BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA000733-TA|BGIBMGA000733-PA|IPR001173|Glycosyl
transferase, family 2, IPR000772|Ricin B lectin, IPR008997|Ricin
B-related lectin
(589 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q16ZW8 Cluster: N-acetylgalactosaminyltransferase; n=4;... 353 1e-95
UniRef50_Q8MVS5 Cluster: Polypeptide N-acetylgalactosaminyltrans... 321 3e-86
UniRef50_Q8NCW6 Cluster: Polypeptide N-acetylgalactosaminyltrans... 316 1e-84
UniRef50_Q6WV19 Cluster: Polypeptide N-acetylgalactosaminyltrans... 288 2e-76
UniRef50_Q7K755 Cluster: Putative polypeptide N-acetylgalactosam... 286 1e-75
UniRef50_Q8I136 Cluster: Polypeptide N-acetylgalactosaminyltrans... 282 2e-74
UniRef50_P34678 Cluster: Polypeptide N-acetylgalactosaminyltrans... 279 1e-73
UniRef50_A7SZ28 Cluster: Predicted protein; n=1; Nematostella ve... 276 1e-72
UniRef50_A7SDQ3 Cluster: Predicted protein; n=1; Nematostella ve... 275 2e-72
UniRef50_Q10471 Cluster: Polypeptide N-acetylgalactosaminyltrans... 272 2e-71
UniRef50_A7RGG9 Cluster: Predicted protein; n=3; Eumetazoa|Rep: ... 266 8e-70
UniRef50_Q8N428 Cluster: Putative polypeptide N-acetylgalactosam... 266 1e-69
UniRef50_Q5DD76 Cluster: SJCHGC09400 protein; n=2; Schistosoma j... 264 5e-69
UniRef50_Q95ZJ1 Cluster: Polypeptide N-acetylgalactosaminyltrans... 263 1e-68
UniRef50_A2AQQ1 Cluster: UDP-N-acetyl-alpha-D-galactosamine: pol... 261 3e-68
UniRef50_Q10472 Cluster: Polypeptide N-acetylgalactosaminyltrans... 261 3e-68
UniRef50_A7RRV7 Cluster: Predicted protein; n=1; Nematostella ve... 260 6e-68
UniRef50_Q4RQL8 Cluster: Chromosome 2 SCAF15004, whole genome sh... 259 2e-67
UniRef50_UPI00015B515F Cluster: PREDICTED: similar to n-acetylga... 256 9e-67
UniRef50_Q96FL9 Cluster: Polypeptide N-acetylgalactosaminyltrans... 256 1e-66
UniRef50_UPI00015B5D50 Cluster: PREDICTED: similar to ENSANGP000... 254 4e-66
UniRef50_UPI00015B453F Cluster: PREDICTED: similar to GA20875-PA... 254 4e-66
UniRef50_Q8IXK2 Cluster: Polypeptide N-acetylgalactosaminyltrans... 253 8e-66
UniRef50_O61394 Cluster: Probable N-acetylgalactosaminyltransfer... 252 3e-65
UniRef50_UPI0000E4974C Cluster: PREDICTED: hypothetical protein;... 242 2e-62
UniRef50_UPI0000D564C6 Cluster: PREDICTED: similar to CG8182-PA,... 241 4e-62
UniRef50_Q9D4M9 Cluster: Putative polypeptide N-acetylgalactosam... 241 4e-62
UniRef50_Q86SR1 Cluster: Polypeptide N-acetylgalactosaminyltrans... 239 1e-61
UniRef50_Q6WV20 Cluster: Polypeptide N-acetylgalactosaminyltrans... 239 2e-61
UniRef50_Q9U2C4 Cluster: Probable N-acetylgalactosaminyltransfer... 238 3e-61
UniRef50_Q14435 Cluster: Polypeptide N-acetylgalactosaminyltrans... 238 3e-61
UniRef50_O45293 Cluster: Probable N-acetylgalactosaminyltransfer... 236 1e-60
UniRef50_UPI0000D56CDA Cluster: PREDICTED: similar to CG4445-PA;... 233 7e-60
UniRef50_O61397 Cluster: Probable N-acetylgalactosaminyltransfer... 232 2e-59
UniRef50_Q7Z7M9 Cluster: Polypeptide N-acetylgalactosaminyltrans... 231 4e-59
UniRef50_Q9Y117 Cluster: Polypeptide N-acetylgalactosaminyltrans... 227 5e-58
UniRef50_Q176D5 Cluster: N-acetylgalactosaminyltransferase; n=3;... 226 1e-57
UniRef50_Q16SH9 Cluster: N-acetylgalactosaminyltransferase; n=2;... 226 1e-57
UniRef50_Q8MV48 Cluster: N-acetylgalactosaminyltransferase 7; n=... 223 1e-56
UniRef50_UPI0000E461C0 Cluster: PREDICTED: hypothetical protein,... 222 2e-56
UniRef50_Q6V2D0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 221 4e-56
UniRef50_Q17NN8 Cluster: N-acetylgalactosaminyltransferase; n=4;... 219 2e-55
UniRef50_O45947 Cluster: Putative polypeptide N-acetylgalactosam... 217 7e-55
UniRef50_Q9HCQ5 Cluster: Polypeptide N-acetylgalactosaminyltrans... 215 3e-54
UniRef50_Q5TWJ3 Cluster: ENSANGP00000028412; n=1; Anopheles gamb... 213 8e-54
UniRef50_Q6WV16 Cluster: N-acetylgalactosaminyltransferase 6; n=... 210 8e-53
UniRef50_Q17M60 Cluster: N-acetylgalactosaminyltransferase; n=1;... 209 2e-52
UniRef50_Q8N3T1 Cluster: Polypeptide N-acetylgalactosaminyltrans... 207 6e-52
UniRef50_Q7TT15-2 Cluster: Isoform 2 of Q7TT15 ; n=9; Mammalia|R... 206 1e-51
UniRef50_UPI0000E4710F Cluster: PREDICTED: similar to pp-GalNAc-... 205 2e-51
UniRef50_UPI0000E46551 Cluster: PREDICTED: hypothetical protein,... 204 7e-51
UniRef50_Q6P9A2 Cluster: Putative polypeptide N-acetylgalactosam... 202 2e-50
UniRef50_Q16ZA7 Cluster: N-acetylgalactosaminyltransferase; n=7;... 202 2e-50
UniRef50_UPI000069E576 Cluster: Polypeptide N-acetylgalactosamin... 201 4e-50
UniRef50_UPI000069E1C8 Cluster: Polypeptide N-acetylgalactosamin... 201 4e-50
UniRef50_Q9NY28 Cluster: Probable polypeptide N-acetylgalactosam... 200 6e-50
UniRef50_Q8IA42 Cluster: N-acetylgalactosaminyltransferase 4; n=... 197 8e-49
UniRef50_Q86SF2 Cluster: N-acetylgalactosaminyltransferase 7; n=... 196 2e-48
UniRef50_Q9VUT6 Cluster: Polypeptide N-acetylgalactosaminyltrans... 194 4e-48
UniRef50_A0NGH9 Cluster: ENSANGP00000031751; n=1; Anopheles gamb... 194 5e-48
UniRef50_UPI000065D57A Cluster: Putative polypeptide N-acetylgal... 190 9e-47
UniRef50_Q8MYY6 Cluster: Putative polypeptide N-acetylgalactosam... 189 2e-46
UniRef50_Q5CKF0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 187 8e-46
UniRef50_Q8K1B9 Cluster: Putative polypeptide N-acetylgalactosam... 186 1e-45
UniRef50_Q8MM26 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 180 1e-43
UniRef50_UPI0000E45D84 Cluster: PREDICTED: hypothetical protein;... 177 7e-43
UniRef50_Q5CY08 Cluster: Extracellular protein with a signal pep... 177 7e-43
UniRef50_UPI0000D9AA48 Cluster: PREDICTED: similar to UDP-N-acet... 172 2e-41
UniRef50_UPI000065D031 Cluster: Probable polypeptide N-acetylgal... 171 3e-41
UniRef50_UPI0000586DC6 Cluster: PREDICTED: similar to polypeptid... 171 6e-41
UniRef50_Q4RKI0 Cluster: Chromosome 21 SCAF15029, whole genome s... 169 1e-40
UniRef50_Q8IA43 Cluster: Putative polypeptide N-acetylgalactosam... 167 1e-39
UniRef50_Q6TBR4 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 164 7e-39
UniRef50_Q5CYR4 Cluster: Extracellular protein with a signal pep... 164 7e-39
UniRef50_Q6YBY0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 161 4e-38
UniRef50_Q8IA41 Cluster: Putative polypeptide N-acetylgalactosam... 146 2e-33
UniRef50_UPI0000E46EB4 Cluster: PREDICTED: similar to MGC81846 p... 136 2e-30
UniRef50_Q6YK77 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid... 135 4e-30
UniRef50_Q4RNJ5 Cluster: Chromosome 21 SCAF15012, whole genome s... 133 1e-29
UniRef50_Q4RNJ6 Cluster: Chromosome 21 SCAF15012, whole genome s... 125 4e-27
UniRef50_Q5CHA1 Cluster: Glycosyl transferase; n=4; Cryptosporid... 124 7e-27
UniRef50_Q4SKF7 Cluster: Chromosome 13 SCAF14566, whole genome s... 122 3e-26
UniRef50_Q8IA44 Cluster: Putative polypeptide N-acetylgalactosam... 115 4e-24
UniRef50_UPI0000E46FFD Cluster: PREDICTED: similar to n-acetylga... 110 1e-22
UniRef50_Q4STJ6 Cluster: Chromosome undetermined SCAF14183, whol... 106 1e-21
UniRef50_UPI000155C133 Cluster: PREDICTED: similar to UDP-N-acet... 103 1e-20
UniRef50_Q4T0W3 Cluster: Chromosome undetermined SCAF10824, whol... 103 2e-20
UniRef50_UPI0001554C17 Cluster: PREDICTED: similar to Polypeptid... 97 9e-19
UniRef50_A5D6B4 Cluster: Predicted glycosyltransferases; n=1; Pe... 91 8e-17
UniRef50_UPI0000D8AB1E Cluster: UDP-N-acetyl-alpha-D-galactosami... 89 3e-16
UniRef50_UPI00005A4710 Cluster: PREDICTED: similar to GalNAc tra... 88 7e-16
UniRef50_Q4RQK9 Cluster: Chromosome 2 SCAF15004, whole genome sh... 85 4e-15
UniRef50_Q7Q046 Cluster: ENSANGP00000016624; n=1; Anopheles gamb... 84 9e-15
UniRef50_Q2B871 Cluster: Glycosyl transferase, group 2 family pr... 83 3e-14
UniRef50_Q4SU00 Cluster: Chromosome undetermined SCAF14054, whol... 80 1e-13
UniRef50_UPI0000E234D0 Cluster: PREDICTED: similar to UDP-GalNAc... 80 2e-13
UniRef50_Q5BYW7 Cluster: SJCHGC07375 protein; n=1; Schistosoma j... 79 2e-13
UniRef50_Q4TCW9 Cluster: Chromosome undetermined SCAF6660, whole... 75 4e-12
UniRef50_UPI0000E46BB8 Cluster: PREDICTED: similar to UDP-GalNAc... 74 9e-12
UniRef50_UPI00005A4DE7 Cluster: PREDICTED: similar to Probable p... 74 1e-11
UniRef50_Q4T9B6 Cluster: Chromosome undetermined SCAF7602, whole... 73 2e-11
UniRef50_UPI0000F20FAB Cluster: PREDICTED: hypothetical protein;... 73 2e-11
UniRef50_Q01V98 Cluster: Glycosyl transferase, family 2; n=1; So... 73 2e-11
UniRef50_A4J8G0 Cluster: Glycosyl transferase, family 2; n=1; De... 73 2e-11
UniRef50_Q4RPK0 Cluster: Chromosome 12 SCAF15007, whole genome s... 72 4e-11
UniRef50_UPI0000E49DD9 Cluster: PREDICTED: hypothetical protein;... 71 7e-11
UniRef50_Q4SIA0 Cluster: Chromosome 5 SCAF14581, whole genome sh... 69 3e-10
UniRef50_UPI0000F1FCC6 Cluster: PREDICTED: hypothetical protein;... 67 1e-09
UniRef50_A7T195 Cluster: Predicted protein; n=1; Nematostella ve... 64 8e-09
UniRef50_UPI000069DFD6 Cluster: Polypeptide N-acetylgalactosamin... 62 5e-08
UniRef50_Q3A8Y3 Cluster: Glycosyl transferase, group 2 family; n... 60 2e-07
UniRef50_Q3A8Y2 Cluster: Glycosyl transferase, group 2 family; n... 58 9e-07
UniRef50_Q4TDW9 Cluster: Chromosome undetermined SCAF5986, whole... 50 2e-04
UniRef50_Q68VJ7 Cluster: Polypeptide N-acetylgalactosaminyltrans... 49 4e-04
UniRef50_Q1ARC2 Cluster: Glycosyl transferase, family 2; n=1; Ru... 47 0.002
UniRef50_Q4RAK4 Cluster: Chromosome undetermined SCAF23488, whol... 46 0.002
UniRef50_Q5CY12 Cluster: UDP-N-acetylgalactosamine: polypeptide ... 46 0.002
UniRef50_UPI000069E575 Cluster: Polypeptide N-acetylgalactosamin... 46 0.003
UniRef50_A6WEA2 Cluster: Glycosyl transferase family 2; n=1; Kin... 46 0.003
UniRef50_Q1Q6Z1 Cluster: Putative uncharacterized protein; n=1; ... 43 0.020
UniRef50_UPI0000499CE1 Cluster: SMC3 protein; n=1; Entamoeba his... 43 0.026
UniRef50_A2DKE3 Cluster: Viral A-type inclusion protein, putativ... 43 0.026
UniRef50_Q23ZG7 Cluster: Peptidase family M1 containing protein;... 42 0.035
UniRef50_A2EE02 Cluster: Putative uncharacterized protein; n=1; ... 42 0.046
UniRef50_A1BDD3 Cluster: Glycosyl transferase, family 2; n=1; Ch... 42 0.061
UniRef50_Q23G97 Cluster: Putative uncharacterized protein; n=1; ... 42 0.061
UniRef50_A2F531 Cluster: Viral A-type inclusion protein, putativ... 41 0.11
UniRef50_A0D876 Cluster: Chromosome undetermined scaffold_40, wh... 41 0.11
UniRef50_A6UUX2 Cluster: SMC domain protein; n=1; Methanococcus ... 41 0.11
UniRef50_A0E3J8 Cluster: Chromosome undetermined scaffold_76, wh... 40 0.14
UniRef50_P62134 Cluster: DNA double-strand break repair rad50 AT... 40 0.14
UniRef50_Q1WVM5 Cluster: N-acetylglucosaminyltransferase; n=1; L... 40 0.19
UniRef50_Q55EZ8 Cluster: Putative uncharacterized protein; n=1; ... 40 0.19
UniRef50_Q54SR9 Cluster: Leucine-rich repeat-containing protein;... 40 0.19
UniRef50_A2DD37 Cluster: Viral A-type inclusion protein, putativ... 40 0.25
UniRef50_A2SR79 Cluster: Glycosyl transferase, family 2; n=1; Me... 40 0.25
UniRef50_A0YT83 Cluster: Putative uncharacterized protein; n=1; ... 39 0.32
UniRef50_Q8MNV4 Cluster: Putative uncharacterized protein; n=2; ... 39 0.32
UniRef50_Q55FF7 Cluster: Putative uncharacterized protein; n=1; ... 39 0.32
UniRef50_Q9V2L6 Cluster: Dpm1 dolichol-phosphate mannosyltransfe... 39 0.32
UniRef50_Q5LW10 Cluster: Diguanylate cyclase, putative; n=3; Rho... 39 0.43
UniRef50_Q7NBF8 Cluster: Putative uncharacterized protein; n=1; ... 38 0.57
UniRef50_A5HZ17 Cluster: Exonuclease; n=4; Clostridium botulinum... 38 0.57
UniRef50_A0L591 Cluster: Glycosyl transferase, family 2; n=1; Ma... 38 0.57
UniRef50_Q8IBY8 Cluster: Putative uncharacterized protein PF07_0... 38 0.57
UniRef50_Q86KX8 Cluster: Similar to Dictyostelium discoideum (Sl... 38 0.57
UniRef50_Q22W40 Cluster: Putative uncharacterized protein; n=1; ... 38 0.57
UniRef50_A4YEH2 Cluster: Glycosyl transferase, family 2; n=1; Me... 38 0.57
UniRef50_UPI00006CE63C Cluster: hypothetical protein TTHERM_0070... 38 0.75
UniRef50_Q6MF69 Cluster: Putative uncharacterized protein; n=2; ... 38 0.75
UniRef50_Q1Q6W0 Cluster: Putative uncharacterized protein; n=1; ... 38 0.75
UniRef50_A1AUA8 Cluster: Glycosyl transferase, family 2; n=1; Pe... 38 0.75
UniRef50_Q23A50 Cluster: Putative uncharacterized protein; n=1; ... 38 0.75
UniRef50_A2FSZ8 Cluster: Viral A-type inclusion protein, putativ... 38 0.75
UniRef50_A2E4S4 Cluster: Viral A-type inclusion protein, putativ... 38 0.75
UniRef50_A2DKP8 Cluster: Viral A-type inclusion protein, putativ... 38 0.75
UniRef50_Q2S1Y8 Cluster: Glycosyl transferase, group 2 family pr... 38 0.99
UniRef50_A2FD36 Cluster: Viral A-type inclusion protein, putativ... 38 0.99
UniRef50_A0DTN4 Cluster: Chromosome undetermined scaffold_63, wh... 38 0.99
UniRef50_A0CKL2 Cluster: Chromosome undetermined scaffold_2, who... 38 0.99
UniRef50_Q6CYG5 Cluster: Similarity; n=2; Kluyveromyces lactis|R... 38 0.99
UniRef50_Q8TXA4 Cluster: Uncharacterized protein; n=2; cellular ... 38 0.99
UniRef50_UPI00006CD032 Cluster: Micro-fibrillar-associated prote... 37 1.3
UniRef50_UPI00006CB2DA Cluster: Viral A-type inclusion protein r... 37 1.3
UniRef50_Q7U947 Cluster: Putative uncharacterized protein; n=1; ... 37 1.3
UniRef50_Q319Q2 Cluster: Putative uncharacterized protein; n=1; ... 37 1.3
UniRef50_Q9GRG0 Cluster: Tetrin B protein; n=2; Tetrahymena ther... 37 1.3
UniRef50_Q8ILK5 Cluster: Putative uncharacterized protein; n=1; ... 37 1.3
UniRef50_Q6LF09 Cluster: Putative uncharacterized protein; n=6; ... 37 1.3
UniRef50_Q5DAE6 Cluster: SJCHGC05311 protein; n=1; Schistosoma j... 37 1.3
UniRef50_Q22869 Cluster: Non-muscle myosin heavy chain II; n=3; ... 37 1.3
UniRef50_A2EZ87 Cluster: Viral A-type inclusion protein, putativ... 37 1.3
UniRef50_Q75E63 Cluster: ABL193Cp; n=1; Eremothecium gossypii|Re... 37 1.3
UniRef50_Q0URH4 Cluster: Putative uncharacterized protein; n=1; ... 37 1.3
UniRef50_Q2FPR9 Cluster: Regulatory protein, ArsR; n=2; Methanom... 37 1.3
UniRef50_UPI00006CEB56 Cluster: hypothetical protein TTHERM_0037... 37 1.7
UniRef50_UPI00006CB6DE Cluster: hypothetical protein TTHERM_0049... 37 1.7
UniRef50_UPI00015A629B Cluster: UPI00015A629B related cluster; n... 37 1.7
UniRef50_UPI000069FF36 Cluster: M-phase phosphoprotein 1 (MPP1) ... 37 1.7
UniRef50_Q82L26 Cluster: Putative secreted alpha-galactosidase; ... 37 1.7
UniRef50_A5EY82 Cluster: Serine protease; n=1; Dichelobacter nod... 37 1.7
UniRef50_A2G5G4 Cluster: Putative uncharacterized protein; n=1; ... 37 1.7
UniRef50_A2EN31 Cluster: Viral A-type inclusion protein, putativ... 37 1.7
UniRef50_A2E7J3 Cluster: Putative uncharacterized protein; n=1; ... 37 1.7
UniRef50_A5DZR6 Cluster: Putative uncharacterized protein; n=1; ... 37 1.7
UniRef50_Q58718 Cluster: DNA double-strand break repair rad50 AT... 37 1.7
UniRef50_UPI00015B6253 Cluster: PREDICTED: similar to CG33715-PD... 36 2.3
UniRef50_Q2JCN5 Cluster: Glycosyl transferase, family 2 precurso... 36 2.3
UniRef50_O06764 Cluster: Phase variable surface lipoprotein P78 ... 36 2.3
UniRef50_A6WGG7 Cluster: Glycosyl transferase family 2; n=1; Kin... 36 2.3
UniRef50_A0V2D1 Cluster: Glycosyl transferase, family 2; n=1; Cl... 36 2.3
UniRef50_Q4Q0C8 Cluster: Phosphatidylinositol 3 kinase, putative... 36 2.3
UniRef50_O97294 Cluster: Putative uncharacterized protein PFC099... 36 2.3
UniRef50_A2EJ43 Cluster: Viral A-type inclusion protein, putativ... 36 2.3
UniRef50_A0CQE7 Cluster: Chromosome undetermined scaffold_24, wh... 36 2.3
UniRef50_Q4JC59 Cluster: Conserved Archaeal membrane protein; n=... 36 2.3
UniRef50_UPI000150A0FA Cluster: Type III restriction enzyme, res... 36 3.0
UniRef50_UPI00006CF2BD Cluster: Leucine Rich Repeat family prote... 36 3.0
UniRef50_UPI0000499464 Cluster: DNA repair protein Rad50; n=1; E... 36 3.0
UniRef50_Q1WV23 Cluster: Superfamily II DNA and RNA helicase; n=... 36 3.0
UniRef50_Q7RT92 Cluster: Phosphatidylinositol transfer protein 2... 36 3.0
UniRef50_A2ETW9 Cluster: Viral A-type inclusion protein, putativ... 36 3.0
UniRef50_Q0W1H0 Cluster: Putative glycosyltransferase; n=2; uncu... 36 3.0
UniRef50_A2STK1 Cluster: Glycosyl transferase, family 2; n=2; Me... 36 3.0
UniRef50_UPI000150A21E Cluster: hypothetical protein TTHERM_0019... 36 4.0
UniRef50_UPI00006CF1FD Cluster: hypothetical protein TTHERM_0054... 36 4.0
UniRef50_Q8KU52 Cluster: EF0109; n=1; Enterococcus faecalis|Rep:... 36 4.0
UniRef50_A5ZRG2 Cluster: Putative uncharacterized protein; n=1; ... 36 4.0
UniRef50_A4JHB1 Cluster: Putative uncharacterized protein; n=1; ... 36 4.0
UniRef50_Q7RSH4 Cluster: Putative uncharacterized protein PY0038... 36 4.0
UniRef50_Q7RBX2 Cluster: Mature-parasite-infected erythrocyte su... 36 4.0
UniRef50_Q4Z6V7 Cluster: Putative uncharacterized protein; n=3; ... 36 4.0
UniRef50_Q23YT2 Cluster: Kinesin motor domain containing protein... 36 4.0
UniRef50_Q22YR7 Cluster: Cyclic nucleotide-binding domain contai... 36 4.0
UniRef50_A2FX23 Cluster: Formin Homology 2 Domain containing pro... 36 4.0
UniRef50_A2F112 Cluster: Putative uncharacterized protein; n=1; ... 36 4.0
UniRef50_A2ERV7 Cluster: Putative uncharacterized protein; n=1; ... 36 4.0
UniRef50_A2EQA8 Cluster: Putative uncharacterized protein; n=1; ... 36 4.0
UniRef50_A0CXR3 Cluster: Chromosome undetermined scaffold_30, wh... 36 4.0
UniRef50_Q0UYN5 Cluster: Putative uncharacterized protein; n=1; ... 36 4.0
UniRef50_P37709 Cluster: Trichohyalin; n=2; Eutheria|Rep: Tricho... 36 4.0
UniRef50_UPI00006CB606 Cluster: hypothetical protein TTHERM_0044... 35 5.3
UniRef50_UPI0000498399 Cluster: Viral A-type inclusion protein r... 35 5.3
UniRef50_Q97H37 Cluster: Glycosyltransferase domain containing p... 35 5.3
UniRef50_Q8EWP8 Cluster: Predicted cytoskeletal protein; n=1; My... 35 5.3
UniRef50_Q50EX9 Cluster: P-553; n=5; Borrelia|Rep: P-553 - Borre... 35 5.3
UniRef50_Q1FP00 Cluster: Helix-turn-helix, AraC type precursor; ... 35 5.3
UniRef50_Q0TSM9 Cluster: Bacterial sugar transferase family prot... 35 5.3
UniRef50_A3ZWW6 Cluster: Putative uncharacterized protein; n=1; ... 35 5.3
UniRef50_A3N1J8 Cluster: Type I restriction enzyme, R subunit; n... 35 5.3
UniRef50_A0W610 Cluster: Glycosyl transferase, family 2; n=2; Ge... 35 5.3
UniRef50_Q8I569 Cluster: Putative uncharacterized protein; n=1; ... 35 5.3
UniRef50_Q8I4T0 Cluster: Putative uncharacterized protein; n=1; ... 35 5.3
UniRef50_Q7RIN9 Cluster: Putative uncharacterized protein PY0357... 35 5.3
UniRef50_Q54G05 Cluster: Putative uncharacterized protein; n=1; ... 35 5.3
UniRef50_Q4D320 Cluster: Putative uncharacterized protein; n=3; ... 35 5.3
UniRef50_Q28X57 Cluster: GA21145-PA; n=2; Sophophora|Rep: GA2114... 35 5.3
UniRef50_Q25662 Cluster: Repeat organellar protein; n=5; Plasmod... 35 5.3
UniRef50_Q236I9 Cluster: Putative uncharacterized protein; n=1; ... 35 5.3
UniRef50_A5KBH9 Cluster: Putative uncharacterized protein; n=1; ... 35 5.3
UniRef50_A2DDX5 Cluster: Viral A-type inclusion protein, putativ... 35 5.3
UniRef50_A0E4N3 Cluster: Chromosome undetermined scaffold_78, wh... 35 5.3
UniRef50_Q8SVH3 Cluster: Putative uncharacterized protein ECU05_... 35 5.3
UniRef50_Q5J6J3 Cluster: Dipeptidyl-peptidase IV; n=16; Pezizomy... 35 5.3
UniRef50_Q2FUG3 Cluster: Sensor protein; n=1; Methanospirillum h... 35 5.3
UniRef50_A3HAB6 Cluster: Putative uncharacterized protein; n=1; ... 35 5.3
UniRef50_Q10411 Cluster: Sporulation-specific protein 15; n=1; S... 35 5.3
UniRef50_UPI00015BC953 Cluster: UPI00015BC953 related cluster; n... 35 7.0
UniRef50_UPI00015B53AD Cluster: PREDICTED: similar to RHO kinase... 35 7.0
UniRef50_UPI0000DAE6EA Cluster: hypothetical protein Rgryl_01001... 35 7.0
UniRef50_Q8DX62 Cluster: Putative uncharacterized protein SAG199... 35 7.0
UniRef50_Q72ZS2 Cluster: Putative uncharacterized protein; n=1; ... 35 7.0
UniRef50_Q311W2 Cluster: Glycosyl transferase, group 2 family pr... 35 7.0
UniRef50_Q26H47 Cluster: Putative uncharacterized protein; n=1; ... 35 7.0
UniRef50_Q21IV0 Cluster: B-glycosyltransferase-like protein; n=1... 35 7.0
UniRef50_A7BMC2 Cluster: Glycosyl transferase, group 2 family; n... 35 7.0
UniRef50_A5FFS1 Cluster: Glycosyl transferase, family 2; n=15; B... 35 7.0
UniRef50_A5D3A4 Cluster: Predicted glycosyltransferases; n=1; Pe... 35 7.0
UniRef50_A4F5Y3 Cluster: Glycosyl transferase; n=1; Saccharopoly... 35 7.0
UniRef50_A3JJ43 Cluster: Glycosyl transferases-like protein; n=3... 35 7.0
UniRef50_Q8I1P1 Cluster: Putative uncharacterized protein PFD095... 35 7.0
UniRef50_Q7REL0 Cluster: Rhoptry protein; n=29; Plasmodium (Vinc... 35 7.0
UniRef50_Q54Y97 Cluster: Putative uncharacterized protein; n=1; ... 35 7.0
UniRef50_Q54MY6 Cluster: Putative uncharacterized protein; n=1; ... 35 7.0
UniRef50_Q4YPW4 Cluster: Putative uncharacterized protein; n=5; ... 35 7.0
UniRef50_Q233C6 Cluster: Putative uncharacterized protein; n=1; ... 35 7.0
UniRef50_Q22U14 Cluster: Putative uncharacterized protein; n=3; ... 35 7.0
UniRef50_Q22CU7 Cluster: Putative uncharacterized protein; n=1; ... 35 7.0
UniRef50_Q22BF6 Cluster: Putative uncharacterized protein; n=2; ... 35 7.0
UniRef50_A7TM62 Cluster: Putative uncharacterized protein; n=1; ... 35 7.0
UniRef50_Q8PXS7 Cluster: Dolichyl-phosphate mannose synthase rel... 35 7.0
UniRef50_Q2FTA5 Cluster: Glycosyl transferase, family 2; n=2; Me... 35 7.0
UniRef50_O80458 Cluster: Dehydrodolichyl diphosphate synthase 1;... 35 7.0
UniRef50_UPI00006CE5B4 Cluster: hypothetical protein TTHERM_0014... 34 9.2
UniRef50_Q149I6 Cluster: Upf3b protein; n=1; Mus musculus|Rep: U... 34 9.2
UniRef50_Q8RA31 Cluster: Glycosyltransferases involved in cell w... 34 9.2
UniRef50_Q6YPY2 Cluster: Putative uncharacterized protein; n=1; ... 34 9.2
UniRef50_Q3B6H2 Cluster: Glycosyl transferase; n=4; Chlorobium/P... 34 9.2
UniRef50_Q2B897 Cluster: Glycosyltransferase; n=1; Bacillus sp. ... 34 9.2
UniRef50_Q1IMJ5 Cluster: Glycosyl transferase, family 2; n=8; ce... 34 9.2
UniRef50_A5I5G6 Cluster: Putative uncharacterized protein; n=4; ... 34 9.2
UniRef50_A4WXV7 Cluster: TRAP dicarboxylate transporter-DctP sub... 34 9.2
UniRef50_A0IQ61 Cluster: Glycosyl transferase, family 2; n=1; Se... 34 9.2
UniRef50_Q8IPP9 Cluster: CG31551-PA; n=2; Eukaryota|Rep: CG31551... 34 9.2
UniRef50_Q8IDH0 Cluster: Putative uncharacterized protein MAL13P... 34 9.2
UniRef50_Q8I416 Cluster: ATP-dependent RNA helicase, putative; n... 34 9.2
UniRef50_Q7QA01 Cluster: ENSANGP00000012284; n=1; Anopheles gamb... 34 9.2
UniRef50_Q7PDJ9 Cluster: Related to CG6013 PROTEIN; n=3; Plasmod... 34 9.2
UniRef50_Q5CRD7 Cluster: Putative uncharacterized protein; n=2; ... 34 9.2
UniRef50_Q4Z043 Cluster: Related to CG6013 PROTEIN, putative; n=... 34 9.2
UniRef50_Q23JF3 Cluster: Putative uncharacterized protein; n=1; ... 34 9.2
UniRef50_Q23AP4 Cluster: Peptidyl-prolyl cis-trans isomerase, cy... 34 9.2
UniRef50_Q22RF4 Cluster: Viral A-type inclusion protein repeat c... 34 9.2
UniRef50_A2G6X0 Cluster: Putative uncharacterized protein; n=1; ... 34 9.2
UniRef50_A0BIX7 Cluster: Chromosome undetermined scaffold_11, wh... 34 9.2
UniRef50_A0BCU6 Cluster: Chromosome undetermined scaffold_10, wh... 34 9.2
UniRef50_Q9HQP4 Cluster: Putative uncharacterized protein; n=1; ... 34 9.2
UniRef50_Q5JJ24 Cluster: Dolichol-phosphate mannosyltransferase;... 34 9.2
UniRef50_Q09462 Cluster: Probable U3 small nucleolar RNA-associa... 34 9.2
>UniRef50_Q16ZW8 Cluster: N-acetylgalactosaminyltransferase; n=4;
Endopterygota|Rep: N-acetylgalactosaminyltransferase -
Aedes aegypti (Yellowfever mosquito)
Length = 662
Score = 353 bits (867), Expect = 1e-95
Identities = 157/258 (60%), Positives = 205/258 (79%), Gaps = 10/258 (3%)
Query: 246 ENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRL 305
E + +KN+ +RL++ ++REGL+R+R+YGA N+ GDVL+FLDSHIEVNV W+ PLL+R
Sbjct: 251 ELNALKNS--KVRLIRNAEREGLMRSRVYGARNATGDVLIFLDSHIEVNVDWVEPLLQR- 307
Query: 306 SQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDE 365
+K + A+ PVID+IN+DTF YS SPLVRGGFNWGLHFKWDNLPKGTL +
Sbjct: 308 ------IKTNKTILAM-PVIDIINSDTFIYSSSPLVRGGFNWGLHFKWDNLPKGTLAKES 360
Query: 366 DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCS 425
DF+ P +SPTMAGGLFA+ R+YF +G+YD GM+VWGGENLEISFR W CGGS+EL+PCS
Sbjct: 361 DFVGPFQSPTMAGGLFAVDRQYFKDLGEYDMGMDVWGGENLEISFRTWQCGGSIELVPCS 420
Query: 426 RVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKA 485
R+GHVFRKRRPYG + D M++NS+R++RVWMDDY+K +E P A V+ GD+++R
Sbjct: 421 RIGHVFRKRRPYGSPDGSDTMIRNSLRLSRVWMDDYIKYFLENQPQAKKVDPGDLTDRHD 480
Query: 486 LRERLQCKTFKWYLDNMW 503
LR+RL CK+F+WYL N++
Sbjct: 481 LRKRLNCKSFEWYLKNIY 498
Score = 139 bits (336), Expect = 2e-31
Identities = 80/186 (43%), Positives = 110/186 (59%), Gaps = 15/186 (8%)
Query: 27 KNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIK 86
+NN + LEGK D ++ S KY Y+++ E+R+ + K +
Sbjct: 87 QNNLVGLDPLEGKT-------DTGKSFSFFKDKYNRYRKEQEFRK---ISHKLMDELQPI 136
Query: 87 MSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQ 146
M T+ +FG++RNSE+ IRD GY HAFN L+S +IG R +PDTR+KLC Q
Sbjct: 137 MPNGTD-----EFGMVRNSEEQFIRDVGYRKHAFNVLVSNKIGPFRGVPDTRHKLCHEQS 191
Query: 147 YFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEA 206
Y LP ASII+CFYNEH ETL+RSV SI+ RT + EIILVDD SDL +L +++
Sbjct: 192 YDKVLPSASIIMCFYNEHLETLVRSVTSIIRRTPSYLLHEIILVDDCSDLDDLRDNLEHE 251
Query: 207 VDKLNN 212
++ L N
Sbjct: 252 LNALKN 257
Score = 101 bits (242), Expect = 5e-20
Identities = 44/91 (48%), Positives = 64/91 (70%), Gaps = 3/91 (3%)
Query: 502 MWFETDRSELVLGRTLCLDASNNV---APILGKCHEMGGTQEWKHKGTASSPIYNTAAGM 558
MW+ET+R+ELVLG+ LCL+A ++ +P+L KCHEMGG Q WKH+ T +PIYN A+G
Sbjct: 572 MWYETERAELVLGQLLCLEAPSSATKGSPMLNKCHEMGGDQAWKHRKTKGTPIYNIASGS 631
Query: 559 CLGVDRSYRGETVLMVICDDYSNNKWDIVRS 589
CL V ++ +G V + +C + + WD+V S
Sbjct: 632 CLAVKQATKGALVGLDLCVNSPRSTWDLVIS 662
>UniRef50_Q8MVS5 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 35A (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 35A) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase
35A) (pp-GaNTase 35A) (Protein l(2)35Aa); n=2;
Sophophora|Rep: Polypeptide
N-acetylgalactosaminyltransferase 35A (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 35A) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase
35A) (pp-GaNTase 35A) (Protein l(2)35Aa) - Drosophila
melanogaster (Fruit fly)
Length = 632
Score = 321 bits (789), Expect = 3e-86
Identities = 145/250 (58%), Positives = 185/250 (74%), Gaps = 8/250 (3%)
Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
N+R +K +REGLIR+R+ GA +VGDVLVFLDSHIEVN WL PLL+ +
Sbjct: 211 NLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSE------- 263
Query: 316 YSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
+A PVID+INADTFEY+PSPLVRGGFNWGLHF+W+NLP+GTL EDF P +SPT
Sbjct: 264 -NATLAVPVIDLINADTFEYTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPT 322
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
MAGGLFA+ R+YF +G+YD M++WGGEN+EISFR W CGG+++++PCSRVGH+FRKRR
Sbjct: 323 MAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRR 382
Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
PY + + ML+NS+R+A VWMD Y ++ + GDIS+R LRERLQC+ F
Sbjct: 383 PYTSPDGANTMLKNSLRLAHVWMDQYKDYYLKHEKVPKTYDYGDISDRLKLRERLQCRDF 442
Query: 496 KWYLDNMWFE 505
WYL N++ E
Sbjct: 443 AWYLKNVYPE 452
Score = 120 bits (288), Expect = 1e-25
Identities = 62/130 (47%), Positives = 90/130 (69%), Gaps = 9/130 (6%)
Query: 97 EQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFD--ELPRA 154
E G++RN +D IRD GY HAFN L+S IG R +PDTR+K+C Q+ + LP+A
Sbjct: 91 ELLGVVRNKQDKYIRDIGYKHHAFNALVSNNIGLFRAIPDTRHKVCDRQETTEAENLPQA 150
Query: 155 SIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDL----YNLHHDVQEAVDKL 210
SI++CFYNEH TLMRS+ ++++RT ++EIILVDD+SDL ++LH D++ + K
Sbjct: 151 SIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARL-KY 209
Query: 211 NNV--IKKEE 218
+N+ IK E+
Sbjct: 210 DNLRYIKNEQ 219
Score = 87.0 bits (206), Expect = 1e-15
Identities = 38/101 (37%), Positives = 61/101 (60%), Gaps = 2/101 (1%)
Query: 490 LQCKTFKWYLDNMWFETDRSELVLGRTLCLDASNNVAPILGKCHEMGGTQEWKHKGTASS 549
LQ +T + + +W+ET+++E+VL + LCL+AS + + KCHEM G Q+W+H A+S
Sbjct: 511 LQLQTCRRTPNQLWYETEKAEIVLDKLLCLEASGDAQVTVNKCHEMLGDQQWRHTRNANS 570
Query: 550 PIYNTAAGMCLGVDRSYRGETVLMVIC--DDYSNNKWDIVR 588
P+YN A G CL G + + +C + + WDIV+
Sbjct: 571 PVYNMAKGTCLRAAAPTTGALISLDLCSKSNGAGGSWDIVQ 611
>UniRef50_Q8NCW6 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 11; n=33;
Eumetazoa|Rep: Polypeptide
N-acetylgalactosaminyltransferase 11 - Homo sapiens
(Human)
Length = 608
Score = 316 bits (775), Expect = 1e-84
Identities = 145/249 (58%), Positives = 185/249 (74%), Gaps = 8/249 (3%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I++++ +KREGLIR R+ GA ++ G+VLVFLDSH EVNV WL PLL + + R+
Sbjct: 214 IKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIRED------RH 267
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
+ V PVID+I+ADT YS SP+VRGGFNWGLHFKWD +P L E P+KSPTM
Sbjct: 268 TV--VCPVIDIISADTLAYSSSPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTM 325
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
AGGLFA+ R+YF+ +G+YD GM++WGGENLEISFRIWMCGG L +IPCSRVGH+FRKRRP
Sbjct: 326 AGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRP 385
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFK 496
YG E QD M NS+R+A VW+D+Y ++ + P G+ISER LR++L CK+FK
Sbjct: 386 YGSPEGQDTMTHNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFK 445
Query: 497 WYLDNMWFE 505
WYLDN++ E
Sbjct: 446 WYLDNVYPE 454
Score = 127 bits (306), Expect = 9e-28
Identities = 58/112 (51%), Positives = 77/112 (68%)
Query: 98 QFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASII 157
+ G+I N D +RD GY HAFN LIS R+G HRD+PDTRN C+ + Y +LP AS++
Sbjct: 97 ELGMIFNERDQELRDLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVV 156
Query: 158 ICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
ICFYNE + L+R+VHS++DRT + EIILVDD SD +L ++ E V K
Sbjct: 157 ICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQK 208
Score = 58.0 bits (134), Expect = 7e-07
Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 6/91 (6%)
Query: 500 DNMWFETDRSELVLGRTLCLDASNNVA---PILGKCHEMGGTQEWKHKGTASSPIYNTAA 556
+ +W + ELVL LCLD S + P L KCH GG+Q+W ++ +Y +
Sbjct: 518 NQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF--GKNNRLYQVSV 575
Query: 557 GMCL-GVDRSYRGETVLMVICDDYSNNKWDI 586
G CL VD + +V M ICD S+ +W +
Sbjct: 576 GQCLRAVDPLGQKGSVAMAICDGSSSQQWHL 606
>UniRef50_Q6WV19 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 2; n=2;
Sophophora|Rep: Polypeptide
N-acetylgalactosaminyltransferase 2 - Drosophila
melanogaster (Fruit fly)
Length = 633
Score = 288 bits (707), Expect = 2e-76
Identities = 154/340 (45%), Positives = 205/340 (60%), Gaps = 24/340 (7%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+++ KREGL+R+R+ GAD +V VL FLDSH+E N WL PLL+R+ +
Sbjct: 259 VRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVRED-------- 310
Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNL-PKGTLINDEDFMKPLKSP 374
R V PVIDVI+ D F+Y S +RGGF+W L FKW+ L P + D +++P
Sbjct: 311 PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTP 370
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
+AGGLF I + YFN +GKYD M+VWGGENLEISFR+W CGGSLE+IPCSRVGHVFRKR
Sbjct: 371 MIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKR 430
Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
PY G + +N+ R A VWMDDY + P A ++ G+I +R AL+E+L CK
Sbjct: 431 HPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCK 490
Query: 494 TFKWYLDNMWFE---TDRSELVLGR---TLCLDASNN-VAPILG--KCHEMGGTQEWKHK 544
FKWYL+N++ + D E+ R T CLD + + +G CH GG QEW
Sbjct: 491 PFKWYLENVYPDLQAPDPQEVGQFRQDSTECLDTMGHLIDGTVGIFPCHNTGGNQEWAF- 549
Query: 545 GTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNKW 584
T I + +CL + RG V++ CDD N +W
Sbjct: 550 -TKRGEIKHD--DLCLTLVTFARGSQVVLKACDDSENQRW 586
Score = 90.6 bits (215), Expect = 1e-16
Identities = 42/96 (43%), Positives = 67/96 (69%), Gaps = 6/96 (6%)
Query: 100 GLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIIC 159
G +RN ED IR++ FN S + +RD+PDTRN +C++++Y ++LP S+II
Sbjct: 156 GALRNGEDPYIRNR------FNQEASDALPSNRDIPDTRNPMCRTKKYREDLPETSVIIT 209
Query: 160 FYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
F+NE TL+R++ S+++R+ + I+EI+LVDDYSD
Sbjct: 210 FHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSD 245
>UniRef50_Q7K755 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 11; n=3;
Caenorhabditis|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 11 - Caenorhabditis
elegans
Length = 605
Score = 286 bits (701), Expect = 1e-75
Identities = 154/343 (44%), Positives = 210/343 (61%), Gaps = 40/343 (11%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++ LKT K EGLIRA+++GA + G+VLVFLDSH EVN WLPPLL ++ Q
Sbjct: 224 VKFLKTDKNEGLIRAKIFGARRANGEVLVFLDSHCEVNEEWLPPLLDQIKQN-------- 275
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
R V P+ID+I+A T +Y SP+ GG NW + FKWD + + +++ PLKSPTM
Sbjct: 276 RRRVVCPIIDIIDAITMKYVESPVCTGGVNWAMTFKWDYPHRSYFEDPMNYVNPLKSPTM 335
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
AGGLFAI +EYF IG YD GM+VWG EN+EIS RIW CGG L ++PCSRVGH+FR++RP
Sbjct: 336 AGGLFAIDKEYFFEIGSYDEGMDVWGAENVEISVRIWTCGGELLIMPCSRVGHIFRRQRP 395
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPS-AAHVEIGDISERKALRERLQCKTF 495
YG+ K D M +NS+R+ARVW+D+Y++ E P+ + GD++ R +LR LQCK F
Sbjct: 396 YGI--KTDSMGKNSVRLARVWLDEYLENFFEARPNYRTFTDYGDLTSRISLRRNLQCKPF 453
Query: 496 KWYLDNMWFET---------DRSELVLGR---------TLCLDASNNVAPI-------LG 530
KWYL+N++ E + LV G+ T CL A N+ I +
Sbjct: 454 KWYLENIYPELLPDNTPNQLNNQILVAGKKYLIKMANGTHCLSAENSQGRIANGNRVEMR 513
Query: 531 KCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLM 573
KC+ M Q+WK+ T ++ MCL S RG +V++
Sbjct: 514 KCNHMERMQQWKYSSTNELRPMGSSR-MCLD---SLRGISVIL 552
Score = 96.7 bits (230), Expect = 2e-18
Identities = 48/111 (43%), Positives = 68/111 (61%), Gaps = 1/111 (0%)
Query: 88 SKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY 147
SK+ E D + G I + ++ +GY + FN L+S RIG R + D+RN C S Y
Sbjct: 96 SKEIEIDTD-LLGKINGKAEDDLQVEGYKKYQFNGLLSDRIGSRRKIKDSRNARCSSLTY 154
Query: 148 FDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYN 198
D LP ASI++C++NE L+R V+SI DRT +H+ EI+LVDD S+ N
Sbjct: 155 SDSLPAASIVVCYFNESPSVLIRMVNSIFDRTKPEHLHEILLVDDSSEWSN 205
>UniRef50_Q8I136 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 4; n=3;
Caenorhabditis|Rep: Polypeptide
N-acetylgalactosaminyltransferase 4 - Caenorhabditis
elegans
Length = 589
Score = 282 bits (691), Expect = 2e-74
Identities = 150/335 (44%), Positives = 204/335 (60%), Gaps = 22/335 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I +L+ ++REGLIR+R+ GA + VL FLDSHIE N WL PLL R+++ V
Sbjct: 208 ITVLRNNQREGLIRSRVKGAQVARAPVLTFLDSHIECNQKWLEPLLARIAENPKAV---- 263
Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDF-MKPLKSP 374
V P+IDVIN D F Y S +RGGF+W L F+W+ + + P++SP
Sbjct: 264 ----VAPIIDVINVDNFNYVGASADLRGGFDWTLVFRWEFMNEQLRKERHAHPTAPIRSP 319
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
TMAGGLFAI +E+FN +G YD M VWGGENLE+SFR+W CGGSLE++PCSRVGHVFRK+
Sbjct: 320 TMAGGLFAISKEWFNELGTYDLDMEVWGGENLEMSFRVWQCGGSLEIMPCSRVGHVFRKK 379
Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
PY G + +N+ R A VWMD+Y ++ PSA V GDI++R A+R+RLQCK
Sbjct: 380 HPYTFPGGSGNVFQKNTRRAAEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCK 439
Query: 494 TFKWYLDNMWFETDRSELVLGRT-------LCLDA---SNNVAPILGKCHEMGGTQEWKH 543
+FKWYL+N++ + + G++ LCLD+ + AP L CH GG QEW
Sbjct: 440 SFKWYLENVYPQLEIPRKTPGKSFQMKIGNLCLDSMARKESEAPGLFGCHGTGGNQEWVF 499
Query: 544 KGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDD 578
+ N + +CL + +TV MV C++
Sbjct: 500 -DQLTKTFKNAISQLCLDFSSNTENKTVTMVKCEN 533
Score = 52.0 bits (119), Expect = 4e-05
Identities = 25/81 (30%), Positives = 47/81 (58%), Gaps = 1/81 (1%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDE-LPRASIIICFYNEHYETLMRSVH 173
Y ++FN S + R +PD+R C+ Y + ++II ++NE +L+R+V
Sbjct: 113 YKANSFNQEASDALNPTRKIPDSREPQCRDVDYSKVGMQPTTVIITYHNEARSSLLRTVF 172
Query: 174 SIMDRTDQKHIKEIILVDDYS 194
S+ +++ ++ + EI+LVDD S
Sbjct: 173 SVFNQSPEELLLEIVLVDDNS 193
>UniRef50_P34678 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 3; n=2;
Caenorhabditis|Rep: Polypeptide
N-acetylgalactosaminyltransferase 3 - Caenorhabditis
elegans
Length = 612
Score = 279 bits (685), Expect = 1e-73
Identities = 149/313 (47%), Positives = 197/313 (62%), Gaps = 22/313 (7%)
Query: 247 NSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLS 306
+S +K I L+ R GLIRARL G++ + G +L+FLD+H+EV GWL PL+ R++
Sbjct: 222 DSYIKMFPIPIHLVHLENRSGLIRARLTGSEMAKGKILLFLDAHVEVTDGWLEPLVSRVA 281
Query: 307 QGVDGVKVRYSARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLIN-D 364
+ R V P+IDVI+ DTFEY + S GGFNW L+F+W +PK L
Sbjct: 282 ED--------RKRVVAPIIDVISDDTFEYVTASETTWGGFNWHLNFRWYAVPKRELNRRG 333
Query: 365 EDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPC 424
D P+++PT+AGGLFAI +++F IG YD GM VWGGENLEISFR+WMCGGSLE+ PC
Sbjct: 334 SDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPC 393
Query: 425 SRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISER 483
SRVGHVFRK+ PY G + N+ R A VWMD+Y ++ P+A +VE GD+SER
Sbjct: 394 SRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEYKAFFYKMVPAARNVEAGDVSER 453
Query: 484 KALRERLQCKTFKWYLDNMWFE----TDRSEL--VLGR--TLCLDAS---NNVAPILGKC 532
K LRE LQCK+FKWYL+N++ E D L ++ R C+D + + AP + C
Sbjct: 454 KKLRETLQCKSFKWYLENIYPEAPLPADFRSLGAIVNRFTEKCVDTNGKKDGQAPGIQAC 513
Query: 533 HEMGGTQEWKHKG 545
H GG Q W G
Sbjct: 514 HGAGGNQAWSLTG 526
Score = 68.9 bits (161), Expect = 3e-10
Identities = 35/89 (39%), Positives = 60/89 (67%), Gaps = 3/89 (3%)
Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDE---LPRASIIICFYNEHYE 166
I++K + + FN + S+ I +R LPD R+ C++ + +P+ SIII F+NE +
Sbjct: 125 IKEKRFLENQFNVVASEMISVNRTLPDYRSDACRTSGNNLKTAGMPKTSIIIVFHNEAWT 184
Query: 167 TLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
TL+R++HS+++R+ + ++EIILVDD SD
Sbjct: 185 TLLRTLHSVINRSPRHLLEEIILVDDKSD 213
>UniRef50_A7SZ28 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 454
Score = 276 bits (677), Expect = 1e-72
Identities = 144/337 (42%), Positives = 201/337 (59%), Gaps = 23/337 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I++++T +R GLI+AR+ GA+N+VG+V++FLD+H E N GWLPPLL+R++
Sbjct: 121 IKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLPPLLERIALN-------- 172
Query: 317 SARAVTPVIDVINADTFEYSP-SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
AV P ID I+ TF+Y P P +RG FNW +K + + D + +KSP
Sbjct: 173 RRTAVCPTIDFIDHKTFQYKPMDPYIRGTFNWRFDYKERAVRPEEMAKRRDPTQEVKSPV 232
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
MAGGLFAI RE+F+ +G+YDPGM +WGGE EISF++W CGG LE IPCSRVGHV+R
Sbjct: 233 MAGGLFAINREFFSELGQYDPGMFIWGGEQYEISFKLWQCGGQLENIPCSRVGHVYRHHV 292
Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
PY K D L N R+A VWMD+Y + + P V+ GDIS+R ALR+RL+CK+F
Sbjct: 293 PY-TYPKHDATLVNFRRVAEVWMDEYKDWLYDKRPEIKSVDYGDISDRIALRKRLKCKSF 351
Query: 496 KWYLDNMWFETDRSELVL-------GRTLCLDASNNVAPILG--KCHEMGGTQEWKHKGT 546
KWYL+N+ +T +++L G+ +CLD+ +G CH MGG Q +++
Sbjct: 352 KWYLENVANDTVKTKLCACFQVRNQGKNMCLDSMGRKDGHVGLASCHNMGGNQAFQYTYI 411
Query: 547 ASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNK 583
T C V S+ G V C + N+
Sbjct: 412 RELRTDET----CFDVHESFPGAKVHFFPCHEMKGNQ 444
Score = 84.2 bits (199), Expect = 9e-15
Identities = 38/109 (34%), Positives = 67/109 (61%)
Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEH 164
+ED +D Y+ FN +S +I R + DTR++ C+ + Y LP+AS++I F+NE
Sbjct: 13 AEDESKKDAAYSEFGFNQFVSDQISLERTISDTRHQACKQRSYPINLPKASVVIVFHNEG 72
Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
+ TLMR+VH+++ R+ ++EI++VDD+S+ L + + KL +
Sbjct: 73 WSTLMRTVHTVLLRSPPHMLQEIVMVDDFSNKDFLKQKLDDYTKKLGKI 121
>UniRef50_A7SDQ3 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 575
Score = 275 bits (675), Expect = 2e-72
Identities = 150/344 (43%), Positives = 205/344 (59%), Gaps = 27/344 (7%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++L++ +KREGLIR+R+ GA+ + G+VL FLDSH E N WL PLL R+ + +
Sbjct: 197 VKLIRNTKREGLIRSRVKGANLARGEVLTFLDSHCECNKNWLEPLLLRIKESPKTI---- 252
Query: 317 SARAVTPVIDVINADTFEYSPSPL-VRGGFNWGLHFKWDNLPKGTLINDEDFMK-PLKSP 374
V+P+IDVIN DTF+Y S +RGGF W L+FKWD LP L + P+KSP
Sbjct: 253 ----VSPIIDVINLDTFDYLGSSADLRGGFGWNLNFKWDFLPPHILAERQGKPTLPIKSP 308
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
+AGGLF++ +++F +GKYD M+VWGGENLEISFR W CGG++E+IPCSRVGHVFR R
Sbjct: 309 VIAGGLFSVAKKWFETLGKYDMQMDVWGGENLEISFRTWQCGGAMEIIPCSRVGHVFRNR 368
Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
PY G + +N+ R VWMDDY + P A + GDI ER LR +L+C+
Sbjct: 369 HPYQFPGGSMNVFQKNTRRAVEVWMDDYKRYYYAAVPYAKNTPYGDIEERVELRRKLRCR 428
Query: 494 TFKWYLDNMWFE----TDRSELVLGR----TLCLDASNNV-APILG--KCHEMGGTQEWK 542
FKWY+ N++ E +D S G C+D ++ +G +CH GG Q W
Sbjct: 429 PFKWYVQNVYPELKLPSDESTKSFGEIKQGNQCVDTLGHMRGQTIGLFECHGAGGNQMWS 488
Query: 543 HKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDD-YSNNKWD 585
T SS + + MCLGV+ E V ++ CD+ S W+
Sbjct: 489 L--TKSSLLKHET--MCLGVNDGKATEPVQLLDCDENNSMQHWE 528
Score = 75.8 bits (178), Expect = 3e-12
Identities = 45/134 (33%), Positives = 77/134 (57%), Gaps = 6/134 (4%)
Query: 61 QDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAF 120
QD + E R++ KE+ +++ K+ +DL+ ++ N D Y +A+
Sbjct: 55 QDLQENQETSREIP-KEREEEEE---FDKRKISDLDPIKYIVENG--FHEGDDAYAKNAY 108
Query: 121 NTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTD 180
N S ++ R++PD R++ C+SQ + +LP +IIICF+NE L+R+V S ++R+
Sbjct: 109 NIKKSDQLPVDREVPDVRDQQCKSQVWPHDLPTTTIIICFHNEGRSALLRTVISALNRSP 168
Query: 181 QKHIKEIILVDDYS 194
+KEIILVDD+S
Sbjct: 169 PHLLKEIILVDDFS 182
>UniRef50_Q10471 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 2 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 2) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 2)
(Polypeptide GalNAc transferase 2) (GalNAc-T2)
(pp-GaNTase 2) [Contains: Polypeptide
N-acetylgalactosaminyltransferase 2 soluble form]; n=32;
Coelomata|Rep: Polypeptide
N-acetylgalactosaminyltransferase 2 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 2) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 2)
(Polypeptide GalNAc transferase 2) (GalNAc-T2)
(pp-GaNTase 2) [Contains: Polypeptide
N-acetylgalactosaminyltransferase 2 soluble form] - Homo
sapiens (Human)
Length = 571
Score = 272 bits (667), Expect = 2e-71
Identities = 149/354 (42%), Positives = 208/354 (58%), Gaps = 26/354 (7%)
Query: 246 ENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRL 305
E+ + + +R+L+ +REGL+R+R+ GAD + VL FLDSH E N WL PLL+R+
Sbjct: 182 EDGALLGKIEKVRVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERV 241
Query: 306 SQGVDGVKVRYSARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNL-PKGTLIN 363
++ R V+P+IDVIN D F+Y S ++GGF+W L FKWD + P+
Sbjct: 242 AED--------RTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSR 293
Query: 364 DEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIP 423
+ + P+K+P +AGGLF + + YF +GKYD M+VWGGENLEISFR+W CGGSLE+IP
Sbjct: 294 QGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIP 353
Query: 424 CSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISE 482
CSRVGHVFRK+ PY G +N+ R A VWMD+Y PSA +V G+I
Sbjct: 354 CSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSARNVPYGNIQS 413
Query: 483 RKALRERLQCKTFKWYLDNMWFE---TDRSELVLGR----TLCLDASNNVAP-ILG--KC 532
R LR++L CK FKWYL+N++ E D ++ G T CLD + A ++G +C
Sbjct: 414 RLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGALQQGTNCLDTLGHFADGVVGVYEC 473
Query: 533 HEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVIC-DDYSNNKWD 585
H GG QEW T + + +CL V G + + C ++ S KW+
Sbjct: 474 HNAGGNQEWAL--TKEKSVKH--MDLCLTVVDRAPGSLIKLQGCRENDSRQKWE 523
Score = 71.3 bits (167), Expect = 7e-11
Identities = 34/81 (41%), Positives = 53/81 (65%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
Y + FN + S ++ R +PDTR+ CQ +Q+ +LP S++I F+NE L+R+V S
Sbjct: 99 YARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVS 158
Query: 175 IMDRTDQKHIKEIILVDDYSD 195
++ ++ IKEIILVDDYS+
Sbjct: 159 VLKKSPPHLIKEIILVDDYSN 179
>UniRef50_A7RGG9 Cluster: Predicted protein; n=3; Eumetazoa|Rep:
Predicted protein - Nematostella vectensis
Length = 353
Score = 266 bits (653), Expect = 8e-70
Identities = 129/251 (51%), Positives = 166/251 (66%), Gaps = 9/251 (3%)
Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
N+R+L+TSKREGLIRARL GA + GDV+ FLD+H E NV WL PLL R+ V V
Sbjct: 96 NVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWLQPLLSRIHSDRTIVAV- 154
Query: 316 YSARAVTPVIDVINADTFEYSPSP-LVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
PVID+I++ F YS +P V GGF+W + F W +LP +D P+++P
Sbjct: 155 -------PVIDIISSTNFMYSGTPSAVIGGFSWDMQFTWHSLPNNRQSERKDRTAPIRTP 207
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
TMAGGLF+I R+YF G YD GM+VWGGENLE+SFRIW CGG LE++PCSRVGHVFR R
Sbjct: 208 TMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSFRIWQCGGKLEILPCSRVGHVFRTR 267
Query: 435 RPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKT 494
PY + N R+ VWMD+Y + V P ++ GDI+ R ALR +L+CK+
Sbjct: 268 FPYSFPGGYSEVSVNLARVVHVWMDEYNQYVYMKRPDLQSLKYGDITSRVALRNKLKCKS 327
Query: 495 FKWYLDNMWFE 505
FKWYL+N++ E
Sbjct: 328 FKWYLENVYPE 338
Score = 67.7 bits (158), Expect = 8e-10
Identities = 30/72 (41%), Positives = 47/72 (65%)
Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
C S+ Y LP +++ICF+NE + TL+R+VHS++DR+ ++EI+L+DD+S L
Sbjct: 26 CSSKSYPSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFSTHDYLKS 85
Query: 202 DVQEAVDKLNNV 213
+ V KL NV
Sbjct: 86 KLTAYVAKLRNV 97
>UniRef50_Q8N428 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1; n=43;
Eumetazoa|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 1 - Homo
sapiens (Human)
Length = 558
Score = 266 bits (651), Expect = 1e-69
Identities = 119/251 (47%), Positives = 169/251 (67%), Gaps = 10/251 (3%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++ L+ +REGLIR+R+ GAD + VL FLDSH EVN WLPP+L+R+ +
Sbjct: 180 VKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKED-------- 231
Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
R V+P+IDVI+ D F Y + S +RGGF+W LHFKW+ +P + D +P+++P
Sbjct: 232 HTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPV 291
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
+AGG+F I + +FN +GKYD M++WGGEN E+SFR+WMCGGSLE++PCSRVGHVFRKR
Sbjct: 292 IAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRH 351
Query: 436 PYGVGEKQDY-MLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKT 494
PY E ++N+ R A VWMD+Y + E PSA G ++ R R+++ CK+
Sbjct: 352 PYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKMNCKS 411
Query: 495 FKWYLDNMWFE 505
F+WYL+N++ E
Sbjct: 412 FRWYLENVYPE 422
Score = 81.8 bits (193), Expect = 5e-14
Identities = 39/91 (42%), Positives = 58/91 (63%)
Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNE 163
+++ L+ + Y HAFN L S ++ R + DTR+ C S Y +LP S+II F+NE
Sbjct: 75 SAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIITFHNE 134
Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
TL+R+V S+++RT I+EIILVDD+S
Sbjct: 135 ARSTLLRTVKSVLNRTPANLIQEIILVDDFS 165
>UniRef50_Q5DD76 Cluster: SJCHGC09400 protein; n=2; Schistosoma
japonicum|Rep: SJCHGC09400 protein - Schistosoma
japonicum (Blood fluke)
Length = 737
Score = 264 bits (647), Expect = 5e-69
Identities = 145/304 (47%), Positives = 189/304 (62%), Gaps = 28/304 (9%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++T +REGLIRAR+ GA S G VLVFLDSHIE GWL PLL R++
Sbjct: 306 VKIVRTKRREGLIRARMLGAAQSSGKVLVFLDSHIECTTGWLEPLLDRIAYN-------- 357
Query: 317 SARAVTPVIDVINADTFEY---SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
S+ V PVI VIN T +Y SPS + GGF+W L F W + P++S
Sbjct: 358 SSIVVVPVITVINDKTLKYDLPSPSRVQIGGFDWSLSFIWHEQTERHKNRPGAPYSPVQS 417
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
PTMAGGLFAI REYFN +G YDPGM VWGGENLE+SF+IWMCGGSLE++ CS+VGH+FR
Sbjct: 418 PTMAGGLFAISREYFNHLGMYDPGMEVWGGENLELSFKIWMCGGSLEIVICSQVGHIFRD 477
Query: 434 RRPY-GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
R PY + +D + +N +R+A VW+DDY K+ V+IG++SERKALRE+L+C
Sbjct: 478 RSPYIWDVDVKDPLKRNLLRLADVWLDDY-KRFYHARIGFEMVDIGNVSERKALREKLKC 536
Query: 493 KTFKWYLDNMWFE--TDRSELVLG------RTLCLDA----SNNVAPILGK---CHEMGG 537
+F WYL N++ E L G CLDA N+ + ++ K CH+ GG
Sbjct: 537 HSFDWYLTNIYPELFVPSKALASGDIESAAGPHCLDAPLPSENDSSSVIIKTRPCHKQGG 596
Query: 538 TQEW 541
Q W
Sbjct: 597 NQFW 600
Score = 100 bits (239), Expect = 1e-19
Identities = 52/106 (49%), Positives = 70/106 (66%), Gaps = 1/106 (0%)
Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLM 169
I DKG+ +AFN L S RI R LPD R C+ +Y LP ASIIICF+NE + L+
Sbjct: 203 IFDKGWKDNAFNQLASDRISVRRYLPDYREGTCKDNKYSRNLPSASIIICFHNEAWSVLL 262
Query: 170 RSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
RSVHS++DR+ + EIILVDD+SD +L ++E + K+ NV+K
Sbjct: 263 RSVHSVIDRSPSYLLHEIILVDDFSDRPHLKEALEEYM-KMLNVVK 307
>UniRef50_Q95ZJ1 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 5; n=13;
Bilateria|Rep: Polypeptide
N-acetylgalactosaminyltransferase 5 - Caenorhabditis
elegans
Length = 626
Score = 263 bits (644), Expect = 1e-68
Identities = 129/253 (50%), Positives = 169/253 (66%), Gaps = 14/253 (5%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++L+ KREGLIRARL GA + G+VL +LDSH E GW+ PLL R+ R
Sbjct: 237 VKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDRIK--------RD 288
Query: 317 SARAVTPVIDVINADTFEYSPSPLVR---GGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
V PVIDVI+ +TFEY S GGF+WGL F W ++P+ N + P++S
Sbjct: 289 PTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERDRKNRTRPIDPVRS 348
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
PTMAGGLF+I ++YF +G YDPG ++WGGENLE+SF+IWMCGG+LE++PCS VGHVFRK
Sbjct: 349 PTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVPCSHVGHVFRK 408
Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIE-VNPSAAHVEIGDISERKALRERLQC 492
R PY + + +NS+R+A VW+DDY E +N + GDIS RK LRE L C
Sbjct: 409 RSPYKWRTGVNVLKRNSIRLAEVWLDDYKTYYYERINNQLG--DFGDISSRKKLREDLGC 466
Query: 493 KTFKWYLDNMWFE 505
K+FKWYLDN++ E
Sbjct: 467 KSFKWYLDNIYPE 479
Score = 93.5 bits (222), Expect = 1e-17
Identities = 41/112 (36%), Positives = 71/112 (63%)
Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNE 163
++E+ DKG +AFN S I HR LP + C++++Y + LPR S+IICF+NE
Sbjct: 127 STEEKAKYDKGMLNNAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNE 186
Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
+ L+R+VHS+++RT ++E++LVDD+SD+ + ++E + + +K
Sbjct: 187 AWSVLLRTVHSVLERTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVK 238
>UniRef50_A2AQQ1 Cluster: UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N- acetylgalactosaminyltransferase 13; n=10;
Coelomata|Rep: UDP-N-acetyl-alpha-D-galactosamine:
polypeptide N- acetylgalactosaminyltransferase 13 - Mus
musculus (Mouse)
Length = 592
Score = 261 bits (640), Expect = 3e-68
Identities = 144/331 (43%), Positives = 193/331 (58%), Gaps = 24/331 (7%)
Query: 243 KSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLL 302
K T + VK +++++ +R GLIRARL GA S G V+ FLD+H E +GWL PLL
Sbjct: 163 KLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222
Query: 303 KRLSQGVDGVKVRYSARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTL 361
R+ + V V P+IDVI+ DTFEY + S + GGFNW L+F+W +P+ +
Sbjct: 223 ARIKEDRKTV--------VCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREM 274
Query: 362 INDE-DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLE 420
+ D P+++PTMAGGLF+I R YF IG YD GM++WGGENLE+SFRIW CGGSLE
Sbjct: 275 DRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLE 334
Query: 421 LIPCSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGD 479
++ CS VGHVFRK PY G + +N+ R+A VWMD++ ++P V+ GD
Sbjct: 335 IVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKVDYGD 394
Query: 480 ISERKALRERLQCKTFKWYLDNMWFETD--RSELVLGR------TLCLD---ASNNVAPI 528
+S RK LRE L+CK F WYL+N++ ++ R LG CLD N
Sbjct: 395 VSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVG 454
Query: 529 LGKCHEMGGTQEWKHKGTASSPIYNTAAGMC 559
+ CH MGG Q H S+P A C
Sbjct: 455 IFNCHGMGGNQ--VHDLCLSAPSLGVGAEEC 483
Score = 76.2 bits (179), Expect = 2e-12
Identities = 37/101 (36%), Positives = 63/101 (62%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
+ ++ FN + S I +R LPD R + C+++ Y DELP S++I F+NE + TL+R+V+S
Sbjct: 78 FKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYS 137
Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
+++R+ + E+ILVDD S+ L ++ V L +K
Sbjct: 138 VINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVK 178
>UniRef50_Q10472 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 1 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 1) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 1)
(Polypeptide GalNAc transferase 1) (GalNAc-T1)
(pp-GaNTase 1) [Contains: Polypeptide
N-acetylgalactosaminyltransferase 1 soluble form]; n=66;
Eumetazoa|Rep: Polypeptide
N-acetylgalactosaminyltransferase 1 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 1) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 1)
(Polypeptide GalNAc transferase 1) (GalNAc-T1)
(pp-GaNTase 1) [Contains: Polypeptide
N-acetylgalactosaminyltransferase 1 soluble form] - Homo
sapiens (Human)
Length = 559
Score = 261 bits (640), Expect = 3e-68
Identities = 151/354 (42%), Positives = 204/354 (57%), Gaps = 31/354 (8%)
Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
S VK + +++ +R GLIRARL GA S G V+ FLD+H E VGWL PLL R
Sbjct: 169 SYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR--- 225
Query: 308 GVDGVKVRYSARAVT-PVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDE 365
+++ R V P+IDVI+ DTFEY + S + GGFNW L+F+W +P+ + +
Sbjct: 226 ------IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRK 279
Query: 366 -DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPC 424
D P+++PTMAGGLF+I R+YF IG YD GM++WGGENLEISFRIW CGG+LE++ C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339
Query: 425 SRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISER 483
S VGHVFRK PY G + +N+ R+A VWMD++ ++P V+ GDIS R
Sbjct: 340 SHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKVDYGDISSR 399
Query: 484 KALRERLQCKTFKWYLDNMWFETD--RSELVLGR------TLCLD---ASNNVAPILGKC 532
LR +LQCK F WYL+N++ ++ R LG CLD N + C
Sbjct: 400 VGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNC 459
Query: 533 HEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNK-WD 585
H MGG Q + + TA+ I +CL D S V M+ C N+ W+
Sbjct: 460 HGMGGNQVFSY--TANKEI--RTDDLCL--DVSKLNGPVTMLKCHHLKGNQLWE 507
Score = 83.8 bits (198), Expect = 1e-14
Identities = 43/116 (37%), Positives = 73/116 (62%), Gaps = 3/116 (2%)
Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHY 165
ED + + ++ FN + S+ I +R LPD R + C+++ Y D LP S++I F+NE +
Sbjct: 70 EDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAW 129
Query: 166 ETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLN---NVIKKEE 218
TL+R+VHS+++R+ + I+EI+LVDD S+ L ++ V KL +VI+ E+
Sbjct: 130 STLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQ 185
>UniRef50_A7RRV7 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 513
Score = 260 bits (638), Expect = 6e-68
Identities = 147/347 (42%), Positives = 198/347 (57%), Gaps = 31/347 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++L+ KREGL+RARL GA+ + GDVL FLDSH E GW PLL R++ V
Sbjct: 129 VKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPLLARIAADRRNV---- 184
Query: 317 SARAVTPVIDVINADTFEYSPSPLV--RGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
V P I+VINADTF Y S RGGF+W L FKW +P D P+++P
Sbjct: 185 ----VCPAIEVINADTFAYQGSTNADQRGGFSWDLFFKWKGIPPEEQKLRNDDSDPIRTP 240
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK- 433
TMAGGLF+I+R+YF IG YD M++WGGENLE+SFR+WMCGG LE++ CSRVGHVFRK
Sbjct: 241 TMAGGLFSIHRQYFFDIGSYDEEMDIWGGENLELSFRVWMCGGRLEIVTCSRVGHVFRKY 300
Query: 434 RRPYGVGEKQDYML-QNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
PY + + L +N R+A VWMD+Y P A + + GDIS+R LR+RL+C
Sbjct: 301 TSPYKFPDGVERTLTKNFNRLAEVWMDEYKDLYYNKKPQAKNSDYGDISKRLELRKRLKC 360
Query: 493 KTFKWYLDNMWFETDRSEL---VLGR------TLCLDA-----SNNVAPILGKCHEMGGT 538
K+FKWY++N++ + EL G CLD+ +N + CH GG
Sbjct: 361 KSFKWYINNIYPDVQMPELDPPARGEVRNPSSNQCLDSLGAKPEHNARVGIYTCHGQGGN 420
Query: 539 QEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDY-SNNKW 584
Q K+ I+ C V +++ G V ++ C N +W
Sbjct: 421 QVSKY--MPRELIFEEE--NCFDVSKTHPGAPVELMKCHGMRGNQEW 463
Score = 79.4 bits (187), Expect = 2e-13
Identities = 41/126 (32%), Positives = 75/126 (59%), Gaps = 6/126 (4%)
Query: 102 IRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQ--YFDELPRASIIIC 159
+ + E+ ++ +K + H+FN L+S +I R L D R++ C+++ Y +LP S+IIC
Sbjct: 16 LESEENKKLAEKYFANHSFNWLLSDKISLDRTLDDVRSERCKAKHNTYPAKLPTTSVIIC 75
Query: 160 FYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV----IK 215
F+ E L+R+VHS+++RT + + E+I+VDD+S L + + V + V +K
Sbjct: 76 FHKERLSVLLRTVHSVINRTPPELLAEVIVVDDFSQDAKLGKPLDDHVAQFTKVKVLRMK 135
Query: 216 KEEEMI 221
K E ++
Sbjct: 136 KREGLV 141
Score = 39.1 bits (87), Expect = 0.32
Identities = 28/79 (35%), Positives = 37/79 (46%), Gaps = 7/79 (8%)
Query: 510 ELVLGRTLCLDASNNV--API-LGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDR-S 565
EL+ C D S AP+ L KCH M G QEWKH + ++ T CL DR S
Sbjct: 429 ELIFEEENCFDVSKTHPGAPVELMKCHGMRGNQEWKHDREKGTLMHFTTQ-QCL--DRGS 485
Query: 566 YRGETVLMVICDDYSNNKW 584
+ +M CD + +W
Sbjct: 486 PSDQYAVMNPCDGRESQRW 504
>UniRef50_Q4RQL8 Cluster: Chromosome 2 SCAF15004, whole genome
shotgun sequence; n=5; Euteleostomi|Rep: Chromosome 2
SCAF15004, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 632
Score = 259 bits (634), Expect = 2e-67
Identities = 152/371 (40%), Positives = 206/371 (55%), Gaps = 36/371 (9%)
Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
KK EN V+ +R+L+ +R GLIRARL GA + G V+ FLD+H E VGWL PL
Sbjct: 163 KKKLENY-VRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPL 221
Query: 302 LKRLSQG------------VDGVKVR---YSARAVTPVIDVINADTFEY-SPSPLVRGGF 345
L R+ + + R + V P+IDVI+ +TFEY + S + GGF
Sbjct: 222 LARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMAGSDMTYGGF 281
Query: 346 NWGLHFKWDNLPKGTLINDE-DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGE 404
NW L+F+W +P+ + + D P+++PTMAGGLF+I + YF IG YDPGM++WGGE
Sbjct: 282 NWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGE 341
Query: 405 NLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVK 463
NLE+SFRIW CGGSLE++ CS VGHVFRK PY G + +N+ R+A VWMDD+
Sbjct: 342 NLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKD 401
Query: 464 KVIEVNPSAAHVEIGDISERKALRERLQCKTFKWYLDNMWFETD--RSELVLGR------ 515
++P V+ GD+S RK LR+ L CK F WYL+N++ ++ R LG
Sbjct: 402 FFYIISPGVMRVDYGDVSSRKGLRDALHCKPFSWYLENIYPDSQIPRRYYSLGEIRNVET 461
Query: 516 TLCLD---ASNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVL 572
C+D N CH MGG Q + + TA I +CL V R VL
Sbjct: 462 NQCVDNMGRKENEKVGFFNCHGMGGNQVFSY--TADKEI--RTDDLCLDVSR--LNGPVL 515
Query: 573 MVICDDYSNNK 583
M+ C N+
Sbjct: 516 MLKCHHMKGNQ 526
Score = 75.4 bits (177), Expect = 4e-12
Identities = 32/81 (39%), Positives = 56/81 (69%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
+ ++ FN + S I +R LPD R C+++ Y D++P S++I F+NE + TL+R+VHS
Sbjct: 78 FKINQFNLMASDMIALNRSLPDVRLDGCKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHS 137
Query: 175 IMDRTDQKHIKEIILVDDYSD 195
+++R+ + + EI+LVDD S+
Sbjct: 138 VINRSPRHLLVEIVLVDDASE 158
>UniRef50_UPI00015B515F Cluster: PREDICTED: similar to
n-acetylgalactosaminyltransferase; n=1; Nasonia
vitripennis|Rep: PREDICTED: similar to
n-acetylgalactosaminyltransferase - Nasonia vitripennis
Length = 826
Score = 256 bits (628), Expect = 9e-67
Identities = 140/284 (49%), Positives = 178/284 (62%), Gaps = 25/284 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++LL+ +R+GL+RARL GA ++ GDVL+FLD+H EV WL PLL+R+ + + V
Sbjct: 433 VKLLRLDERQGLVRARLKGAKSATGDVLMFLDAHCEVTKQWLEPLLQRIKEKKNAV---- 488
Query: 317 SARAVTPVIDVINADTFEYS----PSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLK 372
VTP+ID I+ +TFEYS PS GGF W HF W N+ + L + + P+K
Sbjct: 489 ----VTPIIDNISEETFEYSHSDEPSFFQVGGFTWSGHFTWINIQEADLKSKTSAISPVK 544
Query: 373 SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
SPTMAGGLFAI R+YF IG YD M WGGENLE+SFRIW CGG LE IPCSRVGHVFR
Sbjct: 545 SPTMAGGLFAINRKYFWDIGSYDDKMEGWGGENLEMSFRIWQCGGVLETIPCSRVGHVFR 604
Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVE---IGDISERKALRER 489
PY +D N+ R+A VWMDDY K++ ++ + IGDI ER LRE+
Sbjct: 605 NFLPYKFPMDKDTHGINTARLANVWMDDY-KRLYYLHREEYKDKPELIGDIKERVNLREK 663
Query: 490 LQCKTFKWYLDNMW---FETDRSELVLGR------TLCLDASNN 524
L+CK+FKWYLDN++ F D + GR LCLD N
Sbjct: 664 LKCKSFKWYLDNVYPEKFIPDENVQAFGRVQVQKGNLCLDNLQN 707
Score = 75.8 bits (178), Expect = 3e-12
Identities = 37/77 (48%), Positives = 52/77 (67%)
Query: 119 AFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDR 178
A N ++S +I R LPD R+ LC++ Y LP ASIII F+NE + L+R+V+S++
Sbjct: 337 AVNIILSNKIPLQRKLPDVRDPLCKNVTYDSVLPSASIIIIFHNEAFSVLLRTVYSVIKE 396
Query: 179 TDQKHIKEIILVDDYSD 195
T K +KEIILVDD S+
Sbjct: 397 TPPKLLKEIILVDDKSN 413
>UniRef50_Q96FL9 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 14; n=30;
Tetrapoda|Rep: Polypeptide
N-acetylgalactosaminyltransferase 14 - Homo sapiens
(Human)
Length = 552
Score = 256 bits (627), Expect = 1e-66
Identities = 139/335 (41%), Positives = 196/335 (58%), Gaps = 24/335 (7%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++ L+ ++R+GL+R+R+ GAD + G L FLDSH EVN WL PLL R VK Y
Sbjct: 168 VKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR-------VKEDY 220
Query: 317 SARAVTPVIDVINADTFEYSPSPL-VRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
+ R V PVID+IN DTF Y S +RGGF+W LHF+W+ L D +P+++P
Sbjct: 221 T-RVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPI 279
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
+AGGLF I + +F+ +GKYD M++WGGEN EISFR+WMCGGSLE++PCSRVGHVFRK+
Sbjct: 280 IAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKH 339
Query: 436 PYGV--GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
PY G Y ++N+ R A VWMD+Y + P A G++ R LR+ L+C+
Sbjct: 340 PYVFPDGNANTY-IKNTKRTAEVWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQ 398
Query: 494 TFKWYLDNMWFETD---RSELVLG----RTLCLDA---SNNVAP--ILGKCHEMGGTQEW 541
+FKWYL+N++ E S + G R CL++ +N P L C ++ G
Sbjct: 399 SFKWYLENIYPELSIPKESSIQKGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAK 458
Query: 542 KHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVIC 576
+ +CL V + G V++V+C
Sbjct: 459 SQVWAFTYTQQILQEELCLSVITLFPGAPVVLVLC 493
Score = 90.6 bits (215), Expect = 1e-16
Identities = 50/115 (43%), Positives = 71/115 (61%), Gaps = 2/115 (1%)
Query: 83 QAIKMSKKTENDLEEQFGLIR--NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNK 140
Q K S +DL +QF R N++ R+ D Y L+AFN S+RI +R +PDTR+
Sbjct: 40 QTPKPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAIPDTRHL 99
Query: 141 LCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
C Y +LP SIII F+NE TL+R++ S+++RT I+EIILVDD+S+
Sbjct: 100 RCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 154
>UniRef50_UPI00015B5D50 Cluster: PREDICTED: similar to
ENSANGP00000021852; n=1; Nasonia vitripennis|Rep:
PREDICTED: similar to ENSANGP00000021852 - Nasonia
vitripennis
Length = 612
Score = 254 bits (623), Expect = 4e-66
Identities = 125/251 (49%), Positives = 164/251 (65%), Gaps = 14/251 (5%)
Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
R+L++ KR GL+ ARL GA+ + G+VL FLD+H E GWL PLL+ +S+
Sbjct: 221 RVLRSDKRVGLVNARLMGANEAKGEVLTFLDAHCECTAGWLEPLLEAISKN--------R 272
Query: 318 ARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTLIND--EDFMKPLKSP 374
R V+PVID+IN DTF Y+ S L G FNW LHF+W L G L+ + E+ + P K+P
Sbjct: 273 TRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLML-NGALLRERRENIVDPFKTP 331
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
MAGGLF++ REYF +G YD M +WGGENLE+SFR+W CGGS+E+ PCS VGH+FRK
Sbjct: 332 AMAGGLFSMDREYFFELGSYDEHMRIWGGENLELSFRVWQCGGSVEIAPCSHVGHIFRKS 391
Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHV-EIGDISERKALRERLQC 492
PY G + + N R+A VWMD++ K NP A V + I R LRERL+C
Sbjct: 392 SPYTFPGGVDEILYGNLARVALVWMDEWGKFYFNFNPQAQRVRDKQQIRSRLELRERLKC 451
Query: 493 KTFKWYLDNMW 503
K+F+WYLDN+W
Sbjct: 452 KSFEWYLDNVW 462
Score = 82.2 bits (194), Expect = 4e-14
Identities = 43/108 (39%), Positives = 69/108 (63%), Gaps = 1/108 (0%)
Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNE 163
++D + + + ++ +N L S RI +R LPD R K C ++ +LP S+II F+NE
Sbjct: 110 AKDFQKMQQLFQINRYNLLASDRIPLNRTLPDVRKKKCITRYANLGDLPSTSVIIVFHNE 169
Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLN 211
+ TL+R+VHS+++R+ +K ++EIILVDD SD L + E V +LN
Sbjct: 170 AWSTLLRTVHSVINRSPRKLLEEIILVDDNSDRDFLRKPLDEYVAQLN 217
>UniRef50_UPI00015B453F Cluster: PREDICTED: similar to GA20875-PA;
n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
GA20875-PA - Nasonia vitripennis
Length = 793
Score = 254 bits (623), Expect = 4e-66
Identities = 142/288 (49%), Positives = 175/288 (60%), Gaps = 25/288 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++LL+ KR+GLIRARL GA + GDVLVFLD+H EV GWL PLL R +K R
Sbjct: 360 VKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVTKGWLSPLLHR-------IKARP 412
Query: 317 SARAVTPVIDVINADTFEYS----PSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLK 372
+A + PVIDVI+A T EY S + GGF W F W N+ + P+
Sbjct: 413 NA-VLIPVIDVIDAKTLEYKLAARGSHMPIGGFKWTGDFTWINMEDSPKRTTASPIDPIN 471
Query: 373 SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
+PTMAGGLFAI R+YF IG YD M+ WGGENLE+SFRIW CGGS+E++PCSRVGH+FR
Sbjct: 472 TPTMAGGLFAIDRKYFWVIGSYDELMDGWGGENLEMSFRIWQCGGSIEIVPCSRVGHIFR 531
Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVI--EVNPSAAHVEIGDISERKALRERL 490
PY +D L N+ R A VWMDDY + N EIGD++ RK LRERL
Sbjct: 532 DFFPYEFPSSRDTYLINTARAAHVWMDDYKRLFFLHHKNMEGNTKEIGDLTARKKLRERL 591
Query: 491 QCKTFKWYLDNMW---FETDRSELVLG------RTLCLDA--SNNVAP 527
QC +FKWYL N++ F D + L G R LCLD+ SN+ P
Sbjct: 592 QCASFKWYLQNVYPEKFIPDENVLAYGRARSPRRNLCLDSITSNDEHP 639
Score = 72.1 bits (169), Expect = 4e-11
Identities = 33/81 (40%), Positives = 53/81 (65%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
Y+ A N ++S +I R + D R+ LC+S Y +LP S++I F+NE + L+R+V+S
Sbjct: 260 YSKRAVNVVLSNKIPLQRRIRDMRDPLCKSVTYDTKLPTTSVVIIFHNEAWSVLLRTVYS 319
Query: 175 IMDRTDQKHIKEIILVDDYSD 195
++ + K +KEIILVDD S+
Sbjct: 320 VLQESPPKFLKEIILVDDNSN 340
>UniRef50_Q8IXK2 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 12; n=41;
Eumetazoa|Rep: Polypeptide
N-acetylgalactosaminyltransferase 12 - Homo sapiens
(Human)
Length = 581
Score = 253 bits (620), Expect = 8e-66
Identities = 148/344 (43%), Positives = 204/344 (59%), Gaps = 39/344 (11%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+RL++ +KREGL+RARL GA + GDVL FLD H E + GWL PLL+R+ + V
Sbjct: 197 VRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEEESAV---- 252
Query: 317 SARAVTPVIDVINADTFEY---SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
V PVIDVI+ +TFEY S P + GGF+W L F W +P+ I + + ++S
Sbjct: 253 ----VCPVIDVIDWNTFEYLGNSGEPQI-GGFDWRLVFTWHTVPERERIRMQSPVDVIRS 307
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
PTMAGGLFA+ ++YF +G YD GM VWGGENLE SFRIW CGG LE PCS VGHVF K
Sbjct: 308 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 367
Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
+ PY ++ L NS+R A VWMD++ + NP A GD++ERK LR++LQCK
Sbjct: 368 QAPY----SRNKALANSVRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCK 423
Query: 494 TFKWYLDNMWFETDRSE-------LVLGRTL---CLDAS----NNVA---PILGKCHEMG 536
FKW+L+ ++ E E ++ + L C D + N + IL CH MG
Sbjct: 424 DFKWFLETVYPELHVPEDRPGFFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMG 483
Query: 537 GTQEWKHKGTASSPI-YNT-AAGMCLGVDRSYRGETVLMVICDD 578
Q +++ T+ I YNT C+ V+ +T++M +C++
Sbjct: 484 QNQFFEY--TSQKEIRYNTHQPEGCIAVEAGM--DTLIMHLCEE 523
Score = 96.3 bits (229), Expect = 2e-18
Identities = 45/109 (41%), Positives = 71/109 (65%), Gaps = 1/109 (0%)
Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEH 164
E+LR++++ LH N +S RI HR LP+ N LC+ ++Y +D LPR S+II FYNE
Sbjct: 89 EELRLQEESVRLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEA 148
Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
+ TL+R+V+S+++ + ++E+ILVDDYSD +L + + L V
Sbjct: 149 WSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLANELSGLPKV 197
>UniRef50_O61394 Cluster: Probable N-acetylgalactosaminyltransferase
6; n=4; Caenorhabditis|Rep: Probable
N-acetylgalactosaminyltransferase 6 - Caenorhabditis
elegans
Length = 618
Score = 252 bits (616), Expect = 3e-65
Identities = 123/254 (48%), Positives = 168/254 (66%), Gaps = 15/254 (5%)
Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
+I+++++ +R GLIRAR+ GA + GDVL FLDSH E GWL PLL R ++
Sbjct: 219 DIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTR---------IK 269
Query: 316 YSARAVT-PVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTLINDE-DFMKPLK 372
+ +AV PVID+IN +TF+Y + RGGFNW L F+W +P D P++
Sbjct: 270 LNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQHLLDPTGPIE 329
Query: 373 SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
SPTMAGGLF+I R YF +G+YDPGM++WGGENLE+SFRIW CGG +E++PCS VGHVFR
Sbjct: 330 SPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGHVFR 389
Query: 433 KRRPYGV-GEKQDYMLQ-NSMRMARVWMDDYVKKVIEVNPSAAHVEIG-DISERKALRER 489
K P+ G+ +L N +R+A VWMDD+ ++ P A + D+SER LR++
Sbjct: 390 KSSPHDFPGKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAPQAHRMRSSIDVSERVELRKK 449
Query: 490 LQCKTFKWYLDNMW 503
L CK+FKWYL N++
Sbjct: 450 LNCKSFKWYLQNVF 463
Score = 87.0 bits (206), Expect = 1e-15
Identities = 39/90 (43%), Positives = 60/90 (66%)
Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHY 165
E ++ D + ++ FN L+S I R LP+ R C++ Y D LP S+II ++NE Y
Sbjct: 111 EQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKPSCRNMTYPDNLPTTSVIIVYHNEAY 170
Query: 166 ETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
TL+R+V S++DR+ ++ +KEIILVDD+SD
Sbjct: 171 STLLRTVWSVIDRSPKELLKEIILVDDFSD 200
>UniRef50_UPI0000E4974C Cluster: PREDICTED: hypothetical protein;
n=3; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 953
Score = 242 bits (592), Expect = 2e-62
Identities = 133/301 (44%), Positives = 179/301 (59%), Gaps = 28/301 (9%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
IR+++ KR GLI+AR+ G D S G+ FLDSH+EV +GWL PLL RL+ D V
Sbjct: 575 IRMVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLARLAS--DRTIV-- 630
Query: 317 SARAVTPVIDVINADTFEYS--PSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
V PV+D IN DTF Y+ P PL RGGFNW ++W +P + + P+KSP
Sbjct: 631 ----VMPVVDEINKDTFNYNVVPEPLQRGGFNWRFEYRWKPIPNYDKRPSK--VAPIKSP 684
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
M GGL + R +F +G +D GM VWGGENLE S +IWMCGGS+E+IPCSRVGHV+R
Sbjct: 685 AMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGSIEIIPCSRVGHVYRDT 744
Query: 435 RPYG-VGEKQ-DYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
PY +G+ D + N+MR+ VW D++ + P + + GD+S+RK LRE L C
Sbjct: 745 SPYSFLGQNPLDIVEHNAMRVVEVWTDEHKYHFYDRLPMLKNRDFGDVSKRKKLRESLNC 804
Query: 493 KTFKWYLDNMWFE--TDRSELVL-------GRTLCLDAS--NNVA--PILG-KCHEMGGT 538
F WYL N++ E S VL G LC+D++ N A ++G CH +GG
Sbjct: 805 YDFNWYLANVYPELYVPSSSSVLRQTINNKGSKLCIDSNDQNGQAGKNLIGWHCHNLGGN 864
Query: 539 Q 539
+
Sbjct: 865 E 865
>UniRef50_UPI0000D564C6 Cluster: PREDICTED: similar to CG8182-PA,
isoform A; n=1; Tribolium castaneum|Rep: PREDICTED:
similar to CG8182-PA, isoform A - Tribolium castaneum
Length = 545
Score = 241 bits (590), Expect = 4e-62
Identities = 122/261 (46%), Positives = 163/261 (62%), Gaps = 15/261 (5%)
Query: 249 EVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQG 308
E + + +RL+ R GLIRARL GA + GDVL+FLD+H E W+ PLL R+ Q
Sbjct: 156 ETRLSSTKLRLIHLKTRMGLIRARLQGARIATGDVLIFLDAHCEATTDWMEPLLSRIEQE 215
Query: 309 VDGVKVRYSARAVTPVIDVINADTFEYSPSPLVR---GGFNWGLHFKWDNLPKGTLINDE 365
V V P+IDVI A+T YS + GGF+W HF W ++ D+
Sbjct: 216 PTAVLV--------PIIDVIEANTLAYSTNGDTSYQVGGFSWSGHFTWIDIQNE---EDK 264
Query: 366 DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCS 425
+ P+KSPTMAGGLFAI R++F IG YD M+ WGGENLE+SFRIW CGG LE +PCS
Sbjct: 265 HKLTPVKSPTMAGGLFAIDRKFFWEIGSYDEQMDGWGGENLEMSFRIWQCGGRLETVPCS 324
Query: 426 RVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAA-HVEIGDISERK 484
RVGH+FR PY + +D N+ R+A VWMDDY + P+ + +GD++ RK
Sbjct: 325 RVGHIFRDFHPYSFPDNKDTHGINTARLAHVWMDDYKRFFFMYQPALENNPVVGDLTHRK 384
Query: 485 ALRERLQCKTFKWYLDNMWFE 505
LR++L+CK+FKWYL+N++ E
Sbjct: 385 QLRQKLRCKSFKWYLENVYPE 405
Score = 63.3 bits (147), Expect = 2e-08
Identities = 33/89 (37%), Positives = 56/89 (62%), Gaps = 1/89 (1%)
Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHY 165
+D + +K A NT++S R+ R L D RN C++ Y +L +AS+++ FYNE
Sbjct: 55 QDAKEGEKALKKFALNTVLSDRMPLDRKLRDPRNPKCKTFTYNPKL-KASVVVIFYNELL 113
Query: 166 ETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
++R+V S++ +T ++ ++EIILVDD S
Sbjct: 114 SVILRTVWSVILQTPKELLEEIILVDDAS 142
>UniRef50_Q9D4M9 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5; n=9;
Eutheria|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 5 - Mus
musculus (Mouse)
Length = 431
Score = 241 bits (590), Expect = 4e-62
Identities = 115/251 (45%), Positives = 163/251 (64%), Gaps = 9/251 (3%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++L++ KREGLIR+++ GA + GD+LVFLDSH EVN WL PLL +++ V
Sbjct: 177 VKLIRNKKREGLIRSKMIGASRASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMV---- 232
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
V P+IDVIN T +Y +P+VRG F+W L+ +WDN+ L E P++SP M
Sbjct: 233 ----VCPIIDVINELTLDYMAAPIVRGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAM 288
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
GG+FAI R YFN +G+YD GM++ GGEN+E+S RIWMCGG L ++PCSRVG+ +
Sbjct: 289 TGGIFAINRHYFNELGQYDNGMDICGGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQ 348
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFK 496
+ Q + +N +R+ VW+D+Y PS +V G+ISER LR+RL CK+F+
Sbjct: 349 HR-RANQSALSRNLLRVVHVWLDEYKGNFFLQRPSLTYVSCGNISERVELRKRLGCKSFQ 407
Query: 497 WYLDNMWFETD 507
WYLDN++ E +
Sbjct: 408 WYLDNIFPELE 418
Score = 92.3 bits (219), Expect = 3e-17
Identities = 47/132 (35%), Positives = 84/132 (63%), Gaps = 5/132 (3%)
Query: 68 EYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQR 127
E ++ +LK++ + A + ++ +E+ ++ N D + +G + N ++S+R
Sbjct: 36 ENEKEELLKKRSLGKNAHQQTRHSEDVTHDEV----NFSDPELI-QGLRRYGLNAIMSRR 90
Query: 128 IGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEI 187
+G R++PD+R+K+CQ + Y LP ASIIICFYNE + TL+R+V S+++ + Q ++EI
Sbjct: 91 LGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSSVVNLSPQHLLEEI 150
Query: 188 ILVDDYSDLYNL 199
ILVDD S+ +L
Sbjct: 151 ILVDDMSEFDDL 162
>UniRef50_Q86SR1 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 10; n=77;
Coelomata|Rep: Polypeptide
N-acetylgalactosaminyltransferase 10 - Homo sapiens
(Human)
Length = 603
Score = 239 bits (586), Expect = 1e-61
Identities = 113/246 (45%), Positives = 157/246 (63%), Gaps = 13/246 (5%)
Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
++R+L+T KREGLIR R+ GA + GDV+ FLDSH E NV WLPPLL R++ R
Sbjct: 205 SVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIA--------R 256
Query: 316 YSARAVTPVIDVINADTFEYSPSP--LVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
V P+IDVI+ D F Y +RG F+W +++K +P + D P +S
Sbjct: 257 NRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYK--RIPIPPELQKADPSDPFES 314
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
P MAGGLFA+ R++F +G YDPG+ +WGGE EISF++WMCGG +E IPCSRVGH++RK
Sbjct: 315 PVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRK 374
Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
PY V + +N R+A VWMD+Y + + + P H+ GD++ +K LR L CK
Sbjct: 375 YVPYKVPAGVS-LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCK 433
Query: 494 TFKWYL 499
+FKW++
Sbjct: 434 SFKWFM 439
Score = 89.4 bits (212), Expect = 2e-16
Identities = 44/116 (37%), Positives = 73/116 (62%), Gaps = 4/116 (3%)
Query: 112 DKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRS 171
D+ Y + FN +S +I +R LPD R+ C S++Y + LP SIII F+NE + +L+R+
Sbjct: 105 DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRT 164
Query: 172 VHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV----IKKEEEMIET 223
VHS+++R+ + + EI+LVDD+SD +L +++ + +V KK E +I T
Sbjct: 165 VHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPSVRILRTKKREGLIRT 220
Score = 37.9 bits (84), Expect = 0.75
Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 6/72 (8%)
Query: 515 RTLCLDASNNVAPI-LGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLM 573
+ C DA ++ +P+ L CH M G Q WK++ +Y+ +G C+ D S + M
Sbjct: 520 KKFCFDAISHTSPVTLYDCHSMKGNQLWKYR--KDKTLYHPVSGSCM--DCSESDHRIFM 575
Query: 574 VICDDYS-NNKW 584
C+ S +W
Sbjct: 576 NTCNPSSLTQQW 587
>UniRef50_Q6WV20 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 1; n=5; Diptera|Rep:
Polypeptide N-acetylgalactosaminyltransferase 1 -
Drosophila melanogaster (Fruit fly)
Length = 601
Score = 239 bits (584), Expect = 2e-61
Identities = 123/259 (47%), Positives = 160/259 (61%), Gaps = 18/259 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+ +L+ R GLIRARL GA + GDVL+FLD+H E N+GW PLL+R+ + V V
Sbjct: 213 VTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIKESRTSVLV-- 270
Query: 317 SARAVTPVIDVINADTFEYSPSPLVR---GGFNWGLHFKWDNLPKGTL------INDEDF 367
P+IDVI+A+ F+YS + GGF W HF W NLP+ E
Sbjct: 271 ------PIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRRECKQERE 324
Query: 368 MKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRV 427
+ P SPTMAGGLFAI R YF +G YD M+ WGGENLE+SFRIW CGG++E IPCSRV
Sbjct: 325 ICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCSRV 384
Query: 428 GHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAA-HVEIGDISERKAL 486
GH+FR PY +D N+ RMA VWMD+Y+ P H +IGD++ R L
Sbjct: 385 GHIFRDFHPYKFPNDRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGDVTHRVML 444
Query: 487 RERLQCKTFKWYLDNMWFE 505
R++L+CK+F+WYL N++ E
Sbjct: 445 RKKLRCKSFEWYLKNIYPE 463
Score = 77.8 bits (183), Expect = 8e-13
Identities = 46/117 (39%), Positives = 71/117 (60%), Gaps = 3/117 (2%)
Query: 82 QQAIKMS-KKTENDLEEQFGLIRNSEDLRIR-DKGYNLHAFNTLISQRIGDHRDLPDTRN 139
+Q I++ +K + L EQ + S + R D+ Y A N +S+++ +R + D RN
Sbjct: 76 EQIIQLDLQKQKVGLGEQGVAVHLSGAAKERGDEIYKKIALNEELSEQLTYNRSVGDHRN 135
Query: 140 KLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
LC Q++ D LP AS++I F+NE Y L+R+VHS + ++K +KEIILVDD SD
Sbjct: 136 PLCAKQRFDSDSLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSD 192
>UniRef50_Q9U2C4 Cluster: Probable N-acetylgalactosaminyltransferase
9; n=3; Caenorhabditis|Rep: Probable
N-acetylgalactosaminyltransferase 9 - Caenorhabditis
elegans
Length = 579
Score = 238 bits (583), Expect = 3e-61
Identities = 118/250 (47%), Positives = 153/250 (61%), Gaps = 12/250 (4%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+RL++ R GLIRA+L GA +VGD++VFLDSH E N GWL P+++R+S +
Sbjct: 196 VRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTAI---- 251
Query: 317 SARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
V P+ID I+ +T Y L GGF+W LHF W+ L + ++SPT
Sbjct: 252 ----VCPMIDSISDNTLAYHGDWSLSTGGFSWALHFTWEGLSEEEQKRRTKPTDYIRSPT 307
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
MAGGL A REYF +G YD M++WGGENLEISFR WMCGGS+E IPCS VGH+FR
Sbjct: 308 MAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGH 367
Query: 436 PY---GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
PY G +D NS R+A VWMDDY + ++GD++ R LR+RL C
Sbjct: 368 PYNMTGRNNNKDVHGTNSKRLAEVWMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNC 427
Query: 493 KTFKWYLDNM 502
K FKW+LDN+
Sbjct: 428 KPFKWFLDNI 437
Score = 64.9 bits (151), Expect = 6e-09
Identities = 31/96 (32%), Positives = 57/96 (59%), Gaps = 1/96 (1%)
Query: 121 NTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
N S +I RD+PD R + C+ +Y + LP+ S+II F +E + L+R+VHS+++R+
Sbjct: 102 NVHASDKISLDRDVPDPRIQACKDIKYDYAALPKTSVIIIFTDEAWTPLLRTVHSVINRS 161
Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
+ ++E+IL+DD S L + E + + ++
Sbjct: 162 PPELLQEVILLDDNSKRQELQEPLDEHIKRFGGKVR 197
>UniRef50_Q14435 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 3; n=53;
Euteleostomi|Rep: Polypeptide
N-acetylgalactosaminyltransferase 3 - Homo sapiens
(Human)
Length = 633
Score = 238 bits (582), Expect = 3e-61
Identities = 130/327 (39%), Positives = 194/327 (59%), Gaps = 33/327 (10%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++ +R+GLI ARL GA + + L FLD+H E GWL PLL R+++ Y
Sbjct: 246 VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAEN-------Y 298
Query: 317 SARAVTPVIDVINADTFEYS-PSPLV----RGGFNWGLHFKWDNLPKGTLINDEDFMKPL 371
+A V+P I I+ +TFE++ PSP RG F+W L F W++LP +D P+
Sbjct: 299 TA-VVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRKDETYPI 357
Query: 372 KSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVF 431
K+PT AGGLF+I +EYF IG YD M +WGGEN+E+SFR+W CGG LE++PCS VGHVF
Sbjct: 358 KTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVF 417
Query: 432 RKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHV----EIGDISERKALR 487
R + P+ + + +N +R+A VWMD+Y + N AA + GD+S+R ++
Sbjct: 418 RSKSPHSFPKGTQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIK 477
Query: 488 ERLQCKTFKWYLDNMWFET---DRSELV------LGRTLCLDASNN----VAPILGKCHE 534
RL+CK F WYL+N++ E D + ++ +G+ LCLD N I+ CH
Sbjct: 478 HRLRCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLCLDVGENNQGGKPLIMYTCHG 537
Query: 535 MGGTQEWKHKGTASSPI-YNTAAGMCL 560
+GG Q +++ +A I +N +CL
Sbjct: 538 LGGNQYFEY--SAQHEIRHNIQKELCL 562
Score = 79.0 bits (186), Expect = 3e-13
Identities = 46/111 (41%), Positives = 65/111 (58%), Gaps = 3/111 (2%)
Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDL-PDTRNKLCQSQQY--FDELPRASIIICFYN 162
E+ + +++G H FN S RI HRDL PDTR C Q++ LP S+II F+N
Sbjct: 136 EEQKEKERGEAKHCFNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHN 195
Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
E + TL+R+VHS++ + +KEIILVDD S LH + E V + + V
Sbjct: 196 EAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSIV 246
>UniRef50_O45293 Cluster: Probable N-acetylgalactosaminyltransferase
8; n=2; Caenorhabditis|Rep: Probable
N-acetylgalactosaminyltransferase 8 - Caenorhabditis
elegans
Length = 421
Score = 236 bits (577), Expect = 1e-60
Identities = 106/250 (42%), Positives = 161/250 (64%), Gaps = 9/250 (3%)
Query: 259 LLKTSK-REGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
++K S+ R+GLIRA+++ + + G+V+VF+DSH EV WL PLL+ + + +
Sbjct: 173 IIKRSEYRQGLIRAKVHASRLATGEVIVFMDSHCEVAERWLEPLLQPIKEDPKSI----- 227
Query: 318 ARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
V PV+D+IN +F+YSPS + + GF+WG FKW LP E+ +KP SP M
Sbjct: 228 ---VLPVVDLINPVSFDYSPSMVAKSGFDWGFTFKWIYLPWEYFETPENNVKPFNSPAMP 284
Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
GGL A+ +EYF +G+YD GM +WG EN+E+S + W+CGG + + PCSRVGHVFR RRPY
Sbjct: 285 GGLLAMRKEYFVELGEYDMGMEIWGSENIELSLKAWLCGGRVVVAPCSRVGHVFRMRRPY 344
Query: 438 GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFKW 497
D L N++R+A+ W+ +Y K V P A + GD++E +++RL+CK KW
Sbjct: 345 TSKPGMDTALYNAVRVAKTWLGEYESKFFAVKPRGAKMVFGDLTEPMQVKDRLKCKDMKW 404
Query: 498 YLDNMWFETD 507
+++N++ E +
Sbjct: 405 FIENVYPELE 414
Score = 56.8 bits (131), Expect = 2e-06
Identities = 29/108 (26%), Positives = 63/108 (58%), Gaps = 4/108 (3%)
Query: 114 GYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVH 173
G AF+ L S+++G +R++ +KLC+ ++Y D S+++ +NE T++R ++
Sbjct: 70 GIKSFAFDALSSEKLGPNRNVGKQAHKLCEEEKY-DASYSTSVVVIHHNEALSTILRMIN 128
Query: 174 SIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI 221
I++ T + +KEI+L +D S+ H + + ++K + E+++I
Sbjct: 129 GIIEFTPKSLLKEIVLYEDASE---EDHVLTKHLEKFAKIKGLEDKLI 173
>UniRef50_UPI0000D56CDA Cluster: PREDICTED: similar to CG4445-PA;
n=1; Tribolium castaneum|Rep: PREDICTED: similar to
CG4445-PA - Tribolium castaneum
Length = 602
Score = 233 bits (571), Expect = 7e-60
Identities = 120/250 (48%), Positives = 154/250 (61%), Gaps = 12/250 (4%)
Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
++L++ R GLI+ARL GA + G VL FLD+H E GWL LL + Q V
Sbjct: 213 KVLRSQARIGLIKARLKGALVAKGPVLTFLDAHCECTTGWLEALLSVIKQDRTAV----- 267
Query: 318 ARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTL-INDEDFMKPLKSPT 375
V PVID+IN DTF Y S L G FNW L F+W L L + D +P +PT
Sbjct: 268 ---VCPVIDIINDDTFAYVKSFELHWGAFNWNLQFRWFTLGGRELKLRKNDATQPFNTPT 324
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
MAGGLFAI REYF +G YD GMN+WGGENLE+SFRIW CGG +++ PCSRVGH+FRK
Sbjct: 325 MAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRIWQCGGKVQIAPCSRVGHLFRKSS 384
Query: 436 PYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVE-IGDISERKALRERLQCK 493
PY G + N R+ARVWMDD+ + + N A ++ +++ R LR + +CK
Sbjct: 385 PYSFPGGINKTLFSNLARVARVWMDDWARFYFKFNEPADRIKNEQNVTSRIELRRKHKCK 444
Query: 494 TFKWYLDNMW 503
F+WYLDN+W
Sbjct: 445 GFEWYLDNVW 454
Score = 69.7 bits (163), Expect = 2e-10
Identities = 40/106 (37%), Positives = 63/106 (59%), Gaps = 1/106 (0%)
Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEH 164
+DL + + ++ FN L S RI +R LPD R K C + + P+ SIII F+NE
Sbjct: 103 KDLLKMQQYFQINRFNLLASDRIPLNRSLPDFRRKKCATLFGDYPTYPKTSIIIVFHNEA 162
Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKL 210
+ TL+R+V S+++R+ + ++EIILVDD S+ L + + V L
Sbjct: 163 WSTLLRTVWSVINRSPPELLEEIILVDDSSERKFLKKPLDDYVANL 208
>UniRef50_O61397 Cluster: Probable N-acetylgalactosaminyltransferase
7; n=5; Bilateria|Rep: Probable
N-acetylgalactosaminyltransferase 7 - Caenorhabditis
elegans
Length = 601
Score = 232 bits (568), Expect = 2e-59
Identities = 120/287 (41%), Positives = 173/287 (60%), Gaps = 18/287 (6%)
Query: 259 LLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSA 318
+++T +REGLI AR GA +S G+V++FLD+H EVN WLPPLL + + + V
Sbjct: 220 VVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWLPPLLAPIKRNRKVMTV---- 275
Query: 319 RAVTPVIDVINADTFEY-----SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
PVID I+++++EY SP+ G F WGL +K + + + + +P +S
Sbjct: 276 ----PVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERETAHRKHNSQPFRS 331
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
PT AGGLFAI R +F +G YD G+ +WGGE E+SF+IW CGG + +PCS VGHV+R
Sbjct: 332 PTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVFVPCSHVGHVYRS 391
Query: 434 RRPYGVGE--KQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQ 491
PY G+ + + N MR+ + WMDDY K + P A +V GDIS + ALR++LQ
Sbjct: 392 HMPYSFGKFSGKPVISINMMRVVKTWMDDYSKYYLTREPQATNVNPGDISAQLALRDKLQ 451
Query: 492 CKTFKWYLDNMWFETDRSELVLGRTLCLDASNNVAPILGKC-HEMGG 537
CK+FKWY++N+ ++ +S +L + N P GKC MGG
Sbjct: 452 CKSFKWYMENVAYDVLKSYPMLPPNDVWGEARN--PATGKCLDRMGG 496
Score = 74.1 bits (174), Expect = 9e-12
Identities = 32/95 (33%), Positives = 60/95 (63%)
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
FNT +S I +R +PD R + C+ Y ++LP S+++ F+NE + L+R+VHS++ R+
Sbjct: 124 FNTYVSDMISMNRTIPDIRPEECKHWDYPEKLPTVSVVVVFHNEGWTPLLRTVHSVLLRS 183
Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVI 214
+ I+++++VDD SD +L + + V + N +
Sbjct: 184 PPELIEQVVMVDDDSDKPHLKEKLDKYVTRFNGKV 218
>UniRef50_Q7Z7M9 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 5; n=29;
Deuterostomia|Rep: Polypeptide
N-acetylgalactosaminyltransferase 5 - Homo sapiens
(Human)
Length = 940
Score = 231 bits (565), Expect = 4e-59
Identities = 118/287 (41%), Positives = 177/287 (61%), Gaps = 20/287 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+L+ +R GLIRARL GA N+ GDVL FLDSH+E NVGWL PLL+R+
Sbjct: 557 VRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS-------- 608
Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDE-DFMKPLKSP 374
+ PVI+VIN Y + RG F W ++F W +P + + ++ P
Sbjct: 609 RKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTIRCP 668
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
MAGGLF+I + YF +G YDPG++VWGGEN+E+SF++WMCGG +E+IPCSRVGH+FR
Sbjct: 669 VMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRND 728
Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAH--VEIGDISERKALRERLQ 491
PY ++ + +N +R+A VW+D+Y + +++G++++++ LR++L+
Sbjct: 729 NPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQRELRKKLK 788
Query: 492 CKTFKWYLDNMWFETDRSELVLGRTLCLDASNNVAPILGKCHEMGGT 538
CK+FKWYL+N+ F R+ +V + + NVA LGKC + T
Sbjct: 789 CKSFKWYLENV-FPDLRAPIVRASGVLI----NVA--LGKCISIENT 828
Score = 70.5 bits (165), Expect = 1e-10
Identities = 34/75 (45%), Positives = 48/75 (64%)
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
FN +S I R + DTR C Q + LP S+I+CF +E + TL+RSVHS+++R+
Sbjct: 464 FNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVINRS 523
Query: 180 DQKHIKEIILVDDYS 194
IKEI+LVDD+S
Sbjct: 524 PPHLIKEILLVDDFS 538
>UniRef50_Q9Y117 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 3; n=2;
Sophophora|Rep: Polypeptide
N-acetylgalactosaminyltransferase 3 - Drosophila
melanogaster (Fruit fly)
Length = 667
Score = 227 bits (556), Expect = 5e-58
Identities = 131/309 (42%), Positives = 173/309 (55%), Gaps = 23/309 (7%)
Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
S VK R+ + KR GL+ ARL GA+N+ GDVL FLD+H E + GWL PLL R+ +
Sbjct: 203 SYVKVLAVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWLEPLLSRIKE 262
Query: 308 GVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVR-GGFNWGLHFKW---DNLPKGTLIN 363
V + PVID+I+ D F Y+ + G FNW L F+W D + +
Sbjct: 263 SRKVV--------ICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTAGNS 314
Query: 364 DEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIP 423
+D P+ +P MAGGLFAI R+YF +G YD M VWGGEN+E+SFRIW CGG +E+ P
Sbjct: 315 SKDSTDPIATPGMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISP 374
Query: 424 CSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGD--- 479
CS VGHVFR PY G + + N R A VWMDD+ + I + S + D
Sbjct: 375 CSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDW-QYFIMLYTSGLTLGAKDKVN 433
Query: 480 ISERKALRERLQCKTFKWYLDNMWFE--TDRSELVLGRTLCLDASNNVAPILGK-CHEMG 536
++ER ALRERLQCK F WYL+N+W E + G+ + LD A K +
Sbjct: 434 VTERVALRERLQCKPFSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLP 493
Query: 537 G---TQEWK 542
G ++EWK
Sbjct: 494 GRALSREWK 502
Score = 74.9 bits (176), Expect = 5e-12
Identities = 38/96 (39%), Positives = 59/96 (61%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
+ L++FN L S RI +R L D R C+ ++Y LP S+II F+NE + L+R++ S
Sbjct: 113 FRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYASGLPSTSVIIVFHNEAWSVLLRTITS 172
Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKL 210
+++R+ + +KEIILVDD SD L ++ V L
Sbjct: 173 VINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVL 208
>UniRef50_Q176D5 Cluster: N-acetylgalactosaminyltransferase; n=3;
Endopterygota|Rep: N-acetylgalactosaminyltransferase -
Aedes aegypti (Yellowfever mosquito)
Length = 661
Score = 226 bits (553), Expect = 1e-57
Identities = 116/265 (43%), Positives = 153/265 (57%), Gaps = 12/265 (4%)
Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
K EN K V I +L+ +KREGL+ ARL GA + GD L FLD+H E + GWL PL
Sbjct: 191 KNDLENYVQKLPVV-ISILRLNKREGLVAARLMGARVATGDTLTFLDAHCECSPGWLEPL 249
Query: 302 LKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGT 360
L R+ + + V PVID+I+ D F Y S G FNW +HF+W L
Sbjct: 250 LARVQEN--------PKKVVCPVIDIISDDNFSYIKSFEFHWGAFNWQMHFRWYTLSDEE 301
Query: 361 LIND-EDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSL 419
L +D P +P MAGGLF I R+YF +G YD + +WGG+NLE+SFRIW CGG +
Sbjct: 302 LAERRKDTTMPFHTPAMAGGLFTIDRKYFFDVGAYDERLKIWGGDNLEMSFRIWQCGGEI 361
Query: 420 ELIPCSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIG 478
E+ PCS VGH+FRK PY G + +N R+A VWMDD+ K + N +
Sbjct: 362 EIAPCSHVGHLFRKSSPYTFPGGVSGILNENLARVALVWMDDWAKFFFKFNKGTEEFKSL 421
Query: 479 DISERKALRERLQCKTFKWYLDNMW 503
++S R AL++ L CK+F WYL +W
Sbjct: 422 NVSSRVALKKHLSCKSFDWYLRKIW 446
Score = 88.6 bits (210), Expect = 4e-16
Identities = 45/114 (39%), Positives = 72/114 (63%)
Query: 101 LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICF 160
++ ++DL + + + ++ +N L S R+ +R LPD R C S++Y +LP SIII F
Sbjct: 92 VVIQAKDLLLMQQLFQINRYNLLASDRVALNRSLPDVRKSKCVSKEYPSKLPTTSIIIVF 151
Query: 161 YNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVI 214
+NE + L+R+V S++ R+ + IKEI+LVDD SD L +D++ V KL VI
Sbjct: 152 HNEAWSVLLRTVWSVIIRSPRHLIKEILLVDDASDRRFLKNDLENYVQKLPVVI 205
>UniRef50_Q16SH9 Cluster: N-acetylgalactosaminyltransferase; n=2;
Culicidae|Rep: N-acetylgalactosaminyltransferase - Aedes
aegypti (Yellowfever mosquito)
Length = 569
Score = 226 bits (552), Expect = 1e-57
Identities = 105/253 (41%), Positives = 162/253 (64%), Gaps = 13/253 (5%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R + REGLIR+R G + GD ++FLDSH EVN GWL PL+ RL+ VD V
Sbjct: 175 VRFHRNFVREGLIRSRNIGVAYASGDFVLFLDSHCEVNRGWLEPLVDRLT--VDSTAV-- 230
Query: 317 SARAVTPVIDVINADTFEYSP-SPLVRGGFNWGLHFKWDNLPKGTLIN-DEDFMKPLKSP 374
++P+ID+I+AD+FEY P S +RGGF+W L F+W + + L + + D +P SP
Sbjct: 231 ----LSPIIDIIDADSFEYRPNSARLRGGFDWSLRFRWLPVAEEELEHRNHDESQPFYSP 286
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
++GG+F + + F +G +D G+ +WGGE+LE S + W+CG +E++PCSR+GHVFR++
Sbjct: 287 AISGGVFIVSKTLFQQLGGFDGGLEIWGGESLEFSLKAWLCGAHVEVVPCSRIGHVFRRK 346
Query: 435 RPYGV--GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
PYG G Y L+N+ R+A VWMD++ + P A+ + +G + + K L+ RL C
Sbjct: 347 HPYGFPQGSAATY-LRNTKRIASVWMDEFQNFFYKTRPEASALSVGSLQQMKDLKRRLNC 405
Query: 493 KTFKWYLDNMWFE 505
+ F WY+ N++ +
Sbjct: 406 RKFSWYMQNVFLD 418
Score = 68.9 bits (161), Expect = 3e-10
Identities = 35/90 (38%), Positives = 58/90 (64%), Gaps = 3/90 (3%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRA-SIIICFYNEHYETLMRSVH 173
Y H FN +S RIG R LPDTR+ C+++++ R S+II F+NE TL+R++
Sbjct: 79 YQRHYFNLELSDRIGVDRTLPDTRHANCKTREFLTSPGRTTSVIITFHNEATSTLLRTIG 138
Query: 174 SIMDRTDQKHIKEIILVDDYSDLYNLHHDV 203
S++ +T + ++EII++DD S +L H++
Sbjct: 139 SVLKQTPPELLQEIIVIDDCST--SLEHNL 166
>UniRef50_Q8MV48 Cluster: N-acetylgalactosaminyltransferase 7; n=5;
Endopterygota|Rep: N-acetylgalactosaminyltransferase 7 -
Drosophila melanogaster (Fruit fly)
Length = 591
Score = 223 bits (545), Expect = 1e-56
Identities = 114/261 (43%), Positives = 160/261 (61%), Gaps = 22/261 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++ +REGLIR R GA + G+V+VFLD+H EVN WLPPLL + Y
Sbjct: 204 VKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPPLLAPI----------Y 253
Query: 317 SARAV--TPVIDVINADTFEYSP----SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKP 370
R V P+ID I+ FEY P RG F WG+ +K + +P+ +P
Sbjct: 254 RDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRRAHNSEP 313
Query: 371 LKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHV 430
+SPT AGGLFAI REYF +G YDPG+ VWGGEN E+SF+IW CGGS+E +PCSRVGHV
Sbjct: 314 YRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSRVGHV 373
Query: 431 FRKRRPYGVG-----EKQDYMLQNSMRMARVWMDDYVKKVIEV-NPSAAHVEIGDISERK 484
+R PY G +K + N R+ W DD K+ P A ++++GDISE+
Sbjct: 374 YRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPLARYLDMGDISEQL 433
Query: 485 ALRERLQCKTFKWYLDNMWFE 505
AL++RL CK+F+W++D++ ++
Sbjct: 434 ALKKRLNCKSFQWFMDHIAYD 454
Score = 84.6 bits (200), Expect = 7e-15
Identities = 41/98 (41%), Positives = 59/98 (60%)
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
+ N S I HR + DTR + C+ Y +LPR S+II F+NE + LMR+VHS++D
Sbjct: 108 YGMNIACSDEISMHRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVID 167
Query: 178 RTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
R+ + EIILVDD+SD NL + E V + ++K
Sbjct: 168 RSPTHMLHEIILVDDFSDKENLRSQLDEYVLQFKGLVK 205
>UniRef50_UPI0000E461C0 Cluster: PREDICTED: hypothetical protein,
partial; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: hypothetical protein, partial -
Strongylocentrotus purpuratus
Length = 639
Score = 222 bits (543), Expect = 2e-56
Identities = 111/254 (43%), Positives = 158/254 (62%), Gaps = 15/254 (5%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+ + R GLIRAR++GA N++GD+L FLDSH EVNVGWL PLL + + V
Sbjct: 356 VRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWLEPLLAVIDKDRRNV---- 411
Query: 317 SARAVTPVIDVINADTFEYSPSPLVR--GGFNWGLHFKWDNLPKGTLINDE-DFMKPLKS 373
VTP IDVI+ + Y S + G F W + F+W + L + + P++S
Sbjct: 412 ----VTPTIDVIDDNDLAYKGSDQLPQVGSFGWTMAFRWTAIQTMDLEEAKRNPTLPIRS 467
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
PTMAGGLF+I + YF +G YDPG +WG EN+E+SF+ WMCGGSL + CS VGH+FRK
Sbjct: 468 PTMAGGLFSIDKGYFMELGMYDPGFQIWGAENIELSFKTWMCGGSLYTMACSHVGHIFRK 527
Query: 434 RRPY-GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
PY G+G Y +N+ R+ VW+ D +++P ++ GDI ++ LR++L C
Sbjct: 528 FAPYSGMG---SYFHRNNKRLIEVWLGDARAFYYKLHPDVLRIDAGDIQDQINLRKKLDC 584
Query: 493 KTFKWYLDNMWFET 506
K+F WYLDN++ E+
Sbjct: 585 KSFDWYLDNVFPES 598
Score = 80.6 bits (190), Expect = 1e-13
Identities = 40/98 (40%), Positives = 62/98 (63%)
Query: 112 DKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRS 171
D Y+ +AFN L S I +R LPD R + C+S Y + LP S+II F+NE + L+R+
Sbjct: 251 DALYHKNAFNLLASDMIAFNRSLPDVRPQQCKSLVYPEVLPTTSVIIIFHNEAFSALLRT 310
Query: 172 VHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
VHS+++R+ + +KEIILVDD S +L + + + +
Sbjct: 311 VHSVINRSPRHLLKEIILVDDASTQEHLKVKLDDYISR 348
>UniRef50_Q6V2D0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase; n=1; Echinococcus
granulosus|Rep: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase - Echinococcus
granulosus
Length = 659
Score = 221 bits (540), Expect = 4e-56
Identities = 114/274 (41%), Positives = 159/274 (58%), Gaps = 9/274 (3%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+++ +R GLIRARL GA + DV++FLD+H E WL PLL R+ Q D V
Sbjct: 217 VRIVRLPQRTGLIRARLEGAKAATADVIIFLDAHCEATYRWLEPLLYRIWQKPDAVVCPA 276
Query: 317 SARAVTPVIDVINADTFEYSPS---PLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
A + + D Y+ L G F W F +++ P+ +++ + + ++S
Sbjct: 277 IANIDRFTLKIFRTDV-RYTEDGWLSLRVGSFAWDGMFVFEHPPRSSVVKRQSNVDTIES 335
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
TM GGLFAI+R+YF +G YD GM +WGGENLE+SFRIW CGGSLE PCS VGHV+R
Sbjct: 336 LTMPGGLFAIHRDYFFKLGGYDDGMEIWGGENLELSFRIWQCGGSLEFSPCSTVGHVYRA 395
Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
PY K+DY N+ RMA VWMD Y + +++ GD+S+RK LR L C+
Sbjct: 396 IHPYSFPSKKDYNGYNTARMAEVWMDMYKENFYLARGDIKNMDYGDVSKRKKLRNDLGCR 455
Query: 494 TFKWYLDNM---WFETDRSELVLGRTLCLDASNN 524
F+W+LDN+ F R+ L G C +A N+
Sbjct: 456 NFQWFLDNIAPHKFVYSRNRLGYGS--CCNAENH 487
Score = 60.5 bits (140), Expect = 1e-07
Identities = 37/107 (34%), Positives = 60/107 (56%), Gaps = 6/107 (5%)
Query: 102 IRNSEDLRIRDK-GYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICF 160
I + E R+ D GYN HA + R HR K C + Y D+LP AS+I+ F
Sbjct: 107 ISDEEMKRVNDADGYNSHACKLVALDRSLGHRPA-----KECLAVIYPDKLPTASVILIF 161
Query: 161 YNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAV 207
+NE + ++R+V S+++R+ +KE+IL+DD S +L ++ + V
Sbjct: 162 FNEPFRLIIRTVFSVVNRSPPALLKEVILLDDGSTQSDLLDNLDKFV 208
>UniRef50_Q17NN8 Cluster: N-acetylgalactosaminyltransferase; n=4;
Endopterygota|Rep: N-acetylgalactosaminyltransferase -
Aedes aegypti (Yellowfever mosquito)
Length = 613
Score = 219 bits (534), Expect = 2e-55
Identities = 108/247 (43%), Positives = 154/247 (62%), Gaps = 12/247 (4%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++L+ R GLI ARL GA + GDVL+ LDSH EVNV WLPPL++ +++
Sbjct: 215 VKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVNWLPPLIEPIAEDY------- 267
Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
V P IDVI DTF+Y + RG F+W +K L ++ D +P +SP
Sbjct: 268 -RTCVCPFIDVIAHDTFQYRAQDEGKRGAFDWKFLYKRLPLRAQDMV---DPTEPFESPI 323
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
MAGGLFAI ++F +G YD G+++WGGE E+SF++W CGG + PCSRVGHV+R
Sbjct: 324 MAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVGHVYRGYA 383
Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
P+ +++ +N R+A VWMD+Y + + E NP + GD++++KALRERLQCK F
Sbjct: 384 PFPNPRGTNFVTRNFKRVAEVWMDEYKQFLYERNPQFDQTDAGDLTKQKALRERLQCKPF 443
Query: 496 KWYLDNM 502
KW+L+ +
Sbjct: 444 KWFLEEV 450
Score = 63.7 bits (148), Expect = 1e-08
Identities = 27/63 (42%), Positives = 45/63 (71%)
Query: 145 QQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQ 204
++Y ELP S+I+ FYNEH+ TL+R+V+S+++R+ +KEI+LV+D+S L +Q
Sbjct: 145 KRYLQELPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHSTKEFLWEPLQ 204
Query: 205 EAV 207
+ V
Sbjct: 205 DFV 207
>UniRef50_O45947 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 10; n=4;
Caenorhabditis|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 10 - Caenorhabditis
elegans
Length = 622
Score = 217 bits (530), Expect = 7e-55
Identities = 106/261 (40%), Positives = 159/261 (60%), Gaps = 14/261 (5%)
Query: 251 KNNVFNI-RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGV 309
KN + +I ++L+T KREGLIR R GA ++ G++L+FLD+H E N WLPPLL +++
Sbjct: 214 KNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLPPLLDPIAEDY 273
Query: 310 DGVKVRYSARAVTPVIDVINADTFEYSPSPL-VRGGFNWGLHFKWDNLPKGTLINDEDFM 368
V V P +DVI+ +T+E P RG F+W ++K LP T + E
Sbjct: 274 RTV--------VCPFVDVIDCETYEVRPQDEGARGSFDWAFNYK--RLPL-TKKDRESPT 322
Query: 369 KPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVG 428
KP SP MAGG FAI ++F +G YD G+++WGGE E+SF++W C G + PCSRV
Sbjct: 323 KPFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVA 382
Query: 429 HVFR-KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALR 487
H++R K P+ D++ +N R+A VWMDDY + + + P + + GD+ K +R
Sbjct: 383 HIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDDYKETLYKHRPGVGNADAGDLKLMKGIR 442
Query: 488 ERLQCKTFKWYLDNMWFETDR 508
E+LQCK+F W++ + F+ D+
Sbjct: 443 EKLQCKSFDWFMKEIAFDQDK 463
Score = 79.8 bits (188), Expect = 2e-13
Identities = 37/103 (35%), Positives = 67/103 (65%), Gaps = 2/103 (1%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
Y + +N IS I +R + D R+K C++ Y +LP S+I F+ EH TL+RSV+S
Sbjct: 120 YKANGYNAYISDMISLNRSIKDIRHKECKNMMYSAKLPTVSVIFPFHEEHNSTLLRSVYS 179
Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAV--DKLNNVIK 215
+++R+ + +KEIILVDD+S+ L +++ + +K+++++K
Sbjct: 180 VINRSPPELLKEIILVDDFSEKPALRQPLEDFLKKNKIDHIVK 222
Score = 46.4 bits (105), Expect = 0.002
Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 6/75 (8%)
Query: 514 GRTLCLDASNNV--AP-ILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGET 570
GR +C D S +V AP IL CH M G Q +K++ A IY+ +G CL D + +G
Sbjct: 528 GRKICFDCSTSVDKAPVILFDCHSMKGNQLFKYR-VAQKQIYHPISGQCLTADENGKG-F 585
Query: 571 VLMVICDDYSN-NKW 584
+ M CD S+ KW
Sbjct: 586 LHMKKCDSSSDLQKW 600
>UniRef50_Q9HCQ5 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 9; n=51;
Euteleostomi|Rep: Polypeptide
N-acetylgalactosaminyltransferase 9 - Homo sapiens
(Human)
Length = 603
Score = 215 bits (525), Expect = 3e-54
Identities = 106/252 (42%), Positives = 151/252 (59%), Gaps = 13/252 (5%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++ S+REGLIRARL G + V+ F D+H+E N GW P L R+ +
Sbjct: 214 VKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAEPALSRIRED-------- 265
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
R V P ID I TFE G+NWGL + +P ++ D P+++P M
Sbjct: 266 RRRIVLPAIDNIKYSTFEVQQYANAAHGYNWGLRCMYI-IPPQDWLDRGDESAPIRTPAM 324
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
G F + REYF IG DPGM V+GGEN+E+ R+W CGGS+E++PCSRV H+ R R+P
Sbjct: 325 IGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRVWQCGGSMEVLPCSRVAHIERTRKP 384
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EVNPSAAHVEIGDISERKALRERLQCK 493
Y + Y +N++R A VWMDD+ V + S V+ GD+SER ALR+RL+C+
Sbjct: 385 YN-NDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSERLALRQRLKCR 443
Query: 494 TFKWYLDNMWFE 505
+FKWYL+N++ E
Sbjct: 444 SFKWYLENVYPE 455
Score = 79.8 bits (188), Expect = 2e-13
Identities = 35/95 (36%), Positives = 57/95 (60%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
Y + +N +S RI R +PD R + C+ Y +LP+ S++ F NE ++RSVHS
Sbjct: 114 YEEYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLPQVSVVFIFVNEALSVILRSVHS 173
Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
+++ T + +KE+ILVDD SD L ++ + V+K
Sbjct: 174 VVNHTPSQLLKEVILVDDNSDNVELKFNLDQYVNK 208
Score = 34.3 bits (75), Expect = 9.2
Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 1/70 (1%)
Query: 518 CL-DASNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVIC 576
CL D P L KC ++ + T S PI + A G CL V+ S L ++
Sbjct: 525 CLVDDGTGRMPTLKKCEDVARPTQRLWDFTQSGPIVSRATGRCLEVEMSKDANFGLRLVV 584
Query: 577 DDYSNNKWDI 586
S KW I
Sbjct: 585 QRCSGQKWMI 594
>UniRef50_Q5TWJ3 Cluster: ENSANGP00000028412; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000028412 - Anopheles gambiae
str. PEST
Length = 523
Score = 213 bits (521), Expect = 8e-54
Identities = 119/332 (35%), Positives = 182/332 (54%), Gaps = 35/332 (10%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+L+T KR GLI R++GA + D L+FLD+H E GWL PLL+ ++ + KV
Sbjct: 127 VRILRTPKRLGLITGRIFGAKRASADYLLFLDAHCECLAGWLEPLLELVASNQENRKV-- 184
Query: 317 SARAVTPVIDVINADTF--EYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
P ID +N T + S + G F+W L F+W E+ ++P +P
Sbjct: 185 ---VAVPTIDWLNETTLALQVGASSGLYGAFDWNLSFQWRPRYDRLQAPQENLLEPFDTP 241
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
MAGGLF I + +F +G YDPG+ V+GGEN+E+SF++WMCGG++ +PCS V H+ ++
Sbjct: 242 VMAGGLFCIEKAFFAQLGWYDPGLQVYGGENMELSFKVWMCGGAIRTVPCSHVAHIQKRN 301
Query: 435 RPY--GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPS--------AAH-VEIGDISER 483
PY +++D ++NS+R+A VWMD+Y + + ++P +H + ++ R
Sbjct: 302 NPYIGSYTKERDLTMRNSLRVAEVWMDEYAEFLYRLHPDYRALLASRTSHSLSNVNLDAR 361
Query: 484 KALRERLQCKTFKWYLDNMWFETD-----------RSELVLGRTLCLD-ASNNVAPILGK 531
+ LR L CK+F+WYL +++ E D R E G+ LCL + + L
Sbjct: 362 RQLRSELGCKSFRWYLQHVFPEQDDPSEAQAAGWIRHENEAGQ-LCLTWPMRDRSLALLH 420
Query: 532 CHEMGGTQEWKHKGTASSPIYNTAAGMCLGVD 563
CH +GG Q W H+ T G CLGVD
Sbjct: 421 CHGLGGQQIWFHRKTGEI----AREGHCLGVD 448
Score = 70.9 bits (166), Expect = 9e-11
Identities = 32/86 (37%), Positives = 54/86 (62%), Gaps = 1/86 (1%)
Query: 114 GYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVH 173
G+ +N +S I R+LPD R+ C+ ++ LP SI+I F++E L+R+VH
Sbjct: 28 GWQRQGYNQFVSDLISVRRELPDVRDPWCRDRKR-SALPPVSIVIVFHDEALSVLLRTVH 86
Query: 174 SIMDRTDQKHIKEIILVDDYSDLYNL 199
S+++RT + ++EI+L+DD+S L L
Sbjct: 87 SVLNRTPPELVQEILLIDDWSSLVQL 112
>UniRef50_Q6WV16 Cluster: N-acetylgalactosaminyltransferase 6; n=4;
Diptera|Rep: N-acetylgalactosaminyltransferase 6 -
Drosophila melanogaster (Fruit fly)
Length = 666
Score = 210 bits (513), Expect = 8e-53
Identities = 107/252 (42%), Positives = 153/252 (60%), Gaps = 14/252 (5%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+++ +R GLI AR GA N+ +VL+FLDSH+E N WLPPLL+ ++
Sbjct: 264 VRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALN-------- 315
Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
AV P IDVI+ F Y + RG F+W +K L L + D P KSP
Sbjct: 316 KRTAVCPFIDVIDHTNFHYRAQDEGARGAFDWEFFYKRLPLLPEDLKHPAD---PFKSPI 372
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
MAGGLFAI RE+F +G YD G+++WGGE E+SF+IWMCGG + PCSR+GH++R R
Sbjct: 373 MAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRGPR 432
Query: 436 PYGVG-EKQDYMLQNSMRMARVWMDDYVKKVIEVNPSA-AHVEIGDISERKALRERLQCK 493
+ K DY+ +N R+A VWMD+Y + V+ GD++E+KA+R +L CK
Sbjct: 433 NHQPSPRKGDYLHKNYKRVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLNCK 492
Query: 494 TFKWYLDNMWFE 505
+FKW+++ + F+
Sbjct: 493 SFKWFMEEVAFD 504
Score = 89.0 bits (211), Expect = 3e-16
Identities = 43/93 (46%), Positives = 64/93 (68%), Gaps = 1/93 (1%)
Query: 104 NSEDLRIRDKGYNL-HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYN 162
+ E R +K +L + FN L+S I +R +PD R+ LC+ ++Y +LP S+II FYN
Sbjct: 153 DDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKLPTVSVIIIFYN 212
Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
E+ LMRSVHS+++R+ + +KEIILVDD+SD
Sbjct: 213 EYLSVLMRSVHSLINRSPPELMKEIILVDDHSD 245
>UniRef50_Q17M60 Cluster: N-acetylgalactosaminyltransferase; n=1;
Aedes aegypti|Rep: N-acetylgalactosaminyltransferase -
Aedes aegypti (Yellowfever mosquito)
Length = 552
Score = 209 bits (510), Expect = 2e-52
Identities = 104/254 (40%), Positives = 153/254 (60%), Gaps = 12/254 (4%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+L++ +R GLI+ARL GA N+ ++L FLD+H E GWL P L R+++ V +
Sbjct: 177 VRILRSPQRLGLIKARLMGARNATTEILTFLDAHCECTTGWLEPQLDRVARNPTTVAI-- 234
Query: 317 SARAVTPVIDVINADTFEY--SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
P ID ++ + + S + G +WGL F W + E+ ++P +P
Sbjct: 235 ------PTIDWVDEHNLAFIANRSHIYYGACDWGLQFGWRGR-WDRKVKPENKLEPFPTP 287
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
MAGGLF+I + +F IG YD G+ ++GGEN+E+S + WMCGG LE IPCSRVGH+ +
Sbjct: 288 IMAGGLFSINKTFFAHIGWYDEGLGIYGGENVELSLKAWMCGGRLETIPCSRVGHIQKAG 347
Query: 435 RPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEV-NPSAAHVEIGDISERKALRERLQCK 493
PY G K D++ S+R+A VWMD Y + V ++ GD+S+RK LRE L CK
Sbjct: 348 HPYLDGVKTDWVRVGSVRVAEVWMDQYAQVVYDMFGGPEFRGNFGDVSDRKKLRESLNCK 407
Query: 494 TFKWYLDNMWFETD 507
+FKWYL+N + E +
Sbjct: 408 SFKWYLENAFPELE 421
Score = 70.1 bits (164), Expect = 2e-10
Identities = 33/97 (34%), Positives = 59/97 (60%), Gaps = 1/97 (1%)
Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEHYETL 168
+ ++ + A N S I HR LPD R+ C+ + + + LP +++I F+NE + L
Sbjct: 73 VMERQFKTFALNEYASALISAHRRLPDYRDPWCKVKGRIMEHLPETTVVIVFFNEPWSVL 132
Query: 169 MRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQE 205
+R+V+S++DR+ + IKE++LVDD S + + +QE
Sbjct: 133 VRTVYSVLDRSPPELIKEVLLVDDCSFMPHTKTQLQE 169
>UniRef50_Q8N3T1 Cluster: Polypeptide
N-acetylgalactosaminyltransferase-like protein 2; n=21;
Amniota|Rep: Polypeptide
N-acetylgalactosaminyltransferase-like protein 2 - Homo
sapiens (Human)
Length = 639
Score = 207 bits (506), Expect = 6e-52
Identities = 108/267 (40%), Positives = 164/267 (61%), Gaps = 13/267 (4%)
Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
SE + ++LL+++KR G IRAR+ GA + GDVLVF+D+H E + GWL PLL R++
Sbjct: 243 SEYVARLEGVKLLRSNKRLGAIRARMLGATRATGDVLVFMDAHCECHPGWLEPLLSRIAG 302
Query: 308 GVDGVKVRYSARAVTPVIDVINADTFEYSPSP-LVRGGFNWGLHFKWDNLPKGTLINDED 366
+R V+PVIDVI+ TF+Y PS L RG +W L F W+ LP+ +
Sbjct: 303 D--------RSRVVSPVIDVIDWKTFQYYPSKDLQRGVLDWKLDFHWEPLPEHVRKALQS 354
Query: 367 FMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSR 426
+ P++SP + G + A+ R YF G YD M++ GGENLE+SF+ W+CGGS+E++PCSR
Sbjct: 355 PISPIRSPVVPGEVVAMDRHYFQNTGAYDSLMSLRGGENLELSFKAWLCGGSVEILPCSR 414
Query: 427 VGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSA---AHVEIGDISER 483
VGH+++ + + + Q+ L+N +R+A W+ + + + +P A + E D ER
Sbjct: 415 VGHIYQNQDSHSPLD-QEATLRNRVRIAETWLGSFKETFYKHSPEAFSLSKAEKPDCMER 473
Query: 484 KALRERLQCKTFKWYLDNMWFETDRSE 510
L+ RL C+TF W+L N++ E SE
Sbjct: 474 LQLQRRLGCRTFHWFLANVYPELYPSE 500
Score = 81.0 bits (191), Expect = 8e-14
Identities = 41/90 (45%), Positives = 56/90 (62%)
Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKH 183
+S RI R LP+ R+ LC Q D LP AS+I+CF++E + TL+R+VHSI+D +
Sbjct: 163 LSARIPLQRALPEVRHPLCLQQHPQDSLPTASVILCFHDEAWSTLLRTVHSILDTVPRAF 222
Query: 184 IKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
+KEIILVDD S L + E V +L V
Sbjct: 223 LKEIILVDDLSQQGQLKSALSEYVARLEGV 252
>UniRef50_Q7TT15-2 Cluster: Isoform 2 of Q7TT15 ; n=9; Mammalia|Rep:
Isoform 2 of Q7TT15 - Mus musculus (Mouse)
Length = 596
Score = 206 bits (504), Expect = 1e-51
Identities = 113/294 (38%), Positives = 168/294 (57%), Gaps = 24/294 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++ KREGLIRAR+ G + G V F D+H+E GW P+L R+ +
Sbjct: 213 VKVVRNQKREGLIRARIEGWKAATGQVTGFFDAHVEFTAGWAEPVLSRIQEN-------- 264
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
R + P ID I D FE G++W L + + PK + D P+++P M
Sbjct: 265 RKRVILPSIDNIKQDNFEVQRYENSAHGYSWELWCMYISPPKDWW-DAGDPSLPIRTPAM 323
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
G F + R++F IG DPGM+V+GGEN+E+ ++W+CGGS+E++PCSRV H+ RK++P
Sbjct: 324 IGCSFVVNRKFFGEIGLLDPGMDVYGGENIELGIKVWLCGGSMEVLPCSRVAHIERKKKP 383
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKV-IEVNPSAAH--VEIGDISERKALRERLQCK 493
Y Y +N++R+A VWMDDY V I N + ++IGD+SERKALR+ L+CK
Sbjct: 384 YN-SNIGFYTKRNALRVAEVWMDDYKSHVYIAWNLPLENPGIDIGDVSERKALRKSLKCK 442
Query: 494 TFKWYLDNMWFETDR--SELVLG-------RTLCLDAS--NNVAPILGKCHEMG 536
F+WYLD+++ E R + + G + +CLD N IL CH G
Sbjct: 443 NFQWYLDHVYPEMRRYNNTIAYGELRNNKAKDVCLDQGPLENHTAILYPCHGWG 496
Score = 68.5 bits (160), Expect = 5e-10
Identities = 32/75 (42%), Positives = 47/75 (62%)
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
+ +N+ +S++I R +PD R C+ +Y ELP+ SII F NE ++RSVHS ++
Sbjct: 118 YGYNSYLSEKISLDRSIPDYRPTKCKELKYSKELPQISIIFIFVNEALSVILRSVHSAVN 177
Query: 178 RTDQKHIKEIILVDD 192
T +KEIILVDD
Sbjct: 178 HTPTHLLKEIILVDD 192
>UniRef50_UPI0000E4710F Cluster: PREDICTED: similar to
pp-GalNAc-transferase 17; n=2; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to
pp-GalNAc-transferase 17 - Strongylocentrotus purpuratus
Length = 315
Score = 205 bits (501), Expect = 2e-51
Identities = 105/253 (41%), Positives = 148/253 (58%), Gaps = 11/253 (4%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++L + SKREGLIR+R++GA+ S G VL FLD+H E + WL PLL ++ V
Sbjct: 38 VKLYRNSKREGLIRSRIFGAEQSRGQVLTFLDAHCECSPNWLVPLLTEIALNRTTV---- 93
Query: 317 SARAVTPVIDVINADTFEYSPSP--LVRGGFNWGLHFKWDNLPKGTLINDEDFM-KPLKS 373
V P +D I+AD FEY L RG +W +K + + +P S
Sbjct: 94 ----VCPTVDSISADNFEYRSQGDGLCRGAMDWDFWYKRIPVDLSRQRLGLKYQSEPYDS 149
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
P MAGGLFA+ RE+F +G YDPG+ +WGGEN EISF+ WMCGGSL+ +PCSRVGHV+RK
Sbjct: 150 PMMAGGLFALDREFFFELGGYDPGLQIWGGENFEISFKAWMCGGSLKFVPCSRVGHVYRK 209
Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
PY + + N MR+A VW+D++ + P GDI E+ R+ K
Sbjct: 210 GVPYTYPDSGVPGVSNYMRVAEVWLDEFKEFFYTSRPDLRGKPYGDIGEQIRFRKHHCPK 269
Query: 494 TFKWYLDNMWFET 506
+FKW+++ + F++
Sbjct: 270 SFKWFMEEVAFDS 282
>UniRef50_UPI0000E46551 Cluster: PREDICTED: hypothetical protein,
partial; n=1; Strongylocentrotus purpuratus|Rep:
PREDICTED: hypothetical protein, partial -
Strongylocentrotus purpuratus
Length = 325
Score = 204 bits (497), Expect = 7e-51
Identities = 88/144 (61%), Positives = 110/144 (76%)
Query: 373 SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
SPTMAGGLFA+ REYF+ +G YD GM++WGGENLEISFRIW CGG LE++PCSRVGHVFR
Sbjct: 1 SPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISFRIWQCGGKLEIVPCSRVGHVFR 60
Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
KRRPYG +QD +N++R+A VWMD+Y + +V P A +++ GDIS R ALRE L+C
Sbjct: 61 KRRPYGSPNRQDTTTKNAVRVAEVWMDEYKEHFYQVQPKAKNIDYGDISSRVALREELKC 120
Query: 493 KTFKWYLDNMWFETDRSELVLGRT 516
K+FKWYLD ++ E GRT
Sbjct: 121 KSFKWYLDTVYPEMRTPNDTKGRT 144
Score = 52.8 bits (121), Expect = 2e-05
Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 6/86 (6%)
Query: 503 WFETDRSELVLGRTLCLDAS-NNVA--PILGKCHEMGGTQEWKHKGTASSPIYNTAAGMC 559
W++T EL LG +C+D S +N A P L KC MGG+Q W+ K IY+ +G C
Sbjct: 208 WYQTSVEELKLGDAICMDMSESNSASLPQLRKCDGMGGSQRWRIK---DKNIYHPVSGQC 264
Query: 560 LGVDRSYRGETVLMVICDDYSNNKWD 585
L + + + + IC +W+
Sbjct: 265 LSIKQLGSIQMAQLDICSSDPMQEWE 290
>UniRef50_Q6P9A2 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4; n=28;
Euteleostomi|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4 - Homo
sapiens (Human)
Length = 607
Score = 202 bits (494), Expect = 2e-50
Identities = 110/293 (37%), Positives = 170/293 (58%), Gaps = 25/293 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I++++ SK+EGLIR+R+ G + V+ D+H+E NVGW P+L R+ +
Sbjct: 220 IKVVRHSKQEGLIRSRVSGWRAATAPVVALFDAHVEFNVGWAEPVLTRIKEN-------- 271
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
R ++P D I D FE PL GF+W L ++ N PK E+ P++SP +
Sbjct: 272 RKRIISPSFDNIKYDNFEIEEYPLAAQGFDWELWCRYLNPPKAWW-KLENSTAPIRSPAL 330
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
G F + R+YF IG D GM V+GGEN+E+ R+W CGGS+E++PCSR+ H+ R +P
Sbjct: 331 IG-CFIVDRQYFQEIGLLDEGMEVYGGENVELGIRVWQCGGSVEVLPCSRIAHIERAHKP 389
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EVNPSAAHVEIGDISERKALRERLQCK 493
Y + ++ +N++R+A VWMD++ V + + ++IGDI+ RKALR++LQCK
Sbjct: 390 Y-TEDLTAHVRRNALRVAEVWMDEFKSHVYMAWNIPQEDSGIDIGDITARKALRKQLQCK 448
Query: 494 TFKWYLDNMWFETDR-SELV--------LGRTLCLDASNNV--APILGKCHEM 535
TF+WYL +++ E S+++ L LCLD + PI+ CH M
Sbjct: 449 TFRWYLVSVYPEMRMYSDIIAYGVLQNSLKTDLCLDQGPDTENVPIMYICHGM 501
Score = 90.2 bits (214), Expect = 1e-16
Identities = 50/141 (35%), Positives = 75/141 (53%)
Query: 72 KVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDH 131
K ++E AK + + T++ L +G + E R+ K + + +N +S R+
Sbjct: 74 KQHIQEAPAKPEEAEAEPFTDSSLFAHWGQELSPEGRRVALKQFQYYGYNAYLSDRLPLD 133
Query: 132 RDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVD 191
R LPD R C++ + D LP SI+ F NE L+RS+HS M+RT +KEIILVD
Sbjct: 134 RPLPDLRPSGCRNLSFPDSLPEVSIVFIFVNEALSVLLRSIHSAMERTPPHLLKEIILVD 193
Query: 192 DYSDLYNLHHDVQEAVDKLNN 212
D S L + E VDK+N+
Sbjct: 194 DNSSNEELKEKLTEYVDKVNS 214
>UniRef50_Q16ZA7 Cluster: N-acetylgalactosaminyltransferase; n=7;
Culicidae|Rep: N-acetylgalactosaminyltransferase - Aedes
aegypti (Yellowfever mosquito)
Length = 648
Score = 202 bits (493), Expect = 2e-50
Identities = 125/347 (36%), Positives = 179/347 (51%), Gaps = 35/347 (10%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+L+ + R GLIRA++ GA N+ ++ FLD+H+E VGWL PLL ++++ + +
Sbjct: 265 VRILRAASRLGLIRAKMLGAWNTTAQIITFLDAHVECEVGWLEPLLNQVARNPTAIAI-- 322
Query: 317 SARAVTPVIDVINADTFEYSP--SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
P +D I+ DT P S L+ G F+W +F+W + M+P SP
Sbjct: 323 ------PSMDWIDGDTMTLDPQVSQLIYGKFDWMGNFQWGLRRDRRQPQAKHPMEPFDSP 376
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
M GGLFAI R F +G YD +G E+LE+SF+ WMCGGS++++PCSRV HV +
Sbjct: 377 VMPGGLFAINRTLFAHLGWYDEQFETYGAEHLELSFKTWMCGGSMQIVPCSRVAHVQKPN 436
Query: 435 RPY--GVGEKQDYMLQNSMRMARVWMDDYVKKVIEV-NPSAAHVEIGDISERKALRERLQ 491
PY +D + +N +RMA VWMD+Y E + GD+S RK LR+ L
Sbjct: 437 HPYITKTSGSEDVIKRNLVRMAEVWMDEYALYYYETFGGPDKRGDFGDVSSRKQLRQHLN 496
Query: 492 CKTFKWYLDNMWFET-DRSELV---------LGRTLCLD--ASNNVAPILGKCHEMGGTQ 539
CK+F+WYL+N++ E D S V G CLD + N + CH G Q
Sbjct: 497 CKSFRWYLENVFPEQFDPSRAVGRGEFRNGENGTDRCLDWPLARNQCGVT-SCHGRGRHQ 555
Query: 540 EWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNK-WD 585
W T I T CL Y G+T+ M C N+ W+
Sbjct: 556 MWYF--TREGEI--TRKDHCL----DYDGKTLEMNRCHQMGGNQLWE 594
Score = 83.4 bits (197), Expect = 2e-14
Identities = 40/89 (44%), Positives = 58/89 (65%), Gaps = 1/89 (1%)
Query: 109 RIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLC-QSQQYFDELPRASIIICFYNEHYET 167
++ ++GYN FN ++S I R LPD R+ C Q +Y LP SI+I FYNE +
Sbjct: 160 KLVEEGYNDQGFNQVLSDLISVRRRLPDYRDSWCKQPGRYLKNLPDTSIVIVFYNEAWSV 219
Query: 168 LMRSVHSIMDRTDQKHIKEIILVDDYSDL 196
L+R+VHSI+DR+ ++EI+LVDD+S L
Sbjct: 220 LVRTVHSILDRSPPNLVREIVLVDDFSFL 248
>UniRef50_UPI000069E576 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 5 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 5) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 5)
(Polypeptide GalNAc transferase 5) (GalNAc-T5)
(pp-GaNTase 5).; n=1; Xenopus tropicalis|Rep:
Polypeptide N-acetylgalactosaminyltransferase 5 (EC
2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
5) (UDP- GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5) (Polypeptide GalNAc
transferase 5) (GalNAc-T5) (pp-GaNTase 5). - Xenopus
tropicalis
Length = 307
Score = 201 bits (491), Expect = 4e-50
Identities = 113/254 (44%), Positives = 159/254 (62%), Gaps = 38/254 (14%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+L +R GLIRAR+ GA+ + G+VL FLDSH+E NVGWL PLL++ VR
Sbjct: 72 VRVLHLPERHGLIRARIAGANIATGEVLTFLDSHVECNVGWLEPLLEQ---------VRI 122
Query: 317 SARAVT-PVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
+ R V PVI+VI+A + S S + + T ++ F + P
Sbjct: 123 NRRKVACPVIEVISA--LDLSVSRFIS---------------QATYFHNVCFC--FRCPV 163
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
MAGGLF+I + YF +G YDPG++VWGGEN+EISF+IWMCGG +E+IPCSRVGH+FR
Sbjct: 164 MAGGLFSIEKNYFYELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRNDN 223
Query: 436 PYGVGEKQDYMLQ-NSMRMARVWMDDYVKKVIEVNPSAAHV-----EIGDISERKALRER 489
PY + + ++ N +R+A VW+DDY K++ + H+ IGD++E+K LRER
Sbjct: 224 PYSFPKDRIKTVERNLVRVAEVWLDDY-KEIFYGH--GQHLLKYLPNIGDLTEQKQLRER 280
Query: 490 LQCKTFKWYLDNMW 503
LQCK F WY+ N++
Sbjct: 281 LQCKNFNWYIKNVF 294
Score = 68.9 bits (161), Expect = 3e-10
Identities = 35/72 (48%), Positives = 47/72 (65%)
Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
C +Q D+LP SIIICF +E + TLMRSV+S+++R+ + IKEIILVDD+S L
Sbjct: 1 CSNQLIHDDLPTTSIIICFIDEVWSTLMRSVYSVLNRSPEHLIKEIILVDDFSTRDYLKE 60
Query: 202 DVQEAVDKLNNV 213
+ V KL V
Sbjct: 61 KLDTYVKKLPKV 72
>UniRef50_UPI000069E1C8 Cluster: Polypeptide
N-acetylgalactosaminyltransferase-like protein 2 (EC
2.4.1.41) (Protein-UDP
acetylgalactosaminyltransferase-like protein 2)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase- like protein 2)
(Polypeptide GalNAc transferase-like protein; n=1;
Xenopus tropicalis|Rep: Polypeptide
N-acetylgalactosaminyltransferase-like protein 2 (EC
2.4.1.41) (Protein-UDP
acetylgalactosaminyltransferase-like protein 2)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase- like protein 2)
(Polypeptide GalNAc transferase-like protein - Xenopus
tropicalis
Length = 611
Score = 201 bits (491), Expect = 4e-50
Identities = 104/270 (38%), Positives = 162/270 (60%), Gaps = 14/270 (5%)
Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
SE + + ++L++++KR G+I R+ GA + G+VL+F+DSH E + GWL PLL R+
Sbjct: 221 SEYISRIGGVKLIRSNKRLGVIGGRMLGAARATGEVLIFMDSHCECHPGWLEPLLSRIMH 280
Query: 308 GVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGG-FNWGLHFKWDNLPKGTLINDED 366
+ R V+PVID I+ TFEYS S L++ G F+W L F W LP+ +
Sbjct: 281 NRN--------RIVSPVIDFIDWKTFEYSHSSLLQQGVFDWKLDFHWVPLPEHEEKVRQS 332
Query: 367 FMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSR 426
+ P +SP + G + A R YF IG +D G+N WG E E+S R+W+CGGS+E++PCSR
Sbjct: 333 PIIPFRSPVIPGYVLASDRHYFQNIGGFDTGINSWGVETTELSIRVWLCGGSVEIVPCSR 392
Query: 427 VGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAA---HVEIGDISER 483
VGH ++ + + + +L++ +R A +WMD Y K + N + ++ DI+E
Sbjct: 393 VGHAYQNHTMHN-SVQNEAVLRSKVRTAELWMDSY-KAIFYRNVGNSLLNRIQESDINEH 450
Query: 484 KALRERLQCKTFKWYLDNMWFETDRSELVL 513
+ LR+RL CK F+W+L N++ E + S L
Sbjct: 451 EQLRQRLGCKRFQWFLANVYPEINMSTSTL 480
Score = 89.0 bits (211), Expect = 3e-16
Identities = 41/98 (41%), Positives = 64/98 (65%)
Query: 116 NLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSI 175
N + F+ +S+ I HR +PD R+ C Q Y ++LP AS+IICF+NE + TL+R+VHS+
Sbjct: 133 NTNGFDEEVSKNIPLHRIIPDGRHPECLQQNYGEKLPIASVIICFHNEGWSTLLRTVHSV 192
Query: 176 MDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
+D + + +KEIILVDD S +L + E + ++ V
Sbjct: 193 LDNSPRTFLKEIILVDDLSHQEHLKSALSEYISRIGGV 230
>UniRef50_Q9NY28 Cluster: Probable polypeptide
N-acetylgalactosaminyltransferase 8; n=9; Theria|Rep:
Probable polypeptide N-acetylgalactosaminyltransferase 8
- Homo sapiens (Human)
Length = 637
Score = 200 bits (489), Expect = 6e-50
Identities = 109/301 (36%), Positives = 171/301 (56%), Gaps = 25/301 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++ +R+GL +AR G + + DV+ LD+HIEVNVGW P+L R+ + D +
Sbjct: 247 LKIIRHPERKGLAQARNTGWEAATADVVAILDAHIEVNVGWAEPILARIQE--DRTVI-- 302
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
V+PV D I DTF+ L GFNW L ++D LP+ I+ D P+KSP++
Sbjct: 303 ----VSPVFDNIRFDTFKLDKYELAVDGFNWELWCRYDALPQAW-IDLHDVTAPVKSPSI 357
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
G + A R + IG D GM ++GGEN+E+S R+W CGG +E++PCSR+ H+ R +P
Sbjct: 358 MG-ILAANRHFLGEIGSLDGGMLIYGGENVELSLRVWQCGGKVEILPCSRIAHLERHHKP 416
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EVNPSAAHVEIGDISERKALRERLQCK 493
Y + + + +N++R+A +WMD++ V + + ++ GD+S R ALRE+L+CK
Sbjct: 417 YAL-DLTAALKRNALRVAEIWMDEHKHMVYLAWNIPLQNSGIDFGDVSSRMALREKLKCK 475
Query: 494 TFKWYLDNMW-----FET----DRSELVLGRTLCLDAS--NNVAPILGKCHEMGGTQEWK 542
TF WYL N++ T R + +L +CLD PI+ CHE +
Sbjct: 476 TFDWYLKNVYPLLKPLHTIVGYGRMKNLLDENVCLDQGPVPGNTPIMYYCHEFSSQNVYY 535
Query: 543 H 543
H
Sbjct: 536 H 536
Score = 70.9 bits (166), Expect = 9e-11
Identities = 43/162 (26%), Positives = 79/162 (48%), Gaps = 2/162 (1%)
Query: 50 EETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLR 109
+E ++ T+ + +D R + + E K+ +M + L Q+G + +
Sbjct: 81 QENVNSTLKRAKDEVRPLLKAMETKVNE--TKKHKTQMKLFPHSQLFRQWGEDLSEAQQK 138
Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLM 169
+ +N +S ++ +R +PDTR+ C + Y +LP S+I+ F NE +
Sbjct: 139 AAQDLFRKFGYNAYLSNQLPLNRTIPDTRDYRCLRKTYPSQLPSLSVILIFVNEALSIIQ 198
Query: 170 RSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLN 211
R++ SI++RT + +KEIILVDD+S L + E + N
Sbjct: 199 RAITSIINRTPSRLLKEIILVDDFSSNGELKVHLDEKIKLYN 240
>UniRef50_Q8IA42 Cluster: N-acetylgalactosaminyltransferase 4; n=2;
Sophophora|Rep: N-acetylgalactosaminyltransferase 4 -
Drosophila melanogaster (Fruit fly)
Length = 659
Score = 197 bits (480), Expect = 8e-49
Identities = 99/248 (39%), Positives = 155/248 (62%), Gaps = 17/248 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+ +++ +R+GLI AR+ GA +VG V+VF DSHIEVN WLPPL++ ++ + +
Sbjct: 256 VTIVRNPERQGLIGARIAGAKVAVGQVMVFFDSHIEVNYNWLPPLIEPIA-----INPKI 310
Query: 317 SARAVTPVIDVINADTFEYSPSPL--VRGGFNWGLHFKW-DNLPKGTLINDEDFMKPLKS 373
S P++D I+ + F Y RGGF+W + +K LP+ L D P +S
Sbjct: 311 ST---CPMVDTISHEDFSYFSGNKDGARGGFDWKMLYKQLPVLPEDAL----DKSMPYRS 363
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR- 432
P M GGLFAI ++F +G YD +++WGGE E+SF+IWMCGG L +PCSRV H+FR
Sbjct: 364 PVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRG 423
Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSA-AHVEIGDISERKALRERLQ 491
+P G +++ +N R+A VWMD+Y + V + +P +++ GD++ ++ +RERL+
Sbjct: 424 PMKPRGNPRGHNFVAKNHKRVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLK 483
Query: 492 CKTFKWYL 499
CK+F W++
Sbjct: 484 CKSFHWFM 491
Score = 92.7 bits (220), Expect = 2e-17
Identities = 38/94 (40%), Positives = 67/94 (71%)
Query: 102 IRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFY 161
I N ++ ++ + Y ++ FN LIS RI +R +PD R + C++++Y +LP S+I F+
Sbjct: 143 IENPDEKQLEKEHYEMNGFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFF 202
Query: 162 NEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
NEH+ TL+RS++S+++RT + +K+I+LVDD S+
Sbjct: 203 NEHFNTLLRSIYSVINRTPPELLKQIVLVDDGSE 236
>UniRef50_Q86SF2 Cluster: N-acetylgalactosaminyltransferase 7; n=31;
Euteleostomi|Rep: N-acetylgalactosaminyltransferase 7 -
Homo sapiens (Human)
Length = 657
Score = 196 bits (477), Expect = 2e-48
Identities = 113/309 (36%), Positives = 165/309 (53%), Gaps = 34/309 (11%)
Query: 257 IRLLKTSKREGLIRARLYGADNS-VGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
+++ + +REGLI+AR GA + +G VL++LD+H EV V W PL+ +S+
Sbjct: 269 VKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKD------- 321
Query: 316 YSARAVTPVIDVINADTFEYSPSP------LVRGGFNWGLHFKWDNLPKGTLINDEDFMK 369
P+IDVIN +T+E P RG ++W + +K L + +
Sbjct: 322 -RTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTE 380
Query: 370 PLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGH 429
P +SP MAGGLFAI RE+F +G YDPG+ +WGGEN EIS++IW CGG L +PCSRVGH
Sbjct: 381 PYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGH 440
Query: 430 VFR----KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKA 485
++R + P + L+N +R+ VW D+Y P + + GDISE K
Sbjct: 441 IYRLEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDEYKDYFYASRPESQALPYGDISELKK 500
Query: 486 LRERLQCKTFKWYLDNMWFE-----------TDRSELVLGRT-LCLDA---SNNVAPILG 530
RE CK+FKW+++ + ++ D E+ T C+D+ +N LG
Sbjct: 501 FREDHNCKSFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELG 560
Query: 531 KCHEMGGTQ 539
CH MGG Q
Sbjct: 561 PCHRMGGNQ 569
Score = 77.4 bits (182), Expect = 1e-12
Identities = 35/96 (36%), Positives = 60/96 (62%)
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
FN + S I R + D R + C+ Y + L +S++I F+NE + TLMR+VHS++ RT
Sbjct: 175 FNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRT 234
Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
+K++ EI+L+DD+S+ +L + E + N ++K
Sbjct: 235 PRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVK 270
>UniRef50_Q9VUT6 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 8; n=1; Drosophila
melanogaster|Rep: Polypeptide
N-acetylgalactosaminyltransferase 8 - Drosophila
melanogaster (Fruit fly)
Length = 590
Score = 194 bits (474), Expect = 4e-48
Identities = 102/273 (37%), Positives = 156/273 (57%), Gaps = 27/273 (9%)
Query: 244 STENSEVKNNVFNIRLL------KTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGW 297
ST+ E N+ I+ L + + + GL+ AR+ GA+ ++ DVLVFLDSH+EV GW
Sbjct: 170 STQADEKLNDFIKIKFLNMVQHRRITTQVGLMHARVVGAELALADVLVFLDSHVEVTKGW 229
Query: 298 LPPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLP 357
L PL+ + + + TP+ID I+ D F Y RG FNW F + LP
Sbjct: 230 LEPLIAPILED--------NRTCTTPIIDTIDFDNFAYRRGKPSRGFFNW--EFNYIQLP 279
Query: 358 KGTLINDEDFMKPL--KSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMC 415
L+ +E P K+P M GGLFAI RE+F+ +G YD G+ +WG E E+S ++W+C
Sbjct: 280 ---LLKEEAVAMPAPHKNPIMNGGLFAIGREWFSELGGYDKGLKIWGAEQFELSLKLWLC 336
Query: 416 GGSLELIPCSRVGHVFR------KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVN 469
GG + +PCSRVGH+FR + ++ + +N R+A +W+D+Y K+
Sbjct: 337 GGQILEVPCSRVGHLFRDGNFQIRYTNKDKNSEKKLISRNYRRVAEIWLDEYKDKLFANM 396
Query: 470 PSAAHVEIGDISERKALRERLQCKTFKWYLDNM 502
P + +G+++E++ L+ RL CK FKW+LDN+
Sbjct: 397 PHLTVIPVGNLAEQRDLKNRLHCKPFKWFLDNL 429
Score = 62.5 bits (145), Expect = 3e-08
Identities = 37/111 (33%), Positives = 61/111 (54%), Gaps = 10/111 (9%)
Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEH 164
E L K +N +S+RI R L D R++ C+ +Y ++LP S++I ++NE
Sbjct: 81 EQLEAIAKSQRETGYNAWLSKRISPERSLYDMRHRSCKKLKYPMEKLPSVSVVITYHNEE 140
Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
L+R++ S+ RT + ++E+ILVDD S +A +KLN+ IK
Sbjct: 141 ASVLLRTLSSLRSRTPIQLLREVILVDDGS---------TQADEKLNDFIK 182
>UniRef50_A0NGH9 Cluster: ENSANGP00000031751; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000031751 - Anopheles gambiae
str. PEST
Length = 499
Score = 194 bits (473), Expect = 5e-48
Identities = 96/252 (38%), Positives = 149/252 (59%), Gaps = 13/252 (5%)
Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
R+++ KR GLIRAR+ G ++ D++ FLD+H+EV VGWL L++ + + + +
Sbjct: 126 RIVRAPKRLGLIRARMLGGKSTKTDLITFLDAHVEVTVGWLEALIQPVVESWTTIAI--- 182
Query: 318 ARAVTPVIDVINADTFEY--SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
P ID I+ + +Y +P G ++W L+F W N M+P +P
Sbjct: 183 -----PTIDWIDENNMKYRDDKAPTFVGAYDWDLNFGWWGRWSQKKQNANK-MEPFDTPA 236
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
MAGGLFAI R +F IG YD G +++G EN+E+S + WMCGG + +PCSRVGH+ +
Sbjct: 237 MAGGLFAINRTFFERIGWYDDGFDIYGIENIELSVKSWMCGGKMVTVPCSRVGHIQKTGH 296
Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVN--PSAAHVEIGDISERKALRERLQCK 493
PY + +D + NS+R+A VWMD+Y + + ++ P E G ++ RKA+RE +CK
Sbjct: 297 PYLYKQPKDVVRANSIRLAEVWMDEYKRIIFDIYGIPHYLEEEFGSVATRKAIRESAKCK 356
Query: 494 TFKWYLDNMWFE 505
F +YL+N + E
Sbjct: 357 PFSYYLENAFPE 368
Score = 68.1 bits (159), Expect = 6e-10
Identities = 32/88 (36%), Positives = 51/88 (57%), Gaps = 1/88 (1%)
Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEHYETL 168
+ +G FN S + R LP+ R+ C ++ +LP SI+I F+NE + +
Sbjct: 19 LTQQGIQTQGFNQYFSDLMSVRRRLPEIRDPWCAKPGRFLADLPATSIVIVFFNEAWSVV 78
Query: 169 MRSVHSIMDRTDQKHIKEIILVDDYSDL 196
+R+VHS++DR+ +KEI+LVDD S L
Sbjct: 79 LRTVHSVLDRSPAHLVKEIVLVDDCSTL 106
Score = 35.5 bits (78), Expect = 4.0
Identities = 25/88 (28%), Positives = 39/88 (44%), Gaps = 5/88 (5%)
Query: 500 DNMWFETDRSELVLGRTLCLDASNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMC 559
D W EL R C+DA V + +CH G Q WK S I + A +C
Sbjct: 413 DQYWTHNYYQELNSYRN-CIDAVGTVVEVY-QCHRSRGNQAWKVL-VESQQILSVARNLC 469
Query: 560 LGVDRSYRGETVLMVICD-DYSNNKWDI 586
L ++ + T+L+ CD + +W++
Sbjct: 470 LALNLQTK-TTLLLEKCDATKPSQQWNV 496
>UniRef50_UPI000065D57A Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 3 (EC
2.4.1.41) (Protein-UDP
acetylgalactosaminyltransferase-like protein 3)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase- like protein 3)
(Polypeptide GalNAc transferase-lik; n=1; Takifugu
rubripes|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 3 (EC
2.4.1.41) (Protein-UDP
acetylgalactosaminyltransferase-like protein 3)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase- like protein 3)
(Polypeptide GalNAc transferase-lik - Takifugu rubripes
Length = 605
Score = 190 bits (463), Expect = 9e-47
Identities = 103/258 (39%), Positives = 153/258 (59%), Gaps = 16/258 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++ KREGLIRAR+ G + +V F D+H+E W P+L R +K Y
Sbjct: 192 VKIVRNQKREGLIRARIEGWKVASAEVTGFFDAHVEFTPSWAEPVLAR-------IKEDY 244
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLK---S 373
R + P ID I DTFE G+NW L + N PK ++ D P++ +
Sbjct: 245 K-RIILPSIDNIKHDTFEVERYENSGHGYNWELWCMYINPPKQWW-DEGDASAPIRHDPT 302
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
P M G F R+YF +G D GM+V+GGEN+E+ R+W+CGGS+E++PCSRV H+ R
Sbjct: 303 PAMIGCSFVANRDYFGELGLLDSGMDVYGGENIELGIRVWLCGGSMEVLPCSRVAHIARV 362
Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKV-IEVN-PSAAH-VEIGDISERKALRERL 490
++PY + +N++R+A VWMD+Y V + N P H ++ GDIS+R ALR+ L
Sbjct: 363 KKPYH-SNIAYHTRRNALRVAEVWMDEYRSNVYLAWNIPMENHGIDYGDISQRVALRKSL 421
Query: 491 QCKTFKWYLDNMWFETDR 508
QCK+F+WYL+N++ E R
Sbjct: 422 QCKSFEWYLENVYPEMRR 439
Score = 64.5 bits (150), Expect = 8e-09
Identities = 32/78 (41%), Positives = 46/78 (58%)
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
+ +N +S RI R +PD R C+ Y +LP+ S+I F NE ++RSVHS ++
Sbjct: 16 YGYNAYLSDRISLDRTIPDHRPGKCRKVGYPRDLPQISLIFIFVNEALSVILRSVHSAVN 75
Query: 178 RTDQKHIKEIILVDDYSD 195
T +KEIILV+D SD
Sbjct: 76 HTPAHLLKEIILVNDNSD 93
>UniRef50_Q8MYY6 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 13; n=1; Drosophila
melanogaster|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 13 - Drosophila
melanogaster (Fruit fly)
Length = 558
Score = 189 bits (460), Expect = 2e-46
Identities = 107/324 (33%), Positives = 172/324 (53%), Gaps = 32/324 (9%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+ L+ +R GLI +R GA + G ++FLDSH EVN GWL PLL+RL+ +
Sbjct: 178 LTFLRNQERMGLIWSRNRGASLASGRYVLFLDSHCEVNEGWLEPLLERLALNTN------ 231
Query: 317 SARAVTPVIDVINADTFEYSP-SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
AV+P++D I+ T Y + L++GGF+W LHF W K L N E P +SP
Sbjct: 232 --LAVSPLLDPIDPTTLSYRKGNELLKGGFDWSLHFHW---LKRQLTNQESLEMPYQSPA 286
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
AGG+ + RE+F +G ++P + +WGGE++E++ ++W+CGG +E++PCSR+GH+FR+R
Sbjct: 287 FAGGVLMMSREWFLKLGSFNPYLKIWGGESIELAIKLWLCGGQIEIVPCSRIGHIFRRRH 346
Query: 436 PYG--------VGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIG-DISERKAL 486
+ + Q+ L NS +A W+D+Y + P+A + + E + +
Sbjct: 347 AFDFPPQSDRQLSPAQETYLHNSKIIAESWLDEYKNMFYALRPAARRIPLDHTYDELQRM 406
Query: 487 RERLQCKTFKWYLDN------MWF-ETDRSELVLGRTLCLDA-SNNVAPILGKCHEMGGT 538
R+ +C F+WYL + M F E + + C+ A + PIL C+ +
Sbjct: 407 RKERRCHPFEWYLRHVSPELRMHFDELSATGTLRNEDRCVHARQKDSQPILASCY-LSDI 465
Query: 539 QEWKHKGTASSPIYNTAAGMCLGV 562
+W S +T +CL V
Sbjct: 466 TQWSM--LRQSGQLSTHRELCLAV 487
Score = 50.4 bits (115), Expect = 1e-04
Identities = 26/82 (31%), Positives = 47/82 (57%), Gaps = 3/82 (3%)
Query: 116 NLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELP---RASIIICFYNEHYETLMRSV 172
+ + +N +S +G R LP TR+ C ++ P S++I F+NE L+R++
Sbjct: 71 DFYQYNIHLSNALGLIRKLPVTRHHSCTTRNSILPAPLEANVSVVISFHNEARSMLLRTI 130
Query: 173 HSIMDRTDQKHIKEIILVDDYS 194
S++ R+ + ++ E+ILVDD S
Sbjct: 131 VSLLSRSPEDYLHELILVDDGS 152
>UniRef50_Q5CKF0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T3; n=4;
Eimeriorina|Rep:
UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T3 - Cryptosporidium
hominis
Length = 732
Score = 187 bits (455), Expect = 8e-46
Identities = 101/255 (39%), Positives = 148/255 (58%), Gaps = 14/255 (5%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R++ ++R+G++ ARL G + V+V LDSHIE + WL P L RL + V
Sbjct: 345 VRVVHLTERKGIVGARLSGVRAASAPVIVILDSHIETSRQWLEPQLLRLKESPKSV---- 400
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
V P ID I+ F +S G L FK+ L + TL + P+KSP M
Sbjct: 401 ----VMPQIDSIDPVNFAFSNF----SGIGCRLGFKYSILEQATLTGPINDTTPIKSPMM 452
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
AGGLFA+ R+YF +G YD WG EN+EISFRIWMCGG +E PCSRV H+FRK +
Sbjct: 453 AGGLFAMKRDYFWHLGGYDEKFRHWGAENVEISFRIWMCGGQIECTPCSRVFHIFRK-KG 511
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFK 496
G + + N +R ARVWMD++ ++ E+ ++++G + L+++L+CK F+
Sbjct: 512 VGYSSPPESLWHNRLRTARVWMDEFY-QITEMLAPNPNIKLGSFDDMLHLKKKLKCKPFR 570
Query: 497 WYLDNMWFETDRSEL 511
W+LDN+ ET ++L
Sbjct: 571 WFLDNVAPETYITQL 585
Score = 63.3 bits (147), Expect = 2e-08
Identities = 37/97 (38%), Positives = 56/97 (57%), Gaps = 6/97 (6%)
Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYN 162
N ED + G+N + L R+ + R+ +C++ Y +L ASIII FYN
Sbjct: 238 NEEDFLAKGGGFNRQLSDFLSLDRVP-----LEVRDPICRNMIYPIKDLDDASIIITFYN 292
Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
E TL+RSVHS+++ T ++EIILV+D SD+ +L
Sbjct: 293 EPLSTLLRSVHSVLNNTPPPLLREIILVNDGSDMIDL 329
>UniRef50_Q8K1B9 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4; n=9;
Euteleostomi|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase-like protein 4 - Mus
musculus (Mouse)
Length = 622
Score = 186 bits (454), Expect = 1e-45
Identities = 110/308 (35%), Positives = 170/308 (55%), Gaps = 40/308 (12%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I++++ SK+EGLIR+R+ G + V+ D+H+E NVGW P+L R+ +
Sbjct: 220 IKVVRHSKQEGLIRSRVSGWRAATAPVVALFDAHVEFNVGWAEPVLTRIKEN-------- 271
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
R ++P D I D FE PL GF+W L ++ N PK E+ P++SP +
Sbjct: 272 RKRIISPSFDNIKYDNFEIEEYPLAAQGFDWELWCRYLNPPKAWW-KLENSTAPIRSPAL 330
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRI---------------WMCGGSLEL 421
G F + R+YF IG D GM V+GGEN+E+ R+ W CGGS+E+
Sbjct: 331 IG-CFIVDRQYFEEIGLLDEGMEVYGGENVELGIRVSEISHTGLSSAPMMVWQCGGSVEV 389
Query: 422 IPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EVNPSAAHVEIG 478
+PCSR+ H+ R +PY + ++ +N++R+A VWMD++ V + + ++IG
Sbjct: 390 LPCSRIAHIERAHKPY-TEDLTAHVRRNALRVAEVWMDEFKSHVYMAWNIPQEDSGIDIG 448
Query: 479 DISERKALRERLQCKTFKWYLDNMWFETDR-SELV--------LGRTLCLDASNNV--AP 527
DI+ RKALR++LQCKTF+WYL +++ E S+++ L LCLD + P
Sbjct: 449 DITARKALRKQLQCKTFRWYLVSVYPEMRMYSDIIAYGVLQNSLKTDLCLDQGPDTENVP 508
Query: 528 ILGKCHEM 535
I+ CH M
Sbjct: 509 IVYICHGM 516
Score = 90.2 bits (214), Expect = 1e-16
Identities = 50/140 (35%), Positives = 74/140 (52%)
Query: 72 KVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDH 131
K ++E AK + + T++ L +G + E R+ K + + +N +S R+
Sbjct: 74 KQHIQEAPAKPEEAEAEPFTDSSLFAHWGQELSPEGRRVALKQFQYYGYNAYLSDRLPLD 133
Query: 132 RDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVD 191
R LPD R C++ + D LP SI+ F NE L+RS+HS M+RT +KEIILVD
Sbjct: 134 RPLPDLRPSGCRNLSFPDSLPEVSIVFIFVNEALSVLLRSIHSAMERTPSHLLKEIILVD 193
Query: 192 DYSDLYNLHHDVQEAVDKLN 211
D S L + E VDK+N
Sbjct: 194 DNSSNEELKEKLTEYVDKVN 213
>UniRef50_Q8MM26 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T1; n=1; Toxoplasma
gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T1 - Toxoplasma gondii
Length = 751
Score = 180 bits (437), Expect = 1e-43
Identities = 139/465 (29%), Positives = 215/465 (46%), Gaps = 47/465 (10%)
Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEH 164
E R+ KGY FNT +S + R +P+ + C+ Q+ FD + S + +
Sbjct: 196 EQKRLAHKGY---CFNTKVSDSLSLDRSVPEFASNYCRDQRLLFDNMTPPSAEQKW--KQ 250
Query: 165 YETLMRSVHSIMDRTDQKHIKEII-------LVDDYSDLYNLHHDVQEAVDKLNNVIKKE 217
L ++ + TD K ++ L D + + ++ + +++V+ +
Sbjct: 251 TRELSKAQSPQLSATDGKASSAVVPRATDGSLPDTSVVIVFYNENLSVLLRSIHSVLNRT 310
Query: 218 EEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGAD 277
+ I ++ K+ + + + RLL+ +R GL+ AR GA
Sbjct: 311 PPSLLKEIIVVDDFSDRQTHPWLGKQLEDY--ISGTLPKTRLLRLLQRRGLMGARAAGAA 368
Query: 278 NSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSP 337
+ + + FLDSHIE WL PLL + Q R P+I I+AD F
Sbjct: 369 AASAETVTFLDSHIECLPYWLQPLLFHVKQDW--------RRIAMPLIPTIDADNFRIKD 420
Query: 338 SPLVRGGFNWGL-HFKWDNLPKGTL--INDEDFMK----PLKSPTMAGGLFAIYREYFNA 390
L F WG+ H+ + + + + ++ K P SP MAGGLF I + +++
Sbjct: 421 GGLKTLAFTWGMSHYHIHDKIRHRIEELGQDEAAKNPDAPTMSPIMAGGLFTITKAWWDT 480
Query: 391 IGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQD----YM 446
+G YD M ++GGE EISF+ WMCGGSL L+PCSRVGHVFR + G+ +
Sbjct: 481 LGGYDKEMQIYGGEEFEISFKTWMCGGSLHLVPCSRVGHVFRSNE-FWQGQVYTVPGALI 539
Query: 447 LQNSMRMARVWMDDYVKKVIEVNPSAAHVE-IGDISERKALRERLQCKTFKWYLDNMWFE 505
+N +R A VWM +Y + V V P + +GD++E KALR+RL+CK F WYL N++ E
Sbjct: 540 HRNKLRTAHVWMGEYARIVELVIPRLPQDKPLGDLTELKALRDRLKCKDFNWYLKNIYPE 599
Query: 506 TDRSELVLGRT---------LCLDASNNVAPILG--KCHEMGGTQ 539
+ L T CLD +G CH GTQ
Sbjct: 600 LEPPNLAHAMTGAMRNPKFNCCLDTLTTKNQEIGVYPCHFEHGTQ 644
>UniRef50_UPI0000E45D84 Cluster: PREDICTED: hypothetical protein;
n=3; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 662
Score = 177 bits (431), Expect = 7e-43
Identities = 86/179 (48%), Positives = 118/179 (65%), Gaps = 11/179 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
++L++ S+REGLIR R+ GA +S GDVL++LD+H EV V WLPPLL ++
Sbjct: 468 LKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLPPLLTPIAVN-------- 519
Query: 317 SARAVTPVIDVINADTFEYSPSPLV---RGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
AV P+IDVI+ + P RGGF+W L++K +P+ + +P +S
Sbjct: 520 RTTAVCPIIDVIDNMDYRVYPQGTGDQDRGGFDWSLYWKHLPVPQFEKSRRQHASEPYRS 579
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
P MAGGLFA+ R+YF +G YD G+ +WGGEN E+SF+IWMCGGSL +PCSRVGHV+R
Sbjct: 580 PAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGGSLLWVPCSRVGHVYR 638
Score = 75.4 bits (177), Expect = 4e-12
Identities = 37/105 (35%), Positives = 65/105 (61%), Gaps = 1/105 (0%)
Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEH 164
SE ++ D+ + FN +S +I R++ D R++ C+ Y + LP S+II F+NE
Sbjct: 358 SEKAKV-DRLIQEYGFNQYVSDQISLDRNIADLRSQQCKHWHYPETLPTTSVIIVFHNEG 416
Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
+ TL+R+VHS+ +R+ + + EIILVDD+S +L +++ V +
Sbjct: 417 WSTLLRTVHSVFNRSPSQLLHEIILVDDFSTKEHLKERLEDYVQE 461
>UniRef50_Q5CY08 Cluster: Extracellular protein with a signal
peptide followed by family 2 glycosyltransferase and
ricin domains; n=3; Cryptosporidium|Rep: Extracellular
protein with a signal peptide followed by family 2
glycosyltransferase and ricin domains - Cryptosporidium
parvum Iowa II
Length = 637
Score = 177 bits (431), Expect = 7e-43
Identities = 101/257 (39%), Positives = 139/257 (54%), Gaps = 29/257 (11%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+RL++ +KR G++ ARL G + + V LDSHIEV W P++KR+ +
Sbjct: 239 VRLIRNAKRSGIVGARLAGINACKSPIFVILDSHIEVQPVWAEPIVKRIQED-------- 290
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNL-----------PKGTLINDE 365
R V P ID I+++TFE+ V GG L F W + P+ +
Sbjct: 291 PRRIVMPQIDSIDSETFEF-----VNGGIGCTLGFLWKLIEHAFPQQISPDPRRRYAKNY 345
Query: 366 DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCS 425
D++ SPTMAGGL A +F IG YDP WG ENLE+SFR+WMCGG +E PCS
Sbjct: 346 DYVS---SPTMAGGLLAANVAFFKQIGSYDPQFEYWGTENLELSFRVWMCGGFIECAPCS 402
Query: 426 RVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKA 485
RV HVFRK G +L+N +R +WMD++ V V+ G + ER
Sbjct: 403 RVFHVFRK-GGVGYSSPSHAVLKNKLRTLYLWMDEFGDLAWRV-MGRPRVDTGPLDERIK 460
Query: 486 LRERLQCKTFKWYLDNM 502
LRERL+C +FKW+L+N+
Sbjct: 461 LRERLRCNSFKWFLENV 477
Score = 62.9 bits (146), Expect = 2e-08
Identities = 33/77 (42%), Positives = 48/77 (62%), Gaps = 1/77 (1%)
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDR 178
FN +S + R++ D R+ C+ Y ++ S+II FYNE + TLMRSVHS+++R
Sbjct: 141 FNLNLSDSLPLDRNVSDYRDLQCKLISYDISKMDTISVIIVFYNEPFSTLMRSVHSVLNR 200
Query: 179 TDQKHIKEIILVDDYSD 195
T + EIILVDD S+
Sbjct: 201 TPPSLLDEIILVDDGSN 217
>UniRef50_UPI0000D9AA48 Cluster: PREDICTED: similar to
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5; n=2; Macaca
mulatta|Rep: PREDICTED: similar to
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 5 - Macaca
mulatta
Length = 442
Score = 172 bits (419), Expect = 2e-41
Identities = 102/253 (40%), Positives = 139/253 (54%), Gaps = 42/253 (16%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I++++ KREGLIRARL GA ++ GDVLVFLDSH EVN WL PLL +++ V
Sbjct: 228 IKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMV---- 283
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
V P+IDVI+ T EY PSP+VRG F+W L FKWDN+ + E KP+
Sbjct: 284 ----VCPLIDVIDDRTLEYKPSPVVRGAFDWNLQFKWDNVFSYEMDGPEGPTKPI----- 334
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
+ D GM +W MCGG L +IPCSRVGH+ K++
Sbjct: 335 ----------------RVDCGMRIW------------MCGGQLFIIPCSRVGHI-SKKQT 365
Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFK 496
+ N +R+ VW+D+Y ++ P +V G+I ER LR+RL CK+F+
Sbjct: 366 RKTSAIISATIHNYLRLVHVWLDEYKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQ 425
Query: 497 WYLDNMWFETDRS 509
WYLDN++ E + S
Sbjct: 426 WYLDNVFPELEAS 438
Score = 92.3 bits (219), Expect = 3e-17
Identities = 47/128 (36%), Positives = 76/128 (59%), Gaps = 2/128 (1%)
Query: 89 KKTEND-LEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY 147
K+T+ D E G+ N + + ++ + FN +IS+ +G R++PDTRNK+C + Y
Sbjct: 103 KRTDEDKAESTLGMDFNHTNPELHNELLK-YGFNVIISRSLGIEREVPDTRNKMCLQKHY 161
Query: 148 FDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAV 207
LP ASI+ICF+NE + L R+V S+M+ T ++EIILVDD S++ +L + +
Sbjct: 162 PARLPTASIVICFHNEEFHALFRTVSSVMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHL 221
Query: 208 DKLNNVIK 215
+ IK
Sbjct: 222 ETFRGKIK 229
>UniRef50_UPI000065D031 Cluster: Probable polypeptide
N-acetylgalactosaminyltransferase 8 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 8) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 8)
(Polypeptide GalNAc transferase 8) (GalNAc-T8)
(pp-GaNTase 8).; n=1; Takifugu rubripes|Rep: Probable
polypeptide N-acetylgalactosaminyltransferase 8 (EC
2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
8) (UDP- GalNAc:polypeptide
N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
transferase 8) (GalNAc-T8) (pp-GaNTase 8). - Takifugu
rubripes
Length = 565
Score = 171 bits (417), Expect = 3e-41
Identities = 101/276 (36%), Positives = 159/276 (57%), Gaps = 30/276 (10%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R ++ +++ GL +ARL G +VGDV+ LD+HIEV+V W PLL R+ + D +
Sbjct: 129 VRKVRHAEQLGLTQARLSGWKAAVGDVVAILDAHIEVHVQWAEPLLARIKE--DRTVI-- 184
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNL-PKGTLINDEDFMKPLKSPT 375
+TPV D + D F+W L +++ P+ + D+ P KSP+
Sbjct: 185 ----LTPVFDNVKYDDLTVLHYQPAADAFDWALWCMYESFRPEWYDLKDDSL--PGKSPS 238
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
+ G + A R++F IG D GM ++GGEN+E+ R W CGGS+E+IPCS++ H+ R +
Sbjct: 239 IMGIVVA-ERKFFGEIGSLDGGMKIYGGENVELGIRAWSCGGSIEVIPCSKIAHIERAMK 297
Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKV-----IEVNPSA------AH------VEIG 478
PY + + M +N++R+A VWMD+Y V + + SA AH ++IG
Sbjct: 298 PY-LPDLSVTMKRNALRVAEVWMDEYKSNVNVAWNLPLVASASKMWLSAHFRANHGIDIG 356
Query: 479 DISERKALRERLQCKTFKWYLDNMWFETDRSELVLG 514
D+SERK LR+RL CK F WYL+N++ + D + ++G
Sbjct: 357 DVSERKKLRKRLNCKPFSWYLENIYPQLDPLDNLVG 392
Score = 72.1 bits (169), Expect = 4e-11
Identities = 30/108 (27%), Positives = 65/108 (60%)
Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNE 163
+ E+ + ++ + + +N +S R+ +R++PDTR C ++Y +ELP S+++ + +E
Sbjct: 15 SEEEQKEAERLFQQYGYNAFLSDRLPLNREIPDTRPTRCAEKKYPEELPNISVVLIYLDE 74
Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLN 211
+ R++ S++D+T + + EIILVDD+S +L + E + ++
Sbjct: 75 ALSVIKRAIRSLIDKTPARLLTEIILVDDHSSNEDLGKKLDEYIGSIH 122
>UniRef50_UPI0000586DC6 Cluster: PREDICTED: similar to polypeptide
N-acetylgalactosaminyltransferase 10; n=1;
Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
polypeptide N-acetylgalactosaminyltransferase 10 -
Strongylocentrotus purpuratus
Length = 376
Score = 171 bits (415), Expect = 6e-41
Identities = 80/184 (43%), Positives = 114/184 (61%), Gaps = 3/184 (1%)
Query: 319 RAVTPVIDVINADTFEYSPSP--LVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
R V P+IDVI+ + F Y ++RG F+W L++K + + P ++P M
Sbjct: 29 RIVCPMIDVISNEDFHYESQAGDVMRGAFDWELYYKRIPISEAENKRRSHESDPFRTPIM 88
Query: 377 AGGLFAIYREYF-NAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
AGGLFA+ R+YF +G YD G+ +WGGE ++SF++WMCGG +E IPCSRVGH++RK
Sbjct: 89 AGGLFAVDRKYFMEELGGYDEGLEIWGGEQYDLSFKVWMCGGEMEEIPCSRVGHIYRKFM 148
Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
Y V + +N +R+ VWMD++ K E P + GDIS++ ALRERLQCK F
Sbjct: 149 SYTVPGGAGVINKNLLRVVEVWMDEWGKYFYERRPYLKGQDYGDISKQLALRERLQCKNF 208
Query: 496 KWYL 499
W+L
Sbjct: 209 TWFL 212
>UniRef50_Q4RKI0 Cluster: Chromosome 21 SCAF15029, whole genome
shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 21
SCAF15029, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 531
Score = 169 bits (412), Expect = 1e-40
Identities = 92/240 (38%), Positives = 140/240 (58%), Gaps = 23/240 (9%)
Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
SE +++ +RLL++++R G+ R GA + G++LVF+DSH E GWL PLL+R++Q
Sbjct: 231 SEYLSHLSRVRLLRSARRLGVAGCRALGASKAEGELLVFMDSHCECQKGWLEPLLERVAQ 290
Query: 308 GVDGVKVRYSARAVTPVIDVINADTFEYSPSPL-VRGGFNWGLHFKWDN---LPK---GT 360
R V+P+ID I+ TF Y+ + VRG FNW L F+W++ LP G+
Sbjct: 291 D--------RTRVVSPIIDAIDWRTFRYNATQWPVRGVFNWRLDFRWESHTLLPDKDPGS 342
Query: 361 LINDEDFMKPL------KSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWM 414
+ + L +SP + G +FAI R +F +G +DPGM +WG E +E+S R+W
Sbjct: 343 AVRALRLCRRLTETARFRSPVLGGEVFAIDRHFFQHVGGFDPGMLLWGEEQIELSIRVWS 402
Query: 415 CGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAH 474
CGGS+E+ PCSRV H+ PY + QD + N +R+A +WM Y +K+ + AH
Sbjct: 403 CGGSMEVAPCSRVAHLDHHSLPYTFPD-QDLLENNKIRIAEIWMGAY-RKIFYRRDTLAH 460
Score = 91.1 bits (216), Expect = 8e-17
Identities = 39/96 (40%), Positives = 63/96 (65%)
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
+ FN +S+ I HR LP+ R+ C QQY + LP AS++ICF+NE + TL+R+VHS++
Sbjct: 145 YGFNEAVSEGISVHRRLPEARHPRCLQQQYSESLPSASVVICFHNEAWSTLLRTVHSVLS 204
Query: 178 RTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
++H++E++LVDD S +L + E + L+ V
Sbjct: 205 TAPRRHLRELLLVDDLSQHGHLKGVLSEYLSHLSRV 240
>UniRef50_Q8IA43 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 10; n=1; Drosophila
melanogaster|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 10 - Drosophila
melanogaster (Fruit fly)
Length = 630
Score = 167 bits (405), Expect = 1e-39
Identities = 94/247 (38%), Positives = 138/247 (55%), Gaps = 17/247 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I +L+ +R G I+AR+ S VLVFLDSHIEVN WLPPLL+ +
Sbjct: 230 IHILRLPERRGSIKARMEAIRVSSCQVLVFLDSHIEVNTNWLPPLLE---------PIVI 280
Query: 317 SARAVT-PVIDVINADTFEYSP-SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
+ VT P++D I+ TF Y+ + + R GFNW L ++LP D P ++P
Sbjct: 281 NPHIVTRPILDAISRKTFAYAKQNTMTRSGFNWWLES--ESLPIFPEDKSPD-STPYRTP 337
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHV-FRK 433
++G + AI R YF +G +D ++ W E EISF++WMCGG + +PC+RVGH+ R
Sbjct: 338 VLSGAM-AIDRNYFLNLGGFDEQLDTWEAEKFEISFKVWMCGGMMLYVPCARVGHIGKRP 396
Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHV-EIGDISERKALRERLQC 492
+ +++ +N R+A VWMD+Y K V + NP + G + +RK R L+C
Sbjct: 397 MKSISSPGYHNFLARNYKRVAEVWMDNYKKYVYDKNPKLYKMANAGLLFQRKTKRNALEC 456
Query: 493 KTFKWYL 499
KTF WY+
Sbjct: 457 KTFDWYM 463
Score = 76.6 bits (180), Expect = 2e-12
Identities = 36/88 (40%), Positives = 58/88 (65%)
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
F +S RI +R LPDTR C+ ++Y + LP ++II F++EH L+RS+ SI++R+
Sbjct: 135 FYAELSDRIPLNRSLPDTRPISCRKRKYLENLPNVTVIIAFHDEHLSVLLRSITSIINRS 194
Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAV 207
+ +K+I+LVDD S+L L ++E V
Sbjct: 195 PVELLKQIVLVDDDSNLPELGQQLEEIV 222
>UniRef50_Q6TBR4 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T4; n=1; Toxoplasma
gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T4 - Toxoplasma gondii
Length = 329
Score = 164 bits (398), Expect = 7e-39
Identities = 98/252 (38%), Positives = 132/252 (52%), Gaps = 21/252 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+RLL+ R+G+ AR G + V V LDSH+EV WL PL+ R++ + +
Sbjct: 98 VRLLRHQTRKGVTVARSTGIRAAKSHVFVILDSHVEVGYQWLEPLVARVASNPETI---- 153
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWG-LHFKWDNLPKGTLINDEDFMKPLKSPT 375
V PV+D ++ T E+ S + G W + + L L +P SPT
Sbjct: 154 ----VFPVVDAVDYRTLEFKSSGV---GLIWSVMEHGFVPLSPERLAYSPGAYRP--SPT 204
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
M G +FA + YF G YD GM G EN+E+S R W CGG LE PCSRV H+FR
Sbjct: 205 MMGSVFAADKNYFLQHGGYDEGMRFEGAENIELSLRQWQCGGRLECSPCSRVFHLFRS-- 262
Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
G + N +R VWMD+Y V HV +GDIS+R LRERL CK+F
Sbjct: 263 ----GADAQPVTWNRLRTMAVWMDEYGDLAWRVT-GEPHVSLGDISDRIKLRERLGCKSF 317
Query: 496 KWYLDNMWFETD 507
+W+LDN+W E+D
Sbjct: 318 QWFLDNVWPESD 329
Score = 52.4 bits (120), Expect = 3e-05
Identities = 26/44 (59%), Positives = 33/44 (75%)
Query: 151 LPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
LP ASIII FYNE+ L R++HSI+ RT + ++EIILVDD S
Sbjct: 31 LPAASIIIAFYNEYPTALHRTLHSILHRTPLQLLEEIILVDDGS 74
>UniRef50_Q5CYR4 Cluster: Extracellular protein with a signal
peptide followed by a family 2 glycosyltransferase and
ricin domains; n=3; Cryptosporidium|Rep: Extracellular
protein with a signal peptide followed by a family 2
glycosyltransferase and ricin domains - Cryptosporidium
parvum Iowa II
Length = 545
Score = 164 bits (398), Expect = 7e-39
Identities = 109/319 (34%), Positives = 162/319 (50%), Gaps = 44/319 (13%)
Query: 247 NSEVKNNVFN-IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRL 305
N E+ ++ I++++ K EGLIR+R+ GAD S V+VF+D H W+ PL+ RL
Sbjct: 101 NKELPSSYLKYIKVIRLDKCEGLIRSRILGADASKSSVIVFMDGHCRPKENWIEPLINRL 160
Query: 306 SQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDE 365
+ + V P+I+ I+ T++ + ++ F+W F W E
Sbjct: 161 KEKPKAI--------VCPMIEDIDRYTWKDLGTFGLKMMFDWNFEFNWY----------E 202
Query: 366 DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCS 425
DF + P +GGL+AI RE++ GKYDPGM WGGEN+E S RIW CGG + S
Sbjct: 203 DFTDVI--PIASGGLYAITREWWEESGKYDPGMLEWGGENIEQSIRIWRCGGEIVAEKKS 260
Query: 426 RVGHVF-RKRRPYGVGEKQDYMLQNSMRMARVWMD----DYVKKVIEVNPSAAHVEIG-D 479
RVGH+F R +P + + +N R A VW+D Y + + +V S + G D
Sbjct: 261 RVGHIFKRDPKPNPENKLVLQVQRNQKRAAMVWLDKKRYKYFETIHDVVKSLNETQSGVD 320
Query: 480 ISERKALRERLQCKTFKWYLDNMWFETDRSELVLGR---------TLCLDASNN------ 524
+ +R +++ERL+CK F WY+D DR L+L LCL AS N
Sbjct: 321 LEQRHSIKERLKCKPFSWYVDKFRASFDRGGLLLDNFRHFKHRKSGLCLTASLNEVVTGT 380
Query: 525 --VAPILGKCHEMGGTQEW 541
A + +C+E TQ+W
Sbjct: 381 EDKAVVFKECNERDDTQKW 399
>UniRef50_Q6YBY0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T3; n=1; Toxoplasma
gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T3 - Toxoplasma gondii
Length = 635
Score = 161 bits (392), Expect = 4e-38
Identities = 96/257 (37%), Positives = 137/257 (53%), Gaps = 16/257 (6%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+RL++ R+G++ AR+ G S + LDSHIEV+ WL PLL R+ + DG
Sbjct: 245 VRLIRNEVRKGIVGARMKGIRASRAPIFAILDSHIEVSPQWLEPLLLRIKE--DG----- 297
Query: 317 SARAVTPVIDVINADTFEYSPSPL-VRGGFNWGL-HFKWDNLPKGTLINDEDFMKPLK-- 372
R V P ID I+A+TF++ + + GF W L ++ L +E P
Sbjct: 298 -RRVVMPQIDGIDAETFKHIAGGIGCKLGFLWKLMEHSYEGHQTARLPPEERQPSPTDFQ 356
Query: 373 -SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVF 431
SP MAGGLFA + +F +G YD WG ENLE+SFR+W CGG LE PCSRV H+F
Sbjct: 357 TSPAMAGGLFAANKAFFFDVGAYDEDFQFWGTENLELSFRLWQCGGVLECAPCSRVYHIF 416
Query: 432 RKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEV-NPSAAHVEIGDISERKALRERL 490
RK G D + N MR +WMD+Y V + + +R+ R+R
Sbjct: 417 RKGGS-GYSSPGDSITINKMR-TMLWMDEYADLAWRVIGKPRVNYRPESLEKRREWRKRK 474
Query: 491 QCKTFKWYLDNMWFETD 507
CK+F+W+++N++ E D
Sbjct: 475 GCKSFRWFMENVFPEGD 491
Score = 74.1 bits (174), Expect = 9e-12
Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 1/84 (1%)
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDR 178
FN +S + R PD R+ C+ Y LP+AS+II FYNE + TLMRSVHS+++
Sbjct: 146 FNLYLSDHLELDRTAPDARHASCRQLHYDLSTLPKASVIIVFYNEPFSTLMRSVHSVLNG 205
Query: 179 TDQKHIKEIILVDDYSDLYNLHHD 202
T + ++E+ILVDD S L + D
Sbjct: 206 TPPQILEELILVDDGSTLPYIRED 229
>UniRef50_Q8IA41 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 11; n=2; Drosophila
melanogaster|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 11 - Drosophila
melanogaster (Fruit fly)
Length = 557
Score = 146 bits (353), Expect = 2e-33
Identities = 102/347 (29%), Positives = 163/347 (46%), Gaps = 44/347 (12%)
Query: 260 LKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSAR 319
L+ +G+I ARL GA + GD+LVFL+ H+EV GWLPPLL+ + + +
Sbjct: 171 LEMESSKGIIHARLTGAGVATGDILVFLNGHMEVTRGWLPPLLEPI--------LLNNQT 222
Query: 320 AVTPVIDVINADTFEY----SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
P++D I+ ++F Y P L F+W L + L + + KP S
Sbjct: 223 VTEPIVDAISRESFAYRKLVEPEQLA---FDWQLDHIFLPLDQHSW---NSLPKPYPSSQ 276
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVF-RKR 434
+ G +FAI R++F +G +D G+ +GG+ LE+S ++W CGG + +PCSRVG ++ R
Sbjct: 277 LEGRVFAIDRKWFWHLGGWDEGLRDYGGDALELSLKVWQCGGLILAVPCSRVGIIYKRDE 336
Query: 435 RPYGVGEKQDYMLQ---NSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQ 491
+ ++ LQ N R+ VW+D+Y NP ++ + + + LR RL
Sbjct: 337 LEAQMAPNRNPSLQVQKNFKRVVDVWLDEYKLHFYRYNPKLRNLTAESLDKPRDLRRRLN 396
Query: 492 CKTFKWY-------LDNMWFETDRSELVLGRTL-------CLDASNNVAPILGKCHEMGG 537
CK+F+WY + N + + +G+ + CL P++ KCH
Sbjct: 397 CKSFEWYRSQVAPQIRNHFLHAGLTNYPIGKIMPFVAPHFCLSIKGGF-PVIRKCHST-N 454
Query: 538 TQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNKW 584
++W T +S MCL VD Y+ S N W
Sbjct: 455 FEDW----TLTSRCQLKHGNMCLDVD--YKNNVRATKCTKKLSKNPW 495
Score = 52.0 bits (119), Expect = 4e-05
Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 2/86 (2%)
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELP-RASIIICFYNEHYETLMRSVHSIM 176
+ +N +S+RI R L D R+ C Y E SI+I EH TL+R ++S++
Sbjct: 75 YQYNAWLSERIPLKRTLEDYRDPQCLKINYSSEKTVTVSIVIAIQQEHPHTLLRGIYSVI 134
Query: 177 DRTDQKHIKEIILV-DDYSDLYNLHH 201
+T +KEI+LV D + DL + H
Sbjct: 135 TQTSPYLLKEIVLVHDGHPDLDLIRH 160
>UniRef50_UPI0000E46EB4 Cluster: PREDICTED: similar to MGC81846
protein, partial; n=2; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to MGC81846 protein,
partial - Strongylocentrotus purpuratus
Length = 358
Score = 136 bits (329), Expect = 2e-30
Identities = 60/114 (52%), Positives = 86/114 (75%)
Query: 96 EEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRAS 155
E++ G++R E+ IRD GY HAFN LISQRIG HR++ DTRN LC+ Q Y +ELP S
Sbjct: 133 EDELGMVRTDEERSIRDGGYRQHAFNELISQRIGFHRNVTDTRNPLCKYQVYSEELPTVS 192
Query: 156 IIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
I+ICFYNE + TL+R+V+S++DRT ++ I E+ILVDD+S+L +L ++ + + K
Sbjct: 193 IVICFYNEAWSTLLRTVYSVLDRTPRRLIHELILVDDFSELTHLKKELDQYMSK 246
Score = 129 bits (312), Expect = 2e-28
Identities = 63/130 (48%), Positives = 82/130 (63%), Gaps = 8/130 (6%)
Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
KK + KN + ++ +REGLIRAR GA + GDVL+FLDSH EVN WL PL
Sbjct: 237 KKELDQYMSKNFNGLVHVIHNGQREGLIRARTIGARYATGDVLMFLDSHCEVNEQWLEPL 296
Query: 302 LKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTL 361
L+R+ S V P+ID+IN DTF Y+ SPLV+GGFNWG+HFKWD + L
Sbjct: 297 LERIKAD--------SHTVVCPIIDIINHDTFAYTASPLVKGGFNWGMHFKWDTIRSRQL 348
Query: 362 INDEDFMKPL 371
+ ED++KP+
Sbjct: 349 VGKEDYVKPI 358
>UniRef50_Q6YK77 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
N- acetylgalactosaminyltransferase T2; n=1; Toxoplasma
gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
acetylgalactosaminyltransferase T2 - Toxoplasma gondii
Length = 692
Score = 135 bits (326), Expect = 4e-30
Identities = 86/257 (33%), Positives = 126/257 (49%), Gaps = 22/257 (8%)
Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
R+++ +GLIR R+ GA + D FLD H VGW PLL L K Y
Sbjct: 214 RVIRFDSPQGLIRGRVAGAAIATSDNFFFLDGHCRPKVGWAEPLLAHL-------KTNYR 266
Query: 318 ARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
R P I I D++E + + F W F W ED + P +A
Sbjct: 267 -RIACPKIYDIYLDSWEDVGTHGTKMMFEWTFEFGWF----------EDLEDEV--PVLA 313
Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
GG+ A+ ++++ G YD GM WGGENLE S R W+CGG + + S++GH+F +
Sbjct: 314 GGILAMTKKWWIESGLYDEGMLEWGGENLEQSIRSWLCGGEIVAVQESKIGHIFSRPPKP 373
Query: 438 GVGEKQDYMLQ-NSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALR-ERLQCKTF 495
G + +Q N R A+VW+D+Y + + + GDI++RK LR E+L C F
Sbjct: 374 NPGNRLVIQVQKNQKRGAKVWLDEYYFLFYKYHREVRGHQEGDITQRKKLRYEQLTCMPF 433
Query: 496 KWYLDNMWFETDRSELV 512
+WY++ DR L+
Sbjct: 434 QWYVEKFKTAFDRKGLL 450
>UniRef50_Q4RNJ5 Cluster: Chromosome 21 SCAF15012, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 21 SCAF15012, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 364
Score = 133 bits (321), Expect = 1e-29
Identities = 80/181 (44%), Positives = 103/181 (56%), Gaps = 33/181 (18%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+RL++ +KREGL+RARL GA + G+VL FLD H E + GWL PLL+R+ + V
Sbjct: 191 VRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPLLQRIKEEPSAV---- 246
Query: 317 SARAVTPVIDVINADTFEY--SPSPLVRGGFNWGLHFKWDNLPK---------------G 359
V PVIDVI+ +TFEY +P GGF+W L F W +P+ G
Sbjct: 247 ----VCPVIDVIHWNTFEYLGNPGEPQIGGFDWRLVFTWHIIPEYEQKRRRSPTDVIRYG 302
Query: 360 TLI-------NDEDFMKPLK-SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFR 411
L D + SPTMAGGLFA+ + YF+ +G YD GM VWGGENLE SFR
Sbjct: 303 RLFRTLALRAGSSDVPSARRRSPTMAGGLFAVSKNYFHYLGTYDTGMEVWGGENLEFSFR 362
Query: 412 I 412
+
Sbjct: 363 V 363
Score = 84.6 bits (200), Expect = 7e-15
Identities = 39/114 (34%), Positives = 67/114 (58%), Gaps = 1/114 (0%)
Query: 101 LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIIC 159
L + E+ + ++ H N +S ++ HR LP+ N C+ +Y + LP S++I
Sbjct: 78 LTLSEEEKQKEEESLQKHQINIYVSDQVSLHRRLPEKWNPRCRELEYDYRSLPTTSVVIA 137
Query: 160 FYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
FYNE + TL+R+VHS+++ + +KE++LVDDYSD +L +++ V L V
Sbjct: 138 FYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDRAHLKEPLEKYVSGLKKV 191
>UniRef50_Q4RNJ6 Cluster: Chromosome 21 SCAF15012, whole genome
shotgun sequence; n=2; Clupeocephala|Rep: Chromosome 21
SCAF15012, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 534
Score = 125 bits (301), Expect = 4e-27
Identities = 92/262 (35%), Positives = 128/262 (48%), Gaps = 44/262 (16%)
Query: 321 VTPVIDVINADTFEY---SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
V PVID I+ +TFEY + P++ GGF+W L F+W ++P+ + P++ P
Sbjct: 210 VCPVIDTIDWNTFEYYMQTDEPMI-GGFDWRLTFQWHSVPERERKRRSSRIDPIR-PRCR 267
Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
G L A+ EI +W CGGSLE+ PCS VGHVF K+ PY
Sbjct: 268 GALAAMSLSLAFR----------------EIRGNVWQCGGSLEIHPCSHVGHVFPKKAPY 311
Query: 438 GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFKW 497
+ LQN++R A VWMD Y + NP+A GDIS R LRE+L+C++F W
Sbjct: 312 A----RPNFLQNTVRAAEVWMDSYKQHFYNRNPAARKETYGDISGRLLLREKLKCQSFTW 367
Query: 498 YLDNMWFETDRSELVLG-----RTL-----CLDASNNVAPILGK------CHEMGGTQEW 541
YL N++ E E G R L CLD + + G CH GG Q +
Sbjct: 368 YLKNIYPELHIPEDRAGWHGAVRNLGISSECLDYNAPEHSVTGAQLSLFGCHGQGGNQYF 427
Query: 542 KHKGTASSPI-YNTAAGMCLGV 562
++ T+ I +NT +C V
Sbjct: 428 EY--TSQKEIRFNTVTELCAEV 447
Score = 71.3 bits (167), Expect = 7e-11
Identities = 33/78 (42%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
+A N +S +I HR + D R C + LP S+II FYNE + TL+R++HS+++
Sbjct: 106 YAINIFVSDKISLHRHIQDHRMNECAFD--YRRLPTTSVIIAFYNEAWSTLLRTIHSVLE 163
Query: 178 RTDQKHIKEIILVDDYSD 195
T +KEIIL+DD+SD
Sbjct: 164 TTPAILLKEIILIDDFSD 181
>UniRef50_Q5CHA1 Cluster: Glycosyl transferase; n=4;
Cryptosporidium|Rep: Glycosyl transferase -
Cryptosporidium hominis
Length = 809
Score = 124 bits (299), Expect = 7e-27
Identities = 80/256 (31%), Positives = 127/256 (49%), Gaps = 35/256 (13%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++ K EGLIR+++ GAD ++G + FLD H + GW L+K ++ Y
Sbjct: 333 VKIIRLKKCEGLIRSKIIGADAALGPNIFFLDGHCKPKKGWSEALVK-------SIRENY 385
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKW--DNLPKGTLINDEDFMKPLKSP 374
R V P++ I+ + + + W F W D LP + P
Sbjct: 386 K-RVVCPIVQSISNIDWSDIGTAGAKMMIEWNFAFHWYDDGLP--------------EIP 430
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK- 433
+GG+ I + ++ GKYDPGM WGGEN+E SFRIW+CGG + ++ S VGH+F +
Sbjct: 431 IASGGILMITKRWWEESGKYDPGMLYWGGENIEQSFRIWLCGGEIHVVRNSLVGHIFERN 490
Query: 434 ---RRPYGVGEKQ---DYMLQNSMRMARVWMDDYVKKVIEVNPSA-AHVEIG---DISER 483
+R K+ D M N R A VW+ + + N ++ I +SER
Sbjct: 491 NSNKRNQDFQYKKMLIDNMNSNHQRTAFVWLSEQFYETYFKNYHVLGYLPISYTKGLSER 550
Query: 484 KALRERLQCKTFKWYL 499
+L+ L+CK F+WY+
Sbjct: 551 LSLKHILKCKPFEWYI 566
>UniRef50_Q4SKF7 Cluster: Chromosome 13 SCAF14566, whole genome
shotgun sequence; n=14; Clupeocephala|Rep: Chromosome 13
SCAF14566, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 530
Score = 122 bits (294), Expect = 3e-26
Identities = 86/278 (30%), Positives = 140/278 (50%), Gaps = 43/278 (15%)
Query: 249 EVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQG 308
E +N ++ ++++GL AR+ G + DV+ LD+HIEV+ W PLL ++
Sbjct: 122 EKENPSVRFTRVRHTEQKGLSHARVSGWSAATADVVAILDAHIEVHEMWAEPLLTQIRAD 181
Query: 309 VDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNL-PKGTLINDEDF 367
V V+PV D +N D + F+W L ++ P+ + D
Sbjct: 182 RSVV--------VSPVFDRVNYDDLKVIKYSPAAHAFDWALWCMYEGFTPEYYKLADSSL 233
Query: 368 MKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRV 427
P KSP++ G L A R++ IG D GM ++W CGGS+E++PCS++
Sbjct: 234 --PGKSPSVMGILVAD-RKFLGEIGVLDEGM------------KVWTCGGSIEVVPCSKI 278
Query: 428 GHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKV---------IEVNPSAAH---- 474
H+ R + Y + + M +N++R+A VWMD+Y V + N +
Sbjct: 279 AHMERAHKRY-MPDLTLAMKRNALRVAEVWMDEYKHNVNLAWNLPFQVFENEKRSSGNKR 337
Query: 475 -----VEIGDISERKALRERLQCKTFKWYLDNMWFETD 507
++IG+++ERK LRERL+CK FKWYL+N++ + D
Sbjct: 338 RPNHGIDIGNVTERKQLRERLKCKPFKWYLENVYPKLD 375
Score = 68.1 bits (159), Expect = 6e-10
Identities = 37/127 (29%), Positives = 70/127 (55%), Gaps = 3/127 (2%)
Query: 92 ENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDEL 151
++ L +++G + D R + + + +N +S R+ R L DTR C + Y +L
Sbjct: 3 DSALFKEWGENLSEADQREAEALFKKYGYNVFLSDRLPLDRPLADTREPRCSKKSYPKDL 62
Query: 152 PRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQ---EAVD 208
P S+++ + NE + R++ SI++RT + +KEII+VDD S +L D+ +A++
Sbjct: 63 PTLSVVLIYLNEALSVIKRALRSILNRTPKHLLKEIIMVDDNSSNEDLKGDLDFYVKALE 122
Query: 209 KLNNVIK 215
K N ++
Sbjct: 123 KENPSVR 129
>UniRef50_Q8IA44 Cluster: Putative polypeptide
N-acetylgalactosaminyltransferase 12; n=2; Drosophila
melanogaster|Rep: Putative polypeptide
N-acetylgalactosaminyltransferase 12 - Drosophila
melanogaster (Fruit fly)
Length = 563
Score = 115 bits (276), Expect = 4e-24
Identities = 68/250 (27%), Positives = 123/250 (49%), Gaps = 16/250 (6%)
Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
R+L ++ GLI+AR A + + LVF+D+ +E GWL PLL +++ S
Sbjct: 179 RILHLPEQVGLIKARNLAASEAKAENLVFVDAQVEFTNGWLSPLLDTIAE--------QS 230
Query: 318 ARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
TP++D ++ T Y S RG ++W L + L + P + +
Sbjct: 231 YTLATPILDNLDEQTLAYQRSIERRGMYDWSLTRREVPLSRA---RRSHLPWPYEVAAVR 287
Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
+FAI +F I +D + +G LE+SF++W GG + +PCSRVGH+ K Y
Sbjct: 288 TSVFAIPAVWFQDISNFDNNLRGFGAAELELSFKVWCTGGRIVQVPCSRVGHLQPKDEDY 347
Query: 438 -----GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
+ + + +N R+ VW D + + P ++ GD++E + L ++ +C
Sbjct: 348 LKRYGDLHKMGEQKSRNLKRIIEVWTGDLKSAIYKYQPHLLNISEGDLNEPRKLYKQNEC 407
Query: 493 KTFKWYLDNM 502
++FK +++++
Sbjct: 408 QSFKEFINDI 417
Score = 68.5 bits (160), Expect = 5e-10
Identities = 34/86 (39%), Positives = 56/86 (65%), Gaps = 3/86 (3%)
Query: 113 KGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY---FDELPRASIIICFYNEHYETLM 169
+G+ + +N+ +++RI R LPD R+ CQ +Y DE+ ASII+ F NE L+
Sbjct: 68 QGWRYYLYNSWLAERIPLRRSLPDLRDHRCQKLEYDEDSDEMKPASIIMIFRNEQLVVLL 127
Query: 170 RSVHSIMDRTDQKHIKEIILVDDYSD 195
R++HS+++RT + E+ILV+D+SD
Sbjct: 128 RTLHSLVERTPKYLYIELILVNDHSD 153
>UniRef50_UPI0000E46FFD Cluster: PREDICTED: similar to
n-acetylgalactosaminyltransferase, partial; n=1;
Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
n-acetylgalactosaminyltransferase, partial -
Strongylocentrotus purpuratus
Length = 405
Score = 110 bits (264), Expect = 1e-22
Identities = 57/155 (36%), Positives = 89/155 (57%), Gaps = 8/155 (5%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+RL+ T+ REG+ RA++ GA + G+VLVFLD+H EVN WL P+L + QG V
Sbjct: 213 VRLVHTTHREGVARAKMRGAREARGEVLVFLDAHCEVNTHWLEPMLDLVHQGPTTV---- 268
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
V+P+ID I+ +TF + L R F W L + L + + ++P++SP
Sbjct: 269 ----VSPIIDKIDPETFGFEDGSLARVTFRWSLETRRIPLSQIEKAERLNPLEPVRSPLT 324
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFR 411
GG+FA+ + +F IG D G++ WG + L+ S +
Sbjct: 325 NGGIFAVSKSFFEKIGGIDAGLDGWGADGLDFSMK 359
Score = 83.4 bits (197), Expect = 2e-14
Identities = 40/95 (42%), Positives = 61/95 (64%), Gaps = 1/95 (1%)
Query: 101 LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIIC 159
LI + +D+ K + H FN ++S +I R + DTR+ CQ Y F + P AS+II
Sbjct: 99 LILSGKDMEKAKKSRDQHNFNLVVSDKISLERTVKDTRDSRCQDITYRFSKFPTASVIIA 158
Query: 160 FYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
F+NE + TLMR+VHS+++RT + + E++LVDD S
Sbjct: 159 FHNEAWSTLMRTVHSVVNRTPRDILTEVVLVDDAS 193
Score = 37.1 bits (82), Expect = 1.3
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 478 GDISERKALRERLQCKTFKWYLDNM 502
GD+SE K+LR RL C +F WYL ++
Sbjct: 364 GDVSEIKSLRARLSCHSFDWYLSHV 388
>UniRef50_Q4STJ6 Cluster: Chromosome undetermined SCAF14183, whole
genome shotgun sequence; n=2; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF14183,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 344
Score = 106 bits (255), Expect = 1e-21
Identities = 47/88 (53%), Positives = 61/88 (69%), Gaps = 1/88 (1%)
Query: 319 RAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
R V+PVID+IN DTF Y + S +RGGF+W LHFKW+ L D +P+K+P +A
Sbjct: 5 RVVSPVIDIINMDTFAYVAASADLRGGFDWSLHFKWEQLSPEQRARRTDPAQPIKTPIIA 64
Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGEN 405
GGLF I R +FN +GKYD M++WGGEN
Sbjct: 65 GGLFVIDRSWFNHLGKYDTAMDIWGGEN 92
>UniRef50_UPI000155C133 Cluster: PREDICTED: similar to
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 4, partial; n=1;
Ornithorhynchus anatinus|Rep: PREDICTED: similar to
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase-like 4, partial -
Ornithorhynchus anatinus
Length = 305
Score = 103 bits (247), Expect = 1e-20
Identities = 56/138 (40%), Positives = 83/138 (60%), Gaps = 15/138 (10%)
Query: 412 IWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EV 468
+W CGGS+E++PCSR+ H+ R +PY + ++ +N++R+A VWMD++ V +
Sbjct: 105 VWQCGGSVEVLPCSRIAHIERAHKPY-TEDLTAHVRRNALRVAEVWMDEFKSHVYMAWNI 163
Query: 469 NPSAAHVEIGDISERKALRERLQCKTFKWYLDNMWFETDR-SELV--------LGRTLCL 519
+ ++IGDISERKALR+ LQCKTF+WYL N++ E S+ V L LCL
Sbjct: 164 PQEDSGIDIGDISERKALRKALQCKTFRWYLVNVYPEMRMYSDTVAYGVLQNSLKSDLCL 223
Query: 520 DASNNV--APILGKCHEM 535
D + PI+ CH M
Sbjct: 224 DQGPDTENIPIMYICHGM 241
>UniRef50_Q4T0W3 Cluster: Chromosome undetermined SCAF10824, whole
genome shotgun sequence; n=9; Euteleostomi|Rep:
Chromosome undetermined SCAF10824, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 149
Score = 103 bits (246), Expect = 2e-20
Identities = 46/92 (50%), Positives = 62/92 (67%), Gaps = 4/92 (4%)
Query: 412 IWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPS 471
IW CGGSLE+ PCS VGHVF K+ PY ++ L NS+R A VWMD+Y + NP
Sbjct: 1 IWQCGGSLEIHPCSHVGHVFPKKAPYS----RNKALANSVRAAEVWMDEYKEIYYHRNPH 56
Query: 472 AAHVEIGDISERKALRERLQCKTFKWYLDNMW 503
A GD++ER+ LRE+L CK+F W+L+N++
Sbjct: 57 ARLEAFGDVTERRKLREKLGCKSFGWFLENIY 88
>UniRef50_UPI0001554C17 Cluster: PREDICTED: similar to Polypeptide
N-acetylgalactosaminyltransferase 17; n=1;
Ornithorhynchus anatinus|Rep: PREDICTED: similar to
Polypeptide N-acetylgalactosaminyltransferase 17 -
Ornithorhynchus anatinus
Length = 328
Score = 97.5 bits (232), Expect = 9e-19
Identities = 38/93 (40%), Positives = 61/93 (65%), Gaps = 1/93 (1%)
Query: 407 EISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVI 466
+++ ++WMCGG + +PCSRVGH++RK PY V + +N R+A WMD++ + +
Sbjct: 91 DVAVKVWMCGGGMFDVPCSRVGHIYRKYVPYKVPSGTS-LARNLKRVAETWMDEFAEYIY 149
Query: 467 EVNPSAAHVEIGDISERKALRERLQCKTFKWYL 499
+ P H+ GDIS +K LR+ L+CK FKW++
Sbjct: 150 QRRPEYRHLSTGDISAQKELRKHLKCKDFKWFM 182
>UniRef50_A5D6B4 Cluster: Predicted glycosyltransferases; n=1;
Pelotomaculum thermopropionicum SI|Rep: Predicted
glycosyltransferases - Pelotomaculum thermopropionicum
SI
Length = 274
Score = 91.1 bits (216), Expect = 8e-17
Identities = 81/258 (31%), Positives = 123/258 (47%), Gaps = 32/258 (12%)
Query: 252 NNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ-GVD 310
NN ++L+ +S G RAR GA ++ G L+F D+HI V WL LL S+ GVD
Sbjct: 36 NNYERVKLISSSGL-GAARARNLGAASARGKYLIFCDAHITVPQNWLEALLDTFSRPGVD 94
Query: 311 GVKVRYSARAVTPVIDVINADTFEYSPSPLVRGG-FNWGLHFKWDNLPKGTLINDEDFMK 369
AV+P I + +P+ + G +N L W P G M
Sbjct: 95 ---------AVSPAIGSLE------NPAAVGYGQTWNSRLETVWLPPPGG--------MP 131
Query: 370 PLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGH 429
P + GG A+ F +G +D G VWG E+ E+S ++W+ G L ++P RV H
Sbjct: 132 AGPVPLLPGGCLAVRAGAFRRVGGFDEGFIVWGCEDAELSLKLWLFGCRLYVVPSVRVLH 191
Query: 430 VFRKRRPYGVGEKQDYMLQNSMRMA-RVWMDDYVKKVIE-VNPSAAHVE-IGDISERKAL 486
+FR R PY V D++ N +RMA + VKKV+ + P + + + + AL
Sbjct: 192 LFRSRHPYPV--TMDHVHHNLLRMALSHFKSSRVKKVMGLIEPCGRLADTVRRVLQGGAL 249
Query: 487 RERLQCKTFKWYLDNMWF 504
++R + + Y D+ WF
Sbjct: 250 QQRRRYLAGRMY-DDDWF 266
>UniRef50_UPI0000D8AB1E Cluster:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9; n=2;
Euarchontoglires|Rep:
UDP-N-acetyl-alpha-D-galactosamine:polypeptide
N-acetylgalactosaminyltransferase 9 - Mus musculus
Length = 311
Score = 89.0 bits (211), Expect = 3e-16
Identities = 59/180 (32%), Positives = 84/180 (46%), Gaps = 32/180 (17%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+++++ S+REGLIRARL G + ++ F D+H+E N GW P L R+ +
Sbjct: 139 VKVVRNSRREGLIRARLQGWKVATAPIVGFFDAHVEFNTGWAEPALARIQED-------- 190
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFK-----------------WD---NL 356
R + P ID I TFE G+NWGL W+ +
Sbjct: 191 RRRIILPAIDNIKYSTFEVQQYASAAHGYNWGLWCMYIIPPQDWLDRGEPGSVWELGSST 250
Query: 357 PKGTLINDEDFMK-PL---KSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRI 412
G+ E F P ++P M G F + REYF IG DPGM V+G EN+E+ R+
Sbjct: 251 KAGSTPKGESFPSVPTHRGRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGAENIELGMRV 310
Score = 76.2 bits (179), Expect = 2e-12
Identities = 35/90 (38%), Positives = 55/90 (61%)
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
+N +S RI R +PD R K C+ Y ++LP+ S++ F NE ++RSVHS+++ T
Sbjct: 44 YNAQLSDRISLDRTIPDYRPKRCRQITYSEDLPQISVVFIFVNEALSVILRSVHSVVNHT 103
Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
+ +KE+ILVDD SD L ++ + V K
Sbjct: 104 PSQLLKEVILVDDNSDNVELKFNLDQYVHK 133
>UniRef50_UPI00005A4710 Cluster: PREDICTED: similar to GalNAc
transferase 10 isoform a; n=1; Canis lupus
familiaris|Rep: PREDICTED: similar to GalNAc transferase
10 isoform a - Canis familiaris
Length = 216
Score = 87.8 bits (208), Expect = 7e-16
Identities = 40/89 (44%), Positives = 57/89 (64%)
Query: 112 DKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRS 171
D Y + FN +S I R LPD R+ C+ + Y + LP SIII F+NE + +L+R+
Sbjct: 81 DSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMYLERLPNTSIIIPFHNEGWTSLLRT 140
Query: 172 VHSIMDRTDQKHIKEIILVDDYSDLYNLH 200
+HSI++RT + I EIILVDD+SD +H
Sbjct: 141 IHSIINRTPESLIAEIILVDDFSDRGKIH 169
>UniRef50_Q4RQK9 Cluster: Chromosome 2 SCAF15004, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 2 SCAF15004, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 449
Score = 85.4 bits (202), Expect = 4e-15
Identities = 43/100 (43%), Positives = 66/100 (66%), Gaps = 4/100 (4%)
Query: 98 QFG---LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRA 154
QFG L+ +SED ++R++ ++ FN +S RI R +PDTR + C D+LP
Sbjct: 25 QFGQAVLVSSSEDAQVRER-WDEGFFNVYLSDRIPVDRAVPDTRPESCAQSLIHDDLPST 83
Query: 155 SIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
S+I CF +E + TL+RSVHS+++R+ ++EIILVDD+S
Sbjct: 84 SVIFCFVDEVWSTLLRSVHSVLNRSPLHLLREIILVDDFS 123
Score = 82.6 bits (195), Expect = 3e-14
Identities = 47/103 (45%), Positives = 62/103 (60%), Gaps = 9/103 (8%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+++ +R+GLIRARL GA + G+VL FLDSH+E NVGWL PLL+R+ +D KV
Sbjct: 142 VRIIRLQERQGLIRARLAGAAAATGEVLTFLDSHVECNVGWLEPLLERIY--LDRRKV-- 197
Query: 317 SARAVTPVIDVINADTFEYS-PSPLVRGGFNWGLHFKWDNLPK 358
PVI+VIN Y RG F W L F W +P+
Sbjct: 198 ----PCPVIEVINDKDMSYMLVDNFQRGIFKWPLVFGWSPVPE 236
>UniRef50_Q7Q046 Cluster: ENSANGP00000016624; n=1; Anopheles gambiae
str. PEST|Rep: ENSANGP00000016624 - Anopheles gambiae
str. PEST
Length = 205
Score = 84.2 bits (199), Expect = 9e-15
Identities = 42/94 (44%), Positives = 61/94 (64%), Gaps = 1/94 (1%)
Query: 113 KGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEHYETLMRS 171
+GY+ N +S I R LPD R+ C ++ + ELP+ASI+I F+NE + L+R+
Sbjct: 110 QGYDQQGLNQYVSDLIPVRRRLPDLRDPWCTAETRLLPELPQASIVIVFFNEAWSVLVRT 169
Query: 172 VHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQE 205
VHSI+DRT I+EIILVDD+S+L +L + E
Sbjct: 170 VHSILDRTPHALIREIILVDDFSNLAHLRTQLDE 203
>UniRef50_Q2B871 Cluster: Glycosyl transferase, group 2 family
protein; n=1; Bacillus sp. NRRL B-14911|Rep: Glycosyl
transferase, group 2 family protein - Bacillus sp. NRRL
B-14911
Length = 297
Score = 82.6 bits (195), Expect = 3e-14
Identities = 75/252 (29%), Positives = 115/252 (45%), Gaps = 26/252 (10%)
Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
NI L+ T+ G AR GA + G VLVF D+H+E WL L++ L G+
Sbjct: 61 NISLI-TTDGVGAANARNEGAKLAKGQVLVFCDAHLEFEDYWLDLLIEPLLTGLTD---- 115
Query: 316 YSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
AVTP I I F L + + W+ + +D + P
Sbjct: 116 ----AVTPAIGAIGNPHFTGYGQTLWVNERSSKIRTHWN-------VKQDDLFETAILP- 163
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
GG FAI R F G ++ G VWG E++EIS ++W+ G + P ++V H+FRK +
Sbjct: 164 --GGCFAINRSVFEEAGGFETGFPVWGYEDVEISIKLWLFGYKCHVQPKAKVLHLFRKVQ 221
Query: 436 PYGVGEKQDYMLQNSMRMARVWMDD---YVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
PY V E +Y N +R+A + Y + + +N + +E + + AL E+ Q
Sbjct: 222 PYRV-ELDEY-FYNLLRLAYLHFSPARIYKTRKMLINGNEKEIE-RKVLAQGAL-EKKQA 277
Query: 493 KTFKWYLDNMWF 504
+ D+ WF
Sbjct: 278 YLARRRYDDDWF 289
>UniRef50_Q4SU00 Cluster: Chromosome undetermined SCAF14054, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF14054,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 253
Score = 80.2 bits (189), Expect = 1e-13
Identities = 32/62 (51%), Positives = 42/62 (67%)
Query: 344 GFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGG 403
GF+W LHFKW+ L D +P+K+P +AGGLF I R +FN +GKYD M++WGG
Sbjct: 1 GFDWSLHFKWEQLSPEQRARRTDPAQPIKTPIIAGGLFVIDRSWFNHLGKYDTAMDIWGG 60
Query: 404 EN 405
EN
Sbjct: 61 EN 62
>UniRef50_UPI0000E234D0 Cluster: PREDICTED: similar to UDP-GalNAc:
polypeptide N-acetylgalactosaminyltransferase; n=1; Pan
troglodytes|Rep: PREDICTED: similar to UDP-GalNAc:
polypeptide N-acetylgalactosaminyltransferase - Pan
troglodytes
Length = 459
Score = 79.8 bits (188), Expect = 2e-13
Identities = 35/95 (36%), Positives = 57/95 (60%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
Y + +N +S RI R +PD R + C+ Y +LP+ S++ F NE ++RSVHS
Sbjct: 114 YEEYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLPQVSVVFIFVNEALSVILRSVHS 173
Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
+++ T + +KE+ILVDD SD L ++ + V+K
Sbjct: 174 VVNHTPSQLLKEVILVDDNSDNVELKFNLDQYVNK 208
Score = 79.4 bits (187), Expect = 2e-13
Identities = 61/169 (36%), Positives = 86/169 (50%), Gaps = 27/169 (15%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ--------G 308
+++++ S+REGLIRARL G + V+ F D+H+E N GW P L R+ +
Sbjct: 214 VKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAEPALSRIREDSRPQFLLS 273
Query: 309 VDGVK-----VRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLIN 363
D +K VRYS RA + +F S P +G GLH + L +L
Sbjct: 274 HDELKLSCFSVRYSRRAHHYAWLIFCIFSFPGS-WPSRQGS---GLHS--EVLTPLSL-- 325
Query: 364 DEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRI 412
P ++P M G F + REYF IG DPGM V+GGEN+E+ R+
Sbjct: 326 ------PPRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRV 368
>UniRef50_Q5BYW7 Cluster: SJCHGC07375 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC07375 protein - Schistosoma
japonicum (Blood fluke)
Length = 202
Score = 79.4 bits (187), Expect = 2e-13
Identities = 42/105 (40%), Positives = 65/105 (61%), Gaps = 2/105 (1%)
Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDEL-P-RASIIICFYNEHYETLMRSV 172
++++ FN + S IG R+L D R+ C Q D+L P + S+II F+NE + L+R+V
Sbjct: 87 FSINEFNLVASDLIGLRRNLDDFRHPSCPRQIPLDKLIPFKTSVIIVFHNEAWSALLRTV 146
Query: 173 HSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKE 217
HS++DRT ++ + EIILVDD S +L H + + LN I+ E
Sbjct: 147 HSVLDRTPEQLLHEIILVDDASTQSHLGHQLDNYISSLNKPIRLE 191
>UniRef50_Q4TCW9 Cluster: Chromosome undetermined SCAF6660, whole
genome shotgun sequence; n=2; Deuterostomia|Rep:
Chromosome undetermined SCAF6660, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 157
Score = 75.4 bits (177), Expect = 4e-12
Identities = 43/94 (45%), Positives = 56/94 (59%), Gaps = 9/94 (9%)
Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
KK EN V+ +R+L+ +R GLIRARL GA + G V+ FLD+H E VGWL PL
Sbjct: 59 KKKLENY-VRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPL 117
Query: 302 LKRLSQGVDGVKVRYSARAVTPVIDVINADTFEY 335
L R+ + V V P+IDVI+ +TFEY
Sbjct: 118 LARIKEDRTAV--------VCPIIDVISDETFEY 143
Score = 59.3 bits (137), Expect = 3e-07
Identities = 23/54 (42%), Positives = 42/54 (77%)
Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
C+++ Y D++P S++I F+NE + TL+R+VHS+++R+ + + EI+LVDD S+
Sbjct: 1 CKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDDASE 54
>UniRef50_UPI0000E46BB8 Cluster: PREDICTED: similar to
UDP-GalNAc:polypeptide,
N-acetylgalactosaminyltransferase, partial; n=3;
Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
UDP-GalNAc:polypeptide,
N-acetylgalactosaminyltransferase, partial -
Strongylocentrotus purpuratus
Length = 112
Score = 74.1 bits (174), Expect = 9e-12
Identities = 35/79 (44%), Positives = 53/79 (67%)
Query: 117 LHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIM 176
++ FN + S RI +R LPD R + C ++ Y +LP S+I+ ++NE TL+R+VHSI+
Sbjct: 33 INEFNLMASDRIALNRSLPDVRPRGCANKVYPKKLPTTSVILVYHNEARSTLLRNVHSII 92
Query: 177 DRTDQKHIKEIILVDDYSD 195
+R+ + EIILVDD SD
Sbjct: 93 NRSPHDLLAEIILVDDASD 111
>UniRef50_UPI00005A4DE7 Cluster: PREDICTED: similar to Probable
polypeptide N-acetylgalactosaminyltransferase 8
(Protein-UDP acetylgalactosaminyltransferase 8)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
transferase 8) (GalNAc-T8) (pp-GaNTase 8)...; n=1; Canis
lupus familiaris|Rep: PREDICTED: similar to Probable
polypeptide N-acetylgalactosaminyltransferase 8
(Protein-UDP acetylgalactosaminyltransferase 8)
(UDP-GalNAc:polypeptide
N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
transferase 8) (GalNAc-T8) (pp-GaNTase 8)... - Canis
familiaris
Length = 437
Score = 73.7 bits (173), Expect = 1e-11
Identities = 40/91 (43%), Positives = 53/91 (58%), Gaps = 2/91 (2%)
Query: 321 VTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGL 380
V+PV D IN DTFE L GFNW L ++D LP+ + D P+KSP++ G+
Sbjct: 165 VSPVFDNINFDTFELDKYALAVDGFNWKLWCRYDPLPE-AWFDLHDVTAPIKSPSIM-GI 222
Query: 381 FAIYREYFNAIGKYDPGMNVWGGENLEISFR 411
A R + IG D GM V+GGEN+E+S R
Sbjct: 223 LAANRIFLGEIGSLDGGMLVYGGENVELSLR 253
Score = 70.1 bits (164), Expect = 2e-10
Identities = 41/150 (27%), Positives = 78/150 (52%), Gaps = 6/150 (4%)
Query: 45 QPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRN 104
Q +V+ T+ + KY+++ Q + + +K K+ K ++ L Q+G +
Sbjct: 3 QEENVDNTVERV--KYEEHPVQ----KTLKVKASETKEHKPKEILFPDSQLFRQWGEDLS 56
Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEH 164
+ + + +N +S ++ +R +PDTR+ C + Y +LP +I+ F NE
Sbjct: 57 EAQQKKAEDLFQEFGYNVYLSNQLPLNRTIPDTRDSRCLQKTYSSQLPSLGVILIFMNEA 116
Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
+ R++ SI++RT + +KEIILVDD+S
Sbjct: 117 LSIIQRAITSIINRTPTQLLKEIILVDDFS 146
Score = 47.2 bits (107), Expect = 0.001
Identities = 18/38 (47%), Positives = 29/38 (76%)
Query: 466 IEVNPSAAHVEIGDISERKALRERLQCKTFKWYLDNMW 503
+E++ + ++ GDIS R ALR++L+CKTF WYL N++
Sbjct: 248 VELSLRNSGIDFGDISSRMALRKKLKCKTFDWYLKNVY 285
>UniRef50_Q4T9B6 Cluster: Chromosome undetermined SCAF7602, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF7602,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 451
Score = 73.3 bits (172), Expect = 2e-11
Identities = 31/82 (37%), Positives = 53/82 (64%)
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
FN + S I R + D R+ C+ Y + L +S++I F+NE + TLMR+VHS++ RT
Sbjct: 165 FNMVASDMISLDRTISDIRHDECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRT 224
Query: 180 DQKHIKEIILVDDYSDLYNLHH 201
++++ EI+++DD+S+ HH
Sbjct: 225 PRRYLAEIVMIDDFSNKEKAHH 246
>UniRef50_UPI0000F20FAB Cluster: PREDICTED: hypothetical protein;
n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
Danio rerio
Length = 204
Score = 72.9 bits (171), Expect = 2e-11
Identities = 32/54 (59%), Positives = 40/54 (74%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
+R+L+T KREGLIR RL GA + G V+ FLDSH E NV WLPPLL R++Q +
Sbjct: 140 VRILRTQKREGLIRTRLLGAAAARGQVITFLDSHCEANVNWLPPLLDRIAQNTN 193
Score = 64.1 bits (149), Expect = 1e-08
Identities = 34/86 (39%), Positives = 55/86 (63%), Gaps = 4/86 (4%)
Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
C+ + Y +LP S+II F+NE + +L+R+VHS++DR+ I EIILVDD+SD +L
Sbjct: 69 CKLKLYTADLPNTSVIIPFHNEGWSSLLRTVHSVLDRSPPLLIAEIILVDDFSDKGHLKA 128
Query: 202 DVQEAVDKLNNV----IKKEEEMIET 223
+++ + +L V +K E +I T
Sbjct: 129 PLEQYMVRLPKVRILRTQKREGLIRT 154
>UniRef50_Q01V98 Cluster: Glycosyl transferase, family 2; n=1;
Solibacter usitatus Ellin6076|Rep: Glycosyl transferase,
family 2 - Solibacter usitatus (strain Ellin6076)
Length = 282
Score = 72.9 bits (171), Expect = 2e-11
Identities = 64/202 (31%), Positives = 95/202 (47%), Gaps = 24/202 (11%)
Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
R++KT+ G+ AR G + GD+L F D+HI + W PL + L KV
Sbjct: 50 RVIKTNGI-GVACARNLGVSKTTGDMLFFADAHIRLEKNWWQPLAEVLEDR----KVAAV 104
Query: 318 ARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
A AVT + P+ RG +GL F +L L + P +P +
Sbjct: 105 APAVT------------HLPATRRRG---FGLTFTGPDLDARWL--PRQGVTPFSAPILP 147
Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
G + R F+A+G +D G+ GG + E+S R+W+ G L + P V H+FR PY
Sbjct: 148 GCSLMMRRATFDAVGGWDGGLLHRGGVDNEMSVRLWLLGYELMVAPQVVVPHLFRSASPY 207
Query: 438 GVGEKQDYMLQNSMRMARVWMD 459
VG Q L N +R+A V ++
Sbjct: 208 PVGWPQ--YLHNRLRLAFVHLN 227
>UniRef50_A4J8G0 Cluster: Glycosyl transferase, family 2; n=1;
Desulfotomaculum reducens MI-1|Rep: Glycosyl
transferase, family 2 - Desulfotomaculum reducens MI-1
Length = 291
Score = 72.9 bits (171), Expect = 2e-11
Identities = 55/199 (27%), Positives = 94/199 (47%), Gaps = 26/199 (13%)
Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
+++L+ T+ G AR GA G++LVF D+HI V WL + LS+G+ +
Sbjct: 58 SVKLINTTGI-GAANARNLGAQQCAGEILVFCDAHITVEPDWL----ENLSEGL----LE 108
Query: 316 YSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
+ AV+P I +N + + G W + LP + + P
Sbjct: 109 RGSGAVSPGIANMNMNH-------AIGYGMTWNKQLEARWLPSTGDVAEV--------PI 153
Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
GG A++R+ FN +G ++ G +G E+ E S ++W+ G +E+ P + H FR +
Sbjct: 154 APGGCVAVHRDVFNDVGGFETGFRTYGFEDAEFSLKLWLFGYRVEVDPSVVIQHHFRSKH 213
Query: 436 PYGVGEKQDYMLQNSMRMA 454
PY + ++Y N + MA
Sbjct: 214 PYSI-TMEEY-AYNGIHMA 230
>UniRef50_Q4RPK0 Cluster: Chromosome 12 SCAF15007, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 12 SCAF15007, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 554
Score = 72.1 bits (169), Expect = 4e-11
Identities = 80/322 (24%), Positives = 130/322 (40%), Gaps = 44/322 (13%)
Query: 290 HIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGL 349
HI+ P+L R+ + R + P ID I +TFE G+NWGL
Sbjct: 243 HIDFRTRLAEPILTRMKED--------HTRIILPAIDNIKYNTFEVQQYANAAHGYNWGL 294
Query: 350 HF-------KWDNLPKGTL---INDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMN 399
+W + +G I ++ K + A + + + FN G +
Sbjct: 295 WCMYIIPPQEWLDKGRGVTCLEIKKDELQTSSKKRSCARWILDLCFQ-FN--GFTPVSLC 351
Query: 400 VWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMD 459
V + ++W CGGS+E++PC+RV H+ R ++PY + Y +N++R A VWMD
Sbjct: 352 V-------LPVQVWQCGGSMEVLPCARVAHIERTKKPYN-NDIDYYAKRNALRAAEVWMD 403
Query: 460 DYVKKV-------IEVNPSAAH---VEIGDISERKALR---ERLQCKTFKWYLDN-MWFE 505
+Y V + V S A ++ G + KA+ + + ++ + +
Sbjct: 404 EYKSHVYMAWNIPMNVRNSKASGYCLDQGAEDDDKAILYPCHGMSSQLARYSTEGLLQLG 463
Query: 506 TDRSELVLGRTLCL-DASNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDR 564
S L T CL D P L KC + + T + PI + G CL V+
Sbjct: 464 PLGSTTFLPDTKCLVDDGRGRTPRLKKCEAVSRNSQRLWDFTQNGPIISRDTGRCLEVEM 523
Query: 565 SYRGETVLMVICDDYSNNKWDI 586
S L ++ S KW I
Sbjct: 524 SKDANFGLRLVVQRCSGQKWMI 545
>UniRef50_UPI0000E49DD9 Cluster: PREDICTED: hypothetical protein;
n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 220
Score = 71.3 bits (167), Expect = 7e-11
Identities = 35/96 (36%), Positives = 56/96 (58%)
Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNE 163
N D D+ + FN +IS RI R + D R+ C+ Y LP ++++ F+NE
Sbjct: 99 NPNDQDKYDQSLKEYGFNMVISDRIALDRAVNDIRHDECKYWHYPKNLPNTTVVVVFHNE 158
Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
+ TL+R+VHS+++ + + EIILVDD+SD +L
Sbjct: 159 GWSTLLRTVHSVINTSPPYLLHEIILVDDFSDKISL 194
>UniRef50_Q4SIA0 Cluster: Chromosome 5 SCAF14581, whole genome
shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome 5 SCAF14581, whole genome shotgun sequence -
Tetraodon nigroviridis (Green puffer)
Length = 404
Score = 68.9 bits (161), Expect = 3e-10
Identities = 35/104 (33%), Positives = 59/104 (56%)
Query: 91 TENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDE 150
+++ L +G + ++ R+ K + + +N +S R+ R +PD R C++ Y
Sbjct: 35 SDSSLFAHWGQNLSPDNRRVALKMFQYYGYNGYLSDRLPLDRPIPDLRPDGCRNTTYPLS 94
Query: 151 LPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
LP+ SI+ F NE ++RS+HS ++RT +KEIILVDD S
Sbjct: 95 LPQVSIVFIFVNEALSVILRSIHSAINRTPSHLLKEIILVDDNS 138
Score = 66.1 bits (154), Expect = 2e-09
Identities = 36/90 (40%), Positives = 47/90 (52%), Gaps = 2/90 (2%)
Query: 322 TPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLF 381
+P D I DTFE PL GF+W L ++ N PK P++SP + G F
Sbjct: 158 SPSFDNIKYDTFEIEEYPLSAQGFDWELWCRYLNPPKSWWFKGNK-SAPIQSPALIG-CF 215
Query: 382 AIYREYFNAIGKYDPGMNVWGGENLEISFR 411
+ R YF IG D GM V+GGEN+E+ R
Sbjct: 216 VVDRLYFEEIGLLDEGMEVYGGENVELGIR 245
Score = 58.0 bits (134), Expect = 7e-07
Identities = 25/48 (52%), Positives = 37/48 (77%)
Query: 458 MDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFKWYLDNMWFE 505
M+ Y + +E+ + ++IGD+S+RKALR+RLQCKTF+WYL NM+ E
Sbjct: 232 MEVYGGENVELGIRDSGIDIGDVSDRKALRKRLQCKTFRWYLVNMYPE 279
>UniRef50_UPI0000F1FCC6 Cluster: PREDICTED: hypothetical protein;
n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
Danio rerio
Length = 213
Score = 66.9 bits (156), Expect = 1e-09
Identities = 33/78 (42%), Positives = 46/78 (58%)
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
+ +N +S +I R LPD R C+ + +LP+ SII F NE ++RSVHS ++
Sbjct: 132 YGYNAFLSDKISLDRSLPDYRPSKCKKAFFPRDLPQISIIFIFVNEALSVILRSVHSAVN 191
Query: 178 RTDQKHIKEIILVDDYSD 195
T +KEIILVDD SD
Sbjct: 192 HTPAHLLKEIILVDDNSD 209
>UniRef50_A7T195 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 693
Score = 64.5 bits (150), Expect = 8e-09
Identities = 31/82 (37%), Positives = 49/82 (59%)
Query: 108 LRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYET 167
L+ + Y + FN IS +IG RD+PDTR+ C+ + Y LP SIII F+NE T
Sbjct: 24 LKEGEDAYGKNQFNQAISDKIGGDRDVPDTRHSHCRYEAYPSTLPATSIIITFHNEARST 83
Query: 168 LMRSVHSIMDRTDQKHIKEIIL 189
L+R+V S++ + + I +++
Sbjct: 84 LLRTVKSLLSIPNLERICRLLM 105
>UniRef50_UPI000069DFD6 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 11 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 11) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 11)
(Polypeptide GalNAc transferase 11) (GalNAc-T11)
(pp-GaNTase 11).; n=1; Xenopus tropicalis|Rep:
Polypeptide N-acetylgalactosaminyltransferase 11 (EC
2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
11) (UDP- GalNAc:polypeptide
N-acetylgalactosaminyltransferase 11) (Polypeptide
GalNAc transferase 11) (GalNAc-T11) (pp-GaNTase 11). -
Xenopus tropicalis
Length = 343
Score = 61.7 bits (143), Expect = 5e-08
Identities = 26/54 (48%), Positives = 36/54 (66%)
Query: 86 KMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRN 139
++ + DL + G+I N +D +RD GY HAFN LIS R+G HRD+PDTR+
Sbjct: 78 QLETEANADLSPELGMIFNEQDQDVRDVGYQKHAFNLLISNRLGYHRDVPDTRD 131
Score = 56.4 bits (130), Expect = 2e-06
Identities = 32/91 (35%), Positives = 45/91 (49%), Gaps = 6/91 (6%)
Query: 500 DNMWFETDRSELVLGRTLCLDASNNVA---PILGKCHEMGGTQEWKHKGTASSPIYNTAA 556
+ +W + EL+L LCLD S + P L KCH GG+Q+W S+ +Y +
Sbjct: 253 EQVWSYNEEHELILSNLLCLDMSETRSSDPPRLMKCHGSGGSQQWVF--GKSNRLYQVSV 310
Query: 557 GMCLG-VDRSYRGETVLMVICDDYSNNKWDI 586
G CL VD R V M ICD + +W +
Sbjct: 311 GQCLKLVDPMSRKGYVSMAICDGSPSQQWHL 341
>UniRef50_Q3A8Y3 Cluster: Glycosyl transferase, group 2 family; n=1;
Carboxydothermus hydrogenoformans Z-2901|Rep: Glycosyl
transferase, group 2 family - Carboxydothermus
hydrogenoformans (strain Z-2901 / DSM 6008)
Length = 288
Score = 60.1 bits (139), Expect = 2e-07
Identities = 33/101 (32%), Positives = 57/101 (56%), Gaps = 3/101 (2%)
Query: 369 KPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVG 428
K + P + GGL I + F +G ++ M WG E+ E+S R+W+ G L ++P V
Sbjct: 145 KVAEIPVVPGGLMVIKSKVFFEVGGFEGLMERWGWEDAELSLRLWLMGYRLLVVPEVVVY 204
Query: 429 HVFRKRRPYGVGEKQDYMLQNSMRMARVWM-DDYVKKVIEV 468
H+FR+R+PY K L+N +A + ++ VKK++++
Sbjct: 205 HLFRERQPYPTSRKA--ALKNLFILALNHLSEERVKKILKL 243
Score = 34.3 bits (75), Expect = 9.2
Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)
Query: 251 KNNVFN-IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
K++ +N I+L+KT + GL RA+ GA + G LVF D+H+ WL L
Sbjct: 50 KDSRYNQIKLIKT-EGIGLARAKNLGAKYASGKYLVFSDAHMSYQTFWLDHL 100
>UniRef50_Q3A8Y2 Cluster: Glycosyl transferase, group 2 family; n=1;
Carboxydothermus hydrogenoformans Z-2901|Rep: Glycosyl
transferase, group 2 family - Carboxydothermus
hydrogenoformans (strain Z-2901 / DSM 6008)
Length = 288
Score = 57.6 bits (133), Expect = 9e-07
Identities = 24/64 (37%), Positives = 38/64 (59%)
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
P + GG I + F A+G YD G+ +WG ++ E S R W+ G +L + P ++V H+FR
Sbjct: 149 PVLPGGFMLIKKNDFIALGGYDEGLKIWGYDDCEFSLRAWLMGFNLLVTPRTKVFHLFRS 208
Query: 434 RRPY 437
+ Y
Sbjct: 209 GQIY 212
Score = 43.6 bits (98), Expect = 0.015
Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 4/67 (5%)
Query: 244 STENSEV---KNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPP 300
ST+ S V N NI L+K R G+ RA+ GA+ + G+VL+F D+HI V WL
Sbjct: 41 STDQSTVFLNSGNFKNIDLIKLP-RSGVTRAKNAGANKARGEVLIFSDAHILVEDFWLEK 99
Query: 301 LLKRLSQ 307
+L+ L +
Sbjct: 100 MLEDLQE 106
>UniRef50_Q4TDW9 Cluster: Chromosome undetermined SCAF5986, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF5986,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 123
Score = 49.6 bits (113), Expect = 2e-04
Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Query: 425 SRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISER 483
SRVGHVFRK+ PY G +N+ R A VWMD+Y PSA +V G + +
Sbjct: 1 SRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSARNVPYGKVPDH 60
Query: 484 K 484
+
Sbjct: 61 Q 61
>UniRef50_Q68VJ7 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 1; n=3;
Euteleostomi|Rep: Polypeptide
N-acetylgalactosaminyltransferase 1 - Homo sapiens
(Human)
Length = 170
Score = 48.8 bits (111), Expect = 4e-04
Identities = 41/120 (34%), Positives = 55/120 (45%), Gaps = 18/120 (15%)
Query: 478 GDISERKALRERLQCKTFKWYLDNMWFETD--RSELVLGR------TLCLD---ASNNVA 526
GDIS R LR +LQCK F WYL+N++ ++ R LG CLD N
Sbjct: 5 GDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEK 64
Query: 527 PILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNK-WD 585
+ CH MGG Q + + TA+ I +CL D S V M+ C N+ W+
Sbjct: 65 VGIFNCHGMGGNQVFSY--TANKEI--RTDDLCL--DVSKLNGPVTMLKCHHLKGNQLWE 118
>UniRef50_Q1ARC2 Cluster: Glycosyl transferase, family 2; n=1;
Rubrobacter xylanophilus DSM 9941|Rep: Glycosyl
transferase, family 2 - Rubrobacter xylanophilus (strain
DSM 9941 / NBRC 16129)
Length = 753
Score = 46.8 bits (106), Expect = 0.002
Identities = 52/189 (27%), Positives = 80/189 (42%), Gaps = 24/189 (12%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
+R+L+ +R G AR G + G+V+ F D EV GWL L L GV++
Sbjct: 397 VRVLRLERRAGQSAARNLGLRAARGEVVAFTDDDCEVLPGWLRALAAPLC--TPGVELA- 453
Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
R ++P FE + SPL G P+G + + L S +
Sbjct: 454 GGRVLSPP-PAGRLGAFEAARSPLDMG-------------PEGGEVGPRGAVPYLPSCNL 499
Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
AG A+ R +G +D GM + GE+ ++ +R G + P +RV H R R
Sbjct: 500 AGDRRALLR-----LGGFDEGMEL--GEDADLVWRAVRAGLGVRYEPSARVVHRHRTRLA 552
Query: 437 YGVGEKQDY 445
+ + DY
Sbjct: 553 ALLARRADY 561
>UniRef50_Q4RAK4 Cluster: Chromosome undetermined SCAF23488, whole
genome shotgun sequence; n=2; Tetraodontidae|Rep:
Chromosome undetermined SCAF23488, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 174
Score = 46.4 bits (105), Expect = 0.002
Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 21/130 (16%)
Query: 470 PSAAHVEIGDISERKALRERLQCKTFKWYLDNMWFE-----------TDRSEL-VLGRTL 517
P + + GDISE K RE +CK+FKW+++ + ++ D E+ L +
Sbjct: 2 PESLTLAYGDISELKRFREEHRCKSFKWFMEEIAYDIPLHYPMPPKNVDWGEIRGLDTSY 61
Query: 518 CLDA---SNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMV 574
C+D+ +N +G CH MGG Q ++ Y+ + G D S V++
Sbjct: 62 CIDSMGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQC--LTRGTDNS----GVIIT 115
Query: 575 ICDDYSNNKW 584
CD + +W
Sbjct: 116 HCDQNQHTEW 125
>UniRef50_Q5CY12 Cluster: UDP-N-acetylgalactosamine: polypeptide N-
acetylgalactosaminyltransferase, signal peptide; n=2;
Cryptosporidium|Rep: UDP-N-acetylgalactosamine:
polypeptide N- acetylgalactosaminyltransferase, signal
peptide - Cryptosporidium parvum Iowa II
Length = 414
Score = 46.4 bits (105), Expect = 0.002
Identities = 54/217 (24%), Positives = 98/217 (45%), Gaps = 31/217 (14%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I++++T +E L + GA+NS G++++F+ S W+ P+++ LS +
Sbjct: 119 IKIIETELQE-LGELQNLGANNSTGEIILFVPSATLFPKNWMSPIMRSLSDNYKSI---- 173
Query: 317 SARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
+ P +N D + +S + P+ + F+ N+ TL N K P
Sbjct: 174 ----IVPRFKKLNKDKWTFSNNDPVYSPKMMFTKEFELTNI--HTLDN--------KVPM 219
Query: 376 MAGGLFAIYREYFNAIGKY-DPGMNV--WGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
+FAI + ++ I K DP +N+ N +IS R W CGG + I G V +
Sbjct: 220 FYSKIFAITKSWWLNISKLSDPTINLIFKTSINFDISLRSWNCGGRVAQIAELSFG-VTK 278
Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVN 469
+ P E + +L++ W+D+ K++I N
Sbjct: 279 VKIPQPSLEIRQVLLES-------WIDEPTKQMIMNN 308
>UniRef50_UPI000069E575 Cluster: Polypeptide
N-acetylgalactosaminyltransferase 5 (EC 2.4.1.41)
(Protein-UDP acetylgalactosaminyltransferase 5) (UDP-
GalNAc:polypeptide N-acetylgalactosaminyltransferase 5)
(Polypeptide GalNAc transferase 5) (GalNAc-T5)
(pp-GaNTase 5).; n=1; Xenopus tropicalis|Rep:
Polypeptide N-acetylgalactosaminyltransferase 5 (EC
2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
5) (UDP- GalNAc:polypeptide
N-acetylgalactosaminyltransferase 5) (Polypeptide GalNAc
transferase 5) (GalNAc-T5) (pp-GaNTase 5). - Xenopus
tropicalis
Length = 124
Score = 46.0 bits (104), Expect = 0.003
Identities = 17/27 (62%), Positives = 23/27 (85%)
Query: 477 IGDISERKALRERLQCKTFKWYLDNMW 503
IGD++E+K LRERLQCK F WY+ N++
Sbjct: 2 IGDLTEQKQLRERLQCKNFNWYIKNVF 28
>UniRef50_A6WEA2 Cluster: Glycosyl transferase family 2; n=1;
Kineococcus radiotolerans SRS30216|Rep: Glycosyl
transferase family 2 - Kineococcus radiotolerans
SRS30216
Length = 289
Score = 46.0 bits (104), Expect = 0.003
Identities = 51/197 (25%), Positives = 77/197 (39%), Gaps = 34/197 (17%)
Query: 255 FNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKV 314
F +R + S R G+ AR G ++ DV++ D+ V VGW+ + + L Q
Sbjct: 60 FMLRRVDASARRGVAHARNAGCRAALADVILVCDADDVVGVGWVDAMARALEQA------ 113
Query: 315 RYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKW-DNLPKGTLINDEDFMKPLKS 373
D++ LV G N L +W P G L +
Sbjct: 114 -----------DLVGGT--------LVHGHLNTALVQQWRPTSPPGVLPTKLSHL----- 149
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
P G + RE F+A+G +D G V GG+++E S+R G L P + + +R
Sbjct: 150 PYAVGANVGLRREVFDALGGWDEGF-VAGGDDVEFSWRAQHAGFCLRSAPDAVI--AYRM 206
Query: 434 RRPYGVGEKQDYMLQNS 450
R KQ Y S
Sbjct: 207 RTTLSANVKQSYFYARS 223
>UniRef50_Q1Q6Z1 Cluster: Putative uncharacterized protein; n=1;
Candidatus Kuenenia stuttgartiensis|Rep: Putative
uncharacterized protein - Candidatus Kuenenia
stuttgartiensis
Length = 291
Score = 43.2 bits (97), Expect = 0.020
Identities = 27/69 (39%), Positives = 38/69 (55%), Gaps = 5/69 (7%)
Query: 244 STENS-----EVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWL 298
ST+NS +N + N+R+ S R G AR GA +VG+ LVF D+ EV GWL
Sbjct: 41 STDNSMEIVMRYQNRLPNLRIADASDRRGQAHARNIGARLAVGESLVFCDADDEVAPGWL 100
Query: 299 PPLLKRLSQ 307
+ + LS+
Sbjct: 101 AAMGEALSR 109
>UniRef50_UPI0000499CE1 Cluster: SMC3 protein; n=1; Entamoeba
histolytica HM-1:IMSS|Rep: SMC3 protein - Entamoeba
histolytica HM-1:IMSS
Length = 1188
Score = 42.7 bits (96), Expect = 0.026
Identities = 50/257 (19%), Positives = 110/257 (42%), Gaps = 15/257 (5%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
+ K+RIK +K IA E + + +EE + M + ++ + KE+
Sbjct: 643 ETKERIKEVEKDIARSEA------EKKRIEEEQKEIMKEMEEISSKIAEEEVKYEKERME 696
Query: 81 KQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNK 140
+ IK S++ ++ + I +E + R+ L N ++ I D ++ + K
Sbjct: 697 RMIKIKKSERIRESIKNKEKRIEENEKIIFRNTQKLLILQNEKDNKNIIDRSEIEKAKKK 756
Query: 141 LCQSQQYFDELPRASIIICFYNEHYETLMRSVH--SIMDRTD--QKHIKEIILVDDYSDL 196
L + Q EL + + I E+ ++R+ + I+ R + ++ ++E+ D SD+
Sbjct: 757 LEEIQNKVRELEKKRVEI----ENRRQVLRNEYQFGIISRINEIERKMREVESGGDESDI 812
Query: 197 YNLHHDVQEAVDKLNNVIKKEEEMI-ETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVF 255
+ +++++L + ++ E+ I E ++ + K E + +F
Sbjct: 813 EKYKEVLSKSMEELQRINEEIEKKIQEERTLEEQQEGIEKEKEKKEKIQNEREKKMARLF 872
Query: 256 NIRLLKTSKREGLIRAR 272
+ SKR+ LI+ R
Sbjct: 873 EKMTVLESKRKELIKRR 889
>UniRef50_A2DKE3 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
protein, putative - Trichomonas vaginalis G3
Length = 2870
Score = 42.7 bits (96), Expect = 0.026
Identities = 45/203 (22%), Positives = 89/203 (43%), Gaps = 10/203 (4%)
Query: 29 NDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMS 88
N+K++ + LS ++ E++ + K D + + + K Q+
Sbjct: 2356 NEKNVLLQQEISKLSSDLQEKEKSEKSLLQKQNDLISEISKLKNDIKDHKINLSQSTSSL 2415
Query: 89 KKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGD-HRDLPDTRNKLCQSQQY 147
KK D+ + I S +D+ NL N + ++I + L DT + L QS Q
Sbjct: 2416 KK---DISTKAKQIEQS-----KDELNNLQTENNSLKKKIQNLEAVLQDTEDSLAQSNQS 2467
Query: 148 FDELPRASIIICFYNEHYETLMRSVHSIMDR-TDQKHIKEIILVDDYSDLYNLHHDVQEA 206
++ + ++ E + L+ S ++R T++ KE L S+L N+ ++
Sbjct: 2468 QRQIKASYDLLNNKFEENQVLLNSKQKEIERLTNEVSDKEKELEKTKSELINIQERIRSD 2527
Query: 207 VDKLNNVIKKEEEMIETNNIDME 229
KLN I +++ +E+ NI++E
Sbjct: 2528 SSKLNQDINEKQTKLESLNIELE 2550
>UniRef50_Q23ZG7 Cluster: Peptidase family M1 containing protein;
n=1; Tetrahymena thermophila SB210|Rep: Peptidase family
M1 containing protein - Tetrahymena thermophila SB210
Length = 1721
Score = 42.3 bits (95), Expect = 0.035
Identities = 51/201 (25%), Positives = 86/201 (42%), Gaps = 18/201 (8%)
Query: 59 KYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLH 118
KY D + Q E +++ KEK + K+ L +Q I +K NL
Sbjct: 659 KYSDERAQFEQKQEEFSKEKEDLLNQFEKFKEENTILAKQIS--------EIEEKIENLI 710
Query: 119 AFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDR 178
+ N +S + D R L +T K + + Y DEL + I + E LM+ + +
Sbjct: 711 SENQELSIQNQDQRYLIETLEK--KQESYTDELKK-QFDISLRQKIEENLMQLTNRYNEE 767
Query: 179 TDQKHIKEII-----LVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXX 233
++ H EII +V++ L +Q +LN +I++ +E+ E NI+ E
Sbjct: 768 INKLH-NEIIKRNESVVEEKEKNRQLEEQIQRLKFELNKLIQETKELKEFANIE-EEQNY 825
Query: 234 XXXXXXXXKKSTENSEVKNNV 254
+ EN+EVKN +
Sbjct: 826 EQQKQYQQQLEFENNEVKNQI 846
>UniRef50_A2EE02 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 766
Score = 41.9 bits (94), Expect = 0.046
Identities = 46/188 (24%), Positives = 84/188 (44%), Gaps = 10/188 (5%)
Query: 107 DLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRA--SIIICFYNEH 164
D++ D NL AF I++ I D + D +KL + D++ + S ++ +
Sbjct: 340 DMKKLDVDKNLLAF---IAKNISDDNSMGDV-SKLVANLTNKDDVACSLVSSLLLANMKQ 395
Query: 165 YETLMRSVHSIMDRTD-QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIET 223
E L R V + R + + I+EI + + + L +QE +D L +K+ E++I+T
Sbjct: 396 QEQLQRMVDQVEQREEILQSIREIGVPPEKAAKIVLK--MQEEIDNLKKTVKENEQIIQT 453
Query: 224 NNIDMEXXXXXXXXXXXXK-KSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGD 282
N ++ E K K TEN NV + K ++ + + R+ +DN +
Sbjct: 454 NKVEFENLEEQLEKLGEEKTKLTENLFNTENVLSDFRTKNNELQKELENRITESDNMKKE 513
Query: 283 VLVFLDSH 290
V D +
Sbjct: 514 YTVVKDQN 521
>UniRef50_A1BDD3 Cluster: Glycosyl transferase, family 2; n=1;
Chlorobium phaeobacteroides DSM 266|Rep: Glycosyl
transferase, family 2 - Chlorobium phaeobacteroides
(strain DSM 266)
Length = 994
Score = 41.5 bits (93), Expect = 0.061
Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 24/191 (12%)
Query: 244 STENS-EVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWL---- 298
S +NS E N + I+L++ +G I + GA + G+ L+FL++ EV WL
Sbjct: 406 SPDNSYETLNKISQIKLIRNDCNKGFIHSCNSGASLATGEYLIFLNNDTEVLNAWLDSLI 465
Query: 299 PPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPK 358
P + + G+ G ++ Y + VI +D G N+G N P+
Sbjct: 466 APFIIHDNVGLVGSQIIYPDGRLQEAGGVILSD----------GSGLNYG-RLSDPNKPE 514
Query: 359 GTLINDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGS 418
+ + D+ +G AI + F++IG +D E+ +I+F + G
Sbjct: 515 YNFLREVDY--------CSGCSIAIKKSLFDSIGGFDTLFIPAYYEDTDIAFTVRKMGYK 566
Query: 419 LELIPCSRVGH 429
+ P S+V H
Sbjct: 567 VLYQPASKVIH 577
>UniRef50_Q23G97 Cluster: Putative uncharacterized protein; n=1;
Tetrahymena thermophila SB210|Rep: Putative
uncharacterized protein - Tetrahymena thermophila SB210
Length = 758
Score = 41.5 bits (93), Expect = 0.061
Identities = 49/251 (19%), Positives = 109/251 (43%), Gaps = 16/251 (6%)
Query: 16 RPLNWDIKDRIKNNDKSIAALEG-----KDLLSDQPRDVEETLS--KTMWKYQDYKRQSE 68
R L D DR + IA L+ K+ L + R++++ ++ K + + D+K+ +
Sbjct: 50 RQLELDFNDRTDVQIRQIANLKTEVDTLKETLKYKQRELDDAVAELKAIKEVSDHKQINT 109
Query: 69 YRRKVMLKEKFAKQQAIKMSKKT-ENDL----EEQFGLIRNSED--LRIRDKGYNLHAFN 121
R KV LK+ F Q +K ++ E L EE+ L+ + L I +
Sbjct: 110 ERLKVELKQSFENNQRLKEDNQSMETQLRWQKEEKKNLMERMDQLTLNIEEVMEKNEDLE 169
Query: 122 TLISQRIGDHRDLPDTRNKLCQSQQYF-DELPRASIIICFYNEHYETLMRSVHSIMDRTD 180
++ + D R+L + +L + +EL R + II ++ + ++
Sbjct: 170 RVLEESEKDKRNLENLHAQLQSEYELLNEELRRKNQIISELESQNSHQLKQILTLESDKG 229
Query: 181 QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEX-XXXXXXXXX 239
K + + L + +L +++ + +E + L+ +K ++ ++ N +++
Sbjct: 230 NKSSEVMKLKQELEELRHINKNQKEDIFNLDQCVKDKQSQLDQNYKELKSLQNNYTLALE 289
Query: 240 XXKKSTENSEV 250
KK+ +++EV
Sbjct: 290 ESKKAQDHAEV 300
>UniRef50_A2F531 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type
inclusion protein, putative - Trichomonas vaginalis G3
Length = 3748
Score = 40.7 bits (91), Expect = 0.11
Identities = 42/201 (20%), Positives = 90/201 (44%), Gaps = 11/201 (5%)
Query: 24 DRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQS---EYRRKVMLKEKFA 80
D I+N + + E +DL S + D + + + K D K+Q E + K + K+
Sbjct: 662 DEIENENDQLFE-EVEDLKS-KVDDAKILYNDMVDKIDDLKQQRSKVEQKYKDLEKQNKE 719
Query: 81 KQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNK 140
K I+ K ++L+E+ + +D + + A N I ++ ++ + + NK
Sbjct: 720 KSDEIEKVSKEISELKEKLDNLNQFKD-NTPELHQKVDAMNEQIVKKSQENEKIQEEMNK 778
Query: 141 LCQSQQYFD-ELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
L + Q+ + E+ ++ N+ ET+ + +I + ++K + D + L
Sbjct: 779 LNEELQHLENEMEEIEVV----NDERETIQEKIDNIKQQIEEKKKSNEEIQDIMNLLIEA 834
Query: 200 HHDVQEAVDKLNNVIKKEEEM 220
+D Q+ +D + V + EE+
Sbjct: 835 ENDAQKELDDIEIVEAQSEEI 855
>UniRef50_A0D876 Cluster: Chromosome undetermined scaffold_40, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_40,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 537
Score = 40.7 bits (91), Expect = 0.11
Identities = 56/236 (23%), Positives = 107/236 (45%), Gaps = 16/236 (6%)
Query: 4 NQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDL---LSDQPRDVE--ETLSKTMW 58
++++ + K+ + N +I ++I+ N + E K LS+ D E E L T+
Sbjct: 221 DKLIQENNKLQTQLTNAEI-EKIQMNQSQQSLTECKQCVQDLSNSNLDKESYELLQNTIK 279
Query: 59 KYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEE-QFGLIRNSEDLRIRDKGYNL 117
+Y+ + E RK+ L K + + K+ E LE+ Q GL + E I K NL
Sbjct: 280 EYEQSVQDMETERKMFLSAKSEISEKEQEKKQMEMKLEQKQHGLNKLKERNEICAK--NL 337
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQ-YFDELPRASIIICFYNEHYETLMRSVHSIM 176
A I + ++ + +K ++QQ Y +L + + ++ M ++ I
Sbjct: 338 KACQEQILKLQQQLKEKSELEDKAKEAQQKYQQQLIKLKDNFDKQQKELQSYMAQLNEIK 397
Query: 177 DRTDQKHIKEIILVDDY-SDLYNLHHDVQEAVDKLNNVIKKEEEMI--ETNNIDME 229
D+ +++ K I+L +++ S+ L + + + L KK EE I +T I+ E
Sbjct: 398 DKFEEEQKKNILLKEEFESEKKRLQEENKRNQEILQ---KKHEEAILQQTQRIEKE 450
>UniRef50_A6UUX2 Cluster: SMC domain protein; n=1; Methanococcus
aeolicus Nankai-3|Rep: SMC domain protein -
Methanococcus aeolicus Nankai-3
Length = 994
Score = 40.7 bits (91), Expect = 0.11
Identities = 51/229 (22%), Positives = 101/229 (44%), Gaps = 20/229 (8%)
Query: 4 NQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKT---MWKY 60
N +N+ TK L +D+ +I L+ ++LL + +EE K+ KY
Sbjct: 477 NAEINQ-TKDAIEKLKGTTEDKCPVCQSNIDGLKKQELLKQYNQLIEERKQKSNKLQIKY 535
Query: 61 QDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAF 120
Y + + + + K K + ++ +K N +EE+ LI+ + DL + G +L +
Sbjct: 536 NKYLSEKKDIKDKLDKINNLKNKYGQLKEKNNNLIEEENKLIKLNNDLNV--IGKDLEEY 593
Query: 121 NTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTD 180
N +I ++ NKL + +QY+ + + NE + L+ + + ++D
Sbjct: 594 NKII-------ENIKTKENKLKELEQYYKKYEYCENFLKDSNE--QELVDNKNKLLDIIG 644
Query: 181 QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDME 229
K I+ + L++ V E + LN + KE+ M + N ++ E
Sbjct: 645 NNTNKSIL-----NTKKELNNKVGELNELLNLIRNKEQNMKKLNVVNKE 688
Score = 35.1 bits (77), Expect = 5.3
Identities = 45/206 (21%), Positives = 91/206 (44%), Gaps = 9/206 (4%)
Query: 23 KDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQ---SEYRRKVMLKEKF 79
K+ KN + + LE + + E LS K D K+Q +EY+ K LK+
Sbjct: 382 KELNKNKENYLKYLELSKKSKELNNKLME-LSGIKEKENDLKQQIKSTEYKIK-QLKQDL 439
Query: 80 AKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRN 139
I + E +++ ++ I N DL +++K +A I + + +
Sbjct: 440 KDFNNIDIEINKEKEIKTKYEDIVNKIDL-LKEKIAQNNAEINQTKDAIEKLKGTTEDKC 498
Query: 140 KLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
+CQS D L + ++ YN+ E + + + + ++ ++ + D + NL
Sbjct: 499 PVCQSN--IDGLKKQELLKQ-YNQLIEERKQKSNKLQIKYNKYLSEKKDIKDKLDKINNL 555
Query: 200 HHDVQEAVDKLNNVIKKEEEMIETNN 225
+ + +K NN+I++E ++I+ NN
Sbjct: 556 KNKYGQLKEKNNNLIEEENKLIKLNN 581
>UniRef50_A0E3J8 Cluster: Chromosome undetermined scaffold_76, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_76,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 827
Score = 40.3 bits (90), Expect = 0.14
Identities = 37/145 (25%), Positives = 73/145 (50%), Gaps = 14/145 (9%)
Query: 19 NWDIKDRIKN-NDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKE 77
N D+++++ + D+ AL KD L +D E+ L++ + QD + E + L+
Sbjct: 414 NDDLRNQLNDLQDQLTEALLDKDYLQKSLKDQEDELNRVNDQIQDLNNEKEQAQAAALEA 473
Query: 78 K-----FAKQQAIKMSKKTE-----NDLEEQFGLIRNS-EDLRIRDKGYNLHAFNTLISQ 126
K A ++A + + K + NDLE++ + + EDL + + L+ LI
Sbjct: 474 KQQLQDIADEKAQEDADKEKDQDRLNDLEDKVAELEDQIEDLE-KTRNRLLNQIQELID- 531
Query: 127 RIGDHRDLPDTRNKLCQSQQYFDEL 151
++ D R+L + +KLC Q++ ++L
Sbjct: 532 KLHDERELCEYYHKLCSDQEHQNKL 556
Score = 35.9 bits (79), Expect = 3.0
Identities = 17/53 (32%), Positives = 31/53 (58%)
Query: 177 DRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDME 229
DR DQK E +L D + L +++A DK NN +++E++I+ + D++
Sbjct: 132 DRDDQKAFYEGLLADKDDLIAELRRQLKDADDKFNNYRREKEQIIKEKDYDIK 184
>UniRef50_P62134 Cluster: DNA double-strand break repair rad50
ATPase; n=3; Methanococcus maripaludis|Rep: DNA
double-strand break repair rad50 ATPase - Methanococcus
maripaludis
Length = 993
Score = 40.3 bits (90), Expect = 0.14
Identities = 47/204 (23%), Positives = 84/204 (41%), Gaps = 17/204 (8%)
Query: 60 YQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHA 119
Y+ Y + +LKE ++++K +KK ++L+E L N E + I DK
Sbjct: 309 YESYNKLKTIEES-LLKELGVLKESLKDNKKNPDELKEN--LKENDEKILILDKIKEKIK 365
Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSI---- 175
I ++I + + T L S + +D+ + + YE L++ +
Sbjct: 366 ELEFIEKQIYEIKIHKKTVETLFDSVKIYDDSIKTFEELKTKKNSYENLLKEKFDLEKKL 425
Query: 176 MDRTDQKH--IKEIILVDDYSDLYNL-------HHDVQEAVDKLNNVI-KKEEEMIETNN 225
+ TD+K I E+ + + NL + D+ E +DKLN ++ KKE ++ E N
Sbjct: 426 QNETDEKTKLISELTDFEKIEEKINLENELKEKYEDLSEKIDKLNEIVLKKESKISEYKN 485
Query: 226 IDMEXXXXXXXXXXXXKKSTENSE 249
E K TE +
Sbjct: 486 SKAELEKTKDSCHVCQSKITEEKK 509
>UniRef50_Q1WVM5 Cluster: N-acetylglucosaminyltransferase; n=1;
Lactobacillus salivarius subsp. salivarius UCC118|Rep:
N-acetylglucosaminyltransferase - Lactobacillus
salivarius subsp. salivarius (strain UCC118)
Length = 338
Score = 39.9 bits (89), Expect = 0.19
Identities = 27/115 (23%), Positives = 52/115 (45%), Gaps = 8/115 (6%)
Query: 196 LYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVF 255
+YN+ + ++ + ++ K +E ++E ID K E + KN
Sbjct: 12 MYNVENSIERLIISISKAFKNQESLVEVLAID-------DGSTDKTVKIFEKLQKKNQAL 64
Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
++L+K S G+ +AR G S G + F+DS E+ + + L +L+Q +
Sbjct: 65 ALKLIKNS-HGGVSKARNTGIKYSTGQYVTFVDSDDELTLVDVVDLKGKLAQNAE 118
>UniRef50_Q55EZ8 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 568
Score = 39.9 bits (89), Expect = 0.19
Identities = 45/197 (22%), Positives = 88/197 (44%), Gaps = 15/197 (7%)
Query: 39 KDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQ 98
+DL + D +DYK + + + + KEK + K + E + EE+
Sbjct: 127 QDLFKEMIPDPSSNFEDLFGSDEDYKESLKQQGEEIEKEK----EKEKDTNNEEEEEEEE 182
Query: 99 FGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIII 158
I NS +++ G+ ++ +S D ++ D K+ Q + EL R ++
Sbjct: 183 EERIDNSNI--VKEGGFGSMSYAIDVSIDPEDTKENLD-EIKVKQLSELVTELERYQNLV 239
Query: 159 CFYNEHYETLMR-SVHSIMDRTDQKHIKEII-LVDDYSDLYNLHHDVQ----EAVDKLNN 212
N+ Y TL++ ++++ D H++ +I L + +D+ D+ + +D NN
Sbjct: 240 V--NDWYSTLIKININTQSDTEYSSHLRHVITLQQNINDIKAKSKDIDVLPFKNIDNQNN 297
Query: 213 VIKKEEEMIETNNIDME 229
+ EEE+ E I+ME
Sbjct: 298 NQENEEEIDEFEGIEME 314
>UniRef50_Q54SR9 Cluster: Leucine-rich repeat-containing protein;
n=1; Dictyostelium discoideum AX4|Rep: Leucine-rich
repeat-containing protein - Dictyostelium discoideum AX4
Length = 874
Score = 39.9 bits (89), Expect = 0.19
Identities = 40/148 (27%), Positives = 69/148 (46%), Gaps = 14/148 (9%)
Query: 85 IKMSKKTENDLEEQFGLIRNS---EDLRIRDKGYN----LHAFNTLISQRIGDHRDLPDT 137
I ++ ++EE I+NS E L I + N L+ NTL H DL +
Sbjct: 426 ISNNRLERTEIEELIAFIKNSRALESLNISNCSLNHDYFLYICNTLNKNEYIKHLDLNVS 485
Query: 138 RNKLCQSQQYF-DELPRASIIICFYNEH----YETLMRSVHSIMDRTDQKHIKEIILVDD 192
N++ +S F L +++ H Y+TL+R V SI R K+I+E+IL
Sbjct: 486 HNQISKSPSIFVSALEFLTMVNSLNLSHIPITYKTLIRVVDSI--RIYCKNIRELILDAC 543
Query: 193 YSDLYNLHHDVQEAVDKLNNVIKKEEEM 220
+S + HD + +++L +++ E +
Sbjct: 544 FSHSNDKSHDGIDFINELKQFLRERESL 571
>UniRef50_A2DD37 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
protein, putative - Trichomonas vaginalis G3
Length = 1553
Score = 39.5 bits (88), Expect = 0.25
Identities = 52/266 (19%), Positives = 109/266 (40%), Gaps = 10/266 (3%)
Query: 27 KNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEK----FAKQ 82
K ND+ A + K L +Q + +++ L+K ++ + + + + KEK ++
Sbjct: 800 KFNDERQEAAKTKSDLQNQIQQLKDALAKAESNQKETQNKLDISNSDLEKEKDKSKSLEE 859
Query: 83 QAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLC 142
+ + K + EE+ L + E+ R + N + L S+ ++RDL + N+L
Sbjct: 860 ELAALKSKLQQVQEEKANLESDLENERQNNSSSNAELSDKL-SKLQQENRDLVNQINQLQ 918
Query: 143 QS-QQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
+Q E+ + S + N + L ++ + + D+ K LVDD L
Sbjct: 919 NDLKQKESEIQKVSSDLDNLNNVIQDLESQMNDMQGKNDELSKKLSNLVDDNERKDKLID 978
Query: 202 DVQEAVDKLNN---VIKKEEEMIETNNIDM-EXXXXXXXXXXXXKKSTENSEVKNNVFNI 257
D+ + LNN + + E+ +D+ ++S + KNN +
Sbjct: 979 DLNSQLSNLNNEKDSLTNKLSETESEKLDLANQNEKLLKVIEDLQRSLSEEKDKNNSSLL 1038
Query: 258 RLLKTSKREGLIRARLYGADNSVGDV 283
L K L++ ++ + V ++
Sbjct: 1039 SLGDFGKENALLKEKVADLEKQVSNL 1064
>UniRef50_A2SR79 Cluster: Glycosyl transferase, family 2; n=1;
Methanocorpusculum labreanum Z|Rep: Glycosyl
transferase, family 2 - Methanocorpusculum labreanum
(strain ATCC 43576 / DSM 4855 / Z)
Length = 376
Score = 39.5 bits (88), Expect = 0.25
Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 151 LPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
LP SI+IC YNE T+ R + SI T + E++LV D SD
Sbjct: 44 LPAISIVICAYNEE-RTIARKIQSISSCTYPNELMEVVLVIDCSD 87
>UniRef50_A0YT83 Cluster: Putative uncharacterized protein; n=1;
Lyngbya sp. PCC 8106|Rep: Putative uncharacterized
protein - Lyngbya sp. PCC 8106
Length = 342
Score = 39.1 bits (87), Expect = 0.32
Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 3/52 (5%)
Query: 262 TSKRE---GLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
T KRE G A+L GA+ + G++++++DS E WL +L LSQ D
Sbjct: 73 TVKREPGIGYHEAKLLGAELATGEIVIYMDSDCEYEPQWLSSILTTLSQNYD 124
>UniRef50_Q8MNV4 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis elegans|Rep: Putative uncharacterized
protein - Caenorhabditis elegans
Length = 1046
Score = 39.1 bits (87), Expect = 0.32
Identities = 50/211 (23%), Positives = 95/211 (45%), Gaps = 16/211 (7%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDY----KRQSEYRRKVMLK 76
D+K+RI + K + +LL D+ R EE + K ++ K+Q R +
Sbjct: 323 DMKERIITSKKDD---DSNNLLQDELRRTEEKYQQAQKKIENLDETIKQQETQIRDLGRS 379
Query: 77 EKFAKQQAIKMSKKTENDLEEQFG--LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDL 134
AK+Q KMS++ +N+ + G R+ E+ +++ L + + Q++ +L
Sbjct: 380 LDEAKRQLQKMSEQRQNEEVARQGEDSARSMEEKATKEEIKKLKS-QVQLQQQLEQDLEL 438
Query: 135 PDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
R + Q+ E +AS+ + + TLM S++S+ + Q + L +
Sbjct: 439 QKKRVQELTEQRKVLE-SKASVA-----DEFGTLMSSLNSLREENRQYEEETRSLQTNIR 492
Query: 195 DLYNLHHDVQEAVDKLNNVIKKEEEMIETNN 225
L + + Q+A+ + N +K EE IE N
Sbjct: 493 TLQDEVYQHQDAITEWKNRAEKAEEYIEKEN 523
>UniRef50_Q55FF7 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 898
Score = 39.1 bits (87), Expect = 0.32
Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 1/119 (0%)
Query: 23 KDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQ 82
KD+ K+ ++ + K+ + D+ ++ E+ K K +D ++ + R K KEK +
Sbjct: 124 KDKEKDKEREREKEKEKEKVKDREKEKEKEKEKEKEKVKDREKVKD-REKEKEKEKERDK 182
Query: 83 QAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKL 141
K SK E D+E++ R E +IRD+ + ++ N +I + + T+ +
Sbjct: 183 LKPKDSKIKERDIEKEKVRDREKEREKIRDREKDKNSNNNIIKPKEKKDESIAKTQKNI 241
>UniRef50_Q9V2L6 Cluster: Dpm1 dolichol-phosphate
mannosyltransferase; n=4; Thermococcaceae|Rep: Dpm1
dolichol-phosphate mannosyltransferase - Pyrococcus
abyssi
Length = 362
Score = 39.1 bits (87), Expect = 0.32
Identities = 19/66 (28%), Positives = 38/66 (57%), Gaps = 1/66 (1%)
Query: 252 NNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD- 310
++V+ +++++ +GL A + G + GDV V +D+ ++ +P LLKR+ +G D
Sbjct: 64 SSVYPVKVIRRINEKGLSSAVIRGFKEASGDVFVVMDADLQHPPEVIPELLKRIKEGADL 123
Query: 311 GVKVRY 316
+ RY
Sbjct: 124 AIASRY 129
>UniRef50_Q5LW10 Cluster: Diguanylate cyclase, putative; n=3;
Rhodobacterales|Rep: Diguanylate cyclase, putative -
Silicibacter pomeroyi
Length = 513
Score = 38.7 bits (86), Expect = 0.43
Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)
Query: 150 ELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVD 208
+LPR S+ + +E + +H++ TD+K II +DDY+DL + HH QEA D
Sbjct: 68 QLPRDSVTGLMLKDGFEGALTHIHAMAAETDRKSACFIIELDDYADLVD-HHG-QEAGD 124
>UniRef50_Q7NBF8 Cluster: Putative uncharacterized protein; n=1;
Mycoplasma gallisepticum|Rep: Putative uncharacterized
protein - Mycoplasma gallisepticum
Length = 1931
Score = 38.3 bits (85), Expect = 0.57
Identities = 27/105 (25%), Positives = 55/105 (52%), Gaps = 6/105 (5%)
Query: 14 HYRPLNWDIK-DRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWK-YQDYKRQSEYRR 71
HY+ L IK ++ K N++ +LL+DQ ++ K Y YK+Q + ++
Sbjct: 1317 HYQRLERAIKNEQHKLNNQKNNFFNKVELLNDQLNKKSSKIALLRSKIYNTYKQQQQ-QK 1375
Query: 72 KVMLKEKFAKQQAIKMSKKTENDLEE---QFGLIRNSEDLRIRDK 113
+++L+EK Q K KT+ +L + QF + + E+ +++++
Sbjct: 1376 QILLEEKHKNSQLRKSLLKTQEELHQQKAQFSIAKKQEEKKLKNQ 1420
>UniRef50_A5HZ17 Cluster: Exonuclease; n=4; Clostridium
botulinum|Rep: Exonuclease - Clostridium botulinum A
str. ATCC 3502
Length = 1176
Score = 38.3 bits (85), Expect = 0.57
Identities = 44/197 (22%), Positives = 95/197 (48%), Gaps = 14/197 (7%)
Query: 37 EGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKE-KFAKQQAIKMSKKTEND- 94
E +++++D+PRD+++ + + + ++ R V+L + KF+ + +K+S K+ D
Sbjct: 115 EEEEIIADKPRDIQKNIESIIGLTAE-----DFTRSVVLPQGKFS--EFLKLSGKSRRDM 167
Query: 95 LEEQFGLIRNSEDL--RIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELP 152
LE FGL + + L RIR + TLI R+ ++D+ ++ KL + + ++ L
Sbjct: 168 LERIFGLEKYGKKLLERIRKARNKEISSLTLIEGRLEQYKDI--SKEKLQELKIQYENLL 225
Query: 153 RASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNN 212
+ I E L +I + ++ +I L + L N+ + + V+K N
Sbjct: 226 KEKSKIAKEKEESNKLYEKYKNIWELQEELNIYLNKLENLKKGLLNI-EEKRIKVEKGKN 284
Query: 213 VIKKEEEMIETNNIDME 229
+ + + E + I+ +
Sbjct: 285 ALSVKPYIDELSKIESD 301
>UniRef50_A0L591 Cluster: Glycosyl transferase, family 2; n=1;
Magnetococcus sp. MC-1|Rep: Glycosyl transferase, family
2 - Magnetococcus sp. (strain MC-1)
Length = 332
Score = 38.3 bits (85), Expect = 0.57
Identities = 20/63 (31%), Positives = 34/63 (53%)
Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
I++L S+R G+ L G SVGD +++LDS ++ +P +L++ G D V
Sbjct: 66 IKILNMSRRFGVYECMLAGMIASVGDAVIYLDSDLQDPPELIPQMLEQWRNGADIVHTTR 125
Query: 317 SAR 319
+ R
Sbjct: 126 TER 128
>UniRef50_Q8IBY8 Cluster: Putative uncharacterized protein PF07_0042;
n=1; Plasmodium falciparum 3D7|Rep: Putative
uncharacterized protein PF07_0042 - Plasmodium falciparum
(isolate 3D7)
Length = 2910
Score = 38.3 bits (85), Expect = 0.57
Identities = 63/294 (21%), Positives = 135/294 (45%), Gaps = 17/294 (5%)
Query: 5 QIVNKATKVHYRPLNWDIK-DRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDY 63
QI+ K + LN D++ +R +NND I L+ + ++Q D E K + Q+
Sbjct: 1471 QILYDDGKNNISQLNIDLENERTRNNDLKIL-LDQEKKKNEQINDDLENERKRNNQLQNI 1529
Query: 64 KRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTL 123
+ E ++K L + +Q+ I + + EN+L++Q I + + ++ + + NT
Sbjct: 1530 LNE-EQKKKEQLNVSYEEQKNI--NHQLENELQKQ--RITYKKMIAKFERKFLMK--NTN 1582
Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDE---LPRASIIICFYNEHYETLMRSVHSIMDRTD 180
+Q+I D + + DT+ K+ +Q+ D + ++ + +EH + D T
Sbjct: 1583 DTQKIKDTQQIIDTQ-KIIDTQKIIDTQKIIDTSNNVNQMNDEHKHVDQMNDAESEDNTF 1641
Query: 181 QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXX 240
+ E + + + L D ++ +D+LN ++KE+E+ + I ME
Sbjct: 1642 LELQLEKVKQVNIDMIIQLKKD-KKRIDELNLELEKEKEVNDKIIIQME--EYKMKIEHI 1698
Query: 241 XKKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVN 294
++ + E+ +N+ NIR+ K +++ + +L + + LD ++N
Sbjct: 1699 NEELEKEKEINHNL-NIRIEKDNEKNEQLNIQLDTEKKMNNQMSIELDEEKKMN 1751
>UniRef50_Q86KX8 Cluster: Similar to Dictyostelium discoideum (Slime
mold). Interaptin; n=2; Dictyostelium discoideum|Rep:
Similar to Dictyostelium discoideum (Slime mold).
Interaptin - Dictyostelium discoideum (Slime mold)
Length = 1781
Score = 38.3 bits (85), Expect = 0.57
Identities = 48/231 (20%), Positives = 95/231 (41%), Gaps = 15/231 (6%)
Query: 9 KATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSE 68
K + H R N IK ++SI+ LE L S Q + + ++QD K QS
Sbjct: 1402 KESNTHLRKENDKDTLVIKQLEQSISQLE--HLHSQQTENYLKERELIQQQHQDEK-QSS 1458
Query: 69 YRRKVMLKEKFAKQQA--------IKMSKKTENDLEEQFGL----IRNSEDLRIRDKGYN 116
+ LK KF ++Q + SK+ N L+++F L I+ +D + N
Sbjct: 1459 IQSTHQLKSKFDEKQQQYDESLEKLSQSKQELNKLKQEFDLNILVIQKLQDDKQSQSDSN 1518
Query: 117 LHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIM 176
L + L Q++ + + Q + ++ I + + + S+H +
Sbjct: 1519 LQLKSNLEEQQLQNQESIEKISTLQQQVNHLQQQFNINTLEIQKLQDEKQLSIESIHQLK 1578
Query: 177 DRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNID 227
+ D+K + ++ +DL +Q+ ++ N ++ +E I T ++
Sbjct: 1579 SKFDEKQQQYNESIEKSNDLQKQSDQLQQKLENSTNENQQLQEKISTIQLE 1629
>UniRef50_Q22W40 Cluster: Putative uncharacterized protein; n=1;
Tetrahymena thermophila SB210|Rep: Putative
uncharacterized protein - Tetrahymena thermophila SB210
Length = 970
Score = 38.3 bits (85), Expect = 0.57
Identities = 47/222 (21%), Positives = 97/222 (43%), Gaps = 15/222 (6%)
Query: 9 KATKVHYRPLNWDIKDRIKNNDKSIAALEGK-DLLSDQPRDVEETLSKTMWKYQDYKRQS 67
K ++ + + I IK N+ L+ K D RD + + WK + Y+ +
Sbjct: 16 KISQEKWEEIQEKIGSLIKTNETLNQMLKLKTDEYEILKRDNQRNVENENWKKKYYQLEQ 75
Query: 68 EYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKG---YNLHAFNTLI 124
E ++ ++++ A I + + + ++ E+ I N +L ++DK +L N +
Sbjct: 76 EMQKIAQIEQEVADYLPIMRNFEEKANMNEK--QIENL-NLILKDKENLIQDLSKANAQL 132
Query: 125 SQRIGDHRDLPDT----RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTD 180
Q+ + L +N Q Q+ F+ L S+ + +EH L ++ S+MD+ +
Sbjct: 133 EQQCHSQQQLAQELSFFKNLSEQKQKSFENLKLNSVSL---DEHKRVLKENI-SLMDKLE 188
Query: 181 QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIE 222
+K + L + + +Q+ + +NN+ KE E E
Sbjct: 189 KKEKEYQKLTVGLQQVNDYEQKMQQMENAVNNLRNKERENYE 230
>UniRef50_A4YEH2 Cluster: Glycosyl transferase, family 2; n=1;
Metallosphaera sedula DSM 5348|Rep: Glycosyl
transferase, family 2 - Metallosphaera sedula DSM 5348
Length = 421
Score = 38.3 bits (85), Expect = 0.57
Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 1/76 (1%)
Query: 274 YGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTF 333
YG S G++LVFLD+ V+ L + LSQ + + +R R + V+ ++
Sbjct: 115 YGVSLSTGEILVFLDAEARVDPTILTRISAHLSQ-AEAMALRLRVRDPKNKLQVLYSEIT 173
Query: 334 EYSPSPLVRGGFNWGL 349
E+S L RG + GL
Sbjct: 174 EFSMDSLFRGRYLKGL 189
>UniRef50_UPI00006CE63C Cluster: hypothetical protein
TTHERM_00709570; n=1; Tetrahymena thermophila SB210|Rep:
hypothetical protein TTHERM_00709570 - Tetrahymena
thermophila SB210
Length = 684
Score = 37.9 bits (84), Expect = 0.75
Identities = 62/293 (21%), Positives = 133/293 (45%), Gaps = 31/293 (10%)
Query: 2 SGNQIVN--KATKVHYRPLNWDI-KDRIKNND-KSIAALEGKDLLSDQPRDVEETLSKTM 57
+GNQI + K Y L D ++++++N+ KS L+ K L ++ + V E + +
Sbjct: 84 TGNQIQDLYDKLKADYTKLMIDFQREQMESNNLKSQNRLDAKKL-NELDKLVAEQQQEIV 142
Query: 58 WKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKT-ENDLEEQFGLIRNSEDLRIRDKGY- 115
+ + Q +L++ + ++Q +++K T EN Q + ++ L +++K Y
Sbjct: 143 TLNNEIQLQISQNN--VLRD-YLREQENRVTKITAENTTLNQE--LSSTRPLSLKNKSYT 197
Query: 116 NLHAFNTLISQRIGDHRDLPDTRNKLCQS-----QQYFDELPRASIIICFYNEHYETLMR 170
++ N L ++I + ++ + +N Q QQ ++L + C H + + R
Sbjct: 198 SVLQQNDLKYEQILESKECLERQNDQLQQHNSRLQQLHEQLNHVKMQFCTLENHSQQIQR 257
Query: 171 SVHSIMDRTDQKHIKEIILV-------DDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIE- 222
++ + + Q I E +L +DY ++ N++ ++QE +L +I+K E+ I+
Sbjct: 258 NLKQLELESSQSKINESLLQKSYQKLEEDYREMQNMNINLQEHNKQLGILIQKLEDKIKI 317
Query: 223 -----TNNIDMEXXXXXXXXXXXXKKSTEN-SEVKNNVFNIRLLKTSKREGLI 269
N + E KS N E++N + +++ +RE I
Sbjct: 318 MEEQCNNTVFWEQTAKDFELQEEYFKSNSNIKEMENYMQKLKIQMKEERERYI 370
>UniRef50_Q6MF69 Cluster: Putative uncharacterized protein; n=2;
Candidatus Protochlamydia amoebophila UWE25|Rep:
Putative uncharacterized protein - Protochlamydia
amoebophila (strain UWE25)
Length = 449
Score = 37.9 bits (84), Expect = 0.75
Identities = 21/95 (22%), Positives = 52/95 (54%), Gaps = 4/95 (4%)
Query: 23 KDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQ 82
KD I++ KS+ ++ + + ++ +++++L ++++++ E ++ +K KQ
Sbjct: 330 KDYIQDTLKSLVKVQNESIHIEEKNEIQKSLKILKEDHEEFQKAIEEDLAILAAKKGKKQ 389
Query: 83 QAI----KMSKKTENDLEEQFGLIRNSEDLRIRDK 113
+ K+ K ENDLEE+ + R+ E + + +K
Sbjct: 390 RGEGDIEKLEAKLENDLEEKGNVERDLEAMHLVEK 424
>UniRef50_Q1Q6W0 Cluster: Putative uncharacterized protein; n=1;
Candidatus Kuenenia stuttgartiensis|Rep: Putative
uncharacterized protein - Candidatus Kuenenia
stuttgartiensis
Length = 302
Score = 37.9 bits (84), Expect = 0.75
Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 1/70 (1%)
Query: 244 STENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLK 303
ST+NS F ++LL +G AR G N+ G++L F DS V+ WL +K
Sbjct: 44 STDNSLQSIKAFPVKLLIEKDVKGSYAARNLGVKNAEGEILAFTDSDCVVDKYWLCNAIK 103
Query: 304 RL-SQGVDGV 312
++ V GV
Sbjct: 104 YFAAEDVGGV 113
>UniRef50_A1AUA8 Cluster: Glycosyl transferase, family 2; n=1;
Pelobacter propionicus DSM 2379|Rep: Glycosyl
transferase, family 2 - Pelobacter propionicus (strain
DSM 2379)
Length = 1268
Score = 37.9 bits (84), Expect = 0.75
Identities = 18/51 (35%), Positives = 28/51 (54%)
Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLS 306
+ R+++ G A GAD + G +L+FL++ EV GW PPL +S
Sbjct: 64 HFRIVRNDYAHGFAAACNRGADVARGHLLLFLNNDTEVQPGWFPPLYALIS 114
>UniRef50_Q23A50 Cluster: Putative uncharacterized protein; n=1;
Tetrahymena thermophila SB210|Rep: Putative
uncharacterized protein - Tetrahymena thermophila SB210
Length = 1670
Score = 37.9 bits (84), Expect = 0.75
Identities = 27/115 (23%), Positives = 54/115 (46%), Gaps = 3/115 (2%)
Query: 64 KRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTL 123
K++S ++K++ K+ + + + K SK+ ++ LI N + K +F TL
Sbjct: 107 KQESNQKKKIINKQNYYQIEQSKASKQAYK-IDTFKQLIINQRQIFNSSKIEKKQSFGTL 165
Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDR 178
+++R G + T L QSQQ ++ + + FY H E V +++ +
Sbjct: 166 VNERKGFSNEKAQT--PLWQSQQKYNFASKEGFLSEFYQTHSELKQGRVQALLQK 218
>UniRef50_A2FSZ8 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
protein, putative - Trichomonas vaginalis G3
Length = 4045
Score = 37.9 bits (84), Expect = 0.75
Identities = 47/223 (21%), Positives = 99/223 (44%), Gaps = 21/223 (9%)
Query: 4 NQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGK-DLLSDQPRDVEETLSKTMWKYQD 62
N I N+ +V+ + N D+K+ + + I A+ + +S + D+++ SK+ YQD
Sbjct: 1710 NSINNELRRVNSQ--NNDLKELLAKKESEINAINNELKRISSENNDLKDINSKSENNYQD 1767
Query: 63 YKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNT 122
Q + + + + K Q+ +K S + +N L+ DL I +K + + +
Sbjct: 1768 ---QLKNLKNQLTQLKNENQKLMKSSTEEKNKLK----------DL-INEKNIQIQSLQS 1813
Query: 123 LISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQK 182
+ + + NKL Q+ DE + ++ NE + + S + + DQK
Sbjct: 1814 KNEDLVNNQSKI---NNKLESIQKDLDEKENQNSVLISENEKLQNELMSSKTEIQTLDQK 1870
Query: 183 HIK-EIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETN 224
+ L + + +L + + +KLNN+ + E++ + N
Sbjct: 1871 ETEFNDKLREMERNNRSLSSQINDLKEKLNNLTETNEKISDEN 1913
>UniRef50_A2E4S4 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
protein, putative - Trichomonas vaginalis G3
Length = 1795
Score = 37.9 bits (84), Expect = 0.75
Identities = 48/212 (22%), Positives = 93/212 (43%), Gaps = 18/212 (8%)
Query: 9 KATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSE 68
K ++ + N D+I+++ K I + GK + D ++ ++L + + K ++
Sbjct: 1012 KENEIQEKMENLKKMDQIQSSQKIICGIVGKQTIDDAINEI-KSLKEQIQKLKNEISVKS 1070
Query: 69 YRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIR-NSEDLRIRDKGYNLHAFNTLISQR 127
+ +KEK +A TE DL + ++ +E+L+ DK FN+ IS
Sbjct: 1071 DKILNDIKEKMKLPEA-----TTEQDLVDAIAVMEIENEELKENDKKLR-EIFNSDIS-- 1122
Query: 128 IGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEI 187
+ L + N L Q + L + ++ + + L + I+D D+ I E+
Sbjct: 1123 VDTLEILKEVEN-LKQKEDQLKSLVESENLV----DEIQKLNENQQRILDECDKSDISEV 1177
Query: 188 ILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEE 219
I ++ DL L DV+ N+V + +EE
Sbjct: 1178 I--EEIKDLKKLQQDVENCF-PTNDVSQIKEE 1206
>UniRef50_A2DKP8 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
protein, putative - Trichomonas vaginalis G3
Length = 1618
Score = 37.9 bits (84), Expect = 0.75
Identities = 39/239 (16%), Positives = 98/239 (41%), Gaps = 13/239 (5%)
Query: 37 EGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLE 96
+ DL+++ + +EE ++ + +Y++Q E + ++ + Q+ K ++ + E
Sbjct: 1028 QNNDLIANYKKQIEELSKQSNEEVVNYQKQVEDLKNKLIDLQQNNQEIAKYQQQIDELNE 1087
Query: 97 EQFGLIRNSEDL--RIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRA 154
E+ + +L ++ ++ + I +DL + ++ + Q D+L +
Sbjct: 1088 EKSNSEKQINELNQKLNQNNEEINKYQKQIEDLNQKLKDLQENNQEIAKYQNEVDDLKKK 1147
Query: 155 SIIICFYNEHYETLMRSVHSIMDRTDQKHIKEII-----LVDDYSDLYNLHHDVQEAVDK 209
+ NE + + M + +Q ++K+I L++ S++ NL+ + +
Sbjct: 1148 FDV---SNEEIANKEKEIEE-MKKKEQNYLKQISELNNHLMEKQSEIVNLNSKLDNQIYN 1203
Query: 210 LNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSKREGL 268
LN KK+ + N++ + K++ + N I L +K L
Sbjct: 1204 LNT--KKQNLEMNLNDLQTKLKQIEQENANLSKRNKDLENESQNQAKITLETQNKNVDL 1260
>UniRef50_Q2S1Y8 Cluster: Glycosyl transferase, group 2 family
protein; n=1; Salinibacter ruber DSM 13855|Rep: Glycosyl
transferase, group 2 family protein - Salinibacter ruber
(strain DSM 13855)
Length = 391
Score = 37.5 bits (83), Expect = 0.99
Identities = 43/176 (24%), Positives = 69/176 (39%), Gaps = 14/176 (7%)
Query: 281 GDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTP-VIDVINADTFEYSPSP 339
G +V L++ +EV GWL PL V+ R AV P ++ + FEY+
Sbjct: 126 GRFVVLLNNDVEVPPGWLHPL-------VEAAAGRPDVAAVQPKLLQYDDRGRFEYAGGA 178
Query: 340 LVRGGF--NWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPG 397
GGF G F L + + + P G + R + +G D
Sbjct: 179 ---GGFLDRAGYPFTRGRLFETMERDRGQYDDPRDVFWATGAALLLRRSALDEVGPLDER 235
Query: 398 MNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRM 453
+ E +++ +R+W G + + P S V H+ P K Y +NS+ M
Sbjct: 236 FEMHM-EEIDLCWRLWRHGYRVRVAPESTVYHIGGASLPQSSPRKTYYNYRNSLLM 290
>UniRef50_A2FD36 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
protein, putative - Trichomonas vaginalis G3
Length = 3977
Score = 37.5 bits (83), Expect = 0.99
Identities = 43/214 (20%), Positives = 89/214 (41%), Gaps = 14/214 (6%)
Query: 22 IKDRIKN-NDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
+K +K NDK+ G D+L + + +S + K +E + K + + K
Sbjct: 2677 LKSSLKELNDKNKELQNGNDILKQENETLTPKISSLESENSSLKSTNEIKDKEIEELKQK 2736
Query: 81 KQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNK 140
+ +++ + E+DL+ + R + + + L I R ++L + +
Sbjct: 2737 LSEISQLNSQHESDLDSR----RKQFEKELEELRNQLEKLQNEIQIREQRGKELSNQNEE 2792
Query: 141 LCQS-QQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQ------KHIKEI--ILVD 191
L + ++ EL A + ++ ETL +S+ DQ K I+E+ L+
Sbjct: 2793 LMNNLEKMKSELNDAKMNKEHSDQENETLKKSLEENQQNYDQLVDELSKEIEELKKQLLT 2852
Query: 192 DYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNN 225
+ + H++ E K+ N+ + E + TNN
Sbjct: 2853 KAEESNSSKHEIDELQSKIQNLSSENENLKSTNN 2886
Score = 37.1 bits (82), Expect = 1.3
Identities = 53/240 (22%), Positives = 112/240 (46%), Gaps = 27/240 (11%)
Query: 1 MSGNQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKY 60
+SGN++++ K+ D+ ++I + K ++L+ Q + + + + K
Sbjct: 3108 LSGNELLSNNEKLEQEQS--DLMNQINDLRKK------NEILNQQQANNNQIIKECQEKI 3159
Query: 61 QDYKRQS-EYRRKV---MLKEKFAKQQAIKMSKKTE----NDLEEQFGLIRNSEDLRI-- 110
Q+Y+ + E +RK+ M + AK Q ++ K E ND + L + E L+
Sbjct: 3160 QNYEESNNELQRKLNEAMNNNENAKNQIDQLKKLLEETKQNDDKLVEELTKEIEKLKNEQ 3219
Query: 111 RDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMR 170
+ K N++ + L + + D K +Q++++ + +I + E+L +
Sbjct: 3220 QSKDQNINDLSALNKDKSSLIQQNDDLSKK---TQEFYNSQQNQAQMIEDLKKQNESLQK 3276
Query: 171 SVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEE---MIETNNID 227
++ I + Q++I + L D SDL + HD + ++ LN++IK+ E +IE N +
Sbjct: 3277 NLE-INNNETQQNIDQ--LTKDKSDLASKLHDYEAKINDLNSLIKELNEKNAIIEKKNYE 3333
>UniRef50_A0DTN4 Cluster: Chromosome undetermined scaffold_63, whole
genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_63,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 566
Score = 37.5 bits (83), Expect = 0.99
Identities = 25/112 (22%), Positives = 57/112 (50%), Gaps = 3/112 (2%)
Query: 39 KDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTEND--LE 96
K+ + ++ RD+ L K M K+++ +++++ +KV +K+K K ++ E D +
Sbjct: 187 KNEMKEKERDLRY-LKKEMIKFEEKQKETQQNQKVTVKKKEVIDIDDKKAEIIEKDQQIS 245
Query: 97 EQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYF 148
+Q +I ++ Y L + N LI + G ++ + R +L + +Y+
Sbjct: 246 QQEQIINQLKEKLQERTNYELRSKNALIDELQGLNQQVQQMREELKELNEYY 297
>UniRef50_A0CKL2 Cluster: Chromosome undetermined scaffold_2, whole
genome shotgun sequence; n=2; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_2,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 521
Score = 37.5 bits (83), Expect = 0.99
Identities = 23/79 (29%), Positives = 44/79 (55%), Gaps = 3/79 (3%)
Query: 27 KNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAK--QQA 84
K K + + ++L +Q ++++ + +Y D +Q E +R+ ++EKFAK Q
Sbjct: 366 KEEKKKLKMHKSQELEKEQVEEIKQKEEQFKKRYSDKLKQFENKRQ-KVEEKFAKKDQYL 424
Query: 85 IKMSKKTENDLEEQFGLIR 103
+ +K ++DLEE+F IR
Sbjct: 425 AEHLQKKKDDLEEKFDKIR 443
>UniRef50_Q6CYG5 Cluster: Similarity; n=2; Kluyveromyces lactis|Rep:
Similarity - Kluyveromyces lactis (Yeast) (Candida
sphaerica)
Length = 1748
Score = 37.5 bits (83), Expect = 0.99
Identities = 52/233 (22%), Positives = 106/233 (45%), Gaps = 25/233 (10%)
Query: 5 QIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYK 64
+++ ++V R + + + ++ K +A +D+ +EE LS T K + +
Sbjct: 94 RLIEGNSQVTKRIIELEQEIEVERQQKELADASKQDIAESLNEKIEE-LSSTKAKLNEAQ 152
Query: 65 -RQSEYRRKVMLKE-KFAKQQAIKMSKKTEN-DLEEQFGLIRNSEDLRIRDKGYNLHAFN 121
E R+KV+ E + QQA+++ K+E +E++ L+R + D N
Sbjct: 153 GANKELRQKVVNTETELQTQQALELRSKSEILRMEQEITLLRENNDWLTNQLNTKTVQLN 212
Query: 122 TLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQ 181
I + L D++ K+ + E+ R S + L +SVHS+ ++ +Q
Sbjct: 213 EFRESTISE---LQDSQLKVSNMESEL-EIARTS---------NQKLKQSVHSLHEQLEQ 259
Query: 182 KHIKEIILVDDYS----DL---YNLHHDVQEAVDKLNNVIKKEEEMIETNNID 227
K + + D+Y+ +L +L + +A++K +KKE + + NN+D
Sbjct: 260 KLSENKEIKDEYNFSKQELTKEMSLKQRMIDALEKHMESLKKEMDATK-NNMD 311
>UniRef50_Q8TXA4 Cluster: Uncharacterized protein; n=2; cellular
organisms|Rep: Uncharacterized protein - Methanopyrus
kandleri
Length = 609
Score = 37.5 bits (83), Expect = 0.99
Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 13/116 (11%)
Query: 42 LSDQPRDVEETLSKTMWKYQDYK----RQSEYRRKV-MLKEKFAK-QQAIKMSKKTENDL 95
LSDQ R + E L K KY + K R E ++V LK++ AK Q +K K +DL
Sbjct: 202 LSDQNRRLAENLKKLKEKYNEIKEERDRLKEETKEVGKLKDQLAKLQSKLKEVKSERDDL 261
Query: 96 EEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDEL 151
+ +RN E+ ++R K + L S+ + L D KL +++Q+ +L
Sbjct: 262 ANEVEALRN-ENEKLRKK------IDKLKSELSNLQKKLKDREKKLEKARQHIGKL 310
>UniRef50_UPI00006CD032 Cluster: Micro-fibrillar-associated protein
1 C-terminus containing protein; n=1; Tetrahymena
thermophila SB210|Rep: Micro-fibrillar-associated
protein 1 C-terminus containing protein - Tetrahymena
thermophila SB210
Length = 521
Score = 37.1 bits (82), Expect = 1.3
Identities = 23/97 (23%), Positives = 44/97 (45%), Gaps = 3/97 (3%)
Query: 2 SGNQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQ 61
+G + N ++ +N DIK +N ++ A LL Q + E+ +K + K Q
Sbjct: 150 TGQSLQNSEIQITTHNINGDIKVESRNEGRAAARAR---LLQRQKEEEEQKKNKQLSKQQ 206
Query: 62 DYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQ 98
+ + E R+ LKEK + + + S D +++
Sbjct: 207 SEEMEIERRKSQQLKEKANESEKSEQSSSENEDNDDE 243
Score = 35.1 bits (77), Expect = 5.3
Identities = 25/97 (25%), Positives = 48/97 (49%), Gaps = 5/97 (5%)
Query: 28 NNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKM 87
+ ++S + +GKD ++D D++ + WK ++ KR + R + + +EK +Q +
Sbjct: 335 DEEQSDDSRQGKDFMNDDD-DMDREFEREQWKIRELKRIRKDRDEQIKREKELAEQERRS 393
Query: 88 SKKTENDLEE--QFGLIRNSEDLRI--RDKGYNLHAF 120
E +EE + GL + E +I K Y+ AF
Sbjct: 394 KMTNEEIIEEDKRLGLHQKKEKRQIGFMQKYYHKGAF 430
>UniRef50_UPI00006CB2DA Cluster: Viral A-type inclusion protein repeat
containing protein; n=1; Tetrahymena thermophila
SB210|Rep: Viral A-type inclusion protein repeat
containing protein - Tetrahymena thermophila SB210
Length = 2199
Score = 37.1 bits (82), Expect = 1.3
Identities = 51/233 (21%), Positives = 102/233 (43%), Gaps = 20/233 (8%)
Query: 9 KATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSE 68
KA H + N + ++IK+N + I L+ K S+Q ++ + K +YQ+ +Q E
Sbjct: 1622 KAESYHVKIQNQE--EKIKSNAEMIQVLQEKLKTSEQQANLLKQQLKNK-QYQEDDQQRE 1678
Query: 69 YRRKVMLKEKFAKQQAIKMSKKTE--NDLEEQFGLIRNSEDLRIR----DKGYNLHAFNT 122
R+ V A+ ++ + + + E ++ + NS + +I+ +K N+ +
Sbjct: 1679 TRKSVSFLTSQAEMNKYQLDNQKQKWDQQEAEYKIKINSLNAQIQQLIEEKQSNIDMKKS 1738
Query: 123 LISQR----IGDHRDLPDTRNKLCQSQQYFDELPRA----SIIICFYNEHYETLMRSVHS 174
+ +R + + L D + QS++ + L + +I N+ E+L +
Sbjct: 1739 FMKERESVVVDKEKALRDLKQLYAQSRKNEESLEQKISEMEKVILNMNQEIESLRTQLIR 1798
Query: 175 IMDRTDQKHIKEIILVDDYSDLY--NLHHDVQ-EAVDKLNNVIKKEEEMIETN 224
+ +Q Y+DL NL +Q E++ LNN+ K+ M E N
Sbjct: 1799 ANQQIEQMAYARKYEASQYADLRSANLSKALQQESLPTLNNIQKQGSNMGELN 1851
>UniRef50_Q7U947 Cluster: Putative uncharacterized protein; n=1;
Synechococcus sp. WH 8102|Rep: Putative uncharacterized
protein - Synechococcus sp. (strain WH8102)
Length = 614
Score = 37.1 bits (82), Expect = 1.3
Identities = 26/111 (23%), Positives = 48/111 (43%), Gaps = 1/111 (0%)
Query: 344 GFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGG 403
GF +H + L L + P +++ + R+ F ++G + P
Sbjct: 496 GFPANIH-PYKGLSVQELEQRHPHLDPYPVDSLSAAMLLFERDRFLSVGGFHPAFGRGDF 554
Query: 404 ENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMA 454
E+LE+S R G L ++P +R+ H+ R+ G E + LQ + +A
Sbjct: 555 EDLELSQRWKQQQGELWMVPTARLMHLERQSMASGADESAAWALQANAWLA 605
>UniRef50_Q319Q2 Cluster: Putative uncharacterized protein; n=1;
Prochlorococcus marinus str. MIT 9312|Rep: Putative
uncharacterized protein - Prochlorococcus marinus
(strain MIT 9312)
Length = 292
Score = 37.1 bits (82), Expect = 1.3
Identities = 17/47 (36%), Positives = 32/47 (68%)
Query: 252 NNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWL 298
N+ I+ +K +KR G+ ++R G +S+G+V++FLDS ++ +G L
Sbjct: 56 NSSIKIKHIKPTKRAGVSKSRNIGIISSIGNVILFLDSDDKLIIGAL 102
>UniRef50_Q9GRG0 Cluster: Tetrin B protein; n=2; Tetrahymena
thermophila|Rep: Tetrin B protein - Tetrahymena
thermophila
Length = 731
Score = 37.1 bits (82), Expect = 1.3
Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 4/104 (3%)
Query: 8 NKATKVHYRPLNWDIK-DRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQ 66
NKA + L W + DR+ KSI K+L + Q RD+E S+++ K Q +
Sbjct: 554 NKAKQAEAEQLYWKNQTDRVVRQ-KSIEYQVEKELTNSQLRDLER--SQSIEKIQKLRES 610
Query: 67 SEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRI 110
R + K++ A++ + S+ + E++ IRN + I
Sbjct: 611 DSIRNIIDYKDREAQELRLNQSRAISDLARERYEKIRNQTEAEI 654
>UniRef50_Q8ILK5 Cluster: Putative uncharacterized protein; n=1;
Plasmodium falciparum 3D7|Rep: Putative uncharacterized
protein - Plasmodium falciparum (isolate 3D7)
Length = 966
Score = 37.1 bits (82), Expect = 1.3
Identities = 34/165 (20%), Positives = 72/165 (43%), Gaps = 10/165 (6%)
Query: 70 RRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNS----EDLRIRDKGYNLHAFNTLIS 125
+R V+LKEK K E + EE + ++ +D+ +DKG N FN +
Sbjct: 101 KRLVLLKEKLKYNHYYYAGKLAEKEWEENYDNLKKKSQLFKDVLEKDKGKNFSTFNITKN 160
Query: 126 QRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIK 185
++I + ++ + + + + + +N+ Y S++ D +D ++K
Sbjct: 161 EKICNIKEKAQKKKQNKNQKNLKKKNFKKEHNDISFNDTYTKYSSSLNDFNDISDSLNLK 220
Query: 186 EIILVDDYSDLYNLHHDVQEAVDKLNNV----IKKEEEMIETNNI 226
I+ ++ + YNL + + ++ N+ KKE + NN+
Sbjct: 221 NDIINNE--EEYNLTNSLFQSFPMEQNLPLFKYKKENKQDYDNNV 263
>UniRef50_Q6LF09 Cluster: Putative uncharacterized protein; n=6;
Plasmodium|Rep: Putative uncharacterized protein -
Plasmodium falciparum (isolate 3D7)
Length = 947
Score = 37.1 bits (82), Expect = 1.3
Identities = 48/213 (22%), Positives = 96/213 (45%), Gaps = 29/213 (13%)
Query: 28 NNDKSIAALEGKDLLSDQPRDVEETLSKT--MWKYQDYKRQSEYRRKVMLKEKF---AKQ 82
N DK + + ++L ++ + E L KT + + Q K + +K L +K K+
Sbjct: 347 NKDKLLIE-KNTEILIEERNYINEELIKTQKLLESQINKNKELENKKTNLLDKIDLLEKK 405
Query: 83 QAIKMSKKTEN-----DLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDT 137
Q + K EN DL ++F L+ N ++ + +N + N L + +T
Sbjct: 406 QKDLIKKNNENEQKMDDLNKKFKLLTNENKIKENEILHNNNLINNLNNN---------NT 456
Query: 138 RNKLCQSQQYFD----ELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDY 193
+ K+ Q+++ E + S+ I H ++L + +I++ T + K D
Sbjct: 457 KMKIKLDQEFYKMKMLEKEKKSLSI-----HVKSLTYEIQTILNLTQETQNKFEQQKRDI 511
Query: 194 SDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNI 226
+DL ++ V+K+++VIKK E+ + + I
Sbjct: 512 NDLIIEKEQTKKLVEKIDDVIKKNTEIAKKDKI 544
>UniRef50_Q5DAE6 Cluster: SJCHGC05311 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC05311 protein - Schistosoma
japonicum (Blood fluke)
Length = 320
Score = 37.1 bits (82), Expect = 1.3
Identities = 22/98 (22%), Positives = 45/98 (45%), Gaps = 3/98 (3%)
Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKH 183
+ Q + D P + +S + +P + +I H + +V + D T Q+H
Sbjct: 111 VHQPVAPFSDFPLPPKQKSESTSQVEPVPNSKPVIVEKESHR---LPTVEEVKDMTRQRH 167
Query: 184 IKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI 221
+ ++ D+ +L NL +DV+ V+ +K+ EE +
Sbjct: 168 DQPVLKNIDFRELPNLVNDVESKVEYAKTKLKEFEETL 205
>UniRef50_Q22869 Cluster: Non-muscle myosin heavy chain II; n=3;
Caenorhabditis|Rep: Non-muscle myosin heavy chain II -
Caenorhabditis elegans
Length = 2003
Score = 37.1 bits (82), Expect = 1.3
Identities = 58/271 (21%), Positives = 112/271 (41%), Gaps = 14/271 (5%)
Query: 10 ATKVHYRPLNWDIKDRIKNNDKSIAAL--EGKDLLSDQPRDVEETLSKTMWK-YQDYKRQ 66
A V R D +++I+ K + +L E + L ++ R+V E L K K K +
Sbjct: 1356 AVAVEARDDALDAQEKIEKEVKEVKSLLAEARKKLDEENREVMEELRKKKEKELSAEKER 1415
Query: 67 SEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSE-DLRIRDKGYNLHAFNTLIS 125
++ + K + AK++AI+ ++ + +L + R E +R D+ NTL++
Sbjct: 1416 ADMAEQARDKAERAKKKAIQEAEDVQKELTDVVAATREMERKMRKFDQQLAEERNNTLLA 1475
Query: 126 QRIGD--HRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQ-- 181
Q+ D H+ L D K + +EL I+ + TL + ++ D
Sbjct: 1476 QQERDMAHQMLRDAETK---ALVLSNELSEKKDIVDQLEKDKRTLKLEIDNLASTKDDAG 1532
Query: 182 KHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXX 241
K++ E+ D L Q+ ++ L + ++ ++ ++M+
Sbjct: 1533 KNVYELEKTKRRLD-EELSRAEQQIIE-LEDALQLADDARSRVEVNMQAMRSEFERQLAS 1590
Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRAR 272
++ E+ K IR L T + E RAR
Sbjct: 1591 REEDEDDRKKGLTSKIRNL-TEELESEQRAR 1620
>UniRef50_A2EZ87 Cluster: Viral A-type inclusion protein, putative;
n=2; cellular organisms|Rep: Viral A-type inclusion
protein, putative - Trichomonas vaginalis G3
Length = 2271
Score = 37.1 bits (82), Expect = 1.3
Identities = 34/206 (16%), Positives = 92/206 (44%), Gaps = 13/206 (6%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
++ ++ +NND+ ++ L + + + L K + D ++ E + K + +
Sbjct: 588 ELTNKSQNNDELQNQIKQ---LKSELENTQNQLQKVTNEKGDKSKEIEEQNKKLKSQIEE 644
Query: 81 KQQAI-KMSKKTENDLE--EQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDT 137
+ Q I K+ + + E EQ + + + ++R++ ++A NT + + + + + D
Sbjct: 645 RDQMISKLQDENQKIAETAEQAAIKSSETNKKLREQFKKVYAENTSLKAK--NEKQVQDL 702
Query: 138 RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLY 197
+L + ++ + Y + + L + +MD+ + + + L +D ++
Sbjct: 703 MQQLDEKEKQLQSKKDEN-----YKQENDQLKKENQDLMDKLKEIENERVELEEDVKNVT 757
Query: 198 NLHHDVQEAVDKLNNVIKKEEEMIET 223
D++E ++KL + E+ +ET
Sbjct: 758 TEKEDLEEEIEKLKEKVDVLEDQLET 783
>UniRef50_Q75E63 Cluster: ABL193Cp; n=1; Eremothecium gossypii|Rep:
ABL193Cp - Ashbya gossypii (Yeast) (Eremothecium
gossypii)
Length = 862
Score = 37.1 bits (82), Expect = 1.3
Identities = 20/84 (23%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSD-QPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKF 79
D+++ + + +AALE + L D + + TL+ + + K+Q E + ++ ++
Sbjct: 195 DLREEMSRLQEELAALENRQLKKDGEISQLNRTLNDKDMQLAELKKQLESKTGEVISQEL 254
Query: 80 AKQQAIKMSKKTENDLEEQFGLIR 103
A + +K EN+LE+Q +R
Sbjct: 255 KAANAHQRAKTLENELEQQKNKVR 278
>UniRef50_Q0URH4 Cluster: Putative uncharacterized protein; n=1;
Phaeosphaeria nodorum|Rep: Putative uncharacterized
protein - Phaeosphaeria nodorum (Septoria nodorum)
Length = 448
Score = 37.1 bits (82), Expect = 1.3
Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 5/80 (6%)
Query: 39 KDLLSDQPRDVEETLS--KTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLE 96
KD D D E S K + +Y Y+R + RR + E+ AK + ++SK+ + L
Sbjct: 310 KDDSEDPSEDPEYNPSAIKMVKRYNVYRRPDQLRRTL---EELAKTSSRRVSKRDKRSLH 366
Query: 97 EQFGLIRNSEDLRIRDKGYN 116
F IRN+ + R GY+
Sbjct: 367 SSFAEIRNTVEKPTRGPGYS 386
>UniRef50_Q2FPR9 Cluster: Regulatory protein, ArsR; n=2;
Methanomicrobiales|Rep: Regulatory protein, ArsR -
Methanospirillum hungatei (strain JF-1 / DSM 864)
Length = 252
Score = 37.1 bits (82), Expect = 1.3
Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 3/63 (4%)
Query: 163 EHYETLMRSVHSIMDRTDQKHIKEII--LVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEM 220
+ +L +H M+RT+Q H+ II + ++YS L HH V+ A D L++ + E +M
Sbjct: 16 QEIHSLREDLHRFMERTNQIHVNAIISDIRNEYSGLL-AHHQVERAGDCLSHAMVHECKM 74
Query: 221 IET 223
+T
Sbjct: 75 HDT 77
>UniRef50_UPI00006CEB56 Cluster: hypothetical protein TTHERM_00370820;
n=1; Tetrahymena thermophila SB210|Rep: hypothetical
protein TTHERM_00370820 - Tetrahymena thermophila SB210
Length = 1792
Score = 36.7 bits (81), Expect = 1.7
Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 1/78 (1%)
Query: 20 WDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDY-KRQSEYRRKVMLKEK 78
W K + + K E K + +Q R ++E K ++Q++ K+Q E +++ K+K
Sbjct: 1397 WKEKKEVFDQQKQKLIEEQKKIQEEQLRRMQEEKEKKEKEFQEFNKKQIEKQQEEFKKQK 1456
Query: 79 FAKQQAIKMSKKTENDLE 96
+QQ KKT DL+
Sbjct: 1457 EKEQQVNNFKKKTNLDLK 1474
>UniRef50_UPI00006CB6DE Cluster: hypothetical protein
TTHERM_00494050; n=1; Tetrahymena thermophila SB210|Rep:
hypothetical protein TTHERM_00494050 - Tetrahymena
thermophila SB210
Length = 1181
Score = 36.7 bits (81), Expect = 1.7
Identities = 35/159 (22%), Positives = 69/159 (43%), Gaps = 11/159 (6%)
Query: 27 KNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIK 86
KN+ K + D+ +D++ L++ + Q+ K Q E + ML+EK K+Q IK
Sbjct: 314 KNSLKIYQLQKELDISQQNTQDIQMQLAQAQKQIQELKNQCELK---MLEEKQMKEQIIK 370
Query: 87 MSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQ 146
S+ + ++ F L + +K + I Q DL D + K+ Q QQ
Sbjct: 371 ESEIKVDSQQKAFQLEQQKS-----EKEQQIRELKRDIEQL---KEDLQDQKEKVIQEQQ 422
Query: 147 YFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIK 185
+L + + E ++++ + D+ +K+ +
Sbjct: 423 KNKDLKNNEYSLTKDIQTLEEQLQNIQNDHDKLQEKYAR 461
>UniRef50_UPI00015A629B Cluster: UPI00015A629B related cluster; n=1;
Danio rerio|Rep: UPI00015A629B UniRef100 entry - Danio
rerio
Length = 2736
Score = 36.7 bits (81), Expect = 1.7
Identities = 52/258 (20%), Positives = 105/258 (40%), Gaps = 24/258 (9%)
Query: 15 YRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKR-----QSEY 69
+R L IKD+ + + K +A L+ DQ + +KT + Q K+ QSE
Sbjct: 368 HRSLEQKIKDQERESQKELAQLQSSYQALDQ--QFTQVKNKTSMEIQQAKKDHNVLQSEM 425
Query: 70 RRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLR-IRDKGYNLHAFNTLISQRI 128
+ LK + K+ K ++ Q ++ +E + + + N + Q +
Sbjct: 426 DKVTALKNRLEKELEELKQKLLRSEQALQASQVKEAETKKKFEEMQREKNTLNCQLDQGM 485
Query: 129 GDHRDLPD----TRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSV--HSIMDRTDQK 182
+ L D T L +++ D+L + NE L + + S+ + +
Sbjct: 486 KRVKQLEDEKQNTEQILAKNRMMVDDL---KVKTQTQNEELTELRKKMDHQSVSSAQELE 542
Query: 183 HIKEIIL------VDDYSDLYNLHHDVQEAVDKLNNVIKKEEEM-IETNNIDMEXXXXXX 235
++K+ ++ + ++L L HDV+ +K+ V K+ EE+ + +N+ E
Sbjct: 543 NLKKTLIEAEAKNMKTQAELQKLVHDVELKENKICAVEKENEELKMTSNSCQKELAEMKK 602
Query: 236 XXXXXXKKSTENSEVKNN 253
+ TE ++ NN
Sbjct: 603 EYDALLQWKTEKEQLINN 620
>UniRef50_UPI000069FF36 Cluster: M-phase phosphoprotein 1 (MPP1)
(Kinesin-related motor interacting with PIN1).; n=1;
Xenopus tropicalis|Rep: M-phase phosphoprotein 1 (MPP1)
(Kinesin-related motor interacting with PIN1). - Xenopus
tropicalis
Length = 755
Score = 36.7 bits (81), Expect = 1.7
Identities = 50/223 (22%), Positives = 89/223 (39%), Gaps = 19/223 (8%)
Query: 24 DRIKNNDKSIAALEGKDLLS-DQPRDVEETLSKTMWKYQDYKRQ----SEYRRKVMLKEK 78
D K +S+ LE ++ S +E+ L + KY+ Y++ +E RK+ K
Sbjct: 103 DSYKEKCESLKCLEEQNKASASDTLQLEQNLKEAQTKYEAYEKDITTLNEENRKLG-KNI 161
Query: 79 FAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTR 138
Q+ + ++KT D EEQ G +L +D L D + +
Sbjct: 162 TELQEKLNAAEKTSKDKEEQVGQSAKEIELLKKDLSQRASELKVLQLDLQRKEEDCTELK 221
Query: 139 NKLCQSQQYFDELPR--------ASIIICFYNEHYETLMRSVH---SIMDRTDQKHIKEI 187
+KL S++ ++ + ++ NE YE L + + RT Q+ KE
Sbjct: 222 DKLMDSKKQIQQVEKEVSGMREEKRLLTNKVNE-YEKLKNQMSRELEMKQRTIQQLKKES 280
Query: 188 ILVDDYSDLYNLHHDV-QEAVDKLNNVIKKEEEMIETNNIDME 229
+ D+ L+ QEA +K + +E +IE +E
Sbjct: 281 ADNEKNGDVMQLYQKACQEAQEKEKIIEDMKETLIEQEQTQVE 323
>UniRef50_Q82L26 Cluster: Putative secreted alpha-galactosidase;
n=1; Streptomyces avermitilis|Rep: Putative secreted
alpha-galactosidase - Streptomyces avermitilis
Length = 658
Score = 36.7 bits (81), Expect = 1.7
Identities = 26/91 (28%), Positives = 39/91 (42%), Gaps = 10/91 (10%)
Query: 503 WFETDRSELVLGRTLCLDA-----SNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAG 557
W T R ELVL CLDA +N ++ C+ Q+W + I N AG
Sbjct: 570 WTYTSRKELVLYGNKCLDAYNLGTTNGTKVVIWDCNGQ-ANQKWNI--NSDGTITNVNAG 626
Query: 558 MCLGV--DRSYRGETVLMVICDDYSNNKWDI 586
+CL + G ++++ C N KW +
Sbjct: 627 LCLDAYNAATANGTSLVLWSCGTGDNQKWTV 657
>UniRef50_A5EY82 Cluster: Serine protease; n=1; Dichelobacter
nodosus VCS1703A|Rep: Serine protease - Dichelobacter
nodosus (strain VCS1703A)
Length = 467
Score = 36.7 bits (81), Expect = 1.7
Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 3/79 (3%)
Query: 276 ADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY--SARAVTPVIDVINADTF 333
A VGD+L+ + H LPPL+ G D V++ Y + T + + N +T
Sbjct: 304 AQLKVGDILLSFNGHTINKASDLPPLVAMAPLGKD-VEIEYLRDGKKQTTTVKIENLETA 362
Query: 334 EYSPSPLVRGGFNWGLHFK 352
+ S + R NWG+ K
Sbjct: 363 DTSSAATSREMRNWGIELK 381
>UniRef50_A2G5G4 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 346
Score = 36.7 bits (81), Expect = 1.7
Identities = 22/86 (25%), Positives = 44/86 (51%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
++K R +KS + + LL DQ ++V+ L + Y+++ E +K L+ K
Sbjct: 21 EVKKREDLKEKSEELKKEQTLLKDQSQNVKTELESINEMIEKYEQKRETSQKDNLRYKEQ 80
Query: 81 KQQAIKMSKKTENDLEEQFGLIRNSE 106
Q +I+ E++L+EQ L+ ++
Sbjct: 81 LQNSIQQKSTAESELKEQKQLLEEAK 106
>UniRef50_A2EN31 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type
inclusion protein, putative - Trichomonas vaginalis G3
Length = 5296
Score = 36.7 bits (81), Expect = 1.7
Identities = 50/239 (20%), Positives = 102/239 (42%), Gaps = 28/239 (11%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSDQ----PRDVEETLSKTMWKYQDYKRQSEYRRKVM-- 74
++K++I + + I AL +LL Q D +E + + +D K+Q E + K +
Sbjct: 378 NLKNKIADRESQIKAL---NLLIAQYQTDDEDKKEIIENLEKEIKDLKKQIEDKDKEIEV 434
Query: 75 LKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDL 134
LK K AK + I D E++ ++ + D+ + D FN ++++ +
Sbjct: 435 LKAKIAKIEEIP------EDEEDEDIVVAGTRDVDLGD-------FNEEEAEQVSLEDQV 481
Query: 135 PDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
+ KL ++ ++ +A + + E L + + DR D++ L S
Sbjct: 482 KQLKEKLDDKKKNGVQMKQA---LASKDAEIEKLNEQIQELKDRNDKQEQNIEELNTKNS 538
Query: 195 DLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNN 253
DL N + + ++ +D+L N +K ++ + + K EN+E K+N
Sbjct: 539 DLQNSNDEYKKLIDELQNQLK---DLAKNKAESSDLNNSENTKQDSEKAEDENAETKSN 594
>UniRef50_A2E7J3 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 397
Score = 36.7 bits (81), Expect = 1.7
Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 48 DVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNS-E 106
D E L K + DY++Q ++ + V+ E+ A ++ IK ++ + L + F L+ N
Sbjct: 56 DTEPQLQKELNTLVDYQKQLQFYQNVLRHEQQAAEEEIKRARLDTSQLRKSFELLENELM 115
Query: 107 DLRIRDK 113
+L++R K
Sbjct: 116 ELQLRQK 122
>UniRef50_A5DZR6 Cluster: Putative uncharacterized protein; n=1;
Lodderomyces elongisporus NRRL YB-4239|Rep: Putative
uncharacterized protein - Lodderomyces elongisporus
(Yeast) (Saccharomyces elongisporus)
Length = 865
Score = 36.7 bits (81), Expect = 1.7
Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Query: 64 KRQSEYRRKVMLKEKFAKQQAIKMSKKTEN---DLEEQFGLIRNSEDLRIRDKGYNLHAF 120
K ++E ++V+ K KF KQQ + KT+N +L+E FG I +DLR KG F
Sbjct: 213 KTKAEVMKEVIAKSKFYKQQRQRDYAKTQNQIDELDEDFGDI--MDDLRNTQKGVAKPQF 270
Query: 121 NTLISQRI 128
+T + I
Sbjct: 271 STKTPEEI 278
>UniRef50_Q58718 Cluster: DNA double-strand break repair rad50
ATPase; n=1; Methanocaldococcus jannaschii|Rep: DNA
double-strand break repair rad50 ATPase - Methanococcus
jannaschii
Length = 1005
Score = 36.7 bits (81), Expect = 1.7
Identities = 50/218 (22%), Positives = 89/218 (40%), Gaps = 19/218 (8%)
Query: 48 DVEETLSKTMWKYQDYKRQSEYRRKV--MLKE-KFAKQQAIKMSKKTE---NDLEEQFGL 101
+ ETL++ +Y+ YK + RK+ L+E K + +K++K+ E D+E+
Sbjct: 274 EARETLNRHKDEYEKYKSLVDEIRKIESRLRELKSHYEDYLKLTKQLEIIKGDIEKLKEF 333
Query: 102 IRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFY 161
I S + RD NL I I + D +L + +++ + I
Sbjct: 334 INKS---KYRDDIDNLDTLLNKIKDEIERVETIKDLLEELKNLNEEIEKIEKYKRICEEC 390
Query: 162 NEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI 221
E+YE + ++K ++ L +Y L +++ ++ L I K E
Sbjct: 391 KEYYEKYLE--------LEEKAVEYNKLTLEYITLLQEKKSIEKNINDLETRINKLLE-- 440
Query: 222 ETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRL 259
ET NID+E KK EN + + N +L
Sbjct: 441 ETKNIDIESIENSLKEIEEKKKVLENLQKEKIELNKKL 478
>UniRef50_UPI00015B6253 Cluster: PREDICTED: similar to CG33715-PD;
n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
CG33715-PD - Nasonia vitripennis
Length = 7697
Score = 36.3 bits (80), Expect = 2.3
Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 3/74 (4%)
Query: 22 IKDRIKNNDKSIAALEGKDLLSDQPRDVE--ETLSKTMWKYQDYKRQSEYRRKVMLKEKF 79
IK+ I+ D I +E K+ D+P+ E ET+ K + + ++ ++V KE+
Sbjct: 3010 IKEEIQQRDSQIGKVEEKETQQDKPKKDEPNETVVKKDEHQKKESQNEKFVKQVATKEES 3069
Query: 80 AKQQAI-KMSKKTE 92
K++A+ ++SKK E
Sbjct: 3070 RKEEAVEQVSKKEE 3083
>UniRef50_Q2JCN5 Cluster: Glycosyl transferase, family 2 precursor;
n=1; Frankia sp. CcI3|Rep: Glycosyl transferase, family
2 precursor - Frankia sp. (strain CcI3)
Length = 466
Score = 36.3 bits (80), Expect = 2.3
Identities = 47/176 (26%), Positives = 75/176 (42%), Gaps = 19/176 (10%)
Query: 267 GLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTPVID 326
GL RAR G + V+VF D +EV+ WL LL + G GV V + VT +I
Sbjct: 174 GLSRARNAGLAAATTPVVVFTDDDVEVDPRWLEFLLSGFAAG-SGV-VDETVGCVTGLIR 231
Query: 327 VINADTFEYSPSPL---VRGGFNWG-LHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLFA 382
+ T P+ + GGF G + ++D T D + P + G +
Sbjct: 232 PLELST----PAQVWFEQFGGFGKGFVGRRFDR----TENRSGDLLYPYTAGVFGSGANS 283
Query: 383 IYR-EYFNAIGKYD----PGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
+R + +G +D G GGE+L+I + G L P + + H+ ++
Sbjct: 284 AFRTDTLRQLGGFDEFLGTGTAARGGEDLDIFLSVVRSGHVLVYEPAALIRHLHKR 339
>UniRef50_O06764 Cluster: Phase variable surface lipoprotein P78
precursor; n=1; Mycoplasma fermentans|Rep: Phase
variable surface lipoprotein P78 precursor - Mycoplasma
fermentans
Length = 680
Score = 36.3 bits (80), Expect = 2.3
Identities = 37/149 (24%), Positives = 74/149 (49%), Gaps = 25/149 (16%)
Query: 42 LSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGL 101
LSD+ +D+ E L+K +Y K ++ ++++ EK+ ++A+ K E D ++
Sbjct: 172 LSDKLKDINEFLTKNKSEYDAEKAKANPNKEII--EKY--ERALIRKAKYE-DTADKDSY 226
Query: 102 IRNSEDLRIRDKGYNLHAFN--TLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIIC 159
I++ E+L I+ Y + + N ++S R +P+ +N FD+
Sbjct: 227 IKSFENLNIK---YKIQSINDPKIVSDEFS--RSIPNFKNDSYIDYYVFDK--------- 272
Query: 160 FYNEHYETLMRSVHSIMDRTDQ--KHIKE 186
++ Y T+ S HS+ D T++ K+IK+
Sbjct: 273 --SDTYSTVSHSFHSVSDLTEEIKKYIKK 299
>UniRef50_A6WGG7 Cluster: Glycosyl transferase family 2; n=1;
Kineococcus radiotolerans SRS30216|Rep: Glycosyl
transferase family 2 - Kineococcus radiotolerans
SRS30216
Length = 419
Score = 36.3 bits (80), Expect = 2.3
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 380 LFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCG 416
+ A+ R+ F+A+G ++ +GGE+ E + R W+ G
Sbjct: 172 VLAVTRDLFDAVGGFEEAFTAYGGEDWEFAHRCWLAG 208
>UniRef50_A0V2D1 Cluster: Glycosyl transferase, family 2; n=1;
Clostridium cellulolyticum H10|Rep: Glycosyl
transferase, family 2 - Clostridium cellulolyticum H10
Length = 333
Score = 36.3 bits (80), Expect = 2.3
Identities = 18/48 (37%), Positives = 27/48 (56%)
Query: 244 STENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHI 291
ST+NS + ++ KT K G+ AR G + + GD+L FLDS +
Sbjct: 44 STDNSIEIAKKYPCKIFKTPKNGGVAAARNLGVEYASGDILFFLDSDV 91
>UniRef50_Q4Q0C8 Cluster: Phosphatidylinositol 3 kinase, putative;
n=3; Leishmania|Rep: Phosphatidylinositol 3 kinase,
putative - Leishmania major
Length = 2613
Score = 36.3 bits (80), Expect = 2.3
Identities = 28/116 (24%), Positives = 53/116 (45%), Gaps = 5/116 (4%)
Query: 15 YRPLNWDIKDRIKNNDKSIAALEGKDLLSD----QPRDVEETLSKTMWKYQDYKRQSEYR 70
Y PL + + D N + + A L + P + + +T +Y+ +Q R
Sbjct: 1178 YTPLVFPLLDGYGQNGQDVKAFTLSTLRTSGRVVAPENAQAQEEETAARYRVNTQQLAQR 1237
Query: 71 RKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQ 126
RK L+E FA+ ++I +++ E + E L + + +L +R N H F ++Q
Sbjct: 1238 RKTALEENFAQLRSILVARDRETEEEWNLWLKQLAVEL-LRSSPSNAHGFAFALAQ 1292
>UniRef50_O97294 Cluster: Putative uncharacterized protein PFC0990c;
n=2; Plasmodium|Rep: Putative uncharacterized protein
PFC0990c - Plasmodium falciparum (isolate 3D7)
Length = 753
Score = 36.3 bits (80), Expect = 2.3
Identities = 55/269 (20%), Positives = 114/269 (42%), Gaps = 25/269 (9%)
Query: 22 IKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVM-----LK 76
I++ KNN KD+ ++ D E L++ + Q K+ Y +K+ L
Sbjct: 381 IENLKKNNQIIYDKFLQKDISQNETNDTIEKLNQKLKSEQ--KQIYHYEQKINTLNDDLN 438
Query: 77 EKFAKQ-QAIKMSKKTENDL---EEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHR 132
+ K S +N L E+Q G ++N D+ I K + ++ R
Sbjct: 439 NSYQKYVYYYNKSDSNQNLLQQKEDQIGKLKNQLDIHINSKNDIQEKLKNIYTENNKVER 498
Query: 133 DLPDTRNKLCQSQQYFDELPRASIIICFYNEHYE-TLMRSVHSIMDRTDQKHIKEIILVD 191
D D +N+L +++ ++L + ++N+ Y + ++ I+ +++ + +++
Sbjct: 499 DNDDLKNELTKTKLNLEKL-KDEYEELYHNKQYVFSCYKNEEKILKENLERYKTKCAILE 557
Query: 192 DYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVK 251
+ D + L ++ KL + + E+ M E + E +TE ++K
Sbjct: 558 NQKDCHILEDKYKQLEIKLKDT-ENEKYMYERTCLMNE--------KRQNDMATEIKDLK 608
Query: 252 NNVFNI--RLLKTSKREGL-IRARLYGAD 277
N +++ RL K SK + L ++ LY D
Sbjct: 609 NELYDCKNRLYKMSKSDVLDMKTTLYNQD 637
>UniRef50_A2EJ43 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type
inclusion protein, putative - Trichomonas vaginalis G3
Length = 940
Score = 36.3 bits (80), Expect = 2.3
Identities = 48/253 (18%), Positives = 112/253 (44%), Gaps = 17/253 (6%)
Query: 21 DIKDRIKN--NDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQS-EYRRKV--ML 75
+IK++ +N +K + E + D ++ EET + + +DYK Q E ++++ +
Sbjct: 49 EIKNQNENLQKEKENSLNEMNKQIDDLQKEKEETEKALIEENEDYKNQLSELKKQIEDLQ 108
Query: 76 KEKFAKQQAIKMSKKTEN----DLEEQFGLIRNS-EDLRIRDKGYNLHAFNTL--ISQRI 128
E K + +K + N DL++Q L++ S + +D+ + + + + Q++
Sbjct: 109 NENEEKVENLKKENEEFNNEIKDLQDQIELLKKSMSESEDKDQKFVIELNQQIEKLKQKV 168
Query: 129 GDHRDLPDTRNK--LCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKE 186
D +DL +++ + Q+ D + + + NE + + + D ++++ +K+
Sbjct: 169 SDEKDLIQVKDEEIIDLKQKNTDLSEQNNKLNEDKNELEKQIEELAQKLSDESEKEKLKQ 228
Query: 187 IILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI--ETNNIDMEXXXXXXXXXXXXKKS 244
I + S+ N D + ++ L + + E+ I +T ID +
Sbjct: 229 EI-NELKSEKENSEKDFNKKLENLTQKVTELEDSISQKTREIDEAETAKEDISLKLDNLA 287
Query: 245 TENSEVKNNVFNI 257
EN ++ N+ I
Sbjct: 288 EENEKLSQNLSEI 300
>UniRef50_A0CQE7 Cluster: Chromosome undetermined scaffold_24, whole
genome shotgun sequence; n=3; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_24,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 336
Score = 36.3 bits (80), Expect = 2.3
Identities = 42/215 (19%), Positives = 92/215 (42%), Gaps = 6/215 (2%)
Query: 8 NKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQS 67
N+ K + L + + N K L+ + L+ ++ +++ ++ + K +D ++ +
Sbjct: 92 NQYLKQRIKQLESQNNNYVSENKKLAHVLDQQIQLNQSLQEQQQSKNQIIKKLEDVQKMN 151
Query: 68 EYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQR 127
+Y++ K Q I +SKK NDLEE+ ++ N E+ ++ + +
Sbjct: 152 KYQQSNNSDLKQINDQLI-ISKKVVNDLEEKVQIVLN-ENQKLSELNERFQFTENQLKIE 209
Query: 128 IGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEI 187
I ++ + QQ + I N+ E L R + + + Q + ++
Sbjct: 210 IEKYKSKCSILESKMKQQQDESKCLELQRKIKKQNDQLEILARENYQLKQQLTQNN-NQV 268
Query: 188 ILVDDYSDLYNLHHD-VQEAVDKLNNVIKKEEEMI 221
DD LY +H+D +E + +L + ++MI
Sbjct: 269 SQQDD-KQLY-VHNDKFKETIQELECENQYLQQMI 301
>UniRef50_Q4JC59 Cluster: Conserved Archaeal membrane protein; n=4;
Sulfolobaceae|Rep: Conserved Archaeal membrane protein -
Sulfolobus acidocaldarius
Length = 342
Score = 36.3 bits (80), Expect = 2.3
Identities = 27/99 (27%), Positives = 46/99 (46%), Gaps = 5/99 (5%)
Query: 255 FNIRLLKTSKR----EGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
+N++++ ++K G I A+L G ++ GD++VF DS WL L+ LS V
Sbjct: 88 YNVKVVVSNKNCDICSGKINAQLEGLKHARGDIIVFADSDTWFPKYWLKELVSPLSNYVA 147
Query: 311 GVKVRYSARAVTPVIDVINADTFEYS-PSPLVRGGFNWG 348
++ + ++I A + S V G F WG
Sbjct: 148 TTVFSWAKPVRLTIGNIIRAGFWTLGFESQAVGGTFLWG 186
>UniRef50_UPI000150A0FA Cluster: Type III restriction enzyme, res
subunit family protein; n=1; Tetrahymena thermophila
SB210|Rep: Type III restriction enzyme, res subunit
family protein - Tetrahymena thermophila SB210
Length = 2730
Score = 35.9 bits (79), Expect = 3.0
Identities = 24/125 (19%), Positives = 55/125 (44%), Gaps = 1/125 (0%)
Query: 48 DVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSED 107
++E+T K K + + + K + F +SK+ LE+ L++++E+
Sbjct: 497 EIEDTF-KANAKTLRIFHKGQVKIKSIYTNNFYTAACFSISKRVITLLEQMDELVKSNEE 555
Query: 108 LRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYET 167
+ + D NL L Q++GDH L Q +++ D + + + Y+
Sbjct: 556 VNVNDLESNLSKLKDLHEQQVGDHLQLNSQLIYQEQVKKWCDFIQQKKTVFEIDYNKYDQ 615
Query: 168 LMRSV 172
+++++
Sbjct: 616 VLKNI 620
>UniRef50_UPI00006CF2BD Cluster: Leucine Rich Repeat family protein;
n=1; Tetrahymena thermophila SB210|Rep: Leucine Rich
Repeat family protein - Tetrahymena thermophila SB210
Length = 1504
Score = 35.9 bits (79), Expect = 3.0
Identities = 38/209 (18%), Positives = 82/209 (39%), Gaps = 6/209 (2%)
Query: 71 RKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGD 130
R+ +LK+K Q K E ++ +IR+ E+ +K ++ N L + G
Sbjct: 977 RENILKQKQILYQMNHELKLNETNINSSSEVIRSYEN----EKQNLIYEINQLKEENFGQ 1032
Query: 131 HRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILV 190
+ + + + + R I E YE +R ++ + ++ K+I
Sbjct: 1033 KLAIEEFQQIQIKCNSQLRSIERQEKIFKEEKEAYELRIRDYEELLKQIHSEYQKQIHQK 1092
Query: 191 D-DYSDLYNLHHDVQEAVDKLNNVIK-KEEEMIETNNIDMEXXXXXXXXXXXXKKSTENS 248
D + + N VQ+ + L ++ K+ ++ET + +K +NS
Sbjct: 1093 DIEIHESKNETRAVQKQISILQRELEFKQNTILETEKALLNRGEQSSILLRETEKKLQNS 1152
Query: 249 EVKNNVFNIRLLKTSKREGLIRARLYGAD 277
+ N F ++ + ++ ++YG D
Sbjct: 1153 IMMANEFKTKIEELYSENEELKDQVYGKD 1181
>UniRef50_UPI0000499464 Cluster: DNA repair protein Rad50; n=1;
Entamoeba histolytica HM-1:IMSS|Rep: DNA repair protein
Rad50 - Entamoeba histolytica HM-1:IMSS
Length = 1241
Score = 35.9 bits (79), Expect = 3.0
Identities = 54/265 (20%), Positives = 107/265 (40%), Gaps = 18/265 (6%)
Query: 5 QIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYK 64
Q ++ KV+ + ++ +I N K LE DL + R++E+ +++ K Q +
Sbjct: 307 QSIDVQQKVNKEGQHQELSKQIHNQMKDQTLLEN-DLKNR--RELEKEINE---KIQGVE 360
Query: 65 RQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLI 124
E +K + KQ+ I+ KK E E N + ++ YN+H I
Sbjct: 361 SVEEKVNNEKIKNE-EKQKEIEEEKKKEEKELEDMSKEVNEIKNELENRKYNVH-----I 414
Query: 125 SQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHI 184
Q +++ + R K + Q+ +E+ I NE + L + + + ++K
Sbjct: 415 KQDDVNNKTTENERKKK-RDQEIQEEINEMKKDIIKKNEEIDDLKKQLSK--ESFEEKEQ 471
Query: 185 KEIILVDDY-SDLYNLHHDVQEAVDKLNNVIKKEEEM--IETNNIDMEXXXXXXXXXXXX 241
K I +++ D+ + +++ A++ + IK E M I N ++E
Sbjct: 472 KSKIKLEEIKKDIEEIDNEINRALENIQQQIKIERLMKEINENKTELENFKLTVGKDLQG 531
Query: 242 KKSTENSEVKNNVFNIRLLKTSKRE 266
K+ +K I +K E
Sbjct: 532 KEKDIKETIKKQKNEILSMKNDSEE 556
>UniRef50_Q1WV23 Cluster: Superfamily II DNA and RNA helicase; n=1;
Lactobacillus salivarius subsp. salivarius UCC118|Rep:
Superfamily II DNA and RNA helicase - Lactobacillus
salivarius subsp. salivarius (strain UCC118)
Length = 788
Score = 35.9 bits (79), Expect = 3.0
Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 3/77 (3%)
Query: 141 LCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEII---LVDDYSDLY 197
+C FD L + N Y L+ H+++ R+ + KEI + D L
Sbjct: 347 ICDYNYLFDPLVKLQRFFTERNYDYTFLLDEAHNLVSRSRDMYTKEISSQQIKDLLDKLQ 406
Query: 198 NLHHDVQEAVDKLNNVI 214
L H Q+ VDKLN ++
Sbjct: 407 TLPHPPQKIVDKLNTLL 423
>UniRef50_Q7RT92 Cluster: Phosphatidylinositol transfer protein 2;
n=4; Plasmodium (Vinckeia)|Rep: Phosphatidylinositol
transfer protein 2 - Plasmodium yoelii yoelii
Length = 738
Score = 35.9 bits (79), Expect = 3.0
Identities = 28/113 (24%), Positives = 48/113 (42%), Gaps = 5/113 (4%)
Query: 188 ILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTEN 247
I V+DY +L N+++DV+ + D + N ++ + I E K++ N
Sbjct: 597 IKVEDYKNLMNIYNDVENS-DNIENEVQPLNNL--NGKIKNENDKLSYIFKKNDKENENN 653
Query: 248 SEVKNNVFNIRLLKTSKREGL--IRARLYGADNSVGDVLVFLDSHIEVNVGWL 298
KN V N L+ SK I L +N+ L D +++N W+
Sbjct: 654 VNNKNGVNNHNTLQISKNNKYENIPITLQLFNNATDKKLKINDKKLKINDSWI 706
>UniRef50_A2ETW9 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type
inclusion protein, putative - Trichomonas vaginalis G3
Length = 2010
Score = 35.9 bits (79), Expect = 3.0
Identities = 42/208 (20%), Positives = 90/208 (43%), Gaps = 18/208 (8%)
Query: 22 IKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAK 81
I+ +++ K IA E D L +Q ++ E S + + K Q+E + KE
Sbjct: 788 IESENESSSKIIALTEEIDELKNQINNISEQKSTLEFTIDEIKAQNESEISQLKKENEDL 847
Query: 82 QQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKL 141
I+ K N+L+ + I+NS L + + N N L + + + D+ N+
Sbjct: 848 NSKIESLSKENNELKTEIENIQNSHSLSLLETEMN----NKLTN--LNEENDMLKNENEN 901
Query: 142 C--QSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
+ ++ E + F+ ++ + + D++ K I+L L
Sbjct: 902 IKREKEETLAENKSLKDTLDFFEKNLTKINEQNKDKTEELDKQ--KRIVLT-----LTGE 954
Query: 200 HHDVQEAVDKLNN---VIKKEEEMIETN 224
+++++ +DK+ N +++KE E +E++
Sbjct: 955 NNELKSKLDKIKNDYELLQKENEKLESD 982
>UniRef50_Q0W1H0 Cluster: Putative glycosyltransferase; n=2;
uncultured methanogenic archaeon RC-I|Rep: Putative
glycosyltransferase - Uncultured methanogenic archaeon
RC-I
Length = 234
Score = 35.9 bits (79), Expect = 3.0
Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 3/87 (3%)
Query: 367 FMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSR 426
FM + +P +AG FA+ RE F+ G +D + GE++E+ RI G P S
Sbjct: 137 FMARINNPAVAGANFAVTREAFDKAGGFDESLVT--GEDIELCKRIKRY-GRFVFNPDSL 193
Query: 427 VGHVFRKRRPYGVGEKQDYMLQNSMRM 453
V R+ R +G + + N++++
Sbjct: 194 VYVSMRRVREWGYARFVAFHVTNTIKV 220
>UniRef50_A2STK1 Cluster: Glycosyl transferase, family 2; n=2;
Methanomicrobiales|Rep: Glycosyl transferase, family 2 -
Methanocorpusculum labreanum (strain ATCC 43576 / DSM
4855 / Z)
Length = 238
Score = 35.9 bits (79), Expect = 3.0
Identities = 25/104 (24%), Positives = 41/104 (39%), Gaps = 2/104 (1%)
Query: 207 VDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSKRE 266
V+ L I K E +E E ++ E E K+ +RLL + +R+
Sbjct: 16 VEALKTAIPKSIEALEAYGKSFELIIAEDGSTDGSRECVEEWERKDP--RVRLLHSDERQ 73
Query: 267 GLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
G RA S G++ + D + ++ L LL + G D
Sbjct: 74 GRGRALNRALAESRGEIFCYYDVDLATDISHLSELLDHIEDGAD 117
>UniRef50_UPI000150A21E Cluster: hypothetical protein TTHERM_00191160;
n=1; Tetrahymena thermophila SB210|Rep: hypothetical
protein TTHERM_00191160 - Tetrahymena thermophila SB210
Length = 1590
Score = 35.5 bits (78), Expect = 4.0
Identities = 30/142 (21%), Positives = 57/142 (40%), Gaps = 4/142 (2%)
Query: 113 KGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSV 172
K +N +NTL D +D D+ N+ Q + LP+ + F ++ V
Sbjct: 1397 KIFNQERYNTLTEMNAKDLQDQSDS-NQEFSKQNSQNNLPKQESKVTFPTGKKNSIKSDV 1455
Query: 173 HSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIET-NNIDMEXX 231
K K ++ +D++ +YN + D + NN + ++EE +E+ I E
Sbjct: 1456 SKSQLSNKSKAFKNVV-ANDFN-VYNSNVDQEAVAANQNNYMLEDEEEVESQGQISNEDD 1513
Query: 232 XXXXXXXXXXKKSTENSEVKNN 253
+ ++ + V NN
Sbjct: 1514 HSQYQQNNRNQTNSNHQNVNNN 1535
>UniRef50_UPI00006CF1FD Cluster: hypothetical protein
TTHERM_00540130; n=1; Tetrahymena thermophila SB210|Rep:
hypothetical protein TTHERM_00540130 - Tetrahymena
thermophila SB210
Length = 1215
Score = 35.5 bits (78), Expect = 4.0
Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 2/97 (2%)
Query: 5 QIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWK--YQD 62
Q + K ++ +P IKD+I+ +K AA+E L P+ + + L K + Y +
Sbjct: 604 QFILKKNQILKQPSLETIKDKIEYYNKIEAAIEQTQLKESTPQQLRDRLFKLIRNMIYCE 663
Query: 63 YKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQF 99
K +Y +KEK K A ++ + D E F
Sbjct: 664 LKLTFKYEMIPQMKEKLQKSIADVLNTDIQADQNEYF 700
>UniRef50_Q8KU52 Cluster: EF0109; n=1; Enterococcus faecalis|Rep:
EF0109 - Enterococcus faecalis (Streptococcus faecalis)
Length = 1924
Score = 35.5 bits (78), Expect = 4.0
Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 1/64 (1%)
Query: 251 KNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLP-PLLKRLSQGV 309
K N+++ +L T KR GL+RA L G N D+ D+ EV +P + R +
Sbjct: 441 KTNIWSSGILSTDKRGGLVRAALVGRTNGTIDIYFKDDTPQEVLNSEIPFQVWARFKEKK 500
Query: 310 DGVK 313
+GVK
Sbjct: 501 EGVK 504
>UniRef50_A5ZRG2 Cluster: Putative uncharacterized protein; n=1;
Ruminococcus obeum ATCC 29174|Rep: Putative
uncharacterized protein - Ruminococcus obeum ATCC 29174
Length = 927
Score = 35.5 bits (78), Expect = 4.0
Identities = 19/63 (30%), Positives = 35/63 (55%), Gaps = 4/63 (6%)
Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIE 222
E ET ++ V+ D D I EI+ DD DL + + D++ L++++KK ++++E
Sbjct: 542 EEIETFIKQVNLTYDLIDDSEINEIL--DD--DLKDFNSDIESIYKILHDIVKKSDKIVE 597
Query: 223 TNN 225
N
Sbjct: 598 KIN 600
>UniRef50_A4JHB1 Cluster: Putative uncharacterized protein; n=1;
Burkholderia vietnamiensis G4|Rep: Putative
uncharacterized protein - Burkholderia vietnamiensis
(strain G4 / LMG 22486) (Burkholderiacepacia (strain
R1808))
Length = 218
Score = 35.5 bits (78), Expect = 4.0
Identities = 33/124 (26%), Positives = 60/124 (48%), Gaps = 9/124 (7%)
Query: 41 LLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTEND--LEEQ 98
LL+ +++EE +K + K + + KV KE+ A +Q K S + E+D ++
Sbjct: 83 LLNIPDKEIEEIATKIIAVVNGAKLPKKQQSKVEQKEEVASEQINKWSNQFEDDKKSDDP 142
Query: 99 FG--LIRNSED-LRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQS--QQYFDELPR 153
F L+ N E ++ + N+ F L+ + G R+L DT + +S QQ D L +
Sbjct: 143 FAITLVNNFESGIKEEEPKINILEFMKLVGK--GFDRELTDTEKSVVKSYYQQNSDLLAK 200
Query: 154 ASII 157
++
Sbjct: 201 DQLV 204
>UniRef50_Q7RSH4 Cluster: Putative uncharacterized protein PY00383;
n=2; Plasmodium (Vinckeia)|Rep: Putative uncharacterized
protein PY00383 - Plasmodium yoelii yoelii
Length = 859
Score = 35.5 bits (78), Expect = 4.0
Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 5/104 (4%)
Query: 161 YNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEM 220
Y + Y T + I R ++ IK I + YSDL L +V +DKLNN KK +++
Sbjct: 155 YKKEYTTHQKCNKEI-SRQNKDGIKNIYSNNKYSDLPKLRKNV---LDKLNNADKKNDQL 210
Query: 221 IETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSK 264
T +++++ K+ N++ ++++ + +T+K
Sbjct: 211 F-TLSLNVKNNKKSKLNYIDTKEKNYNNKSQSHIDKPNVTRTNK 253
>UniRef50_Q7RBX2 Cluster: Mature-parasite-infected erythrocyte
surface antigen; n=5; Plasmodium|Rep:
Mature-parasite-infected erythrocyte surface antigen -
Plasmodium yoelii yoelii
Length = 761
Score = 35.5 bits (78), Expect = 4.0
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 3/75 (4%)
Query: 23 KDRIKNNDKSIAALEGKDLLSDQPRDVEET--LSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
+++IKN ++SI E KD++S++P +EE L K K+ D+ + ++ M+ +K
Sbjct: 488 REKIKN-ERSIFLKEIKDMVSNKPEKIEEENYLKKLEEKFTDFDDKILRKKMKMMSKKKN 546
Query: 81 KQQAIKMSKKTENDL 95
+ + T NDL
Sbjct: 547 RMNNLSNVGMTSNDL 561
>UniRef50_Q4Z6V7 Cluster: Putative uncharacterized protein; n=3;
Plasmodium (Vinckeia)|Rep: Putative uncharacterized
protein - Plasmodium berghei
Length = 746
Score = 35.5 bits (78), Expect = 4.0
Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 3/75 (4%)
Query: 41 LLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFG 100
+L D +++ + K + Y ++ EY+ K++ ++ IK N++ E
Sbjct: 74 ILKDNKNNLKLYIKKIRSNFNSYWKECEYQLKILKTYYIENEKLIKEIVDENNNMIES-- 131
Query: 101 LIRNSEDLRIRDKGY 115
I N+EDL+I DK Y
Sbjct: 132 -ITNNEDLKIDDKTY 145
>UniRef50_Q23YT2 Cluster: Kinesin motor domain containing protein;
n=1; Tetrahymena thermophila SB210|Rep: Kinesin motor
domain containing protein - Tetrahymena thermophila
SB210
Length = 1736
Score = 35.5 bits (78), Expect = 4.0
Identities = 41/180 (22%), Positives = 85/180 (47%), Gaps = 20/180 (11%)
Query: 43 SDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLI 102
SD+ D+++ L K KY K+Q E + + + K + + IK SK+T+ +EQ +
Sbjct: 732 SDKVVDIKQ-LEK---KYLQKKQQREPTKFIYINSKKIEVKKIKNSKRTKKLSQEQSSIN 787
Query: 103 RNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYN 162
++S ++GY+ H N++ + D + + N++ D + I+ N
Sbjct: 788 KSS-----INEGYDSHQ-NSIEKYYVQDSLNQGNPNNQIFSQ----DHSLISQFILNKTN 837
Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIE 222
EH + + SI+D + I +++ S+++ L Q + LNN I +++++
Sbjct: 838 EHNQNNL----SILDNERYQSINQLLQPSTESNIHTLKTSNQ--IGNLNNNISNNQDILK 891
>UniRef50_Q22YR7 Cluster: Cyclic nucleotide-binding domain containing
protein; n=1; Tetrahymena thermophila SB210|Rep: Cyclic
nucleotide-binding domain containing protein -
Tetrahymena thermophila SB210
Length = 1484
Score = 35.5 bits (78), Expect = 4.0
Identities = 16/68 (23%), Positives = 45/68 (66%), Gaps = 6/68 (8%)
Query: 50 EETLSKTMWKYQDYK-RQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQ----FGLIRN 104
++ +++T+ +++ K +Q+ Y+++++L +KF + QA + +K+ +DL ++ F + +N
Sbjct: 1291 QQDITETIQQFEQEKGKQNNYQKQILLYKKFKQNQAFENEQKSSHDLNKEGKNNFSIFQN 1350
Query: 105 SEDLRIRD 112
+ L ++D
Sbjct: 1351 PQ-LNLKD 1357
>UniRef50_A2FX23 Cluster: Formin Homology 2 Domain containing
protein; n=1; Trichomonas vaginalis G3|Rep: Formin
Homology 2 Domain containing protein - Trichomonas
vaginalis G3
Length = 2354
Score = 35.5 bits (78), Expect = 4.0
Identities = 54/226 (23%), Positives = 101/226 (44%), Gaps = 22/226 (9%)
Query: 9 KATKVHYRPLNWDIKDRIK-NNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDY-KRQ 66
+A K Y+ L I+++ K +K E +DL + + + E T+ + Y R
Sbjct: 491 RAIKQSYKKLQDQIEEKTKIEGEKEEMKKENEDLKA-RLKTAESTIVIQKAAAESYTSRV 549
Query: 67 SEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQ 126
++ ++K+ E +QQ +K +N E+ L S+ L+ +++ + +L SQ
Sbjct: 550 NDLQQKLAEYESKLQQQISANEEKIKNQENEKVTL---SQKLKEQEEE-SRKIIESLQSQ 605
Query: 127 RIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKE 186
+DL N++ + Q +E+ + N+ YETL + + DRT ++E
Sbjct: 606 S----KDLQKMNNEMQVNLQ--NEISILKSKLTESNQKYETLEQKSSNESDRTASA-LQE 658
Query: 187 IILVDDYSDLYNLHHDVQEAVDKLNNVIKKEE---EMIETNNIDME 229
+ + NL D++ KLN + K+ E IE N D+E
Sbjct: 659 LKTQNK-----NLESDIENLTSKLNEITKQNEMKSREIERLNADIE 699
>UniRef50_A2F112 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 376
Score = 35.5 bits (78), Expect = 4.0
Identities = 42/233 (18%), Positives = 95/233 (40%), Gaps = 20/233 (8%)
Query: 22 IKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAK 81
I+D ++ DK I E K+ L D+E+ +S + + + + + E K
Sbjct: 141 IRDYCRSLDKKIEIEESKNYLKQTITDLEKKIS-------EREGNLNKEKSIEVTEHHQK 193
Query: 82 QQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKL 141
+ IK +K E L+++ +R D + N + T ++ + + +L ++
Sbjct: 194 SEEIK-NKNVE--LQKEIQKLRLDVDKETNEHKNNDNMHITRMNNALSEKTELTRKIEQM 250
Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
Q++Q DE+ + Y + + +++ +++ + ++ IL DY
Sbjct: 251 SQTKQ--DEIK--------LDSEYRAISKVLNASLEQLTEIQNQKEILAKDYEGEKQKID 300
Query: 202 DVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNV 254
D E ++ LN I+ + I+ N +E TE +++++
Sbjct: 301 DYHEKINNLNKKIQDLNDKIQNNKKTLETELYHGETSKMLNYLTEQRNLRDSL 353
>UniRef50_A2ERV7 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 1347
Score = 35.5 bits (78), Expect = 4.0
Identities = 18/69 (26%), Positives = 40/69 (57%), Gaps = 1/69 (1%)
Query: 30 DKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSK 89
+K + E K+ S +P +++E + T+ + ++ ++ E +K +KE+ KQ+ IK K
Sbjct: 787 EKKVEETEIKENTSTKPNEIKEEIKPTIEEKKEETKEIEQTKKDEIKEEPKKQEEIK-QK 845
Query: 90 KTENDLEEQ 98
+ E +++Q
Sbjct: 846 EAEQPVQKQ 854
>UniRef50_A2EQA8 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 1190
Score = 35.5 bits (78), Expect = 4.0
Identities = 36/181 (19%), Positives = 88/181 (48%), Gaps = 16/181 (8%)
Query: 59 KYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTE-NDLEEQFGLIRNSEDLRIRDKGYNL 117
+Y Q Y+++ +K+K + Q K + + NDL+++ G + ++E + +K +
Sbjct: 660 EYTQQIEQKLYQKQEEMKKKDEENQKEKENLMNQINDLKKKLGDL-STEKRNLNEK---M 715
Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLM----RSVH 173
+ + I + ++ +T L + +Y +E+ + ++ EH + L+ ++++
Sbjct: 716 EKEKDIFEEEIENLKEDNETIKNL--NDKYVEEINKLKELVAEELEHNKQLIEAHEKAMN 773
Query: 174 SIMDRTDQKHIKEIILVDDYSD-LYNL----HHDVQEAVDKLNNVIKKEEEMIETNNIDM 228
+ + ++KH KE++ V D D L+NL + EA+ ++ N + E ++ +
Sbjct: 774 DLQNDIEEKHQKELMQVGDEIDKLHNLITKKEEENTEALKQVKNQFRDEINKLKNEKEEA 833
Query: 229 E 229
E
Sbjct: 834 E 834
>UniRef50_A0CXR3 Cluster: Chromosome undetermined scaffold_30, whole
genome shotgun sequence; n=4; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_30,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 1104
Score = 35.5 bits (78), Expect = 4.0
Identities = 39/196 (19%), Positives = 90/196 (45%), Gaps = 15/196 (7%)
Query: 47 RDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLE--EQFGLIRN 104
+++++ L + + Q Y++Q++ K + + + +QQ +K + + +L+ +Q + N
Sbjct: 247 QELQDLLEASETQLQKYQQQNDKLNKQIKELQQKEQQLLKENLNAKENLQQCDQLQNLLN 306
Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRD-----LPDTRNKLCQSQQYFDELPRASIIIC 159
SE +R + +L+ N + ++ D ++ L + +SQQ D + I
Sbjct: 307 SELNDMRSRNESLNQLNQQLDRQNRDFKNECELTLKELTEVKRKSQQQMDLNLQLDEEIE 366
Query: 160 FYNEHYETLMRSVH-------SIMDRTDQKHIKEI-ILVDDYSDLYNLHHDVQEAVDKLN 211
Y E + H ++D+ +K ++I L + + N+ QE +D+L
Sbjct: 367 QYKVEIEQIKTKKHQEISKQRELLDQLKEKSNQKINELKNKLKEAQNIEQYQQEQLDELQ 426
Query: 212 NVIKKEEEMIETNNID 227
+IK+ E ++ I+
Sbjct: 427 ELIKQSENQLKQLQIN 442
>UniRef50_Q0UYN5 Cluster: Putative uncharacterized protein; n=1;
Phaeosphaeria nodorum|Rep: Putative uncharacterized
protein - Phaeosphaeria nodorum (Septoria nodorum)
Length = 693
Score = 35.5 bits (78), Expect = 4.0
Identities = 36/145 (24%), Positives = 63/145 (43%), Gaps = 11/145 (7%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
D K ++ AALE +Q R + K K + E R+ + +E+ A
Sbjct: 263 DTSKSSKRDEAHDAALEELRQELEQERAARLSAEKASKKSSSDSAEIEELRQELEQERHA 322
Query: 81 KQQAIKMSKK---TENDLEEQF--GLIRNSEDLRIRDKGY-----NLHAFNTLISQRIGD 130
+Q+A K SKK T+N E+ L + + ++K Y L NT++ ++
Sbjct: 323 RQKAEKASKKGTQTDNSQSEEIKKALEEEKRERKKQEKEYTKTLAELQGRNTVLDDKLSA 382
Query: 131 HRD-LPDTRNKLCQSQQYFDELPRA 154
R+ L T+ KL + + + + RA
Sbjct: 383 FREKLRTTKEKLKEKEAELERVDRA 407
>UniRef50_P37709 Cluster: Trichohyalin; n=2; Eutheria|Rep:
Trichohyalin - Oryctolagus cuniculus (Rabbit)
Length = 1407
Score = 35.5 bits (78), Expect = 4.0
Identities = 20/104 (19%), Positives = 50/104 (48%)
Query: 23 KDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQ 82
++R ++ + E ++L ++ R + E + ++ R+ E RK+ +E+ +Q
Sbjct: 648 RERKLREEEQLLRREEQELRQERERKLREEEQLLQEREEERLRRQERARKLREEEQLLRQ 707
Query: 83 QAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQ 126
+ ++ ++ E L E+ L+R E L +++ L L+ +
Sbjct: 708 EEQELRQERERKLREEEQLLRREEQLLRQERDRKLREEEQLLQE 751
>UniRef50_UPI00006CB606 Cluster: hypothetical protein
TTHERM_00444160; n=1; Tetrahymena thermophila SB210|Rep:
hypothetical protein TTHERM_00444160 - Tetrahymena
thermophila SB210
Length = 2098
Score = 35.1 bits (77), Expect = 5.3
Identities = 23/69 (33%), Positives = 41/69 (59%), Gaps = 5/69 (7%)
Query: 37 EGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLE 96
EG+ +S + R+V L K M + ++Y ++ E+++K+ KEK K QAIK KK ++
Sbjct: 792 EGQQPISKEEREVR--LKKFMEQVEEYTKEHEFQQKI--KEK-QKAQAIKFEKKKIEKIK 846
Query: 97 EQFGLIRNS 105
++ L +S
Sbjct: 847 KKKKLASSS 855
>UniRef50_UPI0000498399 Cluster: Viral A-type inclusion protein
repeat; n=1; Entamoeba histolytica HM-1:IMSS|Rep: Viral
A-type inclusion protein repeat - Entamoeba histolytica
HM-1:IMSS
Length = 1387
Score = 35.1 bits (77), Expect = 5.3
Identities = 47/256 (18%), Positives = 109/256 (42%), Gaps = 22/256 (8%)
Query: 4 NQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVE-ETLSKTMWKYQD 62
N+ +N+ K + + + +I N +K I ++ K+ ++ E ++K + ++
Sbjct: 945 NEQINEINK-EKENIQKEFEIQIDNKNKEINEIKEKNEKEINEIKIQIEEMNKEKNQLEN 1003
Query: 63 YKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNT 122
K+Q E +++ KE K++ K + E++ IRN + + R+ G +
Sbjct: 1004 LKKQLENENEIIKKENKKKEEENKEMGYLIKENEKKIESIRNEINSKERELGTKIK---- 1059
Query: 123 LISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQK 182
+ + I + +D+ + + + E+ +I I E + I+ + D+
Sbjct: 1060 -LIEMIKNEKDIME--------KDFKKEVDNKNIEIKRLQIDIEKKKNDITLIIQKNDED 1110
Query: 183 HIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXK 242
K I + NL+ ++++ + N+V +KE+E I D++ +
Sbjct: 1111 KKKSI------EEKKNLNQEIEKIKSEKNDV-QKEKEQILLEKEDLQSDFNKYKTQMENE 1163
Query: 243 KSTENSEVKNNVFNIR 258
K E +NN+ N++
Sbjct: 1164 KLQIKEEHENNITNLQ 1179
>UniRef50_Q97H37 Cluster: Glycosyltransferase domain containing
protein; n=1; Clostridium acetobutylicum|Rep:
Glycosyltransferase domain containing protein -
Clostridium acetobutylicum
Length = 937
Score = 35.1 bits (77), Expect = 5.3
Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 3/66 (4%)
Query: 150 ELPRASIIICFYN-EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVD 208
E P SII+C N + Y L +S+ I+++ + E+I+VDD D L+ +++ ++
Sbjct: 21 ESPNVSIILCLINIKDYNKLDKSISLILNQEYRNF--ELIVVDDGQDDELLYTKIKDYIN 78
Query: 209 KLNNVI 214
K N ++
Sbjct: 79 KDNRIV 84
>UniRef50_Q8EWP8 Cluster: Predicted cytoskeletal protein; n=1;
Mycoplasma penetrans|Rep: Predicted cytoskeletal protein
- Mycoplasma penetrans
Length = 3317
Score = 35.1 bits (77), Expect = 5.3
Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 5/91 (5%)
Query: 39 KDLLSDQPRDVEETLSKTMWKYQDYKR--QSEYRRKVM-LKEKFAK--QQAIKMSKKTEN 93
K L +D + + + K + K++D K Q+E+ +++ KE K ++ I ++ +
Sbjct: 3047 KLLFNDSSENQDPEIKKILSKFEDSKEIIQNEFNQELQAFKESIFKVREKEINDYRQQVS 3106
Query: 94 DLEEQFGLIRNSEDLRIRDKGYNLHAFNTLI 124
D+E++ IRNS + +K L A + LI
Sbjct: 3107 DIEKEVLKIRNSNSINDENKNKELEAIDNLI 3137
>UniRef50_Q50EX9 Cluster: P-553; n=5; Borrelia|Rep: P-553 - Borrelia
hermsii
Length = 760
Score = 35.1 bits (77), Expect = 5.3
Identities = 53/275 (19%), Positives = 104/275 (37%), Gaps = 12/275 (4%)
Query: 22 IKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAK 81
+KD + N ++ L+ K +++ +++ L K K + + K LK+ K
Sbjct: 316 LKDALHKNTHKLSELDNK--INNNKETLKDALHKNTHKLSELDNKIN-NNKETLKDALLK 372
Query: 82 Q--QAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRN 139
+ ++ K N+ E I+ DL + N N I QR+ + L D N
Sbjct: 373 NTHKLSELDDKINNNKETLNNNIQRLSDLDDKINN-NKETLNNNI-QRLSN---LDDKIN 427
Query: 140 KLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
+ + + R S + N + ETL ++ + D D+ + + L ++ L +L
Sbjct: 428 N--NKETLNNNIQRLSDLDDKINNNKETLNNNIQRLSDLDDKINNNKETLNNNIQRLSDL 485
Query: 200 HHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRL 259
+ + LNN I++ ++ + N + E N+E F L
Sbjct: 486 DDKINNNKETLNNNIQRLSDLDDKINNNKETLNNNIQRLSDLDDKINNNEDTLLAFQKEL 545
Query: 260 LKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVN 294
+ G + +S+ V+ L I+ N
Sbjct: 546 IDLKNNHGTSLKNIENNLSSIEKVINILKDKIDKN 580
>UniRef50_Q1FP00 Cluster: Helix-turn-helix, AraC type precursor;
n=1; Clostridium phytofermentans ISDg|Rep:
Helix-turn-helix, AraC type precursor - Clostridium
phytofermentans ISDg
Length = 784
Score = 35.1 bits (77), Expect = 5.3
Identities = 22/87 (25%), Positives = 39/87 (44%)
Query: 11 TKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYR 70
TK Y P+N +K I N L+ +D L + E +S+ + D K +Y
Sbjct: 330 TKHIYVPVNKVVKQFIHKNKNISNDLKIEDELGFIVKSYENAVSQISIQQSDLKSSKKYI 389
Query: 71 RKVMLKEKFAKQQAIKMSKKTENDLEE 97
R +K + + + + + +ND+EE
Sbjct: 390 RSYWIKRLLMESKMLSLEELKKNDVEE 416
>UniRef50_Q0TSM9 Cluster: Bacterial sugar transferase family
protein; n=3; Bacteria|Rep: Bacterial sugar transferase
family protein - Clostridium perfringens (strain ATCC
13124 / NCTC 8237 / Type A)
Length = 200
Score = 35.1 bits (77), Expect = 5.3
Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 3/95 (3%)
Query: 133 DLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDD 192
+LP N L + P + FY E Y+ +++ I D ++I E ++
Sbjct: 96 ELPQLYNVLKGDMSFVGPRPEVKKYVKFYEEEYDEILKIKPGITDLASIEYIDENTIISK 155
Query: 193 YSDLYNLHHDVQEAVDKLNNVIKKE-EEMIETNNI 226
YSD ++ ++E + K + K+ EEM N+I
Sbjct: 156 YSDPEKVY--IEEVLPKKLMLNKRYIEEMSIKNDI 188
>UniRef50_A3ZWW6 Cluster: Putative uncharacterized protein; n=1;
Blastopirellula marina DSM 3645|Rep: Putative
uncharacterized protein - Blastopirellula marina DSM
3645
Length = 286
Score = 35.1 bits (77), Expect = 5.3
Identities = 21/83 (25%), Positives = 36/83 (43%), Gaps = 1/83 (1%)
Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSL-ELIPCSRVGHVFR 432
P M GG A++R+ + + YD WG E+ ++ R++ G + +R H++
Sbjct: 175 PRMKGGNIALWRDDYETVNGYDQDFVGWGLEDSDLQRRLYQAGVRFRSSMRWTRTHHLWH 234
Query: 433 KRRPYGVGEKQDYMLQNSMRMAR 455
R P V + MR R
Sbjct: 235 ARDPSYVARASGTDNEKLMRANR 257
>UniRef50_A3N1J8 Cluster: Type I restriction enzyme, R subunit; n=4;
Bacteria|Rep: Type I restriction enzyme, R subunit -
Actinobacillus pleuropneumoniae serotype 5b (strain L20)
Length = 1044
Score = 35.1 bits (77), Expect = 5.3
Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 4/86 (4%)
Query: 110 IRDKGYNLHAFNTLISQRIGDH-RDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEHYET 167
I D + L +Q+ ++ +DL R KL ++ +FDELP+ + YN T
Sbjct: 574 ICDSSDQARMMHFLFNQKYAENPQDLTAYREKLAENDPHFFDELPKVAESTATYNLAKRT 633
Query: 168 LMRSVHSIMDRTDQKHIKEIILVDDY 193
+ + + D D+ + K+ LVDDY
Sbjct: 634 VTSASLILSDEGDKTYRKD--LVDDY 657
>UniRef50_A0W610 Cluster: Glycosyl transferase, family 2; n=2;
Geobacter|Rep: Glycosyl transferase, family 2 -
Geobacter lovleyi SZ
Length = 280
Score = 35.1 bits (77), Expect = 5.3
Identities = 26/100 (26%), Positives = 43/100 (43%), Gaps = 4/100 (4%)
Query: 371 LKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHV 430
++S T+ G F +R F IG +D + E+ ++ R G L + + + H
Sbjct: 147 VRSGTVNGICFMAHRRVFETIGVFDENFRIGQYEDKDLFLRARRAGFRLGTVGAAFIHH- 205
Query: 431 FRKRRPYGVG---EKQDYMLQNSMRMARVWMDDYVKKVIE 467
F VG E +DY L N +R W + K++ E
Sbjct: 206 FGSITQKAVGARRETRDYALANKAYFSRKWHLPWWKRLSE 245
>UniRef50_Q8I569 Cluster: Putative uncharacterized protein; n=1;
Plasmodium falciparum 3D7|Rep: Putative uncharacterized
protein - Plasmodium falciparum (isolate 3D7)
Length = 1662
Score = 35.1 bits (77), Expect = 5.3
Identities = 47/214 (21%), Positives = 89/214 (41%), Gaps = 17/214 (7%)
Query: 22 IKDRIKNNDKSIAALEGK---DLLSDQPRDVEETLSKTMWKYQDYKRQSEYRR--KVMLK 76
IK ++ DK LEGK D DQ DV + + K ++ + R+ + K
Sbjct: 997 IKKIVEEKDK----LEGKSKEDKKGDQQSDVHQIIDKKQGGIENQSLEESKRKAKNIQQK 1052
Query: 77 EKFAKQQAIKMS-KKTENDLEEQFG-LIRNSEDLRIRDKGYNLHAFN--TLISQRIGDHR 132
+ + +K KKTE +L + N +D +D N+H N +++ +
Sbjct: 1053 DNTKNAELLKCDEKKTEGNLSNTLTHYVTNQQDNFDQDIENNIHNQNEENNLNKNTSKGK 1112
Query: 133 DLPDTRNKLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVD 191
+ NK+ ++ +++ + I N L + + + IKE + D
Sbjct: 1113 KKIENNNKVVDTKILNIEDIIKMKSI---NNNDNIKLPEEIKEYIKKMKSGGIKENVEYD 1169
Query: 192 DYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNN 225
Y N + +V+E VD + V +KE + ++ N+
Sbjct: 1170 IYHKFINNYFNVKEYVDGIIVVEEKEVKAMKGND 1203
>UniRef50_Q8I4T0 Cluster: Putative uncharacterized protein; n=1;
Plasmodium falciparum 3D7|Rep: Putative uncharacterized
protein - Plasmodium falciparum (isolate 3D7)
Length = 656
Score = 35.1 bits (77), Expect = 5.3
Identities = 23/95 (24%), Positives = 42/95 (44%)
Query: 134 LPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDY 193
+ + +L + F +L + +I E Y+ + ++ D K L+ Y
Sbjct: 434 IENDEKELIDQKNIFLKLSKILNLIHLSYEQYDEIQNEKCTMTKNEDDYTKKYEELLIKY 493
Query: 194 SDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDM 228
++ N D +E +D IKK E+M+ T NI+M
Sbjct: 494 ENVKNQVKDKKEILDNKKQEIKKVEDMLNTYNIEM 528
>UniRef50_Q7RIN9 Cluster: Putative uncharacterized protein PY03578;
n=8; Plasmodium (Vinckeia)|Rep: Putative uncharacterized
protein PY03578 - Plasmodium yoelii yoelii
Length = 1527
Score = 35.1 bits (77), Expect = 5.3
Identities = 44/231 (19%), Positives = 104/231 (45%), Gaps = 16/231 (6%)
Query: 5 QIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYK 64
Q + TK L D + K N +++ + + ++ ++ + K KY+
Sbjct: 764 QKIELKTKEKIEELKQDFEKTQKINMENLEMEKESFINNNLENEINKMKEKLEEKYETQI 823
Query: 65 RQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQF--GLIRNSEDL---RIRDKGYNLHA 119
+++E + K +KE+ K + + +++ N E++ L +N D I +K + +
Sbjct: 824 KETEMKYKYQIKEEIEKTK--QNAEQNFNSKFEKYKENLEKNKNDFINNLIIEKNNEIES 881
Query: 120 F-NTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDR 178
F N + ++ + + KL Q+ F+ + + I N+H E + + + ++
Sbjct: 882 FKNDIEQKKFKEMEKFKEENEKLLQNN--FENM--KNHFIEEQNKHIENIKKE-YELIKN 936
Query: 179 TDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI---ETNNI 226
+ +++KE + + ++ N+ + + +K + +KKE E I E NN+
Sbjct: 937 NEIEYLKEEMKKNKIQEIENVELKLADEKNKHIDDMKKELENIYNVEINNL 987
>UniRef50_Q54G05 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 1492
Score = 35.1 bits (77), Expect = 5.3
Identities = 50/294 (17%), Positives = 131/294 (44%), Gaps = 28/294 (9%)
Query: 1 MSGNQI-VNKA-TKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMW 58
+ NQ+ VN+ +K++ + +N I I+NN S+ L+ K L+++ ++ + +
Sbjct: 696 IQSNQVTVNELQSKLNEKEIN--INQLIENNQSSLDELQSK--LNEKQNEINQLIENNQS 751
Query: 59 KYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLH 118
+ + + + + + + + + I+ ++ + ++L+ + LI+ S++L+ +D+ L
Sbjct: 752 SSDELQSKLNEKHQEISELQSKLNELIENNESSSDELQSK--LIQLSDELKEKDE--KLK 807
Query: 119 AFNTLISQRIGDHRDLPDT-RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
+ +++I + L + ++ L + Q +E NE E S + +
Sbjct: 808 SLDSIIIENQEKLVQLTKSNQDSLDELQSKLNEKQNE------INELIENNQSSSNELQS 861
Query: 178 RTDQKHIKEIILVDD--------YSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDME 229
+ ++K + +L+++ S L H ++ E KLN K E++E N +
Sbjct: 862 KLNEKQNEINLLIENNQSSSDELQSKLNEKHQEINELQSKLNEKQNKINELVENNESSSD 921
Query: 230 XXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDV 283
+ EN ++K+ F +++ ++ ++++L N + +
Sbjct: 922 ELQSKLIQLSDQLQEKEN-QLKS--FESSIIERDEKLNQLQSKLNEKQNEIDQI 972
>UniRef50_Q4D320 Cluster: Putative uncharacterized protein; n=3;
Trypanosoma cruzi|Rep: Putative uncharacterized protein
- Trypanosoma cruzi
Length = 926
Score = 35.1 bits (77), Expect = 5.3
Identities = 43/203 (21%), Positives = 89/203 (43%), Gaps = 14/203 (6%)
Query: 96 EEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFD-----E 150
EE L+R+ + + R + NTL+S +I D +L + + Q YF +
Sbjct: 596 EESLILLRSEVEAQ-RKRCEEFKNMNTLLSDKIVDAENLLQEMSTASEKQIYFSHYFTRQ 654
Query: 151 LPRASIIICFYNEHY-ETLMRSVHSIMDRTDQKHIKE-IILVDDYSDLYNLHHDVQEAVD 208
L + ++ F++EHY + + V+ ++ D+ + KE ++D + L + + +E
Sbjct: 655 LDDTAGLLMFFSEHYVNWIFKKVNIVVMEFDKAYAKENERVLDALNHLRKVQEETKERYT 714
Query: 209 KLNNVIKKEE-EMIETNNIDMEXXXXXXXXXXXXKKSTENSE-VKNNVFNIRLLKTSKRE 266
L + E + E +E +K + +E +K ++ +R L + +
Sbjct: 715 NLQMKLNDAEFRLGEREKASVEKFENFLSEIRLIEKQRDVAEKMKKDI--LRKLDSVNDQ 772
Query: 267 GLIRARLYGADNSVGDVLVFLDS 289
L A + D + ++LV LD+
Sbjct: 773 HL--AEINKKDEQIHELLVNLDA 793
>UniRef50_Q28X57 Cluster: GA21145-PA; n=2; Sophophora|Rep:
GA21145-PA - Drosophila pseudoobscura (Fruit fly)
Length = 399
Score = 35.1 bits (77), Expect = 5.3
Identities = 20/77 (25%), Positives = 36/77 (46%)
Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
T+ GG+ A+ RE+F A+ + WGGE+ ++S R+ + P + + K
Sbjct: 285 TIFGGVSAMTREHFQAVNGFSNSFFGWGGEDDDMSNRLKHANLFISRYPVNIARYKMLKH 344
Query: 435 RPYGVGEKQDYMLQNSM 451
+ K+ +QN M
Sbjct: 345 QKEKANPKRYENIQNGM 361
>UniRef50_Q25662 Cluster: Repeat organellar protein; n=5; Plasmodium
(Vinckeia)|Rep: Repeat organellar protein - Plasmodium
chabaudi
Length = 1939
Score = 35.1 bits (77), Expect = 5.3
Identities = 41/206 (19%), Positives = 97/206 (47%), Gaps = 17/206 (8%)
Query: 20 WDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKF 79
++++D++K KSI AL K + + +EE L K + ++ + EY +++ K +F
Sbjct: 75 YELEDQLKETLKSITALSIK--VKEYEVKIEE-LEKELKLEKEKQINKEYEKELNEKSEF 131
Query: 80 AKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRN 139
K+Q +++ K+ E ++ + I N E + ++ + N + S+ I +++
Sbjct: 132 IKRQ-MELLKEKELNINLKENKINNKEIITLKRE----EKLNDIESEYIEKNKEKEKLNY 186
Query: 140 KLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQ-KHIKEIILVDD--YSDL 196
++ + D+L C E + L + ++++ + + +KE + + L
Sbjct: 187 EVTNIKMSLDKL------TCEVQEKKDNLEKINKKVIEKENNLRELKEFMKEKNEIIESL 240
Query: 197 YNLHHDVQEAVDKLNNVIKKEEEMIE 222
+D + A +KL +++ +MIE
Sbjct: 241 DGTINDKKNAYEKLEISFEEKRKMIE 266
Score = 35.1 bits (77), Expect = 5.3
Identities = 51/253 (20%), Positives = 112/253 (44%), Gaps = 20/253 (7%)
Query: 21 DIKDRIKNNDKSIAALEG-KDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKF 79
D+K++I + + LE K++L+D+ ++++ + K + K ++E + L +
Sbjct: 1265 DLKNKILDLSNELINLENMKNVLTDENNNLKKEIEIKDNKLNE-KEKNENTEILNLNDDI 1323
Query: 80 AK-QQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTR 138
K ++ I K E L ++ ++N D+ +K Y + N +I + ++ +
Sbjct: 1324 IKLKKEISEWKDEEEKLTKENIKLKN--DIEQINKEYKIKEENLMIKFN-ENINEVTSLK 1380
Query: 139 NKLCQSQQYFDELPRASIIICFYNEHYETLM---RSVHSIMDRTDQKHIKEIILVDDYSD 195
N++ + +EL N++YE L+ R + + D K ++ IL D S
Sbjct: 1381 NQIEIEKMKLEEL----------NKNYELLLAEKRETNMSISNDDNKIVENNILEDTDSK 1430
Query: 196 LYNLHHDVQEAVDKLNNVIKKEEEMIETNNI-DMEXXXXXXXXXXXXKKSTENSEVKNNV 254
NL+ +V++ N K ++ E + + D +K++ + +VKN
Sbjct: 1431 QNNLNKNVEDKTGDDINCEKNNDQAKEISYLKDEIKKISMLYGEELNRKNSYDEKVKNLT 1490
Query: 255 FNIRLLKTSKREG 267
++ LK ++G
Sbjct: 1491 NELKELKIRNKKG 1503
>UniRef50_Q236I9 Cluster: Putative uncharacterized protein; n=1;
Tetrahymena thermophila SB210|Rep: Putative
uncharacterized protein - Tetrahymena thermophila SB210
Length = 783
Score = 35.1 bits (77), Expect = 5.3
Identities = 36/187 (19%), Positives = 82/187 (43%), Gaps = 6/187 (3%)
Query: 39 KDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKV-MLKEKFAKQQAIKMSKKTENDLEE 97
+D L D+ + ++E L+++ + + Q Y + + K +++ +TE L+E
Sbjct: 352 RDKLKDKNKQLKEELNQSFKDKKLLEMQVNYEGYINQVNSKLEEKEKQLQRIQTEIKLKE 411
Query: 98 QFGLIRNSE--DLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRAS 155
+R E +++++ K Q I NKL Q Q+ +++
Sbjct: 412 AELKLRQDEIQNIKLQQKKQQSQNNTFNAQQSIQSCSSCEILNNKLQQEQEI--SFQKSN 469
Query: 156 IIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
+ N+ E + R + +D+ +QK+ + DY + +++ E +++LNN I
Sbjct: 470 ELQSQLNQQKEKV-RILEDDLDQVNQKYQEVCEKQKDYDQIIQDKNELNEQINQLNNTIN 528
Query: 216 KEEEMIE 222
+++ E
Sbjct: 529 EQKIKFE 535
>UniRef50_A5KBH9 Cluster: Putative uncharacterized protein; n=1;
Plasmodium vivax|Rep: Putative uncharacterized protein -
Plasmodium vivax
Length = 1860
Score = 35.1 bits (77), Expect = 5.3
Identities = 47/214 (21%), Positives = 95/214 (44%), Gaps = 18/214 (8%)
Query: 16 RPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVML 75
R ++ +++++K +SI +L K + + +E+ L K + +D + Y +++
Sbjct: 79 RNKDYQLEEQLKETLRSITSLSTK--IVNYETKIED-LEKELKMEKDKQVDKAYEKELKE 135
Query: 76 KEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLP 135
KE F KQ+ I M + EN L E+ L N + +I D+ F + ++ D ++
Sbjct: 136 KENFIKQK-IGMLNEKENLLNEK-ELDINMREEKINDR----EMFISKKEDKLNDMQEQY 189
Query: 136 DTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
+NK + ++ E+ I + E L V D + K I+ + +
Sbjct: 190 LEKNK--EKEKLHFEIADIKISL-------EKLKYEVKDKKDCLENVSNKVILKENTLRE 240
Query: 196 LYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDME 229
L + E ++ LN I ++E++ E D+E
Sbjct: 241 LKEFIREKNEMIESLNEKITEKEKIYEQLGKDVE 274
>UniRef50_A2DDX5 Cluster: Viral A-type inclusion protein, putative;
n=1; Trichomonas vaginalis G3|Rep: Viral A-type
inclusion protein, putative - Trichomonas vaginalis G3
Length = 1794
Score = 35.1 bits (77), Expect = 5.3
Identities = 36/219 (16%), Positives = 95/219 (43%), Gaps = 11/219 (5%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
++ + I+ NDK + ++L + ++ ++K + D K+ ++ ++ ++ +
Sbjct: 276 ELANLIEENDKKQGTQQNQNLNQNDEDAIQSLVTKYEEEIDDIKKNNQNEKENLINQINE 335
Query: 81 KQQAIKMSK-KTENDLEEQFGLIRNSE---DLRIRDKGYNLHAFNTLI---SQRIGDHRD 133
+ ++K + +ENDL E +I + + +I+D NL + + SQ++ + +
Sbjct: 336 LKNSLKNKEISSENDLNEMKIIIEQTSKDYETKIQDLMTNLEENSQKLNEMSQKLKESEE 395
Query: 134 LPDTRNKLCQSQQYFDELPRASII----ICFYNEHYETLMRSVHSIMDRTDQKHIKEIIL 189
N++ Q D I + NE +T++ ++ + L
Sbjct: 396 KNQKLNEMSMLQASNDAEKEKFIKEISNLTKENEKLQTVLNENEKNRTENERLVAENQKL 455
Query: 190 VDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDM 228
D ++ ++ ++Q ++KL ++K E+ E + +
Sbjct: 456 NSDLHEIGEVNKNLQTEIEKLTEIMKSEQNNKENEMMSL 494
>UniRef50_A0E4N3 Cluster: Chromosome undetermined scaffold_78, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_78,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 842
Score = 35.1 bits (77), Expect = 5.3
Identities = 21/95 (22%), Positives = 49/95 (51%), Gaps = 2/95 (2%)
Query: 19 NWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEK 78
+WD DRI++ + I + K+ + D +D E+ K + +D +R+ E ++ +++
Sbjct: 359 DWDRDDRIEDEQEPIKLKDKKEKIKD--KDNEKDREKEKQREKDKEREKEKEKEKEREKE 416
Query: 79 FAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDK 113
+++ + K+ E + E++ R E + R+K
Sbjct: 417 KEREREREKEKEREREREKEKEREREREKEKEREK 451
>UniRef50_Q8SVH3 Cluster: Putative uncharacterized protein
ECU05_1300; n=1; Encephalitozoon cuniculi|Rep: Putative
uncharacterized protein ECU05_1300 - Encephalitozoon
cuniculi
Length = 961
Score = 35.1 bits (77), Expect = 5.3
Identities = 22/72 (30%), Positives = 33/72 (45%), Gaps = 3/72 (4%)
Query: 463 KKVIEVNPSAAHVEIGDISERKALRERLQCKTFKW---YLDNMWFETDRSELVLGRTLCL 519
KK++ ++ + VEI D R +R +CK W L ++FE RS + GR L
Sbjct: 32 KKMLRISNRSKGVEINDEQMRTVIRSMHRCKDTHWVNAVLQRLYFEISRSYAIEGRIKSL 91
Query: 520 DASNNVAPILGK 531
A +GK
Sbjct: 92 ILKRFEASGIGK 103
>UniRef50_Q5J6J3 Cluster: Dipeptidyl-peptidase IV; n=16;
Pezizomycotina|Rep: Dipeptidyl-peptidase IV -
Trichophyton rubrum
Length = 775
Score = 35.1 bits (77), Expect = 5.3
Identities = 21/74 (28%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 314 VRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
V Y + +TP+++ + + Y+ S +GG+ + L ++ N+P L + +D KPLK+
Sbjct: 416 VSYDTKVMTPLVN--DKEAAYYTASFSAKGGY-YILSYQGPNVPYQELYSTKDSKKPLKT 472
Query: 374 PTMAGGLFAIYREY 387
T L +EY
Sbjct: 473 ITSNDALLEKLKEY 486
>UniRef50_Q2FUG3 Cluster: Sensor protein; n=1; Methanospirillum
hungatei JF-1|Rep: Sensor protein - Methanospirillum
hungatei (strain JF-1 / DSM 864)
Length = 765
Score = 35.1 bits (77), Expect = 5.3
Identities = 51/199 (25%), Positives = 83/199 (41%), Gaps = 20/199 (10%)
Query: 21 DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQ----------DYKRQSEYR 70
DI DR + + + E LL+++ RD+ L WKY+ Y ++ YR
Sbjct: 285 DITDRKRMESALLDSEEKYRLLAEKSRDLIFMLRLPEWKYEYISPSVLEITGYTQEEFYR 344
Query: 71 RKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGD 130
+L+ A +++ +K DL G+I + + +I K L +SQR
Sbjct: 345 NPDLLRRCIA-PHSLEYFEKAYKDLLN--GIIPETYEFQIITKSGEL----KWVSQRNSP 397
Query: 131 HRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILV 190
D T L E + I+C + YETL SV + + TD+ II V
Sbjct: 398 ITDKSGTITALQGIVTDITERKTSEEIVCEAMQKYETLFSSVPTGIIVTDEN--GAIIEV 455
Query: 191 DDYSDLYNLHHDVQEAVDK 209
+ ++ LH +E +DK
Sbjct: 456 NQHAARI-LHVSREELIDK 473
>UniRef50_A3HAB6 Cluster: Putative uncharacterized protein; n=1;
Caldivirga maquilingensis IC-167|Rep: Putative
uncharacterized protein - Caldivirga maquilingensis
IC-167
Length = 184
Score = 35.1 bits (77), Expect = 5.3
Identities = 16/46 (34%), Positives = 27/46 (58%)
Query: 114 GYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIIC 159
GY L NT++++ + +H LP L ++++Y EL R+ IIC
Sbjct: 44 GYYLKVLNTVLNKNLKNHHPLPRLLASLERTKRYLLELSRSKQIIC 89
>UniRef50_Q10411 Cluster: Sporulation-specific protein 15; n=1;
Schizosaccharomyces pombe|Rep: Sporulation-specific
protein 15 - Schizosaccharomyces pombe (Fission yeast)
Length = 1957
Score = 35.1 bits (77), Expect = 5.3
Identities = 46/209 (22%), Positives = 94/209 (44%), Gaps = 16/209 (7%)
Query: 19 NWDIKDRIKNNDKSIAALEGK-DLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKE 77
N + D +KNN ++IA+L+ + + + D++ LS +Y++ S K L++
Sbjct: 1013 NERLMDDLKNNGENIASLQTEIEKKRAENDDLQSKLSVVSSEYENLLLISSQTNK-SLED 1071
Query: 78 KFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDT 137
K + + I+ + + D ++Q RN E + K L N I + R
Sbjct: 1072 KTNQLKYIEKNVQKLLDEKDQ----RNVELEELTSKYGKLGEENAQIKDELLALRKKSKK 1127
Query: 138 RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLY 197
++ LC + + D+L S + E L + ++ +Q + LV++ SDL
Sbjct: 1128 QHDLCAN--FVDDLKEKS-------DALEQLTNEKNELIVSLEQSNSNNEALVEERSDLA 1178
Query: 198 NLHHDVQEAVDKLNNVIKK-EEEMIETNN 225
N D+++++ +NVI +++ N+
Sbjct: 1179 NRLSDMKKSLSDSDNVISVIRSDLVRVND 1207
>UniRef50_UPI00015BC953 Cluster: UPI00015BC953 related cluster; n=1;
unknown|Rep: UPI00015BC953 UniRef100 entry - unknown
Length = 514
Score = 34.7 bits (76), Expect = 7.0
Identities = 43/207 (20%), Positives = 98/207 (47%), Gaps = 17/207 (8%)
Query: 18 LNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKE 77
+N DI R KS + G ++ R V ET +++ +Q QS+ +K E
Sbjct: 62 INEDIVRRETKAGKSKFYING---MTSNQRTVLETFGQSVM-FQAQSSQSKIFKKHHQLE 117
Query: 78 KFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDT 137
K + I+ KK + E F ++ E+L ++D + + + + I +++L ++
Sbjct: 118 ILDKDKDIQRKKK---EFVEYFDALKQKEEL-LKDLLFQKES----LEKEIEKNKELIES 169
Query: 138 RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEI-ILVDDYSDL 196
L + ++++ + + + E T +++V + ++ +D I +I + +
Sbjct: 170 LEALNLEKDTYNDIKLKAEELQ-HAEKINTYIQNVLNALEYSDHSAISKINYSISQINQA 228
Query: 197 YNLHHDVQEAVDKLNNVIKKEEEMIET 223
+ D+Q+A+DKLN + +++++ET
Sbjct: 229 LSYKEDLQKAIDKLNAL---KDQLLET 252
>UniRef50_UPI00015B53AD Cluster: PREDICTED: similar to RHO kinase,
putative; n=1; Nasonia vitripennis|Rep: PREDICTED:
similar to RHO kinase, putative - Nasonia vitripennis
Length = 1419
Score = 34.7 bits (76), Expect = 7.0
Identities = 44/187 (23%), Positives = 82/187 (43%), Gaps = 19/187 (10%)
Query: 49 VEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLI--RNSE 106
+E+ K D +QS ++ +E + ++ + N EE L RN +
Sbjct: 816 IEQEQQKRNVLQSDLAQQSSEVSRLKAREHQLVGEVTQLREAKRNIEEELHHLKTQRNVD 875
Query: 107 DLRIRDKGYNLHA---FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASII----IC 159
L+ ++ L A F+TL + + R+ D + +L QQ +E R+S++ +
Sbjct: 876 QLQTKELQEQLEAEAYFSTLYKTQAQELREELDEKTRL---QQELEE-ERSSLVHQLQLS 931
Query: 160 FYNEHYETLMRSV--HSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKE 217
E L RS+ ++ D ++ +KE+ +Y D + HH A D+L N +K+
Sbjct: 932 LARGDSEALARSIAEETVADLEKERTMKEL----EYKDSTSKHHQELNAKDQLINRLKES 987
Query: 218 EEMIETN 224
E + N
Sbjct: 988 EAEAKKN 994
>UniRef50_UPI0000DAE6EA Cluster: hypothetical protein
Rgryl_01001034; n=1; Rickettsiella grylli|Rep:
hypothetical protein Rgryl_01001034 - Rickettsiella
grylli
Length = 753
Score = 34.7 bits (76), Expect = 7.0
Identities = 22/94 (23%), Positives = 50/94 (53%), Gaps = 1/94 (1%)
Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKH 183
I ++I ++L + RNK C ++ Y D +I C +NE+ + + +V IM + +K
Sbjct: 579 IWRKIVKLQNLAERRNKEC-AELYLDLAKSLQMIKCDFNENTDATLSAVLIIMQKIKEKS 637
Query: 184 IKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKE 217
+ + + + S L++ ++ E +++ + +K E
Sbjct: 638 LNAVRGLFNRSYLHDRVVELAERFERILHAMKAE 671
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.319 0.136 0.410
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 673,051,384
Number of Sequences: 1657284
Number of extensions: 29326428
Number of successful extensions: 82024
Number of sequences better than 10.0: 299
Number of HSP's better than 10.0 without gapping: 145
Number of HSP's successfully gapped in prelim test: 154
Number of HSP's that attempted gapping in prelim test: 81219
Number of HSP's gapped (non-prelim): 678
length of query: 589
length of database: 575,637,011
effective HSP length: 105
effective length of query: 484
effective length of database: 401,622,191
effective search space: 194385140444
effective search space used: 194385140444
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
S2: 75 (34.3 bits)
- SilkBase 1999-2023 -