SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTP 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= BGIBMGA000733-TA|BGIBMGA000733-PA|IPR001173|Glycosyl
transferase, family 2, IPR000772|Ricin B lectin, IPR008997|Ricin
B-related lectin
         (589 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q16ZW8 Cluster: N-acetylgalactosaminyltransferase; n=4;...   353   1e-95
UniRef50_Q8MVS5 Cluster: Polypeptide N-acetylgalactosaminyltrans...   321   3e-86
UniRef50_Q8NCW6 Cluster: Polypeptide N-acetylgalactosaminyltrans...   316   1e-84
UniRef50_Q6WV19 Cluster: Polypeptide N-acetylgalactosaminyltrans...   288   2e-76
UniRef50_Q7K755 Cluster: Putative polypeptide N-acetylgalactosam...   286   1e-75
UniRef50_Q8I136 Cluster: Polypeptide N-acetylgalactosaminyltrans...   282   2e-74
UniRef50_P34678 Cluster: Polypeptide N-acetylgalactosaminyltrans...   279   1e-73
UniRef50_A7SZ28 Cluster: Predicted protein; n=1; Nematostella ve...   276   1e-72
UniRef50_A7SDQ3 Cluster: Predicted protein; n=1; Nematostella ve...   275   2e-72
UniRef50_Q10471 Cluster: Polypeptide N-acetylgalactosaminyltrans...   272   2e-71
UniRef50_A7RGG9 Cluster: Predicted protein; n=3; Eumetazoa|Rep: ...   266   8e-70
UniRef50_Q8N428 Cluster: Putative polypeptide N-acetylgalactosam...   266   1e-69
UniRef50_Q5DD76 Cluster: SJCHGC09400 protein; n=2; Schistosoma j...   264   5e-69
UniRef50_Q95ZJ1 Cluster: Polypeptide N-acetylgalactosaminyltrans...   263   1e-68
UniRef50_A2AQQ1 Cluster: UDP-N-acetyl-alpha-D-galactosamine: pol...   261   3e-68
UniRef50_Q10472 Cluster: Polypeptide N-acetylgalactosaminyltrans...   261   3e-68
UniRef50_A7RRV7 Cluster: Predicted protein; n=1; Nematostella ve...   260   6e-68
UniRef50_Q4RQL8 Cluster: Chromosome 2 SCAF15004, whole genome sh...   259   2e-67
UniRef50_UPI00015B515F Cluster: PREDICTED: similar to n-acetylga...   256   9e-67
UniRef50_Q96FL9 Cluster: Polypeptide N-acetylgalactosaminyltrans...   256   1e-66
UniRef50_UPI00015B5D50 Cluster: PREDICTED: similar to ENSANGP000...   254   4e-66
UniRef50_UPI00015B453F Cluster: PREDICTED: similar to GA20875-PA...   254   4e-66
UniRef50_Q8IXK2 Cluster: Polypeptide N-acetylgalactosaminyltrans...   253   8e-66
UniRef50_O61394 Cluster: Probable N-acetylgalactosaminyltransfer...   252   3e-65
UniRef50_UPI0000E4974C Cluster: PREDICTED: hypothetical protein;...   242   2e-62
UniRef50_UPI0000D564C6 Cluster: PREDICTED: similar to CG8182-PA,...   241   4e-62
UniRef50_Q9D4M9 Cluster: Putative polypeptide N-acetylgalactosam...   241   4e-62
UniRef50_Q86SR1 Cluster: Polypeptide N-acetylgalactosaminyltrans...   239   1e-61
UniRef50_Q6WV20 Cluster: Polypeptide N-acetylgalactosaminyltrans...   239   2e-61
UniRef50_Q9U2C4 Cluster: Probable N-acetylgalactosaminyltransfer...   238   3e-61
UniRef50_Q14435 Cluster: Polypeptide N-acetylgalactosaminyltrans...   238   3e-61
UniRef50_O45293 Cluster: Probable N-acetylgalactosaminyltransfer...   236   1e-60
UniRef50_UPI0000D56CDA Cluster: PREDICTED: similar to CG4445-PA;...   233   7e-60
UniRef50_O61397 Cluster: Probable N-acetylgalactosaminyltransfer...   232   2e-59
UniRef50_Q7Z7M9 Cluster: Polypeptide N-acetylgalactosaminyltrans...   231   4e-59
UniRef50_Q9Y117 Cluster: Polypeptide N-acetylgalactosaminyltrans...   227   5e-58
UniRef50_Q176D5 Cluster: N-acetylgalactosaminyltransferase; n=3;...   226   1e-57
UniRef50_Q16SH9 Cluster: N-acetylgalactosaminyltransferase; n=2;...   226   1e-57
UniRef50_Q8MV48 Cluster: N-acetylgalactosaminyltransferase 7; n=...   223   1e-56
UniRef50_UPI0000E461C0 Cluster: PREDICTED: hypothetical protein,...   222   2e-56
UniRef50_Q6V2D0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid...   221   4e-56
UniRef50_Q17NN8 Cluster: N-acetylgalactosaminyltransferase; n=4;...   219   2e-55
UniRef50_O45947 Cluster: Putative polypeptide N-acetylgalactosam...   217   7e-55
UniRef50_Q9HCQ5 Cluster: Polypeptide N-acetylgalactosaminyltrans...   215   3e-54
UniRef50_Q5TWJ3 Cluster: ENSANGP00000028412; n=1; Anopheles gamb...   213   8e-54
UniRef50_Q6WV16 Cluster: N-acetylgalactosaminyltransferase 6; n=...   210   8e-53
UniRef50_Q17M60 Cluster: N-acetylgalactosaminyltransferase; n=1;...   209   2e-52
UniRef50_Q8N3T1 Cluster: Polypeptide N-acetylgalactosaminyltrans...   207   6e-52
UniRef50_Q7TT15-2 Cluster: Isoform 2 of Q7TT15 ; n=9; Mammalia|R...   206   1e-51
UniRef50_UPI0000E4710F Cluster: PREDICTED: similar to pp-GalNAc-...   205   2e-51
UniRef50_UPI0000E46551 Cluster: PREDICTED: hypothetical protein,...   204   7e-51
UniRef50_Q6P9A2 Cluster: Putative polypeptide N-acetylgalactosam...   202   2e-50
UniRef50_Q16ZA7 Cluster: N-acetylgalactosaminyltransferase; n=7;...   202   2e-50
UniRef50_UPI000069E576 Cluster: Polypeptide N-acetylgalactosamin...   201   4e-50
UniRef50_UPI000069E1C8 Cluster: Polypeptide N-acetylgalactosamin...   201   4e-50
UniRef50_Q9NY28 Cluster: Probable polypeptide N-acetylgalactosam...   200   6e-50
UniRef50_Q8IA42 Cluster: N-acetylgalactosaminyltransferase 4; n=...   197   8e-49
UniRef50_Q86SF2 Cluster: N-acetylgalactosaminyltransferase 7; n=...   196   2e-48
UniRef50_Q9VUT6 Cluster: Polypeptide N-acetylgalactosaminyltrans...   194   4e-48
UniRef50_A0NGH9 Cluster: ENSANGP00000031751; n=1; Anopheles gamb...   194   5e-48
UniRef50_UPI000065D57A Cluster: Putative polypeptide N-acetylgal...   190   9e-47
UniRef50_Q8MYY6 Cluster: Putative polypeptide N-acetylgalactosam...   189   2e-46
UniRef50_Q5CKF0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid...   187   8e-46
UniRef50_Q8K1B9 Cluster: Putative polypeptide N-acetylgalactosam...   186   1e-45
UniRef50_Q8MM26 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid...   180   1e-43
UniRef50_UPI0000E45D84 Cluster: PREDICTED: hypothetical protein;...   177   7e-43
UniRef50_Q5CY08 Cluster: Extracellular protein with a signal pep...   177   7e-43
UniRef50_UPI0000D9AA48 Cluster: PREDICTED: similar to UDP-N-acet...   172   2e-41
UniRef50_UPI000065D031 Cluster: Probable polypeptide N-acetylgal...   171   3e-41
UniRef50_UPI0000586DC6 Cluster: PREDICTED: similar to polypeptid...   171   6e-41
UniRef50_Q4RKI0 Cluster: Chromosome 21 SCAF15029, whole genome s...   169   1e-40
UniRef50_Q8IA43 Cluster: Putative polypeptide N-acetylgalactosam...   167   1e-39
UniRef50_Q6TBR4 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid...   164   7e-39
UniRef50_Q5CYR4 Cluster: Extracellular protein with a signal pep...   164   7e-39
UniRef50_Q6YBY0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid...   161   4e-38
UniRef50_Q8IA41 Cluster: Putative polypeptide N-acetylgalactosam...   146   2e-33
UniRef50_UPI0000E46EB4 Cluster: PREDICTED: similar to MGC81846 p...   136   2e-30
UniRef50_Q6YK77 Cluster: UDP-N-acetyl-D-galactosamine:polypeptid...   135   4e-30
UniRef50_Q4RNJ5 Cluster: Chromosome 21 SCAF15012, whole genome s...   133   1e-29
UniRef50_Q4RNJ6 Cluster: Chromosome 21 SCAF15012, whole genome s...   125   4e-27
UniRef50_Q5CHA1 Cluster: Glycosyl transferase; n=4; Cryptosporid...   124   7e-27
UniRef50_Q4SKF7 Cluster: Chromosome 13 SCAF14566, whole genome s...   122   3e-26
UniRef50_Q8IA44 Cluster: Putative polypeptide N-acetylgalactosam...   115   4e-24
UniRef50_UPI0000E46FFD Cluster: PREDICTED: similar to n-acetylga...   110   1e-22
UniRef50_Q4STJ6 Cluster: Chromosome undetermined SCAF14183, whol...   106   1e-21
UniRef50_UPI000155C133 Cluster: PREDICTED: similar to UDP-N-acet...   103   1e-20
UniRef50_Q4T0W3 Cluster: Chromosome undetermined SCAF10824, whol...   103   2e-20
UniRef50_UPI0001554C17 Cluster: PREDICTED: similar to Polypeptid...    97   9e-19
UniRef50_A5D6B4 Cluster: Predicted glycosyltransferases; n=1; Pe...    91   8e-17
UniRef50_UPI0000D8AB1E Cluster: UDP-N-acetyl-alpha-D-galactosami...    89   3e-16
UniRef50_UPI00005A4710 Cluster: PREDICTED: similar to GalNAc tra...    88   7e-16
UniRef50_Q4RQK9 Cluster: Chromosome 2 SCAF15004, whole genome sh...    85   4e-15
UniRef50_Q7Q046 Cluster: ENSANGP00000016624; n=1; Anopheles gamb...    84   9e-15
UniRef50_Q2B871 Cluster: Glycosyl transferase, group 2 family pr...    83   3e-14
UniRef50_Q4SU00 Cluster: Chromosome undetermined SCAF14054, whol...    80   1e-13
UniRef50_UPI0000E234D0 Cluster: PREDICTED: similar to UDP-GalNAc...    80   2e-13
UniRef50_Q5BYW7 Cluster: SJCHGC07375 protein; n=1; Schistosoma j...    79   2e-13
UniRef50_Q4TCW9 Cluster: Chromosome undetermined SCAF6660, whole...    75   4e-12
UniRef50_UPI0000E46BB8 Cluster: PREDICTED: similar to UDP-GalNAc...    74   9e-12
UniRef50_UPI00005A4DE7 Cluster: PREDICTED: similar to Probable p...    74   1e-11
UniRef50_Q4T9B6 Cluster: Chromosome undetermined SCAF7602, whole...    73   2e-11
UniRef50_UPI0000F20FAB Cluster: PREDICTED: hypothetical protein;...    73   2e-11
UniRef50_Q01V98 Cluster: Glycosyl transferase, family 2; n=1; So...    73   2e-11
UniRef50_A4J8G0 Cluster: Glycosyl transferase, family 2; n=1; De...    73   2e-11
UniRef50_Q4RPK0 Cluster: Chromosome 12 SCAF15007, whole genome s...    72   4e-11
UniRef50_UPI0000E49DD9 Cluster: PREDICTED: hypothetical protein;...    71   7e-11
UniRef50_Q4SIA0 Cluster: Chromosome 5 SCAF14581, whole genome sh...    69   3e-10
UniRef50_UPI0000F1FCC6 Cluster: PREDICTED: hypothetical protein;...    67   1e-09
UniRef50_A7T195 Cluster: Predicted protein; n=1; Nematostella ve...    64   8e-09
UniRef50_UPI000069DFD6 Cluster: Polypeptide N-acetylgalactosamin...    62   5e-08
UniRef50_Q3A8Y3 Cluster: Glycosyl transferase, group 2 family; n...    60   2e-07
UniRef50_Q3A8Y2 Cluster: Glycosyl transferase, group 2 family; n...    58   9e-07
UniRef50_Q4TDW9 Cluster: Chromosome undetermined SCAF5986, whole...    50   2e-04
UniRef50_Q68VJ7 Cluster: Polypeptide N-acetylgalactosaminyltrans...    49   4e-04
UniRef50_Q1ARC2 Cluster: Glycosyl transferase, family 2; n=1; Ru...    47   0.002
UniRef50_Q4RAK4 Cluster: Chromosome undetermined SCAF23488, whol...    46   0.002
UniRef50_Q5CY12 Cluster: UDP-N-acetylgalactosamine: polypeptide ...    46   0.002
UniRef50_UPI000069E575 Cluster: Polypeptide N-acetylgalactosamin...    46   0.003
UniRef50_A6WEA2 Cluster: Glycosyl transferase family 2; n=1; Kin...    46   0.003
UniRef50_Q1Q6Z1 Cluster: Putative uncharacterized protein; n=1; ...    43   0.020
UniRef50_UPI0000499CE1 Cluster: SMC3 protein; n=1; Entamoeba his...    43   0.026
UniRef50_A2DKE3 Cluster: Viral A-type inclusion protein, putativ...    43   0.026
UniRef50_Q23ZG7 Cluster: Peptidase family M1 containing protein;...    42   0.035
UniRef50_A2EE02 Cluster: Putative uncharacterized protein; n=1; ...    42   0.046
UniRef50_A1BDD3 Cluster: Glycosyl transferase, family 2; n=1; Ch...    42   0.061
UniRef50_Q23G97 Cluster: Putative uncharacterized protein; n=1; ...    42   0.061
UniRef50_A2F531 Cluster: Viral A-type inclusion protein, putativ...    41   0.11 
UniRef50_A0D876 Cluster: Chromosome undetermined scaffold_40, wh...    41   0.11 
UniRef50_A6UUX2 Cluster: SMC domain protein; n=1; Methanococcus ...    41   0.11 
UniRef50_A0E3J8 Cluster: Chromosome undetermined scaffold_76, wh...    40   0.14 
UniRef50_P62134 Cluster: DNA double-strand break repair rad50 AT...    40   0.14 
UniRef50_Q1WVM5 Cluster: N-acetylglucosaminyltransferase; n=1; L...    40   0.19 
UniRef50_Q55EZ8 Cluster: Putative uncharacterized protein; n=1; ...    40   0.19 
UniRef50_Q54SR9 Cluster: Leucine-rich repeat-containing protein;...    40   0.19 
UniRef50_A2DD37 Cluster: Viral A-type inclusion protein, putativ...    40   0.25 
UniRef50_A2SR79 Cluster: Glycosyl transferase, family 2; n=1; Me...    40   0.25 
UniRef50_A0YT83 Cluster: Putative uncharacterized protein; n=1; ...    39   0.32 
UniRef50_Q8MNV4 Cluster: Putative uncharacterized protein; n=2; ...    39   0.32 
UniRef50_Q55FF7 Cluster: Putative uncharacterized protein; n=1; ...    39   0.32 
UniRef50_Q9V2L6 Cluster: Dpm1 dolichol-phosphate mannosyltransfe...    39   0.32 
UniRef50_Q5LW10 Cluster: Diguanylate cyclase, putative; n=3; Rho...    39   0.43 
UniRef50_Q7NBF8 Cluster: Putative uncharacterized protein; n=1; ...    38   0.57 
UniRef50_A5HZ17 Cluster: Exonuclease; n=4; Clostridium botulinum...    38   0.57 
UniRef50_A0L591 Cluster: Glycosyl transferase, family 2; n=1; Ma...    38   0.57 
UniRef50_Q8IBY8 Cluster: Putative uncharacterized protein PF07_0...    38   0.57 
UniRef50_Q86KX8 Cluster: Similar to Dictyostelium discoideum (Sl...    38   0.57 
UniRef50_Q22W40 Cluster: Putative uncharacterized protein; n=1; ...    38   0.57 
UniRef50_A4YEH2 Cluster: Glycosyl transferase, family 2; n=1; Me...    38   0.57 
UniRef50_UPI00006CE63C Cluster: hypothetical protein TTHERM_0070...    38   0.75 
UniRef50_Q6MF69 Cluster: Putative uncharacterized protein; n=2; ...    38   0.75 
UniRef50_Q1Q6W0 Cluster: Putative uncharacterized protein; n=1; ...    38   0.75 
UniRef50_A1AUA8 Cluster: Glycosyl transferase, family 2; n=1; Pe...    38   0.75 
UniRef50_Q23A50 Cluster: Putative uncharacterized protein; n=1; ...    38   0.75 
UniRef50_A2FSZ8 Cluster: Viral A-type inclusion protein, putativ...    38   0.75 
UniRef50_A2E4S4 Cluster: Viral A-type inclusion protein, putativ...    38   0.75 
UniRef50_A2DKP8 Cluster: Viral A-type inclusion protein, putativ...    38   0.75 
UniRef50_Q2S1Y8 Cluster: Glycosyl transferase, group 2 family pr...    38   0.99 
UniRef50_A2FD36 Cluster: Viral A-type inclusion protein, putativ...    38   0.99 
UniRef50_A0DTN4 Cluster: Chromosome undetermined scaffold_63, wh...    38   0.99 
UniRef50_A0CKL2 Cluster: Chromosome undetermined scaffold_2, who...    38   0.99 
UniRef50_Q6CYG5 Cluster: Similarity; n=2; Kluyveromyces lactis|R...    38   0.99 
UniRef50_Q8TXA4 Cluster: Uncharacterized protein; n=2; cellular ...    38   0.99 
UniRef50_UPI00006CD032 Cluster: Micro-fibrillar-associated prote...    37   1.3  
UniRef50_UPI00006CB2DA Cluster: Viral A-type inclusion protein r...    37   1.3  
UniRef50_Q7U947 Cluster: Putative uncharacterized protein; n=1; ...    37   1.3  
UniRef50_Q319Q2 Cluster: Putative uncharacterized protein; n=1; ...    37   1.3  
UniRef50_Q9GRG0 Cluster: Tetrin B protein; n=2; Tetrahymena ther...    37   1.3  
UniRef50_Q8ILK5 Cluster: Putative uncharacterized protein; n=1; ...    37   1.3  
UniRef50_Q6LF09 Cluster: Putative uncharacterized protein; n=6; ...    37   1.3  
UniRef50_Q5DAE6 Cluster: SJCHGC05311 protein; n=1; Schistosoma j...    37   1.3  
UniRef50_Q22869 Cluster: Non-muscle myosin heavy chain II; n=3; ...    37   1.3  
UniRef50_A2EZ87 Cluster: Viral A-type inclusion protein, putativ...    37   1.3  
UniRef50_Q75E63 Cluster: ABL193Cp; n=1; Eremothecium gossypii|Re...    37   1.3  
UniRef50_Q0URH4 Cluster: Putative uncharacterized protein; n=1; ...    37   1.3  
UniRef50_Q2FPR9 Cluster: Regulatory protein, ArsR; n=2; Methanom...    37   1.3  
UniRef50_UPI00006CEB56 Cluster: hypothetical protein TTHERM_0037...    37   1.7  
UniRef50_UPI00006CB6DE Cluster: hypothetical protein TTHERM_0049...    37   1.7  
UniRef50_UPI00015A629B Cluster: UPI00015A629B related cluster; n...    37   1.7  
UniRef50_UPI000069FF36 Cluster: M-phase phosphoprotein 1 (MPP1) ...    37   1.7  
UniRef50_Q82L26 Cluster: Putative secreted alpha-galactosidase; ...    37   1.7  
UniRef50_A5EY82 Cluster: Serine protease; n=1; Dichelobacter nod...    37   1.7  
UniRef50_A2G5G4 Cluster: Putative uncharacterized protein; n=1; ...    37   1.7  
UniRef50_A2EN31 Cluster: Viral A-type inclusion protein, putativ...    37   1.7  
UniRef50_A2E7J3 Cluster: Putative uncharacterized protein; n=1; ...    37   1.7  
UniRef50_A5DZR6 Cluster: Putative uncharacterized protein; n=1; ...    37   1.7  
UniRef50_Q58718 Cluster: DNA double-strand break repair rad50 AT...    37   1.7  
UniRef50_UPI00015B6253 Cluster: PREDICTED: similar to CG33715-PD...    36   2.3  
UniRef50_Q2JCN5 Cluster: Glycosyl transferase, family 2 precurso...    36   2.3  
UniRef50_O06764 Cluster: Phase variable surface lipoprotein P78 ...    36   2.3  
UniRef50_A6WGG7 Cluster: Glycosyl transferase family 2; n=1; Kin...    36   2.3  
UniRef50_A0V2D1 Cluster: Glycosyl transferase, family 2; n=1; Cl...    36   2.3  
UniRef50_Q4Q0C8 Cluster: Phosphatidylinositol 3 kinase, putative...    36   2.3  
UniRef50_O97294 Cluster: Putative uncharacterized protein PFC099...    36   2.3  
UniRef50_A2EJ43 Cluster: Viral A-type inclusion protein, putativ...    36   2.3  
UniRef50_A0CQE7 Cluster: Chromosome undetermined scaffold_24, wh...    36   2.3  
UniRef50_Q4JC59 Cluster: Conserved Archaeal membrane protein; n=...    36   2.3  
UniRef50_UPI000150A0FA Cluster: Type III restriction enzyme, res...    36   3.0  
UniRef50_UPI00006CF2BD Cluster: Leucine Rich Repeat family prote...    36   3.0  
UniRef50_UPI0000499464 Cluster: DNA repair protein Rad50; n=1; E...    36   3.0  
UniRef50_Q1WV23 Cluster: Superfamily II DNA and RNA helicase; n=...    36   3.0  
UniRef50_Q7RT92 Cluster: Phosphatidylinositol transfer protein 2...    36   3.0  
UniRef50_A2ETW9 Cluster: Viral A-type inclusion protein, putativ...    36   3.0  
UniRef50_Q0W1H0 Cluster: Putative glycosyltransferase; n=2; uncu...    36   3.0  
UniRef50_A2STK1 Cluster: Glycosyl transferase, family 2; n=2; Me...    36   3.0  
UniRef50_UPI000150A21E Cluster: hypothetical protein TTHERM_0019...    36   4.0  
UniRef50_UPI00006CF1FD Cluster: hypothetical protein TTHERM_0054...    36   4.0  
UniRef50_Q8KU52 Cluster: EF0109; n=1; Enterococcus faecalis|Rep:...    36   4.0  
UniRef50_A5ZRG2 Cluster: Putative uncharacterized protein; n=1; ...    36   4.0  
UniRef50_A4JHB1 Cluster: Putative uncharacterized protein; n=1; ...    36   4.0  
UniRef50_Q7RSH4 Cluster: Putative uncharacterized protein PY0038...    36   4.0  
UniRef50_Q7RBX2 Cluster: Mature-parasite-infected erythrocyte su...    36   4.0  
UniRef50_Q4Z6V7 Cluster: Putative uncharacterized protein; n=3; ...    36   4.0  
UniRef50_Q23YT2 Cluster: Kinesin motor domain containing protein...    36   4.0  
UniRef50_Q22YR7 Cluster: Cyclic nucleotide-binding domain contai...    36   4.0  
UniRef50_A2FX23 Cluster: Formin Homology 2 Domain containing pro...    36   4.0  
UniRef50_A2F112 Cluster: Putative uncharacterized protein; n=1; ...    36   4.0  
UniRef50_A2ERV7 Cluster: Putative uncharacterized protein; n=1; ...    36   4.0  
UniRef50_A2EQA8 Cluster: Putative uncharacterized protein; n=1; ...    36   4.0  
UniRef50_A0CXR3 Cluster: Chromosome undetermined scaffold_30, wh...    36   4.0  
UniRef50_Q0UYN5 Cluster: Putative uncharacterized protein; n=1; ...    36   4.0  
UniRef50_P37709 Cluster: Trichohyalin; n=2; Eutheria|Rep: Tricho...    36   4.0  
UniRef50_UPI00006CB606 Cluster: hypothetical protein TTHERM_0044...    35   5.3  
UniRef50_UPI0000498399 Cluster: Viral A-type inclusion protein r...    35   5.3  
UniRef50_Q97H37 Cluster: Glycosyltransferase domain containing p...    35   5.3  
UniRef50_Q8EWP8 Cluster: Predicted cytoskeletal protein; n=1; My...    35   5.3  
UniRef50_Q50EX9 Cluster: P-553; n=5; Borrelia|Rep: P-553 - Borre...    35   5.3  
UniRef50_Q1FP00 Cluster: Helix-turn-helix, AraC type precursor; ...    35   5.3  
UniRef50_Q0TSM9 Cluster: Bacterial sugar transferase family prot...    35   5.3  
UniRef50_A3ZWW6 Cluster: Putative uncharacterized protein; n=1; ...    35   5.3  
UniRef50_A3N1J8 Cluster: Type I restriction enzyme, R subunit; n...    35   5.3  
UniRef50_A0W610 Cluster: Glycosyl transferase, family 2; n=2; Ge...    35   5.3  
UniRef50_Q8I569 Cluster: Putative uncharacterized protein; n=1; ...    35   5.3  
UniRef50_Q8I4T0 Cluster: Putative uncharacterized protein; n=1; ...    35   5.3  
UniRef50_Q7RIN9 Cluster: Putative uncharacterized protein PY0357...    35   5.3  
UniRef50_Q54G05 Cluster: Putative uncharacterized protein; n=1; ...    35   5.3  
UniRef50_Q4D320 Cluster: Putative uncharacterized protein; n=3; ...    35   5.3  
UniRef50_Q28X57 Cluster: GA21145-PA; n=2; Sophophora|Rep: GA2114...    35   5.3  
UniRef50_Q25662 Cluster: Repeat organellar protein; n=5; Plasmod...    35   5.3  
UniRef50_Q236I9 Cluster: Putative uncharacterized protein; n=1; ...    35   5.3  
UniRef50_A5KBH9 Cluster: Putative uncharacterized protein; n=1; ...    35   5.3  
UniRef50_A2DDX5 Cluster: Viral A-type inclusion protein, putativ...    35   5.3  
UniRef50_A0E4N3 Cluster: Chromosome undetermined scaffold_78, wh...    35   5.3  
UniRef50_Q8SVH3 Cluster: Putative uncharacterized protein ECU05_...    35   5.3  
UniRef50_Q5J6J3 Cluster: Dipeptidyl-peptidase IV; n=16; Pezizomy...    35   5.3  
UniRef50_Q2FUG3 Cluster: Sensor protein; n=1; Methanospirillum h...    35   5.3  
UniRef50_A3HAB6 Cluster: Putative uncharacterized protein; n=1; ...    35   5.3  
UniRef50_Q10411 Cluster: Sporulation-specific protein 15; n=1; S...    35   5.3  
UniRef50_UPI00015BC953 Cluster: UPI00015BC953 related cluster; n...    35   7.0  
UniRef50_UPI00015B53AD Cluster: PREDICTED: similar to RHO kinase...    35   7.0  
UniRef50_UPI0000DAE6EA Cluster: hypothetical protein Rgryl_01001...    35   7.0  
UniRef50_Q8DX62 Cluster: Putative uncharacterized protein SAG199...    35   7.0  
UniRef50_Q72ZS2 Cluster: Putative uncharacterized protein; n=1; ...    35   7.0  
UniRef50_Q311W2 Cluster: Glycosyl transferase, group 2 family pr...    35   7.0  
UniRef50_Q26H47 Cluster: Putative uncharacterized protein; n=1; ...    35   7.0  
UniRef50_Q21IV0 Cluster: B-glycosyltransferase-like protein; n=1...    35   7.0  
UniRef50_A7BMC2 Cluster: Glycosyl transferase, group 2 family; n...    35   7.0  
UniRef50_A5FFS1 Cluster: Glycosyl transferase, family 2; n=15; B...    35   7.0  
UniRef50_A5D3A4 Cluster: Predicted glycosyltransferases; n=1; Pe...    35   7.0  
UniRef50_A4F5Y3 Cluster: Glycosyl transferase; n=1; Saccharopoly...    35   7.0  
UniRef50_A3JJ43 Cluster: Glycosyl transferases-like protein; n=3...    35   7.0  
UniRef50_Q8I1P1 Cluster: Putative uncharacterized protein PFD095...    35   7.0  
UniRef50_Q7REL0 Cluster: Rhoptry protein; n=29; Plasmodium (Vinc...    35   7.0  
UniRef50_Q54Y97 Cluster: Putative uncharacterized protein; n=1; ...    35   7.0  
UniRef50_Q54MY6 Cluster: Putative uncharacterized protein; n=1; ...    35   7.0  
UniRef50_Q4YPW4 Cluster: Putative uncharacterized protein; n=5; ...    35   7.0  
UniRef50_Q233C6 Cluster: Putative uncharacterized protein; n=1; ...    35   7.0  
UniRef50_Q22U14 Cluster: Putative uncharacterized protein; n=3; ...    35   7.0  
UniRef50_Q22CU7 Cluster: Putative uncharacterized protein; n=1; ...    35   7.0  
UniRef50_Q22BF6 Cluster: Putative uncharacterized protein; n=2; ...    35   7.0  
UniRef50_A7TM62 Cluster: Putative uncharacterized protein; n=1; ...    35   7.0  
UniRef50_Q8PXS7 Cluster: Dolichyl-phosphate mannose synthase rel...    35   7.0  
UniRef50_Q2FTA5 Cluster: Glycosyl transferase, family 2; n=2; Me...    35   7.0  
UniRef50_O80458 Cluster: Dehydrodolichyl diphosphate synthase 1;...    35   7.0  
UniRef50_UPI00006CE5B4 Cluster: hypothetical protein TTHERM_0014...    34   9.2  
UniRef50_Q149I6 Cluster: Upf3b protein; n=1; Mus musculus|Rep: U...    34   9.2  
UniRef50_Q8RA31 Cluster: Glycosyltransferases involved in cell w...    34   9.2  
UniRef50_Q6YPY2 Cluster: Putative uncharacterized protein; n=1; ...    34   9.2  
UniRef50_Q3B6H2 Cluster: Glycosyl transferase; n=4; Chlorobium/P...    34   9.2  
UniRef50_Q2B897 Cluster: Glycosyltransferase; n=1; Bacillus sp. ...    34   9.2  
UniRef50_Q1IMJ5 Cluster: Glycosyl transferase, family 2; n=8; ce...    34   9.2  
UniRef50_A5I5G6 Cluster: Putative uncharacterized protein; n=4; ...    34   9.2  
UniRef50_A4WXV7 Cluster: TRAP dicarboxylate transporter-DctP sub...    34   9.2  
UniRef50_A0IQ61 Cluster: Glycosyl transferase, family 2; n=1; Se...    34   9.2  
UniRef50_Q8IPP9 Cluster: CG31551-PA; n=2; Eukaryota|Rep: CG31551...    34   9.2  
UniRef50_Q8IDH0 Cluster: Putative uncharacterized protein MAL13P...    34   9.2  
UniRef50_Q8I416 Cluster: ATP-dependent RNA helicase, putative; n...    34   9.2  
UniRef50_Q7QA01 Cluster: ENSANGP00000012284; n=1; Anopheles gamb...    34   9.2  
UniRef50_Q7PDJ9 Cluster: Related to CG6013 PROTEIN; n=3; Plasmod...    34   9.2  
UniRef50_Q5CRD7 Cluster: Putative uncharacterized protein; n=2; ...    34   9.2  
UniRef50_Q4Z043 Cluster: Related to CG6013 PROTEIN, putative; n=...    34   9.2  
UniRef50_Q23JF3 Cluster: Putative uncharacterized protein; n=1; ...    34   9.2  
UniRef50_Q23AP4 Cluster: Peptidyl-prolyl cis-trans isomerase, cy...    34   9.2  
UniRef50_Q22RF4 Cluster: Viral A-type inclusion protein repeat c...    34   9.2  
UniRef50_A2G6X0 Cluster: Putative uncharacterized protein; n=1; ...    34   9.2  
UniRef50_A0BIX7 Cluster: Chromosome undetermined scaffold_11, wh...    34   9.2  
UniRef50_A0BCU6 Cluster: Chromosome undetermined scaffold_10, wh...    34   9.2  
UniRef50_Q9HQP4 Cluster: Putative uncharacterized protein; n=1; ...    34   9.2  
UniRef50_Q5JJ24 Cluster: Dolichol-phosphate mannosyltransferase;...    34   9.2  
UniRef50_Q09462 Cluster: Probable U3 small nucleolar RNA-associa...    34   9.2  

>UniRef50_Q16ZW8 Cluster: N-acetylgalactosaminyltransferase; n=4;
           Endopterygota|Rep: N-acetylgalactosaminyltransferase -
           Aedes aegypti (Yellowfever mosquito)
          Length = 662

 Score =  353 bits (867), Expect = 1e-95
 Identities = 157/258 (60%), Positives = 205/258 (79%), Gaps = 10/258 (3%)

Query: 246 ENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRL 305
           E + +KN+   +RL++ ++REGL+R+R+YGA N+ GDVL+FLDSHIEVNV W+ PLL+R 
Sbjct: 251 ELNALKNS--KVRLIRNAEREGLMRSRVYGARNATGDVLIFLDSHIEVNVDWVEPLLQR- 307

Query: 306 SQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDE 365
                 +K   +  A+ PVID+IN+DTF YS SPLVRGGFNWGLHFKWDNLPKGTL  + 
Sbjct: 308 ------IKTNKTILAM-PVIDIINSDTFIYSSSPLVRGGFNWGLHFKWDNLPKGTLAKES 360

Query: 366 DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCS 425
           DF+ P +SPTMAGGLFA+ R+YF  +G+YD GM+VWGGENLEISFR W CGGS+EL+PCS
Sbjct: 361 DFVGPFQSPTMAGGLFAVDRQYFKDLGEYDMGMDVWGGENLEISFRTWQCGGSIELVPCS 420

Query: 426 RVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKA 485
           R+GHVFRKRRPYG  +  D M++NS+R++RVWMDDY+K  +E  P A  V+ GD+++R  
Sbjct: 421 RIGHVFRKRRPYGSPDGSDTMIRNSLRLSRVWMDDYIKYFLENQPQAKKVDPGDLTDRHD 480

Query: 486 LRERLQCKTFKWYLDNMW 503
           LR+RL CK+F+WYL N++
Sbjct: 481 LRKRLNCKSFEWYLKNIY 498



 Score =  139 bits (336), Expect = 2e-31
 Identities = 80/186 (43%), Positives = 110/186 (59%), Gaps = 15/186 (8%)

Query: 27  KNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIK 86
           +NN   +  LEGK        D  ++ S    KY  Y+++ E+R+   +  K   +    
Sbjct: 87  QNNLVGLDPLEGKT-------DTGKSFSFFKDKYNRYRKEQEFRK---ISHKLMDELQPI 136

Query: 87  MSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQ 146
           M   T+     +FG++RNSE+  IRD GY  HAFN L+S +IG  R +PDTR+KLC  Q 
Sbjct: 137 MPNGTD-----EFGMVRNSEEQFIRDVGYRKHAFNVLVSNKIGPFRGVPDTRHKLCHEQS 191

Query: 147 YFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEA 206
           Y   LP ASII+CFYNEH ETL+RSV SI+ RT    + EIILVDD SDL +L  +++  
Sbjct: 192 YDKVLPSASIIMCFYNEHLETLVRSVTSIIRRTPSYLLHEIILVDDCSDLDDLRDNLEHE 251

Query: 207 VDKLNN 212
           ++ L N
Sbjct: 252 LNALKN 257



 Score =  101 bits (242), Expect = 5e-20
 Identities = 44/91 (48%), Positives = 64/91 (70%), Gaps = 3/91 (3%)

Query: 502 MWFETDRSELVLGRTLCLDASNNV---APILGKCHEMGGTQEWKHKGTASSPIYNTAAGM 558
           MW+ET+R+ELVLG+ LCL+A ++    +P+L KCHEMGG Q WKH+ T  +PIYN A+G 
Sbjct: 572 MWYETERAELVLGQLLCLEAPSSATKGSPMLNKCHEMGGDQAWKHRKTKGTPIYNIASGS 631

Query: 559 CLGVDRSYRGETVLMVICDDYSNNKWDIVRS 589
           CL V ++ +G  V + +C +   + WD+V S
Sbjct: 632 CLAVKQATKGALVGLDLCVNSPRSTWDLVIS 662


>UniRef50_Q8MVS5 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 35A (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 35A) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase
           35A) (pp-GaNTase 35A) (Protein l(2)35Aa); n=2;
           Sophophora|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 35A (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 35A) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase
           35A) (pp-GaNTase 35A) (Protein l(2)35Aa) - Drosophila
           melanogaster (Fruit fly)
          Length = 632

 Score =  321 bits (789), Expect = 3e-86
 Identities = 145/250 (58%), Positives = 185/250 (74%), Gaps = 8/250 (3%)

Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
           N+R +K  +REGLIR+R+ GA  +VGDVLVFLDSHIEVN  WL PLL+ +          
Sbjct: 211 NLRYIKNEQREGLIRSRVIGAREAVGDVLVFLDSHIEVNQQWLEPLLRLIKSE------- 263

Query: 316 YSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
            +A    PVID+INADTFEY+PSPLVRGGFNWGLHF+W+NLP+GTL   EDF  P +SPT
Sbjct: 264 -NATLAVPVIDLINADTFEYTPSPLVRGGFNWGLHFRWENLPEGTLKVPEDFRGPFRSPT 322

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           MAGGLFA+ R+YF  +G+YD  M++WGGEN+EISFR W CGG+++++PCSRVGH+FRKRR
Sbjct: 323 MAGGLFAVNRKYFQHLGEYDMAMDIWGGENIEISFRAWQCGGAIKIVPCSRVGHIFRKRR 382

Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
           PY   +  + ML+NS+R+A VWMD Y    ++        + GDIS+R  LRERLQC+ F
Sbjct: 383 PYTSPDGANTMLKNSLRLAHVWMDQYKDYYLKHEKVPKTYDYGDISDRLKLRERLQCRDF 442

Query: 496 KWYLDNMWFE 505
            WYL N++ E
Sbjct: 443 AWYLKNVYPE 452



 Score =  120 bits (288), Expect = 1e-25
 Identities = 62/130 (47%), Positives = 90/130 (69%), Gaps = 9/130 (6%)

Query: 97  EQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFD--ELPRA 154
           E  G++RN +D  IRD GY  HAFN L+S  IG  R +PDTR+K+C  Q+  +   LP+A
Sbjct: 91  ELLGVVRNKQDKYIRDIGYKHHAFNALVSNNIGLFRAIPDTRHKVCDRQETTEAENLPQA 150

Query: 155 SIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDL----YNLHHDVQEAVDKL 210
           SI++CFYNEH  TLMRS+ ++++RT    ++EIILVDD+SDL    ++LH D++  + K 
Sbjct: 151 SIVMCFYNEHKMTLMRSIKTVLERTPSYLLREIILVDDHSDLPELEFHLHGDLRARL-KY 209

Query: 211 NNV--IKKEE 218
           +N+  IK E+
Sbjct: 210 DNLRYIKNEQ 219



 Score = 87.0 bits (206), Expect = 1e-15
 Identities = 38/101 (37%), Positives = 61/101 (60%), Gaps = 2/101 (1%)

Query: 490 LQCKTFKWYLDNMWFETDRSELVLGRTLCLDASNNVAPILGKCHEMGGTQEWKHKGTASS 549
           LQ +T +   + +W+ET+++E+VL + LCL+AS +    + KCHEM G Q+W+H   A+S
Sbjct: 511 LQLQTCRRTPNQLWYETEKAEIVLDKLLCLEASGDAQVTVNKCHEMLGDQQWRHTRNANS 570

Query: 550 PIYNTAAGMCLGVDRSYRGETVLMVIC--DDYSNNKWDIVR 588
           P+YN A G CL       G  + + +C   + +   WDIV+
Sbjct: 571 PVYNMAKGTCLRAAAPTTGALISLDLCSKSNGAGGSWDIVQ 611


>UniRef50_Q8NCW6 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 11; n=33;
           Eumetazoa|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 11 - Homo sapiens
           (Human)
          Length = 608

 Score =  316 bits (775), Expect = 1e-84
 Identities = 145/249 (58%), Positives = 185/249 (74%), Gaps = 8/249 (3%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I++++ +KREGLIR R+ GA ++ G+VLVFLDSH EVNV WL PLL  + +       R+
Sbjct: 214 IKVIRNTKREGLIRGRMIGAAHATGEVLVFLDSHCEVNVMWLQPLLAAIRED------RH 267

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
           +   V PVID+I+ADT  YS SP+VRGGFNWGLHFKWD +P   L   E    P+KSPTM
Sbjct: 268 TV--VCPVIDIISADTLAYSSSPVVRGGFNWGLHFKWDLVPLSELGRAEGATAPIKSPTM 325

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
           AGGLFA+ R+YF+ +G+YD GM++WGGENLEISFRIWMCGG L +IPCSRVGH+FRKRRP
Sbjct: 326 AGGLFAMNRQYFHELGQYDSGMDIWGGENLEISFRIWMCGGKLFIIPCSRVGHIFRKRRP 385

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFK 496
           YG  E QD M  NS+R+A VW+D+Y ++   + P       G+ISER  LR++L CK+FK
Sbjct: 386 YGSPEGQDTMTHNSLRLAHVWLDEYKEQYFSLRPDLKTKSYGNISERVELRKKLGCKSFK 445

Query: 497 WYLDNMWFE 505
           WYLDN++ E
Sbjct: 446 WYLDNVYPE 454



 Score =  127 bits (306), Expect = 9e-28
 Identities = 58/112 (51%), Positives = 77/112 (68%)

Query: 98  QFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASII 157
           + G+I N  D  +RD GY  HAFN LIS R+G HRD+PDTRN  C+ + Y  +LP AS++
Sbjct: 97  ELGMIFNERDQELRDLGYQKHAFNMLISDRLGYHRDVPDTRNAACKEKFYPPDLPAASVV 156

Query: 158 ICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
           ICFYNE +  L+R+VHS++DRT    + EIILVDD SD  +L  ++ E V K
Sbjct: 157 ICFYNEAFSALLRTVHSVIDRTPAHLLHEIILVDDDSDFDDLKGELDEYVQK 208



 Score = 58.0 bits (134), Expect = 7e-07
 Identities = 32/91 (35%), Positives = 47/91 (51%), Gaps = 6/91 (6%)

Query: 500 DNMWFETDRSELVLGRTLCLDASNNVA---PILGKCHEMGGTQEWKHKGTASSPIYNTAA 556
           + +W   +  ELVL   LCLD S   +   P L KCH  GG+Q+W      ++ +Y  + 
Sbjct: 518 NQIWIYNEEHELVLNSLLCLDMSETRSSDPPRLMKCHGSGGSQQWTF--GKNNRLYQVSV 575

Query: 557 GMCL-GVDRSYRGETVLMVICDDYSNNKWDI 586
           G CL  VD   +  +V M ICD  S+ +W +
Sbjct: 576 GQCLRAVDPLGQKGSVAMAICDGSSSQQWHL 606


>UniRef50_Q6WV19 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 2; n=2;
           Sophophora|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 2 - Drosophila
           melanogaster (Fruit fly)
          Length = 633

 Score =  288 bits (707), Expect = 2e-76
 Identities = 154/340 (45%), Positives = 205/340 (60%), Gaps = 24/340 (7%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+++  KREGL+R+R+ GAD +V  VL FLDSH+E N  WL PLL+R+ +         
Sbjct: 259 VRVIRNDKREGLVRSRVKGADAAVSSVLTFLDSHVECNEMWLEPLLERVRED-------- 310

Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNL-PKGTLINDEDFMKPLKSP 374
             R V PVIDVI+ D F+Y   S  +RGGF+W L FKW+ L P    +   D    +++P
Sbjct: 311 PTRVVCPVIDVISMDNFQYIGASADLRGGFDWNLIFKWEYLSPSERAMRHNDPTTAIRTP 370

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            +AGGLF I + YFN +GKYD  M+VWGGENLEISFR+W CGGSLE+IPCSRVGHVFRKR
Sbjct: 371 MIAGGLFVIDKAYFNKLGKYDMKMDVWGGENLEISFRVWQCGGSLEIIPCSRVGHVFRKR 430

Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
            PY   G   +   +N+ R A VWMDDY +      P A ++  G+I +R AL+E+L CK
Sbjct: 431 HPYTFPGGSGNVFARNTRRAAEVWMDDYKQHYYNAVPLAKNIPFGNIDDRLALKEKLHCK 490

Query: 494 TFKWYLDNMWFE---TDRSELVLGR---TLCLDASNN-VAPILG--KCHEMGGTQEWKHK 544
            FKWYL+N++ +    D  E+   R   T CLD   + +   +G   CH  GG QEW   
Sbjct: 491 PFKWYLENVYPDLQAPDPQEVGQFRQDSTECLDTMGHLIDGTVGIFPCHNTGGNQEWAF- 549

Query: 545 GTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNKW 584
            T    I +    +CL +    RG  V++  CDD  N +W
Sbjct: 550 -TKRGEIKHD--DLCLTLVTFARGSQVVLKACDDSENQRW 586



 Score = 90.6 bits (215), Expect = 1e-16
 Identities = 42/96 (43%), Positives = 67/96 (69%), Gaps = 6/96 (6%)

Query: 100 GLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIIC 159
           G +RN ED  IR++      FN   S  +  +RD+PDTRN +C++++Y ++LP  S+II 
Sbjct: 156 GALRNGEDPYIRNR------FNQEASDALPSNRDIPDTRNPMCRTKKYREDLPETSVIIT 209

Query: 160 FYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
           F+NE   TL+R++ S+++R+ +  I+EI+LVDDYSD
Sbjct: 210 FHNEARSTLLRTIVSVLNRSPEHLIREIVLVDDYSD 245


>UniRef50_Q7K755 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase 11; n=3;
           Caenorhabditis|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase 11 - Caenorhabditis
           elegans
          Length = 605

 Score =  286 bits (701), Expect = 1e-75
 Identities = 154/343 (44%), Positives = 210/343 (61%), Gaps = 40/343 (11%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++ LKT K EGLIRA+++GA  + G+VLVFLDSH EVN  WLPPLL ++ Q         
Sbjct: 224 VKFLKTDKNEGLIRAKIFGARRANGEVLVFLDSHCEVNEEWLPPLLDQIKQN-------- 275

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
             R V P+ID+I+A T +Y  SP+  GG NW + FKWD   +    +  +++ PLKSPTM
Sbjct: 276 RRRVVCPIIDIIDAITMKYVESPVCTGGVNWAMTFKWDYPHRSYFEDPMNYVNPLKSPTM 335

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
           AGGLFAI +EYF  IG YD GM+VWG EN+EIS RIW CGG L ++PCSRVGH+FR++RP
Sbjct: 336 AGGLFAIDKEYFFEIGSYDEGMDVWGAENVEISVRIWTCGGELLIMPCSRVGHIFRRQRP 395

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPS-AAHVEIGDISERKALRERLQCKTF 495
           YG+  K D M +NS+R+ARVW+D+Y++   E  P+     + GD++ R +LR  LQCK F
Sbjct: 396 YGI--KTDSMGKNSVRLARVWLDEYLENFFEARPNYRTFTDYGDLTSRISLRRNLQCKPF 453

Query: 496 KWYLDNMWFET---------DRSELVLGR---------TLCLDASNNVAPI-------LG 530
           KWYL+N++ E          +   LV G+         T CL A N+   I       + 
Sbjct: 454 KWYLENIYPELLPDNTPNQLNNQILVAGKKYLIKMANGTHCLSAENSQGRIANGNRVEMR 513

Query: 531 KCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLM 573
           KC+ M   Q+WK+  T       ++  MCL    S RG +V++
Sbjct: 514 KCNHMERMQQWKYSSTNELRPMGSSR-MCLD---SLRGISVIL 552



 Score = 96.7 bits (230), Expect = 2e-18
 Identities = 48/111 (43%), Positives = 68/111 (61%), Gaps = 1/111 (0%)

Query: 88  SKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY 147
           SK+ E D +   G I    +  ++ +GY  + FN L+S RIG  R + D+RN  C S  Y
Sbjct: 96  SKEIEIDTD-LLGKINGKAEDDLQVEGYKKYQFNGLLSDRIGSRRKIKDSRNARCSSLTY 154

Query: 148 FDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYN 198
            D LP ASI++C++NE    L+R V+SI DRT  +H+ EI+LVDD S+  N
Sbjct: 155 SDSLPAASIVVCYFNESPSVLIRMVNSIFDRTKPEHLHEILLVDDSSEWSN 205


>UniRef50_Q8I136 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 4; n=3;
           Caenorhabditis|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 4 - Caenorhabditis
           elegans
          Length = 589

 Score =  282 bits (691), Expect = 2e-74
 Identities = 150/335 (44%), Positives = 204/335 (60%), Gaps = 22/335 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I +L+ ++REGLIR+R+ GA  +   VL FLDSHIE N  WL PLL R+++    V    
Sbjct: 208 ITVLRNNQREGLIRSRVKGAQVARAPVLTFLDSHIECNQKWLEPLLARIAENPKAV---- 263

Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDF-MKPLKSP 374
               V P+IDVIN D F Y   S  +RGGF+W L F+W+ + +            P++SP
Sbjct: 264 ----VAPIIDVINVDNFNYVGASADLRGGFDWTLVFRWEFMNEQLRKERHAHPTAPIRSP 319

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
           TMAGGLFAI +E+FN +G YD  M VWGGENLE+SFR+W CGGSLE++PCSRVGHVFRK+
Sbjct: 320 TMAGGLFAISKEWFNELGTYDLDMEVWGGENLEMSFRVWQCGGSLEIMPCSRVGHVFRKK 379

Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
            PY   G   +   +N+ R A VWMD+Y    ++  PSA  V  GDI++R A+R+RLQCK
Sbjct: 380 HPYTFPGGSGNVFQKNTRRAAEVWMDEYKAIYLKNVPSARFVNFGDITDRLAIRDRLQCK 439

Query: 494 TFKWYLDNMWFETDRSELVLGRT-------LCLDA---SNNVAPILGKCHEMGGTQEWKH 543
           +FKWYL+N++ + +      G++       LCLD+     + AP L  CH  GG QEW  
Sbjct: 440 SFKWYLENVYPQLEIPRKTPGKSFQMKIGNLCLDSMARKESEAPGLFGCHGTGGNQEWVF 499

Query: 544 KGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDD 578
               +    N  + +CL    +   +TV MV C++
Sbjct: 500 -DQLTKTFKNAISQLCLDFSSNTENKTVTMVKCEN 533



 Score = 52.0 bits (119), Expect = 4e-05
 Identities = 25/81 (30%), Positives = 47/81 (58%), Gaps = 1/81 (1%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDE-LPRASIIICFYNEHYETLMRSVH 173
           Y  ++FN   S  +   R +PD+R   C+   Y    +   ++II ++NE   +L+R+V 
Sbjct: 113 YKANSFNQEASDALNPTRKIPDSREPQCRDVDYSKVGMQPTTVIITYHNEARSSLLRTVF 172

Query: 174 SIMDRTDQKHIKEIILVDDYS 194
           S+ +++ ++ + EI+LVDD S
Sbjct: 173 SVFNQSPEELLLEIVLVDDNS 193


>UniRef50_P34678 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 3; n=2;
           Caenorhabditis|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 3 - Caenorhabditis
           elegans
          Length = 612

 Score =  279 bits (685), Expect = 1e-73
 Identities = 149/313 (47%), Positives = 197/313 (62%), Gaps = 22/313 (7%)

Query: 247 NSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLS 306
           +S +K     I L+    R GLIRARL G++ + G +L+FLD+H+EV  GWL PL+ R++
Sbjct: 222 DSYIKMFPIPIHLVHLENRSGLIRARLTGSEMAKGKILLFLDAHVEVTDGWLEPLVSRVA 281

Query: 307 QGVDGVKVRYSARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLIN-D 364
           +           R V P+IDVI+ DTFEY + S    GGFNW L+F+W  +PK  L    
Sbjct: 282 ED--------RKRVVAPIIDVISDDTFEYVTASETTWGGFNWHLNFRWYAVPKRELNRRG 333

Query: 365 EDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPC 424
            D   P+++PT+AGGLFAI +++F  IG YD GM VWGGENLEISFR+WMCGGSLE+ PC
Sbjct: 334 SDRSMPIQTPTIAGGLFAIDKQFFYDIGSYDEGMQVWGGENLEISFRVWMCGGSLEIHPC 393

Query: 425 SRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISER 483
           SRVGHVFRK+ PY   G     +  N+ R A VWMD+Y     ++ P+A +VE GD+SER
Sbjct: 394 SRVGHVFRKQTPYTFPGGTAKVIHHNAARTAEVWMDEYKAFFYKMVPAARNVEAGDVSER 453

Query: 484 KALRERLQCKTFKWYLDNMWFE----TDRSEL--VLGR--TLCLDAS---NNVAPILGKC 532
           K LRE LQCK+FKWYL+N++ E     D   L  ++ R    C+D +   +  AP +  C
Sbjct: 454 KKLRETLQCKSFKWYLENIYPEAPLPADFRSLGAIVNRFTEKCVDTNGKKDGQAPGIQAC 513

Query: 533 HEMGGTQEWKHKG 545
           H  GG Q W   G
Sbjct: 514 HGAGGNQAWSLTG 526



 Score = 68.9 bits (161), Expect = 3e-10
 Identities = 35/89 (39%), Positives = 60/89 (67%), Gaps = 3/89 (3%)

Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDE---LPRASIIICFYNEHYE 166
           I++K +  + FN + S+ I  +R LPD R+  C++     +   +P+ SIII F+NE + 
Sbjct: 125 IKEKRFLENQFNVVASEMISVNRTLPDYRSDACRTSGNNLKTAGMPKTSIIIVFHNEAWT 184

Query: 167 TLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
           TL+R++HS+++R+ +  ++EIILVDD SD
Sbjct: 185 TLLRTLHSVINRSPRHLLEEIILVDDKSD 213


>UniRef50_A7SZ28 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 454

 Score =  276 bits (677), Expect = 1e-72
 Identities = 144/337 (42%), Positives = 201/337 (59%), Gaps = 23/337 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I++++T +R GLI+AR+ GA+N+VG+V++FLD+H E N GWLPPLL+R++          
Sbjct: 121 IKIVRTKERVGLIKARVIGANNAVGEVVIFLDAHCECNKGWLPPLLERIALN-------- 172

Query: 317 SARAVTPVIDVINADTFEYSP-SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
              AV P ID I+  TF+Y P  P +RG FNW   +K   +    +    D  + +KSP 
Sbjct: 173 RRTAVCPTIDFIDHKTFQYKPMDPYIRGTFNWRFDYKERAVRPEEMAKRRDPTQEVKSPV 232

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           MAGGLFAI RE+F+ +G+YDPGM +WGGE  EISF++W CGG LE IPCSRVGHV+R   
Sbjct: 233 MAGGLFAINREFFSELGQYDPGMFIWGGEQYEISFKLWQCGGQLENIPCSRVGHVYRHHV 292

Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
           PY    K D  L N  R+A VWMD+Y   + +  P    V+ GDIS+R ALR+RL+CK+F
Sbjct: 293 PY-TYPKHDATLVNFRRVAEVWMDEYKDWLYDKRPEIKSVDYGDISDRIALRKRLKCKSF 351

Query: 496 KWYLDNMWFETDRSELVL-------GRTLCLDASNNVAPILG--KCHEMGGTQEWKHKGT 546
           KWYL+N+  +T +++L         G+ +CLD+       +G   CH MGG Q +++   
Sbjct: 352 KWYLENVANDTVKTKLCACFQVRNQGKNMCLDSMGRKDGHVGLASCHNMGGNQAFQYTYI 411

Query: 547 ASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNK 583
                  T    C  V  S+ G  V    C +   N+
Sbjct: 412 RELRTDET----CFDVHESFPGAKVHFFPCHEMKGNQ 444



 Score = 84.2 bits (199), Expect = 9e-15
 Identities = 38/109 (34%), Positives = 67/109 (61%)

Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEH 164
           +ED   +D  Y+   FN  +S +I   R + DTR++ C+ + Y   LP+AS++I F+NE 
Sbjct: 13  AEDESKKDAAYSEFGFNQFVSDQISLERTISDTRHQACKQRSYPINLPKASVVIVFHNEG 72

Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
           + TLMR+VH+++ R+    ++EI++VDD+S+   L   + +   KL  +
Sbjct: 73  WSTLMRTVHTVLLRSPPHMLQEIVMVDDFSNKDFLKQKLDDYTKKLGKI 121


>UniRef50_A7SDQ3 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 575

 Score =  275 bits (675), Expect = 2e-72
 Identities = 150/344 (43%), Positives = 205/344 (59%), Gaps = 27/344 (7%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++L++ +KREGLIR+R+ GA+ + G+VL FLDSH E N  WL PLL R+ +    +    
Sbjct: 197 VKLIRNTKREGLIRSRVKGANLARGEVLTFLDSHCECNKNWLEPLLLRIKESPKTI---- 252

Query: 317 SARAVTPVIDVINADTFEYSPSPL-VRGGFNWGLHFKWDNLPKGTLINDEDFMK-PLKSP 374
               V+P+IDVIN DTF+Y  S   +RGGF W L+FKWD LP   L   +     P+KSP
Sbjct: 253 ----VSPIIDVINLDTFDYLGSSADLRGGFGWNLNFKWDFLPPHILAERQGKPTLPIKSP 308

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            +AGGLF++ +++F  +GKYD  M+VWGGENLEISFR W CGG++E+IPCSRVGHVFR R
Sbjct: 309 VIAGGLFSVAKKWFETLGKYDMQMDVWGGENLEISFRTWQCGGAMEIIPCSRVGHVFRNR 368

Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
            PY   G   +   +N+ R   VWMDDY +      P A +   GDI ER  LR +L+C+
Sbjct: 369 HPYQFPGGSMNVFQKNTRRAVEVWMDDYKRYYYAAVPYAKNTPYGDIEERVELRRKLRCR 428

Query: 494 TFKWYLDNMWFE----TDRSELVLGR----TLCLDASNNV-APILG--KCHEMGGTQEWK 542
            FKWY+ N++ E    +D S    G       C+D   ++    +G  +CH  GG Q W 
Sbjct: 429 PFKWYVQNVYPELKLPSDESTKSFGEIKQGNQCVDTLGHMRGQTIGLFECHGAGGNQMWS 488

Query: 543 HKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDD-YSNNKWD 585
              T SS + +    MCLGV+     E V ++ CD+  S   W+
Sbjct: 489 L--TKSSLLKHET--MCLGVNDGKATEPVQLLDCDENNSMQHWE 528



 Score = 75.8 bits (178), Expect = 3e-12
 Identities = 45/134 (33%), Positives = 77/134 (57%), Gaps = 6/134 (4%)

Query: 61  QDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAF 120
           QD +   E  R++  KE+  +++     K+  +DL+    ++ N       D  Y  +A+
Sbjct: 55  QDLQENQETSREIP-KEREEEEE---FDKRKISDLDPIKYIVENG--FHEGDDAYAKNAY 108

Query: 121 NTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTD 180
           N   S ++   R++PD R++ C+SQ +  +LP  +IIICF+NE    L+R+V S ++R+ 
Sbjct: 109 NIKKSDQLPVDREVPDVRDQQCKSQVWPHDLPTTTIIICFHNEGRSALLRTVISALNRSP 168

Query: 181 QKHIKEIILVDDYS 194
              +KEIILVDD+S
Sbjct: 169 PHLLKEIILVDDFS 182


>UniRef50_Q10471 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 2 (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 2) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase 2)
           (Polypeptide GalNAc transferase 2) (GalNAc-T2)
           (pp-GaNTase 2) [Contains: Polypeptide
           N-acetylgalactosaminyltransferase 2 soluble form]; n=32;
           Coelomata|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 2 (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 2) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase 2)
           (Polypeptide GalNAc transferase 2) (GalNAc-T2)
           (pp-GaNTase 2) [Contains: Polypeptide
           N-acetylgalactosaminyltransferase 2 soluble form] - Homo
           sapiens (Human)
          Length = 571

 Score =  272 bits (667), Expect = 2e-71
 Identities = 149/354 (42%), Positives = 208/354 (58%), Gaps = 26/354 (7%)

Query: 246 ENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRL 305
           E+  +   +  +R+L+  +REGL+R+R+ GAD +   VL FLDSH E N  WL PLL+R+
Sbjct: 182 EDGALLGKIEKVRVLRNDRREGLMRSRVRGADAAQAKVLTFLDSHCECNEHWLEPLLERV 241

Query: 306 SQGVDGVKVRYSARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNL-PKGTLIN 363
           ++           R V+P+IDVIN D F+Y   S  ++GGF+W L FKWD + P+     
Sbjct: 242 AED--------RTRVVSPIIDVINMDNFQYVGASADLKGGFDWNLVFKWDYMTPEQRRSR 293

Query: 364 DEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIP 423
             + + P+K+P +AGGLF + + YF  +GKYD  M+VWGGENLEISFR+W CGGSLE+IP
Sbjct: 294 QGNPVAPIKTPMIAGGLFVMDKFYFEELGKYDMMMDVWGGENLEISFRVWQCGGSLEIIP 353

Query: 424 CSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISE 482
           CSRVGHVFRK+ PY   G       +N+ R A VWMD+Y        PSA +V  G+I  
Sbjct: 354 CSRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSARNVPYGNIQS 413

Query: 483 RKALRERLQCKTFKWYLDNMWFE---TDRSELVLGR----TLCLDASNNVAP-ILG--KC 532
           R  LR++L CK FKWYL+N++ E    D  ++  G     T CLD   + A  ++G  +C
Sbjct: 414 RLELRKKLSCKPFKWYLENVYPELRVPDHQDIAFGALQQGTNCLDTLGHFADGVVGVYEC 473

Query: 533 HEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVIC-DDYSNNKWD 585
           H  GG QEW    T    + +    +CL V     G  + +  C ++ S  KW+
Sbjct: 474 HNAGGNQEWAL--TKEKSVKH--MDLCLTVVDRAPGSLIKLQGCRENDSRQKWE 523



 Score = 71.3 bits (167), Expect = 7e-11
 Identities = 34/81 (41%), Positives = 53/81 (65%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
           Y  + FN + S ++   R +PDTR+  CQ +Q+  +LP  S++I F+NE    L+R+V S
Sbjct: 99  YARNKFNQVESDKLRMDRAIPDTRHDQCQRKQWRVDLPATSVVITFHNEARSALLRTVVS 158

Query: 175 IMDRTDQKHIKEIILVDDYSD 195
           ++ ++    IKEIILVDDYS+
Sbjct: 159 VLKKSPPHLIKEIILVDDYSN 179


>UniRef50_A7RGG9 Cluster: Predicted protein; n=3; Eumetazoa|Rep:
           Predicted protein - Nematostella vectensis
          Length = 353

 Score =  266 bits (653), Expect = 8e-70
 Identities = 129/251 (51%), Positives = 166/251 (66%), Gaps = 9/251 (3%)

Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
           N+R+L+TSKREGLIRARL GA  + GDV+ FLD+H E NV WL PLL R+      V V 
Sbjct: 96  NVRVLRTSKREGLIRARLIGARAAKGDVITFLDAHCEANVDWLQPLLSRIHSDRTIVAV- 154

Query: 316 YSARAVTPVIDVINADTFEYSPSP-LVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
                  PVID+I++  F YS +P  V GGF+W + F W +LP       +D   P+++P
Sbjct: 155 -------PVIDIISSTNFMYSGTPSAVIGGFSWDMQFTWHSLPNNRQSERKDRTAPIRTP 207

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
           TMAGGLF+I R+YF   G YD GM+VWGGENLE+SFRIW CGG LE++PCSRVGHVFR R
Sbjct: 208 TMAGGLFSIDRKYFFESGSYDEGMDVWGGENLEMSFRIWQCGGKLEILPCSRVGHVFRTR 267

Query: 435 RPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKT 494
            PY        +  N  R+  VWMD+Y + V    P    ++ GDI+ R ALR +L+CK+
Sbjct: 268 FPYSFPGGYSEVSVNLARVVHVWMDEYNQYVYMKRPDLQSLKYGDITSRVALRNKLKCKS 327

Query: 495 FKWYLDNMWFE 505
           FKWYL+N++ E
Sbjct: 328 FKWYLENVYPE 338



 Score = 67.7 bits (158), Expect = 8e-10
 Identities = 30/72 (41%), Positives = 47/72 (65%)

Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
           C S+ Y   LP  +++ICF+NE + TL+R+VHS++DR+    ++EI+L+DD+S    L  
Sbjct: 26  CSSKSYPSYLPSTTVVICFHNEAWSTLLRTVHSVIDRSPAHLLREILLIDDFSTHDYLKS 85

Query: 202 DVQEAVDKLNNV 213
            +   V KL NV
Sbjct: 86  KLTAYVAKLRNV 97


>UniRef50_Q8N428 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1; n=43;
           Eumetazoa|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 1 - Homo
           sapiens (Human)
          Length = 558

 Score =  266 bits (651), Expect = 1e-69
 Identities = 119/251 (47%), Positives = 169/251 (67%), Gaps = 10/251 (3%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++ L+  +REGLIR+R+ GAD +   VL FLDSH EVN  WLPP+L+R+ +         
Sbjct: 180 VKCLRNDRREGLIRSRVRGADVAAATVLTFLDSHCEVNTEWLPPMLQRVKED-------- 231

Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
             R V+P+IDVI+ D F Y + S  +RGGF+W LHFKW+ +P    +   D  +P+++P 
Sbjct: 232 HTRVVSPIIDVISLDNFAYLAASADLRGGFDWSLHFKWEQIPLEQKMTRTDPTRPIRTPV 291

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           +AGG+F I + +FN +GKYD  M++WGGEN E+SFR+WMCGGSLE++PCSRVGHVFRKR 
Sbjct: 292 IAGGIFVIDKSWFNHLGKYDAQMDIWGGENFELSFRVWMCGGSLEIVPCSRVGHVFRKRH 351

Query: 436 PYGVGEKQDY-MLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKT 494
           PY   E      ++N+ R A VWMD+Y +   E  PSA     G ++ R   R+++ CK+
Sbjct: 352 PYNFPEGNALTYIRNTKRTAEVWMDEYKQYYYEARPSAIGKAFGSVATRIEQRKKMNCKS 411

Query: 495 FKWYLDNMWFE 505
           F+WYL+N++ E
Sbjct: 412 FRWYLENVYPE 422



 Score = 81.8 bits (193), Expect = 5e-14
 Identities = 39/91 (42%), Positives = 58/91 (63%)

Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNE 163
           +++ L+  +  Y  HAFN L S ++   R + DTR+  C S  Y  +LP  S+II F+NE
Sbjct: 75  SAKQLKAGEDPYRQHAFNQLESDKLSPDRPIRDTRHYSCPSVSYSSDLPATSVIITFHNE 134

Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
              TL+R+V S+++RT    I+EIILVDD+S
Sbjct: 135 ARSTLLRTVKSVLNRTPANLIQEIILVDDFS 165


>UniRef50_Q5DD76 Cluster: SJCHGC09400 protein; n=2; Schistosoma
           japonicum|Rep: SJCHGC09400 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 737

 Score =  264 bits (647), Expect = 5e-69
 Identities = 145/304 (47%), Positives = 189/304 (62%), Gaps = 28/304 (9%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++T +REGLIRAR+ GA  S G VLVFLDSHIE   GWL PLL R++          
Sbjct: 306 VKIVRTKRREGLIRARMLGAAQSSGKVLVFLDSHIECTTGWLEPLLDRIAYN-------- 357

Query: 317 SARAVTPVIDVINADTFEY---SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
           S+  V PVI VIN  T +Y   SPS +  GGF+W L F W    +           P++S
Sbjct: 358 SSIVVVPVITVINDKTLKYDLPSPSRVQIGGFDWSLSFIWHEQTERHKNRPGAPYSPVQS 417

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           PTMAGGLFAI REYFN +G YDPGM VWGGENLE+SF+IWMCGGSLE++ CS+VGH+FR 
Sbjct: 418 PTMAGGLFAISREYFNHLGMYDPGMEVWGGENLELSFKIWMCGGSLEIVICSQVGHIFRD 477

Query: 434 RRPY-GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
           R PY    + +D + +N +R+A VW+DDY K+          V+IG++SERKALRE+L+C
Sbjct: 478 RSPYIWDVDVKDPLKRNLLRLADVWLDDY-KRFYHARIGFEMVDIGNVSERKALREKLKC 536

Query: 493 KTFKWYLDNMWFE--TDRSELVLG------RTLCLDA----SNNVAPILGK---CHEMGG 537
            +F WYL N++ E       L  G         CLDA     N+ + ++ K   CH+ GG
Sbjct: 537 HSFDWYLTNIYPELFVPSKALASGDIESAAGPHCLDAPLPSENDSSSVIIKTRPCHKQGG 596

Query: 538 TQEW 541
            Q W
Sbjct: 597 NQFW 600



 Score =  100 bits (239), Expect = 1e-19
 Identities = 52/106 (49%), Positives = 70/106 (66%), Gaps = 1/106 (0%)

Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLM 169
           I DKG+  +AFN L S RI   R LPD R   C+  +Y   LP ASIIICF+NE +  L+
Sbjct: 203 IFDKGWKDNAFNQLASDRISVRRYLPDYREGTCKDNKYSRNLPSASIIICFHNEAWSVLL 262

Query: 170 RSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
           RSVHS++DR+    + EIILVDD+SD  +L   ++E + K+ NV+K
Sbjct: 263 RSVHSVIDRSPSYLLHEIILVDDFSDRPHLKEALEEYM-KMLNVVK 307


>UniRef50_Q95ZJ1 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 5; n=13;
           Bilateria|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 5 - Caenorhabditis
           elegans
          Length = 626

 Score =  263 bits (644), Expect = 1e-68
 Identities = 129/253 (50%), Positives = 169/253 (66%), Gaps = 14/253 (5%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++L+  KREGLIRARL GA  + G+VL +LDSH E   GW+ PLL R+         R 
Sbjct: 237 VKILRMEKREGLIRARLRGAAVATGEVLTYLDSHCECMEGWMEPLLDRIK--------RD 288

Query: 317 SARAVTPVIDVINADTFEYSPSPLVR---GGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
               V PVIDVI+ +TFEY  S       GGF+WGL F W ++P+    N    + P++S
Sbjct: 289 PTTVVCPVIDVIDDNTFEYHHSKAYFTSVGGFDWGLQFNWHSIPERDRKNRTRPIDPVRS 348

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           PTMAGGLF+I ++YF  +G YDPG ++WGGENLE+SF+IWMCGG+LE++PCS VGHVFRK
Sbjct: 349 PTMAGGLFSIDKKYFEKLGTYDPGFDIWGGENLELSFKIWMCGGTLEIVPCSHVGHVFRK 408

Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIE-VNPSAAHVEIGDISERKALRERLQC 492
           R PY      + + +NS+R+A VW+DDY     E +N      + GDIS RK LRE L C
Sbjct: 409 RSPYKWRTGVNVLKRNSIRLAEVWLDDYKTYYYERINNQLG--DFGDISSRKKLREDLGC 466

Query: 493 KTFKWYLDNMWFE 505
           K+FKWYLDN++ E
Sbjct: 467 KSFKWYLDNIYPE 479



 Score = 93.5 bits (222), Expect = 1e-17
 Identities = 41/112 (36%), Positives = 71/112 (63%)

Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNE 163
           ++E+    DKG   +AFN   S  I  HR LP   +  C++++Y + LPR S+IICF+NE
Sbjct: 127 STEEKAKYDKGMLNNAFNQYASDMISVHRTLPTNIDAECKTEKYNENLPRTSVIICFHNE 186

Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
            +  L+R+VHS+++RT    ++E++LVDD+SD+ +    ++E + +    +K
Sbjct: 187 AWSVLLRTVHSVLERTPDHLLEEVVLVDDFSDMDHTKRPLEEYMSQFGGKVK 238


>UniRef50_A2AQQ1 Cluster: UDP-N-acetyl-alpha-D-galactosamine:
           polypeptide N- acetylgalactosaminyltransferase 13; n=10;
           Coelomata|Rep: UDP-N-acetyl-alpha-D-galactosamine:
           polypeptide N- acetylgalactosaminyltransferase 13 - Mus
           musculus (Mouse)
          Length = 592

 Score =  261 bits (640), Expect = 3e-68
 Identities = 144/331 (43%), Positives = 193/331 (58%), Gaps = 24/331 (7%)

Query: 243 KSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLL 302
           K T  + VK     +++++  +R GLIRARL GA  S G V+ FLD+H E  +GWL PLL
Sbjct: 163 KLTLENYVKTLEVPVKIIRMEERSGLIRARLRGAAASKGQVITFLDAHCECTLGWLEPLL 222

Query: 303 KRLSQGVDGVKVRYSARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTL 361
            R+ +    V        V P+IDVI+ DTFEY + S +  GGFNW L+F+W  +P+  +
Sbjct: 223 ARIKEDRKTV--------VCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREM 274

Query: 362 INDE-DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLE 420
              + D   P+++PTMAGGLF+I R YF  IG YD GM++WGGENLE+SFRIW CGGSLE
Sbjct: 275 DRRKGDRTLPVRTPTMAGGLFSIDRNYFEEIGTYDAGMDIWGGENLEMSFRIWQCGGSLE 334

Query: 421 LIPCSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGD 479
           ++ CS VGHVFRK  PY   G     + +N+ R+A VWMD++      ++P    V+ GD
Sbjct: 335 IVTCSHVGHVFRKATPYTFPGGTGHVINKNNRRLAEVWMDEFKDFFYIISPGVVKVDYGD 394

Query: 480 ISERKALRERLQCKTFKWYLDNMWFETD--RSELVLGR------TLCLD---ASNNVAPI 528
           +S RK LRE L+CK F WYL+N++ ++   R    LG         CLD      N    
Sbjct: 395 VSVRKTLRENLKCKPFSWYLENIYPDSQIPRRYYSLGEIRNVETNQCLDNMGRKENEKVG 454

Query: 529 LGKCHEMGGTQEWKHKGTASSPIYNTAAGMC 559
           +  CH MGG Q   H    S+P     A  C
Sbjct: 455 IFNCHGMGGNQ--VHDLCLSAPSLGVGAEEC 483



 Score = 76.2 bits (179), Expect = 2e-12
 Identities = 37/101 (36%), Positives = 63/101 (62%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
           + ++ FN + S  I  +R LPD R + C+++ Y DELP  S++I F+NE + TL+R+V+S
Sbjct: 78  FKINQFNLMASDLIALNRSLPDVRLEGCKTKVYPDELPNTSVVIVFHNEAWSTLLRTVYS 137

Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
           +++R+    + E+ILVDD S+   L   ++  V  L   +K
Sbjct: 138 VINRSPHYLLSEVILVDDASERDFLKLTLENYVKTLEVPVK 178


>UniRef50_Q10472 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 1 (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 1) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase 1)
           (Polypeptide GalNAc transferase 1) (GalNAc-T1)
           (pp-GaNTase 1) [Contains: Polypeptide
           N-acetylgalactosaminyltransferase 1 soluble form]; n=66;
           Eumetazoa|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 1 (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 1) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase 1)
           (Polypeptide GalNAc transferase 1) (GalNAc-T1)
           (pp-GaNTase 1) [Contains: Polypeptide
           N-acetylgalactosaminyltransferase 1 soluble form] - Homo
           sapiens (Human)
          Length = 559

 Score =  261 bits (640), Expect = 3e-68
 Identities = 151/354 (42%), Positives = 204/354 (57%), Gaps = 31/354 (8%)

Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
           S VK     + +++  +R GLIRARL GA  S G V+ FLD+H E  VGWL PLL R   
Sbjct: 169 SYVKKLKVPVHVIRMEQRSGLIRARLKGAAVSKGQVITFLDAHCECTVGWLEPLLAR--- 225

Query: 308 GVDGVKVRYSARAVT-PVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDE 365
                 +++  R V  P+IDVI+ DTFEY + S +  GGFNW L+F+W  +P+  +   +
Sbjct: 226 ------IKHDRRTVVCPIIDVISDDTFEYMAGSDMTYGGFNWKLNFRWYPVPQREMDRRK 279

Query: 366 -DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPC 424
            D   P+++PTMAGGLF+I R+YF  IG YD GM++WGGENLEISFRIW CGG+LE++ C
Sbjct: 280 GDRTLPVRTPTMAGGLFSIDRDYFQEIGTYDAGMDIWGGENLEISFRIWQCGGTLEIVTC 339

Query: 425 SRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISER 483
           S VGHVFRK  PY   G     + +N+ R+A VWMD++      ++P    V+ GDIS R
Sbjct: 340 SHVGHVFRKATPYTFPGGTGQIINKNNRRLAEVWMDEFKNFFYIISPGVTKVDYGDISSR 399

Query: 484 KALRERLQCKTFKWYLDNMWFETD--RSELVLGR------TLCLD---ASNNVAPILGKC 532
             LR +LQCK F WYL+N++ ++   R    LG         CLD      N    +  C
Sbjct: 400 VGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEKVGIFNC 459

Query: 533 HEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNK-WD 585
           H MGG Q + +  TA+  I      +CL  D S     V M+ C     N+ W+
Sbjct: 460 HGMGGNQVFSY--TANKEI--RTDDLCL--DVSKLNGPVTMLKCHHLKGNQLWE 507



 Score = 83.8 bits (198), Expect = 1e-14
 Identities = 43/116 (37%), Positives = 73/116 (62%), Gaps = 3/116 (2%)

Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHY 165
           ED     + + ++ FN + S+ I  +R LPD R + C+++ Y D LP  S++I F+NE +
Sbjct: 70  EDQEKMKEMFKINQFNLMASEMIALNRSLPDVRLEGCKTKVYPDNLPTTSVVIVFHNEAW 129

Query: 166 ETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLN---NVIKKEE 218
            TL+R+VHS+++R+ +  I+EI+LVDD S+   L   ++  V KL    +VI+ E+
Sbjct: 130 STLLRTVHSVINRSPRHMIEEIVLVDDASERDFLKRPLESYVKKLKVPVHVIRMEQ 185


>UniRef50_A7RRV7 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 513

 Score =  260 bits (638), Expect = 6e-68
 Identities = 147/347 (42%), Positives = 198/347 (57%), Gaps = 31/347 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++L+  KREGL+RARL GA+ + GDVL FLDSH E   GW  PLL R++     V    
Sbjct: 129 VKVLRMKKREGLVRARLQGANTAKGDVLTFLDSHCEATPGWAEPLLARIAADRRNV---- 184

Query: 317 SARAVTPVIDVINADTFEYSPSPLV--RGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
               V P I+VINADTF Y  S     RGGF+W L FKW  +P        D   P+++P
Sbjct: 185 ----VCPAIEVINADTFAYQGSTNADQRGGFSWDLFFKWKGIPPEEQKLRNDDSDPIRTP 240

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK- 433
           TMAGGLF+I+R+YF  IG YD  M++WGGENLE+SFR+WMCGG LE++ CSRVGHVFRK 
Sbjct: 241 TMAGGLFSIHRQYFFDIGSYDEEMDIWGGENLELSFRVWMCGGRLEIVTCSRVGHVFRKY 300

Query: 434 RRPYGVGEKQDYML-QNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
             PY   +  +  L +N  R+A VWMD+Y        P A + + GDIS+R  LR+RL+C
Sbjct: 301 TSPYKFPDGVERTLTKNFNRLAEVWMDEYKDLYYNKKPQAKNSDYGDISKRLELRKRLKC 360

Query: 493 KTFKWYLDNMWFETDRSEL---VLGR------TLCLDA-----SNNVAPILGKCHEMGGT 538
           K+FKWY++N++ +    EL     G         CLD+      +N    +  CH  GG 
Sbjct: 361 KSFKWYINNIYPDVQMPELDPPARGEVRNPSSNQCLDSLGAKPEHNARVGIYTCHGQGGN 420

Query: 539 QEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDY-SNNKW 584
           Q  K+       I+      C  V +++ G  V ++ C     N +W
Sbjct: 421 QVSKY--MPRELIFEEE--NCFDVSKTHPGAPVELMKCHGMRGNQEW 463



 Score = 79.4 bits (187), Expect = 2e-13
 Identities = 41/126 (32%), Positives = 75/126 (59%), Gaps = 6/126 (4%)

Query: 102 IRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQ--YFDELPRASIIIC 159
           + + E+ ++ +K +  H+FN L+S +I   R L D R++ C+++   Y  +LP  S+IIC
Sbjct: 16  LESEENKKLAEKYFANHSFNWLLSDKISLDRTLDDVRSERCKAKHNTYPAKLPTTSVIIC 75

Query: 160 FYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV----IK 215
           F+ E    L+R+VHS+++RT  + + E+I+VDD+S    L   + + V +   V    +K
Sbjct: 76  FHKERLSVLLRTVHSVINRTPPELLAEVIVVDDFSQDAKLGKPLDDHVAQFTKVKVLRMK 135

Query: 216 KEEEMI 221
           K E ++
Sbjct: 136 KREGLV 141



 Score = 39.1 bits (87), Expect = 0.32
 Identities = 28/79 (35%), Positives = 37/79 (46%), Gaps = 7/79 (8%)

Query: 510 ELVLGRTLCLDASNNV--API-LGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDR-S 565
           EL+     C D S     AP+ L KCH M G QEWKH     + ++ T    CL  DR S
Sbjct: 429 ELIFEEENCFDVSKTHPGAPVELMKCHGMRGNQEWKHDREKGTLMHFTTQ-QCL--DRGS 485

Query: 566 YRGETVLMVICDDYSNNKW 584
              +  +M  CD   + +W
Sbjct: 486 PSDQYAVMNPCDGRESQRW 504


>UniRef50_Q4RQL8 Cluster: Chromosome 2 SCAF15004, whole genome
           shotgun sequence; n=5; Euteleostomi|Rep: Chromosome 2
           SCAF15004, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 632

 Score =  259 bits (634), Expect = 2e-67
 Identities = 152/371 (40%), Positives = 206/371 (55%), Gaps = 36/371 (9%)

Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
           KK  EN  V+     +R+L+  +R GLIRARL GA  + G V+ FLD+H E  VGWL PL
Sbjct: 163 KKKLENY-VRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPL 221

Query: 302 LKRLSQG------------VDGVKVR---YSARAVTPVIDVINADTFEY-SPSPLVRGGF 345
           L R+ +              +    R   +    V P+IDVI+ +TFEY + S +  GGF
Sbjct: 222 LARIKEDRWDCNTALCVCVFERPSFRCFLFRTAVVCPIIDVISDETFEYMAGSDMTYGGF 281

Query: 346 NWGLHFKWDNLPKGTLINDE-DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGE 404
           NW L+F+W  +P+  +   + D   P+++PTMAGGLF+I + YF  IG YDPGM++WGGE
Sbjct: 282 NWKLNFRWYPVPQREMDRRKGDRTLPVRTPTMAGGLFSIDKTYFEEIGSYDPGMDIWGGE 341

Query: 405 NLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVK 463
           NLE+SFRIW CGGSLE++ CS VGHVFRK  PY   G     + +N+ R+A VWMDD+  
Sbjct: 342 NLEMSFRIWQCGGSLEIVTCSHVGHVFRKATPYSFPGGTGQVINKNNRRLAEVWMDDFKD 401

Query: 464 KVIEVNPSAAHVEIGDISERKALRERLQCKTFKWYLDNMWFETD--RSELVLGR------ 515
               ++P    V+ GD+S RK LR+ L CK F WYL+N++ ++   R    LG       
Sbjct: 402 FFYIISPGVMRVDYGDVSSRKGLRDALHCKPFSWYLENIYPDSQIPRRYYSLGEIRNVET 461

Query: 516 TLCLD---ASNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVL 572
             C+D      N       CH MGG Q + +  TA   I      +CL V R      VL
Sbjct: 462 NQCVDNMGRKENEKVGFFNCHGMGGNQVFSY--TADKEI--RTDDLCLDVSR--LNGPVL 515

Query: 573 MVICDDYSNNK 583
           M+ C     N+
Sbjct: 516 MLKCHHMKGNQ 526



 Score = 75.4 bits (177), Expect = 4e-12
 Identities = 32/81 (39%), Positives = 56/81 (69%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
           + ++ FN + S  I  +R LPD R   C+++ Y D++P  S++I F+NE + TL+R+VHS
Sbjct: 78  FKINQFNLMASDMIALNRSLPDVRLDGCKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHS 137

Query: 175 IMDRTDQKHIKEIILVDDYSD 195
           +++R+ +  + EI+LVDD S+
Sbjct: 138 VINRSPRHLLVEIVLVDDASE 158


>UniRef50_UPI00015B515F Cluster: PREDICTED: similar to
           n-acetylgalactosaminyltransferase; n=1; Nasonia
           vitripennis|Rep: PREDICTED: similar to
           n-acetylgalactosaminyltransferase - Nasonia vitripennis
          Length = 826

 Score =  256 bits (628), Expect = 9e-67
 Identities = 140/284 (49%), Positives = 178/284 (62%), Gaps = 25/284 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++LL+  +R+GL+RARL GA ++ GDVL+FLD+H EV   WL PLL+R+ +  + V    
Sbjct: 433 VKLLRLDERQGLVRARLKGAKSATGDVLMFLDAHCEVTKQWLEPLLQRIKEKKNAV---- 488

Query: 317 SARAVTPVIDVINADTFEYS----PSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLK 372
               VTP+ID I+ +TFEYS    PS    GGF W  HF W N+ +  L +    + P+K
Sbjct: 489 ----VTPIIDNISEETFEYSHSDEPSFFQVGGFTWSGHFTWINIQEADLKSKTSAISPVK 544

Query: 373 SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
           SPTMAGGLFAI R+YF  IG YD  M  WGGENLE+SFRIW CGG LE IPCSRVGHVFR
Sbjct: 545 SPTMAGGLFAINRKYFWDIGSYDDKMEGWGGENLEMSFRIWQCGGVLETIPCSRVGHVFR 604

Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVE---IGDISERKALRER 489
              PY     +D    N+ R+A VWMDDY K++  ++      +   IGDI ER  LRE+
Sbjct: 605 NFLPYKFPMDKDTHGINTARLANVWMDDY-KRLYYLHREEYKDKPELIGDIKERVNLREK 663

Query: 490 LQCKTFKWYLDNMW---FETDRSELVLGR------TLCLDASNN 524
           L+CK+FKWYLDN++   F  D +    GR       LCLD   N
Sbjct: 664 LKCKSFKWYLDNVYPEKFIPDENVQAFGRVQVQKGNLCLDNLQN 707



 Score = 75.8 bits (178), Expect = 3e-12
 Identities = 37/77 (48%), Positives = 52/77 (67%)

Query: 119 AFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDR 178
           A N ++S +I   R LPD R+ LC++  Y   LP ASIII F+NE +  L+R+V+S++  
Sbjct: 337 AVNIILSNKIPLQRKLPDVRDPLCKNVTYDSVLPSASIIIIFHNEAFSVLLRTVYSVIKE 396

Query: 179 TDQKHIKEIILVDDYSD 195
           T  K +KEIILVDD S+
Sbjct: 397 TPPKLLKEIILVDDKSN 413


>UniRef50_Q96FL9 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 14; n=30;
           Tetrapoda|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 14 - Homo sapiens
           (Human)
          Length = 552

 Score =  256 bits (627), Expect = 1e-66
 Identities = 139/335 (41%), Positives = 196/335 (58%), Gaps = 24/335 (7%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++ L+ ++R+GL+R+R+ GAD + G  L FLDSH EVN  WL PLL R       VK  Y
Sbjct: 168 VKCLRNNERQGLVRSRIRGADIAQGTTLTFLDSHCEVNRDWLQPLLHR-------VKEDY 220

Query: 317 SARAVTPVIDVINADTFEYSPSPL-VRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
           + R V PVID+IN DTF Y  S   +RGGF+W LHF+W+ L         D  +P+++P 
Sbjct: 221 T-RVVCPVIDIINLDTFTYIESASELRGGFDWSLHFQWEQLSPEQKARRLDPTEPIRTPI 279

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           +AGGLF I + +F+ +GKYD  M++WGGEN EISFR+WMCGGSLE++PCSRVGHVFRK+ 
Sbjct: 280 IAGGLFVIDKAWFDYLGKYDMDMDIWGGENFEISFRVWMCGGSLEIVPCSRVGHVFRKKH 339

Query: 436 PYGV--GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
           PY    G    Y ++N+ R A VWMD+Y +      P A     G++  R  LR+ L+C+
Sbjct: 340 PYVFPDGNANTY-IKNTKRTAEVWMDEYKQYYYAARPFALERPFGNVESRLDLRKNLRCQ 398

Query: 494 TFKWYLDNMWFETD---RSELVLG----RTLCLDA---SNNVAP--ILGKCHEMGGTQEW 541
           +FKWYL+N++ E      S +  G    R  CL++   +N   P   L  C ++ G    
Sbjct: 399 SFKWYLENIYPELSIPKESSIQKGNIRQRQKCLESQRQNNQETPNLKLSPCAKVKGEDAK 458

Query: 542 KHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVIC 576
                 +         +CL V   + G  V++V+C
Sbjct: 459 SQVWAFTYTQQILQEELCLSVITLFPGAPVVLVLC 493



 Score = 90.6 bits (215), Expect = 1e-16
 Identities = 50/115 (43%), Positives = 71/115 (61%), Gaps = 2/115 (1%)

Query: 83  QAIKMSKKTENDLEEQFGLIR--NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNK 140
           Q  K S    +DL +QF   R  N++  R+ D  Y L+AFN   S+RI  +R +PDTR+ 
Sbjct: 40  QTPKPSDADWDDLWDQFDERRYLNAKKWRVGDDPYKLYAFNQRESERISSNRAIPDTRHL 99

Query: 141 LCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
            C    Y  +LP  SIII F+NE   TL+R++ S+++RT    I+EIILVDD+S+
Sbjct: 100 RCTLLVYCTDLPPTSIIITFHNEARSTLLRTIRSVLNRTPTHLIREIILVDDFSN 154


>UniRef50_UPI00015B5D50 Cluster: PREDICTED: similar to
           ENSANGP00000021852; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to ENSANGP00000021852 - Nasonia
           vitripennis
          Length = 612

 Score =  254 bits (623), Expect = 4e-66
 Identities = 125/251 (49%), Positives = 164/251 (65%), Gaps = 14/251 (5%)

Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
           R+L++ KR GL+ ARL GA+ + G+VL FLD+H E   GWL PLL+ +S+          
Sbjct: 221 RVLRSDKRVGLVNARLMGANEAKGEVLTFLDAHCECTAGWLEPLLEAISKN--------R 272

Query: 318 ARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTLIND--EDFMKPLKSP 374
            R V+PVID+IN DTF Y+ S  L  G FNW LHF+W  L  G L+ +  E+ + P K+P
Sbjct: 273 TRVVSPVIDIINDDTFSYTRSFELHWGAFNWDLHFRWLML-NGALLRERRENIVDPFKTP 331

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            MAGGLF++ REYF  +G YD  M +WGGENLE+SFR+W CGGS+E+ PCS VGH+FRK 
Sbjct: 332 AMAGGLFSMDREYFFELGSYDEHMRIWGGENLELSFRVWQCGGSVEIAPCSHVGHIFRKS 391

Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHV-EIGDISERKALRERLQC 492
            PY   G   + +  N  R+A VWMD++ K     NP A  V +   I  R  LRERL+C
Sbjct: 392 SPYTFPGGVDEILYGNLARVALVWMDEWGKFYFNFNPQAQRVRDKQQIRSRLELRERLKC 451

Query: 493 KTFKWYLDNMW 503
           K+F+WYLDN+W
Sbjct: 452 KSFEWYLDNVW 462



 Score = 82.2 bits (194), Expect = 4e-14
 Identities = 43/108 (39%), Positives = 69/108 (63%), Gaps = 1/108 (0%)

Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNE 163
           ++D +   + + ++ +N L S RI  +R LPD R K C ++     +LP  S+II F+NE
Sbjct: 110 AKDFQKMQQLFQINRYNLLASDRIPLNRTLPDVRKKKCITRYANLGDLPSTSVIIVFHNE 169

Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLN 211
            + TL+R+VHS+++R+ +K ++EIILVDD SD   L   + E V +LN
Sbjct: 170 AWSTLLRTVHSVINRSPRKLLEEIILVDDNSDRDFLRKPLDEYVAQLN 217


>UniRef50_UPI00015B453F Cluster: PREDICTED: similar to GA20875-PA;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           GA20875-PA - Nasonia vitripennis
          Length = 793

 Score =  254 bits (623), Expect = 4e-66
 Identities = 142/288 (49%), Positives = 175/288 (60%), Gaps = 25/288 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++LL+  KR+GLIRARL GA  + GDVLVFLD+H EV  GWL PLL R       +K R 
Sbjct: 360 VKLLRLPKRQGLIRARLAGAQQATGDVLVFLDAHCEVTKGWLSPLLHR-------IKARP 412

Query: 317 SARAVTPVIDVINADTFEYS----PSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLK 372
           +A  + PVIDVI+A T EY      S +  GGF W   F W N+           + P+ 
Sbjct: 413 NA-VLIPVIDVIDAKTLEYKLAARGSHMPIGGFKWTGDFTWINMEDSPKRTTASPIDPIN 471

Query: 373 SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
           +PTMAGGLFAI R+YF  IG YD  M+ WGGENLE+SFRIW CGGS+E++PCSRVGH+FR
Sbjct: 472 TPTMAGGLFAIDRKYFWVIGSYDELMDGWGGENLEMSFRIWQCGGSIEIVPCSRVGHIFR 531

Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVI--EVNPSAAHVEIGDISERKALRERL 490
              PY     +D  L N+ R A VWMDDY +       N      EIGD++ RK LRERL
Sbjct: 532 DFFPYEFPSSRDTYLINTARAAHVWMDDYKRLFFLHHKNMEGNTKEIGDLTARKKLRERL 591

Query: 491 QCKTFKWYLDNMW---FETDRSELVLG------RTLCLDA--SNNVAP 527
           QC +FKWYL N++   F  D + L  G      R LCLD+  SN+  P
Sbjct: 592 QCASFKWYLQNVYPEKFIPDENVLAYGRARSPRRNLCLDSITSNDEHP 639



 Score = 72.1 bits (169), Expect = 4e-11
 Identities = 33/81 (40%), Positives = 53/81 (65%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
           Y+  A N ++S +I   R + D R+ LC+S  Y  +LP  S++I F+NE +  L+R+V+S
Sbjct: 260 YSKRAVNVVLSNKIPLQRRIRDMRDPLCKSVTYDTKLPTTSVVIIFHNEAWSVLLRTVYS 319

Query: 175 IMDRTDQKHIKEIILVDDYSD 195
           ++  +  K +KEIILVDD S+
Sbjct: 320 VLQESPPKFLKEIILVDDNSN 340


>UniRef50_Q8IXK2 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 12; n=41;
           Eumetazoa|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 12 - Homo sapiens
           (Human)
          Length = 581

 Score =  253 bits (620), Expect = 8e-66
 Identities = 148/344 (43%), Positives = 204/344 (59%), Gaps = 39/344 (11%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +RL++ +KREGL+RARL GA  + GDVL FLD H E + GWL PLL+R+ +    V    
Sbjct: 197 VRLIRANKREGLVRARLLGASAARGDVLTFLDCHCECHEGWLEPLLQRIHEEESAV---- 252

Query: 317 SARAVTPVIDVINADTFEY---SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
               V PVIDVI+ +TFEY   S  P + GGF+W L F W  +P+   I  +  +  ++S
Sbjct: 253 ----VCPVIDVIDWNTFEYLGNSGEPQI-GGFDWRLVFTWHTVPERERIRMQSPVDVIRS 307

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           PTMAGGLFA+ ++YF  +G YD GM VWGGENLE SFRIW CGG LE  PCS VGHVF K
Sbjct: 308 PTMAGGLFAVSKKYFEYLGSYDTGMEVWGGENLEFSFRIWQCGGVLETHPCSHVGHVFPK 367

Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
           + PY     ++  L NS+R A VWMD++ +     NP A     GD++ERK LR++LQCK
Sbjct: 368 QAPY----SRNKALANSVRAAEVWMDEFKELYYHRNPRARLEPFGDVTERKQLRDKLQCK 423

Query: 494 TFKWYLDNMWFETDRSE-------LVLGRTL---CLDAS----NNVA---PILGKCHEMG 536
            FKW+L+ ++ E    E       ++  + L   C D +    N +     IL  CH MG
Sbjct: 424 DFKWFLETVYPELHVPEDRPGFFGMLQNKGLTDYCFDYNPPDENQIVGHQVILYLCHGMG 483

Query: 537 GTQEWKHKGTASSPI-YNT-AAGMCLGVDRSYRGETVLMVICDD 578
             Q +++  T+   I YNT     C+ V+     +T++M +C++
Sbjct: 484 QNQFFEY--TSQKEIRYNTHQPEGCIAVEAGM--DTLIMHLCEE 523



 Score = 96.3 bits (229), Expect = 2e-18
 Identities = 45/109 (41%), Positives = 71/109 (65%), Gaps = 1/109 (0%)

Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEH 164
           E+LR++++   LH  N  +S RI  HR LP+  N LC+ ++Y +D LPR S+II FYNE 
Sbjct: 89  EELRLQEESVRLHQINIYLSDRISLHRRLPERWNPLCKEKKYDYDNLPRTSVIIAFYNEA 148

Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
           + TL+R+V+S+++ +    ++E+ILVDDYSD  +L   +   +  L  V
Sbjct: 149 WSTLLRTVYSVLETSPDILLEEVILVDDYSDREHLKERLANELSGLPKV 197


>UniRef50_O61394 Cluster: Probable N-acetylgalactosaminyltransferase
           6; n=4; Caenorhabditis|Rep: Probable
           N-acetylgalactosaminyltransferase 6 - Caenorhabditis
           elegans
          Length = 618

 Score =  252 bits (616), Expect = 3e-65
 Identities = 123/254 (48%), Positives = 168/254 (66%), Gaps = 15/254 (5%)

Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
           +I+++++ +R GLIRAR+ GA  + GDVL FLDSH E   GWL PLL R         ++
Sbjct: 219 DIKIIRSKERVGLIRARMMGAQEAQGDVLTFLDSHCECTKGWLEPLLTR---------IK 269

Query: 316 YSARAVT-PVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTLINDE-DFMKPLK 372
            + +AV  PVID+IN +TF+Y     + RGGFNW L F+W  +P         D   P++
Sbjct: 270 LNRKAVPCPVIDIINDNTFQYQKGIEMFRGGFNWNLQFRWYGMPTAMAKQHLLDPTGPIE 329

Query: 373 SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
           SPTMAGGLF+I R YF  +G+YDPGM++WGGENLE+SFRIW CGG +E++PCS VGHVFR
Sbjct: 330 SPTMAGGLFSINRNYFEELGEYDPGMDIWGGENLEMSFRIWQCGGRVEILPCSHVGHVFR 389

Query: 433 KRRPYGV-GEKQDYMLQ-NSMRMARVWMDDYVKKVIEVNPSAAHVEIG-DISERKALRER 489
           K  P+   G+    +L  N +R+A VWMDD+     ++ P A  +    D+SER  LR++
Sbjct: 390 KSSPHDFPGKSSGKVLNTNLLRVAEVWMDDWKHYFYKIAPQAHRMRSSIDVSERVELRKK 449

Query: 490 LQCKTFKWYLDNMW 503
           L CK+FKWYL N++
Sbjct: 450 LNCKSFKWYLQNVF 463



 Score = 87.0 bits (206), Expect = 1e-15
 Identities = 39/90 (43%), Positives = 60/90 (66%)

Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHY 165
           E  ++ D  + ++ FN L+S  I   R LP+ R   C++  Y D LP  S+II ++NE Y
Sbjct: 111 EQQKLADSTFAVNQFNLLVSDGISVRRSLPEIRKPSCRNMTYPDNLPTTSVIIVYHNEAY 170

Query: 166 ETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
            TL+R+V S++DR+ ++ +KEIILVDD+SD
Sbjct: 171 STLLRTVWSVIDRSPKELLKEIILVDDFSD 200


>UniRef50_UPI0000E4974C Cluster: PREDICTED: hypothetical protein;
           n=3; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 953

 Score =  242 bits (592), Expect = 2e-62
 Identities = 133/301 (44%), Positives = 179/301 (59%), Gaps = 28/301 (9%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           IR+++  KR GLI+AR+ G D S G+   FLDSH+EV +GWL PLL RL+   D   V  
Sbjct: 575 IRMVRAEKRLGLIKARMMGVDASEGETFTFLDSHVEVMIGWLEPLLARLAS--DRTIV-- 630

Query: 317 SARAVTPVIDVINADTFEYS--PSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
               V PV+D IN DTF Y+  P PL RGGFNW   ++W  +P       +  + P+KSP
Sbjct: 631 ----VMPVVDEINKDTFNYNVVPEPLQRGGFNWRFEYRWKPIPNYDKRPSK--VAPIKSP 684

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            M GGL  + R +F  +G +D GM VWGGENLE S +IWMCGGS+E+IPCSRVGHV+R  
Sbjct: 685 AMPGGLLTMDRSFFLELGGFDLGMEVWGGENLETSLKIWMCGGSIEIIPCSRVGHVYRDT 744

Query: 435 RPYG-VGEKQ-DYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
            PY  +G+   D +  N+MR+  VW D++     +  P   + + GD+S+RK LRE L C
Sbjct: 745 SPYSFLGQNPLDIVEHNAMRVVEVWTDEHKYHFYDRLPMLKNRDFGDVSKRKKLRESLNC 804

Query: 493 KTFKWYLDNMWFE--TDRSELVL-------GRTLCLDAS--NNVA--PILG-KCHEMGGT 538
             F WYL N++ E     S  VL       G  LC+D++  N  A   ++G  CH +GG 
Sbjct: 805 YDFNWYLANVYPELYVPSSSSVLRQTINNKGSKLCIDSNDQNGQAGKNLIGWHCHNLGGN 864

Query: 539 Q 539
           +
Sbjct: 865 E 865


>UniRef50_UPI0000D564C6 Cluster: PREDICTED: similar to CG8182-PA,
           isoform A; n=1; Tribolium castaneum|Rep: PREDICTED:
           similar to CG8182-PA, isoform A - Tribolium castaneum
          Length = 545

 Score =  241 bits (590), Expect = 4e-62
 Identities = 122/261 (46%), Positives = 163/261 (62%), Gaps = 15/261 (5%)

Query: 249 EVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQG 308
           E + +   +RL+    R GLIRARL GA  + GDVL+FLD+H E    W+ PLL R+ Q 
Sbjct: 156 ETRLSSTKLRLIHLKTRMGLIRARLQGARIATGDVLIFLDAHCEATTDWMEPLLSRIEQE 215

Query: 309 VDGVKVRYSARAVTPVIDVINADTFEYSPSPLVR---GGFNWGLHFKWDNLPKGTLINDE 365
              V V        P+IDVI A+T  YS +       GGF+W  HF W ++       D+
Sbjct: 216 PTAVLV--------PIIDVIEANTLAYSTNGDTSYQVGGFSWSGHFTWIDIQNE---EDK 264

Query: 366 DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCS 425
             + P+KSPTMAGGLFAI R++F  IG YD  M+ WGGENLE+SFRIW CGG LE +PCS
Sbjct: 265 HKLTPVKSPTMAGGLFAIDRKFFWEIGSYDEQMDGWGGENLEMSFRIWQCGGRLETVPCS 324

Query: 426 RVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAA-HVEIGDISERK 484
           RVGH+FR   PY   + +D    N+ R+A VWMDDY +      P+   +  +GD++ RK
Sbjct: 325 RVGHIFRDFHPYSFPDNKDTHGINTARLAHVWMDDYKRFFFMYQPALENNPVVGDLTHRK 384

Query: 485 ALRERLQCKTFKWYLDNMWFE 505
            LR++L+CK+FKWYL+N++ E
Sbjct: 385 QLRQKLRCKSFKWYLENVYPE 405



 Score = 63.3 bits (147), Expect = 2e-08
 Identities = 33/89 (37%), Positives = 56/89 (62%), Gaps = 1/89 (1%)

Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHY 165
           +D +  +K     A NT++S R+   R L D RN  C++  Y  +L +AS+++ FYNE  
Sbjct: 55  QDAKEGEKALKKFALNTVLSDRMPLDRKLRDPRNPKCKTFTYNPKL-KASVVVIFYNELL 113

Query: 166 ETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
             ++R+V S++ +T ++ ++EIILVDD S
Sbjct: 114 SVILRTVWSVILQTPKELLEEIILVDDAS 142


>UniRef50_Q9D4M9 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5; n=9;
           Eutheria|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 5 - Mus
           musculus (Mouse)
          Length = 431

 Score =  241 bits (590), Expect = 4e-62
 Identities = 115/251 (45%), Positives = 163/251 (64%), Gaps = 9/251 (3%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++L++  KREGLIR+++ GA  + GD+LVFLDSH EVN  WL PLL  +++    V    
Sbjct: 177 VKLIRNKKREGLIRSKMIGASRASGDILVFLDSHCEVNRVWLEPLLHAIAKDHKMV---- 232

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
               V P+IDVIN  T +Y  +P+VRG F+W L+ +WDN+    L   E    P++SP M
Sbjct: 233 ----VCPIIDVINELTLDYMAAPIVRGAFDWNLNLRWDNVFAYELDGPEGPSTPIRSPAM 288

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
            GG+FAI R YFN +G+YD GM++ GGEN+E+S RIWMCGG L ++PCSRVG+  +    
Sbjct: 289 TGGIFAINRHYFNELGQYDNGMDICGGENVELSLRIWMCGGQLFILPCSRVGYNSKALSQ 348

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFK 496
           +     Q  + +N +R+  VW+D+Y        PS  +V  G+ISER  LR+RL CK+F+
Sbjct: 349 HR-RANQSALSRNLLRVVHVWLDEYKGNFFLQRPSLTYVSCGNISERVELRKRLGCKSFQ 407

Query: 497 WYLDNMWFETD 507
           WYLDN++ E +
Sbjct: 408 WYLDNIFPELE 418



 Score = 92.3 bits (219), Expect = 3e-17
 Identities = 47/132 (35%), Positives = 84/132 (63%), Gaps = 5/132 (3%)

Query: 68  EYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQR 127
           E  ++ +LK++   + A + ++ +E+   ++     N  D  +  +G   +  N ++S+R
Sbjct: 36  ENEKEELLKKRSLGKNAHQQTRHSEDVTHDEV----NFSDPELI-QGLRRYGLNAIMSRR 90

Query: 128 IGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEI 187
           +G  R++PD+R+K+CQ + Y   LP ASIIICFYNE + TL+R+V S+++ + Q  ++EI
Sbjct: 91  LGIEREVPDSRDKICQQKHYPFNLPTASIIICFYNEEFNTLLRAVSSVVNLSPQHLLEEI 150

Query: 188 ILVDDYSDLYNL 199
           ILVDD S+  +L
Sbjct: 151 ILVDDMSEFDDL 162


>UniRef50_Q86SR1 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 10; n=77;
           Coelomata|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 10 - Homo sapiens
           (Human)
          Length = 603

 Score =  239 bits (586), Expect = 1e-61
 Identities = 113/246 (45%), Positives = 157/246 (63%), Gaps = 13/246 (5%)

Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
           ++R+L+T KREGLIR R+ GA  + GDV+ FLDSH E NV WLPPLL R++        R
Sbjct: 205 SVRILRTKKREGLIRTRMLGASVATGDVITFLDSHCEANVNWLPPLLDRIA--------R 256

Query: 316 YSARAVTPVIDVINADTFEYSPSP--LVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
                V P+IDVI+ D F Y       +RG F+W +++K   +P    +   D   P +S
Sbjct: 257 NRKTIVCPMIDVIDHDDFRYETQAGDAMRGAFDWEMYYK--RIPIPPELQKADPSDPFES 314

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           P MAGGLFA+ R++F  +G YDPG+ +WGGE  EISF++WMCGG +E IPCSRVGH++RK
Sbjct: 315 PVMAGGLFAVDRKWFWELGGYDPGLEIWGGEQYEISFKVWMCGGRMEDIPCSRVGHIYRK 374

Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
             PY V      + +N  R+A VWMD+Y + + +  P   H+  GD++ +K LR  L CK
Sbjct: 375 YVPYKVPAGVS-LARNLKRVAEVWMDEYAEYIYQRRPEYRHLSAGDVAVQKKLRSSLNCK 433

Query: 494 TFKWYL 499
           +FKW++
Sbjct: 434 SFKWFM 439



 Score = 89.4 bits (212), Expect = 2e-16
 Identities = 44/116 (37%), Positives = 73/116 (62%), Gaps = 4/116 (3%)

Query: 112 DKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRS 171
           D+ Y  + FN  +S +I  +R LPD R+  C S++Y + LP  SIII F+NE + +L+R+
Sbjct: 105 DQAYRENGFNIYVSDKISLNRSLPDIRHPNCNSKRYLETLPNTSIIIPFHNEGWSSLLRT 164

Query: 172 VHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV----IKKEEEMIET 223
           VHS+++R+  + + EI+LVDD+SD  +L   +++ +    +V     KK E +I T
Sbjct: 165 VHSVLNRSPPELVAEIVLVDDFSDREHLKKPLEDYMALFPSVRILRTKKREGLIRT 220



 Score = 37.9 bits (84), Expect = 0.75
 Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 6/72 (8%)

Query: 515 RTLCLDASNNVAPI-LGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLM 573
           +  C DA ++ +P+ L  CH M G Q WK++      +Y+  +G C+  D S     + M
Sbjct: 520 KKFCFDAISHTSPVTLYDCHSMKGNQLWKYR--KDKTLYHPVSGSCM--DCSESDHRIFM 575

Query: 574 VICDDYS-NNKW 584
             C+  S   +W
Sbjct: 576 NTCNPSSLTQQW 587


>UniRef50_Q6WV20 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 1; n=5; Diptera|Rep:
           Polypeptide N-acetylgalactosaminyltransferase 1 -
           Drosophila melanogaster (Fruit fly)
          Length = 601

 Score =  239 bits (584), Expect = 2e-61
 Identities = 123/259 (47%), Positives = 160/259 (61%), Gaps = 18/259 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           + +L+   R GLIRARL GA  + GDVL+FLD+H E N+GW  PLL+R+ +    V V  
Sbjct: 213 VTILRLKNRLGLIRARLAGARIATGDVLIFLDAHCEGNIGWCEPLLQRIKESRTSVLV-- 270

Query: 317 SARAVTPVIDVINADTFEYSPSPLVR---GGFNWGLHFKWDNLPKGTL------INDEDF 367
                 P+IDVI+A+ F+YS +       GGF W  HF W NLP+            E  
Sbjct: 271 ------PIIDVIDANDFQYSTNGYKSFQVGGFQWNGHFDWINLPEREKQRQRRECKQERE 324

Query: 368 MKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRV 427
           + P  SPTMAGGLFAI R YF  +G YD  M+ WGGENLE+SFRIW CGG++E IPCSRV
Sbjct: 325 ICPAYSPTMAGGLFAIDRRYFWEVGSYDEQMDGWGGENLEMSFRIWQCGGTIETIPCSRV 384

Query: 428 GHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAA-HVEIGDISERKAL 486
           GH+FR   PY     +D    N+ RMA VWMD+Y+       P    H +IGD++ R  L
Sbjct: 385 GHIFRDFHPYKFPNDRDTHGINTARMALVWMDEYINIFFLNRPDLKFHADIGDVTHRVML 444

Query: 487 RERLQCKTFKWYLDNMWFE 505
           R++L+CK+F+WYL N++ E
Sbjct: 445 RKKLRCKSFEWYLKNIYPE 463



 Score = 77.8 bits (183), Expect = 8e-13
 Identities = 46/117 (39%), Positives = 71/117 (60%), Gaps = 3/117 (2%)

Query: 82  QQAIKMS-KKTENDLEEQFGLIRNSEDLRIR-DKGYNLHAFNTLISQRIGDHRDLPDTRN 139
           +Q I++  +K +  L EQ   +  S   + R D+ Y   A N  +S+++  +R + D RN
Sbjct: 76  EQIIQLDLQKQKVGLGEQGVAVHLSGAAKERGDEIYKKIALNEELSEQLTYNRSVGDHRN 135

Query: 140 KLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
            LC  Q++  D LP AS++I F+NE Y  L+R+VHS +   ++K +KEIILVDD SD
Sbjct: 136 PLCAKQRFDSDSLPTASVVIIFFNEPYSVLLRTVHSTLSTCNEKALKEIILVDDGSD 192


>UniRef50_Q9U2C4 Cluster: Probable N-acetylgalactosaminyltransferase
           9; n=3; Caenorhabditis|Rep: Probable
           N-acetylgalactosaminyltransferase 9 - Caenorhabditis
           elegans
          Length = 579

 Score =  238 bits (583), Expect = 3e-61
 Identities = 118/250 (47%), Positives = 153/250 (61%), Gaps = 12/250 (4%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +RL++   R GLIRA+L GA  +VGD++VFLDSH E N GWL P+++R+S     +    
Sbjct: 196 VRLIRKHVRHGLIRAKLAGAREAVGDIIVFLDSHCEANHGWLEPIVQRISDERTAI---- 251

Query: 317 SARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
               V P+ID I+ +T  Y     L  GGF+W LHF W+ L +            ++SPT
Sbjct: 252 ----VCPMIDSISDNTLAYHGDWSLSTGGFSWALHFTWEGLSEEEQKRRTKPTDYIRSPT 307

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           MAGGL A  REYF  +G YD  M++WGGENLEISFR WMCGGS+E IPCS VGH+FR   
Sbjct: 308 MAGGLLAANREYFFEVGGYDEEMDIWGGENLEISFRAWMCGGSIEFIPCSHVGHIFRAGH 367

Query: 436 PY---GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
           PY   G    +D    NS R+A VWMDDY +            ++GD++ R  LR+RL C
Sbjct: 368 PYNMTGRNNNKDVHGTNSKRLAEVWMDDYKRLYYMHREDLRTKDVGDLTARHELRKRLNC 427

Query: 493 KTFKWYLDNM 502
           K FKW+LDN+
Sbjct: 428 KPFKWFLDNI 437



 Score = 64.9 bits (151), Expect = 6e-09
 Identities = 31/96 (32%), Positives = 57/96 (59%), Gaps = 1/96 (1%)

Query: 121 NTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
           N   S +I   RD+PD R + C+  +Y +  LP+ S+II F +E +  L+R+VHS+++R+
Sbjct: 102 NVHASDKISLDRDVPDPRIQACKDIKYDYAALPKTSVIIIFTDEAWTPLLRTVHSVINRS 161

Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
             + ++E+IL+DD S    L   + E + +    ++
Sbjct: 162 PPELLQEVILLDDNSKRQELQEPLDEHIKRFGGKVR 197


>UniRef50_Q14435 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 3; n=53;
           Euteleostomi|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 3 - Homo sapiens
           (Human)
          Length = 633

 Score =  238 bits (582), Expect = 3e-61
 Identities = 130/327 (39%), Positives = 194/327 (59%), Gaps = 33/327 (10%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++  +R+GLI ARL GA  +  + L FLD+H E   GWL PLL R+++        Y
Sbjct: 246 VKIVRQRERKGLITARLLGATVATAETLTFLDAHCECFYGWLEPLLARIAEN-------Y 298

Query: 317 SARAVTPVIDVINADTFEYS-PSPLV----RGGFNWGLHFKWDNLPKGTLINDEDFMKPL 371
           +A  V+P I  I+ +TFE++ PSP      RG F+W L F W++LP       +D   P+
Sbjct: 299 TA-VVSPDIASIDLNTFEFNKPSPYGSNHNRGNFDWSLSFGWESLPDHEKQRRKDETYPI 357

Query: 372 KSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVF 431
           K+PT AGGLF+I +EYF  IG YD  M +WGGEN+E+SFR+W CGG LE++PCS VGHVF
Sbjct: 358 KTPTFAGGLFSISKEYFEYIGSYDEEMEIWGGENIEMSFRVWQCGGQLEIMPCSVVGHVF 417

Query: 432 RKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHV----EIGDISERKALR 487
           R + P+   +    + +N +R+A VWMD+Y +     N  AA +      GD+S+R  ++
Sbjct: 418 RSKSPHSFPKGTQVIARNQVRLAEVWMDEYKEIFYRRNTDAAKIVKQKAFGDLSKRFEIK 477

Query: 488 ERLQCKTFKWYLDNMWFET---DRSELV------LGRTLCLDASNN----VAPILGKCHE 534
            RL+CK F WYL+N++ E    D + ++      +G+ LCLD   N       I+  CH 
Sbjct: 478 HRLRCKNFTWYLNNIYPEVYVPDLNPVISGYIKSVGQPLCLDVGENNQGGKPLIMYTCHG 537

Query: 535 MGGTQEWKHKGTASSPI-YNTAAGMCL 560
           +GG Q +++  +A   I +N    +CL
Sbjct: 538 LGGNQYFEY--SAQHEIRHNIQKELCL 562



 Score = 79.0 bits (186), Expect = 3e-13
 Identities = 46/111 (41%), Positives = 65/111 (58%), Gaps = 3/111 (2%)

Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDL-PDTRNKLCQSQQY--FDELPRASIIICFYN 162
           E+ + +++G   H FN   S RI  HRDL PDTR   C  Q++     LP  S+II F+N
Sbjct: 136 EEQKEKERGEAKHCFNAFASDRISLHRDLGPDTRPPECIEQKFKRCPPLPTTSVIIVFHN 195

Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
           E + TL+R+VHS++  +    +KEIILVDD S    LH  + E V + + V
Sbjct: 196 EAWSTLLRTVHSVLYSSPAILLKEIILVDDASVDEYLHDKLDEYVKQFSIV 246


>UniRef50_O45293 Cluster: Probable N-acetylgalactosaminyltransferase
           8; n=2; Caenorhabditis|Rep: Probable
           N-acetylgalactosaminyltransferase 8 - Caenorhabditis
           elegans
          Length = 421

 Score =  236 bits (577), Expect = 1e-60
 Identities = 106/250 (42%), Positives = 161/250 (64%), Gaps = 9/250 (3%)

Query: 259 LLKTSK-REGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
           ++K S+ R+GLIRA+++ +  + G+V+VF+DSH EV   WL PLL+ + +    +     
Sbjct: 173 IIKRSEYRQGLIRAKVHASRLATGEVIVFMDSHCEVAERWLEPLLQPIKEDPKSI----- 227

Query: 318 ARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
              V PV+D+IN  +F+YSPS + + GF+WG  FKW  LP       E+ +KP  SP M 
Sbjct: 228 ---VLPVVDLINPVSFDYSPSMVAKSGFDWGFTFKWIYLPWEYFETPENNVKPFNSPAMP 284

Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
           GGL A+ +EYF  +G+YD GM +WG EN+E+S + W+CGG + + PCSRVGHVFR RRPY
Sbjct: 285 GGLLAMRKEYFVELGEYDMGMEIWGSENIELSLKAWLCGGRVVVAPCSRVGHVFRMRRPY 344

Query: 438 GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFKW 497
                 D  L N++R+A+ W+ +Y  K   V P  A +  GD++E   +++RL+CK  KW
Sbjct: 345 TSKPGMDTALYNAVRVAKTWLGEYESKFFAVKPRGAKMVFGDLTEPMQVKDRLKCKDMKW 404

Query: 498 YLDNMWFETD 507
           +++N++ E +
Sbjct: 405 FIENVYPELE 414



 Score = 56.8 bits (131), Expect = 2e-06
 Identities = 29/108 (26%), Positives = 63/108 (58%), Gaps = 4/108 (3%)

Query: 114 GYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVH 173
           G    AF+ L S+++G +R++    +KLC+ ++Y D     S+++  +NE   T++R ++
Sbjct: 70  GIKSFAFDALSSEKLGPNRNVGKQAHKLCEEEKY-DASYSTSVVVIHHNEALSTILRMIN 128

Query: 174 SIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI 221
            I++ T +  +KEI+L +D S+     H + + ++K   +   E+++I
Sbjct: 129 GIIEFTPKSLLKEIVLYEDASE---EDHVLTKHLEKFAKIKGLEDKLI 173


>UniRef50_UPI0000D56CDA Cluster: PREDICTED: similar to CG4445-PA;
           n=1; Tribolium castaneum|Rep: PREDICTED: similar to
           CG4445-PA - Tribolium castaneum
          Length = 602

 Score =  233 bits (571), Expect = 7e-60
 Identities = 120/250 (48%), Positives = 154/250 (61%), Gaps = 12/250 (4%)

Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
           ++L++  R GLI+ARL GA  + G VL FLD+H E   GWL  LL  + Q    V     
Sbjct: 213 KVLRSQARIGLIKARLKGALVAKGPVLTFLDAHCECTTGWLEALLSVIKQDRTAV----- 267

Query: 318 ARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTL-INDEDFMKPLKSPT 375
              V PVID+IN DTF Y  S  L  G FNW L F+W  L    L +   D  +P  +PT
Sbjct: 268 ---VCPVIDIINDDTFAYVKSFELHWGAFNWNLQFRWFTLGGRELKLRKNDATQPFNTPT 324

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           MAGGLFAI REYF  +G YD GMN+WGGENLE+SFRIW CGG +++ PCSRVGH+FRK  
Sbjct: 325 MAGGLFAIDREYFFEMGAYDDGMNIWGGENLEMSFRIWQCGGKVQIAPCSRVGHLFRKSS 384

Query: 436 PYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVE-IGDISERKALRERLQCK 493
           PY   G     +  N  R+ARVWMDD+ +   + N  A  ++   +++ R  LR + +CK
Sbjct: 385 PYSFPGGINKTLFSNLARVARVWMDDWARFYFKFNEPADRIKNEQNVTSRIELRRKHKCK 444

Query: 494 TFKWYLDNMW 503
            F+WYLDN+W
Sbjct: 445 GFEWYLDNVW 454



 Score = 69.7 bits (163), Expect = 2e-10
 Identities = 40/106 (37%), Positives = 63/106 (59%), Gaps = 1/106 (0%)

Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEH 164
           +DL    + + ++ FN L S RI  +R LPD R K C +    +   P+ SIII F+NE 
Sbjct: 103 KDLLKMQQYFQINRFNLLASDRIPLNRSLPDFRRKKCATLFGDYPTYPKTSIIIVFHNEA 162

Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKL 210
           + TL+R+V S+++R+  + ++EIILVDD S+   L   + + V  L
Sbjct: 163 WSTLLRTVWSVINRSPPELLEEIILVDDSSERKFLKKPLDDYVANL 208


>UniRef50_O61397 Cluster: Probable N-acetylgalactosaminyltransferase
           7; n=5; Bilateria|Rep: Probable
           N-acetylgalactosaminyltransferase 7 - Caenorhabditis
           elegans
          Length = 601

 Score =  232 bits (568), Expect = 2e-59
 Identities = 120/287 (41%), Positives = 173/287 (60%), Gaps = 18/287 (6%)

Query: 259 LLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSA 318
           +++T +REGLI AR  GA +S G+V++FLD+H EVN  WLPPLL  + +    + V    
Sbjct: 220 VVRTEQREGLINARSIGAKHSTGEVVLFLDAHCEVNTNWLPPLLAPIKRNRKVMTV---- 275

Query: 319 RAVTPVIDVINADTFEY-----SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
               PVID I+++++EY     SP+    G F WGL +K   + +    + +   +P +S
Sbjct: 276 ----PVIDGIDSNSWEYRSVYGSPNAHHSGIFEWGLLYKETQITERETAHRKHNSQPFRS 331

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           PT AGGLFAI R +F  +G YD G+ +WGGE  E+SF+IW CGG +  +PCS VGHV+R 
Sbjct: 332 PTHAGGLFAINRLWFKELGYYDEGLQIWGGEQYELSFKIWQCGGGIVFVPCSHVGHVYRS 391

Query: 434 RRPYGVGE--KQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQ 491
             PY  G+   +  +  N MR+ + WMDDY K  +   P A +V  GDIS + ALR++LQ
Sbjct: 392 HMPYSFGKFSGKPVISINMMRVVKTWMDDYSKYYLTREPQATNVNPGDISAQLALRDKLQ 451

Query: 492 CKTFKWYLDNMWFETDRSELVLGRTLCLDASNNVAPILGKC-HEMGG 537
           CK+FKWY++N+ ++  +S  +L        + N  P  GKC   MGG
Sbjct: 452 CKSFKWYMENVAYDVLKSYPMLPPNDVWGEARN--PATGKCLDRMGG 496



 Score = 74.1 bits (174), Expect = 9e-12
 Identities = 32/95 (33%), Positives = 60/95 (63%)

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
           FNT +S  I  +R +PD R + C+   Y ++LP  S+++ F+NE +  L+R+VHS++ R+
Sbjct: 124 FNTYVSDMISMNRTIPDIRPEECKHWDYPEKLPTVSVVVVFHNEGWTPLLRTVHSVLLRS 183

Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVI 214
             + I+++++VDD SD  +L   + + V + N  +
Sbjct: 184 PPELIEQVVMVDDDSDKPHLKEKLDKYVTRFNGKV 218


>UniRef50_Q7Z7M9 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 5; n=29;
           Deuterostomia|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 5 - Homo sapiens
           (Human)
          Length = 940

 Score =  231 bits (565), Expect = 4e-59
 Identities = 118/287 (41%), Positives = 177/287 (61%), Gaps = 20/287 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+L+  +R GLIRARL GA N+ GDVL FLDSH+E NVGWL PLL+R+           
Sbjct: 557 VRILRLKERHGLIRARLAGAQNATGDVLTFLDSHVECNVGWLEPLLERVYLS-------- 608

Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDE-DFMKPLKSP 374
             +   PVI+VIN     Y +     RG F W ++F W  +P   +  +       ++ P
Sbjct: 609 RKKVACPVIEVINDKDMSYMTVDNFQRGIFVWPMNFGWRTIPPDVIAKNRIKETDTIRCP 668

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            MAGGLF+I + YF  +G YDPG++VWGGEN+E+SF++WMCGG +E+IPCSRVGH+FR  
Sbjct: 669 VMAGGLFSIDKSYFFELGTYDPGLDVWGGENMELSFKVWMCGGEIEIIPCSRVGHIFRND 728

Query: 435 RPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAH--VEIGDISERKALRERLQ 491
            PY    ++   + +N +R+A VW+D+Y +             +++G++++++ LR++L+
Sbjct: 729 NPYSFPKDRMKTVERNLVRVAEVWLDEYKELFYGHGDHLIDQGLDVGNLTQQRELRKKLK 788

Query: 492 CKTFKWYLDNMWFETDRSELVLGRTLCLDASNNVAPILGKCHEMGGT 538
           CK+FKWYL+N+ F   R+ +V    + +    NVA  LGKC  +  T
Sbjct: 789 CKSFKWYLENV-FPDLRAPIVRASGVLI----NVA--LGKCISIENT 828



 Score = 70.5 bits (165), Expect = 1e-10
 Identities = 34/75 (45%), Positives = 48/75 (64%)

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
           FN  +S  I   R + DTR   C  Q   + LP  S+I+CF +E + TL+RSVHS+++R+
Sbjct: 464 FNVYLSDLIPVDRAIEDTRPAGCAEQLVHNNLPTTSVIMCFVDEVWSTLLRSVHSVINRS 523

Query: 180 DQKHIKEIILVDDYS 194
               IKEI+LVDD+S
Sbjct: 524 PPHLIKEILLVDDFS 538


>UniRef50_Q9Y117 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 3; n=2;
           Sophophora|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 3 - Drosophila
           melanogaster (Fruit fly)
          Length = 667

 Score =  227 bits (556), Expect = 5e-58
 Identities = 131/309 (42%), Positives = 173/309 (55%), Gaps = 23/309 (7%)

Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
           S VK      R+ +  KR GL+ ARL GA+N+ GDVL FLD+H E + GWL PLL R+ +
Sbjct: 203 SYVKVLAVPTRIFRMKKRSGLVPARLLGAENARGDVLTFLDAHCECSRGWLEPLLSRIKE 262

Query: 308 GVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVR-GGFNWGLHFKW---DNLPKGTLIN 363
               V        + PVID+I+ D F Y+ +     G FNW L F+W   D   +    +
Sbjct: 263 SRKVV--------ICPVIDIISDDNFSYTKTFENHWGAFNWQLSFRWFSSDRKRQTAGNS 314

Query: 364 DEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIP 423
            +D   P+ +P MAGGLFAI R+YF  +G YD  M VWGGEN+E+SFRIW CGG +E+ P
Sbjct: 315 SKDSTDPIATPGMAGGLFAIDRKYFYEMGSYDSNMRVWGGENVEMSFRIWQCGGRVEISP 374

Query: 424 CSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGD--- 479
           CS VGHVFR   PY   G   + +  N  R A VWMDD+ +  I +  S   +   D   
Sbjct: 375 CSHVGHVFRSSTPYTFPGGMSEVLTDNLARAATVWMDDW-QYFIMLYTSGLTLGAKDKVN 433

Query: 480 ISERKALRERLQCKTFKWYLDNMWFE--TDRSELVLGRTLCLDASNNVAPILGK-CHEMG 536
           ++ER ALRERLQCK F WYL+N+W E      +   G+ + LD     A    K    + 
Sbjct: 434 VTERVALRERLQCKPFSWYLENIWPEHFFPAPDRFFGKIIWLDGETECAQAYSKHMKNLP 493

Query: 537 G---TQEWK 542
           G   ++EWK
Sbjct: 494 GRALSREWK 502



 Score = 74.9 bits (176), Expect = 5e-12
 Identities = 38/96 (39%), Positives = 59/96 (61%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
           + L++FN L S RI  +R L D R   C+ ++Y   LP  S+II F+NE +  L+R++ S
Sbjct: 113 FRLNSFNLLASDRIPLNRTLKDYRTPECRDKKYASGLPSTSVIIVFHNEAWSVLLRTITS 172

Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKL 210
           +++R+ +  +KEIILVDD SD   L   ++  V  L
Sbjct: 173 VINRSPRHLLKEIILVDDASDRSYLKRQLESYVKVL 208


>UniRef50_Q176D5 Cluster: N-acetylgalactosaminyltransferase; n=3;
           Endopterygota|Rep: N-acetylgalactosaminyltransferase -
           Aedes aegypti (Yellowfever mosquito)
          Length = 661

 Score =  226 bits (553), Expect = 1e-57
 Identities = 116/265 (43%), Positives = 153/265 (57%), Gaps = 12/265 (4%)

Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
           K   EN   K  V  I +L+ +KREGL+ ARL GA  + GD L FLD+H E + GWL PL
Sbjct: 191 KNDLENYVQKLPVV-ISILRLNKREGLVAARLMGARVATGDTLTFLDAHCECSPGWLEPL 249

Query: 302 LKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGT 360
           L R+ +           + V PVID+I+ D F Y  S     G FNW +HF+W  L    
Sbjct: 250 LARVQEN--------PKKVVCPVIDIISDDNFSYIKSFEFHWGAFNWQMHFRWYTLSDEE 301

Query: 361 LIND-EDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSL 419
           L    +D   P  +P MAGGLF I R+YF  +G YD  + +WGG+NLE+SFRIW CGG +
Sbjct: 302 LAERRKDTTMPFHTPAMAGGLFTIDRKYFFDVGAYDERLKIWGGDNLEMSFRIWQCGGEI 361

Query: 420 ELIPCSRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIG 478
           E+ PCS VGH+FRK  PY   G     + +N  R+A VWMDD+ K   + N      +  
Sbjct: 362 EIAPCSHVGHLFRKSSPYTFPGGVSGILNENLARVALVWMDDWAKFFFKFNKGTEEFKSL 421

Query: 479 DISERKALRERLQCKTFKWYLDNMW 503
           ++S R AL++ L CK+F WYL  +W
Sbjct: 422 NVSSRVALKKHLSCKSFDWYLRKIW 446



 Score = 88.6 bits (210), Expect = 4e-16
 Identities = 45/114 (39%), Positives = 72/114 (63%)

Query: 101 LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICF 160
           ++  ++DL +  + + ++ +N L S R+  +R LPD R   C S++Y  +LP  SIII F
Sbjct: 92  VVIQAKDLLLMQQLFQINRYNLLASDRVALNRSLPDVRKSKCVSKEYPSKLPTTSIIIVF 151

Query: 161 YNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVI 214
           +NE +  L+R+V S++ R+ +  IKEI+LVDD SD   L +D++  V KL  VI
Sbjct: 152 HNEAWSVLLRTVWSVIIRSPRHLIKEILLVDDASDRRFLKNDLENYVQKLPVVI 205


>UniRef50_Q16SH9 Cluster: N-acetylgalactosaminyltransferase; n=2;
           Culicidae|Rep: N-acetylgalactosaminyltransferase - Aedes
           aegypti (Yellowfever mosquito)
          Length = 569

 Score =  226 bits (552), Expect = 1e-57
 Identities = 105/253 (41%), Positives = 162/253 (64%), Gaps = 13/253 (5%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R  +   REGLIR+R  G   + GD ++FLDSH EVN GWL PL+ RL+  VD   V  
Sbjct: 175 VRFHRNFVREGLIRSRNIGVAYASGDFVLFLDSHCEVNRGWLEPLVDRLT--VDSTAV-- 230

Query: 317 SARAVTPVIDVINADTFEYSP-SPLVRGGFNWGLHFKWDNLPKGTLIN-DEDFMKPLKSP 374
               ++P+ID+I+AD+FEY P S  +RGGF+W L F+W  + +  L + + D  +P  SP
Sbjct: 231 ----LSPIIDIIDADSFEYRPNSARLRGGFDWSLRFRWLPVAEEELEHRNHDESQPFYSP 286

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            ++GG+F + +  F  +G +D G+ +WGGE+LE S + W+CG  +E++PCSR+GHVFR++
Sbjct: 287 AISGGVFIVSKTLFQQLGGFDGGLEIWGGESLEFSLKAWLCGAHVEVVPCSRIGHVFRRK 346

Query: 435 RPYGV--GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
            PYG   G    Y L+N+ R+A VWMD++     +  P A+ + +G + + K L+ RL C
Sbjct: 347 HPYGFPQGSAATY-LRNTKRIASVWMDEFQNFFYKTRPEASALSVGSLQQMKDLKRRLNC 405

Query: 493 KTFKWYLDNMWFE 505
           + F WY+ N++ +
Sbjct: 406 RKFSWYMQNVFLD 418



 Score = 68.9 bits (161), Expect = 3e-10
 Identities = 35/90 (38%), Positives = 58/90 (64%), Gaps = 3/90 (3%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRA-SIIICFYNEHYETLMRSVH 173
           Y  H FN  +S RIG  R LPDTR+  C+++++     R  S+II F+NE   TL+R++ 
Sbjct: 79  YQRHYFNLELSDRIGVDRTLPDTRHANCKTREFLTSPGRTTSVIITFHNEATSTLLRTIG 138

Query: 174 SIMDRTDQKHIKEIILVDDYSDLYNLHHDV 203
           S++ +T  + ++EII++DD S   +L H++
Sbjct: 139 SVLKQTPPELLQEIIVIDDCST--SLEHNL 166


>UniRef50_Q8MV48 Cluster: N-acetylgalactosaminyltransferase 7; n=5;
           Endopterygota|Rep: N-acetylgalactosaminyltransferase 7 -
           Drosophila melanogaster (Fruit fly)
          Length = 591

 Score =  223 bits (545), Expect = 1e-56
 Identities = 114/261 (43%), Positives = 160/261 (61%), Gaps = 22/261 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++  +REGLIR R  GA  + G+V+VFLD+H EVN  WLPPLL  +          Y
Sbjct: 204 VKVIRNKEREGLIRTRSRGAMEATGEVIVFLDAHCEVNTNWLPPLLAPI----------Y 253

Query: 317 SARAV--TPVIDVINADTFEYSP----SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKP 370
             R V   P+ID I+   FEY P        RG F WG+ +K + +P+          +P
Sbjct: 254 RDRTVMTVPIIDGIDHKNFEYRPVYGTDNHFRGIFEWGMLYKENEVPRREQRRRAHNSEP 313

Query: 371 LKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHV 430
            +SPT AGGLFAI REYF  +G YDPG+ VWGGEN E+SF+IW CGGS+E +PCSRVGHV
Sbjct: 314 YRSPTHAGGLFAINREYFLELGAYDPGLLVWGGENFELSFKIWQCGGSIEWVPCSRVGHV 373

Query: 431 FRKRRPYGVG-----EKQDYMLQNSMRMARVWMDDYVKKVIEV-NPSAAHVEIGDISERK 484
           +R   PY  G     +K   +  N  R+   W DD  K+      P A ++++GDISE+ 
Sbjct: 374 YRGFMPYNFGKLASKKKGPLITINYKRVIETWFDDTHKEYFYTREPLARYLDMGDISEQL 433

Query: 485 ALRERLQCKTFKWYLDNMWFE 505
           AL++RL CK+F+W++D++ ++
Sbjct: 434 ALKKRLNCKSFQWFMDHIAYD 454



 Score = 84.6 bits (200), Expect = 7e-15
 Identities = 41/98 (41%), Positives = 59/98 (60%)

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
           +  N   S  I  HR + DTR + C+   Y  +LPR S+II F+NE +  LMR+VHS++D
Sbjct: 108 YGMNIACSDEISMHRSVRDTRLEECRHWDYPFDLPRTSVIIVFHNEGFSVLMRTVHSVID 167

Query: 178 RTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
           R+    + EIILVDD+SD  NL   + E V +   ++K
Sbjct: 168 RSPTHMLHEIILVDDFSDKENLRSQLDEYVLQFKGLVK 205


>UniRef50_UPI0000E461C0 Cluster: PREDICTED: hypothetical protein,
           partial; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: hypothetical protein, partial -
           Strongylocentrotus purpuratus
          Length = 639

 Score =  222 bits (543), Expect = 2e-56
 Identities = 111/254 (43%), Positives = 158/254 (62%), Gaps = 15/254 (5%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+ +   R GLIRAR++GA N++GD+L FLDSH EVNVGWL PLL  + +    V    
Sbjct: 356 VRIERLPTRSGLIRARIHGALNAIGDILTFLDSHCEVNVGWLEPLLAVIDKDRRNV---- 411

Query: 317 SARAVTPVIDVINADTFEYSPSPLVR--GGFNWGLHFKWDNLPKGTLINDE-DFMKPLKS 373
               VTP IDVI+ +   Y  S  +   G F W + F+W  +    L   + +   P++S
Sbjct: 412 ----VTPTIDVIDDNDLAYKGSDQLPQVGSFGWTMAFRWTAIQTMDLEEAKRNPTLPIRS 467

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           PTMAGGLF+I + YF  +G YDPG  +WG EN+E+SF+ WMCGGSL  + CS VGH+FRK
Sbjct: 468 PTMAGGLFSIDKGYFMELGMYDPGFQIWGAENIELSFKTWMCGGSLYTMACSHVGHIFRK 527

Query: 434 RRPY-GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
             PY G+G    Y  +N+ R+  VW+ D      +++P    ++ GDI ++  LR++L C
Sbjct: 528 FAPYSGMG---SYFHRNNKRLIEVWLGDARAFYYKLHPDVLRIDAGDIQDQINLRKKLDC 584

Query: 493 KTFKWYLDNMWFET 506
           K+F WYLDN++ E+
Sbjct: 585 KSFDWYLDNVFPES 598



 Score = 80.6 bits (190), Expect = 1e-13
 Identities = 40/98 (40%), Positives = 62/98 (63%)

Query: 112 DKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRS 171
           D  Y+ +AFN L S  I  +R LPD R + C+S  Y + LP  S+II F+NE +  L+R+
Sbjct: 251 DALYHKNAFNLLASDMIAFNRSLPDVRPQQCKSLVYPEVLPTTSVIIIFHNEAFSALLRT 310

Query: 172 VHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
           VHS+++R+ +  +KEIILVDD S   +L   + + + +
Sbjct: 311 VHSVINRSPRHLLKEIILVDDASTQEHLKVKLDDYISR 348


>UniRef50_Q6V2D0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
           N- acetylgalactosaminyltransferase; n=1; Echinococcus
           granulosus|Rep: UDP-N-acetyl-D-galactosamine:polypeptide
           N- acetylgalactosaminyltransferase - Echinococcus
           granulosus
          Length = 659

 Score =  221 bits (540), Expect = 4e-56
 Identities = 114/274 (41%), Positives = 159/274 (58%), Gaps = 9/274 (3%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+++  +R GLIRARL GA  +  DV++FLD+H E    WL PLL R+ Q  D V    
Sbjct: 217 VRIVRLPQRTGLIRARLEGAKAATADVIIFLDAHCEATYRWLEPLLYRIWQKPDAVVCPA 276

Query: 317 SARAVTPVIDVINADTFEYSPS---PLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
            A      + +   D   Y+      L  G F W   F +++ P+ +++  +  +  ++S
Sbjct: 277 IANIDRFTLKIFRTDV-RYTEDGWLSLRVGSFAWDGMFVFEHPPRSSVVKRQSNVDTIES 335

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
            TM GGLFAI+R+YF  +G YD GM +WGGENLE+SFRIW CGGSLE  PCS VGHV+R 
Sbjct: 336 LTMPGGLFAIHRDYFFKLGGYDDGMEIWGGENLELSFRIWQCGGSLEFSPCSTVGHVYRA 395

Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
             PY    K+DY   N+ RMA VWMD Y +          +++ GD+S+RK LR  L C+
Sbjct: 396 IHPYSFPSKKDYNGYNTARMAEVWMDMYKENFYLARGDIKNMDYGDVSKRKKLRNDLGCR 455

Query: 494 TFKWYLDNM---WFETDRSELVLGRTLCLDASNN 524
            F+W+LDN+    F   R+ L  G   C +A N+
Sbjct: 456 NFQWFLDNIAPHKFVYSRNRLGYGS--CCNAENH 487



 Score = 60.5 bits (140), Expect = 1e-07
 Identities = 37/107 (34%), Positives = 60/107 (56%), Gaps = 6/107 (5%)

Query: 102 IRNSEDLRIRDK-GYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICF 160
           I + E  R+ D  GYN HA   +   R   HR       K C +  Y D+LP AS+I+ F
Sbjct: 107 ISDEEMKRVNDADGYNSHACKLVALDRSLGHRPA-----KECLAVIYPDKLPTASVILIF 161

Query: 161 YNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAV 207
           +NE +  ++R+V S+++R+    +KE+IL+DD S   +L  ++ + V
Sbjct: 162 FNEPFRLIIRTVFSVVNRSPPALLKEVILLDDGSTQSDLLDNLDKFV 208


>UniRef50_Q17NN8 Cluster: N-acetylgalactosaminyltransferase; n=4;
           Endopterygota|Rep: N-acetylgalactosaminyltransferase -
           Aedes aegypti (Yellowfever mosquito)
          Length = 613

 Score =  219 bits (534), Expect = 2e-55
 Identities = 108/247 (43%), Positives = 154/247 (62%), Gaps = 12/247 (4%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++L+    R GLI ARL GA  + GDVL+ LDSH EVNV WLPPL++ +++         
Sbjct: 215 VKLISLPVRSGLITARLTGAKAATGDVLIVLDSHTEVNVNWLPPLIEPIAEDY------- 267

Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
               V P IDVI  DTF+Y +     RG F+W   +K   L    ++   D  +P +SP 
Sbjct: 268 -RTCVCPFIDVIAHDTFQYRAQDEGKRGAFDWKFLYKRLPLRAQDMV---DPTEPFESPI 323

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           MAGGLFAI  ++F  +G YD G+++WGGE  E+SF++W CGG +   PCSRVGHV+R   
Sbjct: 324 MAGGLFAISAKFFWELGGYDEGLDIWGGEQYELSFKVWQCGGRMVDAPCSRVGHVYRGYA 383

Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
           P+      +++ +N  R+A VWMD+Y + + E NP     + GD++++KALRERLQCK F
Sbjct: 384 PFPNPRGTNFVTRNFKRVAEVWMDEYKQFLYERNPQFDQTDAGDLTKQKALRERLQCKPF 443

Query: 496 KWYLDNM 502
           KW+L+ +
Sbjct: 444 KWFLEEV 450



 Score = 63.7 bits (148), Expect = 1e-08
 Identities = 27/63 (42%), Positives = 45/63 (71%)

Query: 145 QQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQ 204
           ++Y  ELP  S+I+ FYNEH+ TL+R+V+S+++R+    +KEI+LV+D+S    L   +Q
Sbjct: 145 KRYLQELPTVSVIVIFYNEHWSTLLRTVYSVLNRSPSHLLKEIVLVNDHSTKEFLWEPLQ 204

Query: 205 EAV 207
           + V
Sbjct: 205 DFV 207


>UniRef50_O45947 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase 10; n=4;
           Caenorhabditis|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase 10 - Caenorhabditis
           elegans
          Length = 622

 Score =  217 bits (530), Expect = 7e-55
 Identities = 106/261 (40%), Positives = 159/261 (60%), Gaps = 14/261 (5%)

Query: 251 KNNVFNI-RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGV 309
           KN + +I ++L+T KREGLIR R  GA ++ G++L+FLD+H E N  WLPPLL  +++  
Sbjct: 214 KNKIDHIVKVLRTKKREGLIRGRQLGAQDATGEILIFLDAHSEANYNWLPPLLDPIAEDY 273

Query: 310 DGVKVRYSARAVTPVIDVINADTFEYSPSPL-VRGGFNWGLHFKWDNLPKGTLINDEDFM 368
             V        V P +DVI+ +T+E  P     RG F+W  ++K   LP  T  + E   
Sbjct: 274 RTV--------VCPFVDVIDCETYEVRPQDEGARGSFDWAFNYK--RLPL-TKKDRESPT 322

Query: 369 KPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVG 428
           KP  SP MAGG FAI  ++F  +G YD G+++WGGE  E+SF++W C G +   PCSRV 
Sbjct: 323 KPFNSPVMAGGYFAISAKWFWELGGYDEGLDIWGGEQYELSFKVWQCHGRMVDAPCSRVA 382

Query: 429 HVFR-KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALR 487
           H++R K  P+      D++ +N  R+A VWMDDY + + +  P   + + GD+   K +R
Sbjct: 383 HIYRCKYAPFKNAGMGDFVSRNYKRVAEVWMDDYKETLYKHRPGVGNADAGDLKLMKGIR 442

Query: 488 ERLQCKTFKWYLDNMWFETDR 508
           E+LQCK+F W++  + F+ D+
Sbjct: 443 EKLQCKSFDWFMKEIAFDQDK 463



 Score = 79.8 bits (188), Expect = 2e-13
 Identities = 37/103 (35%), Positives = 67/103 (65%), Gaps = 2/103 (1%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
           Y  + +N  IS  I  +R + D R+K C++  Y  +LP  S+I  F+ EH  TL+RSV+S
Sbjct: 120 YKANGYNAYISDMISLNRSIKDIRHKECKNMMYSAKLPTVSVIFPFHEEHNSTLLRSVYS 179

Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAV--DKLNNVIK 215
           +++R+  + +KEIILVDD+S+   L   +++ +  +K+++++K
Sbjct: 180 VINRSPPELLKEIILVDDFSEKPALRQPLEDFLKKNKIDHIVK 222



 Score = 46.4 bits (105), Expect = 0.002
 Identities = 30/75 (40%), Positives = 41/75 (54%), Gaps = 6/75 (8%)

Query: 514 GRTLCLDASNNV--AP-ILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGET 570
           GR +C D S +V  AP IL  CH M G Q +K++  A   IY+  +G CL  D + +G  
Sbjct: 528 GRKICFDCSTSVDKAPVILFDCHSMKGNQLFKYR-VAQKQIYHPISGQCLTADENGKG-F 585

Query: 571 VLMVICDDYSN-NKW 584
           + M  CD  S+  KW
Sbjct: 586 LHMKKCDSSSDLQKW 600


>UniRef50_Q9HCQ5 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 9; n=51;
           Euteleostomi|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 9 - Homo sapiens
           (Human)
          Length = 603

 Score =  215 bits (525), Expect = 3e-54
 Identities = 106/252 (42%), Positives = 151/252 (59%), Gaps = 13/252 (5%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++ S+REGLIRARL G   +   V+ F D+H+E N GW  P L R+ +         
Sbjct: 214 VKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAEPALSRIRED-------- 265

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
             R V P ID I   TFE         G+NWGL   +  +P    ++  D   P+++P M
Sbjct: 266 RRRIVLPAIDNIKYSTFEVQQYANAAHGYNWGLRCMYI-IPPQDWLDRGDESAPIRTPAM 324

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
            G  F + REYF  IG  DPGM V+GGEN+E+  R+W CGGS+E++PCSRV H+ R R+P
Sbjct: 325 IGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRVWQCGGSMEVLPCSRVAHIERTRKP 384

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EVNPSAAHVEIGDISERKALRERLQCK 493
           Y   +   Y  +N++R A VWMDD+   V     +  S   V+ GD+SER ALR+RL+C+
Sbjct: 385 YN-NDIDYYAKRNALRAAEVWMDDFKSHVYMAWNIPMSNPGVDFGDVSERLALRQRLKCR 443

Query: 494 TFKWYLDNMWFE 505
           +FKWYL+N++ E
Sbjct: 444 SFKWYLENVYPE 455



 Score = 79.8 bits (188), Expect = 2e-13
 Identities = 35/95 (36%), Positives = 57/95 (60%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
           Y  + +N  +S RI   R +PD R + C+   Y  +LP+ S++  F NE    ++RSVHS
Sbjct: 114 YEEYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLPQVSVVFIFVNEALSVILRSVHS 173

Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
           +++ T  + +KE+ILVDD SD   L  ++ + V+K
Sbjct: 174 VVNHTPSQLLKEVILVDDNSDNVELKFNLDQYVNK 208



 Score = 34.3 bits (75), Expect = 9.2
 Identities = 22/70 (31%), Positives = 29/70 (41%), Gaps = 1/70 (1%)

Query: 518 CL-DASNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVIC 576
           CL D      P L KC ++    +     T S PI + A G CL V+ S      L ++ 
Sbjct: 525 CLVDDGTGRMPTLKKCEDVARPTQRLWDFTQSGPIVSRATGRCLEVEMSKDANFGLRLVV 584

Query: 577 DDYSNNKWDI 586
              S  KW I
Sbjct: 585 QRCSGQKWMI 594


>UniRef50_Q5TWJ3 Cluster: ENSANGP00000028412; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000028412 - Anopheles gambiae
           str. PEST
          Length = 523

 Score =  213 bits (521), Expect = 8e-54
 Identities = 119/332 (35%), Positives = 182/332 (54%), Gaps = 35/332 (10%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+L+T KR GLI  R++GA  +  D L+FLD+H E   GWL PLL+ ++   +  KV  
Sbjct: 127 VRILRTPKRLGLITGRIFGAKRASADYLLFLDAHCECLAGWLEPLLELVASNQENRKV-- 184

Query: 317 SARAVTPVIDVINADTF--EYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
                 P ID +N  T   +   S  + G F+W L F+W           E+ ++P  +P
Sbjct: 185 ---VAVPTIDWLNETTLALQVGASSGLYGAFDWNLSFQWRPRYDRLQAPQENLLEPFDTP 241

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            MAGGLF I + +F  +G YDPG+ V+GGEN+E+SF++WMCGG++  +PCS V H+ ++ 
Sbjct: 242 VMAGGLFCIEKAFFAQLGWYDPGLQVYGGENMELSFKVWMCGGAIRTVPCSHVAHIQKRN 301

Query: 435 RPY--GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPS--------AAH-VEIGDISER 483
            PY     +++D  ++NS+R+A VWMD+Y + +  ++P          +H +   ++  R
Sbjct: 302 NPYIGSYTKERDLTMRNSLRVAEVWMDEYAEFLYRLHPDYRALLASRTSHSLSNVNLDAR 361

Query: 484 KALRERLQCKTFKWYLDNMWFETD-----------RSELVLGRTLCLD-ASNNVAPILGK 531
           + LR  L CK+F+WYL +++ E D           R E   G+ LCL     + +  L  
Sbjct: 362 RQLRSELGCKSFRWYLQHVFPEQDDPSEAQAAGWIRHENEAGQ-LCLTWPMRDRSLALLH 420

Query: 532 CHEMGGTQEWKHKGTASSPIYNTAAGMCLGVD 563
           CH +GG Q W H+ T          G CLGVD
Sbjct: 421 CHGLGGQQIWFHRKTGEI----AREGHCLGVD 448



 Score = 70.9 bits (166), Expect = 9e-11
 Identities = 32/86 (37%), Positives = 54/86 (62%), Gaps = 1/86 (1%)

Query: 114 GYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVH 173
           G+    +N  +S  I   R+LPD R+  C+ ++    LP  SI+I F++E    L+R+VH
Sbjct: 28  GWQRQGYNQFVSDLISVRRELPDVRDPWCRDRKR-SALPPVSIVIVFHDEALSVLLRTVH 86

Query: 174 SIMDRTDQKHIKEIILVDDYSDLYNL 199
           S+++RT  + ++EI+L+DD+S L  L
Sbjct: 87  SVLNRTPPELVQEILLIDDWSSLVQL 112


>UniRef50_Q6WV16 Cluster: N-acetylgalactosaminyltransferase 6; n=4;
           Diptera|Rep: N-acetylgalactosaminyltransferase 6 -
           Drosophila melanogaster (Fruit fly)
          Length = 666

 Score =  210 bits (513), Expect = 8e-53
 Identities = 107/252 (42%), Positives = 153/252 (60%), Gaps = 14/252 (5%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+++  +R GLI AR  GA N+  +VL+FLDSH+E N  WLPPLL+ ++          
Sbjct: 264 VRVVRLPRRTGLIGARAAGARNATAEVLIFLDSHVEANYNWLPPLLEPIALN-------- 315

Query: 317 SARAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
              AV P IDVI+   F Y +     RG F+W   +K   L    L +  D   P KSP 
Sbjct: 316 KRTAVCPFIDVIDHTNFHYRAQDEGARGAFDWEFFYKRLPLLPEDLKHPAD---PFKSPI 372

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           MAGGLFAI RE+F  +G YD G+++WGGE  E+SF+IWMCGG +   PCSR+GH++R  R
Sbjct: 373 MAGGLFAISREFFWELGGYDEGLDIWGGEQYELSFKIWMCGGEMYDAPCSRIGHIYRGPR 432

Query: 436 PYGVG-EKQDYMLQNSMRMARVWMDDYVKKVIEVNPSA-AHVEIGDISERKALRERLQCK 493
            +     K DY+ +N  R+A VWMD+Y   +          V+ GD++E+KA+R +L CK
Sbjct: 433 NHQPSPRKGDYLHKNYKRVAEVWMDEYKNYLYSHGDGLYESVDPGDLTEQKAIRTKLNCK 492

Query: 494 TFKWYLDNMWFE 505
           +FKW+++ + F+
Sbjct: 493 SFKWFMEEVAFD 504



 Score = 89.0 bits (211), Expect = 3e-16
 Identities = 43/93 (46%), Positives = 64/93 (68%), Gaps = 1/93 (1%)

Query: 104 NSEDLRIRDKGYNL-HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYN 162
           + E  R  +K  +L + FN L+S  I  +R +PD R+ LC+ ++Y  +LP  S+II FYN
Sbjct: 153 DDESQRDLEKRMSLENGFNALLSDSISVNRSVPDIRHPLCRKKEYVAKLPTVSVIIIFYN 212

Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
           E+   LMRSVHS+++R+  + +KEIILVDD+SD
Sbjct: 213 EYLSVLMRSVHSLINRSPPELMKEIILVDDHSD 245


>UniRef50_Q17M60 Cluster: N-acetylgalactosaminyltransferase; n=1;
           Aedes aegypti|Rep: N-acetylgalactosaminyltransferase -
           Aedes aegypti (Yellowfever mosquito)
          Length = 552

 Score =  209 bits (510), Expect = 2e-52
 Identities = 104/254 (40%), Positives = 153/254 (60%), Gaps = 12/254 (4%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+L++ +R GLI+ARL GA N+  ++L FLD+H E   GWL P L R+++    V +  
Sbjct: 177 VRILRSPQRLGLIKARLMGARNATTEILTFLDAHCECTTGWLEPQLDRVARNPTTVAI-- 234

Query: 317 SARAVTPVIDVINADTFEY--SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
                 P ID ++     +  + S +  G  +WGL F W        +  E+ ++P  +P
Sbjct: 235 ------PTIDWVDEHNLAFIANRSHIYYGACDWGLQFGWRGR-WDRKVKPENKLEPFPTP 287

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            MAGGLF+I + +F  IG YD G+ ++GGEN+E+S + WMCGG LE IPCSRVGH+ +  
Sbjct: 288 IMAGGLFSINKTFFAHIGWYDEGLGIYGGENVELSLKAWMCGGRLETIPCSRVGHIQKAG 347

Query: 435 RPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEV-NPSAAHVEIGDISERKALRERLQCK 493
            PY  G K D++   S+R+A VWMD Y + V ++          GD+S+RK LRE L CK
Sbjct: 348 HPYLDGVKTDWVRVGSVRVAEVWMDQYAQVVYDMFGGPEFRGNFGDVSDRKKLRESLNCK 407

Query: 494 TFKWYLDNMWFETD 507
           +FKWYL+N + E +
Sbjct: 408 SFKWYLENAFPELE 421



 Score = 70.1 bits (164), Expect = 2e-10
 Identities = 33/97 (34%), Positives = 59/97 (60%), Gaps = 1/97 (1%)

Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEHYETL 168
           + ++ +   A N   S  I  HR LPD R+  C+ + +  + LP  +++I F+NE +  L
Sbjct: 73  VMERQFKTFALNEYASALISAHRRLPDYRDPWCKVKGRIMEHLPETTVVIVFFNEPWSVL 132

Query: 169 MRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQE 205
           +R+V+S++DR+  + IKE++LVDD S + +    +QE
Sbjct: 133 VRTVYSVLDRSPPELIKEVLLVDDCSFMPHTKTQLQE 169


>UniRef50_Q8N3T1 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase-like protein 2; n=21;
           Amniota|Rep: Polypeptide
           N-acetylgalactosaminyltransferase-like protein 2 - Homo
           sapiens (Human)
          Length = 639

 Score =  207 bits (506), Expect = 6e-52
 Identities = 108/267 (40%), Positives = 164/267 (61%), Gaps = 13/267 (4%)

Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
           SE    +  ++LL+++KR G IRAR+ GA  + GDVLVF+D+H E + GWL PLL R++ 
Sbjct: 243 SEYVARLEGVKLLRSNKRLGAIRARMLGATRATGDVLVFMDAHCECHPGWLEPLLSRIAG 302

Query: 308 GVDGVKVRYSARAVTPVIDVINADTFEYSPSP-LVRGGFNWGLHFKWDNLPKGTLINDED 366
                     +R V+PVIDVI+  TF+Y PS  L RG  +W L F W+ LP+      + 
Sbjct: 303 D--------RSRVVSPVIDVIDWKTFQYYPSKDLQRGVLDWKLDFHWEPLPEHVRKALQS 354

Query: 367 FMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSR 426
            + P++SP + G + A+ R YF   G YD  M++ GGENLE+SF+ W+CGGS+E++PCSR
Sbjct: 355 PISPIRSPVVPGEVVAMDRHYFQNTGAYDSLMSLRGGENLELSFKAWLCGGSVEILPCSR 414

Query: 427 VGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSA---AHVEIGDISER 483
           VGH+++ +  +   + Q+  L+N +R+A  W+  + +   + +P A   +  E  D  ER
Sbjct: 415 VGHIYQNQDSHSPLD-QEATLRNRVRIAETWLGSFKETFYKHSPEAFSLSKAEKPDCMER 473

Query: 484 KALRERLQCKTFKWYLDNMWFETDRSE 510
             L+ RL C+TF W+L N++ E   SE
Sbjct: 474 LQLQRRLGCRTFHWFLANVYPELYPSE 500



 Score = 81.0 bits (191), Expect = 8e-14
 Identities = 41/90 (45%), Positives = 56/90 (62%)

Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKH 183
           +S RI   R LP+ R+ LC  Q   D LP AS+I+CF++E + TL+R+VHSI+D   +  
Sbjct: 163 LSARIPLQRALPEVRHPLCLQQHPQDSLPTASVILCFHDEAWSTLLRTVHSILDTVPRAF 222

Query: 184 IKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
           +KEIILVDD S    L   + E V +L  V
Sbjct: 223 LKEIILVDDLSQQGQLKSALSEYVARLEGV 252


>UniRef50_Q7TT15-2 Cluster: Isoform 2 of Q7TT15 ; n=9; Mammalia|Rep:
           Isoform 2 of Q7TT15 - Mus musculus (Mouse)
          Length = 596

 Score =  206 bits (504), Expect = 1e-51
 Identities = 113/294 (38%), Positives = 168/294 (57%), Gaps = 24/294 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++  KREGLIRAR+ G   + G V  F D+H+E   GW  P+L R+ +         
Sbjct: 213 VKVVRNQKREGLIRARIEGWKAATGQVTGFFDAHVEFTAGWAEPVLSRIQEN-------- 264

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
             R + P ID I  D FE         G++W L   + + PK    +  D   P+++P M
Sbjct: 265 RKRVILPSIDNIKQDNFEVQRYENSAHGYSWELWCMYISPPKDWW-DAGDPSLPIRTPAM 323

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
            G  F + R++F  IG  DPGM+V+GGEN+E+  ++W+CGGS+E++PCSRV H+ RK++P
Sbjct: 324 IGCSFVVNRKFFGEIGLLDPGMDVYGGENIELGIKVWLCGGSMEVLPCSRVAHIERKKKP 383

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKV-IEVNPSAAH--VEIGDISERKALRERLQCK 493
           Y       Y  +N++R+A VWMDDY   V I  N    +  ++IGD+SERKALR+ L+CK
Sbjct: 384 YN-SNIGFYTKRNALRVAEVWMDDYKSHVYIAWNLPLENPGIDIGDVSERKALRKSLKCK 442

Query: 494 TFKWYLDNMWFETDR--SELVLG-------RTLCLDAS--NNVAPILGKCHEMG 536
            F+WYLD+++ E  R  + +  G       + +CLD     N   IL  CH  G
Sbjct: 443 NFQWYLDHVYPEMRRYNNTIAYGELRNNKAKDVCLDQGPLENHTAILYPCHGWG 496



 Score = 68.5 bits (160), Expect = 5e-10
 Identities = 32/75 (42%), Positives = 47/75 (62%)

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
           + +N+ +S++I   R +PD R   C+  +Y  ELP+ SII  F NE    ++RSVHS ++
Sbjct: 118 YGYNSYLSEKISLDRSIPDYRPTKCKELKYSKELPQISIIFIFVNEALSVILRSVHSAVN 177

Query: 178 RTDQKHIKEIILVDD 192
            T    +KEIILVDD
Sbjct: 178 HTPTHLLKEIILVDD 192


>UniRef50_UPI0000E4710F Cluster: PREDICTED: similar to
           pp-GalNAc-transferase 17; n=2; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to
           pp-GalNAc-transferase 17 - Strongylocentrotus purpuratus
          Length = 315

 Score =  205 bits (501), Expect = 2e-51
 Identities = 105/253 (41%), Positives = 148/253 (58%), Gaps = 11/253 (4%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++L + SKREGLIR+R++GA+ S G VL FLD+H E +  WL PLL  ++     V    
Sbjct: 38  VKLYRNSKREGLIRSRIFGAEQSRGQVLTFLDAHCECSPNWLVPLLTEIALNRTTV---- 93

Query: 317 SARAVTPVIDVINADTFEYSPSP--LVRGGFNWGLHFKWDNLPKGTLINDEDFM-KPLKS 373
               V P +D I+AD FEY      L RG  +W   +K   +          +  +P  S
Sbjct: 94  ----VCPTVDSISADNFEYRSQGDGLCRGAMDWDFWYKRIPVDLSRQRLGLKYQSEPYDS 149

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           P MAGGLFA+ RE+F  +G YDPG+ +WGGEN EISF+ WMCGGSL+ +PCSRVGHV+RK
Sbjct: 150 PMMAGGLFALDREFFFELGGYDPGLQIWGGENFEISFKAWMCGGSLKFVPCSRVGHVYRK 209

Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCK 493
             PY   +     + N MR+A VW+D++ +      P       GDI E+   R+    K
Sbjct: 210 GVPYTYPDSGVPGVSNYMRVAEVWLDEFKEFFYTSRPDLRGKPYGDIGEQIRFRKHHCPK 269

Query: 494 TFKWYLDNMWFET 506
           +FKW+++ + F++
Sbjct: 270 SFKWFMEEVAFDS 282


>UniRef50_UPI0000E46551 Cluster: PREDICTED: hypothetical protein,
           partial; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: hypothetical protein, partial -
           Strongylocentrotus purpuratus
          Length = 325

 Score =  204 bits (497), Expect = 7e-51
 Identities = 88/144 (61%), Positives = 110/144 (76%)

Query: 373 SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
           SPTMAGGLFA+ REYF+ +G YD GM++WGGENLEISFRIW CGG LE++PCSRVGHVFR
Sbjct: 1   SPTMAGGLFAMNREYFHKLGDYDEGMDIWGGENLEISFRIWQCGGKLEIVPCSRVGHVFR 60

Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
           KRRPYG   +QD   +N++R+A VWMD+Y +   +V P A +++ GDIS R ALRE L+C
Sbjct: 61  KRRPYGSPNRQDTTTKNAVRVAEVWMDEYKEHFYQVQPKAKNIDYGDISSRVALREELKC 120

Query: 493 KTFKWYLDNMWFETDRSELVLGRT 516
           K+FKWYLD ++ E        GRT
Sbjct: 121 KSFKWYLDTVYPEMRTPNDTKGRT 144



 Score = 52.8 bits (121), Expect = 2e-05
 Identities = 29/86 (33%), Positives = 44/86 (51%), Gaps = 6/86 (6%)

Query: 503 WFETDRSELVLGRTLCLDAS-NNVA--PILGKCHEMGGTQEWKHKGTASSPIYNTAAGMC 559
           W++T   EL LG  +C+D S +N A  P L KC  MGG+Q W+ K      IY+  +G C
Sbjct: 208 WYQTSVEELKLGDAICMDMSESNSASLPQLRKCDGMGGSQRWRIK---DKNIYHPVSGQC 264

Query: 560 LGVDRSYRGETVLMVICDDYSNNKWD 585
           L + +    +   + IC      +W+
Sbjct: 265 LSIKQLGSIQMAQLDICSSDPMQEWE 290


>UniRef50_Q6P9A2 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 4; n=28;
           Euteleostomi|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 4 - Homo
           sapiens (Human)
          Length = 607

 Score =  202 bits (494), Expect = 2e-50
 Identities = 110/293 (37%), Positives = 170/293 (58%), Gaps = 25/293 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I++++ SK+EGLIR+R+ G   +   V+   D+H+E NVGW  P+L R+ +         
Sbjct: 220 IKVVRHSKQEGLIRSRVSGWRAATAPVVALFDAHVEFNVGWAEPVLTRIKEN-------- 271

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
             R ++P  D I  D FE    PL   GF+W L  ++ N PK      E+   P++SP +
Sbjct: 272 RKRIISPSFDNIKYDNFEIEEYPLAAQGFDWELWCRYLNPPKAWW-KLENSTAPIRSPAL 330

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
            G  F + R+YF  IG  D GM V+GGEN+E+  R+W CGGS+E++PCSR+ H+ R  +P
Sbjct: 331 IG-CFIVDRQYFQEIGLLDEGMEVYGGENVELGIRVWQCGGSVEVLPCSRIAHIERAHKP 389

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EVNPSAAHVEIGDISERKALRERLQCK 493
           Y   +   ++ +N++R+A VWMD++   V     +    + ++IGDI+ RKALR++LQCK
Sbjct: 390 Y-TEDLTAHVRRNALRVAEVWMDEFKSHVYMAWNIPQEDSGIDIGDITARKALRKQLQCK 448

Query: 494 TFKWYLDNMWFETDR-SELV--------LGRTLCLDASNNV--APILGKCHEM 535
           TF+WYL +++ E    S+++        L   LCLD   +    PI+  CH M
Sbjct: 449 TFRWYLVSVYPEMRMYSDIIAYGVLQNSLKTDLCLDQGPDTENVPIMYICHGM 501



 Score = 90.2 bits (214), Expect = 1e-16
 Identities = 50/141 (35%), Positives = 75/141 (53%)

Query: 72  KVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDH 131
           K  ++E  AK +  +    T++ L   +G   + E  R+  K +  + +N  +S R+   
Sbjct: 74  KQHIQEAPAKPEEAEAEPFTDSSLFAHWGQELSPEGRRVALKQFQYYGYNAYLSDRLPLD 133

Query: 132 RDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVD 191
           R LPD R   C++  + D LP  SI+  F NE    L+RS+HS M+RT    +KEIILVD
Sbjct: 134 RPLPDLRPSGCRNLSFPDSLPEVSIVFIFVNEALSVLLRSIHSAMERTPPHLLKEIILVD 193

Query: 192 DYSDLYNLHHDVQEAVDKLNN 212
           D S    L   + E VDK+N+
Sbjct: 194 DNSSNEELKEKLTEYVDKVNS 214


>UniRef50_Q16ZA7 Cluster: N-acetylgalactosaminyltransferase; n=7;
           Culicidae|Rep: N-acetylgalactosaminyltransferase - Aedes
           aegypti (Yellowfever mosquito)
          Length = 648

 Score =  202 bits (493), Expect = 2e-50
 Identities = 125/347 (36%), Positives = 179/347 (51%), Gaps = 35/347 (10%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+L+ + R GLIRA++ GA N+   ++ FLD+H+E  VGWL PLL ++++    + +  
Sbjct: 265 VRILRAASRLGLIRAKMLGAWNTTAQIITFLDAHVECEVGWLEPLLNQVARNPTAIAI-- 322

Query: 317 SARAVTPVIDVINADTFEYSP--SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
                 P +D I+ DT    P  S L+ G F+W  +F+W           +  M+P  SP
Sbjct: 323 ------PSMDWIDGDTMTLDPQVSQLIYGKFDWMGNFQWGLRRDRRQPQAKHPMEPFDSP 376

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
            M GGLFAI R  F  +G YD     +G E+LE+SF+ WMCGGS++++PCSRV HV +  
Sbjct: 377 VMPGGLFAINRTLFAHLGWYDEQFETYGAEHLELSFKTWMCGGSMQIVPCSRVAHVQKPN 436

Query: 435 RPY--GVGEKQDYMLQNSMRMARVWMDDYVKKVIEV-NPSAAHVEIGDISERKALRERLQ 491
            PY       +D + +N +RMA VWMD+Y     E         + GD+S RK LR+ L 
Sbjct: 437 HPYITKTSGSEDVIKRNLVRMAEVWMDEYALYYYETFGGPDKRGDFGDVSSRKQLRQHLN 496

Query: 492 CKTFKWYLDNMWFET-DRSELV---------LGRTLCLD--ASNNVAPILGKCHEMGGTQ 539
           CK+F+WYL+N++ E  D S  V          G   CLD   + N   +   CH  G  Q
Sbjct: 497 CKSFRWYLENVFPEQFDPSRAVGRGEFRNGENGTDRCLDWPLARNQCGVT-SCHGRGRHQ 555

Query: 540 EWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNK-WD 585
            W    T    I  T    CL     Y G+T+ M  C     N+ W+
Sbjct: 556 MWYF--TREGEI--TRKDHCL----DYDGKTLEMNRCHQMGGNQLWE 594



 Score = 83.4 bits (197), Expect = 2e-14
 Identities = 40/89 (44%), Positives = 58/89 (65%), Gaps = 1/89 (1%)

Query: 109 RIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLC-QSQQYFDELPRASIIICFYNEHYET 167
           ++ ++GYN   FN ++S  I   R LPD R+  C Q  +Y   LP  SI+I FYNE +  
Sbjct: 160 KLVEEGYNDQGFNQVLSDLISVRRRLPDYRDSWCKQPGRYLKNLPDTSIVIVFYNEAWSV 219

Query: 168 LMRSVHSIMDRTDQKHIKEIILVDDYSDL 196
           L+R+VHSI+DR+    ++EI+LVDD+S L
Sbjct: 220 LVRTVHSILDRSPPNLVREIVLVDDFSFL 248


>UniRef50_UPI000069E576 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 5 (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 5) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase 5)
           (Polypeptide GalNAc transferase 5) (GalNAc-T5)
           (pp-GaNTase 5).; n=1; Xenopus tropicalis|Rep:
           Polypeptide N-acetylgalactosaminyltransferase 5 (EC
           2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
           5) (UDP- GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5) (Polypeptide GalNAc
           transferase 5) (GalNAc-T5) (pp-GaNTase 5). - Xenopus
           tropicalis
          Length = 307

 Score =  201 bits (491), Expect = 4e-50
 Identities = 113/254 (44%), Positives = 159/254 (62%), Gaps = 38/254 (14%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+L   +R GLIRAR+ GA+ + G+VL FLDSH+E NVGWL PLL++         VR 
Sbjct: 72  VRVLHLPERHGLIRARIAGANIATGEVLTFLDSHVECNVGWLEPLLEQ---------VRI 122

Query: 317 SARAVT-PVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
           + R V  PVI+VI+A   + S S  +                + T  ++  F    + P 
Sbjct: 123 NRRKVACPVIEVISA--LDLSVSRFIS---------------QATYFHNVCFC--FRCPV 163

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           MAGGLF+I + YF  +G YDPG++VWGGEN+EISF+IWMCGG +E+IPCSRVGH+FR   
Sbjct: 164 MAGGLFSIEKNYFYELGTYDPGLDVWGGENMEISFKIWMCGGEIEIIPCSRVGHIFRNDN 223

Query: 436 PYGVGEKQDYMLQ-NSMRMARVWMDDYVKKVIEVNPSAAHV-----EIGDISERKALRER 489
           PY   + +   ++ N +R+A VW+DDY K++   +    H+      IGD++E+K LRER
Sbjct: 224 PYSFPKDRIKTVERNLVRVAEVWLDDY-KEIFYGH--GQHLLKYLPNIGDLTEQKQLRER 280

Query: 490 LQCKTFKWYLDNMW 503
           LQCK F WY+ N++
Sbjct: 281 LQCKNFNWYIKNVF 294



 Score = 68.9 bits (161), Expect = 3e-10
 Identities = 35/72 (48%), Positives = 47/72 (65%)

Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
           C +Q   D+LP  SIIICF +E + TLMRSV+S+++R+ +  IKEIILVDD+S    L  
Sbjct: 1   CSNQLIHDDLPTTSIIICFIDEVWSTLMRSVYSVLNRSPEHLIKEIILVDDFSTRDYLKE 60

Query: 202 DVQEAVDKLNNV 213
            +   V KL  V
Sbjct: 61  KLDTYVKKLPKV 72


>UniRef50_UPI000069E1C8 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase-like protein 2 (EC
           2.4.1.41) (Protein-UDP
           acetylgalactosaminyltransferase-like protein 2)
           (UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase- like protein 2)
           (Polypeptide GalNAc transferase-like protein; n=1;
           Xenopus tropicalis|Rep: Polypeptide
           N-acetylgalactosaminyltransferase-like protein 2 (EC
           2.4.1.41) (Protein-UDP
           acetylgalactosaminyltransferase-like protein 2)
           (UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase- like protein 2)
           (Polypeptide GalNAc transferase-like protein - Xenopus
           tropicalis
          Length = 611

 Score =  201 bits (491), Expect = 4e-50
 Identities = 104/270 (38%), Positives = 162/270 (60%), Gaps = 14/270 (5%)

Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
           SE  + +  ++L++++KR G+I  R+ GA  + G+VL+F+DSH E + GWL PLL R+  
Sbjct: 221 SEYISRIGGVKLIRSNKRLGVIGGRMLGAARATGEVLIFMDSHCECHPGWLEPLLSRIMH 280

Query: 308 GVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGG-FNWGLHFKWDNLPKGTLINDED 366
             +        R V+PVID I+  TFEYS S L++ G F+W L F W  LP+      + 
Sbjct: 281 NRN--------RIVSPVIDFIDWKTFEYSHSSLLQQGVFDWKLDFHWVPLPEHEEKVRQS 332

Query: 367 FMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSR 426
            + P +SP + G + A  R YF  IG +D G+N WG E  E+S R+W+CGGS+E++PCSR
Sbjct: 333 PIIPFRSPVIPGYVLASDRHYFQNIGGFDTGINSWGVETTELSIRVWLCGGSVEIVPCSR 392

Query: 427 VGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAA---HVEIGDISER 483
           VGH ++    +    + + +L++ +R A +WMD Y K +   N   +    ++  DI+E 
Sbjct: 393 VGHAYQNHTMHN-SVQNEAVLRSKVRTAELWMDSY-KAIFYRNVGNSLLNRIQESDINEH 450

Query: 484 KALRERLQCKTFKWYLDNMWFETDRSELVL 513
           + LR+RL CK F+W+L N++ E + S   L
Sbjct: 451 EQLRQRLGCKRFQWFLANVYPEINMSTSTL 480



 Score = 89.0 bits (211), Expect = 3e-16
 Identities = 41/98 (41%), Positives = 64/98 (65%)

Query: 116 NLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSI 175
           N + F+  +S+ I  HR +PD R+  C  Q Y ++LP AS+IICF+NE + TL+R+VHS+
Sbjct: 133 NTNGFDEEVSKNIPLHRIIPDGRHPECLQQNYGEKLPIASVIICFHNEGWSTLLRTVHSV 192

Query: 176 MDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
           +D + +  +KEIILVDD S   +L   + E + ++  V
Sbjct: 193 LDNSPRTFLKEIILVDDLSHQEHLKSALSEYISRIGGV 230


>UniRef50_Q9NY28 Cluster: Probable polypeptide
           N-acetylgalactosaminyltransferase 8; n=9; Theria|Rep:
           Probable polypeptide N-acetylgalactosaminyltransferase 8
           - Homo sapiens (Human)
          Length = 637

 Score =  200 bits (489), Expect = 6e-50
 Identities = 109/301 (36%), Positives = 171/301 (56%), Gaps = 25/301 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++  +R+GL +AR  G + +  DV+  LD+HIEVNVGW  P+L R+ +  D   +  
Sbjct: 247 LKIIRHPERKGLAQARNTGWEAATADVVAILDAHIEVNVGWAEPILARIQE--DRTVI-- 302

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
               V+PV D I  DTF+     L   GFNW L  ++D LP+   I+  D   P+KSP++
Sbjct: 303 ----VSPVFDNIRFDTFKLDKYELAVDGFNWELWCRYDALPQAW-IDLHDVTAPVKSPSI 357

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
            G + A  R +   IG  D GM ++GGEN+E+S R+W CGG +E++PCSR+ H+ R  +P
Sbjct: 358 MG-ILAANRHFLGEIGSLDGGMLIYGGENVELSLRVWQCGGKVEILPCSRIAHLERHHKP 416

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EVNPSAAHVEIGDISERKALRERLQCK 493
           Y + +    + +N++R+A +WMD++   V     +    + ++ GD+S R ALRE+L+CK
Sbjct: 417 YAL-DLTAALKRNALRVAEIWMDEHKHMVYLAWNIPLQNSGIDFGDVSSRMALREKLKCK 475

Query: 494 TFKWYLDNMW-----FET----DRSELVLGRTLCLDAS--NNVAPILGKCHEMGGTQEWK 542
           TF WYL N++       T     R + +L   +CLD        PI+  CHE      + 
Sbjct: 476 TFDWYLKNVYPLLKPLHTIVGYGRMKNLLDENVCLDQGPVPGNTPIMYYCHEFSSQNVYY 535

Query: 543 H 543
           H
Sbjct: 536 H 536



 Score = 70.9 bits (166), Expect = 9e-11
 Identities = 43/162 (26%), Positives = 79/162 (48%), Gaps = 2/162 (1%)

Query: 50  EETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLR 109
           +E ++ T+ + +D  R      +  + E   K+   +M     + L  Q+G   +    +
Sbjct: 81  QENVNSTLKRAKDEVRPLLKAMETKVNE--TKKHKTQMKLFPHSQLFRQWGEDLSEAQQK 138

Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLM 169
                +    +N  +S ++  +R +PDTR+  C  + Y  +LP  S+I+ F NE    + 
Sbjct: 139 AAQDLFRKFGYNAYLSNQLPLNRTIPDTRDYRCLRKTYPSQLPSLSVILIFVNEALSIIQ 198

Query: 170 RSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLN 211
           R++ SI++RT  + +KEIILVDD+S    L   + E +   N
Sbjct: 199 RAITSIINRTPSRLLKEIILVDDFSSNGELKVHLDEKIKLYN 240


>UniRef50_Q8IA42 Cluster: N-acetylgalactosaminyltransferase 4; n=2;
           Sophophora|Rep: N-acetylgalactosaminyltransferase 4 -
           Drosophila melanogaster (Fruit fly)
          Length = 659

 Score =  197 bits (480), Expect = 8e-49
 Identities = 99/248 (39%), Positives = 155/248 (62%), Gaps = 17/248 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           + +++  +R+GLI AR+ GA  +VG V+VF DSHIEVN  WLPPL++ ++     +  + 
Sbjct: 256 VTIVRNPERQGLIGARIAGAKVAVGQVMVFFDSHIEVNYNWLPPLIEPIA-----INPKI 310

Query: 317 SARAVTPVIDVINADTFEYSPSPL--VRGGFNWGLHFKW-DNLPKGTLINDEDFMKPLKS 373
           S     P++D I+ + F Y        RGGF+W + +K    LP+  L    D   P +S
Sbjct: 311 ST---CPMVDTISHEDFSYFSGNKDGARGGFDWKMLYKQLPVLPEDAL----DKSMPYRS 363

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR- 432
           P M GGLFAI  ++F  +G YD  +++WGGE  E+SF+IWMCGG L  +PCSRV H+FR 
Sbjct: 364 PVMMGGLFAINTDFFWDLGGYDDQLDIWGGEQYELSFKIWMCGGMLLDVPCSRVAHIFRG 423

Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSA-AHVEIGDISERKALRERLQ 491
             +P G     +++ +N  R+A VWMD+Y + V + +P    +++ GD++ ++ +RERL+
Sbjct: 424 PMKPRGNPRGHNFVAKNHKRVAEVWMDEYKQYVYKRDPKTYDNLDAGDLTRQRGVRERLK 483

Query: 492 CKTFKWYL 499
           CK+F W++
Sbjct: 484 CKSFHWFM 491



 Score = 92.7 bits (220), Expect = 2e-17
 Identities = 38/94 (40%), Positives = 67/94 (71%)

Query: 102 IRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFY 161
           I N ++ ++  + Y ++ FN LIS RI  +R +PD R + C++++Y  +LP  S+I  F+
Sbjct: 143 IENPDEKQLEKEHYEMNGFNGLISDRISVNRSVPDLRLEACKTRKYLAKLPNISVIFIFF 202

Query: 162 NEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
           NEH+ TL+RS++S+++RT  + +K+I+LVDD S+
Sbjct: 203 NEHFNTLLRSIYSVINRTPPELLKQIVLVDDGSE 236


>UniRef50_Q86SF2 Cluster: N-acetylgalactosaminyltransferase 7; n=31;
           Euteleostomi|Rep: N-acetylgalactosaminyltransferase 7 -
           Homo sapiens (Human)
          Length = 657

 Score =  196 bits (477), Expect = 2e-48
 Identities = 113/309 (36%), Positives = 165/309 (53%), Gaps = 34/309 (11%)

Query: 257 IRLLKTSKREGLIRARLYGADNS-VGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
           +++ +  +REGLI+AR  GA  + +G VL++LD+H EV V W  PL+  +S+        
Sbjct: 269 VKVFRNERREGLIQARSIGAQKAKLGQVLIYLDAHCEVAVNWYAPLVAPISKD------- 321

Query: 316 YSARAVTPVIDVINADTFEYSPSP------LVRGGFNWGLHFKWDNLPKGTLINDEDFMK 369
                  P+IDVIN +T+E  P          RG ++W + +K   L        +   +
Sbjct: 322 -RTICTVPLIDVINGNTYEIIPQGGGDEDGYARGAWDWSMLWKRVPLTPQEKRLRKTKTE 380

Query: 370 PLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGH 429
           P +SP MAGGLFAI RE+F  +G YDPG+ +WGGEN EIS++IW CGG L  +PCSRVGH
Sbjct: 381 PYRSPAMAGGLFAIEREFFFELGLYDPGLQIWGGENFEISYKIWQCGGKLLFVPCSRVGH 440

Query: 430 VFR----KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKA 485
           ++R    +  P  +       L+N +R+  VW D+Y        P +  +  GDISE K 
Sbjct: 441 IYRLEGWQGNPPPIYVGSSPTLKNYVRVVEVWWDEYKDYFYASRPESQALPYGDISELKK 500

Query: 486 LRERLQCKTFKWYLDNMWFE-----------TDRSELVLGRT-LCLDA---SNNVAPILG 530
            RE   CK+FKW+++ + ++            D  E+    T  C+D+   +N     LG
Sbjct: 501 FREDHNCKSFKWFMEEIAYDITSHYPLPPKNVDWGEIRGFETAYCIDSMGKTNGGFVELG 560

Query: 531 KCHEMGGTQ 539
            CH MGG Q
Sbjct: 561 PCHRMGGNQ 569



 Score = 77.4 bits (182), Expect = 1e-12
 Identities = 35/96 (36%), Positives = 60/96 (62%)

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
           FN + S  I   R + D R + C+   Y + L  +S++I F+NE + TLMR+VHS++ RT
Sbjct: 175 FNMVASDMISLDRSVNDLRQEECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRT 234

Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
            +K++ EI+L+DD+S+  +L   + E +   N ++K
Sbjct: 235 PRKYLAEIVLIDDFSNKEHLKEKLDEYIKLWNGLVK 270


>UniRef50_Q9VUT6 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 8; n=1; Drosophila
           melanogaster|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 8 - Drosophila
           melanogaster (Fruit fly)
          Length = 590

 Score =  194 bits (474), Expect = 4e-48
 Identities = 102/273 (37%), Positives = 156/273 (57%), Gaps = 27/273 (9%)

Query: 244 STENSEVKNNVFNIRLL------KTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGW 297
           ST+  E  N+   I+ L      + + + GL+ AR+ GA+ ++ DVLVFLDSH+EV  GW
Sbjct: 170 STQADEKLNDFIKIKFLNMVQHRRITTQVGLMHARVVGAELALADVLVFLDSHVEVTKGW 229

Query: 298 LPPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLP 357
           L PL+  + +         +    TP+ID I+ D F Y      RG FNW   F +  LP
Sbjct: 230 LEPLIAPILED--------NRTCTTPIIDTIDFDNFAYRRGKPSRGFFNW--EFNYIQLP 279

Query: 358 KGTLINDEDFMKPL--KSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMC 415
              L+ +E    P   K+P M GGLFAI RE+F+ +G YD G+ +WG E  E+S ++W+C
Sbjct: 280 ---LLKEEAVAMPAPHKNPIMNGGLFAIGREWFSELGGYDKGLKIWGAEQFELSLKLWLC 336

Query: 416 GGSLELIPCSRVGHVFR------KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVN 469
           GG +  +PCSRVGH+FR      +        ++  + +N  R+A +W+D+Y  K+    
Sbjct: 337 GGQILEVPCSRVGHLFRDGNFQIRYTNKDKNSEKKLISRNYRRVAEIWLDEYKDKLFANM 396

Query: 470 PSAAHVEIGDISERKALRERLQCKTFKWYLDNM 502
           P    + +G+++E++ L+ RL CK FKW+LDN+
Sbjct: 397 PHLTVIPVGNLAEQRDLKNRLHCKPFKWFLDNL 429



 Score = 62.5 bits (145), Expect = 3e-08
 Identities = 37/111 (33%), Positives = 61/111 (54%), Gaps = 10/111 (9%)

Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEH 164
           E L    K      +N  +S+RI   R L D R++ C+  +Y  ++LP  S++I ++NE 
Sbjct: 81  EQLEAIAKSQRETGYNAWLSKRISPERSLYDMRHRSCKKLKYPMEKLPSVSVVITYHNEE 140

Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
              L+R++ S+  RT  + ++E+ILVDD S          +A +KLN+ IK
Sbjct: 141 ASVLLRTLSSLRSRTPIQLLREVILVDDGS---------TQADEKLNDFIK 182


>UniRef50_A0NGH9 Cluster: ENSANGP00000031751; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000031751 - Anopheles gambiae
           str. PEST
          Length = 499

 Score =  194 bits (473), Expect = 5e-48
 Identities = 96/252 (38%), Positives = 149/252 (59%), Gaps = 13/252 (5%)

Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
           R+++  KR GLIRAR+ G  ++  D++ FLD+H+EV VGWL  L++ + +    + +   
Sbjct: 126 RIVRAPKRLGLIRARMLGGKSTKTDLITFLDAHVEVTVGWLEALIQPVVESWTTIAI--- 182

Query: 318 ARAVTPVIDVINADTFEY--SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
                P ID I+ +  +Y    +P   G ++W L+F W         N    M+P  +P 
Sbjct: 183 -----PTIDWIDENNMKYRDDKAPTFVGAYDWDLNFGWWGRWSQKKQNANK-MEPFDTPA 236

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           MAGGLFAI R +F  IG YD G +++G EN+E+S + WMCGG +  +PCSRVGH+ +   
Sbjct: 237 MAGGLFAINRTFFERIGWYDDGFDIYGIENIELSVKSWMCGGKMVTVPCSRVGHIQKTGH 296

Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVN--PSAAHVEIGDISERKALRERLQCK 493
           PY   + +D +  NS+R+A VWMD+Y + + ++   P     E G ++ RKA+RE  +CK
Sbjct: 297 PYLYKQPKDVVRANSIRLAEVWMDEYKRIIFDIYGIPHYLEEEFGSVATRKAIRESAKCK 356

Query: 494 TFKWYLDNMWFE 505
            F +YL+N + E
Sbjct: 357 PFSYYLENAFPE 368



 Score = 68.1 bits (159), Expect = 6e-10
 Identities = 32/88 (36%), Positives = 51/88 (57%), Gaps = 1/88 (1%)

Query: 110 IRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEHYETL 168
           +  +G     FN   S  +   R LP+ R+  C    ++  +LP  SI+I F+NE +  +
Sbjct: 19  LTQQGIQTQGFNQYFSDLMSVRRRLPEIRDPWCAKPGRFLADLPATSIVIVFFNEAWSVV 78

Query: 169 MRSVHSIMDRTDQKHIKEIILVDDYSDL 196
           +R+VHS++DR+    +KEI+LVDD S L
Sbjct: 79  LRTVHSVLDRSPAHLVKEIVLVDDCSTL 106



 Score = 35.5 bits (78), Expect = 4.0
 Identities = 25/88 (28%), Positives = 39/88 (44%), Gaps = 5/88 (5%)

Query: 500 DNMWFETDRSELVLGRTLCLDASNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMC 559
           D  W      EL   R  C+DA   V  +  +CH   G Q WK     S  I + A  +C
Sbjct: 413 DQYWTHNYYQELNSYRN-CIDAVGTVVEVY-QCHRSRGNQAWKVL-VESQQILSVARNLC 469

Query: 560 LGVDRSYRGETVLMVICD-DYSNNKWDI 586
           L ++   +  T+L+  CD    + +W++
Sbjct: 470 LALNLQTK-TTLLLEKCDATKPSQQWNV 496


>UniRef50_UPI000065D57A Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 3 (EC
           2.4.1.41) (Protein-UDP
           acetylgalactosaminyltransferase-like protein 3)
           (UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase- like protein 3)
           (Polypeptide GalNAc transferase-lik; n=1; Takifugu
           rubripes|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 3 (EC
           2.4.1.41) (Protein-UDP
           acetylgalactosaminyltransferase-like protein 3)
           (UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase- like protein 3)
           (Polypeptide GalNAc transferase-lik - Takifugu rubripes
          Length = 605

 Score =  190 bits (463), Expect = 9e-47
 Identities = 103/258 (39%), Positives = 153/258 (59%), Gaps = 16/258 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++  KREGLIRAR+ G   +  +V  F D+H+E    W  P+L R       +K  Y
Sbjct: 192 VKIVRNQKREGLIRARIEGWKVASAEVTGFFDAHVEFTPSWAEPVLAR-------IKEDY 244

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLK---S 373
             R + P ID I  DTFE         G+NW L   + N PK    ++ D   P++   +
Sbjct: 245 K-RIILPSIDNIKHDTFEVERYENSGHGYNWELWCMYINPPKQWW-DEGDASAPIRHDPT 302

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           P M G  F   R+YF  +G  D GM+V+GGEN+E+  R+W+CGGS+E++PCSRV H+ R 
Sbjct: 303 PAMIGCSFVANRDYFGELGLLDSGMDVYGGENIELGIRVWLCGGSMEVLPCSRVAHIARV 362

Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKV-IEVN-PSAAH-VEIGDISERKALRERL 490
           ++PY       +  +N++R+A VWMD+Y   V +  N P   H ++ GDIS+R ALR+ L
Sbjct: 363 KKPYH-SNIAYHTRRNALRVAEVWMDEYRSNVYLAWNIPMENHGIDYGDISQRVALRKSL 421

Query: 491 QCKTFKWYLDNMWFETDR 508
           QCK+F+WYL+N++ E  R
Sbjct: 422 QCKSFEWYLENVYPEMRR 439



 Score = 64.5 bits (150), Expect = 8e-09
 Identities = 32/78 (41%), Positives = 46/78 (58%)

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
           + +N  +S RI   R +PD R   C+   Y  +LP+ S+I  F NE    ++RSVHS ++
Sbjct: 16  YGYNAYLSDRISLDRTIPDHRPGKCRKVGYPRDLPQISLIFIFVNEALSVILRSVHSAVN 75

Query: 178 RTDQKHIKEIILVDDYSD 195
            T    +KEIILV+D SD
Sbjct: 76  HTPAHLLKEIILVNDNSD 93


>UniRef50_Q8MYY6 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase 13; n=1; Drosophila
           melanogaster|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase 13 - Drosophila
           melanogaster (Fruit fly)
          Length = 558

 Score =  189 bits (460), Expect = 2e-46
 Identities = 107/324 (33%), Positives = 172/324 (53%), Gaps = 32/324 (9%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +  L+  +R GLI +R  GA  + G  ++FLDSH EVN GWL PLL+RL+   +      
Sbjct: 178 LTFLRNQERMGLIWSRNRGASLASGRYVLFLDSHCEVNEGWLEPLLERLALNTN------ 231

Query: 317 SARAVTPVIDVINADTFEYSP-SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
              AV+P++D I+  T  Y   + L++GGF+W LHF W    K  L N E    P +SP 
Sbjct: 232 --LAVSPLLDPIDPTTLSYRKGNELLKGGFDWSLHFHW---LKRQLTNQESLEMPYQSPA 286

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
            AGG+  + RE+F  +G ++P + +WGGE++E++ ++W+CGG +E++PCSR+GH+FR+R 
Sbjct: 287 FAGGVLMMSREWFLKLGSFNPYLKIWGGESIELAIKLWLCGGQIEIVPCSRIGHIFRRRH 346

Query: 436 PYG--------VGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIG-DISERKAL 486
            +         +   Q+  L NS  +A  W+D+Y      + P+A  + +     E + +
Sbjct: 347 AFDFPPQSDRQLSPAQETYLHNSKIIAESWLDEYKNMFYALRPAARRIPLDHTYDELQRM 406

Query: 487 RERLQCKTFKWYLDN------MWF-ETDRSELVLGRTLCLDA-SNNVAPILGKCHEMGGT 538
           R+  +C  F+WYL +      M F E   +  +     C+ A   +  PIL  C+ +   
Sbjct: 407 RKERRCHPFEWYLRHVSPELRMHFDELSATGTLRNEDRCVHARQKDSQPILASCY-LSDI 465

Query: 539 QEWKHKGTASSPIYNTAAGMCLGV 562
            +W       S   +T   +CL V
Sbjct: 466 TQWSM--LRQSGQLSTHRELCLAV 487



 Score = 50.4 bits (115), Expect = 1e-04
 Identities = 26/82 (31%), Positives = 47/82 (57%), Gaps = 3/82 (3%)

Query: 116 NLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELP---RASIIICFYNEHYETLMRSV 172
           + + +N  +S  +G  R LP TR+  C ++      P     S++I F+NE    L+R++
Sbjct: 71  DFYQYNIHLSNALGLIRKLPVTRHHSCTTRNSILPAPLEANVSVVISFHNEARSMLLRTI 130

Query: 173 HSIMDRTDQKHIKEIILVDDYS 194
            S++ R+ + ++ E+ILVDD S
Sbjct: 131 VSLLSRSPEDYLHELILVDDGS 152


>UniRef50_Q5CKF0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
           N- acetylgalactosaminyltransferase T3; n=4;
           Eimeriorina|Rep:
           UDP-N-acetyl-D-galactosamine:polypeptide N-
           acetylgalactosaminyltransferase T3 - Cryptosporidium
           hominis
          Length = 732

 Score =  187 bits (455), Expect = 8e-46
 Identities = 101/255 (39%), Positives = 148/255 (58%), Gaps = 14/255 (5%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R++  ++R+G++ ARL G   +   V+V LDSHIE +  WL P L RL +    V    
Sbjct: 345 VRVVHLTERKGIVGARLSGVRAASAPVIVILDSHIETSRQWLEPQLLRLKESPKSV---- 400

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
               V P ID I+   F +S       G    L FK+  L + TL    +   P+KSP M
Sbjct: 401 ----VMPQIDSIDPVNFAFSNF----SGIGCRLGFKYSILEQATLTGPINDTTPIKSPMM 452

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
           AGGLFA+ R+YF  +G YD     WG EN+EISFRIWMCGG +E  PCSRV H+FRK + 
Sbjct: 453 AGGLFAMKRDYFWHLGGYDEKFRHWGAENVEISFRIWMCGGQIECTPCSRVFHIFRK-KG 511

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFK 496
            G     + +  N +R ARVWMD++  ++ E+     ++++G   +   L+++L+CK F+
Sbjct: 512 VGYSSPPESLWHNRLRTARVWMDEFY-QITEMLAPNPNIKLGSFDDMLHLKKKLKCKPFR 570

Query: 497 WYLDNMWFETDRSEL 511
           W+LDN+  ET  ++L
Sbjct: 571 WFLDNVAPETYITQL 585



 Score = 63.3 bits (147), Expect = 2e-08
 Identities = 37/97 (38%), Positives = 56/97 (57%), Gaps = 6/97 (6%)

Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYN 162
           N ED   +  G+N    + L   R+       + R+ +C++  Y   +L  ASIII FYN
Sbjct: 238 NEEDFLAKGGGFNRQLSDFLSLDRVP-----LEVRDPICRNMIYPIKDLDDASIIITFYN 292

Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
           E   TL+RSVHS+++ T    ++EIILV+D SD+ +L
Sbjct: 293 EPLSTLLRSVHSVLNNTPPPLLREIILVNDGSDMIDL 329


>UniRef50_Q8K1B9 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 4; n=9;
           Euteleostomi|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase-like protein 4 - Mus
           musculus (Mouse)
          Length = 622

 Score =  186 bits (454), Expect = 1e-45
 Identities = 110/308 (35%), Positives = 170/308 (55%), Gaps = 40/308 (12%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I++++ SK+EGLIR+R+ G   +   V+   D+H+E NVGW  P+L R+ +         
Sbjct: 220 IKVVRHSKQEGLIRSRVSGWRAATAPVVALFDAHVEFNVGWAEPVLTRIKEN-------- 271

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
             R ++P  D I  D FE    PL   GF+W L  ++ N PK      E+   P++SP +
Sbjct: 272 RKRIISPSFDNIKYDNFEIEEYPLAAQGFDWELWCRYLNPPKAWW-KLENSTAPIRSPAL 330

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRI---------------WMCGGSLEL 421
            G  F + R+YF  IG  D GM V+GGEN+E+  R+               W CGGS+E+
Sbjct: 331 IG-CFIVDRQYFEEIGLLDEGMEVYGGENVELGIRVSEISHTGLSSAPMMVWQCGGSVEV 389

Query: 422 IPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EVNPSAAHVEIG 478
           +PCSR+ H+ R  +PY   +   ++ +N++R+A VWMD++   V     +    + ++IG
Sbjct: 390 LPCSRIAHIERAHKPY-TEDLTAHVRRNALRVAEVWMDEFKSHVYMAWNIPQEDSGIDIG 448

Query: 479 DISERKALRERLQCKTFKWYLDNMWFETDR-SELV--------LGRTLCLDASNNV--AP 527
           DI+ RKALR++LQCKTF+WYL +++ E    S+++        L   LCLD   +    P
Sbjct: 449 DITARKALRKQLQCKTFRWYLVSVYPEMRMYSDIIAYGVLQNSLKTDLCLDQGPDTENVP 508

Query: 528 ILGKCHEM 535
           I+  CH M
Sbjct: 509 IVYICHGM 516



 Score = 90.2 bits (214), Expect = 1e-16
 Identities = 50/140 (35%), Positives = 74/140 (52%)

Query: 72  KVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDH 131
           K  ++E  AK +  +    T++ L   +G   + E  R+  K +  + +N  +S R+   
Sbjct: 74  KQHIQEAPAKPEEAEAEPFTDSSLFAHWGQELSPEGRRVALKQFQYYGYNAYLSDRLPLD 133

Query: 132 RDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVD 191
           R LPD R   C++  + D LP  SI+  F NE    L+RS+HS M+RT    +KEIILVD
Sbjct: 134 RPLPDLRPSGCRNLSFPDSLPEVSIVFIFVNEALSVLLRSIHSAMERTPSHLLKEIILVD 193

Query: 192 DYSDLYNLHHDVQEAVDKLN 211
           D S    L   + E VDK+N
Sbjct: 194 DNSSNEELKEKLTEYVDKVN 213


>UniRef50_Q8MM26 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
           N- acetylgalactosaminyltransferase T1; n=1; Toxoplasma
           gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
           acetylgalactosaminyltransferase T1 - Toxoplasma gondii
          Length = 751

 Score =  180 bits (437), Expect = 1e-43
 Identities = 139/465 (29%), Positives = 215/465 (46%), Gaps = 47/465 (10%)

Query: 106 EDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEH 164
           E  R+  KGY    FNT +S  +   R +P+  +  C+ Q+  FD +   S    +  + 
Sbjct: 196 EQKRLAHKGY---CFNTKVSDSLSLDRSVPEFASNYCRDQRLLFDNMTPPSAEQKW--KQ 250

Query: 165 YETLMRSVHSIMDRTDQKHIKEII-------LVDDYSDLYNLHHDVQEAVDKLNNVIKKE 217
              L ++    +  TD K    ++       L D    +   + ++   +  +++V+ + 
Sbjct: 251 TRELSKAQSPQLSATDGKASSAVVPRATDGSLPDTSVVIVFYNENLSVLLRSIHSVLNRT 310

Query: 218 EEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGAD 277
              +    I ++            K+  +   +   +   RLL+  +R GL+ AR  GA 
Sbjct: 311 PPSLLKEIIVVDDFSDRQTHPWLGKQLEDY--ISGTLPKTRLLRLLQRRGLMGARAAGAA 368

Query: 278 NSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSP 337
            +  + + FLDSHIE    WL PLL  + Q           R   P+I  I+AD F    
Sbjct: 369 AASAETVTFLDSHIECLPYWLQPLLFHVKQDW--------RRIAMPLIPTIDADNFRIKD 420

Query: 338 SPLVRGGFNWGL-HFKWDNLPKGTL--INDEDFMK----PLKSPTMAGGLFAIYREYFNA 390
             L    F WG+ H+   +  +  +  +  ++  K    P  SP MAGGLF I + +++ 
Sbjct: 421 GGLKTLAFTWGMSHYHIHDKIRHRIEELGQDEAAKNPDAPTMSPIMAGGLFTITKAWWDT 480

Query: 391 IGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQD----YM 446
           +G YD  M ++GGE  EISF+ WMCGGSL L+PCSRVGHVFR    +  G+        +
Sbjct: 481 LGGYDKEMQIYGGEEFEISFKTWMCGGSLHLVPCSRVGHVFRSNE-FWQGQVYTVPGALI 539

Query: 447 LQNSMRMARVWMDDYVKKVIEVNPSAAHVE-IGDISERKALRERLQCKTFKWYLDNMWFE 505
            +N +R A VWM +Y + V  V P     + +GD++E KALR+RL+CK F WYL N++ E
Sbjct: 540 HRNKLRTAHVWMGEYARIVELVIPRLPQDKPLGDLTELKALRDRLKCKDFNWYLKNIYPE 599

Query: 506 TDRSELVLGRT---------LCLDASNNVAPILG--KCHEMGGTQ 539
            +   L    T          CLD        +G   CH   GTQ
Sbjct: 600 LEPPNLAHAMTGAMRNPKFNCCLDTLTTKNQEIGVYPCHFEHGTQ 644


>UniRef50_UPI0000E45D84 Cluster: PREDICTED: hypothetical protein;
           n=3; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 662

 Score =  177 bits (431), Expect = 7e-43
 Identities = 86/179 (48%), Positives = 118/179 (65%), Gaps = 11/179 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           ++L++ S+REGLIR R+ GA +S GDVL++LD+H EV V WLPPLL  ++          
Sbjct: 468 LKLVRNSRREGLIRTRIIGARHSTGDVLLWLDAHCEVGVNWLPPLLTPIAVN-------- 519

Query: 317 SARAVTPVIDVINADTFEYSPSPLV---RGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
              AV P+IDVI+   +   P       RGGF+W L++K   +P+      +   +P +S
Sbjct: 520 RTTAVCPIIDVIDNMDYRVYPQGTGDQDRGGFDWSLYWKHLPVPQFEKSRRQHASEPYRS 579

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
           P MAGGLFA+ R+YF  +G YD G+ +WGGEN E+SF+IWMCGGSL  +PCSRVGHV+R
Sbjct: 580 PAMAGGLFAMDRKYFFELGAYDEGLEIWGGENFELSFKIWMCGGSLLWVPCSRVGHVYR 638



 Score = 75.4 bits (177), Expect = 4e-12
 Identities = 37/105 (35%), Positives = 65/105 (61%), Gaps = 1/105 (0%)

Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEH 164
           SE  ++ D+    + FN  +S +I   R++ D R++ C+   Y + LP  S+II F+NE 
Sbjct: 358 SEKAKV-DRLIQEYGFNQYVSDQISLDRNIADLRSQQCKHWHYPETLPTTSVIIVFHNEG 416

Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
           + TL+R+VHS+ +R+  + + EIILVDD+S   +L   +++ V +
Sbjct: 417 WSTLLRTVHSVFNRSPSQLLHEIILVDDFSTKEHLKERLEDYVQE 461


>UniRef50_Q5CY08 Cluster: Extracellular protein with a signal
           peptide followed by family 2 glycosyltransferase and
           ricin domains; n=3; Cryptosporidium|Rep: Extracellular
           protein with a signal peptide followed by family 2
           glycosyltransferase and ricin domains - Cryptosporidium
           parvum Iowa II
          Length = 637

 Score =  177 bits (431), Expect = 7e-43
 Identities = 101/257 (39%), Positives = 139/257 (54%), Gaps = 29/257 (11%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +RL++ +KR G++ ARL G +     + V LDSHIEV   W  P++KR+ +         
Sbjct: 239 VRLIRNAKRSGIVGARLAGINACKSPIFVILDSHIEVQPVWAEPIVKRIQED-------- 290

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNL-----------PKGTLINDE 365
             R V P ID I+++TFE+     V GG    L F W  +           P+     + 
Sbjct: 291 PRRIVMPQIDSIDSETFEF-----VNGGIGCTLGFLWKLIEHAFPQQISPDPRRRYAKNY 345

Query: 366 DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCS 425
           D++    SPTMAGGL A    +F  IG YDP    WG ENLE+SFR+WMCGG +E  PCS
Sbjct: 346 DYVS---SPTMAGGLLAANVAFFKQIGSYDPQFEYWGTENLELSFRVWMCGGFIECAPCS 402

Query: 426 RVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKA 485
           RV HVFRK    G       +L+N +R   +WMD++      V      V+ G + ER  
Sbjct: 403 RVFHVFRK-GGVGYSSPSHAVLKNKLRTLYLWMDEFGDLAWRV-MGRPRVDTGPLDERIK 460

Query: 486 LRERLQCKTFKWYLDNM 502
           LRERL+C +FKW+L+N+
Sbjct: 461 LRERLRCNSFKWFLENV 477



 Score = 62.9 bits (146), Expect = 2e-08
 Identities = 33/77 (42%), Positives = 48/77 (62%), Gaps = 1/77 (1%)

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDR 178
           FN  +S  +   R++ D R+  C+   Y   ++   S+II FYNE + TLMRSVHS+++R
Sbjct: 141 FNLNLSDSLPLDRNVSDYRDLQCKLISYDISKMDTISVIIVFYNEPFSTLMRSVHSVLNR 200

Query: 179 TDQKHIKEIILVDDYSD 195
           T    + EIILVDD S+
Sbjct: 201 TPPSLLDEIILVDDGSN 217


>UniRef50_UPI0000D9AA48 Cluster: PREDICTED: similar to
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5; n=2; Macaca
           mulatta|Rep: PREDICTED: similar to
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 5 - Macaca
           mulatta
          Length = 442

 Score =  172 bits (419), Expect = 2e-41
 Identities = 102/253 (40%), Positives = 139/253 (54%), Gaps = 42/253 (16%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I++++  KREGLIRARL GA ++ GDVLVFLDSH EVN  WL PLL  +++    V    
Sbjct: 228 IKIIRNKKREGLIRARLIGASHASGDVLVFLDSHCEVNRVWLEPLLHAIAKDPKMV---- 283

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
               V P+IDVI+  T EY PSP+VRG F+W L FKWDN+    +   E   KP+     
Sbjct: 284 ----VCPLIDVIDDRTLEYKPSPVVRGAFDWNLQFKWDNVFSYEMDGPEGPTKPI----- 334

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
                           + D GM +W            MCGG L +IPCSRVGH+  K++ 
Sbjct: 335 ----------------RVDCGMRIW------------MCGGQLFIIPCSRVGHI-SKKQT 365

Query: 437 YGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFK 496
                     + N +R+  VW+D+Y ++     P   +V  G+I ER  LR+RL CK+F+
Sbjct: 366 RKTSAIISATIHNYLRLVHVWLDEYKEQFFLRKPGLKYVTYGNIHERVQLRKRLGCKSFQ 425

Query: 497 WYLDNMWFETDRS 509
           WYLDN++ E + S
Sbjct: 426 WYLDNVFPELEAS 438



 Score = 92.3 bits (219), Expect = 3e-17
 Identities = 47/128 (36%), Positives = 76/128 (59%), Gaps = 2/128 (1%)

Query: 89  KKTEND-LEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY 147
           K+T+ D  E   G+  N  +  + ++    + FN +IS+ +G  R++PDTRNK+C  + Y
Sbjct: 103 KRTDEDKAESTLGMDFNHTNPELHNELLK-YGFNVIISRSLGIEREVPDTRNKMCLQKHY 161

Query: 148 FDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAV 207
              LP ASI+ICF+NE +  L R+V S+M+ T    ++EIILVDD S++ +L   +   +
Sbjct: 162 PARLPTASIVICFHNEEFHALFRTVSSVMNLTPHYFLEEIILVDDMSEVDDLKEKLDYHL 221

Query: 208 DKLNNVIK 215
           +     IK
Sbjct: 222 ETFRGKIK 229


>UniRef50_UPI000065D031 Cluster: Probable polypeptide
           N-acetylgalactosaminyltransferase 8 (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 8) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase 8)
           (Polypeptide GalNAc transferase 8) (GalNAc-T8)
           (pp-GaNTase 8).; n=1; Takifugu rubripes|Rep: Probable
           polypeptide N-acetylgalactosaminyltransferase 8 (EC
           2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
           8) (UDP- GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
           transferase 8) (GalNAc-T8) (pp-GaNTase 8). - Takifugu
           rubripes
          Length = 565

 Score =  171 bits (417), Expect = 3e-41
 Identities = 101/276 (36%), Positives = 159/276 (57%), Gaps = 30/276 (10%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R ++ +++ GL +ARL G   +VGDV+  LD+HIEV+V W  PLL R+ +  D   +  
Sbjct: 129 VRKVRHAEQLGLTQARLSGWKAAVGDVVAILDAHIEVHVQWAEPLLARIKE--DRTVI-- 184

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNL-PKGTLINDEDFMKPLKSPT 375
               +TPV D +  D             F+W L   +++  P+   + D+    P KSP+
Sbjct: 185 ----LTPVFDNVKYDDLTVLHYQPAADAFDWALWCMYESFRPEWYDLKDDSL--PGKSPS 238

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           + G + A  R++F  IG  D GM ++GGEN+E+  R W CGGS+E+IPCS++ H+ R  +
Sbjct: 239 IMGIVVA-ERKFFGEIGSLDGGMKIYGGENVELGIRAWSCGGSIEVIPCSKIAHIERAMK 297

Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKV-----IEVNPSA------AH------VEIG 478
           PY + +    M +N++R+A VWMD+Y   V     + +  SA      AH      ++IG
Sbjct: 298 PY-LPDLSVTMKRNALRVAEVWMDEYKSNVNVAWNLPLVASASKMWLSAHFRANHGIDIG 356

Query: 479 DISERKALRERLQCKTFKWYLDNMWFETDRSELVLG 514
           D+SERK LR+RL CK F WYL+N++ + D  + ++G
Sbjct: 357 DVSERKKLRKRLNCKPFSWYLENIYPQLDPLDNLVG 392



 Score = 72.1 bits (169), Expect = 4e-11
 Identities = 30/108 (27%), Positives = 65/108 (60%)

Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNE 163
           + E+ +  ++ +  + +N  +S R+  +R++PDTR   C  ++Y +ELP  S+++ + +E
Sbjct: 15  SEEEQKEAERLFQQYGYNAFLSDRLPLNREIPDTRPTRCAEKKYPEELPNISVVLIYLDE 74

Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLN 211
               + R++ S++D+T  + + EIILVDD+S   +L   + E +  ++
Sbjct: 75  ALSVIKRAIRSLIDKTPARLLTEIILVDDHSSNEDLGKKLDEYIGSIH 122


>UniRef50_UPI0000586DC6 Cluster: PREDICTED: similar to polypeptide
           N-acetylgalactosaminyltransferase 10; n=1;
           Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
           polypeptide N-acetylgalactosaminyltransferase 10 -
           Strongylocentrotus purpuratus
          Length = 376

 Score =  171 bits (415), Expect = 6e-41
 Identities = 80/184 (43%), Positives = 114/184 (61%), Gaps = 3/184 (1%)

Query: 319 RAVTPVIDVINADTFEYSPSP--LVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
           R V P+IDVI+ + F Y      ++RG F+W L++K   + +           P ++P M
Sbjct: 29  RIVCPMIDVISNEDFHYESQAGDVMRGAFDWELYYKRIPISEAENKRRSHESDPFRTPIM 88

Query: 377 AGGLFAIYREYF-NAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           AGGLFA+ R+YF   +G YD G+ +WGGE  ++SF++WMCGG +E IPCSRVGH++RK  
Sbjct: 89  AGGLFAVDRKYFMEELGGYDEGLEIWGGEQYDLSFKVWMCGGEMEEIPCSRVGHIYRKFM 148

Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
            Y V      + +N +R+  VWMD++ K   E  P     + GDIS++ ALRERLQCK F
Sbjct: 149 SYTVPGGAGVINKNLLRVVEVWMDEWGKYFYERRPYLKGQDYGDISKQLALRERLQCKNF 208

Query: 496 KWYL 499
            W+L
Sbjct: 209 TWFL 212


>UniRef50_Q4RKI0 Cluster: Chromosome 21 SCAF15029, whole genome
           shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 21
           SCAF15029, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 531

 Score =  169 bits (412), Expect = 1e-40
 Identities = 92/240 (38%), Positives = 140/240 (58%), Gaps = 23/240 (9%)

Query: 248 SEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ 307
           SE  +++  +RLL++++R G+   R  GA  + G++LVF+DSH E   GWL PLL+R++Q
Sbjct: 231 SEYLSHLSRVRLLRSARRLGVAGCRALGASKAEGELLVFMDSHCECQKGWLEPLLERVAQ 290

Query: 308 GVDGVKVRYSARAVTPVIDVINADTFEYSPSPL-VRGGFNWGLHFKWDN---LPK---GT 360
                      R V+P+ID I+  TF Y+ +   VRG FNW L F+W++   LP    G+
Sbjct: 291 D--------RTRVVSPIIDAIDWRTFRYNATQWPVRGVFNWRLDFRWESHTLLPDKDPGS 342

Query: 361 LINDEDFMKPL------KSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWM 414
            +      + L      +SP + G +FAI R +F  +G +DPGM +WG E +E+S R+W 
Sbjct: 343 AVRALRLCRRLTETARFRSPVLGGEVFAIDRHFFQHVGGFDPGMLLWGEEQIELSIRVWS 402

Query: 415 CGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAH 474
           CGGS+E+ PCSRV H+     PY   + QD +  N +R+A +WM  Y +K+     + AH
Sbjct: 403 CGGSMEVAPCSRVAHLDHHSLPYTFPD-QDLLENNKIRIAEIWMGAY-RKIFYRRDTLAH 460



 Score = 91.1 bits (216), Expect = 8e-17
 Identities = 39/96 (40%), Positives = 63/96 (65%)

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
           + FN  +S+ I  HR LP+ R+  C  QQY + LP AS++ICF+NE + TL+R+VHS++ 
Sbjct: 145 YGFNEAVSEGISVHRRLPEARHPRCLQQQYSESLPSASVVICFHNEAWSTLLRTVHSVLS 204

Query: 178 RTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
              ++H++E++LVDD S   +L   + E +  L+ V
Sbjct: 205 TAPRRHLRELLLVDDLSQHGHLKGVLSEYLSHLSRV 240


>UniRef50_Q8IA43 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase 10; n=1; Drosophila
           melanogaster|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase 10 - Drosophila
           melanogaster (Fruit fly)
          Length = 630

 Score =  167 bits (405), Expect = 1e-39
 Identities = 94/247 (38%), Positives = 138/247 (55%), Gaps = 17/247 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I +L+  +R G I+AR+     S   VLVFLDSHIEVN  WLPPLL+          +  
Sbjct: 230 IHILRLPERRGSIKARMEAIRVSSCQVLVFLDSHIEVNTNWLPPLLE---------PIVI 280

Query: 317 SARAVT-PVIDVINADTFEYSP-SPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSP 374
           +   VT P++D I+  TF Y+  + + R GFNW L    ++LP        D   P ++P
Sbjct: 281 NPHIVTRPILDAISRKTFAYAKQNTMTRSGFNWWLES--ESLPIFPEDKSPD-STPYRTP 337

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHV-FRK 433
            ++G + AI R YF  +G +D  ++ W  E  EISF++WMCGG +  +PC+RVGH+  R 
Sbjct: 338 VLSGAM-AIDRNYFLNLGGFDEQLDTWEAEKFEISFKVWMCGGMMLYVPCARVGHIGKRP 396

Query: 434 RRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHV-EIGDISERKALRERLQC 492
            +        +++ +N  R+A VWMD+Y K V + NP    +   G + +RK  R  L+C
Sbjct: 397 MKSISSPGYHNFLARNYKRVAEVWMDNYKKYVYDKNPKLYKMANAGLLFQRKTKRNALEC 456

Query: 493 KTFKWYL 499
           KTF WY+
Sbjct: 457 KTFDWYM 463



 Score = 76.6 bits (180), Expect = 2e-12
 Identities = 36/88 (40%), Positives = 58/88 (65%)

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
           F   +S RI  +R LPDTR   C+ ++Y + LP  ++II F++EH   L+RS+ SI++R+
Sbjct: 135 FYAELSDRIPLNRSLPDTRPISCRKRKYLENLPNVTVIIAFHDEHLSVLLRSITSIINRS 194

Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAV 207
             + +K+I+LVDD S+L  L   ++E V
Sbjct: 195 PVELLKQIVLVDDDSNLPELGQQLEEIV 222


>UniRef50_Q6TBR4 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
           N- acetylgalactosaminyltransferase T4; n=1; Toxoplasma
           gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
           acetylgalactosaminyltransferase T4 - Toxoplasma gondii
          Length = 329

 Score =  164 bits (398), Expect = 7e-39
 Identities = 98/252 (38%), Positives = 132/252 (52%), Gaps = 21/252 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +RLL+   R+G+  AR  G   +   V V LDSH+EV   WL PL+ R++   + +    
Sbjct: 98  VRLLRHQTRKGVTVARSTGIRAAKSHVFVILDSHVEVGYQWLEPLVARVASNPETI---- 153

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWG-LHFKWDNLPKGTLINDEDFMKPLKSPT 375
               V PV+D ++  T E+  S +   G  W  +   +  L    L       +P  SPT
Sbjct: 154 ----VFPVVDAVDYRTLEFKSSGV---GLIWSVMEHGFVPLSPERLAYSPGAYRP--SPT 204

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
           M G +FA  + YF   G YD GM   G EN+E+S R W CGG LE  PCSRV H+FR   
Sbjct: 205 MMGSVFAADKNYFLQHGGYDEGMRFEGAENIELSLRQWQCGGRLECSPCSRVFHLFRS-- 262

Query: 436 PYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTF 495
               G     +  N +R   VWMD+Y      V     HV +GDIS+R  LRERL CK+F
Sbjct: 263 ----GADAQPVTWNRLRTMAVWMDEYGDLAWRVT-GEPHVSLGDISDRIKLRERLGCKSF 317

Query: 496 KWYLDNMWFETD 507
           +W+LDN+W E+D
Sbjct: 318 QWFLDNVWPESD 329



 Score = 52.4 bits (120), Expect = 3e-05
 Identities = 26/44 (59%), Positives = 33/44 (75%)

Query: 151 LPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
           LP ASIII FYNE+   L R++HSI+ RT  + ++EIILVDD S
Sbjct: 31  LPAASIIIAFYNEYPTALHRTLHSILHRTPLQLLEEIILVDDGS 74


>UniRef50_Q5CYR4 Cluster: Extracellular protein with a signal
           peptide followed by a family 2 glycosyltransferase and
           ricin domains; n=3; Cryptosporidium|Rep: Extracellular
           protein with a signal peptide followed by a family 2
           glycosyltransferase and ricin domains - Cryptosporidium
           parvum Iowa II
          Length = 545

 Score =  164 bits (398), Expect = 7e-39
 Identities = 109/319 (34%), Positives = 162/319 (50%), Gaps = 44/319 (13%)

Query: 247 NSEVKNNVFN-IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRL 305
           N E+ ++    I++++  K EGLIR+R+ GAD S   V+VF+D H      W+ PL+ RL
Sbjct: 101 NKELPSSYLKYIKVIRLDKCEGLIRSRILGADASKSSVIVFMDGHCRPKENWIEPLINRL 160

Query: 306 SQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDE 365
            +    +        V P+I+ I+  T++   +  ++  F+W   F W           E
Sbjct: 161 KEKPKAI--------VCPMIEDIDRYTWKDLGTFGLKMMFDWNFEFNWY----------E 202

Query: 366 DFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCS 425
           DF   +  P  +GGL+AI RE++   GKYDPGM  WGGEN+E S RIW CGG +     S
Sbjct: 203 DFTDVI--PIASGGLYAITREWWEESGKYDPGMLEWGGENIEQSIRIWRCGGEIVAEKKS 260

Query: 426 RVGHVF-RKRRPYGVGEKQDYMLQNSMRMARVWMD----DYVKKVIEVNPSAAHVEIG-D 479
           RVGH+F R  +P    +    + +N  R A VW+D     Y + + +V  S    + G D
Sbjct: 261 RVGHIFKRDPKPNPENKLVLQVQRNQKRAAMVWLDKKRYKYFETIHDVVKSLNETQSGVD 320

Query: 480 ISERKALRERLQCKTFKWYLDNMWFETDRSELVLGR---------TLCLDASNN------ 524
           + +R +++ERL+CK F WY+D      DR  L+L            LCL AS N      
Sbjct: 321 LEQRHSIKERLKCKPFSWYVDKFRASFDRGGLLLDNFRHFKHRKSGLCLTASLNEVVTGT 380

Query: 525 --VAPILGKCHEMGGTQEW 541
              A +  +C+E   TQ+W
Sbjct: 381 EDKAVVFKECNERDDTQKW 399


>UniRef50_Q6YBY0 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
           N- acetylgalactosaminyltransferase T3; n=1; Toxoplasma
           gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
           acetylgalactosaminyltransferase T3 - Toxoplasma gondii
          Length = 635

 Score =  161 bits (392), Expect = 4e-38
 Identities = 96/257 (37%), Positives = 137/257 (53%), Gaps = 16/257 (6%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +RL++   R+G++ AR+ G   S   +   LDSHIEV+  WL PLL R+ +  DG     
Sbjct: 245 VRLIRNEVRKGIVGARMKGIRASRAPIFAILDSHIEVSPQWLEPLLLRIKE--DG----- 297

Query: 317 SARAVTPVIDVINADTFEYSPSPL-VRGGFNWGL-HFKWDNLPKGTLINDEDFMKPLK-- 372
             R V P ID I+A+TF++    +  + GF W L    ++      L  +E    P    
Sbjct: 298 -RRVVMPQIDGIDAETFKHIAGGIGCKLGFLWKLMEHSYEGHQTARLPPEERQPSPTDFQ 356

Query: 373 -SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVF 431
            SP MAGGLFA  + +F  +G YD     WG ENLE+SFR+W CGG LE  PCSRV H+F
Sbjct: 357 TSPAMAGGLFAANKAFFFDVGAYDEDFQFWGTENLELSFRLWQCGGVLECAPCSRVYHIF 416

Query: 432 RKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEV-NPSAAHVEIGDISERKALRERL 490
           RK    G     D +  N MR   +WMD+Y      V      +     + +R+  R+R 
Sbjct: 417 RKGGS-GYSSPGDSITINKMR-TMLWMDEYADLAWRVIGKPRVNYRPESLEKRREWRKRK 474

Query: 491 QCKTFKWYLDNMWFETD 507
            CK+F+W+++N++ E D
Sbjct: 475 GCKSFRWFMENVFPEGD 491



 Score = 74.1 bits (174), Expect = 9e-12
 Identities = 37/84 (44%), Positives = 52/84 (61%), Gaps = 1/84 (1%)

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDR 178
           FN  +S  +   R  PD R+  C+   Y    LP+AS+II FYNE + TLMRSVHS+++ 
Sbjct: 146 FNLYLSDHLELDRTAPDARHASCRQLHYDLSTLPKASVIIVFYNEPFSTLMRSVHSVLNG 205

Query: 179 TDQKHIKEIILVDDYSDLYNLHHD 202
           T  + ++E+ILVDD S L  +  D
Sbjct: 206 TPPQILEELILVDDGSTLPYIRED 229


>UniRef50_Q8IA41 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase 11; n=2; Drosophila
           melanogaster|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase 11 - Drosophila
           melanogaster (Fruit fly)
          Length = 557

 Score =  146 bits (353), Expect = 2e-33
 Identities = 102/347 (29%), Positives = 163/347 (46%), Gaps = 44/347 (12%)

Query: 260 LKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSAR 319
           L+    +G+I ARL GA  + GD+LVFL+ H+EV  GWLPPLL+ +        +  +  
Sbjct: 171 LEMESSKGIIHARLTGAGVATGDILVFLNGHMEVTRGWLPPLLEPI--------LLNNQT 222

Query: 320 AVTPVIDVINADTFEY----SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
              P++D I+ ++F Y     P  L    F+W L   +  L + +        KP  S  
Sbjct: 223 VTEPIVDAISRESFAYRKLVEPEQLA---FDWQLDHIFLPLDQHSW---NSLPKPYPSSQ 276

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVF-RKR 434
           + G +FAI R++F  +G +D G+  +GG+ LE+S ++W CGG +  +PCSRVG ++ R  
Sbjct: 277 LEGRVFAIDRKWFWHLGGWDEGLRDYGGDALELSLKVWQCGGLILAVPCSRVGIIYKRDE 336

Query: 435 RPYGVGEKQDYMLQ---NSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQ 491
               +   ++  LQ   N  R+  VW+D+Y       NP   ++    + + + LR RL 
Sbjct: 337 LEAQMAPNRNPSLQVQKNFKRVVDVWLDEYKLHFYRYNPKLRNLTAESLDKPRDLRRRLN 396

Query: 492 CKTFKWY-------LDNMWFETDRSELVLGRTL-------CLDASNNVAPILGKCHEMGG 537
           CK+F+WY       + N +     +   +G+ +       CL       P++ KCH    
Sbjct: 397 CKSFEWYRSQVAPQIRNHFLHAGLTNYPIGKIMPFVAPHFCLSIKGGF-PVIRKCHST-N 454

Query: 538 TQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNKW 584
            ++W    T +S        MCL VD  Y+            S N W
Sbjct: 455 FEDW----TLTSRCQLKHGNMCLDVD--YKNNVRATKCTKKLSKNPW 495



 Score = 52.0 bits (119), Expect = 4e-05
 Identities = 30/86 (34%), Positives = 46/86 (53%), Gaps = 2/86 (2%)

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELP-RASIIICFYNEHYETLMRSVHSIM 176
           + +N  +S+RI   R L D R+  C    Y  E     SI+I    EH  TL+R ++S++
Sbjct: 75  YQYNAWLSERIPLKRTLEDYRDPQCLKINYSSEKTVTVSIVIAIQQEHPHTLLRGIYSVI 134

Query: 177 DRTDQKHIKEIILV-DDYSDLYNLHH 201
            +T    +KEI+LV D + DL  + H
Sbjct: 135 TQTSPYLLKEIVLVHDGHPDLDLIRH 160


>UniRef50_UPI0000E46EB4 Cluster: PREDICTED: similar to MGC81846
           protein, partial; n=2; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to MGC81846 protein,
           partial - Strongylocentrotus purpuratus
          Length = 358

 Score =  136 bits (329), Expect = 2e-30
 Identities = 60/114 (52%), Positives = 86/114 (75%)

Query: 96  EEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRAS 155
           E++ G++R  E+  IRD GY  HAFN LISQRIG HR++ DTRN LC+ Q Y +ELP  S
Sbjct: 133 EDELGMVRTDEERSIRDGGYRQHAFNELISQRIGFHRNVTDTRNPLCKYQVYSEELPTVS 192

Query: 156 IIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
           I+ICFYNE + TL+R+V+S++DRT ++ I E+ILVDD+S+L +L  ++ + + K
Sbjct: 193 IVICFYNEAWSTLLRTVYSVLDRTPRRLIHELILVDDFSELTHLKKELDQYMSK 246



 Score =  129 bits (312), Expect = 2e-28
 Identities = 63/130 (48%), Positives = 82/130 (63%), Gaps = 8/130 (6%)

Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
           KK  +    KN    + ++   +REGLIRAR  GA  + GDVL+FLDSH EVN  WL PL
Sbjct: 237 KKELDQYMSKNFNGLVHVIHNGQREGLIRARTIGARYATGDVLMFLDSHCEVNEQWLEPL 296

Query: 302 LKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTL 361
           L+R+           S   V P+ID+IN DTF Y+ SPLV+GGFNWG+HFKWD +    L
Sbjct: 297 LERIKAD--------SHTVVCPIIDIINHDTFAYTASPLVKGGFNWGMHFKWDTIRSRQL 348

Query: 362 INDEDFMKPL 371
           +  ED++KP+
Sbjct: 349 VGKEDYVKPI 358


>UniRef50_Q6YK77 Cluster: UDP-N-acetyl-D-galactosamine:polypeptide
           N- acetylgalactosaminyltransferase T2; n=1; Toxoplasma
           gondii|Rep: UDP-N-acetyl-D-galactosamine:polypeptide N-
           acetylgalactosaminyltransferase T2 - Toxoplasma gondii
          Length = 692

 Score =  135 bits (326), Expect = 4e-30
 Identities = 86/257 (33%), Positives = 126/257 (49%), Gaps = 22/257 (8%)

Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
           R+++    +GLIR R+ GA  +  D   FLD H    VGW  PLL  L       K  Y 
Sbjct: 214 RVIRFDSPQGLIRGRVAGAAIATSDNFFFLDGHCRPKVGWAEPLLAHL-------KTNYR 266

Query: 318 ARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
            R   P I  I  D++E   +   +  F W   F W           ED    +  P +A
Sbjct: 267 -RIACPKIYDIYLDSWEDVGTHGTKMMFEWTFEFGWF----------EDLEDEV--PVLA 313

Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
           GG+ A+ ++++   G YD GM  WGGENLE S R W+CGG +  +  S++GH+F +    
Sbjct: 314 GGILAMTKKWWIESGLYDEGMLEWGGENLEQSIRSWLCGGEIVAVQESKIGHIFSRPPKP 373

Query: 438 GVGEKQDYMLQ-NSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALR-ERLQCKTF 495
             G +    +Q N  R A+VW+D+Y     + +      + GDI++RK LR E+L C  F
Sbjct: 374 NPGNRLVIQVQKNQKRGAKVWLDEYYFLFYKYHREVRGHQEGDITQRKKLRYEQLTCMPF 433

Query: 496 KWYLDNMWFETDRSELV 512
           +WY++      DR  L+
Sbjct: 434 QWYVEKFKTAFDRKGLL 450


>UniRef50_Q4RNJ5 Cluster: Chromosome 21 SCAF15012, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 21 SCAF15012, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 364

 Score =  133 bits (321), Expect = 1e-29
 Identities = 80/181 (44%), Positives = 103/181 (56%), Gaps = 33/181 (18%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +RL++ +KREGL+RARL GA  + G+VL FLD H E + GWL PLL+R+ +    V    
Sbjct: 191 VRLIRATKREGLVRARLLGASITTGEVLTFLDCHCECHEGWLEPLLQRIKEEPSAV---- 246

Query: 317 SARAVTPVIDVINADTFEY--SPSPLVRGGFNWGLHFKWDNLPK---------------G 359
               V PVIDVI+ +TFEY  +P     GGF+W L F W  +P+               G
Sbjct: 247 ----VCPVIDVIHWNTFEYLGNPGEPQIGGFDWRLVFTWHIIPEYEQKRRRSPTDVIRYG 302

Query: 360 TLI-------NDEDFMKPLK-SPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFR 411
            L           D     + SPTMAGGLFA+ + YF+ +G YD GM VWGGENLE SFR
Sbjct: 303 RLFRTLALRAGSSDVPSARRRSPTMAGGLFAVSKNYFHYLGTYDTGMEVWGGENLEFSFR 362

Query: 412 I 412
           +
Sbjct: 363 V 363



 Score = 84.6 bits (200), Expect = 7e-15
 Identities = 39/114 (34%), Positives = 67/114 (58%), Gaps = 1/114 (0%)

Query: 101 LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIIC 159
           L  + E+ +  ++    H  N  +S ++  HR LP+  N  C+  +Y +  LP  S++I 
Sbjct: 78  LTLSEEEKQKEEESLQKHQINIYVSDQVSLHRRLPEKWNPRCRELEYDYRSLPTTSVVIA 137

Query: 160 FYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNV 213
           FYNE + TL+R+VHS+++ +    +KE++LVDDYSD  +L   +++ V  L  V
Sbjct: 138 FYNEAWSTLLRTVHSVLETSPDILLKEVVLVDDYSDRAHLKEPLEKYVSGLKKV 191


>UniRef50_Q4RNJ6 Cluster: Chromosome 21 SCAF15012, whole genome
           shotgun sequence; n=2; Clupeocephala|Rep: Chromosome 21
           SCAF15012, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 534

 Score =  125 bits (301), Expect = 4e-27
 Identities = 92/262 (35%), Positives = 128/262 (48%), Gaps = 44/262 (16%)

Query: 321 VTPVIDVINADTFEY---SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
           V PVID I+ +TFEY   +  P++ GGF+W L F+W ++P+         + P++ P   
Sbjct: 210 VCPVIDTIDWNTFEYYMQTDEPMI-GGFDWRLTFQWHSVPERERKRRSSRIDPIR-PRCR 267

Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
           G L A+                       EI   +W CGGSLE+ PCS VGHVF K+ PY
Sbjct: 268 GALAAMSLSLAFR----------------EIRGNVWQCGGSLEIHPCSHVGHVFPKKAPY 311

Query: 438 GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFKW 497
                +   LQN++R A VWMD Y +     NP+A     GDIS R  LRE+L+C++F W
Sbjct: 312 A----RPNFLQNTVRAAEVWMDSYKQHFYNRNPAARKETYGDISGRLLLREKLKCQSFTW 367

Query: 498 YLDNMWFETDRSELVLG-----RTL-----CLDASNNVAPILGK------CHEMGGTQEW 541
           YL N++ E    E   G     R L     CLD +     + G       CH  GG Q +
Sbjct: 368 YLKNIYPELHIPEDRAGWHGAVRNLGISSECLDYNAPEHSVTGAQLSLFGCHGQGGNQYF 427

Query: 542 KHKGTASSPI-YNTAAGMCLGV 562
           ++  T+   I +NT   +C  V
Sbjct: 428 EY--TSQKEIRFNTVTELCAEV 447



 Score = 71.3 bits (167), Expect = 7e-11
 Identities = 33/78 (42%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
           +A N  +S +I  HR + D R   C     +  LP  S+II FYNE + TL+R++HS+++
Sbjct: 106 YAINIFVSDKISLHRHIQDHRMNECAFD--YRRLPTTSVIIAFYNEAWSTLLRTIHSVLE 163

Query: 178 RTDQKHIKEIILVDDYSD 195
            T    +KEIIL+DD+SD
Sbjct: 164 TTPAILLKEIILIDDFSD 181


>UniRef50_Q5CHA1 Cluster: Glycosyl transferase; n=4;
           Cryptosporidium|Rep: Glycosyl transferase -
           Cryptosporidium hominis
          Length = 809

 Score =  124 bits (299), Expect = 7e-27
 Identities = 80/256 (31%), Positives = 127/256 (49%), Gaps = 35/256 (13%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++  K EGLIR+++ GAD ++G  + FLD H +   GW   L+K        ++  Y
Sbjct: 333 VKIIRLKKCEGLIRSKIIGADAALGPNIFFLDGHCKPKKGWSEALVK-------SIRENY 385

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKW--DNLPKGTLINDEDFMKPLKSP 374
             R V P++  I+   +    +   +    W   F W  D LP              + P
Sbjct: 386 K-RVVCPIVQSISNIDWSDIGTAGAKMMIEWNFAFHWYDDGLP--------------EIP 430

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK- 433
             +GG+  I + ++   GKYDPGM  WGGEN+E SFRIW+CGG + ++  S VGH+F + 
Sbjct: 431 IASGGILMITKRWWEESGKYDPGMLYWGGENIEQSFRIWLCGGEIHVVRNSLVGHIFERN 490

Query: 434 ---RRPYGVGEKQ---DYMLQNSMRMARVWMDDYVKKVIEVNPSA-AHVEIG---DISER 483
              +R      K+   D M  N  R A VW+ +   +    N     ++ I     +SER
Sbjct: 491 NSNKRNQDFQYKKMLIDNMNSNHQRTAFVWLSEQFYETYFKNYHVLGYLPISYTKGLSER 550

Query: 484 KALRERLQCKTFKWYL 499
            +L+  L+CK F+WY+
Sbjct: 551 LSLKHILKCKPFEWYI 566


>UniRef50_Q4SKF7 Cluster: Chromosome 13 SCAF14566, whole genome
           shotgun sequence; n=14; Clupeocephala|Rep: Chromosome 13
           SCAF14566, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 530

 Score =  122 bits (294), Expect = 3e-26
 Identities = 86/278 (30%), Positives = 140/278 (50%), Gaps = 43/278 (15%)

Query: 249 EVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQG 308
           E +N       ++ ++++GL  AR+ G   +  DV+  LD+HIEV+  W  PLL ++   
Sbjct: 122 EKENPSVRFTRVRHTEQKGLSHARVSGWSAATADVVAILDAHIEVHEMWAEPLLTQIRAD 181

Query: 309 VDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNL-PKGTLINDEDF 367
              V        V+PV D +N D  +          F+W L   ++   P+   + D   
Sbjct: 182 RSVV--------VSPVFDRVNYDDLKVIKYSPAAHAFDWALWCMYEGFTPEYYKLADSSL 233

Query: 368 MKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRV 427
             P KSP++ G L A  R++   IG  D GM            ++W CGGS+E++PCS++
Sbjct: 234 --PGKSPSVMGILVAD-RKFLGEIGVLDEGM------------KVWTCGGSIEVVPCSKI 278

Query: 428 GHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKV---------IEVNPSAAH---- 474
            H+ R  + Y + +    M +N++R+A VWMD+Y   V         +  N   +     
Sbjct: 279 AHMERAHKRY-MPDLTLAMKRNALRVAEVWMDEYKHNVNLAWNLPFQVFENEKRSSGNKR 337

Query: 475 -----VEIGDISERKALRERLQCKTFKWYLDNMWFETD 507
                ++IG+++ERK LRERL+CK FKWYL+N++ + D
Sbjct: 338 RPNHGIDIGNVTERKQLRERLKCKPFKWYLENVYPKLD 375



 Score = 68.1 bits (159), Expect = 6e-10
 Identities = 37/127 (29%), Positives = 70/127 (55%), Gaps = 3/127 (2%)

Query: 92  ENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDEL 151
           ++ L +++G   +  D R  +  +  + +N  +S R+   R L DTR   C  + Y  +L
Sbjct: 3   DSALFKEWGENLSEADQREAEALFKKYGYNVFLSDRLPLDRPLADTREPRCSKKSYPKDL 62

Query: 152 PRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQ---EAVD 208
           P  S+++ + NE    + R++ SI++RT +  +KEII+VDD S   +L  D+    +A++
Sbjct: 63  PTLSVVLIYLNEALSVIKRALRSILNRTPKHLLKEIIMVDDNSSNEDLKGDLDFYVKALE 122

Query: 209 KLNNVIK 215
           K N  ++
Sbjct: 123 KENPSVR 129


>UniRef50_Q8IA44 Cluster: Putative polypeptide
           N-acetylgalactosaminyltransferase 12; n=2; Drosophila
           melanogaster|Rep: Putative polypeptide
           N-acetylgalactosaminyltransferase 12 - Drosophila
           melanogaster (Fruit fly)
          Length = 563

 Score =  115 bits (276), Expect = 4e-24
 Identities = 68/250 (27%), Positives = 123/250 (49%), Gaps = 16/250 (6%)

Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
           R+L   ++ GLI+AR   A  +  + LVF+D+ +E   GWL PLL  +++         S
Sbjct: 179 RILHLPEQVGLIKARNLAASEAKAENLVFVDAQVEFTNGWLSPLLDTIAE--------QS 230

Query: 318 ARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
               TP++D ++  T  Y  S   RG ++W L  +   L +           P +   + 
Sbjct: 231 YTLATPILDNLDEQTLAYQRSIERRGMYDWSLTRREVPLSRA---RRSHLPWPYEVAAVR 287

Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
             +FAI   +F  I  +D  +  +G   LE+SF++W  GG +  +PCSRVGH+  K   Y
Sbjct: 288 TSVFAIPAVWFQDISNFDNNLRGFGAAELELSFKVWCTGGRIVQVPCSRVGHLQPKDEDY 347

Query: 438 -----GVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
                 + +  +   +N  R+  VW  D    + +  P   ++  GD++E + L ++ +C
Sbjct: 348 LKRYGDLHKMGEQKSRNLKRIIEVWTGDLKSAIYKYQPHLLNISEGDLNEPRKLYKQNEC 407

Query: 493 KTFKWYLDNM 502
           ++FK +++++
Sbjct: 408 QSFKEFINDI 417



 Score = 68.5 bits (160), Expect = 5e-10
 Identities = 34/86 (39%), Positives = 56/86 (65%), Gaps = 3/86 (3%)

Query: 113 KGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY---FDELPRASIIICFYNEHYETLM 169
           +G+  + +N+ +++RI   R LPD R+  CQ  +Y    DE+  ASII+ F NE    L+
Sbjct: 68  QGWRYYLYNSWLAERIPLRRSLPDLRDHRCQKLEYDEDSDEMKPASIIMIFRNEQLVVLL 127

Query: 170 RSVHSIMDRTDQKHIKEIILVDDYSD 195
           R++HS+++RT +    E+ILV+D+SD
Sbjct: 128 RTLHSLVERTPKYLYIELILVNDHSD 153


>UniRef50_UPI0000E46FFD Cluster: PREDICTED: similar to
           n-acetylgalactosaminyltransferase, partial; n=1;
           Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
           n-acetylgalactosaminyltransferase, partial -
           Strongylocentrotus purpuratus
          Length = 405

 Score =  110 bits (264), Expect = 1e-22
 Identities = 57/155 (36%), Positives = 89/155 (57%), Gaps = 8/155 (5%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +RL+ T+ REG+ RA++ GA  + G+VLVFLD+H EVN  WL P+L  + QG   V    
Sbjct: 213 VRLVHTTHREGVARAKMRGAREARGEVLVFLDAHCEVNTHWLEPMLDLVHQGPTTV---- 268

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
               V+P+ID I+ +TF +    L R  F W L  +   L +       + ++P++SP  
Sbjct: 269 ----VSPIIDKIDPETFGFEDGSLARVTFRWSLETRRIPLSQIEKAERLNPLEPVRSPLT 324

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFR 411
            GG+FA+ + +F  IG  D G++ WG + L+ S +
Sbjct: 325 NGGIFAVSKSFFEKIGGIDAGLDGWGADGLDFSMK 359



 Score = 83.4 bits (197), Expect = 2e-14
 Identities = 40/95 (42%), Positives = 61/95 (64%), Gaps = 1/95 (1%)

Query: 101 LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQY-FDELPRASIIIC 159
           LI + +D+    K  + H FN ++S +I   R + DTR+  CQ   Y F + P AS+II 
Sbjct: 99  LILSGKDMEKAKKSRDQHNFNLVVSDKISLERTVKDTRDSRCQDITYRFSKFPTASVIIA 158

Query: 160 FYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
           F+NE + TLMR+VHS+++RT +  + E++LVDD S
Sbjct: 159 FHNEAWSTLMRTVHSVVNRTPRDILTEVVLVDDAS 193



 Score = 37.1 bits (82), Expect = 1.3
 Identities = 14/25 (56%), Positives = 19/25 (76%)

Query: 478 GDISERKALRERLQCKTFKWYLDNM 502
           GD+SE K+LR RL C +F WYL ++
Sbjct: 364 GDVSEIKSLRARLSCHSFDWYLSHV 388


>UniRef50_Q4STJ6 Cluster: Chromosome undetermined SCAF14183, whole
           genome shotgun sequence; n=2; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF14183,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 344

 Score =  106 bits (255), Expect = 1e-21
 Identities = 47/88 (53%), Positives = 61/88 (69%), Gaps = 1/88 (1%)

Query: 319 RAVTPVIDVINADTFEY-SPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
           R V+PVID+IN DTF Y + S  +RGGF+W LHFKW+ L         D  +P+K+P +A
Sbjct: 5   RVVSPVIDIINMDTFAYVAASADLRGGFDWSLHFKWEQLSPEQRARRTDPAQPIKTPIIA 64

Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGEN 405
           GGLF I R +FN +GKYD  M++WGGEN
Sbjct: 65  GGLFVIDRSWFNHLGKYDTAMDIWGGEN 92


>UniRef50_UPI000155C133 Cluster: PREDICTED: similar to
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 4, partial; n=1;
           Ornithorhynchus anatinus|Rep: PREDICTED: similar to
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase-like 4, partial -
           Ornithorhynchus anatinus
          Length = 305

 Score =  103 bits (247), Expect = 1e-20
 Identities = 56/138 (40%), Positives = 83/138 (60%), Gaps = 15/138 (10%)

Query: 412 IWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVI---EV 468
           +W CGGS+E++PCSR+ H+ R  +PY   +   ++ +N++R+A VWMD++   V     +
Sbjct: 105 VWQCGGSVEVLPCSRIAHIERAHKPY-TEDLTAHVRRNALRVAEVWMDEFKSHVYMAWNI 163

Query: 469 NPSAAHVEIGDISERKALRERLQCKTFKWYLDNMWFETDR-SELV--------LGRTLCL 519
               + ++IGDISERKALR+ LQCKTF+WYL N++ E    S+ V        L   LCL
Sbjct: 164 PQEDSGIDIGDISERKALRKALQCKTFRWYLVNVYPEMRMYSDTVAYGVLQNSLKSDLCL 223

Query: 520 DASNNV--APILGKCHEM 535
           D   +    PI+  CH M
Sbjct: 224 DQGPDTENIPIMYICHGM 241


>UniRef50_Q4T0W3 Cluster: Chromosome undetermined SCAF10824, whole
           genome shotgun sequence; n=9; Euteleostomi|Rep:
           Chromosome undetermined SCAF10824, whole genome shotgun
           sequence - Tetraodon nigroviridis (Green puffer)
          Length = 149

 Score =  103 bits (246), Expect = 2e-20
 Identities = 46/92 (50%), Positives = 62/92 (67%), Gaps = 4/92 (4%)

Query: 412 IWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVNPS 471
           IW CGGSLE+ PCS VGHVF K+ PY     ++  L NS+R A VWMD+Y +     NP 
Sbjct: 1   IWQCGGSLEIHPCSHVGHVFPKKAPYS----RNKALANSVRAAEVWMDEYKEIYYHRNPH 56

Query: 472 AAHVEIGDISERKALRERLQCKTFKWYLDNMW 503
           A     GD++ER+ LRE+L CK+F W+L+N++
Sbjct: 57  ARLEAFGDVTERRKLREKLGCKSFGWFLENIY 88


>UniRef50_UPI0001554C17 Cluster: PREDICTED: similar to Polypeptide
           N-acetylgalactosaminyltransferase 17; n=1;
           Ornithorhynchus anatinus|Rep: PREDICTED: similar to
           Polypeptide N-acetylgalactosaminyltransferase 17 -
           Ornithorhynchus anatinus
          Length = 328

 Score = 97.5 bits (232), Expect = 9e-19
 Identities = 38/93 (40%), Positives = 61/93 (65%), Gaps = 1/93 (1%)

Query: 407 EISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVI 466
           +++ ++WMCGG +  +PCSRVGH++RK  PY V      + +N  R+A  WMD++ + + 
Sbjct: 91  DVAVKVWMCGGGMFDVPCSRVGHIYRKYVPYKVPSGTS-LARNLKRVAETWMDEFAEYIY 149

Query: 467 EVNPSAAHVEIGDISERKALRERLQCKTFKWYL 499
           +  P   H+  GDIS +K LR+ L+CK FKW++
Sbjct: 150 QRRPEYRHLSTGDISAQKELRKHLKCKDFKWFM 182


>UniRef50_A5D6B4 Cluster: Predicted glycosyltransferases; n=1;
           Pelotomaculum thermopropionicum SI|Rep: Predicted
           glycosyltransferases - Pelotomaculum thermopropionicum
           SI
          Length = 274

 Score = 91.1 bits (216), Expect = 8e-17
 Identities = 81/258 (31%), Positives = 123/258 (47%), Gaps = 32/258 (12%)

Query: 252 NNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ-GVD 310
           NN   ++L+ +S   G  RAR  GA ++ G  L+F D+HI V   WL  LL   S+ GVD
Sbjct: 36  NNYERVKLISSSGL-GAARARNLGAASARGKYLIFCDAHITVPQNWLEALLDTFSRPGVD 94

Query: 311 GVKVRYSARAVTPVIDVINADTFEYSPSPLVRGG-FNWGLHFKWDNLPKGTLINDEDFMK 369
                    AV+P I  +       +P+ +  G  +N  L   W   P G        M 
Sbjct: 95  ---------AVSPAIGSLE------NPAAVGYGQTWNSRLETVWLPPPGG--------MP 131

Query: 370 PLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGH 429
               P + GG  A+    F  +G +D G  VWG E+ E+S ++W+ G  L ++P  RV H
Sbjct: 132 AGPVPLLPGGCLAVRAGAFRRVGGFDEGFIVWGCEDAELSLKLWLFGCRLYVVPSVRVLH 191

Query: 430 VFRKRRPYGVGEKQDYMLQNSMRMA-RVWMDDYVKKVIE-VNPSAAHVE-IGDISERKAL 486
           +FR R PY V    D++  N +RMA   +    VKKV+  + P     + +  + +  AL
Sbjct: 192 LFRSRHPYPV--TMDHVHHNLLRMALSHFKSSRVKKVMGLIEPCGRLADTVRRVLQGGAL 249

Query: 487 RERLQCKTFKWYLDNMWF 504
           ++R +    + Y D+ WF
Sbjct: 250 QQRRRYLAGRMY-DDDWF 266


>UniRef50_UPI0000D8AB1E Cluster:
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9; n=2;
           Euarchontoglires|Rep:
           UDP-N-acetyl-alpha-D-galactosamine:polypeptide
           N-acetylgalactosaminyltransferase 9 - Mus musculus
          Length = 311

 Score = 89.0 bits (211), Expect = 3e-16
 Identities = 59/180 (32%), Positives = 84/180 (46%), Gaps = 32/180 (17%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +++++ S+REGLIRARL G   +   ++ F D+H+E N GW  P L R+ +         
Sbjct: 139 VKVVRNSRREGLIRARLQGWKVATAPIVGFFDAHVEFNTGWAEPALARIQED-------- 190

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFK-----------------WD---NL 356
             R + P ID I   TFE         G+NWGL                    W+   + 
Sbjct: 191 RRRIILPAIDNIKYSTFEVQQYASAAHGYNWGLWCMYIIPPQDWLDRGEPGSVWELGSST 250

Query: 357 PKGTLINDEDFMK-PL---KSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRI 412
             G+    E F   P    ++P M G  F + REYF  IG  DPGM V+G EN+E+  R+
Sbjct: 251 KAGSTPKGESFPSVPTHRGRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGAENIELGMRV 310



 Score = 76.2 bits (179), Expect = 2e-12
 Identities = 35/90 (38%), Positives = 55/90 (61%)

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
           +N  +S RI   R +PD R K C+   Y ++LP+ S++  F NE    ++RSVHS+++ T
Sbjct: 44  YNAQLSDRISLDRTIPDYRPKRCRQITYSEDLPQISVVFIFVNEALSVILRSVHSVVNHT 103

Query: 180 DQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
             + +KE+ILVDD SD   L  ++ + V K
Sbjct: 104 PSQLLKEVILVDDNSDNVELKFNLDQYVHK 133


>UniRef50_UPI00005A4710 Cluster: PREDICTED: similar to GalNAc
           transferase 10 isoform a; n=1; Canis lupus
           familiaris|Rep: PREDICTED: similar to GalNAc transferase
           10 isoform a - Canis familiaris
          Length = 216

 Score = 87.8 bits (208), Expect = 7e-16
 Identities = 40/89 (44%), Positives = 57/89 (64%)

Query: 112 DKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRS 171
           D  Y  + FN  +S  I   R LPD R+  C+ + Y + LP  SIII F+NE + +L+R+
Sbjct: 81  DSAYRENGFNIFVSNSIALERSLPDIRHANCKHKMYLERLPNTSIIIPFHNEGWTSLLRT 140

Query: 172 VHSIMDRTDQKHIKEIILVDDYSDLYNLH 200
           +HSI++RT +  I EIILVDD+SD   +H
Sbjct: 141 IHSIINRTPESLIAEIILVDDFSDRGKIH 169


>UniRef50_Q4RQK9 Cluster: Chromosome 2 SCAF15004, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 2 SCAF15004, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 449

 Score = 85.4 bits (202), Expect = 4e-15
 Identities = 43/100 (43%), Positives = 66/100 (66%), Gaps = 4/100 (4%)

Query: 98  QFG---LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRA 154
           QFG   L+ +SED ++R++ ++   FN  +S RI   R +PDTR + C      D+LP  
Sbjct: 25  QFGQAVLVSSSEDAQVRER-WDEGFFNVYLSDRIPVDRAVPDTRPESCAQSLIHDDLPST 83

Query: 155 SIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
           S+I CF +E + TL+RSVHS+++R+    ++EIILVDD+S
Sbjct: 84  SVIFCFVDEVWSTLLRSVHSVLNRSPLHLLREIILVDDFS 123



 Score = 82.6 bits (195), Expect = 3e-14
 Identities = 47/103 (45%), Positives = 62/103 (60%), Gaps = 9/103 (8%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+++  +R+GLIRARL GA  + G+VL FLDSH+E NVGWL PLL+R+   +D  KV  
Sbjct: 142 VRIIRLQERQGLIRARLAGAAAATGEVLTFLDSHVECNVGWLEPLLERIY--LDRRKV-- 197

Query: 317 SARAVTPVIDVINADTFEYS-PSPLVRGGFNWGLHFKWDNLPK 358
                 PVI+VIN     Y       RG F W L F W  +P+
Sbjct: 198 ----PCPVIEVINDKDMSYMLVDNFQRGIFKWPLVFGWSPVPE 236


>UniRef50_Q7Q046 Cluster: ENSANGP00000016624; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000016624 - Anopheles gambiae
           str. PEST
          Length = 205

 Score = 84.2 bits (199), Expect = 9e-15
 Identities = 42/94 (44%), Positives = 61/94 (64%), Gaps = 1/94 (1%)

Query: 113 KGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEHYETLMRS 171
           +GY+    N  +S  I   R LPD R+  C ++ +   ELP+ASI+I F+NE +  L+R+
Sbjct: 110 QGYDQQGLNQYVSDLIPVRRRLPDLRDPWCTAETRLLPELPQASIVIVFFNEAWSVLVRT 169

Query: 172 VHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQE 205
           VHSI+DRT    I+EIILVDD+S+L +L   + E
Sbjct: 170 VHSILDRTPHALIREIILVDDFSNLAHLRTQLDE 203


>UniRef50_Q2B871 Cluster: Glycosyl transferase, group 2 family
           protein; n=1; Bacillus sp. NRRL B-14911|Rep: Glycosyl
           transferase, group 2 family protein - Bacillus sp. NRRL
           B-14911
          Length = 297

 Score = 82.6 bits (195), Expect = 3e-14
 Identities = 75/252 (29%), Positives = 115/252 (45%), Gaps = 26/252 (10%)

Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
           NI L+ T+   G   AR  GA  + G VLVF D+H+E    WL  L++ L  G+      
Sbjct: 61  NISLI-TTDGVGAANARNEGAKLAKGQVLVFCDAHLEFEDYWLDLLIEPLLTGLTD---- 115

Query: 316 YSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
               AVTP I  I    F      L     +  +   W+       +  +D  +    P 
Sbjct: 116 ----AVTPAIGAIGNPHFTGYGQTLWVNERSSKIRTHWN-------VKQDDLFETAILP- 163

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
             GG FAI R  F   G ++ G  VWG E++EIS ++W+ G    + P ++V H+FRK +
Sbjct: 164 --GGCFAINRSVFEEAGGFETGFPVWGYEDVEISIKLWLFGYKCHVQPKAKVLHLFRKVQ 221

Query: 436 PYGVGEKQDYMLQNSMRMARVWMDD---YVKKVIEVNPSAAHVEIGDISERKALRERLQC 492
           PY V E  +Y   N +R+A +       Y  + + +N +   +E   +  + AL E+ Q 
Sbjct: 222 PYRV-ELDEY-FYNLLRLAYLHFSPARIYKTRKMLINGNEKEIE-RKVLAQGAL-EKKQA 277

Query: 493 KTFKWYLDNMWF 504
              +   D+ WF
Sbjct: 278 YLARRRYDDDWF 289


>UniRef50_Q4SU00 Cluster: Chromosome undetermined SCAF14054, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF14054,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 253

 Score = 80.2 bits (189), Expect = 1e-13
 Identities = 32/62 (51%), Positives = 42/62 (67%)

Query: 344 GFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGG 403
           GF+W LHFKW+ L         D  +P+K+P +AGGLF I R +FN +GKYD  M++WGG
Sbjct: 1   GFDWSLHFKWEQLSPEQRARRTDPAQPIKTPIIAGGLFVIDRSWFNHLGKYDTAMDIWGG 60

Query: 404 EN 405
           EN
Sbjct: 61  EN 62


>UniRef50_UPI0000E234D0 Cluster: PREDICTED: similar to UDP-GalNAc:
           polypeptide N-acetylgalactosaminyltransferase; n=1; Pan
           troglodytes|Rep: PREDICTED: similar to UDP-GalNAc:
           polypeptide N-acetylgalactosaminyltransferase - Pan
           troglodytes
          Length = 459

 Score = 79.8 bits (188), Expect = 2e-13
 Identities = 35/95 (36%), Positives = 57/95 (60%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHS 174
           Y  + +N  +S RI   R +PD R + C+   Y  +LP+ S++  F NE    ++RSVHS
Sbjct: 114 YEEYGYNAQLSDRISLDRSIPDYRPRKCRQMSYAQDLPQVSVVFIFVNEALSVILRSVHS 173

Query: 175 IMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDK 209
           +++ T  + +KE+ILVDD SD   L  ++ + V+K
Sbjct: 174 VVNHTPSQLLKEVILVDDNSDNVELKFNLDQYVNK 208



 Score = 79.4 bits (187), Expect = 2e-13
 Identities = 61/169 (36%), Positives = 86/169 (50%), Gaps = 27/169 (15%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQ--------G 308
           +++++ S+REGLIRARL G   +   V+ F D+H+E N GW  P L R+ +         
Sbjct: 214 VKIVRNSRREGLIRARLQGWKAATAPVVGFFDAHVEFNTGWAEPALSRIREDSRPQFLLS 273

Query: 309 VDGVK-----VRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLIN 363
            D +K     VRYS RA      +    +F  S  P  +G    GLH   + L   +L  
Sbjct: 274 HDELKLSCFSVRYSRRAHHYAWLIFCIFSFPGS-WPSRQGS---GLHS--EVLTPLSL-- 325

Query: 364 DEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRI 412
                 P ++P M G  F + REYF  IG  DPGM V+GGEN+E+  R+
Sbjct: 326 ------PPRTPAMIGCSFVVDREYFGDIGLLDPGMEVYGGENVELGMRV 368


>UniRef50_Q5BYW7 Cluster: SJCHGC07375 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC07375 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 202

 Score = 79.4 bits (187), Expect = 2e-13
 Identities = 42/105 (40%), Positives = 65/105 (61%), Gaps = 2/105 (1%)

Query: 115 YNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDEL-P-RASIIICFYNEHYETLMRSV 172
           ++++ FN + S  IG  R+L D R+  C  Q   D+L P + S+II F+NE +  L+R+V
Sbjct: 87  FSINEFNLVASDLIGLRRNLDDFRHPSCPRQIPLDKLIPFKTSVIIVFHNEAWSALLRTV 146

Query: 173 HSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKE 217
           HS++DRT ++ + EIILVDD S   +L H +   +  LN  I+ E
Sbjct: 147 HSVLDRTPEQLLHEIILVDDASTQSHLGHQLDNYISSLNKPIRLE 191


>UniRef50_Q4TCW9 Cluster: Chromosome undetermined SCAF6660, whole
           genome shotgun sequence; n=2; Deuterostomia|Rep:
           Chromosome undetermined SCAF6660, whole genome shotgun
           sequence - Tetraodon nigroviridis (Green puffer)
          Length = 157

 Score = 75.4 bits (177), Expect = 4e-12
 Identities = 43/94 (45%), Positives = 56/94 (59%), Gaps = 9/94 (9%)

Query: 242 KKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
           KK  EN  V+     +R+L+  +R GLIRARL GA  + G V+ FLD+H E  VGWL PL
Sbjct: 59  KKKLENY-VRTLEVPVRILRMEQRSGLIRARLRGAAATKGQVITFLDAHCECTVGWLEPL 117

Query: 302 LKRLSQGVDGVKVRYSARAVTPVIDVINADTFEY 335
           L R+ +    V        V P+IDVI+ +TFEY
Sbjct: 118 LARIKEDRTAV--------VCPIIDVISDETFEY 143



 Score = 59.3 bits (137), Expect = 3e-07
 Identities = 23/54 (42%), Positives = 42/54 (77%)

Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
           C+++ Y D++P  S++I F+NE + TL+R+VHS+++R+ +  + EI+LVDD S+
Sbjct: 1   CKTKVYPDDVPNTSVVIVFHNEAWSTLLRTVHSVINRSPRHLLVEIVLVDDASE 54


>UniRef50_UPI0000E46BB8 Cluster: PREDICTED: similar to
           UDP-GalNAc:polypeptide,
           N-acetylgalactosaminyltransferase, partial; n=3;
           Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
           UDP-GalNAc:polypeptide,
           N-acetylgalactosaminyltransferase, partial -
           Strongylocentrotus purpuratus
          Length = 112

 Score = 74.1 bits (174), Expect = 9e-12
 Identities = 35/79 (44%), Positives = 53/79 (67%)

Query: 117 LHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIM 176
           ++ FN + S RI  +R LPD R + C ++ Y  +LP  S+I+ ++NE   TL+R+VHSI+
Sbjct: 33  INEFNLMASDRIALNRSLPDVRPRGCANKVYPKKLPTTSVILVYHNEARSTLLRNVHSII 92

Query: 177 DRTDQKHIKEIILVDDYSD 195
           +R+    + EIILVDD SD
Sbjct: 93  NRSPHDLLAEIILVDDASD 111


>UniRef50_UPI00005A4DE7 Cluster: PREDICTED: similar to Probable
           polypeptide N-acetylgalactosaminyltransferase 8
           (Protein-UDP acetylgalactosaminyltransferase 8)
           (UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
           transferase 8) (GalNAc-T8) (pp-GaNTase 8)...; n=1; Canis
           lupus familiaris|Rep: PREDICTED: similar to Probable
           polypeptide N-acetylgalactosaminyltransferase 8
           (Protein-UDP acetylgalactosaminyltransferase 8)
           (UDP-GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 8) (Polypeptide GalNAc
           transferase 8) (GalNAc-T8) (pp-GaNTase 8)... - Canis
           familiaris
          Length = 437

 Score = 73.7 bits (173), Expect = 1e-11
 Identities = 40/91 (43%), Positives = 53/91 (58%), Gaps = 2/91 (2%)

Query: 321 VTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGL 380
           V+PV D IN DTFE     L   GFNW L  ++D LP+    +  D   P+KSP++  G+
Sbjct: 165 VSPVFDNINFDTFELDKYALAVDGFNWKLWCRYDPLPE-AWFDLHDVTAPIKSPSIM-GI 222

Query: 381 FAIYREYFNAIGKYDPGMNVWGGENLEISFR 411
            A  R +   IG  D GM V+GGEN+E+S R
Sbjct: 223 LAANRIFLGEIGSLDGGMLVYGGENVELSLR 253



 Score = 70.1 bits (164), Expect = 2e-10
 Identities = 41/150 (27%), Positives = 78/150 (52%), Gaps = 6/150 (4%)

Query: 45  QPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRN 104
           Q  +V+ T+ +   KY+++  Q    + + +K    K+   K     ++ L  Q+G   +
Sbjct: 3   QEENVDNTVERV--KYEEHPVQ----KTLKVKASETKEHKPKEILFPDSQLFRQWGEDLS 56

Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEH 164
               +  +  +    +N  +S ++  +R +PDTR+  C  + Y  +LP   +I+ F NE 
Sbjct: 57  EAQQKKAEDLFQEFGYNVYLSNQLPLNRTIPDTRDSRCLQKTYSSQLPSLGVILIFMNEA 116

Query: 165 YETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
              + R++ SI++RT  + +KEIILVDD+S
Sbjct: 117 LSIIQRAITSIINRTPTQLLKEIILVDDFS 146



 Score = 47.2 bits (107), Expect = 0.001
 Identities = 18/38 (47%), Positives = 29/38 (76%)

Query: 466 IEVNPSAAHVEIGDISERKALRERLQCKTFKWYLDNMW 503
           +E++   + ++ GDIS R ALR++L+CKTF WYL N++
Sbjct: 248 VELSLRNSGIDFGDISSRMALRKKLKCKTFDWYLKNVY 285


>UniRef50_Q4T9B6 Cluster: Chromosome undetermined SCAF7602, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF7602,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 451

 Score = 73.3 bits (172), Expect = 2e-11
 Identities = 31/82 (37%), Positives = 53/82 (64%)

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRT 179
           FN + S  I   R + D R+  C+   Y + L  +S++I F+NE + TLMR+VHS++ RT
Sbjct: 165 FNMVASDMISLDRTISDIRHDECKYWHYDENLLTSSVVIVFHNEGWSTLMRTVHSVIKRT 224

Query: 180 DQKHIKEIILVDDYSDLYNLHH 201
            ++++ EI+++DD+S+    HH
Sbjct: 225 PRRYLAEIVMIDDFSNKEKAHH 246


>UniRef50_UPI0000F20FAB Cluster: PREDICTED: hypothetical protein;
           n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
           Danio rerio
          Length = 204

 Score = 72.9 bits (171), Expect = 2e-11
 Identities = 32/54 (59%), Positives = 40/54 (74%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
           +R+L+T KREGLIR RL GA  + G V+ FLDSH E NV WLPPLL R++Q  +
Sbjct: 140 VRILRTQKREGLIRTRLLGAAAARGQVITFLDSHCEANVNWLPPLLDRIAQNTN 193



 Score = 64.1 bits (149), Expect = 1e-08
 Identities = 34/86 (39%), Positives = 55/86 (63%), Gaps = 4/86 (4%)

Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
           C+ + Y  +LP  S+II F+NE + +L+R+VHS++DR+    I EIILVDD+SD  +L  
Sbjct: 69  CKLKLYTADLPNTSVIIPFHNEGWSSLLRTVHSVLDRSPPLLIAEIILVDDFSDKGHLKA 128

Query: 202 DVQEAVDKLNNV----IKKEEEMIET 223
            +++ + +L  V     +K E +I T
Sbjct: 129 PLEQYMVRLPKVRILRTQKREGLIRT 154


>UniRef50_Q01V98 Cluster: Glycosyl transferase, family 2; n=1;
           Solibacter usitatus Ellin6076|Rep: Glycosyl transferase,
           family 2 - Solibacter usitatus (strain Ellin6076)
          Length = 282

 Score = 72.9 bits (171), Expect = 2e-11
 Identities = 64/202 (31%), Positives = 95/202 (47%), Gaps = 24/202 (11%)

Query: 258 RLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYS 317
           R++KT+   G+  AR  G   + GD+L F D+HI +   W  PL + L       KV   
Sbjct: 50  RVIKTNGI-GVACARNLGVSKTTGDMLFFADAHIRLEKNWWQPLAEVLEDR----KVAAV 104

Query: 318 ARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMA 377
           A AVT            + P+   RG   +GL F   +L    L      + P  +P + 
Sbjct: 105 APAVT------------HLPATRRRG---FGLTFTGPDLDARWL--PRQGVTPFSAPILP 147

Query: 378 GGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPY 437
           G    + R  F+A+G +D G+   GG + E+S R+W+ G  L + P   V H+FR   PY
Sbjct: 148 GCSLMMRRATFDAVGGWDGGLLHRGGVDNEMSVRLWLLGYELMVAPQVVVPHLFRSASPY 207

Query: 438 GVGEKQDYMLQNSMRMARVWMD 459
            VG  Q   L N +R+A V ++
Sbjct: 208 PVGWPQ--YLHNRLRLAFVHLN 227


>UniRef50_A4J8G0 Cluster: Glycosyl transferase, family 2; n=1;
           Desulfotomaculum reducens MI-1|Rep: Glycosyl
           transferase, family 2 - Desulfotomaculum reducens MI-1
          Length = 291

 Score = 72.9 bits (171), Expect = 2e-11
 Identities = 55/199 (27%), Positives = 94/199 (47%), Gaps = 26/199 (13%)

Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVR 315
           +++L+ T+   G   AR  GA    G++LVF D+HI V   WL    + LS+G+    + 
Sbjct: 58  SVKLINTTGI-GAANARNLGAQQCAGEILVFCDAHITVEPDWL----ENLSEGL----LE 108

Query: 316 YSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
             + AV+P I  +N +         +  G  W    +   LP    + +         P 
Sbjct: 109 RGSGAVSPGIANMNMNH-------AIGYGMTWNKQLEARWLPSTGDVAEV--------PI 153

Query: 376 MAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRR 435
             GG  A++R+ FN +G ++ G   +G E+ E S ++W+ G  +E+ P   + H FR + 
Sbjct: 154 APGGCVAVHRDVFNDVGGFETGFRTYGFEDAEFSLKLWLFGYRVEVDPSVVIQHHFRSKH 213

Query: 436 PYGVGEKQDYMLQNSMRMA 454
           PY +   ++Y   N + MA
Sbjct: 214 PYSI-TMEEY-AYNGIHMA 230


>UniRef50_Q4RPK0 Cluster: Chromosome 12 SCAF15007, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 12 SCAF15007, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 554

 Score = 72.1 bits (169), Expect = 4e-11
 Identities = 80/322 (24%), Positives = 130/322 (40%), Gaps = 44/322 (13%)

Query: 290 HIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGL 349
           HI+       P+L R+ +           R + P ID I  +TFE         G+NWGL
Sbjct: 243 HIDFRTRLAEPILTRMKED--------HTRIILPAIDNIKYNTFEVQQYANAAHGYNWGL 294

Query: 350 HF-------KWDNLPKGTL---INDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMN 399
                    +W +  +G     I  ++     K  + A  +  +  + FN  G     + 
Sbjct: 295 WCMYIIPPQEWLDKGRGVTCLEIKKDELQTSSKKRSCARWILDLCFQ-FN--GFTPVSLC 351

Query: 400 VWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMARVWMD 459
           V       +  ++W CGGS+E++PC+RV H+ R ++PY   +   Y  +N++R A VWMD
Sbjct: 352 V-------LPVQVWQCGGSMEVLPCARVAHIERTKKPYN-NDIDYYAKRNALRAAEVWMD 403

Query: 460 DYVKKV-------IEVNPSAAH---VEIGDISERKALR---ERLQCKTFKWYLDN-MWFE 505
           +Y   V       + V  S A    ++ G   + KA+      +  +  ++  +  +   
Sbjct: 404 EYKSHVYMAWNIPMNVRNSKASGYCLDQGAEDDDKAILYPCHGMSSQLARYSTEGLLQLG 463

Query: 506 TDRSELVLGRTLCL-DASNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDR 564
              S   L  T CL D      P L KC  +    +     T + PI +   G CL V+ 
Sbjct: 464 PLGSTTFLPDTKCLVDDGRGRTPRLKKCEAVSRNSQRLWDFTQNGPIISRDTGRCLEVEM 523

Query: 565 SYRGETVLMVICDDYSNNKWDI 586
           S      L ++    S  KW I
Sbjct: 524 SKDANFGLRLVVQRCSGQKWMI 545


>UniRef50_UPI0000E49DD9 Cluster: PREDICTED: hypothetical protein;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 220

 Score = 71.3 bits (167), Expect = 7e-11
 Identities = 35/96 (36%), Positives = 56/96 (58%)

Query: 104 NSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNE 163
           N  D    D+    + FN +IS RI   R + D R+  C+   Y   LP  ++++ F+NE
Sbjct: 99  NPNDQDKYDQSLKEYGFNMVISDRIALDRAVNDIRHDECKYWHYPKNLPNTTVVVVFHNE 158

Query: 164 HYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
            + TL+R+VHS+++ +    + EIILVDD+SD  +L
Sbjct: 159 GWSTLLRTVHSVINTSPPYLLHEIILVDDFSDKISL 194


>UniRef50_Q4SIA0 Cluster: Chromosome 5 SCAF14581, whole genome
           shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
           Chromosome 5 SCAF14581, whole genome shotgun sequence -
           Tetraodon nigroviridis (Green puffer)
          Length = 404

 Score = 68.9 bits (161), Expect = 3e-10
 Identities = 35/104 (33%), Positives = 59/104 (56%)

Query: 91  TENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDE 150
           +++ L   +G   + ++ R+  K +  + +N  +S R+   R +PD R   C++  Y   
Sbjct: 35  SDSSLFAHWGQNLSPDNRRVALKMFQYYGYNGYLSDRLPLDRPIPDLRPDGCRNTTYPLS 94

Query: 151 LPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
           LP+ SI+  F NE    ++RS+HS ++RT    +KEIILVDD S
Sbjct: 95  LPQVSIVFIFVNEALSVILRSIHSAINRTPSHLLKEIILVDDNS 138



 Score = 66.1 bits (154), Expect = 2e-09
 Identities = 36/90 (40%), Positives = 47/90 (52%), Gaps = 2/90 (2%)

Query: 322 TPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLF 381
           +P  D I  DTFE    PL   GF+W L  ++ N PK           P++SP + G  F
Sbjct: 158 SPSFDNIKYDTFEIEEYPLSAQGFDWELWCRYLNPPKSWWFKGNK-SAPIQSPALIG-CF 215

Query: 382 AIYREYFNAIGKYDPGMNVWGGENLEISFR 411
            + R YF  IG  D GM V+GGEN+E+  R
Sbjct: 216 VVDRLYFEEIGLLDEGMEVYGGENVELGIR 245



 Score = 58.0 bits (134), Expect = 7e-07
 Identities = 25/48 (52%), Positives = 37/48 (77%)

Query: 458 MDDYVKKVIEVNPSAAHVEIGDISERKALRERLQCKTFKWYLDNMWFE 505
           M+ Y  + +E+    + ++IGD+S+RKALR+RLQCKTF+WYL NM+ E
Sbjct: 232 MEVYGGENVELGIRDSGIDIGDVSDRKALRKRLQCKTFRWYLVNMYPE 279


>UniRef50_UPI0000F1FCC6 Cluster: PREDICTED: hypothetical protein;
           n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
           Danio rerio
          Length = 213

 Score = 66.9 bits (156), Expect = 1e-09
 Identities = 33/78 (42%), Positives = 46/78 (58%)

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
           + +N  +S +I   R LPD R   C+   +  +LP+ SII  F NE    ++RSVHS ++
Sbjct: 132 YGYNAFLSDKISLDRSLPDYRPSKCKKAFFPRDLPQISIIFIFVNEALSVILRSVHSAVN 191

Query: 178 RTDQKHIKEIILVDDYSD 195
            T    +KEIILVDD SD
Sbjct: 192 HTPAHLLKEIILVDDNSD 209


>UniRef50_A7T195 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 693

 Score = 64.5 bits (150), Expect = 8e-09
 Identities = 31/82 (37%), Positives = 49/82 (59%)

Query: 108 LRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYET 167
           L+  +  Y  + FN  IS +IG  RD+PDTR+  C+ + Y   LP  SIII F+NE   T
Sbjct: 24  LKEGEDAYGKNQFNQAISDKIGGDRDVPDTRHSHCRYEAYPSTLPATSIIITFHNEARST 83

Query: 168 LMRSVHSIMDRTDQKHIKEIIL 189
           L+R+V S++   + + I  +++
Sbjct: 84  LLRTVKSLLSIPNLERICRLLM 105


>UniRef50_UPI000069DFD6 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 11 (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 11) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase 11)
           (Polypeptide GalNAc transferase 11) (GalNAc-T11)
           (pp-GaNTase 11).; n=1; Xenopus tropicalis|Rep:
           Polypeptide N-acetylgalactosaminyltransferase 11 (EC
           2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
           11) (UDP- GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 11) (Polypeptide
           GalNAc transferase 11) (GalNAc-T11) (pp-GaNTase 11). -
           Xenopus tropicalis
          Length = 343

 Score = 61.7 bits (143), Expect = 5e-08
 Identities = 26/54 (48%), Positives = 36/54 (66%)

Query: 86  KMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRN 139
           ++  +   DL  + G+I N +D  +RD GY  HAFN LIS R+G HRD+PDTR+
Sbjct: 78  QLETEANADLSPELGMIFNEQDQDVRDVGYQKHAFNLLISNRLGYHRDVPDTRD 131



 Score = 56.4 bits (130), Expect = 2e-06
 Identities = 32/91 (35%), Positives = 45/91 (49%), Gaps = 6/91 (6%)

Query: 500 DNMWFETDRSELVLGRTLCLDASNNVA---PILGKCHEMGGTQEWKHKGTASSPIYNTAA 556
           + +W   +  EL+L   LCLD S   +   P L KCH  GG+Q+W      S+ +Y  + 
Sbjct: 253 EQVWSYNEEHELILSNLLCLDMSETRSSDPPRLMKCHGSGGSQQWVF--GKSNRLYQVSV 310

Query: 557 GMCLG-VDRSYRGETVLMVICDDYSNNKWDI 586
           G CL  VD   R   V M ICD   + +W +
Sbjct: 311 GQCLKLVDPMSRKGYVSMAICDGSPSQQWHL 341


>UniRef50_Q3A8Y3 Cluster: Glycosyl transferase, group 2 family; n=1;
           Carboxydothermus hydrogenoformans Z-2901|Rep: Glycosyl
           transferase, group 2 family - Carboxydothermus
           hydrogenoformans (strain Z-2901 / DSM 6008)
          Length = 288

 Score = 60.1 bits (139), Expect = 2e-07
 Identities = 33/101 (32%), Positives = 57/101 (56%), Gaps = 3/101 (2%)

Query: 369 KPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVG 428
           K  + P + GGL  I  + F  +G ++  M  WG E+ E+S R+W+ G  L ++P   V 
Sbjct: 145 KVAEIPVVPGGLMVIKSKVFFEVGGFEGLMERWGWEDAELSLRLWLMGYRLLVVPEVVVY 204

Query: 429 HVFRKRRPYGVGEKQDYMLQNSMRMARVWM-DDYVKKVIEV 468
           H+FR+R+PY    K    L+N   +A   + ++ VKK++++
Sbjct: 205 HLFRERQPYPTSRKA--ALKNLFILALNHLSEERVKKILKL 243



 Score = 34.3 bits (75), Expect = 9.2
 Identities = 21/52 (40%), Positives = 31/52 (59%), Gaps = 2/52 (3%)

Query: 251 KNNVFN-IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPL 301
           K++ +N I+L+KT +  GL RA+  GA  + G  LVF D+H+     WL  L
Sbjct: 50  KDSRYNQIKLIKT-EGIGLARAKNLGAKYASGKYLVFSDAHMSYQTFWLDHL 100


>UniRef50_Q3A8Y2 Cluster: Glycosyl transferase, group 2 family; n=1;
           Carboxydothermus hydrogenoformans Z-2901|Rep: Glycosyl
           transferase, group 2 family - Carboxydothermus
           hydrogenoformans (strain Z-2901 / DSM 6008)
          Length = 288

 Score = 57.6 bits (133), Expect = 9e-07
 Identities = 24/64 (37%), Positives = 38/64 (59%)

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           P + GG   I +  F A+G YD G+ +WG ++ E S R W+ G +L + P ++V H+FR 
Sbjct: 149 PVLPGGFMLIKKNDFIALGGYDEGLKIWGYDDCEFSLRAWLMGFNLLVTPRTKVFHLFRS 208

Query: 434 RRPY 437
            + Y
Sbjct: 209 GQIY 212



 Score = 43.6 bits (98), Expect = 0.015
 Identities = 27/67 (40%), Positives = 39/67 (58%), Gaps = 4/67 (5%)

Query: 244 STENSEV---KNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPP 300
           ST+ S V     N  NI L+K   R G+ RA+  GA+ + G+VL+F D+HI V   WL  
Sbjct: 41  STDQSTVFLNSGNFKNIDLIKLP-RSGVTRAKNAGANKARGEVLIFSDAHILVEDFWLEK 99

Query: 301 LLKRLSQ 307
           +L+ L +
Sbjct: 100 MLEDLQE 106


>UniRef50_Q4TDW9 Cluster: Chromosome undetermined SCAF5986, whole
           genome shotgun sequence; n=1; Tetraodon
           nigroviridis|Rep: Chromosome undetermined SCAF5986,
           whole genome shotgun sequence - Tetraodon nigroviridis
           (Green puffer)
          Length = 123

 Score = 49.6 bits (113), Expect = 2e-04
 Identities = 25/61 (40%), Positives = 33/61 (54%), Gaps = 1/61 (1%)

Query: 425 SRVGHVFRKRRPYGV-GEKQDYMLQNSMRMARVWMDDYVKKVIEVNPSAAHVEIGDISER 483
           SRVGHVFRK+ PY   G       +N+ R A VWMD+Y        PSA +V  G + + 
Sbjct: 1   SRVGHVFRKQHPYTFPGGSGTVFARNTRRAAEVWMDEYKNFYYAAVPSARNVPYGKVPDH 60

Query: 484 K 484
           +
Sbjct: 61  Q 61


>UniRef50_Q68VJ7 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 1; n=3;
           Euteleostomi|Rep: Polypeptide
           N-acetylgalactosaminyltransferase 1 - Homo sapiens
           (Human)
          Length = 170

 Score = 48.8 bits (111), Expect = 4e-04
 Identities = 41/120 (34%), Positives = 55/120 (45%), Gaps = 18/120 (15%)

Query: 478 GDISERKALRERLQCKTFKWYLDNMWFETD--RSELVLGR------TLCLD---ASNNVA 526
           GDIS R  LR +LQCK F WYL+N++ ++   R    LG         CLD      N  
Sbjct: 5   GDISSRVGLRHKLQCKPFSWYLENIYPDSQIPRHYFSLGEIRNVETNQCLDNMARKENEK 64

Query: 527 PILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMVICDDYSNNK-WD 585
             +  CH MGG Q + +  TA+  I      +CL  D S     V M+ C     N+ W+
Sbjct: 65  VGIFNCHGMGGNQVFSY--TANKEI--RTDDLCL--DVSKLNGPVTMLKCHHLKGNQLWE 118


>UniRef50_Q1ARC2 Cluster: Glycosyl transferase, family 2; n=1;
           Rubrobacter xylanophilus DSM 9941|Rep: Glycosyl
           transferase, family 2 - Rubrobacter xylanophilus (strain
           DSM 9941 / NBRC 16129)
          Length = 753

 Score = 46.8 bits (106), Expect = 0.002
 Identities = 52/189 (27%), Positives = 80/189 (42%), Gaps = 24/189 (12%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           +R+L+  +R G   AR  G   + G+V+ F D   EV  GWL  L   L     GV++  
Sbjct: 397 VRVLRLERRAGQSAARNLGLRAARGEVVAFTDDDCEVLPGWLRALAAPLC--TPGVELA- 453

Query: 317 SARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTM 376
             R ++P         FE + SPL  G             P+G  +     +  L S  +
Sbjct: 454 GGRVLSPP-PAGRLGAFEAARSPLDMG-------------PEGGEVGPRGAVPYLPSCNL 499

Query: 377 AGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRP 436
           AG   A+ R     +G +D GM +  GE+ ++ +R    G  +   P +RV H  R R  
Sbjct: 500 AGDRRALLR-----LGGFDEGMEL--GEDADLVWRAVRAGLGVRYEPSARVVHRHRTRLA 552

Query: 437 YGVGEKQDY 445
             +  + DY
Sbjct: 553 ALLARRADY 561


>UniRef50_Q4RAK4 Cluster: Chromosome undetermined SCAF23488, whole
           genome shotgun sequence; n=2; Tetraodontidae|Rep:
           Chromosome undetermined SCAF23488, whole genome shotgun
           sequence - Tetraodon nigroviridis (Green puffer)
          Length = 174

 Score = 46.4 bits (105), Expect = 0.002
 Identities = 35/130 (26%), Positives = 59/130 (45%), Gaps = 21/130 (16%)

Query: 470 PSAAHVEIGDISERKALRERLQCKTFKWYLDNMWFE-----------TDRSEL-VLGRTL 517
           P +  +  GDISE K  RE  +CK+FKW+++ + ++            D  E+  L  + 
Sbjct: 2   PESLTLAYGDISELKRFREEHRCKSFKWFMEEIAYDIPLHYPMPPKNVDWGEIRGLDTSY 61

Query: 518 CLDA---SNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAGMCLGVDRSYRGETVLMV 574
           C+D+   +N     +G CH MGG Q ++         Y+    +  G D S     V++ 
Sbjct: 62  CIDSMGHTNGGNVEIGPCHRMGGNQLFRINEANQLMQYDQC--LTRGTDNS----GVIIT 115

Query: 575 ICDDYSNNKW 584
            CD   + +W
Sbjct: 116 HCDQNQHTEW 125


>UniRef50_Q5CY12 Cluster: UDP-N-acetylgalactosamine: polypeptide N-
           acetylgalactosaminyltransferase, signal peptide; n=2;
           Cryptosporidium|Rep: UDP-N-acetylgalactosamine:
           polypeptide N- acetylgalactosaminyltransferase, signal
           peptide - Cryptosporidium parvum Iowa II
          Length = 414

 Score = 46.4 bits (105), Expect = 0.002
 Identities = 54/217 (24%), Positives = 98/217 (45%), Gaps = 31/217 (14%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I++++T  +E L   +  GA+NS G++++F+ S       W+ P+++ LS     +    
Sbjct: 119 IKIIETELQE-LGELQNLGANNSTGEIILFVPSATLFPKNWMSPIMRSLSDNYKSI---- 173

Query: 317 SARAVTPVIDVINADTFEYSPS-PLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKSPT 375
               + P    +N D + +S + P+      +   F+  N+   TL N        K P 
Sbjct: 174 ----IVPRFKKLNKDKWTFSNNDPVYSPKMMFTKEFELTNI--HTLDN--------KVPM 219

Query: 376 MAGGLFAIYREYFNAIGKY-DPGMNV--WGGENLEISFRIWMCGGSLELIPCSRVGHVFR 432
               +FAI + ++  I K  DP +N+      N +IS R W CGG +  I     G V +
Sbjct: 220 FYSKIFAITKSWWLNISKLSDPTINLIFKTSINFDISLRSWNCGGRVAQIAELSFG-VTK 278

Query: 433 KRRPYGVGEKQDYMLQNSMRMARVWMDDYVKKVIEVN 469
            + P    E +  +L++       W+D+  K++I  N
Sbjct: 279 VKIPQPSLEIRQVLLES-------WIDEPTKQMIMNN 308


>UniRef50_UPI000069E575 Cluster: Polypeptide
           N-acetylgalactosaminyltransferase 5 (EC 2.4.1.41)
           (Protein-UDP acetylgalactosaminyltransferase 5) (UDP-
           GalNAc:polypeptide N-acetylgalactosaminyltransferase 5)
           (Polypeptide GalNAc transferase 5) (GalNAc-T5)
           (pp-GaNTase 5).; n=1; Xenopus tropicalis|Rep:
           Polypeptide N-acetylgalactosaminyltransferase 5 (EC
           2.4.1.41) (Protein-UDP acetylgalactosaminyltransferase
           5) (UDP- GalNAc:polypeptide
           N-acetylgalactosaminyltransferase 5) (Polypeptide GalNAc
           transferase 5) (GalNAc-T5) (pp-GaNTase 5). - Xenopus
           tropicalis
          Length = 124

 Score = 46.0 bits (104), Expect = 0.003
 Identities = 17/27 (62%), Positives = 23/27 (85%)

Query: 477 IGDISERKALRERLQCKTFKWYLDNMW 503
           IGD++E+K LRERLQCK F WY+ N++
Sbjct: 2   IGDLTEQKQLRERLQCKNFNWYIKNVF 28


>UniRef50_A6WEA2 Cluster: Glycosyl transferase family 2; n=1;
           Kineococcus radiotolerans SRS30216|Rep: Glycosyl
           transferase family 2 - Kineococcus radiotolerans
           SRS30216
          Length = 289

 Score = 46.0 bits (104), Expect = 0.003
 Identities = 51/197 (25%), Positives = 77/197 (39%), Gaps = 34/197 (17%)

Query: 255 FNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKV 314
           F +R +  S R G+  AR  G   ++ DV++  D+   V VGW+  + + L Q       
Sbjct: 60  FMLRRVDASARRGVAHARNAGCRAALADVILVCDADDVVGVGWVDAMARALEQA------ 113

Query: 315 RYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKW-DNLPKGTLINDEDFMKPLKS 373
                      D++           LV G  N  L  +W    P G L      +     
Sbjct: 114 -----------DLVGGT--------LVHGHLNTALVQQWRPTSPPGVLPTKLSHL----- 149

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
           P   G    + RE F+A+G +D G  V GG+++E S+R    G  L   P + +   +R 
Sbjct: 150 PYAVGANVGLRREVFDALGGWDEGF-VAGGDDVEFSWRAQHAGFCLRSAPDAVI--AYRM 206

Query: 434 RRPYGVGEKQDYMLQNS 450
           R       KQ Y    S
Sbjct: 207 RTTLSANVKQSYFYARS 223


>UniRef50_Q1Q6Z1 Cluster: Putative uncharacterized protein; n=1;
           Candidatus Kuenenia stuttgartiensis|Rep: Putative
           uncharacterized protein - Candidatus Kuenenia
           stuttgartiensis
          Length = 291

 Score = 43.2 bits (97), Expect = 0.020
 Identities = 27/69 (39%), Positives = 38/69 (55%), Gaps = 5/69 (7%)

Query: 244 STENS-----EVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWL 298
           ST+NS       +N + N+R+   S R G   AR  GA  +VG+ LVF D+  EV  GWL
Sbjct: 41  STDNSMEIVMRYQNRLPNLRIADASDRRGQAHARNIGARLAVGESLVFCDADDEVAPGWL 100

Query: 299 PPLLKRLSQ 307
             + + LS+
Sbjct: 101 AAMGEALSR 109


>UniRef50_UPI0000499CE1 Cluster: SMC3 protein; n=1; Entamoeba
           histolytica HM-1:IMSS|Rep: SMC3 protein - Entamoeba
           histolytica HM-1:IMSS
          Length = 1188

 Score = 42.7 bits (96), Expect = 0.026
 Identities = 50/257 (19%), Positives = 110/257 (42%), Gaps = 15/257 (5%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
           + K+RIK  +K IA  E       + + +EE   + M + ++   +         KE+  
Sbjct: 643 ETKERIKEVEKDIARSEA------EKKRIEEEQKEIMKEMEEISSKIAEEEVKYEKERME 696

Query: 81  KQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNK 140
           +   IK S++    ++ +   I  +E +  R+    L   N   ++ I D  ++   + K
Sbjct: 697 RMIKIKKSERIRESIKNKEKRIEENEKIIFRNTQKLLILQNEKDNKNIIDRSEIEKAKKK 756

Query: 141 LCQSQQYFDELPRASIIICFYNEHYETLMRSVH--SIMDRTD--QKHIKEIILVDDYSDL 196
           L + Q    EL +  + I    E+   ++R+ +   I+ R +  ++ ++E+    D SD+
Sbjct: 757 LEEIQNKVRELEKKRVEI----ENRRQVLRNEYQFGIISRINEIERKMREVESGGDESDI 812

Query: 197 YNLHHDVQEAVDKLNNVIKKEEEMI-ETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVF 255
                 + +++++L  + ++ E+ I E   ++ +            K   E  +    +F
Sbjct: 813 EKYKEVLSKSMEELQRINEEIEKKIQEERTLEEQQEGIEKEKEKKEKIQNEREKKMARLF 872

Query: 256 NIRLLKTSKREGLIRAR 272
               +  SKR+ LI+ R
Sbjct: 873 EKMTVLESKRKELIKRR 889


>UniRef50_A2DKE3 Cluster: Viral A-type inclusion protein, putative;
            n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
            protein, putative - Trichomonas vaginalis G3
          Length = 2870

 Score = 42.7 bits (96), Expect = 0.026
 Identities = 45/203 (22%), Positives = 89/203 (43%), Gaps = 10/203 (4%)

Query: 29   NDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMS 88
            N+K++   +    LS   ++ E++    + K  D   +    +  +   K    Q+    
Sbjct: 2356 NEKNVLLQQEISKLSSDLQEKEKSEKSLLQKQNDLISEISKLKNDIKDHKINLSQSTSSL 2415

Query: 89   KKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGD-HRDLPDTRNKLCQSQQY 147
            KK   D+  +   I  S     +D+  NL   N  + ++I +    L DT + L QS Q 
Sbjct: 2416 KK---DISTKAKQIEQS-----KDELNNLQTENNSLKKKIQNLEAVLQDTEDSLAQSNQS 2467

Query: 148  FDELPRASIIICFYNEHYETLMRSVHSIMDR-TDQKHIKEIILVDDYSDLYNLHHDVQEA 206
              ++  +  ++    E  + L+ S    ++R T++   KE  L    S+L N+   ++  
Sbjct: 2468 QRQIKASYDLLNNKFEENQVLLNSKQKEIERLTNEVSDKEKELEKTKSELINIQERIRSD 2527

Query: 207  VDKLNNVIKKEEEMIETNNIDME 229
              KLN  I +++  +E+ NI++E
Sbjct: 2528 SSKLNQDINEKQTKLESLNIELE 2550


>UniRef50_Q23ZG7 Cluster: Peptidase family M1 containing protein;
           n=1; Tetrahymena thermophila SB210|Rep: Peptidase family
           M1 containing protein - Tetrahymena thermophila SB210
          Length = 1721

 Score = 42.3 bits (95), Expect = 0.035
 Identities = 51/201 (25%), Positives = 86/201 (42%), Gaps = 18/201 (8%)

Query: 59  KYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLH 118
           KY D + Q E +++   KEK       +  K+    L +Q           I +K  NL 
Sbjct: 659 KYSDERAQFEQKQEEFSKEKEDLLNQFEKFKEENTILAKQIS--------EIEEKIENLI 710

Query: 119 AFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDR 178
           + N  +S +  D R L +T  K  + + Y DEL +    I    +  E LM+  +   + 
Sbjct: 711 SENQELSIQNQDQRYLIETLEK--KQESYTDELKK-QFDISLRQKIEENLMQLTNRYNEE 767

Query: 179 TDQKHIKEII-----LVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXX 233
            ++ H  EII     +V++      L   +Q    +LN +I++ +E+ E  NI+ E    
Sbjct: 768 INKLH-NEIIKRNESVVEEKEKNRQLEEQIQRLKFELNKLIQETKELKEFANIE-EEQNY 825

Query: 234 XXXXXXXXKKSTENSEVKNNV 254
                   +   EN+EVKN +
Sbjct: 826 EQQKQYQQQLEFENNEVKNQI 846


>UniRef50_A2EE02 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 766

 Score = 41.9 bits (94), Expect = 0.046
 Identities = 46/188 (24%), Positives = 84/188 (44%), Gaps = 10/188 (5%)

Query: 107 DLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRA--SIIICFYNEH 164
           D++  D   NL AF   I++ I D   + D  +KL  +    D++  +  S ++    + 
Sbjct: 340 DMKKLDVDKNLLAF---IAKNISDDNSMGDV-SKLVANLTNKDDVACSLVSSLLLANMKQ 395

Query: 165 YETLMRSVHSIMDRTD-QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIET 223
            E L R V  +  R +  + I+EI +  + +    L   +QE +D L   +K+ E++I+T
Sbjct: 396 QEQLQRMVDQVEQREEILQSIREIGVPPEKAAKIVLK--MQEEIDNLKKTVKENEQIIQT 453

Query: 224 NNIDMEXXXXXXXXXXXXK-KSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGD 282
           N ++ E            K K TEN     NV +    K ++ +  +  R+  +DN   +
Sbjct: 454 NKVEFENLEEQLEKLGEEKTKLTENLFNTENVLSDFRTKNNELQKELENRITESDNMKKE 513

Query: 283 VLVFLDSH 290
             V  D +
Sbjct: 514 YTVVKDQN 521


>UniRef50_A1BDD3 Cluster: Glycosyl transferase, family 2; n=1;
           Chlorobium phaeobacteroides DSM 266|Rep: Glycosyl
           transferase, family 2 - Chlorobium phaeobacteroides
           (strain DSM 266)
          Length = 994

 Score = 41.5 bits (93), Expect = 0.061
 Identities = 47/191 (24%), Positives = 82/191 (42%), Gaps = 24/191 (12%)

Query: 244 STENS-EVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWL---- 298
           S +NS E  N +  I+L++    +G I +   GA  + G+ L+FL++  EV   WL    
Sbjct: 406 SPDNSYETLNKISQIKLIRNDCNKGFIHSCNSGASLATGEYLIFLNNDTEVLNAWLDSLI 465

Query: 299 PPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPK 358
            P +   + G+ G ++ Y    +     VI +D            G N+G      N P+
Sbjct: 466 APFIIHDNVGLVGSQIIYPDGRLQEAGGVILSD----------GSGLNYG-RLSDPNKPE 514

Query: 359 GTLINDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGS 418
              + + D+         +G   AI +  F++IG +D        E+ +I+F +   G  
Sbjct: 515 YNFLREVDY--------CSGCSIAIKKSLFDSIGGFDTLFIPAYYEDTDIAFTVRKMGYK 566

Query: 419 LELIPCSRVGH 429
           +   P S+V H
Sbjct: 567 VLYQPASKVIH 577


>UniRef50_Q23G97 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 758

 Score = 41.5 bits (93), Expect = 0.061
 Identities = 49/251 (19%), Positives = 109/251 (43%), Gaps = 16/251 (6%)

Query: 16  RPLNWDIKDRIKNNDKSIAALEG-----KDLLSDQPRDVEETLS--KTMWKYQDYKRQSE 68
           R L  D  DR     + IA L+      K+ L  + R++++ ++  K + +  D+K+ + 
Sbjct: 50  RQLELDFNDRTDVQIRQIANLKTEVDTLKETLKYKQRELDDAVAELKAIKEVSDHKQINT 109

Query: 69  YRRKVMLKEKFAKQQAIKMSKKT-ENDL----EEQFGLIRNSED--LRIRDKGYNLHAFN 121
            R KV LK+ F   Q +K   ++ E  L    EE+  L+   +   L I +         
Sbjct: 110 ERLKVELKQSFENNQRLKEDNQSMETQLRWQKEEKKNLMERMDQLTLNIEEVMEKNEDLE 169

Query: 122 TLISQRIGDHRDLPDTRNKLCQSQQYF-DELPRASIIICFYNEHYETLMRSVHSIMDRTD 180
            ++ +   D R+L +   +L    +   +EL R + II          ++ + ++     
Sbjct: 170 RVLEESEKDKRNLENLHAQLQSEYELLNEELRRKNQIISELESQNSHQLKQILTLESDKG 229

Query: 181 QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEX-XXXXXXXXX 239
            K  + + L  +  +L +++ + +E +  L+  +K ++  ++ N  +++           
Sbjct: 230 NKSSEVMKLKQELEELRHINKNQKEDIFNLDQCVKDKQSQLDQNYKELKSLQNNYTLALE 289

Query: 240 XXKKSTENSEV 250
             KK+ +++EV
Sbjct: 290 ESKKAQDHAEV 300


>UniRef50_A2F531 Cluster: Viral A-type inclusion protein, putative;
           n=1; Trichomonas vaginalis G3|Rep: Viral A-type
           inclusion protein, putative - Trichomonas vaginalis G3
          Length = 3748

 Score = 40.7 bits (91), Expect = 0.11
 Identities = 42/201 (20%), Positives = 90/201 (44%), Gaps = 11/201 (5%)

Query: 24  DRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQS---EYRRKVMLKEKFA 80
           D I+N +  +   E +DL S +  D +   +  + K  D K+Q    E + K + K+   
Sbjct: 662 DEIENENDQLFE-EVEDLKS-KVDDAKILYNDMVDKIDDLKQQRSKVEQKYKDLEKQNKE 719

Query: 81  KQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNK 140
           K   I+   K  ++L+E+   +   +D    +    + A N  I ++  ++  + +  NK
Sbjct: 720 KSDEIEKVSKEISELKEKLDNLNQFKD-NTPELHQKVDAMNEQIVKKSQENEKIQEEMNK 778

Query: 141 LCQSQQYFD-ELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
           L +  Q+ + E+    ++    N+  ET+   + +I  + ++K      + D  + L   
Sbjct: 779 LNEELQHLENEMEEIEVV----NDERETIQEKIDNIKQQIEEKKKSNEEIQDIMNLLIEA 834

Query: 200 HHDVQEAVDKLNNVIKKEEEM 220
            +D Q+ +D +  V  + EE+
Sbjct: 835 ENDAQKELDDIEIVEAQSEEI 855


>UniRef50_A0D876 Cluster: Chromosome undetermined scaffold_40, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_40,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 537

 Score = 40.7 bits (91), Expect = 0.11
 Identities = 56/236 (23%), Positives = 107/236 (45%), Gaps = 16/236 (6%)

Query: 4   NQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDL---LSDQPRDVE--ETLSKTMW 58
           ++++ +  K+  +  N +I ++I+ N    +  E K     LS+   D E  E L  T+ 
Sbjct: 221 DKLIQENNKLQTQLTNAEI-EKIQMNQSQQSLTECKQCVQDLSNSNLDKESYELLQNTIK 279

Query: 59  KYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEE-QFGLIRNSEDLRIRDKGYNL 117
           +Y+   +  E  RK+ L  K    +  +  K+ E  LE+ Q GL +  E   I  K  NL
Sbjct: 280 EYEQSVQDMETERKMFLSAKSEISEKEQEKKQMEMKLEQKQHGLNKLKERNEICAK--NL 337

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQ-YFDELPRASIIICFYNEHYETLMRSVHSIM 176
            A    I +     ++  +  +K  ++QQ Y  +L +         +  ++ M  ++ I 
Sbjct: 338 KACQEQILKLQQQLKEKSELEDKAKEAQQKYQQQLIKLKDNFDKQQKELQSYMAQLNEIK 397

Query: 177 DRTDQKHIKEIILVDDY-SDLYNLHHDVQEAVDKLNNVIKKEEEMI--ETNNIDME 229
           D+ +++  K I+L +++ S+   L  + +   + L    KK EE I  +T  I+ E
Sbjct: 398 DKFEEEQKKNILLKEEFESEKKRLQEENKRNQEILQ---KKHEEAILQQTQRIEKE 450


>UniRef50_A6UUX2 Cluster: SMC domain protein; n=1; Methanococcus
           aeolicus Nankai-3|Rep: SMC domain protein -
           Methanococcus aeolicus Nankai-3
          Length = 994

 Score = 40.7 bits (91), Expect = 0.11
 Identities = 51/229 (22%), Positives = 101/229 (44%), Gaps = 20/229 (8%)

Query: 4   NQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKT---MWKY 60
           N  +N+ TK     L    +D+      +I  L+ ++LL    + +EE   K+     KY
Sbjct: 477 NAEINQ-TKDAIEKLKGTTEDKCPVCQSNIDGLKKQELLKQYNQLIEERKQKSNKLQIKY 535

Query: 61  QDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAF 120
             Y  + +  +  + K    K +  ++ +K  N +EE+  LI+ + DL +   G +L  +
Sbjct: 536 NKYLSEKKDIKDKLDKINNLKNKYGQLKEKNNNLIEEENKLIKLNNDLNV--IGKDLEEY 593

Query: 121 NTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTD 180
           N +I        ++    NKL + +QY+ +       +   NE  + L+ + + ++D   
Sbjct: 594 NKII-------ENIKTKENKLKELEQYYKKYEYCENFLKDSNE--QELVDNKNKLLDIIG 644

Query: 181 QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDME 229
               K I+     +    L++ V E  + LN +  KE+ M + N ++ E
Sbjct: 645 NNTNKSIL-----NTKKELNNKVGELNELLNLIRNKEQNMKKLNVVNKE 688



 Score = 35.1 bits (77), Expect = 5.3
 Identities = 45/206 (21%), Positives = 91/206 (44%), Gaps = 9/206 (4%)

Query: 23  KDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQ---SEYRRKVMLKEKF 79
           K+  KN +  +  LE      +    + E LS    K  D K+Q   +EY+ K  LK+  
Sbjct: 382 KELNKNKENYLKYLELSKKSKELNNKLME-LSGIKEKENDLKQQIKSTEYKIK-QLKQDL 439

Query: 80  AKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRN 139
                I +    E +++ ++  I N  DL +++K    +A        I   +   + + 
Sbjct: 440 KDFNNIDIEINKEKEIKTKYEDIVNKIDL-LKEKIAQNNAEINQTKDAIEKLKGTTEDKC 498

Query: 140 KLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
            +CQS    D L +  ++   YN+  E   +  + +  + ++   ++  + D    + NL
Sbjct: 499 PVCQSN--IDGLKKQELLKQ-YNQLIEERKQKSNKLQIKYNKYLSEKKDIKDKLDKINNL 555

Query: 200 HHDVQEAVDKLNNVIKKEEEMIETNN 225
            +   +  +K NN+I++E ++I+ NN
Sbjct: 556 KNKYGQLKEKNNNLIEEENKLIKLNN 581


>UniRef50_A0E3J8 Cluster: Chromosome undetermined scaffold_76, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_76,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 827

 Score = 40.3 bits (90), Expect = 0.14
 Identities = 37/145 (25%), Positives = 73/145 (50%), Gaps = 14/145 (9%)

Query: 19  NWDIKDRIKN-NDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKE 77
           N D+++++ +  D+   AL  KD L    +D E+ L++   + QD   + E  +   L+ 
Sbjct: 414 NDDLRNQLNDLQDQLTEALLDKDYLQKSLKDQEDELNRVNDQIQDLNNEKEQAQAAALEA 473

Query: 78  K-----FAKQQAIKMSKKTE-----NDLEEQFGLIRNS-EDLRIRDKGYNLHAFNTLISQ 126
           K      A ++A + + K +     NDLE++   + +  EDL  + +   L+    LI  
Sbjct: 474 KQQLQDIADEKAQEDADKEKDQDRLNDLEDKVAELEDQIEDLE-KTRNRLLNQIQELID- 531

Query: 127 RIGDHRDLPDTRNKLCQSQQYFDEL 151
           ++ D R+L +  +KLC  Q++ ++L
Sbjct: 532 KLHDERELCEYYHKLCSDQEHQNKL 556



 Score = 35.9 bits (79), Expect = 3.0
 Identities = 17/53 (32%), Positives = 31/53 (58%)

Query: 177 DRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDME 229
           DR DQK   E +L D    +  L   +++A DK NN  +++E++I+  + D++
Sbjct: 132 DRDDQKAFYEGLLADKDDLIAELRRQLKDADDKFNNYRREKEQIIKEKDYDIK 184


>UniRef50_P62134 Cluster: DNA double-strand break repair rad50
           ATPase; n=3; Methanococcus maripaludis|Rep: DNA
           double-strand break repair rad50 ATPase - Methanococcus
           maripaludis
          Length = 993

 Score = 40.3 bits (90), Expect = 0.14
 Identities = 47/204 (23%), Positives = 84/204 (41%), Gaps = 17/204 (8%)

Query: 60  YQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHA 119
           Y+ Y +        +LKE    ++++K +KK  ++L+E   L  N E + I DK      
Sbjct: 309 YESYNKLKTIEES-LLKELGVLKESLKDNKKNPDELKEN--LKENDEKILILDKIKEKIK 365

Query: 120 FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSI---- 175
               I ++I + +    T   L  S + +D+  +    +      YE L++    +    
Sbjct: 366 ELEFIEKQIYEIKIHKKTVETLFDSVKIYDDSIKTFEELKTKKNSYENLLKEKFDLEKKL 425

Query: 176 MDRTDQKH--IKEIILVDDYSDLYNL-------HHDVQEAVDKLNNVI-KKEEEMIETNN 225
            + TD+K   I E+   +   +  NL       + D+ E +DKLN ++ KKE ++ E  N
Sbjct: 426 QNETDEKTKLISELTDFEKIEEKINLENELKEKYEDLSEKIDKLNEIVLKKESKISEYKN 485

Query: 226 IDMEXXXXXXXXXXXXKKSTENSE 249
              E             K TE  +
Sbjct: 486 SKAELEKTKDSCHVCQSKITEEKK 509


>UniRef50_Q1WVM5 Cluster: N-acetylglucosaminyltransferase; n=1;
           Lactobacillus salivarius subsp. salivarius UCC118|Rep:
           N-acetylglucosaminyltransferase - Lactobacillus
           salivarius subsp. salivarius (strain UCC118)
          Length = 338

 Score = 39.9 bits (89), Expect = 0.19
 Identities = 27/115 (23%), Positives = 52/115 (45%), Gaps = 8/115 (6%)

Query: 196 LYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVF 255
           +YN+ + ++  +  ++   K +E ++E   ID               K  E  + KN   
Sbjct: 12  MYNVENSIERLIISISKAFKNQESLVEVLAID-------DGSTDKTVKIFEKLQKKNQAL 64

Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
            ++L+K S   G+ +AR  G   S G  + F+DS  E+ +  +  L  +L+Q  +
Sbjct: 65  ALKLIKNS-HGGVSKARNTGIKYSTGQYVTFVDSDDELTLVDVVDLKGKLAQNAE 118


>UniRef50_Q55EZ8 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 568

 Score = 39.9 bits (89), Expect = 0.19
 Identities = 45/197 (22%), Positives = 88/197 (44%), Gaps = 15/197 (7%)

Query: 39  KDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQ 98
           +DL  +   D            +DYK   + + + + KEK    +  K +   E + EE+
Sbjct: 127 QDLFKEMIPDPSSNFEDLFGSDEDYKESLKQQGEEIEKEK----EKEKDTNNEEEEEEEE 182

Query: 99  FGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIII 158
              I NS    +++ G+   ++   +S    D ++  D   K+ Q  +   EL R   ++
Sbjct: 183 EERIDNSNI--VKEGGFGSMSYAIDVSIDPEDTKENLD-EIKVKQLSELVTELERYQNLV 239

Query: 159 CFYNEHYETLMR-SVHSIMDRTDQKHIKEII-LVDDYSDLYNLHHDVQ----EAVDKLNN 212
              N+ Y TL++ ++++  D     H++ +I L  + +D+     D+     + +D  NN
Sbjct: 240 V--NDWYSTLIKININTQSDTEYSSHLRHVITLQQNINDIKAKSKDIDVLPFKNIDNQNN 297

Query: 213 VIKKEEEMIETNNIDME 229
             + EEE+ E   I+ME
Sbjct: 298 NQENEEEIDEFEGIEME 314


>UniRef50_Q54SR9 Cluster: Leucine-rich repeat-containing protein;
           n=1; Dictyostelium discoideum AX4|Rep: Leucine-rich
           repeat-containing protein - Dictyostelium discoideum AX4
          Length = 874

 Score = 39.9 bits (89), Expect = 0.19
 Identities = 40/148 (27%), Positives = 69/148 (46%), Gaps = 14/148 (9%)

Query: 85  IKMSKKTENDLEEQFGLIRNS---EDLRIRDKGYN----LHAFNTLISQRIGDHRDLPDT 137
           I  ++    ++EE    I+NS   E L I +   N    L+  NTL       H DL  +
Sbjct: 426 ISNNRLERTEIEELIAFIKNSRALESLNISNCSLNHDYFLYICNTLNKNEYIKHLDLNVS 485

Query: 138 RNKLCQSQQYF-DELPRASIIICFYNEH----YETLMRSVHSIMDRTDQKHIKEIILVDD 192
            N++ +S   F   L   +++      H    Y+TL+R V SI  R   K+I+E+IL   
Sbjct: 486 HNQISKSPSIFVSALEFLTMVNSLNLSHIPITYKTLIRVVDSI--RIYCKNIRELILDAC 543

Query: 193 YSDLYNLHHDVQEAVDKLNNVIKKEEEM 220
           +S   +  HD  + +++L   +++ E +
Sbjct: 544 FSHSNDKSHDGIDFINELKQFLRERESL 571


>UniRef50_A2DD37 Cluster: Viral A-type inclusion protein, putative;
            n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
            protein, putative - Trichomonas vaginalis G3
          Length = 1553

 Score = 39.5 bits (88), Expect = 0.25
 Identities = 52/266 (19%), Positives = 109/266 (40%), Gaps = 10/266 (3%)

Query: 27   KNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEK----FAKQ 82
            K ND+   A + K  L +Q + +++ L+K     ++ + + +     + KEK      ++
Sbjct: 800  KFNDERQEAAKTKSDLQNQIQQLKDALAKAESNQKETQNKLDISNSDLEKEKDKSKSLEE 859

Query: 83   QAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLC 142
            +   +  K +   EE+  L  + E+ R  +   N    + L S+   ++RDL +  N+L 
Sbjct: 860  ELAALKSKLQQVQEEKANLESDLENERQNNSSSNAELSDKL-SKLQQENRDLVNQINQLQ 918

Query: 143  QS-QQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
               +Q   E+ + S  +   N   + L   ++ +  + D+   K   LVDD      L  
Sbjct: 919  NDLKQKESEIQKVSSDLDNLNNVIQDLESQMNDMQGKNDELSKKLSNLVDDNERKDKLID 978

Query: 202  DVQEAVDKLNN---VIKKEEEMIETNNIDM-EXXXXXXXXXXXXKKSTENSEVKNNVFNI 257
            D+   +  LNN    +  +    E+  +D+              ++S    + KNN   +
Sbjct: 979  DLNSQLSNLNNEKDSLTNKLSETESEKLDLANQNEKLLKVIEDLQRSLSEEKDKNNSSLL 1038

Query: 258  RLLKTSKREGLIRARLYGADNSVGDV 283
             L    K   L++ ++   +  V ++
Sbjct: 1039 SLGDFGKENALLKEKVADLEKQVSNL 1064


>UniRef50_A2SR79 Cluster: Glycosyl transferase, family 2; n=1;
           Methanocorpusculum labreanum Z|Rep: Glycosyl
           transferase, family 2 - Methanocorpusculum labreanum
           (strain ATCC 43576 / DSM 4855 / Z)
          Length = 376

 Score = 39.5 bits (88), Expect = 0.25
 Identities = 20/45 (44%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 151 LPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
           LP  SI+IC YNE   T+ R + SI   T    + E++LV D SD
Sbjct: 44  LPAISIVICAYNEE-RTIARKIQSISSCTYPNELMEVVLVIDCSD 87


>UniRef50_A0YT83 Cluster: Putative uncharacterized protein; n=1;
           Lyngbya sp. PCC 8106|Rep: Putative uncharacterized
           protein - Lyngbya sp. PCC 8106
          Length = 342

 Score = 39.1 bits (87), Expect = 0.32
 Identities = 20/52 (38%), Positives = 30/52 (57%), Gaps = 3/52 (5%)

Query: 262 TSKRE---GLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
           T KRE   G   A+L GA+ + G++++++DS  E    WL  +L  LSQ  D
Sbjct: 73  TVKREPGIGYHEAKLLGAELATGEIVIYMDSDCEYEPQWLSSILTTLSQNYD 124


>UniRef50_Q8MNV4 Cluster: Putative uncharacterized protein; n=2;
           Caenorhabditis elegans|Rep: Putative uncharacterized
           protein - Caenorhabditis elegans
          Length = 1046

 Score = 39.1 bits (87), Expect = 0.32
 Identities = 50/211 (23%), Positives = 95/211 (45%), Gaps = 16/211 (7%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDY----KRQSEYRRKVMLK 76
           D+K+RI  + K     +  +LL D+ R  EE   +   K ++     K+Q    R +   
Sbjct: 323 DMKERIITSKKDD---DSNNLLQDELRRTEEKYQQAQKKIENLDETIKQQETQIRDLGRS 379

Query: 77  EKFAKQQAIKMSKKTENDLEEQFG--LIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDL 134
              AK+Q  KMS++ +N+   + G    R+ E+   +++   L +    + Q++    +L
Sbjct: 380 LDEAKRQLQKMSEQRQNEEVARQGEDSARSMEEKATKEEIKKLKS-QVQLQQQLEQDLEL 438

Query: 135 PDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
              R +    Q+   E  +AS+      + + TLM S++S+ +   Q   +   L  +  
Sbjct: 439 QKKRVQELTEQRKVLE-SKASVA-----DEFGTLMSSLNSLREENRQYEEETRSLQTNIR 492

Query: 195 DLYNLHHDVQEAVDKLNNVIKKEEEMIETNN 225
            L +  +  Q+A+ +  N  +K EE IE  N
Sbjct: 493 TLQDEVYQHQDAITEWKNRAEKAEEYIEKEN 523


>UniRef50_Q55FF7 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 898

 Score = 39.1 bits (87), Expect = 0.32
 Identities = 28/119 (23%), Positives = 57/119 (47%), Gaps = 1/119 (0%)

Query: 23  KDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQ 82
           KD+ K+ ++     + K+ + D+ ++ E+   K   K +D ++  + R K   KEK   +
Sbjct: 124 KDKEKDKEREREKEKEKEKVKDREKEKEKEKEKEKEKVKDREKVKD-REKEKEKEKERDK 182

Query: 83  QAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKL 141
              K SK  E D+E++    R  E  +IRD+  + ++ N +I  +      +  T+  +
Sbjct: 183 LKPKDSKIKERDIEKEKVRDREKEREKIRDREKDKNSNNNIIKPKEKKDESIAKTQKNI 241


>UniRef50_Q9V2L6 Cluster: Dpm1 dolichol-phosphate
           mannosyltransferase; n=4; Thermococcaceae|Rep: Dpm1
           dolichol-phosphate mannosyltransferase - Pyrococcus
           abyssi
          Length = 362

 Score = 39.1 bits (87), Expect = 0.32
 Identities = 19/66 (28%), Positives = 38/66 (57%), Gaps = 1/66 (1%)

Query: 252 NNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD- 310
           ++V+ +++++    +GL  A + G   + GDV V +D+ ++     +P LLKR+ +G D 
Sbjct: 64  SSVYPVKVIRRINEKGLSSAVIRGFKEASGDVFVVMDADLQHPPEVIPELLKRIKEGADL 123

Query: 311 GVKVRY 316
            +  RY
Sbjct: 124 AIASRY 129


>UniRef50_Q5LW10 Cluster: Diguanylate cyclase, putative; n=3;
           Rhodobacterales|Rep: Diguanylate cyclase, putative -
           Silicibacter pomeroyi
          Length = 513

 Score = 38.7 bits (86), Expect = 0.43
 Identities = 22/59 (37%), Positives = 34/59 (57%), Gaps = 2/59 (3%)

Query: 150 ELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVD 208
           +LPR S+      + +E  +  +H++   TD+K    II +DDY+DL + HH  QEA D
Sbjct: 68  QLPRDSVTGLMLKDGFEGALTHIHAMAAETDRKSACFIIELDDYADLVD-HHG-QEAGD 124


>UniRef50_Q7NBF8 Cluster: Putative uncharacterized protein; n=1;
            Mycoplasma gallisepticum|Rep: Putative uncharacterized
            protein - Mycoplasma gallisepticum
          Length = 1931

 Score = 38.3 bits (85), Expect = 0.57
 Identities = 27/105 (25%), Positives = 55/105 (52%), Gaps = 6/105 (5%)

Query: 14   HYRPLNWDIK-DRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWK-YQDYKRQSEYRR 71
            HY+ L   IK ++ K N++        +LL+DQ       ++    K Y  YK+Q + ++
Sbjct: 1317 HYQRLERAIKNEQHKLNNQKNNFFNKVELLNDQLNKKSSKIALLRSKIYNTYKQQQQ-QK 1375

Query: 72   KVMLKEKFAKQQAIKMSKKTENDLEE---QFGLIRNSEDLRIRDK 113
            +++L+EK    Q  K   KT+ +L +   QF + +  E+ +++++
Sbjct: 1376 QILLEEKHKNSQLRKSLLKTQEELHQQKAQFSIAKKQEEKKLKNQ 1420


>UniRef50_A5HZ17 Cluster: Exonuclease; n=4; Clostridium
           botulinum|Rep: Exonuclease - Clostridium botulinum A
           str. ATCC 3502
          Length = 1176

 Score = 38.3 bits (85), Expect = 0.57
 Identities = 44/197 (22%), Positives = 95/197 (48%), Gaps = 14/197 (7%)

Query: 37  EGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKE-KFAKQQAIKMSKKTEND- 94
           E +++++D+PRD+++ +   +    +     ++ R V+L + KF+  + +K+S K+  D 
Sbjct: 115 EEEEIIADKPRDIQKNIESIIGLTAE-----DFTRSVVLPQGKFS--EFLKLSGKSRRDM 167

Query: 95  LEEQFGLIRNSEDL--RIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELP 152
           LE  FGL +  + L  RIR       +  TLI  R+  ++D+  ++ KL + +  ++ L 
Sbjct: 168 LERIFGLEKYGKKLLERIRKARNKEISSLTLIEGRLEQYKDI--SKEKLQELKIQYENLL 225

Query: 153 RASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNN 212
           +    I    E    L     +I +  ++ +I    L +    L N+  + +  V+K  N
Sbjct: 226 KEKSKIAKEKEESNKLYEKYKNIWELQEELNIYLNKLENLKKGLLNI-EEKRIKVEKGKN 284

Query: 213 VIKKEEEMIETNNIDME 229
            +  +  + E + I+ +
Sbjct: 285 ALSVKPYIDELSKIESD 301


>UniRef50_A0L591 Cluster: Glycosyl transferase, family 2; n=1;
           Magnetococcus sp. MC-1|Rep: Glycosyl transferase, family
           2 - Magnetococcus sp. (strain MC-1)
          Length = 332

 Score = 38.3 bits (85), Expect = 0.57
 Identities = 20/63 (31%), Positives = 34/63 (53%)

Query: 257 IRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY 316
           I++L  S+R G+    L G   SVGD +++LDS ++     +P +L++   G D V    
Sbjct: 66  IKILNMSRRFGVYECMLAGMIASVGDAVIYLDSDLQDPPELIPQMLEQWRNGADIVHTTR 125

Query: 317 SAR 319
           + R
Sbjct: 126 TER 128


>UniRef50_Q8IBY8 Cluster: Putative uncharacterized protein PF07_0042;
            n=1; Plasmodium falciparum 3D7|Rep: Putative
            uncharacterized protein PF07_0042 - Plasmodium falciparum
            (isolate 3D7)
          Length = 2910

 Score = 38.3 bits (85), Expect = 0.57
 Identities = 63/294 (21%), Positives = 135/294 (45%), Gaps = 17/294 (5%)

Query: 5    QIVNKATKVHYRPLNWDIK-DRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDY 63
            QI+    K +   LN D++ +R +NND  I  L+ +   ++Q  D  E   K   + Q+ 
Sbjct: 1471 QILYDDGKNNISQLNIDLENERTRNNDLKIL-LDQEKKKNEQINDDLENERKRNNQLQNI 1529

Query: 64   KRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTL 123
              + E ++K  L   + +Q+ I  + + EN+L++Q   I   + +   ++ + +   NT 
Sbjct: 1530 LNE-EQKKKEQLNVSYEEQKNI--NHQLENELQKQ--RITYKKMIAKFERKFLMK--NTN 1582

Query: 124  ISQRIGDHRDLPDTRNKLCQSQQYFDE---LPRASIIICFYNEHYETLMRSVHSIMDRTD 180
             +Q+I D + + DT+ K+  +Q+  D    +  ++ +    +EH      +     D T 
Sbjct: 1583 DTQKIKDTQQIIDTQ-KIIDTQKIIDTQKIIDTSNNVNQMNDEHKHVDQMNDAESEDNTF 1641

Query: 181  QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXX 240
             +   E +   +   +  L  D ++ +D+LN  ++KE+E+ +   I ME           
Sbjct: 1642 LELQLEKVKQVNIDMIIQLKKD-KKRIDELNLELEKEKEVNDKIIIQME--EYKMKIEHI 1698

Query: 241  XKKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVN 294
             ++  +  E+ +N+ NIR+ K +++   +  +L         + + LD   ++N
Sbjct: 1699 NEELEKEKEINHNL-NIRIEKDNEKNEQLNIQLDTEKKMNNQMSIELDEEKKMN 1751


>UniRef50_Q86KX8 Cluster: Similar to Dictyostelium discoideum (Slime
            mold). Interaptin; n=2; Dictyostelium discoideum|Rep:
            Similar to Dictyostelium discoideum (Slime mold).
            Interaptin - Dictyostelium discoideum (Slime mold)
          Length = 1781

 Score = 38.3 bits (85), Expect = 0.57
 Identities = 48/231 (20%), Positives = 95/231 (41%), Gaps = 15/231 (6%)

Query: 9    KATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSE 68
            K +  H R  N      IK  ++SI+ LE   L S Q  +  +       ++QD K QS 
Sbjct: 1402 KESNTHLRKENDKDTLVIKQLEQSISQLE--HLHSQQTENYLKERELIQQQHQDEK-QSS 1458

Query: 69   YRRKVMLKEKFAKQQA--------IKMSKKTENDLEEQFGL----IRNSEDLRIRDKGYN 116
             +    LK KF ++Q         +  SK+  N L+++F L    I+  +D +      N
Sbjct: 1459 IQSTHQLKSKFDEKQQQYDESLEKLSQSKQELNKLKQEFDLNILVIQKLQDDKQSQSDSN 1518

Query: 117  LHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIM 176
            L   + L  Q++ +   +        Q      +    ++ I    +  +  + S+H + 
Sbjct: 1519 LQLKSNLEEQQLQNQESIEKISTLQQQVNHLQQQFNINTLEIQKLQDEKQLSIESIHQLK 1578

Query: 177  DRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNID 227
             + D+K  +    ++  +DL      +Q+ ++   N  ++ +E I T  ++
Sbjct: 1579 SKFDEKQQQYNESIEKSNDLQKQSDQLQQKLENSTNENQQLQEKISTIQLE 1629


>UniRef50_Q22W40 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 970

 Score = 38.3 bits (85), Expect = 0.57
 Identities = 47/222 (21%), Positives = 97/222 (43%), Gaps = 15/222 (6%)

Query: 9   KATKVHYRPLNWDIKDRIKNNDKSIAALEGK-DLLSDQPRDVEETLSKTMWKYQDYKRQS 67
           K ++  +  +   I   IK N+     L+ K D      RD +  +    WK + Y+ + 
Sbjct: 16  KISQEKWEEIQEKIGSLIKTNETLNQMLKLKTDEYEILKRDNQRNVENENWKKKYYQLEQ 75

Query: 68  EYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKG---YNLHAFNTLI 124
           E ++   ++++ A    I  + + + ++ E+   I N  +L ++DK     +L   N  +
Sbjct: 76  EMQKIAQIEQEVADYLPIMRNFEEKANMNEK--QIENL-NLILKDKENLIQDLSKANAQL 132

Query: 125 SQRIGDHRDLPDT----RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTD 180
            Q+    + L       +N   Q Q+ F+ L   S+ +   +EH   L  ++ S+MD+ +
Sbjct: 133 EQQCHSQQQLAQELSFFKNLSEQKQKSFENLKLNSVSL---DEHKRVLKENI-SLMDKLE 188

Query: 181 QKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIE 222
           +K  +   L      + +    +Q+  + +NN+  KE E  E
Sbjct: 189 KKEKEYQKLTVGLQQVNDYEQKMQQMENAVNNLRNKERENYE 230


>UniRef50_A4YEH2 Cluster: Glycosyl transferase, family 2; n=1;
           Metallosphaera sedula DSM 5348|Rep: Glycosyl
           transferase, family 2 - Metallosphaera sedula DSM 5348
          Length = 421

 Score = 38.3 bits (85), Expect = 0.57
 Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 1/76 (1%)

Query: 274 YGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTPVIDVINADTF 333
           YG   S G++LVFLD+   V+   L  +   LSQ  + + +R   R     + V+ ++  
Sbjct: 115 YGVSLSTGEILVFLDAEARVDPTILTRISAHLSQ-AEAMALRLRVRDPKNKLQVLYSEIT 173

Query: 334 EYSPSPLVRGGFNWGL 349
           E+S   L RG +  GL
Sbjct: 174 EFSMDSLFRGRYLKGL 189


>UniRef50_UPI00006CE63C Cluster: hypothetical protein
           TTHERM_00709570; n=1; Tetrahymena thermophila SB210|Rep:
           hypothetical protein TTHERM_00709570 - Tetrahymena
           thermophila SB210
          Length = 684

 Score = 37.9 bits (84), Expect = 0.75
 Identities = 62/293 (21%), Positives = 133/293 (45%), Gaps = 31/293 (10%)

Query: 2   SGNQIVN--KATKVHYRPLNWDI-KDRIKNND-KSIAALEGKDLLSDQPRDVEETLSKTM 57
           +GNQI +     K  Y  L  D  ++++++N+ KS   L+ K L ++  + V E   + +
Sbjct: 84  TGNQIQDLYDKLKADYTKLMIDFQREQMESNNLKSQNRLDAKKL-NELDKLVAEQQQEIV 142

Query: 58  WKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKT-ENDLEEQFGLIRNSEDLRIRDKGY- 115
               + + Q       +L++ + ++Q  +++K T EN    Q   + ++  L +++K Y 
Sbjct: 143 TLNNEIQLQISQNN--VLRD-YLREQENRVTKITAENTTLNQE--LSSTRPLSLKNKSYT 197

Query: 116 NLHAFNTLISQRIGDHRDLPDTRNKLCQS-----QQYFDELPRASIIICFYNEHYETLMR 170
           ++   N L  ++I + ++  + +N   Q      QQ  ++L    +  C    H + + R
Sbjct: 198 SVLQQNDLKYEQILESKECLERQNDQLQQHNSRLQQLHEQLNHVKMQFCTLENHSQQIQR 257

Query: 171 SVHSIMDRTDQKHIKEIILV-------DDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIE- 222
           ++  +   + Q  I E +L        +DY ++ N++ ++QE   +L  +I+K E+ I+ 
Sbjct: 258 NLKQLELESSQSKINESLLQKSYQKLEEDYREMQNMNINLQEHNKQLGILIQKLEDKIKI 317

Query: 223 -----TNNIDMEXXXXXXXXXXXXKKSTEN-SEVKNNVFNIRLLKTSKREGLI 269
                 N +  E             KS  N  E++N +  +++    +RE  I
Sbjct: 318 MEEQCNNTVFWEQTAKDFELQEEYFKSNSNIKEMENYMQKLKIQMKEERERYI 370


>UniRef50_Q6MF69 Cluster: Putative uncharacterized protein; n=2;
           Candidatus Protochlamydia amoebophila UWE25|Rep:
           Putative uncharacterized protein - Protochlamydia
           amoebophila (strain UWE25)
          Length = 449

 Score = 37.9 bits (84), Expect = 0.75
 Identities = 21/95 (22%), Positives = 52/95 (54%), Gaps = 4/95 (4%)

Query: 23  KDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQ 82
           KD I++  KS+  ++ + +  ++  +++++L      ++++++  E    ++  +K  KQ
Sbjct: 330 KDYIQDTLKSLVKVQNESIHIEEKNEIQKSLKILKEDHEEFQKAIEEDLAILAAKKGKKQ 389

Query: 83  QAI----KMSKKTENDLEEQFGLIRNSEDLRIRDK 113
           +      K+  K ENDLEE+  + R+ E + + +K
Sbjct: 390 RGEGDIEKLEAKLENDLEEKGNVERDLEAMHLVEK 424


>UniRef50_Q1Q6W0 Cluster: Putative uncharacterized protein; n=1;
           Candidatus Kuenenia stuttgartiensis|Rep: Putative
           uncharacterized protein - Candidatus Kuenenia
           stuttgartiensis
          Length = 302

 Score = 37.9 bits (84), Expect = 0.75
 Identities = 24/70 (34%), Positives = 35/70 (50%), Gaps = 1/70 (1%)

Query: 244 STENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLK 303
           ST+NS      F ++LL     +G   AR  G  N+ G++L F DS   V+  WL   +K
Sbjct: 44  STDNSLQSIKAFPVKLLIEKDVKGSYAARNLGVKNAEGEILAFTDSDCVVDKYWLCNAIK 103

Query: 304 RL-SQGVDGV 312
              ++ V GV
Sbjct: 104 YFAAEDVGGV 113


>UniRef50_A1AUA8 Cluster: Glycosyl transferase, family 2; n=1;
           Pelobacter propionicus DSM 2379|Rep: Glycosyl
           transferase, family 2 - Pelobacter propionicus (strain
           DSM 2379)
          Length = 1268

 Score = 37.9 bits (84), Expect = 0.75
 Identities = 18/51 (35%), Positives = 28/51 (54%)

Query: 256 NIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLS 306
           + R+++     G   A   GAD + G +L+FL++  EV  GW PPL   +S
Sbjct: 64  HFRIVRNDYAHGFAAACNRGADVARGHLLLFLNNDTEVQPGWFPPLYALIS 114


>UniRef50_Q23A50 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 1670

 Score = 37.9 bits (84), Expect = 0.75
 Identities = 27/115 (23%), Positives = 54/115 (46%), Gaps = 3/115 (2%)

Query: 64  KRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTL 123
           K++S  ++K++ K+ + + +  K SK+    ++    LI N   +    K     +F TL
Sbjct: 107 KQESNQKKKIINKQNYYQIEQSKASKQAYK-IDTFKQLIINQRQIFNSSKIEKKQSFGTL 165

Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDR 178
           +++R G   +   T   L QSQQ ++   +   +  FY  H E     V +++ +
Sbjct: 166 VNERKGFSNEKAQT--PLWQSQQKYNFASKEGFLSEFYQTHSELKQGRVQALLQK 218


>UniRef50_A2FSZ8 Cluster: Viral A-type inclusion protein, putative;
            n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
            protein, putative - Trichomonas vaginalis G3
          Length = 4045

 Score = 37.9 bits (84), Expect = 0.75
 Identities = 47/223 (21%), Positives = 99/223 (44%), Gaps = 21/223 (9%)

Query: 4    NQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGK-DLLSDQPRDVEETLSKTMWKYQD 62
            N I N+  +V+ +  N D+K+ +   +  I A+  +   +S +  D+++  SK+   YQD
Sbjct: 1710 NSINNELRRVNSQ--NNDLKELLAKKESEINAINNELKRISSENNDLKDINSKSENNYQD 1767

Query: 63   YKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNT 122
               Q +  +  + + K   Q+ +K S + +N L+          DL I +K   + +  +
Sbjct: 1768 ---QLKNLKNQLTQLKNENQKLMKSSTEEKNKLK----------DL-INEKNIQIQSLQS 1813

Query: 123  LISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQK 182
                 + +   +    NKL   Q+  DE    + ++   NE  +  + S  + +   DQK
Sbjct: 1814 KNEDLVNNQSKI---NNKLESIQKDLDEKENQNSVLISENEKLQNELMSSKTEIQTLDQK 1870

Query: 183  HIK-EIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETN 224
              +    L +   +  +L   + +  +KLNN+ +  E++ + N
Sbjct: 1871 ETEFNDKLREMERNNRSLSSQINDLKEKLNNLTETNEKISDEN 1913


>UniRef50_A2E4S4 Cluster: Viral A-type inclusion protein, putative;
            n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
            protein, putative - Trichomonas vaginalis G3
          Length = 1795

 Score = 37.9 bits (84), Expect = 0.75
 Identities = 48/212 (22%), Positives = 93/212 (43%), Gaps = 18/212 (8%)

Query: 9    KATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSE 68
            K  ++  +  N    D+I+++ K I  + GK  + D   ++ ++L + + K ++      
Sbjct: 1012 KENEIQEKMENLKKMDQIQSSQKIICGIVGKQTIDDAINEI-KSLKEQIQKLKNEISVKS 1070

Query: 69   YRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIR-NSEDLRIRDKGYNLHAFNTLISQR 127
             +    +KEK    +A      TE DL +   ++   +E+L+  DK      FN+ IS  
Sbjct: 1071 DKILNDIKEKMKLPEA-----TTEQDLVDAIAVMEIENEELKENDKKLR-EIFNSDIS-- 1122

Query: 128  IGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEI 187
            +     L +  N L Q +     L  +  ++    +  + L  +   I+D  D+  I E+
Sbjct: 1123 VDTLEILKEVEN-LKQKEDQLKSLVESENLV----DEIQKLNENQQRILDECDKSDISEV 1177

Query: 188  ILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEE 219
            I  ++  DL  L  DV+      N+V + +EE
Sbjct: 1178 I--EEIKDLKKLQQDVENCF-PTNDVSQIKEE 1206


>UniRef50_A2DKP8 Cluster: Viral A-type inclusion protein, putative;
            n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
            protein, putative - Trichomonas vaginalis G3
          Length = 1618

 Score = 37.9 bits (84), Expect = 0.75
 Identities = 39/239 (16%), Positives = 98/239 (41%), Gaps = 13/239 (5%)

Query: 37   EGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLE 96
            +  DL+++  + +EE   ++  +  +Y++Q E  +  ++  +   Q+  K  ++ +   E
Sbjct: 1028 QNNDLIANYKKQIEELSKQSNEEVVNYQKQVEDLKNKLIDLQQNNQEIAKYQQQIDELNE 1087

Query: 97   EQFGLIRNSEDL--RIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRA 154
            E+    +   +L  ++      ++ +   I       +DL +   ++ + Q   D+L + 
Sbjct: 1088 EKSNSEKQINELNQKLNQNNEEINKYQKQIEDLNQKLKDLQENNQEIAKYQNEVDDLKKK 1147

Query: 155  SIIICFYNEHYETLMRSVHSIMDRTDQKHIKEII-----LVDDYSDLYNLHHDVQEAVDK 209
              +    NE      + +   M + +Q ++K+I      L++  S++ NL+  +   +  
Sbjct: 1148 FDV---SNEEIANKEKEIEE-MKKKEQNYLKQISELNNHLMEKQSEIVNLNSKLDNQIYN 1203

Query: 210  LNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSKREGL 268
            LN   KK+   +  N++  +            K++ +      N   I L   +K   L
Sbjct: 1204 LNT--KKQNLEMNLNDLQTKLKQIEQENANLSKRNKDLENESQNQAKITLETQNKNVDL 1260


>UniRef50_Q2S1Y8 Cluster: Glycosyl transferase, group 2 family
           protein; n=1; Salinibacter ruber DSM 13855|Rep: Glycosyl
           transferase, group 2 family protein - Salinibacter ruber
           (strain DSM 13855)
          Length = 391

 Score = 37.5 bits (83), Expect = 0.99
 Identities = 43/176 (24%), Positives = 69/176 (39%), Gaps = 14/176 (7%)

Query: 281 GDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTP-VIDVINADTFEYSPSP 339
           G  +V L++ +EV  GWL PL       V+    R    AV P ++   +   FEY+   
Sbjct: 126 GRFVVLLNNDVEVPPGWLHPL-------VEAAAGRPDVAAVQPKLLQYDDRGRFEYAGGA 178

Query: 340 LVRGGF--NWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPG 397
              GGF    G  F    L +    +   +  P       G    + R   + +G  D  
Sbjct: 179 ---GGFLDRAGYPFTRGRLFETMERDRGQYDDPRDVFWATGAALLLRRSALDEVGPLDER 235

Query: 398 MNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRM 453
             +   E +++ +R+W  G  + + P S V H+     P     K  Y  +NS+ M
Sbjct: 236 FEMHM-EEIDLCWRLWRHGYRVRVAPESTVYHIGGASLPQSSPRKTYYNYRNSLLM 290


>UniRef50_A2FD36 Cluster: Viral A-type inclusion protein, putative;
            n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion
            protein, putative - Trichomonas vaginalis G3
          Length = 3977

 Score = 37.5 bits (83), Expect = 0.99
 Identities = 43/214 (20%), Positives = 89/214 (41%), Gaps = 14/214 (6%)

Query: 22   IKDRIKN-NDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
            +K  +K  NDK+     G D+L  +   +   +S    +    K  +E + K + + K  
Sbjct: 2677 LKSSLKELNDKNKELQNGNDILKQENETLTPKISSLESENSSLKSTNEIKDKEIEELKQK 2736

Query: 81   KQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNK 140
              +  +++ + E+DL+ +    R   +  + +    L      I  R    ++L +   +
Sbjct: 2737 LSEISQLNSQHESDLDSR----RKQFEKELEELRNQLEKLQNEIQIREQRGKELSNQNEE 2792

Query: 141  LCQS-QQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQ------KHIKEI--ILVD 191
            L  + ++   EL  A +     ++  ETL +S+       DQ      K I+E+   L+ 
Sbjct: 2793 LMNNLEKMKSELNDAKMNKEHSDQENETLKKSLEENQQNYDQLVDELSKEIEELKKQLLT 2852

Query: 192  DYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNN 225
               +  +  H++ E   K+ N+  + E +  TNN
Sbjct: 2853 KAEESNSSKHEIDELQSKIQNLSSENENLKSTNN 2886



 Score = 37.1 bits (82), Expect = 1.3
 Identities = 53/240 (22%), Positives = 112/240 (46%), Gaps = 27/240 (11%)

Query: 1    MSGNQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKY 60
            +SGN++++   K+       D+ ++I +  K        ++L+ Q  +  + + +   K 
Sbjct: 3108 LSGNELLSNNEKLEQEQS--DLMNQINDLRKK------NEILNQQQANNNQIIKECQEKI 3159

Query: 61   QDYKRQS-EYRRKV---MLKEKFAKQQAIKMSKKTE----NDLEEQFGLIRNSEDLRI-- 110
            Q+Y+  + E +RK+   M   + AK Q  ++ K  E    ND +    L +  E L+   
Sbjct: 3160 QNYEESNNELQRKLNEAMNNNENAKNQIDQLKKLLEETKQNDDKLVEELTKEIEKLKNEQ 3219

Query: 111  RDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMR 170
            + K  N++  + L   +    +   D   K   +Q++++     + +I    +  E+L +
Sbjct: 3220 QSKDQNINDLSALNKDKSSLIQQNDDLSKK---TQEFYNSQQNQAQMIEDLKKQNESLQK 3276

Query: 171  SVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEE---MIETNNID 227
            ++  I +   Q++I +  L  D SDL +  HD +  ++ LN++IK+  E   +IE  N +
Sbjct: 3277 NLE-INNNETQQNIDQ--LTKDKSDLASKLHDYEAKINDLNSLIKELNEKNAIIEKKNYE 3333


>UniRef50_A0DTN4 Cluster: Chromosome undetermined scaffold_63, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_63,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 566

 Score = 37.5 bits (83), Expect = 0.99
 Identities = 25/112 (22%), Positives = 57/112 (50%), Gaps = 3/112 (2%)

Query: 39  KDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTEND--LE 96
           K+ + ++ RD+   L K M K+++ +++++  +KV +K+K       K ++  E D  + 
Sbjct: 187 KNEMKEKERDLRY-LKKEMIKFEEKQKETQQNQKVTVKKKEVIDIDDKKAEIIEKDQQIS 245

Query: 97  EQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYF 148
           +Q  +I   ++       Y L + N LI +  G ++ +   R +L +  +Y+
Sbjct: 246 QQEQIINQLKEKLQERTNYELRSKNALIDELQGLNQQVQQMREELKELNEYY 297


>UniRef50_A0CKL2 Cluster: Chromosome undetermined scaffold_2, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_2,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 521

 Score = 37.5 bits (83), Expect = 0.99
 Identities = 23/79 (29%), Positives = 44/79 (55%), Gaps = 3/79 (3%)

Query: 27  KNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAK--QQA 84
           K   K +   + ++L  +Q  ++++   +   +Y D  +Q E +R+  ++EKFAK  Q  
Sbjct: 366 KEEKKKLKMHKSQELEKEQVEEIKQKEEQFKKRYSDKLKQFENKRQ-KVEEKFAKKDQYL 424

Query: 85  IKMSKKTENDLEEQFGLIR 103
            +  +K ++DLEE+F  IR
Sbjct: 425 AEHLQKKKDDLEEKFDKIR 443


>UniRef50_Q6CYG5 Cluster: Similarity; n=2; Kluyveromyces lactis|Rep:
           Similarity - Kluyveromyces lactis (Yeast) (Candida
           sphaerica)
          Length = 1748

 Score = 37.5 bits (83), Expect = 0.99
 Identities = 52/233 (22%), Positives = 106/233 (45%), Gaps = 25/233 (10%)

Query: 5   QIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYK 64
           +++   ++V  R +  + +  ++   K +A    +D+       +EE LS T  K  + +
Sbjct: 94  RLIEGNSQVTKRIIELEQEIEVERQQKELADASKQDIAESLNEKIEE-LSSTKAKLNEAQ 152

Query: 65  -RQSEYRRKVMLKE-KFAKQQAIKMSKKTEN-DLEEQFGLIRNSEDLRIRDKGYNLHAFN 121
               E R+KV+  E +   QQA+++  K+E   +E++  L+R + D             N
Sbjct: 153 GANKELRQKVVNTETELQTQQALELRSKSEILRMEQEITLLRENNDWLTNQLNTKTVQLN 212

Query: 122 TLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQ 181
                 I +   L D++ K+   +    E+ R S          + L +SVHS+ ++ +Q
Sbjct: 213 EFRESTISE---LQDSQLKVSNMESEL-EIARTS---------NQKLKQSVHSLHEQLEQ 259

Query: 182 KHIKEIILVDDYS----DL---YNLHHDVQEAVDKLNNVIKKEEEMIETNNID 227
           K  +   + D+Y+    +L    +L   + +A++K    +KKE +  + NN+D
Sbjct: 260 KLSENKEIKDEYNFSKQELTKEMSLKQRMIDALEKHMESLKKEMDATK-NNMD 311


>UniRef50_Q8TXA4 Cluster: Uncharacterized protein; n=2; cellular
           organisms|Rep: Uncharacterized protein - Methanopyrus
           kandleri
          Length = 609

 Score = 37.5 bits (83), Expect = 0.99
 Identities = 36/116 (31%), Positives = 57/116 (49%), Gaps = 13/116 (11%)

Query: 42  LSDQPRDVEETLSKTMWKYQDYK----RQSEYRRKV-MLKEKFAK-QQAIKMSKKTENDL 95
           LSDQ R + E L K   KY + K    R  E  ++V  LK++ AK Q  +K  K   +DL
Sbjct: 202 LSDQNRRLAENLKKLKEKYNEIKEERDRLKEETKEVGKLKDQLAKLQSKLKEVKSERDDL 261

Query: 96  EEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDEL 151
             +   +RN E+ ++R K       + L S+     + L D   KL +++Q+  +L
Sbjct: 262 ANEVEALRN-ENEKLRKK------IDKLKSELSNLQKKLKDREKKLEKARQHIGKL 310


>UniRef50_UPI00006CD032 Cluster: Micro-fibrillar-associated protein
           1 C-terminus containing protein; n=1; Tetrahymena
           thermophila SB210|Rep: Micro-fibrillar-associated
           protein 1 C-terminus containing protein - Tetrahymena
           thermophila SB210
          Length = 521

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 23/97 (23%), Positives = 44/97 (45%), Gaps = 3/97 (3%)

Query: 2   SGNQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQ 61
           +G  + N   ++    +N DIK   +N  ++ A      LL  Q  + E+  +K + K Q
Sbjct: 150 TGQSLQNSEIQITTHNINGDIKVESRNEGRAAARAR---LLQRQKEEEEQKKNKQLSKQQ 206

Query: 62  DYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQ 98
             + + E R+   LKEK  + +  + S     D +++
Sbjct: 207 SEEMEIERRKSQQLKEKANESEKSEQSSSENEDNDDE 243



 Score = 35.1 bits (77), Expect = 5.3
 Identities = 25/97 (25%), Positives = 48/97 (49%), Gaps = 5/97 (5%)

Query: 28  NNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKM 87
           + ++S  + +GKD ++D   D++    +  WK ++ KR  + R + + +EK   +Q  + 
Sbjct: 335 DEEQSDDSRQGKDFMNDDD-DMDREFEREQWKIRELKRIRKDRDEQIKREKELAEQERRS 393

Query: 88  SKKTENDLEE--QFGLIRNSEDLRI--RDKGYNLHAF 120
               E  +EE  + GL +  E  +I    K Y+  AF
Sbjct: 394 KMTNEEIIEEDKRLGLHQKKEKRQIGFMQKYYHKGAF 430


>UniRef50_UPI00006CB2DA Cluster: Viral A-type inclusion protein repeat
            containing protein; n=1; Tetrahymena thermophila
            SB210|Rep: Viral A-type inclusion protein repeat
            containing protein - Tetrahymena thermophila SB210
          Length = 2199

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 51/233 (21%), Positives = 102/233 (43%), Gaps = 20/233 (8%)

Query: 9    KATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSE 68
            KA   H +  N +  ++IK+N + I  L+ K   S+Q  ++ +   K   +YQ+  +Q E
Sbjct: 1622 KAESYHVKIQNQE--EKIKSNAEMIQVLQEKLKTSEQQANLLKQQLKNK-QYQEDDQQRE 1678

Query: 69   YRRKVMLKEKFAKQQAIKMSKKTE--NDLEEQFGLIRNSEDLRIR----DKGYNLHAFNT 122
             R+ V      A+    ++  + +  +  E ++ +  NS + +I+    +K  N+    +
Sbjct: 1679 TRKSVSFLTSQAEMNKYQLDNQKQKWDQQEAEYKIKINSLNAQIQQLIEEKQSNIDMKKS 1738

Query: 123  LISQR----IGDHRDLPDTRNKLCQSQQYFDELPRA----SIIICFYNEHYETLMRSVHS 174
             + +R    +   + L D +    QS++  + L +       +I   N+  E+L   +  
Sbjct: 1739 FMKERESVVVDKEKALRDLKQLYAQSRKNEESLEQKISEMEKVILNMNQEIESLRTQLIR 1798

Query: 175  IMDRTDQKHIKEIILVDDYSDLY--NLHHDVQ-EAVDKLNNVIKKEEEMIETN 224
               + +Q           Y+DL   NL   +Q E++  LNN+ K+   M E N
Sbjct: 1799 ANQQIEQMAYARKYEASQYADLRSANLSKALQQESLPTLNNIQKQGSNMGELN 1851


>UniRef50_Q7U947 Cluster: Putative uncharacterized protein; n=1;
           Synechococcus sp. WH 8102|Rep: Putative uncharacterized
           protein - Synechococcus sp. (strain WH8102)
          Length = 614

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 26/111 (23%), Positives = 48/111 (43%), Gaps = 1/111 (0%)

Query: 344 GFNWGLHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGG 403
           GF   +H  +  L    L      + P    +++  +    R+ F ++G + P       
Sbjct: 496 GFPANIH-PYKGLSVQELEQRHPHLDPYPVDSLSAAMLLFERDRFLSVGGFHPAFGRGDF 554

Query: 404 ENLEISFRIWMCGGSLELIPCSRVGHVFRKRRPYGVGEKQDYMLQNSMRMA 454
           E+LE+S R     G L ++P +R+ H+ R+    G  E   + LQ +  +A
Sbjct: 555 EDLELSQRWKQQQGELWMVPTARLMHLERQSMASGADESAAWALQANAWLA 605


>UniRef50_Q319Q2 Cluster: Putative uncharacterized protein; n=1;
           Prochlorococcus marinus str. MIT 9312|Rep: Putative
           uncharacterized protein - Prochlorococcus marinus
           (strain MIT 9312)
          Length = 292

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 17/47 (36%), Positives = 32/47 (68%)

Query: 252 NNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWL 298
           N+   I+ +K +KR G+ ++R  G  +S+G+V++FLDS  ++ +G L
Sbjct: 56  NSSIKIKHIKPTKRAGVSKSRNIGIISSIGNVILFLDSDDKLIIGAL 102


>UniRef50_Q9GRG0 Cluster: Tetrin B protein; n=2; Tetrahymena
           thermophila|Rep: Tetrin B protein - Tetrahymena
           thermophila
          Length = 731

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 4/104 (3%)

Query: 8   NKATKVHYRPLNWDIK-DRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQ 66
           NKA +     L W  + DR+    KSI     K+L + Q RD+E   S+++ K Q  +  
Sbjct: 554 NKAKQAEAEQLYWKNQTDRVVRQ-KSIEYQVEKELTNSQLRDLER--SQSIEKIQKLRES 610

Query: 67  SEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRI 110
              R  +  K++ A++  +  S+   +   E++  IRN  +  I
Sbjct: 611 DSIRNIIDYKDREAQELRLNQSRAISDLARERYEKIRNQTEAEI 654


>UniRef50_Q8ILK5 Cluster: Putative uncharacterized protein; n=1;
           Plasmodium falciparum 3D7|Rep: Putative uncharacterized
           protein - Plasmodium falciparum (isolate 3D7)
          Length = 966

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 34/165 (20%), Positives = 72/165 (43%), Gaps = 10/165 (6%)

Query: 70  RRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNS----EDLRIRDKGYNLHAFNTLIS 125
           +R V+LKEK          K  E + EE +  ++      +D+  +DKG N   FN   +
Sbjct: 101 KRLVLLKEKLKYNHYYYAGKLAEKEWEENYDNLKKKSQLFKDVLEKDKGKNFSTFNITKN 160

Query: 126 QRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIK 185
           ++I + ++    + +    +    +  +       +N+ Y     S++   D +D  ++K
Sbjct: 161 EKICNIKEKAQKKKQNKNQKNLKKKNFKKEHNDISFNDTYTKYSSSLNDFNDISDSLNLK 220

Query: 186 EIILVDDYSDLYNLHHDVQEAVDKLNNV----IKKEEEMIETNNI 226
             I+ ++  + YNL + + ++     N+     KKE +    NN+
Sbjct: 221 NDIINNE--EEYNLTNSLFQSFPMEQNLPLFKYKKENKQDYDNNV 263


>UniRef50_Q6LF09 Cluster: Putative uncharacterized protein; n=6;
           Plasmodium|Rep: Putative uncharacterized protein -
           Plasmodium falciparum (isolate 3D7)
          Length = 947

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 48/213 (22%), Positives = 96/213 (45%), Gaps = 29/213 (13%)

Query: 28  NNDKSIAALEGKDLLSDQPRDVEETLSKT--MWKYQDYKRQSEYRRKVMLKEKF---AKQ 82
           N DK +   +  ++L ++   + E L KT  + + Q  K +    +K  L +K     K+
Sbjct: 347 NKDKLLIE-KNTEILIEERNYINEELIKTQKLLESQINKNKELENKKTNLLDKIDLLEKK 405

Query: 83  QAIKMSKKTEN-----DLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDT 137
           Q   + K  EN     DL ++F L+ N   ++  +  +N +  N L +          +T
Sbjct: 406 QKDLIKKNNENEQKMDDLNKKFKLLTNENKIKENEILHNNNLINNLNNN---------NT 456

Query: 138 RNKLCQSQQYFD----ELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDY 193
           + K+   Q+++     E  + S+ I     H ++L   + +I++ T +   K      D 
Sbjct: 457 KMKIKLDQEFYKMKMLEKEKKSLSI-----HVKSLTYEIQTILNLTQETQNKFEQQKRDI 511

Query: 194 SDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNI 226
           +DL       ++ V+K+++VIKK  E+ + + I
Sbjct: 512 NDLIIEKEQTKKLVEKIDDVIKKNTEIAKKDKI 544


>UniRef50_Q5DAE6 Cluster: SJCHGC05311 protein; n=1; Schistosoma
           japonicum|Rep: SJCHGC05311 protein - Schistosoma
           japonicum (Blood fluke)
          Length = 320

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 22/98 (22%), Positives = 45/98 (45%), Gaps = 3/98 (3%)

Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKH 183
           + Q +    D P    +  +S    + +P +  +I     H    + +V  + D T Q+H
Sbjct: 111 VHQPVAPFSDFPLPPKQKSESTSQVEPVPNSKPVIVEKESHR---LPTVEEVKDMTRQRH 167

Query: 184 IKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI 221
            + ++   D+ +L NL +DV+  V+     +K+ EE +
Sbjct: 168 DQPVLKNIDFRELPNLVNDVESKVEYAKTKLKEFEETL 205


>UniRef50_Q22869 Cluster: Non-muscle myosin heavy chain II; n=3;
            Caenorhabditis|Rep: Non-muscle myosin heavy chain II -
            Caenorhabditis elegans
          Length = 2003

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 58/271 (21%), Positives = 112/271 (41%), Gaps = 14/271 (5%)

Query: 10   ATKVHYRPLNWDIKDRIKNNDKSIAAL--EGKDLLSDQPRDVEETLSKTMWK-YQDYKRQ 66
            A  V  R    D +++I+   K + +L  E +  L ++ R+V E L K   K     K +
Sbjct: 1356 AVAVEARDDALDAQEKIEKEVKEVKSLLAEARKKLDEENREVMEELRKKKEKELSAEKER 1415

Query: 67   SEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSE-DLRIRDKGYNLHAFNTLIS 125
            ++   +   K + AK++AI+ ++  + +L +     R  E  +R  D+       NTL++
Sbjct: 1416 ADMAEQARDKAERAKKKAIQEAEDVQKELTDVVAATREMERKMRKFDQQLAEERNNTLLA 1475

Query: 126  QRIGD--HRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQ-- 181
            Q+  D  H+ L D   K   +    +EL     I+    +   TL   + ++    D   
Sbjct: 1476 QQERDMAHQMLRDAETK---ALVLSNELSEKKDIVDQLEKDKRTLKLEIDNLASTKDDAG 1532

Query: 182  KHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXX 241
            K++ E+       D   L    Q+ ++ L + ++  ++      ++M+            
Sbjct: 1533 KNVYELEKTKRRLD-EELSRAEQQIIE-LEDALQLADDARSRVEVNMQAMRSEFERQLAS 1590

Query: 242  KKSTENSEVKNNVFNIRLLKTSKREGLIRAR 272
            ++  E+   K     IR L T + E   RAR
Sbjct: 1591 REEDEDDRKKGLTSKIRNL-TEELESEQRAR 1620


>UniRef50_A2EZ87 Cluster: Viral A-type inclusion protein, putative;
           n=2; cellular organisms|Rep: Viral A-type inclusion
           protein, putative - Trichomonas vaginalis G3
          Length = 2271

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 34/206 (16%), Positives = 92/206 (44%), Gaps = 13/206 (6%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
           ++ ++ +NND+    ++    L  +  + +  L K   +  D  ++ E + K +  +   
Sbjct: 588 ELTNKSQNNDELQNQIKQ---LKSELENTQNQLQKVTNEKGDKSKEIEEQNKKLKSQIEE 644

Query: 81  KQQAI-KMSKKTENDLE--EQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDT 137
           + Q I K+  + +   E  EQ  +  +  + ++R++   ++A NT +  +  + + + D 
Sbjct: 645 RDQMISKLQDENQKIAETAEQAAIKSSETNKKLREQFKKVYAENTSLKAK--NEKQVQDL 702

Query: 138 RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLY 197
             +L + ++        +     Y +  + L +    +MD+  +   + + L +D  ++ 
Sbjct: 703 MQQLDEKEKQLQSKKDEN-----YKQENDQLKKENQDLMDKLKEIENERVELEEDVKNVT 757

Query: 198 NLHHDVQEAVDKLNNVIKKEEEMIET 223
               D++E ++KL   +   E+ +ET
Sbjct: 758 TEKEDLEEEIEKLKEKVDVLEDQLET 783


>UniRef50_Q75E63 Cluster: ABL193Cp; n=1; Eremothecium gossypii|Rep:
           ABL193Cp - Ashbya gossypii (Yeast) (Eremothecium
           gossypii)
          Length = 862

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 20/84 (23%), Positives = 43/84 (51%), Gaps = 1/84 (1%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSD-QPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKF 79
           D+++ +    + +AALE + L  D +   +  TL+    +  + K+Q E +   ++ ++ 
Sbjct: 195 DLREEMSRLQEELAALENRQLKKDGEISQLNRTLNDKDMQLAELKKQLESKTGEVISQEL 254

Query: 80  AKQQAIKMSKKTENDLEEQFGLIR 103
               A + +K  EN+LE+Q   +R
Sbjct: 255 KAANAHQRAKTLENELEQQKNKVR 278


>UniRef50_Q0URH4 Cluster: Putative uncharacterized protein; n=1;
           Phaeosphaeria nodorum|Rep: Putative uncharacterized
           protein - Phaeosphaeria nodorum (Septoria nodorum)
          Length = 448

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 25/80 (31%), Positives = 39/80 (48%), Gaps = 5/80 (6%)

Query: 39  KDLLSDQPRDVEETLS--KTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLE 96
           KD   D   D E   S  K + +Y  Y+R  + RR +   E+ AK  + ++SK+ +  L 
Sbjct: 310 KDDSEDPSEDPEYNPSAIKMVKRYNVYRRPDQLRRTL---EELAKTSSRRVSKRDKRSLH 366

Query: 97  EQFGLIRNSEDLRIRDKGYN 116
             F  IRN+ +   R  GY+
Sbjct: 367 SSFAEIRNTVEKPTRGPGYS 386


>UniRef50_Q2FPR9 Cluster: Regulatory protein, ArsR; n=2;
           Methanomicrobiales|Rep: Regulatory protein, ArsR -
           Methanospirillum hungatei (strain JF-1 / DSM 864)
          Length = 252

 Score = 37.1 bits (82), Expect = 1.3
 Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 3/63 (4%)

Query: 163 EHYETLMRSVHSIMDRTDQKHIKEII--LVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEM 220
           +   +L   +H  M+RT+Q H+  II  + ++YS L   HH V+ A D L++ +  E +M
Sbjct: 16  QEIHSLREDLHRFMERTNQIHVNAIISDIRNEYSGLL-AHHQVERAGDCLSHAMVHECKM 74

Query: 221 IET 223
            +T
Sbjct: 75  HDT 77


>UniRef50_UPI00006CEB56 Cluster: hypothetical protein TTHERM_00370820;
            n=1; Tetrahymena thermophila SB210|Rep: hypothetical
            protein TTHERM_00370820 - Tetrahymena thermophila SB210
          Length = 1792

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 22/78 (28%), Positives = 39/78 (50%), Gaps = 1/78 (1%)

Query: 20   WDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDY-KRQSEYRRKVMLKEK 78
            W  K  + +  K     E K +  +Q R ++E   K   ++Q++ K+Q E +++   K+K
Sbjct: 1397 WKEKKEVFDQQKQKLIEEQKKIQEEQLRRMQEEKEKKEKEFQEFNKKQIEKQQEEFKKQK 1456

Query: 79   FAKQQAIKMSKKTENDLE 96
              +QQ     KKT  DL+
Sbjct: 1457 EKEQQVNNFKKKTNLDLK 1474


>UniRef50_UPI00006CB6DE Cluster: hypothetical protein
           TTHERM_00494050; n=1; Tetrahymena thermophila SB210|Rep:
           hypothetical protein TTHERM_00494050 - Tetrahymena
           thermophila SB210
          Length = 1181

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 35/159 (22%), Positives = 69/159 (43%), Gaps = 11/159 (6%)

Query: 27  KNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIK 86
           KN+ K     +  D+     +D++  L++   + Q+ K Q E +   ML+EK  K+Q IK
Sbjct: 314 KNSLKIYQLQKELDISQQNTQDIQMQLAQAQKQIQELKNQCELK---MLEEKQMKEQIIK 370

Query: 87  MSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQ 146
            S+   +  ++ F L +        +K   +      I Q      DL D + K+ Q QQ
Sbjct: 371 ESEIKVDSQQKAFQLEQQKS-----EKEQQIRELKRDIEQL---KEDLQDQKEKVIQEQQ 422

Query: 147 YFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIK 185
              +L      +    +  E  ++++ +  D+  +K+ +
Sbjct: 423 KNKDLKNNEYSLTKDIQTLEEQLQNIQNDHDKLQEKYAR 461


>UniRef50_UPI00015A629B Cluster: UPI00015A629B related cluster; n=1;
           Danio rerio|Rep: UPI00015A629B UniRef100 entry - Danio
           rerio
          Length = 2736

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 52/258 (20%), Positives = 105/258 (40%), Gaps = 24/258 (9%)

Query: 15  YRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKR-----QSEY 69
           +R L   IKD+ + + K +A L+      DQ     +  +KT  + Q  K+     QSE 
Sbjct: 368 HRSLEQKIKDQERESQKELAQLQSSYQALDQ--QFTQVKNKTSMEIQQAKKDHNVLQSEM 425

Query: 70  RRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLR-IRDKGYNLHAFNTLISQRI 128
            +   LK +  K+      K   ++   Q   ++ +E  +   +     +  N  + Q +
Sbjct: 426 DKVTALKNRLEKELEELKQKLLRSEQALQASQVKEAETKKKFEEMQREKNTLNCQLDQGM 485

Query: 129 GDHRDLPD----TRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSV--HSIMDRTDQK 182
              + L D    T   L +++   D+L    +     NE    L + +   S+    + +
Sbjct: 486 KRVKQLEDEKQNTEQILAKNRMMVDDL---KVKTQTQNEELTELRKKMDHQSVSSAQELE 542

Query: 183 HIKEIIL------VDDYSDLYNLHHDVQEAVDKLNNVIKKEEEM-IETNNIDMEXXXXXX 235
           ++K+ ++      +   ++L  L HDV+   +K+  V K+ EE+ + +N+   E      
Sbjct: 543 NLKKTLIEAEAKNMKTQAELQKLVHDVELKENKICAVEKENEELKMTSNSCQKELAEMKK 602

Query: 236 XXXXXXKKSTENSEVKNN 253
                 +  TE  ++ NN
Sbjct: 603 EYDALLQWKTEKEQLINN 620


>UniRef50_UPI000069FF36 Cluster: M-phase phosphoprotein 1 (MPP1)
           (Kinesin-related motor interacting with PIN1).; n=1;
           Xenopus tropicalis|Rep: M-phase phosphoprotein 1 (MPP1)
           (Kinesin-related motor interacting with PIN1). - Xenopus
           tropicalis
          Length = 755

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 50/223 (22%), Positives = 89/223 (39%), Gaps = 19/223 (8%)

Query: 24  DRIKNNDKSIAALEGKDLLS-DQPRDVEETLSKTMWKYQDYKRQ----SEYRRKVMLKEK 78
           D  K   +S+  LE ++  S      +E+ L +   KY+ Y++     +E  RK+  K  
Sbjct: 103 DSYKEKCESLKCLEEQNKASASDTLQLEQNLKEAQTKYEAYEKDITTLNEENRKLG-KNI 161

Query: 79  FAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTR 138
              Q+ +  ++KT  D EEQ G      +L  +D          L         D  + +
Sbjct: 162 TELQEKLNAAEKTSKDKEEQVGQSAKEIELLKKDLSQRASELKVLQLDLQRKEEDCTELK 221

Query: 139 NKLCQSQQYFDELPR--------ASIIICFYNEHYETLMRSVH---SIMDRTDQKHIKEI 187
           +KL  S++   ++ +          ++    NE YE L   +     +  RT Q+  KE 
Sbjct: 222 DKLMDSKKQIQQVEKEVSGMREEKRLLTNKVNE-YEKLKNQMSRELEMKQRTIQQLKKES 280

Query: 188 ILVDDYSDLYNLHHDV-QEAVDKLNNVIKKEEEMIETNNIDME 229
              +   D+  L+    QEA +K   +   +E +IE     +E
Sbjct: 281 ADNEKNGDVMQLYQKACQEAQEKEKIIEDMKETLIEQEQTQVE 323


>UniRef50_Q82L26 Cluster: Putative secreted alpha-galactosidase;
           n=1; Streptomyces avermitilis|Rep: Putative secreted
           alpha-galactosidase - Streptomyces avermitilis
          Length = 658

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 26/91 (28%), Positives = 39/91 (42%), Gaps = 10/91 (10%)

Query: 503 WFETDRSELVLGRTLCLDA-----SNNVAPILGKCHEMGGTQEWKHKGTASSPIYNTAAG 557
           W  T R ELVL    CLDA     +N    ++  C+     Q+W     +   I N  AG
Sbjct: 570 WTYTSRKELVLYGNKCLDAYNLGTTNGTKVVIWDCNGQ-ANQKWNI--NSDGTITNVNAG 626

Query: 558 MCLGV--DRSYRGETVLMVICDDYSNNKWDI 586
           +CL      +  G ++++  C    N KW +
Sbjct: 627 LCLDAYNAATANGTSLVLWSCGTGDNQKWTV 657


>UniRef50_A5EY82 Cluster: Serine protease; n=1; Dichelobacter
           nodosus VCS1703A|Rep: Serine protease - Dichelobacter
           nodosus (strain VCS1703A)
          Length = 467

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 23/79 (29%), Positives = 36/79 (45%), Gaps = 3/79 (3%)

Query: 276 ADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRY--SARAVTPVIDVINADTF 333
           A   VGD+L+  + H       LPPL+     G D V++ Y    +  T  + + N +T 
Sbjct: 304 AQLKVGDILLSFNGHTINKASDLPPLVAMAPLGKD-VEIEYLRDGKKQTTTVKIENLETA 362

Query: 334 EYSPSPLVRGGFNWGLHFK 352
           + S +   R   NWG+  K
Sbjct: 363 DTSSAATSREMRNWGIELK 381


>UniRef50_A2G5G4 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 346

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 22/86 (25%), Positives = 44/86 (51%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
           ++K R    +KS    + + LL DQ ++V+  L       + Y+++ E  +K  L+ K  
Sbjct: 21  EVKKREDLKEKSEELKKEQTLLKDQSQNVKTELESINEMIEKYEQKRETSQKDNLRYKEQ 80

Query: 81  KQQAIKMSKKTENDLEEQFGLIRNSE 106
            Q +I+     E++L+EQ  L+  ++
Sbjct: 81  LQNSIQQKSTAESELKEQKQLLEEAK 106


>UniRef50_A2EN31 Cluster: Viral A-type inclusion protein, putative;
           n=1; Trichomonas vaginalis G3|Rep: Viral A-type
           inclusion protein, putative - Trichomonas vaginalis G3
          Length = 5296

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 50/239 (20%), Positives = 102/239 (42%), Gaps = 28/239 (11%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSDQ----PRDVEETLSKTMWKYQDYKRQSEYRRKVM-- 74
           ++K++I + +  I AL   +LL  Q      D +E +     + +D K+Q E + K +  
Sbjct: 378 NLKNKIADRESQIKAL---NLLIAQYQTDDEDKKEIIENLEKEIKDLKKQIEDKDKEIEV 434

Query: 75  LKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDL 134
           LK K AK + I        D E++  ++  + D+ + D       FN   ++++     +
Sbjct: 435 LKAKIAKIEEIP------EDEEDEDIVVAGTRDVDLGD-------FNEEEAEQVSLEDQV 481

Query: 135 PDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYS 194
              + KL   ++   ++ +A   +   +   E L   +  + DR D++      L    S
Sbjct: 482 KQLKEKLDDKKKNGVQMKQA---LASKDAEIEKLNEQIQELKDRNDKQEQNIEELNTKNS 538

Query: 195 DLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNN 253
           DL N + + ++ +D+L N +K   ++ +      +            K   EN+E K+N
Sbjct: 539 DLQNSNDEYKKLIDELQNQLK---DLAKNKAESSDLNNSENTKQDSEKAEDENAETKSN 594


>UniRef50_A2E7J3 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 397

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 48  DVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNS-E 106
           D E  L K +    DY++Q ++ + V+  E+ A ++ IK ++   + L + F L+ N   
Sbjct: 56  DTEPQLQKELNTLVDYQKQLQFYQNVLRHEQQAAEEEIKRARLDTSQLRKSFELLENELM 115

Query: 107 DLRIRDK 113
           +L++R K
Sbjct: 116 ELQLRQK 122


>UniRef50_A5DZR6 Cluster: Putative uncharacterized protein; n=1;
           Lodderomyces elongisporus NRRL YB-4239|Rep: Putative
           uncharacterized protein - Lodderomyces elongisporus
           (Yeast) (Saccharomyces elongisporus)
          Length = 865

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 5/68 (7%)

Query: 64  KRQSEYRRKVMLKEKFAKQQAIKMSKKTEN---DLEEQFGLIRNSEDLRIRDKGYNLHAF 120
           K ++E  ++V+ K KF KQQ  +   KT+N   +L+E FG I   +DLR   KG     F
Sbjct: 213 KTKAEVMKEVIAKSKFYKQQRQRDYAKTQNQIDELDEDFGDI--MDDLRNTQKGVAKPQF 270

Query: 121 NTLISQRI 128
           +T   + I
Sbjct: 271 STKTPEEI 278


>UniRef50_Q58718 Cluster: DNA double-strand break repair rad50
           ATPase; n=1; Methanocaldococcus jannaschii|Rep: DNA
           double-strand break repair rad50 ATPase - Methanococcus
           jannaschii
          Length = 1005

 Score = 36.7 bits (81), Expect = 1.7
 Identities = 50/218 (22%), Positives = 89/218 (40%), Gaps = 19/218 (8%)

Query: 48  DVEETLSKTMWKYQDYKRQSEYRRKV--MLKE-KFAKQQAIKMSKKTE---NDLEEQFGL 101
           +  ETL++   +Y+ YK   +  RK+   L+E K   +  +K++K+ E    D+E+    
Sbjct: 274 EARETLNRHKDEYEKYKSLVDEIRKIESRLRELKSHYEDYLKLTKQLEIIKGDIEKLKEF 333

Query: 102 IRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFY 161
           I  S   + RD   NL      I   I     + D   +L    +  +++ +   I    
Sbjct: 334 INKS---KYRDDIDNLDTLLNKIKDEIERVETIKDLLEELKNLNEEIEKIEKYKRICEEC 390

Query: 162 NEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI 221
            E+YE  +          ++K ++   L  +Y  L      +++ ++ L   I K  E  
Sbjct: 391 KEYYEKYLE--------LEEKAVEYNKLTLEYITLLQEKKSIEKNINDLETRINKLLE-- 440

Query: 222 ETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRL 259
           ET NID+E            KK  EN + +    N +L
Sbjct: 441 ETKNIDIESIENSLKEIEEKKKVLENLQKEKIELNKKL 478


>UniRef50_UPI00015B6253 Cluster: PREDICTED: similar to CG33715-PD;
            n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
            CG33715-PD - Nasonia vitripennis
          Length = 7697

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 22   IKDRIKNNDKSIAALEGKDLLSDQPRDVE--ETLSKTMWKYQDYKRQSEYRRKVMLKEKF 79
            IK+ I+  D  I  +E K+   D+P+  E  ET+ K     +   +  ++ ++V  KE+ 
Sbjct: 3010 IKEEIQQRDSQIGKVEEKETQQDKPKKDEPNETVVKKDEHQKKESQNEKFVKQVATKEES 3069

Query: 80   AKQQAI-KMSKKTE 92
             K++A+ ++SKK E
Sbjct: 3070 RKEEAVEQVSKKEE 3083


>UniRef50_Q2JCN5 Cluster: Glycosyl transferase, family 2 precursor;
           n=1; Frankia sp. CcI3|Rep: Glycosyl transferase, family
           2 precursor - Frankia sp. (strain CcI3)
          Length = 466

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 47/176 (26%), Positives = 75/176 (42%), Gaps = 19/176 (10%)

Query: 267 GLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVDGVKVRYSARAVTPVID 326
           GL RAR  G   +   V+VF D  +EV+  WL  LL   + G  GV V  +   VT +I 
Sbjct: 174 GLSRARNAGLAAATTPVVVFTDDDVEVDPRWLEFLLSGFAAG-SGV-VDETVGCVTGLIR 231

Query: 327 VINADTFEYSPSPL---VRGGFNWG-LHFKWDNLPKGTLINDEDFMKPLKSPTMAGGLFA 382
            +   T    P+ +     GGF  G +  ++D     T     D + P  +     G  +
Sbjct: 232 PLELST----PAQVWFEQFGGFGKGFVGRRFDR----TENRSGDLLYPYTAGVFGSGANS 283

Query: 383 IYR-EYFNAIGKYD----PGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRK 433
            +R +    +G +D     G    GGE+L+I   +   G  L   P + + H+ ++
Sbjct: 284 AFRTDTLRQLGGFDEFLGTGTAARGGEDLDIFLSVVRSGHVLVYEPAALIRHLHKR 339


>UniRef50_O06764 Cluster: Phase variable surface lipoprotein P78
           precursor; n=1; Mycoplasma fermentans|Rep: Phase
           variable surface lipoprotein P78 precursor - Mycoplasma
           fermentans
          Length = 680

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 37/149 (24%), Positives = 74/149 (49%), Gaps = 25/149 (16%)

Query: 42  LSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGL 101
           LSD+ +D+ E L+K   +Y   K ++   ++++  EK+  ++A+    K E D  ++   
Sbjct: 172 LSDKLKDINEFLTKNKSEYDAEKAKANPNKEII--EKY--ERALIRKAKYE-DTADKDSY 226

Query: 102 IRNSEDLRIRDKGYNLHAFN--TLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIIC 159
           I++ E+L I+   Y + + N   ++S      R +P+ +N        FD+         
Sbjct: 227 IKSFENLNIK---YKIQSINDPKIVSDEFS--RSIPNFKNDSYIDYYVFDK--------- 272

Query: 160 FYNEHYETLMRSVHSIMDRTDQ--KHIKE 186
             ++ Y T+  S HS+ D T++  K+IK+
Sbjct: 273 --SDTYSTVSHSFHSVSDLTEEIKKYIKK 299


>UniRef50_A6WGG7 Cluster: Glycosyl transferase family 2; n=1;
           Kineococcus radiotolerans SRS30216|Rep: Glycosyl
           transferase family 2 - Kineococcus radiotolerans
           SRS30216
          Length = 419

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 380 LFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCG 416
           + A+ R+ F+A+G ++     +GGE+ E + R W+ G
Sbjct: 172 VLAVTRDLFDAVGGFEEAFTAYGGEDWEFAHRCWLAG 208


>UniRef50_A0V2D1 Cluster: Glycosyl transferase, family 2; n=1;
           Clostridium cellulolyticum H10|Rep: Glycosyl
           transferase, family 2 - Clostridium cellulolyticum H10
          Length = 333

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 18/48 (37%), Positives = 27/48 (56%)

Query: 244 STENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHI 291
           ST+NS      +  ++ KT K  G+  AR  G + + GD+L FLDS +
Sbjct: 44  STDNSIEIAKKYPCKIFKTPKNGGVAAARNLGVEYASGDILFFLDSDV 91


>UniRef50_Q4Q0C8 Cluster: Phosphatidylinositol 3 kinase, putative;
            n=3; Leishmania|Rep: Phosphatidylinositol 3 kinase,
            putative - Leishmania major
          Length = 2613

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 28/116 (24%), Positives = 53/116 (45%), Gaps = 5/116 (4%)

Query: 15   YRPLNWDIKDRIKNNDKSIAALEGKDLLSD----QPRDVEETLSKTMWKYQDYKRQSEYR 70
            Y PL + + D    N + + A     L +      P + +    +T  +Y+   +Q   R
Sbjct: 1178 YTPLVFPLLDGYGQNGQDVKAFTLSTLRTSGRVVAPENAQAQEEETAARYRVNTQQLAQR 1237

Query: 71   RKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQ 126
            RK  L+E FA+ ++I +++  E + E    L + + +L +R    N H F   ++Q
Sbjct: 1238 RKTALEENFAQLRSILVARDRETEEEWNLWLKQLAVEL-LRSSPSNAHGFAFALAQ 1292


>UniRef50_O97294 Cluster: Putative uncharacterized protein PFC0990c;
           n=2; Plasmodium|Rep: Putative uncharacterized protein
           PFC0990c - Plasmodium falciparum (isolate 3D7)
          Length = 753

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 55/269 (20%), Positives = 114/269 (42%), Gaps = 25/269 (9%)

Query: 22  IKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVM-----LK 76
           I++  KNN         KD+  ++  D  E L++ +   Q  K+   Y +K+      L 
Sbjct: 381 IENLKKNNQIIYDKFLQKDISQNETNDTIEKLNQKLKSEQ--KQIYHYEQKINTLNDDLN 438

Query: 77  EKFAKQ-QAIKMSKKTENDL---EEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHR 132
             + K       S   +N L   E+Q G ++N  D+ I  K         + ++     R
Sbjct: 439 NSYQKYVYYYNKSDSNQNLLQQKEDQIGKLKNQLDIHINSKNDIQEKLKNIYTENNKVER 498

Query: 133 DLPDTRNKLCQSQQYFDELPRASIIICFYNEHYE-TLMRSVHSIMDRTDQKHIKEIILVD 191
           D  D +N+L +++   ++L +      ++N+ Y  +  ++   I+    +++  +  +++
Sbjct: 499 DNDDLKNELTKTKLNLEKL-KDEYEELYHNKQYVFSCYKNEEKILKENLERYKTKCAILE 557

Query: 192 DYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVK 251
           +  D + L    ++   KL +  + E+ M E   +  E              +TE  ++K
Sbjct: 558 NQKDCHILEDKYKQLEIKLKDT-ENEKYMYERTCLMNE--------KRQNDMATEIKDLK 608

Query: 252 NNVFNI--RLLKTSKREGL-IRARLYGAD 277
           N +++   RL K SK + L ++  LY  D
Sbjct: 609 NELYDCKNRLYKMSKSDVLDMKTTLYNQD 637


>UniRef50_A2EJ43 Cluster: Viral A-type inclusion protein, putative;
           n=1; Trichomonas vaginalis G3|Rep: Viral A-type
           inclusion protein, putative - Trichomonas vaginalis G3
          Length = 940

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 48/253 (18%), Positives = 112/253 (44%), Gaps = 17/253 (6%)

Query: 21  DIKDRIKN--NDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQS-EYRRKV--ML 75
           +IK++ +N   +K  +  E    + D  ++ EET    + + +DYK Q  E ++++  + 
Sbjct: 49  EIKNQNENLQKEKENSLNEMNKQIDDLQKEKEETEKALIEENEDYKNQLSELKKQIEDLQ 108

Query: 76  KEKFAKQQAIKMSKKTEN----DLEEQFGLIRNS-EDLRIRDKGYNLHAFNTL--ISQRI 128
            E   K + +K   +  N    DL++Q  L++ S  +   +D+ + +     +  + Q++
Sbjct: 109 NENEEKVENLKKENEEFNNEIKDLQDQIELLKKSMSESEDKDQKFVIELNQQIEKLKQKV 168

Query: 129 GDHRDLPDTRNK--LCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKE 186
            D +DL   +++  +   Q+  D   + + +    NE  + +      + D ++++ +K+
Sbjct: 169 SDEKDLIQVKDEEIIDLKQKNTDLSEQNNKLNEDKNELEKQIEELAQKLSDESEKEKLKQ 228

Query: 187 IILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI--ETNNIDMEXXXXXXXXXXXXKKS 244
            I  +  S+  N   D  + ++ L   + + E+ I  +T  ID                +
Sbjct: 229 EI-NELKSEKENSEKDFNKKLENLTQKVTELEDSISQKTREIDEAETAKEDISLKLDNLA 287

Query: 245 TENSEVKNNVFNI 257
            EN ++  N+  I
Sbjct: 288 EENEKLSQNLSEI 300


>UniRef50_A0CQE7 Cluster: Chromosome undetermined scaffold_24, whole
           genome shotgun sequence; n=3; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_24,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 336

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 42/215 (19%), Positives = 92/215 (42%), Gaps = 6/215 (2%)

Query: 8   NKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQS 67
           N+  K   + L     + +  N K    L+ +  L+   ++ +++ ++ + K +D ++ +
Sbjct: 92  NQYLKQRIKQLESQNNNYVSENKKLAHVLDQQIQLNQSLQEQQQSKNQIIKKLEDVQKMN 151

Query: 68  EYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQR 127
           +Y++      K    Q I +SKK  NDLEE+  ++ N E+ ++ +           +   
Sbjct: 152 KYQQSNNSDLKQINDQLI-ISKKVVNDLEEKVQIVLN-ENQKLSELNERFQFTENQLKIE 209

Query: 128 IGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEI 187
           I  ++          + QQ   +       I   N+  E L R  + +  +  Q +  ++
Sbjct: 210 IEKYKSKCSILESKMKQQQDESKCLELQRKIKKQNDQLEILARENYQLKQQLTQNN-NQV 268

Query: 188 ILVDDYSDLYNLHHD-VQEAVDKLNNVIKKEEEMI 221
              DD   LY +H+D  +E + +L    +  ++MI
Sbjct: 269 SQQDD-KQLY-VHNDKFKETIQELECENQYLQQMI 301


>UniRef50_Q4JC59 Cluster: Conserved Archaeal membrane protein; n=4;
           Sulfolobaceae|Rep: Conserved Archaeal membrane protein -
           Sulfolobus acidocaldarius
          Length = 342

 Score = 36.3 bits (80), Expect = 2.3
 Identities = 27/99 (27%), Positives = 46/99 (46%), Gaps = 5/99 (5%)

Query: 255 FNIRLLKTSKR----EGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
           +N++++ ++K      G I A+L G  ++ GD++VF DS       WL  L+  LS  V 
Sbjct: 88  YNVKVVVSNKNCDICSGKINAQLEGLKHARGDIIVFADSDTWFPKYWLKELVSPLSNYVA 147

Query: 311 GVKVRYSARAVTPVIDVINADTFEYS-PSPLVRGGFNWG 348
                ++      + ++I A  +     S  V G F WG
Sbjct: 148 TTVFSWAKPVRLTIGNIIRAGFWTLGFESQAVGGTFLWG 186


>UniRef50_UPI000150A0FA Cluster: Type III restriction enzyme, res
           subunit family protein; n=1; Tetrahymena thermophila
           SB210|Rep: Type III restriction enzyme, res subunit
           family protein - Tetrahymena thermophila SB210
          Length = 2730

 Score = 35.9 bits (79), Expect = 3.0
 Identities = 24/125 (19%), Positives = 55/125 (44%), Gaps = 1/125 (0%)

Query: 48  DVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSED 107
           ++E+T  K   K      + + + K +    F       +SK+    LE+   L++++E+
Sbjct: 497 EIEDTF-KANAKTLRIFHKGQVKIKSIYTNNFYTAACFSISKRVITLLEQMDELVKSNEE 555

Query: 108 LRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYET 167
           + + D   NL     L  Q++GDH  L        Q +++ D + +   +       Y+ 
Sbjct: 556 VNVNDLESNLSKLKDLHEQQVGDHLQLNSQLIYQEQVKKWCDFIQQKKTVFEIDYNKYDQ 615

Query: 168 LMRSV 172
           +++++
Sbjct: 616 VLKNI 620


>UniRef50_UPI00006CF2BD Cluster: Leucine Rich Repeat family protein;
            n=1; Tetrahymena thermophila SB210|Rep: Leucine Rich
            Repeat family protein - Tetrahymena thermophila SB210
          Length = 1504

 Score = 35.9 bits (79), Expect = 3.0
 Identities = 38/209 (18%), Positives = 82/209 (39%), Gaps = 6/209 (2%)

Query: 71   RKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGD 130
            R+ +LK+K    Q     K  E ++     +IR+ E+    +K   ++  N L  +  G 
Sbjct: 977  RENILKQKQILYQMNHELKLNETNINSSSEVIRSYEN----EKQNLIYEINQLKEENFGQ 1032

Query: 131  HRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILV 190
               + + +    +       + R   I     E YE  +R    ++ +   ++ K+I   
Sbjct: 1033 KLAIEEFQQIQIKCNSQLRSIERQEKIFKEEKEAYELRIRDYEELLKQIHSEYQKQIHQK 1092

Query: 191  D-DYSDLYNLHHDVQEAVDKLNNVIK-KEEEMIETNNIDMEXXXXXXXXXXXXKKSTENS 248
            D +  +  N    VQ+ +  L   ++ K+  ++ET    +             +K  +NS
Sbjct: 1093 DIEIHESKNETRAVQKQISILQRELEFKQNTILETEKALLNRGEQSSILLRETEKKLQNS 1152

Query: 249  EVKNNVFNIRLLKTSKREGLIRARLYGAD 277
             +  N F  ++ +       ++ ++YG D
Sbjct: 1153 IMMANEFKTKIEELYSENEELKDQVYGKD 1181


>UniRef50_UPI0000499464 Cluster: DNA repair protein Rad50; n=1;
           Entamoeba histolytica HM-1:IMSS|Rep: DNA repair protein
           Rad50 - Entamoeba histolytica HM-1:IMSS
          Length = 1241

 Score = 35.9 bits (79), Expect = 3.0
 Identities = 54/265 (20%), Positives = 107/265 (40%), Gaps = 18/265 (6%)

Query: 5   QIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYK 64
           Q ++   KV+    + ++  +I N  K    LE  DL +   R++E+ +++   K Q  +
Sbjct: 307 QSIDVQQKVNKEGQHQELSKQIHNQMKDQTLLEN-DLKNR--RELEKEINE---KIQGVE 360

Query: 65  RQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLI 124
              E      +K +  KQ+ I+  KK E    E      N     + ++ YN+H     I
Sbjct: 361 SVEEKVNNEKIKNE-EKQKEIEEEKKKEEKELEDMSKEVNEIKNELENRKYNVH-----I 414

Query: 125 SQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHI 184
            Q   +++   + R K  + Q+  +E+      I   NE  + L + +    +  ++K  
Sbjct: 415 KQDDVNNKTTENERKKK-RDQEIQEEINEMKKDIIKKNEEIDDLKKQLSK--ESFEEKEQ 471

Query: 185 KEIILVDDY-SDLYNLHHDVQEAVDKLNNVIKKEEEM--IETNNIDMEXXXXXXXXXXXX 241
           K  I +++   D+  + +++  A++ +   IK E  M  I  N  ++E            
Sbjct: 472 KSKIKLEEIKKDIEEIDNEINRALENIQQQIKIERLMKEINENKTELENFKLTVGKDLQG 531

Query: 242 KKSTENSEVKNNVFNIRLLKTSKRE 266
           K+      +K     I  +K    E
Sbjct: 532 KEKDIKETIKKQKNEILSMKNDSEE 556


>UniRef50_Q1WV23 Cluster: Superfamily II DNA and RNA helicase; n=1;
           Lactobacillus salivarius subsp. salivarius UCC118|Rep:
           Superfamily II DNA and RNA helicase - Lactobacillus
           salivarius subsp. salivarius (strain UCC118)
          Length = 788

 Score = 35.9 bits (79), Expect = 3.0
 Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 3/77 (3%)

Query: 141 LCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEII---LVDDYSDLY 197
           +C     FD L +        N  Y  L+   H+++ R+   + KEI    + D    L 
Sbjct: 347 ICDYNYLFDPLVKLQRFFTERNYDYTFLLDEAHNLVSRSRDMYTKEISSQQIKDLLDKLQ 406

Query: 198 NLHHDVQEAVDKLNNVI 214
            L H  Q+ VDKLN ++
Sbjct: 407 TLPHPPQKIVDKLNTLL 423


>UniRef50_Q7RT92 Cluster: Phosphatidylinositol transfer protein 2;
           n=4; Plasmodium (Vinckeia)|Rep: Phosphatidylinositol
           transfer protein 2 - Plasmodium yoelii yoelii
          Length = 738

 Score = 35.9 bits (79), Expect = 3.0
 Identities = 28/113 (24%), Positives = 48/113 (42%), Gaps = 5/113 (4%)

Query: 188 ILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTEN 247
           I V+DY +L N+++DV+ + D + N ++    +     I  E            K++  N
Sbjct: 597 IKVEDYKNLMNIYNDVENS-DNIENEVQPLNNL--NGKIKNENDKLSYIFKKNDKENENN 653

Query: 248 SEVKNNVFNIRLLKTSKREGL--IRARLYGADNSVGDVLVFLDSHIEVNVGWL 298
              KN V N   L+ SK      I   L   +N+    L   D  +++N  W+
Sbjct: 654 VNNKNGVNNHNTLQISKNNKYENIPITLQLFNNATDKKLKINDKKLKINDSWI 706


>UniRef50_A2ETW9 Cluster: Viral A-type inclusion protein, putative;
           n=1; Trichomonas vaginalis G3|Rep: Viral A-type
           inclusion protein, putative - Trichomonas vaginalis G3
          Length = 2010

 Score = 35.9 bits (79), Expect = 3.0
 Identities = 42/208 (20%), Positives = 90/208 (43%), Gaps = 18/208 (8%)

Query: 22  IKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAK 81
           I+   +++ K IA  E  D L +Q  ++ E  S   +   + K Q+E     + KE    
Sbjct: 788 IESENESSSKIIALTEEIDELKNQINNISEQKSTLEFTIDEIKAQNESEISQLKKENEDL 847

Query: 82  QQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKL 141
              I+   K  N+L+ +   I+NS  L + +   N    N L +  + +  D+    N+ 
Sbjct: 848 NSKIESLSKENNELKTEIENIQNSHSLSLLETEMN----NKLTN--LNEENDMLKNENEN 901

Query: 142 C--QSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
              + ++   E       + F+ ++   +        +  D++  K I+L      L   
Sbjct: 902 IKREKEETLAENKSLKDTLDFFEKNLTKINEQNKDKTEELDKQ--KRIVLT-----LTGE 954

Query: 200 HHDVQEAVDKLNN---VIKKEEEMIETN 224
           +++++  +DK+ N   +++KE E +E++
Sbjct: 955 NNELKSKLDKIKNDYELLQKENEKLESD 982


>UniRef50_Q0W1H0 Cluster: Putative glycosyltransferase; n=2;
           uncultured methanogenic archaeon RC-I|Rep: Putative
           glycosyltransferase - Uncultured methanogenic archaeon
           RC-I
          Length = 234

 Score = 35.9 bits (79), Expect = 3.0
 Identities = 25/87 (28%), Positives = 43/87 (49%), Gaps = 3/87 (3%)

Query: 367 FMKPLKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSR 426
           FM  + +P +AG  FA+ RE F+  G +D  +    GE++E+  RI    G     P S 
Sbjct: 137 FMARINNPAVAGANFAVTREAFDKAGGFDESLVT--GEDIELCKRIKRY-GRFVFNPDSL 193

Query: 427 VGHVFRKRRPYGVGEKQDYMLQNSMRM 453
           V    R+ R +G      + + N++++
Sbjct: 194 VYVSMRRVREWGYARFVAFHVTNTIKV 220


>UniRef50_A2STK1 Cluster: Glycosyl transferase, family 2; n=2;
           Methanomicrobiales|Rep: Glycosyl transferase, family 2 -
           Methanocorpusculum labreanum (strain ATCC 43576 / DSM
           4855 / Z)
          Length = 238

 Score = 35.9 bits (79), Expect = 3.0
 Identities = 25/104 (24%), Positives = 41/104 (39%), Gaps = 2/104 (1%)

Query: 207 VDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSKRE 266
           V+ L   I K  E +E      E            ++  E  E K+    +RLL + +R+
Sbjct: 16  VEALKTAIPKSIEALEAYGKSFELIIAEDGSTDGSRECVEEWERKDP--RVRLLHSDERQ 73

Query: 267 GLIRARLYGADNSVGDVLVFLDSHIEVNVGWLPPLLKRLSQGVD 310
           G  RA       S G++  + D  +  ++  L  LL  +  G D
Sbjct: 74  GRGRALNRALAESRGEIFCYYDVDLATDISHLSELLDHIEDGAD 117


>UniRef50_UPI000150A21E Cluster: hypothetical protein TTHERM_00191160;
            n=1; Tetrahymena thermophila SB210|Rep: hypothetical
            protein TTHERM_00191160 - Tetrahymena thermophila SB210
          Length = 1590

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 30/142 (21%), Positives = 57/142 (40%), Gaps = 4/142 (2%)

Query: 113  KGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSV 172
            K +N   +NTL      D +D  D+ N+    Q   + LP+    + F      ++   V
Sbjct: 1397 KIFNQERYNTLTEMNAKDLQDQSDS-NQEFSKQNSQNNLPKQESKVTFPTGKKNSIKSDV 1455

Query: 173  HSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIET-NNIDMEXX 231
                     K  K ++  +D++ +YN + D +      NN + ++EE +E+   I  E  
Sbjct: 1456 SKSQLSNKSKAFKNVV-ANDFN-VYNSNVDQEAVAANQNNYMLEDEEEVESQGQISNEDD 1513

Query: 232  XXXXXXXXXXKKSTENSEVKNN 253
                      + ++ +  V NN
Sbjct: 1514 HSQYQQNNRNQTNSNHQNVNNN 1535


>UniRef50_UPI00006CF1FD Cluster: hypothetical protein
           TTHERM_00540130; n=1; Tetrahymena thermophila SB210|Rep:
           hypothetical protein TTHERM_00540130 - Tetrahymena
           thermophila SB210
          Length = 1215

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 26/97 (26%), Positives = 44/97 (45%), Gaps = 2/97 (2%)

Query: 5   QIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWK--YQD 62
           Q + K  ++  +P    IKD+I+  +K  AA+E   L    P+ + + L K +    Y +
Sbjct: 604 QFILKKNQILKQPSLETIKDKIEYYNKIEAAIEQTQLKESTPQQLRDRLFKLIRNMIYCE 663

Query: 63  YKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQF 99
            K   +Y     +KEK  K  A  ++   + D  E F
Sbjct: 664 LKLTFKYEMIPQMKEKLQKSIADVLNTDIQADQNEYF 700


>UniRef50_Q8KU52 Cluster: EF0109; n=1; Enterococcus faecalis|Rep:
           EF0109 - Enterococcus faecalis (Streptococcus faecalis)
          Length = 1924

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 22/64 (34%), Positives = 33/64 (51%), Gaps = 1/64 (1%)

Query: 251 KNNVFNIRLLKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVNVGWLP-PLLKRLSQGV 309
           K N+++  +L T KR GL+RA L G  N   D+    D+  EV    +P  +  R  +  
Sbjct: 441 KTNIWSSGILSTDKRGGLVRAALVGRTNGTIDIYFKDDTPQEVLNSEIPFQVWARFKEKK 500

Query: 310 DGVK 313
           +GVK
Sbjct: 501 EGVK 504


>UniRef50_A5ZRG2 Cluster: Putative uncharacterized protein; n=1;
           Ruminococcus obeum ATCC 29174|Rep: Putative
           uncharacterized protein - Ruminococcus obeum ATCC 29174
          Length = 927

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 19/63 (30%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIE 222
           E  ET ++ V+   D  D   I EI+  DD  DL + + D++     L++++KK ++++E
Sbjct: 542 EEIETFIKQVNLTYDLIDDSEINEIL--DD--DLKDFNSDIESIYKILHDIVKKSDKIVE 597

Query: 223 TNN 225
             N
Sbjct: 598 KIN 600


>UniRef50_A4JHB1 Cluster: Putative uncharacterized protein; n=1;
           Burkholderia vietnamiensis G4|Rep: Putative
           uncharacterized protein - Burkholderia vietnamiensis
           (strain G4 / LMG 22486) (Burkholderiacepacia (strain
           R1808))
          Length = 218

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 33/124 (26%), Positives = 60/124 (48%), Gaps = 9/124 (7%)

Query: 41  LLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTEND--LEEQ 98
           LL+   +++EE  +K +      K   + + KV  KE+ A +Q  K S + E+D   ++ 
Sbjct: 83  LLNIPDKEIEEIATKIIAVVNGAKLPKKQQSKVEQKEEVASEQINKWSNQFEDDKKSDDP 142

Query: 99  FG--LIRNSED-LRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQS--QQYFDELPR 153
           F   L+ N E  ++  +   N+  F  L+ +  G  R+L DT   + +S  QQ  D L +
Sbjct: 143 FAITLVNNFESGIKEEEPKINILEFMKLVGK--GFDRELTDTEKSVVKSYYQQNSDLLAK 200

Query: 154 ASII 157
             ++
Sbjct: 201 DQLV 204


>UniRef50_Q7RSH4 Cluster: Putative uncharacterized protein PY00383;
           n=2; Plasmodium (Vinckeia)|Rep: Putative uncharacterized
           protein PY00383 - Plasmodium yoelii yoelii
          Length = 859

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 5/104 (4%)

Query: 161 YNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEM 220
           Y + Y T  +    I  R ++  IK I   + YSDL  L  +V   +DKLNN  KK +++
Sbjct: 155 YKKEYTTHQKCNKEI-SRQNKDGIKNIYSNNKYSDLPKLRKNV---LDKLNNADKKNDQL 210

Query: 221 IETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSK 264
             T +++++            K+   N++ ++++    + +T+K
Sbjct: 211 F-TLSLNVKNNKKSKLNYIDTKEKNYNNKSQSHIDKPNVTRTNK 253


>UniRef50_Q7RBX2 Cluster: Mature-parasite-infected erythrocyte
           surface antigen; n=5; Plasmodium|Rep:
           Mature-parasite-infected erythrocyte surface antigen -
           Plasmodium yoelii yoelii
          Length = 761

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 3/75 (4%)

Query: 23  KDRIKNNDKSIAALEGKDLLSDQPRDVEET--LSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
           +++IKN ++SI   E KD++S++P  +EE   L K   K+ D+  +   ++  M+ +K  
Sbjct: 488 REKIKN-ERSIFLKEIKDMVSNKPEKIEEENYLKKLEEKFTDFDDKILRKKMKMMSKKKN 546

Query: 81  KQQAIKMSKKTENDL 95
           +   +     T NDL
Sbjct: 547 RMNNLSNVGMTSNDL 561


>UniRef50_Q4Z6V7 Cluster: Putative uncharacterized protein; n=3;
           Plasmodium (Vinckeia)|Rep: Putative uncharacterized
           protein - Plasmodium berghei
          Length = 746

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 20/75 (26%), Positives = 37/75 (49%), Gaps = 3/75 (4%)

Query: 41  LLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFG 100
           +L D   +++  + K    +  Y ++ EY+ K++       ++ IK      N++ E   
Sbjct: 74  ILKDNKNNLKLYIKKIRSNFNSYWKECEYQLKILKTYYIENEKLIKEIVDENNNMIES-- 131

Query: 101 LIRNSEDLRIRDKGY 115
            I N+EDL+I DK Y
Sbjct: 132 -ITNNEDLKIDDKTY 145


>UniRef50_Q23YT2 Cluster: Kinesin motor domain containing protein;
           n=1; Tetrahymena thermophila SB210|Rep: Kinesin motor
           domain containing protein - Tetrahymena thermophila
           SB210
          Length = 1736

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 41/180 (22%), Positives = 85/180 (47%), Gaps = 20/180 (11%)

Query: 43  SDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLI 102
           SD+  D+++ L K   KY   K+Q E  + + +  K  + + IK SK+T+   +EQ  + 
Sbjct: 732 SDKVVDIKQ-LEK---KYLQKKQQREPTKFIYINSKKIEVKKIKNSKRTKKLSQEQSSIN 787

Query: 103 RNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYN 162
           ++S      ++GY+ H  N++    + D  +  +  N++       D    +  I+   N
Sbjct: 788 KSS-----INEGYDSHQ-NSIEKYYVQDSLNQGNPNNQIFSQ----DHSLISQFILNKTN 837

Query: 163 EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIE 222
           EH +  +    SI+D    + I +++     S+++ L    Q  +  LNN I   +++++
Sbjct: 838 EHNQNNL----SILDNERYQSINQLLQPSTESNIHTLKTSNQ--IGNLNNNISNNQDILK 891


>UniRef50_Q22YR7 Cluster: Cyclic nucleotide-binding domain containing
            protein; n=1; Tetrahymena thermophila SB210|Rep: Cyclic
            nucleotide-binding domain containing protein -
            Tetrahymena thermophila SB210
          Length = 1484

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 16/68 (23%), Positives = 45/68 (66%), Gaps = 6/68 (8%)

Query: 50   EETLSKTMWKYQDYK-RQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQ----FGLIRN 104
            ++ +++T+ +++  K +Q+ Y+++++L +KF + QA +  +K+ +DL ++    F + +N
Sbjct: 1291 QQDITETIQQFEQEKGKQNNYQKQILLYKKFKQNQAFENEQKSSHDLNKEGKNNFSIFQN 1350

Query: 105  SEDLRIRD 112
             + L ++D
Sbjct: 1351 PQ-LNLKD 1357


>UniRef50_A2FX23 Cluster: Formin Homology 2 Domain containing
           protein; n=1; Trichomonas vaginalis G3|Rep: Formin
           Homology 2 Domain containing protein - Trichomonas
           vaginalis G3
          Length = 2354

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 54/226 (23%), Positives = 101/226 (44%), Gaps = 22/226 (9%)

Query: 9   KATKVHYRPLNWDIKDRIK-NNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDY-KRQ 66
           +A K  Y+ L   I+++ K   +K     E +DL + + +  E T+       + Y  R 
Sbjct: 491 RAIKQSYKKLQDQIEEKTKIEGEKEEMKKENEDLKA-RLKTAESTIVIQKAAAESYTSRV 549

Query: 67  SEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQ 126
           ++ ++K+   E   +QQ     +K +N   E+  L   S+ L+ +++  +     +L SQ
Sbjct: 550 NDLQQKLAEYESKLQQQISANEEKIKNQENEKVTL---SQKLKEQEEE-SRKIIESLQSQ 605

Query: 127 RIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKE 186
                +DL    N++  + Q  +E+      +   N+ YETL +   +  DRT    ++E
Sbjct: 606 S----KDLQKMNNEMQVNLQ--NEISILKSKLTESNQKYETLEQKSSNESDRTASA-LQE 658

Query: 187 IILVDDYSDLYNLHHDVQEAVDKLNNVIKKEE---EMIETNNIDME 229
           +   +      NL  D++    KLN + K+ E     IE  N D+E
Sbjct: 659 LKTQNK-----NLESDIENLTSKLNEITKQNEMKSREIERLNADIE 699


>UniRef50_A2F112 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 376

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 42/233 (18%), Positives = 95/233 (40%), Gaps = 20/233 (8%)

Query: 22  IKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAK 81
           I+D  ++ DK I   E K+ L     D+E+ +S       + +      + + + E   K
Sbjct: 141 IRDYCRSLDKKIEIEESKNYLKQTITDLEKKIS-------EREGNLNKEKSIEVTEHHQK 193

Query: 82  QQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKL 141
            + IK +K  E  L+++   +R   D    +   N +   T ++  + +  +L     ++
Sbjct: 194 SEEIK-NKNVE--LQKEIQKLRLDVDKETNEHKNNDNMHITRMNNALSEKTELTRKIEQM 250

Query: 142 CQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHH 201
            Q++Q  DE+          +  Y  + + +++ +++  +   ++ IL  DY        
Sbjct: 251 SQTKQ--DEIK--------LDSEYRAISKVLNASLEQLTEIQNQKEILAKDYEGEKQKID 300

Query: 202 DVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNV 254
           D  E ++ LN  I+   + I+ N   +E               TE   +++++
Sbjct: 301 DYHEKINNLNKKIQDLNDKIQNNKKTLETELYHGETSKMLNYLTEQRNLRDSL 353


>UniRef50_A2ERV7 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 1347

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 18/69 (26%), Positives = 40/69 (57%), Gaps = 1/69 (1%)

Query: 30  DKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSK 89
           +K +   E K+  S +P +++E +  T+ + ++  ++ E  +K  +KE+  KQ+ IK  K
Sbjct: 787 EKKVEETEIKENTSTKPNEIKEEIKPTIEEKKEETKEIEQTKKDEIKEEPKKQEEIK-QK 845

Query: 90  KTENDLEEQ 98
           + E  +++Q
Sbjct: 846 EAEQPVQKQ 854


>UniRef50_A2EQA8 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 1190

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 36/181 (19%), Positives = 88/181 (48%), Gaps = 16/181 (8%)

Query: 59  KYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTE-NDLEEQFGLIRNSEDLRIRDKGYNL 117
           +Y     Q  Y+++  +K+K  + Q  K +   + NDL+++ G + ++E   + +K   +
Sbjct: 660 EYTQQIEQKLYQKQEEMKKKDEENQKEKENLMNQINDLKKKLGDL-STEKRNLNEK---M 715

Query: 118 HAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLM----RSVH 173
                +  + I + ++  +T   L  + +Y +E+ +   ++    EH + L+    ++++
Sbjct: 716 EKEKDIFEEEIENLKEDNETIKNL--NDKYVEEINKLKELVAEELEHNKQLIEAHEKAMN 773

Query: 174 SIMDRTDQKHIKEIILVDDYSD-LYNL----HHDVQEAVDKLNNVIKKEEEMIETNNIDM 228
            + +  ++KH KE++ V D  D L+NL      +  EA+ ++ N  + E   ++    + 
Sbjct: 774 DLQNDIEEKHQKELMQVGDEIDKLHNLITKKEEENTEALKQVKNQFRDEINKLKNEKEEA 833

Query: 229 E 229
           E
Sbjct: 834 E 834


>UniRef50_A0CXR3 Cluster: Chromosome undetermined scaffold_30, whole
           genome shotgun sequence; n=4; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_30,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 1104

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 39/196 (19%), Positives = 90/196 (45%), Gaps = 15/196 (7%)

Query: 47  RDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLE--EQFGLIRN 104
           +++++ L  +  + Q Y++Q++   K + + +  +QQ +K +   + +L+  +Q   + N
Sbjct: 247 QELQDLLEASETQLQKYQQQNDKLNKQIKELQQKEQQLLKENLNAKENLQQCDQLQNLLN 306

Query: 105 SEDLRIRDKGYNLHAFNTLISQRIGDHRD-----LPDTRNKLCQSQQYFDELPRASIIIC 159
           SE   +R +  +L+  N  + ++  D ++     L +      +SQQ  D   +    I 
Sbjct: 307 SELNDMRSRNESLNQLNQQLDRQNRDFKNECELTLKELTEVKRKSQQQMDLNLQLDEEIE 366

Query: 160 FYNEHYETLMRSVH-------SIMDRTDQKHIKEI-ILVDDYSDLYNLHHDVQEAVDKLN 211
            Y    E +    H        ++D+  +K  ++I  L +   +  N+    QE +D+L 
Sbjct: 367 QYKVEIEQIKTKKHQEISKQRELLDQLKEKSNQKINELKNKLKEAQNIEQYQQEQLDELQ 426

Query: 212 NVIKKEEEMIETNNID 227
            +IK+ E  ++   I+
Sbjct: 427 ELIKQSENQLKQLQIN 442


>UniRef50_Q0UYN5 Cluster: Putative uncharacterized protein; n=1;
           Phaeosphaeria nodorum|Rep: Putative uncharacterized
           protein - Phaeosphaeria nodorum (Septoria nodorum)
          Length = 693

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 36/145 (24%), Positives = 63/145 (43%), Gaps = 11/145 (7%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
           D     K ++   AALE      +Q R    +  K   K      + E  R+ + +E+ A
Sbjct: 263 DTSKSSKRDEAHDAALEELRQELEQERAARLSAEKASKKSSSDSAEIEELRQELEQERHA 322

Query: 81  KQQAIKMSKK---TENDLEEQF--GLIRNSEDLRIRDKGY-----NLHAFNTLISQRIGD 130
           +Q+A K SKK   T+N   E+    L     + + ++K Y      L   NT++  ++  
Sbjct: 323 RQKAEKASKKGTQTDNSQSEEIKKALEEEKRERKKQEKEYTKTLAELQGRNTVLDDKLSA 382

Query: 131 HRD-LPDTRNKLCQSQQYFDELPRA 154
            R+ L  T+ KL + +   + + RA
Sbjct: 383 FREKLRTTKEKLKEKEAELERVDRA 407


>UniRef50_P37709 Cluster: Trichohyalin; n=2; Eutheria|Rep:
           Trichohyalin - Oryctolagus cuniculus (Rabbit)
          Length = 1407

 Score = 35.5 bits (78), Expect = 4.0
 Identities = 20/104 (19%), Positives = 50/104 (48%)

Query: 23  KDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQ 82
           ++R    ++ +   E ++L  ++ R + E       + ++  R+ E  RK+  +E+  +Q
Sbjct: 648 RERKLREEEQLLRREEQELRQERERKLREEEQLLQEREEERLRRQERARKLREEEQLLRQ 707

Query: 83  QAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQ 126
           +  ++ ++ E  L E+  L+R  E L  +++   L     L+ +
Sbjct: 708 EEQELRQERERKLREEEQLLRREEQLLRQERDRKLREEEQLLQE 751


>UniRef50_UPI00006CB606 Cluster: hypothetical protein
           TTHERM_00444160; n=1; Tetrahymena thermophila SB210|Rep:
           hypothetical protein TTHERM_00444160 - Tetrahymena
           thermophila SB210
          Length = 2098

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 23/69 (33%), Positives = 41/69 (59%), Gaps = 5/69 (7%)

Query: 37  EGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLE 96
           EG+  +S + R+V   L K M + ++Y ++ E+++K+  KEK  K QAIK  KK    ++
Sbjct: 792 EGQQPISKEEREVR--LKKFMEQVEEYTKEHEFQQKI--KEK-QKAQAIKFEKKKIEKIK 846

Query: 97  EQFGLIRNS 105
           ++  L  +S
Sbjct: 847 KKKKLASSS 855


>UniRef50_UPI0000498399 Cluster: Viral A-type inclusion protein
            repeat; n=1; Entamoeba histolytica HM-1:IMSS|Rep: Viral
            A-type inclusion protein repeat - Entamoeba histolytica
            HM-1:IMSS
          Length = 1387

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 47/256 (18%), Positives = 109/256 (42%), Gaps = 22/256 (8%)

Query: 4    NQIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVE-ETLSKTMWKYQD 62
            N+ +N+  K     +  + + +I N +K I  ++ K+        ++ E ++K   + ++
Sbjct: 945  NEQINEINK-EKENIQKEFEIQIDNKNKEINEIKEKNEKEINEIKIQIEEMNKEKNQLEN 1003

Query: 63   YKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNT 122
             K+Q E   +++ KE   K++  K       + E++   IRN  + + R+ G  +     
Sbjct: 1004 LKKQLENENEIIKKENKKKEEENKEMGYLIKENEKKIESIRNEINSKERELGTKIK---- 1059

Query: 123  LISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQK 182
             + + I + +D+ +        + +  E+   +I I       E     +  I+ + D+ 
Sbjct: 1060 -LIEMIKNEKDIME--------KDFKKEVDNKNIEIKRLQIDIEKKKNDITLIIQKNDED 1110

Query: 183  HIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXK 242
              K I       +  NL+ ++++   + N+V +KE+E I     D++            +
Sbjct: 1111 KKKSI------EEKKNLNQEIEKIKSEKNDV-QKEKEQILLEKEDLQSDFNKYKTQMENE 1163

Query: 243  KSTENSEVKNNVFNIR 258
            K     E +NN+ N++
Sbjct: 1164 KLQIKEEHENNITNLQ 1179


>UniRef50_Q97H37 Cluster: Glycosyltransferase domain containing
           protein; n=1; Clostridium acetobutylicum|Rep:
           Glycosyltransferase domain containing protein -
           Clostridium acetobutylicum
          Length = 937

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 3/66 (4%)

Query: 150 ELPRASIIICFYN-EHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVD 208
           E P  SII+C  N + Y  L +S+  I+++  +    E+I+VDD  D   L+  +++ ++
Sbjct: 21  ESPNVSIILCLINIKDYNKLDKSISLILNQEYRNF--ELIVVDDGQDDELLYTKIKDYIN 78

Query: 209 KLNNVI 214
           K N ++
Sbjct: 79  KDNRIV 84


>UniRef50_Q8EWP8 Cluster: Predicted cytoskeletal protein; n=1;
            Mycoplasma penetrans|Rep: Predicted cytoskeletal protein
            - Mycoplasma penetrans
          Length = 3317

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 24/91 (26%), Positives = 47/91 (51%), Gaps = 5/91 (5%)

Query: 39   KDLLSDQPRDVEETLSKTMWKYQDYKR--QSEYRRKVM-LKEKFAK--QQAIKMSKKTEN 93
            K L +D   + +  + K + K++D K   Q+E+ +++   KE   K  ++ I   ++  +
Sbjct: 3047 KLLFNDSSENQDPEIKKILSKFEDSKEIIQNEFNQELQAFKESIFKVREKEINDYRQQVS 3106

Query: 94   DLEEQFGLIRNSEDLRIRDKGYNLHAFNTLI 124
            D+E++   IRNS  +   +K   L A + LI
Sbjct: 3107 DIEKEVLKIRNSNSINDENKNKELEAIDNLI 3137


>UniRef50_Q50EX9 Cluster: P-553; n=5; Borrelia|Rep: P-553 - Borrelia
           hermsii
          Length = 760

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 53/275 (19%), Positives = 104/275 (37%), Gaps = 12/275 (4%)

Query: 22  IKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAK 81
           +KD +  N   ++ L+ K  +++    +++ L K   K  +   +     K  LK+   K
Sbjct: 316 LKDALHKNTHKLSELDNK--INNNKETLKDALHKNTHKLSELDNKIN-NNKETLKDALLK 372

Query: 82  Q--QAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRN 139
              +  ++  K  N+ E     I+   DL  +    N    N  I QR+ +   L D  N
Sbjct: 373 NTHKLSELDDKINNNKETLNNNIQRLSDLDDKINN-NKETLNNNI-QRLSN---LDDKIN 427

Query: 140 KLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNL 199
                +   + + R S +    N + ETL  ++  + D  D+ +  +  L ++   L +L
Sbjct: 428 N--NKETLNNNIQRLSDLDDKINNNKETLNNNIQRLSDLDDKINNNKETLNNNIQRLSDL 485

Query: 200 HHDVQEAVDKLNNVIKKEEEMIETNNIDMEXXXXXXXXXXXXKKSTENSEVKNNVFNIRL 259
              +    + LNN I++  ++ +  N + E                 N+E     F   L
Sbjct: 486 DDKINNNKETLNNNIQRLSDLDDKINNNKETLNNNIQRLSDLDDKINNNEDTLLAFQKEL 545

Query: 260 LKTSKREGLIRARLYGADNSVGDVLVFLDSHIEVN 294
           +      G     +    +S+  V+  L   I+ N
Sbjct: 546 IDLKNNHGTSLKNIENNLSSIEKVINILKDKIDKN 580


>UniRef50_Q1FP00 Cluster: Helix-turn-helix, AraC type precursor;
           n=1; Clostridium phytofermentans ISDg|Rep:
           Helix-turn-helix, AraC type precursor - Clostridium
           phytofermentans ISDg
          Length = 784

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 22/87 (25%), Positives = 39/87 (44%)

Query: 11  TKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYR 70
           TK  Y P+N  +K  I  N      L+ +D L    +  E  +S+   +  D K   +Y 
Sbjct: 330 TKHIYVPVNKVVKQFIHKNKNISNDLKIEDELGFIVKSYENAVSQISIQQSDLKSSKKYI 389

Query: 71  RKVMLKEKFAKQQAIKMSKKTENDLEE 97
           R   +K    + + + + +  +ND+EE
Sbjct: 390 RSYWIKRLLMESKMLSLEELKKNDVEE 416


>UniRef50_Q0TSM9 Cluster: Bacterial sugar transferase family
           protein; n=3; Bacteria|Rep: Bacterial sugar transferase
           family protein - Clostridium perfringens (strain ATCC
           13124 / NCTC 8237 / Type A)
          Length = 200

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 3/95 (3%)

Query: 133 DLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDD 192
           +LP   N L     +    P     + FY E Y+ +++    I D    ++I E  ++  
Sbjct: 96  ELPQLYNVLKGDMSFVGPRPEVKKYVKFYEEEYDEILKIKPGITDLASIEYIDENTIISK 155

Query: 193 YSDLYNLHHDVQEAVDKLNNVIKKE-EEMIETNNI 226
           YSD   ++  ++E + K   + K+  EEM   N+I
Sbjct: 156 YSDPEKVY--IEEVLPKKLMLNKRYIEEMSIKNDI 188


>UniRef50_A3ZWW6 Cluster: Putative uncharacterized protein; n=1;
           Blastopirellula marina DSM 3645|Rep: Putative
           uncharacterized protein - Blastopirellula marina DSM
           3645
          Length = 286

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 21/83 (25%), Positives = 36/83 (43%), Gaps = 1/83 (1%)

Query: 374 PTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSL-ELIPCSRVGHVFR 432
           P M GG  A++R+ +  +  YD     WG E+ ++  R++  G      +  +R  H++ 
Sbjct: 175 PRMKGGNIALWRDDYETVNGYDQDFVGWGLEDSDLQRRLYQAGVRFRSSMRWTRTHHLWH 234

Query: 433 KRRPYGVGEKQDYMLQNSMRMAR 455
            R P  V        +  MR  R
Sbjct: 235 ARDPSYVARASGTDNEKLMRANR 257


>UniRef50_A3N1J8 Cluster: Type I restriction enzyme, R subunit; n=4;
           Bacteria|Rep: Type I restriction enzyme, R subunit -
           Actinobacillus pleuropneumoniae serotype 5b (strain L20)
          Length = 1044

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 25/86 (29%), Positives = 42/86 (48%), Gaps = 4/86 (4%)

Query: 110 IRDKGYNLHAFNTLISQRIGDH-RDLPDTRNKLCQSQ-QYFDELPRASIIICFYNEHYET 167
           I D        + L +Q+  ++ +DL   R KL ++   +FDELP+ +     YN    T
Sbjct: 574 ICDSSDQARMMHFLFNQKYAENPQDLTAYREKLAENDPHFFDELPKVAESTATYNLAKRT 633

Query: 168 LMRSVHSIMDRTDQKHIKEIILVDDY 193
           +  +   + D  D+ + K+  LVDDY
Sbjct: 634 VTSASLILSDEGDKTYRKD--LVDDY 657


>UniRef50_A0W610 Cluster: Glycosyl transferase, family 2; n=2;
           Geobacter|Rep: Glycosyl transferase, family 2 -
           Geobacter lovleyi SZ
          Length = 280

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 26/100 (26%), Positives = 43/100 (43%), Gaps = 4/100 (4%)

Query: 371 LKSPTMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHV 430
           ++S T+ G  F  +R  F  IG +D    +   E+ ++  R    G  L  +  + + H 
Sbjct: 147 VRSGTVNGICFMAHRRVFETIGVFDENFRIGQYEDKDLFLRARRAGFRLGTVGAAFIHH- 205

Query: 431 FRKRRPYGVG---EKQDYMLQNSMRMARVWMDDYVKKVIE 467
           F       VG   E +DY L N    +R W   + K++ E
Sbjct: 206 FGSITQKAVGARRETRDYALANKAYFSRKWHLPWWKRLSE 245


>UniRef50_Q8I569 Cluster: Putative uncharacterized protein; n=1;
            Plasmodium falciparum 3D7|Rep: Putative uncharacterized
            protein - Plasmodium falciparum (isolate 3D7)
          Length = 1662

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 47/214 (21%), Positives = 89/214 (41%), Gaps = 17/214 (7%)

Query: 22   IKDRIKNNDKSIAALEGK---DLLSDQPRDVEETLSKTMWKYQDYKRQSEYRR--KVMLK 76
            IK  ++  DK    LEGK   D   DQ  DV + + K     ++   +   R+   +  K
Sbjct: 997  IKKIVEEKDK----LEGKSKEDKKGDQQSDVHQIIDKKQGGIENQSLEESKRKAKNIQQK 1052

Query: 77   EKFAKQQAIKMS-KKTENDLEEQFG-LIRNSEDLRIRDKGYNLHAFN--TLISQRIGDHR 132
            +     + +K   KKTE +L       + N +D   +D   N+H  N    +++     +
Sbjct: 1053 DNTKNAELLKCDEKKTEGNLSNTLTHYVTNQQDNFDQDIENNIHNQNEENNLNKNTSKGK 1112

Query: 133  DLPDTRNKLCQSQQY-FDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVD 191
               +  NK+  ++    +++ +   I    N     L   +   + +     IKE +  D
Sbjct: 1113 KKIENNNKVVDTKILNIEDIIKMKSI---NNNDNIKLPEEIKEYIKKMKSGGIKENVEYD 1169

Query: 192  DYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNN 225
             Y    N + +V+E VD +  V +KE + ++ N+
Sbjct: 1170 IYHKFINNYFNVKEYVDGIIVVEEKEVKAMKGND 1203


>UniRef50_Q8I4T0 Cluster: Putative uncharacterized protein; n=1;
           Plasmodium falciparum 3D7|Rep: Putative uncharacterized
           protein - Plasmodium falciparum (isolate 3D7)
          Length = 656

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 23/95 (24%), Positives = 42/95 (44%)

Query: 134 LPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDY 193
           + +   +L   +  F +L +   +I    E Y+ +     ++    D    K   L+  Y
Sbjct: 434 IENDEKELIDQKNIFLKLSKILNLIHLSYEQYDEIQNEKCTMTKNEDDYTKKYEELLIKY 493

Query: 194 SDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDM 228
            ++ N   D +E +D     IKK E+M+ T NI+M
Sbjct: 494 ENVKNQVKDKKEILDNKKQEIKKVEDMLNTYNIEM 528


>UniRef50_Q7RIN9 Cluster: Putative uncharacterized protein PY03578;
           n=8; Plasmodium (Vinckeia)|Rep: Putative uncharacterized
           protein PY03578 - Plasmodium yoelii yoelii
          Length = 1527

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 44/231 (19%), Positives = 104/231 (45%), Gaps = 16/231 (6%)

Query: 5   QIVNKATKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYK 64
           Q +   TK     L  D +   K N +++   +   + ++   ++ +   K   KY+   
Sbjct: 764 QKIELKTKEKIEELKQDFEKTQKINMENLEMEKESFINNNLENEINKMKEKLEEKYETQI 823

Query: 65  RQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQF--GLIRNSEDL---RIRDKGYNLHA 119
           +++E + K  +KE+  K +  + +++  N   E++   L +N  D     I +K   + +
Sbjct: 824 KETEMKYKYQIKEEIEKTK--QNAEQNFNSKFEKYKENLEKNKNDFINNLIIEKNNEIES 881

Query: 120 F-NTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDR 178
           F N +  ++  +     +   KL Q+   F+ +   +  I   N+H E + +  + ++  
Sbjct: 882 FKNDIEQKKFKEMEKFKEENEKLLQNN--FENM--KNHFIEEQNKHIENIKKE-YELIKN 936

Query: 179 TDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMI---ETNNI 226
            + +++KE +  +   ++ N+   + +  +K  + +KKE E I   E NN+
Sbjct: 937 NEIEYLKEEMKKNKIQEIENVELKLADEKNKHIDDMKKELENIYNVEINNL 987


>UniRef50_Q54G05 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 1492

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 50/294 (17%), Positives = 131/294 (44%), Gaps = 28/294 (9%)

Query: 1   MSGNQI-VNKA-TKVHYRPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMW 58
           +  NQ+ VN+  +K++ + +N  I   I+NN  S+  L+ K  L+++  ++ + +     
Sbjct: 696 IQSNQVTVNELQSKLNEKEIN--INQLIENNQSSLDELQSK--LNEKQNEINQLIENNQS 751

Query: 59  KYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLH 118
              + + +   + + + + +    + I+ ++ + ++L+ +  LI+ S++L+ +D+   L 
Sbjct: 752 SSDELQSKLNEKHQEISELQSKLNELIENNESSSDELQSK--LIQLSDELKEKDE--KLK 807

Query: 119 AFNTLISQRIGDHRDLPDT-RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMD 177
           + +++I +       L  + ++ L + Q   +E           NE  E    S + +  
Sbjct: 808 SLDSIIIENQEKLVQLTKSNQDSLDELQSKLNEKQNE------INELIENNQSSSNELQS 861

Query: 178 RTDQKHIKEIILVDD--------YSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDME 229
           + ++K  +  +L+++         S L   H ++ E   KLN    K  E++E N    +
Sbjct: 862 KLNEKQNEINLLIENNQSSSDELQSKLNEKHQEINELQSKLNEKQNKINELVENNESSSD 921

Query: 230 XXXXXXXXXXXXKKSTENSEVKNNVFNIRLLKTSKREGLIRARLYGADNSVGDV 283
                        +  EN ++K+  F   +++  ++   ++++L    N +  +
Sbjct: 922 ELQSKLIQLSDQLQEKEN-QLKS--FESSIIERDEKLNQLQSKLNEKQNEIDQI 972


>UniRef50_Q4D320 Cluster: Putative uncharacterized protein; n=3;
           Trypanosoma cruzi|Rep: Putative uncharacterized protein
           - Trypanosoma cruzi
          Length = 926

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 43/203 (21%), Positives = 89/203 (43%), Gaps = 14/203 (6%)

Query: 96  EEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFD-----E 150
           EE   L+R+  + + R +       NTL+S +I D  +L    +   + Q YF      +
Sbjct: 596 EESLILLRSEVEAQ-RKRCEEFKNMNTLLSDKIVDAENLLQEMSTASEKQIYFSHYFTRQ 654

Query: 151 LPRASIIICFYNEHY-ETLMRSVHSIMDRTDQKHIKE-IILVDDYSDLYNLHHDVQEAVD 208
           L   + ++ F++EHY   + + V+ ++   D+ + KE   ++D  + L  +  + +E   
Sbjct: 655 LDDTAGLLMFFSEHYVNWIFKKVNIVVMEFDKAYAKENERVLDALNHLRKVQEETKERYT 714

Query: 209 KLNNVIKKEE-EMIETNNIDMEXXXXXXXXXXXXKKSTENSE-VKNNVFNIRLLKTSKRE 266
            L   +   E  + E     +E            +K  + +E +K ++  +R L +   +
Sbjct: 715 NLQMKLNDAEFRLGEREKASVEKFENFLSEIRLIEKQRDVAEKMKKDI--LRKLDSVNDQ 772

Query: 267 GLIRARLYGADNSVGDVLVFLDS 289
            L  A +   D  + ++LV LD+
Sbjct: 773 HL--AEINKKDEQIHELLVNLDA 793


>UniRef50_Q28X57 Cluster: GA21145-PA; n=2; Sophophora|Rep:
           GA21145-PA - Drosophila pseudoobscura (Fruit fly)
          Length = 399

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 20/77 (25%), Positives = 36/77 (46%)

Query: 375 TMAGGLFAIYREYFNAIGKYDPGMNVWGGENLEISFRIWMCGGSLELIPCSRVGHVFRKR 434
           T+ GG+ A+ RE+F A+  +      WGGE+ ++S R+      +   P +   +   K 
Sbjct: 285 TIFGGVSAMTREHFQAVNGFSNSFFGWGGEDDDMSNRLKHANLFISRYPVNIARYKMLKH 344

Query: 435 RPYGVGEKQDYMLQNSM 451
           +      K+   +QN M
Sbjct: 345 QKEKANPKRYENIQNGM 361


>UniRef50_Q25662 Cluster: Repeat organellar protein; n=5; Plasmodium
           (Vinckeia)|Rep: Repeat organellar protein - Plasmodium
           chabaudi
          Length = 1939

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 41/206 (19%), Positives = 97/206 (47%), Gaps = 17/206 (8%)

Query: 20  WDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKF 79
           ++++D++K   KSI AL  K  + +    +EE L K +   ++ +   EY +++  K +F
Sbjct: 75  YELEDQLKETLKSITALSIK--VKEYEVKIEE-LEKELKLEKEKQINKEYEKELNEKSEF 131

Query: 80  AKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRN 139
            K+Q +++ K+ E ++  +   I N E + ++ +       N + S+ I  +++      
Sbjct: 132 IKRQ-MELLKEKELNINLKENKINNKEIITLKRE----EKLNDIESEYIEKNKEKEKLNY 186

Query: 140 KLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQ-KHIKEIILVDD--YSDL 196
           ++   +   D+L       C   E  + L +    ++++ +  + +KE +   +     L
Sbjct: 187 EVTNIKMSLDKL------TCEVQEKKDNLEKINKKVIEKENNLRELKEFMKEKNEIIESL 240

Query: 197 YNLHHDVQEAVDKLNNVIKKEEEMIE 222
               +D + A +KL    +++ +MIE
Sbjct: 241 DGTINDKKNAYEKLEISFEEKRKMIE 266



 Score = 35.1 bits (77), Expect = 5.3
 Identities = 51/253 (20%), Positives = 112/253 (44%), Gaps = 20/253 (7%)

Query: 21   DIKDRIKNNDKSIAALEG-KDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKF 79
            D+K++I +    +  LE  K++L+D+  ++++ +     K  + K ++E    + L +  
Sbjct: 1265 DLKNKILDLSNELINLENMKNVLTDENNNLKKEIEIKDNKLNE-KEKNENTEILNLNDDI 1323

Query: 80   AK-QQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDTR 138
             K ++ I   K  E  L ++   ++N  D+   +K Y +   N +I     +  ++   +
Sbjct: 1324 IKLKKEISEWKDEEEKLTKENIKLKN--DIEQINKEYKIKEENLMIKFN-ENINEVTSLK 1380

Query: 139  NKLCQSQQYFDELPRASIIICFYNEHYETLM---RSVHSIMDRTDQKHIKEIILVDDYSD 195
            N++   +   +EL          N++YE L+   R  +  +   D K ++  IL D  S 
Sbjct: 1381 NQIEIEKMKLEEL----------NKNYELLLAEKRETNMSISNDDNKIVENNILEDTDSK 1430

Query: 196  LYNLHHDVQEAVDKLNNVIKKEEEMIETNNI-DMEXXXXXXXXXXXXKKSTENSEVKNNV 254
              NL+ +V++      N  K  ++  E + + D              +K++ + +VKN  
Sbjct: 1431 QNNLNKNVEDKTGDDINCEKNNDQAKEISYLKDEIKKISMLYGEELNRKNSYDEKVKNLT 1490

Query: 255  FNIRLLKTSKREG 267
              ++ LK   ++G
Sbjct: 1491 NELKELKIRNKKG 1503


>UniRef50_Q236I9 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 783

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 36/187 (19%), Positives = 82/187 (43%), Gaps = 6/187 (3%)

Query: 39  KDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKV-MLKEKFAKQQAIKMSKKTENDLEE 97
           +D L D+ + ++E L+++    +  + Q  Y   +  +  K  +++      +TE  L+E
Sbjct: 352 RDKLKDKNKQLKEELNQSFKDKKLLEMQVNYEGYINQVNSKLEEKEKQLQRIQTEIKLKE 411

Query: 98  QFGLIRNSE--DLRIRDKGYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRAS 155
               +R  E  +++++ K            Q I          NKL Q Q+      +++
Sbjct: 412 AELKLRQDEIQNIKLQQKKQQSQNNTFNAQQSIQSCSSCEILNNKLQQEQEI--SFQKSN 469

Query: 156 IIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIK 215
            +    N+  E + R +   +D+ +QK+ +      DY  +    +++ E +++LNN I 
Sbjct: 470 ELQSQLNQQKEKV-RILEDDLDQVNQKYQEVCEKQKDYDQIIQDKNELNEQINQLNNTIN 528

Query: 216 KEEEMIE 222
           +++   E
Sbjct: 529 EQKIKFE 535


>UniRef50_A5KBH9 Cluster: Putative uncharacterized protein; n=1;
           Plasmodium vivax|Rep: Putative uncharacterized protein -
           Plasmodium vivax
          Length = 1860

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 47/214 (21%), Positives = 95/214 (44%), Gaps = 18/214 (8%)

Query: 16  RPLNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVML 75
           R  ++ +++++K   +SI +L  K  + +    +E+ L K +   +D +    Y +++  
Sbjct: 79  RNKDYQLEEQLKETLRSITSLSTK--IVNYETKIED-LEKELKMEKDKQVDKAYEKELKE 135

Query: 76  KEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLP 135
           KE F KQ+ I M  + EN L E+  L  N  + +I D+      F +    ++ D ++  
Sbjct: 136 KENFIKQK-IGMLNEKENLLNEK-ELDINMREEKINDR----EMFISKKEDKLNDMQEQY 189

Query: 136 DTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSD 195
             +NK  + ++   E+    I +       E L   V    D  +    K I+  +   +
Sbjct: 190 LEKNK--EKEKLHFEIADIKISL-------EKLKYEVKDKKDCLENVSNKVILKENTLRE 240

Query: 196 LYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDME 229
           L     +  E ++ LN  I ++E++ E    D+E
Sbjct: 241 LKEFIREKNEMIESLNEKITEKEKIYEQLGKDVE 274


>UniRef50_A2DDX5 Cluster: Viral A-type inclusion protein, putative;
           n=1; Trichomonas vaginalis G3|Rep: Viral A-type
           inclusion protein, putative - Trichomonas vaginalis G3
          Length = 1794

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 36/219 (16%), Positives = 95/219 (43%), Gaps = 11/219 (5%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEKFA 80
           ++ + I+ NDK     + ++L  +    ++  ++K   +  D K+ ++  ++ ++ +   
Sbjct: 276 ELANLIEENDKKQGTQQNQNLNQNDEDAIQSLVTKYEEEIDDIKKNNQNEKENLINQINE 335

Query: 81  KQQAIKMSK-KTENDLEEQFGLIRNSE---DLRIRDKGYNLHAFNTLI---SQRIGDHRD 133
            + ++K  +  +ENDL E   +I  +    + +I+D   NL   +  +   SQ++ +  +
Sbjct: 336 LKNSLKNKEISSENDLNEMKIIIEQTSKDYETKIQDLMTNLEENSQKLNEMSQKLKESEE 395

Query: 134 LPDTRNKLCQSQQYFDELPRASII----ICFYNEHYETLMRSVHSIMDRTDQKHIKEIIL 189
                N++   Q   D      I     +   NE  +T++          ++   +   L
Sbjct: 396 KNQKLNEMSMLQASNDAEKEKFIKEISNLTKENEKLQTVLNENEKNRTENERLVAENQKL 455

Query: 190 VDDYSDLYNLHHDVQEAVDKLNNVIKKEEEMIETNNIDM 228
             D  ++  ++ ++Q  ++KL  ++K E+   E   + +
Sbjct: 456 NSDLHEIGEVNKNLQTEIEKLTEIMKSEQNNKENEMMSL 494


>UniRef50_A0E4N3 Cluster: Chromosome undetermined scaffold_78, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_78,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 842

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 21/95 (22%), Positives = 49/95 (51%), Gaps = 2/95 (2%)

Query: 19  NWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKEK 78
           +WD  DRI++  + I   + K+ + D  +D E+   K   + +D +R+ E  ++   +++
Sbjct: 359 DWDRDDRIEDEQEPIKLKDKKEKIKD--KDNEKDREKEKQREKDKEREKEKEKEKEREKE 416

Query: 79  FAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDK 113
             +++  +  K+ E + E++    R  E  + R+K
Sbjct: 417 KEREREREKEKEREREREKEKEREREREKEKEREK 451


>UniRef50_Q8SVH3 Cluster: Putative uncharacterized protein
           ECU05_1300; n=1; Encephalitozoon cuniculi|Rep: Putative
           uncharacterized protein ECU05_1300 - Encephalitozoon
           cuniculi
          Length = 961

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 22/72 (30%), Positives = 33/72 (45%), Gaps = 3/72 (4%)

Query: 463 KKVIEVNPSAAHVEIGDISERKALRERLQCKTFKW---YLDNMWFETDRSELVLGRTLCL 519
           KK++ ++  +  VEI D   R  +R   +CK   W    L  ++FE  RS  + GR   L
Sbjct: 32  KKMLRISNRSKGVEINDEQMRTVIRSMHRCKDTHWVNAVLQRLYFEISRSYAIEGRIKSL 91

Query: 520 DASNNVAPILGK 531
                 A  +GK
Sbjct: 92  ILKRFEASGIGK 103


>UniRef50_Q5J6J3 Cluster: Dipeptidyl-peptidase IV; n=16;
           Pezizomycotina|Rep: Dipeptidyl-peptidase IV -
           Trichophyton rubrum
          Length = 775

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 21/74 (28%), Positives = 39/74 (52%), Gaps = 3/74 (4%)

Query: 314 VRYSARAVTPVIDVINADTFEYSPSPLVRGGFNWGLHFKWDNLPKGTLINDEDFMKPLKS 373
           V Y  + +TP+++  + +   Y+ S   +GG+ + L ++  N+P   L + +D  KPLK+
Sbjct: 416 VSYDTKVMTPLVN--DKEAAYYTASFSAKGGY-YILSYQGPNVPYQELYSTKDSKKPLKT 472

Query: 374 PTMAGGLFAIYREY 387
            T    L    +EY
Sbjct: 473 ITSNDALLEKLKEY 486


>UniRef50_Q2FUG3 Cluster: Sensor protein; n=1; Methanospirillum
           hungatei JF-1|Rep: Sensor protein - Methanospirillum
           hungatei (strain JF-1 / DSM 864)
          Length = 765

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 51/199 (25%), Positives = 83/199 (41%), Gaps = 20/199 (10%)

Query: 21  DIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQ----------DYKRQSEYR 70
           DI DR +     + + E   LL+++ RD+   L    WKY+           Y ++  YR
Sbjct: 285 DITDRKRMESALLDSEEKYRLLAEKSRDLIFMLRLPEWKYEYISPSVLEITGYTQEEFYR 344

Query: 71  RKVMLKEKFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGD 130
              +L+   A   +++  +K   DL    G+I  + + +I  K   L      +SQR   
Sbjct: 345 NPDLLRRCIA-PHSLEYFEKAYKDLLN--GIIPETYEFQIITKSGEL----KWVSQRNSP 397

Query: 131 HRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILV 190
             D   T   L        E   +  I+C   + YETL  SV + +  TD+     II V
Sbjct: 398 ITDKSGTITALQGIVTDITERKTSEEIVCEAMQKYETLFSSVPTGIIVTDEN--GAIIEV 455

Query: 191 DDYSDLYNLHHDVQEAVDK 209
           + ++    LH   +E +DK
Sbjct: 456 NQHAARI-LHVSREELIDK 473


>UniRef50_A3HAB6 Cluster: Putative uncharacterized protein; n=1;
           Caldivirga maquilingensis IC-167|Rep: Putative
           uncharacterized protein - Caldivirga maquilingensis
           IC-167
          Length = 184

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 16/46 (34%), Positives = 27/46 (58%)

Query: 114 GYNLHAFNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIIC 159
           GY L   NT++++ + +H  LP     L ++++Y  EL R+  IIC
Sbjct: 44  GYYLKVLNTVLNKNLKNHHPLPRLLASLERTKRYLLELSRSKQIIC 89


>UniRef50_Q10411 Cluster: Sporulation-specific protein 15; n=1;
            Schizosaccharomyces pombe|Rep: Sporulation-specific
            protein 15 - Schizosaccharomyces pombe (Fission yeast)
          Length = 1957

 Score = 35.1 bits (77), Expect = 5.3
 Identities = 46/209 (22%), Positives = 94/209 (44%), Gaps = 16/209 (7%)

Query: 19   NWDIKDRIKNNDKSIAALEGK-DLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKE 77
            N  + D +KNN ++IA+L+ + +    +  D++  LS    +Y++    S    K  L++
Sbjct: 1013 NERLMDDLKNNGENIASLQTEIEKKRAENDDLQSKLSVVSSEYENLLLISSQTNK-SLED 1071

Query: 78   KFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDT 137
            K  + + I+ + +   D ++Q    RN E   +  K   L   N  I   +   R     
Sbjct: 1072 KTNQLKYIEKNVQKLLDEKDQ----RNVELEELTSKYGKLGEENAQIKDELLALRKKSKK 1127

Query: 138  RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEIILVDDYSDLY 197
            ++ LC +  + D+L   S       +  E L    + ++   +Q +     LV++ SDL 
Sbjct: 1128 QHDLCAN--FVDDLKEKS-------DALEQLTNEKNELIVSLEQSNSNNEALVEERSDLA 1178

Query: 198  NLHHDVQEAVDKLNNVIKK-EEEMIETNN 225
            N   D+++++   +NVI     +++  N+
Sbjct: 1179 NRLSDMKKSLSDSDNVISVIRSDLVRVND 1207


>UniRef50_UPI00015BC953 Cluster: UPI00015BC953 related cluster; n=1;
           unknown|Rep: UPI00015BC953 UniRef100 entry - unknown
          Length = 514

 Score = 34.7 bits (76), Expect = 7.0
 Identities = 43/207 (20%), Positives = 98/207 (47%), Gaps = 17/207 (8%)

Query: 18  LNWDIKDRIKNNDKSIAALEGKDLLSDQPRDVEETLSKTMWKYQDYKRQSEYRRKVMLKE 77
           +N DI  R     KS   + G   ++   R V ET  +++  +Q    QS+  +K    E
Sbjct: 62  INEDIVRRETKAGKSKFYING---MTSNQRTVLETFGQSVM-FQAQSSQSKIFKKHHQLE 117

Query: 78  KFAKQQAIKMSKKTENDLEEQFGLIRNSEDLRIRDKGYNLHAFNTLISQRIGDHRDLPDT 137
              K + I+  KK   +  E F  ++  E+L ++D  +   +    + + I  +++L ++
Sbjct: 118 ILDKDKDIQRKKK---EFVEYFDALKQKEEL-LKDLLFQKES----LEKEIEKNKELIES 169

Query: 138 RNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKHIKEI-ILVDDYSDL 196
              L   +  ++++   +  +  + E   T +++V + ++ +D   I +I   +   +  
Sbjct: 170 LEALNLEKDTYNDIKLKAEELQ-HAEKINTYIQNVLNALEYSDHSAISKINYSISQINQA 228

Query: 197 YNLHHDVQEAVDKLNNVIKKEEEMIET 223
            +   D+Q+A+DKLN +   +++++ET
Sbjct: 229 LSYKEDLQKAIDKLNAL---KDQLLET 252


>UniRef50_UPI00015B53AD Cluster: PREDICTED: similar to RHO kinase,
           putative; n=1; Nasonia vitripennis|Rep: PREDICTED:
           similar to RHO kinase, putative - Nasonia vitripennis
          Length = 1419

 Score = 34.7 bits (76), Expect = 7.0
 Identities = 44/187 (23%), Positives = 82/187 (43%), Gaps = 19/187 (10%)

Query: 49  VEETLSKTMWKYQDYKRQSEYRRKVMLKEKFAKQQAIKMSKKTENDLEEQFGLI--RNSE 106
           +E+   K      D  +QS    ++  +E     +  ++ +   N  EE   L   RN +
Sbjct: 816 IEQEQQKRNVLQSDLAQQSSEVSRLKAREHQLVGEVTQLREAKRNIEEELHHLKTQRNVD 875

Query: 107 DLRIRDKGYNLHA---FNTLISQRIGDHRDLPDTRNKLCQSQQYFDELPRASII----IC 159
            L+ ++    L A   F+TL   +  + R+  D + +L   QQ  +E  R+S++    + 
Sbjct: 876 QLQTKELQEQLEAEAYFSTLYKTQAQELREELDEKTRL---QQELEE-ERSSLVHQLQLS 931

Query: 160 FYNEHYETLMRSV--HSIMDRTDQKHIKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKE 217
                 E L RS+   ++ D   ++ +KE+    +Y D  + HH    A D+L N +K+ 
Sbjct: 932 LARGDSEALARSIAEETVADLEKERTMKEL----EYKDSTSKHHQELNAKDQLINRLKES 987

Query: 218 EEMIETN 224
           E   + N
Sbjct: 988 EAEAKKN 994


>UniRef50_UPI0000DAE6EA Cluster: hypothetical protein
           Rgryl_01001034; n=1; Rickettsiella grylli|Rep:
           hypothetical protein Rgryl_01001034 - Rickettsiella
           grylli
          Length = 753

 Score = 34.7 bits (76), Expect = 7.0
 Identities = 22/94 (23%), Positives = 50/94 (53%), Gaps = 1/94 (1%)

Query: 124 ISQRIGDHRDLPDTRNKLCQSQQYFDELPRASIIICFYNEHYETLMRSVHSIMDRTDQKH 183
           I ++I   ++L + RNK C ++ Y D      +I C +NE+ +  + +V  IM +  +K 
Sbjct: 579 IWRKIVKLQNLAERRNKEC-AELYLDLAKSLQMIKCDFNENTDATLSAVLIIMQKIKEKS 637

Query: 184 IKEIILVDDYSDLYNLHHDVQEAVDKLNNVIKKE 217
           +  +  + + S L++   ++ E  +++ + +K E
Sbjct: 638 LNAVRGLFNRSYLHDRVVELAERFERILHAMKAE 671


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.319    0.136    0.410 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 673,051,384
Number of Sequences: 1657284
Number of extensions: 29326428
Number of successful extensions: 82024
Number of sequences better than 10.0: 299
Number of HSP's better than 10.0 without gapping: 145
Number of HSP's successfully gapped in prelim test: 154
Number of HSP's that attempted gapping in prelim test: 81219
Number of HSP's gapped (non-prelim): 678
length of query: 589
length of database: 575,637,011
effective HSP length: 105
effective length of query: 484
effective length of database: 401,622,191
effective search space: 194385140444
effective search space used: 194385140444
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
S2: 75 (34.3 bits)

- SilkBase 1999-2023 -