SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTP 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= BGIBMGA001273-TA|BGIBMGA001273-PA|IPR012776|Trimethyllysine
dioxygenase, IPR003819|Taurine catabolism dioxygenase TauD/TfdA
         (232 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q4V6C2 Cluster: IP11527p; n=5; Sophophora|Rep: IP11527p...   215   1e-54
UniRef50_Q16V01 Cluster: Epsilon-trimethyllysine 2-oxoglutarate ...   199   4e-50
UniRef50_A7SLB9 Cluster: Predicted protein; n=1; Nematostella ve...   196   3e-49
UniRef50_Q9NVH6 Cluster: Trimethyllysine dioxygenase, mitochondr...   172   5e-42
UniRef50_Q6CCC7 Cluster: Similar to sp|Q96UB1 Neurospora crassa ...   166   4e-40
UniRef50_A5DCB6 Cluster: Trimethyllysine dioxygenase; n=6; Sacch...   161   1e-38
UniRef50_Q5KF50 Cluster: Mitochondrion protein, putative; n=2; F...   149   4e-35
UniRef50_A1D409 Cluster: Trimethyllysine dioxygenase, putative; ...   146   4e-34
UniRef50_Q21526 Cluster: Putative uncharacterized protein gbh-2;...   145   7e-34
UniRef50_UPI000023E495 Cluster: hypothetical protein FG06105.1; ...   143   4e-33
UniRef50_Q1E1M7 Cluster: Putative uncharacterized protein; n=1; ...   141   2e-32
UniRef50_Q0UJ11 Cluster: Putative uncharacterized protein; n=1; ...   137   2e-31
UniRef50_Q757P7 Cluster: AEL035Wp; n=2; Saccharomycetaceae|Rep: ...   136   6e-31
UniRef50_A0J760 Cluster: Taurine catabolism dioxygenase TauD/Tfd...   134   2e-30
UniRef50_Q4X023 Cluster: Gamma-butyrobetaine hydroxylase subfami...   125   8e-28
UniRef50_Q2GXQ0 Cluster: Putative uncharacterized protein; n=6; ...   121   2e-26
UniRef50_Q1VMP3 Cluster: Gamma-butyrobetaine hydroxylase; n=1; P...   115   1e-24
UniRef50_Q98KK0 Cluster: Probable gamma-butyrobetaine dioxygenas...   105   1e-21
UniRef50_Q2UCW9 Cluster: Predicted gamma-butyrobetaine; n=2; Asp...   104   2e-21
UniRef50_A6RF74 Cluster: Predicted protein; n=1; Ajellomyces cap...   104   2e-21
UniRef50_A6F7M8 Cluster: Gamma-butyrobetaine hydroxylase; n=1; M...   103   3e-21
UniRef50_A2RB24 Cluster: Contig An18c0170, complete genome; n=1;...   102   7e-21
UniRef50_Q4PCW2 Cluster: Putative uncharacterized protein; n=1; ...   101   2e-20
UniRef50_A3YAS9 Cluster: Gamma-butyrobetaine hydroxylase; n=1; M...   100   4e-20
UniRef50_A6QRR2 Cluster: Putative uncharacterized protein; n=1; ...   100   4e-20
UniRef50_UPI0000E48C37 Cluster: PREDICTED: similar to gamma buty...    98   1e-19
UniRef50_A3YI34 Cluster: Gamma-butyrobetaine hydroxylase; n=1; M...    98   2e-19
UniRef50_A2R5A1 Cluster: Catalytic activity: H. sapiens BBH conv...    97   2e-19
UniRef50_A3Y505 Cluster: Gamma-butyrobetaine hydroxylase, putati...    97   4e-19
UniRef50_UPI0000586B6F Cluster: PREDICTED: hypothetical protein;...    96   6e-19
UniRef50_O75936 Cluster: Gamma-butyrobetaine dioxygenase; n=26; ...    95   1e-18
UniRef50_Q1GKN1 Cluster: Gamma-butyrobetaine2-oxoglutarate dioxy...    94   2e-18
UniRef50_Q5AVW2 Cluster: Putative uncharacterized protein; n=1; ...    94   2e-18
UniRef50_A7S7D2 Cluster: Predicted protein; n=2; Nematostella ve...    94   3e-18
UniRef50_A4R0Y1 Cluster: Putative uncharacterized protein; n=1; ...    94   3e-18
UniRef50_Q1QTU1 Cluster: Gamma-butyrobetaine,2-oxoglutarate diox...    93   4e-18
UniRef50_A6SL62 Cluster: Putative uncharacterized protein; n=2; ...    92   1e-17
UniRef50_Q96UB1 Cluster: Trimethyllysine dioxygenase; n=2; Neuro...    92   1e-17
UniRef50_Q1QSP3 Cluster: Taurine catabolism dioxygenase TauD/Tfd...    91   2e-17
UniRef50_Q17KD9 Cluster: Epsilon-trimethyllysine 2-oxoglutarate ...    91   2e-17
UniRef50_Q1RPP2 Cluster: Gamma-butyrobetaine hydroxylase; n=2; C...    91   3e-17
UniRef50_Q4V6I6 Cluster: IP11337p; n=6; Sophophora|Rep: IP11337p...    90   4e-17
UniRef50_Q1E7N7 Cluster: Putative uncharacterized protein; n=1; ...    90   4e-17
UniRef50_Q112B1 Cluster: Taurine catabolism dioxygenase TauD/Tfd...    89   7e-17
UniRef50_Q6C1G9 Cluster: Similar to DEHA0C03839g Debaryomyces ha...    89   1e-16
UniRef50_UPI000023D763 Cluster: hypothetical protein FG05953.1; ...    88   2e-16
UniRef50_A0Z404 Cluster: Gamma-butyrobetaine,2-oxoglutarate diox...    86   6e-16
UniRef50_Q75A94 Cluster: ADR024Wp; n=1; Eremothecium gossypii|Re...    86   6e-16
UniRef50_A0YX02 Cluster: Gamma-butyrobetaine hydroxylase, putati...    85   1e-15
UniRef50_Q9UVG4 Cluster: Putative uncharacterized protein; n=2; ...    84   3e-15
UniRef50_P80193 Cluster: Gamma-butyrobetaine dioxygenase; n=13; ...    83   4e-15
UniRef50_UPI0000E486C9 Cluster: PREDICTED: similar to LOC535630 ...    83   6e-15
UniRef50_Q9NF72 Cluster: EG:BACR7A4.9 protein; n=4; Sophophora|R...    83   8e-15
UniRef50_UPI0000587DDD Cluster: PREDICTED: hypothetical protein;...    82   1e-14
UniRef50_A0YLG8 Cluster: Gamma-butyrobetaine hydroxylase; n=1; L...    79   9e-14
UniRef50_A7SHP2 Cluster: Predicted protein; n=4; Nematostella ve...    79   1e-13
UniRef50_Q5A0G4 Cluster: Potential gamma-butyrobetaine hydroxyla...    79   1e-13
UniRef50_Q0UUH0 Cluster: Putative uncharacterized protein; n=1; ...    79   1e-13
UniRef50_Q1GF28 Cluster: Gamma-butyrobetaine2-oxoglutarate dioxy...    78   2e-13
UniRef50_Q6CQT2 Cluster: Similar to sp|P23180 Saccharomyces cere...    77   4e-13
UniRef50_Q19000 Cluster: Probable gamma-butyrobetaine dioxygenas...    73   6e-12
UniRef50_A3LY61 Cluster: Gamma-butyrobetaine dioxygenase; n=3; S...    72   1e-11
UniRef50_Q5KP77 Cluster: Mitochondrion protein, putative; n=2; F...    71   2e-11
UniRef50_Q4P2H1 Cluster: Putative uncharacterized protein; n=1; ...    69   1e-10
UniRef50_A0YHS0 Cluster: Gamma-butyrobetaine,2-oxoglutarate diox...    67   4e-10
UniRef50_UPI0000586629 Cluster: PREDICTED: similar to gamma buty...    66   5e-10
UniRef50_Q097J3 Cluster: Gamma-butyrobetaine dioxygenase; n=1; S...    64   4e-09
UniRef50_Q1YSL8 Cluster: Gamma-butyrobetaine hydroxylase; n=1; g...    63   5e-09
UniRef50_UPI0000E47204 Cluster: PREDICTED: similar to Gamma-buty...    62   1e-08
UniRef50_Q7S3G2 Cluster: Putative uncharacterized protein NCU068...    59   1e-07
UniRef50_UPI0000587A47 Cluster: PREDICTED: hypothetical protein,...    58   1e-07
UniRef50_A7SHP3 Cluster: Predicted protein; n=1; Nematostella ve...    58   2e-07
UniRef50_Q2HBR6 Cluster: Putative uncharacterized protein; n=1; ...    57   4e-07
UniRef50_A0P0W2 Cluster: Putative uncharacterized protein; n=1; ...    54   3e-06
UniRef50_A1DGB7 Cluster: Haloacid dehalogenase-like hydrolase, p...    50   7e-05
UniRef50_Q9KQQ4 Cluster: PvcB protein; n=20; Proteobacteria|Rep:...    48   3e-04
UniRef50_A4D938 Cluster: CrpF; n=1; Nostoc sp. ATCC 53789|Rep: C...    47   5e-04
UniRef50_P23180 Cluster: Uncharacterized oxidoreductase YHL021C;...    46   0.001
UniRef50_Q19Q32 Cluster: Trimethyllysine hydroxylase-like; n=1; ...    45   0.002
UniRef50_A7SYU0 Cluster: Predicted protein; n=3; Nematostella ve...    41   0.030
UniRef50_Q6E7K0 Cluster: JamJ; n=3; Oscillatoriales|Rep: JamJ - ...    39   0.092
UniRef50_Q4FKY1 Cluster: Gab protein; n=2; Candidatus Pelagibact...    39   0.12 
UniRef50_Q6FMD9 Cluster: Similar to sp|P23180 Saccharomyces cere...    38   0.21 
UniRef50_Q9I6U7 Cluster: Putative uncharacterized protein; n=4; ...    38   0.28 
UniRef50_Q2C5Y7 Cluster: Putative uncharacterized protein; n=2; ...    37   0.37 
UniRef50_Q94534 Cluster: Beaten path precursor; n=5; Diptera|Rep...    37   0.37 
UniRef50_Q10Z25 Cluster: Putative uncharacterized protein; n=2; ...    36   0.86 
UniRef50_A3NJS9 Cluster: Taurine catabolism dioxygenase TauD, Tf...    36   0.86 
UniRef50_Q5QFY7 Cluster: ORF3; n=3; Proteobacteria|Rep: ORF3 - P...    36   1.1  
UniRef50_Q0AA88 Cluster: Flagellar hook capping protein; n=1; Al...    36   1.1  
UniRef50_A4KUB9 Cluster: TlmR3; n=2; Actinomycetales|Rep: TlmR3 ...    36   1.1  
UniRef50_Q05582 Cluster: Clavaminate synthase 2; n=5; Streptomyc...    35   1.5  
UniRef50_Q5ZU71 Cluster: Pyoverdine biosynthesis regulatory gene...    35   2.0  
UniRef50_Q118E5 Cluster: Gamma-butyrobetaine,2-oxoglutarate diox...    35   2.0  
UniRef50_A5EW81 Cluster: Putative uncharacterized protein; n=1; ...    35   2.0  
UniRef50_Q50E85 Cluster: Putative uncharacterized protein; n=1; ...    34   3.5  
UniRef50_A0BH33 Cluster: Chromosome undetermined scaffold_107, w...    34   3.5  
UniRef50_Q3IC42 Cluster: Putative oxidoreductase; n=2; Alteromon...    33   4.6  
UniRef50_Q9FB40 Cluster: SyrP-like protein; n=1; Streptomyces ve...    33   4.6  
UniRef50_Q29DR0 Cluster: GA10095-PA; n=2; pseudoobscura subgroup...    33   4.6  
UniRef50_A0DD98 Cluster: Chromosome undetermined scaffold_46, wh...    33   4.6  
UniRef50_Q6Q472 Cluster: Calcium activated chloride channel vari...    33   8.0  
UniRef50_Q9I1L4 Cluster: Pyoverdine biosynthesis protein PvcB; n...    33   8.0  
UniRef50_Q72TN9 Cluster: Syringomycin channel-forming protein; n...    33   8.0  
UniRef50_Q2CHG0 Cluster: Putative uncharacterized protein; n=1; ...    33   8.0  
UniRef50_Q9NGV3 Cluster: SP1173; n=4; Sophophora|Rep: SP1173 - D...    33   8.0  

>UniRef50_Q4V6C2 Cluster: IP11527p; n=5; Sophophora|Rep: IP11527p -
           Drosophila melanogaster (Fruit fly)
          Length = 366

 Score =  215 bits (524), Expect = 1e-54
 Identities = 100/231 (43%), Positives = 141/231 (61%), Gaps = 1/231 (0%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y     + V P+A  TE   + +  +  T FG  W F+   DHADTAYT L L +H DN 
Sbjct: 136 YGIVFIDDVAPTANMTELALRRVFPLMKTFFGEMWTFSDNPDHADTAYTKLYLGSHTDNT 195

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
           Y+ +AAGLQ LHCIEH+ G+GGE   VDG +    LK  +P  Y+ L + ++ GEYIE+ 
Sbjct: 196 YFCDAAGLQALHCIEHS-GSGGENFFVDGLHVVHELKRRYPAAYDVLCSVQVPGEYIEKG 254

Query: 121 HHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQ 180
            H  H+AP+IQ+D  T++  Q+R NVYDR+        +   +Y SL+ L     +K+ Q
Sbjct: 255 EHHYHTAPIIQVDPLTQEFVQLRLNVYDRAVFNTIPQAEMAEFYDSLRQLLLIVRDKQQQ 314

Query: 181 WIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLNLI 231
           W  KL PG +++ DN+R+LHGR  +TG R + G+YV R+D+L KAR L +I
Sbjct: 315 WALKLCPGSIVLFDNWRVLHGREAYTGSRTMSGSYVQRTDFLSKARVLGII 365


>UniRef50_Q16V01 Cluster: Epsilon-trimethyllysine 2-oxoglutarate
           dioxygenase; n=2; Culicidae|Rep: Epsilon-trimethyllysine
           2-oxoglutarate dioxygenase - Aedes aegypti (Yellowfever
           mosquito)
          Length = 713

 Score =  199 bits (486), Expect = 4e-50
 Identities = 92/231 (39%), Positives = 139/231 (60%), Gaps = 1/231 (0%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y      +V  + ++TE   + +  I  T+FG  W F+   DH+DTAYT   L  H DN 
Sbjct: 483 YGVAFIEKVPANPQSTEMAVRRIFPIHKTLFGEMWTFSDSMDHSDTAYTKNYLGPHTDNT 542

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
           Y+++A+GLQ+LHCI+   G+GG+TIL+DGF  A  L+   PE +E L  Y + GEY+E  
Sbjct: 543 YFSDASGLQVLHCIQF-KGSGGQTILIDGFKAAEQLRLKKPEVFERLCNYPVTGEYLEEG 601

Query: 121 HHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQ 180
            H T+ AP+I+ +  T +++Q+RFN+YDR+ +          +Y   K L      +   
Sbjct: 602 KHHTYCAPIIKRNIITGEVEQLRFNIYDRAILKTIPQEQVPQFYADFKELGAEINEESMA 661

Query: 181 WIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLNLI 231
           W F+L PG VM+ DN+R+LHGR  + G+RV+ G YV+R+D+   AR+L +I
Sbjct: 662 WTFQLTPGTVMIFDNWRVLHGRMAYNGKRVMSGCYVARTDYQSVARTLGII 712


>UniRef50_A7SLB9 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 337

 Score =  196 bits (479), Expect = 3e-49
 Identities = 98/220 (44%), Positives = 135/220 (61%), Gaps = 3/220 (1%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGG-IQHTIFGATWEFTT-VADHADTAYTNLPLAAHND 58
           Y F   N       A E + K+LG  ++ T FG  W F+  V DHADTAYT+  L AHND
Sbjct: 118 YGFAFVNDTPTELSAVEKLAKSLGCFVRETHFGRLWAFSNEVMDHADTAYTSGFLHAHND 177

Query: 59  NIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIE 118
           N Y+T  AGLQ+LHC+ H +G GGE++LVDGF  A  LK++HP  Y FLTT  +   YI+
Sbjct: 178 NTYYTSPAGLQMLHCVHH-DGKGGESLLVDGFNAANELKKEHPGAYTFLTTKVLPYRYID 236

Query: 119 RRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKE 178
              H     P I++D  ++D  QIR+N YDR+ +      D   YY++++  A      E
Sbjct: 237 SERHLKAFGPTIELDPFSKDFHQIRYNHYDRAVIDCLESDDVPSYYKAIQAYAEILRRPE 296

Query: 179 NQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSR 218
           +++ FKLVPG +MV+ N+R++HGRN FTGRR + G YV +
Sbjct: 297 SEYWFKLVPGQLMVMGNWRVMHGRNRFTGRRDMQGCYVDK 336


>UniRef50_Q9NVH6 Cluster: Trimethyllysine dioxygenase, mitochondrial
           precursor; n=33; Euteleostomi|Rep: Trimethyllysine
           dioxygenase, mitochondrial precursor - Homo sapiens
           (Human)
          Length = 421

 Score =  172 bits (419), Expect = 5e-42
 Identities = 91/234 (38%), Positives = 130/234 (55%), Gaps = 5/234 (2%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y       V P+ E TE + + +  I+ TI+G  W FT+     DTAYT L L  H D  
Sbjct: 187 YGIAFVENVPPTQEHTEKLAERISLIRETIYGRMWYFTSDFSRGDTAYTKLALDRHTDTT 246

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIER- 119
           Y+ E  G+Q+ HC++H  GTGG T+LVDGFY A  + +  PE++E L+   ++ EYIE  
Sbjct: 247 YFQEPCGIQVFHCLKH-EGTGGRTLLVDGFYAAEQVLQKAPEEFELLSKVPLKHEYIEDV 305

Query: 120 ---RHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYEN 176
               +H     PV+ I    +++  IR+N YDR+ +          +Y + + L      
Sbjct: 306 GECHNHMIGIGPVLNIYPWNKELYLIRYNNYDRAVINTVPYDVVHRWYTAHRTLTIELRR 365

Query: 177 KENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLNL 230
            EN++  KL PG V+ IDN+R+LHGR  FTG R LCG Y++R D L+ AR L L
Sbjct: 366 PENEFWVKLKPGRVLFIDNWRVLHGRECFTGYRQLCGCYLTRDDVLNTARLLGL 419


>UniRef50_Q6CCC7 Cluster: Similar to sp|Q96UB1 Neurospora crassa
           Trimethyllysine dioxygenase; n=1; Yarrowia
           lipolytica|Rep: Similar to sp|Q96UB1 Neurospora crassa
           Trimethyllysine dioxygenase - Yarrowia lipolytica
           (Candida lipolytica)
          Length = 382

 Score =  166 bits (404), Expect = 4e-40
 Identities = 89/225 (39%), Positives = 127/225 (56%), Gaps = 5/225 (2%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
           V  + E TE +C+ L  I+HT +G  W+FT      DTAYTN  LA+H D  YWT+  GL
Sbjct: 149 VPATPEDTEKLCERLAHIKHTHYGGFWDFTADLAMNDTAYTNFHLASHTDGTYWTDTPGL 208

Query: 69  QILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYI-ERRHHFTHSA 127
           Q+ HC+ H +G GGE +LVDGF  A   K+ +PE YE L+   I      E     T   
Sbjct: 209 QLFHCLHH-DGKGGENMLVDGFRAAQEFKKLNPEGYELLSRVRIPAHSAGEDSVCITPEV 267

Query: 128 --PVIQIDKNTEDIKQIRFNVYDRSAM-AFRSGRDCRLYYRSLKNLARYYENKENQWIFK 184
             PV   D  T +++Q+R+N  DRS M  + S  D   +Y++++       + + +++ K
Sbjct: 268 PQPVFTHDPITGELQQVRWNNDDRSVMDTWDSPEDVPKFYKAIRQWNGILTDPKFEYVCK 327

Query: 185 LVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLN 229
           LV G  ++ DN+R+LHGR GF G R +CGAY +R D+L   R  N
Sbjct: 328 LVAGECLIFDNWRVLHGRKGFVGNRRMCGAYHARDDFLSTFRLTN 372


>UniRef50_A5DCB6 Cluster: Trimethyllysine dioxygenase; n=6;
           Saccharomycetales|Rep: Trimethyllysine dioxygenase -
           Pichia guilliermondii (Yeast) (Candida guilliermondii)
          Length = 399

 Score =  161 bits (392), Expect = 1e-38
 Identities = 81/232 (34%), Positives = 130/232 (56%), Gaps = 6/232 (2%)

Query: 3   FKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYW 62
           F   + V    + TE +C+ L  I+ T +G  W+FT+     DTAYTN+ +++H D  YW
Sbjct: 161 FCFIDNVPVDPQETEKLCEKLMYIRPTHYGGFWDFTSDLSKNDTAYTNIDISSHTDGTYW 220

Query: 63  TEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHH 122
           ++  GLQ+ H + H  GTGG T LVD F+ A  LK++HPE +E LT   +          
Sbjct: 221 SDTPGLQLFHLLMH-EGTGGTTSLVDAFHCAEILKKEHPESFELLTRIPVPAHSAGEEKV 279

Query: 123 FTH---SAPVIQIDKNTEDIKQIRFNVYDRSAM-AFRSGRDCRLYYRSLKNLARYYENKE 178
                   P+ ++D N E I Q+R+N  DRS M ++ +  +   +YR++K   +   +  
Sbjct: 280 CIQPDIPQPIFKLDTNGELI-QVRWNQSDRSTMDSWENPLEVVKFYRAIKQWHKIISDPA 338

Query: 179 NQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLNL 230
           N+  ++L PG  ++ DN+R  H R  FTG+R +CGAY++R D++ +   LN+
Sbjct: 339 NELFYQLRPGQCLIFDNWRCFHSRTEFTGKRRMCGAYINRDDFVSRLNLLNI 390


>UniRef50_Q5KF50 Cluster: Mitochondrion protein, putative; n=2;
           Filobasidiella neoformans|Rep: Mitochondrion protein,
           putative - Cryptococcus neoformans (Filobasidiella
           neoformans)
          Length = 447

 Score =  149 bits (362), Expect = 4e-35
 Identities = 76/217 (35%), Positives = 122/217 (56%), Gaps = 6/217 (2%)

Query: 13  AEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILH 72
           A+ TET+ K++G I+ T +G  W FT    H D AY+   L AH D  Y+T+ AGLQI H
Sbjct: 210 AKETETLIKSIGPIRQTHYGGFWSFTADLSHGDLAYSAQSLPAHTDTTYFTDPAGLQIFH 269

Query: 73  CIEHTN-GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTH---SAP 128
            + H + G GG+T+L DGF+ A+ L    P  Y  L+   I       +        S P
Sbjct: 270 LLSHPSPGQGGKTLLADGFHAASQLSAVDPASYSVLSRLPIPAHASGTKGTLLRPLISFP 329

Query: 129 VIQIDKNTEDIKQIRFNVYDRSAMAFR-SGRDCRLYYRSLKNLARYYENKENQWIFKLVP 187
           V++ D+    + Q+R+N  DR  +    S  + R +Y++ +      ++++N++  +L P
Sbjct: 330 VLRHDE-CGRLAQVRWNNEDRGIIGHGWSATEVRQWYQAAQRFESLVKSEQNEYWVQLNP 388

Query: 188 GLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDK 224
           G +++IDN+R++HGR+ FTG R +CGAY+   DW  +
Sbjct: 389 GTMLIIDNWRVMHGRSEFTGSRTMCGAYIGADDWYSR 425


>UniRef50_A1D409 Cluster: Trimethyllysine dioxygenase, putative;
           n=5; Trichocomaceae|Rep: Trimethyllysine dioxygenase,
           putative - Neosartorya fischeri (strain ATCC 1020 / DSM
           3700 / NRRL 181)(Aspergillus fischerianus (strain ATCC
           1020 / DSM 3700 / NRRL 181))
          Length = 375

 Score =  146 bits (354), Expect = 4e-34
 Identities = 73/219 (33%), Positives = 124/219 (56%), Gaps = 5/219 (2%)

Query: 14  EATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHC 73
           E+T+ + + +  I+HT +G  W+FT+     DTAYT   L AH DN Y+T+ A LQ+ H 
Sbjct: 133 ESTKALLERIAFIRHTHYGGFWDFTSDLTFKDTAYTTEFLGAHTDNTYFTDPARLQLFHL 192

Query: 74  IEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTT----YEIEGEYIERRHHFTHSAPV 129
           + HT+G GG ++LVDGF  A+ +++++P+    L      Y   G   +     T  APV
Sbjct: 193 LSHTDGHGGASLLVDGFKAASIMRQENPKHCGVLAATKQPYHSSGNE-DVCIQPTEQAPV 251

Query: 130 IQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGL 189
            +I  +   + Q+R+N YDR+A      ++   +Y + ++     +    +   +L PG 
Sbjct: 252 FKIHPDLSRLYQVRWNNYDRAAKRNWGLKEQNRWYNAARHFNHIIQRPNVEIWTQLQPGT 311

Query: 190 VMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSL 228
            ++ DN+R+LHGR+ FTG+R +CG Y++  D++ + R L
Sbjct: 312 ALIFDNWRMLHGRSEFTGKRRMCGGYINNDDFISRYRLL 350


>UniRef50_Q21526 Cluster: Putative uncharacterized protein gbh-2;
           n=2; Caenorhabditis|Rep: Putative uncharacterized
           protein gbh-2 - Caenorhabditis elegans
          Length = 409

 Score =  145 bits (352), Expect = 7e-34
 Identities = 81/233 (34%), Positives = 126/233 (54%), Gaps = 18/233 (7%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVAD-----HADTAYTNLPLAAHNDNIYWT 63
           V+ ++EATE +C++L  +  T FG  W F+  A      + DTAY +  +  H D  Y+ 
Sbjct: 167 VEGTSEATEKLCQSLVPVHDTFFGQFWVFSNSATNDEPAYEDTAYGSDEIGPHTDGTYFD 226

Query: 64  EAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR--- 120
           +  G+Q+ HC+     TGG+T+LVD FY A  L+ + PED+E L   +I   Y+E     
Sbjct: 227 QTPGIQVFHCLTPAK-TGGDTVLVDSFYCAEKLRNESPEDFEILCNTKISHHYLEGSPPG 285

Query: 121 ---HHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRL-----YYRSLKNLAR 172
              H  +   PVI+ + +  +I QIRFN YDR+  +  +  +        +Y + +  ++
Sbjct: 286 SSIHSVSLEKPVIERN-SFGNITQIRFNPYDRAPFSCLNSSEASAAETIKFYEAYEKFSK 344

Query: 173 YYENKENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKA 225
              N +N     L PG V+ IDNFR+LH R  F G R +CG Y+SR +++ KA
Sbjct: 345 ICHNPDNSIEISLRPGSVIFIDNFRILHSRTSFQGYRQMCGCYLSRDNFMAKA 397


>UniRef50_UPI000023E495 Cluster: hypothetical protein FG06105.1;
           n=1; Gibberella zeae PH-1|Rep: hypothetical protein
           FG06105.1 - Gibberella zeae PH-1
          Length = 369

 Score =  143 bits (346), Expect = 4e-33
 Identities = 79/240 (32%), Positives = 124/240 (51%), Gaps = 14/240 (5%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y F      +P+ EAT+   + +G I++T +G  ++F      ADTAYTN+ LA H D  
Sbjct: 96  YGFCLVENAEPTPEATQAFLEKIGPIRNTHYGGFYDFVPDLALADTAYTNIALAPHTDTT 155

Query: 61  YWTEAAGLQILHCIEH--------TNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEI 112
           Y++E AGLQ  HC+EH            GGE++LVDG   A  LK + P  ++ L    +
Sbjct: 156 YFSEPAGLQAFHCLEHEAPPGHNPDEPLGGESLLVDGLQAARLLKRETPNLFDTLRDIRV 215

Query: 113 EGEYIERRHHF---THSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKN 169
                  +        + PVI++D  T  I +IR+N  DR  +      D   +Y + + 
Sbjct: 216 PWHASGNKGIAIAPDRTYPVIEVDNETRRINRIRWNNDDRGVVHL---FDSPPWYVAARQ 272

Query: 170 LARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLN 229
                  + +Q+ FKL PG +++ +N+R++HGR  F G R +CGAY+ R D++ + R  N
Sbjct: 273 WNDIINRERSQYRFKLTPGTIVIFNNWRVMHGRTAFKGTRRICGAYIPRDDFVSRYRETN 332


>UniRef50_Q1E1M7 Cluster: Putative uncharacterized protein; n=1;
           Coccidioides immitis|Rep: Putative uncharacterized
           protein - Coccidioides immitis
          Length = 450

 Score =  141 bits (341), Expect = 2e-32
 Identities = 78/222 (35%), Positives = 124/222 (55%), Gaps = 9/222 (4%)

Query: 14  EATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHC 73
           EAT+ + + +  I+ T +G  W+FT+     D AYT   L  H DN Y+T+ +GLQ+ H 
Sbjct: 222 EATKKLLERIAFIRPTHYGGFWDFTSDLAMKDMAYTTQGLGVHTDNAYFTDPSGLQMFHL 281

Query: 74  IEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLT----TYEIEG-EYIERRHHFTHSAP 128
           + HT+G GGE+ LVDGF  A  L  ++P+ Y  L+    ++   G E++      TH   
Sbjct: 282 LSHTDGDGGESTLVDGFEAARTLWSENPDAYAVLSNPIFSHHASGNEHVHIMPAKTHE-- 339

Query: 129 VIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRL-YYRSLKNLARYYENKENQWIFKLVP 187
                + T ++ QIR+N  DR A  F   +D  L +Y + +  ++  +  +    FKL P
Sbjct: 340 TFSHRQPTGELYQIRWNDEDRGA-NFTGSQDSLLAWYVAAREWSQMLKRPKLLLKFKLEP 398

Query: 188 GLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLN 229
           G+ ++ DN+R+LHGR  FTG R +CG Y++R D++ +   LN
Sbjct: 399 GMPLIFDNWRMLHGRTAFTGARRMCGGYINRDDFISRYELLN 440


>UniRef50_Q0UJ11 Cluster: Putative uncharacterized protein; n=1;
           Phaeosphaeria nodorum|Rep: Putative uncharacterized
           protein - Phaeosphaeria nodorum (Septoria nodorum)
          Length = 324

 Score =  137 bits (332), Expect = 2e-31
 Identities = 77/216 (35%), Positives = 119/216 (55%), Gaps = 12/216 (5%)

Query: 7   NQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAA 66
           N    S   TE + K++  I+ T +G  ++FT      DTAYTN+ L AH D  Y+++ A
Sbjct: 99  NVPHESPSDTEQLLKSIAFIRETHYGGFYDFTADLASKDTAYTNIALEAHTDTTYFSDPA 158

Query: 67  GLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEI-------EGEYIER 119
           GLQ  H + HT+G GG ++LVDGF  A  L +   E Y  L+T  +       EG  I+ 
Sbjct: 159 GLQAFHLLSHTDGEGGASLLVDGFKVAQELYDTDREAYRVLSTVNVHAHASGNEGISIQA 218

Query: 120 RHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKEN 179
              F    PV++ D  T  + ++R+N  DR+++        R +Y + +      + KEN
Sbjct: 219 YRGF----PVLEHDGATGALLRVRWNTADRASIELPIEETGR-WYDAARKFDGLLKKKEN 273

Query: 180 QWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAY 215
           ++  +L PG V++ DN+R+LHGR+ FTG+R +CG Y
Sbjct: 274 EYWEQLTPGKVLIFDNWRVLHGRSSFTGKRRICGGY 309


>UniRef50_Q757P7 Cluster: AEL035Wp; n=2; Saccharomycetaceae|Rep:
           AEL035Wp - Ashbya gossypii (Yeast) (Eremothecium
           gossypii)
          Length = 412

 Score =  136 bits (328), Expect = 6e-31
 Identities = 82/240 (34%), Positives = 128/240 (53%), Gaps = 13/240 (5%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGA-TWEFTTVADHADTAYTNLPLAAHNDN 59
           + F     V  S EAT+TV + +  I+ T +    WEFT+     DTAYT + ++ H D 
Sbjct: 161 FGFTFVRNVPVSIEATKTVSELISIIRPTHYDTGVWEFTSDLAKHDTAYTTVGISMHTDG 220

Query: 60  IYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIER 119
            YW E  GLQ+ H +EH+ G GGET +VD       L+E   +D  + TTY++  E+   
Sbjct: 221 NYWHELPGLQLFHLLEHSGGEGGETQIVDVAKVVDQLRELAAQDESWRTTYKVLTEHPLA 280

Query: 120 RHH--------FTHSAPVIQIDKNTEDIKQIRFNVYDR---SAMAFRSGRDCRLYYRSLK 168
            H         +    P + +D  T +++Q R+N  DR   + +A  S       Y++L 
Sbjct: 281 FHQSGELDSVFYQADYPTLTLDA-TGELEQCRWNTSDRISQAPLAPGSPYTVPQVYQALF 339

Query: 169 NLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSL 228
            L    ++++N   FK+ PG + + DN+R+LH R  FTG R LCG+Y++R D+L + RS+
Sbjct: 340 RLDSLIKDEKNYVQFKMQPGTIFIFDNWRVLHARTSFTGCRRLCGSYLTRDDFLARFRSI 399


>UniRef50_A0J760 Cluster: Taurine catabolism dioxygenase TauD/TfdA;
           n=2; Shewanella|Rep: Taurine catabolism dioxygenase
           TauD/TfdA - Shewanella woodyi ATCC 51908
          Length = 371

 Score =  134 bits (324), Expect = 2e-30
 Identities = 74/228 (32%), Positives = 120/228 (52%), Gaps = 3/228 (1%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y   +F+ +  + EAT+ +   +G I+ T+FG+ W+F+    H+D+AYT++ +  H D+ 
Sbjct: 139 YGLVTFSGMPSNMEATKKLLNQVGYIRDTVFGSLWDFSNNGAHSDSAYTSVGIGLHTDST 198

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
           Y  +  GLQ+LHC+   +G G      DGF  A  +K + P  YE L   ++   YIE  
Sbjct: 199 YTIDPPGLQLLHCLAF-DGEGAFNQFADGFKVAQTIKSEDPAAYETLKRIKVPAHYIEPG 257

Query: 121 HHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQ 180
                   V++ D N    +QI FN +DRS     S  + + +Y +     R   + + Q
Sbjct: 258 IQLRGQHEVVREDINGL-FEQICFNNFDRSPFML-STSEQKAFYHAYGLFQRLINDPKFQ 315

Query: 181 WIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSL 228
             F+L PG  +  DN+R+LH R+ F+G R L G Y +R D++ K  +L
Sbjct: 316 VSFQLQPGRAVWFDNWRVLHARSAFSGFRHLAGGYTNREDYISKKLTL 363


>UniRef50_Q4X023 Cluster: Gamma-butyrobetaine hydroxylase subfamily,
           putative; n=3; Trichocomaceae|Rep: Gamma-butyrobetaine
           hydroxylase subfamily, putative - Aspergillus fumigatus
           (Sartorya fumigata)
          Length = 483

 Score =  125 bits (302), Expect = 8e-28
 Identities = 72/225 (32%), Positives = 114/225 (50%), Gaps = 8/225 (3%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
           +  S E  E +   +G +++T +G TW+  +V    + AYTN+ L  H D +Y  E  G 
Sbjct: 222 IPDSREMVEKIATKMGPLRNTFYGPTWDVRSVPKAPNVAYTNVFLGFHMDLMYMNEPPGF 281

Query: 69  QILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAP 128
           Q+LHC+E+ +  GGE++ VDGF  A  ++  +PE +E LT   +  EY  + H + +S P
Sbjct: 282 QLLHCLEN-SCEGGESLFVDGFRVAELIRWKYPEQFEDLTKLRLNYEYNHKEHIYNNSWP 340

Query: 129 VIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRL----YYRSLKNLARYYENKENQWIFK 184
           V++ +      + +  N          S  + ++    Y R+L+  AR  E   N +  K
Sbjct: 341 VVETEDGDPKKRILHVNYSPPFQAPLLSDDNHQMPWIEYSRALRAFAREIERPYNVFQLK 400

Query: 185 LVPGLVMVIDNFRLLHGRNGFT---GRRVLCGAYVSRSDWLDKAR 226
           L PG  ++ +N R+LH RN F    G+R L G YV   D L   R
Sbjct: 401 LNPGECVIFENRRILHARNQFNTEQGKRWLAGTYVDEDDVLSTFR 445


>UniRef50_Q2GXQ0 Cluster: Putative uncharacterized protein; n=6;
           cellular organisms|Rep: Putative uncharacterized protein
           - Chaetomium globosum (Soil fungus)
          Length = 999

 Score =  121 bits (291), Expect = 2e-26
 Identities = 72/228 (31%), Positives = 117/228 (51%), Gaps = 11/228 (4%)

Query: 14  EATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHC 73
           E T+ + + +  I+ T +G  ++F      ADTAYTN  LA H D  Y+T+ AGLQ  H 
Sbjct: 765 EHTKKLLERIAFIRQTHYGGFYDFKPDLAMADTAYTNQALALHTDTTYFTDPAGLQAFHM 824

Query: 74  IEHT------NGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSA 127
           + H         TGGE++L+DG+  A  + ++ P  Y  L   ++       +     S 
Sbjct: 825 LSHEPAEGKDRATGGESVLLDGYNAAGIMHKESPAMYRLLAYLQLPWHSSGNKGIKITSD 884

Query: 128 ---PVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFK 184
              PV + ++   DI +IR+N  DR  + + +      +Y +           E Q+ F+
Sbjct: 885 LKYPVFE-ERLAGDILKIRWNNDDRGVVPYGTITP-EEWYEAAGKWNEIINRPELQYWFQ 942

Query: 185 LVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLNLIQ 232
           L PG V++ DN+R+LHGRN F G R +CG Y++R D++ +  + NL +
Sbjct: 943 LTPGRVLIFDNWRVLHGRNAFEGVRRICGGYINRDDFMSQWGNTNLTE 990


>UniRef50_Q1VMP3 Cluster: Gamma-butyrobetaine hydroxylase; n=1;
           Psychroflexus torquis ATCC 700755|Rep:
           Gamma-butyrobetaine hydroxylase - Psychroflexus torquis
           ATCC 700755
          Length = 234

 Score =  115 bits (276), Expect = 1e-24
 Identities = 73/232 (31%), Positives = 118/232 (50%), Gaps = 8/232 (3%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y F     +  S         ++G ++ T FG  ++  +  D  D AYT+L LA H DN 
Sbjct: 1   YGFVVVKNIPTSKNYIVEFANSIGSVRRTNFGEYFDVKSKPDPNDLAYTSLALAPHTDNP 60

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
           Y      +Q+LHCIE +  +GG + LVDG+     LK ++PE Y+ LT  ++   +I++ 
Sbjct: 61  YRNPVPCIQLLHCIE-SKVSGGLSTLVDGYTVTEDLKNEYPEFYKILTEVKVRFRFIDKE 119

Query: 121 HHFTHSAPVIQIDKNTEDIKQIRFNV-YDRSAMAFRSGRDCRLYYRSLKNLARYYENKEN 179
                 +P+I+++ + +  KQ+RF+   D   +  +   D  LYY + K ++  Y + + 
Sbjct: 120 VILETISPLIELN-DDKSFKQVRFSPRLDYVPILEKQKLD--LYYSARKKISEMYNSDKY 176

Query: 180 QWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYVSRSDWLDKARSL 228
           +  FKL P  +M++DN RLLHGR  +    G R L G Y+       K R L
Sbjct: 177 RIEFKLEPKDLMMMDNHRLLHGRTVYDANEGERFLQGCYIDYDSTEGKLRHL 228


>UniRef50_Q98KK0 Cluster: Probable gamma-butyrobetaine dioxygenase;
           n=4; Alphaproteobacteria|Rep: Probable
           gamma-butyrobetaine dioxygenase - Rhizobium loti
           (Mesorhizobium loti)
          Length = 383

 Score =  105 bits (251), Expect = 1e-21
 Identities = 71/225 (31%), Positives = 107/225 (47%), Gaps = 5/225 (2%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y F   + +   + A   V    G I+ T +G  +E     +  + AYTNL L AH DN 
Sbjct: 146 YGFAVMDGLPAESGALCKVSDLFGYIRETNYGRWFEVRAEVNPNNLAYTNLGLQAHTDNP 205

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYI-ER 119
           Y      LQIL C+E+T   GGE+ ++DGF  A  L+ ++PE +  L++     EY    
Sbjct: 206 YRDPVPTLQILACVENT-VEGGESSVIDGFAVAAALQAENPEGFRLLSSCPARFEYAGSS 264

Query: 120 RHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKEN 179
                   P+I++  + E I  IRFN    + +      D   YY + +  A   E+ + 
Sbjct: 265 GVRLQAKRPMIELGPDGELI-CIRFNNRSLAPVVDVPFADMDAYYAAYRRFAELIEDPDF 323

Query: 180 QWIFKLVPGLVMVIDNFRLLHGRNGF--TGRRVLCGAYVSRSDWL 222
           +  FKL PG   ++DN R++H R  F  TG+R L G Y  +   L
Sbjct: 324 EVTFKLQPGQAFIVDNTRVMHARKAFSGTGKRWLQGCYADKDGLL 368


>UniRef50_Q2UCW9 Cluster: Predicted gamma-butyrobetaine; n=2;
           Aspergillus|Rep: Predicted gamma-butyrobetaine -
           Aspergillus oryzae
          Length = 475

 Score =  104 bits (249), Expect = 2e-21
 Identities = 69/226 (30%), Positives = 111/226 (49%), Gaps = 12/226 (5%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
           +  S    E +   +G +++T +G+TW+  TV +  + AYT+  L  H D +Y  E  G 
Sbjct: 222 IPDSRAEVEKLATRMGPLRNTFYGSTWDVRTVPEAKNVAYTSQFLGFHMDLMYMNEPPGY 281

Query: 69  QILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAP 128
           Q+LHC+++ +  GGE++  D F  A  L  D PE ++ L    +  EY      +T+  P
Sbjct: 282 QLLHCLQN-SCDGGESLFADSFAVARQLSIDDPEAFKALCNLRLSYEYNHENDIYTNDWP 340

Query: 129 VIQ--IDKNTEDIKQIRFNVYDRSAMAFRSGR-----DCRLYYRSLKNLARYYENKENQW 181
           V Q  +D+ T+  + +  N Y     A   G+           R+L   A+  E+++  +
Sbjct: 341 VFQTYVDEYTQQQRLMHAN-YSPPFQAPMHGQRRPFNRTMSEMRALDKFAKMLEDEKYIY 399

Query: 182 IFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYVSRSDWLDK 224
             KL PG  ++ +N R+LH R  F   TG+R L GAYV     L K
Sbjct: 400 ELKLNPGECVIFENRRVLHARRQFNTATGQRWLAGAYVDEDAVLSK 445


>UniRef50_A6RF74 Cluster: Predicted protein; n=1; Ajellomyces
           capsulatus NAm1|Rep: Predicted protein - Ajellomyces
           capsulatus NAm1
          Length = 485

 Score =  104 bits (249), Expect = 2e-21
 Identities = 62/213 (29%), Positives = 105/213 (49%), Gaps = 17/213 (7%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
           +  S E  E +   +G ++++ +G+TW+  +V D  + AYTN  L  H D +Y  +  G 
Sbjct: 186 IPESPEMVEKIATRMGPLRNSFYGSTWDVRSVPDAKNVAYTNKHLDFHMDLLYMKDPPGY 245

Query: 69  QILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAP 128
           Q+LHC+ + + +GGE++  D F  A  L  + P  ++ L       EY     H+ +S P
Sbjct: 246 QLLHCLRN-SFSGGESLFSDTFQAAVRLLRNDPILFDILCKTPTRFEYKNNNQHYQYSHP 304

Query: 129 VIQIDKNTEDIKQ------IRFNVYDRSAMAFRS----------GRDCRLYYRSLKNLAR 172
            I+I+   E +K       + +  Y   +  F++          GRD +LY R++K  A 
Sbjct: 305 TIEIEGGEEFLKNPPKKNPVPYVNYVNYSPPFQAPSYLTKHLVDGRDIKLYVRAMKAFAA 364

Query: 173 YYENKENQWIFKLVPGLVMVIDNFRLLHGRNGF 205
               +EN +  KL PG  ++  N R++H RN F
Sbjct: 365 ELGKQENIFQVKLEPGQCVIFQNRRVVHARNAF 397


>UniRef50_A6F7M8 Cluster: Gamma-butyrobetaine hydroxylase; n=1;
           Moritella sp. PE36|Rep: Gamma-butyrobetaine hydroxylase
           - Moritella sp. PE36
          Length = 373

 Score =  103 bits (248), Expect = 3e-21
 Identities = 61/199 (30%), Positives = 105/199 (52%), Gaps = 6/199 (3%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTN 78
           V +  G ++ T +G+ +E  +  +  + AYT  PL+ H DN Y      LQ+LHC+    
Sbjct: 147 VVEQFGFVRDTNYGSHFEVISEENPVNLAYTPKPLSLHTDNAYRHPVPTLQLLHCLISAE 206

Query: 79  GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTED 138
             GG T L DGFY A  L++  P+ Y+ LT+  +   +     H  H+  +I+++ N  +
Sbjct: 207 -QGGITALTDGFYAAQLLQQRFPQQYQLLTSTPVMYRFKNADTHLEHTGYIIELN-NRGE 264

Query: 139 IKQIRFNVYDRSAMAFR-SGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFR 197
           +++IR N  +R+  A +    +   +Y + +N +R   + E +++  L PG +M+ +N R
Sbjct: 265 LERIRLN--NRAIQAIKLPFAEMAAFYDAYQNFSRILHSDECKFLCTLQPGELMIFNNER 322

Query: 198 LLHGRN-GFTGRRVLCGAY 215
           +LHGR     G R L G Y
Sbjct: 323 ILHGREVAAEGARHLQGCY 341


>UniRef50_A2RB24 Cluster: Contig An18c0170, complete genome; n=1;
           Aspergillus niger|Rep: Contig An18c0170, complete genome
           - Aspergillus niger
          Length = 443

 Score =  102 bits (245), Expect = 7e-21
 Identities = 66/200 (33%), Positives = 107/200 (53%), Gaps = 14/200 (7%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
           V  + E+TE + K +  I++T +G     T++A   DTAYT   L AH DN Y+T+ A L
Sbjct: 224 VPTNPESTEALLKRIAFIRNTHYGKA---TSLA-FPDTAYTTEFLGAHTDNTYFTDPARL 279

Query: 69  QILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTT----YEIEGEYIERRHHFT 124
           Q+ H + HT+G GG ++LVDGF  A+ L+E+ P+D+E L +    Y   G   +      
Sbjct: 280 QLFHLLSHTDGDGGASLLVDGFRAASILREESPQDFEVLMSTNHPYHSSGNE-DVCVQPA 338

Query: 125 HSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFK 184
             APV+++    + + QIR+N YDR+A    +  D   +Y + ++       K+ +   +
Sbjct: 339 EQAPVLKVHPELQRLYQIRWNNYDRAAKKNWNWEDQVKWYTAARHWDEIIRRKDMEIWTQ 398

Query: 185 LVPGLVMV-----IDNFRLL 199
           L PG  ++     I  +RLL
Sbjct: 399 LEPGTALINNDDFISRYRLL 418


>UniRef50_Q4PCW2 Cluster: Putative uncharacterized protein; n=1;
           Ustilago maydis|Rep: Putative uncharacterized protein -
           Ustilago maydis (Smut fungus)
          Length = 777

 Score =  101 bits (241), Expect = 2e-20
 Identities = 53/147 (36%), Positives = 82/147 (55%), Gaps = 6/147 (4%)

Query: 80  TGGETILVDGFYGATCLKEDHPEDYEFLT-----TYEIEGEYIERRHHFTHSAPVIQIDK 134
           +GGE++LVDGF  A  LK+ HP+ YE L+     T+    E    R  F    P++Q D 
Sbjct: 607 SGGESLLVDGFLAAAVLKDVHPDAYETLSRVRIRTHSAGDENTMIRPLFEGGYPILQHDD 666

Query: 135 NTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVID 194
            T ++  +R+N  DRS +   +  D   +Y +L+   +   N E ++  +L PG  ++ D
Sbjct: 667 ATGELVLVRYNNDDRSVLRIDAD-DVERFYDALRKWNQILTNPEGEYWVQLKPGSALIFD 725

Query: 195 NFRLLHGRNGFTGRRVLCGAYVSRSDW 221
           N R+LHGR+ F G R LCGAY++  D+
Sbjct: 726 NHRVLHGRSAFVGNRRLCGAYINHDDY 752



 Score = 72.5 bits (170), Expect = 8e-12
 Identities = 32/77 (41%), Positives = 45/77 (58%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y F     V P+   TE + + +  I+ T +G  W+FT+   H DTAYT+L L AH D  
Sbjct: 486 YGFAFVTGVPPTPTDTEALIRRIAFIRETHYGGFWDFTSDLAHGDTAYTDLALQAHTDTT 545

Query: 61  YWTEAAGLQILHCIEHT 77
           Y+T+ AGLQ+ H + HT
Sbjct: 546 YFTDPAGLQMFHLLSHT 562


>UniRef50_A3YAS9 Cluster: Gamma-butyrobetaine hydroxylase; n=1;
           Marinomonas sp. MED121|Rep: Gamma-butyrobetaine
           hydroxylase - Marinomonas sp. MED121
          Length = 394

 Score =  100 bits (239), Expect = 4e-20
 Identities = 77/233 (33%), Positives = 107/233 (45%), Gaps = 10/233 (4%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y      QV         V   +  I+ T FG  +     AD   TAYTNL L  H D  
Sbjct: 162 YGLALVTQVDTQTNTLVKVANRISFIRETNFGTIFNVQAKADANSTAYTNLRLPLHTDLP 221

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
                 GLQ LHC+ + + TGGE+I VDGF  A  ++E +PED+  L+   +     ++ 
Sbjct: 222 TRELQPGLQFLHCLIN-DATGGESIFVDGFKIAEHMREHYPEDFASLSAIPMSFYNKDKE 280

Query: 121 HHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLY--YRSLKNLARYYENKE 178
             +      I  D N + I ++R   + R  +   S +   LY  YR   +L R  E K 
Sbjct: 281 TDYRFRGTAIVTDSNGK-IVEVRLANFLRGPIDVPSHQTMALYKAYRRFISLTR--ETK- 336

Query: 179 NQWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYVSRSDWLDKARSL 228
            Q   +L  G ++V DN R+LH RN F    GRR L G Y+ R + L + R L
Sbjct: 337 FQHFQRLNQGDLIVFDNRRVLHARNAFDLKAGRRHLQGCYIDRDELLSRIRVL 389


>UniRef50_A6QRR2 Cluster: Putative uncharacterized protein; n=1;
           Ajellomyces capsulatus NAm1|Rep: Putative
           uncharacterized protein - Ajellomyces capsulatus NAm1
          Length = 306

 Score =  100 bits (239), Expect = 4e-20
 Identities = 46/111 (41%), Positives = 65/111 (58%)

Query: 14  EATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHC 73
           EATE + + +  I+ T +G  W+FT+     D AYT   L  H D  Y+T+ AGLQ+ H 
Sbjct: 175 EATEKLLERIAFIRPTHYGGFWDFTSDLSLKDMAYTTEGLGGHTDTTYFTDPAGLQMFHM 234

Query: 74  IEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFT 124
           + HTNG+GGE++LVDGF  A  L  + PE YE L  + ++G      H+ T
Sbjct: 235 LSHTNGSGGESLLVDGFEAAKTLYNEDPEAYEVLKEFGVDGHASGNEHYST 285


>UniRef50_UPI0000E48C37 Cluster: PREDICTED: similar to gamma
           butyrobetaine hydroxylase, partial; n=1;
           Strongylocentrotus purpuratus|Rep: PREDICTED: similar to
           gamma butyrobetaine hydroxylase, partial -
           Strongylocentrotus purpuratus
          Length = 318

 Score = 98.3 bits (234), Expect = 1e-19
 Identities = 62/194 (31%), Positives = 97/194 (50%), Gaps = 8/194 (4%)

Query: 17  ETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEH 76
           E++ K +G ++ T++G T+E   +A  ++ AYT L L  H D   +     +Q+LHCI+ 
Sbjct: 86  ESIGKRVGHLRTTMYGHTFEVLAIASSSNLAYTTLKLGLHVDLPLYEVPPSVQMLHCIKQ 145

Query: 77  TNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIE-----GEYIERRHHFTHSAPVIQ 131
               GGE+   D       LKE  PE Y  LT  +++      +YI   +HF ++ P+I+
Sbjct: 146 CKTVGGESQFCDALKVTNDLKESDPEFYNTLTRVKVDIRLRGKDYIP--YHFQYARPIIE 203

Query: 132 IDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVM 191
           +D   +  K I  N   R+        D + +Y+SL  L      KEN   FKL  G V+
Sbjct: 204 LDDEGK-FKAITHNNGVRAPYMNLPVADVKTWYKSLACLDGKLNAKENMIQFKLKEGDVV 262

Query: 192 VIDNFRLLHGRNGF 205
             +N R++HGR  F
Sbjct: 263 TFNNNRVMHGRGSF 276


>UniRef50_A3YI34 Cluster: Gamma-butyrobetaine hydroxylase; n=1;
           Marinomonas sp. MED121|Rep: Gamma-butyrobetaine
           hydroxylase - Marinomonas sp. MED121
          Length = 372

 Score = 97.9 bits (233), Expect = 2e-19
 Identities = 57/201 (28%), Positives = 103/201 (51%), Gaps = 6/201 (2%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTN 78
           V +  G ++ T FG ++   T  +  D AY ++ L  H DN Y     G+Q+LHCI++  
Sbjct: 155 VAERFGYVRETNFGKSFSVYTRPNSDDLAYRSVALGPHTDNPYRNPIPGIQLLHCIQNET 214

Query: 79  GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTED 138
             GG + LVD     + LK++ PE ++ L+   +   ++++    +    +IQ+D N + 
Sbjct: 215 -QGGLSTLVDSLSVVSQLKQEDPEGFDLLSRVPVRYRHLDKSICLSERRTMIQLDINGQ- 272

Query: 139 IKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRL 198
           ++ + ++      +      D  +++R+ K L +   + + +W FKL PG + +  N R+
Sbjct: 273 VEGVAYSP-RLDFLPLLKQDDLIVFHRARKRLGQLLSDPKFEWRFKLAPGQLQMFHNSRV 331

Query: 199 LHGRNGF---TGRRVLCGAYV 216
           LHGR  F    G R L GAY+
Sbjct: 332 LHGRTEFDPNEGLRYLQGAYI 352


>UniRef50_A2R5A1 Cluster: Catalytic activity: H. sapiens BBH
           converts gamma-butyrobetaine precursor; n=2;
           Fungi/Metazoa group|Rep: Catalytic activity: H. sapiens
           BBH converts gamma-butyrobetaine precursor - Aspergillus
           niger
          Length = 543

 Score = 97.5 bits (232), Expect = 2e-19
 Identities = 70/230 (30%), Positives = 107/230 (46%), Gaps = 17/230 (7%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
           +  S E  E +   +G I+ T +G TW+  ++    + AYT+  L  H D +Y  +  G 
Sbjct: 276 IPDSREMVEKIATRMGPIRDTFYGRTWDVRSIPQATNVAYTDQFLGFHMDLMYMNDPPGY 335

Query: 69  QILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAP 128
           Q+LHC+++ +  GGE++ VD F  A  +K+D  ++Y  L  + I   Y    H +T+S P
Sbjct: 336 QLLHCLQN-SCEGGESLFVDTFRVAYDMKQDDHKNYSRLLHHHIPYHYNHPDHFYTNSWP 394

Query: 129 VIQ-------IDKNTEDIKQIRFNV-YDRSAMAFRS-----GRDCRLYYRSLKNLARYYE 175
           V +       + + T   K    +V Y     A R       R  R    +L   A   E
Sbjct: 395 VFETETFDNSVTEGTNFSKSRLVHVNYSPPFQAPRKVQSPVPRKFREKNEALAKFASLLE 454

Query: 176 NKENQWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYVSRSDWL 222
           ++   +  KL PG  +V +N R+ H R GF   TG R L GAYV     L
Sbjct: 455 DERYMFELKLNPGECVVFENRRVAHARRGFKTSTGERWLAGAYVDEDAML 504


>UniRef50_A3Y505 Cluster: Gamma-butyrobetaine hydroxylase, putative;
           n=2; Bacteria|Rep: Gamma-butyrobetaine hydroxylase,
           putative - Marinomonas sp. MED121
          Length = 397

 Score = 96.7 bits (230), Expect = 4e-19
 Identities = 61/200 (30%), Positives = 101/200 (50%), Gaps = 7/200 (3%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTN 78
           V    G ++ T +G  +E  T  +  + A+TNL L  H DN Y      +Q+LHC+E+T 
Sbjct: 168 VIDTFGYVRDTNYGKLFEVKTQVEPNNLAFTNLGLGLHADNPYRDPVPTVQLLHCLENT- 226

Query: 79  GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTED 138
             GGE+IL DGF  A  L+E+   D++ L+   I   + ++        P+I+++   + 
Sbjct: 227 VEGGESILGDGFKAARILREESQADFDLLSQTWINFRFQDKDTDLQSRVPLIEVNDKGQV 286

Query: 139 IKQIRFNVYDRSAMAFRSGR-DCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFR 197
           +K +RFN  +RS       +   + +Y++ ++ A           FKL  G +++ DN R
Sbjct: 287 VK-VRFN--NRSIAPINIDKHKMKAFYKAYQHYAEILNRTSIMVDFKLTQGQLVMFDNTR 343

Query: 198 LLHGRNGF--TGRRVLCGAY 215
           + H R  F  +G R L GAY
Sbjct: 344 VFHARKAFSTSGSRWLQGAY 363


>UniRef50_UPI0000586B6F Cluster: PREDICTED: hypothetical protein;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 481

 Score = 96.3 bits (229), Expect = 6e-19
 Identities = 55/199 (27%), Positives = 99/199 (49%), Gaps = 8/199 (4%)

Query: 17  ETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEH 76
           + +C  +G  + T +G+ +    + + +   +T   L  H D  Y+    G+Q L+C+  
Sbjct: 250 QNICDRVGYERFTCYGSDFRVENIFESSSLGFTTAALGLHLDLPYYDYRPGVQFLNCLRQ 309

Query: 77  TNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEG-----EYIERRHHFTHSAPVIQ 131
               GGE+  VD    A  LK++ PE YE++T  +++      +YI+   H  H+  +I+
Sbjct: 310 CEVKGGESQFVDAKRVAETLKKEEPEWYEYMTNVKLDFRLLGIDYID--SHLQHARNLIE 367

Query: 132 IDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVM 191
           +D+  E  K + +N   RS        +    Y++LK    +   KEN   +KL PG ++
Sbjct: 368 LDEQGE-FKTLAYNDQTRSPYMNVPVEEVNKIYQALKKFNEFLYRKENFIDYKLQPGEII 426

Query: 192 VIDNFRLLHGRNGFTGRRV 210
             DN R++HGR+ +T + V
Sbjct: 427 AFDNNRVMHGRSAYTVKYV 445


>UniRef50_O75936 Cluster: Gamma-butyrobetaine dioxygenase; n=26;
           Euteleostomi|Rep: Gamma-butyrobetaine dioxygenase - Homo
           sapiens (Human)
          Length = 387

 Score = 95.1 bits (226), Expect = 1e-18
 Identities = 65/210 (30%), Positives = 107/210 (50%), Gaps = 16/210 (7%)

Query: 21  KALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGT 80
           K +G +  T +G TW+     D  + AYT   L+ H D        G+Q+LHCI+ T  T
Sbjct: 167 KRMGFLYLTFYGHTWQVQDKIDANNVAYTTGKLSFHTDYPALHHPPGVQLLHCIKQTV-T 225

Query: 81  GGETILVDGFYGATCLKEDHPEDYEFLTTY-----EIEGEYIERRHHFTHSAPVIQIDKN 135
           GG++ +VDGF     LK+++P+ ++ L++      +I  +Y +      H   +I++D  
Sbjct: 226 GGDSEIVDGFNVCQKLKKNNPQAFQILSSTFVDFTDIGVDYCDFSVQSKHK--IIELDDK 283

Query: 136 TEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDN 195
            + ++ I FN   R  +        + +Y +LK       +KE+++ FK+ PG V+  DN
Sbjct: 284 GQVVR-INFNNATRDTIFDVPVERVQPFYAALKEFVDLMNSKESKFTFKMNPGDVITFDN 342

Query: 196 FRLLHGRNGFTG----RRVLCGAYVSRSDW 221
           +RLLHGR  +       R L GAY   +DW
Sbjct: 343 WRLLHGRRSYEAGTEISRHLEGAY---ADW 369


>UniRef50_Q1GKN1 Cluster: Gamma-butyrobetaine2-oxoglutarate
           dioxygenase; n=3; Proteobacteria|Rep:
           Gamma-butyrobetaine2-oxoglutarate dioxygenase -
           Silicibacter sp. (strain TM1040)
          Length = 402

 Score = 94.3 bits (224), Expect = 2e-18
 Identities = 66/221 (29%), Positives = 103/221 (46%), Gaps = 6/221 (2%)

Query: 12  SAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQIL 71
           S EA   V + +G ++ T FG T+E  +  +  + AYT + L  H D        G Q L
Sbjct: 182 STEAGMDVARRIGFLRQTNFGVTFEVKSKPNPNNLAYTPIALPLHTDLTNQELPPGFQFL 241

Query: 72  HCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQ 131
           HC+ +    GG ++  DG+  A  L+ D PE +E L+T  +   + ++     +   VI 
Sbjct: 242 HCLAN-EARGGGSLFCDGYAIAEDLRRDDPESFELLSTVSVPFRFHDQDTDIRNRKKVIT 300

Query: 132 IDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVM 191
           +D++   I +I FN +             R YYR+ +       +       KL  G ++
Sbjct: 301 LDEDGRVI-EICFNAHLADIFDLEPALMQR-YYRAYRKFMILTRSTNYLVTLKLKGGEMV 358

Query: 192 VIDNFRLLHGRNGF---TGRRVLCGAYVSRSDWLDKARSLN 229
           V DN R+LHGR  F   TG R L G YV R ++  + R L+
Sbjct: 359 VFDNRRVLHGREAFDPQTGYRHLHGCYVDRGEFESRLRVLH 399


>UniRef50_Q5AVW2 Cluster: Putative uncharacterized protein; n=1;
           Emericella nidulans|Rep: Putative uncharacterized
           protein - Emericella nidulans (Aspergillus nidulans)
          Length = 555

 Score = 94.3 bits (224), Expect = 2e-18
 Identities = 59/220 (26%), Positives = 111/220 (50%), Gaps = 14/220 (6%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
           +  S E  E +   +G +++T +G+TW+   V +  + AYT+  L  H D +Y  +    
Sbjct: 270 IPDSREMVEKIATRIGPLRNTFYGSTWDVRKVPEAKNVAYTSQYLGFHMDLMYMKDPPAF 329

Query: 69  QILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAP 128
           Q+LHC+ + +  GGE++  D F  A  L  + PE ++ L   ++  EY  +   ++++ P
Sbjct: 330 QLLHCLRN-SCDGGESLFADTFNVAGYLYRNRPEIFQILAKTKLRYEYQHKDQSYSNAWP 388

Query: 129 VIQ---IDKNTEDIKQIRFNVYDRSAMAFRSGRD------CRLYYRSLKNLARYYENKEN 179
           V++   +DK    + ++ ++   ++ +   S  D       +    +LK  A   E ++N
Sbjct: 389 VLERGPLDKG-HFLARVAYSPPFQAPILNDSNADPEYIAKLQTQLGALKYFASSLEREDN 447

Query: 180 QWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYV 216
            +  KL PG  ++ +N R++H R  F   TG R L GAY+
Sbjct: 448 MFELKLQPGECVIFENRRIVHARRQFNTATGERWLAGAYL 487


>UniRef50_A7S7D2 Cluster: Predicted protein; n=2; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 432

 Score = 93.9 bits (223), Expect = 3e-18
 Identities = 71/225 (31%), Positives = 105/225 (46%), Gaps = 13/225 (5%)

Query: 15  ATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCI 74
           A E +   +G I+ T +G T++     D  + AYT   L  H D        G+Q+LHC+
Sbjct: 202 AVERLATRVGYIKDTHYGHTFDVNAKFDANNLAYTTADLPLHCDIPQSEYYPGVQMLHCL 261

Query: 75  EHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRH----HFTHSAPVI 130
           +     GGE+I VDGF+ A  +KE HP  +  L T  I    I +      H  +    I
Sbjct: 262 QQAPTEGGESIFVDGFFIAQEIKEQHPRLFNLLATTPIPYVDIGKDEFGDFHLKNKRESI 321

Query: 131 QIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLV 190
           ++D+    I +  +N + R           +L Y++   L +   +  N   +KL PG V
Sbjct: 322 ELDE-LGHIVRFTYNNHVRDYFMDSPVEKVQLLYQAYLILGQMMRDPVNMLEYKLSPGEV 380

Query: 191 MVIDNFRLLHGRNGFT----GRRVLCGAYVSRSDW-LDKARSLNL 230
           +  +N R+LHGR G+T    G R L G Y+   DW L  AR  NL
Sbjct: 381 VSFNNSRVLHGRRGYTITGEGNRHLQGCYM---DWDLVNARLRNL 422


>UniRef50_A4R0Y1 Cluster: Putative uncharacterized protein; n=1;
           Magnaporthe grisea|Rep: Putative uncharacterized protein
           - Magnaporthe grisea (Rice blast fungus) (Pyricularia
           grisea)
          Length = 573

 Score = 93.9 bits (223), Expect = 3e-18
 Identities = 61/220 (27%), Positives = 106/220 (48%), Gaps = 13/220 (5%)

Query: 9   VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
           V  S  A E +  A+G  Q T +G TW+  +     + AYTN+ L  H D +Y  +   L
Sbjct: 336 VPESETAVEEMACAVGHAQTTFYGKTWDVVSKPQAENVAYTNVFLCLHQDLLYMQDPPRL 395

Query: 69  QILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAP 128
           Q+LHC+ + +  GGE++  DG   A  ++  +P+ +E L    +   Y +  H + ++ P
Sbjct: 396 QLLHCLAN-SCEGGESLFSDGIRAAEQVRSKNPKQFELLKNKPVYYHYDKNGHWYEYNRP 454

Query: 129 VIQIDKN-TEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSL----KNLARYYENK----EN 179
           V+ + K+ +  I  I ++   +       G    +  +      +  AR + +     E+
Sbjct: 455 VVTLSKDGSGAIDSIGWSPPFQDNFPAPQGLSASINSQDALEEWRAAARSFRDSSTAPES 514

Query: 180 QWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYV 216
            + +K+ PG   + DN R+LHGR  F   +G+R L G YV
Sbjct: 515 MFEYKMKPGECAIFDNMRILHGRRQFQLTSGKRWLKGTYV 554


>UniRef50_Q1QTU1 Cluster: Gamma-butyrobetaine,2-oxoglutarate
           dioxygenase precursor; n=1; Chromohalobacter salexigens
           DSM 3043|Rep: Gamma-butyrobetaine,2-oxoglutarate
           dioxygenase precursor - Chromohalobacter salexigens
           (strain DSM 3043 / ATCC BAA-138 / NCIMB13768)
          Length = 408

 Score = 93.5 bits (222), Expect = 4e-18
 Identities = 60/206 (29%), Positives = 100/206 (48%), Gaps = 6/206 (2%)

Query: 14  EATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHC 73
           E    + +  G ++ T FGA ++  +  +  + AYT + L  H D   W     +Q+L+C
Sbjct: 172 EEVVRIAELFGPMRATNFGARFDVQSKPNPNNAAYTAIGLELHTDLPNWRHPPDIQLLYC 231

Query: 74  IEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQID 133
           +E+    GGE++  DGF  A  L+ + PE +  L    I+  + +        APVI++D
Sbjct: 232 LEN-EAEGGESLFADGFAVAEALRHEAPELFLRLRDTPIDFRFQDEDSDIAVRAPVIEVD 290

Query: 134 KNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVI 193
            +T  I+++RFN + R  +      +   +Y +     +       +  F L PG ++  
Sbjct: 291 -DTGRIREVRFNNWIRDTLRL-PPEEADAWYEAYLVFWQRLREPRFRVDFALEPGQMVAF 348

Query: 194 DNFRLLHGRNGF---TGRRVLCGAYV 216
           DN R+LHGR  F   TGRR L G Y+
Sbjct: 349 DNRRVLHGRGAFDPNTGRRHLQGTYL 374


>UniRef50_A6SL62 Cluster: Putative uncharacterized protein; n=2;
           Sclerotiniaceae|Rep: Putative uncharacterized protein -
           Botryotinia fuckeliana B05.10
          Length = 467

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 67/238 (28%), Positives = 103/238 (43%), Gaps = 14/238 (5%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y       V  S  +   + K +G ++ T +G TW+  +V    + AYT+  L  H D +
Sbjct: 179 YGLLILRDVPESETSVVDIAKRIGNLRDTFYGVTWDVKSVPQPKNVAYTSQYLGLHMDLL 238

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
           Y     G Q LHC+ +T  +GG +I  D F+ A  L  D   +Y  L T ++   Y    
Sbjct: 239 YMANPPGFQFLHCLRNT-CSGGSSIFSDAFHAARQLDRD---NYIQLCTKKVGYHYRNAG 294

Query: 121 HHFTHSAPVIQI------DKNTEDIKQIRFNVYDRSAMA-FRSGRDCRLYYRSLKNLARY 173
            H+    PVI I      D ++     I++  Y     A F          R+L+  A  
Sbjct: 295 EHYHFKHPVISIHSKKGGDASSPSDNNIQYINYSPPFQATFDKPFGSLPIARALRQFASR 354

Query: 174 YENKENQWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYVSRSDWLDKARSL 228
            E  EN + ++L  G  ++ +N R+LHGR  F    G R   GAY+    +  + R L
Sbjct: 355 VEAPENMYEYRLQEGECVIFNNRRVLHGRKEFDTSAGERWFKGAYIDTDVFRSRYRVL 412


>UniRef50_Q96UB1 Cluster: Trimethyllysine dioxygenase; n=2;
           Neurospora crassa|Rep: Trimethyllysine dioxygenase -
           Neurospora crassa
          Length = 471

 Score = 91.9 bits (218), Expect = 1e-17
 Identities = 46/153 (30%), Positives = 86/153 (56%), Gaps = 5/153 (3%)

Query: 81  GGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSA----PVIQIDKNT 136
           GG+++LVDGF  A  LKE+ P  YE L++  +   +       T +     PV++++++T
Sbjct: 308 GGKSLLVDGFNAARILKEEDPRAYEILSSVRLPW-HASGNEGITIAPDKLYPVLELNEDT 366

Query: 137 EDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNF 196
            ++ ++R+N  DR  + F        +Y + +        K ++   +L PG  ++ DN+
Sbjct: 367 GELHRVRWNNDDRGVVPFGEKYSPSEWYEAARKWDGILRRKSSELWVQLEPGKPLIFDNW 426

Query: 197 RLLHGRNGFTGRRVLCGAYVSRSDWLDKARSLN 229
           R+LHGR+ F+G R +CG Y++R D++ + R+ N
Sbjct: 427 RVLHGRSAFSGIRRICGGYINRDDFISRWRNTN 459



 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 27/63 (42%), Positives = 38/63 (60%)

Query: 14  EATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHC 73
           + T  + + +  I+ T +G  ++FT     ADTAYTNL L AH D  Y+T+ AGLQ  H 
Sbjct: 209 DVTRQLLERIAFIRVTHYGGFYDFTPDLAMADTAYTNLALPAHTDTTYFTDPAGLQAFHL 268

Query: 74  IEH 76
           +EH
Sbjct: 269 LEH 271


>UniRef50_Q1QSP3 Cluster: Taurine catabolism dioxygenase TauD/TfdA;
           n=1; Chromohalobacter salexigens DSM 3043|Rep: Taurine
           catabolism dioxygenase TauD/TfdA - Chromohalobacter
           salexigens (strain DSM 3043 / ATCC BAA-138 / NCIMB13768)
          Length = 431

 Score = 91.5 bits (217), Expect = 2e-17
 Identities = 64/216 (29%), Positives = 97/216 (44%), Gaps = 7/216 (3%)

Query: 17  ETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEH 76
           + + + +G  + T FG  ++     D    AYT++ L  H D        GLQ+LHC+E+
Sbjct: 204 DAIARRIGPPRTTNFGTLFDVRAKPDPDSNAYTSIALPPHVDLPTREYQPGLQLLHCLEN 263

Query: 77  TNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNT 136
               GG+ +++DGF  A  L+E HPE +  LT          R        P+I++D N 
Sbjct: 264 DT-VGGDAVMMDGFAVAEALRERHPEHFATLTRVRWCYANTARTTDHVWFDPMIKLDANG 322

Query: 137 EDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNF 196
               ++R   + R  +      D    Y +L  L R     E    F   PG +++ DN 
Sbjct: 323 H-FDEVRIADFLRGPL-MAPFEDVEPAYAALMALQRLLREPEFALRFSYAPGDMVIFDNR 380

Query: 197 RLLHGRNGFT----GRRVLCGAYVSRSDWLDKARSL 228
           RLLH R+ F     GRR L G Y+ R +   + R L
Sbjct: 381 RLLHARDAFDVGQGGRRWLQGCYLERDEARSRLRML 416


>UniRef50_Q17KD9 Cluster: Epsilon-trimethyllysine 2-oxoglutarate
           dioxygenase; n=3; Culicidae|Rep: Epsilon-trimethyllysine
           2-oxoglutarate dioxygenase - Aedes aegypti (Yellowfever
           mosquito)
          Length = 466

 Score = 91.5 bits (217), Expect = 2e-17
 Identities = 63/230 (27%), Positives = 111/230 (48%), Gaps = 17/230 (7%)

Query: 12  SAEATETVCKAL----GGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAG 67
           +A  TE  C+ L    G I+ T +G  +        ++ AY + PL  H D  Y+    G
Sbjct: 230 NAPLTEQECRRLAERVGFIRKTHYGEEFIVKAKEGTSNVAYLSTPLQMHTDLPYYDYKPG 289

Query: 68  LQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDY----EFLTTYEIEGEYIERRHHF 123
             +LHC+  +   GG+ +L D FY A  ++ ++P+D+    E L  +   GE    + H 
Sbjct: 290 CNLLHCLVQSRSQGGQNLLADAFYVADLMRREYPKDFRLLSETLVNWTDIGEDEGGQFHS 349

Query: 124 THSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRD-CRLYYRSLKNLARYYENKENQWI 182
            + APVI + ++  ++++I  +V  R +  F    D  + +YR++    +    +  +  
Sbjct: 350 IYRAPVICVGRD-GNLERINHSVPQRDSF-FNVPLDRVQPWYRAMARFVKLIHQEAVE-- 405

Query: 183 FKLVPGLVMVIDNFRLLHGRNGFTGRRV----LCGAYVSRSDWLDKARSL 228
           FK +PG ++   N R++HGR G+T   V    + GAY+   +   K R L
Sbjct: 406 FKTMPGDILTFSNVRMVHGRTGYTDTEVNTRHIVGAYLDWDEIYSKLRVL 455


>UniRef50_Q1RPP2 Cluster: Gamma-butyrobetaine hydroxylase; n=2;
           Chromohalobacter salexigens|Rep: Gamma-butyrobetaine
           hydroxylase - Chromohalobacter salexigens
          Length = 407

 Score = 90.6 bits (215), Expect = 3e-17
 Identities = 63/231 (27%), Positives = 101/231 (43%), Gaps = 6/231 (2%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y F   + V   A+  + +   +G ++ T +G   +  +VA+  D   T   L  H DN 
Sbjct: 159 YGFVKVSGVPCEADGMQPLIDRIGPLRRTNWGGIADVKSVANAFDLTMTQRGLEPHTDNP 218

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
           Y     G   LHC+ +    GG++ L DGF  A  LK + PED+  LT       Y +  
Sbjct: 219 YRDPIPGYIWLHCLSNA-ADGGDSTLTDGFMAAQRLKAEAPEDFACLTRLSPRFRYTDAT 277

Query: 121 HHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQ 180
                  P+I++D     + ++R++      +A         YY + +   R   ++   
Sbjct: 278 TDLESEGPLIELDSRGR-LARVRYS-NRTERIAAHDAALLERYYAARQRFYRLITDEALT 335

Query: 181 WIFKLVPGLVMVIDNFRLLHGRNGFT---GRRVLCGAYVSRSDWLDKARSL 228
              KL PG ++++DN+RLLHGR  +    G R L   YV R     + R L
Sbjct: 336 VHLKLGPGDMLIMDNYRLLHGRTAYQLEGGVRHLRQGYVDRDSTASRRRVL 386


>UniRef50_Q4V6I6 Cluster: IP11337p; n=6; Sophophora|Rep: IP11337p -
           Drosophila melanogaster (Fruit fly)
          Length = 421

 Score = 90.2 bits (214), Expect = 4e-17
 Identities = 65/207 (31%), Positives = 95/207 (45%), Gaps = 14/207 (6%)

Query: 23  LGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGG 82
           +G I+ T +G  +         + AY +L L  H D  Y+     + ILHC+  T+  GG
Sbjct: 202 VGFIRRTTYGEEFVVQAKPGAQNFAYLSLTLPLHTDLPYYEYKPSVNILHCVVQTDSPGG 261

Query: 83  ETILVDGFYGATCLKEDHPEDYEFLTTYEIE----GEYIERRHHFTHSAPVIQIDKNTED 138
             +LVDGF+ A  L+ DHPED+E L+   ++    G    R  H    APVI +D+    
Sbjct: 262 SNMLVDGFHVADLLRRDHPEDFERLSRIVVDWNDIGSEDGREFHNIWRAPVICLDEEGR- 320

Query: 139 IKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRL 198
             +I  +V  R +       +   +Y S     R      +   FK  PG V+  +N RL
Sbjct: 321 YTRINHSVPQRDSHFNVPLEEVLPWYESYALFVRL--AIADSHAFKTRPGDVLTFNNIRL 378

Query: 199 LHGRNGF----TGRRVLCGAYVSRSDW 221
           LHGR G+       R + GAY+   DW
Sbjct: 379 LHGRTGYDDSEESPRYIVGAYL---DW 402


>UniRef50_Q1E7N7 Cluster: Putative uncharacterized protein; n=1;
           Coccidioides immitis|Rep: Putative uncharacterized
           protein - Coccidioides immitis
          Length = 461

 Score = 90.2 bits (214), Expect = 4e-17
 Identities = 63/232 (27%), Positives = 107/232 (46%), Gaps = 13/232 (5%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y       V  S E+   +   +G ++++ +G TW+  +V +  + AYTN  L  H D +
Sbjct: 207 YGLAFVKNVPESTESVSQIATRMGPLRNSFYGLTWDVRSVPEAKNVAYTNKFLGFHMDLL 266

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
           Y  +    Q+LHC+ ++   GGE++ VD F  A  L ED   D + L+   +   Y    
Sbjct: 267 YMADPPAYQLLHCMNNSL-PGGESMFVDTFRAAQRLSED---DRKTLSDVALHYGYFNDG 322

Query: 121 HHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQ 180
             + +S P IQ+DK + D++ + ++   ++     +    +    SL+  +   ++ E  
Sbjct: 323 QSYEYSRPTIQLDK-SGDLEYVNYSPPFQAPHFTPADYPMQQLAESLRKFSDLLQDPEGM 381

Query: 181 WIFKLVPGLVMVIDNFRLLHGRNGF-----TGR---RVLCGAYVSRSDWLDK 224
           +  KL PG  ++  N R+ H R  F      GR   R L GAYV     L K
Sbjct: 382 FELKLRPGECVIFANRRVAHARRAFDLSQSDGRKRSRWLRGAYVDEDALLSK 433


>UniRef50_Q112B1 Cluster: Taurine catabolism dioxygenase TauD/TfdA;
           n=1; Trichodesmium erythraeum IMS101|Rep: Taurine
           catabolism dioxygenase TauD/TfdA - Trichodesmium
           erythraeum (strain IMS101)
          Length = 371

 Score = 89.4 bits (212), Expect = 7e-17
 Identities = 55/208 (26%), Positives = 108/208 (51%), Gaps = 7/208 (3%)

Query: 13  AEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILH 72
           A+  +   +++G I +  +G      T  +  D A T   ++ H D  YW     L  L+
Sbjct: 150 AKDLQATIESIGPIYNGDYGLFAPSKTTNEGKDLAETGNAMSFHTDYTYWHTPPLLTSLY 209

Query: 73  CIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGE--YIERRHHFTHSAPVI 130
           C+E++  +GGE+++VDGF      ++ HP+ ++ LT   I+ +  Y + ++ ++ + P++
Sbjct: 210 CVENS-ASGGESLIVDGFRVVDDFRQQHPDYFQILTQTPIQFKQVYTKWQYFYSRTQPIL 268

Query: 131 QIDKNTEDIKQIRFNVYDRSAMAFRSGRD-CRLYYRSLKNLARYYENKENQWIFKLVPGL 189
           ++D   E  K  R N  +  +  ++   D    +Y +     +Y +N   ++ F L PG 
Sbjct: 269 ELD---EYGKVTRINFANSHSYTWKLPFDQMEEFYAAYITFFQYVKNPVYEYCFSLEPGD 325

Query: 190 VMVIDNFRLLHGRNGFTGRRVLCGAYVS 217
           ++++++ R++HGR  FTG R L  A VS
Sbjct: 326 LLLMNDSRIMHGRKAFTGNRHLEIACVS 353


>UniRef50_Q6C1G9 Cluster: Similar to DEHA0C03839g Debaryomyces
           hansenii; n=1; Yarrowia lipolytica|Rep: Similar to
           DEHA0C03839g Debaryomyces hansenii - Yarrowia lipolytica
           (Candida lipolytica)
          Length = 453

 Score = 88.6 bits (210), Expect = 1e-16
 Identities = 64/226 (28%), Positives = 105/226 (46%), Gaps = 15/226 (6%)

Query: 17  ETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEH 76
           E V K +G I+ T +G +W+  +V +  + AYT+  L  H D +Y+    G+Q+LH I++
Sbjct: 227 EEVGKKIGYIKETFYGRSWDTRSVPNPKNVAYTSQYLPLHMDLLYYESPPGIQLLHVIKN 286

Query: 77  TNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVI----QI 132
               GGE+I  D F  A  + E +P  Y  L    +   YI    H+ ++ P+I    Q 
Sbjct: 287 -QAVGGESIFTDSFASAKYVWEKNPAAYRALCEIPLTFHYINDGQHYHNTVPMIVEHQQT 345

Query: 133 DKNT-EDIKQIRF-----NVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLV 186
           DK+   + K I +       +D   +    G  C L+   L+    +  + EN+   K+ 
Sbjct: 346 DKSKWTNPKAINYAPPFQGPFDAVELV-EGGEKCELFREGLRLFEEHLTSAENELRTKME 404

Query: 187 PGLVMVIDNFRLLHGRNGF---TGRRVLCGAYVSRSDWLDKARSLN 229
               ++  N R+LH R  F   +G R L G Y+    +  K R L+
Sbjct: 405 ENSCVLFLNRRVLHSRTEFDAQSGVRWLKGTYLDIDAFYSKLRVLS 450


>UniRef50_UPI000023D763 Cluster: hypothetical protein FG05953.1;
           n=1; Gibberella zeae PH-1|Rep: hypothetical protein
           FG05953.1 - Gibberella zeae PH-1
          Length = 494

 Score = 88.2 bits (209), Expect = 2e-16
 Identities = 54/206 (26%), Positives = 98/206 (47%), Gaps = 6/206 (2%)

Query: 23  LGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGG 82
           +  I+ T +G T++     D  + AYT+  L  H D +Y      +Q+LHC+E+ +  GG
Sbjct: 227 IANIKETFYGRTFDVRAKPDAENVAYTSGYLGLHQDLLYLESPPAIQLLHCMEN-SCEGG 285

Query: 83  ETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQI 142
           E++  DG +    L          L    +   Y +  + +    P++++  N E++  +
Sbjct: 286 ESLFSDGLFAGKLLFLQSSPTIRNLWKVMVPYHYEKHGYFYHQRRPILELGPN-ENLAGV 344

Query: 143 RFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGR 202
            ++   +   +  +  D R +    K   R   N +  +  K+ PG  ++ DN R++HGR
Sbjct: 345 NWSPPFQDQFS-SAAVDAREWLEPAKLFDRMINNPDVMYEMKMEPGECVLFDNTRIMHGR 403

Query: 203 NGFT---GRRVLCGAYVSRSDWLDKA 225
           N F    G R L GAY+SR D++ +A
Sbjct: 404 NKFDVGGGSRWLRGAYISREDFVSRA 429


>UniRef50_A0Z404 Cluster: Gamma-butyrobetaine,2-oxoglutarate
           dioxygenase; n=1; marine gamma proteobacterium
           HTCC2080|Rep: Gamma-butyrobetaine,2-oxoglutarate
           dioxygenase - marine gamma proteobacterium HTCC2080
          Length = 416

 Score = 86.2 bits (204), Expect = 6e-16
 Identities = 72/230 (31%), Positives = 104/230 (45%), Gaps = 16/230 (6%)

Query: 11  PSAEA-TETVCKALGGIQHTIFGATWEFT---TVADHADT---AYTNLPLAAHNDNIYWT 63
           PS E     +   +G ++ + FGA W+     ++A  A T   A T L L  H D     
Sbjct: 180 PSEEGFLNKLAARIGPVRDSNFGALWDVVADISLAGDAKTNTTANTGLRLGPHTDLPTRE 239

Query: 64  EAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHF 123
              G Q LHC+ +    GGE+ L DG      LK  HP DYE L+T      +  R    
Sbjct: 240 IPPGYQFLHCLIN-EADGGESTLTDGAALVQELKMHHPADYELLSTRR--WVFFNRGPGI 296

Query: 124 TH--SAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQW 181
            H  SAP+I        +  +R   Y   A       D    Y +L+   +  ++   + 
Sbjct: 297 DHRWSAPIIDTS-GAHALPTLRA-FYPVRAFPDMPECDVAEAYEALRRFHKLADDPRFEL 354

Query: 182 IFKLVPGLVMVIDNFRLLHGRNGF--TGRRVLCGAYVSRSDWLDKARSLN 229
            F+L  G +M  DN R++HGR GF  +G+R L G Y+ R + L +AR+LN
Sbjct: 355 TFRLGAGDIMCFDNRRVMHGRKGFSGSGKRHLQGVYIDRDEILSRARALN 404


>UniRef50_Q75A94 Cluster: ADR024Wp; n=1; Eremothecium gossypii|Rep:
           ADR024Wp - Ashbya gossypii (Yeast) (Eremothecium
           gossypii)
          Length = 472

 Score = 86.2 bits (204), Expect = 6e-16
 Identities = 65/242 (26%), Positives = 106/242 (43%), Gaps = 30/242 (12%)

Query: 17  ETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEH 76
           +T+ + +G I+HT +G  ++    A   + AYTN+PL  H D +Y     G Q+LH I +
Sbjct: 225 KTIAERIGNIRHTFYGELFDVINKAGAENIAYTNVPLPLHMDLLYLETVPGWQLLHAIRN 284

Query: 77  TNGTG--GETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQ--- 131
           + G    G     D F+ A  ++E   + Y+ LT   +   Y      +  S PVI+   
Sbjct: 285 STGATDLGMNYFADAFHAARYVRETDSDAYDALTHMPVNYGYNRDDKRYYRSRPVIEEHE 344

Query: 132 -----------------IDKNTEDIKQIRFNVYDRSAMAFRSGRDCRL----YYRSLKNL 170
                            ++ +    K   + ++++   A  S    +L     +R  +  
Sbjct: 345 FGEGTSLSSQFNRLIKCVNYSPPFQKPFTYGIWEKPKGAEVSTPQGKLTERFVFRDFQRG 404

Query: 171 ARYYE----NKENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDKAR 226
            R +E    N ENQ+  KL  G  ++ +N R+LH R GFTG R L G Y+     L K +
Sbjct: 405 LRLFEQYINNPENQFRVKLPEGTCVIFNNRRILHARTGFTGERWLKGCYLDSDSVLSKLQ 464

Query: 227 SL 228
            L
Sbjct: 465 YL 466


>UniRef50_A0YX02 Cluster: Gamma-butyrobetaine hydroxylase, putative;
           n=2; Lyngbya sp. PCC 8106|Rep: Gamma-butyrobetaine
           hydroxylase, putative - Lyngbya sp. PCC 8106
          Length = 378

 Score = 85.4 bits (202), Expect = 1e-15
 Identities = 53/170 (31%), Positives = 97/170 (57%), Gaps = 11/170 (6%)

Query: 53  LAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEI 112
           L+ H D  + +    +Q+L+C+E+   TGGE++LVDGF  A   ++ HP+ +E LT   +
Sbjct: 190 LSPHTDITFMSTPPLVQLLYCVENL-ATGGESVLVDGFKVARDFQQHHPQYFEILTKVPV 248

Query: 113 EGE--YIERRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSA-MAFRSGRDCRLYYRSLKN 169
           + E  Y E  ++ + + P+I+++++   +  I F+  + S+ + F    +   +Y + K 
Sbjct: 249 KFEQFYQEWEYYVSRTTPIIELEQDGL-VSGIYFSHKNFSSQLPFDQVEE---FYEAYKT 304

Query: 170 LARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYV 216
              Y +N   Q+ F+L PG  ++++NFR+LHGR  F   +G R L  AY+
Sbjct: 305 FFLYLKNPAYQYWFRLEPGDCLLVENFRVLHGRKAFNPNSGMRHLEVAYM 354


>UniRef50_Q9UVG4 Cluster: Putative uncharacterized protein; n=2;
           Pichia|Rep: Putative uncharacterized protein - Pichia
           farinosa (Yeast)
          Length = 436

 Score = 83.8 bits (198), Expect = 3e-15
 Identities = 59/227 (25%), Positives = 103/227 (45%), Gaps = 19/227 (8%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTN 78
           + +  G I+ T +G  ++     +  + AYT+  L  H D +Y+    GLQ LH I+++ 
Sbjct: 209 LARKFGYIKETFYGTLFDVKNKDEAENIAYTDTFLPLHMDLLYYESPPGLQFLHFIKNST 268

Query: 79  GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTED 138
             GGE++  DGF  A  +KE  PE Y+ L    I   Y    H++ H  P++  + + E 
Sbjct: 269 -EGGESMFADGFAIARKVKEQDPEAYDALKKVLITYRYENNGHYYYHKRPLVVEETSWET 327

Query: 139 -------IKQIRFNVYDRSAMAFRSGRD-------CRLYYRSLKNLARYYENKENQWIFK 184
                  IK++ ++   +    +    D        +L+ R           ++NQ   K
Sbjct: 328 LDYASGIIKEVNYSPPFQGHFEYGIHGDKPKESALFKLFLRGYLLFESLANQEQNQLSLK 387

Query: 185 LVPGLVMVIDNFRLLHGRNGFT----GRRVLCGAYVSRSDWLDKARS 227
           +  G+ ++ DN R+LH R  F+    G+R L G YV    +  + R+
Sbjct: 388 VPAGVCVIFDNRRILHSRKSFSSSNGGQRWLMGCYVDGDSFRSRLRT 434


>UniRef50_P80193 Cluster: Gamma-butyrobetaine dioxygenase; n=13;
           Proteobacteria|Rep: Gamma-butyrobetaine dioxygenase -
           Pseudomonas sp. (strain AK-1)
          Length = 383

 Score = 83.4 bits (197), Expect = 4e-15
 Identities = 65/213 (30%), Positives = 100/213 (46%), Gaps = 14/213 (6%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTN 78
           + K +  I+ + FG  ++  + AD    AYT   L  H D        GLQ LHC+ + +
Sbjct: 172 LAKRISFIRESNFGVLFDVRSKADADSNAYTAFNLPLHTDLPTRELQPGLQFLHCLVN-D 230

Query: 79  GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTED 138
            TGG +  VDGF  A  L+ + P  Y  L    +E    +R   +  +APVI +D + E 
Sbjct: 231 ATGGNSTFVDGFAIAEALRIEAPAAYRLLCETPVEFRNKDRHSDYRCTAPVIALDSSGE- 289

Query: 139 IKQIRFNVYDRSAMAFRSGR--DCRLYYRSLKNLARYYENKENQWIF--KLVPGLVMVID 194
           +++IR   + R+     + R  D  L YR    + R     E ++ F  +L  G +   D
Sbjct: 290 VREIRLANFLRAPFQMDAQRMPDYYLAYRRFIQMTR-----EPRFCFTRRLEAGQLWCFD 344

Query: 195 NFRLLHGRNGF---TGRRVLCGAYVSRSDWLDK 224
           N R+LH R+ F   +G R   G YV R + L +
Sbjct: 345 NRRVLHARDAFDPASGDRHFQGCYVDRDELLSR 377


>UniRef50_UPI0000E486C9 Cluster: PREDICTED: similar to LOC535630
           protein, partial; n=1; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to LOC535630 protein,
           partial - Strongylocentrotus purpuratus
          Length = 90

 Score = 83.0 bits (196), Expect = 6e-15
 Identities = 35/84 (41%), Positives = 56/84 (66%)

Query: 144 FNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRN 203
           FN +DRS +++ S  D   +Y + +   R  E+ E++++ KL PG ++V DN+RLLHGR 
Sbjct: 1   FNPHDRSTLSYTSYEDAERFYAAYRTFGRIIESPESKFLTKLQPGRMVVFDNWRLLHGRA 60

Query: 204 GFTGRRVLCGAYVSRSDWLDKARS 227
           GFTG+RV+CG Y +  D+ +  R+
Sbjct: 61  GFTGKRVMCGCYFNYDDFQNLKRT 84


>UniRef50_Q9NF72 Cluster: EG:BACR7A4.9 protein; n=4; Sophophora|Rep:
           EG:BACR7A4.9 protein - Drosophila melanogaster (Fruit
           fly)
          Length = 504

 Score = 82.6 bits (195), Expect = 8e-15
 Identities = 63/220 (28%), Positives = 99/220 (45%), Gaps = 13/220 (5%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTN 78
           + + +G I+ T +G  +E  +  +  + AY   PL  H D  Y+   AG+ ILH +  + 
Sbjct: 281 LAERIGYIKRTTYGDVFEVKSKPNARNYAYLMTPLPLHTDMPYYEYKAGINILHTLVQSE 340

Query: 79  GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYI------ERRHHFTHSAPVIQI 132
             GG   L DGF  A+ L++ HPED+E L +  +    I       +  H    APVI +
Sbjct: 341 SKGGANTLTDGFNVASQLQKLHPEDFEVLKSVPVNWFDIGHDGDDSKPFHSLWRAPVICL 400

Query: 133 DKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMV 192
           D +     +I  N   R +    S      +Y++        +++  +  FK   G V V
Sbjct: 401 DVDGR-FARINQNTTKRDSRFSVSLAQAVSWYKAYDKFLEIAQSEAVE--FKTQAGDVFV 457

Query: 193 IDNFRLLHGRNGFT----GRRVLCGAYVSRSDWLDKARSL 228
            +N R+LHGR  +      +R L GAYV       K R+L
Sbjct: 458 FNNLRMLHGRTAYEDAPGNKRHLVGAYVDWDIIYSKLRTL 497


>UniRef50_UPI0000587DDD Cluster: PREDICTED: hypothetical protein;
           n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
           hypothetical protein - Strongylocentrotus purpuratus
          Length = 395

 Score = 81.8 bits (193), Expect = 1e-14
 Identities = 54/182 (29%), Positives = 91/182 (50%), Gaps = 8/182 (4%)

Query: 29  TIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVD 88
           TI+G  ++  T+ D    AYT   L  H D   +    G+Q+LHCI+  +  GG+   VD
Sbjct: 175 TIYGDIFDVITMYDACSLAYTAKQLGLHVDLPGFYNTPGVQMLHCIKQVDSEGGDNEFVD 234

Query: 89  GFYGATCLKEDHPEDYEFLTTYEIE-----GEYIERRHHFTHSAPVIQIDKNTEDIKQIR 143
           G   A  L++++P+  + LT  +++      EY+   +H     PVI+ D++    + I 
Sbjct: 235 GLRVAEQLEQEYPKILQTLTRMKVDFRTLGAEYVP--YHTMTQRPVIEYDQDGV-FQGIN 291

Query: 144 FNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRN 203
           +N   R+        +    YR+LK   R   ++ N   +K+  G +++ DN R+LHGR 
Sbjct: 292 YNDGVRAPYWSLPVEEITEGYRALKTFHRAMYDERNCIYYKMEKGDMVIFDNRRVLHGRL 351

Query: 204 GF 205
           GF
Sbjct: 352 GF 353


>UniRef50_A0YLG8 Cluster: Gamma-butyrobetaine hydroxylase; n=1;
           Lyngbya sp. PCC 8106|Rep: Gamma-butyrobetaine
           hydroxylase - Lyngbya sp. PCC 8106
          Length = 358

 Score = 79.0 bits (186), Expect = 9e-14
 Identities = 57/216 (26%), Positives = 104/216 (48%), Gaps = 16/216 (7%)

Query: 14  EATETVCKALGGIQHTIFGATWEFTTVADHADT--AYTNLPLAAHNDNIYWTEAAGLQIL 71
           E  E+   ++G I +  +G      T     ++  +    PL  H+D  YW     ++ L
Sbjct: 141 EKLESFLSSIGPIFNADYGTIMPLETRDKTTESLPSRDGCPLPPHHDLSYWGGHRLVEFL 200

Query: 72  HCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRH--HFTHSAPV 129
           +C+E+ N +GGE+ LVDGF  A    +D+P+ Y+ L    ++   +++ H   F + A +
Sbjct: 201 YCVENQN-SGGESTLVDGFQVAQDFSQDYPQYYQTLLETPVQFWLVDKTHQYRFCNIATI 259

Query: 130 IQIDKNTEDIKQIRFNVYD-RSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPG 188
           ++ D+   ++  +RF+  + R  + F    D   +Y++      Y +  + +  F+L   
Sbjct: 260 LECDR-YGNLTTVRFSKRNCRPHLPFEQLED---FYQAYHTFFHYLKKNDYKHQFQLRSH 315

Query: 189 LVMVIDNFRLLHGRNGF---TGRRVLCGAYVSRSDW 221
             ++  NFR+LHGR  F    G+R L   YV   DW
Sbjct: 316 NCLLFQNFRILHGRTAFDPALGKRKLNSGYV---DW 348


>UniRef50_A7SHP2 Cluster: Predicted protein; n=4; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 385

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 56/209 (26%), Positives = 94/209 (44%), Gaps = 13/209 (6%)

Query: 22  ALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTG 81
           A+G ++ T +G++    +       AYT   L  H D  Y+     + +LHCI+    +G
Sbjct: 166 AVGFLRKTFYGSSVALRSEPQARSLAYTGYELQPHTDLPYYEFKPSVILLHCIDQVRSSG 225

Query: 82  GETILVDGFYGATCLKEDHPEDYEFLTT----YEIEG-EYIERRHHFTHSAPVIQIDKNT 136
           GE   VDG+      + D+P+ ++ L +    + ++G E          + P+I++D   
Sbjct: 226 GENTFVDGYSILKAFRNDNPDGFDLLASTPVLHRVKGVEPTYGEFEQLFARPIIELDVKG 285

Query: 137 EDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNF 196
             I++I FN   R              YR+   L + +   +     K+ PG +  IDN 
Sbjct: 286 R-IRRINFNDPLREEFLDTPAEQIPKVYRAYHKLTQMFYEPKFIVRNKMAPGDICAIDND 344

Query: 197 RLLHGRNGFTGR----RVLCGAYVSRSDW 221
           RLLHGR+ F  +    R+L  AY+   DW
Sbjct: 345 RLLHGRSAFEVKSDDLRLLEQAYI---DW 370


>UniRef50_Q5A0G4 Cluster: Potential gamma-butyrobetaine hydroxylase;
           n=1; Candida albicans|Rep: Potential gamma-butyrobetaine
           hydroxylase - Candida albicans (Yeast)
          Length = 407

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 57/217 (26%), Positives = 93/217 (42%), Gaps = 13/217 (5%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHA-DTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHT 77
           + +  G I+ T +G  ++     + A + AYTN  L  H D +Y+    GLQ+LH I+++
Sbjct: 187 IAEKFGYIKKTFYGTLFDVKNKKEKATNIAYTNTFLPLHMDLLYYESPPGLQLLHAIQNS 246

Query: 78  NGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTE 137
              GGE I  D +  A  +++  P  Y  LT   I   Y     ++ +  P+I  D    
Sbjct: 247 T-LGGENIFCDSYLAAEHVRKTDPRAYTALTQTPITFHYDNNNEYYYYKRPLIVEDPEVG 305

Query: 138 D----IKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVI 193
           D    I  I +    +         D   + R ++    +  +  N +  K+  G  ++ 
Sbjct: 306 DGFPKIASINYAPPFQGPFEVDPHPD---FIRGMQLFETFINDPANHFEIKMPEGTCVIF 362

Query: 194 DNFRLLHGRNGFT----GRRVLCGAYVSRSDWLDKAR 226
           +N R LH RN F+    G R L G YV    +  K R
Sbjct: 363 ENRRALHSRNAFSDSNNGDRWLMGTYVDGDSFRSKLR 399


>UniRef50_Q0UUH0 Cluster: Putative uncharacterized protein; n=1;
           Phaeosphaeria nodorum|Rep: Putative uncharacterized
           protein - Phaeosphaeria nodorum (Septoria nodorum)
          Length = 490

 Score = 78.6 bits (185), Expect = 1e-13
 Identities = 63/242 (26%), Positives = 101/242 (41%), Gaps = 15/242 (6%)

Query: 1   YTFKSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           Y       V  S  +   +   +G ++ T +G TW+  + A   + AYT   L  H D +
Sbjct: 231 YGLLFLRNVPDSETSVVDLASRIGTLKDTFYGRTWDVRSKAKAENIAYTPQFLGLHMDLL 290

Query: 61  YWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERR 120
           Y +    LQ LH +      GGE+   D F+ A  L+      +  L T+ +  +Y    
Sbjct: 291 YTSNPPHLQFLHSL-RARCPGGESFFSDSFHAAHQLQRRSAFHFRTLCTFPVTYQYHHPT 349

Query: 121 HHFTHSAPVIQI-------DKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARY 173
            H+  + PVI +       D     I ++ ++   +     R G +     RS    +  
Sbjct: 350 FHYHFTRPVIDLHPYPKYSDPTLLPIHRVNWSPPFQGPFEARIGSNNANSLRSFVAASHA 409

Query: 174 YE----NKENQWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYVSRSDWLDKAR 226
           YE    ++EN + ++L  G  ++ DN R+LH R  F    G R L GAYV    +  K R
Sbjct: 410 YEKLISSEENLYEYRLNEGECVIFDNRRVLHARKAFDASKGERWLKGAYVDDDVFFSKLR 469

Query: 227 SL 228
            L
Sbjct: 470 VL 471


>UniRef50_Q1GF28 Cluster: Gamma-butyrobetaine2-oxoglutarate
           dioxygenase; n=4; Rhodobacteraceae|Rep:
           Gamma-butyrobetaine2-oxoglutarate dioxygenase -
           Silicibacter sp. (strain TM1040)
          Length = 382

 Score = 78.2 bits (184), Expect = 2e-13
 Identities = 66/227 (29%), Positives = 102/227 (44%), Gaps = 17/227 (7%)

Query: 11  PSAEATET-VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQ 69
           P ++A  T   + +G ++ T FG  ++  T  +  +TAYT   L  H D      A G+Q
Sbjct: 156 PDSDAALTQTAELMGFVRPTFFGTYFDVKTHINPTNTAYTAGALELHTDTPAEEFAPGIQ 215

Query: 70  ILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAP- 128
            LHC  +T   GGE++  DG   A   ++  PE +  L+   I   Y E   +   S   
Sbjct: 216 FLHCRINT-VDGGESLYADGVAVANDFRKRDPEGFRLLSEVPIP-FYCEHDTYDARSRQY 273

Query: 129 VIQIDKNTE----DIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFK 184
           VI++D++ E     I Q   +++D              YY +     R  + ++    F 
Sbjct: 274 VIELDQHGEVEGLTISQHMADIFDLDQKLLDD------YYPAFCRFGRMLQEEKYMMRFL 327

Query: 185 LVPGLVMVIDNFRLLHGRNGFT---GRRVLCGAYVSRSDWLDKARSL 228
           +  G  MV DN R++HGR  +T   G R L G YV RS+     R+L
Sbjct: 328 MKGGECMVFDNHRIVHGRAAYTASSGDRYLRGCYVDRSEMRSTYRAL 374


>UniRef50_Q6CQT2 Cluster: Similar to sp|P23180 Saccharomyces
           cerevisiae YHL021c singleton; n=1; Kluyveromyces
           lactis|Rep: Similar to sp|P23180 Saccharomyces
           cerevisiae YHL021c singleton - Kluyveromyces lactis
           (Yeast) (Candida sphaerica)
          Length = 420

 Score = 77.0 bits (181), Expect = 4e-13
 Identities = 53/214 (24%), Positives = 91/214 (42%), Gaps = 9/214 (4%)

Query: 17  ETVCKALGGIQHTIFGATWEFTTVADHADT-AYTNLPLAAHNDNIYWTEAAGLQILHCIE 75
           + +C+ +G ++ T +G  ++    A  A+  AYT  PL  H D +Y     G Q+LHCI+
Sbjct: 201 QMICERIGHVRTTFYGELFDVKNQASQANNIAYTAKPLPLHMDLLYLENIPGWQLLHCIK 260

Query: 76  HTNG--TGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQID 133
           ++ G    G+   VD       +K   P   + L T  I   Y      +    P+++  
Sbjct: 261 NSEGLEENGQNYFVDSLGALNYIKNKDPSVLKALETIPITYHYRRDDKRYYQQRPLVE-H 319

Query: 134 KNTEDIKQIRFNVYDRSAMAFRSGRDCRL---YYRSLKNLARYYENKENQWIFKLVPGLV 190
           K  E +  + ++   +     +   D  L   + + L     Y  + +NQ+  KL     
Sbjct: 320 KKYETV--VNYSPPFQGPFNLKDITDIPLLNQFKKGLYMFEEYINDPKNQFQIKLPENSC 377

Query: 191 MVIDNFRLLHGRNGFTGRRVLCGAYVSRSDWLDK 224
           ++  N R+LH R  F G R L G Y+    +  K
Sbjct: 378 VIFHNRRILHARRQFDGERWLKGCYLDADTFSSK 411


>UniRef50_Q19000 Cluster: Probable gamma-butyrobetaine dioxygenase;
           n=3; Rhabditida|Rep: Probable gamma-butyrobetaine
           dioxygenase - Caenorhabditis elegans
          Length = 421

 Score = 72.9 bits (171), Expect = 6e-12
 Identities = 65/224 (29%), Positives = 102/224 (45%), Gaps = 22/224 (9%)

Query: 15  ATETVCKALGGIQHTIFGATWEFTTVADHADTAY-TNLPLAAHNDNIYWTEAAGLQILHC 73
           A E +   +G I+ T FG  +E +  AD ++ AY +N  L  H D    +    LQ+LH 
Sbjct: 180 AVEAIGDRIGMIKRTHFGLVFEVSLKADASNMAYASNGGLPFHTDFPSLSHPPQLQMLHM 239

Query: 74  IEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTT------------YEIEGEYIERRH 121
           ++     GG ++ VDGF+ A  L+ + PE ++ LTT            +EI G+ I   +
Sbjct: 240 LQSAE-EGGHSLFVDGFHVAEQLRVEKPEIFKILTTQSMEYIEEGYDVHEINGKTIRFDY 298

Query: 122 HFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQW 181
                  VI+++ + + + +I+F    RS          +  YR++K    Y     N  
Sbjct: 299 DMCARHKVIRLNDDGK-VNKIQFGNAMRSWFYDCEPSKVQDVYRAMKTFTEYCYQPRNML 357

Query: 182 IFKLVPGLVMVIDNFRLLHGRNGFTG----RRVLCGAYVSRSDW 221
            F+L  G  ++  N RLLH R+GF       R L G Y    DW
Sbjct: 358 KFRLEDGDTVLWANQRLLHTRDGFRNAPEKARTLTGCYF---DW 398


>UniRef50_A3LY61 Cluster: Gamma-butyrobetaine dioxygenase; n=3;
           Saccharomycetaceae|Rep: Gamma-butyrobetaine dioxygenase
           - Pichia stipitis (Yeast)
          Length = 453

 Score = 71.7 bits (168), Expect = 1e-11
 Identities = 60/228 (26%), Positives = 96/228 (42%), Gaps = 26/228 (11%)

Query: 24  GGIQHTIFGATWEFTTVADHA-DTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGG 82
           G I+ T +G  ++     + A + A TN  L  H D +Y+    GLQ+LH I+++  TGG
Sbjct: 217 GYIKKTFYGTLFDVKNEKEEAKNIANTNTFLPLHMDLLYYESPPGLQLLHFIKNST-TGG 275

Query: 83  ETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVI--QIDKNTEDIK 140
           E +  D F  A  +K   P  Y  LT   I   Y     H+    P++  ++  +T  IK
Sbjct: 276 ENVFCDSFLAAEHVKNVDPTAYVALTLVPITYHYDNNNEHYFFKRPLVVEEVKGDTARIK 335

Query: 141 QIRF--------------NVYDRSAMAFRSGRDCRL----YYRSLKNLARYYENKENQWI 182
           ++ +              N  +R  +         L    + R  +    +  +  N + 
Sbjct: 336 EVNYAPPFQGPFEFGITRNDSEREGLFLAKDTTDGLLFQDFIRGFQLFEDFINDPVNHYE 395

Query: 183 FKLVPGLVMVIDNFRLLHGRNGFT----GRRVLCGAYVSRSDWLDKAR 226
            K+  G  ++ DN R+LH R GF+    G R L G YV    +  K R
Sbjct: 396 IKMPEGSCVIFDNRRVLHSRLGFSDSNGGDRWLMGTYVDGDSFRSKLR 443


>UniRef50_Q5KP77 Cluster: Mitochondrion protein, putative; n=2;
           Filobasidiella neoformans|Rep: Mitochondrion protein,
           putative - Cryptococcus neoformans (Filobasidiella
           neoformans)
          Length = 575

 Score = 71.3 bits (167), Expect = 2e-11
 Identities = 38/115 (33%), Positives = 57/115 (49%), Gaps = 4/115 (3%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTN 78
           V   +G I++T +G TW+  +V    + AYTNL L  H D +Y++     Q LHC+ +  
Sbjct: 319 VTDMIGKIRNTFYGETWDVKSVKQSKNIAYTNLNLGLHMDLLYFSSPPRFQALHCLRN-K 377

Query: 79  GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQID 133
             GG +  VD F   + L  D    +EFL    I  +Y    H+F +  P+I  D
Sbjct: 378 VEGGSSYFVDSFRTVSDLPRD---QFEFLQKINITYQYDNDNHYFRYRHPIISSD 429


>UniRef50_Q4P2H1 Cluster: Putative uncharacterized protein; n=1;
            Ustilago maydis|Rep: Putative uncharacterized protein -
            Ustilago maydis (Smut fungus)
          Length = 1527

 Score = 68.5 bits (160), Expect = 1e-10
 Identities = 49/214 (22%), Positives = 94/214 (43%), Gaps = 20/214 (9%)

Query: 11   PSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQI 70
            P +     +   +G +++T +G  W+  + A   + AYTNL L  H D +Y+      Q 
Sbjct: 874  PQSARLRELANIMGELRNTFYGLLWDVRSKAGARNIAYTNLDLGLHMDLLYFQNPPRFQF 933

Query: 71   LHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVI 130
            LH + +    GG +I VD F  A  + E+  E +E LT   +   Y+    H+  + P  
Sbjct: 934  LHMLRN-KVRGGASIFVDSFKVAERMWEEDRELWEVLTKVPVGFHYVNDGRHYRFTHPTF 992

Query: 131  QIDKNTED-------------IKQIRFNVYDRSAMA------FRSGRDCRLYYRSLKNLA 171
            ++  +TE              +  + ++   +S +        ++    + +Y +LK  +
Sbjct: 993  ELAHDTEGHAGGPLAGTTMPRLSAVNYSPPFQSPIPLHPTKHLKTPEQRQTFYLALKRFS 1052

Query: 172  RYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGF 205
                 +E ++  ++  G  ++ DN R+LH R GF
Sbjct: 1053 DLTLAEEFRYEKQMEEGECVIFDNRRVLHSRKGF 1086


>UniRef50_A0YHS0 Cluster: Gamma-butyrobetaine,2-oxoglutarate
           dioxygenase; n=1; marine gamma proteobacterium
           HTCC2143|Rep: Gamma-butyrobetaine,2-oxoglutarate
           dioxygenase - marine gamma proteobacterium HTCC2143
          Length = 394

 Score = 66.9 bits (156), Expect = 4e-10
 Identities = 60/216 (27%), Positives = 88/216 (40%), Gaps = 9/216 (4%)

Query: 19  VCKALGGIQHTIFGATWEFTTVA---DHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIE 75
           V   +G  + T FG  W         +   TA T   L  H D        G Q LHC+E
Sbjct: 176 VTNRIGAQRDTNFGLAWSVKAEILGNEENSTANTPFRLGPHTDLPTREIPPGYQFLHCLE 235

Query: 76  HTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKN 135
           +T  TGG   + DG   A  LKE+ P+ ++ L +         R H    S P++    +
Sbjct: 236 NTV-TGGFATMADGEAIARHLKEEEPKIHQALASLNWIFFNRSRDHDHRWSGPMLDYGVS 294

Query: 136 TEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDN 195
              +    F  Y   A    +  D    YR+++   +   + + Q  +    G ++  DN
Sbjct: 295 QAPLSIRAF--YPVRAFPDMADEDVGRAYRAVRRFHQLAADPQFQISYPYQSGDLIGFDN 352

Query: 196 FRLLHGRNGF---TGRRVLCGAYVSRSDWLDKARSL 228
            RLLHGR+ F    GRR L G YV   +   + R L
Sbjct: 353 RRLLHGRDSFDPGAGRRHLRGTYVDHDEIHSRLRIL 388


>UniRef50_UPI0000586629 Cluster: PREDICTED: similar to gamma
           butyrobetaine hydroxylase; n=1; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to gamma
           butyrobetaine hydroxylase - Strongylocentrotus
           purpuratus
          Length = 181

 Score = 66.5 bits (155), Expect = 5e-10
 Identities = 40/111 (36%), Positives = 61/111 (54%), Gaps = 9/111 (8%)

Query: 48  YTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFL 107
           ++N  L  H D  Y+    G+ + HCI++T+  GG+++L+DGF  A  LK DH + +  L
Sbjct: 11  FSNGYLMPHTDFSYYVSPPGVALFHCIKNTSTVGGDSLLIDGFKAAMELKTDHHDAFNML 70

Query: 108 TTYEIEGEYIE-----RRHHFTHSA--PVIQIDKNTEDIKQIRFN-VYDRS 150
           T +EIE   I       +  F H A  P+I++D     +KQI FN +Y  S
Sbjct: 71  TKHEIEHVAISVNKKITQQGFYHHARHPLIRLDA-LGQLKQITFNEIYHAS 120


>UniRef50_Q097J3 Cluster: Gamma-butyrobetaine dioxygenase; n=1;
           Stigmatella aurantiaca DW4/3-1|Rep: Gamma-butyrobetaine
           dioxygenase - Stigmatella aurantiaca DW4/3-1
          Length = 357

 Score = 63.7 bits (148), Expect = 4e-09
 Identities = 46/182 (25%), Positives = 78/182 (42%), Gaps = 4/182 (2%)

Query: 38  TTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLK 97
           TT  +     YT+  +  H D  +       Q+LH  +    TGG   +VDG   A  L 
Sbjct: 179 TTNRNTDQLGYTDSAVQLHTDQPFLDRPPRYQLLHS-QRPAETGGANFVVDGLAAARYLS 237

Query: 98  EDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSG 157
                 +E L T  +     ++       +P++  D       +IR++ Y   A   R  
Sbjct: 238 GLDRPAFELLRTVPVTFHRKQKSFERVLVSPILDFD--APGGFRIRYS-YFTLAPHQRPF 294

Query: 158 RDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVS 217
            +   +YR+    A+   ++ +Q+ F L  G  ++ DN+R+LH R  FTG R + G Y  
Sbjct: 295 AEMEAWYRAYNRFAKLVRDERHQYRFLLQTGDFLIYDNWRMLHARTSFTGARWVRGVYFD 354

Query: 218 RS 219
           ++
Sbjct: 355 KA 356


>UniRef50_Q1YSL8 Cluster: Gamma-butyrobetaine hydroxylase; n=1;
           gamma proteobacterium HTCC2207|Rep: Gamma-butyrobetaine
           hydroxylase - gamma proteobacterium HTCC2207
          Length = 366

 Score = 63.3 bits (147), Expect = 5e-09
 Identities = 47/205 (22%), Positives = 94/205 (45%), Gaps = 5/205 (2%)

Query: 13  AEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILH 72
           + + E +   LG I+  +F      +      + A+T+L +  HND   ++    +Q LH
Sbjct: 146 SNSLEMLSTRLGPIREVLFERIHNVSIDTHVYNIAHTSLEVPPHNDFASYSWPPSVQALH 205

Query: 73  CIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQI 132
            + +    GGE+++VDG+     L+ D+P  ++ L ++ +     +  +      P++++
Sbjct: 206 MLAN-ECEGGESMIVDGYSVLNDLQNDNPNLFKILCSFPVPFREFDEENETYTKEPIVRL 264

Query: 133 DKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMV 192
           +   + I   RF+      M         L+Y +   L     +K+ +  F+L  G +++
Sbjct: 265 NSQNK-ITGFRFS-NQLMQMIDPIEDTLDLFYMAYHELCNRINSKKYKSKFRLESGHILL 322

Query: 193 IDNFRLLHGRNGF--TGRRVLCGAY 215
           +   R+LHGR  F   G+R L  AY
Sbjct: 323 VHGHRVLHGRCEFQPDGKRHLQDAY 347


>UniRef50_UPI0000E47204 Cluster: PREDICTED: similar to
           Gamma-butyrobetaine hydroxylase subfamily, putative,
           partial; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to Gamma-butyrobetaine hydroxylase
           subfamily, putative, partial - Strongylocentrotus
           purpuratus
          Length = 124

 Score = 62.1 bits (144), Expect = 1e-08
 Identities = 35/115 (30%), Positives = 59/115 (51%), Gaps = 5/115 (4%)

Query: 19  VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTN 78
           V   +G +  T++G T++  T     + AY+++ L  H D  Y+    GLQ+LHC++  +
Sbjct: 1   VANMIGPVTETLYGHTFDVQTEDKPINVAYSSVGLGFHVDLAYYESPPGLQLLHCLQFDD 60

Query: 79  GT-GGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQI 132
              GGE+I +DGF  A   +E  P  +E LT   +   +  ++ HF  + P   I
Sbjct: 61  MVLGGESIFLDGFCIAEEFREKFPHHFETLT--RVPATF--QKFHFERANPAAYI 111


>UniRef50_Q7S3G2 Cluster: Putative uncharacterized protein
           NCU06891.1; n=1; Neurospora crassa|Rep: Putative
           uncharacterized protein NCU06891.1 - Neurospora crassa
          Length = 1261

 Score = 58.8 bits (136), Expect = 1e-07
 Identities = 30/91 (32%), Positives = 49/91 (53%), Gaps = 1/91 (1%)

Query: 17  ETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEH 76
           E +   +G + HT +G TW+  +     + AYTN+ L  H D +Y      LQ+LHCI +
Sbjct: 887 EKIANRIGILMHTFYGFTWDVRSKPRAENVAYTNVFLGLHQDLMYIDPPPRLQLLHCISN 946

Query: 77  TNGTGGETILVDGFYGATCLKEDHPEDYEFL 107
            +  GGE++  DG   A  L+ ++P  ++ L
Sbjct: 947 -SFQGGESLFSDGARAAYSLELNNPLAFDQL 976



 Score = 34.3 bits (75), Expect = 2.6
 Identities = 15/58 (25%), Positives = 29/58 (50%)

Query: 148  DRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGF 205
            + + +    G++   +  + K   R    +EN +  K+  G  ++ DN+R+LHGR  F
Sbjct: 1055 EHAVVVEEKGKNMTKWVPAAKEFEREISAEENMFELKMKEGECVIFDNWRVLHGRREF 1112


>UniRef50_UPI0000587A47 Cluster: PREDICTED: hypothetical protein,
           partial; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: hypothetical protein, partial -
           Strongylocentrotus purpuratus
          Length = 311

 Score = 58.4 bits (135), Expect = 1e-07
 Identities = 47/174 (27%), Positives = 78/174 (44%), Gaps = 12/174 (6%)

Query: 19  VCKALGGIQHTIFGAT-WEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHT 77
           +C    G    I+G++ W F     H        PL  H+D  +  ++ G+  +HCI   
Sbjct: 145 ICGTSYGTTSQIYGSSDWAF----GHPSYGLEYRPL--HSDYSFIDDSHGVFAMHCISQI 198

Query: 78  NGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGE---YIERRHHFTHSAPV-IQID 133
           +G GGE   VDG+  A  L++ +PE ++ LT    EG    ++    H+ HS    I+++
Sbjct: 199 DGKGGEYFFVDGYKAAQDLQKTNPEAFQRLTKPCWEGRVKAWVVNDKHYHHSVSAPIKLN 258

Query: 134 KNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVP 187
           +  E  K    + +  S M    G  C   YR+L+ L      K++ +   L P
Sbjct: 259 EAGEVTKLTLCDFWRTSVMRLPVGEVCDT-YRALRALKDLLYRKDHTFCHNLQP 311


>UniRef50_A7SHP3 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 438

 Score = 57.6 bits (133), Expect = 2e-07
 Identities = 60/217 (27%), Positives = 90/217 (41%), Gaps = 14/217 (6%)

Query: 23  LGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI--YWTEAAGLQILHCIEHTNGT 80
           LG ++   FG T+         DT  +   +A        Y     GL ++HC +   G 
Sbjct: 206 LGYLKTMWFGETYPVVNKIGSEDTGASPASIACVPKATCPYKEYRGGLHMIHCRQELGGE 265

Query: 81  GGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRH----HFTHSAPVIQIDKNT 136
           GGE   VDGF  A  LK++ PE++  L T  I  +     H      + S PVI++D   
Sbjct: 266 GGEHTFVDGFNIAKQLKKEDPENFNRLCTARILYQRKVTNHDADLKMSFSHPVIRLDDKG 325

Query: 137 EDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNF 196
             +K+I  +   R +       +  L YR+    +R   N       KL  G ++ +D  
Sbjct: 326 R-LKRIICSEKYRVSFVNVPPNEVPLLYRAYFQFSRLIRNPGYTIKHKLREGDIITMDTD 384

Query: 197 RLLHGRNGF----TGRRVL-CGAYVSRSDWLDKARSL 228
           R+L GR+ +    +G  V  CG   S  D    AR L
Sbjct: 385 RVLCGRDAYGPEVSGTEVFECG--FSDKDMTSSARRL 419


>UniRef50_Q2HBR6 Cluster: Putative uncharacterized protein; n=1;
          Chaetomium globosum|Rep: Putative uncharacterized
          protein - Chaetomium globosum (Soil fungus)
          Length = 176

 Score = 56.8 bits (131), Expect = 4e-07
 Identities = 28/77 (36%), Positives = 40/77 (51%), Gaps = 1/77 (1%)

Query: 9  VQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGL 68
          V     A E V   +G +Q T +G TW+    A   + AYT+  L  H D +Y     GL
Sbjct: 24 VPADESAVERVASRIGPVQETFYGRTWDVRNKARAENVAYTDKFLCLHQDLMYHDPVPGL 83

Query: 69 QILHCIEHTNGTGGETI 85
          Q+LHC+ +T   GGE++
Sbjct: 84 QLLHCLANT-CEGGESL 99


>UniRef50_A0P0W2 Cluster: Putative uncharacterized protein; n=1;
           Stappia aggregata IAM 12614|Rep: Putative
           uncharacterized protein - Stappia aggregata IAM 12614
          Length = 272

 Score = 54.0 bits (124), Expect = 3e-06
 Identities = 47/172 (27%), Positives = 79/172 (45%), Gaps = 14/172 (8%)

Query: 42  DHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHP 101
           +H +  YT     AH D+ Y      + +L C +  + +GG +I++D       L  DH 
Sbjct: 95  EHRELIYTEKGQPAHCDSAYHETMPDIVMLGCSKAAS-SGGLSIIID----IRSLLSDHH 149

Query: 102 EDY--EFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRD 159
            +Y  E L  ++ + + I  + +     P ++++ NTE   +  F  +  SAM F+S  D
Sbjct: 150 MEYLKERLRAHQ-QIDVIYSKRNIRVEKPFVKMNPNTERA-EFAFTPFALSAM-FKSKED 206

Query: 160 CRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVL 211
             LY   ++N      +      F L  G  +++DN  +LHGR  F G R L
Sbjct: 207 ANLYDCIIQNS----NSPTYSRYFLLEDGDFVILDNTSMLHGRTAFAGERSL 254


>UniRef50_A1DGB7 Cluster: Haloacid dehalogenase-like hydrolase,
           putative; n=3; Trichocomaceae|Rep: Haloacid
           dehalogenase-like hydrolase, putative - Neosartorya
           fischeri (strain ATCC 1020 / DSM 3700 / NRRL
           181)(Aspergillus fischerianus (strain ATCC 1020 / DSM
           3700 / NRRL 181))
          Length = 486

 Score = 49.6 bits (113), Expect = 7e-05
 Identities = 42/165 (25%), Positives = 69/165 (41%), Gaps = 19/165 (11%)

Query: 48  YTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFL 107
           Y++  L  H D   W E   + ++  ++  +  GGE++L DG+     LK +  E YE +
Sbjct: 81  YSSEELFFHTDRSGWDEPPQI-LMSTLKSRSEAGGESLLADGYQVLEALKREDEELYELI 139

Query: 108 TTYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSL 167
           T           +H    S   + + +   D K           + FR     +L    +
Sbjct: 140 TN---------SKHTSFRSDDEVFVPRAIFDRKN--------GILRFRFDDSIQLSASMV 182

Query: 168 KNLARYYEN-KENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVL 211
              +R  +   EN ++  L PG   ++DN R LHGR  F+G R L
Sbjct: 183 SRFSRLQDIIYENAFVVSLQPGQGYILDNHRYLHGRASFSGSREL 227


>UniRef50_Q9KQQ4 Cluster: PvcB protein; n=20; Proteobacteria|Rep:
           PvcB protein - Vibrio cholerae
          Length = 287

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 48/198 (24%), Positives = 77/198 (38%), Gaps = 17/198 (8%)

Query: 20  CKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHT-N 78
           C+  G +    FG   +     D  D  + +  +  H D +Y  +    QI  C++    
Sbjct: 65  CERWGEVSVWPFGRVLDLVQKEDPGDHIFDSSYMPMHWDGMYRPQVPEYQIFQCVKAPLP 124

Query: 79  GTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPV--IQIDKNT 136
           G GG T      +  T L   H    +     ++ G Y +R+  F HS  V  I +    
Sbjct: 125 GHGGRTT-----FSHTMLALQHAPQPDLELWQQVTGHY-QRKMEFYHSKTVSPIVMQHPY 178

Query: 137 EDIKQIRFNV--YDRSAMAFR------SGRDCRLYYRSLKNLARYYENKENQWIFKLVPG 188
            D + IR+N   ++ +           SG          K+L R   +  N +  +   G
Sbjct: 179 RDYQVIRYNEPHFEENGDLLNPPDVSLSGITPEQAIEFHKSLRRALYDPRNFYAHEWQTG 238

Query: 189 LVMVIDNFRLLHGRNGFT 206
            +++ DNF LLHGR  FT
Sbjct: 239 DIVITDNFSLLHGREAFT 256


>UniRef50_A4D938 Cluster: CrpF; n=1; Nostoc sp. ATCC 53789|Rep: CrpF
           - Nostoc sp. ATCC 53789
          Length = 294

 Score = 46.8 bits (106), Expect = 5e-04
 Identities = 40/165 (24%), Positives = 76/165 (46%), Gaps = 14/165 (8%)

Query: 42  DHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHP 101
           ++ +T  T+L L  H D  +      +  + C +     GG T L+DG      LK  +P
Sbjct: 83  EYVNTTTTDLSL--HTDGAFTITPPKVMAMQC-QIAAANGGFTKLIDGKLVYEHLKRTNP 139

Query: 102 EDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCR 161
                LT +  +   ++R +    + P+ + + +   I  +RF   + + ++  S     
Sbjct: 140 VG--LLTLFNPDAITVKRDNKKA-TKPIFE-EHHAGLI--VRFRADNAAHVSVESKS--- 190

Query: 162 LYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGFT 206
             + + K+   +  N +NQ IFKL    ++++DN R+LHGR  F+
Sbjct: 191 --FAAFKSFENFVNNPDNQVIFKLAQNQIIIVDNTRVLHGRTAFS 233


>UniRef50_P23180 Cluster: Uncharacterized oxidoreductase YHL021C;
           n=2; Saccharomyces cerevisiae|Rep: Uncharacterized
           oxidoreductase YHL021C - Saccharomyces cerevisiae
           (Baker's yeast)
          Length = 465

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 34/128 (26%), Positives = 52/128 (40%), Gaps = 6/128 (4%)

Query: 17  ETVCKALGGIQHTIFG-ATWEFT-TVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCI 74
           + +C+ +G I+ T+ G  T++   + A   +  Y N  L  H D  +     G QIL  +
Sbjct: 205 QKICERIGPIRSTVHGEGTFDVNASQATSVNAHYANKDLPLHTDLPFLENVPGFQILQSL 264

Query: 75  EHTNGTGGET----ILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVI 130
             T G    T      VD FY    ++E   E YE L    +   Y      +  S P+I
Sbjct: 265 PATEGEDPNTRPMNYFVDAFYATRNVRESDFEAYEALQIVPVNYIYENGDKRYYQSKPLI 324

Query: 131 QIDKNTED 138
           +     ED
Sbjct: 325 EHHDINED 332


>UniRef50_Q19Q32 Cluster: Trimethyllysine hydroxylase-like; n=1;
           Belgica antarctica|Rep: Trimethyllysine hydroxylase-like
           - Belgica antarctica
          Length = 234

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 30/105 (28%), Positives = 53/105 (50%), Gaps = 5/105 (4%)

Query: 9   VQPSAEATETVCKALGGIQ--HTIF--GATWEFTTVADHADTAYTNLPLAAHNDNIYWTE 64
           V P+ EA+E V   L  +Q    IF  G         + ++  YTN+ L+  +++ Y+ +
Sbjct: 130 VAPTMEASEEVVSILFELQSQRQIFCNGIANYSDAATEISNAKYTNVCLSPKSEHTYFND 189

Query: 65  AAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTT 109
             GL +++C+ H+      + L++    +  LK+ HPE Y  LTT
Sbjct: 190 GQGLLMINCVHHSPDC-ALSYLIEARKISNNLKQKHPEQYISLTT 233


>UniRef50_A7SYU0 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 437

 Score = 40.7 bits (91), Expect = 0.030
 Identities = 53/200 (26%), Positives = 75/200 (37%), Gaps = 20/200 (10%)

Query: 48  YTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGET-ILVDGFYGATCLKEDHPEDYEF 106
           YT      H D  Y+     L  L   E+       T   VD       ++++ PE +E 
Sbjct: 210 YTQDKHPVHADTSYFDVPVRLSGLLATEYDAPVEDTTNYFVDSVKVIEDIRKEEPEAFEL 269

Query: 107 LTTYE-------------IEGEYIERRHHFTH-SAPVIQIDKNTEDIKQIRF--NVYDRS 150
           L+T               +E E     H  T    P I  D   ED   +RF  N     
Sbjct: 270 LSTVPTRFSRRRMDVPEPVEPERAPAFHFETLIKTPFIGYDVG-EDRPSLRFSNNHCGLD 328

Query: 151 AMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGF--TGR 208
             +F+  +  R Y+ +LK L     + EN     L  G   V +N+ + HGRN    T R
Sbjct: 329 PDSFKDPKTMRRYFEALKLLQDKLTDPENHQQLVLRQGWCAVFNNWHVCHGRNAVHPTTR 388

Query: 209 RVLCGAYVSRSDWLDKARSL 228
           R L  +Y+S   W  + R L
Sbjct: 389 RSLMLSYISNVTWQTRWRIL 408


>UniRef50_Q6E7K0 Cluster: JamJ; n=3; Oscillatoriales|Rep: JamJ -
           Lyngbya majuscula
          Length = 3302

 Score = 39.1 bits (87), Expect = 0.092
 Identities = 34/108 (31%), Positives = 55/108 (50%), Gaps = 15/108 (13%)

Query: 107 LTTYEIEGEYIE--RRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYY 164
           L    +EG YIE  +R  ++HS    Q+ +   DIK   F++ +         RD +LYY
Sbjct: 496 LDVLALEGRYIEISKRKIWSHS----QVAQKRSDIKYFPFDLLEEF------NRDNQLYY 545

Query: 165 RSLKNLARYYENKE-NQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVL 211
           +  K L + +ENKE +  ++K  P    +++ FR L  R+   GR V+
Sbjct: 546 QIWKKLIQCFENKELHPLVYKTFPN-EDIVEAFRYLQ-RSKHIGRVVV 591


>UniRef50_Q4FKY1 Cluster: Gab protein; n=2; Candidatus Pelagibacter
           ubique|Rep: Gab protein - Pelagibacter ubique
          Length = 303

 Score = 38.7 bits (86), Expect = 0.12
 Identities = 44/159 (27%), Positives = 64/159 (40%), Gaps = 14/159 (8%)

Query: 47  AYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEF 106
           AYTN+ L  H D  Y  E     ++  IE  N  GGET ++       C  ED   D   
Sbjct: 139 AYTNMDL--HTDGTYVKEITDWLLMTKIEEQNVQGGETAMLHLDDWEHC--EDLFNDPIG 194

Query: 107 LTTYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRS 166
              + + G    +   +    PV   D N +       N+            D  ++   
Sbjct: 195 KQNF-VWGSPKSKNIEYKVEHPVFTTDDNGKP------NISYIDQFPEPKNMDQGIF--- 244

Query: 167 LKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGF 205
           L+ L+   E  +N+ I KLVPG  +V +N+  LHGR  F
Sbjct: 245 LQKLSDALEESKNKVITKLVPGSTIVANNYFWLHGRKPF 283


>UniRef50_Q6FMD9 Cluster: Similar to sp|P23180 Saccharomyces
           cerevisiae YHL021c; n=1; Candida glabrata|Rep: Similar
           to sp|P23180 Saccharomyces cerevisiae YHL021c - Candida
           glabrata (Yeast) (Torulopsis glabrata)
          Length = 508

 Score = 37.9 bits (84), Expect = 0.21
 Identities = 27/130 (20%), Positives = 56/130 (43%), Gaps = 12/130 (9%)

Query: 19  VCKALGGIQHTIFGATWEFTT--VADHA---------DTAYTNLPLAAHNDNIYWTEAAG 67
           +C+ +G ++ T +G  ++ T   + D+          +  + N+    H D  +     G
Sbjct: 241 LCERIGPLRKTFYGEVFDITNQNLKDYDQDLPPPHDYNIPFENIASPLHMDLQFLENVPG 300

Query: 68  LQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHFTHSA 127
            +ILH +++ +      I VD  Y A  ++E   E YE L    I   +  +   +  S 
Sbjct: 301 FKILHALKNES-ENDTNIFVDSLYAARNIRETDNEAYEALQHVPINYTFKNKNKRYYQSK 359

Query: 128 PVIQIDKNTE 137
           P+++  ++ E
Sbjct: 360 PLVEEYESNE 369


>UniRef50_Q9I6U7 Cluster: Putative uncharacterized protein; n=4;
           Pseudomonas aeruginosa|Rep: Putative uncharacterized
           protein - Pseudomonas aeruginosa
          Length = 271

 Score = 37.5 bits (83), Expect = 0.28
 Identities = 40/157 (25%), Positives = 61/157 (38%), Gaps = 11/157 (7%)

Query: 49  TNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLT 108
           T L    H D  +      L  L C+ +    GGET+L         L+   P  +  L 
Sbjct: 95  TPLEHKLHTDGAFLDTPEQLCSLQCVRNAR-EGGETLLASAGLAFERLRRRMPTKH--LG 151

Query: 109 TYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLK 168
               +   I R+H  + + PV +++     IK   F   D +A          +   +  
Sbjct: 152 LLRGDALTIVRKHQ-SSTQPVFRLNGEALGIK---FRQNDGAAEVVEHP----VAVEAFA 203

Query: 169 NLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGF 205
            L    E+   Q   KL PG ++V+DN  +LHGR  F
Sbjct: 204 ELVAALEDPACQLRIKLEPGEILVLDNTAVLHGRTAF 240


>UniRef50_Q2C5Y7 Cluster: Putative uncharacterized protein; n=2;
           Vibrionaceae|Rep: Putative uncharacterized protein -
           Photobacterium sp. SKA34
          Length = 162

 Score = 37.1 bits (82), Expect = 0.37
 Identities = 21/84 (25%), Positives = 43/84 (51%), Gaps = 10/84 (11%)

Query: 98  EDHPEDYEFLTTYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQIR-FNVYDRSAMAFRS 156
           + + ED + +  Y  +G+Y+++R H T     +QI K  +DI   R +  Y+  ++ +R 
Sbjct: 78  QGNDEDVKLMELY-YDGDYLQKRDHLTLKFESVQIQKKHDDINLNRAYKEYEGHSILYR- 135

Query: 157 GRDCRLYYRSLKNLARYYENKENQ 180
                  Y+++ N   +Y + EN+
Sbjct: 136 -------YKNINNRLYFYTDSENE 152


>UniRef50_Q94534 Cluster: Beaten path precursor; n=5; Diptera|Rep:
           Beaten path precursor - Drosophila melanogaster (Fruit
           fly)
          Length = 427

 Score = 37.1 bits (82), Expect = 0.37
 Identities = 23/81 (28%), Positives = 34/81 (41%), Gaps = 2/81 (2%)

Query: 37  FTTVADHADTAYTNLPLAAHNDNIYW--TEAAGLQILHCIEHTNGTGGETILVDGFYGAT 94
           F     H D     L  +A   ++YW  TE   L+     +H NG  G  +  D FY   
Sbjct: 209 FVVTDQHFDNGKLKLRCSAQLHDVYWKTTEKIILETDLFPKHGNGANGNHVNPDDFYDQY 268

Query: 95  CLKEDHPEDYEFLTTYEIEGE 115
            L EDH  + +     +++GE
Sbjct: 269 ALHEDHLHNKKNSYLTQLQGE 289


>UniRef50_Q10Z25 Cluster: Putative uncharacterized protein; n=2;
           Trichodesmium erythraeum IMS101|Rep: Putative
           uncharacterized protein - Trichodesmium erythraeum
           (strain IMS101)
          Length = 365

 Score = 35.9 bits (79), Expect = 0.86
 Identities = 22/68 (32%), Positives = 38/68 (55%), Gaps = 8/68 (11%)

Query: 152 MAFRSGRDCRLY--YRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRR 209
           + F  G D   +  Y  +K+   Y++N E   +F    G ++++DN+R+ HGR  FTG+R
Sbjct: 303 VVFEDGSDLSFWDVYHIVKS---YWKNAE---LFSWQEGDIVILDNYRMGHGRLPFTGKR 356

Query: 210 VLCGAYVS 217
            +  A+ S
Sbjct: 357 KVYIAFSS 364


>UniRef50_A3NJS9 Cluster: Taurine catabolism dioxygenase TauD, TfdA
           family; n=6; Burkholderia pseudomallei|Rep: Taurine
           catabolism dioxygenase TauD, TfdA family - Burkholderia
           pseudomallei (strain 668)
          Length = 278

 Score = 35.9 bits (79), Expect = 0.86
 Identities = 21/72 (29%), Positives = 36/72 (50%), Gaps = 4/72 (5%)

Query: 134 KNTEDIKQIRFNVYDRSAMAFRSGRDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMVI 193
           K  +D   I+F + D +A A  +     +Y     +LA +  + EN  +F L PG +++ 
Sbjct: 181 KEDDDGWSIKFRMNDGAATATPAPAAADMY----GSLACFLTDPENMLLFPLEPGQILIG 236

Query: 194 DNFRLLHGRNGF 205
           DN  + HGR  +
Sbjct: 237 DNTAVTHGRTSY 248


>UniRef50_Q5QFY7 Cluster: ORF3; n=3; Proteobacteria|Rep: ORF3 -
           Pseudomonas syringae pv. phaseolicola
          Length = 306

 Score = 35.5 bits (78), Expect = 1.1
 Identities = 41/158 (25%), Positives = 60/158 (37%), Gaps = 9/158 (5%)

Query: 49  TNLPLAAHNDNIYWTEAA-GLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFL 107
           T++PL  H D  Y         IL C E     GGE+IL D       L EDHP+    L
Sbjct: 115 TDVPL--HTDGSYLPIGTIKTSILFCRESA-ALGGESILFDSVSAFRALSEDHPDLARSL 171

Query: 108 ---TTYEIEGEYIERRHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGRDCRLYY 164
                +            + H  P+  + +   DI    F +   +   +    D R+  
Sbjct: 172 LADNAFRRRSTSTRSGRQYQHIGPMF-LRREDGDIVG-GFTLDITADWEYSRRMDARVID 229

Query: 165 RSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGR 202
            +   +    EN +    F L  G V++I N +L HGR
Sbjct: 230 AAAYLIRLASENSDYTLKFGLHKGQVLIIRNDQLSHGR 267


>UniRef50_Q0AA88 Cluster: Flagellar hook capping protein; n=1;
           Alkalilimnicola ehrlichei MLHE-1|Rep: Flagellar hook
           capping protein - Alkalilimnicola ehrlichei (strain
           MLHE-1)
          Length = 222

 Score = 35.5 bits (78), Expect = 1.1
 Identities = 39/150 (26%), Positives = 60/150 (40%), Gaps = 5/150 (3%)

Query: 4   KSFNQVQPSAEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWT 63
           KSF++ Q    A + + +A   +   +   T      AD   T    LP A  N NI+  
Sbjct: 70  KSFSEFQQDMRANQAL-QAASLVGREVLVETDAGRLPADGEMTGIVQLPSAVANANIHIH 128

Query: 64  EAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEYIERRHHF 123
            AAG ++         +G      DG   A   +   P  Y    + ++EGE  ER    
Sbjct: 129 NAAGERVRTLATGEQPSGDYRFAWDG--RADDGRTLPPGAYRVTASTQVEGE--ERSLRV 184

Query: 124 THSAPVIQIDKNTEDIKQIRFNVYDRSAMA 153
            +SAPV+ +    E+ +  R N+     MA
Sbjct: 185 MNSAPVVSVTLAGENERGPRVNLDGIGEMA 214


>UniRef50_A4KUB9 Cluster: TlmR3; n=2; Actinomycetales|Rep: TlmR3 -
           Streptoalloteichus hindustanus
          Length = 348

 Score = 35.5 bits (78), Expect = 1.1
 Identities = 18/28 (64%), Positives = 19/28 (67%), Gaps = 1/28 (3%)

Query: 187 PGLVMVIDNFRLLHGRNGFTG-RRVLCG 213
           PG VMV+DN    HGR  FTG RRVL G
Sbjct: 304 PGDVMVVDNLLSAHGREPFTGARRVLVG 331


>UniRef50_Q05582 Cluster: Clavaminate synthase 2; n=5; Streptomyces
           clavuligerus|Rep: Clavaminate synthase 2 - Streptomyces
           clavuligerus
          Length = 325

 Score = 35.1 bits (77), Expect = 1.5
 Identities = 15/25 (60%), Positives = 18/25 (72%)

Query: 184 KLVPGLVMVIDNFRLLHGRNGFTGR 208
           KLVPG V++IDNFR  H R  F+ R
Sbjct: 264 KLVPGDVLIIDNFRTTHARTPFSPR 288


>UniRef50_Q5ZU71 Cluster: Pyoverdine biosynthesis regulatory gene
           SyrP-like; n=4; Legionella pneumophila|Rep: Pyoverdine
           biosynthesis regulatory gene SyrP-like - Legionella
           pneumophila subsp. pneumophila (strain Philadelphia 1
           /ATCC 33152 / DSM 7513)
          Length = 353

 Score = 34.7 bits (76), Expect = 2.0
 Identities = 13/25 (52%), Positives = 18/25 (72%)

Query: 187 PGLVMVIDNFRLLHGRNGFTGRRVL 211
           PG VM++DNF  LHG+   TG R++
Sbjct: 320 PGDVMIVDNFSCLHGKTPHTGNRLI 344


>UniRef50_Q118E5 Cluster: Gamma-butyrobetaine,2-oxoglutarate
           dioxygenase; n=1; Trichodesmium erythraeum IMS101|Rep:
           Gamma-butyrobetaine,2-oxoglutarate dioxygenase -
           Trichodesmium erythraeum (strain IMS101)
          Length = 328

 Score = 34.7 bits (76), Expect = 2.0
 Identities = 19/58 (32%), Positives = 32/58 (55%), Gaps = 3/58 (5%)

Query: 163 YYRSLKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGF---TGRRVLCGAYVS 217
           +Y +     RY ++ E Q+ F+      ++I NF++LHGR  F   +G R L  +YV+
Sbjct: 249 FYEAYSQFFRYLKSPEYQYHFRSEAEDCLMIQNFQVLHGRTAFDANSGSRHLEVSYVA 306


>UniRef50_A5EW81 Cluster: Putative uncharacterized protein; n=1;
           Dichelobacter nodosus VCS1703A|Rep: Putative
           uncharacterized protein - Dichelobacter nodosus (strain
           VCS1703A)
          Length = 584

 Score = 34.7 bits (76), Expect = 2.0
 Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 7   NQVQPSAEATET-VCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAH 56
           N ++ +  +TE  V +A+   QHT   A   FT +AD A   +TNL   AH
Sbjct: 313 NDIRATQASTEAKVQEAVDAFQHTATTAKESFTALADQATKQFTNLTQQAH 363


>UniRef50_Q50E85 Cluster: Putative uncharacterized protein; n=1;
           Streptomyces filamentosus|Rep: Putative uncharacterized
           protein - Streptomyces filamentosus (Streptomyces
           roseosporus)
          Length = 255

 Score = 33.9 bits (74), Expect = 3.5
 Identities = 16/38 (42%), Positives = 22/38 (57%)

Query: 183 FKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAYVSRSD 220
           F+L  G ++V+DN+R  HGR   TG R +    V  SD
Sbjct: 216 FRLDKGEILVLDNYRCWHGREAHTGDRAVRILTVRSSD 253


>UniRef50_A0BH33 Cluster: Chromosome undetermined scaffold_107, whole
            genome shotgun sequence; n=1; Paramecium tetraurelia|Rep:
            Chromosome undetermined scaffold_107, whole genome
            shotgun sequence - Paramecium tetraurelia
          Length = 1241

 Score = 33.9 bits (74), Expect = 3.5
 Identities = 24/83 (28%), Positives = 39/83 (46%), Gaps = 3/83 (3%)

Query: 101  PEDYEFLTTYEIEGEYIER--RHHFTHSAPVIQIDKNTEDIKQIRFNVYDRSAMAFRSGR 158
            P++Y    TY+I+  YIER  R +  +   + +I    E IK++    Y     +     
Sbjct: 1005 PQEYRSNITYDIKLNYIERNMRPYIDYDYYISRILTGYEIIKKMHMKHYQEVFNSADMDL 1064

Query: 159  DCRLYYRSLKNLARYYE-NKENQ 180
            D  + Y+  K L  Y+E N+ NQ
Sbjct: 1065 DNMMEYQEFKKLYYYFEVNQGNQ 1087


>UniRef50_Q3IC42 Cluster: Putative oxidoreductase; n=2;
           Alteromonadales|Rep: Putative oxidoreductase -
           Pseudoalteromonas haloplanktis (strain TAC 125)
          Length = 342

 Score = 33.5 bits (73), Expect = 4.6
 Identities = 21/62 (33%), Positives = 32/62 (51%), Gaps = 3/62 (4%)

Query: 32  GATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILHCIEHTNGTGGETILVDGFY 91
           GA  E+T VA++A +    +P     D       AG+  L  ++  N T G+T+L+DG  
Sbjct: 108 GALSEYTKVANYAVSV---VPAELAGDMAATLPCAGMAALISLDKINITEGDTVLIDGGA 164

Query: 92  GA 93
           GA
Sbjct: 165 GA 166


>UniRef50_Q9FB40 Cluster: SyrP-like protein; n=1; Streptomyces
           verticillus|Rep: SyrP-like protein - Streptomyces
           verticillus
          Length = 328

 Score = 33.5 bits (73), Expect = 4.6
 Identities = 15/25 (60%), Positives = 19/25 (76%), Gaps = 1/25 (4%)

Query: 188 GLVMVIDNFRLLHGRNGFTG-RRVL 211
           G +M++DN R+ HGR  FTG RRVL
Sbjct: 296 GDIMLVDNLRMAHGREPFTGERRVL 320


>UniRef50_Q29DR0 Cluster: GA10095-PA; n=2; pseudoobscura
           subgroup|Rep: GA10095-PA - Drosophila pseudoobscura
           (Fruit fly)
          Length = 2483

 Score = 33.5 bits (73), Expect = 4.6
 Identities = 14/47 (29%), Positives = 19/47 (40%)

Query: 14  EATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNI 60
           E T T      G++HT     WE     D  +   T  P  AHN+ +
Sbjct: 343 ELTTTAAAPNSGLEHTSENVVWEAKLTTDAPEALSTTTPAVAHNETV 389


>UniRef50_A0DD98 Cluster: Chromosome undetermined scaffold_46, whole
           genome shotgun sequence; n=1; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_46,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 1489

 Score = 33.5 bits (73), Expect = 4.6
 Identities = 17/61 (27%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 134 KNTEDIKQIRFNVYDRSAMAFRSG-RDCRLYYRSLKNLARYYENKENQWIFKLVPGLVMV 192
           KN+ ++KQ  +N  D+++  F  G  D  ++Y     L ++ ENK+  +I+  +  L+ V
Sbjct: 852 KNSLNLKQDEYNNDDQASSKFSKGFLDTAIFYSKYNQLNKFLENKKLLFIYVTISLLLFV 911

Query: 193 I 193
           +
Sbjct: 912 L 912


>UniRef50_Q6Q472 Cluster: Calcium activated chloride channel
           variant; n=3; Murinae|Rep: Calcium activated chloride
           channel variant - Mus musculus (Mouse)
          Length = 843

 Score = 32.7 bits (71), Expect = 8.0
 Identities = 17/55 (30%), Positives = 25/55 (45%), Gaps = 1/55 (1%)

Query: 62  WTEAAGLQILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEGEY 116
           +T   G ++   IE  +G   E +L+D   GA   K+D      F T Y + G Y
Sbjct: 541 YTPIIGARVTATIESNSGKTEELVLLDNGAGADAFKDDGVYS-RFFTAYSVNGRY 594


>UniRef50_Q9I1L4 Cluster: Pyoverdine biosynthesis protein PvcB; n=6;
           Pseudomonas aeruginosa|Rep: Pyoverdine biosynthesis
           protein PvcB - Pseudomonas aeruginosa
          Length = 291

 Score = 32.7 bits (71), Expect = 8.0
 Identities = 19/73 (26%), Positives = 28/73 (38%), Gaps = 1/73 (1%)

Query: 13  AEATETVCKALGGIQHTIFGATWEFTTVADHADTAYTNLPLAAHNDNIYWTEAAGLQILH 72
           AE+    C   G +    FGA  E        D  + N  +  H D +Y       Q+ H
Sbjct: 67  AESLTRYCHDFGEVMLWPFGAVLELVEQEGAEDHIFANNYVPLHWDGMYLETVPEFQVFH 126

Query: 73  CIEHT-NGTGGET 84
           C++   +  GG T
Sbjct: 127 CVDAPGDSDGGRT 139


>UniRef50_Q72TN9 Cluster: Syringomycin channel-forming protein; n=4;
           Leptospira|Rep: Syringomycin channel-forming protein -
           Leptospira interrogans serogroup Icterohaemorrhagiae
           serovarcopenhageni
          Length = 262

 Score = 32.7 bits (71), Expect = 8.0
 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 2/49 (4%)

Query: 167 LKNLARYYENKENQWIFKLVPGLVMVIDNFRLLHGRNGFTGRRVLCGAY 215
           LK +   + N  N  +F    G ++VIDN+ + HGR+ FTG R +  A+
Sbjct: 214 LKQIQNVFWN--NISLFSWQNGDILVIDNYSVSHGRHPFTGPREIFVAW 260


>UniRef50_Q2CHG0 Cluster: Putative uncharacterized protein; n=1;
           Oceanicola granulosus HTCC2516|Rep: Putative
           uncharacterized protein - Oceanicola granulosus HTCC2516
          Length = 269

 Score = 32.7 bits (71), Expect = 8.0
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 183 FKLVPGLVMVIDNFRLLHGRNGFTG 207
           FK  PG ++ + N R+LHGR GF G
Sbjct: 228 FKAQPGEIVFMQNTRVLHGRRGFGG 252


>UniRef50_Q9NGV3 Cluster: SP1173; n=4; Sophophora|Rep: SP1173 -
           Drosophila melanogaster (Fruit fly)
          Length = 741

 Score = 32.7 bits (71), Expect = 8.0
 Identities = 14/45 (31%), Positives = 25/45 (55%), Gaps = 2/45 (4%)

Query: 70  ILHCIEHTNGTGGETILVDGFYGATCLKEDHPEDYEFLTTYEIEG 114
           + HC  +T  T   ++++D   G+   +E+H +DYE    Y +EG
Sbjct: 267 VKHCHAYTEDT--TSVVMDALMGSAISQEEHYDDYEGWCQYPLEG 309


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.322    0.137    0.428 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 281,785,509
Number of Sequences: 1657284
Number of extensions: 12089472
Number of successful extensions: 23187
Number of sequences better than 10.0: 106
Number of HSP's better than 10.0 without gapping: 85
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 22961
Number of HSP's gapped (non-prelim): 127
length of query: 232
length of database: 575,637,011
effective HSP length: 98
effective length of query: 134
effective length of database: 413,223,179
effective search space: 55371905986
effective search space used: 55371905986
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.9 bits)
S2: 71 (32.7 bits)

- SilkBase 1999-2023 -