SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTP 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= BGIBMGA000704-TA|BGIBMGA000704-PA|IPR005123|2OG-Fe(II)
oxygenase, IPR001006|Procollagen-lysine 5-dioxygenase,
IPR006620|Prolyl 4-hydroxylase, alpha subunit
         (271 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q9VTH0 Cluster: CG6199-PA, isoform A; n=9; Coelomata|Re...   408   e-113
UniRef50_O60568 Cluster: Procollagen-lysine,2-oxoglutarate 5-dio...   343   4e-93
UniRef50_Q20679 Cluster: Procollagen-lysine,2-oxoglutarate 5-dio...   322   4e-87
UniRef50_O00469 Cluster: Procollagen-lysine,2-oxoglutarate 5-dio...   308   1e-82
UniRef50_A7S477 Cluster: Predicted protein; n=1; Nematostella ve...   303   4e-81
UniRef50_Q96AR9 Cluster: PLOD2 protein; n=3; Eutheria|Rep: PLOD2...   267   2e-70
UniRef50_UPI0000E4A230 Cluster: PREDICTED: similar to Plod-prov ...   248   9e-65
UniRef50_Q5UQC3 Cluster: Probable procollagen-lysine,2-oxoglutar...   199   7e-50
UniRef50_Q4TBD7 Cluster: Chromosome undetermined SCAF7145, whole...   189   6e-47
UniRef50_Q1VL57 Cluster: Putative uncharacterized protein; n=1; ...    68   3e-10
UniRef50_A4RT30 Cluster: Protein Lysyl hydroxylase fusion protei...    60   4e-08
UniRef50_Q01F56 Cluster: SmkH; n=2; Ostreococcus tauri|Rep: SmkH...    58   2e-07
UniRef50_A7SXT6 Cluster: Predicted protein; n=3; Nematostella ve...    56   1e-06
UniRef50_Q15SJ2 Cluster: 2OG-Fe(II) oxygenase; n=6; Proteobacter...    50   5e-05
UniRef50_Q26DQ8 Cluster: Oxidoreductase, 20G-Fe(II) oxygenase su...    46   8e-04
UniRef50_A2ZPZ8 Cluster: Putative uncharacterized protein; n=2; ...    45   0.002
UniRef50_A3KGZ2 Cluster: 2-oxoglutarate and iron-dependent oxyge...    44   0.003
UniRef50_A3TG91 Cluster: Putative uncharacterized protein; n=1; ...    43   0.007
UniRef50_Q55BW6 Cluster: Putative uncharacterized protein; n=3; ...    43   0.007
UniRef50_Q4RGA7 Cluster: Chromosome 12 SCAF15104, whole genome s...    41   0.029
UniRef50_Q9LV19 Cluster: Gb|AAB72163.1; n=5; Magnoliophyta|Rep: ...    41   0.029
UniRef50_Q58LI8 Cluster: Possible dioxygenase; n=1; Cyanophage P...    39   0.16 
UniRef50_Q46JM1 Cluster: Putative uncharacterized protein; n=1; ...    36   0.83 
UniRef50_Q58MP6 Cluster: Dioxygenase; n=1; Cyanophage P-SSM2|Rep...    36   1.1  
UniRef50_Q5UNV6 Cluster: Uncharacterized protein R699; n=1; Acan...    36   1.1  
UniRef50_Q6N063 Cluster: 2-oxoglutarate and iron-dependent oxyge...    36   1.1  
UniRef50_UPI0000F21643 Cluster: PREDICTED: similar to pol polypr...    35   2.5  
UniRef50_A5PBA9 Cluster: Oxidoreductase, 2OG-Fe(II) oxygenase fa...    35   2.5  
UniRef50_P20792 Cluster: Cell surface receptor daf-1 precursor; ...    35   2.5  
UniRef50_UPI00015A7ABE Cluster: UPI00015A7ABE related cluster; n...    34   3.4  
UniRef50_Q4JN23 Cluster: Putative uncharacterized protein; n=1; ...    34   3.4  
UniRef50_A0BE00 Cluster: Chromosome undetermined scaffold_101, w...    34   4.4  
UniRef50_UPI0000DB7621 Cluster: PREDICTED: similar to DNA ligase...    33   5.9  
UniRef50_Q8DKV0 Cluster: Tlr0755 protein; n=1; Synechococcus elo...    33   5.9  
UniRef50_Q5LRQ3 Cluster: TPR domain protein; n=4; cellular organ...    33   5.9  
UniRef50_Q5GQB2 Cluster: Putative uncharacterized protein; n=1; ...    33   5.9  
UniRef50_UPI0000DB7C47 Cluster: PREDICTED: similar to fibroblast...    33   7.7  
UniRef50_Q0HJW2 Cluster: Prolyl 4-hydroxylase, alpha subunit; n=...    33   7.7  
UniRef50_A3WSE0 Cluster: Putative uncharacterized protein; n=1; ...    33   7.7  
UniRef50_Q6ZD92 Cluster: Proline-rich protein-like; n=3; Oryza s...    33   7.7  

>UniRef50_Q9VTH0 Cluster: CG6199-PA, isoform A; n=9; Coelomata|Rep:
           CG6199-PA, isoform A - Drosophila melanogaster (Fruit
           fly)
          Length = 721

 Score =  408 bits (1005), Expect = e-113
 Identities = 174/261 (66%), Positives = 206/261 (78%)

Query: 11  AVTYVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNM 70
           A+++  +  D DMA C SLR   IFMY SN   FGHLVN + ++ T T PD Y LF N +
Sbjct: 461 AISFKHKEFDPDMAMCESLRNAGIFMYASNLRIFGHLVNADDFNTTVTRPDFYTLFSNEI 520

Query: 71  EWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDK 130
           +WT +Y+HP Y     E  K   PC DVYWF ++SD FCD+ +AIMEA+  WSDG+NND 
Sbjct: 521 DWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSDAFCDDLVAIMEAHNGWSDGSNNDN 580

Query: 131 RLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYR 190
           RLE GYEAVPTRDIHM QVGLER +L+ L+ +VRPLQE  FTGY+HNPP ++MNF+VRYR
Sbjct: 581 RLEGGYEAVPTRDIHMKQVGLERLYLKFLQMFVRPLQERAFTGYFHNPPRALMNFMVRYR 640

Query: 191 PDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTH 250
           PDEQPSLRPHHDSSTYTIN+A+N   +DY+GGGCRFIRYNCSV +TKKGW+LMHPGRLTH
Sbjct: 641 PDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRFIRYNCSVTDTKKGWMLMHPGRLTH 700

Query: 251 YHEGLLVTKGTRYIMISFVDP 271
           YHEGLLVT GTRYIMISF+DP
Sbjct: 701 YHEGLLVTNGTRYIMISFIDP 721


>UniRef50_O60568 Cluster: Procollagen-lysine,2-oxoglutarate
           5-dioxygenase 3 precursor; n=75; Euteleostomi|Rep:
           Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3
           precursor - Homo sapiens (Human)
          Length = 738

 Score =  343 bits (842), Expect = 4e-93
 Identities = 146/252 (57%), Positives = 185/252 (73%), Gaps = 1/252 (0%)

Query: 20  DSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNMEWTTRYLHP 79
           D DMAFC S R+  IF+++SN+ +FG L+    YD    HPD++Q+F+N ++W  +Y+H 
Sbjct: 488 DPDMAFCKSFRDKGIFLHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHE 547

Query: 80  EYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAV 139
            Y    E +     PC DVYWFPL+S++ CDE +A ME YG+WS G + D RL  GYE V
Sbjct: 548 NYSRALEGEGIVEQPCPDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENV 607

Query: 140 PTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRP 199
           PT DIHM QVG E  WLQ+L+ YV P+ E +F G YH    ++MNFVVRYRPDEQPSLRP
Sbjct: 608 PTVDIHMKQVGYEDQWLQLLRTYVGPMTESLFPG-YHTKARAVMNFVVRYRPDEQPSLRP 666

Query: 200 HHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTK 259
           HHDSST+T+N+ALN   +DYEGGGCRF+RY+C + + +KGW L+HPGRLTHYHEGL  T 
Sbjct: 667 HHDSSTFTLNVALNHKGLDYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTW 726

Query: 260 GTRYIMISFVDP 271
           GTRYIM+SFVDP
Sbjct: 727 GTRYIMVSFVDP 738


>UniRef50_Q20679 Cluster: Procollagen-lysine,2-oxoglutarate
           5-dioxygenase precursor; n=2; Caenorhabditis|Rep:
           Procollagen-lysine,2-oxoglutarate 5-dioxygenase
           precursor - Caenorhabditis elegans
          Length = 730

 Score =  322 bits (792), Expect = 4e-87
 Identities = 145/256 (56%), Positives = 181/256 (70%), Gaps = 4/256 (1%)

Query: 20  DSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKT----HPDMYQLFENNMEWTTR 75
           D DM+ C   R+   F+Y+ NE+ +G L+  + Y  T T    HP+M+Q+FEN   W  R
Sbjct: 475 DPDMSMCKFARDNGHFLYIDNEKYYGFLIVSDEYAETVTEGKWHPEMWQIFENRELWEAR 534

Query: 76  YLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESG 135
           Y+HP Y    E +      C DVY FPLMS+RFC+E I  ME +G+WSDG+NNDKRL  G
Sbjct: 535 YIHPGYHKIMEPEHVVDQACPDVYDFPLMSERFCEELIEEMEGFGRWSDGSNNDKRLAGG 594

Query: 136 YEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQP 195
           YE VPTRDIHM+QVG ER WL  +  YVRP+QE  F GYYH P  S M FVVRY+P+EQP
Sbjct: 595 YENVPTRDIHMNQVGFERQWLYFMDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQP 654

Query: 196 SLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGL 255
           SLRPHHD+ST++I++ALN    DYEGGG R+IRYNC+V   + G+ +M PGRLTH HEGL
Sbjct: 655 SLRPHHDASTFSIDIALNKKGRDYEGGGVRYIRYNCTVPADEVGYAMMFPGRLTHLHEGL 714

Query: 256 LVTKGTRYIMISFVDP 271
             TKGTRYIM+SF++P
Sbjct: 715 ATTKGTRYIMVSFINP 730


>UniRef50_O00469 Cluster: Procollagen-lysine,2-oxoglutarate
           5-dioxygenase 2 precursor; n=47; Deuterostomia|Rep:
           Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2
           precursor - Homo sapiens (Human)
          Length = 737

 Score =  308 bits (755), Expect = 1e-82
 Identities = 129/258 (50%), Positives = 182/258 (70%), Gaps = 2/258 (0%)

Query: 14  YVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNMEWT 73
           +V++  D DMA C + RE+ +FMY+SN  +FG L++   Y+ +  + D++Q+FEN ++W 
Sbjct: 482 FVRDKLDPDMALCRNAREMGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWK 541

Query: 74  TRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLE 133
            +Y++ +Y   F E+     PC DV+WFP+ S++ CDE +  ME YGKWS G ++D R+ 
Sbjct: 542 EKYINRDYSKIFTENIVE-QPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRIS 600

Query: 134 SGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDE 193
            GYE VPT DIHM QV LE  WL  +++++ P+   VF GYY    A ++NFVV+Y P+ 
Sbjct: 601 GGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGFA-LLNFVVKYSPER 659

Query: 194 QPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHE 253
           Q SLRPHHD+ST+TIN+ALN    D++GGGC+F+RYNCS+ + +KGW  MHPGRLTH HE
Sbjct: 660 QRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHE 719

Query: 254 GLLVTKGTRYIMISFVDP 271
           GL V  GTRYI +SF+DP
Sbjct: 720 GLPVKNGTRYIAVSFIDP 737


>UniRef50_A7S477 Cluster: Predicted protein; n=1; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 729

 Score =  303 bits (743), Expect = 4e-81
 Identities = 143/267 (53%), Positives = 173/267 (64%), Gaps = 4/267 (1%)

Query: 7   PKAKAVTYVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLF 66
           PK K   Y     + D++F   LR+  IFMYV+N   FG L   +T      H D++Q+F
Sbjct: 465 PKLKHA-YSYGNLEPDLSFSKYLRDNGIFMYVTNMHYFGRLKETDTVTTNHLHNDLWQIF 523

Query: 67  ENNMEWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGT 126
           +N ++W  RYLHP Y  N  +     MPC DV+WFPLMS+ +    I  ME YGKWS G 
Sbjct: 524 DNQIDWEERYLHPNYSQNLNKSIPLKMPCNDVFWFPLMSETWATHMIEEMEHYGKWSGGK 583

Query: 127 NN--DKRLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMN 184
           +   D RL  GYE VPT DIHM+QVG ER WL +LK Y+ P+   +F GYY    A IMN
Sbjct: 584 HEPQDARLNGGYENVPTVDIHMNQVGWEREWLHLLKTYIVPVNTRIFPGYYSEGRA-IMN 642

Query: 185 FVVRYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMH 244
           FVV+Y P  Q  LRPHHDSSTYTIN+ LN P + Y GGG RFIR +C+V +T+ GW LMH
Sbjct: 643 FVVKYTPSGQYYLRPHHDSSTYTINIGLNKPGIHYGGGGSRFIRQDCAVTDTQVGWALMH 702

Query: 245 PGRLTHYHEGLLVTKGTRYIMISFVDP 271
           PGRLTHYHEGL  T GTRYIM+ FVDP
Sbjct: 703 PGRLTHYHEGLPTTWGTRYIMVCFVDP 729


>UniRef50_Q96AR9 Cluster: PLOD2 protein; n=3; Eutheria|Rep: PLOD2
           protein - Homo sapiens (Human)
          Length = 210

 Score =  267 bits (654), Expect = 2e-70
 Identities = 112/211 (53%), Positives = 152/211 (72%), Gaps = 2/211 (0%)

Query: 61  DMYQLFENNMEWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYG 120
           D++Q+FEN ++W  +Y++ +Y   F E+     PC DV+WFP+ S++ CDE +  ME YG
Sbjct: 2   DLWQIFENPVDWKEKYINRDYSKIFTENIVE-QPCPDVFWFPIFSEKACDELVEEMEHYG 60

Query: 121 KWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPA 180
           KWS G ++D R+  GYE VPT DIHM QV LE  WL  +++++ P+   VF GYY    A
Sbjct: 61  KWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGFA 120

Query: 181 SIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGW 240
            ++NFVV+Y P+ Q SLRPHHD+ST+TIN+ALN    D++GGGC+F+RYNCS+ + +KGW
Sbjct: 121 -LLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGW 179

Query: 241 LLMHPGRLTHYHEGLLVTKGTRYIMISFVDP 271
             MHPGRLTH HEGL V  GTRYI +SF+DP
Sbjct: 180 SFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 210


>UniRef50_UPI0000E4A230 Cluster: PREDICTED: similar to Plod-prov
           protein, partial; n=2; Strongylocentrotus
           purpuratus|Rep: PREDICTED: similar to Plod-prov protein,
           partial - Strongylocentrotus purpuratus
          Length = 609

 Score =  248 bits (608), Expect = 9e-65
 Identities = 117/206 (56%), Positives = 143/206 (69%), Gaps = 3/206 (1%)

Query: 17  EGHDSDMAFCASLRELNIFMYVSNEED-FGHLVNPETYDITKTHPDMYQLFENNMEWTTR 75
           E  D+DMA C  LR   IF+YV N ED +GH+V  + Y+ T  H DM++L+ N  +W  +
Sbjct: 405 EDLDTDMAICMDLRSKGIFLYVVNMEDSYGHIVTLDNYETTHLHNDMWELWNNKEDWEAK 464

Query: 76  YLHPEYLANFEEDRKHL-MPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLES 134
           YL P+Y    E DR ++ MPCTDVY FPLMS  +  E I  ME +G+WS G N DKRL  
Sbjct: 465 YLSPDYFVVKEMDRNNITMPCTDVYTFPLMSRTWAKELIEEMEHFGEWSGGGNQDKRLNG 524

Query: 135 GYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQ 194
           GYE VPTRDIHM+Q+G E+HWL  L++YV P+ E V+ GYY    A IMNFVVRY+PDEQ
Sbjct: 525 GYENVPTRDIHMNQIGFEQHWLYFLREYVVPICENVYPGYYSKAYA-IMNFVVRYKPDEQ 583

Query: 195 PSLRPHHDSSTYTINLALNTPNVDYE 220
            SLRPHHDSSTYTIN+ALN    DYE
Sbjct: 584 ASLRPHHDSSTYTINVALNERETDYE 609


>UniRef50_Q5UQC3 Cluster: Probable procollagen-lysine,2-oxoglutarate
           5-dioxygenase; n=1; Acanthamoeba polyphaga
           mimivirus|Rep: Probable
           procollagen-lysine,2-oxoglutarate 5-dioxygenase -
           Mimivirus
          Length = 895

 Score =  199 bits (485), Expect = 7e-50
 Identities = 100/265 (37%), Positives = 151/265 (56%), Gaps = 10/265 (3%)

Query: 13  TYVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHL---VNPETYDITKTHPDMYQLFENN 69
           +++  G + DM  C +LR+ N+FMY+SN   +GH+   +N E      T   +Y L    
Sbjct: 634 SHMWNGSNIDMRLCHNLRKNNVFMYLSNLRPYGHIDDSINLEVLSGVPTEVTLYDLPTRK 693

Query: 70  MEWTTRYLHPEYLANFE--EDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTN 127
            EW  +YLHPE+L++ +  +D  +   C DVY FPL +  FC E I +M+    WS G +
Sbjct: 694 EEWEKKYLHPEFLSHLQNFKDFDYTEICNDVYSFPLFTPAFCKEVIEVMDKANLWSKGGD 753

Query: 128 N--DKRLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNF 185
           +  D R+  G E+ PT+D  + +VGL++ W  ++ +YV P    ++  Y        + F
Sbjct: 754 SYFDPRI-GGVESYPTQDTQLYEVGLDKQWHYVVFNYVAPFVRHLYNNY--KTKDINLAF 810

Query: 186 VVRYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHP 245
           VV+Y  + Q  L PHHDSSTYT+N+ALN    +Y  GGC FIR+    +  K G+  +H 
Sbjct: 811 VVKYDMERQSELAPHHDSSTYTLNIALNEYGKEYTAGGCEFIRHKFIWQGQKVGYATIHA 870

Query: 246 GRLTHYHEGLLVTKGTRYIMISFVD 270
           G+L  YH  L +T G RYI++SFV+
Sbjct: 871 GKLLAYHRALPITSGKRYILVSFVN 895


>UniRef50_Q4TBD7 Cluster: Chromosome undetermined SCAF7145, whole
           genome shotgun sequence; n=2; Tetraodontidae|Rep:
           Chromosome undetermined SCAF7145, whole genome shotgun
           sequence - Tetraodon nigroviridis (Green puffer)
          Length = 607

 Score =  189 bits (461), Expect = 6e-47
 Identities = 81/144 (56%), Positives = 103/144 (71%), Gaps = 2/144 (1%)

Query: 71  EWTTRYLHPEYLANFEEDRKHL-MPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNND 129
           +W  +Y+H  Y   FEE    +  PC DVYWFP  S++ CD  +  MEA+G+WS G + D
Sbjct: 465 DWKEKYVHENYSRIFEEQESFVEQPCPDVYWFPAFSEKMCDHLVETMEAHGQWSSGGHKD 524

Query: 130 KRLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRY 189
           +RL  GYE VPT D HM+Q+G E+ WL+ L+DY+ P+ E ++ GYY    A IMNFVVRY
Sbjct: 525 ERLSGGYENVPTVDTHMNQIGFEKEWLRFLRDYIVPVTEKLYPGYYPRAQA-IMNFVVRY 583

Query: 190 RPDEQPSLRPHHDSSTYTINLALN 213
           RPDEQPSLRPHHDSST+TIN+ALN
Sbjct: 584 RPDEQPSLRPHHDSSTFTINIALN 607


>UniRef50_Q1VL57 Cluster: Putative uncharacterized protein; n=1;
           Psychroflexus torquis ATCC 700755|Rep: Putative
           uncharacterized protein - Psychroflexus torquis ATCC
           700755
          Length = 364

 Score = 67.7 bits (158), Expect = 3e-10
 Identities = 70/256 (27%), Positives = 116/256 (45%), Gaps = 27/256 (10%)

Query: 14  YVQEGHDSDMAFCASLRELNIFMYVSNEEDF--GHLVNPETYDITKT-HPDMYQLFENNM 70
           YVQ+ H S+    A   E  IF     + +   G L NP T   T   H +  Q  +  M
Sbjct: 116 YVQKQHLSNKFSIACDVEGYIFTCYEPKIEVRNGQLYNPVTGCFTCAYHGNGGQKQKELM 175

Query: 71  EWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDK 130
           E   +     Y  N+   +K+ +   D+     MS+  C   I++ E             
Sbjct: 176 E---KVYSNFYGFNYIPTKKYDILSDDILLIDFMSEDMCQNMISLAE---------EKQF 223

Query: 131 RLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPA--SIMN-FVV 187
           R+  G + VP +++ + +      W  + + + + + E+V+   Y NP     + + F++
Sbjct: 224 RIMEG-DKVPAQELRLKEF---EQWKLLEEHWNKIVYEIVWE--YWNPCHMWGLRDAFII 277

Query: 188 RYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGR 247
           +Y  D+Q  LR H+D+S  T ++ LN    DYEGG   F R   +  +   G  L+ PG+
Sbjct: 278 KYEMDKQRELRLHNDASLVTGSIKLND---DYEGGVLEFPRQKVNNSDVPVGKCLLFPGQ 334

Query: 248 LTHYHEGLLVTKGTRY 263
           +TH H   L+TKGT+Y
Sbjct: 335 VTHGHSSSLLTKGTKY 350


>UniRef50_A4RT30 Cluster: Protein Lysyl hydroxylase fusion protein,
           putative; n=1; Ostreococcus lucimarinus CCE9901|Rep:
           Protein Lysyl hydroxylase fusion protein, putative -
           Ostreococcus lucimarinus CCE9901
          Length = 618

 Score = 60.5 bits (140), Expect = 4e-08
 Identities = 62/263 (23%), Positives = 110/263 (41%), Gaps = 25/263 (9%)

Query: 13  TYVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNMEW 72
           T +   HD+D+     L    + +    +    H +    + + K H  + + F  ++ +
Sbjct: 372 TLLHNPHDTDV-----LHAYGVTLLSQGKWKLAHSIWQLAFKVDKAHLWLSKQFNKDVSY 426

Query: 73  TTRYLH--PEYLANFEEDRKHLMPCTDVYWFP--LMSDRFCDEWIAIMEAYGKWSDGTNN 128
                   PEY  +F+      +   ++Y      +S   C  WI   EA+     G + 
Sbjct: 427 AAYLCRKCPEYYVSFQT-----ISIDEIYLTTNSAISPSACSSWIKTAEAHATNRGGWDT 481

Query: 129 DKRLESGYEAVPTRDIHMSQV-GLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVV 187
           D+     +++V T D+ + ++  + R W  I    + P  +  F             F+V
Sbjct: 482 DR-----HKSVATTDLPIHEIPSVLREWNLIFGQIIGPFIQERFRVDGDTNLRVHDAFIV 536

Query: 188 RY-RPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPG 246
           +Y   D Q  L  H D   ++I L+LN P + Y+GGG  F  +   VR  K G  +    
Sbjct: 537 KYDASDGQCQLPVHTDQGHFSITLSLNDP-IQYKGGGTIFPEHEFIVR-PKCGDFVAFRS 594

Query: 247 RLTHYHEGLLVTKGTRYIMISFV 269
            LTH   G+ +T G RYI+++F+
Sbjct: 595 YLTH--GGVPITSGVRYIVVAFL 615


>UniRef50_Q01F56 Cluster: SmkH; n=2; Ostreococcus tauri|Rep: SmkH -
           Ostreococcus tauri
          Length = 637

 Score = 58.0 bits (134), Expect = 2e-07
 Identities = 45/164 (27%), Positives = 77/164 (46%), Gaps = 12/164 (7%)

Query: 109 CDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQV-GLERHWLQILKDYVRPLQ 167
           C  W+   E+  +   G +  +     ++AVPT D+ + ++ G+   W ++    + P  
Sbjct: 480 CPSWVEAAESVARSRGGWDTAR-----HKAVPTTDLPIHEIPGVMEQWNRLFSVVISPFI 534

Query: 168 ELVFTGYYHNPPASIMN-FVVRYRPDE-QPSLRPHHDSSTYTINLALNTPNVDYEGGGCR 225
              F          + + FVV+Y  +E Q  L  H D   +++ LAL+    DY GGG  
Sbjct: 535 RDRFRLPTSFGTLYVHDAFVVKYNANEGQRELPVHTDQGQFSLTLALHDTQ-DYSGGGTI 593

Query: 226 FIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFV 269
           F  + C VR  + G  +     LTH   G+ +T G RYI+++F+
Sbjct: 594 FPEHECIVR-PRCGDFVAFRSSLTH--GGVPITAGVRYIVVAFL 634


>UniRef50_A7SXT6 Cluster: Predicted protein; n=3; Nematostella
           vectensis|Rep: Predicted protein - Nematostella
           vectensis
          Length = 344

 Score = 56.0 bits (129), Expect = 1e-06
 Identities = 44/179 (24%), Positives = 86/179 (48%), Gaps = 21/179 (11%)

Query: 97  DVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWL 156
           +VY  P+ ++ FC+++I  +E +   SD         + Y  +      +S +G + H++
Sbjct: 134 EVYRLPVFTESFCEQFIEELEHFES-SDVPRGRPNTMNNYGVL------LSDLGFDEHFI 186

Query: 157 QILK-DYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTP 215
             L+ +Y++P+  L+F  +  +   S   F V Y P +   L  H+D++  T+++ L   
Sbjct: 187 NPLRREYLQPITALLFPQWGGDGLDSHKAFTVHYMPGKDTELSYHYDNAEVTLSVCLGR- 245

Query: 216 NVDYEGGGCRF--IRY------NCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMI 266
             ++ GG   F  +R        C+    +  + L+H G+  H H  L  T+G+RY +I
Sbjct: 246 --EFSGGDLYFGDMRQVLLEDTQCTEVENRPTYGLLHRGQ--HMHGALPTTQGSRYNLI 300


>UniRef50_Q15SJ2 Cluster: 2OG-Fe(II) oxygenase; n=6;
           Proteobacteria|Rep: 2OG-Fe(II) oxygenase -
           Pseudoalteromonas atlantica (strain T6c / BAA-1087)
          Length = 306

 Score = 50.4 bits (115), Expect = 5e-05
 Identities = 29/117 (24%), Positives = 53/117 (45%), Gaps = 4/117 (3%)

Query: 157 QILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPN 216
           ++L  Y+RP+  L+F             F + Y+P+   S+RPH D+S  T+N+ LN P+
Sbjct: 158 EMLDRYMRPIARLLFPDIV-GYDTQTFGFSIHYKPNTDTSIRPHTDASAVTLNINLNLPD 216

Query: 217 VDYEGGGCRFIRYNCSVR---NTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFVD 270
             + G    F             K G  ++H G + H  + +   + T +++  + D
Sbjct: 217 EVFTGSNVDFYDPTTGKMIGLAFKPGSAMIHRGNVVHAAQPITSGERTNFVLWLYGD 273


>UniRef50_Q26DQ8 Cluster: Oxidoreductase, 20G-Fe(II) oxygenase
           superfamily; n=2; Flavobacteria|Rep: Oxidoreductase,
           20G-Fe(II) oxygenase superfamily - Flavobacteria
           bacterium BBFL7
          Length = 320

 Score = 46.4 bits (105), Expect = 8e-04
 Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 2/87 (2%)

Query: 140 PTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRP 199
           P  + H++    +  +  I+  Y+RP+  L+   Y  +       F ++Y PD+   L  
Sbjct: 142 PRSEGHLAAPNFQSFYNTIMDRYMRPIARLLLGTYGFDNQT--FGFSIQYNPDKDKDLHA 199

Query: 200 HHDSSTYTINLALNTPNVDYEGGGCRF 226
           H D+S  T+N+ +N P+ ++ G    F
Sbjct: 200 HTDASAATLNININLPDEEFTGSQVDF 226


>UniRef50_A2ZPZ8 Cluster: Putative uncharacterized protein; n=2;
           Oryza sativa|Rep: Putative uncharacterized protein -
           Oryza sativa subsp. japonica (Rice)
          Length = 366

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 49/196 (25%), Positives = 79/196 (40%), Gaps = 18/196 (9%)

Query: 84  NFEEDRKHLM--PCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPT 141
           N EE    +M  P   V+ FP++   FC   ++ +  + +W+   N      +  +    
Sbjct: 128 NTEESITSIMMEPAPGVFAFPMLKPSFCQMLMSEVNNFLRWAQSANQRIMRPTSLDR-HG 186

Query: 142 RDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHH 201
           R   +S  GL+     ++KD++ P+  ++F     N   S   FV+ Y   E    R  H
Sbjct: 187 RGAALSDFGLQEMLDNLMKDFISPMSTVLFPEVGGNTLDSHHTFVLEY--GEADGARGFH 244

Query: 202 -DSSTYTINLAL--NTPNVDYEGGGCRFIRYNCS--------VRNTKKGWLLMHPGRLTH 250
            D S  T+N+ L  +    D    G R   +  S        V     G +L+H G  +H
Sbjct: 245 VDDSEVTLNICLGKHFTGADMYFRGIRCGNHVNSGTHDEEYFVHPNVPGQVLLHHG--SH 302

Query: 251 YHEGLLVTKGTRYIMI 266
            H    VT G R  M+
Sbjct: 303 RHGVFSVTSGRRVNMV 318


>UniRef50_A3KGZ2 Cluster: 2-oxoglutarate and iron-dependent
           oxygenase domain-containing protein 2; n=8;
           Deuterostomia|Rep: 2-oxoglutarate and iron-dependent
           oxygenase domain-containing protein 2 - Danio rerio
           (Zebrafish) (Brachydanio rerio)
          Length = 345

 Score = 44.4 bits (100), Expect = 0.003
 Identities = 43/175 (24%), Positives = 80/175 (45%), Gaps = 15/175 (8%)

Query: 98  VYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQ 157
           V+ F +    FC + +  +E + + SD         + Y  V      ++++G +  ++ 
Sbjct: 131 VFRFQVFRKEFCKDLLEELEHFEQ-SDAPKGRPNTMNNYGIV------LNELGFDEGFIT 183

Query: 158 ILKD-YVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPN 216
            L++ Y+RPL  L+++    N   S   FVV+Y   E  +L  H+D+S  T+N++L    
Sbjct: 184 PLREVYLRPLTALLYSDCGGNCLDSHKAFVVKYDMHEDLNLSYHYDNSEVTLNVSLGKDF 243

Query: 217 VDYE--GGGCRFI---RYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMI 266
            +     G  R +      C     +    L+H G+  H H  L ++ GTR+ +I
Sbjct: 244 TEGNLFFGDMRQVPLSETECVEVEHRVTEGLLHRGQ--HMHGALSISSGTRWNLI 296


>UniRef50_A3TG91 Cluster: Putative uncharacterized protein; n=1;
           Janibacter sp. HTCC2649|Rep: Putative uncharacterized
           protein - Janibacter sp. HTCC2649
          Length = 711

 Score = 43.2 bits (97), Expect = 0.007
 Identities = 25/86 (29%), Positives = 40/86 (46%), Gaps = 2/86 (2%)

Query: 185 FVVRYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMH 244
           FV  +    +P +  H D S +T+N+ L  P+    G     +     V   ++GW + H
Sbjct: 612 FVRHFSERTRPFIPFHPDDSHWTVNVPLEDPDQTSGGELVMLLDGGLRVVERRRGWAISH 671

Query: 245 PGRLTHYHEGLLVTKGTRYIMISFVD 270
           PG L H      VT G R+ +I+F +
Sbjct: 672 PGALIHGVR--RVTHGDRWSLIAFYE 695


>UniRef50_Q55BW6 Cluster: Putative uncharacterized protein; n=3;
           Dictyostelium discoideum|Rep: Putative uncharacterized
           protein - Dictyostelium discoideum AX4
          Length = 242

 Score = 43.2 bits (97), Expect = 0.007
 Identities = 31/131 (23%), Positives = 59/131 (45%), Gaps = 10/131 (7%)

Query: 96  TDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHW 155
           T +Y F + +  FC + +  +E +      T     + + Y AV      + ++G    +
Sbjct: 58  TRIYSFRIFTMEFCTKLLEEIENFKNTGLPTARPNSMNN-YGAV------LDEMGFTEFF 110

Query: 156 LQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTP 215
            Q+ +DY+     +++  Y      S   F V+Y+ D++  L  H+D S  T+NL L + 
Sbjct: 111 KQLREDYLSLFTSILYKDYNGEKLNSHHAFAVQYKMDKEKELGFHYDESDITVNLCLGS- 169

Query: 216 NVDYEGGGCRF 226
             ++ GG   F
Sbjct: 170 --EFTGGSLYF 178


>UniRef50_Q4RGA7 Cluster: Chromosome 12 SCAF15104, whole genome
           shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 12
           SCAF15104, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 593

 Score = 41.1 bits (92), Expect = 0.029
 Identities = 39/175 (22%), Positives = 76/175 (43%), Gaps = 15/175 (8%)

Query: 98  VYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQ 157
           VY FP+    FC++ +  +E + + S        +           + ++++GL+  ++ 
Sbjct: 381 VYRFPVFEKSFCEQLLEELEHFEQSSAPKGRPNTMNR-------HGVLLNELGLDEGFIT 433

Query: 158 ILKD-YVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPN 216
            L++ Y++P+  L++         S   FVV+Y   E   L  H+D++  T+N ++    
Sbjct: 434 PLREHYLQPVSALLYPECGGGRLDSHKAFVVKYDMKEDVELSYHYDNAEVTLNASIGKEF 493

Query: 217 VD---YEGG--GCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMI 266
            D   Y G           CS    +    L+H G+  H H  L ++ G R+ +I
Sbjct: 494 TDGNLYFGALKQVPLGEAECSEVEHRVSEGLLHRGQ--HMHGALPISSGQRWNLI 546



 Score = 39.5 bits (88), Expect = 0.089
 Identities = 34/166 (20%), Positives = 73/166 (43%), Gaps = 14/166 (8%)

Query: 59  HPDMYQLFENNMEWTTRYLHPEYLANFEEDRKHLMPCTD-----VYWFPLMSDRFCDEWI 113
           HP +Y L E  +    R +  +Y    +  ++ L+   +     VY FP+    FC++ +
Sbjct: 30  HPQVYNLQERYLAPRFRQI-VQYCQRTDATKEGLLAFLEEAAVGVYRFPVFEKSFCEQLL 88

Query: 114 AIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQILKD-YVRPLQELVFT 172
             +E + + S        +           + ++++GL+  ++  L++ Y++P+  L++ 
Sbjct: 89  EELEHFEQSSAPKGRPNTMNR-------HGVLLNELGLDEGFITPLREHYLQPVSALLYP 141

Query: 173 GYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPNVD 218
                   S   FVV+Y   E   L  H+D++  T+N ++     D
Sbjct: 142 ECGGGRLDSHKAFVVKYDMKEDVELSYHYDNAEVTLNASIGKEFTD 187


>UniRef50_Q9LV19 Cluster: Gb|AAB72163.1; n=5; Magnoliophyta|Rep:
           Gb|AAB72163.1 - Arabidopsis thaliana (Mouse-ear cress)
          Length = 394

 Score = 41.1 bits (92), Expect = 0.029
 Identities = 34/142 (23%), Positives = 62/142 (43%), Gaps = 11/142 (7%)

Query: 84  NFEEDRKHLM--PCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTN---NDKRLESGYEA 138
           N EE  ++++  P   V+ F ++   FC+  +A ++ + +W   T          + Y A
Sbjct: 153 NTEESFRNIISEPSPGVFVFDMLQPSFCEMMLAEIDNFERWVGETKFRIMRPNTMNKYGA 212

Query: 139 VPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLR 198
           V      +   GL+    ++++ ++RP+ ++ F+        S   FVV Y  D    L 
Sbjct: 213 V------LDDFGLDTMLDKLMEGFIRPISKVFFSDVGGATLDSHHGFVVEYGKDRDVDLG 266

Query: 199 PHHDSSTYTINLALNTPNVDYE 220
            H D S  T+N+ L    V  E
Sbjct: 267 FHVDDSEVTLNVCLGNQFVGGE 288


>UniRef50_Q58LI8 Cluster: Possible dioxygenase; n=1; Cyanophage
           P-SSM4|Rep: Possible dioxygenase - Cyanophage P-SSM4
          Length = 196

 Score = 38.7 bits (86), Expect = 0.16
 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 1/55 (1%)

Query: 216 NVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFVD 270
           N D+EGG   F+ Y       K+G +L+ P   TH H GL    G +YI  S+ +
Sbjct: 139 NDDFEGGETEFL-YQHKRFKPKRGQVLIWPAGFTHTHRGLPPLDGAKYISTSWTE 192


>UniRef50_Q46JM1 Cluster: Putative uncharacterized protein; n=1;
           Prochlorococcus marinus str. NATL2A|Rep: Putative
           uncharacterized protein - Prochlorococcus marinus
           (strain NATL2A)
          Length = 199

 Score = 36.3 bits (80), Expect = 0.83
 Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 1/51 (1%)

Query: 220 EGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFVD 270
           EGG   F  Y+  +   +KG  L+ P   TH H G ++  G +YI+  +++
Sbjct: 147 EGGSTYFSHYDLEIE-PRKGLTLIWPAEWTHAHRGNILKAGKKYIITGWIN 196


>UniRef50_Q58MP6 Cluster: Dioxygenase; n=1; Cyanophage P-SSM2|Rep:
           Dioxygenase - Cyanophage P-SSM2
          Length = 197

 Score = 35.9 bits (79), Expect = 1.1
 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 203 SSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTR 262
           +S Y   + L   N D+EGG   F+  N  ++  + G  ++ P   TH H G     G++
Sbjct: 122 TSPYRQLVTLLYLNDDFEGGETEFLYQNVRIK-PQAGKFIIFPPFWTHTHRGNPPIGGSK 180

Query: 263 YIMISFVD 270
           YI+ S+ D
Sbjct: 181 YIITSWAD 188


>UniRef50_Q5UNV6 Cluster: Uncharacterized protein R699; n=1;
           Acanthamoeba polyphaga mimivirus|Rep: Uncharacterized
           protein R699 - Mimivirus
          Length = 455

 Score = 35.9 bits (79), Expect = 1.1
 Identities = 14/30 (46%), Positives = 21/30 (70%)

Query: 19  HDSDMAFCASLRELNIFMYVSNEEDFGHLV 48
           +D DM  C SLR+  IFMY+ N  ++G++V
Sbjct: 426 NDKDMDLCFSLRKHTIFMYMINNNNYGYMV 455


>UniRef50_Q6N063 Cluster: 2-oxoglutarate and iron-dependent
           oxygenase domain-containing protein 2; n=13;
           Amniota|Rep: 2-oxoglutarate and iron-dependent oxygenase
           domain-containing protein 2 - Homo sapiens (Human)
          Length = 350

 Score = 35.9 bits (79), Expect = 1.1
 Identities = 27/116 (23%), Positives = 55/116 (47%), Gaps = 8/116 (6%)

Query: 98  VYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQ 157
           +Y  P+ +  FC   +  +E + + SD         + Y  +      + ++GL+   + 
Sbjct: 139 IYRVPVFTAPFCQALLEELEHFEQ-SDMPKGRPNTMNNYGVL------LHELGLDEPLMT 191

Query: 158 ILKD-YVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLAL 212
            L++ +++PL  L++         S   FVV+Y P +   L  H+D++  T+N+AL
Sbjct: 192 PLRERFLQPLMALLYPDCGGGRLDSHRAFVVKYAPGQDLELGCHYDNAELTLNVAL 247


>UniRef50_UPI0000F21643 Cluster: PREDICTED: similar to pol
            polyprotein; n=24; Danio rerio|Rep: PREDICTED: similar to
            pol polyprotein - Danio rerio
          Length = 1836

 Score = 34.7 bits (76), Expect = 2.5
 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 8/72 (11%)

Query: 102  PLMSDRFCDEWIAIMEAYGKWSDGTNN---DKRLESGYEAVPTRDIHMSQVGLERHWL-- 156
            P++++R C+EW+   +     +D   N   DK L+  YE    + ++ + V   R WL  
Sbjct: 1449 PVLAERDCEEWLRFYKELKITADVNTNKSKDKELKKCYEC---QVVYGTTVSFARDWLTW 1505

Query: 157  QILKDYVRPLQE 168
            ++L+  VRP +E
Sbjct: 1506 RVLRQNVRPKRE 1517


>UniRef50_A5PBA9 Cluster: Oxidoreductase, 2OG-Fe(II) oxygenase
           family protein; n=2; Sphingomonadales|Rep:
           Oxidoreductase, 2OG-Fe(II) oxygenase family protein -
           Erythrobacter sp. SD-21
          Length = 363

 Score = 34.7 bits (76), Expect = 2.5
 Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 7/77 (9%)

Query: 198 RPHHDSSTY-----TINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYH 252
           RPH D++T+        + +N    +YEGG  RF  +         G  ++    L H  
Sbjct: 261 RPHRDNTTFGTAHRRFAVTVNLNAEEYEGGNLRFPEFGQRTYRAPTGGAVVFSCSLLH-- 318

Query: 253 EGLLVTKGTRYIMISFV 269
           E   VT+G RY  + F+
Sbjct: 319 EATPVTRGERYAFLPFL 335


>UniRef50_P20792 Cluster: Cell surface receptor daf-1 precursor;
           n=2; Caenorhabditis elegans|Rep: Cell surface receptor
           daf-1 precursor - Caenorhabditis elegans
          Length = 669

 Score = 34.7 bits (76), Expect = 2.5
 Identities = 41/143 (28%), Positives = 65/143 (45%), Gaps = 22/143 (15%)

Query: 67  ENNMEWTTRYLHPEYL-ANFEEDRKHLMPCTDVYWFPL-MSDRFC---DEWIAIMEA--- 118
           EN    T RYL PE L +  +        C DVY F L M +  C   D  +   EA   
Sbjct: 460 ENYKCGTVRYLAPEILNSTMQFTVFESYQCADVYSFSLVMWETLCRCEDGDVLPREAATV 519

Query: 119 --YGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQILKDY--VRPLQELVFTGY 174
             Y +W+D    D ++   ++ V TR +  ++  L   W    KD+  ++ + E++ T +
Sbjct: 520 IPYIEWTDRDPQDAQM---FDVVCTRRLRPTENPL---W----KDHPEMKHIMEIIKTCW 569

Query: 175 YHNPPASIMNFVVRYRPDEQPSL 197
             NP A   +++ R R DE+  L
Sbjct: 570 NGNPSARFTSYICRKRMDERQQL 592


>UniRef50_UPI00015A7ABE Cluster: UPI00015A7ABE related cluster; n=8;
           Danio rerio|Rep: UPI00015A7ABE UniRef100 entry - Danio
           rerio
          Length = 563

 Score = 34.3 bits (75), Expect = 3.4
 Identities = 36/169 (21%), Positives = 68/169 (40%), Gaps = 8/169 (4%)

Query: 55  ITKTHPDMYQLFENNMEWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCD-EWI 113
           + KT  DM QL ++ M+      H   L   + + K    C +V+   + S   C  E +
Sbjct: 180 LMKTQTDMQQLIQDRMKMIKEIQHSVELRK-KNNEKEKADCVEVFADLMRSIERCQRELL 238

Query: 114 AIMEAYGKWSDGTNND--KRLESGYEAVPTRDIHMSQVGLER---HWLQILKDYVRPLQE 168
            + E   K ++    +  K LE     +  R+  + ++       H LQ+     RPL  
Sbjct: 239 EVTEQKQKAAEKQAEELIKELEQEISELRRRNTELEELSHTEDHLHLLQMFPSLCRPLDI 298

Query: 169 LVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPNV 217
            ++TG   N   S+   +       Q ++    +++ Y++   LN  N+
Sbjct: 299 KIWTGININTGVSV-ETLRSALSQLQETIDEGFNNNEYSLKTILNIQNL 346


>UniRef50_Q4JN23 Cluster: Putative uncharacterized protein; n=1;
           uncultured bacterium BAC13K9BAC|Rep: Putative
           uncharacterized protein - uncultured bacterium
           BAC13K9BAC
          Length = 199

 Score = 34.3 bits (75), Expect = 3.4
 Identities = 16/54 (29%), Positives = 31/54 (57%), Gaps = 1/54 (1%)

Query: 216 NVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFV 269
           +V+  GG   F+     V+  ++G L++ P   TH H G+ + KG++YI  +++
Sbjct: 144 DVEGPGGETEFLHQKVKVK-PEEGKLVVFPPFWTHEHRGVTLKKGSKYIATTWI 196


>UniRef50_A0BE00 Cluster: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_101,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 643

 Score = 33.9 bits (74), Expect = 4.4
 Identities = 21/76 (27%), Positives = 33/76 (43%), Gaps = 4/76 (5%)

Query: 16  QEGHDSDMAFCASLRELNIFMY--VSNEEDFGHLVNPE--TYDITKTHPDMYQLFENNME 71
           +EGH +    C +L  ++ F+Y  V N    G   NP   T ++    P+ Y+  E N  
Sbjct: 30  EEGHKTLFGACLTLGLISFFLYLLVINLYTLGQRDNPTSLTTEVYHAQPEYYKFNEQNFT 89

Query: 72  WTTRYLHPEYLANFEE 87
            T     P+Y    +E
Sbjct: 90  LTFAIQSPDYATYIDE 105


>UniRef50_UPI0000DB7621 Cluster: PREDICTED: similar to DNA ligase 3
           (DNA ligase III) (Polydeoxyribonucleotide synthase [ATP]
           3); n=2; Apocrita|Rep: PREDICTED: similar to DNA ligase
           3 (DNA ligase III) (Polydeoxyribonucleotide synthase
           [ATP] 3) - Apis mellifera
          Length = 1009

 Score = 33.5 bits (73), Expect = 5.9
 Identities = 14/34 (41%), Positives = 20/34 (58%)

Query: 130 KRLESGYEAVPTRDIHMSQVGLERHWLQILKDYV 163
           K L  G E +  +DIH      +RHWL++ KDY+
Sbjct: 563 KILNMGLEGLVLKDIHSKYEPGKRHWLKVKKDYL 596


>UniRef50_Q8DKV0 Cluster: Tlr0755 protein; n=1; Synechococcus
           elongatus|Rep: Tlr0755 protein - Synechococcus elongatus
           (Thermosynechococcus elongatus)
          Length = 197

 Score = 33.5 bits (73), Expect = 5.9
 Identities = 16/54 (29%), Positives = 28/54 (51%), Gaps = 1/54 (1%)

Query: 216 NVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFV 269
           N D++GG   F R    +   + G +++ P   TH H  L V +GT+Y   +++
Sbjct: 143 NEDFQGGETYFDRQGVKI-TPRTGDIVVFPAYYTHPHAALPVVQGTKYAFATWL 195


>UniRef50_Q5LRQ3 Cluster: TPR domain protein; n=4; cellular
           organisms|Rep: TPR domain protein - Silicibacter
           pomeroyi
          Length = 557

 Score = 33.5 bits (73), Expect = 5.9
 Identities = 15/58 (25%), Positives = 25/58 (43%)

Query: 43  DFGHLVNPETYDITKTHPDMYQLFENNMEWTTRYLHPEYLANFEEDRKHLMPCTDVYW 100
           D+G+ +   +  +T   P+    F+  + WT  Y H E  A F    +H   C   +W
Sbjct: 5   DYGYDLGQYSCPVTTAAPEAQLWFDRGLIWTYGYNHAEAAACFRRALEHDPDCAMAHW 62


>UniRef50_Q5GQB2 Cluster: Putative uncharacterized protein; n=1;
           Cyanophage phage S-PM2|Rep: Putative uncharacterized
           protein - Cyanophage phage S-PM2
          Length = 238

 Score = 33.5 bits (73), Expect = 5.9
 Identities = 16/45 (35%), Positives = 25/45 (55%), Gaps = 1/45 (2%)

Query: 221 GGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIM 265
           GG   F+  +  +  TK G +++ P  +TH H G  V KG +YI+
Sbjct: 185 GGETEFLYQHKRISPTK-GTVVVFPAGMTHVHRGNTVLKGNKYIV 228


>UniRef50_UPI0000DB7C47 Cluster: PREDICTED: similar to fibroblast
          growth factor receptor substrate 2; n=1; Apis
          mellifera|Rep: PREDICTED: similar to fibroblast growth
          factor receptor substrate 2 - Apis mellifera
          Length = 483

 Score = 33.1 bits (72), Expect = 7.7
 Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 1/45 (2%)

Query: 33 NIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNMEWTTRYL 77
          NIF  V N +D G+L+ P   ++T+T   +YQ  +  ++W  R L
Sbjct: 16 NIFQ-VMNVDDLGNLITPGRLEVTETDIVLYQRGKQPIKWPLRCL 59


>UniRef50_Q0HJW2 Cluster: Prolyl 4-hydroxylase, alpha subunit; n=9;
           Shewanella|Rep: Prolyl 4-hydroxylase, alpha subunit -
           Shewanella sp. (strain MR-4)
          Length = 231

 Score = 33.1 bits (72), Expect = 7.7
 Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 1/70 (1%)

Query: 200 HHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTK 259
           +H+ + + + L +   N   EGG   F  Y     + KKG +++ P   TH H G +   
Sbjct: 150 NHNEALHRVVLYMFYLNDVEEGGETEFY-YQQRKISPKKGTMVIAPAGFTHSHRGNMPIS 208

Query: 260 GTRYIMISFV 269
             +YI  S+V
Sbjct: 209 NDKYIATSWV 218


>UniRef50_A3WSE0 Cluster: Putative uncharacterized protein; n=1;
           Nitrobacter sp. Nb-311A|Rep: Putative uncharacterized
           protein - Nitrobacter sp. Nb-311A
          Length = 289

 Score = 33.1 bits (72), Expect = 7.7
 Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 2/88 (2%)

Query: 4   FRAPKAKAVTYVQEGHDSDMAFCASLRELNI--FMYVSNEEDFGHLVNPETYDITKTHPD 61
           FR+ +AKA+ +    H + +A C  + E+ +  F+  SN+++     NP+   I      
Sbjct: 61  FRSKQAKALHFADLSHPNKVAVCRKISEMELRCFVVASNKKNMEGYTNPDAAKIPSQCWF 120

Query: 62  MYQLFENNMEWTTRYLHPEYLANFEEDR 89
              +    +E  TR++    + +F E R
Sbjct: 121 YCWMTRVLLERVTRFVLYRSMLDFGEPR 148


>UniRef50_Q6ZD92 Cluster: Proline-rich protein-like; n=3; Oryza
           sativa|Rep: Proline-rich protein-like - Oryza sativa
           subsp. japonica (Rice)
          Length = 451

 Score = 33.1 bits (72), Expect = 7.7
 Identities = 16/59 (27%), Positives = 29/59 (49%), Gaps = 1/59 (1%)

Query: 123 SDGTNNDKRLESGYEAVPTRDIHMSQVGLERHW-LQILKDYVRPLQELVFTGYYHNPPA 180
           +D     K +E  + A  +R +H ++ G+E  W L +L D+ R   + V   +  +P A
Sbjct: 211 NDREKQIKAIEDSFRAAKSRPVHQTKRGMEAEWVLPLLPDFDRYDDQFVMVNFDGDPTA 269


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.322    0.138    0.446 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 346,879,650
Number of Sequences: 1657284
Number of extensions: 15223783
Number of successful extensions: 27410
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 19
Number of HSP's successfully gapped in prelim test: 21
Number of HSP's that attempted gapping in prelim test: 27361
Number of HSP's gapped (non-prelim): 41
length of query: 271
length of database: 575,637,011
effective HSP length: 99
effective length of query: 172
effective length of database: 411,565,895
effective search space: 70789333940
effective search space used: 70789333940
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.9 bits)
S2: 72 (33.1 bits)

- SilkBase 1999-2023 -