BLASTP 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= BGIBMGA000704-TA|BGIBMGA000704-PA|IPR005123|2OG-Fe(II) oxygenase, IPR001006|Procollagen-lysine 5-dioxygenase, IPR006620|Prolyl 4-hydroxylase, alpha subunit (271 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9VTH0 Cluster: CG6199-PA, isoform A; n=9; Coelomata|Re... 408 e-113 UniRef50_O60568 Cluster: Procollagen-lysine,2-oxoglutarate 5-dio... 343 4e-93 UniRef50_Q20679 Cluster: Procollagen-lysine,2-oxoglutarate 5-dio... 322 4e-87 UniRef50_O00469 Cluster: Procollagen-lysine,2-oxoglutarate 5-dio... 308 1e-82 UniRef50_A7S477 Cluster: Predicted protein; n=1; Nematostella ve... 303 4e-81 UniRef50_Q96AR9 Cluster: PLOD2 protein; n=3; Eutheria|Rep: PLOD2... 267 2e-70 UniRef50_UPI0000E4A230 Cluster: PREDICTED: similar to Plod-prov ... 248 9e-65 UniRef50_Q5UQC3 Cluster: Probable procollagen-lysine,2-oxoglutar... 199 7e-50 UniRef50_Q4TBD7 Cluster: Chromosome undetermined SCAF7145, whole... 189 6e-47 UniRef50_Q1VL57 Cluster: Putative uncharacterized protein; n=1; ... 68 3e-10 UniRef50_A4RT30 Cluster: Protein Lysyl hydroxylase fusion protei... 60 4e-08 UniRef50_Q01F56 Cluster: SmkH; n=2; Ostreococcus tauri|Rep: SmkH... 58 2e-07 UniRef50_A7SXT6 Cluster: Predicted protein; n=3; Nematostella ve... 56 1e-06 UniRef50_Q15SJ2 Cluster: 2OG-Fe(II) oxygenase; n=6; Proteobacter... 50 5e-05 UniRef50_Q26DQ8 Cluster: Oxidoreductase, 20G-Fe(II) oxygenase su... 46 8e-04 UniRef50_A2ZPZ8 Cluster: Putative uncharacterized protein; n=2; ... 45 0.002 UniRef50_A3KGZ2 Cluster: 2-oxoglutarate and iron-dependent oxyge... 44 0.003 UniRef50_A3TG91 Cluster: Putative uncharacterized protein; n=1; ... 43 0.007 UniRef50_Q55BW6 Cluster: Putative uncharacterized protein; n=3; ... 43 0.007 UniRef50_Q4RGA7 Cluster: Chromosome 12 SCAF15104, whole genome s... 41 0.029 UniRef50_Q9LV19 Cluster: Gb|AAB72163.1; n=5; Magnoliophyta|Rep: ... 41 0.029 UniRef50_Q58LI8 Cluster: Possible dioxygenase; n=1; Cyanophage P... 39 0.16 UniRef50_Q46JM1 Cluster: Putative uncharacterized protein; n=1; ... 36 0.83 UniRef50_Q58MP6 Cluster: Dioxygenase; n=1; Cyanophage P-SSM2|Rep... 36 1.1 UniRef50_Q5UNV6 Cluster: Uncharacterized protein R699; n=1; Acan... 36 1.1 UniRef50_Q6N063 Cluster: 2-oxoglutarate and iron-dependent oxyge... 36 1.1 UniRef50_UPI0000F21643 Cluster: PREDICTED: similar to pol polypr... 35 2.5 UniRef50_A5PBA9 Cluster: Oxidoreductase, 2OG-Fe(II) oxygenase fa... 35 2.5 UniRef50_P20792 Cluster: Cell surface receptor daf-1 precursor; ... 35 2.5 UniRef50_UPI00015A7ABE Cluster: UPI00015A7ABE related cluster; n... 34 3.4 UniRef50_Q4JN23 Cluster: Putative uncharacterized protein; n=1; ... 34 3.4 UniRef50_A0BE00 Cluster: Chromosome undetermined scaffold_101, w... 34 4.4 UniRef50_UPI0000DB7621 Cluster: PREDICTED: similar to DNA ligase... 33 5.9 UniRef50_Q8DKV0 Cluster: Tlr0755 protein; n=1; Synechococcus elo... 33 5.9 UniRef50_Q5LRQ3 Cluster: TPR domain protein; n=4; cellular organ... 33 5.9 UniRef50_Q5GQB2 Cluster: Putative uncharacterized protein; n=1; ... 33 5.9 UniRef50_UPI0000DB7C47 Cluster: PREDICTED: similar to fibroblast... 33 7.7 UniRef50_Q0HJW2 Cluster: Prolyl 4-hydroxylase, alpha subunit; n=... 33 7.7 UniRef50_A3WSE0 Cluster: Putative uncharacterized protein; n=1; ... 33 7.7 UniRef50_Q6ZD92 Cluster: Proline-rich protein-like; n=3; Oryza s... 33 7.7 >UniRef50_Q9VTH0 Cluster: CG6199-PA, isoform A; n=9; Coelomata|Rep: CG6199-PA, isoform A - Drosophila melanogaster (Fruit fly) Length = 721 Score = 408 bits (1005), Expect = e-113 Identities = 174/261 (66%), Positives = 206/261 (78%) Query: 11 AVTYVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNM 70 A+++ + D DMA C SLR IFMY SN FGHLVN + ++ T T PD Y LF N + Sbjct: 461 AISFKHKEFDPDMAMCESLRNAGIFMYASNLRIFGHLVNADDFNTTVTRPDFYTLFSNEI 520 Query: 71 EWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDK 130 +WT +Y+HP Y E K PC DVYWF ++SD FCD+ +AIMEA+ WSDG+NND Sbjct: 521 DWTEKYIHPNYSLQLNESNKIQQPCPDVYWFQIVSDAFCDDLVAIMEAHNGWSDGSNNDN 580 Query: 131 RLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYR 190 RLE GYEAVPTRDIHM QVGLER +L+ L+ +VRPLQE FTGY+HNPP ++MNF+VRYR Sbjct: 581 RLEGGYEAVPTRDIHMKQVGLERLYLKFLQMFVRPLQERAFTGYFHNPPRALMNFMVRYR 640 Query: 191 PDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTH 250 PDEQPSLRPHHDSSTYTIN+A+N +DY+GGGCRFIRYNCSV +TKKGW+LMHPGRLTH Sbjct: 641 PDEQPSLRPHHDSSTYTINIAMNRAGIDYQGGGCRFIRYNCSVTDTKKGWMLMHPGRLTH 700 Query: 251 YHEGLLVTKGTRYIMISFVDP 271 YHEGLLVT GTRYIMISF+DP Sbjct: 701 YHEGLLVTNGTRYIMISFIDP 721 >UniRef50_O60568 Cluster: Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor; n=75; Euteleostomi|Rep: Procollagen-lysine,2-oxoglutarate 5-dioxygenase 3 precursor - Homo sapiens (Human) Length = 738 Score = 343 bits (842), Expect = 4e-93 Identities = 146/252 (57%), Positives = 185/252 (73%), Gaps = 1/252 (0%) Query: 20 DSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNMEWTTRYLHP 79 D DMAFC S R+ IF+++SN+ +FG L+ YD HPD++Q+F+N ++W +Y+H Sbjct: 488 DPDMAFCKSFRDKGIFLHLSNQHEFGRLLATSRYDTEHLHPDLWQIFDNPVDWKEQYIHE 547 Query: 80 EYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAV 139 Y E + PC DVYWFPL+S++ CDE +A ME YG+WS G + D RL GYE V Sbjct: 548 NYSRALEGEGIVEQPCPDVYWFPLLSEQMCDELVAEMEHYGQWSGGRHEDSRLAGGYENV 607 Query: 140 PTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRP 199 PT DIHM QVG E WLQ+L+ YV P+ E +F G YH ++MNFVVRYRPDEQPSLRP Sbjct: 608 PTVDIHMKQVGYEDQWLQLLRTYVGPMTESLFPG-YHTKARAVMNFVVRYRPDEQPSLRP 666 Query: 200 HHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTK 259 HHDSST+T+N+ALN +DYEGGGCRF+RY+C + + +KGW L+HPGRLTHYHEGL T Sbjct: 667 HHDSSTFTLNVALNHKGLDYEGGGCRFLRYDCVISSPRKGWALLHPGRLTHYHEGLPTTW 726 Query: 260 GTRYIMISFVDP 271 GTRYIM+SFVDP Sbjct: 727 GTRYIMVSFVDP 738 >UniRef50_Q20679 Cluster: Procollagen-lysine,2-oxoglutarate 5-dioxygenase precursor; n=2; Caenorhabditis|Rep: Procollagen-lysine,2-oxoglutarate 5-dioxygenase precursor - Caenorhabditis elegans Length = 730 Score = 322 bits (792), Expect = 4e-87 Identities = 145/256 (56%), Positives = 181/256 (70%), Gaps = 4/256 (1%) Query: 20 DSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKT----HPDMYQLFENNMEWTTR 75 D DM+ C R+ F+Y+ NE+ +G L+ + Y T T HP+M+Q+FEN W R Sbjct: 475 DPDMSMCKFARDNGHFLYIDNEKYYGFLIVSDEYAETVTEGKWHPEMWQIFENRELWEAR 534 Query: 76 YLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESG 135 Y+HP Y E + C DVY FPLMS+RFC+E I ME +G+WSDG+NNDKRL G Sbjct: 535 YIHPGYHKIMEPEHVVDQACPDVYDFPLMSERFCEELIEEMEGFGRWSDGSNNDKRLAGG 594 Query: 136 YEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQP 195 YE VPTRDIHM+QVG ER WL + YVRP+QE F GYYH P S M FVVRY+P+EQP Sbjct: 595 YENVPTRDIHMNQVGFERQWLYFMDTYVRPVQEKTFIGYYHQPVESNMMFVVRYKPEEQP 654 Query: 196 SLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGL 255 SLRPHHD+ST++I++ALN DYEGGG R+IRYNC+V + G+ +M PGRLTH HEGL Sbjct: 655 SLRPHHDASTFSIDIALNKKGRDYEGGGVRYIRYNCTVPADEVGYAMMFPGRLTHLHEGL 714 Query: 256 LVTKGTRYIMISFVDP 271 TKGTRYIM+SF++P Sbjct: 715 ATTKGTRYIMVSFINP 730 >UniRef50_O00469 Cluster: Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 precursor; n=47; Deuterostomia|Rep: Procollagen-lysine,2-oxoglutarate 5-dioxygenase 2 precursor - Homo sapiens (Human) Length = 737 Score = 308 bits (755), Expect = 1e-82 Identities = 129/258 (50%), Positives = 182/258 (70%), Gaps = 2/258 (0%) Query: 14 YVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNMEWT 73 +V++ D DMA C + RE+ +FMY+SN +FG L++ Y+ + + D++Q+FEN ++W Sbjct: 482 FVRDKLDPDMALCRNAREMGVFMYISNRHEFGRLLSTANYNTSHYNNDLWQIFENPVDWK 541 Query: 74 TRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLE 133 +Y++ +Y F E+ PC DV+WFP+ S++ CDE + ME YGKWS G ++D R+ Sbjct: 542 EKYINRDYSKIFTENIVE-QPCPDVFWFPIFSEKACDELVEEMEHYGKWSGGKHHDSRIS 600 Query: 134 SGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDE 193 GYE VPT DIHM QV LE WL +++++ P+ VF GYY A ++NFVV+Y P+ Sbjct: 601 GGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGFA-LLNFVVKYSPER 659 Query: 194 QPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHE 253 Q SLRPHHD+ST+TIN+ALN D++GGGC+F+RYNCS+ + +KGW MHPGRLTH HE Sbjct: 660 QRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGWSFMHPGRLTHLHE 719 Query: 254 GLLVTKGTRYIMISFVDP 271 GL V GTRYI +SF+DP Sbjct: 720 GLPVKNGTRYIAVSFIDP 737 >UniRef50_A7S477 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 729 Score = 303 bits (743), Expect = 4e-81 Identities = 143/267 (53%), Positives = 173/267 (64%), Gaps = 4/267 (1%) Query: 7 PKAKAVTYVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLF 66 PK K Y + D++F LR+ IFMYV+N FG L +T H D++Q+F Sbjct: 465 PKLKHA-YSYGNLEPDLSFSKYLRDNGIFMYVTNMHYFGRLKETDTVTTNHLHNDLWQIF 523 Query: 67 ENNMEWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGT 126 +N ++W RYLHP Y N + MPC DV+WFPLMS+ + I ME YGKWS G Sbjct: 524 DNQIDWEERYLHPNYSQNLNKSIPLKMPCNDVFWFPLMSETWATHMIEEMEHYGKWSGGK 583 Query: 127 NN--DKRLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMN 184 + D RL GYE VPT DIHM+QVG ER WL +LK Y+ P+ +F GYY A IMN Sbjct: 584 HEPQDARLNGGYENVPTVDIHMNQVGWEREWLHLLKTYIVPVNTRIFPGYYSEGRA-IMN 642 Query: 185 FVVRYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMH 244 FVV+Y P Q LRPHHDSSTYTIN+ LN P + Y GGG RFIR +C+V +T+ GW LMH Sbjct: 643 FVVKYTPSGQYYLRPHHDSSTYTINIGLNKPGIHYGGGGSRFIRQDCAVTDTQVGWALMH 702 Query: 245 PGRLTHYHEGLLVTKGTRYIMISFVDP 271 PGRLTHYHEGL T GTRYIM+ FVDP Sbjct: 703 PGRLTHYHEGLPTTWGTRYIMVCFVDP 729 >UniRef50_Q96AR9 Cluster: PLOD2 protein; n=3; Eutheria|Rep: PLOD2 protein - Homo sapiens (Human) Length = 210 Score = 267 bits (654), Expect = 2e-70 Identities = 112/211 (53%), Positives = 152/211 (72%), Gaps = 2/211 (0%) Query: 61 DMYQLFENNMEWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYG 120 D++Q+FEN ++W +Y++ +Y F E+ PC DV+WFP+ S++ CDE + ME YG Sbjct: 2 DLWQIFENPVDWKEKYINRDYSKIFTENIVE-QPCPDVFWFPIFSEKACDELVEEMEHYG 60 Query: 121 KWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPA 180 KWS G ++D R+ GYE VPT DIHM QV LE WL +++++ P+ VF GYY A Sbjct: 61 KWSGGKHHDSRISGGYENVPTDDIHMKQVDLENVWLHFIREFIAPVTLKVFAGYYTKGFA 120 Query: 181 SIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGW 240 ++NFVV+Y P+ Q SLRPHHD+ST+TIN+ALN D++GGGC+F+RYNCS+ + +KGW Sbjct: 121 -LLNFVVKYSPERQRSLRPHHDASTFTINIALNNVGEDFQGGGCKFLRYNCSIESPRKGW 179 Query: 241 LLMHPGRLTHYHEGLLVTKGTRYIMISFVDP 271 MHPGRLTH HEGL V GTRYI +SF+DP Sbjct: 180 SFMHPGRLTHLHEGLPVKNGTRYIAVSFIDP 210 >UniRef50_UPI0000E4A230 Cluster: PREDICTED: similar to Plod-prov protein, partial; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to Plod-prov protein, partial - Strongylocentrotus purpuratus Length = 609 Score = 248 bits (608), Expect = 9e-65 Identities = 117/206 (56%), Positives = 143/206 (69%), Gaps = 3/206 (1%) Query: 17 EGHDSDMAFCASLRELNIFMYVSNEED-FGHLVNPETYDITKTHPDMYQLFENNMEWTTR 75 E D+DMA C LR IF+YV N ED +GH+V + Y+ T H DM++L+ N +W + Sbjct: 405 EDLDTDMAICMDLRSKGIFLYVVNMEDSYGHIVTLDNYETTHLHNDMWELWNNKEDWEAK 464 Query: 76 YLHPEYLANFEEDRKHL-MPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLES 134 YL P+Y E DR ++ MPCTDVY FPLMS + E I ME +G+WS G N DKRL Sbjct: 465 YLSPDYFVVKEMDRNNITMPCTDVYTFPLMSRTWAKELIEEMEHFGEWSGGGNQDKRLNG 524 Query: 135 GYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQ 194 GYE VPTRDIHM+Q+G E+HWL L++YV P+ E V+ GYY A IMNFVVRY+PDEQ Sbjct: 525 GYENVPTRDIHMNQIGFEQHWLYFLREYVVPICENVYPGYYSKAYA-IMNFVVRYKPDEQ 583 Query: 195 PSLRPHHDSSTYTINLALNTPNVDYE 220 SLRPHHDSSTYTIN+ALN DYE Sbjct: 584 ASLRPHHDSSTYTINVALNERETDYE 609 >UniRef50_Q5UQC3 Cluster: Probable procollagen-lysine,2-oxoglutarate 5-dioxygenase; n=1; Acanthamoeba polyphaga mimivirus|Rep: Probable procollagen-lysine,2-oxoglutarate 5-dioxygenase - Mimivirus Length = 895 Score = 199 bits (485), Expect = 7e-50 Identities = 100/265 (37%), Positives = 151/265 (56%), Gaps = 10/265 (3%) Query: 13 TYVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHL---VNPETYDITKTHPDMYQLFENN 69 +++ G + DM C +LR+ N+FMY+SN +GH+ +N E T +Y L Sbjct: 634 SHMWNGSNIDMRLCHNLRKNNVFMYLSNLRPYGHIDDSINLEVLSGVPTEVTLYDLPTRK 693 Query: 70 MEWTTRYLHPEYLANFE--EDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTN 127 EW +YLHPE+L++ + +D + C DVY FPL + FC E I +M+ WS G + Sbjct: 694 EEWEKKYLHPEFLSHLQNFKDFDYTEICNDVYSFPLFTPAFCKEVIEVMDKANLWSKGGD 753 Query: 128 N--DKRLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNF 185 + D R+ G E+ PT+D + +VGL++ W ++ +YV P ++ Y + F Sbjct: 754 SYFDPRI-GGVESYPTQDTQLYEVGLDKQWHYVVFNYVAPFVRHLYNNY--KTKDINLAF 810 Query: 186 VVRYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHP 245 VV+Y + Q L PHHDSSTYT+N+ALN +Y GGC FIR+ + K G+ +H Sbjct: 811 VVKYDMERQSELAPHHDSSTYTLNIALNEYGKEYTAGGCEFIRHKFIWQGQKVGYATIHA 870 Query: 246 GRLTHYHEGLLVTKGTRYIMISFVD 270 G+L YH L +T G RYI++SFV+ Sbjct: 871 GKLLAYHRALPITSGKRYILVSFVN 895 >UniRef50_Q4TBD7 Cluster: Chromosome undetermined SCAF7145, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome undetermined SCAF7145, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 607 Score = 189 bits (461), Expect = 6e-47 Identities = 81/144 (56%), Positives = 103/144 (71%), Gaps = 2/144 (1%) Query: 71 EWTTRYLHPEYLANFEEDRKHL-MPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNND 129 +W +Y+H Y FEE + PC DVYWFP S++ CD + MEA+G+WS G + D Sbjct: 465 DWKEKYVHENYSRIFEEQESFVEQPCPDVYWFPAFSEKMCDHLVETMEAHGQWSSGGHKD 524 Query: 130 KRLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRY 189 +RL GYE VPT D HM+Q+G E+ WL+ L+DY+ P+ E ++ GYY A IMNFVVRY Sbjct: 525 ERLSGGYENVPTVDTHMNQIGFEKEWLRFLRDYIVPVTEKLYPGYYPRAQA-IMNFVVRY 583 Query: 190 RPDEQPSLRPHHDSSTYTINLALN 213 RPDEQPSLRPHHDSST+TIN+ALN Sbjct: 584 RPDEQPSLRPHHDSSTFTINIALN 607 >UniRef50_Q1VL57 Cluster: Putative uncharacterized protein; n=1; Psychroflexus torquis ATCC 700755|Rep: Putative uncharacterized protein - Psychroflexus torquis ATCC 700755 Length = 364 Score = 67.7 bits (158), Expect = 3e-10 Identities = 70/256 (27%), Positives = 116/256 (45%), Gaps = 27/256 (10%) Query: 14 YVQEGHDSDMAFCASLRELNIFMYVSNEEDF--GHLVNPETYDITKT-HPDMYQLFENNM 70 YVQ+ H S+ A E IF + + G L NP T T H + Q + M Sbjct: 116 YVQKQHLSNKFSIACDVEGYIFTCYEPKIEVRNGQLYNPVTGCFTCAYHGNGGQKQKELM 175 Query: 71 EWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDK 130 E + Y N+ +K+ + D+ MS+ C I++ E Sbjct: 176 E---KVYSNFYGFNYIPTKKYDILSDDILLIDFMSEDMCQNMISLAE---------EKQF 223 Query: 131 RLESGYEAVPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPA--SIMN-FVV 187 R+ G + VP +++ + + W + + + + + E+V+ Y NP + + F++ Sbjct: 224 RIMEG-DKVPAQELRLKEF---EQWKLLEEHWNKIVYEIVWE--YWNPCHMWGLRDAFII 277 Query: 188 RYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGR 247 +Y D+Q LR H+D+S T ++ LN DYEGG F R + + G L+ PG+ Sbjct: 278 KYEMDKQRELRLHNDASLVTGSIKLND---DYEGGVLEFPRQKVNNSDVPVGKCLLFPGQ 334 Query: 248 LTHYHEGLLVTKGTRY 263 +TH H L+TKGT+Y Sbjct: 335 VTHGHSSSLLTKGTKY 350 >UniRef50_A4RT30 Cluster: Protein Lysyl hydroxylase fusion protein, putative; n=1; Ostreococcus lucimarinus CCE9901|Rep: Protein Lysyl hydroxylase fusion protein, putative - Ostreococcus lucimarinus CCE9901 Length = 618 Score = 60.5 bits (140), Expect = 4e-08 Identities = 62/263 (23%), Positives = 110/263 (41%), Gaps = 25/263 (9%) Query: 13 TYVQEGHDSDMAFCASLRELNIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNMEW 72 T + HD+D+ L + + + H + + + K H + + F ++ + Sbjct: 372 TLLHNPHDTDV-----LHAYGVTLLSQGKWKLAHSIWQLAFKVDKAHLWLSKQFNKDVSY 426 Query: 73 TTRYLH--PEYLANFEEDRKHLMPCTDVYWFP--LMSDRFCDEWIAIMEAYGKWSDGTNN 128 PEY +F+ + ++Y +S C WI EA+ G + Sbjct: 427 AAYLCRKCPEYYVSFQT-----ISIDEIYLTTNSAISPSACSSWIKTAEAHATNRGGWDT 481 Query: 129 DKRLESGYEAVPTRDIHMSQV-GLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVV 187 D+ +++V T D+ + ++ + R W I + P + F F+V Sbjct: 482 DR-----HKSVATTDLPIHEIPSVLREWNLIFGQIIGPFIQERFRVDGDTNLRVHDAFIV 536 Query: 188 RY-RPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPG 246 +Y D Q L H D ++I L+LN P + Y+GGG F + VR K G + Sbjct: 537 KYDASDGQCQLPVHTDQGHFSITLSLNDP-IQYKGGGTIFPEHEFIVR-PKCGDFVAFRS 594 Query: 247 RLTHYHEGLLVTKGTRYIMISFV 269 LTH G+ +T G RYI+++F+ Sbjct: 595 YLTH--GGVPITSGVRYIVVAFL 615 >UniRef50_Q01F56 Cluster: SmkH; n=2; Ostreococcus tauri|Rep: SmkH - Ostreococcus tauri Length = 637 Score = 58.0 bits (134), Expect = 2e-07 Identities = 45/164 (27%), Positives = 77/164 (46%), Gaps = 12/164 (7%) Query: 109 CDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQV-GLERHWLQILKDYVRPLQ 167 C W+ E+ + G + + ++AVPT D+ + ++ G+ W ++ + P Sbjct: 480 CPSWVEAAESVARSRGGWDTAR-----HKAVPTTDLPIHEIPGVMEQWNRLFSVVISPFI 534 Query: 168 ELVFTGYYHNPPASIMN-FVVRYRPDE-QPSLRPHHDSSTYTINLALNTPNVDYEGGGCR 225 F + + FVV+Y +E Q L H D +++ LAL+ DY GGG Sbjct: 535 RDRFRLPTSFGTLYVHDAFVVKYNANEGQRELPVHTDQGQFSLTLALHDTQ-DYSGGGTI 593 Query: 226 FIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFV 269 F + C VR + G + LTH G+ +T G RYI+++F+ Sbjct: 594 FPEHECIVR-PRCGDFVAFRSSLTH--GGVPITAGVRYIVVAFL 634 >UniRef50_A7SXT6 Cluster: Predicted protein; n=3; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 344 Score = 56.0 bits (129), Expect = 1e-06 Identities = 44/179 (24%), Positives = 86/179 (48%), Gaps = 21/179 (11%) Query: 97 DVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWL 156 +VY P+ ++ FC+++I +E + SD + Y + +S +G + H++ Sbjct: 134 EVYRLPVFTESFCEQFIEELEHFES-SDVPRGRPNTMNNYGVL------LSDLGFDEHFI 186 Query: 157 QILK-DYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTP 215 L+ +Y++P+ L+F + + S F V Y P + L H+D++ T+++ L Sbjct: 187 NPLRREYLQPITALLFPQWGGDGLDSHKAFTVHYMPGKDTELSYHYDNAEVTLSVCLGR- 245 Query: 216 NVDYEGGGCRF--IRY------NCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMI 266 ++ GG F +R C+ + + L+H G+ H H L T+G+RY +I Sbjct: 246 --EFSGGDLYFGDMRQVLLEDTQCTEVENRPTYGLLHRGQ--HMHGALPTTQGSRYNLI 300 >UniRef50_Q15SJ2 Cluster: 2OG-Fe(II) oxygenase; n=6; Proteobacteria|Rep: 2OG-Fe(II) oxygenase - Pseudoalteromonas atlantica (strain T6c / BAA-1087) Length = 306 Score = 50.4 bits (115), Expect = 5e-05 Identities = 29/117 (24%), Positives = 53/117 (45%), Gaps = 4/117 (3%) Query: 157 QILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPN 216 ++L Y+RP+ L+F F + Y+P+ S+RPH D+S T+N+ LN P+ Sbjct: 158 EMLDRYMRPIARLLFPDIV-GYDTQTFGFSIHYKPNTDTSIRPHTDASAVTLNINLNLPD 216 Query: 217 VDYEGGGCRFIRYNCSVR---NTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFVD 270 + G F K G ++H G + H + + + T +++ + D Sbjct: 217 EVFTGSNVDFYDPTTGKMIGLAFKPGSAMIHRGNVVHAAQPITSGERTNFVLWLYGD 273 >UniRef50_Q26DQ8 Cluster: Oxidoreductase, 20G-Fe(II) oxygenase superfamily; n=2; Flavobacteria|Rep: Oxidoreductase, 20G-Fe(II) oxygenase superfamily - Flavobacteria bacterium BBFL7 Length = 320 Score = 46.4 bits (105), Expect = 8e-04 Identities = 22/87 (25%), Positives = 42/87 (48%), Gaps = 2/87 (2%) Query: 140 PTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRP 199 P + H++ + + I+ Y+RP+ L+ Y + F ++Y PD+ L Sbjct: 142 PRSEGHLAAPNFQSFYNTIMDRYMRPIARLLLGTYGFDNQT--FGFSIQYNPDKDKDLHA 199 Query: 200 HHDSSTYTINLALNTPNVDYEGGGCRF 226 H D+S T+N+ +N P+ ++ G F Sbjct: 200 HTDASAATLNININLPDEEFTGSQVDF 226 >UniRef50_A2ZPZ8 Cluster: Putative uncharacterized protein; n=2; Oryza sativa|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 366 Score = 45.2 bits (102), Expect = 0.002 Identities = 49/196 (25%), Positives = 79/196 (40%), Gaps = 18/196 (9%) Query: 84 NFEEDRKHLM--PCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPT 141 N EE +M P V+ FP++ FC ++ + + +W+ N + + Sbjct: 128 NTEESITSIMMEPAPGVFAFPMLKPSFCQMLMSEVNNFLRWAQSANQRIMRPTSLDR-HG 186 Query: 142 RDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHH 201 R +S GL+ ++KD++ P+ ++F N S FV+ Y E R H Sbjct: 187 RGAALSDFGLQEMLDNLMKDFISPMSTVLFPEVGGNTLDSHHTFVLEY--GEADGARGFH 244 Query: 202 -DSSTYTINLAL--NTPNVDYEGGGCRFIRYNCS--------VRNTKKGWLLMHPGRLTH 250 D S T+N+ L + D G R + S V G +L+H G +H Sbjct: 245 VDDSEVTLNICLGKHFTGADMYFRGIRCGNHVNSGTHDEEYFVHPNVPGQVLLHHG--SH 302 Query: 251 YHEGLLVTKGTRYIMI 266 H VT G R M+ Sbjct: 303 RHGVFSVTSGRRVNMV 318 >UniRef50_A3KGZ2 Cluster: 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2; n=8; Deuterostomia|Rep: 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 345 Score = 44.4 bits (100), Expect = 0.003 Identities = 43/175 (24%), Positives = 80/175 (45%), Gaps = 15/175 (8%) Query: 98 VYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQ 157 V+ F + FC + + +E + + SD + Y V ++++G + ++ Sbjct: 131 VFRFQVFRKEFCKDLLEELEHFEQ-SDAPKGRPNTMNNYGIV------LNELGFDEGFIT 183 Query: 158 ILKD-YVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPN 216 L++ Y+RPL L+++ N S FVV+Y E +L H+D+S T+N++L Sbjct: 184 PLREVYLRPLTALLYSDCGGNCLDSHKAFVVKYDMHEDLNLSYHYDNSEVTLNVSLGKDF 243 Query: 217 VDYE--GGGCRFI---RYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMI 266 + G R + C + L+H G+ H H L ++ GTR+ +I Sbjct: 244 TEGNLFFGDMRQVPLSETECVEVEHRVTEGLLHRGQ--HMHGALSISSGTRWNLI 296 >UniRef50_A3TG91 Cluster: Putative uncharacterized protein; n=1; Janibacter sp. HTCC2649|Rep: Putative uncharacterized protein - Janibacter sp. HTCC2649 Length = 711 Score = 43.2 bits (97), Expect = 0.007 Identities = 25/86 (29%), Positives = 40/86 (46%), Gaps = 2/86 (2%) Query: 185 FVVRYRPDEQPSLRPHHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMH 244 FV + +P + H D S +T+N+ L P+ G + V ++GW + H Sbjct: 612 FVRHFSERTRPFIPFHPDDSHWTVNVPLEDPDQTSGGELVMLLDGGLRVVERRRGWAISH 671 Query: 245 PGRLTHYHEGLLVTKGTRYIMISFVD 270 PG L H VT G R+ +I+F + Sbjct: 672 PGALIHGVR--RVTHGDRWSLIAFYE 695 >UniRef50_Q55BW6 Cluster: Putative uncharacterized protein; n=3; Dictyostelium discoideum|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 242 Score = 43.2 bits (97), Expect = 0.007 Identities = 31/131 (23%), Positives = 59/131 (45%), Gaps = 10/131 (7%) Query: 96 TDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHW 155 T +Y F + + FC + + +E + T + + Y AV + ++G + Sbjct: 58 TRIYSFRIFTMEFCTKLLEEIENFKNTGLPTARPNSMNN-YGAV------LDEMGFTEFF 110 Query: 156 LQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTP 215 Q+ +DY+ +++ Y S F V+Y+ D++ L H+D S T+NL L + Sbjct: 111 KQLREDYLSLFTSILYKDYNGEKLNSHHAFAVQYKMDKEKELGFHYDESDITVNLCLGS- 169 Query: 216 NVDYEGGGCRF 226 ++ GG F Sbjct: 170 --EFTGGSLYF 178 >UniRef50_Q4RGA7 Cluster: Chromosome 12 SCAF15104, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 12 SCAF15104, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 593 Score = 41.1 bits (92), Expect = 0.029 Identities = 39/175 (22%), Positives = 76/175 (43%), Gaps = 15/175 (8%) Query: 98 VYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQ 157 VY FP+ FC++ + +E + + S + + ++++GL+ ++ Sbjct: 381 VYRFPVFEKSFCEQLLEELEHFEQSSAPKGRPNTMNR-------HGVLLNELGLDEGFIT 433 Query: 158 ILKD-YVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPN 216 L++ Y++P+ L++ S FVV+Y E L H+D++ T+N ++ Sbjct: 434 PLREHYLQPVSALLYPECGGGRLDSHKAFVVKYDMKEDVELSYHYDNAEVTLNASIGKEF 493 Query: 217 VD---YEGG--GCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMI 266 D Y G CS + L+H G+ H H L ++ G R+ +I Sbjct: 494 TDGNLYFGALKQVPLGEAECSEVEHRVSEGLLHRGQ--HMHGALPISSGQRWNLI 546 Score = 39.5 bits (88), Expect = 0.089 Identities = 34/166 (20%), Positives = 73/166 (43%), Gaps = 14/166 (8%) Query: 59 HPDMYQLFENNMEWTTRYLHPEYLANFEEDRKHLMPCTD-----VYWFPLMSDRFCDEWI 113 HP +Y L E + R + +Y + ++ L+ + VY FP+ FC++ + Sbjct: 30 HPQVYNLQERYLAPRFRQI-VQYCQRTDATKEGLLAFLEEAAVGVYRFPVFEKSFCEQLL 88 Query: 114 AIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQILKD-YVRPLQELVFT 172 +E + + S + + ++++GL+ ++ L++ Y++P+ L++ Sbjct: 89 EELEHFEQSSAPKGRPNTMNR-------HGVLLNELGLDEGFITPLREHYLQPVSALLYP 141 Query: 173 GYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPNVD 218 S FVV+Y E L H+D++ T+N ++ D Sbjct: 142 ECGGGRLDSHKAFVVKYDMKEDVELSYHYDNAEVTLNASIGKEFTD 187 >UniRef50_Q9LV19 Cluster: Gb|AAB72163.1; n=5; Magnoliophyta|Rep: Gb|AAB72163.1 - Arabidopsis thaliana (Mouse-ear cress) Length = 394 Score = 41.1 bits (92), Expect = 0.029 Identities = 34/142 (23%), Positives = 62/142 (43%), Gaps = 11/142 (7%) Query: 84 NFEEDRKHLM--PCTDVYWFPLMSDRFCDEWIAIMEAYGKWSDGTN---NDKRLESGYEA 138 N EE ++++ P V+ F ++ FC+ +A ++ + +W T + Y A Sbjct: 153 NTEESFRNIISEPSPGVFVFDMLQPSFCEMMLAEIDNFERWVGETKFRIMRPNTMNKYGA 212 Query: 139 VPTRDIHMSQVGLERHWLQILKDYVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLR 198 V + GL+ ++++ ++RP+ ++ F+ S FVV Y D L Sbjct: 213 V------LDDFGLDTMLDKLMEGFIRPISKVFFSDVGGATLDSHHGFVVEYGKDRDVDLG 266 Query: 199 PHHDSSTYTINLALNTPNVDYE 220 H D S T+N+ L V E Sbjct: 267 FHVDDSEVTLNVCLGNQFVGGE 288 >UniRef50_Q58LI8 Cluster: Possible dioxygenase; n=1; Cyanophage P-SSM4|Rep: Possible dioxygenase - Cyanophage P-SSM4 Length = 196 Score = 38.7 bits (86), Expect = 0.16 Identities = 20/55 (36%), Positives = 28/55 (50%), Gaps = 1/55 (1%) Query: 216 NVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFVD 270 N D+EGG F+ Y K+G +L+ P TH H GL G +YI S+ + Sbjct: 139 NDDFEGGETEFL-YQHKRFKPKRGQVLIWPAGFTHTHRGLPPLDGAKYISTSWTE 192 >UniRef50_Q46JM1 Cluster: Putative uncharacterized protein; n=1; Prochlorococcus marinus str. NATL2A|Rep: Putative uncharacterized protein - Prochlorococcus marinus (strain NATL2A) Length = 199 Score = 36.3 bits (80), Expect = 0.83 Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 1/51 (1%) Query: 220 EGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFVD 270 EGG F Y+ + +KG L+ P TH H G ++ G +YI+ +++ Sbjct: 147 EGGSTYFSHYDLEIE-PRKGLTLIWPAEWTHAHRGNILKAGKKYIITGWIN 196 >UniRef50_Q58MP6 Cluster: Dioxygenase; n=1; Cyanophage P-SSM2|Rep: Dioxygenase - Cyanophage P-SSM2 Length = 197 Score = 35.9 bits (79), Expect = 1.1 Identities = 21/68 (30%), Positives = 34/68 (50%), Gaps = 1/68 (1%) Query: 203 SSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTR 262 +S Y + L N D+EGG F+ N ++ + G ++ P TH H G G++ Sbjct: 122 TSPYRQLVTLLYLNDDFEGGETEFLYQNVRIK-PQAGKFIIFPPFWTHTHRGNPPIGGSK 180 Query: 263 YIMISFVD 270 YI+ S+ D Sbjct: 181 YIITSWAD 188 >UniRef50_Q5UNV6 Cluster: Uncharacterized protein R699; n=1; Acanthamoeba polyphaga mimivirus|Rep: Uncharacterized protein R699 - Mimivirus Length = 455 Score = 35.9 bits (79), Expect = 1.1 Identities = 14/30 (46%), Positives = 21/30 (70%) Query: 19 HDSDMAFCASLRELNIFMYVSNEEDFGHLV 48 +D DM C SLR+ IFMY+ N ++G++V Sbjct: 426 NDKDMDLCFSLRKHTIFMYMINNNNYGYMV 455 >UniRef50_Q6N063 Cluster: 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2; n=13; Amniota|Rep: 2-oxoglutarate and iron-dependent oxygenase domain-containing protein 2 - Homo sapiens (Human) Length = 350 Score = 35.9 bits (79), Expect = 1.1 Identities = 27/116 (23%), Positives = 55/116 (47%), Gaps = 8/116 (6%) Query: 98 VYWFPLMSDRFCDEWIAIMEAYGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQ 157 +Y P+ + FC + +E + + SD + Y + + ++GL+ + Sbjct: 139 IYRVPVFTAPFCQALLEELEHFEQ-SDMPKGRPNTMNNYGVL------LHELGLDEPLMT 191 Query: 158 ILKD-YVRPLQELVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLAL 212 L++ +++PL L++ S FVV+Y P + L H+D++ T+N+AL Sbjct: 192 PLRERFLQPLMALLYPDCGGGRLDSHRAFVVKYAPGQDLELGCHYDNAELTLNVAL 247 >UniRef50_UPI0000F21643 Cluster: PREDICTED: similar to pol polyprotein; n=24; Danio rerio|Rep: PREDICTED: similar to pol polyprotein - Danio rerio Length = 1836 Score = 34.7 bits (76), Expect = 2.5 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 8/72 (11%) Query: 102 PLMSDRFCDEWIAIMEAYGKWSDGTNN---DKRLESGYEAVPTRDIHMSQVGLERHWL-- 156 P++++R C+EW+ + +D N DK L+ YE + ++ + V R WL Sbjct: 1449 PVLAERDCEEWLRFYKELKITADVNTNKSKDKELKKCYEC---QVVYGTTVSFARDWLTW 1505 Query: 157 QILKDYVRPLQE 168 ++L+ VRP +E Sbjct: 1506 RVLRQNVRPKRE 1517 >UniRef50_A5PBA9 Cluster: Oxidoreductase, 2OG-Fe(II) oxygenase family protein; n=2; Sphingomonadales|Rep: Oxidoreductase, 2OG-Fe(II) oxygenase family protein - Erythrobacter sp. SD-21 Length = 363 Score = 34.7 bits (76), Expect = 2.5 Identities = 22/77 (28%), Positives = 34/77 (44%), Gaps = 7/77 (9%) Query: 198 RPHHDSSTY-----TINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYH 252 RPH D++T+ + +N +YEGG RF + G ++ L H Sbjct: 261 RPHRDNTTFGTAHRRFAVTVNLNAEEYEGGNLRFPEFGQRTYRAPTGGAVVFSCSLLH-- 318 Query: 253 EGLLVTKGTRYIMISFV 269 E VT+G RY + F+ Sbjct: 319 EATPVTRGERYAFLPFL 335 >UniRef50_P20792 Cluster: Cell surface receptor daf-1 precursor; n=2; Caenorhabditis elegans|Rep: Cell surface receptor daf-1 precursor - Caenorhabditis elegans Length = 669 Score = 34.7 bits (76), Expect = 2.5 Identities = 41/143 (28%), Positives = 65/143 (45%), Gaps = 22/143 (15%) Query: 67 ENNMEWTTRYLHPEYL-ANFEEDRKHLMPCTDVYWFPL-MSDRFC---DEWIAIMEA--- 118 EN T RYL PE L + + C DVY F L M + C D + EA Sbjct: 460 ENYKCGTVRYLAPEILNSTMQFTVFESYQCADVYSFSLVMWETLCRCEDGDVLPREAATV 519 Query: 119 --YGKWSDGTNNDKRLESGYEAVPTRDIHMSQVGLERHWLQILKDY--VRPLQELVFTGY 174 Y +W+D D ++ ++ V TR + ++ L W KD+ ++ + E++ T + Sbjct: 520 IPYIEWTDRDPQDAQM---FDVVCTRRLRPTENPL---W----KDHPEMKHIMEIIKTCW 569 Query: 175 YHNPPASIMNFVVRYRPDEQPSL 197 NP A +++ R R DE+ L Sbjct: 570 NGNPSARFTSYICRKRMDERQQL 592 >UniRef50_UPI00015A7ABE Cluster: UPI00015A7ABE related cluster; n=8; Danio rerio|Rep: UPI00015A7ABE UniRef100 entry - Danio rerio Length = 563 Score = 34.3 bits (75), Expect = 3.4 Identities = 36/169 (21%), Positives = 68/169 (40%), Gaps = 8/169 (4%) Query: 55 ITKTHPDMYQLFENNMEWTTRYLHPEYLANFEEDRKHLMPCTDVYWFPLMSDRFCD-EWI 113 + KT DM QL ++ M+ H L + + K C +V+ + S C E + Sbjct: 180 LMKTQTDMQQLIQDRMKMIKEIQHSVELRK-KNNEKEKADCVEVFADLMRSIERCQRELL 238 Query: 114 AIMEAYGKWSDGTNND--KRLESGYEAVPTRDIHMSQVGLER---HWLQILKDYVRPLQE 168 + E K ++ + K LE + R+ + ++ H LQ+ RPL Sbjct: 239 EVTEQKQKAAEKQAEELIKELEQEISELRRRNTELEELSHTEDHLHLLQMFPSLCRPLDI 298 Query: 169 LVFTGYYHNPPASIMNFVVRYRPDEQPSLRPHHDSSTYTINLALNTPNV 217 ++TG N S+ + Q ++ +++ Y++ LN N+ Sbjct: 299 KIWTGININTGVSV-ETLRSALSQLQETIDEGFNNNEYSLKTILNIQNL 346 >UniRef50_Q4JN23 Cluster: Putative uncharacterized protein; n=1; uncultured bacterium BAC13K9BAC|Rep: Putative uncharacterized protein - uncultured bacterium BAC13K9BAC Length = 199 Score = 34.3 bits (75), Expect = 3.4 Identities = 16/54 (29%), Positives = 31/54 (57%), Gaps = 1/54 (1%) Query: 216 NVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFV 269 +V+ GG F+ V+ ++G L++ P TH H G+ + KG++YI +++ Sbjct: 144 DVEGPGGETEFLHQKVKVK-PEEGKLVVFPPFWTHEHRGVTLKKGSKYIATTWI 196 >UniRef50_A0BE00 Cluster: Chromosome undetermined scaffold_101, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_101, whole genome shotgun sequence - Paramecium tetraurelia Length = 643 Score = 33.9 bits (74), Expect = 4.4 Identities = 21/76 (27%), Positives = 33/76 (43%), Gaps = 4/76 (5%) Query: 16 QEGHDSDMAFCASLRELNIFMY--VSNEEDFGHLVNPE--TYDITKTHPDMYQLFENNME 71 +EGH + C +L ++ F+Y V N G NP T ++ P+ Y+ E N Sbjct: 30 EEGHKTLFGACLTLGLISFFLYLLVINLYTLGQRDNPTSLTTEVYHAQPEYYKFNEQNFT 89 Query: 72 WTTRYLHPEYLANFEE 87 T P+Y +E Sbjct: 90 LTFAIQSPDYATYIDE 105 >UniRef50_UPI0000DB7621 Cluster: PREDICTED: similar to DNA ligase 3 (DNA ligase III) (Polydeoxyribonucleotide synthase [ATP] 3); n=2; Apocrita|Rep: PREDICTED: similar to DNA ligase 3 (DNA ligase III) (Polydeoxyribonucleotide synthase [ATP] 3) - Apis mellifera Length = 1009 Score = 33.5 bits (73), Expect = 5.9 Identities = 14/34 (41%), Positives = 20/34 (58%) Query: 130 KRLESGYEAVPTRDIHMSQVGLERHWLQILKDYV 163 K L G E + +DIH +RHWL++ KDY+ Sbjct: 563 KILNMGLEGLVLKDIHSKYEPGKRHWLKVKKDYL 596 >UniRef50_Q8DKV0 Cluster: Tlr0755 protein; n=1; Synechococcus elongatus|Rep: Tlr0755 protein - Synechococcus elongatus (Thermosynechococcus elongatus) Length = 197 Score = 33.5 bits (73), Expect = 5.9 Identities = 16/54 (29%), Positives = 28/54 (51%), Gaps = 1/54 (1%) Query: 216 NVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIMISFV 269 N D++GG F R + + G +++ P TH H L V +GT+Y +++ Sbjct: 143 NEDFQGGETYFDRQGVKI-TPRTGDIVVFPAYYTHPHAALPVVQGTKYAFATWL 195 >UniRef50_Q5LRQ3 Cluster: TPR domain protein; n=4; cellular organisms|Rep: TPR domain protein - Silicibacter pomeroyi Length = 557 Score = 33.5 bits (73), Expect = 5.9 Identities = 15/58 (25%), Positives = 25/58 (43%) Query: 43 DFGHLVNPETYDITKTHPDMYQLFENNMEWTTRYLHPEYLANFEEDRKHLMPCTDVYW 100 D+G+ + + +T P+ F+ + WT Y H E A F +H C +W Sbjct: 5 DYGYDLGQYSCPVTTAAPEAQLWFDRGLIWTYGYNHAEAAACFRRALEHDPDCAMAHW 62 >UniRef50_Q5GQB2 Cluster: Putative uncharacterized protein; n=1; Cyanophage phage S-PM2|Rep: Putative uncharacterized protein - Cyanophage phage S-PM2 Length = 238 Score = 33.5 bits (73), Expect = 5.9 Identities = 16/45 (35%), Positives = 25/45 (55%), Gaps = 1/45 (2%) Query: 221 GGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTKGTRYIM 265 GG F+ + + TK G +++ P +TH H G V KG +YI+ Sbjct: 185 GGETEFLYQHKRISPTK-GTVVVFPAGMTHVHRGNTVLKGNKYIV 228 >UniRef50_UPI0000DB7C47 Cluster: PREDICTED: similar to fibroblast growth factor receptor substrate 2; n=1; Apis mellifera|Rep: PREDICTED: similar to fibroblast growth factor receptor substrate 2 - Apis mellifera Length = 483 Score = 33.1 bits (72), Expect = 7.7 Identities = 16/45 (35%), Positives = 26/45 (57%), Gaps = 1/45 (2%) Query: 33 NIFMYVSNEEDFGHLVNPETYDITKTHPDMYQLFENNMEWTTRYL 77 NIF V N +D G+L+ P ++T+T +YQ + ++W R L Sbjct: 16 NIFQ-VMNVDDLGNLITPGRLEVTETDIVLYQRGKQPIKWPLRCL 59 >UniRef50_Q0HJW2 Cluster: Prolyl 4-hydroxylase, alpha subunit; n=9; Shewanella|Rep: Prolyl 4-hydroxylase, alpha subunit - Shewanella sp. (strain MR-4) Length = 231 Score = 33.1 bits (72), Expect = 7.7 Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 1/70 (1%) Query: 200 HHDSSTYTINLALNTPNVDYEGGGCRFIRYNCSVRNTKKGWLLMHPGRLTHYHEGLLVTK 259 +H+ + + + L + N EGG F Y + KKG +++ P TH H G + Sbjct: 150 NHNEALHRVVLYMFYLNDVEEGGETEFY-YQQRKISPKKGTMVIAPAGFTHSHRGNMPIS 208 Query: 260 GTRYIMISFV 269 +YI S+V Sbjct: 209 NDKYIATSWV 218 >UniRef50_A3WSE0 Cluster: Putative uncharacterized protein; n=1; Nitrobacter sp. Nb-311A|Rep: Putative uncharacterized protein - Nitrobacter sp. Nb-311A Length = 289 Score = 33.1 bits (72), Expect = 7.7 Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 2/88 (2%) Query: 4 FRAPKAKAVTYVQEGHDSDMAFCASLRELNI--FMYVSNEEDFGHLVNPETYDITKTHPD 61 FR+ +AKA+ + H + +A C + E+ + F+ SN+++ NP+ I Sbjct: 61 FRSKQAKALHFADLSHPNKVAVCRKISEMELRCFVVASNKKNMEGYTNPDAAKIPSQCWF 120 Query: 62 MYQLFENNMEWTTRYLHPEYLANFEEDR 89 + +E TR++ + +F E R Sbjct: 121 YCWMTRVLLERVTRFVLYRSMLDFGEPR 148 >UniRef50_Q6ZD92 Cluster: Proline-rich protein-like; n=3; Oryza sativa|Rep: Proline-rich protein-like - Oryza sativa subsp. japonica (Rice) Length = 451 Score = 33.1 bits (72), Expect = 7.7 Identities = 16/59 (27%), Positives = 29/59 (49%), Gaps = 1/59 (1%) Query: 123 SDGTNNDKRLESGYEAVPTRDIHMSQVGLERHW-LQILKDYVRPLQELVFTGYYHNPPA 180 +D K +E + A +R +H ++ G+E W L +L D+ R + V + +P A Sbjct: 211 NDREKQIKAIEDSFRAAKSRPVHQTKRGMEAEWVLPLLPDFDRYDDQFVMVNFDGDPTA 269 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.322 0.138 0.446 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 346,879,650 Number of Sequences: 1657284 Number of extensions: 15223783 Number of successful extensions: 27410 Number of sequences better than 10.0: 40 Number of HSP's better than 10.0 without gapping: 19 Number of HSP's successfully gapped in prelim test: 21 Number of HSP's that attempted gapping in prelim test: 27361 Number of HSP's gapped (non-prelim): 41 length of query: 271 length of database: 575,637,011 effective HSP length: 99 effective length of query: 172 effective length of database: 411,565,895 effective search space: 70789333940 effective search space used: 70789333940 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.9 bits) S2: 72 (33.1 bits)
- SilkBase 1999-2023 -