BLASTP 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= BGIBMGA001797-TA|BGIBMGA001797-PA|IPR002737|Protein of unknown function DUF52 (304 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9VG04 Cluster: CG8031-PA; n=9; Eukaryota|Rep: CG8031-P... 383 e-105 UniRef50_Q9Y316 Cluster: Protein MEMO1; n=34; Bilateria|Rep: Pro... 340 3e-92 UniRef50_Q22915 Cluster: UPF0103 protein tag-253; n=9; Eumetazoa... 245 1e-63 UniRef50_Q5DEE7 Cluster: SJCHGC02434 protein; n=1; Schistosoma j... 235 8e-61 UniRef50_Q0V4V5 Cluster: Putative uncharacterized protein; n=1; ... 213 5e-54 UniRef50_Q4PAA6 Cluster: Putative uncharacterized protein; n=1; ... 204 3e-51 UniRef50_A0C8S3 Cluster: Chromosome undetermined scaffold_159, w... 194 2e-48 UniRef50_A0CYK3 Cluster: Chromosome undetermined scaffold_31, wh... 192 1e-47 UniRef50_A2QP92 Cluster: Similarity to hypothetical protein At2g... 183 4e-45 UniRef50_UPI000023E320 Cluster: hypothetical protein FG00949.1; ... 163 4e-39 UniRef50_Q4U9C7 Cluster: Putative uncharacterized protein; n=2; ... 162 1e-38 UniRef50_Q6CB70 Cluster: Similar to sp|P47085 Saccharomyces cere... 161 2e-38 UniRef50_A4R520 Cluster: Putative uncharacterized protein; n=2; ... 160 4e-38 UniRef50_A5K624 Cluster: Putative uncharacterized protein; n=4; ... 157 4e-37 UniRef50_A2DDC6 Cluster: Putative uncharacterized protein; n=1; ... 153 5e-36 UniRef50_Q38B52 Cluster: Putative uncharacterized protein; n=2; ... 150 5e-35 UniRef50_A2DWN3 Cluster: Putative uncharacterized protein; n=1; ... 150 5e-35 UniRef50_Q7S447 Cluster: Putative uncharacterized protein NCU024... 149 1e-34 UniRef50_Q10212 Cluster: UPF0103 protein C4H3.04c; n=1; Schizosa... 146 5e-34 UniRef50_A5DB33 Cluster: Putative uncharacterized protein; n=1; ... 143 6e-33 UniRef50_Q4WHW4 Cluster: DUF52 domain protein; n=6; Trichocomace... 141 2e-32 UniRef50_Q5KH61 Cluster: Putative uncharacterized protein; n=1; ... 141 2e-32 UniRef50_Q4Q1W0 Cluster: Putative uncharacterized protein; n=3; ... 139 7e-32 UniRef50_Q1DNQ3 Cluster: Putative uncharacterized protein; n=1; ... 136 5e-31 UniRef50_A3LWQ7 Cluster: Predicted protein; n=4; Saccharomycetal... 134 3e-30 UniRef50_P47085 Cluster: UPF0103 protein YJR008W; n=6; Saccharom... 128 2e-28 UniRef50_O15753 Cluster: 2034 protein; n=2; Dictyostelium discoi... 126 9e-28 UniRef50_A2FL46 Cluster: Putative uncharacterized protein; n=1; ... 114 3e-24 UniRef50_A7ATY0 Cluster: Putative uncharacterized protein; n=1; ... 111 2e-23 UniRef50_A6PTD3 Cluster: Putative uncharacterized protein; n=1; ... 108 2e-22 UniRef50_Q7RG18 Cluster: Putative uncharacterized protein PY0453... 95 2e-18 UniRef50_A1SXX4 Cluster: Putative uncharacterized protein; n=2; ... 88 2e-16 UniRef50_A6Q8X5 Cluster: Putative uncharacterized protein; n=2; ... 87 7e-16 UniRef50_Q1Q7G0 Cluster: Putative uncharacterized protein; n=1; ... 86 1e-15 UniRef50_Q6LSR4 Cluster: Putative uncharacterized protein; n=2; ... 85 2e-15 UniRef50_A6CYQ1 Cluster: Putative uncharacterized protein; n=1; ... 81 3e-14 UniRef50_Q2W0W5 Cluster: Predicted dioxygenase; n=4; Rhodospiril... 79 1e-13 UniRef50_A0L9L0 Cluster: Putative uncharacterized protein; n=1; ... 79 1e-13 UniRef50_A1RWV3 Cluster: Putative uncharacterized protein; n=1; ... 77 6e-13 UniRef50_A0LJS7 Cluster: AMMECR1 domain protein precursor; n=3; ... 76 1e-12 UniRef50_Q2BMM2 Cluster: Putative uncharacterized protein; n=1; ... 73 9e-12 UniRef50_Q5ZWB6 Cluster: Putative uncharacterized protein; n=4; ... 73 1e-11 UniRef50_Q2S9S7 Cluster: Predicted dioxygenase; n=15; Proteobact... 72 2e-11 UniRef50_Q3VWM2 Cluster: Putative uncharacterized protein; n=2; ... 71 3e-11 UniRef50_A6QB54 Cluster: Putative uncharacterized protein; n=1; ... 71 3e-11 UniRef50_A0X3C5 Cluster: Putative uncharacterized protein; n=3; ... 69 2e-10 UniRef50_A6DA73 Cluster: Putative uncharacterized protein; n=1; ... 69 2e-10 UniRef50_A1WY73 Cluster: Putative uncharacterized protein; n=1; ... 67 5e-10 UniRef50_Q6L0F9 Cluster: Hypothetical conserved protein DUF52; n... 66 1e-09 UniRef50_Q978N2 Cluster: UPF0103 protein TV1383; n=2; Thermoplas... 66 1e-09 UniRef50_A4MJZ4 Cluster: Putative uncharacterized protein; n=1; ... 65 2e-09 UniRef50_A4BK98 Cluster: Putative uncharacterized protein; n=1; ... 65 2e-09 UniRef50_A5FQ21 Cluster: Putative uncharacterized protein; n=3; ... 62 2e-08 UniRef50_Q7QUI2 Cluster: GLP_516_10373_9414; n=1; Giardia lambli... 62 2e-08 UniRef50_O59292 Cluster: UPF0103 protein PH1626; n=5; Thermococc... 62 2e-08 UniRef50_Q2NG05 Cluster: Putative uncharacterized protein; n=1; ... 62 2e-08 UniRef50_A7DR31 Cluster: Putative uncharacterized protein; n=1; ... 61 4e-08 UniRef50_O67039 Cluster: UPF0103 protein aq_890; n=2; Aquifex ae... 60 5e-08 UniRef50_Q5SHL9 Cluster: Putative uncharacterized protein TTHA17... 60 7e-08 UniRef50_Q57846 Cluster: UPF0103 protein MJ0403; n=8; Euryarchae... 60 7e-08 UniRef50_Q2LQ76 Cluster: Hypothetical cytosolic protein; n=1; Sy... 59 2e-07 UniRef50_A7IAG7 Cluster: Putative uncharacterized protein; n=1; ... 59 2e-07 UniRef50_A5UVY3 Cluster: Putative uncharacterized protein; n=2; ... 57 5e-07 UniRef50_O67355 Cluster: UPF0103 protein aq_1336; n=1; Aquifex a... 57 6e-07 UniRef50_A2BMN4 Cluster: Predicted dioxygenase; n=1; Hyperthermu... 56 9e-07 UniRef50_Q8ZYE1 Cluster: UPF0103 protein PAE0818; n=5; Thermopro... 56 9e-07 UniRef50_Q8G3N3 Cluster: Putative uncharacterized protein; n=1; ... 56 1e-06 UniRef50_Q30X41 Cluster: Putative uncharacterized protein; n=2; ... 56 2e-06 UniRef50_A5UN65 Cluster: Predicted dioxygenase; n=1; Methanobrev... 55 2e-06 UniRef50_Q96YW6 Cluster: UPF0103 protein ST2062; n=4; Sulfolobac... 54 3e-06 UniRef50_O26151 Cluster: UPF0103 protein MTH_45; n=1; Methanothe... 54 5e-06 UniRef50_Q1NJL5 Cluster: Putative uncharacterized protein; n=2; ... 52 2e-05 UniRef50_Q8TT38 Cluster: UPF0103 protein MA_0601; n=4; Methanosa... 52 2e-05 UniRef50_Q74NK0 Cluster: NEQ347; n=1; Nanoarchaeum equitans|Rep:... 50 7e-05 UniRef50_Q30PF9 Cluster: Putative uncharacterized protein; n=1; ... 49 1e-04 UniRef50_A7HMH8 Cluster: Putative uncharacterized protein; n=1; ... 49 1e-04 UniRef50_Q9WXU2 Cluster: UPF0103 protein TM_0087; n=2; Thermotog... 48 2e-04 UniRef50_A3JXY8 Cluster: Predicted dioxygenase; n=1; Sagittula s... 48 3e-04 UniRef50_Q1IL90 Cluster: Putative uncharacterized protein; n=2; ... 48 4e-04 UniRef50_A2BK85 Cluster: Universally conserved protein; n=3; Des... 47 7e-04 UniRef50_Q2IES1 Cluster: Putative uncharacterized protein; n=1; ... 46 0.001 UniRef50_Q66Q62 Cluster: Dor2; n=1; Sorangium cellulosum|Rep: Do... 46 0.002 UniRef50_A0LEC6 Cluster: Putative uncharacterized protein; n=1; ... 46 0.002 UniRef50_A0RY15 Cluster: Dioxygenase; n=1; Cenarchaeum symbiosum... 46 0.002 UniRef50_Q74C45 Cluster: Putative uncharacterized protein; n=7; ... 45 0.002 UniRef50_Q3A412 Cluster: Predicted dioxygenase; n=2; Desulfuromo... 45 0.002 UniRef50_Q0ABA7 Cluster: Dioxygenase-like protein; n=1; Alkalili... 45 0.002 UniRef50_A1VAM6 Cluster: Putative uncharacterized protein; n=2; ... 45 0.003 UniRef50_A2SR96 Cluster: Putative uncharacterized protein; n=1; ... 44 0.005 UniRef50_UPI0000498B94 Cluster: conserved hypothetical protein; ... 43 0.011 UniRef50_O51324 Cluster: Putative uncharacterized protein BB0349... 42 0.015 UniRef50_O27974 Cluster: UPF0103 protein AF_2310; n=2; Euryarcha... 41 0.035 UniRef50_Q9YB24 Cluster: UPF0103 protein APE_1771; n=1; Aeropyru... 39 0.14 UniRef50_Q56419 Cluster: UPF0103 protein TTHA0924; n=2; Thermus ... 39 0.18 UniRef50_Q5BSZ0 Cluster: SJCHGC03049 protein; n=1; Schistosoma j... 38 0.43 UniRef50_Q1HQS5 Cluster: Syndecan binding protein; n=5; Pancrust... 36 0.98 UniRef50_Q98GI9 Cluster: Encapsulation protein; CapA; n=1; Mesor... 36 1.7 UniRef50_Q1PVM2 Cluster: Putative uncharacterized protein; n=1; ... 36 1.7 UniRef50_A6REB9 Cluster: Predicted protein; n=1; Ajellomyces cap... 36 1.7 UniRef50_A3LYQ1 Cluster: Predicted protein; n=2; Saccharomycetac... 36 1.7 UniRef50_Q6C3Q3 Cluster: Yarrowia lipolytica chromosome E of str... 35 3.0 UniRef50_Q7SXQ0 Cluster: Zgc:66133; n=3; Danio rerio|Rep: Zgc:66... 33 9.2 UniRef50_Q6MQA3 Cluster: Iron-regulated protein A precursor; n=1... 33 9.2 UniRef50_A1SQY9 Cluster: Pentapeptide repeat protein; n=1; Psych... 33 9.2 >UniRef50_Q9VG04 Cluster: CG8031-PA; n=9; Eukaryota|Rep: CG8031-PA - Drosophila melanogaster (Fruit fly) Length = 295 Score = 383 bits (943), Expect = e-105 Identities = 184/281 (65%), Positives = 211/281 (75%), Gaps = 1/281 (0%) Query: 24 NSGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILG 83 +SG+ELSRQLD WL ADL+HGPARAIIAPH RQVSPVVVKRIFILG Sbjct: 15 DSGAELSRQLDRWLGAADLSHGPARAIIAPHAGYTYCGACAAFAYRQVSPVVVKRIFILG 74 Query: 84 PSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLP 143 PSHHVR+ GCALS KY+TPLYDL ID QI +ELE T +F MD +TDE+EHSIEMHLP Sbjct: 75 PSHHVRLRGCALSVAKKYRTPLYDLKIDAQINSELEKTGKFSWMDMKTDEDEHSIEMHLP 134 Query: 144 YIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFR 203 YIAKVME+YK FTI+PILVGSL PE+EA+YG++L+ YL DP NL VISSDFCHWG RF Sbjct: 135 YIAKVMEDYKDQFTIVPILVGSLNPEQEAQYGSLLSSYLMDPTNLFVISSDFCHWGHRFS 194 Query: 204 YTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISK 263 YT+ DSS G I++SIE LDK GMD+IE ++P +FT+YL KY NTICGRHPIGV+L A+ Sbjct: 195 YTYYDSSCGAIHKSIEKLDKQGMDIIESLNPHSFTEYLRKYNNTICGRHPIGVMLGAVKA 254 Query: 264 LSSQSNAPKMSLKFLKYAQSSQCMNXXXXXXXXXXXXLVFE 304 L Q KMS KFLKYAQSSQC + LVFE Sbjct: 255 LQDQ-GYDKMSFKFLKYAQSSQCQDIEDSSVSYASGSLVFE 294 >UniRef50_Q9Y316 Cluster: Protein MEMO1; n=34; Bilateria|Rep: Protein MEMO1 - Homo sapiens (Human) Length = 297 Score = 340 bits (835), Expect = 3e-92 Identities = 158/264 (59%), Positives = 195/264 (73%), Gaps = 2/264 (0%) Query: 25 SGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGP 84 SG +L+ QL+ WLS+ T PARAIIAPH +QV P + +RIFILGP Sbjct: 20 SGPQLNAQLEGWLSQVQSTKRPARAIIAPHAGYTYCGSCAAHAYKQVDPSITRRIFILGP 79 Query: 85 SHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPY 144 SHHV ++ CALSS+D Y+TPLYDL ID++IY EL T F+RM QTDE+EHSIEMHLPY Sbjct: 80 SHHVPLSRCALSSVDIYRTPLYDLRIDQKIYGELWKTGMFERMSLQTDEDEHSIEMHLPY 139 Query: 145 IAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRY 204 AK ME +K FTIIP+LVG+L+ KE ++G + + YLADP NL V+SSDFCHWG RFRY Sbjct: 140 TAKAMESHKDEFTIIPVLVGALSESKEQEFGKLFSKYLADPSNLFVVSSDFCHWGQRFRY 199 Query: 205 TWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKL 264 ++ D S+G IY+SIE LDK+GM +IE++DP +F++YL KY NTICGRHPIGVLL AI++L Sbjct: 200 SYYDESQGEIYRSIEHLDKMGMSIIEQLDPVSFSNYLKKYHNTICGRHPIGVLLNAITEL 259 Query: 265 SSQSNAPKMSLKFLKYAQSSQCMN 288 Q N MS FL YAQSSQC N Sbjct: 260 --QKNGMNMSFSFLNYAQSSQCRN 281 >UniRef50_Q22915 Cluster: UPF0103 protein tag-253; n=9; Eumetazoa|Rep: UPF0103 protein tag-253 - Caenorhabditis elegans Length = 350 Score = 245 bits (599), Expect = 1e-63 Identities = 126/259 (48%), Positives = 163/259 (62%), Gaps = 4/259 (1%) Query: 28 ELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHH 87 +L RQL WL A G ARA+I+PH +QV V+R+FILGPSH Sbjct: 74 DLDRQLTKWLDNAGPRIGTARALISPHAGYSYCGETAAYAFKQVVSSAVERVFILGPSHV 133 Query: 88 VRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAK 147 V + GCA+++ KY+TPL DL +D +I EL ATR FD MD + +E+EHSIEM LP+IAK Sbjct: 134 VALNGCAITTCSKYRTPLGDLIVDHKINEELRATRHFDLMDRRDEESEHSIEMQLPFIAK 193 Query: 148 VMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWK 207 VM + +TI+P+LVGSL ++ YG I A Y+ DP+NL VISSDFCHWG RF ++ Sbjct: 194 VMGSKR--YTIVPVLVGSLPGSRQQTYGNIFAHYMEDPRNLFVISSDFCHWGERFSFSPY 251 Query: 208 D-SSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSS 266 D S IY+ I +DK GM IE ++P AF DYL K NTICGR+PI ++LQA Sbjct: 252 DRHSSIPIYEQITNMDKQGMSAIETLNPAAFNDYLKKTQNTICGRNPILIMLQAAEHFRI 311 Query: 267 QSNAPKMSLKFLKYAQSSQ 285 SN +FL Y QS++ Sbjct: 312 -SNNHTHEFRFLHYTQSNK 329 >UniRef50_Q5DEE7 Cluster: SJCHGC02434 protein; n=1; Schistosoma japonicum|Rep: SJCHGC02434 protein - Schistosoma japonicum (Blood fluke) Length = 304 Score = 235 bits (576), Expect = 8e-61 Identities = 116/266 (43%), Positives = 165/266 (62%), Gaps = 5/266 (1%) Query: 27 SELSRQLDLWLSKAD---LTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILG 83 ++LS QL WL + L+ RAII PH RQ++P ++RIFILG Sbjct: 23 TQLSSQLSTWLESCENSVLSGYSVRAIIVPHAGYRHSGFCAAHAYRQINPDKIERIFILG 82 Query: 84 PSHHVRIAG-CALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHL 142 PSH + I CAL+ + +Y+TP +L ID IY++L+ F + + DE EHS+EM L Sbjct: 83 PSHRLDIGDTCALTCVSEYETPFCNLKIDTDIYSDLKKLSYFKVLTKNQDEAEHSVEMQL 142 Query: 143 PYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRF 202 P+IA +M+ K ++I+P++VG L+ E++ +G +L+ YL D +NL VISSDFCHWG RF Sbjct: 143 PFIAYIMKGKKDQYSIVPVVVGCLSTERQELFGKLLSNYLLDEKNLFVISSDFCHWGKRF 202 Query: 203 RYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAIS 262 RY + D S G I+QSIE LD LG+ I+ + P++F YL K+ NTICGR IG+LL I Sbjct: 203 RYQYYDKSDGAIWQSIEKLDHLGLGAIQSLKPESFLQYLKKFSNTICGRRSIGLLLFMID 262 Query: 263 KLSSQSNAPKMSLKFLKYAQSSQCMN 288 + Q + LK L Y QS++C + Sbjct: 263 SI-RQKQLFNLELKVLYYTQSNRCQS 287 >UniRef50_Q0V4V5 Cluster: Putative uncharacterized protein; n=1; Phaeosphaeria nodorum|Rep: Putative uncharacterized protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 336 Score = 213 bits (520), Expect = 5e-54 Identities = 121/303 (39%), Positives = 171/303 (56%), Gaps = 44/303 (14%) Query: 24 NSGSELSRQLDLWLSKADLTHGP-----------------ARAIIAPHXXXXXXXXXXXX 66 ++G +LS+QLD WL + P ARAIIAPH Sbjct: 15 SNGKQLSQQLDGWLEAVPSSTTPIGTASSEQGDVSIPTPNARAIIAPHAGYSYSGPAAAW 74 Query: 67 XXRQ-------VSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELE 119 + VSP + KR+F+LGPSHH ++G A ++ DKY TPL DL ID + E++ Sbjct: 75 AYKSADWANACVSPQLCKRVFLLGPSHHHYLSGAATTACDKYATPLGDLIIDTALVQEIK 134 Query: 120 ATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYK--TSFTIIPILVGSLTPEKEAKYGAI 177 + M + DE EHS+EMHLPYI K++ + +S ++PI++G+ +P E+KYG++ Sbjct: 135 QEWGLETMSQDVDEAEHSLEMHLPYIYKMLSLHNNPSSVPLVPIMIGNTSPSTESKYGSL 194 Query: 178 LAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRG----------------HIYQSIEWL 221 LAPYL+DP N+ VISSDFCHWGSRFRYT+ +S G I++SI+ + Sbjct: 195 LAPYLSDPTNIFVISSDFCHWGSRFRYTYYESPDGASATQLTRKSKIDEDWPIHESIKAV 254 Query: 222 DKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKYA 281 DK MD +E K F + L + GNT+CGRHPIGV + A+ S+ K KF++Y Sbjct: 255 DKESMDAVESGHHKRFLEQLKETGNTVCGRHPIGVFMAAVE--SADVGEGKGRFKFVRYE 312 Query: 282 QSS 284 +SS Sbjct: 313 RSS 315 >UniRef50_Q4PAA6 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 346 Score = 204 bits (497), Expect = 3e-51 Identities = 117/268 (43%), Positives = 157/268 (58%), Gaps = 33/268 (12%) Query: 48 RAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYD 107 RAII PH R + +K +FILGPSHHV + GCA+S+ Y+TPL + Sbjct: 65 RAIIGPHAGYSYSGPAAAYAYRTIDTSAIKTVFILGPSHHVYLDGCAVSACSSYETPLGN 124 Query: 108 LTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLT 167 L I++ + EL +T +F M + DE+EHSIEMHLPYI KV + T I+PILVG++ Sbjct: 125 LPINRSVTHELLSTGRFSTMSKTEDEDEHSIEMHLPYIYKVFK--GTGIQIVPILVGAIN 182 Query: 168 PEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTW-------------KDSSRG-- 212 +E ++G +LA YL DP+N V+SSDFCHWGSRFRYT+ SSR Sbjct: 183 TARENEFGKLLAKYLNDPENFFVVSSDFCHWGSRFRYTYYKPCGSNIAMNLTSRSSRSMF 242 Query: 213 ---HIYQSIEWLDKLGMDLI--------EKM--DPK-AFTDYLNKYGNTICGRHPIGVLL 258 I+QSI LD+ G+ I +K D + AF YL++ NT+CGRHPIGVLL Sbjct: 243 EGKPIHQSIRELDEAGILAITYPWSRDRQKTAEDARLAFAKYLSETKNTVCGRHPIGVLL 302 Query: 259 QAISKLSSQSNAPKMSLKFLKYAQSSQC 286 A+++L + K +F +Y QSSQC Sbjct: 303 AALAEL--ERRGQKTECRFTRYEQSSQC 328 >UniRef50_A0C8S3 Cluster: Chromosome undetermined scaffold_159, whole genome shotgun sequence; n=5; Oligohymenophorea|Rep: Chromosome undetermined scaffold_159, whole genome shotgun sequence - Paramecium tetraurelia Length = 345 Score = 194 bits (474), Expect = 2e-48 Identities = 99/266 (37%), Positives = 154/266 (57%), Gaps = 11/266 (4%) Query: 24 NSGSELSRQLDLWL--SKADLTH-GPARAIIAPHXXXXXXXXXXXXXXRQVS---PVVVK 77 + +EL Q++ WL +KA++T +A++ PH + + P Sbjct: 63 SKSNELKIQINCWLEQAKAEVTTVAQLKALVVPHAGYAYSGPTAAFSYKYLKKYPPSEKL 122 Query: 78 RIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHS 137 ++FILGP H+V I C L+ + Y+TPL ++ +D + +L F++ D+ +E EHS Sbjct: 123 KVFILGPCHYVYITQCCLTRQEIYETPLGNIKVDLETVKQLHEQGLFEQSDKDAEEEEHS 182 Query: 138 IEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCH 197 IEM LP++A ++ +FTIIPI+VGS+ + E YG +L+ Y L +IS+DFCH Sbjct: 183 IEMQLPFLAHILGT--DNFTIIPIMVGSIDAKSEEYYGRLLSEYFDMDDTLFIISTDFCH 240 Query: 198 WGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVL 257 WG++F YT+ +S+ G I++SIE LD+ M+ IE D F DYL +Y N +CG+H I +L Sbjct: 241 WGTKFAYTYYNSADGEIFESIEKLDQKAMEHIELHDLDKFNDYLREYENNVCGKHCIAIL 300 Query: 258 LQAISKLSSQSNAPKMSLKFLKYAQS 283 L I + N M KF++YAQS Sbjct: 301 LHCI---AMSQNTHMMETKFIRYAQS 323 >UniRef50_A0CYK3 Cluster: Chromosome undetermined scaffold_31, whole genome shotgun sequence; n=4; Oligohymenophorea|Rep: Chromosome undetermined scaffold_31, whole genome shotgun sequence - Paramecium tetraurelia Length = 294 Score = 192 bits (468), Expect = 1e-47 Identities = 112/270 (41%), Positives = 153/270 (56%), Gaps = 20/270 (7%) Query: 23 LNSGSELSRQLDLWLSKADLTHGP-ARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFI 81 + G +L QL+ +LSKA P +AII PH + + R+F+ Sbjct: 19 IGDGKQLDAQLNDFLSKAKGETIPNIKAIIGPHAGFSYSGPTAAFAYQHLVQKERMRVFL 78 Query: 82 LGPSHHVRIAGCALSSLDKYQTPLYDLTID----KQIYAELEATRQFDRMDEQTDENEHS 137 LGP HH I G LS L++Y+TPL ++ +D KQ+ AEL+ F D +E EHS Sbjct: 79 LGPCHHTYIKGIGLSELEQYETPLGNIELDQPTIKQLSAELKKNYVFTNKD--IEEQEHS 136 Query: 138 IEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCH 197 +EMHLP+I K+ + K +IPI+VG+ + E++A+ ++L Y DP + VISSDFCH Sbjct: 137 LEMHLPFIYKIFPKCK----LIPIMVGATSEEQDAQVASVLVKYFVDPNTVFVISSDFCH 192 Query: 198 WGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVL 257 WG RF+YT + G I+QSI LD + LIE + K F YL++ NTICGRHPI VL Sbjct: 193 WGKRFQYTPYNKEHGEIHQSIAQLDGQAIKLIESHNIKEFYKYLDETENTICGRHPICVL 252 Query: 258 LQAISKLSSQSNAPKMSLK--FLKYAQSSQ 285 L I N K+ LK +YAQSSQ Sbjct: 253 LNII-------NLSKLQLKTQLARYAQSSQ 275 >UniRef50_A2QP92 Cluster: Similarity to hypothetical protein At2g25280 - Arabidopsis thaliana; n=1; Aspergillus niger|Rep: Similarity to hypothetical protein At2g25280 - Arabidopsis thaliana - Aspergillus niger Length = 315 Score = 183 bits (446), Expect = 4e-45 Identities = 111/283 (39%), Positives = 150/283 (53%), Gaps = 29/283 (10%) Query: 27 SELSRQLDLWLSKA-DLTHG-------PARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKR 78 S LS QLD WL + D G AR IIAPH + + KR Sbjct: 18 STLSYQLDHWLQEVPDEIEGIGQLPVPGARMIIAPHAGYAYSGRCAAFAYKALDLSQAKR 77 Query: 79 IFILGPSHHVRIAGCALSSLDKYQTPLYD--LTIDKQIYAELEATR---------QFDRM 127 IF++GPSHH AL Y TPL D L +D + A+L +TR QF M Sbjct: 78 IFVVGPSHHHYFTTLALPEFTSYHTPLSDDPLPLDTEFIAKLRSTRAGSRNGLELQFTTM 137 Query: 128 DEQTDENEHSIEMHLPYIAKVMEEYKTSFT------IIPILVGSLTPEKEAKYGAILAPY 181 DE EHSIE+HLPYI ++++ + + ++PILVG++T E +GA+LAPY Sbjct: 138 SRSVDEAEHSIELHLPYIHRLLQRQRPNQPTSEYPPLVPILVGAVTESTEKAFGALLAPY 197 Query: 182 LADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYL 241 + DP+N VISSDFCHWG RFRYT S I++SI +D M I + F+ L Sbjct: 198 IDDPENAFVISSDFCHWGQRFRYT---SREPPIHESISAVDLATMAAITTGEYARFSTIL 254 Query: 242 NKYGNTICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKYAQSS 284 GNT+CGRHPIGV++ I ++ ++ K F++Y +SS Sbjct: 255 KNTGNTVCGRHPIGVIMAGIEEI-RKNEGEKGRFHFIRYDRSS 296 >UniRef50_UPI000023E320 Cluster: hypothetical protein FG00949.1; n=1; Gibberella zeae PH-1|Rep: hypothetical protein FG00949.1 - Gibberella zeae PH-1 Length = 390 Score = 163 bits (397), Expect = 4e-39 Identities = 94/260 (36%), Positives = 134/260 (51%), Gaps = 22/260 (8%) Query: 47 ARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLY 106 AR +IAPH + + KR+F+LGPSH + GCA + KY TP Sbjct: 45 ARVVIAPHAGYEYSGPCAAWAYKTLDLSCAKRVFVLGPSHTYYLEGCAATIFGKYATPFG 104 Query: 107 DLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYI-AKVMEEYKT--SF-TIIPIL 162 DL ID + ELE ++M Q + NEHS+EMH+PY+ + E ++T F I+P+L Sbjct: 105 DLEIDVDMAKELEDAIMMEKMPRQGEINEHSLEMHMPYLYLRCEETFETPDKFPKIVPVL 164 Query: 163 VGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRG---------- 212 VGS T ++E G L PYL DP+N +ISSDFCHWGS F Y + Sbjct: 165 VGSNTAKEEKVIGRALLPYLRDPENAFIISSDFCHWGSGFSYLPYSPTNSPSDLTQLKKR 224 Query: 213 -------HIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLS 265 I+++I +D+ MD +E AF L + NT+CGRHPIGV + A+ L Sbjct: 225 DPKPDGPPIHETIRVIDQAAMDAVETGSHDAFISTLKQTRNTVCGRHPIGVTMAALELLQ 284 Query: 266 SQSN-APKMSLKFLKYAQSS 284 ++ K ++Y +S+ Sbjct: 285 KEAGFEEKGRFSIIQYNRSN 304 >UniRef50_Q4U9C7 Cluster: Putative uncharacterized protein; n=2; Theileria|Rep: Putative uncharacterized protein - Theileria annulata Length = 297 Score = 162 bits (393), Expect = 1e-38 Identities = 91/218 (41%), Positives = 125/218 (57%), Gaps = 9/218 (4%) Query: 70 QVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDE 129 Q+ +K IF+LGPSHH + GCA+ QTPL L +D I +L + F ++ Sbjct: 67 QIDATSIKTIFVLGPSHHFFLRGCAVDRFSSLQTPLGVLQVDVDIVEKLSDLKGFSVINN 126 Query: 130 QTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLL 189 + E+EHSIEMHLP + V + K ++PI+VG + + L PY D L Sbjct: 127 EASEDEHSIEMHLPLLKFVFK--KEHVKVVPIMVGEFSESLADELTGALVPYFNDENTLF 184 Query: 190 VISSDFCHWGSRFRY--TWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNT 247 VISSDFCH+GSRF++ T +S +Y+ IE LDK G+DLI F YLN+ NT Sbjct: 185 VISSDFCHFGSRFQFSITGYESENKPLYEKIEMLDKRGIDLIVNHKYDDFLWYLNETENT 244 Query: 248 ICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKYAQSSQ 285 ICGR+PI +LL +L + SN +S K L Y+QSS+ Sbjct: 245 ICGRNPILLLL----RLLAASNL-NISSKLLHYSQSSR 277 >UniRef50_Q6CB70 Cluster: Similar to sp|P47085 Saccharomyces cerevisiae YJR008w; n=1; Yarrowia lipolytica|Rep: Similar to sp|P47085 Saccharomyces cerevisiae YJR008w - Yarrowia lipolytica (Candida lipolytica) Length = 319 Score = 161 bits (392), Expect = 2e-38 Identities = 92/281 (32%), Positives = 144/281 (51%), Gaps = 23/281 (8%) Query: 26 GSELSRQLDLWLSKADLTHGP-ARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGP 84 G+E+ R L S + P AR ++ PH +KR+FILGP Sbjct: 22 GAEVDRHLANGASVLGKSAIPGARVLVGPHAGLAYAGPQLGETYAAFDFKNIKRLFILGP 81 Query: 85 SHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPY 144 SHHV + A S+ Y+TP ++ +D + +L + M TD++EHS EMH+P+ Sbjct: 82 SHHVYLEHAATSAFHSYETPFGNVNVDVETTQKLNDSGVTKYMSATTDKDEHSFEMHMPF 141 Query: 145 IAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRY 204 + +++ + I+PI+VG + E E + +L PY+ DP N VIS+DFCHWG+ FRY Sbjct: 142 LKRLVGDQNVK--IVPIMVGQTSQEYEKRLAKLLLPYVEDPTNAFVISTDFCHWGNNFRY 199 Query: 205 -TWKDS-------------------SRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKY 244 + DS S IY+SIE+LDK GM++ + +Y K Sbjct: 200 WGYADSENCDNVSQSREELRRALKRSNTPIYKSIEYLDKKGMEVASLTSYDKWKEYCKKT 259 Query: 245 GNTICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKYAQSSQ 285 NTICGR P+ +L+ + + + +SL+++ Y+QS+Q Sbjct: 260 DNTICGRKPLAILISMLENYAIEKGDKPISLEWIGYSQSNQ 300 >UniRef50_A4R520 Cluster: Putative uncharacterized protein; n=2; Sordariomycetes|Rep: Putative uncharacterized protein - Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) Length = 364 Score = 160 bits (389), Expect = 4e-38 Identities = 88/217 (40%), Positives = 123/217 (56%), Gaps = 26/217 (11%) Query: 77 KRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEH 136 KRIF+LGPSH ++GCAL++ Y+TPL +L +D +L T +F + DE+EH Sbjct: 102 KRIFVLGPSHTYYLSGCALTTYATYETPLGNLRVDLDTIKQLRDTGKFKDIPRDNDEDEH 161 Query: 137 SIEMHLPYIAKVMEEY---------KTSF-TIIPILVGSLTPEKEAKYGAILAPYLADPQ 186 S+EMHLPY+AK + + S+ ++PIL+G + E +G +L P+L DP Sbjct: 162 SLEMHLPYLAKRLTQTFGGGSDGDGDASWPPVVPILIGDNKRDAEKAFGELLLPHLRDPD 221 Query: 187 NLLVISSDFCHWGSRFRYTWKDS--------SRGH--------IYQSIEWLDKLGMDLIE 230 N ++SSDFCHWG+RF YT + S G I++ I LD L MD IE Sbjct: 222 NAFIVSSDFCHWGNRFSYTKYTADGTVEGVRSLGRADRNLPVPIHEGIRVLDHLAMDAIE 281 Query: 231 KMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQ 267 AF D L GNT+CGRHPIGV++ A+ L + Sbjct: 282 TGSHDAFYDNLKATGNTVCGRHPIGVVMAALEMLKKE 318 >UniRef50_A5K624 Cluster: Putative uncharacterized protein; n=4; Plasmodium|Rep: Putative uncharacterized protein - Plasmodium vivax Length = 296 Score = 157 bits (380), Expect = 4e-37 Identities = 92/269 (34%), Positives = 139/269 (51%), Gaps = 14/269 (5%) Query: 24 NSGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILG 83 +SG L +D +A I PH +S VK IFILG Sbjct: 16 SSGRALKNSIDTHFESISCKKQSVKAAICPHAGYDYALQTNSHVYACISVENVKNIFILG 75 Query: 84 PSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAEL---EATRQFDRMDEQTDENEHSIEM 140 P+HH+ GC ++KY+TP L I++++ +E+ + FD + ++ DE EHSIEM Sbjct: 76 PNHHIYNKGCLFPHVEKYETPFGFLQINREVISEILQNDVDHLFDFIGDEDDEEEHSIEM 135 Query: 141 HLPYIAKVMEEYKTSFTIIPILVGSLTPE--KEAKYGAILAPYLADPQNLLVISSDFCHW 198 LP I +++E IIPI VG + + K ++ L Y D NL + SSDFCH+ Sbjct: 136 QLPLIKYIIKE--KDIKIIPIYVGCIGNDIQKIDRFCNPLKKYFQDEGNLFLFSSDFCHY 193 Query: 199 GSRFRYT--WKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGV 256 G RF +T + + HI++ +E +D+ +I + D F +YLNK NTICG +PI + Sbjct: 194 GRRFSFTNILQKYNDTHIFKQVENMDRDAASIISRHDIADFIEYLNKTHNTICGSNPIKM 253 Query: 257 LLQAISKLSSQSNAPKMSLKFLKYAQSSQ 285 +LQ + L K+S K + Y+QS+Q Sbjct: 254 MLQLLQDLPG-----KVSTKLMHYSQSNQ 277 >UniRef50_A2DDC6 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 292 Score = 153 bits (371), Expect = 5e-36 Identities = 83/244 (34%), Positives = 130/244 (53%), Gaps = 10/244 (4%) Query: 26 GSELSRQLDLWLSKADLTH---GPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFIL 82 G EL LD S A+++ G +AIIAPH + + P R+ IL Sbjct: 18 GQELKEMLDESFSNANVSQDKKGIVKAIIAPHAGYVYSVATASYAYKAIDPSNFDRVVIL 77 Query: 83 GPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAEL--EATRQFDRMDEQTDENEHSIEM 140 GPSH + + C +++ D +TP + ID++ EL + F + EHS+EM Sbjct: 78 GPSHRIYVKKCTIAAADGCETPYGTVPIDRKAADELLQKYPDSFQVLSIDQSAKEHSLEM 137 Query: 141 HLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGS 200 LP + V + F++IPI++G L + + L P ++DP+ LLVISSDFCHWG+ Sbjct: 138 QLPLLKYVFGD--KPFSVIPIMIGDLKEAQHKQVVEALTPIISDPKTLLVISSDFCHWGN 195 Query: 201 RFRYTW--KDSSRGH-IYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVL 257 F Y + K+ + +Y+ IE LDK+ + ++ DPK FT Y+++ NTICG PI + Sbjct: 196 NFDYFYLPKEIEKSEPVYKRIERLDKMAWEYVKDHDPKGFTKYISETENTICGYVPITMA 255 Query: 258 LQAI 261 ++ + Sbjct: 256 MEIL 259 >UniRef50_Q38B52 Cluster: Putative uncharacterized protein; n=2; Trypanosoma|Rep: Putative uncharacterized protein - Trypanosoma brucei Length = 323 Score = 150 bits (363), Expect = 5e-35 Identities = 87/227 (38%), Positives = 129/227 (56%), Gaps = 20/227 (8%) Query: 76 VKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELE-----ATRQFDRMDEQ 130 + RIF+LGPSHH G + + +Y+TP L ++ ++ E+E A M Sbjct: 79 ITRIFLLGPSHHKGFDGVEVCAAQRYETPFGPLVVNAKVGQEVEKELRAAGVPVGTMHRM 138 Query: 131 TDENEHSIEMHLPYIAKVMEE----YKTSFT---IIPILVGSLTPEKEAKYGAILAPYLA 183 TDE+EHSIEM LP+I+ ++ YK + ++P+L+G + E G++L+ YL Sbjct: 139 TDEDEHSIEMQLPFISHLLHYPPNGYKPAMDRVELVPLLIGGTNRKMENLIGSVLSKYLK 198 Query: 184 DPQNLLVISSDFCHWGSRFRYT--WKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYL 241 D QN VISSDFCHWG+RF+Y ++ + I +I +D GM L+E D + YL Sbjct: 199 DNQNFFVISSDFCHWGARFQYMYHYEKAEYPDIGDAIISMDHEGMRLLEARDMDGWYKYL 258 Query: 242 NKYGNTICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKYAQSSQCMN 288 + NTICGR PI VL+ A L S+ A ++FL Y+QS++C N Sbjct: 259 STTNNTICGRRPISVLMAA---LDSKKEA---VVRFLHYSQSNRCKN 299 >UniRef50_A2DWN3 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 286 Score = 150 bits (363), Expect = 5e-35 Identities = 79/221 (35%), Positives = 115/221 (52%), Gaps = 5/221 (2%) Query: 45 GPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTP 104 G +A+I+PH + P + R+ I+GPSH + I C +S ++TP Sbjct: 36 GKVKAVISPHAGYRHCAETASHAFATIDPSLYSRVIIMGPSHRLPIDYCTISEAKSFETP 95 Query: 105 LYDLTIDKQIYAELEATRQ--FDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPIL 162 L ID I EL + F ++ +T EHS+E+ LP+I + + + T++PI+ Sbjct: 96 TRSLEIDP-IAEELTSKYGSIFKKLSIETSNREHSLELMLPWIDYIFKG--KNVTVVPIM 152 Query: 163 VGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLD 222 VG L K + + L PY+ DP LLVISSDF HWGSRF YT+ G I++ I +D Sbjct: 153 VGHLDQTKLEQAVSALKPYINDPSTLLVISSDFTHWGSRFSYTYLPEKDGEIWEKISAID 212 Query: 223 KLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISK 263 M+ I K F DY+ TICGR+PI + + A K Sbjct: 213 HGAMEAISTCKAKNFQDYIKSTRATICGRNPITIAMMAFDK 253 >UniRef50_Q7S447 Cluster: Putative uncharacterized protein NCU02459.1; n=4; Pezizomycotina|Rep: Putative uncharacterized protein NCU02459.1 - Neurospora crassa Length = 355 Score = 149 bits (360), Expect = 1e-34 Identities = 80/194 (41%), Positives = 110/194 (56%), Gaps = 12/194 (6%) Query: 23 LNSGSELSRQLDLWLSKA-------DLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVV 75 L + + LS QLD ++S+ DL AR IIAPH + + Sbjct: 31 LGNAARLSSQLDEFMSRVPNKLDGRDLPIPGARVIIAPHAGYSYSGPCAAWAYKILDLAN 90 Query: 76 VKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENE 135 VKR+F+LGPSH + GCALS+ KY TP DL +D + EL ++F + + D E Sbjct: 91 VKRVFLLGPSHTFYLKGCALSTFGKYSTPFGDLVVDGKAVDELMEDQKFSPIPVEYDIRE 150 Query: 136 HSIEMHLPYIAKVMEEY----KTSF-TIIPILVGSLTPEKEAKYGAILAPYLADPQNLLV 190 H +EMHLPY+ K +E+ + F I+P+LVG L+ + E G+ILAPYLADP+N + Sbjct: 151 HCLEMHLPYLWKRLEQTLGGDSSQFPPIVPVLVGDLSADGEKAVGSILAPYLADPKNAFI 210 Query: 191 ISSDFCHWGSRFRY 204 ISSDFCHWG + Y Sbjct: 211 ISSDFCHWGKNYHY 224 Score = 56.4 bits (130), Expect = 9e-07 Identities = 29/72 (40%), Positives = 44/72 (61%), Gaps = 1/72 (1%) Query: 214 IYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKL-SSQSNAPK 272 I++ I+ LD L MD I+ D + L NT+CGRHPIGV+L A+ K+ +QS K Sbjct: 265 IHEVIKALDDLVMDSIKTGDHSDYYSILKGTNNTVCGRHPIGVVLAALEKMGGAQSGESK 324 Query: 273 MSLKFLKYAQSS 284 +F++Y +S+ Sbjct: 325 GKFQFVQYQRSN 336 >UniRef50_Q10212 Cluster: UPF0103 protein C4H3.04c; n=1; Schizosaccharomyces pombe|Rep: UPF0103 protein C4H3.04c - Schizosaccharomyces pombe (Fission yeast) Length = 309 Score = 146 bits (355), Expect = 5e-34 Identities = 93/277 (33%), Positives = 137/277 (49%), Gaps = 27/277 (9%) Query: 29 LSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHV 88 L++QL ++ G R +I+PH +Q+ ++R+F+ GPSHH+ Sbjct: 22 LTKQLKSFIKNPTPETGK-RFVISPHAGYMYSGKVASQGFQQLDFSKIQRVFVFGPSHHI 80 Query: 89 RIAGCALSSLDKYQTPLYDLTIDKQIYAELEAT-RQFDRMDEQTDENEHSIEMHLPYIA- 146 C +S TPL DL +D+ + +L A+ FD M DE+EHS+EM P +A Sbjct: 81 FTRKCLVSRASICSTPLGDLKVDEDLCQKLVASDNSFDSMTLDVDESEHSLEMQFPLLAF 140 Query: 147 -KVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYT 205 + + I+PI++G+LT L+ Y+ D N VISSDFCHWG RF YT Sbjct: 141 HLLKQGCLGKVKIVPIMIGALTSTTMMAAAKFLSQYIKDESNSFVISSDFCHWGRRFGYT 200 Query: 206 -------------WKDSSRG-----HIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNT 247 K RG IY+SI LD +GM +IE F++YL NT Sbjct: 201 LYLNDTNQLEDAVLKYKRRGGPTSPKIYESISNLDHIGMKIIETKSSDDFSEYLKTTQNT 260 Query: 248 ICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKYAQSS 284 ICGR+PI ++++++ + KF+ YAQSS Sbjct: 261 ICGRYPIELIMKSMECANFSER-----FKFISYAQSS 292 >UniRef50_A5DB33 Cluster: Putative uncharacterized protein; n=1; Pichia guilliermondii|Rep: Putative uncharacterized protein - Pichia guilliermondii (Yeast) (Candida guilliermondii) Length = 328 Score = 143 bits (346), Expect = 6e-33 Identities = 95/271 (35%), Positives = 134/271 (49%), Gaps = 34/271 (12%) Query: 25 SGSELSRQLDLWLSKAD----LTHGP-----ARAIIAPHXXXXXXXXXXXXXXRQVSPVV 75 + + L+ Q++ ++SKA +HG AR +I PH Sbjct: 16 NNASLASQMERFISKAQNNLKKSHGGPHVPGARVLIGPHAGYTYSGTQLAETYEAWDTTG 75 Query: 76 VKRIFILGPSHHVRIAGCA-LSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDEN 134 VKR+FILGPSHHV + A +S D YQTP +L +D ++ +EL F M E+ DEN Sbjct: 76 VKRVFILGPSHHVYFSSTAKVSKFDSYQTPFGNLDVDTKVCSELVDKGAFSYMTEEEDEN 135 Query: 135 EHSIEMHLPYIA-KVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISS 193 EHS EMH P+I K + S I+PI++ ++ K L PY AD N +SS Sbjct: 136 EHSFEMHAPFIRYKTKDLPHGSPKIVPIMISAMNERLYNKIVKALEPYFADKSNTFAVSS 195 Query: 194 DFCHWGSRFRYT-------------------WKDSSR----GHIYQSIEWLDKLGMDLIE 230 DFCHWG+RF YT K SS+ I++SIE LDK M + Sbjct: 196 DFCHWGARFGYTKYLQKIPDSEGITSQSLVSLKSSSQLVQSIPIHRSIEILDKEAMKIAS 255 Query: 231 KMDPKAFTDYLNKYGNTICGRHPIGVLLQAI 261 K + Y+++ NTICG+ PI V+L+ + Sbjct: 256 KGTHTDWNRYIDETQNTICGQKPISVVLRLL 286 >UniRef50_Q4WHW4 Cluster: DUF52 domain protein; n=6; Trichocomaceae|Rep: DUF52 domain protein - Aspergillus fumigatus (Sartorya fumigata) Length = 402 Score = 141 bits (342), Expect = 2e-32 Identities = 86/205 (41%), Positives = 112/205 (54%), Gaps = 25/205 (12%) Query: 27 SELSRQLDLWLSKA--------DLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKR 78 S L+RQLD WL+ L +R IIAPH R + KR Sbjct: 54 STLTRQLDQWLAHVPNEIEGIGSLPVPGSRVIIAPHAGYAYSGPCAAYAYRALDLSKAKR 113 Query: 79 IFILGPSHHVRIAGCALSSLDKYQTPLYD--LTIDKQIYAEL---------EATRQFDRM 127 IFILGPSHH ++ AL L Y TPL D L +D ++ A+L +T F M Sbjct: 114 IFILGPSHHHYLSTLALPQLTSYYTPLSDEPLPLDTELIAKLLSAKAVKPNGSTVSFTTM 173 Query: 128 DEQTDENEHSIEMHLPYIAKVME-EYKTSFT-----IIPILVGSLTPEKEAKYGAILAPY 181 DE+EHSIE+HLPYI ++++ ++ T T ++PILVGS + E +GA+LA Y Sbjct: 174 TRSVDEDEHSIELHLPYIHRLLQLQHPTKRTSQYPPLVPILVGSTSASTEQAFGALLASY 233 Query: 182 LADPQNLLVISSDFCHWGSRFRYTW 206 L DP N+ VISSDFCHWG RF YT+ Sbjct: 234 LEDPSNVFVISSDFCHWGLRFSYTY 258 Score = 61.7 bits (143), Expect = 2e-08 Identities = 28/75 (37%), Positives = 44/75 (58%) Query: 214 IYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQSNAPKM 273 I++SI D M I + + F D + + GNT+CGRHPIGV++ AI +Q + K Sbjct: 313 IHESISAFDIATMAAIATGETENFLDVIQRTGNTVCGRHPIGVIMAAIEATRTQEDGKKG 372 Query: 274 SLKFLKYAQSSQCMN 288 + F++Y +SS +N Sbjct: 373 AFHFIRYERSSDAVN 387 >UniRef50_Q5KH61 Cluster: Putative uncharacterized protein; n=1; Filobasidiella neoformans|Rep: Putative uncharacterized protein - Cryptococcus neoformans (Filobasidiella neoformans) Length = 346 Score = 141 bits (341), Expect = 2e-32 Identities = 70/159 (44%), Positives = 90/159 (56%), Gaps = 1/159 (0%) Query: 47 ARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLY 106 A+AIIAPH V +KR+F+LGPSHH + G ALS + Y+TPL Sbjct: 47 AKAIIAPHAGYSYSGPAAAWAYAAVPTEKIKRVFLLGPSHHAYLPGVALSKFEAYETPLG 106 Query: 107 DLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSL 166 D+ +D EL T F M TDE+EHS+EMHLPYI +++ + + ++PILVG Sbjct: 107 DIPLDTDTINELRDTGIFSDMKSSTDEDEHSLEMHLPYI-RLIFQGRDDLKLVPILVGHP 165 Query: 167 TPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYT 205 + AK LA Y D + VISSDFCHWGSRF T Sbjct: 166 SASTSAKLSEALAKYWQDGETFFVISSDFCHWGSRFSCT 204 Score = 56.8 bits (131), Expect = 6e-07 Identities = 31/79 (39%), Positives = 47/79 (59%), Gaps = 5/79 (6%) Query: 214 IYQSIEWLDKLGMDLIEKMDPKAFTD----YLNKYGNTICGRHPIGVLLQAISKLSSQSN 269 I++SIE++D GMDL+ K + YL + NTICGR+PI VLL + + ++ Sbjct: 252 IWKSIEYMDHEGMDLLRKPGEDGAVEKWHGYLERTKNTICGRNPITVLLNLV-QFVYKNQ 310 Query: 270 APKMSLKFLKYAQSSQCMN 288 K F++Y QSS+C+N Sbjct: 311 PVKPEFVFVRYEQSSKCVN 329 >UniRef50_Q4Q1W0 Cluster: Putative uncharacterized protein; n=3; Leishmania|Rep: Putative uncharacterized protein - Leishmania major Length = 370 Score = 139 bits (337), Expect = 7e-32 Identities = 85/229 (37%), Positives = 123/229 (53%), Gaps = 23/229 (10%) Query: 76 VKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAEL-----EATRQFDRMDEQ 130 ++RIFILGPSH GC LS+ Y+TP L +D + + +A + Sbjct: 130 LERIFILGPSHTRGFEGCELSAASAYETPFGPLRVDTAVVDRVITDLRKAGVGAATASRR 189 Query: 131 TDENEHSIEMHLPYIAKVMEEYKTS-----------FTIIPILVGSLTPEKEAKYGAILA 179 TDE EHSIEM PY++ ++ T+ I+PI+VG + E +L Sbjct: 190 TDEAEHSIEMETPYLSHILHYPPTTTGAPVQPAAGRVAIVPIIVGWTNRQDEKAICDVLK 249 Query: 180 PYLADPQNLLVISSDFCHWGSRFRYT--WKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAF 237 PY+ D +N + SSDFCHWG RF YT +K S +I SI +D M+L+EK D + + Sbjct: 250 PYMDDARNFFICSSDFCHWGERFSYTYHYKRSEYPNIGDSIIAMDHAAMELLEKRDLERW 309 Query: 238 TDYLNKYGNTICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKYAQSSQC 286 YL NTICGR PI + +Q + +S+ N K +KF+ Y+QS++C Sbjct: 310 YAYLQMTKNTICGRAPISIGMQ---RWASKGN--KARVKFVHYSQSNKC 353 >UniRef50_Q1DNQ3 Cluster: Putative uncharacterized protein; n=1; Coccidioides immitis|Rep: Putative uncharacterized protein - Coccidioides immitis Length = 383 Score = 136 bits (330), Expect = 5e-31 Identities = 78/204 (38%), Positives = 112/204 (54%), Gaps = 21/204 (10%) Query: 24 NSGSELSRQLDLWLSKA--------DLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVV 75 ++ + L+RQLD W+++ L AR IIAPH + + Sbjct: 15 DNAATLTRQLDEWMNRVPNEIEGIGSLPVAGARIIIAPHAGYAYSGPCAAFAYKSLDLSK 74 Query: 76 VKRIFILGPSHHVRIAGCALSSLDKYQTPLYD--LTIDKQIYAELEA-----TRQFDRMD 128 KRIF+LGPSHH + AL L Y TPL L +D++I EL T +F M+ Sbjct: 75 AKRIFLLGPSHHHPFSKIALPELSSYSTPLSQEPLPLDREIIDELSTRTENGTVRFTTMN 134 Query: 129 EQTDENEHSIEMHLPYIAKVME-----EYKTSFT-IIPILVGSLTPEKEAKYGAILAPYL 182 + DE EHS+E+HLPYI +++ E S+ ++P++VGS + E +G ILAPYL Sbjct: 135 QAIDEAEHSLELHLPYIHYLLQRLYPGEPAASYPKLVPMMVGSTSAPTEQAFGRILAPYL 194 Query: 183 ADPQNLLVISSDFCHWGSRFRYTW 206 A+P+N ++SSDFCHWG RF Y + Sbjct: 195 ANPENAFIVSSDFCHWGLRFAYAY 218 Score = 47.2 bits (107), Expect = 5e-04 Identities = 23/48 (47%), Positives = 28/48 (58%) Query: 214 IYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAI 261 I++SI D M I + F D L GNTICGRHPIGV++ AI Sbjct: 277 IHESISACDIACMSAIASGQTQTFLDALKSTGNTICGRHPIGVIMAAI 324 >UniRef50_A3LWQ7 Cluster: Predicted protein; n=4; Saccharomycetales|Rep: Predicted protein - Pichia stipitis (Yeast) Length = 345 Score = 134 bits (324), Expect = 3e-30 Identities = 100/312 (32%), Positives = 146/312 (46%), Gaps = 50/312 (16%) Query: 24 NSGSELSRQLDLWLSKADLTHGP--------ARAIIAPHXXXXXXXXXXXXXXRQVSPVV 75 N+ ++L QL+ + KA+ G AR +I PH Sbjct: 16 NNPTKLGLQLEAYFHKAESHSGEDSRHIIPGARILIGPHAGFAYSGERLAETFTVWDTSK 75 Query: 76 VKRIFILGPSHHVRIAGCAL-SSLDKYQTPLYDLTIDKQIYAELEATRQ----------- 123 VKRIF+LGPSHHV + S + Y+TP ++ +D + +L T+ Sbjct: 76 VKRIFMLGPSHHVYFKNSVMVSQFEWYETPFGNIPVDTETIEKLLHTKPQSHGHSLTHAK 135 Query: 124 ---FDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFT-IIPILVGSLTPEKEAKYGAILA 179 F M E+ DE+EHS EMH P+I + + IIPIL+ + + + + L Sbjct: 136 DSVFKYMSEEMDEDEHSFEMHAPFIYQKTHDLPQGIPKIIPILISGMDEKLNDEVVSALL 195 Query: 180 PYLADPQNLLVISSDFCHWGSRFRY--------------TWKDSSRGH----------IY 215 PYL + +N +ISSDFCHWGSRF Y T SS GH IY Sbjct: 196 PYLENEENHFIISSDFCHWGSRFGYTKYVPQKVDSLQLLTENLSSLGHSLRTKPNELPIY 255 Query: 216 QSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISK--LSSQSNAPKM 273 +SIE LDK M++ + Y+++ GNTICG+ PI V+L+ I K L++ Sbjct: 256 KSIEVLDKAAMEIASSGSYSDWKTYISQTGNTICGQKPIAVVLKLIQKYRLAAGDTDKAA 315 Query: 274 SLKFLKYAQSSQ 285 K++ Y+QS+Q Sbjct: 316 IFKWIGYSQSNQ 327 >UniRef50_P47085 Cluster: UPF0103 protein YJR008W; n=6; Saccharomycetales|Rep: UPF0103 protein YJR008W - Saccharomyces cerevisiae (Baker's yeast) Length = 338 Score = 128 bits (309), Expect = 2e-28 Identities = 101/308 (32%), Positives = 146/308 (47%), Gaps = 51/308 (16%) Query: 24 NSGSELSRQLDLWLSKADLTHGP---ARAIIAPHXXXXXXXXXXXXXXRQVS-PVVVKRI 79 N ELS+QL +L K+ L GP AR II PH + VKRI Sbjct: 15 NRAQELSQQLHTYLIKSTLK-GPIHNARIIICPHAGYRYCGPTMAYSYASLDLNRNVKRI 73 Query: 80 FILGPSHHVRIAGCAL-SSLDKYQTPLYDLTIDKQIYAEL-------EATRQFDRMDEQT 131 FILGPSHH+ L S+ + +TPL +L +D + L + F MD T Sbjct: 74 FILGPSHHIYFKNQILVSAFSELETPLGNLKVDTDLCKTLIQKEYPENGKKLFKPMDHDT 133 Query: 132 DENEHSIEMHLPYIAKVMEEYKTSFT---IIPILVGSLTPEKEAKYGAILAPYLADPQNL 188 D EHS+EM LP + + ++ + S + P++V + + + G IL+ Y+ DP NL Sbjct: 134 DMAEHSLEMQLPMLVETLKWREISLDTVKVFPMMVSHNSVDVDRCIGNILSEYIKDPNNL 193 Query: 189 LVISSDFCHWGSRFRYTWKDSSRGHI--------------------------YQSIEWLD 222 ++SSDFCHWG RF+YT S+ + +QSIE +D Sbjct: 194 FIVSSDFCHWGRRFQYTGYVGSKEELNDAIQEETEVEMLTARSKLSHHQVPIWQSIEIMD 253 Query: 223 KLGMDLI------EKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQSNAPKMSLK 276 + M + E+ D A+ YL GNTICG PI V+L A+SK+ + + + Sbjct: 254 RYAMKTLSDTPNGERYD--AWKQYLEITGNTICGEKPISVILSALSKI-RDAGPSGIKFQ 310 Query: 277 FLKYAQSS 284 + Y+QSS Sbjct: 311 WPNYSQSS 318 >UniRef50_O15753 Cluster: 2034 protein; n=2; Dictyostelium discoideum|Rep: 2034 protein - Dictyostelium discoideum (Slime mold) Length = 168 Score = 126 bits (303), Expect = 9e-28 Identities = 67/162 (41%), Positives = 93/162 (57%), Gaps = 6/162 (3%) Query: 23 LNSGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFIL 82 L++ +L +QL WLS+A + ++IIAPH + P KR+FIL Sbjct: 13 LDNARKLEKQLSDWLSEASRLNQNVKSIIAPHAGYSYSGRAAAYAYINLIPENYKRVFIL 72 Query: 83 GPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHL 142 GPSHHV + C L+ LD ++TP+ +L +DK +L T F + DE+EHS+E+ L Sbjct: 73 GPSHHVYMKTCGLTKLDTWETPIGNLKVDKDTTNKLFDTGSFIWNTKSVDEDEHSLELQL 132 Query: 143 PYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLAD 184 PYIAKV E I+PI+VGSL+ + E YG ILAPY D Sbjct: 133 PYIAKVAE------NIVPIMVGSLSIDLEELYGKILAPYFDD 168 >UniRef50_A2FL46 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 310 Score = 114 bits (274), Expect = 3e-24 Identities = 60/219 (27%), Positives = 113/219 (51%), Gaps = 5/219 (2%) Query: 48 RAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYD 107 + +I+PH ++P +RI ILG HH+ + +S + +TP + Sbjct: 42 KGVISPHSCYQVCLRTAAYSFSCINPDKFERIIILGTCHHIALKAGLVSHATEVETPFGN 101 Query: 108 LTIDKQIYAEL--EATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGS 165 L +D ++ +L E MD++ DENEHS+EM P I + ++ IIP+L+GS Sbjct: 102 LQVDTEVTEKLATEYGEAIQWMDQKVDENEHSLEMQYPLIKYIWQDRPVK--IIPMLIGS 159 Query: 166 LTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYT-WKDSSRGHIYQSIEWLDKL 224 L+ +E + L+P + D + +ISSDF HWG F +T + + + + Q ++ D+ Sbjct: 160 LSEPREIEIAEALSPIITDEKTFFIISSDFTHWGEIFHHTPIQSTKKKQLSQQLQIADER 219 Query: 225 GMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISK 263 + +I + + + F + +ICG + I ++L+ +++ Sbjct: 220 SIGIIHQFNYEHFRFICEEIHGSICGCYSICLMLRILAE 258 >UniRef50_A7ATY0 Cluster: Putative uncharacterized protein; n=1; Babesia bovis|Rep: Putative uncharacterized protein - Babesia bovis Length = 245 Score = 111 bits (268), Expect = 2e-23 Identities = 62/167 (37%), Positives = 90/167 (53%), Gaps = 5/167 (2%) Query: 79 IFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSI 138 +FILGPSHH+ + GCA+ QTP +L +D I EL + F + ++ E EHSI Sbjct: 46 VFILGPSHHLPLKGCAVDVSSTLQTPFGELQVDNDITTELLKGKCFKELSKRNSEEEHSI 105 Query: 139 EMHLPYIAKVMEEYKTS-FTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCH 197 EM LP + V + ++PI+VG + E G L PY + VISSDFCH Sbjct: 106 EMQLPILHYVANKSNADHIKVVPIVVGYMLNEGLEDVGQALLPYFEKEDTIFVISSDFCH 165 Query: 198 WGSRFRYT---WKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYL 241 +G RF +T ++D I+++IE LD G+ LI + D + Y+ Sbjct: 166 FGKRFGFTRTGFEDQDM-PIWKAIESLDLDGVKLIVEHDLEVSNKYI 211 >UniRef50_A6PTD3 Cluster: Putative uncharacterized protein; n=1; Victivallis vadensis ATCC BAA-548|Rep: Putative uncharacterized protein - Victivallis vadensis ATCC BAA-548 Length = 295 Score = 108 bits (260), Expect = 2e-22 Identities = 67/221 (30%), Positives = 104/221 (47%), Gaps = 8/221 (3%) Query: 48 RAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYD 107 R + PH R ++ +LGPSH+V G A ++ ++TP D Sbjct: 48 RGCVLPHAGYMFSLGVAMETLRAARHCGCSKVVLLGPSHYVGFRGIAAATFTSWRTPFGD 107 Query: 108 LTIDKQIYAELEATRQ-FDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSL 166 L+ + LEA R +++ NEHS+E+ P I + + + ++P++VG + Sbjct: 108 LSTATDLLDVLEAERNPLVMVNDDAHINEHSLEVQFPLI----QYFFDAPVVLPLVVGGI 163 Query: 167 TPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGM 226 + E GA LA L P L +ISSDF H+G +FRYT S + LD+ Sbjct: 164 SAEDAQSLGAALAK-LDAPDVLWLISSDFTHYGRKFRYTPFGESADPA--ELNRLDREAA 220 Query: 227 DLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQ 267 +LI D F +L + G TICG HPI + L + +L + Sbjct: 221 ELIAARDLTGFVKFLGRTGATICGAHPIAIYLAMLDRLDPE 261 >UniRef50_Q7RG18 Cluster: Putative uncharacterized protein PY04533; n=1; Plasmodium yoelii yoelii|Rep: Putative uncharacterized protein PY04533 - Plasmodium yoelii yoelii Length = 264 Score = 95.5 bits (227), Expect = 2e-18 Identities = 74/249 (29%), Positives = 116/249 (46%), Gaps = 37/249 (14%) Query: 24 NSGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILG 83 ++ + L ++ K +L +A I PH ++ +K IFILG Sbjct: 16 DNSNVLKNSIESLFEKINLPKQQVKAAICPHAGYAYCLETSSHVYSCINVENIKNIFILG 75 Query: 84 PSHHVRIAGCALSSLDKYQT--------------------PL--YDL---TIDKQIYAEL 118 P+HH+ GC L +DKY+T PL Y L TI+ IY + Sbjct: 76 PNHHIYNKGCLLPQVDKYETPFGFLQINKDGNLPLATCHLPLATYHLPLTTINVYIYMFI 135 Query: 119 ------EATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPE--K 170 + +D +DE DE EHSIEM LP I +++E I+PI VG + + K Sbjct: 136 SDIMNNDTQNLYDYIDEIDDEEEHSIEMQLPLIKYIIKE--KDIKIVPIYVGCIGNDVNK 193 Query: 171 EAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYT--WKDSSRGHIYQSIEWLDKLGMDL 228 ++ L Y D N + SSDFCH+G RF +T + + +I++ IE +DK G+++ Sbjct: 194 INEFSNPLKKYFQDKTNAFIFSSDFCHFGRRFSFTNILEKYNDKYIHKKIENMDKDGINV 253 Query: 229 IEKMDPKAF 237 I K + + + Sbjct: 254 ITKHNVQGY 262 >UniRef50_A1SXX4 Cluster: Putative uncharacterized protein; n=2; Alteromonadales|Rep: Putative uncharacterized protein - Psychromonas ingrahamii (strain 37) Length = 282 Score = 88.2 bits (209), Expect = 2e-16 Identities = 47/176 (26%), Positives = 94/176 (53%), Gaps = 9/176 (5%) Query: 25 SGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVV--VKRIFIL 82 + ++ ++L ++L+ + A+A+I PH + + + R+ +L Sbjct: 38 TADQIDQELSVFLNAPSESTTQAKALIVPHAGYCYSGAVAGYAYSYLKNIAHNINRVILL 97 Query: 83 GPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHL 142 GPSH V + GCA+SS D + TP+ + +DK Y +L + +++Q EHS+E+ L Sbjct: 98 GPSHRVALQGCAISSCDFFTTPIGPIPVDKSAYTQL-LDEKLVTINDQAHLLEHSLEVQL 156 Query: 143 PYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 P++ + ++ +F ++PI+VG + + ++ IL + +P L+V+SSD H+ Sbjct: 157 PFLQRSLQ----NFVLVPIVVGQCSVQHVSQILEILK--VNEPGTLVVVSSDLSHY 206 >UniRef50_A6Q8X5 Cluster: Putative uncharacterized protein; n=2; unclassified Epsilonproteobacteria|Rep: Putative uncharacterized protein - Sulfurovum sp. (strain NBC37-1) Length = 267 Score = 86.6 bits (205), Expect = 7e-16 Identities = 48/169 (28%), Positives = 84/169 (49%), Gaps = 9/169 (5%) Query: 30 SRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVR 89 +R ++ L +L H RA+I PH R + KR+ ++GPSH V Sbjct: 28 NRIIEEHLQNEELLHMKPRAVIVPHAGYVYSAFTANVAMRLLGNTEAKRVVVIGPSHRVY 87 Query: 90 IAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVM 149 + G ++S D Y TPL L ID+++ EL++ +F +EHS E+ +P++ Sbjct: 88 LKGTSISDYDSYNTPLGALPIDRELVNELKS--RFGLQFVPDAHHEHSTEVQMPFV---- 141 Query: 150 EEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 + Y T +++ ++ G E A+ ++ L DP ++VIS+D H+ Sbjct: 142 KTYDTDASVVELVYGD---EDPARLAEVIDYLLDDPDTVVVISTDLSHY 187 >UniRef50_Q1Q7G0 Cluster: Putative uncharacterized protein; n=1; Candidatus Kuenenia stuttgartiensis|Rep: Putative uncharacterized protein - Candidatus Kuenenia stuttgartiensis Length = 347 Score = 85.8 bits (203), Expect = 1e-15 Identities = 65/234 (27%), Positives = 106/234 (45%), Gaps = 23/234 (9%) Query: 43 THGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVR-IAGCALSSLDKY 101 ++G AII+PH + KR+ +L PSH R G ++ Y Sbjct: 64 SNGRPLAIISPHAGYVYSGQVAAYGYSAIKGHGFKRVIVLSPSHSGRRYRGASILKATSY 123 Query: 102 QTPLYDLTIDKQ---------IYAELEATRQFDRMDEQTD-----ENEHSIEMHLPYIAK 147 +TPL ++ID++ AE + R + D + EHS+EM LP++ Sbjct: 124 KTPLGKISIDQEACDYLLNTSFTAESKNKRNSSPLKLFGDYDGAYKGEHSLEMQLPFLQM 183 Query: 148 VMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWK 207 + + F ++PI++G L K + P L D + LLV+SSDF H+G +RY Sbjct: 184 TLGD----FNLVPIMIGILIDNDFDKVAEAIRPLL-DDKTLLVVSSDFTHYGDAYRYV-- 236 Query: 208 DSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAI 261 R ++ ++I+ LD + I D +Y + G CG PI +LL+ + Sbjct: 237 -PFRENVEENIKILDYGAFEKILNKDFDGLREYRKQTGINACGILPISILLKLL 289 >UniRef50_Q6LSR4 Cluster: Putative uncharacterized protein; n=2; Photobacterium profundum|Rep: Putative uncharacterized protein - Photobacterium profundum (Photobacterium sp. (strain SS9)) Length = 260 Score = 85.0 bits (201), Expect = 2e-15 Identities = 51/170 (30%), Positives = 80/170 (47%), Gaps = 8/170 (4%) Query: 29 LSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHV 88 L +QLD W S RA+I PH Q+ +K++ ++GPSH Sbjct: 20 LQKQLDDWCSPPTTHRDLIRALIVPHAGYIYSGEVAAKAYCQLQAETIKKVILIGPSHRY 79 Query: 89 RIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKV 148 GCA+ + D + TPL ++ID Q L ++ EQ EH +E+ LP++ Sbjct: 80 AFHGCAVPNSDYFSTPLGSVSIDVQSIDNLIKIDDI-KVSEQVHAQEHCLEVQLPFLQTC 138 Query: 149 MEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 + + FT++P+L +++ K AK I+ LLVISSD H+ Sbjct: 139 LHQ----FTLLPLLTSNVSFIKVAK---IIDALWQQDDTLLVISSDLSHF 181 >UniRef50_A6CYQ1 Cluster: Putative uncharacterized protein; n=1; Vibrio shilonii AK1|Rep: Putative uncharacterized protein - Vibrio shilonii AK1 Length = 267 Score = 81.4 bits (192), Expect = 3e-14 Identities = 48/153 (31%), Positives = 82/153 (53%), Gaps = 11/153 (7%) Query: 48 RAIIAPHXXXXXXXXXXXXXXRQVSPVVVK--RIFILGPSHHVRIAGCALSSLDKYQTPL 105 R +I PH Q+ V + R+ ++GPSH V GCAL S+ ++TPL Sbjct: 46 RGLIVPHAGYVFSGETAGLAYHQLQSVAQQFLRVILVGPSHRVAFHGCALPSVGAFETPL 105 Query: 106 YDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGS 165 ++ID+ EL A +++Q EHS+E+ LP++ V+++ F ++PI+ G Sbjct: 106 GRVSIDRDC-VELLADNSMVSINDQAHAQEHSLEVQLPFLQTVLDD----FQLLPIVTGQ 160 Query: 166 LTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 ++ + AK ++ P + D + LLVIS+D H+ Sbjct: 161 VSALEIAK---LIEP-IWDSKTLLVISTDLSHF 189 >UniRef50_Q2W0W5 Cluster: Predicted dioxygenase; n=4; Rhodospirillaceae|Rep: Predicted dioxygenase - Magnetospirillum magneticum (strain AMB-1 / ATCC 700264) Length = 456 Score = 79.4 bits (187), Expect = 1e-13 Identities = 62/201 (30%), Positives = 98/201 (48%), Gaps = 17/201 (8%) Query: 27 SELSRQLDLWLSKADLTHGPAR---AIIAPHXXXXXXXXXXXXXXRQVSPV--VVKRIFI 81 +E +RQL +L A R A+IAPH + P R+ + Sbjct: 19 AEANRQLTAFLDGAVAAPCAGRRPKALIAPHAGWVYSGPVAAGAYALLKPFRGSWSRVVL 78 Query: 82 LGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMH 141 LGPSH V G ALSS D++ +PL + +DK ++ L +D Q EHS+E+H Sbjct: 79 LGPSHRVAFQGMALSSADQWASPLGAVPLDKD-WSRLAGVAGVGVLD-QAHAQEHSLEVH 136 Query: 142 LPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSR 201 +P++ + E FT++P+++G +PE A G + A + D + L+VIS+D H+ Sbjct: 137 VPFLQATIGE----FTLLPVVIGDSSPEMVA--GLLEALWGGD-ETLIVISTDLSHY--- 186 Query: 202 FRYTWKDSSRGHIYQSIEWLD 222 Y S+ G +IE +D Sbjct: 187 LPYEQCRSTDGQTVAAIEHMD 207 >UniRef50_A0L9L0 Cluster: Putative uncharacterized protein; n=1; Magnetococcus sp. MC-1|Rep: Putative uncharacterized protein - Magnetococcus sp. (strain MC-1) Length = 481 Score = 79.4 bits (187), Expect = 1e-13 Identities = 56/191 (29%), Positives = 94/191 (49%), Gaps = 14/191 (7%) Query: 14 QPGALIIVLLNSGSELSRQL-DLWLSKADLTH--GPARAIIAPHXXXXXXXXXXXXXXR- 69 +P A+ + + ++ RQL L +A H G RA +APH Sbjct: 33 RPAAVAGMFYPAQADALRQLVRSLLQQAPKRHDQGEPRAFVAPHAGYRYSGLTAAYAYNT 92 Query: 70 -QVSPVV-VKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRM 127 Q +P +R+F+LGPSH V + G +L + D ++TPL + +D + + A + Sbjct: 93 LQAAPKERPRRVFLLGPSHRVALHGASLGNYDAFETPLGLVEVDLPLVERMAAQESDLVL 152 Query: 128 DEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQN 187 D EHS+E+HLP+ ++E F ++P++ G + P + A+ ILA Y + + Sbjct: 153 DNAPHAQEHSLEVHLPF----LQESLAHFRLVPMVFGRIEPSRVAE---ILAKY-READD 204 Query: 188 LLVISSDFCHW 198 L+V SSD H+ Sbjct: 205 LIVGSSDLSHF 215 >UniRef50_A1RWV3 Cluster: Putative uncharacterized protein; n=1; Thermofilum pendens Hrk 5|Rep: Putative uncharacterized protein - Thermofilum pendens (strain Hrk 5) Length = 287 Score = 77.0 bits (181), Expect = 6e-13 Identities = 44/122 (36%), Positives = 67/122 (54%), Gaps = 5/122 (4%) Query: 79 IFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSI 138 +FILGP+HH A AL + ++TPL D+ +D ++ EL + Q R D Q EHSI Sbjct: 83 VFILGPNHHALGAPIALDENEVWETPLGDVEVDFRVSKELASREQIIRFDFQAHAYEHSI 142 Query: 139 EMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADP--QNLLVISSDFC 196 E+ +P++ V E FTI+PI + TPE + G +A + + + +V SSD Sbjct: 143 EVQVPFLQFVFGE---GFTIVPISMMLQTPEAARRVGEAIAGLIMEKGLRAYVVASSDMS 199 Query: 197 HW 198 H+ Sbjct: 200 HY 201 >UniRef50_A0LJS7 Cluster: AMMECR1 domain protein precursor; n=3; Syntrophobacter fumaroxidans MPOB|Rep: AMMECR1 domain protein precursor - Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB) Length = 522 Score = 75.8 bits (178), Expect = 1e-12 Identities = 58/246 (23%), Positives = 110/246 (44%), Gaps = 26/246 (10%) Query: 28 ELSRQLDLWLSKAD--LTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPS 85 EL +Q++ +L++ G A+I+PH + + + ++ PS Sbjct: 58 ELRKQIEGFLNRVPEPKPRGQLVALISPHAGTIYSGQVAAYGYKLLEKQKFASVIVISPS 117 Query: 86 HHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDE---NEHSIEMHL 142 H R G A L +QTPL + +D+ + +EA R+ D+ E EH++E+ L Sbjct: 118 HRARFEGVATYELGGFQTPLGIVPLDRDL---IEALRRRDKRIAHRPEVHSEEHALEIQL 174 Query: 143 PYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRF 202 P++ V+EE+K ++P+++G + +A + + + L++ SSD H+ Sbjct: 175 PFLQTVLEEFK----LVPLIMGEQDFATCKRLAEAIADTVREKRVLVIASSDLSHF---- 226 Query: 203 RYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAIS 262 H Y+ + LDK+ D + +DP+ + L CG P+ + A Sbjct: 227 ----------HPYERAKALDKVAADRVGALDPQGLSYSLAGGECEACGGGPMVTAMLAAM 276 Query: 263 KLSSQS 268 +L + S Sbjct: 277 RLGANS 282 >UniRef50_Q2BMM2 Cluster: Putative uncharacterized protein; n=1; Neptuniibacter caesariensis|Rep: Putative uncharacterized protein - Neptuniibacter caesariensis Length = 260 Score = 72.9 bits (171), Expect = 9e-12 Identities = 44/164 (26%), Positives = 82/164 (50%), Gaps = 9/164 (5%) Query: 38 SKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSP-VVVKRIFILGPSHHVRIAGCALS 96 S+++ P ++ PH +Q++ +R+ +LGPSH V + G ALS Sbjct: 30 SQSEREGTPPSLLVVPHAGYQYSGTVAAQAYKQITDWSYYERVLLLGPSHRVPLRGMALS 89 Query: 97 SLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSF 156 DK+ +PL +L +D ++ AEL ++ + E EHS+E+ LP+ ++ Sbjct: 90 DADKFSSPLGELNLDTELIAELN-SQDLAAYNSAAHELEHSLEVQLPF----LQFLNCDL 144 Query: 157 TIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGS 200 IIP++VG + P E +++ + L+++S+D H+ S Sbjct: 145 PIIPVVVG-VAPRDEV--ASLIRIVEQSYRILVIVSTDLSHFHS 185 >UniRef50_Q5ZWB6 Cluster: Putative uncharacterized protein; n=4; Legionella pneumophila|Rep: Putative uncharacterized protein - Legionella pneumophila subsp. pneumophila (strain Philadelphia 1 /ATCC 33152 / DSM 7513) Length = 453 Score = 72.5 bits (170), Expect = 1e-11 Identities = 42/158 (26%), Positives = 76/158 (48%), Gaps = 11/158 (6%) Query: 44 HGPA-RAIIAPHXXXXXXXXXXXXXXRQVSPV--VVKRIFILGPSHHVRIAGCALSSLDK 100 H PA +AI+ PH + + +I +LGP+H + G A +DK Sbjct: 40 HKPAPKAILVPHAGYVYSGAVAASAYASLRDKKDTINKIILLGPAHRLYFKGIAYDPVDK 99 Query: 101 YQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIP 160 + TPL ++ DK++ ++ + E +NEH +E+ LP+ + ++K I+P Sbjct: 100 FATPLGEIDQDKELLTQIIDLPYVYSLPE-AHQNEHCLEVQLPFCQMIFSKFK----ILP 154 Query: 161 ILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 +++G P+ A+ ++A LL+ISSD H+ Sbjct: 155 LVIGETNPQDVAR---LIARIWGGDDTLLIISSDLSHY 189 >UniRef50_Q2S9S7 Cluster: Predicted dioxygenase; n=15; Proteobacteria|Rep: Predicted dioxygenase - Hahella chejuensis (strain KCTC 2396) Length = 259 Score = 72.1 bits (169), Expect = 2e-11 Identities = 50/191 (26%), Positives = 88/191 (46%), Gaps = 12/191 (6%) Query: 11 FFQQPGALIIVLLNSGSELSRQLDLWLSKA-DLTHGPARAIIAPHXXXXXXXXXXXXXXR 69 F ++P + + +LS + +++ + H P +AIIAPH Sbjct: 2 FVRKPAVSGLFYPANAEDLSETVSRYIATSPSFDHSP-KAIIAPHAGYVYSGAIAGVAYS 60 Query: 70 QV--SPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRM 127 + S + ++ +LGPSH V G A S D + TPL + ID +L + Q + Sbjct: 61 ALHNSAKRISKVVLLGPSHRVGFRGIAAPSSDAFSTPLGAIAIDADNLVKLASLPQVVTL 120 Query: 128 DEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQN 187 D EHS+E+HLP++ + ++ F + P+++G E A+ +L + Sbjct: 121 D-SAHAQEHSLEVHLPFLQQCLD----CFELTPLVIGDADAELVAE---VLELLWGGDET 172 Query: 188 LLVISSDFCHW 198 L+VIS+D H+ Sbjct: 173 LIVISTDLSHY 183 >UniRef50_Q3VWM2 Cluster: Putative uncharacterized protein; n=2; Chlorobiaceae|Rep: Putative uncharacterized protein - Prosthecochloris aestuarii DSM 271 Length = 297 Score = 71.3 bits (167), Expect = 3e-11 Identities = 52/179 (29%), Positives = 85/179 (47%), Gaps = 11/179 (6%) Query: 28 ELSRQLDLWLSKADLTHGPA----RAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILG 83 EL L+ LS++ T+ RA++ PH +++ + +FILG Sbjct: 31 ELDTFLESILSESTATNNSEKASIRALLVPHAGYAFSGRASAEAYSRLAGNQYRTVFILG 90 Query: 84 PSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELE--ATRQFDRMDEQTDENEHSIEMH 141 +H R G AL + +Q+PL + I+ + A R D +D ++H +E+ Sbjct: 91 NAHAYRFNGIALDTHHIWQSPLGRIPINMDAAEQFRTAAPRLIDYLD-IAHHSDHVLEVQ 149 Query: 142 LPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGS 200 LP++ K + KT F+I+PIL G + K IL+ L P +LL+ SSD H+ S Sbjct: 150 LPFLQKTL---KTGFSILPILFGENAKDISLKTARILSDIL-QPDDLLIASSDLSHYPS 204 >UniRef50_A6QB54 Cluster: Putative uncharacterized protein; n=1; Sulfurovum sp. NBC37-1|Rep: Putative uncharacterized protein - Sulfurovum sp. (strain NBC37-1) Length = 273 Score = 71.3 bits (167), Expect = 3e-11 Identities = 45/151 (29%), Positives = 70/151 (46%), Gaps = 10/151 (6%) Query: 49 AIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDL 108 AII PH R + KRI ++GPSHH G + + ++TP ++ Sbjct: 49 AIIVPHAGYIYSGFTANFAYRFLKHTKPKRIIVIGPSHHYYFKGISAGHFENFETPCGEI 108 Query: 109 TIDKQIYAELEATRQFD-RMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLT 167 ID L ++F+ D + E EHS E+ +P+I + Y +I ++ G + Sbjct: 109 EIDNPYLFAL--AKEFNIGFDPKAHEKEHSTEVQMPFI----QHYFPKAKVIELVYGDV- 161 Query: 168 PEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 P KE I+ L +P N +VISSD H+ Sbjct: 162 PAKE--LALIITALLKNPDNAVVISSDLSHF 190 >UniRef50_A0X3C5 Cluster: Putative uncharacterized protein; n=3; Shewanella|Rep: Putative uncharacterized protein - Shewanella pealeana ATCC 700345 Length = 303 Score = 68.9 bits (161), Expect = 2e-10 Identities = 45/179 (25%), Positives = 87/179 (48%), Gaps = 11/179 (6%) Query: 22 LLNSGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVV--VKRI 79 L + S L+R+ D S+ D ++ + +I PH + P+ +K++ Sbjct: 55 LTQASSILARKTDCQNSQ-DESYPSPKVLIVPHAGYLYSGQVAAYAYALIQPLADTIKKV 113 Query: 80 FILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIE 139 ++GP+H V + G AL ++TPL + I E+ +Q + E + EHS+E Sbjct: 114 LLIGPAHRVYLQGGALPLSRYFETPLGQIPIAPD-SVEILGCQQCICISELAHQQEHSLE 172 Query: 140 MHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 + LP++ ++E F ++P+L+G P++ A +L + L+V+S+D H+ Sbjct: 173 VQLPFLQHFLKE----FELLPLLIGESEPKEMA---LLLEQVWGGNETLIVVSTDLSHF 224 >UniRef50_A6DA73 Cluster: Putative uncharacterized protein; n=1; Caminibacter mediatlanticus TB-2|Rep: Putative uncharacterized protein - Caminibacter mediatlanticus TB-2 Length = 263 Score = 68.5 bits (160), Expect = 2e-10 Identities = 44/151 (29%), Positives = 69/151 (45%), Gaps = 10/151 (6%) Query: 48 RAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYD 107 +A+I PH R S KR+ ++GPSH I G + + D Y+TP Sbjct: 46 KALIVPHAGWMYSGFTANFAYRIASNTNPKRVVVIGPSHRFPIKGISTTLEDVYETPCGL 105 Query: 108 LTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLT 167 L ID + EL + FD + + EHS E+ +P+I Y ++ ++ G Sbjct: 106 LPIDIEFAKEL--IKNFDVQNLEMVHQEHSTEVQMPFI----YHYFGKIPVVELIYGDYA 159 Query: 168 PEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 PEK + + Y + +L+VISSD H+ Sbjct: 160 PEKLKE----IIKYAIEDNSLVVISSDLSHY 186 >UniRef50_A1WY73 Cluster: Putative uncharacterized protein; n=1; Halorhodospira halophila SL1|Rep: Putative uncharacterized protein - Halorhodospira halophila (strain DSM 244 / SL1) (Ectothiorhodospirahalophila (strain DSM 244 / SL1)) Length = 268 Score = 67.3 bits (157), Expect = 5e-10 Identities = 42/160 (26%), Positives = 77/160 (48%), Gaps = 12/160 (7%) Query: 41 DLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPV--VVKRIFILGPSHHVRIAGCALSSL 98 D T P A++ PH +++ P+ ++ + +LGP+H V ++G AL + Sbjct: 42 DPTRAP-HAMVLPHAGYPFSGAAAARGYQRIVPIREQLRHVVLLGPAHFVDLSGIALPAA 100 Query: 99 DKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTI 158 D TPL + + + E +D+ E EHS+E+HLP++ ++++ F + Sbjct: 101 DALATPLGTVPVSATL-RERALEHPGVHIDDSAHEREHSLEVHLPFLQTLLDD----FDV 155 Query: 159 IPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 +P++VG E + L L L+V+SSD H+ Sbjct: 156 LPLVVGRGPAESCGR----LIEQLWQDDTLVVVSSDLSHF 191 >UniRef50_Q6L0F9 Cluster: Hypothetical conserved protein DUF52; n=2; Thermoplasmatales|Rep: Hypothetical conserved protein DUF52 - Picrophilus torridus Length = 268 Score = 66.1 bits (154), Expect = 1e-09 Identities = 59/241 (24%), Positives = 100/241 (41%), Gaps = 20/241 (8%) Query: 24 NSGSELSRQL-DLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFIL 82 +S SEL +L + D+ ++ PH + +R I+ Sbjct: 14 DSESELLNYFKNLEPERFDIKFNKILGVVVPHAGYEYSGKIAWASYSILKEYNARRFLII 73 Query: 83 GPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHL 142 GP+H+ A+ S ++TPL D ID ++ +L + D +T EHSIE+ L Sbjct: 74 GPNHYGYPFYPAIYSNGSWRTPLGDSIIDNELSEQLIMKSGIIKNDPETHSTEHSIEVQL 133 Query: 143 PYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRF 202 P++ + +K FT +P+++G + E G + D L++ SSD H+ S Sbjct: 134 PFLQYI---FKNQFTFVPLILGDQSYEISRDLGETILS--LDRIPLIIASSDLNHYES-- 186 Query: 203 RYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAIS 262 Y D++ ++ I + K F + + KY T CG I VL+ Sbjct: 187 ------------YDKNNEKDEIIINDIINLRIKDFFNDIYKYRITACGFGAIAVLMYITK 234 Query: 263 K 263 K Sbjct: 235 K 235 >UniRef50_Q978N2 Cluster: UPF0103 protein TV1383; n=2; Thermoplasma|Rep: UPF0103 protein TV1383 - Thermoplasma volcanium Length = 269 Score = 66.1 bits (154), Expect = 1e-09 Identities = 49/221 (22%), Positives = 95/221 (42%), Gaps = 25/221 (11%) Query: 50 IIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLT 109 ++ PH + ++ I+GP+H ++ ++TPL + Sbjct: 40 VVVPHAGIVYSGRTAMYSYNALRNSSIRDFIIIGPNHRPMTPYASIFPSGSWETPLGNAI 99 Query: 110 IDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPE 169 I++++ +EL Q+ DE++ EHSIE+ +P++ + + SFT +P+++G E Sbjct: 100 INEELASELYKNSQYIVKDEESHSVEHSIEVQIPFLQYM---FGNSFTFVPVILGD--QE 154 Query: 170 KEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLI 229 K A +L+ SSDF H Y+ + +++ MDLI Sbjct: 155 KVVANDIASALMRLSKPYILIASSDFTH-----------------YERSDIVERKDMDLI 197 Query: 230 EK---MDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQ 267 + +D F D + + T CG I +L+ K+ ++ Sbjct: 198 SRIVDLDIDGFYDTIERENVTACGYGAIAILMIIAKKIGAK 238 >UniRef50_A4MJZ4 Cluster: Putative uncharacterized protein; n=1; Petrotoga mobilis SJ95|Rep: Putative uncharacterized protein - Petrotoga mobilis SJ95 Length = 274 Score = 64.9 bits (151), Expect = 2e-09 Identities = 51/215 (23%), Positives = 98/215 (45%), Gaps = 22/215 (10%) Query: 46 PARAIIAPHXXXXXXXXXXXXXXRQV-SPVVVKRIFILGPSHHVRIAGCALSSLDKYQTP 104 P I PH ++V + KR+F+LGP+H + ++ + ++TP Sbjct: 42 PPMGAIVPHAGYIYSGETAAKAYKKVFEKGIAKRVFLLGPNHTGLGSKISVFTSGSWKTP 101 Query: 105 LYDLTIDKQIYAELEATRQFD-RMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILV 163 L + +D + ++ ++ D DE EHS+E+ LP++ + F I+PI + Sbjct: 102 LGTINVDGKTAGKI--LKELDIYNDESAHSREHSLEVQLPFLQYAI---GNDFEIVPICM 156 Query: 164 GSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDK 223 + E G ILA + + +L++ SSD H+ S + KD +K Sbjct: 157 MDQSLETSKNLGEILAD-IIEEGDLIIASSDMNHYESHEKTLLKD-------------EK 202 Query: 224 LGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLL 258 + ++ ++ M+ + D + +Y ++CG P+ LL Sbjct: 203 V-IETLKNMNLQEMYDTIRRYNISMCGYGPVAALL 236 >UniRef50_A4BK98 Cluster: Putative uncharacterized protein; n=1; Reinekea sp. MED297|Rep: Putative uncharacterized protein - Reinekea sp. MED297 Length = 261 Score = 64.9 bits (151), Expect = 2e-09 Identities = 47/167 (28%), Positives = 82/167 (49%), Gaps = 14/167 (8%) Query: 32 QLDLWLSKADL--THGPARAIIAPHXXXXXXXXXXXXXXRQVSPVV--VKRIFILGPSHH 87 Q++ WL A + T + +IAPH + ++ V ++R+ +LGP+H Sbjct: 23 QMENWLESAPVKTTQSTPKVLIAPHSGFHYSGESAARAYQTLNAVYDRIRRVILLGPAHR 82 Query: 88 VRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQT-DENEHSIEMHLPYIA 146 + L D + TPL + +DK L RQ + + T EHS+EM LP++ Sbjct: 83 TTVDHLVLPEDDVFATPLGQVPLDKTAVNWLR--RQPGVITDNTLHAPEHSLEMQLPFLQ 140 Query: 147 KVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISS 193 +E+ F ++PI+VG + P+ A + A +L D +L+V+S+ Sbjct: 141 TALED----FFLVPIIVGQVDPDLVA--DILDALWLGD-DSLIVVST 180 >UniRef50_A5FQ21 Cluster: Putative uncharacterized protein; n=3; Dehalococcoides|Rep: Putative uncharacterized protein - Dehalococcoides sp. BAV1 Length = 438 Score = 62.1 bits (144), Expect = 2e-08 Identities = 49/189 (25%), Positives = 87/189 (46%), Gaps = 20/189 (10%) Query: 81 ILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEM 140 ILGPSH A A+ + +QTP+ ++ ID + + + + D + EHS+E+ Sbjct: 68 ILGPSHTGIGAEYAIMASGIWQTPMGEVEIDSPLAHSIMKYCRHIKADPSAHQYEHSVEV 127 Query: 141 HLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADP--QNLLVISSDFCHW 198 +P +++ +K I+PI V E A G +A L + + +++ SSD H+ Sbjct: 128 QIP----ILQYFKPDIKIVPITVSFGKSETLADIGYGIASALRETGREAIIIASSDMTHY 183 Query: 199 GSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLL 258 S+ KDS L +D I K+D + + T+CG P+ +L Sbjct: 184 ESQADAHLKDS--------------LALDAIIKLDAAEMLERIQANHITMCGYAPVAAML 229 Query: 259 QAISKLSSQ 267 A+ +L ++ Sbjct: 230 TAVKELGAK 238 >UniRef50_Q7QUI2 Cluster: GLP_516_10373_9414; n=1; Giardia lamblia ATCC 50803|Rep: GLP_516_10373_9414 - Giardia lamblia ATCC 50803 Length = 319 Score = 62.1 bits (144), Expect = 2e-08 Identities = 46/198 (23%), Positives = 88/198 (44%), Gaps = 14/198 (7%) Query: 71 VSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQ 130 + P + +LG H G + S + PL + +E + Sbjct: 96 IDPTRYTSVVMLGVCHAFHQRGLSTSPFASWANPLMEKGSPS---LSMETIPGLPSCQKD 152 Query: 131 TDENEHSIEMHLPYIAKV----MEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQ 186 E EHS+E+ +P++A V +E F+ + G+ E ++ L Y+ + Sbjct: 153 DCEEEHSLELQIPFLAHVFANQIEAGTVKFSAVYCSYGATRTEIDS-----LMDYVTEHN 207 Query: 187 NLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGN 246 +L+V+SSDFCH+G RF++T + +++ LD ++ + + +F + L + N Sbjct: 208 SLIVVSSDFCHYGPRFQFTPMIQGK-TANETVTMLDNKCINGV-MLGANSFEEALKETQN 265 Query: 247 TICGRHPIGVLLQAISKL 264 T+CG + I L+ + L Sbjct: 266 TVCGHYTILTCLRVLEGL 283 >UniRef50_O59292 Cluster: UPF0103 protein PH1626; n=5; Thermococcaceae|Rep: UPF0103 protein PH1626 - Pyrococcus horikoshii Length = 291 Score = 62.1 bits (144), Expect = 2e-08 Identities = 54/254 (21%), Positives = 103/254 (40%), Gaps = 12/254 (4%) Query: 5 PGIDHGFFQQPGALIIVLLNSGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXX 64 P + F+ + ALI +L + +L + +K +T G +APH Sbjct: 5 PAVAGQFYPEGDALIEMLSSFFKDLGEEG----TKRTITAG-----VAPHAGYVFSGFTA 55 Query: 65 XXXXRQVSPVVVKRIFIL-GPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQ 123 + + + +F++ GP+H + AL ++ TP+ + +D + E+ Sbjct: 56 SRTYKAIYEDGLPEVFVIFGPNHTGLGSPIALYPEGEWITPMGSIKVDSKFAKEIVKRSG 115 Query: 124 FDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAIL--APY 181 +D+ + EHSIE+ LP+I + E+ I+PI +G E G + A Sbjct: 116 IADLDDLAHKYEHSIEVQLPFIQYIAEKAGVEVKIVPITLGIQDEEVSRSLGRSIFEAST 175 Query: 182 LADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYL 241 +++ S+DF H+GS + Y + + D + I D + Sbjct: 176 SLGRDTIIIASTDFMHYGSFYGYVPFRGRPEELPNMVRDWDMRIIRRILDFDLDGMFSEI 235 Query: 242 NKYGNTICGRHPIG 255 + +T+CG +G Sbjct: 236 REMNHTMCGPGGVG 249 >UniRef50_Q2NG05 Cluster: Putative uncharacterized protein; n=1; Methanosphaera stadtmanae DSM 3091|Rep: Putative uncharacterized protein - Methanosphaera stadtmanae (strain DSM 3091) Length = 283 Score = 61.7 bits (143), Expect = 2e-08 Identities = 52/213 (24%), Positives = 94/213 (44%), Gaps = 25/213 (11%) Query: 48 RAIIAPHXXXXXXXXXXXXXXRQVSPV-VVKRIFILGPSHHVRIAGCALSSLDKYQTPLY 106 +A I PH ++ + + I+GP+H +L++ + +QTP+ Sbjct: 45 KAAIVPHAGYIYSGKTASYAYGDIARSGICDTVVIIGPNHTGYGDDISLTTSNTWQTPIG 104 Query: 107 DLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSL 166 D+ +D + ELE + EHSIE+ LP++ + K SF I+PI++ Sbjct: 105 DVCVDSEFNNELEKINSNITFSPEAHIKEHSIEVELPFLQYISNIQKKSFKIVPIVI--- 161 Query: 167 TPEKEAKYGAILAPYLAD-----PQNLLVI-SSDFCHWGSRFRYTWKDSSRGHIYQSIEW 220 ++ + LA + D +N++V+ S+D H+ + KD Sbjct: 162 -TRQQKNFCVELAHSIYDVSKKLNRNIMVVASTDLTHYENATSAKNKD------------ 208 Query: 221 LDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHP 253 +K+ + IE MD + + +NKY T+CG P Sbjct: 209 -EKI-LKSIENMDIDSLLNNINKYNITMCGYGP 239 >UniRef50_A7DR31 Cluster: Putative uncharacterized protein; n=1; Candidatus Nitrosopumilus maritimus SCM1|Rep: Putative uncharacterized protein - Candidatus Nitrosopumilus maritimus SCM1 Length = 275 Score = 60.9 bits (141), Expect = 4e-08 Identities = 49/217 (22%), Positives = 93/217 (42%), Gaps = 17/217 (7%) Query: 50 IIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLT 109 +I+PH + +S + + ILGP+H A +++TPL + Sbjct: 46 VISPHAGYVYSGPTACYSYKAISSKNPELVIILGPNHFGVGKDVATMVNAQWETPLGLVD 105 Query: 110 IDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPE 169 +D + E+ ++ +DE + +HS+E+ +P + + E F I+PI++ + E Sbjct: 106 VDSEAAKEIANNSKYIEIDEFSHSRDHSLEVQIPMLQSIFSE---KFKILPIILRDQSLE 162 Query: 170 KEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLI 229 G +A ++V SSDF H ++++S H DK ++ I Sbjct: 163 MAKDVGNAVAQIAKSRNTMIVASSDFTH--------YEENSFAHSQ------DKALIEPI 208 Query: 230 EKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSS 266 +MD + F L + T CG + ++ A L + Sbjct: 209 LEMDVEKFYSVLMEKRVTACGYGAMASVMIACKNLGA 245 >UniRef50_O67039 Cluster: UPF0103 protein aq_890; n=2; Aquifex aeolicus|Rep: UPF0103 protein aq_890 - Aquifex aeolicus Length = 267 Score = 60.5 bits (140), Expect = 5e-08 Identities = 42/171 (24%), Positives = 79/171 (46%), Gaps = 6/171 (3%) Query: 28 ELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHH 87 EL++ +DL +AI+ PH +++ + +++ +LGP+H Sbjct: 19 ELNKLMDLLCGFEPKEKIKPKAILVPHAGYIYSGKTACEVYKRIE--IPEKVVLLGPNHT 76 Query: 88 VRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAK 147 ++ S D ++TP + ID ++ ++ + DE EHS+E+ LP++ + Sbjct: 77 GLGKPISVYSGDAWETPYGVVEIDGELREKI-LKYPYANPDEYAHLYEHSLEVQLPFLQR 135 Query: 148 VMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 + F I+PI+V + E +G L L + L+VISSD H+ Sbjct: 136 YA---RREFKILPIVVTFVEYEVAKDFGRFLGEVLKEEDALIVISSDMSHY 183 >UniRef50_Q5SHL9 Cluster: Putative uncharacterized protein TTHA1711; n=8; Bacteria|Rep: Putative uncharacterized protein TTHA1711 - Thermus thermophilus (strain HB8 / ATCC 27634 / DSM 579) Length = 456 Score = 60.1 bits (139), Expect = 7e-08 Identities = 43/153 (28%), Positives = 72/153 (47%), Gaps = 10/153 (6%) Query: 48 RAIIAPHXXXXXXXXXXXXXXRQVSPV--VVKRIFILGPSHHVRIAGCALSSLDKYQTPL 105 R +++PH R +S +R+F+LGPSH V G A ++TPL Sbjct: 41 RGVLSPHAGYAYAGRVMAEAFRALSAWRGKARRVFLLGPSHFVAFPGVAFFPYRAWRTPL 100 Query: 106 YDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGS 165 ++ +D + L R + EHS+E+ LP++ + + I+P+L G Sbjct: 101 GEVAVDLEGGRRLLGQGAPFRAYREPFLEEHSLEVLLPFLQVALPQ----TPILPLLFGE 156 Query: 166 LTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 + P + A+ L P L P++L+V SSD H+ Sbjct: 157 VDPGEVAE---ALLPELG-PKDLVVASSDLSHY 185 >UniRef50_Q57846 Cluster: UPF0103 protein MJ0403; n=8; Euryarchaeota|Rep: UPF0103 protein MJ0403 - Methanococcus jannaschii Length = 287 Score = 60.1 bits (139), Expect = 7e-08 Identities = 51/202 (25%), Positives = 95/202 (47%), Gaps = 19/202 (9%) Query: 69 RQVSPVVVKRIFILGPSHHVRIAGCALSSLDK-YQTPLYDLTIDKQIYAELEATRQFDRM 127 ++V + + ILGP+H G +S +D ++TPL D+ D++ EL + + Sbjct: 73 KRVDALEETTVVILGPNHTG--LGSGVSVMDGIWRTPLGDVKCDEEFVEELWRKCEIVDL 130 Query: 128 DEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQN 187 DE NEHSIE+ LP++ + F I+PI + E + G +A + Sbjct: 131 DETAHLNEHSIEVQLPFLKHLELLNIAKFKIVPICMMFQDYETAVEVGYFIAKIAKELNR 190 Query: 188 LLVI--SSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYG 245 +V+ SSD H+ + + KD+ + D++E + + + D +N Y Sbjct: 191 RIVVIASSDLTHYEPQEIASKKDAI-------------VIKDILEMNEKELYEDVVN-YN 236 Query: 246 NTICGRHPIGVLLQAISKLSSQ 267 ++CG P+ +L+A+ L ++ Sbjct: 237 ISMCGYGPVIAMLKAMKTLGAE 258 >UniRef50_Q2LQ76 Cluster: Hypothetical cytosolic protein; n=1; Syntrophus aciditrophicus SB|Rep: Hypothetical cytosolic protein - Syntrophus aciditrophicus (strain SB) Length = 278 Score = 58.8 bits (136), Expect = 2e-07 Identities = 56/241 (23%), Positives = 95/241 (39%), Gaps = 28/241 (11%) Query: 45 GPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTP 104 G ++APH +++ +F++GPSH G +L Y+TP Sbjct: 41 GRILGLVAPHAGYMYSGQVAAHAYKEIKGQTYDVVFVIGPSHRAFFRGVSLFKEGGYETP 100 Query: 105 LYDLTIDKQIYAELEATRQFDRMDEQTDEN--EHSIEMHLPYIAKVMEEYKTSFTIIPIL 162 L + + + + A L Q R+ D + EHS+E+ LP++ + E F+ +P++ Sbjct: 101 LGIVDVHEDMAARL--LEQDPRIAFLPDVHLQEHSVEIQLPFLQVALGE----FSFVPLI 154 Query: 163 VGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLD 222 +G E + + Q L+V SSD H+ H Y+ +D Sbjct: 155 MGDQDYETCRVLADAIVNCCGNKQVLIVGSSDLSHY--------------HGYEQAVRMD 200 Query: 223 KLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKYAQ 282 ++ + KMD L+ CG P V + +L + A LKYA Sbjct: 201 SRILEHLRKMDECGLIRDLSSGTGEACGGGPAAVTMMVARQLGADKAA------VLKYAN 254 Query: 283 S 283 S Sbjct: 255 S 255 >UniRef50_A7IAG7 Cluster: Putative uncharacterized protein; n=1; Candidatus Methanoregula boonei 6A8|Rep: Putative uncharacterized protein - Methanoregula boonei (strain 6A8) Length = 262 Score = 58.8 bits (136), Expect = 2e-07 Identities = 42/155 (27%), Positives = 68/155 (43%), Gaps = 14/155 (9%) Query: 46 PARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPL 105 PA I++PH ++ P ++GPSHH + +S ++TPL Sbjct: 36 PALGIVSPHAGYIYSGQIAAYAFSRIDPGFSGTFVVIGPSHHGYLTS---ASAIPWETPL 92 Query: 106 YDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGS 165 + ID + L+ +DE + E EHSIE+ LP+I + ++PI++G Sbjct: 93 GLVEIDAEFIDALDIP-----VDEPSHEEEHSIEVQLPFIRHRFPRAR----VVPIMMGE 143 Query: 166 LTPEKEAKYG--AILAPYLADPQNLLVISSDFCHW 198 P A + A L + +V SSDF H+ Sbjct: 144 QDPAHAAAVAEKIVAAQRLTKKEIRVVASSDFSHY 178 >UniRef50_A5UVY3 Cluster: Putative uncharacterized protein; n=2; Roseiflexus|Rep: Putative uncharacterized protein - Roseiflexus sp. RS-1 Length = 284 Score = 57.2 bits (132), Expect = 5e-07 Identities = 45/177 (25%), Positives = 79/177 (44%), Gaps = 9/177 (5%) Query: 27 SELSRQLDLWLSKAD--LTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGP 84 + L ++D +L++A+ + G ++APH V + I I P Sbjct: 18 AHLQHEIDRYLAQAEPPVLPGKVWGVLAPHAGVRYSGPIAAWAFACVRGRTPEIIVIASP 77 Query: 85 SHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAEL-EATRQFDR--MDEQTDENEHSIEMH 141 H + Y+TPL + +D A+L EA R+ + + ++EH++E+ Sbjct: 78 WHRGGPTPLITTGHTAYETPLGIVPVDNNAIAQLDEALRRRAGFGLTPRRHDDEHAVEIE 137 Query: 142 LPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 LP++ +V SF ++P+++ + A GA LA L LLV SSD H+ Sbjct: 138 LPFLQRVFG----SFWLLPVMLADQSAVTSAALGAALAETLRGRDALLVASSDLSHY 190 >UniRef50_O67355 Cluster: UPF0103 protein aq_1336; n=1; Aquifex aeolicus|Rep: UPF0103 protein aq_1336 - Aquifex aeolicus Length = 374 Score = 56.8 bits (131), Expect = 6e-07 Identities = 45/158 (28%), Positives = 68/158 (43%), Gaps = 7/158 (4%) Query: 47 ARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLY 106 AR I+ PH + + +LG SH+ ++ LD +TPL Sbjct: 136 ARGILVPHMDLRVASGVYGSVYSAIKENEYDTVVLLGVSHYFHETPFSVLPLD-LRTPLG 194 Query: 107 DLTIDKQIYAELEATRQFD-RMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGS 165 DL +D + EL+ +D D +NEHSIE ++ + E K +IP +V Sbjct: 195 DLKVDIERVEELQKMFDYDLSHDVLAYKNEHSIEFQTIFLKYLFPEVK----VIPAIVSY 250 Query: 166 LTPEKEAKYGAILAPYLADPQNLLVISS-DFCHWGSRF 202 + + + L D QN L+ISS DF H G +F Sbjct: 251 GDTKSLKEIAHKITKVLEDSQNPLIISSVDFSHVGRKF 288 >UniRef50_A2BMN4 Cluster: Predicted dioxygenase; n=1; Hyperthermus butylicus DSM 5456|Rep: Predicted dioxygenase - Hyperthermus butylicus (strain DSM 5456 / JCM 9403) Length = 301 Score = 56.4 bits (130), Expect = 9e-07 Identities = 45/162 (27%), Positives = 79/162 (48%), Gaps = 10/162 (6%) Query: 101 YQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIP 160 + TPL + D + L+ D EHSIE+ LP++ + Y +F ++P Sbjct: 110 WATPLGTVETDIEFIELLKKLYPRLEDDYLAHMREHSIEVELPFLQYI---YGNNFKLVP 166 Query: 161 ILVGSLTPEKEAKYGAILAPYLADP--QNLLVI-SSDFCHWGSRFRYTWKDSSRGHIYQS 217 I+V + E+ A+ A A+ + +LVI SSDF H G + Y + + ++ Sbjct: 167 IVVKEPS-ERMAREMAEAVKRAAEELGRRILVIASSDFTHHGYMYDYVLFTEN---VREN 222 Query: 218 IEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQ 259 + LD ++ I ++D K F + + +YG T+CG I L++ Sbjct: 223 VAKLDMAIIEHILRLDTKGFLETIYRYGATVCGYGAIATLIE 264 >UniRef50_Q8ZYE1 Cluster: UPF0103 protein PAE0818; n=5; Thermoproteaceae|Rep: UPF0103 protein PAE0818 - Pyrobaculum aerophilum Length = 281 Score = 56.4 bits (130), Expect = 9e-07 Identities = 50/188 (26%), Positives = 90/188 (47%), Gaps = 24/188 (12%) Query: 81 ILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQ--TDENEHSI 138 I+GP+H+ A A+ ++TPL + +D+++ AE+ T F +++ EHS+ Sbjct: 81 IVGPNHYGIGAPVAIMKSGAWETPLGRVEVDREL-AEV-ITSHFKEVEDDFYAFSKEHSV 138 Query: 139 EMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLAD--PQNLLVISSDFC 196 E+ +P+I + Y I+PI++ T + G +A L + + ++ SSDF Sbjct: 139 EVQVPFI----QYYFGDVKIVPIVMWRQTLSTSRELGRAIAKALKEYGRKAYVIASSDFN 194 Query: 197 HWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGV 256 H+ T K D++ + I K+D + +K+ +ICG PIGV Sbjct: 195 HYEPHDITTRK--------------DEMAISKILKLDEAGLFEISSKFDISICGIGPIGV 240 Query: 257 LLQAISKL 264 L+ A +L Sbjct: 241 LIAAAKEL 248 >UniRef50_Q8G3N3 Cluster: Putative uncharacterized protein; n=1; Bifidobacterium longum|Rep: Putative uncharacterized protein - Bifidobacterium longum Length = 596 Score = 56.0 bits (129), Expect = 1e-06 Identities = 53/215 (24%), Positives = 88/215 (40%), Gaps = 27/215 (12%) Query: 4 RPGIDHG-FFQQPGALIIVLLNSGSELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXX 62 RP G F+ + L+N + R+L L + L G RA+I PH Sbjct: 49 RPSAVAGSFYPADRTALKQLINQQLDYGRKL-LQQLEPTLPAGVPRAVIVPHAGYIYSGT 107 Query: 63 XXXXXXRQVSPV--VVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTID--------- 111 + V R I+GP+H V + G A S+ ++TPL + +D Sbjct: 108 AAALAYALLERGRGSVTRAVIVGPTHRVAVRGVACSTAAAFETPLGTVPVDIAAERKALG 167 Query: 112 --------KQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILV 163 +A A ++ T EH++E+ +P++ V+ TI+P+ Sbjct: 168 LSVNEPLRSGTHARPGAPAPAMIVNAPTHAQEHAVEVQIPFLQTVL---GPDLTIVPLNA 224 Query: 164 GSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 G TP+ + G +L P+ ++VISSD H+ Sbjct: 225 GDATPQ---EVGDVLRALWGGPETVIVISSDLSHY 256 >UniRef50_Q30X41 Cluster: Putative uncharacterized protein; n=2; Deltaproteobacteria|Rep: Putative uncharacterized protein - Desulfovibrio desulfuricans (strain G20) Length = 298 Score = 55.6 bits (128), Expect = 2e-06 Identities = 61/254 (24%), Positives = 110/254 (43%), Gaps = 31/254 (12%) Query: 24 NSGSELSRQLDLWLSKADLT-HGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFIL 82 +S +EL L +L +A P + PH Q ++ + +F+L Sbjct: 33 DSPAELQSMLRAYLDEAAAPPQKPTLLAMVPHAGYVFSGAVAGCTLAQA--MLPQTLFVL 90 Query: 83 GPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHL 142 GP+H R +G A+ ++TPL D+ +D + AE A R D EHS+E+ L Sbjct: 91 GPNHTGRGSGIAVWPEGVWRTPLGDVPVDNALAAEFCALCAPARPDTLAHSAEHSLEVVL 150 Query: 143 PYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYL------ADP--QN--LLVIS 192 P++ + + I+P+ +G + GA +A + ADP QN +V+S Sbjct: 151 PFLQLRVPRVR----IVPVSIGDPSLAVLTAAGAAMAQIIRRAAQTADPGGQNRIAMVVS 206 Query: 193 SDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRH 252 SD H + +D +R LD + ++ + ++P+ + + ++CG Sbjct: 207 SDMTH------FLPQDEARR--------LDAMALEQVTALNPQGLYTTVREKRISMCGVL 252 Query: 253 PIGVLLQAISKLSS 266 P+ L+A L + Sbjct: 253 PMTAALEACRLLGA 266 >UniRef50_A5UN65 Cluster: Predicted dioxygenase; n=1; Methanobrevibacter smithii ATCC 35061|Rep: Predicted dioxygenase - Methanobrevibacter smithii (strain PS / ATCC 35061 / DSM 861) Length = 282 Score = 55.2 bits (127), Expect = 2e-06 Identities = 50/230 (21%), Positives = 103/230 (44%), Gaps = 22/230 (9%) Query: 50 IIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFIL-GPSHHVRIAGCALSSLDKYQTPLYDL 108 ++ PH +++ +FI+ GP+H + ++ + ++ TPL ++ Sbjct: 51 VMVPHAGFQYSGTIAAHSYCELAKNGFPEVFIIIGPNHTGLGSEVSVFNKGEWITPLGNI 110 Query: 109 TIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTP 168 +D++ L + F D EHSIE+ LP+ ++ + F I+P+++GS T Sbjct: 111 QVDEEFADTLISFSDFASADFAAHMREHSIEVQLPF----LQYFSNDFKIVPVVLGSQTI 166 Query: 169 EKEAKYGAIL--APYLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGM 226 A + A D ++ SSD H+ ++ R D G + + IE +D+ Sbjct: 167 SAANDLAAAILKAGEKLDKSYCVIASSDLSHFNTQERANKVD---GFVLEDIENMDE--F 221 Query: 227 DLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQSNAPKMSLK 276 L+E+ + +Y T+CG P+ + +SK+ ++ + ++ K Sbjct: 222 KLLEE---------IIQYNITMCGYGPV-MTTMILSKMCGKNTSEILAYK 261 >UniRef50_Q96YW6 Cluster: UPF0103 protein ST2062; n=4; Sulfolobaceae|Rep: UPF0103 protein ST2062 - Sulfolobus tokodaii Length = 284 Score = 54.4 bits (125), Expect = 3e-06 Identities = 49/164 (29%), Positives = 85/164 (51%), Gaps = 14/164 (8%) Query: 79 IFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSI 138 + ILGP+H + +L K++TPL ++ ID+QI +L + +DE+ EHSI Sbjct: 81 VIILGPNHTGLGSYVSLWPKGKWKTPLGEIEIDEQIAMDLVRESEVIDIDEKAHLYEHSI 140 Query: 139 EMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGA-----ILAPYLADPQNLLVISS 193 E+ +P++ + KT I+PI++ TPE ++Y A I+ Y D +++ SS Sbjct: 141 EVQVPFLQYFFDS-KTK--IVPIVIMMQTPE-ISEYLAEGISKIMQKY-KDKDIVVIASS 195 Query: 194 DFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGM-DLIEKMDPKA 236 D H+ + KD+ + I LD G+ +++E+ D A Sbjct: 196 DMNHYEPHEKTIEKDNM---AIEKILSLDYKGLFNVVEEKDVTA 236 >UniRef50_O26151 Cluster: UPF0103 protein MTH_45; n=1; Methanothermobacter thermautotrophicus str. Delta H|Rep: UPF0103 protein MTH_45 - Methanobacterium thermoautotrophicum Length = 277 Score = 54.0 bits (124), Expect = 5e-06 Identities = 49/216 (22%), Positives = 91/216 (42%), Gaps = 21/216 (9%) Query: 48 RAIIAPHXXXXXXXXXXXXXXRQ-VSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLY 106 + +IAPH + VS + + I+ P+H +G +L ++TPL Sbjct: 46 KGVIAPHAGYMYSGPVAAHAYHELVSDGIPGTLVIICPNHTGMGSGVSLMQQGAWETPLG 105 Query: 107 DLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSL 166 + ID ++ + +DE EHS E+H+P+I + + +F I+P+ + Sbjct: 106 TVEIDSELAEAIVRESGIIDLDETAHLAEHSCEVHVPFI----QYFTDNFRIVPVTMWMQ 161 Query: 167 TPEKEAKYGAILAPYLADP--QNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKL 224 E A G +A + + ++ S+DF H Y+ +D + E D+ Sbjct: 162 GHETAADVGHAVASAIRETGRDAAVIASTDFTH------YSPQDIA--------EATDRR 207 Query: 225 GMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQA 260 +D I MD +++ T+CG P+ + A Sbjct: 208 IIDRITAMDDTGMYGVISELNATMCGYGPVAATIIA 243 >UniRef50_Q1NJL5 Cluster: Putative uncharacterized protein; n=2; delta proteobacterium MLMS-1|Rep: Putative uncharacterized protein - delta proteobacterium MLMS-1 Length = 267 Score = 52.0 bits (119), Expect = 2e-05 Identities = 51/177 (28%), Positives = 82/177 (46%), Gaps = 15/177 (8%) Query: 46 PARAIIAPHXXXXXXX--XXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQT 103 PA A++ PH Q+ P V+ +LGP+HH A A+ ++ Sbjct: 35 PALAVVMPHAGYIFSGPVAGATVAAAQIPPEVI----VLGPNHHGLGATAAVMDQGAWEM 90 Query: 104 PLYDLTIDKQIYAE-LEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPIL 162 P + I+ + A+ LE F + DE EHS+E+ +P++ E + I+PI Sbjct: 91 PWGTVPINASLAAKVLEHCPDF-QADELAHRREHSLEVLVPFLHYRQPELQ----IVPIC 145 Query: 163 VGSLTPEKEAKYGAILAPYL-ADPQN-LLVISSDFCHWGSRFRYTWKDS-SRGHIYQ 216 + + + GA LA + A P+ LL S+D H+ SR T KD+ + GHI + Sbjct: 146 LSRSDYQFCQRAGAGLAAAIKAWPEPVLLAASTDMSHFESREATTTKDNLAIGHILE 202 >UniRef50_Q8TT38 Cluster: UPF0103 protein MA_0601; n=4; Methanosarcinales|Rep: UPF0103 protein MA_0601 - Methanosarcina acetivorans Length = 267 Score = 52.0 bits (119), Expect = 2e-05 Identities = 48/186 (25%), Positives = 88/186 (47%), Gaps = 22/186 (11%) Query: 81 ILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEM 140 + GP+H + +LS + ++TPL + +D ++ A+ D DE EHSIE+ Sbjct: 70 LFGPNHTGYGSPVSLSR-ETWKTPLGTIDVDLEL-ADGFLGSIVDT-DELGHTYEHSIEV 126 Query: 141 HLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLAD--PQNLLVISSDFCHW 198 LP++ + F I+PI +G + + G+++A +++ + +++ SSDF H+ Sbjct: 127 QLPFL---QYRFGRDFKILPICMGMQDKDTAVEVGSLVADLVSESGKRAVIIASSDFTHY 183 Query: 199 GSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLL 258 + H ++ D +D I K+D D L + ++CG PI +L Sbjct: 184 ----------ETAEHARET----DSEVIDAILKLDVPGMYDSLYRRNASVCGYGPIAAML 229 Query: 259 QAISKL 264 A KL Sbjct: 230 SASQKL 235 >UniRef50_Q74NK0 Cluster: NEQ347; n=1; Nanoarchaeum equitans|Rep: NEQ347 - Nanoarchaeum equitans Length = 266 Score = 50.0 bits (114), Expect = 7e-05 Identities = 52/178 (29%), Positives = 87/178 (48%), Gaps = 13/178 (7%) Query: 77 KRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEH 136 K I+G ++H + A SL +TPL ID++ A + FD D++ EH Sbjct: 61 KTYAIIG-TNHTGLGSLANVSLMPIETPLGIAKIDEEA-AMIFMKNGFD-YDDRPFLYEH 117 Query: 137 SIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFC 196 S+E +P++ + + +F I+P ++ ++ + + G LA L + L V SSDF Sbjct: 118 SVENQIPFLQYL---HGDNFLIVPSVMFNVYRFAK-EVGKQLALELPERVRL-VASSDFT 172 Query: 197 HWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPI 254 H+G + Y K S G + ++ LD + I K+D F + + G T+CG PI Sbjct: 173 HYGDIYGY--KPFSDG---RKVKELDMKLISYILKLDSLGFYKEIVRTGATVCGWGPI 225 >UniRef50_Q30PF9 Cluster: Putative uncharacterized protein; n=1; Thiomicrospira denitrificans ATCC 33889|Rep: Putative uncharacterized protein - Thiomicrospira denitrificans (strain ATCC 33889 / DSM 1351) Length = 262 Score = 49.2 bits (112), Expect = 1e-04 Identities = 36/152 (23%), Positives = 62/152 (40%), Gaps = 9/152 (5%) Query: 47 ARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLY 106 +R +I PH R + VK+ ++GPSH V G +L Y+TP Sbjct: 41 SRVVIVPHAGYIYSGYSANVAYRVLKKSGVKKFLVIGPSHRVGFEGISLGDFSSYETPFG 100 Query: 107 DLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSL 166 + + EL T F + EHS E+ P+I + Y +++ ++ + Sbjct: 101 AIPASLDLVEELSNT--FLLSCYRDTHFEHSTEVQFPFI----KYYIEGASVVELVYSYM 154 Query: 167 TPEKEAKYGAILAPYLADPQNLLVISSDFCHW 198 P +K I+ L ++IS+D H+ Sbjct: 155 KPSNLSK---IIDFALNHKDVGIIISTDLSHF 183 >UniRef50_A7HMH8 Cluster: Putative uncharacterized protein; n=1; Fervidobacterium nodosum Rt17-B1|Rep: Putative uncharacterized protein - Fervidobacterium nodosum Rt17-B1 Length = 267 Score = 49.2 bits (112), Expect = 1e-04 Identities = 51/190 (26%), Positives = 85/190 (44%), Gaps = 22/190 (11%) Query: 77 KRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAEL-EATRQFDRMDEQTDENE 135 K I I GP+H ++ S +QTPL ++ ++ +I +L + T F DE E Sbjct: 71 KNIIIFGPNHTGYGELVSVWSEGIWQTPLGNIEVNSEIADKLIDNTVIFS--DEMAHLYE 128 Query: 136 HSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLAD-PQNLLVISSD 194 HSIE+ LP + E+K IIP+ + +K L + + P L+V SSD Sbjct: 129 HSIEVQLPLLQYAFGEFK----IIPVCMMDQRLSTVSKIVDKLKQIIKEYPDTLVVASSD 184 Query: 195 FCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPI 254 F H+ ++ DKL ++ I + D + + + K+ T+CG P+ Sbjct: 185 FNHYDP--------------HEITLEKDKLAIEKILEGDIEGLYERIKKHNITMCGPGPV 230 Query: 255 GVLLQAISKL 264 V+ S + Sbjct: 231 AVVRSLFSNV 240 >UniRef50_Q9WXU2 Cluster: UPF0103 protein TM_0087; n=2; Thermotoga|Rep: UPF0103 protein TM_0087 - Thermotoga maritima Length = 277 Score = 48.4 bits (110), Expect = 2e-04 Identities = 43/181 (23%), Positives = 84/181 (46%), Gaps = 19/181 (10%) Query: 79 IFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSI 138 + I+GP+H + +++TPL + ++++ + + ++ D + EHSI Sbjct: 81 VVIIGPNHTGLGRPVGVWPEGEWETPLGTVPVNERAVEIVLSNSRYAEEDFMSHIREHSI 140 Query: 139 EMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLAD-PQNLLVISSDFCH 197 E+ +P++ V E +I+PI + +P + LA +A+ P L++ S+D H Sbjct: 141 EVQIPFLQFVFGE----VSIVPICLMDQSPAVAEDLASALAKLVAEFPGVLIIASTDLNH 196 Query: 198 WGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVL 257 + + KDS +I ++ IE MDP +YL + ++CG + L Sbjct: 197 YEDQRTTLRKDS---YI-----------IEAIEGMDPSLLYEYLVREDISMCGYGGVATL 242 Query: 258 L 258 L Sbjct: 243 L 243 >UniRef50_A3JXY8 Cluster: Predicted dioxygenase; n=1; Sagittula stellata E-37|Rep: Predicted dioxygenase - Sagittula stellata E-37 Length = 450 Score = 48.0 bits (109), Expect = 3e-04 Identities = 43/181 (23%), Positives = 71/181 (39%), Gaps = 10/181 (5%) Query: 29 LSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHV 88 L+ ++ L A P A+I+PH K I +L PSH Sbjct: 23 LAAEVAALLDGAPTAPEPPVAVISPHAGYRFSGRLTARALATTREAAPKSIAVLSPSHRH 82 Query: 89 RIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKV 148 G A S D + P ID A + A +++ + EH +E+ LP V Sbjct: 83 AFDGIAAPSQDAFALPTGTQRIDIATRAAMVAAGLI-HVEDAAHDQEHGVEVQLP----V 137 Query: 149 MEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKD 208 + ++P+++G ++ A L L + L+V+SSD H+ +R KD Sbjct: 138 LHALHPDVPVLPLVIGRTGNDRV----AALVDALPE-GTLIVLSSDLSHFLTRDDARAKD 192 Query: 209 S 209 + Sbjct: 193 A 193 >UniRef50_Q1IL90 Cluster: Putative uncharacterized protein; n=2; Bacteria|Rep: Putative uncharacterized protein - Acidobacteria bacterium (strain Ellin345) Length = 271 Score = 47.6 bits (108), Expect = 4e-04 Identities = 43/192 (22%), Positives = 85/192 (44%), Gaps = 20/192 (10%) Query: 77 KRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEH 136 KR IL P+H A+ ++TPL D ID ++ +L A D EH Sbjct: 68 KRFVILCPNHTGAGHPLAVMREGSWRTPLGDAAIDAELADQLLAAFPLTSEDADAHRTEH 127 Query: 137 SIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYL--ADPQNLLVISSD 194 ++E+ LP++ ++ +F +P+ VG+ + + G +A + A + +++ SSD Sbjct: 128 ALEVQLPFLQILV----PNFRFVPVAVGTGRFDVLSALGESIAKVVQSAAERVMVIASSD 183 Query: 195 FCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPI 254 H+ + K D+L ++ + +D K D +++ ++CG P Sbjct: 184 MNHYENDADTRVK--------------DRLAIERLLALDAKGLYDVVHEKNISMCGYGPA 229 Query: 255 GVLLQAISKLSS 266 +L A ++ + Sbjct: 230 VAMLTAAKRVGA 241 >UniRef50_A2BK85 Cluster: Universally conserved protein; n=3; Desulfurococcales|Rep: Universally conserved protein - Hyperthermus butylicus (strain DSM 5456 / JCM 9403) Length = 297 Score = 46.8 bits (106), Expect = 7e-04 Identities = 25/89 (28%), Positives = 46/89 (51%), Gaps = 3/89 (3%) Query: 81 ILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEM 140 I+GP+H A ++ + TPL +L +D ++ L + +DE+ EHS+E+ Sbjct: 97 IVGPNHTGLGASVSVYPGTAWSTPLGELQVDTELARVLVKASSYAELDEKAHLYEHSVEV 156 Query: 141 HLPYIAKVMEEYKTSFTIIPILVGSLTPE 169 LP++ + + I+P++V TPE Sbjct: 157 QLPFLQYL---FNARVRILPVVVYEQTPE 182 >UniRef50_Q2IES1 Cluster: Putative uncharacterized protein; n=1; Anaeromyxobacter dehalogenans 2CP-C|Rep: Putative uncharacterized protein - Anaeromyxobacter dehalogenans (strain 2CP-C) Length = 301 Score = 46.0 bits (104), Expect = 0.001 Identities = 61/262 (23%), Positives = 98/262 (37%), Gaps = 19/262 (7%) Query: 12 FQQPGALIIVLLNSGSELSRQLDLWLS----KADLTHGPARAIIAPHXXXXXXXXXXXXX 67 F+ P V ++ L R LD WL+ P ++APH Sbjct: 19 FRPPACAGAVYPDAPGALRRALDRWLALPAGAPAAPPAPRGVVVAPHIDYARGAAGYAHA 78 Query: 68 XRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRM 127 R + I G +H L+ LD Y TPL + D+ + L D + Sbjct: 79 YRALEASRADLFVIFGTAHATPPRPFTLTRLD-YGTPLGPVRTDRALVDALCGALGEDAL 137 Query: 128 --DEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPIL---VGSLT--PEKEAKYGAILAP 180 DE +EHSIE+ +A + FT++P+L +G L A + LA Sbjct: 138 LGDELCHRDEHSIELQAVVLA---HRLRRPFTVLPVLCSAIGHLADPAAATAPFLDALAR 194 Query: 181 YLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDY 240 +A V +D H G R+ + + + ++ D+ + +E DP F Sbjct: 195 AVAGRSVCWVAGADLAHVGPRYGDA-RPPAPAEL-AALAAADRRTLRYVEAGDPAGFHRD 252 Query: 241 LNKYG--NTICGRHPIGVLLQA 260 + G +CG PI L++ Sbjct: 253 AVRDGARRRLCGIAPIYAALRS 274 >UniRef50_Q66Q62 Cluster: Dor2; n=1; Sorangium cellulosum|Rep: Dor2 - Polyangium cellulosum (Sorangium cellulosum) Length = 422 Score = 45.6 bits (103), Expect = 0.002 Identities = 55/201 (27%), Positives = 84/201 (41%), Gaps = 20/201 (9%) Query: 76 VKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFD-RMDEQTDEN 134 V +LG SH A+ + TPL L D+++ AEL A +FD R D+ +N Sbjct: 195 VDTFVLLGTSHAAMRRPYAVCE-KTFATPLGPLEPDREMIAELAAASRFDVREDQYLHKN 253 Query: 135 EHSIEMHLPYIAKVMEEYKTSFTIIPILVG-----------SLTPEKEAKYGAILAPYLA 183 EHSIE ++ ++ S I+PIL G + E+ A+ Sbjct: 254 EHSIEFQAVFVRHLLGGRAAS--IVPILCGLSECQARRRDPAQDDGAESFLRALRDALAK 311 Query: 184 DPQNLLVIS-SDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMD-PKAFTDYL 241 P +LVI+ +D H G RF R ++ D ++ +D P F D Sbjct: 312 RPGRVLVIAGADLAHVGPRFGDPAPLDERQR--TALRDRDLASIERATSIDAPGFFVDVA 369 Query: 242 NKYGN-TICGRHPIGVLLQAI 261 + +CG PI LL+A+ Sbjct: 370 RDLASRRVCGLGPIYTLLRAL 390 >UniRef50_A0LEC6 Cluster: Putative uncharacterized protein; n=1; Syntrophobacter fumaroxidans MPOB|Rep: Putative uncharacterized protein - Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB) Length = 414 Score = 45.6 bits (103), Expect = 0.002 Identities = 60/233 (25%), Positives = 105/233 (45%), Gaps = 23/233 (9%) Query: 46 PARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFI-LGPSHHVRIAGCALSSLDKYQTP 104 P ++APH + + V R +I LG H + AL++ D ++TP Sbjct: 154 PVLGLVAPHIDIQAGGRCFAHAYKAAADSVSPRTWIVLGTGHELVSNYFALTAKD-FETP 212 Query: 105 LYDLTIDKQIYAELEATRQFDRM-DEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILV 163 L + D++ A L + + D + E EH++E ++A V K I+P+L Sbjct: 213 LGLVGHDEECCAHLVNSAKRDILAGEYNHVREHTVEFQAVFLAYVQPGAK----IVPLLC 268 Query: 164 GSLTP--EKEAKYGAILAPYLAD---PQNLLVISS-DFCHWGSRF--RYTWKDSS-RGHI 214 E + +Y A L D +++ +++S D H G R+ R+ DS+ + H+ Sbjct: 269 SFSHEDLETDGEYIDHFAGLLRDLVLTRSVGILASVDLAHIGPRYGDRFQPTDSTVKDHM 328 Query: 215 YQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNT--ICGRHPIGVLLQAISKLS 265 D+ ++ + + D +AF + GN ICG P+ VL QA+S L+ Sbjct: 329 AS-----DRGLVESLRECDAEAFIRQIRLEGNRRKICGVAPLYVLAQALSGLA 376 >UniRef50_A0RY15 Cluster: Dioxygenase; n=1; Cenarchaeum symbiosum|Rep: Dioxygenase - Cenarchaeum symbiosum Length = 273 Score = 45.6 bits (103), Expect = 0.002 Identities = 37/124 (29%), Positives = 60/124 (48%), Gaps = 7/124 (5%) Query: 78 RIFIL-GPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEH 136 R+F++ GP+H G A ++ TP + D ELE R + D EH Sbjct: 74 RLFVMAGPNHWGLGLGIAGIGACRWITPAGYVETDDAGSVELE--RCGIKEDFFAHSKEH 131 Query: 137 SIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFC 196 S+E+ +P +++E+ F I+PIL+ E+ AK G +A ++L+ SSD Sbjct: 132 SLEVIVP----MLQEFFGEFGILPILLSEQGEEQAAKVGGAMARAAKGRDSMLIGSSDLT 187 Query: 197 HWGS 200 H+ S Sbjct: 188 HYES 191 >UniRef50_Q74C45 Cluster: Putative uncharacterized protein; n=7; Desulfuromonadales|Rep: Putative uncharacterized protein - Geobacter sulfurreducens Length = 267 Score = 45.2 bits (102), Expect = 0.002 Identities = 51/220 (23%), Positives = 94/220 (42%), Gaps = 24/220 (10%) Query: 50 IIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLT 109 +IAPH + V+ + + ILGP+HH A +L + +PL ++ Sbjct: 39 VIAPHAGYMYSGAIAGAVYGSI--VIPRTVVILGPNHHGLGAAASLYPDGTWLSPLGEVP 96 Query: 110 IDKQIYA-ELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTP 168 I++++ + LE Q + D EHS+E+ +P++ + + I+P+ +G Sbjct: 97 IEQRLSSLVLEHVPQAE-PDVIAHRFEHSLEVQVPFLRYL----NSDVAIVPMCLGGGGY 151 Query: 169 EKEAKYGAILAPYLA--DPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDKLGM 226 + G LA +A + L+V SSD H+ S +S D+ + Sbjct: 152 GWCRQVGEGLARAIAAYGEEVLIVASSDMTHYESA--------------ESARLKDEAAL 197 Query: 227 DLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSS 266 + +D + + G T+CG P V+L A +L + Sbjct: 198 SCVLALDAEGLLKVCRQRGITMCGVIPSTVMLVAARELGA 237 >UniRef50_Q3A412 Cluster: Predicted dioxygenase; n=2; Desulfuromonadales|Rep: Predicted dioxygenase - Pelobacter carbinolicus (strain DSM 2380 / Gra Bd 1) Length = 267 Score = 45.2 bits (102), Expect = 0.002 Identities = 38/172 (22%), Positives = 72/172 (41%), Gaps = 9/172 (5%) Query: 29 LSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHV 88 L ++ +L KA +H PA ++ PH V + ++ ++GP+H Sbjct: 19 LRSMVETYLEKATQSH-PAIGLMVPHAGYVFSGAIAGQTFGCVD--IPSKVLVIGPNHTG 75 Query: 89 RIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKV 148 AL + + TPL ++ I + + + D+ EHS+E+ +P+ Sbjct: 76 YGESLALFAKGSWVTPLGEVPIAEGLADRVLQAHPRLMADDLAHRFEHSLEVQIPF---- 131 Query: 149 MEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQN--LLVISSDFCHW 198 ++ I+P+ + + E+ G + LA + LLV SSD H+ Sbjct: 132 LQVRAPDVQIVPLCLAPVPYEELLALGNAIGQVLAAEKEPVLLVASSDMTHY 183 >UniRef50_Q0ABA7 Cluster: Dioxygenase-like protein; n=1; Alkalilimnicola ehrlichei MLHE-1|Rep: Dioxygenase-like protein - Alkalilimnicola ehrlichei (strain MLHE-1) Length = 225 Score = 45.2 bits (102), Expect = 0.002 Identities = 34/122 (27%), Positives = 55/122 (45%), Gaps = 6/122 (4%) Query: 70 QVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDE 129 Q + R+++L + H G A S ++ TPL LT+D L+ +D+ Sbjct: 64 QAAAAPPNRVYLLATTPHRTAEGPAFSGKRQFATPLGRLTLDAAGIERLQDDAG-GALDD 122 Query: 130 QTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLL 189 + EH +E LPY+ +V+ F ++P+L+ A G IL L D LL Sbjct: 123 RAHALEHRLEAPLPYLQRVL----PPFQLVPVLLPE-AGTTSAACGRILQLALEDRAGLL 177 Query: 190 VI 191 V+ Sbjct: 178 VV 179 >UniRef50_A1VAM6 Cluster: Putative uncharacterized protein; n=2; Desulfovibrio vulgaris subsp. vulgaris|Rep: Putative uncharacterized protein - Desulfovibrio vulgaris subsp. vulgaris (strain DP4) Length = 329 Score = 44.8 bits (101), Expect = 0.003 Identities = 55/227 (24%), Positives = 90/227 (39%), Gaps = 25/227 (11%) Query: 47 ARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLY 106 A ++ PH QV V +F+LGP+H R A A+ + TPL Sbjct: 91 ALLVMLPHAGYVYSGRVAGRTLSQVRLAPV--VFMLGPNHTGRGAPLAVWPEGDWLTPLG 148 Query: 107 DLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSL 166 + + ++ A L + E EHS+E+ LP +++ + +IIP+ V Sbjct: 149 SVPVHERAAAALLDKDGGYTANRTAHEGEHSLEVLLP----LLQVRHPALSIIPVAVSEQ 204 Query: 167 TPEKEAKYGAILAPYL-----ADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWL 221 + GA LA + A + +V+SSD H+ +R E Sbjct: 205 DAGALQRAGASLARTMQELAAAGVPSSIVLSSDMSHYVTR--------------TQAEER 250 Query: 222 DKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQS 268 D L + + +DP+ + T+CG P V L A L ++S Sbjct: 251 DALALGRMAALDPEGLYATVRHNRITMCGVLPAVVALHACRALGAES 297 >UniRef50_A2SR96 Cluster: Putative uncharacterized protein; n=1; Methanocorpusculum labreanum Z|Rep: Putative uncharacterized protein - Methanocorpusculum labreanum (strain ATCC 43576 / DSM 4855 / Z) Length = 279 Score = 44.0 bits (99), Expect = 0.005 Identities = 42/174 (24%), Positives = 71/174 (40%), Gaps = 13/174 (7%) Query: 28 ELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSHH 87 EL L + + + I+ PH +SP +LGPSH Sbjct: 32 ELDALLSALFAATETSVSDPYGILVPHAGYVYSGKTAAYGYAAISPAFNGTFVLLGPSH- 90 Query: 88 VRIAGCALSSLDK-YQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIA 146 AG S+ D ++TPL ++ D L A Q ++ E+S+E+ LP+I Sbjct: 91 ---AGLETSTADMIWETPLGNVFPDSAFIEALSA--QIPVRNDLISAEENSLEVQLPFIR 145 Query: 147 KVMEEYKTSFTIIPILVGSLTPEKEAKYG-AIL-APYLADPQNLLVISSDFCHW 198 + + I+PIL+G +P + A+L A + +++ S D H+ Sbjct: 146 YRFPKAR----IVPILMGDQSPNGAVRVAQAVLSAAETTGIRPIIIASGDGSHY 195 >UniRef50_UPI0000498B94 Cluster: conserved hypothetical protein; n=1; Entamoeba histolytica HM-1:IMSS|Rep: conserved hypothetical protein - Entamoeba histolytica HM-1:IMSS Length = 284 Score = 42.7 bits (96), Expect = 0.011 Identities = 53/235 (22%), Positives = 89/235 (37%), Gaps = 27/235 (11%) Query: 25 SGSELSRQLDLW----LSKADLTHGPARAIIAPHXXXXXXXXXXX---XXXRQVSPVVVK 77 +G+EL+ ++D + L+K G I+PH ++ S + K Sbjct: 19 NGNELANEVDHYINNALNKLPSIQGKILGCISPHAGFRYSGQTAGYDFAALKRDSEINGK 78 Query: 78 R--IFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENE 135 +FILG SH R A+ TP+ ID + R + + + E Sbjct: 79 PDVVFILGFSHSSRFDCAAVMDGKAISTPIATTEIDNEAITMFCEGRNYLKCFYKPHNGE 138 Query: 136 HSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDF 195 HS E LP++ + + K ++ +L+G+ E + L + + ++ SSD Sbjct: 139 HSAENELPFVQRALPGVK----VVMVLIGTHKSEVLEQVSQGLQAVCSKKKMYVIASSDM 194 Query: 196 CHWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICG 250 H D S + +E DK + L EKMD K + CG Sbjct: 195 LH----------DES----HNLVEKTDKETIQLTEKMDIKGLLSKWSYENQIYCG 235 >UniRef50_O51324 Cluster: Putative uncharacterized protein BB0349; n=3; Borrelia burgdorferi group|Rep: Putative uncharacterized protein BB0349 - Borrelia burgdorferi (Lyme disease spirochete) Length = 246 Score = 42.3 bits (95), Expect = 0.015 Identities = 33/127 (25%), Positives = 57/127 (44%), Gaps = 6/127 (4%) Query: 117 ELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGA 176 +L T F +D++ EN+H IE+ L +I+ + E K IIPI+ G + K+ Sbjct: 90 KLLKTLNFINIDDKLIENDHKIEITLNFISNIKENIK----IIPIIFGKTCNKHLLKFCE 145 Query: 177 ILAPYLADPQNLLVISSDFCHWGSRFRYTWK-DSSRGHIYQSIEWLDKLGMDLIEKMDPK 235 L P++ +N + S F + + K + + HI + L L + L K Sbjct: 146 FLKPFINREENSFIFLSCFISKSTNIKKALKFEENLKHILLE-KKLPNLNLILENYKSKK 204 Query: 236 AFTDYLN 242 F + +N Sbjct: 205 IFPENIN 211 >UniRef50_O27974 Cluster: UPF0103 protein AF_2310; n=2; Euryarchaeota|Rep: UPF0103 protein AF_2310 - Archaeoglobus fulgidus Length = 261 Score = 41.1 bits (92), Expect = 0.035 Identities = 43/179 (24%), Positives = 80/179 (44%), Gaps = 21/179 (11%) Query: 81 ILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEM 140 I+GP+H A+S+ D + TPL ++ +D + + + DE EHS+E+ Sbjct: 66 IVGPNHTGYGLPVAVST-DTWLTPLGEVEVDTEFVEAMP--KIITAPDEIAHRYEHSLEV 122 Query: 141 HLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYG-AILAPYLADPQNLLVISSDFCHWG 199 +P++ + +++K I+PI +G E + IL + ++VI+S H Sbjct: 123 QVPFLQYLHDDFK----IVPICLGMQDEETAMEVAEEILTAERETGRKVVVIASSDMH-- 176 Query: 200 SRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLL 258 Y + R LD + +D I MD K + + + + ++CG I V + Sbjct: 177 ---HYLPDEECRR--------LDSIVIDAILSMDVKKYYETIYRLQASVCGYGCIAVAM 224 >UniRef50_Q9YB24 Cluster: UPF0103 protein APE_1771; n=1; Aeropyrum pernix|Rep: UPF0103 protein APE_1771 - Aeropyrum pernix Length = 281 Score = 39.1 bits (87), Expect = 0.14 Identities = 41/184 (22%), Positives = 77/184 (41%), Gaps = 19/184 (10%) Query: 79 IFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSI 138 + +LGP+H +L ++TPL ++ +D + + D++ EHS+ Sbjct: 81 VVLLGPNHTGLGLAASLWDEGVWRTPLGEVEVDSEAGRLVVEYSGIVAPDDEGHIYEHSL 140 Query: 139 EMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLAD--PQNLLVISSDFC 196 E+ LP++ + Y F I+PI+V T + + + +LV +SD Sbjct: 141 EVQLPFLQYL---YGGDFRIVPIVVLHQTLDISIRIARAYHRLREENGVNAVLVATSDLN 197 Query: 197 HWGSRFRYTWKDSSRGHIYQSIEWLDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGV 256 H+ Y+ + D L + IE+ DP+A + + + CG PI Sbjct: 198 HY--------------EPYEENKRKDLLLLKAIEEGDPEAVFKTIEAHAISACGPSPIAA 243 Query: 257 LLQA 260 ++A Sbjct: 244 AVEA 247 >UniRef50_Q56419 Cluster: UPF0103 protein TTHA0924; n=2; Thermus thermophilus|Rep: UPF0103 protein TTHA0924 - Thermus thermophilus (strain HB8 / ATCC 27634 / DSM 579) Length = 326 Score = 38.7 bits (86), Expect = 0.18 Identities = 35/127 (27%), Positives = 56/127 (44%), Gaps = 10/127 (7%) Query: 77 KRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTD-ENE 135 +RI+++G +H A + +QTP D L+A F+ + E Sbjct: 124 ERIYLVGVAHRPLKEKAAALPVP-FQTPFGPALPDLPALQALDALLPFELFNTPLAFREE 182 Query: 136 HSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDF 195 HS+E+ L ++ E + ++P+LV +PE G L L D LLV++ D Sbjct: 183 HSLELPLFFLKGRFPEAR----VLPLLVARRSPE----LGEALKVVLRDFPGLLVLAVDL 234 Query: 196 CHWGSRF 202 H G RF Sbjct: 235 SHVGPRF 241 >UniRef50_Q5BSZ0 Cluster: SJCHGC03049 protein; n=1; Schistosoma japonicum|Rep: SJCHGC03049 protein - Schistosoma japonicum (Blood fluke) Length = 64 Score = 37.5 bits (83), Expect = 0.43 Identities = 17/36 (47%), Positives = 26/36 (72%) Query: 188 LLVISSDFCHWGSRFRYTWKDSSRGHIYQSIEWLDK 223 L++I + F G RF+YT+ D S+G I+QSI+ LD+ Sbjct: 26 LIMILAYFLSSGKRFQYTYYDQSKGPIWQSIQALDE 61 >UniRef50_Q1HQS5 Cluster: Syndecan binding protein; n=5; Pancrustacea|Rep: Syndecan binding protein - Aedes aegypti (Yellowfever mosquito) Length = 333 Score = 36.3 bits (80), Expect = 0.98 Identities = 26/81 (32%), Positives = 35/81 (43%), Gaps = 4/81 (4%) Query: 105 LYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVG 164 L D+ +DK + ++ A Q + +H MH P A M Y ++P VG Sbjct: 7 LEDMQVDKIMQSQNAA---ISNAIAQQQQQQHQFSMHDPPPAYTMNPYAQLSNLLPGAVG 63 Query: 165 SLTPEKE-AKYGAILAPYLAD 184 S PE E AK P LAD Sbjct: 64 STAPEPETAKKQEFFYPDLAD 84 >UniRef50_Q98GI9 Cluster: Encapsulation protein; CapA; n=1; Mesorhizobium loti|Rep: Encapsulation protein; CapA - Rhizobium loti (Mesorhizobium loti) Length = 561 Score = 35.5 bits (78), Expect = 1.7 Identities = 33/129 (25%), Positives = 52/129 (40%), Gaps = 7/129 (5%) Query: 71 VSPVVVKRIFILGPSHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQ 130 VS KRI IL P H + ++ + T L + D LEA D ++E Sbjct: 82 VSGFRYKRIVILSPDHFHKTHKLYATTARGFDTVLGPVAADSDAVRLLEA--HGDMVEES 139 Query: 131 -TDENEHSIEMHLPYIAKVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYLADPQNLL 189 + EH + LP++ E K I+P+ + + A + D L+ Sbjct: 140 CLFDKEHGVRAMLPFLHHYFPEAK----IVPVAMSVKAKRGDWDRLAEALKPIVDQDTLI 195 Query: 190 VISSDFCHW 198 V S+DF H+ Sbjct: 196 VESTDFSHY 204 >UniRef50_Q1PVM2 Cluster: Putative uncharacterized protein; n=1; Candidatus Kuenenia stuttgartiensis|Rep: Putative uncharacterized protein - Candidatus Kuenenia stuttgartiensis Length = 267 Score = 35.5 bits (78), Expect = 1.7 Identities = 33/173 (19%), Positives = 69/173 (39%), Gaps = 9/173 (5%) Query: 27 SELSRQLDLWLSKADLTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIFILGPSH 86 S L ++D ++ K D A ++PH ++ + + IL P+H Sbjct: 17 SRLQHEIDTFIIK-DCEKQSALGAVSPHAGYMYSGSIAGSLYSHIT--IPDLVVILSPNH 73 Query: 87 HVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIA 146 ++ + TP ++ ++++ EL + D++ EH+ E+ +P+I Sbjct: 74 TGYGKPYSIWPGGSWITPFGEIAVNEEAVDELVNSCHLIERDKEAHLYEHAAEVQIPFI- 132 Query: 147 KVMEEYKTSFTIIPILVGSLTPEKEAKYGAILAPYL--ADPQNLLVISSDFCH 197 + + I+ + + S + G ++ L P L+V SSD H Sbjct: 133 ---QYFNQKTEIVVMTIASRKIQDLKTIGKCMSQMLQKLHPDALVVASSDMTH 182 >UniRef50_A6REB9 Cluster: Predicted protein; n=1; Ajellomyces capsulatus NAm1|Rep: Predicted protein - Ajellomyces capsulatus NAm1 Length = 137 Score = 35.5 bits (78), Expect = 1.7 Identities = 23/67 (34%), Positives = 29/67 (43%), Gaps = 8/67 (11%) Query: 29 LSRQLDLWLSKAD--------LTHGPARAIIAPHXXXXXXXXXXXXXXRQVSPVVVKRIF 80 LS QL+ WL++ L AR IIAPH + + K IF Sbjct: 60 LSSQLEKWLAQVPDELPGIGRLPIAGARVIIAPHAGYAYSGPCAAWAYKALDLSKAKSIF 119 Query: 81 ILGPSHH 87 +LGPSHH Sbjct: 120 LLGPSHH 126 >UniRef50_A3LYQ1 Cluster: Predicted protein; n=2; Saccharomycetaceae|Rep: Predicted protein - Pichia stipitis (Yeast) Length = 509 Score = 35.5 bits (78), Expect = 1.7 Identities = 24/117 (20%), Positives = 48/117 (41%), Gaps = 7/117 (5%) Query: 101 YQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKTSFTIIP 160 + P Y + K+ Y L A D D ++++ PY+ +++E FT++P Sbjct: 307 FSQPFYGAALQKKAYDALLAGSNGDICQAWDD----AVDVKCPYVIALVQESLRYFTVLP 362 Query: 161 ILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRY---TWKDSSRGHI 214 + + +T + GA + N + D C + S + + W D+ G + Sbjct: 363 LGLPRITTKDIVYNGAFIPKETILIMNAFAANHDSCVFQSPYEFIPERWLDAETGEL 419 >UniRef50_Q6C3Q3 Cluster: Yarrowia lipolytica chromosome E of strain CLIB 122 of Yarrowia lipolytica; n=1; Yarrowia lipolytica|Rep: Yarrowia lipolytica chromosome E of strain CLIB 122 of Yarrowia lipolytica - Yarrowia lipolytica (Candida lipolytica) Length = 235 Score = 34.7 bits (76), Expect = 3.0 Identities = 23/75 (30%), Positives = 34/75 (45%), Gaps = 5/75 (6%) Query: 159 IPILVGSLTPEKEAKYGAILAPYLADPQNLLVISSDFCHWGSRFRYTWKDSSRGHIYQSI 218 + + + L +E +YG L DP L S +F W Y SSRGH Y + Sbjct: 145 LDVRMARLRNREEQRYGDTLR---TDPAMGLK-SENFLSWAESLSYPHCTSSRGHDYH-L 199 Query: 219 EWLDKLGMDLIEKMD 233 WLD G+ +++ D Sbjct: 200 RWLDNCGVPVLKLGD 214 >UniRef50_Q7SXQ0 Cluster: Zgc:66133; n=3; Danio rerio|Rep: Zgc:66133 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 184 Score = 33.1 bits (72), Expect = 9.2 Identities = 18/70 (25%), Positives = 35/70 (50%) Query: 85 SHHVRIAGCALSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPY 144 SH +R++ S Y+ + D + ++ +++A Q R +E+T EN+ E + + Sbjct: 112 SHKLRLSNVKPSDEGTYECRVIDFSENRVQRHQVQAYLQIQRTEEETSENQQKKEEQILH 171 Query: 145 IAKVMEEYKT 154 + EE KT Sbjct: 172 HHHLYEENKT 181 >UniRef50_Q6MQA3 Cluster: Iron-regulated protein A precursor; n=1; Bdellovibrio bacteriovorus|Rep: Iron-regulated protein A precursor - Bdellovibrio bacteriovorus Length = 361 Score = 33.1 bits (72), Expect = 9.2 Identities = 18/64 (28%), Positives = 32/64 (50%) Query: 221 LDKLGMDLIEKMDPKAFTDYLNKYGNTICGRHPIGVLLQAISKLSSQSNAPKMSLKFLKY 280 L+KL +D + + K D + G + G H + LL K+S+ A ++ K L+Y Sbjct: 104 LNKLDLDSVMSSNRKITVDLVRALGTNLQGFHTLEYLLFGDGKVSNTKPAASLTAKQLEY 163 Query: 281 AQSS 284 ++S Sbjct: 164 LKAS 167 >UniRef50_A1SQY9 Cluster: Pentapeptide repeat protein; n=1; Psychromonas ingrahamii 37|Rep: Pentapeptide repeat protein - Psychromonas ingrahamii (strain 37) Length = 976 Score = 33.1 bits (72), Expect = 9.2 Identities = 23/85 (27%), Positives = 40/85 (47%), Gaps = 1/85 (1%) Query: 95 LSSLDKYQTPLYDLTIDKQIYAELEATRQFDRMDEQTDENEHSIEMHLPYIAKVMEEYKT 154 L ++D+ Q + + ++ A L+ +Q D + +Q + E P I K +EE Sbjct: 489 LKAMDEVQAHMEAMAEKQKKEALLKVEQQLDELKQQAAQQPEMAEQLDPSI-KQLEEMLA 547 Query: 155 SFTIIPILVGSLTPEKEAKYGAILA 179 S IP+L T E++ + A LA Sbjct: 548 SIDAIPVLTRPDTVEQDTQLSAQLA 572 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.321 0.137 0.412 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 304,621,049 Number of Sequences: 1657284 Number of extensions: 11515953 Number of successful extensions: 26221 Number of sequences better than 10.0: 104 Number of HSP's better than 10.0 without gapping: 69 Number of HSP's successfully gapped in prelim test: 35 Number of HSP's that attempted gapping in prelim test: 26009 Number of HSP's gapped (non-prelim): 129 length of query: 304 length of database: 575,637,011 effective HSP length: 100 effective length of query: 204 effective length of database: 409,908,611 effective search space: 83621356644 effective search space used: 83621356644 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.9 bits) S2: 72 (33.1 bits)
- SilkBase 1999-2023 -