BLASTP 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= BGIBMGA000086-TA|BGIBMGA000086-PA|IPR002589|Appr-1-p processing (244 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q1HPZ5 Cluster: LRP16 protein; n=1; Bombyx mori|Rep: LR... 514 e-144 UniRef50_A1Z1Q3 Cluster: MACRO domain-containing protein 2; n=41... 231 2e-59 UniRef50_Q6PHJ5 Cluster: Zgc:65960; n=5; cellular organisms|Rep:... 225 8e-58 UniRef50_Q66HV6 Cluster: Zgc:92353; n=1; Danio rerio|Rep: Zgc:92... 218 9e-56 UniRef50_A7RJ44 Cluster: Predicted protein; n=3; Eukaryota|Rep: ... 216 5e-55 UniRef50_Q5DCZ3 Cluster: SJCHGC06209 protein; n=1; Schistosoma j... 206 3e-52 UniRef50_Q9BQ69 Cluster: MACRO domain-containing protein 1; n=18... 204 1e-51 UniRef50_UPI000023F24A Cluster: hypothetical protein FG04179.1; ... 186 6e-46 UniRef50_A6S485 Cluster: Putative uncharacterized protein; n=1; ... 182 7e-45 UniRef50_A4R3Q9 Cluster: Putative uncharacterized protein; n=1; ... 182 1e-44 UniRef50_Q8TQD0 Cluster: UPF0189 protein MA_1614; n=4; cellular ... 181 1e-44 UniRef50_A1IFK2 Cluster: Putative uncharacterized protein; n=1; ... 175 6e-43 UniRef50_Q0CQJ0 Cluster: Protein LRP16; n=5; cellular organisms|... 175 8e-43 UniRef50_Q0UQZ6 Cluster: Predicted protein; n=1; Phaeosphaeria n... 174 2e-42 UniRef50_Q5KCD7 Cluster: Putative uncharacterized protein; n=2; ... 172 8e-42 UniRef50_Q8KAE4 Cluster: UPF0189 protein CT2219; n=24; cellular ... 171 1e-41 UniRef50_A2FMC7 Cluster: Appr-1-p processing enzyme family prote... 168 1e-40 UniRef50_Q2GZS3 Cluster: Putative uncharacterized protein; n=1; ... 167 3e-40 UniRef50_Q4WYQ2 Cluster: LRP16 family protein; n=8; cellular org... 165 7e-40 UniRef50_Q17432 Cluster: Putative uncharacterized protein; n=2; ... 165 1e-39 UniRef50_Q0LI88 Cluster: Appr-1-p processing; n=2; cellular orga... 162 8e-39 UniRef50_UPI000049917F Cluster: conserved hypothetical protein; ... 160 3e-38 UniRef50_A5WHZ6 Cluster: Appr-1-p processing domain protein; n=2... 160 3e-38 UniRef50_Q985D2 Cluster: UPF0189 protein mll7730; n=54; cellular... 160 3e-38 UniRef50_Q1K4D1 Cluster: Appr-1-p processing; n=1; Desulfuromona... 157 2e-37 UniRef50_A2DTG7 Cluster: Appr-1-p processing enzyme family prote... 157 2e-37 UniRef50_Q01WP7 Cluster: Appr-1-p processing domain protein; n=1... 155 1e-36 UniRef50_A5TRW5 Cluster: Putative uncharacterized protein; n=1; ... 153 4e-36 UniRef50_Q4P1I0 Cluster: Putative uncharacterized protein; n=1; ... 153 5e-36 UniRef50_Q9HXU7 Cluster: UPF0189 protein PA3693; n=13; Bacteria|... 151 1e-35 UniRef50_A6NXN8 Cluster: Putative uncharacterized protein; n=1; ... 151 2e-35 UniRef50_Q8B4N1 Cluster: ORF-1; n=8; root|Rep: ORF-1 - Rock brea... 150 3e-35 UniRef50_Q8RB30 Cluster: UPF0189 protein TTE0995; n=20; Bacteria... 149 8e-35 UniRef50_A6BCW6 Cluster: Putative uncharacterized protein; n=2; ... 148 1e-34 UniRef50_A5V0Y4 Cluster: Appr-1-p processing domain protein; n=5... 147 3e-34 UniRef50_Q8EYT0 Cluster: UPF0189 protein LA_4133; n=11; cellular... 146 3e-34 UniRef50_A0LGZ1 Cluster: Appr-1-p processing domain protein; n=1... 146 4e-34 UniRef50_Q926Y8 Cluster: UPF0189 protein lin2902; n=14; Firmicut... 146 6e-34 UniRef50_Q4DSL4 Cluster: Putative uncharacterized protein; n=3; ... 145 1e-33 UniRef50_Q88SK6 Cluster: UPF0189 protein lp_3408; n=13; cellular... 145 1e-33 UniRef50_Q94JV1 Cluster: At1g69340/F10D13.28; n=9; Magnoliophyta... 144 1e-33 UniRef50_P67341 Cluster: UPF0189 protein ymdB; n=11; Bacteria|Re... 140 3e-32 UniRef50_Q8PHB6 Cluster: UPF0189 protein XAC3343; n=9; Proteobac... 137 2e-31 UniRef50_UPI0000498CB9 Cluster: conserved hypothetical protein; ... 135 1e-30 UniRef50_A0H6G6 Cluster: Appr-1-p processing; n=1; Chloroflexus ... 133 4e-30 UniRef50_O22875 Cluster: Expressed protein; n=7; Magnoliophyta|R... 133 4e-30 UniRef50_UPI0000498318 Cluster: conserved hypothetical protein; ... 132 6e-30 UniRef50_A7T7L3 Cluster: Predicted protein; n=1; Nematostella ve... 132 6e-30 UniRef50_Q0B030 Cluster: Phosphatase; n=1; Syntrophomonas wolfei... 132 8e-30 UniRef50_A3ZLZ3 Cluster: Putative uncharacterized protein; n=2; ... 132 8e-30 UniRef50_A7T167 Cluster: Predicted protein; n=1; Nematostella ve... 132 8e-30 UniRef50_Q1R0S7 Cluster: Appr-1-p processing; n=1; Chromohalobac... 132 1e-29 UniRef50_Q9NXN4 Cluster: Ganglioside-induced differentiation-ass... 132 1e-29 UniRef50_Q0UG78 Cluster: Putative uncharacterized protein; n=1; ... 130 2e-29 UniRef50_UPI0000E4815A Cluster: PREDICTED: similar to LRP16 prot... 130 3e-29 UniRef50_A5ZAB5 Cluster: Putative uncharacterized protein; n=1; ... 130 4e-29 UniRef50_Q6AKL0 Cluster: Putative uncharacterized protein; n=1; ... 129 5e-29 UniRef50_Q5R014 Cluster: Predicted phosphatase; n=6; Bacteria|Re... 128 9e-29 UniRef50_Q59Z77 Cluster: Putative uncharacterized protein; n=2; ... 128 9e-29 UniRef50_Q0CEI7 Cluster: Putative uncharacterized protein; n=1; ... 128 2e-28 UniRef50_A7B8S3 Cluster: Putative uncharacterized protein; n=1; ... 127 2e-28 UniRef50_Q6ZED8 Cluster: Slr7060 protein; n=1; Synechocystis sp.... 126 5e-28 UniRef50_A1G783 Cluster: Appr-1-p processing; n=1; Salinispora a... 126 5e-28 UniRef50_Q8EP31 Cluster: Hypothetical conserved protein; n=1; Oc... 126 7e-28 UniRef50_Q6AAQ5 Cluster: Conserved protein; n=2; Bacteria|Rep: C... 126 7e-28 UniRef50_A6PEZ6 Cluster: Appr-1-p processing domain protein; n=1... 125 9e-28 UniRef50_Q93SX7 Cluster: UPF0189 protein; n=1; Acinetobacter sp.... 125 9e-28 UniRef50_Q87JZ5 Cluster: UPF0189 protein VPA0103; n=5; cellular ... 124 2e-27 UniRef50_Q9HJ67 Cluster: UPF0189 protein Ta1105; n=2; Thermoplas... 124 2e-27 UniRef50_Q47EQ7 Cluster: Appr-1-p processing; n=1; Dechloromonas... 124 3e-27 UniRef50_A7BY23 Cluster: Putative uncharacterized protein; n=1; ... 124 3e-27 UniRef50_Q9WYX8 Cluster: UPF0189 protein TM_0508; n=4; Thermotog... 122 6e-27 UniRef50_Q97AU0 Cluster: UPF0189 protein TV0719; n=1; Thermoplas... 122 8e-27 UniRef50_A6F1P7 Cluster: Appr-1-p processing; n=1; Marinobacter ... 121 1e-26 UniRef50_P67344 Cluster: UPF0189 protein SA0314; n=13; Staphyloc... 121 1e-26 UniRef50_Q03IQ8 Cluster: Predicted phosphatase homologous to the... 121 2e-26 UniRef50_Q2TX23 Cluster: Predicted phosphatase homologous to the... 121 2e-26 UniRef50_Q18A61 Cluster: Putative uncharacterized protein; n=2; ... 120 3e-26 UniRef50_Q30ZH6 Cluster: Appr-1-p processing; n=1; Desulfovibrio... 120 4e-26 UniRef50_Q9ZBG3 Cluster: UPF0189 protein SCO6450; n=4; Actinomyc... 119 6e-26 UniRef50_A0J8J0 Cluster: Appr-1-p processing; n=1; Shewanella wo... 118 1e-25 UniRef50_A6PBP5 Cluster: Appr-1-p processing domain protein; n=1... 117 3e-25 UniRef50_A6GJ81 Cluster: Putative uncharacterized protein; n=1; ... 115 9e-25 UniRef50_Q93RG0 Cluster: UPF0189 protein in tap1-dppD intergenic... 115 9e-25 UniRef50_A2DE53 Cluster: Appr-1-p processing enzyme family prote... 115 1e-24 UniRef50_UPI0000519D2E Cluster: PREDICTED: similar to CG18812-PC... 114 2e-24 UniRef50_A0UYE8 Cluster: Appr-1-p processing; n=3; Bacteria|Rep:... 114 2e-24 UniRef50_A0X2G8 Cluster: Appr-1-p processing domain protein; n=1... 112 9e-24 UniRef50_Q7JUR6 Cluster: GH03014p; n=11; Endopterygota|Rep: GH03... 111 2e-23 UniRef50_Q22CT8 Cluster: Appr-1-p processing enzyme family prote... 109 8e-23 UniRef50_A1HMQ5 Cluster: Appr-1-p processing domain protein; n=4... 107 3e-22 UniRef50_A1D5K4 Cluster: Appr-1-p processing enzyme family prote... 107 3e-22 UniRef50_A3LYE6 Cluster: Putative uncharacterized protein; n=1; ... 105 8e-22 UniRef50_UPI0000ECB76F Cluster: Poly [ADP-ribose] polymerase 14 ... 104 2e-21 UniRef50_Q5XC09 Cluster: UPF0189 protein M6_Spy0919; n=19; Strep... 104 2e-21 UniRef50_Q8ZXT3 Cluster: UPF0189 protein PAE1111; n=8; Thermopro... 104 2e-21 UniRef50_A5D049 Cluster: Predicted phosphatase; n=3; Bacteria|Re... 103 3e-21 UniRef50_UPI00006A2284 Cluster: UPI00006A2284 related cluster; n... 101 2e-20 UniRef50_A7HJC7 Cluster: Appr-1-p processing domain protein; n=1... 101 2e-20 UniRef50_A1RWM4 Cluster: Appr-1-p processing domain protein; n=2... 101 2e-20 UniRef50_A0CX10 Cluster: Chromosome undetermined scaffold_3, who... 99 5e-20 UniRef50_Q4T065 Cluster: Chromosome undetermined SCAF11328, whol... 99 1e-19 UniRef50_Q2SM57 Cluster: Predicted phosphatase; n=1; Hahella che... 98 2e-19 UniRef50_UPI0000E80997 Cluster: PREDICTED: similar to Poly [ADP-... 97 5e-19 UniRef50_A6SR30 Cluster: Putative uncharacterized protein; n=1; ... 96 6e-19 UniRef50_UPI0000660739 Cluster: ganglioside induced differentiat... 96 8e-19 UniRef50_A7EET2 Cluster: Putative uncharacterized protein; n=1; ... 95 1e-18 UniRef50_UPI0000F2CC13 Cluster: PREDICTED: similar to B aggressi... 95 1e-18 UniRef50_A2QSI2 Cluster: Contig An08c0280, complete genome; n=1;... 95 2e-18 UniRef50_Q6NRC6 Cluster: MGC83934 protein; n=2; Xenopus|Rep: MGC... 94 3e-18 UniRef50_Q54PT1 Cluster: Putative uncharacterized protein; n=1; ... 93 4e-18 UniRef50_UPI0000F3214F Cluster: UPI0000F3214F related cluster; n... 91 2e-17 UniRef50_A6LTB5 Cluster: Appr-1-p processing domain protein; n=1... 90 4e-17 UniRef50_UPI0000E8099B Cluster: PREDICTED: similar to PARP9 prot... 90 5e-17 UniRef50_UPI00006A1CA6 Cluster: poly (ADP-ribose) polymerase fam... 89 9e-17 UniRef50_Q460N5 Cluster: Poly [ADP-ribose] polymerase 14; n=23; ... 89 9e-17 UniRef50_Q4SK43 Cluster: Chromosome 2 SCAF14570, whole genome sh... 89 1e-16 UniRef50_Q10RP7 Cluster: Appr-1-p processing enzyme family prote... 87 4e-16 UniRef50_A1L291 Cluster: LOC799852 protein; n=4; Danio rerio|Rep... 87 5e-16 UniRef50_A7S3X0 Cluster: Predicted protein; n=1; Nematostella ve... 86 7e-16 UniRef50_UPI000023E9A3 Cluster: hypothetical protein FG04612.1; ... 86 9e-16 UniRef50_O07733 Cluster: UPF0189 protein Rv1899c/MT1950; n=9; My... 85 2e-15 UniRef50_UPI0000660C67 Cluster: Homolog of Oncorhynchus mykiss "... 82 1e-14 UniRef50_Q55AK6 Cluster: U box domain-containing protein; n=3; E... 82 1e-14 UniRef50_Q8IXQ6 Cluster: Poly [ADP-ribose] polymerase 9; n=26; E... 81 2e-14 UniRef50_A7C4X9 Cluster: Putative uncharacterized protein; n=1; ... 79 8e-14 UniRef50_UPI0000F2CC14 Cluster: PREDICTED: similar to Poly [ADP-... 79 1e-13 UniRef50_Q5V4P3 Cluster: Putative uncharacterized protein; n=2; ... 79 1e-13 UniRef50_UPI00015A60CA Cluster: UPI00015A60CA related cluster; n... 78 2e-13 UniRef50_Q7QZY2 Cluster: GLP_23_42584_43678; n=1; Giardia lambli... 77 3e-13 UniRef50_O75367 Cluster: Core histone macro-H2A.1; n=179; Eukary... 77 4e-13 UniRef50_A1R2V6 Cluster: Putative uncharacterized protein; n=2; ... 75 2e-12 UniRef50_A0CX06 Cluster: Chromosome undetermined scaffold_3, who... 75 2e-12 UniRef50_Q9YBE9 Cluster: UPF0189 protein APE_1648.1; n=1; Aeropy... 75 2e-12 UniRef50_UPI0000F1EDA9 Cluster: PREDICTED: similar to Poly [ADP-... 75 2e-12 UniRef50_UPI000065ED3A Cluster: Homolog of Oncorhynchus mykiss "... 74 3e-12 UniRef50_Q4RG95 Cluster: Chromosome 12 SCAF15104, whole genome s... 72 1e-11 UniRef50_O67112 Cluster: UPF0189 protein aq_987; n=3; cellular o... 72 2e-11 UniRef50_Q5KUT6 Cluster: Hypothetical conserved protein; n=2; Ge... 71 2e-11 UniRef50_A2BJA7 Cluster: A1pp, Appr-1-p processing enzyme; n=1; ... 70 5e-11 UniRef50_A3DLM0 Cluster: Appr-1-p processing domain protein; n=1... 69 8e-11 UniRef50_UPI0001556316 Cluster: PREDICTED: similar to LRP16 prot... 69 1e-10 UniRef50_Q1YRE7 Cluster: Putative uncharacterized protein; n=1; ... 69 1e-10 UniRef50_Q99IE7 Cluster: Non-structural polyprotein p200 (p200) ... 66 6e-10 UniRef50_UPI00004D69C1 Cluster: poly (ADP-ribose) polymerase fam... 66 1e-09 UniRef50_A3EXC9 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1... 65 1e-09 UniRef50_O28751 Cluster: UPF0189 protein AF_1521; n=25; Euryarch... 64 2e-09 UniRef50_Q9P0M6 Cluster: Core histone macro-H2A.2; n=74; Eukaryo... 64 4e-09 UniRef50_UPI00005A5611 Cluster: PREDICTED: similar to poly (ADP-... 63 5e-09 UniRef50_UPI0000ECC933 Cluster: C20orf133 protein.; n=3; Gallus ... 63 5e-09 UniRef50_Q4RPB9 Cluster: Chromosome 1 SCAF15008, whole genome sh... 62 9e-09 UniRef50_Q460N3 Cluster: Poly [ADP-ribose] polymerase 15; n=9; E... 62 9e-09 UniRef50_Q00XU1 Cluster: Hismacro and SEC14 domain-containing pr... 60 4e-08 UniRef50_Q4SK44 Cluster: Chromosome 2 SCAF14570, whole genome sh... 60 5e-08 UniRef50_UPI0000660C1F Cluster: Homolog of Gallus gallus "Histon... 58 2e-07 UniRef50_Q5M915 Cluster: D930010j01rik-prov protein; n=3; Xenopu... 58 2e-07 UniRef50_UPI000065F87F Cluster: Homolog of Gallus gallus "Histon... 57 4e-07 UniRef50_Q9WJC8 Cluster: Nonstructural polyprotein; n=12; Venezu... 57 5e-07 UniRef50_UPI0001555B8B Cluster: PREDICTED: similar to Poly [ADP-... 56 6e-07 UniRef50_A7BVQ6 Cluster: Appr-1-p processing enzyme family; n=1;... 56 6e-07 UniRef50_Q7REF6 Cluster: ATPase associated with chromosome archi... 56 8e-07 UniRef50_A3BF04 Cluster: Putative uncharacterized protein; n=1; ... 55 1e-06 UniRef50_Q0Q476 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1... 55 2e-06 UniRef50_A7AWQ8 Cluster: Putative uncharacterized protein; n=1; ... 54 2e-06 UniRef50_P18458 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1... 54 2e-06 UniRef50_Q6NIW9 Cluster: Putative uncharacterized protein; n=1; ... 54 3e-06 UniRef50_P87515 Cluster: Non-structural polyprotein (Polyprotein... 54 3e-06 UniRef50_UPI0000E1FED6 Cluster: PREDICTED: hypothetical protein ... 53 6e-06 UniRef50_Q08X95 Cluster: Appr-1-p processing enzyme family prote... 53 6e-06 UniRef50_UPI0000EB30ED Cluster: UPI0000EB30ED related cluster; n... 52 1e-05 UniRef50_UPI0000F2EBB4 Cluster: PREDICTED: similar to LRP16 prot... 49 9e-05 UniRef50_Q4RPB7 Cluster: Chromosome 1 SCAF15008, whole genome sh... 49 1e-04 UniRef50_UPI000155BDA5 Cluster: PREDICTED: similar to LRP16 prot... 47 4e-04 UniRef50_A3EXG5 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1... 47 4e-04 UniRef50_Q8IBS9 Cluster: Putative uncharacterized protein MAL7P1... 46 7e-04 UniRef50_Q4YCG7 Cluster: Putative uncharacterized protein; n=3; ... 46 9e-04 UniRef50_Q4T4T2 Cluster: Chromosome undetermined SCAF9554, whole... 44 0.003 UniRef50_Q6QLN1 Cluster: Non-structural polyprotein; n=40; root|... 44 0.003 UniRef50_Q2V9U1 Cluster: Nonstructural protein 3; n=38; Eastern ... 44 0.005 UniRef50_A5KAG2 Cluster: Putative uncharacterized protein; n=1; ... 44 0.005 UniRef50_UPI0000F1E4D0 Cluster: PREDICTED: similar to collaborat... 43 0.006 UniRef50_A7QKZ8 Cluster: Chromosome chr8 scaffold_115, whole gen... 43 0.006 UniRef50_A4S5T1 Cluster: Predicted protein; n=1; Ostreococcus lu... 43 0.006 UniRef50_A6RX72 Cluster: Predicted protein; n=1; Botryotinia fuc... 43 0.008 UniRef50_P13886 Cluster: Non-structural polyprotein (Polyprotein... 43 0.008 UniRef50_Q10MW4 Cluster: Basic helix-loop-helix, putative, expre... 42 0.011 UniRef50_Q24DG1 Cluster: Putative uncharacterized protein; n=2; ... 42 0.014 UniRef50_Q7RF86 Cluster: GYF domain, putative; n=6; Plasmodium (... 42 0.019 UniRef50_P13887 Cluster: Non-structural polyprotein (Polyprotein... 40 0.043 UniRef50_A7BRB1 Cluster: Protein containing Appr-1-p processing ... 40 0.075 UniRef50_Q0Q467 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1... 40 0.075 UniRef50_Q22U36 Cluster: Cyclic nucleotide-binding domain contai... 39 0.099 UniRef50_Q6ZKH7 Cluster: Putative uncharacterized protein OJ1119... 39 0.13 UniRef50_Q8ZN14 Cluster: Gifsy-1 prophage protein; n=4; Bacteria... 38 0.30 UniRef50_Q8JJX1 Cluster: Non-structural polyprotein (Polyprotein... 38 0.30 UniRef50_Q69HN2 Cluster: Putative uncharacterized protein; n=1; ... 37 0.40 UniRef50_A2E8H6 Cluster: Viral A-type inclusion protein, putativ... 37 0.40 UniRef50_Q3BBL7 Cluster: Putative uncharacterized protein; n=14;... 37 0.40 UniRef50_A6DE82 Cluster: Exonuclease SbcC; n=1; Caminibacter med... 37 0.53 UniRef50_A4GSN8 Cluster: Nuclear-pore anchor; n=7; Arabidopsis t... 37 0.53 UniRef50_Q54DH8 Cluster: Putative uncharacterized protein TAF1; ... 37 0.53 UniRef50_A0DTL5 Cluster: Chromosome undetermined scaffold_63, wh... 37 0.53 UniRef50_Q6FSG9 Cluster: Candida glabrata strain CBS138 chromoso... 37 0.53 UniRef50_UPI00004993C7 Cluster: hypothetical protein 3.t00030; n... 36 0.70 UniRef50_Q6MRT6 Cluster: Putative uncharacterized protein; n=1; ... 36 0.70 UniRef50_Q8I4Z1 Cluster: Putative uncharacterized protein; n=2; ... 36 0.70 UniRef50_A0CHZ3 Cluster: Chromosome undetermined scaffold_186, w... 36 0.70 UniRef50_Q6CT35 Cluster: Similar to sgd|S0006295 Saccharomyces c... 36 0.70 UniRef50_UPI000065F7D8 Cluster: Homolog of Homo sapiens "Splice ... 36 0.93 UniRef50_A7DT33 Cluster: Putative uncharacterized protein; n=3; ... 36 0.93 UniRef50_A2EMN0 Cluster: Putative uncharacterized protein; n=1; ... 36 0.93 UniRef50_UPI00006CE511 Cluster: hypothetical protein TTHERM_0014... 36 1.2 UniRef50_UPI000049880F Cluster: hypothetical protein 63.t00025; ... 36 1.2 UniRef50_Q0WYB5 Cluster: Nonstructural protein; n=141; Hepatitis... 36 1.2 UniRef50_Q1UZP6 Cluster: Putative uncharacterized protein; n=1; ... 36 1.2 UniRef50_Q9U0D4 Cluster: Sequestrin; n=2; Plasmodium falciparum|... 36 1.2 UniRef50_Q54KL2 Cluster: Putative uncharacterized protein; n=1; ... 36 1.2 UniRef50_Q24GP7 Cluster: Putative uncharacterized protein; n=2; ... 36 1.2 UniRef50_UPI00006CAB22 Cluster: hypothetical protein TTHERM_0078... 35 1.6 UniRef50_Q6A5L0 Cluster: Anaerobic glycerol-3-phosphate dehydrog... 35 1.6 UniRef50_Q1FGW8 Cluster: Peptidase M23B precursor; n=1; Clostrid... 35 1.6 UniRef50_Q0PBQ1 Cluster: Putative uncharacterized protein; n=12;... 35 1.6 UniRef50_A3S6V5 Cluster: Putative uncharacterized protein; n=1; ... 35 1.6 UniRef50_Q331Z6 Cluster: Conserved hypothetical phage-related pr... 35 1.6 UniRef50_A4VE14 Cluster: Putative uncharacterized protein; n=1; ... 35 1.6 UniRef50_Q6LQJ9 Cluster: UPF0234 protein PBPRA2024; n=15; Proteo... 35 1.6 UniRef50_Q4SQ87 Cluster: Chromosome 4 SCAF14533, whole genome sh... 35 2.1 UniRef50_Q22DL4 Cluster: Putative uncharacterized protein; n=1; ... 35 2.1 UniRef50_Q22751 Cluster: Putative uncharacterized protein dnj-23... 35 2.1 UniRef50_Q4A7Z9 Cluster: ABC transporter permease protein; n=5; ... 34 2.8 UniRef50_A7S5A3 Cluster: Predicted protein; n=1; Nematostella ve... 34 2.8 UniRef50_A7AQ69 Cluster: Isy1-like splicing family protein; n=1;... 34 2.8 UniRef50_A0D3I1 Cluster: Chromosome undetermined scaffold_36, wh... 34 2.8 UniRef50_A0BUU6 Cluster: Chromosome undetermined scaffold_13, wh... 34 2.8 UniRef50_UPI0000ED8E89 Cluster: hypothetical protein CdifQ_04003... 34 3.7 UniRef50_UPI00006CD9EF Cluster: hypothetical protein TTHERM_0039... 34 3.7 UniRef50_Q897A5 Cluster: Conserved protein; n=1; Clostridium tet... 34 3.7 UniRef50_Q31C98 Cluster: Putative uncharacterized protein precur... 34 3.7 UniRef50_Q8LB56 Cluster: Nuclear RNA binding protein A-like prot... 34 3.7 UniRef50_Q8ILK6 Cluster: Putative uncharacterized protein; n=2; ... 34 3.7 UniRef50_Q4XYB9 Cluster: Putative uncharacterized protein; n=4; ... 34 3.7 UniRef50_A2EMF2 Cluster: Putative uncharacterized protein; n=1; ... 34 3.7 UniRef50_A2DDP1 Cluster: Viral A-type inclusion protein, putativ... 34 3.7 UniRef50_A0MV34 Cluster: Ventral nervous system defective 2; n=1... 34 3.7 UniRef50_A0C1X3 Cluster: Chromosome undetermined scaffold_143, w... 34 3.7 UniRef50_UPI0000F2C318 Cluster: PREDICTED: similar to RIKEN cDNA... 33 4.9 UniRef50_A1L230 Cluster: Zgc:158614; n=2; Danio rerio|Rep: Zgc:1... 33 4.9 UniRef50_Q982Q7 Cluster: Mlr8538 protein; n=2; Mesorhizobium lot... 33 4.9 UniRef50_Q892P8 Cluster: Lipoate-protein ligase A; n=2; Clostrid... 33 4.9 UniRef50_Q2GBI0 Cluster: TonB-dependent receptor precursor; n=1;... 33 4.9 UniRef50_Q4HP54 Cluster: Putative uncharacterized protein; n=1; ... 33 4.9 UniRef50_A6LNV9 Cluster: S-layer domain protein; n=1; Thermosiph... 33 4.9 UniRef50_Q8IE35 Cluster: Putative uncharacterized protein PF13_0... 33 4.9 UniRef50_Q7RDH4 Cluster: Reticulocyte-binding protein 2 homolog ... 33 4.9 UniRef50_Q4RQ13 Cluster: Chromosome 17 SCAF15006, whole genome s... 33 6.5 UniRef50_Q9R8E0 Cluster: SapC; n=3; Campylobacter fetus|Rep: Sap... 33 6.5 UniRef50_Q4CAG5 Cluster: Forkhead-associated; n=1; Crocosphaera ... 33 6.5 UniRef50_A5TRS8 Cluster: Putative uncharacterized protein; n=1; ... 33 6.5 UniRef50_A3J217 Cluster: Putative uncharacterized protein; n=1; ... 33 6.5 UniRef50_Q5CE64 Cluster: Putative uncharacterized protein; n=2; ... 33 6.5 UniRef50_O01923 Cluster: Putative uncharacterized protein R155.3... 33 6.5 UniRef50_A5JZD2 Cluster: Putative uncharacterized protein; n=4; ... 33 6.5 UniRef50_A4KBK6 Cluster: Cathepsin L-like cysteine protease; n=3... 33 6.5 UniRef50_A2EYA1 Cluster: Viral A-type inclusion protein, putativ... 33 6.5 UniRef50_A0BPG7 Cluster: Chromosome undetermined scaffold_12, wh... 33 6.5 UniRef50_Q5ATT0 Cluster: Putative uncharacterized protein; n=1; ... 33 6.5 UniRef50_A6URX9 Cluster: Putative uncharacterized protein; n=1; ... 33 6.5 UniRef50_UPI0000EFB3C7 Cluster: hypothetical protein An07g06160;... 33 8.6 UniRef50_UPI0000D57675 Cluster: PREDICTED: similar to CG6649-PA;... 33 8.6 UniRef50_Q677U1 Cluster: Putative lipopolysaccharide-modifying e... 33 8.6 UniRef50_Q008X6 Cluster: Replicase polyprotein 1ab; n=2; White b... 33 8.6 UniRef50_Q84IM8 Cluster: Hyaluronidase; n=1; Clostridium septicu... 33 8.6 UniRef50_A6KYZ4 Cluster: Putative uncharacterized protein; n=2; ... 33 8.6 UniRef50_A0HCW3 Cluster: Putative uncharacterized protein precur... 33 8.6 UniRef50_Q2PES5 Cluster: Putative uncharacterized protein; n=1; ... 33 8.6 UniRef50_Q8IL70 Cluster: Putative uncharacterized protein; n=1; ... 33 8.6 UniRef50_Q7RM41 Cluster: FtsJ cell division protein, putative; n... 33 8.6 UniRef50_Q7RGM2 Cluster: Putative uncharacterized protein PY0432... 33 8.6 UniRef50_A2DBJ5 Cluster: Putative uncharacterized protein; n=1; ... 33 8.6 UniRef50_Q59SM3 Cluster: Putative uncharacterized protein ORC5; ... 33 8.6 UniRef50_Q00799 Cluster: Reticulocyte-binding protein 2 precurso... 33 8.6 UniRef50_P15917 Cluster: Lethal factor precursor; n=3; Bacillus ... 33 8.6 >UniRef50_Q1HPZ5 Cluster: LRP16 protein; n=1; Bombyx mori|Rep: LRP16 protein - Bombyx mori (Silk moth) Length = 275 Score = 514 bits (1267), Expect = e-144 Identities = 244/244 (100%), Positives = 244/244 (100%) Query: 1 MVNSTKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL 60 MVNSTKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL Sbjct: 32 MVNSTKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL 91 Query: 61 KEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPF 120 KEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPF Sbjct: 92 KEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPF 151 Query: 121 LQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIK 180 LQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIK Sbjct: 152 LQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIK 211 Query: 181 SIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLY 240 SIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLY Sbjct: 212 SIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLY 271 Query: 241 FPTL 244 FPTL Sbjct: 272 FPTL 275 >UniRef50_A1Z1Q3 Cluster: MACRO domain-containing protein 2; n=41; cellular organisms|Rep: MACRO domain-containing protein 2 - Homo sapiens (Human) Length = 448 Score = 231 bits (564), Expect = 2e-59 Identities = 122/242 (50%), Positives = 168/242 (69%), Gaps = 19/242 (7%) Query: 7 WEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKI 66 W EK R+LK++LEE+RK Y D+I L ++ W + + K +G + +++T +E ++ Sbjct: 11 WREEKERLLKMTLEERRKEYLR-DYIPLNSILSWKEEM-KGKGQNDEENT----QETSQV 64 Query: 67 KINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECD 126 K KS++E+VS+++GDIT LE+DA+VNAAN+ L GGGVDG IHRAAGP L AEC Sbjct: 65 K------KSLTEKVSLYRGDITLLEVDAIVNAANASLLGGGGVDGCIHRAAGPCLLAECR 118 Query: 127 SIGGCPTGDAKVTGGYNLPAKYIIHTVGP-----QDGS-AEKLESCYEKCLSFQQEYQIK 180 ++ GC TG AK+T GY+LPAKY+IHTVGP +GS E L +CY+ L +E I+ Sbjct: 119 NLNGCDTGHAKITCGYDLPAKYVIHTVGPIARGHINGSHKEDLANCYKSSLKLVKENNIR 178 Query: 181 SIAFPCISTGIYGFPNRLAAHIALRTARKFLETN-TEMNRIIFCTFLPIDVEIYETLMQL 239 S+AFPCISTGIYGFPN AA IAL T +++L N E++RIIFC FL +D +IY+ M Sbjct: 179 SVAFPCISTGIYGFPNEPAAVIALNTIKEWLAKNHHEVDRIIFCVFLEVDFKIYKKKMNE 238 Query: 240 YF 241 +F Sbjct: 239 FF 240 >UniRef50_Q6PHJ5 Cluster: Zgc:65960; n=5; cellular organisms|Rep: Zgc:65960 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 452 Score = 225 bits (550), Expect = 8e-58 Identities = 115/243 (47%), Positives = 163/243 (67%), Gaps = 25/243 (10%) Query: 6 KWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEK 65 +W EK R+L LSLE++RK Y+ + +++L+ + W+ + DS +T ++ Sbjct: 7 EWRAEKERLLSLSLEDRRKDYRGN-YLELDKIPTWANH-------DSNTATEEE------ 52 Query: 66 IKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC 125 ++ S++++VS++KGDIT LEIDA+VNAANS L GGGVDG IHRAAG L EC Sbjct: 53 ----EHQSSSLADKVSLYKGDITILEIDAIVNAANSSLLGGGGVDGCIHRAAGHLLYEEC 108 Query: 126 DSIGGCPTGDAKVTGGYNLPAKYIIHTVGP----QDGSAEK--LESCYEKCLSFQQEYQI 179 S+ GC TG AK+T GY+LPAKY+IHTVGP G +++ LESCY L ++ + Sbjct: 109 HSLNGCDTGKAKITCGYDLPAKYVIHTVGPIARGNVGQSQRDDLESCYYSSLKLMKDNNL 168 Query: 180 KSIAFPCISTGIYGFPNRLAAHIALRTARKFLETN-TEMNRIIFCTFLPIDVEIYETLMQ 238 +S+AFPCISTGIYGFPN AA IAL+T ++++E + E++R+IFC FL D EIY+ M Sbjct: 169 RSVAFPCISTGIYGFPNEPAAEIALKTVQEWIEKHQDEIDRVIFCVFLETDYEIYKRKMS 228 Query: 239 LYF 241 +F Sbjct: 229 DFF 231 >UniRef50_Q66HV6 Cluster: Zgc:92353; n=1; Danio rerio|Rep: Zgc:92353 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 248 Score = 218 bits (533), Expect = 9e-56 Identities = 111/243 (45%), Positives = 153/243 (62%), Gaps = 26/243 (10%) Query: 7 WEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKI 66 W+ K ++ + E++R++Y+ DFI LE+V WS + S Sbjct: 16 WKQAKTKLCSMDKEKRRELYRV-DFIPLEDVPVWSPSGDSS------------------C 56 Query: 67 KINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECD 126 K E N+ ++ +VS+F GDITKLEIDAV NAAN L GGGVDGAIHR AGP L+ EC Sbjct: 57 KPRCEVNEELNMKVSLFGGDITKLEIDAVANAANKTLLGGGGVDGAIHRGAGPLLRKECA 116 Query: 127 SIGGCPTGDAKVTGGYNLPAKYIIHTVGP--QDGSAEK----LESCYEKCLSFQQEYQIK 180 ++ GC TG+AK+TG Y LPA+Y+IHTVGP D E+ L +CY CL ++ ++ Sbjct: 117 TLNGCETGEAKITGAYGLPARYVIHTVGPIVHDSVGEREEEALRNCYYNCLHTATKHHLR 176 Query: 181 SIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTE-MNRIIFCTFLPIDVEIYETLMQL 239 ++AFPCISTG+YG+P A +AL+T R +LE N E ++R+IFC FL D ++YE L+ Sbjct: 177 TVAFPCISTGVYGYPPDQAVEVALKTVRDYLEQNPEKLDRVIFCVFLKSDKQLYENLLPA 236 Query: 240 YFP 242 YFP Sbjct: 237 YFP 239 >UniRef50_A7RJ44 Cluster: Predicted protein; n=3; Eukaryota|Rep: Predicted protein - Nematostella vectensis Length = 183 Score = 216 bits (527), Expect = 5e-55 Identities = 97/170 (57%), Positives = 128/170 (75%), Gaps = 3/170 (1%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGD 135 ++++VS++ GDIT LEIDA+VNAAN+ L GGGVDG IHRAAG L EC + GC TG+ Sbjct: 5 LNDKVSLWTGDITALEIDAIVNAANTTLLGGGGVDGCIHRAAGDNLFKECRKLRGCQTGE 64 Query: 136 AKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 AK+T G+ LPAKY+IHT GP + +KL+ CY+ CL +++ +K++AF CISTGIYG+P Sbjct: 65 AKITLGHRLPAKYVIHTAGPMGKNRKKLQDCYKNCLQLAKQHGVKTLAFCCISTGIYGYP 124 Query: 196 NRLAAHIALRTARKFLET---NTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 N+ AAH+AL T R++LET N + RI+FCTFLP D EIYE L+ YFP Sbjct: 125 NKDAAHVALETVRQWLETDDNNDSVERIVFCTFLPKDTEIYERLLLCYFP 174 >UniRef50_Q5DCZ3 Cluster: SJCHGC06209 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06209 protein - Schistosoma japonicum (Blood fluke) Length = 194 Score = 206 bits (504), Expect = 3e-52 Identities = 90/162 (55%), Positives = 120/162 (74%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGD 135 + R+S+++GDIT L IDA+ NAAN +L+ GGGVDGAIHRAAGP L C +GGCPTGD Sbjct: 25 LGSRISLWRGDITHLRIDAIANAANRQLRGGGGVDGAIHRAAGPELLVACQKLGGCPTGD 84 Query: 136 AKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 AK+T G+NLP+KY+IH VGP + L S Y+K L E+ I+SIAFPCISTG+YGFP Sbjct: 85 AKLTPGFNLPSKYVIHCVGPIGQNDAALGSTYQKALELCSEHNIQSIAFPCISTGVYGFP 144 Query: 196 NRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLM 237 N AA +A+ T +++++ E+ R+IFC F+ ID +IYE L+ Sbjct: 145 NEAAAKVAIHTVLSYMKSHPEIQRVIFCIFMDIDYKIYEKLI 186 >UniRef50_Q9BQ69 Cluster: MACRO domain-containing protein 1; n=18; cellular organisms|Rep: MACRO domain-containing protein 1 - Homo sapiens (Human) Length = 325 Score = 204 bits (499), Expect = 1e-51 Identities = 107/246 (43%), Positives = 154/246 (62%), Gaps = 21/246 (8%) Query: 4 STKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEF 63 ST W+ K+ + LS +++ + Y DF+ L+ + W + ++G+ K E Sbjct: 92 STDWKEAKSFLKGLSDKQREEHYFCKDFVRLKKIPTWKE---MAKGVAVK-------VEE 141 Query: 64 EKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQA 123 + K K+K ++E++S+ + DITKLE+DA+VNAANS L GGGVDG IHRAAGP L Sbjct: 142 PRYK----KDKQLNEKISLLRSDITKLEVDAIVNAANSSLLGGGGVDGCIHRAAGPLLTD 197 Query: 124 ECDSIGGCPTGDAKVTGGYNLPAKYIIHTVG------PQDGSAEKLESCYEKCLSFQQEY 177 EC ++ C TG AK+TGGY LPAKY+IHTVG P A +L SCY L E+ Sbjct: 198 ECRTLQSCKTGKAKITGGYRLPAKYVIHTVGPIAYGEPSASQAAELRSCYLSSLDLLLEH 257 Query: 178 QIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTE-MNRIIFCTFLPIDVEIYETL 236 +++S+AFPCISTG++G+P AA I L T R++LE + + ++R+I C FL D +IY + Sbjct: 258 RLRSVAFPCISTGVFGYPCEAAAEIVLATLREWLEQHKDKVDRLIICVFLEKDEDIYRSR 317 Query: 237 MQLYFP 242 + YFP Sbjct: 318 LPHYFP 323 >UniRef50_UPI000023F24A Cluster: hypothetical protein FG04179.1; n=1; Gibberella zeae PH-1|Rep: hypothetical protein FG04179.1 - Gibberella zeae PH-1 Length = 220 Score = 186 bits (452), Expect = 6e-46 Identities = 92/174 (52%), Positives = 118/174 (67%), Gaps = 6/174 (3%) Query: 75 SISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTG 134 SI+ R+ + +GDIT+L IDA+VNAAN L+ G GVDGAIH AAGP L E ++G TG Sbjct: 39 SINRRIGLIRGDITELRIDAIVNAANKSLRGGSGVDGAIHSAAGPDLVKESGALGPIDTG 98 Query: 135 DAKVTGGYNLPAKYIIHTVGPQDGSA----EKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 DA +T GY LPAK++IHTVGP GS EKL CY +CL E +++IAF ISTG Sbjct: 99 DAVITKGYKLPAKHVIHTVGPIFGSERHPNEKLAMCYRECLKLAVENGVETIAFSAISTG 158 Query: 191 IYGFPNRLAAHIALRTARKFLET--NTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 IYGFPN AA IA +T R+FLET +++R++F TF+P DV Y ++ FP Sbjct: 159 IYGFPNDPAAKIACQTVREFLETEEGNKLSRVVFVTFVPRDVNAYSKIISTIFP 212 >UniRef50_A6S485 Cluster: Putative uncharacterized protein; n=1; Botryotinia fuckeliana B05.10|Rep: Putative uncharacterized protein - Botryotinia fuckeliana B05.10 Length = 283 Score = 182 bits (443), Expect = 7e-45 Identities = 89/179 (49%), Positives = 117/179 (65%), Gaps = 8/179 (4%) Query: 73 NKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCP 132 N+ ++R+ + +GDIT LE+DA+VNAAN+ L GGGVDGAIHRAAGP L EC ++ GC Sbjct: 37 NQFFNDRIGLIRGDITHLEVDAIVNAANNSLLGGGGVDGAIHRAAGPDLLRECRTLNGCR 96 Query: 133 TGDAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPC 186 TG AK+T Y LP K +IH VGP + S + LE CY L E K+IAF Sbjct: 97 TGSAKITDAYELPCKKVIHAVGPVYDSYKPEVSEQNLEGCYSTSLDLAVENGCKTIAFSA 156 Query: 187 ISTGIYGFPNRLAAHIALRTARKFLET--NTEMNRIIFCTFLPIDVEIYETLMQLYFPT 243 +STG+YG+P+ AA +AL T R+FLE+ ++M +IIFCTF+P DV Y + FPT Sbjct: 157 LSTGVYGYPSDEAAPVALMTVRRFLESKKGSKMEKIIFCTFVPKDVAAYNEWIPRIFPT 215 >UniRef50_A4R3Q9 Cluster: Putative uncharacterized protein; n=1; Magnaporthe grisea|Rep: Putative uncharacterized protein - Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) Length = 263 Score = 182 bits (442), Expect = 1e-44 Identities = 89/178 (50%), Positives = 119/178 (66%), Gaps = 8/178 (4%) Query: 73 NKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCP 132 NK ++R++++ GDITKL +DA+VNAAN L GGGVDG+IHRAAG L EC ++ GC Sbjct: 57 NKRFNDRIALYHGDITKLMVDAIVNAANETLLGGGGVDGSIHRAAGGGLLRECRTLDGCD 116 Query: 133 TGDAKVTGGYNLPAKYIIHTVGPQDGSAEK------LESCYEKCLSFQQEYQIKSIAFPC 186 TGDAKVT Y+LP K +IH VGP + L SCY + L E +SIAFP Sbjct: 117 TGDAKVTDAYDLPCKKVIHAVGPVYNERHREECEMLLSSCYTRSLELAVENGCRSIAFPA 176 Query: 187 ISTGIYGFPNRLAAHIALRTARKFLETN--TEMNRIIFCTFLPIDVEIYETLMQLYFP 242 ISTGIYG+P+R AA+ A+ RKFLE++ +++ ++FC FL D+EIY + L+FP Sbjct: 177 ISTGIYGYPSRRAANAAITAVRKFLESDQGDKISLVVFCCFLQKDMEIYTDKLPLWFP 234 >UniRef50_Q8TQD0 Cluster: UPF0189 protein MA_1614; n=4; cellular organisms|Rep: UPF0189 protein MA_1614 - Methanosarcina acetivorans Length = 195 Score = 181 bits (441), Expect = 1e-44 Identities = 94/179 (52%), Positives = 124/179 (69%), Gaps = 10/179 (5%) Query: 50 IDSKKSTTDDLKE-FEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGG 108 +D +K +LK K +N +N SER+ I + DIT+L++DA+VNAAN+ L GGG Sbjct: 1 MDPQKPYKKELKRNSRKRSLNMSQN---SERIRIIERDITELKVDAIVNAANNTLLGGGG 57 Query: 109 VDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGP--QDGSA---EKL 163 VDGAIHRAAGP L EC ++ GCPTG+AK+T GY LPAKY+IHTVGP Q+G+ E L Sbjct: 58 VDGAIHRAAGPGLLEECRTLNGCPTGEAKITKGYLLPAKYVIHTVGPIWQEGTKGEDEFL 117 Query: 164 ESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIF 222 SCY K L ++Y +K+IAFP ISTG YGFP+ AA IA+ ++FL+ N E+ I+F Sbjct: 118 ASCYRKSLELARKYDVKTIAFPTISTGAYGFPSERAARIAVSQVKEFLKVN-ELPEIVF 175 >UniRef50_A1IFK2 Cluster: Putative uncharacterized protein; n=1; Candidatus Desulfococcus oleovorans Hxd3|Rep: Putative uncharacterized protein - Candidatus Desulfococcus oleovorans Hxd3 Length = 195 Score = 175 bits (427), Expect = 6e-43 Identities = 84/166 (50%), Positives = 109/166 (65%), Gaps = 5/166 (3%) Query: 74 KSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPT 133 K I R+ +++GDIT LE+DA+VNAAN L GGGVDGAIHRAAGP L AEC ++GGC T Sbjct: 23 KEILSRLKVWQGDITTLEVDAIVNAANKTLLGGGGVDGAIHRAAGPELLAECKTLGGCDT 82 Query: 134 GDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCIS 188 G AK+T GY LPAK++IHTVGP G A+ L CY L ++ + S+AFP +S Sbjct: 83 GQAKITRGYRLPAKFVIHTVGPVYSRSNPGVAKLLAGCYTNSLKLAKDQGLASVAFPAVS 142 Query: 189 TGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYE 234 G+YG+P + A IAL T FLET+ + ++IF F +YE Sbjct: 143 CGVYGYPMKEACRIALDTVCDFLETDRTIEQVIFALFSADAGRVYE 188 >UniRef50_Q0CQJ0 Cluster: Protein LRP16; n=5; cellular organisms|Rep: Protein LRP16 - Aspergillus terreus (strain NIH 2624) Length = 344 Score = 175 bits (426), Expect = 8e-43 Identities = 89/184 (48%), Positives = 118/184 (64%), Gaps = 14/184 (7%) Query: 73 NKSISERVSIFKGDITKL-EIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGC 131 +K +++R+S+ + DITKL ++D +VNAANS L GGGVDGAIHRAAGP L EC ++GGC Sbjct: 34 SKPLNDRISLIRHDITKLLDVDCIVNAANSSLLGGGGVDGAIHRAAGPGLVRECRTLGGC 93 Query: 132 PTGDAKVTGGYNLPAKYIIHTVGP------QDGSA---EKLESCYEKCLSFQQEYQIKSI 182 TGDAK T Y+LP +++IHTVGP Q G+A + L SCY +CL + +SI Sbjct: 94 ATGDAKTTAAYDLPCRWVIHTVGPIYPVERQKGAARPEQLLRSCYRRCLELAVRNKARSI 153 Query: 183 AFPCISTGIYGFPNRLAAHIALRTARKFLETN----TEMNRIIFCTFLPIDVEIYETLMQ 238 AFP ISTG+Y +P R AA IAL R FLE+ + +++FC F D YE + Sbjct: 154 AFPAISTGVYAYPKRRAARIALDETRAFLESEGTDIVTLEKVVFCNFEEEDQRAYEEAVP 213 Query: 239 LYFP 242 FP Sbjct: 214 DVFP 217 >UniRef50_Q0UQZ6 Cluster: Predicted protein; n=1; Phaeosphaeria nodorum|Rep: Predicted protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 291 Score = 174 bits (423), Expect = 2e-42 Identities = 85/177 (48%), Positives = 120/177 (67%), Gaps = 9/177 (5%) Query: 75 SISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTG 134 ++++++SI + DIT L IDA+VNAAN+ L GGGVDGAIHRAAGP L EC+++ GC TG Sbjct: 36 TLNDKISIIRRDITTLAIDAIVNAANTSLLGGGGVDGAIHRAAGPKLYDECETLDGCETG 95 Query: 135 DAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCIS 188 +AK+T GY LP+K +IH VGP + SA+ L CY L + + +SIAF +S Sbjct: 96 NAKMTRGYELPSKKVIHAVGPIYWKEGRSASAKLLSMCYRTSLQLAVDNECRSIAFSALS 155 Query: 189 TGIYGFPNRLAAHIALRTARKFLETN---TEMNRIIFCTFLPIDVEIYETLMQLYFP 242 TG+YG+P+ AA +AL+T R+FL+ + +++R+IFC FL D Y +Q YFP Sbjct: 156 TGVYGYPSDEAAVVALQTVRQFLDEDGKAEKLDRVIFCNFLEKDENAYYREIQKYFP 212 >UniRef50_Q5KCD7 Cluster: Putative uncharacterized protein; n=2; Filobasidiella neoformans|Rep: Putative uncharacterized protein - Cryptococcus neoformans (Filobasidiella neoformans) Length = 252 Score = 172 bits (418), Expect = 8e-42 Identities = 88/191 (46%), Positives = 116/191 (60%), Gaps = 6/191 (3%) Query: 58 DDLKEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAA 117 D K E K +++RVSI++GDIT+LE D +VNAANS L GGGVDGAIHRAA Sbjct: 52 DHTNALNPTKPKYEFTKQLNDRVSIWRGDITELEADMIVNAANSSLLGGGGVDGAIHRAA 111 Query: 118 GPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVG------PQDGSAEKLESCYEKCL 171 G L EC +GG TG+ K T GYNL +K I HTVG P +A+ L+SCY+ L Sbjct: 112 GKHLLEECKKLGGAQTGETKFTAGYNLSSKKIAHTVGPVYHSHPPQRAAQLLKSCYQSSL 171 Query: 172 SFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVE 231 ++ I F ISTG+YG+P + A HIAL T R+FLE + + R+I+ F D + Sbjct: 172 EGCRDSGGGVIGFSSISTGVYGYPIKDATHIALETTRQFLEQDDSITRVIYVVFSKRDED 231 Query: 232 IYETLMQLYFP 242 +Y ++ YFP Sbjct: 232 VYREIIPQYFP 242 >UniRef50_Q8KAE4 Cluster: UPF0189 protein CT2219; n=24; cellular organisms|Rep: UPF0189 protein CT2219 - Chlorobium tepidum Length = 172 Score = 171 bits (416), Expect = 1e-41 Identities = 82/160 (51%), Positives = 104/160 (65%), Gaps = 5/160 (3%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + K DIT L +DA+VNAAN+ L GGGVDGAIHRAAGP L C +GGC TG+AK+T Sbjct: 7 IHAIKADITSLTVDAIVNAANTSLLGGGGVDGAIHRAAGPKLLEACRELGGCLTGEAKIT 66 Query: 140 GGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 GY LPA ++IHTVGP G AE L SCY L E+ ++IAFP ISTGIYG+ Sbjct: 67 KGYRLPATFVIHTVGPVWHGGNHGEAELLASCYRNSLKLAIEHHCRTIAFPSISTGIYGY 126 Query: 195 PNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYE 234 P AA IA+ T R+ L + ++IFC F D+++Y+ Sbjct: 127 PVEQAAAIAITTVREMLADERGIEKVIFCCFSDRDLDVYQ 166 >UniRef50_A2FMC7 Cluster: Appr-1-p processing enzyme family protein; n=1; Trichomonas vaginalis G3|Rep: Appr-1-p processing enzyme family protein - Trichomonas vaginalis G3 Length = 361 Score = 168 bits (408), Expect = 1e-40 Identities = 90/183 (49%), Positives = 114/183 (62%), Gaps = 4/183 (2%) Query: 64 EKIKINTEKNKSISERVSIF-KGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQ 122 EK + + N I+E++S + +G+ KLE DAVVNAANS L GGG+ G +H AAG ++ Sbjct: 102 EKFEPLYKPNTEINEKISFWMRGNSVKLECDAVVNAANSHLYPGGGICGVLHSAAGEAME 161 Query: 123 AECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSI 182 EC IG PTG VT GYNLPAKY IHTVGP +KL+ YE LS +I+S+ Sbjct: 162 RECSEIGYTPTGKCAVTLGYNLPAKYCIHTVGPIGEQPDKLQEAYESTLSCIDGKKIRSV 221 Query: 183 AFPCISTGIYGFPNRLAAHIALRTARKFLE--TNTE-MNRIIFCTFLPIDVEIYETLMQL 239 CISTGIYG+P A IAL+ RKFLE N E +RIIF F DV +Y+ + + Sbjct: 222 GLCCISTGIYGYPIENATPIALKVVRKFLEDPNNREKTDRIIFVVFERRDVVVYDRMRHI 281 Query: 240 YFP 242 YFP Sbjct: 282 YFP 284 >UniRef50_Q2GZS3 Cluster: Putative uncharacterized protein; n=1; Chaetomium globosum|Rep: Putative uncharacterized protein - Chaetomium globosum (Soil fungus) Length = 282 Score = 167 bits (405), Expect = 3e-40 Identities = 83/178 (46%), Positives = 112/178 (62%), Gaps = 8/178 (4%) Query: 73 NKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCP 132 +K++++RV + +GDITKL +DA+VNAAN L GGGVD AIHRAAGP L EC +GGC Sbjct: 47 SKTLNDRVGLIRGDITKLAVDAIVNAANRSLLGGGGVDEAIHRAAGPQLYLECRGLGGCE 106 Query: 133 TGDAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPC 186 TG AK+T Y LP + +IH VGP +GS L CY + L E +++AF Sbjct: 107 TGSAKMTAAYALPCQRVIHAVGPVYNPFNPEGSERLLTGCYTRSLELAVEAGCRTVAFSA 166 Query: 187 ISTGIYGFPNRLAAHIALRTARKFL--ETNTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 ISTG+YG+P+ AA AL RKFL ++++++ TF DVE Y ++ LYFP Sbjct: 167 ISTGVYGYPSEEAAPAALSAIRKFLVGPDGGKIDKVVVVTFERKDVEAYNEVLPLYFP 224 >UniRef50_Q4WYQ2 Cluster: LRP16 family protein; n=8; cellular organisms|Rep: LRP16 family protein - Aspergillus fumigatus (Sartorya fumigata) Length = 354 Score = 165 bits (402), Expect = 7e-40 Identities = 92/183 (50%), Positives = 110/183 (60%), Gaps = 13/183 (7%) Query: 73 NKSISERVSIFKGDITKLE-IDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGC 131 + S + +S+ + DITKLE +D +VNAAN L GGGVDGAIHRAAGP L EC ++ GC Sbjct: 34 SNSFNNIISLIRNDITKLENVDCIVNAANESLLGGGGVDGAIHRAAGPDLLRECRTLKGC 93 Query: 132 PTGDAKVTGGYNLPAKYIIHTVGP---------QDGSAEKLESCYEKCLSFQQEYQIKSI 182 TGDAK+T Y LP K +IHTVGP D L SCY + L E +KSI Sbjct: 94 RTGDAKITSAYELPCKKVIHTVGPIYHFELRKGDDRPEMLLRSCYRRSLELAVENNMKSI 153 Query: 183 AFPCISTGIYGFPNRLAAHIALRTARKFLET--NTE-MNRIIFCTFLPIDVEIYETLMQL 239 AF ISTG+YG+P+ AA AL RKFLE N E + RIIFC F D YE + L Sbjct: 154 AFAAISTGVYGYPSSEAAFAALDEVRKFLERPGNIEKLERIIFCNFERKDEVAYEQAIPL 213 Query: 240 YFP 242 FP Sbjct: 214 IFP 216 >UniRef50_Q17432 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 203 Score = 165 bits (400), Expect = 1e-39 Identities = 92/179 (51%), Positives = 110/179 (61%), Gaps = 12/179 (6%) Query: 63 FEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAG-PFL 121 FEK K+ K++ R+S++ GDITKL +DA+VNAANSRL GGGVDGAIHRAAG L Sbjct: 13 FEKFKVA----KNVLGRISVWDGDITKLSVDAIVNAANSRLAGGGGVDGAIHRAAGRKQL 68 Query: 122 QAECDSIGGCPTGDAKVTGGYNL-PAKYIIHTVGPQ------DGSAEKLESCYEKCLSFQ 174 Q EC GC GDA +T G N+ K IIHTVGPQ D E L +CY L Sbjct: 69 QEECQQYNGCAVGDAVITSGCNINHIKKIIHTVGPQVYGNVTDERRENLVACYRTSLDIA 128 Query: 175 QEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIY 233 E +KSIAF CISTG+YG+PN AA ++LE N + RI+ TFL ID E Y Sbjct: 129 IENGMKSIAFCCISTGVYGYPNDDAAKTVTNFLTEYLEKNDTIERIVLVTFLDIDNEHY 187 >UniRef50_Q0LI88 Cluster: Appr-1-p processing; n=2; cellular organisms|Rep: Appr-1-p processing - Herpetosiphon aurantiacus ATCC 23779 Length = 173 Score = 162 bits (393), Expect = 8e-39 Identities = 83/170 (48%), Positives = 110/170 (64%), Gaps = 5/170 (2%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGD 135 +++R+ I +GDITK A+VNAANS L GGGVDGAIHRAAGP L EC +GGC TG Sbjct: 1 MNQRIEILQGDITKFAGAAIVNAANSSLLGGGGVDGAIHRAAGPKLGLECLMLGGCKTGQ 60 Query: 136 AKVTGGYNLPAKYIIHTVGP--QDGS---AEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 AK+T GY LP + IIHTVGP Q G+ AE L +CY++ L ++Q++++AFP IS G Sbjct: 61 AKMTKGYRLPVRSIIHTVGPVWQGGNKHEAELLTNCYQQSLELAAKHQLETLAFPAISCG 120 Query: 191 IYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLY 240 IYG+P LAA IA++T FL TN+ ++ F + Y + Y Sbjct: 121 IYGYPVELAAPIAIQTIANFLTTNSIPEKVSLICFEATVYQAYCVAWEAY 170 >UniRef50_UPI000049917F Cluster: conserved hypothetical protein; n=1; Entamoeba histolytica HM-1:IMSS|Rep: conserved hypothetical protein - Entamoeba histolytica HM-1:IMSS Length = 316 Score = 160 bits (389), Expect = 3e-38 Identities = 84/180 (46%), Positives = 112/180 (62%), Gaps = 4/180 (2%) Query: 68 INT--EKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAE- 124 +NT EKN+ +++++ I GDITK+++D VVNAANS L+ GGGVDGAIH AAG L Sbjct: 37 VNTGYEKNEEMNKKIIIITGDITKIQVDVVVNAANSYLRGGGGVDGAIHCAAGYDLYDYL 96 Query: 125 CDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAF 184 C C TGD K + G+ +P K I+H VGP +A +L+S Y +CL + + KSIAF Sbjct: 97 CSHYTYCKTGDFKPSPGFKMPCKEILHGVGPIGENAIQLQSVYVRCLEYVRLKGYKSIAF 156 Query: 185 PCISTGIYGFPNRLAAHIALRTARKFLETNTEM-NRIIFCTFLPIDVEIYETLMQLYFPT 243 PCISTGI+G+ N A + L R +LE N +IIFC + D IY + YFPT Sbjct: 157 PCISTGIFGYNNNSACPVVLEVVRNWLEVNPLWEGKIIFCCYNLTDYNIYLKFLPYYFPT 216 >UniRef50_A5WHZ6 Cluster: Appr-1-p processing domain protein; n=2; Bacteria|Rep: Appr-1-p processing domain protein - Psychrobacter sp. PRwf-1 Length = 194 Score = 160 bits (388), Expect = 3e-38 Identities = 77/169 (45%), Positives = 109/169 (64%), Gaps = 9/169 (5%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 +++ + DIT L++DA+VNAANS L GGGVDGAIHRAAGP L A C ++ GC TG+AK++ Sbjct: 26 LTLIQADITTLKVDAIVNAANSSLLGGGGVDGAIHRAAGPELVAYCRTLNGCATGEAKIS 85 Query: 140 GGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 G+ LPA+Y+I+TVGP G E L SCY L+ Q++ IKSIAFP ISTG+YG+ Sbjct: 86 PGFKLPAQYVIYTVGPVWHGGNQGEPELLASCYRNSLALAQQHDIKSIAFPAISTGVYGY 145 Query: 195 PNRLAAHIALRTARKFLE----TNTEMNRIIFCTFLPIDVEIYETLMQL 239 P A IA+ + ++ + + +I+C F D +Y+ + L Sbjct: 146 PIEQATDIAINSVIDSIQQASVSQLVITEVIYCCFSAADAAVYKQQLNL 194 >UniRef50_Q985D2 Cluster: UPF0189 protein mll7730; n=54; cellular organisms|Rep: UPF0189 protein mll7730 - Rhizobium loti (Mesorhizobium loti) Length = 176 Score = 160 bits (388), Expect = 3e-38 Identities = 79/161 (49%), Positives = 99/161 (61%), Gaps = 5/161 (3%) Query: 78 ERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAK 137 +R+ I GDITKL++DA+VNAAN+ L GGGVDGAIHRAAG L+ EC + GC GDAK Sbjct: 6 DRIRIHTGDITKLDVDAIVNAANTLLLGGGGVDGAIHRAAGRELEVECRMLNGCKVGDAK 65 Query: 138 VTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIY 192 +T GY LPA++IIHTVGP G AE L SCY L +S+AFP ISTG+Y Sbjct: 66 ITKGYKLPARHIIHTVGPVWQGGGKGEAELLASCYRSSLELAAANDCRSVAFPAISTGVY 125 Query: 193 GFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIY 233 +P A IA+ T +E +IFC F ++Y Sbjct: 126 RYPKDEATGIAVGTVSMVIEEKAMPETVIFCCFDEQTAQLY 166 >UniRef50_Q1K4D1 Cluster: Appr-1-p processing; n=1; Desulfuromonas acetoxidans DSM 684|Rep: Appr-1-p processing - Desulfuromonas acetoxidans DSM 684 Length = 193 Score = 157 bits (382), Expect = 2e-37 Identities = 75/153 (49%), Positives = 98/153 (64%), Gaps = 5/153 (3%) Query: 78 ERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAK 137 +R+ I K DIT+L +DA+VN A ++L GGVDGAIH AAGP L EC + GC G AK Sbjct: 2 KRIEIIKADITQLNVDAIVNTATTKLLGSGGVDGAIHDAAGPELMEECRRLKGCLVGTAK 61 Query: 138 VTGGYNLPAKYIIHTVGPQ--DGSAEK---LESCYEKCLSFQQEYQIKSIAFPCISTGIY 192 +T GYNLPA+Y+IHTVGPQ +G + L SCY C S +EY +K++AFP IS G Y Sbjct: 62 ITSGYNLPARYVIHTVGPQWDEGQGNEQALLASCYRACFSLAREYGLKTLAFPAISCGSY 121 Query: 193 GFPNRLAAHIALRTARKFLETNTEMNRIIFCTF 225 FP A IA+ + L N ++ R+IF + Sbjct: 122 QFPVPTACEIAMDVVEQCLRGNDQIERVIFVCY 154 >UniRef50_A2DTG7 Cluster: Appr-1-p processing enzyme family protein; n=2; Trichomonas vaginalis G3|Rep: Appr-1-p processing enzyme family protein - Trichomonas vaginalis G3 Length = 316 Score = 157 bits (382), Expect = 2e-37 Identities = 87/178 (48%), Positives = 108/178 (60%), Gaps = 7/178 (3%) Query: 69 NTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAG-PFLQAECDS 127 NTE NK IS + GD TKL+ DA+VNAANS L AGGG+ GAI AAG LQ CD Sbjct: 48 NTEINKKISFWMG---GDSTKLKCDAIVNAANSYLAAGGGICGAIFSAAGYEELQKACDE 104 Query: 128 IGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCI 187 G TG AK+T G+ LP+KY+IH VGP E L S Y L F ++KSIAF CI Sbjct: 105 QGYTETGGAKMTPGFRLPSKYVIHAVGPVGVHPEALRSAYNLTLGFMDNDKVKSIAFCCI 164 Query: 188 STGIYGFPNRLAAHIALRTARKFLETNTEM---NRIIFCTFLPIDVEIYETLMQLYFP 242 STGIYG+ A +AL T RK+LE + +R++F F+P D ++Y +YFP Sbjct: 165 STGIYGYSIEKATPVALDTVRKWLEVPENLAKTDRLVFVVFMPKDQQVYSHFAHVYFP 222 >UniRef50_Q01WP7 Cluster: Appr-1-p processing domain protein; n=1; Solibacter usitatus Ellin6076|Rep: Appr-1-p processing domain protein - Solibacter usitatus (strain Ellin6076) Length = 178 Score = 155 bits (376), Expect = 1e-36 Identities = 79/177 (44%), Positives = 108/177 (61%), Gaps = 10/177 (5%) Query: 71 EKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI-- 128 E S +++ + +GDIT++ +D + NAANS L GGGVDGAIHRA GP + E D+I Sbjct: 2 EWTSSTGKKIVLIRGDITRIAVDVMANAANSALAGGGGVDGAIHRAGGPAIMRELDAIRA 61 Query: 129 --GGCPTGDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKS 181 GGCPTG A T +LPA+Y+ H VGP G E L +CY CL +E ++++ Sbjct: 62 RSGGCPTGSAVATSAGSLPARYVFHAVGPVWRGGGCGEPELLAACYRTCLDLARERKLRT 121 Query: 182 IAFPCISTGIYGFPNRLAAHIALRTARKFLE-TNTEMNRIIFCTFLPIDVEIYETLM 237 I+FP ISTGIYG+P + AA IA+R + LE T + ++IF F P IY L+ Sbjct: 122 ISFPAISTGIYGYPLQAAAAIAIREVQSHLEDPTTSIEQVIFVLFDPHAENIYADLL 178 >UniRef50_A5TRW5 Cluster: Putative uncharacterized protein; n=1; Fusobacterium nucleatum subsp. polymorphum ATCC 10953|Rep: Putative uncharacterized protein - Fusobacterium nucleatum subsp. polymorphum ATCC 10953 Length = 175 Score = 153 bits (371), Expect = 4e-36 Identities = 79/143 (55%), Positives = 100/143 (69%), Gaps = 6/143 (4%) Query: 80 VSIFKGDITKL-EIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKV 138 + + GDITK+ E++A+VNAAN+ L+ GGGV GAI RAAG L EC IG C TG+A + Sbjct: 6 IKLVNGDITKIPEVEAIVNAANNYLEMGGGVCGAIFRAAGTELIKECKEIGSCKTGEAVI 65 Query: 139 TGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 T GYNLP KYIIHTVGP ++G AEKL+S Y + L ++ I+ IAFP ISTGIY Sbjct: 66 TKGYNLPNKYIIHTVGPRYTNSENGEAEKLKSAYYESLKLAKKKGIRKIAFPSISTGIYR 125 Query: 194 FPNRLAAHIALRTARKFLETNTE 216 FP A IAL TA+KFL+ N++ Sbjct: 126 FPVDEGAEIALSTAKKFLDENSD 148 >UniRef50_Q4P1I0 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 220 Score = 153 bits (370), Expect = 5e-36 Identities = 86/175 (49%), Positives = 102/175 (58%), Gaps = 9/175 (5%) Query: 77 SERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDA 136 S +SIF GDIT L IDA+VNAAN+ L GGGVDGAIHRAAG L EC + GC TG A Sbjct: 35 SHLLSIFTGDITTLSIDAIVNAANNSLLGGGGVDGAIHRAAGRELVVECGKLNGCETGSA 94 Query: 137 KVTGGYNLPAKYIIHTVGPQDGSAEK------LESCYEKCLSFQQEYQIKSIAFPCISTG 190 K T GY LP+K++IHTVGP S+ L S Y L ++ KSIAFP ISTG Sbjct: 95 KTTLGYALPSKHVIHTVGPVYNSSRHEECERLLRSAYRSSLEELRKIGAKSIAFPSISTG 154 Query: 191 IYGFPNRLAAHIALRTARKFLETN---TEMNRIIFCTFLPIDVEIYETLMQLYFP 242 +YG+P AA AL +LE+N + RI+ C F D Y L FP Sbjct: 155 VYGYPFDTAATAALDEIGSWLESNENHKHIERIVLCCFSQKDYNKYLELAPTVFP 209 >UniRef50_Q9HXU7 Cluster: UPF0189 protein PA3693; n=13; Bacteria|Rep: UPF0189 protein PA3693 - Pseudomonas aeruginosa Length = 173 Score = 151 bits (367), Expect = 1e-35 Identities = 76/163 (46%), Positives = 103/163 (63%), Gaps = 5/163 (3%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 V +++GDIT+L +DA+VNAANS L GGGVDGAIHRAAG L A C + GC TG+AK+T Sbjct: 4 VRVWQGDITRLAVDAIVNAANSSLLGGGGVDGAIHRAAGAELVAACRLLHGCKTGEAKIT 63 Query: 140 GGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 G+ LPA ++IHTVGP +G AE L SCY + L+ ++ S+AFP IS GIYG+ Sbjct: 64 RGFRLPAAHVIHTVGPVWRGGDNGEAELLASCYRRSLALAEQAGAASVAFPAISCGIYGY 123 Query: 195 PNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLM 237 P AA IA+ + ++ + I+ F E Y+ L+ Sbjct: 124 PLEQAAAIAVEEVCRQRPAHSSLEEIVLVAFDSSMAERYQRLL 166 >UniRef50_A6NXN8 Cluster: Putative uncharacterized protein; n=1; Bacteroides capillosus ATCC 29799|Rep: Putative uncharacterized protein - Bacteroides capillosus ATCC 29799 Length = 347 Score = 151 bits (366), Expect = 2e-35 Identities = 72/140 (51%), Positives = 91/140 (65%), Gaps = 5/140 (3%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + I + DITK+++DA+VNAAN L GGGVDG IHRAAGP L EC+++ GC TG AK+T Sbjct: 3 LQIVRNDITKMKVDAIVNAANESLLGGGGVDGCIHRAAGPELLTECETLHGCKTGSAKIT 62 Query: 140 GGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 GY LP KY+IH VGP + G E L SCY L +EY +S AFP IS+GI+G+ Sbjct: 63 KGYKLPCKYVIHAVGPRWYDGRHGERELLTSCYRTSLMLAKEYGCESAAFPLISSGIFGY 122 Query: 195 PNRLAAHIALRTARKFLETN 214 P A +A+ T FL N Sbjct: 123 PKDQALKVAIDTISSFLLEN 142 >UniRef50_Q8B4N1 Cluster: ORF-1; n=8; root|Rep: ORF-1 - Rock bream iridovirus Length = 566 Score = 150 bits (364), Expect = 3e-35 Identities = 76/171 (44%), Positives = 107/171 (62%), Gaps = 9/171 (5%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 VS+ DIT L +DA+VNAAN+ GGGVDG IHR AG L+ EC ++GG G+AK+T Sbjct: 392 VSVVLDDITSLRVDAIVNAANTVGLGGGGVDGRIHRVAGRELKRECRTLGGIGFGEAKIT 451 Query: 140 GGYNLPAKYIIHTVGP------QDGSAEK--LESCYEKCLSFQQEYQIKSIAFPCISTGI 191 GGY LPA Y+IHTVGP + A+K L SCY + L Q +++IAFP ISTG+ Sbjct: 452 GGYRLPATYVIHTVGPIINAGQRPTQADKRVLTSCYIQSLHVAQANGVRTIAFPSISTGV 511 Query: 192 YGFPNRLAAHIALRTARKF-LETNTEMNRIIFCTFLPIDVEIYETLMQLYF 241 Y +P A H+A+ + R + ++ + I+FCT+ D ++Y + + YF Sbjct: 512 YNYPIEDAVHVAMSSVRAYVIQHPGAFDHIVFCTYSNADFDVYNSQLPTYF 562 >UniRef50_Q8RB30 Cluster: UPF0189 protein TTE0995; n=20; Bacteria|Rep: UPF0189 protein TTE0995 - Thermoanaerobacter tengcongensis Length = 175 Score = 149 bits (360), Expect = 8e-35 Identities = 80/167 (47%), Positives = 102/167 (61%), Gaps = 10/167 (5%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGC 131 + E++ + KG+I E+DA+VNAANS L GGGVDGAIH+A GP + E I GGC Sbjct: 1 MKEKIKLIKGNIVDQEVDAIVNAANSSLIGGGGVDGAIHKAGGPAIAEELKVIREKQGGC 60 Query: 132 PTGDAKVTGGYNLPAKYIIHTVGP--QDGSAEK---LESCYEKCLSFQQEYQIKSIAFPC 186 PTG A +TG NL AKY+IH VGP + G+ + L S Y + L EY +K+IAFP Sbjct: 61 PTGHAVITGAGNLKAKYVIHAVGPIWKGGNHNEDNLLASAYIESLKLADEYNVKTIAFPS 120 Query: 187 ISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIY 233 ISTG YGFP AA IALR +LE + + + F F D E+Y Sbjct: 121 ISTGAYGFPVERAARIALRVVSDYLE-GSSIKEVRFVLFSDRDYEVY 166 >UniRef50_A6BCW6 Cluster: Putative uncharacterized protein; n=2; Bacteria|Rep: Putative uncharacterized protein - Dorea longicatena DSM 13814 Length = 267 Score = 148 bits (358), Expect = 1e-34 Identities = 80/175 (45%), Positives = 115/175 (65%), Gaps = 17/175 (9%) Query: 78 ERVSIFKGDITKLEIDAVVNAANSRL-----KAGGGVDGAIHRAAGPFLQAECDSIGGC- 131 +++S+++GDIT+L +DA+VNAANS++ G +D AIH AAG L+ EC I Sbjct: 92 DKISLWRGDITRLSVDAIVNAANSQMLGCFVPCHGCIDNAIHSAAGIQLRNECAQIMEAQ 151 Query: 132 ----PTGDAKVTGGYNLPAKYIIHTVGPQDG------SAEKLESCYEKCLSFQQEYQIKS 181 PTG AK+T GYNLPAK++IHTVGP G E+L+SCY C+ ++ +KS Sbjct: 152 GHEEPTGKAKITKGYNLPAKHVIHTVGPIVGMQVTEKQEEELKSCYLNCMKLAEKEGLKS 211 Query: 182 IAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETL 236 IAF CISTG + FPN+LAA IA++T K+L +++++ R+IF F D IY+ + Sbjct: 212 IAFCCISTGEFHFPNKLAAEIAVKTVDKYL-SSSKLERVIFNVFKEEDYNIYKKI 265 >UniRef50_A5V0Y4 Cluster: Appr-1-p processing domain protein; n=5; Bacteria|Rep: Appr-1-p processing domain protein - Roseiflexus sp. RS-1 Length = 181 Score = 147 bits (356), Expect = 3e-34 Identities = 77/163 (47%), Positives = 100/163 (61%), Gaps = 4/163 (2%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + + +G+I + ++DA+VNAAN L GGGV GAIHRAAGP L EC IGGCPTG+A++T Sbjct: 10 LELIRGNIVEQDVDAIVNAANETLAPGGGVSGAIHRAAGPELADECARIGGCPTGEARIT 69 Query: 140 GGYNLPAKYIIHTVGPQ-DGS---AEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 GY L A+++IH VGP+ G+ AE L S Y L + ++SIAFP ISTGIYG+P Sbjct: 70 AGYRLKARHVIHAVGPRYSGNPRDAELLASAYRSALMLAASHGLQSIAFPSISTGIYGYP 129 Query: 196 NRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQ 238 AA IAL T R L + + + F F YE Q Sbjct: 130 LDQAAPIALATCRDVLLNHPGVALVRFVLFDEETYRAYEQAAQ 172 >UniRef50_Q8EYT0 Cluster: UPF0189 protein LA_4133; n=11; cellular organisms|Rep: UPF0189 protein LA_4133 - Leptospira interrogans Length = 175 Score = 147 bits (355), Expect = 3e-34 Identities = 77/174 (44%), Positives = 107/174 (61%), Gaps = 9/174 (5%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGC 131 ++ ++ + K DIT+LE+DA+VNAANS L GGGVDGAIHRA GP + EC I G C Sbjct: 1 MNNKIKLIKEDITQLEVDAIVNAANSSLLGGGGVDGAIHRAGGPEILEECYKIREKQGEC 60 Query: 132 PTGDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPC 186 G+A +T L AK+IIHTVGP E L + Y+ L + + +K+IAFP Sbjct: 61 KVGEAVITTAGRLNAKFIIHTVGPIWSGGNKNEDELLSNAYKNSLLLAKNHSLKTIAFPN 120 Query: 187 ISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLY 240 ISTGIY FP AA IA+++ +FL+ + ++ + F F ++EIY L+Q Y Sbjct: 121 ISTGIYHFPKERAAKIAIQSVTEFLKQDNQIQTVFFVCFDFENLEIYNKLLQTY 174 >UniRef50_A0LGZ1 Cluster: Appr-1-p processing domain protein; n=1; Syntrophobacter fumaroxidans MPOB|Rep: Appr-1-p processing domain protein - Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB) Length = 175 Score = 146 bits (354), Expect = 4e-34 Identities = 75/164 (45%), Positives = 100/164 (60%), Gaps = 4/164 (2%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKV 138 ++S+ +GD+T+L +DA+VNAAN L GGGV GAI GP +Q ECD+IGG G A + Sbjct: 9 KISLVQGDLTELRVDAIVNAANRHLALGGGVAGAIRMKGGPTIQEECDAIGGTVVGQAVI 68 Query: 139 TGGYNLPAKYIIHTVGPQDGSA---EKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 TGG NL A ++IH VGP+ G EKL + L E + SIAFP +STGI+GFP Sbjct: 69 TGGGNLKAAHVIHAVGPRYGEGDEDEKLRNATLNSLKRATEKSLASIAFPAVSTGIFGFP 128 Query: 196 NRLAAHIALRTARKFLE-TNTEMNRIIFCTFLPIDVEIYETLMQ 238 A I L A FL+ T + +IFC + D+EI+E +Q Sbjct: 129 KDRCAKIMLDAAVAFLDRETTSLRDVIFCLWSKEDLEIFEKTLQ 172 >UniRef50_Q926Y8 Cluster: UPF0189 protein lin2902; n=14; Firmicutes|Rep: UPF0189 protein lin2902 - Listeria innocua Length = 176 Score = 146 bits (353), Expect = 6e-34 Identities = 79/169 (46%), Positives = 105/169 (62%), Gaps = 11/169 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC----DSIGGCPTGD 135 +++ KGDIT+ +D +VNAAN L GGGVDGAIH+AAGP L EC + IG CP G+ Sbjct: 3 ITVVKGDITEQNVDVIVNAANPGLLGGGGVDGAIHQAAGPDLLKECQEVINRIGSCPAGE 62 Query: 136 AKVTGGYNLPAKYIIHTVGP--QDG---SAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 A +T +L A +IIH VGP +DG A KL SCY K L + SIAFP ISTG Sbjct: 63 AVITSAGDLKAHFIIHAVGPIWKDGEHQEANKLASCYWKALDLAAGKDLTSIAFPNISTG 122 Query: 191 IYGFPNRLAAHIALRTARKFLET--NTEMNRIIFCTFLPIDVEIYETLM 237 +YGFP +LAA +AL T RK+ E ++ + + F F ++ +Y L+ Sbjct: 123 VYGFPKKLAAEVALYTVRKWAEEEYDSSIKEVRFVCFDEENLTLYNKLI 171 >UniRef50_Q4DSL4 Cluster: Putative uncharacterized protein; n=3; Trypanosoma|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 297 Score = 145 bits (351), Expect = 1e-33 Identities = 68/163 (41%), Positives = 96/163 (58%), Gaps = 2/163 (1%) Query: 71 EKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGG 130 + + I +++ G +T L++DA+VNAAN G GVDGAIH AAGP L EC + G Sbjct: 116 DPSHDILRHIALHNGPVTDLQLDAIVNAANKTCLGGKGVDGAIHAAAGPLLVRECATFNG 175 Query: 131 CPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 C TG ++T GYNLPA+Y++HTVGP E L SCY LS +++SI F C+STG Sbjct: 176 CDTGQCRITKGYNLPARYVLHTVGPIGERPEALRSCYRSILSLAHRNRLRSIGFCCVSTG 235 Query: 191 IYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIY 233 +YG+P A IA+ ++L+ + + C F +E Y Sbjct: 236 VYGYPLIPATRIAVDETIEYLKQH--FSAFDLCCFACFKLEEY 276 >UniRef50_Q88SK6 Cluster: UPF0189 protein lp_3408; n=13; cellular organisms|Rep: UPF0189 protein lp_3408 - Lactobacillus plantarum Length = 172 Score = 145 bits (351), Expect = 1e-33 Identities = 69/139 (49%), Positives = 92/139 (66%), Gaps = 5/139 (3%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + + GDITK+ +DA+VNAAN+ L GGGVDGAIHRAAGP L A C + GC TG+AK+T Sbjct: 4 IKVIHGDITKMTVDAIVNAANTSLLGGGGVDGAIHRAAGPALLAACRPLHGCATGEAKIT 63 Query: 140 GGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 G+ LPAKY+IHT GP Q + L + Y L+ E +++AFP ISTG+Y F Sbjct: 64 PGFRLPAKYVIHTPGPVWQGGQHNELQLLANSYRNSLNLAAENHCQTVAFPSISTGVYHF 123 Query: 195 PNRLAAHIALRTARKFLET 213 P +AA +AL+T + +T Sbjct: 124 PLSIAAPLALKTLQATAQT 142 >UniRef50_Q94JV1 Cluster: At1g69340/F10D13.28; n=9; Magnoliophyta|Rep: At1g69340/F10D13.28 - Arabidopsis thaliana (Mouse-ear cress) Length = 562 Score = 144 bits (350), Expect = 1e-33 Identities = 73/174 (41%), Positives = 104/174 (59%), Gaps = 8/174 (4%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGD 135 I+ R+ +++G+ LE+DAVVN+ N L G +H AAGP L +C ++GGC TG Sbjct: 83 INSRIYLWRGEPWNLEVDAVVNSTNENLDEAHSSPG-LHVAAGPGLAEQCATLGGCRTGM 141 Query: 136 AKVTGGYNLPAKYIIHTVGPQ------DGSAEKLESCYEKCLSFQQEYQIKSIAFPCIST 189 AKVT Y+LPA+ +IHTVGP+ + L CY CL + ++SIA CI T Sbjct: 142 AKVTNAYDLPARRVIHTVGPKYAVKYHTAAENALSHCYRSCLELLIDSGLQSIALGCIYT 201 Query: 190 GIYGFPNRLAAHIALRTARKFLETNTE-MNRIIFCTFLPIDVEIYETLMQLYFP 242 +P AAH+A+RT R+FLE + ++ ++FCT D EIY+ L+ LYFP Sbjct: 202 EAKNYPREPAAHVAIRTVRRFLEKQKDKISAVVFCTTTSSDTEIYKRLLPLYFP 255 >UniRef50_P67341 Cluster: UPF0189 protein ymdB; n=11; Bacteria|Rep: UPF0189 protein ymdB - Salmonella typhimurium Length = 179 Score = 140 bits (339), Expect = 3e-32 Identities = 74/171 (43%), Positives = 98/171 (57%), Gaps = 9/171 (5%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGC 131 ++ R+ + +GDIT+L +DA+VNAAN+ L GGGVDGAIHRAAGP L C I G C Sbjct: 1 MTSRLQVIQGDITQLSVDAIVNAANASLMGGGGVDGAIHRAAGPALLDACKLIRQQQGEC 60 Query: 132 PTGDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPC 186 TG A +T L AK +IHTVGP + AE LE Y CL + +SIAFP Sbjct: 61 QTGHAVITPAGKLSAKAVIHTVGPVWRGGEHQEAELLEEAYRNCLLLAEANHFRSIAFPA 120 Query: 187 ISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLM 237 ISTG+YG+P AA +A+RT F+ ++ F + +Y L+ Sbjct: 121 ISTGVYGYPRAQAAEVAVRTVSDFITRYALPEQVYFVCYDEETARLYARLL 171 >UniRef50_Q8PHB6 Cluster: UPF0189 protein XAC3343; n=9; Proteobacteria|Rep: UPF0189 protein XAC3343 - Xanthomonas axonopodis pv. citri Length = 179 Score = 137 bits (332), Expect = 2e-31 Identities = 69/167 (41%), Positives = 103/167 (61%), Gaps = 11/167 (6%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIG------GCP 132 R+ +++GDIT+L++D +VNAAN L GGGVDGAIHRAAGP L C+++ CP Sbjct: 2 RIEVWQGDITELDVDVIVNAANESLLGGGGVDGAIHRAAGPRLLEACEALPQVRPGVRCP 61 Query: 133 TGDAKVTGGYNLPAKYIIHTVGP--QDG---SAEKLESCYEKCLSFQQEYQIKSIAFPCI 187 TG+ ++T G++L A++I HTVGP +DG E+L +CY + L ++ + SIAFP I Sbjct: 62 TGEIRITDGFDLKARHIFHTVGPVWRDGRHNEPEQLANCYWQSLKLAEQMMLHSIAFPAI 121 Query: 188 STGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYE 234 S GIYG+P AA IA+ R + ++ I+ + + Y+ Sbjct: 122 SCGIYGYPLHQAARIAVTETRDWQRSHKVPKHIVLVAYNEATYKAYQ 168 >UniRef50_UPI0000498CB9 Cluster: conserved hypothetical protein; n=1; Entamoeba histolytica HM-1:IMSS|Rep: conserved hypothetical protein - Entamoeba histolytica HM-1:IMSS Length = 348 Score = 135 bits (326), Expect = 1e-30 Identities = 76/197 (38%), Positives = 112/197 (56%), Gaps = 15/197 (7%) Query: 54 KSTTDDLKEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKA-----GGG 108 +S ++ +++ + I+ NK S+ + ++KGDITKL+ID++VNAAN+ L Sbjct: 67 QSELGEIIDYKSLPIHPNLNKQFSKSIRVWKGDITKLKIDSIVNAANNTLVGCFIPLHSC 126 Query: 109 VDGAIHRAAGPFLQAECDSIGGC---PTGDAKVTGGYNLPAKYIIHTVGP-----QDGSA 160 VD IH AG L+ EC + T ++T GYNLPAKY+IH VGP + + Sbjct: 127 VDSIIHERAGVQLRHECSQLKTAYKATTTTTEITKGYNLPAKYVIHVVGPIVDTLKPKHS 186 Query: 161 EKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRI 220 L+ CY CL+ + SI F CISTG++GFPN AA IA++T FL+ + + Sbjct: 187 YLLQQCYLNCLNKAIKAGCTSIGFCCISTGMFGFPNEEAAKIAIQTVNNFLKNH--QIEV 244 Query: 221 IFCTFLPIDVEIYETLM 237 +FC F ID IY +L+ Sbjct: 245 VFCVFKEIDYNIYTSLL 261 >UniRef50_A0H6G6 Cluster: Appr-1-p processing; n=1; Chloroflexus aggregans DSM 9485|Rep: Appr-1-p processing - Chloroflexus aggregans DSM 9485 Length = 184 Score = 133 bits (321), Expect = 4e-30 Identities = 66/135 (48%), Positives = 91/135 (67%), Gaps = 7/135 (5%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPF-LQAECDSIGGCPTGDAK 137 R+ + +GDI +DA+VNAAN +L+ GGGV GAI RAAG LQ CD++ CPTG+A+ Sbjct: 13 RIELCEGDIVTQSVDAIVNAANEQLRQGGGVCGAIFRAAGAADLQRACDAVAPCPTGEAR 72 Query: 138 VTGGYNLPAKYIIHTVGP-----QDGSAEK-LESCYEKCLSFQQEYQIKSIAFPCISTGI 191 +T G+ LPA+Y+IH VGP A++ L S Y L+ ++Y ++SIAFP I+TGI Sbjct: 73 ITPGFALPARYVIHAVGPIFDSYSPTEADRLLVSAYRASLALARQYGVRSIAFPSIATGI 132 Query: 192 YGFPNRLAAHIALRT 206 YGFP AA + +RT Sbjct: 133 YGFPVERAAPLVIRT 147 >UniRef50_O22875 Cluster: Expressed protein; n=7; Magnoliophyta|Rep: Expressed protein - Arabidopsis thaliana (Mouse-ear cress) Length = 193 Score = 133 bits (321), Expect = 4e-30 Identities = 76/164 (46%), Positives = 98/164 (59%), Gaps = 14/164 (8%) Query: 73 NKSISERVSIFKGDITKLEID----AVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI 128 N S S + I KGDITK +D A+VN AN R+ GGG DGAIHRAAGP L+A C + Sbjct: 11 NLSDSSLLKILKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRAAGPQLRAACYEV 70 Query: 129 G------GCPTGDAKVTGGYNLPAKYIIHTVGPQDGS----AEKLESCYEKCLSFQQEYQ 178 CPTG+A++T G+NLPA +IHTVGP S E L + Y+ L +E Sbjct: 71 PEVRPGVRCPTGEARITPGFNLPASRVIHTVGPIYDSDVNPQESLTNSYKNSLRVAKENN 130 Query: 179 IKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIF 222 IK IAFP IS GIYG+P AA I + T ++F E++ ++F Sbjct: 131 IKYIAFPAISCGIYGYPFDEAAAIGISTIKQFSTDFKEVHFVLF 174 >UniRef50_UPI0000498318 Cluster: conserved hypothetical protein; n=1; Entamoeba histolytica HM-1:IMSS|Rep: conserved hypothetical protein - Entamoeba histolytica HM-1:IMSS Length = 627 Score = 132 bits (320), Expect = 6e-30 Identities = 90/245 (36%), Positives = 138/245 (56%), Gaps = 30/245 (12%) Query: 7 WEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKI 66 WEI ++ + ++ +E +K+ ++++ +DL + + NK + SK T LKE Sbjct: 73 WEIYRSLMNQIEPDECQKLCQNNELMDL--ISQMLQEKNKDV-VYSKNIIT--LKE---- 123 Query: 67 KINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKA-----GGGVDGAIHRAAGPFL 121 + S +++++KGDITKL +DA+VNAAN++L +D AIH AGP L Sbjct: 124 ---QGHSFLFSNKLALWKGDITKLCVDAIVNAANNQLLGCFVPHHLCIDNAIHTFAGPQL 180 Query: 122 QAECDSIGGC-----PTGDAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKC 170 + +C I PTG AKVT YNLP+KY+IHTVGP ++ L S Y C Sbjct: 181 RRDCSIIMNKQGFEEPTGYAKVTRAYNLPSKYVIHTVGPIVESQLKESHCNLLRSSYINC 240 Query: 171 LSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETN--TEMNRIIFCTFLPI 228 L+ + ++SIAF CISTG++GFP +A+ IA+ T +L N T + ++IF F Sbjct: 241 LNIADDLHLESIAFSCISTGLFGFPQNVASVIAIETVINWLYENPFTSIKKVIFDVFSDN 300 Query: 229 DVEIY 233 D++IY Sbjct: 301 DLQIY 305 >UniRef50_A7T7L3 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 177 Score = 132 bits (320), Expect = 6e-30 Identities = 74/170 (43%), Positives = 103/170 (60%), Gaps = 8/170 (4%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGD 135 ++++VS++ GDIT LEIDA+VNA N+ + G+D + P I C + Sbjct: 12 LNDKVSLWTGDITALEIDAIVNAGNTIMLMFIGIDVDSY----PNKVYSGRGIFKCFFFN 67 Query: 136 AKVT-GGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 V G +IHT GP + KL+ CY+ CL +++ +K++AF CISTGIYG+ Sbjct: 68 LSVLLKGSPYFGLDVIHTAGPMGKNRIKLQDCYKNCLQLAKQHGVKTLAFCCISTGIYGY 127 Query: 195 PNRLAAHIALRTARKFLET---NTEMNRIIFCTFLPIDVEIYETLMQLYF 241 PN+ AAH+AL T R++LET N + RIIFCTFLP D EIYE L+ YF Sbjct: 128 PNKDAAHVALETVRQWLETDDNNDSVERIIFCTFLPKDTEIYERLLLCYF 177 >UniRef50_Q0B030 Cluster: Phosphatase; n=1; Syntrophomonas wolfei subsp. wolfei str. Goettingen|Rep: Phosphatase - Syntrophomonas wolfei subsp. wolfei (strain Goettingen) Length = 176 Score = 132 bits (319), Expect = 8e-30 Identities = 73/160 (45%), Positives = 94/160 (58%), Gaps = 8/160 (5%) Query: 80 VSIFKGDITKLEIDAV-VNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKV 138 + + +GDIT+ E AV VNAANS L+ GGGVDGAIHRAAGP L+ E ++ G A + Sbjct: 8 IQVVQGDITRQEDMAVIVNAANSSLRGGGGVDGAIHRAAGPELKKESSALAPIGPGQAVI 67 Query: 139 TGGYNLPAKYIIHTVGPQDG----SAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 TG Y LP +Y+IH VGP G E L SCY L ++ Q+ SIAFP ISTG+YG+ Sbjct: 68 TGAYRLPNRYVIHCVGPVYGVHKPEDELLASCYRNALRLAEKQQLDSIAFPAISTGVYGY 127 Query: 195 PNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYE 234 P R AA + +T +E E+ I + D YE Sbjct: 128 PMREAAQVMFKT---IIEVIPELKHIKKIRIVLFDHPAYE 164 >UniRef50_A3ZLZ3 Cluster: Putative uncharacterized protein; n=2; Planctomycetaceae|Rep: Putative uncharacterized protein - Blastopirellula marina DSM 3645 Length = 191 Score = 132 bits (319), Expect = 8e-30 Identities = 70/156 (44%), Positives = 95/156 (60%), Gaps = 7/156 (4%) Query: 77 SERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDS--IGGCPTG 134 ++R+ + GDIT +D VVNAANSRL GGGVDGAIH A GP + E GCPTG Sbjct: 7 NQRIELAIGDITDQNVDIVVNAANSRLAGGGGVDGAIHAAGGPAIMEETRRRYPDGCPTG 66 Query: 135 DAKVTGGYNLPAKYIIHTVGP--QDGSA---EKLESCYEKCLSFQQEYQIKSIAFPCIST 189 +A ++ L A+Y+IH VGP Q G A ++LE+ Y +CL + SI FP +S Sbjct: 67 EAVISSAGKLSARYVIHAVGPIWQGGGAGEEKQLEAAYTRCLELAAAHDATSIVFPALSC 126 Query: 190 GIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTF 225 G YG+P LAA IAL+TA +++ +++ I F F Sbjct: 127 GAYGYPLDLAARIALKTAIRWIPYHSQPRLIRFVLF 162 >UniRef50_A7T167 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 502 Score = 132 bits (319), Expect = 8e-30 Identities = 70/177 (39%), Positives = 100/177 (56%), Gaps = 7/177 (3%) Query: 73 NKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC-DSIGGC 131 ++ I+ +V ++ GDITKL DA+VN N L G + +HRAAGP L EC + GC Sbjct: 46 DEEINAKVVLWNGDITKLAADAIVNTTNESLSDRGALSERVHRAAGPELMQECRQQLLGC 105 Query: 132 PTGDAKVTGGYNLPAKYIIHTVGPQDGSAEK------LESCYEKCLSFQQEYQIKSIAFP 185 TG+AK++ GYNLPA+Y+IHTVGP+ + K L SCY + +E +I +I Sbjct: 106 RTGEAKISEGYNLPARYVIHTVGPRYNTKYKTAAESALFSCYRNTMRLVRENKISTIGVC 165 Query: 186 CISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 ++T G+P AHIALRT R+FLE + + +Y +M +YFP Sbjct: 166 VVNTTKRGYPPEDGAHIALRTVRRFLEKYGSAVDTVAFVVEGAEAVVYAKVMPIYFP 222 >UniRef50_Q1R0S7 Cluster: Appr-1-p processing; n=1; Chromohalobacter salexigens DSM 3043|Rep: Appr-1-p processing - Chromohalobacter salexigens (strain DSM 3043 / ATCC BAA-138 / NCIMB13768) Length = 183 Score = 132 bits (318), Expect = 1e-29 Identities = 70/165 (42%), Positives = 96/165 (58%), Gaps = 12/165 (7%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI------GGCP 132 RV + GDIT+L++DA+VNAAN L GGGVDGAI+RAAGP L+ C ++ G P Sbjct: 9 RVDVVSGDITRLDVDAIVNAANHSLMGGGGVDGAIYRAAGPALKRACRALRETHWPDGLP 68 Query: 133 TGDAKVTGGYNLPAKYIIHTVGP----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCIS 188 G+ +T G+ LPA+Y+IHTVGP + L +CY ++ E + IAFP IS Sbjct: 69 DGEVALTEGFELPARYVIHTVGPVYAKTRDKSHLLANCYRNAVALAAETGCRRIAFPAIS 128 Query: 189 TGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIY 233 TG+YG+P AAHI + T L + R+ C F D + + Sbjct: 129 TGVYGYPFDDAAHIVIDTLHDALAIHD--LRVTLCFFSERDYQAF 171 >UniRef50_Q9NXN4 Cluster: Ganglioside-induced differentiation-associated protein 2; n=28; Euteleostomi|Rep: Ganglioside-induced differentiation-associated protein 2 - Homo sapiens (Human) Length = 497 Score = 132 bits (318), Expect = 1e-29 Identities = 73/221 (33%), Positives = 120/221 (54%), Gaps = 11/221 (4%) Query: 29 SDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIFKGDIT 88 S F+D++ + W + Q + TT ++ + + ++ NK ++ +V ++KGD+ Sbjct: 8 SQFVDVDTLPSWG---DSCQDELNSSDTTAEIFQEDTVRSPFLYNKDVNGKVVLWKGDVA 64 Query: 89 KLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKY 148 L A+VN +N L V +I AGP L+ + + GC TG+AK+T G+NL A++ Sbjct: 65 LLNCTAIVNTSNESLTDKNPVSESIFMLAGPDLKEDLQKLKGCRTGEAKLTKGFNLAARF 124 Query: 149 IIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHI 202 IIHTVGP + + L SCY L +E + S+ F I++ G+P A HI Sbjct: 125 IIHTVGPKYKSRYRTAAESSLYSCYRNVLQLAKEQSMSSVGFCVINSAKRGYPLEDATHI 184 Query: 203 ALRTARKFLETNTE-MNRIIFCTFLPIDVEIYETLMQLYFP 242 ALRT R+FLE + E + +++F ++ Y+ L+ LYFP Sbjct: 185 ALRTVRRFLEIHGETIEKVVFAV-SDLEEGTYQKLLPLYFP 224 >UniRef50_Q0UG78 Cluster: Putative uncharacterized protein; n=1; Phaeosphaeria nodorum|Rep: Putative uncharacterized protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 2298 Score = 130 bits (315), Expect = 2e-29 Identities = 74/180 (41%), Positives = 106/180 (58%), Gaps = 11/180 (6%) Query: 73 NKSISERVSIFKGDITKLEIDAVVNAANSRLKA--GGGVDGAIHRAAGPFLQAECDSIGG 130 N + +S D+TKL++DA+VN+AN LK G ++ AIH+AAGP L E + G Sbjct: 654 NDKYNRIISFCHHDLTKLKVDAIVNSANKSLKMTRGDTLNNAIHKAAGPGLSVEA-RLTG 712 Query: 131 CPTGDAKVTGGYNLPAKYIIHTVGP----QDGSAE--KLESCYEKCLSFQQEYQIKSIAF 184 G A +TGG+NLP++++IH + P G E +L CY + L E +IK+IAF Sbjct: 713 RLEGQALITGGHNLPSEHVIHVLRPGYFRHKGMGEFNQLIDCYREVLKVAIENKIKTIAF 772 Query: 185 PCISTGIYGFPNRLAAHIALRTARKFLETNTEMN--RIIFCTFLPIDVEIYETLMQLYFP 242 PC+ TG GFP R+AA I L+ R++L+ + E N RIIFC D + Y + +YFP Sbjct: 773 PCLGTGGVGFPARVAARITLQEMREYLDAHPEHNLERIIFCVNTAADEKAYIDFLPVYFP 832 Score = 111 bits (267), Expect = 2e-23 Identities = 61/192 (31%), Positives = 104/192 (54%), Gaps = 8/192 (4%) Query: 60 LKEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGP 119 L E E+ + + ++++ + + DITKLE+D +VN+ + + G +D + + G Sbjct: 1060 LGELEEKPTQAKPSAVFNDKIYLVREDITKLEVDVMVNSTDVSFRGMGTLDRTVLQKGGE 1119 Query: 120 FLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQD----GSAEKLESCYEKCLSFQQ 175 ++A + G C G+ + T GY LPAK+++H + P D G+ L+ Y + L Sbjct: 1120 QMRAAVTAFGQCKIGEVRHTEGYMLPAKHVLHII-PADRYNGGTKIVLKKLYREVLQEAV 1178 Query: 176 EYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLET---NTEMNRIIFCTFLPIDVEI 232 + SIA P I TG+ +P R A +AL A++FLE+ N + +IIF F D + Sbjct: 1179 SMRATSIALPSIGTGMLNYPRRDVASVALEEAKRFLESAERNNPVEKIIFVVFSSNDEFV 1238 Query: 233 YETLMQLYFPTL 244 Y++LM +YFP + Sbjct: 1239 YKSLMPVYFPPI 1250 >UniRef50_UPI0000E4815A Cluster: PREDICTED: similar to LRP16 protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to LRP16 protein - Strongylocentrotus purpuratus Length = 415 Score = 130 bits (314), Expect = 3e-29 Identities = 73/174 (41%), Positives = 101/174 (58%), Gaps = 21/174 (12%) Query: 8 EIEKNRILKLS--LEEKRKIYK--SSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEF 63 +++K R L L+ L+EK + + D +DL V W Y + G+D+ ++ Sbjct: 98 KVKKTRALYLNKTLDEKAEEARWYRQDLVDLREVLTWPDYA-EDMGLDTPQAK------- 149 Query: 64 EKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQA 123 K + ++ RVS+++GDITKL++D +VNAAN L GGGVDGAIHRAAG L Sbjct: 150 ---KSTSAAKSDLNNRVSVWQGDITKLDVDCIVNAANRSLLGGGGVDGAIHRAAGSNLLQ 206 Query: 124 ECDSIGGCPTGDAKVTGGYNLPAKYIIHTVG------PQDGSAEKLESCYEKCL 171 EC + GC TGDAK+T GY LP++Y++HTVG P E L SCY CL Sbjct: 207 ECKKLAGCETGDAKLTAGYLLPSRYVLHTVGPMVYGQPMTNHREDLTSCYATCL 260 Score = 80.6 bits (190), Expect = 3e-14 Identities = 34/65 (52%), Positives = 50/65 (76%), Gaps = 1/65 (1%) Query: 179 IKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETN-TEMNRIIFCTFLPIDVEIYETLM 237 I+S+AFPCISTG+YG+P A+ +AL T R++LE N E++RI+FC FL D+++YE L+ Sbjct: 335 IRSVAFPCISTGVYGYPQEEASRVALGTVREWLEENPEEVDRIVFCIFLDRDLKVYERLL 394 Query: 238 QLYFP 242 +FP Sbjct: 395 PTFFP 399 >UniRef50_A5ZAB5 Cluster: Putative uncharacterized protein; n=1; Eubacterium ventriosum ATCC 27560|Rep: Putative uncharacterized protein - Eubacterium ventriosum ATCC 27560 Length = 274 Score = 130 bits (313), Expect = 4e-29 Identities = 77/194 (39%), Positives = 110/194 (56%), Gaps = 23/194 (11%) Query: 66 IKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRL-----KAGGGVDGAIHRAAGPF 120 +K N +++++SI++GD+T+L++DA+VNAANS L +D AIH AG Sbjct: 79 VKEQHGSNNPLADKISIWQGDMTRLKVDAIVNAANSALLGCFVPCHRCIDNAIHSGAGME 138 Query: 121 LQAECDSIGGC-----------PTGDAKVTGGYNLPAKYIIHTVGP------QDGSAEKL 163 L+ EC+ I PTG A +T YNLP K +IHTVGP D L Sbjct: 139 LREECNKIMNQRKIKYGTNYEEPTGTATITEAYNLPCKKVIHTVGPICYFGLNDELCNDL 198 Query: 164 ESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFL-ETNTEMNRIIF 222 ++CYE L+ E +K++AF CISTG + FPN+ AA IA T +FL + + R+IF Sbjct: 199 KNCYESVLNCCAENGLKTVAFCCISTGEFRFPNKEAAVIAKDTVERFLMKKENNIERVIF 258 Query: 223 CTFLPIDVEIYETL 236 C + +D EIY+ L Sbjct: 259 CVYKDLDREIYDKL 272 >UniRef50_Q6AKL0 Cluster: Putative uncharacterized protein; n=1; Desulfotalea psychrophila|Rep: Putative uncharacterized protein - Desulfotalea psychrophila Length = 176 Score = 129 bits (312), Expect = 5e-29 Identities = 69/136 (50%), Positives = 89/136 (65%), Gaps = 10/136 (7%) Query: 86 DITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIG-----GCPTGDAKVTG 140 +IT+ E+D +VNAAN RL GGGVDGAIH+AAGP L C I CPTG+A++TG Sbjct: 10 NITQAEVDVIVNAANPRLLGGGGVDGAIHQAAGPTLLDACMKIAEKDGVRCPTGEARITG 69 Query: 141 GYNLPAKYIIHTVGP---QDGSAEK--LESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 L AKY+IHTVGP ++G+A LES Y L+ E+ +SIAFP IS GIYG+P Sbjct: 70 AGRLAAKYVIHTVGPVFKREGAAAAALLESAYTNSLALALEHGCRSIAFPAISCGIYGYP 129 Query: 196 NRLAAHIALRTARKFL 211 AA IA++ + +L Sbjct: 130 LEEAAQIAVKACQPYL 145 >UniRef50_Q5R014 Cluster: Predicted phosphatase; n=6; Bacteria|Rep: Predicted phosphatase - Idiomarina loihiensis Length = 167 Score = 128 bits (310), Expect = 9e-29 Identities = 65/155 (41%), Positives = 96/155 (61%), Gaps = 5/155 (3%) Query: 85 GDITK-LEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYN 143 GDI + EI+A+VNAAN++L+ GGGV GAIHRAAGP L+ S+ G+A +T ++ Sbjct: 8 GDINQQTEIEAIVNAANAKLQTGGGVAGAIHRAAGPELEKATRSLAPIKPGEAVITEAFD 67 Query: 144 LPAKYIIHTVGPQDGSAEK----LESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLA 199 LP KY+IH +GP GS E L CY+ L ++++++SIAFP ISTG +G+P A Sbjct: 68 LPNKYVIHCLGPVYGSDEPSDKLLADCYKNALDLTEKHKVESIAFPAISTGAFGYPFEEA 127 Query: 200 AHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYE 234 +A++T + +E + + I F F D Y+ Sbjct: 128 TDLAIKTVKAHVEKLSHLKMIRFVLFSDSDFAYYQ 162 >UniRef50_Q59Z77 Cluster: Putative uncharacterized protein; n=2; Candida albicans|Rep: Putative uncharacterized protein - Candida albicans (Yeast) Length = 564 Score = 128 bits (310), Expect = 9e-29 Identities = 83/204 (40%), Positives = 118/204 (57%), Gaps = 21/204 (10%) Query: 58 DDLKEFEKIKINTEKNKSISERVSIFKGDITKLE-IDAVVNAANSRL-----KAGGGVDG 111 +D K ++ T + VS++KGDIT L + A+VNAANS L + +D Sbjct: 71 NDNKLHTSVQSLTNNYNIANTTVSLWKGDITTLSGVTAIVNAANSALLGCFQPSHKCIDN 130 Query: 112 AIHRAAGPFLQAECDSI---GGCPTGDAKVTGGYNLPAKYIIHTVGP--QDGSA-----E 161 IH AAGP L+ C ++ PTG AK+T G+NLPAKY+I TVGP +DG+ E Sbjct: 131 VIHTAAGPELRQACYNLMQGKSEPTGSAKITPGFNLPAKYVIQTVGPIIRDGNVTEREQE 190 Query: 162 KLESCYE---KCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTE-- 216 +L +CY+ K L + + KSIAF CISTG++ FP LA+ IA+ T + +LET+ + Sbjct: 191 QLANCYQSSLKALETVNDEKDKSIAFCCISTGLFAFPKELASTIAINTVQHYLETHPDST 250 Query: 217 MNRIIFCTFLPIDVEIYETLMQLY 240 + I+F F D EIYE +Q + Sbjct: 251 IKHIVFNVFSDEDKEIYEKNLQSF 274 >UniRef50_Q0CEI7 Cluster: Putative uncharacterized protein; n=1; Aspergillus terreus NIH2624|Rep: Putative uncharacterized protein - Aspergillus terreus (strain NIH 2624) Length = 524 Score = 128 bits (308), Expect = 2e-28 Identities = 71/178 (39%), Positives = 101/178 (56%), Gaps = 9/178 (5%) Query: 73 NKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCP 132 N+ ++ +S+ DIT LE+D +V S + GG+DGA+H AAGP L C+ +G C Sbjct: 312 NQVANDIISLAHTDITTLEVDCIVTGI-SEPRGQGGLDGAVHAAAGPRLLDACNDLGKCW 370 Query: 133 TGDAKVTGGYNLPAKYIIHTVGP--QDGSAEK---LESCYEKCLSFQQEYQIKSIAFPCI 187 + +VT YNLP K +IHTV P DGSA+ L +CY +CL E +++IAFP + Sbjct: 371 VEEVQVTDAYNLPCKKVIHTVSPPYADGSADSKWLLRACYRRCLEIAIEGGMRTIAFPAL 430 Query: 188 STGIYGFPNRLAAHIALRTARKFLETNTEM---NRIIFCTFLPIDVEIYETLMQLYFP 242 STG GF + AA AL R FL+ + ++IIFC D+E+Y +FP Sbjct: 431 STGSKGFKSYEAATAALEEVRCFLDEPGHLLRFDKIIFCNIHQQDMEVYVAFTGQFFP 488 >UniRef50_A7B8S3 Cluster: Putative uncharacterized protein; n=1; Actinomyces odontolyticus ATCC 17982|Rep: Putative uncharacterized protein - Actinomyces odontolyticus ATCC 17982 Length = 270 Score = 127 bits (307), Expect = 2e-28 Identities = 77/189 (40%), Positives = 111/189 (58%), Gaps = 24/189 (12%) Query: 75 SISERVSIFKGDITKLEIDAVVNAANSRL---KAGGG--VDGAIHRAAGPFLQAEC---- 125 S R+++++GDIT+LE+DA+VNAANS L +A G +D AIH AAG L+ C Sbjct: 82 STHPRMALWRGDITRLEVDAIVNAANSALLGCRAPGHTCIDNAIHSAAGLELRQACAEVM 141 Query: 126 ------DSIGGCPTGDAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSF 173 D G PTG+A +T G++LP++++IHTVGP D E L Y++CL Sbjct: 142 AERTRGDGPSGFPTGEAVLTPGFHLPSRFVIHTVGPIVNGELTDEHREALACSYQRCLEE 201 Query: 174 QQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNT---EMNRIIFCTFLPIDV 230 + + ++AF CISTG++GFP AA IA+ T FLE++T R+IF F D Sbjct: 202 AAAHGLNTVAFCCISTGVFGFPQEEAARIAVSTVADFLESDTRGASEVRVIFDVFGDHDE 261 Query: 231 EIYETLMQL 239 +Y L++L Sbjct: 262 ALYRALLRL 270 >UniRef50_Q6ZED8 Cluster: Slr7060 protein; n=1; Synechocystis sp. PCC 6803|Rep: Slr7060 protein - Synechocystis sp. (strain PCC 6803) Length = 588 Score = 126 bits (304), Expect = 5e-28 Identities = 64/159 (40%), Positives = 88/159 (55%), Gaps = 5/159 (3%) Query: 85 GDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNL 144 GDITK + +A+VN+ + L G + AIH+AAGP L C + GC G AK+T G+NL Sbjct: 425 GDITKEKAEAIVNSTDRNLSNSGALSRAIHQAAGPELLQACQDLQGCTVGGAKLTPGFNL 484 Query: 145 PAKYIIHTVGPQ-----DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLA 199 A ++IHTV P+ G E L SCY+ CL I+S+AFP I+ G GFP +A Sbjct: 485 RANWVIHTVAPKWKGGNQGEEELLVSCYQNCLQLAVSQSIRSLAFPAIACGAMGFPPEIA 544 Query: 200 AHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQ 238 A IAL T FL +N + + F ++ Y+ Q Sbjct: 545 ARIALETVSNFLLSNMAIGSVAFICADKETLQYYQEAFQ 583 >UniRef50_A1G783 Cluster: Appr-1-p processing; n=1; Salinispora arenicola CNS205|Rep: Appr-1-p processing - Salinispora arenicola CNS205 Length = 202 Score = 126 bits (304), Expect = 5e-28 Identities = 67/149 (44%), Positives = 88/149 (59%), Gaps = 8/149 (5%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + + GDIT+ +DA+V AAN L GGGVDGA+HRAAGP L +IG C GDA T Sbjct: 36 IEVVLGDITQQNVDAIVTAANESLLGGGGVDGAVHRAAGPRLAQAGGAIGPCAPGDAMPT 95 Query: 140 GGYNL--PAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIY 192 ++L P ++IIHTVGP G A L SCY + L + ++AFP I+TG+Y Sbjct: 96 PAFDLDPPVRHIIHTVGPVWRGGGHGEARVLASCYRRSLRIADDLDALTVAFPTIATGVY 155 Query: 193 GFPNRLAAHIALRTARKFLETNTEMNRII 221 GFP AA IA+ T R TN + R++ Sbjct: 156 GFPADQAARIAVATIRS-TPTNVQQVRLV 183 >UniRef50_Q8EP31 Cluster: Hypothetical conserved protein; n=1; Oceanobacillus iheyensis|Rep: Hypothetical conserved protein - Oceanobacillus iheyensis Length = 185 Score = 126 bits (303), Expect = 7e-28 Identities = 72/167 (43%), Positives = 97/167 (58%), Gaps = 13/167 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC-----DSIGG--CP 132 + I GDITK + +VNAAN L GGGVDGAIH AAGP L C + + G P Sbjct: 10 LEIVVGDITKETTNVIVNAANGSLLGGGGVDGAIHHAAGPELLKACQEMRNNELNGEELP 69 Query: 133 TGDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCI 187 TG+ +T G+ LP+++IIHTVGP D E L +CY L + ++ SI+FP I Sbjct: 70 TGEVIITSGFQLPSRFIIHTVGPIWNQTPDLQEELLANCYRNALELVKVKKLSSISFPSI 129 Query: 188 STGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYE 234 STG+YG+P AA IAL+T +FL+ N ++ + F D IY+ Sbjct: 130 STGVYGYPIHEAAAIALQTIIQFLQEN-DVGLVKVVLFSERDYSIYQ 175 >UniRef50_Q6AAQ5 Cluster: Conserved protein; n=2; Bacteria|Rep: Conserved protein - Propionibacterium acnes Length = 223 Score = 126 bits (303), Expect = 7e-28 Identities = 69/156 (44%), Positives = 92/156 (58%), Gaps = 13/156 (8%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI------GGCPT 133 ++I + DIT L++DAVVNAAN +L GGGVDGAIHRAAGP L C + G PT Sbjct: 56 ITILRADITTLDVDAVVNAANRQLAGGGGVDGAIHRAAGPELSQACRKLRETTLTDGLPT 115 Query: 134 GDAKVTGGYNLPAKYIIHTVGP----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCIST 189 G + T +PAK++IHTVGP +++L SCY L E ++IAFP IS Sbjct: 116 GQSVATTAGKMPAKWVIHTVGPVWAKTIDKSDQLASCYRTSLHVADEIGARTIAFPTISA 175 Query: 190 GIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTF 225 G+YG+P A IA+ T R +T T+++ I F Sbjct: 176 GVYGYPMDEATRIAVETCR---QTVTKVDTIYLVAF 208 >UniRef50_A6PEZ6 Cluster: Appr-1-p processing domain protein; n=1; Shewanella sediminis HAW-EB3|Rep: Appr-1-p processing domain protein - Shewanella sediminis HAW-EB3 Length = 268 Score = 125 bits (302), Expect = 9e-28 Identities = 73/178 (41%), Positives = 105/178 (58%), Gaps = 19/178 (10%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKA-----GGGVDGAIHRAAGPFLQAECDSIGGCP-- 132 V +++GDIT+L DA+VNAAN L+ +D AIH A+G L+ +C I Sbjct: 91 VKLWQGDITRLAADAIVNAANKELQGCFQPLHSCIDNAIHSASGVRLRDDCAVIIKAQGQ 150 Query: 133 ---TGDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKL-ESCYEKCLSF-QQEYQIKSI 182 T AK+T GYNLP +Y++HTVGP G +KL + CYE CL+ Q I SI Sbjct: 151 FEETAKAKITSGYNLPCQYVLHTVGPIVQGNVTGEHQKLLQLCYENCLALADQTLGINSI 210 Query: 183 AFPCISTGIYGFPNRLAAHIALRTARKFL--ETNTEMNRIIFCTFLPIDVEIYETLMQ 238 AF CISTG++G+P + AA A+R +++L N+ ++ +IF TF P D +Y+ +Q Sbjct: 211 AFCCISTGVFGYPQKPAAQAAVRAVQQWLLNNPNSNIDTVIFNTFKPEDTRLYQQFLQ 268 >UniRef50_Q93SX7 Cluster: UPF0189 protein; n=1; Acinetobacter sp. ED45-25|Rep: UPF0189 protein - Acinetobacter sp. (strain ED45-25) Length = 183 Score = 125 bits (302), Expect = 9e-28 Identities = 67/169 (39%), Positives = 95/169 (56%), Gaps = 9/169 (5%) Query: 78 ERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPT 133 ++V + + DIT + A+VN+AN L GGG+D IH+ AGP ++ EC + GGCPT Sbjct: 2 KKVHLIQADITAFAVHAIVNSANKSLLGGGGLDYVIHKKAGPLMKEECVRLNQEKGGCPT 61 Query: 134 GDAKVTGGYNLPAKYIIHTVGPQ--DG---SAEKLESCYEKCLSFQQEYQIKSIAFPCIS 188 G A+VT NLPAKY+IH VGP+ DG + L Y L E +++FPCIS Sbjct: 62 GQAEVTTAGNLPAKYLIHAVGPRWLDGEHNEPQLLCDAYSNALFKANEIHALTVSFPCIS 121 Query: 189 TGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLM 237 TG+YGFP + AA IA+ T L + + F + IY+ ++ Sbjct: 122 TGVYGFPPQKAAEIAIGTILSMLPQYDHVAEVFFICREDENYLIYKNIL 170 >UniRef50_Q87JZ5 Cluster: UPF0189 protein VPA0103; n=5; cellular organisms|Rep: UPF0189 protein VPA0103 - Vibrio parahaemolyticus Length = 170 Score = 124 bits (300), Expect = 2e-27 Identities = 65/139 (46%), Positives = 88/139 (63%), Gaps = 9/139 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC---DSIGG--CPTG 134 +S+ +GDIT +DA+VNAAN R+ GGGVDGAIHRAAGP L C D + G CP G Sbjct: 4 ISLVQGDITTAHVDAIVNAANPRMLGGGGVDGAIHRAAGPALINACYAVDDVDGIRCPFG 63 Query: 135 DAKVTGGYNLPAKYIIHTVGP-QDGSAEK---LESCYEKCLSFQQEYQIKSIAFPCISTG 190 DA++T NL A+Y+IH VGP D A+ LES Y++ L +S+A P IS G Sbjct: 64 DARITEAGNLNARYVIHAVGPIYDKFADPKTVLESAYQRSLDLALANHCQSVALPAISCG 123 Query: 191 IYGFPNRLAAHIALRTARK 209 +YG+P + AA +A+ ++ Sbjct: 124 VYGYPPQEAAEVAMAVCQR 142 >UniRef50_Q9HJ67 Cluster: UPF0189 protein Ta1105; n=2; Thermoplasma acidophilum|Rep: UPF0189 protein Ta1105 - Thermoplasma acidophilum Length = 196 Score = 124 bits (300), Expect = 2e-27 Identities = 68/139 (48%), Positives = 84/139 (60%), Gaps = 11/139 (7%) Query: 85 GDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI------GGCPTGDAKV 138 GDIT+ + +A+VNAANS L GGGVDGAIH AAGP L E I G P G+A + Sbjct: 16 GDITESDAEAIVNAANSSLMGGGGVDGAIHSAAGPELNGELVKIRRERYPNGLPPGEAVI 75 Query: 139 TGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 T GY L A +IIHTVGP ++G + L Y CL +E+ I IAFP +STG YG Sbjct: 76 TRGYRLKASHIIHTVGPVWMGGRNGEDDVLYRSYRSCLDLAREFGIHDIAFPALSTGAYG 135 Query: 194 FPNRLAAHIALRTARKFLE 212 FP A IA+R+ FL+ Sbjct: 136 FPFDRAERIAIRSVIDFLK 154 >UniRef50_Q47EQ7 Cluster: Appr-1-p processing; n=1; Dechloromonas aromatica RCB|Rep: Appr-1-p processing - Dechloromonas aromatica (strain RCB) Length = 186 Score = 124 bits (298), Expect = 3e-27 Identities = 68/166 (40%), Positives = 90/166 (54%), Gaps = 11/166 (6%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI------GGCP 132 RV ++ GD+T +DA+VNAAN L GGGVDGAIHR GP + C + G P Sbjct: 13 RVRLYVGDLTDQAVDAIVNAANRTLLGGGGVDGAIHRRGGPAILDACRELRRSQWPDGLP 72 Query: 133 TGDAKVTGGYNLPAKYIIHTVGPQDG-----SAEKLESCYEKCLSFQQEYQIKSIAFPCI 187 TG +T G LPA Y+IHTVGP G AE L +CY + ++KS+AFP I Sbjct: 73 TGQVALTNGGKLPAPYVIHTVGPIYGQHRGKEAELLAACYRNAIELAAHLELKSLAFPSI 132 Query: 188 STGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIY 233 STG +G+P AA I R+ K L+ ++ I F +E + Sbjct: 133 STGAFGYPPDKAALIVSRSMHKVLDEIAAIDEIRLVFFNASQMETF 178 >UniRef50_A7BY23 Cluster: Putative uncharacterized protein; n=1; Beggiatoa sp. PS|Rep: Putative uncharacterized protein - Beggiatoa sp. PS Length = 708 Score = 124 bits (298), Expect = 3e-27 Identities = 61/153 (39%), Positives = 90/153 (58%), Gaps = 6/153 (3%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKV 138 ++ I +G+IT+ ++DA+VN + L G +D AI A G L+ C +G C +AK+ Sbjct: 532 KIHIIQGNITQQKVDAIVNTTDRSLSGSGAIDYAIQNAGGIELKEACRQLGTCSVAEAKI 591 Query: 139 TGGYNLPAKYIIHTVGPQ-DG----SAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 T GYNLPA+++IHTVGP +G AEKL CY CL+ ++ K IAFP I G G Sbjct: 592 TEGYNLPAQFVIHTVGPNWEGGNQKEAEKLAQCYRNCLALAEQQGFKIIAFPTIGVGGLG 651 Query: 194 FPNRLAAHIALRTARKFL-ETNTEMNRIIFCTF 225 F + LAA +A+ FL + N+ + ++I F Sbjct: 652 FSHELAAKVAIYEISSFLQQKNSSLEKVILVCF 684 >UniRef50_Q9WYX8 Cluster: UPF0189 protein TM_0508; n=4; Thermotogaceae|Rep: UPF0189 protein TM_0508 - Thermotoga maritima Length = 599 Score = 122 bits (295), Expect = 6e-27 Identities = 73/168 (43%), Positives = 93/168 (55%), Gaps = 11/168 (6%) Query: 78 ERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPT 133 +++ I KGDIT+ E+DA+VNAAN LK GGGV GAI RA G +Q E D I G PT Sbjct: 427 KKIRIVKGDITREEVDAIVNAANEYLKHGGGVAGAIVRAGGSVIQEESDRIVQERGRVPT 486 Query: 134 GDAKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCIS 188 G+A VT L AKY+IHTVGP G E L L E ++KSI+ P IS Sbjct: 487 GEAVVTSAGKLKAKYVIHTVGPVWRGGSHGEDELLYKAVYNALLRAHELKLKSISMPAIS 546 Query: 189 TGIYGFPNRLAAHIALRTARKFLE--TNTEMNRIIFCTFLPIDVEIYE 234 TGI+GFP A I + R F++ +T + I C +I+E Sbjct: 547 TGIFGFPKERAVGIFSKAIRDFIDQHPDTTLEEIRICNIDEETTKIFE 594 >UniRef50_Q97AU0 Cluster: UPF0189 protein TV0719; n=1; Thermoplasma volcanium|Rep: UPF0189 protein TV0719 - Thermoplasma volcanium Length = 186 Score = 122 bits (294), Expect = 8e-27 Identities = 67/142 (47%), Positives = 84/142 (59%), Gaps = 10/142 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI------GGCPT 133 + I +GDIT + +A+VNAAN L GGGVDGAIH G + EC + G P Sbjct: 11 IEIIEGDITDVNCEAIVNAANPSLMGGGGVDGAIHLKGGKTIDLECAELRRTKWPKGLPP 70 Query: 134 GDAKVTGGYNLPAKYIIHTVGP----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCIST 189 G+A +T G L AKY+IHTVGP Q+ AE L S Y + L + + IK IAFP IST Sbjct: 71 GEADITSGGKLKAKYVIHTVGPIYRGQEEDAETLYSSYYRSLEIAKIHGIKCIAFPAIST 130 Query: 190 GIYGFPNRLAAHIALRTARKFL 211 GIYG+P A+ IAL+ FL Sbjct: 131 GIYGYPFEEASVIALKAVTDFL 152 >UniRef50_A6F1P7 Cluster: Appr-1-p processing; n=1; Marinobacter algicola DG893|Rep: Appr-1-p processing - Marinobacter algicola DG893 Length = 183 Score = 121 bits (292), Expect = 1e-26 Identities = 65/167 (38%), Positives = 93/167 (55%), Gaps = 5/167 (2%) Query: 80 VSIFKGDITKLE-IDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKV 138 V +GDIT+ + ++AVVNAAN++L +GGGV GA+H AAGP L EC + G+A + Sbjct: 11 VECVRGDITRQDDLEAVVNAANAQLMSGGGVAGALHAAAGPGLAEECRPMAPIRLGEAVI 70 Query: 139 TGGYNLPAKYIIHTVGPQDGSAEK----LESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 +G +NLP +YI+H +GP G E L CY L I+SIAFP IS G +G+ Sbjct: 71 SGAHNLPNQYIVHCLGPVYGVDEPSNHWLAECYRNALELADSKTIESIAFPAISAGAFGY 130 Query: 195 PNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLYF 241 P AA +A+ T + L + + F F D ++ M+ F Sbjct: 131 PVEGAAEVAMATVSQVLPRLGSVRYVRFVLFSDADEAVFSRAMESAF 177 >UniRef50_P67344 Cluster: UPF0189 protein SA0314; n=13; Staphylococcus|Rep: UPF0189 protein SA0314 - Staphylococcus aureus (strain N315) Length = 266 Score = 121 bits (292), Expect = 1e-26 Identities = 68/174 (39%), Positives = 99/174 (56%), Gaps = 17/174 (9%) Query: 78 ERVSIFKGDITKLEIDAVVNAANSR----LKAGGG-VDGAIHRAAGPFLQAECDSI---- 128 + + +++GDIT L+IDA+VNAANSR ++A +D IH AG ++ +C I Sbjct: 85 DNIFVWQGDITTLKIDAIVNAANSRFLGCMQANHDCIDNIIHTKAGVQVRLDCAEIIRQQ 144 Query: 129 -GGCPTGDAKVTGGYNLPAKYIIHTVGPQ-------DGSAEKLESCYEKCLSFQQEYQIK 180 G AK T GYNLPAKYIIHTVGPQ + + L CY CL ++ + Sbjct: 145 GRNEGVGKAKKTRGYNLPAKYIIHTVGPQIRRLPVSKMNQDLLAKCYLSCLKLADQHSLN 204 Query: 181 SIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYE 234 +AF CISTG++ FP AA IA+RT +L+ +++F F D+++Y+ Sbjct: 205 HVAFCCISTGVFAFPQDEAAEIAVRTVESYLKETNSTLKVVFNVFTDKDLQLYK 258 >UniRef50_Q03IQ8 Cluster: Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1; n=3; Streptococcus thermophilus|Rep: Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 - Streptococcus thermophilus (strain ATCC BAA-491 / LMD-9) Length = 260 Score = 121 bits (291), Expect = 2e-26 Identities = 71/188 (37%), Positives = 112/188 (59%), Gaps = 16/188 (8%) Query: 66 IKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKA-----GGGVDGAIHRAAGPF 120 +++N+ ++ +R+ ++KGDIT+LEIDA+VNAAN L VD AIH AG Sbjct: 71 VQLNSLQSIPQDKRIYLWKGDITRLEIDAIVNAANKTLLGCMKPLHNCVDNAIHTYAGVQ 130 Query: 121 LQAECDSI---GGC--PTGDAKVTGGYNLPAKYIIHTVGPQDGSA------EKLESCYEK 169 L+ C + G P G AK+T YNLP+ ++IHTVGP+ G+ + L Y Sbjct: 131 LRQACFELILEQGYEEPVGMAKITPAYNLPSAFVIHTVGPKIGNQVTPIDEDLLIKSYLS 190 Query: 170 CLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPID 229 L+ ++ +I+SIA PCISTG + FP + AA IA++T + F++ + + ++IF F + Sbjct: 191 VLALAEKNKIESIAIPCISTGDFNFPKQKAAEIAIKTVKSFIDHSEIVKKVIFNVFDDEN 250 Query: 230 VEIYETLM 237 + IY+ L+ Sbjct: 251 LNIYQKLL 258 >UniRef50_Q2TX23 Cluster: Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1; n=4; Trichocomaceae|Rep: Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 - Aspergillus oryzae Length = 615 Score = 121 bits (291), Expect = 2e-26 Identities = 84/227 (37%), Positives = 119/227 (52%), Gaps = 26/227 (11%) Query: 34 LENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIFKGDITKL-EI 92 L+++D Y N + S S L EK+ S + +S++KGDIT L ++ Sbjct: 68 LDDIDTVITYRNNKTMLTSSTSIAPSLVLKPNNLKTVEKSSSKAINISLWKGDITSLTDV 127 Query: 93 DAVVNAANSRLKA-----GGGVDGAIHRAAGPFLQAECDSI--GGC---PTGDAKVTGGY 142 A+VNAANS+L +D IH AAGP L+ C+S+ C G KVT G+ Sbjct: 128 TAIVNAANSQLLGCFRPDHRCIDNIIHSAAGPRLRDACNSLMLKQCHPESVGSVKVTSGF 187 Query: 143 NLPAKYIIHTVGPQDGS--------AEKLESCYEKCLSFQQEYQI-----KSIAFPCIST 189 NLPA++++HTVGPQ S ++L SCY CL + K +AF CIST Sbjct: 188 NLPAQWVLHTVGPQVNSRKSPGTLQQQQLASCYSSCLDATESLPALPDGRKVVAFCCIST 247 Query: 190 GIYGFPNRLAAHIALRTARKFLETN--TEMNRIIFCTFLPIDVEIYE 234 G++ FP +AA IAL T ++ + T + IIF TFL D E+Y+ Sbjct: 248 GLFAFPPDMAAKIALETVVQWCMNHPATSVTDIIFDTFLERDYELYQ 294 >UniRef50_Q18A61 Cluster: Putative uncharacterized protein; n=2; Clostridium difficile|Rep: Putative uncharacterized protein - Clostridium difficile (strain 630) Length = 284 Score = 120 bits (290), Expect = 3e-26 Identities = 75/194 (38%), Positives = 113/194 (58%), Gaps = 19/194 (9%) Query: 64 EKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGG-----VDGAIHRAAG 118 E+ ++ + I E ++I++G+IT L DA+VNAAN++L VD IH AG Sbjct: 91 ERELVDVNDIEEIEEGIAIWRGNITNLRADAIVNAANNKLLGCLQPLHLCVDNEIHSCAG 150 Query: 119 PFLQAECDSI----GGCP-TGDAKVTGGYNLPAKYIIHTVGP--QDGSAEK-----LESC 166 P L+ +CD I G TGDAK+T GY LPAK+++HTVGP G K L C Sbjct: 151 PRLREDCDKIIKKQGHLEYTGDAKITRGYCLPAKFVVHTVGPIVSGGQPSKEQEKQLLHC 210 Query: 167 YEKCLSFQQEY-QIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMN-RIIFCT 224 Y+ CL+ +E +IK+I F ISTG++G+P + AA++A+ R +L+ N E N +++F Sbjct: 211 YKSCLNTIKEIDEIKNIVFCGISTGVFGYPKKEAANLAVSRVRLWLKENPEKNLKVVFNV 270 Query: 225 FLPIDVEIYETLMQ 238 F + E Y + + Sbjct: 271 FTEEEEEKYRRIFK 284 >UniRef50_Q30ZH6 Cluster: Appr-1-p processing; n=1; Desulfovibrio desulfuricans G20|Rep: Appr-1-p processing - Desulfovibrio desulfuricans (strain G20) Length = 183 Score = 120 bits (288), Expect = 4e-26 Identities = 66/152 (43%), Positives = 85/152 (55%), Gaps = 9/152 (5%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECD----SIGGCPTGD 135 + I +GD+T + DAVVNAANSRL GGGVDGA+H AAGP L A+C G P G Sbjct: 10 LEILQGDLTLFKADAVVNAANSRLAGGGGVDGALHAAAGPALLADCSRWVARHGLLPAGK 69 Query: 136 AKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 A VT + LPA+++IHTVGP ++ L YE C + + +AFP IS G Sbjct: 70 AMVTPAHRLPARHVIHTVGPVWRGGKNNEETTLRQAYESCFTLCRSNGFAHVAFPAISCG 129 Query: 191 IYGFPNRLAAHIALRTARKFLETNTEMNRIIF 222 YG+P AA +AL A + L +I F Sbjct: 130 TYGYPASPAARVALACAAQALACQGAPAKITF 161 >UniRef50_Q9ZBG3 Cluster: UPF0189 protein SCO6450; n=4; Actinomycetales|Rep: UPF0189 protein SCO6450 - Streptomyces coelicolor Length = 169 Score = 119 bits (287), Expect = 6e-26 Identities = 66/153 (43%), Positives = 90/153 (58%), Gaps = 10/153 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI------GGCPT 133 +++ +GDIT+ DA+VNAANS L GGGVDGAIHR GP + AEC + G PT Sbjct: 4 ITLVQGDITRQSADAIVNAANSSLLGGGGVDGAIHRRGGPAILAECRRLRAGHLGKGLPT 63 Query: 134 GDAKVTGGYNLPAKYIIHTVGPQDGSAEK----LESCYEKCLSFQQEYQIKSIAFPCIST 189 G A T +L A+++IHTVGP + E L SCY + L E +++AFP IST Sbjct: 64 GRAVATTAGDLDARWVIHTVGPVWSATEDRSGLLASCYRESLRTADELGARTVAFPAIST 123 Query: 190 GIYGFPNRLAAHIALRTARKFLETNTEMNRIIF 222 G+Y +P AA IA+ T + TE+ ++F Sbjct: 124 GVYRWPMDDAARIAVETVATTKTSVTEIRFVLF 156 >UniRef50_A0J8J0 Cluster: Appr-1-p processing; n=1; Shewanella woodyi ATCC 51908|Rep: Appr-1-p processing - Shewanella woodyi ATCC 51908 Length = 296 Score = 118 bits (285), Expect = 1e-25 Identities = 70/176 (39%), Positives = 107/176 (60%), Gaps = 19/176 (10%) Query: 77 SERVSIFKGDITKLEIDAVVNAANSRLKA-----GGGVDGAIHRAAGPFLQAECDSI--- 128 + ++SI+ GDIT+L+IDAV NAAN+++ +D AI+ AAGP L+ +C+ + Sbjct: 109 ASKISIWNGDITRLKIDAVTNAANAQMLGCFQPFHSCIDNAINCAAGPQLREDCNQLMQL 168 Query: 129 --GGCPTGDAKVTGGYNLPAKYIIHTVGP--QDGSA------EKLESCYEKCLSFQQEYQ 178 TG AK+T YNLP+K+++HTVGP Q G+ ++L SCY+ CLS E Sbjct: 169 QGSDETTGSAKITRAYNLPSKFVLHTVGPIIQHGAVPSPRQIDELASCYDACLSLAAEAG 228 Query: 179 IKSIAFPCISTGIYGFPNRLAAHIALR-TARKFLETNTEMNRIIFCTFLPIDVEIY 233 +S+A ISTG++G+P AA++AL+ A FL +++ ++F TF EIY Sbjct: 229 AQSVAVCGISTGVFGYPAEKAANVALQAVANWFLVNPDKLDHLVFNTFGDNATEIY 284 >UniRef50_A6PBP5 Cluster: Appr-1-p processing domain protein; n=1; Shewanella sediminis HAW-EB3|Rep: Appr-1-p processing domain protein - Shewanella sediminis HAW-EB3 Length = 293 Score = 117 bits (281), Expect = 3e-25 Identities = 73/177 (41%), Positives = 103/177 (58%), Gaps = 20/177 (11%) Query: 81 SIFKGDITKLEIDAVVNAAN-----SRLKAGGGVDGAIHRAAGPFLQAECDSI----GGC 131 SI+ GDIT+L++DA++NAAN R +D IH AAG L+ +C +I GG Sbjct: 113 SIWVGDITQLKVDAIINAANVYLLGCRQPNHRCIDNVIHSAAGSRLRDDCATIIEQQGGL 172 Query: 132 -PTGDAKVTGGYNLPAKYIIHTVGP-------QDGSAEK-LESCYEKCLSFQQEY-QIKS 181 PTG AK+T GY LPAKY+IHTVGP D EK L+S Y+ CL+ E +K+ Sbjct: 173 EPTGSAKITRGYALPAKYVIHTVGPCLHSGYLPDEEDEKQLKSAYQSCLTLASEINDLKT 232 Query: 182 IAFPCISTGIYGFPNRLAAHIALRTARKFLETNTE-MNRIIFCTFLPIDVEIYETLM 237 +AF ISTG++ +P AA +AL T +L + + +++F + D IYE L+ Sbjct: 233 LAFCAISTGVFSYPKIDAASVALETVSDWLSEHPQHFEKVVFNLYTQADAAIYERLI 289 >UniRef50_A6GJ81 Cluster: Putative uncharacterized protein; n=1; Plesiocystis pacifica SIR-1|Rep: Putative uncharacterized protein - Plesiocystis pacifica SIR-1 Length = 173 Score = 115 bits (277), Expect = 9e-25 Identities = 60/136 (44%), Positives = 82/136 (60%), Gaps = 9/136 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC---DSIGG--CPTG 134 +++ +GDIT++ DA+VNAAN ++ GGGVDGAIHRAAGP L A C + G CP G Sbjct: 5 ITLERGDITRVSCDAIVNAANPKMLGGGGVDGAIHRAAGPELLAACRRVPKVNGIRCPFG 64 Query: 135 DAKVTGGYNLPAKYIIHTVGPQDGSAEK----LESCYEKCLSFQQEYQIKSIAFPCISTG 190 +A++T + L A+++IH VGP +E L Y L + + +A P +STG Sbjct: 65 EARITPAFGLDARWVIHAVGPIYARSEDPKGVLARAYASALELAAAHDVTELACPALSTG 124 Query: 191 IYGFPNRLAAHIALRT 206 YGFP AA IAL T Sbjct: 125 AYGFPLDPAARIALET 140 >UniRef50_Q93RG0 Cluster: UPF0189 protein in tap1-dppD intergenic region; n=5; Bacteria|Rep: UPF0189 protein in tap1-dppD intergenic region - Treponema medium Length = 261 Score = 115 bits (277), Expect = 9e-25 Identities = 68/172 (39%), Positives = 92/172 (53%), Gaps = 16/172 (9%) Query: 82 IFKGDITKLEIDAVVNAANSRLKA-----GGGVDGAIHRAAGPFLQAECDSI-----GGC 131 +++GDIT L++DA+VNAANS + +D IH AG L+ C I Sbjct: 89 VWRGDITTLKVDAIVNAANSGMTGCWQPCHACIDNCIHTFAGVQLRTVCAGIMQEQGHEE 148 Query: 132 PTGDAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFP 185 PTG AK+T +NLP KY++HTVGP D L + Y CL+ E +KSIAF Sbjct: 149 PTGTAKITPAFNLPCKYVLHTVGPIISGQLTDRDCTLLANSYTSCLNLAAENGVKSIAFC 208 Query: 186 CISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLM 237 CISTG++ FP + AA IA+ T + N +I+F F D +Y LM Sbjct: 209 CISTGVFRFPAQKAAEIAVATVEDWKAKNNSAMKIVFNVFSEKDEALYNKLM 260 >UniRef50_A2DE53 Cluster: Appr-1-p processing enzyme family protein; n=1; Trichomonas vaginalis G3|Rep: Appr-1-p processing enzyme family protein - Trichomonas vaginalis G3 Length = 270 Score = 115 bits (276), Expect = 1e-24 Identities = 73/219 (33%), Positives = 108/219 (49%), Gaps = 13/219 (5%) Query: 28 SSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIFK-GD 86 S+ +DL +V WS D+ +++ ++ N I+ +SI+K GD Sbjct: 2 SNSIVDLASVPKWS---------DAGPQWMEEMPLPRRLHANIRPCPEINNLISIWKCGD 52 Query: 87 ITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPA 146 T+L+ DAV+N ++ +GG + +I+ AAGP L C IG C + VT G++LPA Sbjct: 53 STRLKCDAVINRTDNNFSSGGALFTSINNAAGPQLAQACRQIGHCDDCNTVVTPGFSLPA 112 Query: 147 KYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRT 206 KY+IHTVGP +LES + S I+SI GF A IA Sbjct: 113 KYVIHTVGPTGDDDPELESTMDSVFSHIDGESIRSIGMAPFFIENNGFSLGHATQIAFSK 172 Query: 207 ARKFL---ETNTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 RKFL E +++RI+F P + I+ L+ LYFP Sbjct: 173 TRKFLENPENRQKVDRIVFIVTQPHSIPIFVRLLYLYFP 211 >UniRef50_UPI0000519D2E Cluster: PREDICTED: similar to CG18812-PC, isoform C, partial; n=2; Apocrita|Rep: PREDICTED: similar to CG18812-PC, isoform C, partial - Apis mellifera Length = 353 Score = 114 bits (275), Expect = 2e-24 Identities = 64/175 (36%), Positives = 97/175 (55%), Gaps = 7/175 (4%) Query: 75 SISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC-DSIGGCPT 133 +++ +++++ GDI+ L++DAVVN+ N + + I AG L+ E + I C T Sbjct: 54 TLNNKLALWTGDISILQVDAVVNSTNETMDDNSPMCQRIFVRAGSALKMEIFNEIKECKT 113 Query: 134 GDAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCI 187 G+ +VT + LPA++IIHTVGP Q + L CY L +E +++IA P I Sbjct: 114 GEVRVTQAHGLPARFIIHTVGPVYNVKYQTAAQNTLHCCYRNVLQKARELGLRTIALPVI 173 Query: 188 STGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 ++ +P AHIALRT R+FLE + I P D+ IYE L+ LYFP Sbjct: 174 NSVRRNYPPDAGAHIALRTMRRFLEQYGDSVTCIVLVLEPCDLGIYEVLLPLYFP 228 >UniRef50_A0UYE8 Cluster: Appr-1-p processing; n=3; Bacteria|Rep: Appr-1-p processing - Clostridium cellulolyticum H10 Length = 341 Score = 114 bits (274), Expect = 2e-24 Identities = 61/149 (40%), Positives = 90/149 (60%), Gaps = 8/149 (5%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPF-LQAECDSIGGCPTGDAKVTG 140 I + DITKL++DA+VNAAN+ L+ GGGV GAI +AAG LQA CD + TG+ +T Sbjct: 5 IVRQDITKLKVDAIVNAANTDLRMGGGVCGAIFKAAGAAQLQAVCDKLAPIKTGEVVITP 64 Query: 141 GYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 G+NL AK++IH GP ++ + L + Y L E + +SIAFP IS+GIYG+ Sbjct: 65 GFNLSAKFVIHAAGPVYRHWNREQGEQYLRAAYTNSLKCAVENKCESIAFPLISSGIYGY 124 Query: 195 PNRLAAHIALRTARKFL-ETNTEMNRIIF 222 P A +A F+ + + ++ ++F Sbjct: 125 PKDEALRVATSEIHNFITDHDIDVTLVVF 153 >UniRef50_A0X2G8 Cluster: Appr-1-p processing domain protein; n=1; Shewanella pealeana ATCC 700345|Rep: Appr-1-p processing domain protein - Shewanella pealeana ATCC 700345 Length = 304 Score = 112 bits (269), Expect = 9e-24 Identities = 70/180 (38%), Positives = 106/180 (58%), Gaps = 18/180 (10%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKA-----GGGVDGAIHRAAGPFLQAECDSI----G 129 ++ ++KGDIT L +DA+VNAAN+++ +D AIH AG L+A+C+ I G Sbjct: 121 KIILWKGDITTLAVDAIVNAANNQMLGCFQPQHKCIDNAIHNRAGAQLRADCEVIMELQG 180 Query: 130 GCP-TGDAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSF-QQEYQIKS 181 TG AK+T YNLP+K++IHTVGP Q A +L S Y L+ +Q +I+S Sbjct: 181 NIEETGIAKITRAYNLPSKFVIHTVGPIVQNMIQPIHAGQLASSYRSILTLAKQTERIRS 240 Query: 182 IAFPCISTGIYGFPNRLAAHIALRTARKFL-ETNTEMNRIIFCTFLPIDVEIYETLMQLY 240 +AF ISTGI+G+P A +AL T ++L E + + I+F F D +Y++ ++ Y Sbjct: 241 LAFCSISTGIFGYPIEQATRVALDTVTQWLMENPDQFDTIVFNVFSEYDHHVYQSALEDY 300 >UniRef50_Q7JUR6 Cluster: GH03014p; n=11; Endopterygota|Rep: GH03014p - Drosophila melanogaster (Fruit fly) Length = 540 Score = 111 bits (267), Expect = 2e-23 Identities = 65/176 (36%), Positives = 93/176 (52%), Gaps = 9/176 (5%) Query: 74 KSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDS-IGGCP 132 K ++ R I+ GD+T LE+DA+ N ++ L + I AG L+ E + + C Sbjct: 63 KDVNNRFVIWDGDMTTLEVDAITNTSDETLTESNSISERIFAVAGNQLREELSTTVKECR 122 Query: 133 TGDAKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPC 186 TGD ++T GYNLPAKY++HTV P + + L CY L +E + +IA Sbjct: 123 TGDVRITRGYNLPAKYVLHTVAPAYREKFKTAAENTLHCCYRNVLCKAKELNLHTIALCN 182 Query: 187 ISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 IS FP +AAHIALRT R++L+ T + +I C + YE L LYFP Sbjct: 183 ISAHQKSFPADVAAHIALRTIRRYLDKCT-LQVVILCVG-SSERGTYEVLAPLYFP 236 >UniRef50_Q22CT8 Cluster: Appr-1-p processing enzyme family protein; n=1; Tetrahymena thermophila SB210|Rep: Appr-1-p processing enzyme family protein - Tetrahymena thermophila SB210 Length = 535 Score = 109 bits (261), Expect = 8e-23 Identities = 67/174 (38%), Positives = 94/174 (54%), Gaps = 11/174 (6%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTG 134 ++SI K D+T +DA+VNAAN+ L GGGV GAI R G +Q + I G Sbjct: 46 QISIVKNDLTMENVDAIVNAANNFLAHGGGVAGAICRKGGRIIQNQSYDIIKIRNRIENG 105 Query: 135 DAKVTGGYNLPAKYIIHTVGP--QDGSA---EKLESCYEKCLSFQQEYQIKSIAFPCIST 189 ++ T LP K +IHTVGP +DG + E+L C E L + Y++KSI+ P IS+ Sbjct: 106 ESVTTEAGQLPCKKVIHTVGPIWEDGDSNEKEELAKCMETILREAKFYKLKSISIPAISS 165 Query: 190 GIYGFPNRLAAHIALRTARKFL--ETNTEMNRIIFCTFLPIDVEIYETLMQLYF 241 GI+GFP L A I L +K L + + + I FC F V+++ Q F Sbjct: 166 GIFGFPKYLCAKILLEETQKLLKYDYSNQFEEIRFCNFDNETVQVFAEEFQKQF 219 >UniRef50_A1HMQ5 Cluster: Appr-1-p processing domain protein; n=4; Clostridiales|Rep: Appr-1-p processing domain protein - Thermosinus carboxydivorans Nor1 Length = 264 Score = 107 bits (257), Expect = 3e-22 Identities = 61/158 (38%), Positives = 88/158 (55%), Gaps = 8/158 (5%) Query: 74 KSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----G 129 K + R+ I +GDIT+ DA+VN ANSRL GGG AI G + + + I G Sbjct: 82 KKDARRIIIKQGDITEETTDAIVNPANSRLVHGGGAARAIAVKGGEEIVRQSNEIIRKIG 141 Query: 130 GCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAE---KLESCYEKCLSFQQEYQIKSIAFPC 186 PT A +TG LP K++IH VGPQ G + KL+ L+ + Y +++IA P Sbjct: 142 HLPTTKAVITGAGKLPCKFVIHVVGPQMGEGDEDSKLKRAVWNVLTLAENYNLQTIAMPA 201 Query: 187 ISTGIYGFPNRLAAHIALRTARKFLET-NTEMNRIIFC 223 IS+GI+GFP A + L TA +FL++ + +I+ C Sbjct: 202 ISSGIFGFPKPRCAEVLLSTAARFLDSCAVSLQQIVMC 239 >UniRef50_A1D5K4 Cluster: Appr-1-p processing enzyme family protein; n=1; Neosartorya fischeri NRRL 181|Rep: Appr-1-p processing enzyme family protein - Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / NRRL 181)(Aspergillus fischerianus (strain ATCC 1020 / DSM 3700 / NRRL 181)) Length = 257 Score = 107 bits (257), Expect = 3e-22 Identities = 65/166 (39%), Positives = 87/166 (52%), Gaps = 12/166 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECD---SIGGCPTGDA 136 VS + DI +L++D +VNAA L+ GGGVD A+H AAGP L C C G Sbjct: 92 VSFIEHDIARLQVDCIVNAAKESLQGGGGVDRAMHLAAGPKLNQACIKKLQDRQCSPGRV 151 Query: 137 KVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 +T G++L K +IHTVGP Q A+ L CY L+ ++SI FP IS G Sbjct: 152 FMTPGFHLRCKSVIHTVGPDCRQKQQIDYAQVLRQCYRNSLNKAVSKGLRSIVFPAISVG 211 Query: 191 IYGFPNRLAAHIALRTARKFLETN---TEMNRIIFCTFLPIDVEIY 233 +Y P + IAL T R FL+ + + ++RI FC P IY Sbjct: 212 VYACPAEATSEIALNTVRGFLDEHGRPSSLDRIGFCNLGPNIHAIY 257 >UniRef50_A3LYE6 Cluster: Putative uncharacterized protein; n=1; Pichia stipitis|Rep: Putative uncharacterized protein - Pichia stipitis (Yeast) Length = 583 Score = 105 bits (253), Expect = 8e-22 Identities = 68/184 (36%), Positives = 107/184 (58%), Gaps = 26/184 (14%) Query: 76 ISERVSIFKGDITKL-EIDAVVNAANSRL-----KAGGGVDGAIHRAAGPFLQAECDSIG 129 +S ++SI+KGDIT + ++ A+VNAANS L + +D IH AAGP L+ C ++ Sbjct: 91 LSPKLSIWKGDITTISDVTAIVNAANSALLGCFQPSHRCIDNIIHAAAGPDLRRACYNLV 150 Query: 130 GC------PTGDAKVTGGYNLPAKYIIHTVGPQ--DGS------AEKLESCYEKCLSFQQ 175 P G A++T G+NLPAK +IHTVGP GS +L +CY L+ + Sbjct: 151 EQRDFTQEPVGSAQITPGFNLPAKMVIHTVGPSLLPGSEPNQEEISQLAACYTSSLAKLE 210 Query: 176 EYQ----IKSIAFPCISTGIYGFPNRLAAHIALRTARKFL--ETNTEMNRIIFCTFLPID 229 E + KSI F CISTG++ FPN +A++IA+ + R + ++ ++ +IF F + Sbjct: 211 EQEEDGNDKSIVFCCISTGLFSFPNDIASNIAIESVRNYFSEHPHSSISEVIFNVFTETN 270 Query: 230 VEIY 233 +++Y Sbjct: 271 LKLY 274 >UniRef50_UPI0000ECB76F Cluster: Poly [ADP-ribose] polymerase 14 (EC 2.4.2.30) (PARP-14) (B aggressive lymphoma protein 2).; n=2; Gallus gallus|Rep: Poly [ADP-ribose] polymerase 14 (EC 2.4.2.30) (PARP-14) (B aggressive lymphoma protein 2). - Gallus gallus Length = 1636 Score = 104 bits (250), Expect = 2e-21 Identities = 58/143 (40%), Positives = 82/143 (57%), Gaps = 10/143 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 ++++K D+ +D VVNA+N LK GG+ A+ +AAGP LQAECD + G GD Sbjct: 637 IAVYKADLCTHHVDVVVNASNEDLKHIGGLAWALLQAAGPELQAECDGVVRMSGSLQAGD 696 Query: 136 AKVTGGYNLPAKYIIHTVGP--QDGSAEK----LESCYEKCLSFQQEYQIKSIAFPCIST 189 A +TG LP K +IH VGP ++ AEK L+ +K L + Y +SIAFP +S Sbjct: 697 AVITGAGKLPCKQVIHAVGPRWKEQDAEKCVYLLKKTIKKSLQLAETYNHRSIAFPSVSG 756 Query: 190 GIYGFPNRLAAHIALRTARKFLE 212 GI+GFP + + +K LE Sbjct: 757 GIFGFPLHKCVNAIVSAIKKTLE 779 Score = 66.5 bits (155), Expect = 6e-10 Identities = 44/130 (33%), Positives = 63/130 (48%), Gaps = 9/130 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGG-VDGAIHRAAGPFLQAEC--DSIGGCPT-GD 135 + + KG+I D VV + L+ G + A+ AGP LQ++ + +G P G Sbjct: 848 IMLKKGNIEDASTDGVVISVGGDLQLEKGQLAKALLSKAGPRLQSDLNDEGLGKSPVEGS 907 Query: 136 AKVTGGYNLPAKYIIHTVGP-----QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 T GYNL Y+ H V P + + + L KCL +E +KSI FP I TG Sbjct: 908 VFTTRGYNLSCCYVFHAVTPGWSQGSESAVKILGKIVTKCLQTAEELSLKSITFPAIGTG 967 Query: 191 IYGFPNRLAA 200 I GFP+ + A Sbjct: 968 ILGFPSSVVA 977 Score = 57.2 bits (132), Expect = 4e-07 Identities = 61/240 (25%), Positives = 99/240 (41%), Gaps = 19/240 (7%) Query: 12 NRILKLSLEEKRKIYKSSDFI----DLENVDPWSKYLNKSQG--IDSKKSTTDDLKEFEK 65 +++ + S ++K + F+ D+ N+ +S + G +D + DL+ F Sbjct: 982 DKVYEFSSKKKTNSLREVHFLLHPKDVNNIQAFSNEFERRCGNDVDETEVKEQDLQTFFG 1041 Query: 66 IKINTEKN----KSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFL 121 N ++ + S + GDITK D +VN +N GV AI AG + Sbjct: 1042 PISNPARDVYEMRIGSITFQVAAGDITKETGDVIVNISNQAFNLKTGVSKAILEGAGKEV 1101 Query: 122 QAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKS 181 + EC + P T +LP K IIH V D ++ K L + Q S Sbjct: 1102 ENECAELALQPNDGYITTEAGSLPCKKIIHFVARDD-----IKVPVSKVLQECELQQYTS 1156 Query: 182 IAFPCISTGIYG-FPNRLAAHIALRTARKFLETNT--EMNRIIFCTFLPIDVEIYETLMQ 238 + FP I TG G FP+ L A + F +N+ + I F P + ++ T M+ Sbjct: 1157 VTFPAIGTGQAGRFPD-LVADEMMDAITDFARSNSTPSVKTIKIVIFQPHLLNVFHTSMK 1215 >UniRef50_Q5XC09 Cluster: UPF0189 protein M6_Spy0919; n=19; Streptococcus|Rep: UPF0189 protein M6_Spy0919 - Streptococcus pyogenes serotype M6 Length = 270 Score = 104 bits (250), Expect = 2e-21 Identities = 70/177 (39%), Positives = 94/177 (53%), Gaps = 20/177 (11%) Query: 82 IFKGDITKLEIDAVVNAANSRLKA-----GGGVDGAIHRAAGPFLQAECDSI----GGCP 132 ++ GDI L +DA+VNAANS L G +D AIH AG L+ C +I G Sbjct: 88 LYHGDIRYLAVDAIVNAANSELLGCFIPNHGCIDNAIHTFAGSRLRLACQAIMTEQGRKE 147 Query: 133 T-GDAKVTGGYNLPAKYIIHTVGPQDGS--------AEKLESCYEKCLSFQQEYQIKSIA 183 G AK+T Y+LPA YIIHTVGP+ A+ L CY L + + S+A Sbjct: 148 AIGQAKLTSAYHLPASYIIHTVGPRITKGRHVSPIRADLLARCYRSSLDLAVKAGLTSLA 207 Query: 184 FPCISTGIYGFPNRLAAHIALRTARKFLETNTEMN--RIIFCTFLPIDVEIYETLMQ 238 F ISTG +GFP + AA IA++T K+ + E +IF TF D +Y+T +Q Sbjct: 208 FCSISTGEFGFPKKEAAQIAIKTVLKWQAEHPESKTLTVIFNTFTSEDKALYDTYLQ 264 >UniRef50_Q8ZXT3 Cluster: UPF0189 protein PAE1111; n=8; Thermoprotei|Rep: UPF0189 protein PAE1111 - Pyrobaculum aerophilum Length = 182 Score = 104 bits (249), Expect = 2e-21 Identities = 65/167 (38%), Positives = 87/167 (52%), Gaps = 9/167 (5%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAE----CDSIGGCPTGD 135 V + +GDIT++E DA+VNAANS L+ GGGV GAI R G +Q E G P GD Sbjct: 10 VVLMRGDITEVEADAIVNAANSYLEHGGGVAGAIVRKGGQVIQEESREWVRKHGPVPVGD 69 Query: 136 AKVTGGYNLPAKYIIHTVGPQDG--SAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 VT L AKY+IH VGP+ G EKL + L +E + SIA P ISTGI+G Sbjct: 70 VAVTSAGRLKAKYVIHAVGPRCGVEPIEKLAEAVKNALLKAEELGLVSIALPAISTGIFG 129 Query: 194 FPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLY 240 P AA R+ + RI+ + E Y+ ++++ Sbjct: 130 CPYDAAAEQMATAIREVAPALRSIRRILVVLY---GEEAYQKFLEVF 173 >UniRef50_A5D049 Cluster: Predicted phosphatase; n=3; Bacteria|Rep: Predicted phosphatase - Pelotomaculum thermopropionicum SI Length = 359 Score = 103 bits (248), Expect = 3e-21 Identities = 57/149 (38%), Positives = 81/149 (54%), Gaps = 3/149 (2%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + + KGDIT+L++DA+VNAAN+ L G GV GAI R G ++ E + G P G+A VT Sbjct: 2 IKVLKGDITELQVDAIVNAANNHLWMGAGVAGAIKRKGGAAIEEEAVAKGPIPVGEAVVT 61 Query: 140 GGYNLPAKYIIHTVG-PQD--GSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPN 196 G L A+Y++H QD AEK+ + L E +K+IAFP + TG+ G Sbjct: 62 GAGLLKARYVVHAAAMGQDLVTDAEKVRAATRNALLRAGELGLKTIAFPALGTGVGGLEF 121 Query: 197 RLAAHIALRTARKFLETNTEMNRIIFCTF 225 AA + + R+ L E +IF F Sbjct: 122 DTAARVMVGEVRRHLALGLEPGEVIFALF 150 >UniRef50_UPI00006A2284 Cluster: UPI00006A2284 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2284 UniRef100 entry - Xenopus tropicalis Length = 694 Score = 101 bits (242), Expect = 2e-20 Identities = 60/152 (39%), Positives = 83/152 (54%), Gaps = 12/152 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 V+++K D+ + +D VVNAAN LK GG+ GA+ RAAGP LQ +CD I G GD Sbjct: 3 VAVYKDDLARHSVDVVVNAANEDLKHIGGLAGALLRAAGPKLQTDCDQIIKIRGRLSAGD 62 Query: 136 AKVTGGYNLPAKYIIHTVGPQ-----DGSAEK-LESCYEKCLSFQQEYQIKSIAFPCIST 189 A +T NLP K +IH VGP G ++ L CL +SI P +S+ Sbjct: 63 AVITDAGNLPCKQVIHAVGPVWNAFFPGKCDRQLHKAITSCLDLAARKGHRSIGIPAVSS 122 Query: 190 GIYGFP-NRLAAHIALRTARKFLETNTEMNRI 220 GI+GFP R HI L + + ++E N+ + I Sbjct: 123 GIFGFPLKRCVTHI-LGSIKAYVEDNSAHSTI 153 Score = 57.2 bits (132), Expect = 4e-07 Identities = 44/161 (27%), Positives = 70/161 (43%), Gaps = 9/161 (5%) Query: 57 TDDLK-EFEKIKINT-EKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGG-VDGAI 113 TD L+ E E++K T N+ + + + + I D +VN +L+ + A+ Sbjct: 170 TDALRAESEQLKEQTVTTNEGLI--IKVIQQAIEDSTTDVIVNNVGQKLQLNEWQISRAL 227 Query: 114 HRAAGPFLQ---AECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQ-DGSAEKLESCYEK 169 AGP LQ + P G T G NL ++H V PQ D + L + Sbjct: 228 AARAGPQLQQLLSNSSQGASAPNGSVFSTDGCNLNCAKVLHVVMPQWDRRTQVLRKSIKS 287 Query: 170 CLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKF 210 CL ++ ++SI+ P I TG G+P L A + + F Sbjct: 288 CLKLTEQQSLQSISIPAIGTGKLGYPKDLVAAVTFKEILHF 328 >UniRef50_A7HJC7 Cluster: Appr-1-p processing domain protein; n=1; Fervidobacterium nodosum Rt17-B1|Rep: Appr-1-p processing domain protein - Fervidobacterium nodosum Rt17-B1 Length = 184 Score = 101 bits (241), Expect = 2e-20 Identities = 58/147 (39%), Positives = 77/147 (52%), Gaps = 8/147 (5%) Query: 85 GDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECD----SIGGCPTGDAKVTG 140 GDIT IDA+VNAANS L GGGV G I R GP +Q E D G G VTG Sbjct: 16 GDITTQNIDAIVNAANSYLSHGGGVAGVISRKGGPTIQKESDEYVKKYGPVEPGGVAVTG 75 Query: 141 GYNLPAKYIIHTVGP---QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP-N 196 NL AKY++HTVGP + + + + C+ + E IK+IA P + TGI+G+P Sbjct: 76 AGNLSAKYVLHTVGPIGDKPQNDDIIVKCFINIIKKSDELGIKTIAIPFVGTGIFGYPLE 135 Query: 197 RLAAHIALRTARKFLETNTEMNRIIFC 223 R ++ + + +IIFC Sbjct: 136 RFIENVTKVLINYLKDYEGTLQKIIFC 162 >UniRef50_A1RWM4 Cluster: Appr-1-p processing domain protein; n=2; Thermoproteales|Rep: Appr-1-p processing domain protein - Thermofilum pendens (strain Hrk 5) Length = 189 Score = 101 bits (241), Expect = 2e-20 Identities = 66/166 (39%), Positives = 84/166 (50%), Gaps = 10/166 (6%) Query: 86 DITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDS----IGGCPTGDAKVTGG 141 DIT+ + +A+VNAANS LK GGGV AI R G +Q E D G P G+ VTG Sbjct: 19 DITEADTEAIVNAANSYLKHGGGVALAIVRKGGDVIQRESDEWVKRYGPVPEGEVAVTGA 78 Query: 142 YNLPAKYIIHTVGPQDGSA---EKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRL 198 L AKY+IH VGP+ G EKL L +E +KSIA P ISTG++G+P R Sbjct: 79 GKLKAKYVIHAVGPKYGDPLGDEKLARAISNSLLKAEELGLKSIALPAISTGVFGYPYRR 138 Query: 199 AAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLYFPTL 244 A I A FL T ++ + E YE ++ L Sbjct: 139 CAEI---MADVFLATAGKLKSLRTVLVCLWGSEAYEAFRSVFLEKL 181 >UniRef50_A0CX10 Cluster: Chromosome undetermined scaffold_3, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_3, whole genome shotgun sequence - Paramecium tetraurelia Length = 183 Score = 99 bits (238), Expect = 5e-20 Identities = 69/173 (39%), Positives = 90/173 (52%), Gaps = 14/173 (8%) Query: 80 VSIFKGDITKL-EIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC----DSIGGCPTG 134 V I K +I KL ++DA+VNAAN L GGGV GAI +AAG L+ EC G PT Sbjct: 6 VKIIKENIVKLVDVDAIVNAANQELLPGGGVCGAIFQAAGRELERECQQYIQQYGIVPTS 65 Query: 135 DAKVTGGYNLP---AKYIIHTVGP---QDGSAE-KLESCYEKCLSFQ-QEYQIKSIAFPC 186 VT L KYIIH VGP Q S E +L+ C L+ ++KS+A P Sbjct: 66 KLAVTSSCQLKKNNIKYIIHAVGPKYFQSSSPEDELQICVNNILNQSFNVLELKSVAIPA 125 Query: 187 ISTGIYGFPNRLAAHIALRTARKF-LETNTEMNRIIFCTFLPIDVEIYETLMQ 238 IS+GIYGFP L A I ++ +T+ + II C F I++ + Q Sbjct: 126 ISSGIYGFPKGLCAQIFKLVIEEYQKDTSNKQGEIILCNFDQETTTIFQKVFQ 178 >UniRef50_Q4T065 Cluster: Chromosome undetermined SCAF11328, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF11328, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 566 Score = 98.7 bits (235), Expect = 1e-19 Identities = 57/183 (31%), Positives = 92/183 (50%), Gaps = 11/183 (6%) Query: 29 SDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIFKGDIT 88 S F+D++ + W + L++ ++ + E + I+ ++ +FKGD+ Sbjct: 8 SHFLDVQTLPTWPQQLDQDG-----QAAAPEPSEDQGFPSPFPFRADINAKIVLFKGDVA 62 Query: 89 KLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKY 148 L ++VN ++ L V +IHR AGP L+ E + GC TG+AK+T G+ L A++ Sbjct: 63 LLNCTSIVNTSSESLNDKNPVSDSIHRLAGPELRDELLKLKGCRTGEAKLTKGFGLAARF 122 Query: 149 IIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHI 202 IIHTVGP + + L SCY L E + S+ I+T G+P A H+ Sbjct: 123 IIHTVGPKYKTKYRTAAESSLYSCYRSVLQLVVEQSMASVGLCTITTSKRGYPLEEATHM 182 Query: 203 ALR 205 ALR Sbjct: 183 ALR 185 >UniRef50_Q2SM57 Cluster: Predicted phosphatase; n=1; Hahella chejuensis KCTC 2396|Rep: Predicted phosphatase - Hahella chejuensis (strain KCTC 2396) Length = 180 Score = 98.3 bits (234), Expect = 2e-19 Identities = 59/166 (35%), Positives = 84/166 (50%), Gaps = 12/166 (7%) Query: 85 GDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGP-FLQAECDSIGGCPTGDAKVTGGYN 143 GDIT+LE+DA+V A+ L G G+ I AG L+A C GGC G A +T G+ Sbjct: 7 GDITELEVDAIVCPAHKYLSKGRGLSAQIFEQAGEEALEAACSQAGGCKVGGACLTPGFK 66 Query: 144 LPAKYIIHTVGPQ-------DGS-AEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 LPAK+IIHTV PQ GS L +CY+ + E +K+IAFP + G P Sbjct: 67 LPAKHIIHTVTPQWTGGDQWGGSDLHLLANCYDSVVRLALEQGVKTIAFPALGAGTNKTP 126 Query: 196 NRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLYF 241 +AAH L K+ ++ R+I C ++ + + +F Sbjct: 127 QSMAAHEGLEVLVKYADS---FERLIICLHWEAGLDTWRRTYEDFF 169 >UniRef50_UPI0000E80997 Cluster: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2); n=3; Gallus gallus|Rep: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) - Gallus gallus Length = 1655 Score = 96.7 bits (230), Expect = 5e-19 Identities = 54/141 (38%), Positives = 78/141 (55%), Gaps = 10/141 (7%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGDAK 137 ++KG++ +D VVNAA+ L+ G A+ +AAGP LQAECD + G GDA Sbjct: 646 VYKGNLCNYPVDVVVNAASEDLRHTDGFAWALLQAAGPELQAECDEVVRMTGSLQAGDAV 705 Query: 138 VTGGYNLPAKYIIHTVGPQ--DGSAEK----LESCYEKCLSFQQEYQIKSIAFPCISTGI 191 +TG LP K +IH +GPQ + ++ K L +K L + Y +SIAFP +S GI Sbjct: 706 ITGAGKLPCKQVIHAIGPQWKEKNSGKCMYLLMEAIKKSLQLAETYNHRSIAFPSVSGGI 765 Query: 192 YGFPNRLAAHIALRTARKFLE 212 +GFP + + +K LE Sbjct: 766 FGFPPHKCVNAIVSAIKKTLE 786 Score = 76.6 bits (180), Expect = 5e-13 Identities = 66/210 (31%), Positives = 95/210 (45%), Gaps = 17/210 (8%) Query: 10 EKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKIN 69 E R+L+ +++ K KSS + + P N QG ++ DDL F Sbjct: 805 ETVRVLRETVQ-KEFTAKSSSSVLQQQCSP-----NHRQGESQREKRGDDL--FMATGGE 856 Query: 70 TEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAG-GGVDGAIHRAAGPFLQAECDSI 128 + R+ + K DI D +VN+ + LK G G + A+ + AGP LQ E D Sbjct: 857 NMITTAEGLRIQVEKKDIIDATTDVIVNSVGTDLKFGVGPLCRALLKEAGPELQMEFDKE 916 Query: 129 GG---CPTGDAKVTGGYNLPAKYIIHTVGPQ----DGSAEK-LESCYEKCLSFQQEYQIK 180 G G T GY L ++ H V PQ G A K LE+ KCL +E+ +K Sbjct: 917 KGQQVAGNGSVVCTKGYILDCTFVFHAVLPQWDRGSGQALKTLENTVHKCLMKAEEFGLK 976 Query: 181 SIAFPCISTGIYGFPNRLAAHIALRTARKF 210 SIAFP I TG + FP+ + + + KF Sbjct: 977 SIAFPAIGTGGFSFPHTVVSKLMFDEVFKF 1006 Score = 56.0 bits (129), Expect = 8e-07 Identities = 40/114 (35%), Positives = 54/114 (47%), Gaps = 5/114 (4%) Query: 77 SERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDA 136 S + + GDITK + + +VN AN A GV AI AAG ++ EC+ GG Sbjct: 1072 SVTLKVTSGDITKEDTEVIVNIANQTFDATSGVFKAIMDAAGFDVKEECNQYGGLLQSGF 1131 Query: 137 KVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 T G L + IIH + + + E +E C Q KS+AFP I TG Sbjct: 1132 ITTKGGALLCRRIIHLIHSMNVKNQVSEVLHE-C----QLRTYKSVAFPAIGTG 1180 >UniRef50_A6SR30 Cluster: Putative uncharacterized protein; n=1; Botryotinia fuckeliana B05.10|Rep: Putative uncharacterized protein - Botryotinia fuckeliana B05.10 Length = 474 Score = 96.3 bits (229), Expect = 6e-19 Identities = 61/169 (36%), Positives = 91/169 (53%), Gaps = 10/169 (5%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI---GGCPTGDA 136 V + GD+ K +D +VNAAN +LK GGG+DGAIH AAGP LQ E + + G G Sbjct: 21 VEVLIGDMLKYPVDVIVNAANVKLKKGGGIDGAIHAAAGPELQGEMNELFQHPGQVGGAY 80 Query: 137 KVTGGYNLPA-KYIIHTVGPQDGSAEK-----LESCYEKCLSFQQEYQIKSIAFPCISTG 190 T +++ + +YIIH VGP E+ L + + L + +++SIAFP IS G Sbjct: 81 GTTSSWDIQSCRYIIHAVGPNWNIPEQQDGKFLFTAIQNSLDLAMKNKLRSIAFPGISMG 140 Query: 191 IYGFPNRLAAHIALRTARKF-LETNTEMNRIIFCTFLPIDVEIYETLMQ 238 I+ P LA + + R + ++ EM+RI + EI ET ++ Sbjct: 141 IFAMPKSLAGLVIISALRTWIIKYRGEMDRISILLLGYSEDEITETRLR 189 >UniRef50_UPI0000660739 Cluster: ganglioside induced differentiation associated protein 2; n=1; Takifugu rubripes|Rep: ganglioside induced differentiation associated protein 2 - Takifugu rubripes Length = 529 Score = 95.9 bits (228), Expect = 8e-19 Identities = 52/182 (28%), Positives = 93/182 (51%), Gaps = 11/182 (6%) Query: 29 SDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIFKGDIT 88 S F+D++ + W + L + ++T+ + + + + I+ ++ +FKGD+ Sbjct: 8 SQFVDIQTLPTWPQQLE-----EDGEATSLEQGDGQDVPSPFPFRPDINSKIILFKGDVA 62 Query: 89 KLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKY 148 L ++VN ++ L V +IH+ AGP L+ E + GC TG+AK+T G+ L A++ Sbjct: 63 LLNCTSIVNTSSESLNDKNPVSDSIHQLAGPELRDELLKLKGCRTGEAKLTKGFGLAARF 122 Query: 149 IIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHI 202 IIHTVGP + + L SCY + E + S+ ++T G+P + H+ Sbjct: 123 IIHTVGPKFKTKYRTAAESSLHSCYRNIMQLVVEQSMASVGLCVVTTSKRGYPLEDSTHM 182 Query: 203 AL 204 AL Sbjct: 183 AL 184 >UniRef50_A7EET2 Cluster: Putative uncharacterized protein; n=1; Sclerotinia sclerotiorum 1980|Rep: Putative uncharacterized protein - Sclerotinia sclerotiorum 1980 Length = 506 Score = 95.5 bits (227), Expect = 1e-18 Identities = 63/176 (35%), Positives = 93/176 (52%), Gaps = 13/176 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDS---IGGCPTGDA 136 V + GD+ K +D +VNAAN+ L G G+DG IHR AGP L AE + G G Sbjct: 21 VEVVDGDLLKYPVDVIVNAANASLVRGDGIDGEIHRQAGPELAAEMKTQFPHPGKQGGAY 80 Query: 137 KVTGGYNLPA-KYIIHTVG-----PQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 T +++ + +YIIH VG P + L + Y LS + ++SIAFP IS G Sbjct: 81 GTTHSWDITSCQYIIHAVGPDWRQPNQRATGLLANAYHNSLSLAAKNNLRSIAFPAISVG 140 Query: 191 IYGFPNRLAAHIALRTARKFLETNT-EMNRI---IFCTFLPIDVEIYETLMQLYFP 242 I+ P +A ++T R +++++ EM+RI +F P VE+ +QLY P Sbjct: 141 IFQMPRGMAGVTVMKTIRSWIDSHQGEMDRIGILLFGFDQPEIVEMKYPNLQLYIP 196 >UniRef50_UPI0000F2CC13 Cluster: PREDICTED: similar to B aggressive lymphoma long; n=1; Monodelphis domestica|Rep: PREDICTED: similar to B aggressive lymphoma long - Monodelphis domestica Length = 1624 Score = 95.1 bits (226), Expect = 1e-18 Identities = 74/218 (33%), Positives = 115/218 (52%), Gaps = 26/218 (11%) Query: 2 VNSTKWEIEKNRILKLSLE-EKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL 60 V+S + + + L +S++ E KI KS + + + +D K Q + +S D Sbjct: 35 VHSWIESLMEQKSLHISIDNENLKILKSYESLFRDVID------KKFQCASNLESALDSA 88 Query: 61 KEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPF 120 K F KI ++++ +S++K D+T+ DAVVNAAN RL GG+ A+ RA GP Sbjct: 89 KVF-KIMLSSQIE------LSVWKDDLTRHPADAVVNAANERLLHAGGLALALVRAGGPL 141 Query: 121 LQAECDSI----GGCPTGDAKVTGGYNLPAKYIIHTVGPQ--DGSAEK----LESCYEKC 170 ++ E ++I G PT + VT G LP IIH VGP+ D +AE+ LE Sbjct: 142 IEKESEAIIMQRGEVPTSEIAVTTGGQLPCSCIIHAVGPRWSDWNAERCCQELERATANI 201 Query: 171 LSF--QQEYQIKSIAFPCISTGIYGFPNRLAAHIALRT 206 L++ + IK++A P +S+GI+GFP L I + T Sbjct: 202 LNYVTNDSHGIKTVAIPALSSGIFGFPLELCVQIIILT 239 Score = 72.1 bits (169), Expect = 1e-11 Identities = 47/162 (29%), Positives = 78/162 (48%), Gaps = 5/162 (3%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAG-GGVDGAIHRAAGPFLQAECDSIGGCPTGDAK- 137 + I +G I K ++D +VN+ ++ G V AI AGP ++ E + +K Sbjct: 296 LQIIEGFIEKQQVDVIVNSISASNSFDLGKVSNAILIHAGPEIEEEFSKTYSGMSESSKL 355 Query: 138 --VTGGYNLPAKYIIHTVGPQDGSAEK-LESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 VT G+NL K++ H V P +K L+ +CL + + SI+FP + TG G Sbjct: 356 VVVTEGFNLACKHVYHVVWPSSYQTKKVLKEAVMRCLEKTCQENMNSISFPALGTGNIGL 415 Query: 195 PNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETL 236 P R A I L+ +F + + + ++ P D E+YE + Sbjct: 416 PKREAISIMLKEIFQFSKNHPQKRLLVNFVVYPNDNELYEVM 457 >UniRef50_A2QSI2 Cluster: Contig An08c0280, complete genome; n=1; Aspergillus niger|Rep: Contig An08c0280, complete genome - Aspergillus niger Length = 603 Score = 94.7 bits (225), Expect = 2e-18 Identities = 70/202 (34%), Positives = 105/202 (51%), Gaps = 28/202 (13%) Query: 69 NTEKNKSISERVSIFKGDITKLE-IDAVVNAANSRL-----KAGGGVDGAIHRAAGPFLQ 122 ++ +K + + +++GDIT L+ + A+ NAAN ++ A +D IH AGP L+ Sbjct: 98 SSSSSKPLPATLHLWQGDITTLDGVTAITNAANEQMLGCFQPAHRCLDNVIHARAGPRLR 157 Query: 123 AEC-----DSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQ-DG--------SAEKLESCYE 168 EC P G A T GY LPA Y+IHTVGPQ D ++L CYE Sbjct: 158 EECFHHMDQGQRTLPVGHACATKGYCLPAPYVIHTVGPQLDAGQPVPTAHQRQQLRQCYE 217 Query: 169 KCLSFQQ-----EYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFL--ETNTEMNRII 221 L + + + KSIA ISTG++ FP AA IA+++ +L +T + II Sbjct: 218 AVLDVAEALPASDPRGKSIALCGISTGLFAFPVEEAASIAIQSVLDWLRHHLHTSITNII 277 Query: 222 FCTFLPIDVEIY-ETLMQLYFP 242 F TF D +Y +TL ++++P Sbjct: 278 FNTFTDTDTAVYQQTLKKMHYP 299 >UniRef50_Q6NRC6 Cluster: MGC83934 protein; n=2; Xenopus|Rep: MGC83934 protein - Xenopus laevis (African clawed frog) Length = 914 Score = 93.9 bits (223), Expect = 3e-18 Identities = 61/170 (35%), Positives = 89/170 (52%), Gaps = 12/170 (7%) Query: 71 EKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAE----CD 126 EK S RVS++KGD+T+ +DAVVNAAN LK GG+ A+ +A G +Q E + Sbjct: 73 EKKLSEGLRVSVWKGDMTRQNVDAVVNAANEDLKHFGGLALALVKAGGAVIQDESRRHIE 132 Query: 127 SIGGCPTGDAKVTGGYNLPAKYIIHTVGPQ--DGSAEKLESCYEK-----CLSFQQEYQI 179 +G VT NLP K IIH VGP+ G K E ++ + E + Sbjct: 133 KYKKVKSGSIAVTSAGNLPCKMIIHAVGPEWSPGINAKCEQELKEVIRNVLMQVMNESNV 192 Query: 180 KSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPID 229 +S+A P +S+GI+ FP + I T +KF +T T +++ F+ ID Sbjct: 193 RSVAIPAVSSGIFRFPLQRCTEIIASTTKKFCDTET-YHKLAEIRFVNID 241 Score = 60.9 bits (141), Expect = 3e-08 Identities = 49/163 (30%), Positives = 70/163 (42%), Gaps = 8/163 (4%) Query: 84 KGDITKLEIDAVVNA--ANSRLKAGGGVDGAIHRAAGPFLQAEC--DSIGGCPTGDAKVT 139 KG I + + +VN+ AN L G + AI R AG L E S PT T Sbjct: 362 KGYIEEQKTAVIVNSLGANRNLNEGN-ISKAILRKAGNSLSQEVLDKSKYVSPTDIMIPT 420 Query: 140 GGYNLPAKYIIHTVGPQDGSAEK--LESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNR 197 GY LP ++ H + + GS +K L+ CL+ Y SI+FP + TG+ FP Sbjct: 421 RGYYLPCDFVYHVILQRSGSDQKKILKDGINACLNTALRYNTSSISFPALGTGMLCFPKP 480 Query: 198 LAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLY 240 + A + F + N N IF P D + Y + + Sbjct: 481 VVAKVMTDEVLSFAKEN-PCNMDIFFVIHPNDTDTYSEFKKAF 522 >UniRef50_Q54PT1 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 568 Score = 93.5 bits (222), Expect = 4e-18 Identities = 54/186 (29%), Positives = 91/186 (48%), Gaps = 12/186 (6%) Query: 65 KIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAE 124 + KI+TE I+ R+ ++ GDI L D +V + + L + I + G + + Sbjct: 48 QFKIDTE----INSRICLWMGDICNLNTDTIVYSNSKTLTESDTISDKIFKYGGSEMMND 103 Query: 125 CDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQ------DGSAEKLESCYEKCLSFQQEYQ 178 G C G++ +T G NLP+++++HTV P + L SCY + + Sbjct: 104 IQKNGECRYGESIITSGGNLPSRFVVHTVCPTYNPKYLSAAENALNSCYRSAFHLSMDVK 163 Query: 179 IKSIAFPCISTGIYGFPNRLAAHIALRTARKFLET--NTEMNRIIFCTFLPIDVEIYETL 236 KSI+F + + FP+ HIALRT R+FLE + ++I D+ +YE + Sbjct: 164 SKSISFSTLHSEKRQFPSVGGCHIALRTIRRFLEKPFSKSFEKVILAINTFEDLRLYEQM 223 Query: 237 MQLYFP 242 + +YFP Sbjct: 224 LPIYFP 229 >UniRef50_UPI0000F3214F Cluster: UPI0000F3214F related cluster; n=1; Bos taurus|Rep: UPI0000F3214F UniRef100 entry - Bos Taurus Length = 166 Score = 91.5 bits (217), Expect = 2e-17 Identities = 61/165 (36%), Positives = 90/165 (54%), Gaps = 16/165 (9%) Query: 5 TKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFE 64 TKW K + L L ++RK+++ + L+ WS L K + +K + ++ Sbjct: 8 TKWREIKQQSGTLRLRDQRKLHRR---VALD----WSLILIKKK---MEKGRKEGKRKHC 57 Query: 65 KIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAE 124 + N K+K+ + V ++K + + V AN+ L GGGVDG IHRAAGP L AE Sbjct: 58 QSGFNLRKHKT--KNVFLYKSTYFDICV-CVCMTANASLLGGGGVDGCIHRAAGPCLLAE 114 Query: 125 CDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEK 169 C ++ GC TG AK+T GY+LPAKY +H + P S L SC+ K Sbjct: 115 CRNLNGCETGHAKITCGYDLPAKYFVHEMMPISYS---LFSCHGK 156 >UniRef50_A6LTB5 Cluster: Appr-1-p processing domain protein; n=1; Clostridium beijerinckii NCIMB 8052|Rep: Appr-1-p processing domain protein - Clostridium beijerinckii NCIMB 8052 Length = 214 Score = 90.2 bits (214), Expect = 4e-17 Identities = 48/99 (48%), Positives = 60/99 (60%), Gaps = 7/99 (7%) Query: 86 DITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLP 145 DITK++ DA+VNAAN+ L GGGVDGAIH+A G L EC + GC TG +K+T YNL Sbjct: 10 DITKIKFDAIVNAANASLLGGGGVDGAIHKACGEKLLDECRQLNGCLTGRSKLTRSYNLS 69 Query: 146 ---AKYIIHTVGP---QDGSAEK-LESCYEKCLSFQQEY 177 ++IHTVGP +GS EK L + Y Y Sbjct: 70 DHGVHWVIHTVGPIYRNNGSEEKYLRNAYRSVFDIAANY 108 Score = 35.5 bits (78), Expect = 1.2 Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 2/65 (3%) Query: 176 EYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYET 235 ++ IK+IA P ISTG Y +P A +IAL F+ N + + +D + Y Sbjct: 146 DHPIKTIALPSISTGAYSYPLNEACNIALDEILSFI--NNSPDTFDEIAMVCLDEKTYNM 203 Query: 236 LMQLY 240 LY Sbjct: 204 YKSLY 208 >UniRef50_UPI0000E8099B Cluster: PREDICTED: similar to PARP9 protein; n=2; Gallus gallus|Rep: PREDICTED: similar to PARP9 protein - Gallus gallus Length = 796 Score = 89.8 bits (213), Expect = 5e-17 Identities = 54/144 (37%), Positives = 79/144 (54%), Gaps = 12/144 (8%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGDAK 137 ++K D+T + DAVVNAAN L+ G + A+ A GP + E + G PTG Sbjct: 80 VYKDDLTSHKADAVVNAANESLEHSGALALALLNAGGPEIAEESRNFIRKHGKVPTGKIA 139 Query: 138 VTGGYNLPAKYIIHTVGPQDGSAEKLESCY--EKCLSFQQEY------QIKSIAFPCIST 189 VTGG LP K IIH +GP +EK + C E+ + +Y IKS+A P +S+ Sbjct: 140 VTGGGKLPCKKIIHAIGPIWYPSEKEKCCVLLEEAVVNVLKYASDPKNNIKSVAIPAVSS 199 Query: 190 GIYGFPNRLAAHIALRTARKFLET 213 G++GFP L A + + + + F+ET Sbjct: 200 GVFGFPVNLCAQVIVMSIKLFVET 223 Score = 59.7 bits (138), Expect = 7e-08 Identities = 45/153 (29%), Positives = 77/153 (50%), Gaps = 11/153 (7%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAE-CDSIGGCPTG-DA 136 R+ I KG + K+ A+V++ +S + + A+ + AGP LQAE + + + Sbjct: 279 RLRIIKGYLEKIRTTAIVSSVSSDGEFCSQISTAMLQKAGPTLQAEILSQLKHLDSSKEL 338 Query: 137 KVTGGYNLPAKYIIHTVGPQDGS----AEKLESCYEKCLSFQQEYQIKSIAFPCISTGIY 192 VT GYNLP+ +++H + P E+L+ +CL F + Y + SIAFP + + Sbjct: 339 IVTSGYNLPSDFVLHVLWPCFNHVVLLCEQLKEIVNRCLYFVRNYPLPSIAFPEKNWSL- 397 Query: 193 GFPNRLAAHI----ALRTARKFLETNTEMNRII 221 P + A I L ARK+ ET ++ ++ Sbjct: 398 KLPVAIVAEIMIEEVLDFARKYPETKIDVQFVL 430 >UniRef50_UPI00006A1CA6 Cluster: poly (ADP-ribose) polymerase family, member 14; n=12; Xenopus tropicalis|Rep: poly (ADP-ribose) polymerase family, member 14 - Xenopus tropicalis Length = 1527 Score = 89.0 bits (211), Expect = 9e-17 Identities = 50/151 (33%), Positives = 77/151 (50%), Gaps = 10/151 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 ++++K D+T+ +D VVNAA LK G+ A+ AAGP LQ ECD I G GD Sbjct: 526 IAVYKDDLTRHRVDVVVNAAREDLKHTEGLALALLNAAGPKLQTECDHIIKREGKYSVGD 585 Query: 136 AKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCIST 189 + +TG NLP K +IHTV P Q L +CL E + SI P + + Sbjct: 586 SVITGAGNLPCKQVIHTVSPKWDPNSQTRCTRLLRRGISRCLELAAENGLSSIGIPAVGS 645 Query: 190 GIYGFPNRLAAHIALRTARKFLETNTEMNRI 220 + GFP ++ + + R+++E+ ++ Sbjct: 646 QMSGFPVTVSVQNIVESVRQYVESPQRSRKV 676 Score = 66.5 bits (155), Expect = 6e-10 Identities = 52/161 (32%), Positives = 74/161 (45%), Gaps = 11/161 (6%) Query: 52 SKKSTTDDLKE-FEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAG-GGV 109 SK +T D KE + ++ K + I +G+I D +VN+ L G V Sbjct: 709 SKGNTNPDSKEPLRRSDVHMVTTKE-GVNIKIIQGNIQDATTDVIVNSVGKDLDLNTGAV 767 Query: 110 DGAIHRAAGPFLQAECDSIGG---CPTGDAKVTGGYNLPAKYIIHTVGP--QDG--SAEK 162 A++ AG LQ + + G VT G+ L K +IH V P G SAEK Sbjct: 768 SKALNAKAGTKLQQQLREMSRGTQVEEGSVFVTNGFGLNCKKVIHVVTPGWDQGKRSAEK 827 Query: 163 -LESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHI 202 L + CLS ++ +++SI FP I TG GFP L A + Sbjct: 828 ILRTIMTNCLSTTEKEKLRSITFPAIGTGALGFPKDLVASL 868 Score = 61.7 bits (143), Expect = 2e-08 Identities = 41/137 (29%), Positives = 63/137 (45%), Gaps = 6/137 (4%) Query: 77 SERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDA 136 S + + GDITK D +VN++NS GV AI AAG ++ EC ++G Sbjct: 945 SLKYQVRTGDITKESTDVIVNSSNSSFTQKIGVSKAILEAAGKSIEDECATLGAQANKGY 1004 Query: 137 KVTGGYNLPAKYIIH--TVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 VT NLP ++IIH T+ D + ++C + + S+A P + TG G Sbjct: 1005 IVTQKGNLPCRHIIHVYTISTPDRIKASVLDVLQEC----ENLKATSVALPAVGTGAGGA 1060 Query: 195 PNRLAAHIALRTARKFL 211 + A L +F+ Sbjct: 1061 TSAAVAAAMLDAVEEFV 1077 >UniRef50_Q460N5 Cluster: Poly [ADP-ribose] polymerase 14; n=23; Euteleostomi|Rep: Poly [ADP-ribose] polymerase 14 - Homo sapiens (Human) Length = 1720 Score = 89.0 bits (211), Expect = 9e-17 Identities = 48/122 (39%), Positives = 71/122 (58%), Gaps = 10/122 (8%) Query: 84 KGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGDAKVT 139 +GD+ +L +D VVNA+N LK GG+ A+ +AAGP LQA+CD I G G+A ++ Sbjct: 727 QGDLARLPVDVVVNASNEDLKHYGGLAAALSKAAGPELQADCDQIVKREGRLLPGNATIS 786 Query: 140 GGYNLPAKYIIHTVGPQDGSAEK------LESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 LP ++IH VGP+ E L + L ++Y+ +SIA P IS+G++G Sbjct: 787 KAGKLPYHHVIHAVGPRWSGYEAPRCVYLLRRAVQLSLCLAEKYKYRSIAIPAISSGVFG 846 Query: 194 FP 195 FP Sbjct: 847 FP 848 Score = 72.1 bits (169), Expect = 1e-11 Identities = 48/160 (30%), Positives = 74/160 (46%), Gaps = 12/160 (7%) Query: 67 KINTEKNKSISE---RVSIFKGDITKLEIDAVVNAANSRLKAGGG-VDGAIHRAAGPFLQ 122 K + EK +S ++ + K + + D VVN+ L G + ++ AGP LQ Sbjct: 919 KTSWEKGSLVSPGGLQMLLVKEGVQNAKTDVVVNSVPLDLVLSRGPLSKSLLEKAGPELQ 978 Query: 123 AECDSIG---GCPTGDAKVTGGYNLPAKYIIHTVGPQ--DGSAEKL---ESCYEKCLSFQ 174 E D++G G T +NL +Y++H V P+ +GS L E +C+ Sbjct: 979 EELDTVGQGVAVSMGTVLKTSSWNLDCRYVLHVVAPEWRNGSTSSLKIMEDIIRECMEIT 1038 Query: 175 QEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETN 214 + +KSIAFP I TG GFP + A + + KF N Sbjct: 1039 ESLSLKSIAFPAIGTGNLGFPKNIFAELIISEVFKFSSKN 1078 Score = 59.3 bits (137), Expect = 9e-08 Identities = 44/160 (27%), Positives = 74/160 (46%), Gaps = 9/160 (5%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGG 141 + GDITK E D +VN+ ++ GV AI AG ++ EC D +TGG Sbjct: 1150 VASGDITKEEADVIVNSTSNSFNLKAGVSKAILECAGQNVERECSQQAQQRKNDYIITGG 1209 Query: 142 YNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG-IYGFPNRLAA 200 L K IIH +G D + + S ++C ++ SI P I TG P+++A Sbjct: 1210 GFLRCKNIIHVIGGNDVKS-SVSSVLQEC----EKKNYSSICLPAIGTGNAKQHPDKVAE 1264 Query: 201 HIALRTARKFLETNT--EMNRIIFCTFLPIDVEIYETLMQ 238 I + F++ + + ++ FLP ++++ M+ Sbjct: 1265 AI-IDAIEDFVQKGSAQSVKKVKVVIFLPQVLDVFYANMK 1303 >UniRef50_Q4SK43 Cluster: Chromosome 2 SCAF14570, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 2 SCAF14570, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 418 Score = 88.6 bits (210), Expect = 1e-16 Identities = 53/144 (36%), Positives = 76/144 (52%), Gaps = 11/144 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 VS+ K D+T +DAVVNAAN RL+ GG+ A+ +A G +Q + D G TG+ Sbjct: 57 VSVHKADLTNFPVDAVVNAANERLQHVGGIALALSKAGGSQIQQDSDEYIRKNGVLRTGE 116 Query: 136 AKVTGGYNLPAKYIIHTVGPQ-------DGSAEKLESCYEKCLSFQQEYQIKSIAFPCIS 188 + +LP K IIHTVGP +A LE L E +++S+A P IS Sbjct: 117 SVAMDAGSLPCKKIIHTVGPHVTGHSLTASAANLLEKAVLNSLKKADECRLRSVALPAIS 176 Query: 189 TGIYGFPNRLAAHIALRTARKFLE 212 +GI+G+P + A ++ R F E Sbjct: 177 SGIFGYPLKECADTIVKAVRDFCE 200 Score = 46.0 bits (104), Expect = 9e-04 Identities = 39/146 (26%), Positives = 62/146 (42%), Gaps = 9/146 (6%) Query: 85 GDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGP-FLQA-ECDSIGGCPTGDAKVTGGY 142 G I + + + +VN G + AI + AG L+A +C ++G + VT Y Sbjct: 268 GRIDEEQTNVIVNTTQKD-SWDGQISTAILKKAGTKMLKALKCANVGN---RNVIVTEPY 323 Query: 143 NLPAKYIIHTV---GPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLA 199 NL + HT+ G D + + L +CL + +SIAFP I TG G Sbjct: 324 NLRCAEVYHTLFTAGSTDKAYQILTDAVSECLQLAANHSRQSIAFPAIGTGGRGLEKEKV 383 Query: 200 AHIALRTARKFLETNTEMNRIIFCTF 225 A I KF +++ + F + Sbjct: 384 ASIMSEAVFKFANQSSKQMEVYFVIY 409 >UniRef50_Q10RP7 Cluster: Appr-1-p processing enzyme family protein, expressed; n=3; Magnoliophyta|Rep: Appr-1-p processing enzyme family protein, expressed - Oryza sativa subsp. japonica (Rice) Length = 460 Score = 87.0 bits (206), Expect = 4e-16 Identities = 43/102 (42%), Positives = 60/102 (58%), Gaps = 7/102 (6%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGD 135 I+ ++ +++G LE+DAVVN+ N L G +H AAGP L EC ++GGC TG Sbjct: 95 INSKICLWRGHPWNLEVDAVVNSTNENLDEAHSSPG-LHAAAGPGLAEECTTLGGCRTGM 153 Query: 136 AKVTGGYNLPAKYIIHTVGPQ------DGSAEKLESCYEKCL 171 AK+T Y+LPA+ +IHTVGP+ + L CY CL Sbjct: 154 AKMTNAYDLPARKVIHTVGPKYAVKYHTAAENALSHCYRSCL 195 >UniRef50_A1L291 Cluster: LOC799852 protein; n=4; Danio rerio|Rep: LOC799852 protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 458 Score = 86.6 bits (205), Expect = 5e-16 Identities = 53/145 (36%), Positives = 81/145 (55%), Gaps = 14/145 (9%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 +S++K D+T+ +++AVVNAAN +L+ GGG+ A+ A GP +Q D I G TG+ Sbjct: 72 ISVWKDDLTQHKVEAVVNAANEKLQHGGGLAQALSMAGGPQIQRWSDDIIKRYGYVKTGE 131 Query: 136 AKVTGGYNLPAKYIIHTVG---PQDGSAEKLESC----YEKCLSFQQ---EYQIKSIAFP 185 A +T NLP KYIIH VG PQ+ + +++ Y S Q I S+A P Sbjct: 132 AVLTPAGNLPFKYIIHAVGPKVPQNPTQKEIGDATPLLYNAITSILQTVLRENITSVAIP 191 Query: 186 CISTGIYGFPNRLAAHIALRTARKF 210 +S+G++ FP A I ++ + F Sbjct: 192 ALSSGLFNFPRDRCADIIVKAIKTF 216 Score = 39.9 bits (89), Expect = 0.057 Identities = 41/153 (26%), Positives = 60/153 (39%), Gaps = 7/153 (4%) Query: 84 KGDITKLEIDAVVNAANSRLKAGGGV-DGAIHRAAGPFLQAEC-DSIGGCPTGDAKV--- 138 +G I +D +VN K GV AI + AG +Q E +KV Sbjct: 289 RGAIEDEMVDVLVNTIAPDCKLHQGVISRAILKKAGDEIQNEIYKKKSNTSFYSSKVLYK 348 Query: 139 TGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEY--QIKSIAFPCISTGIYGFPN 196 T GYNL K + HTV ++ E + L ++ +SI+FP I TG F Sbjct: 349 TKGYNLYCKSVFHTVCAHRSDSKSNEILFNVVLESLKKAAEDYESISFPAIGTGNLDFKK 408 Query: 197 RLAAHIALRTARKFLETNTEMNRIIFCTFLPID 229 A I + +F + N ++ P D Sbjct: 409 WEVAKIMMDAVAEFAKQNKRKKLDVYFVVFPKD 441 >UniRef50_A7S3X0 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 143 Score = 86.2 bits (204), Expect = 7e-16 Identities = 56/143 (39%), Positives = 73/143 (51%), Gaps = 10/143 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 V++++GDIT DAVVNAAN L GGGV GAI G +Q EC I G GD Sbjct: 1 VTVYQGDITNERADAVVNAANCDLIHGGGVAGAILAKGGWSIQEECYQIVGRFGRLEVGD 60 Query: 136 AKVTGGYNLPAKYIIHTVGPQ--DGSAEKLES-CYEKCLS---FQQEYQIKSIAFPCIST 189 A T L K +IH VGP + E++++ + CL + SIAFP IS+ Sbjct: 61 AVQTNAGKLLCKAVIHAVGPTWLGATPEQVKNQLFRACLESLYTADNINLCSIAFPAISS 120 Query: 190 GIYGFPNRLAAHIALRTARKFLE 212 GIYG P + A + L + E Sbjct: 121 GIYGVPKEICAQVMLDVVEHYAE 143 >UniRef50_UPI000023E9A3 Cluster: hypothetical protein FG04612.1; n=1; Gibberella zeae PH-1|Rep: hypothetical protein FG04612.1 - Gibberella zeae PH-1 Length = 606 Score = 85.8 bits (203), Expect = 9e-16 Identities = 66/184 (35%), Positives = 92/184 (50%), Gaps = 26/184 (14%) Query: 80 VSIFKGDITKLE-IDAVVNAANSR-----LKAGGGVDGAIHRAAGPFLQAECDSI----- 128 + ++KGDI L I A+ NAANS+ +D IH AGP L+ EC + Sbjct: 117 IHLWKGDIATLTGITAITNAANSQGLGCFQPTHRCIDNIIHTEAGPRLREECFWLMKKRS 176 Query: 129 GGCPTGDAKVTGGYNLPAKYIIHTVGPQ--------DGSAEKLESCYEKCLSFQQ----- 175 GD VTGG+ L A +IHTVGPQ D +L CY+ L + Sbjct: 177 KDLEPGDLLVTGGHALHASSVIHTVGPQLKRGASPTDLERSQLAKCYKGILDAVELLPPG 236 Query: 176 EYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLE--TNTEMNRIIFCTFLPIDVEIY 233 E KS+A CISTG++ FP AA IA+ T +LE ++T + ++F TF D +IY Sbjct: 237 EDGRKSVALCCISTGLFAFPADEAAKIAVSTVTAWLESHSSTTITDVVFNTFTESDTKIY 296 Query: 234 ETLM 237 ++ Sbjct: 297 TAIL 300 >UniRef50_O07733 Cluster: UPF0189 protein Rv1899c/MT1950; n=9; Mycobacterium|Rep: UPF0189 protein Rv1899c/MT1950 - Mycobacterium tuberculosis Length = 359 Score = 84.6 bits (200), Expect = 2e-15 Identities = 53/153 (34%), Positives = 79/153 (51%), Gaps = 6/153 (3%) Query: 73 NKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCP 132 N S+ E + + + D+TKLE+DA+ NAAN+RL+ GGV AI RA GP LQ E Sbjct: 186 NVSMIE-LEVHQADVTKLELDAITNAANTRLRHAGGVAAAIARAGGPELQRESTEKAPIG 244 Query: 133 TGDAKVTGGYNLPAKYIIHTVGPQDG---SAEKLESCYEKCLSFQQEYQIKSIAFPCIST 189 G+A T ++PA+Y+IH + G S E + + L E +S+A T Sbjct: 245 LGEAVETTAGDMPARYVIHAATMELGGPTSGEIITAATAATLRKADELGCRSLALVAFGT 304 Query: 190 GIYGFPNRLAAHIALRTARKFLETNTEMNRIIF 222 G+ GFP AA + + R+ + R++F Sbjct: 305 GVGGFPLDDAARLMVGAVRR--HRPGSLQRVVF 335 >UniRef50_UPI0000660C67 Cluster: Homolog of Oncorhynchus mykiss "VHSV-induced protein-10.; n=1; Takifugu rubripes|Rep: Homolog of Oncorhynchus mykiss "VHSV-induced protein-10. - Takifugu rubripes Length = 1476 Score = 82.2 bits (194), Expect = 1e-14 Identities = 51/132 (38%), Positives = 73/132 (55%), Gaps = 10/132 (7%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTG 134 +V + + +I L++DAVVNAAN LK GG+ A+ AAGP LQ ++ G TG Sbjct: 481 QVYVSEANICLLDVDAVVNAANEELKHIGGLALALLNAAGPELQKISNNYIARNGALCTG 540 Query: 135 DAKVTGGYNLPAKYIIHTVGPQ------DGSAEKLESCYEKCLSFQQEYQIKSIAFPCIS 188 D VT NLP K++IH VGP+ + S L+ + L ++ +IA P IS Sbjct: 541 DTVVTDACNLPCKHVIHAVGPRFSEHSPEDSVSLLKLVVTRSLKEAEKLNCSTIAMPAIS 600 Query: 189 TGIYGFPNRLAA 200 +G++GFP L A Sbjct: 601 SGMFGFPIDLCA 612 Score = 72.1 bits (169), Expect = 1e-11 Identities = 46/161 (28%), Positives = 75/161 (46%), Gaps = 10/161 (6%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKA-GGGVDGAIHRAAGPFLQAECDSIGGCPT---G 134 RV ++KG+I +VN + + G + AI +AAG LQ G + G Sbjct: 683 RVILWKGNIEAQTSCVIVNTISESMNLMQGAISKAILQAAGQSLQTAIQKAAGVSSLLPG 742 Query: 135 DAKVTGGYNLPAKYIIHTVGPQ-----DGSAEKLESCYEKCLSFQQEYQIKSIAFPCIST 189 +T G+NL + + HTV P D + + L S +CL + ++KS++FP I T Sbjct: 743 SVVITDGFNLKCQKVFHTVCPMWTSASDQAEKTLTSIITQCLKEAERLKMKSLSFPAIGT 802 Query: 190 GIYGFPNRLAAHIALRTARKFLETNTEMNRI-IFCTFLPID 229 G+ FP + + + LR T T ++ + +F P D Sbjct: 803 GVLQFPREVVSRVLLREVHNHSRTKTPLHLVEVFIVVHPSD 843 Score = 58.4 bits (135), Expect = 2e-07 Identities = 37/112 (33%), Positives = 52/112 (46%), Gaps = 5/112 (4%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC---DSIGGCPTGDAKV 138 + GDITK D ++N++N GV AI AG + EC G G + Sbjct: 899 VVSGDITKETCDVIINSSNQNFTLKSGVSKAIMNGAGHSVWKECLVKVKAAGSQPGPMIL 958 Query: 139 TGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 T LP + IIH VG Q+ A+ + Y L +E + +S AFP + TG Sbjct: 959 TSAGQLPCRAIIHVVG-QNNPADVKNTVY-SVLKLCEEQKFQSAAFPALGTG 1008 >UniRef50_Q55AK6 Cluster: U box domain-containing protein; n=3; Eukaryota|Rep: U box domain-containing protein - Dictyostelium discoideum AX4 Length = 1618 Score = 82.2 bits (194), Expect = 1e-14 Identities = 56/170 (32%), Positives = 85/170 (50%), Gaps = 8/170 (4%) Query: 73 NKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI---- 128 N S + + I KGDITK + A+VN AN +LK GG +I AAG + C+S Sbjct: 911 NLSNGKIIRIIKGDITKQKTHAIVNPANEKLKNLGGAAFSIQEAAGATFKEFCESYYEKN 970 Query: 129 GGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEK---LESCYEKCLSFQQEYQIKSIAFP 185 G TG + + + ++I+TVGP++ + K L L +SI+ P Sbjct: 971 GPIGTGCSVYGSKFKMGNIFVINTVGPKNDNPNKARILHMSIHSSLRSATALNCQSISIP 1030 Query: 186 CISTGIYGFPNRLAAHIALRTARKFLETN-TEMNRIIFCTFLPIDVEIYE 234 ISTGI+G+ + A I +++A +FL TN T +N + F I+E Sbjct: 1031 AISTGIFGYDPKEAVPIIIKSAIEFLLTNETTLNEVNFVDLNQSTANIFE 1080 >UniRef50_Q8IXQ6 Cluster: Poly [ADP-ribose] polymerase 9; n=26; Eutheria|Rep: Poly [ADP-ribose] polymerase 9 - Homo sapiens (Human) Length = 854 Score = 81.0 bits (191), Expect = 2e-14 Identities = 53/153 (34%), Positives = 77/153 (50%), Gaps = 12/153 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 +S++K D+T +DAVVNAAN L GGG+ A+ +A G +Q E G G+ Sbjct: 120 LSVWKDDLTTHAVDAVVNAANEDLLHGGGLALALVKAGGFEIQEESKQFVARYGKVSAGE 179 Query: 136 AKVTGGYNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSF--QQEYQIKSIAFPCI 187 VTG LP K IIH VGP + G KL+ L++ + IK++A P + Sbjct: 180 IAVTGAGRLPCKQIIHAVGPRWMEWDKQGCTGKLQRAIVSILNYVIYKNTHIKTVAIPAL 239 Query: 188 STGIYGFPNRLAAHIALRTARKFLETNTEMNRI 220 S+GI+ FP L + T R L+ M+ + Sbjct: 240 SSGIFQFPLNLCTKTIVETIRVSLQGKPMMSNL 272 Score = 51.2 bits (117), Expect = 2e-05 Identities = 42/159 (26%), Positives = 69/159 (43%), Gaps = 4/159 (2%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAK-- 137 + I +G I D +VN+ N G V +I + AG +++E + ++ Sbjct: 319 LQIVQGHIEWQTADVIVNSVNPHDITVGPVAKSILQQAGVEMKSEFLATKAKQFQRSQLV 378 Query: 138 -VTGGYNLPAKYIIHTVGPQD-GSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 VT G+NL KYI H + + + L+ ++CL E I SI+FP + TG Sbjct: 379 LVTKGFNLFCKYIYHVLWHSEFPKPQILKHAMKECLEKCIEQNITSISFPALGTGNMEIK 438 Query: 196 NRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYE 234 AA I F + + + + P D+EIY+ Sbjct: 439 KETAAEILFDEVLTFAKDHVKHQLTVKFVIFPTDLEIYK 477 >UniRef50_A7C4X9 Cluster: Putative uncharacterized protein; n=1; Beggiatoa sp. PS|Rep: Putative uncharacterized protein - Beggiatoa sp. PS Length = 220 Score = 79.4 bits (187), Expect = 8e-14 Identities = 54/155 (34%), Positives = 76/155 (49%), Gaps = 10/155 (6%) Query: 92 IDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGDAKVTGGYNLPAK 147 +D +VN ANS L GGG+ I AG L+ C I G A VT LP + Sbjct: 28 VDTIVNPANSGLSHGGGLAEQILLEAGSKLEEACHKIIQQQGKISVTKAVVTTAGQLPYQ 87 Query: 148 YIIHTVGPQDGSAE---KLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIAL 204 +IH VGP+ G + K+E+ CL ++YQ KSIAFP ISTG++ P + A Sbjct: 88 GVIHAVGPRMGDGKEQSKIETTIINCLQIAEKYQWKSIAFPAISTGLFCVPKTVCAKAFD 147 Query: 205 RTARKFLET--NTEMNRIIFCTFLPIDVEIYETLM 237 + + E N+ + I C L D I+E ++ Sbjct: 148 KAISYYWENHPNSAIKNIWLC-LLTEDYPIFEKIL 181 >UniRef50_UPI0000F2CC14 Cluster: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2); n=1; Monodelphis domestica|Rep: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) - Monodelphis domestica Length = 1874 Score = 78.6 bits (185), Expect = 1e-13 Identities = 52/142 (36%), Positives = 72/142 (50%), Gaps = 11/142 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 +++ KGD+T+ D VVNAAN L+ GG+ A+ AAGP LQ ECD I G G Sbjct: 863 LTVQKGDLTQFPADVVVNAANEELQHHGGLAAALSEAAGPALQRECDQIIKQQGRIRPGC 922 Query: 136 AKVTGGYNLPAKYIIHTVGPQDGSAEK------LESCYEKCLSFQQEYQIKSIAFPCIST 189 A V+G LP + +IH VGP+ L++ +CL + SIA P +S+ Sbjct: 923 AVVSGAGQLPYQQVIHAVGPRWRKEHAYRCELLLKNAVTECLYQAELSGHTSIAIPALSS 982 Query: 190 GIYGFPNRLAAH-IALRTARKF 210 G + FP + IAL F Sbjct: 983 GHFDFPLKTCTETIALAIKENF 1004 Score = 66.1 bits (154), Expect = 8e-10 Identities = 54/191 (28%), Positives = 89/191 (46%), Gaps = 17/191 (8%) Query: 12 NRILKLSLEEKRKIYKSSDFI----DLENVDPW----SKYLNKSQGIDSKKSTTDDLKEF 63 + +LK S K K F+ D +N+ + S+Y + + D + +D ++F Sbjct: 1213 SEVLKFSSSRPLKSLKEVYFLLHPSDTDNIQAFKREFSRYTDGTTTSDRASNISDTEEDF 1272 Query: 64 EKIKINTE----KNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGP 119 +++ K K S V + GDITK E + +VN+ N GV AI AAGP Sbjct: 1273 LDTIYDSDLGIYKGKIGSLTVQVAPGDITKEESEVIVNSTNESFLLKNGVSKAILDAAGP 1332 Query: 120 FLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQI 179 +++EC + P + +T G NL K IIH +G D + + ++C ++ + Sbjct: 1333 AVESECAQLAVKPHQNYIITQGGNLGCKKIIHVIGGLD-VYKTITDVLQEC----EKMKY 1387 Query: 180 KSIAFPCISTG 190 SI+ P I TG Sbjct: 1388 TSISLPAIGTG 1398 Score = 62.1 bits (144), Expect = 1e-08 Identities = 46/140 (32%), Positives = 66/140 (47%), Gaps = 9/140 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGG-VDGAIHRAAGPFLQAECDSIGGCPT---GD 135 + + K DI + D +VN + L+ + AI + AGP LQ E + +G T G Sbjct: 1079 IILIKRDIQDAKSDIIVNTIATDLQLDKAPLSQAILKKAGPELQKELNILGKETTVKPGH 1138 Query: 136 AKVTGGYNLPAKYIIHTVG-PQD---GSAEKL-ESCYEKCLSFQQEYQIKSIAFPCISTG 190 TG YNL K+I+H V P + G+A+ + + + CL + SI FP I TG Sbjct: 1139 VLPTGSYNLDCKFILHVVASPWNNGVGNAKMIMKESIKACLETTDSLSLTSITFPAIGTG 1198 Query: 191 IYGFPNRLAAHIALRTARKF 210 GFP A + L KF Sbjct: 1199 KLGFPKATFAKLILSEVLKF 1218 >UniRef50_Q5V4P3 Cluster: Putative uncharacterized protein; n=2; Halobacteriaceae|Rep: Putative uncharacterized protein - Haloarcula marismortui (Halobacterium marismortui) Length = 166 Score = 78.6 bits (185), Expect = 1e-13 Identities = 43/116 (37%), Positives = 59/116 (50%), Gaps = 3/116 (2%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGG 141 + +GDI DA+VNAAN+ L+ G GV GA+ RAAG L E + G G T Sbjct: 5 VIQGDIAAQSADALVNAANTSLRMGSGVAGALKRAAGSGLNDEAVAKGPVDLGGVATTDA 64 Query: 142 YNLPAKYIIHTVGPQDG---SAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 Y+L A+Y+IH G +AE + + L+ +S+ FP I GI GF Sbjct: 65 YDLDAEYVIHAAAMPPGGQSTAESIRNATRNALAEADALNCESVVFPAIGCGIAGF 120 >UniRef50_UPI00015A60CA Cluster: UPI00015A60CA related cluster; n=1; Danio rerio|Rep: UPI00015A60CA UniRef100 entry - Danio rerio Length = 369 Score = 77.8 bits (183), Expect = 2e-13 Identities = 46/143 (32%), Positives = 75/143 (52%), Gaps = 10/143 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI---GGCPTGDA 136 +++ K D+ ++DAVV A L GG+ A+ AAGP LQ +CD + TGDA Sbjct: 4 ITVHKADMCSFQVDAVVGACKETLLLDGGLAKALSDAAGPKLQKDCDKLVKGRKFTTGDA 63 Query: 137 -KVTGGYNLPAKYIIHTVGPQDGSAEKLES------CYEKCLSFQQEYQIKSIAFPCIST 189 + G L K++I +GP S++ ES ++ L+ + +SIA P IS+ Sbjct: 64 VLLDAGGRLHCKHVILAIGPHYNSSKPQESEKLLKKAVKRSLNVADQESFQSIAIPAISS 123 Query: 190 GIYGFPNRLAAHIALRTARKFLE 212 G++GFP L A ++ ++F + Sbjct: 124 GVFGFPMDLCAFTIVKAIKEFCD 146 Score = 74.5 bits (175), Expect = 2e-12 Identities = 54/176 (30%), Positives = 81/176 (46%), Gaps = 9/176 (5%) Query: 44 LNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISER---VSIFKGDITKLEIDAVVNAAN 100 + K G+ + +T ++ K + ++ ++ +++ KG+I +D VVN + Sbjct: 174 VKKVYGVSDQSTTGSSSSSQQQNKASASPSQHQTKEGLTITLMKGNIEDTTMDVVVNTLS 233 Query: 101 SRLKAG-GGVDGAIHRAAGPFLQAECD--SIGGCPTGDAKVTGGYNLPAKYIIHTVGP-- 155 S LK G V A+ +AAGP LQ D + G +G T G NL K + H V P Sbjct: 234 SDLKLNVGAVSNALFKAAGPQLQDLLDQQATGPASSGAVFETAGANLKNKLVFHAVVPHW 293 Query: 156 -QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKF 210 Q E LE+ + CL ++ Q SI F I TG GFP L L + KF Sbjct: 294 NQGQGNEVLENVMDTCLCKAEQRQQSSIVFSAIGTGNLGFPKSLVVSTMLDSVFKF 349 >UniRef50_Q7QZY2 Cluster: GLP_23_42584_43678; n=1; Giardia lamblia ATCC 50803|Rep: GLP_23_42584_43678 - Giardia lamblia ATCC 50803 Length = 364 Score = 77.4 bits (182), Expect = 3e-13 Identities = 56/200 (28%), Positives = 94/200 (47%), Gaps = 25/200 (12%) Query: 68 INTEK-NKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGG--VDGAIHRAAGPFLQAE 124 +N K N I++R+ + +GD+T L +A + + L G V+ +H AGP L E Sbjct: 116 VNIPKPNNEINKRICVVQGDLTALRTEAYIVPVSPSLSGADGSEVNALVHAKAGPQLHTE 175 Query: 125 CDSIGGC-PTGDAKVTGGYNLPAK------------YIIHTVGPQDGSAEKLESCYEKCL 171 +G TG+A +T YN+ A +++HT+ P+ A L+SCYE+ L Sbjct: 176 LKRVGATLRTGEACLTRAYNVGADDPDEETGLLYPMFLLHTLTPKTEDAAALKSCYERTL 235 Query: 172 SFQQEYQIKSIAFPCIS------TGIYGFPNRLAAHIALRTARKFLETNTEMNRI---IF 222 ++++IA P ++ G +P + H+ L R +L+ +R+ I Sbjct: 236 YIALSEELRTIATPILAGVPYPRAGTEYYPLVGSIHVMLSVLRSWLDRQDVRDRVDLFII 295 Query: 223 CTFLPIDVEIYETLMQLYFP 242 C + I + LM LYFP Sbjct: 296 CCATDRETHILQELMPLYFP 315 >UniRef50_O75367 Cluster: Core histone macro-H2A.1; n=179; Eukaryota|Rep: Core histone macro-H2A.1 - Homo sapiens (Human) Length = 372 Score = 77.0 bits (181), Expect = 4e-13 Identities = 54/203 (26%), Positives = 100/203 (49%), Gaps = 15/203 (7%) Query: 46 KSQGIDSKKSTTDDLKE---FEKIKINTEKNKSISERVSIFKGDITKL---EIDAVVNAA 99 K QG SK ++ D E + + + K+ + +++++ +I+ L E++A++N Sbjct: 160 KKQGEVSKAASADSTTEGTPADGFTVLSTKSLFLGQKLNLIHSEISNLAGFEVEAIINPT 219 Query: 100 NSRLKAGGGVDGAIHRAAGP-FLQAECD---SIGGCPTGDAKVTGGYNLPAKYIIHTVGP 155 N+ + + + + G F++A + G A V+ G+ LPAK++IH P Sbjct: 220 NADIDLKDDLGNTLEKKGGKEFVEAVLELRKKNGPLEVAGAAVSAGHGLPAKFVIHCNSP 279 Query: 156 ---QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRT-ARKFL 211 D E LE + CL+ + ++KSIAFP I +G GFP + AA + L+ + F+ Sbjct: 280 VWGADKCEELLEKTVKNCLALADDKKLKSIAFPSIGSGRNGFPKQTAAQLILKAISSYFV 339 Query: 212 ET-NTEMNRIIFCTFLPIDVEIY 233 T ++ + + F F + IY Sbjct: 340 STMSSSIKTVYFVLFDSESIGIY 362 >UniRef50_A1R2V6 Cluster: Putative uncharacterized protein; n=2; Micrococcineae|Rep: Putative uncharacterized protein - Arthrobacter aurescens (strain TC1) Length = 152 Score = 74.9 bits (176), Expect = 2e-12 Identities = 43/117 (36%), Positives = 60/117 (51%), Gaps = 10/117 (8%) Query: 109 VDGAIHRAAGPFLQAECDSIG------GCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEK 162 +DGAIHRAAG L C + G P G A T + LPA ++IHTVGP + + Sbjct: 1 MDGAIHRAAGSELLEACRELRRTELPEGLPVGAAVATPAFRLPAHWVIHTVGPNRHAGQT 60 Query: 163 ----LESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNT 215 L SC+ + L +S+AFP IS GIYG+ +R A +A F +++ Sbjct: 61 DPALLASCFRESLKVAAGLGARSLAFPAISAGIYGWDSRQVAEVAFDAVGSFSSSSS 117 >UniRef50_A0CX06 Cluster: Chromosome undetermined scaffold_3, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_3, whole genome shotgun sequence - Paramecium tetraurelia Length = 1064 Score = 74.9 bits (176), Expect = 2e-12 Identities = 58/155 (37%), Positives = 80/155 (51%), Gaps = 18/155 (11%) Query: 66 IKINTEKNKSISERVSIFKGDITKLE-IDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAE 124 +K K K + + + I DIT+++ +DA+VN A+ LK GG+ GA+ RAAG L E Sbjct: 690 VKKTPMKIKILEQSIIIHNQDITQIKGVDAIVNVADPNLKNRGGICGAVFRAAGENLLEE 749 Query: 125 -----CDSIGGCP--TGDAKVTGGYNLPA----KYIIHTVGP----QDG--SAEKLESCY 167 + +G T + VT Y L KYIIH VGP QD S E+L +C Sbjct: 750 EINMLFNKLGRKQPETSEVIVTKSYRLGQENGPKYIIHAVGPKYNPQDPQKSKEQLNTCI 809 Query: 168 EKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHI 202 L QEY+I S+A P IS + FP ++ A I Sbjct: 810 VNILQKCQEYKITSVAIPPISEKNFDFPKQICAQI 844 >UniRef50_Q9YBE9 Cluster: UPF0189 protein APE_1648.1; n=1; Aeropyrum pernix|Rep: UPF0189 protein APE_1648.1 - Aeropyrum pernix Length = 189 Score = 74.9 bits (176), Expect = 2e-12 Identities = 42/124 (33%), Positives = 69/124 (55%), Gaps = 5/124 (4%) Query: 75 SISERV-SIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPT 133 ++ +RV ++ GD+TK+ +AVVN ANS + GGG GA+ RA G ++ E P Sbjct: 5 TLGDRVLAVSMGDLTKVRAEAVVNPANSLMIMGGGAAGALKRAGGSVIEEEAMRKAPVPV 64 Query: 134 GDAKVTGGYNLPAKYIIHTVGPQD-GSAEKLESCYE---KCLSFQQEYQIKSIAFPCIST 189 G+A +T G +LPA+++IH ++ G L + ++ L E I+S+A P + Sbjct: 65 GEAVITSGGSLPARFVIHAPTMEEPGMRIPLVNAFKASYAALRLASEAGIESVAMPAMGA 124 Query: 190 GIYG 193 G+ G Sbjct: 125 GVGG 128 >UniRef50_UPI0000F1EDA9 Cluster: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2); n=1; Danio rerio|Rep: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) - Danio rerio Length = 1419 Score = 74.5 bits (175), Expect = 2e-12 Identities = 46/150 (30%), Positives = 73/150 (48%), Gaps = 4/150 (2%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + + GDITK++++AVVN+ N+ L GV GAI +A+GP + EC + P +T Sbjct: 904 IRVSSGDITKVKVEAVVNSTNTSLNLSSGVSGAILKASGPTVVKECKAKAPQPEDGVVLT 963 Query: 140 GGYNLP-AKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRL 198 NL +I+H VG S + S K L +E I+S++FP + TG P Sbjct: 964 RAGNLTNCTHIVHMVG--QTSRTGIRSSMAKVLKTCEENHIRSVSFPALGTGAGHLPAAA 1021 Query: 199 AAHIALRTARKFLETNTE-MNRIIFCTFLP 227 A F++ + + + R+ F P Sbjct: 1022 VADAMTTALADFVKDSPKHLKRVHIVIFQP 1051 Score = 43.2 bits (97), Expect = 0.006 Identities = 22/46 (47%), Positives = 26/46 (56%) Query: 84 KGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIG 129 KGDITK D +VN+ N L GV GAI +AAG + EC G Sbjct: 624 KGDITKEAADVIVNSTNKTLDLNTGVSGAILKAAGRSVVDECKKRG 669 Score = 41.9 bits (94), Expect = 0.014 Identities = 33/111 (29%), Positives = 48/111 (43%), Gaps = 15/111 (13%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + + KG IT + +VN N + GG D + LQ + DA VT Sbjct: 741 IEVRKGSITTESVRGIVNTTNRDMSRRGGQDVTVQHCP---LQGD----------DAAVT 787 Query: 140 GGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 L I+H +GP SA + + K L +E QI +++FP I TG Sbjct: 788 AAGLLHCDLILHMLGPH--SAAESRTRVRKVLERCEEKQITTVSFPAIGTG 836 Score = 39.5 bits (88), Expect = 0.075 Identities = 30/118 (25%), Positives = 51/118 (43%), Gaps = 3/118 (2%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 +SI +G + L DA++ +S+L V A+ G + C + GD + Sbjct: 432 LSITEGALQHLAADALLCPLDSKLGFSDPVAQAVLHFRGESIADTCGTQKSPQPGDVLLG 491 Query: 140 GGYNLPAKYIIHTVGPQDGS---AEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 L ++ V PQ G +++L+S L +E+ SIA P + G +GF Sbjct: 492 SAGRLGVGMLLLAVLPQKGQPQDSQRLQSAVCNSLRKAEEHSCSSIALPPVGCGTFGF 549 >UniRef50_UPI000065ED3A Cluster: Homolog of Oncorhynchus mykiss "VHSV-induced protein-10.; n=1; Takifugu rubripes|Rep: Homolog of Oncorhynchus mykiss "VHSV-induced protein-10. - Takifugu rubripes Length = 1083 Score = 74.1 bits (174), Expect = 3e-12 Identities = 47/120 (39%), Positives = 64/120 (53%), Gaps = 9/120 (7%) Query: 90 LEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI---GGCPTGDAKVTGGYNLPA 146 L++DAVVNAAN LK GG A+ AAG + + I G TGD VT NLP Sbjct: 362 LDVDAVVNAANEELKHIGGPALALLNAAGELQKISNNYIARNGALRTGDTVVTDACNLPC 421 Query: 147 KYIIHTVGPQ------DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAA 200 K++IH VGP+ + S L+ + L ++ +IA P IS+G++GFP L A Sbjct: 422 KHVIHAVGPRFSEHSPEDSVPLLKLVVTRSLKEAEKLNCSTIAMPAISSGMFGFPIDLCA 481 >UniRef50_Q4RG95 Cluster: Chromosome 12 SCAF15104, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 12 SCAF15104, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1433 Score = 72.1 bits (169), Expect = 1e-11 Identities = 47/132 (35%), Positives = 68/132 (51%), Gaps = 10/132 (7%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTG 134 ++S+ + D+ L++DAVVN AN L+ GG+ A+ AAGP LQ + G G Sbjct: 501 QLSVSQADLCALQVDAVVNPANENLQHTGGLALALLEAAGPELQNTSNLYVAVNGALCAG 560 Query: 135 DAKVTGGYNLPAKYIIHTVGPQ--DGSAEK----LESCYEKCLSFQQEYQIKSIAFPCIS 188 T LP K++IH VGP+ D S E+ L + L + S+A P IS Sbjct: 561 QVIATDACRLPCKHVIHAVGPRFSDHSREESVLLLRRVVTQSLREAERLGCTSVAVPAIS 620 Query: 189 TGIYGFPNRLAA 200 +G++GFP L A Sbjct: 621 SGVFGFPLSLCA 632 Score = 65.3 bits (152), Expect = 1e-09 Identities = 50/163 (30%), Positives = 73/163 (44%), Gaps = 12/163 (7%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAG-GGVDGAIHRAAGPFLQA------ECDSIGGC 131 RV + KG+I +VN + + G V A+ RAAG LQA + Sbjct: 733 RVVLCKGNIEDQRSCVIVNTISETMNLDQGAVSRALLRAAGKGLQAAVLKEARLARLDQL 792 Query: 132 PTGDAKVTGGYNLPAKYIIHTVGPQDGS---AEK-LESCYEKCLSFQQEYQIKSIAFPCI 187 G VT G+ L + + H V PQ + AEK L S +CL + +++S++FP I Sbjct: 793 DPGSLLVTDGFKLRCQKVFHAVCPQWSASYQAEKTLTSIISRCLKEAERLKMRSLSFPAI 852 Query: 188 STGIYGFPNRLAAHIALRTARKFLETNTEMNRI-IFCTFLPID 229 TG+ FP L A + L R F T + + +F P D Sbjct: 853 GTGLLSFPKDLVARVLLEEVRTFSRKKTPQHLLKVFVVVHPSD 895 Score = 60.5 bits (140), Expect = 4e-08 Identities = 39/117 (33%), Positives = 56/117 (47%), Gaps = 5/117 (4%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDS---IGGCPTGDAKV 138 + GDIT+ D ++N++N GV AI AG +Q EC G P G V Sbjct: 942 VLSGDITRETCDVIINSSNRDFTLKSGVSKAILDGAGWAVQVECAQQARAQGHPPGHMIV 1001 Query: 139 TGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 T LP+K I+H V + A+ ++S L +E +S AFP + TG+ G P Sbjct: 1002 TSAGRLPSKAIVH-VSISNNPAD-IKSTVYAALKLCEEKTFRSAAFPALGTGVGGVP 1056 >UniRef50_O67112 Cluster: UPF0189 protein aq_987; n=3; cellular organisms|Rep: UPF0189 protein aq_987 - Aquifex aeolicus Length = 165 Score = 71.7 bits (168), Expect = 2e-11 Identities = 43/135 (31%), Positives = 62/135 (45%), Gaps = 4/135 (2%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 + + KG IT+++ D +VN ANSR GGGV I R G ++ E P G A +T Sbjct: 3 IKVVKGSITEVDADVIVNPANSRGLMGGGVAVVIKRLGGEEIEREAVEKAPIPVGSAVLT 62 Query: 140 GGYNLPAKYIIHTVGPQD----GSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 L K +IH ++ S EK+ L + K +A P + TG+ G P Sbjct: 63 TAGKLKFKGVIHAPTMEEPAMPSSEEKVRKATRAALELADKECFKIVAIPGMGTGVGGVP 122 Query: 196 NRLAAHIALRTARKF 210 +AA + RKF Sbjct: 123 KEVAARAMVEEIRKF 137 >UniRef50_Q5KUT6 Cluster: Hypothetical conserved protein; n=2; Geobacillus|Rep: Hypothetical conserved protein - Geobacillus kaustophilus Length = 161 Score = 71.3 bits (167), Expect = 2e-11 Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 10/121 (8%) Query: 80 VSIFKGDITKLE-IDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAE----CDSIGGCPTG 134 +S GD+TK+E ++ + NAAN GGGV AIHRA G ++ E C + P G Sbjct: 2 ISAMVGDLTKVEGVEYICNAANGIGPMGGGVAAAIHRAGGRVIEEEAIRVCQAQDPQP-G 60 Query: 135 DAKVTGGYNLPAKYIIHTVGPQD----GSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 D VTG +LP + +IH V + S E + SC E+ ++ +E+ IK +A P + TG Sbjct: 61 DLYVTGAGSLPFRGVIHLVTMKQPAGATSYEIVRSCLERLVAHCREHGIKKVALPALGTG 120 Query: 191 I 191 + Sbjct: 121 V 121 >UniRef50_A2BJA7 Cluster: A1pp, Appr-1-p processing enzyme; n=1; Hyperthermus butylicus DSM 5456|Rep: A1pp, Appr-1-p processing enzyme - Hyperthermus butylicus (strain DSM 5456 / JCM 9403) Length = 199 Score = 70.1 bits (164), Expect = 5e-11 Identities = 50/159 (31%), Positives = 75/159 (47%), Gaps = 7/159 (4%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVT 139 V I +GDIT+ E +AVVN ANS + GGGV GA+ RAAGP ++ E P G+A T Sbjct: 16 VEIARGDITEAECEAVVNPANSLMIMGGGVAGALRRAAGPEVEEEARRKAPVPVGEAIHT 75 Query: 140 GGYNLP--AKYIIHTVGPQDGSAE----KLESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 G L KYIIH + + K+ L ++ + +A P + G+ G Sbjct: 76 GAGRLEPRIKYIIHAPTMERPAMRTTQGKVVKAVLAALREAEKLNVGCLALPAMGAGVGG 135 Query: 194 FPNRLAAHIALRTARKFLETNTEM-NRIIFCTFLPIDVE 231 R + + +FL + ++ RII + D + Sbjct: 136 LTARESLEAIMEALDEFLGSGGKLPPRIILVAYSERDAK 174 >UniRef50_A3DLM0 Cluster: Appr-1-p processing domain protein; n=1; Staphylothermus marinus F1|Rep: Appr-1-p processing domain protein - Staphylothermus marinus (strain ATCC 43588 / DSM 3639 / F1) Length = 192 Score = 69.3 bits (162), Expect = 8e-11 Identities = 43/133 (32%), Positives = 66/133 (49%), Gaps = 4/133 (3%) Query: 84 KGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYN 143 KGDIT+L+++A+VN ANS + GGG+ G + R G ++ E P G A VT Sbjct: 21 KGDITELDVEAIVNPANSFMLMGGGLAGVLKRKGGEIIENEAKKFAPVPVGKAVVTIAGV 80 Query: 144 LPAKYIIHTVGPQDGSAE-KLESCYE---KCLSFQQEYQIKSIAFPCISTGIYGFPNRLA 199 L AKYIIH + + E+ Y+ L+ + + IA P + TG+ G A Sbjct: 81 LKAKYIIHAPTMEKPAMRINPENAYKATFAALTKAFDLSLNRIAVPGMGTGVGGLSPSDA 140 Query: 200 AHIALRTARKFLE 212 + ++FL+ Sbjct: 141 GKAMAKAIKEFLD 153 >UniRef50_UPI0001556316 Cluster: PREDICTED: similar to LRP16 protein; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to LRP16 protein - Ornithorhynchus anatinus Length = 169 Score = 68.9 bits (161), Expect = 1e-10 Identities = 32/78 (41%), Positives = 48/78 (61%), Gaps = 6/78 (7%) Query: 148 YIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAH 201 ++IHTVGP A++L SCY L E +++S+AFPCISTG++G+PN AA Sbjct: 79 HVIHTVGPIAQGEPSPSQAQELRSCYLNSLQLVLENRLRSVAFPCISTGVFGYPNEAAAK 138 Query: 202 IALRTARKFLETNTEMNR 219 + L R++LE + + R Sbjct: 139 VVLTALREWLEEHKDKIR 156 >UniRef50_Q1YRE7 Cluster: Putative uncharacterized protein; n=1; gamma proteobacterium HTCC2207|Rep: Putative uncharacterized protein - gamma proteobacterium HTCC2207 Length = 167 Score = 68.5 bits (160), Expect = 1e-10 Identities = 48/163 (29%), Positives = 78/163 (47%), Gaps = 18/163 (11%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKV 138 R+ I +G I L ++AVV+ + GA+ R A A D + GD V Sbjct: 13 RIKIHQGKIATLNVEAVVSCYSQ--------SGALERLA----VASGDGLVPLRIGDVHV 60 Query: 139 TG-GYNLPAKYIIHTVGPQ----DGSAEK-LESCYEKCLSFQQEYQIKSIAFPCISTGIY 192 + ++ +I +GP+ D E+ L SCY K + ++Y ++SIAF IS G Sbjct: 61 VAEAVEVTSRILIEAIGPRWRGGDYQEEQQLASCYSKAMDVAKQYNVRSIAFTPISCGPL 120 Query: 193 GFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYET 235 GFP A ++A++ + L N + +IFC F P+ +Y + Sbjct: 121 GFPANRATNVAIQQIKLGLGRNPLIESVIFCCFDPVTTALYRS 163 >UniRef50_Q99IE7 Cluster: Non-structural polyprotein p200 (p200) [Contains: Protease p150 (EC 3.4.22.-) (p150); RNA-directed RNA polymerase/triphosphatase/helicase p90 (EC 2.7.7.48) (EC 3.6.1.15) (EC 3.6.1.-) (p90)]; n=113; root|Rep: Non-structural polyprotein p200 (p200) [Contains: Protease p150 (EC 3.4.22.-) (p150); RNA-directed RNA polymerase/triphosphatase/helicase p90 (EC 2.7.7.48) (EC 3.6.1.15) (EC 3.6.1.-) (p90)] - Rubella virus (strain TO-336 vaccine) (RUBV) Length = 2116 Score = 66.5 bits (155), Expect = 6e-10 Identities = 43/123 (34%), Positives = 58/123 (47%), Gaps = 10/123 (8%) Query: 95 VVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVG 154 VVNAAN L AG GV GAI A L A+C + CPTG+A T G+ +IIH V Sbjct: 836 VVNAANEGLLAGSGVCGAIFANATAALAADCRRLAPCPTGEAVATPGHGCGYTHIIHAVA 895 Query: 155 P---------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALR 205 P ++G A LE Y ++ + +A P + G+YG+ + AL Sbjct: 896 PRRPRDPAALEEGEA-LLERAYRSIVALAAARRWACVACPLLGAGVYGWSAAESLRAALA 954 Query: 206 TAR 208 R Sbjct: 955 ATR 957 >UniRef50_UPI00004D69C1 Cluster: poly (ADP-ribose) polymerase family, member 15; n=1; Xenopus tropicalis|Rep: poly (ADP-ribose) polymerase family, member 15 - Xenopus tropicalis Length = 387 Score = 65.7 bits (153), Expect = 1e-09 Identities = 43/133 (32%), Positives = 63/133 (47%), Gaps = 3/133 (2%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAG-GGVDGAIHRAAGPFLQAECDSIGGCPTGDAKV 138 V + KGDIT DA+VN N L GV I AAG ++ EC +G P GD Sbjct: 9 VMLKKGDITAECTDAIVNINNDSLVQNFAGVSKEILSAAGDLVKEECYLLGQQPHGDVVE 68 Query: 139 TGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRL 198 TG NL + +IH +G D + + + +K L + + S+AFP + TG G + Sbjct: 69 TGAGNLQCRKLIHVIGASDWYS--IIAGVKKVLEKCDQLHLISVAFPALGTGAGGLSAKR 126 Query: 199 AAHIALRTARKFL 211 + L ++L Sbjct: 127 SMEAILTATEEYL 139 >UniRef50_A3EXC9 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1A)] [Contains: Non-structural protein 1 (nsp1) (Leader protein); Non-structural protein 2 (nsp2) (p65 homolog); Non-structural protein 3 (EC 3.4.22.-) (nsp3) (Papain- like proteinase) (PL-PRO) (PL2-PRO); Non-structural protein 4 (nsp4); 3C-like proteinase (EC 3.4.22.-) (3CL-PRO) (3CLp) (nsp5); Non- structural protein 6 (nsp6); Non-structural protein 7 (nsp7); Non- structural protein 8 (nsp8); Non-structural protein 9 (nsp9); Non- structural protein 10 (nsp10) (Growth factor-like peptide) (GFL); RNA- directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (nsp12); Helicase (Hel) (nsp13); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp14); Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)]; n=49; Coronavirus|Rep: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1A)] [Contains: Non-structural protein 1 (nsp1) (Leader protein); Non-structural protein 2 (nsp2) (p65 homolog); Non-structural protein 3 (EC 3.4.22.-) (nsp3) (Papain- like proteinase) (PL-PRO) (PL2-PRO); Non-structural protein 4 (nsp4); 3C-like proteinase (EC 3.4.22.-) (3CL-PRO) (3CLp) (nsp5); Non- structural protein 6 (nsp6); Non-structural protein 7 (nsp7); Non- structural protein 8 (nsp8); Non-structural protein 9 (nsp9); Non- structural protein 10 (nsp10) (Growth factor-like peptide) (GFL); RNA- directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (nsp12); Helicase (Hel) (nsp13); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp14); Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)] - Bat coronavirus HKU5 (BtCoV) (BtCoV/HKU5/2004) Length = 7182 Score = 65.3 bits (152), Expect = 1e-09 Identities = 50/170 (29%), Positives = 81/170 (47%), Gaps = 14/170 (8%) Query: 53 KKSTTDDLKEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGA 112 K + LK F+ I +N + + +++ + E +VNAAN+ LK GGG+ A Sbjct: 1180 KPKAENPLKNFKHIVLNNDVTLVFGDAIAVARAT----EDCILVNAANTHLKHGGGIAAA 1235 Query: 113 IHRAAGPFLQAECDS----IGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSAEKLESCYE 168 I RA+G +QAE D G GD+ + G+ L A I+H VGP D A + + Sbjct: 1236 IDRASGGLVQAESDDYVNFYGPLNVGDSTLLKGHGL-ATGILHVVGP-DARANQDIQLLK 1293 Query: 169 KCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRT--ARKFLETNTE 216 +C +Y + + P IS GI+ R++ L + ++ N+E Sbjct: 1294 RCYKAFNKYPL--VVSPLISAGIFCVEPRVSLEYLLSVVHTKTYVVVNSE 1341 >UniRef50_O28751 Cluster: UPF0189 protein AF_1521; n=25; Euryarchaeota|Rep: UPF0189 protein AF_1521 - Archaeoglobus fulgidus Length = 192 Score = 64.5 bits (150), Expect = 2e-09 Identities = 52/150 (34%), Positives = 71/150 (47%), Gaps = 19/150 (12%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRA----AGPFLQ----AECDSIGG- 130 + + +GDIT+ A+VNAAN RL+ GGGV AI +A AG + + A + G Sbjct: 14 LKLAQGDITQYPAKAIVNAANKRLEHGGGVAYAIAKACAGDAGLYTEISKKAMREQFGRD 73 Query: 131 -CPTGDAKVTGGYNLP---AKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIK 180 G+ VT NL KY+ HTVGP + EKL + L +E ++ Sbjct: 74 YIDHGEVVVTPAMNLEERGIKYVFHTVGPICSGMWSEELKEKLYKAFLGPLEKAEEMGVE 133 Query: 181 SIAFPCISTGIYGFPNRLAAHIALRTARKF 210 SIAFP +S GIYG L + F Sbjct: 134 SIAFPAVSAGIYGCDLEKVVETFLEAVKNF 163 >UniRef50_Q9P0M6 Cluster: Core histone macro-H2A.2; n=74; Eukaryota|Rep: Core histone macro-H2A.2 - Homo sapiens (Human) Length = 372 Score = 63.7 bits (148), Expect = 4e-09 Identities = 52/204 (25%), Positives = 99/204 (48%), Gaps = 16/204 (7%) Query: 46 KSQGIDS-KKSTTDDLKEF---EKIKINTEKNKSISERVSIFKGDIT---KLEIDAVVNA 98 KS+ DS K+ T++ E + I + K+ + +++S+ + DI+ + ++ +V+ Sbjct: 159 KSKPKDSDKEGTSNSTSEDGPGDGFTILSSKSLVLGQKLSLTQSDISHIGSMRVEGIVHP 218 Query: 99 ANSRLKAGGGVDGAIHRAAGP-FLQAECD---SIGGCPTGDAKVTGGYNLPAKYIIHTVG 154 + + + A+ +A G FL+ + S G +A V+ L AK++IH Sbjct: 219 TTAEIDLKEDIGKALEKAGGKEFLETVKELRKSQGPLEVAEAAVSQSSGLAAKFVIHCHI 278 Query: 155 PQDGS---AEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFL 211 PQ GS E+LE + CLS ++ ++KS+AFP +G FP + AA + L+ Sbjct: 279 PQWGSDKCEEQLEETIKNCLSAAEDKKLKSVAFPPFPSGRNCFPKQTAAQVTLKAISAHF 338 Query: 212 ETN--TEMNRIIFCTFLPIDVEIY 233 + + + + + F F + IY Sbjct: 339 DDSSASSLKNVYFLLFDSESIGIY 362 >UniRef50_UPI00005A5611 Cluster: PREDICTED: similar to poly (ADP-ribose) polymerase family, member 14; n=1; Canis lupus familiaris|Rep: PREDICTED: similar to poly (ADP-ribose) polymerase family, member 14 - Canis familiaris Length = 575 Score = 63.3 bits (147), Expect = 5e-09 Identities = 47/152 (30%), Positives = 74/152 (48%), Gaps = 12/152 (7%) Query: 68 INTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGG-VDGAIHRAAGPFLQAECD 126 +NT + S+S + D ++ D +VN L+ GGG + A+ + AGP LQ E Sbjct: 95 VNTPCDSSLSTTMD---DDDIRVVADVIVNTVPMNLQLGGGQLSQALLQKAGPELQKELY 151 Query: 127 SIG-GCP--TGDAKVTGGYNLPAKYIIHTVGPQ----DGSAEKL-ESCYEKCLSFQQEYQ 178 + G G +T G NL K ++H V P GS++++ + +KCL+ +E+ Sbjct: 152 ATRQGTEEEVGSIFMTSGCNLNCKAVLHVVAPHWDNGAGSSQQIMANIIKKCLTTVEEFS 211 Query: 179 IKSIAFPCISTGIYGFPNRLAAHIALRTARKF 210 SI FP I TG FP + A + L +F Sbjct: 212 FSSITFPMIGTGSLRFPKAIFAELILSEVFRF 243 Score = 60.5 bits (140), Expect = 4e-08 Identities = 37/112 (33%), Positives = 54/112 (48%), Gaps = 5/112 (4%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGG 141 I GDITK + D +VN+ GV A+ AGP ++ EC P G+ +T G Sbjct: 319 IATGDITKEKADVIVNSTTRTFNLKSGVSKAVLEGAGPAVENECAVRAAQPHGEFIITQG 378 Query: 142 YNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 L K IIH +G D + + + E+C ++ + S+A P I TG G Sbjct: 379 GYLMCKIIIHVLGDND-VRKTVSAVLEEC----EQRKYTSVALPAIGTGSAG 425 >UniRef50_UPI0000ECC933 Cluster: C20orf133 protein.; n=3; Gallus gallus|Rep: C20orf133 protein. - Gallus gallus Length = 159 Score = 63.3 bits (147), Expect = 5e-09 Identities = 33/92 (35%), Positives = 55/92 (59%), Gaps = 12/92 (13%) Query: 7 WEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKI 66 W EK R+LK++LEE+RK Y +++ L+++ W + + D E Sbjct: 73 WREEKERLLKMTLEERRKEYLR-EYVALKDIPTWMEEMRSKNESDG-----------ENA 120 Query: 67 KINTEKNKSISERVSIFKGDITKLEIDAVVNA 98 K + + +S+SE+VS+++GDIT LE+DA+VNA Sbjct: 121 KEDVQGKRSLSEKVSLYRGDITLLEVDAIVNA 152 >UniRef50_Q4RPB9 Cluster: Chromosome 1 SCAF15008, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 1 SCAF15008, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 227 Score = 62.5 bits (145), Expect = 9e-09 Identities = 29/44 (65%), Positives = 31/44 (70%) Query: 109 VDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHT 152 VDGAIHRAAGP L EC S+ GC TG AK+T GY LPA I T Sbjct: 58 VDGAIHRAAGPALLKECASLQGCETGQAKITCGYGLPANVTIGT 101 >UniRef50_Q460N3 Cluster: Poly [ADP-ribose] polymerase 15; n=9; Euteleostomi|Rep: Poly [ADP-ribose] polymerase 15 - Homo sapiens (Human) Length = 656 Score = 62.5 bits (145), Expect = 9e-09 Identities = 47/169 (27%), Positives = 78/169 (46%), Gaps = 12/169 (7%) Query: 45 NKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLK 104 ++S D+K S D L + + + + ++ + + GD+ + D +VN+ L+ Sbjct: 37 SRSMSRDNKFSKKDCLS-IRNVVASIQTKEGLN--LKLISGDVLYIWADVIVNSVPMNLQ 93 Query: 105 AGGG-VDGAIHRAAGPFLQAECDSIGGCP---TGDAKVTGGYNLPAKYIIHTVGPQ---- 156 GGG + A + AGP LQ E D G+ +T G NL K ++H V P Sbjct: 94 LGGGPLSRAFLQKAGPMLQKELDDRRRETEEKVGNIFMTSGCNLDCKAVLHAVAPYWNNG 153 Query: 157 -DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIAL 204 + S + + + +KCL+ + SI FP I TG FP + A + L Sbjct: 154 AETSWQIMANIIKKCLTTVEVLSFSSITFPMIGTGSLQFPKAVFAKLIL 202 Score = 53.6 bits (123), Expect = 4e-06 Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 5/109 (4%) Query: 85 GDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNL 144 GDI ++D +VN+ GV AI AG +++EC + P D +T G L Sbjct: 289 GDIATEQVDVIVNSTARTFNRKSGVSRAILEGAGQAVESECAVLAAQPHRDFIITPGGCL 348 Query: 145 PAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 K IIH G +D + + S E+C ++ + S++ P I TG G Sbjct: 349 KCKIIIHVPGGKD-VRKTVTSVLEEC----EQRKYTSVSLPAIGTGNAG 392 >UniRef50_Q00XU1 Cluster: Hismacro and SEC14 domain-containing proteins; n=1; Ostreococcus tauri|Rep: Hismacro and SEC14 domain-containing proteins - Ostreococcus tauri Length = 598 Score = 60.5 bits (140), Expect = 4e-08 Identities = 53/167 (31%), Positives = 79/167 (47%), Gaps = 17/167 (10%) Query: 90 LEIDAVVNAANS---RLKAGGGV-DGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLP 145 +++DAV AAN R++ G +H AG L+ E S TG +T G LP Sbjct: 129 MDVDAVSCAANESMRRVRVGESERQRTLHALAGEELEREMASAERARTGGCAMTSGCRLP 188 Query: 146 AKYIIHTVGPQ------DGSAEKLESCYEKCLS-FQQEYQIKSIA--FPCISTGIYGFPN 196 A+ I+H VGP+ + L CY LS +E + +++A PC+ Y P Sbjct: 189 ARRIMHVVGPRYAEKYATAAENALCHCYVALLSKCVEECKARTVACTSPCLENKKY--PT 246 Query: 197 RLAAHIALRTARKFLET-NTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 AA +A RT R+FLE ++ + I+ C +E Y +YFP Sbjct: 247 DKAAMVAARTIRRFLERWQSKFDAIVVCVEEEA-LEPYLEAFTVYFP 292 >UniRef50_Q4SK44 Cluster: Chromosome 2 SCAF14570, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 2 SCAF14570, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 865 Score = 60.1 bits (139), Expect = 5e-08 Identities = 42/138 (30%), Positives = 63/138 (45%), Gaps = 7/138 (5%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAG-GGVDGAIHRAAGPFLQAECDSIGGCPT-GDAK 137 +++ G I D +VN+ L G + AI +AAGP LQ ++ T GD Sbjct: 103 IALATGKIEDATTDVIVNSVFKALNLKEGALSNAIFQAAGPQLQVLLNAKKSSGTVGDVI 162 Query: 138 VTGGYNLPAKYIIHTVGPQDGSAEK-----LESCYEKCLSFQQEYQIKSIAFPCISTGIY 192 VT G L + ++ H V P G+A+ L + CL+ ++ + SI+FP I TG Sbjct: 163 VTEGCQLKSMFVYHAVTPAKGTAQDQAMKALSGIFRDCLNKAEDRGMTSISFPTIGTGQL 222 Query: 193 GFPNRLAAHIALRTARKF 210 GF A + KF Sbjct: 223 GFSKDHVAQVLYGEISKF 240 Score = 44.8 bits (101), Expect = 0.002 Identities = 34/112 (30%), Positives = 50/112 (44%), Gaps = 2/112 (1%) Query: 43 YLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSI--FKGDITKLEIDAVVNAAN 100 Y + G + T L F KI +E +++ V+I GDITK D +VN++N Sbjct: 290 YYLHTVGCTFNRCTICILGHFSKIITTSEMHETKMGSVTIQAVTGDITKETTDVIVNSSN 349 Query: 101 SRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHT 152 GV AI AAG ++AEC + VT L ++ I+T Sbjct: 350 ENFTLKRGVSKAILEAAGQAVEAECQKLEWQQIVCQMVTANSTLHSRIRIYT 401 >UniRef50_UPI0000660C1F Cluster: Homolog of Gallus gallus "Histone macroH2A1.2.; n=1; Takifugu rubripes|Rep: Homolog of Gallus gallus "Histone macroH2A1.2. - Takifugu rubripes Length = 1044 Score = 58.4 bits (135), Expect = 2e-07 Identities = 45/164 (27%), Positives = 63/164 (38%), Gaps = 4/164 (2%) Query: 77 SERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDA 136 S + GDITK D +VN++N+ GV AI AAG ++ EC + P Sbjct: 705 SVTIQAVTGDITKETTDVIVNSSNNTFSLKKGVSKAILEAAGQAVEDECQKLAASPNAGI 764 Query: 137 KVTGGYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPN 196 +T NL K I+H G A + + L S++FP I TG Sbjct: 765 IMTQPGNLQCKKIVHVTG--QTKAFLISKVVKSALQMCVANSYTSVSFPAIGTGQGNIKA 822 Query: 197 RLAAHIALRTARKFLETN--TEMNRIIFCTFLPIDVEIYETLMQ 238 A L N T +N + F P + + T MQ Sbjct: 823 TEVADAMFDAVIDELSQNSSTTLNTVRIVVFQPPMLNDFYTSMQ 866 Score = 55.6 bits (128), Expect = 1e-06 Identities = 42/152 (27%), Positives = 71/152 (46%), Gaps = 6/152 (3%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAG-GGVDGAIHRAAGPFLQAECDSIGGCPT-GDAK 137 +++ G+I D VN+ + L G + A+ AAG LQ + T G+ Sbjct: 520 ITLVVGNIEDATTDVTVNSVFNDLDLNRGALSRALLHAAGLQLQDFLKAQNSSGTLGEII 579 Query: 138 VTGGYNLPAKYIIHTVGPQDGSAEKLESC---YEKCLSFQQEYQIKSIAFPCISTGIYGF 194 VT G L + ++ H V P +A+ +++ + CL ++ + SI+FP I TG GF Sbjct: 580 VTEGCQLKSMFVYHAVTPASYNAQAVQALGGIFRDCLKKAEDSGMTSISFPSIGTGGLGF 639 Query: 195 PNRLAAHIALRTARKFLETNTEMNRIIFCTFL 226 P LAA + KF + + R++ T + Sbjct: 640 PKDLAAQMLYDEILKF-SSKRQTKRLVEVTII 670 Score = 52.4 bits (120), Expect = 1e-05 Identities = 45/159 (28%), Positives = 72/159 (45%), Gaps = 17/159 (10%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 + + K DI + AVV+ AN + G+ A+ +AAGP LQ ECD + G GD Sbjct: 298 IFVCKADICSYPVHAVVSYANPDFRFTSGLQRALLKAAGPQLQEECDRLIHLKGRLKPGD 357 Query: 136 AKVT-GGYNLPAKYIIHTVGPQ-DGS-------AEKLESCYEKCLSFQQEYQIKSIAFPC 186 +T G L + IIH V P+ DG +L+ + L ++ S+A P Sbjct: 358 NVITAAGGQLCCRNIIHAVAPKLDGGQIIFVKRVAQLKKAIKGSLELAEKKGCVSVALPA 417 Query: 187 ISTGIYGFPNRLAAHIALRTARKFLE---TNTEMNRIIF 222 +S GF +L+ + R++ + N + R+ F Sbjct: 418 LSI-TSGFLLKLSVDPIITAVREYFDERHNNVVLKRVHF 455 >UniRef50_Q5M915 Cluster: D930010j01rik-prov protein; n=3; Xenopus|Rep: D930010j01rik-prov protein - Xenopus tropicalis (Western clawed frog) (Silurana tropicalis) Length = 170 Score = 58.4 bits (135), Expect = 2e-07 Identities = 33/94 (35%), Positives = 57/94 (60%), Gaps = 13/94 (13%) Query: 7 WEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKI 66 W+ K+ + L+ ++KR Y DFI L+ + W D+ K ++K+ E+ Sbjct: 58 WKEAKSYLKGLTNKQKRDHYSVKDFIKLKQIPVWK---------DTGKKV--NIKQQEEG 106 Query: 67 KINTEKNKSISERVSIFKGDITKLEIDAVVNAAN 100 K KNK+++E++S+F+GDITKLE+DA++NA + Sbjct: 107 KY--AKNKALNEKISLFRGDITKLEVDAIINAGS 138 >UniRef50_UPI000065F87F Cluster: Homolog of Gallus gallus "Histone macroH2A1.2.; n=1; Takifugu rubripes|Rep: Homolog of Gallus gallus "Histone macroH2A1.2. - Takifugu rubripes Length = 888 Score = 57.2 bits (132), Expect = 4e-07 Identities = 40/136 (29%), Positives = 65/136 (47%), Gaps = 5/136 (3%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAG-GGVDGAIHRAAGPFLQAECDSIGGCPT-GDAK 137 +++ G+I D VN+ + L G + A+ AAGP LQ + T G+ Sbjct: 712 ITLVVGNIEDATTDVTVNSVFNDLDLNRGALSRALLHAAGPQLQDFLKAQNSSGTLGEII 771 Query: 138 VTGGYNLPAKYIIHTVGPQDGSAEKLESC---YEKCLSFQQEYQIKSIAFPCISTGIYGF 194 +T G L + ++ H V P +A+ +++ + CL ++ + SI+FP I TG GF Sbjct: 772 MTEGCQLKSMFVYHAVTPASYNAQAVQALGGIFRDCLKKAEDSGMTSISFPSIGTGGLGF 831 Query: 195 PNRLAAHIALRTARKF 210 P LAA + KF Sbjct: 832 PKDLAAQMLYDEILKF 847 Score = 46.8 bits (106), Expect = 5e-04 Identities = 31/83 (37%), Positives = 42/83 (50%), Gaps = 6/83 (7%) Query: 86 DITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGDAKVT-G 140 DI + AVV+ AN + G+ A+ +AAGP LQ +CD + G GD +T Sbjct: 57 DICSYPVHAVVSYANPDFRFTSGLQRALLKAAGPQLQEDCDRLIHLKGRLKPGDNVITAA 116 Query: 141 GYNLPAKYIIHTVGPQ-DGSAEK 162 G L + IIH V P+ DG K Sbjct: 117 GGQLCCRNIIHAVAPKLDGGVSK 139 >UniRef50_Q9WJC8 Cluster: Nonstructural polyprotein; n=12; Venezuelan equine encephalitis virus|Rep: Nonstructural polyprotein - Venezuelan equine encephalitis virus Length = 2455 Score = 56.8 bits (131), Expect = 5e-07 Identities = 48/150 (32%), Positives = 74/150 (49%), Gaps = 20/150 (13%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGG 141 + +GDI E +VNAANSR + GGGV GA+++ E + G +++ G Sbjct: 1335 VVRGDIANAEEGVIVNAANSRGQPGGGVCGALYKRF-----PENFDLQPIEVGKSRLVKG 1389 Query: 142 YNLPAKYIIHTVGPQ-------DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIY-G 193 AK+IIH VGP DG ++L YE + +++A P +STGI+ G Sbjct: 1390 ---AAKHIIHAVGPNFNKVSELDGD-KQLAEAYESVAKIINDNHYRTVAIPLLSTGIFAG 1445 Query: 194 FPNRLAAHIALRTARKFLETNTEMNRIIFC 223 +RL +L L+T T+ + I+C Sbjct: 1446 NKDRLMQ--SLNHLLTALDT-TDADVAIYC 1472 >UniRef50_UPI0001555B8B Cluster: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2), partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2), partial - Ornithorhynchus anatinus Length = 609 Score = 56.4 bits (130), Expect = 6e-07 Identities = 30/77 (38%), Positives = 44/77 (57%), Gaps = 4/77 (5%) Query: 79 RVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTG 134 R+ + +GD+ + DAVVN ++ LK GG+ G + R AGP LQ C + G P G Sbjct: 318 RLVVRQGDLARYPADAVVNPSHEDLKHSGGLAGHLARHAGPELQEACRLLVRKSGPVPLG 377 Query: 135 DAKVTGGYNLPAKYIIH 151 +A TG ++LP +IH Sbjct: 378 EAVATGAWSLPFGRVIH 394 >UniRef50_A7BVQ6 Cluster: Appr-1-p processing enzyme family; n=1; Beggiatoa sp. PS|Rep: Appr-1-p processing enzyme family - Beggiatoa sp. PS Length = 252 Score = 56.4 bits (130), Expect = 6e-07 Identities = 51/211 (24%), Positives = 90/211 (42%), Gaps = 13/211 (6%) Query: 35 ENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSIS-ERVSIFKGDITKLEID 93 + + P S+++ ++ K + +I + K I+ E + I +GDIT +D Sbjct: 14 DKIGPLSRFVAAAKQTTEKLLLDAGFPKEPNKEITIQNIKQIATENIEILRGDITTFTVD 73 Query: 94 AVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTV 153 A V + G + L+A ++ AK++ NLPA+YIIH V Sbjct: 74 ARVMTTAPNPEIGS-------ETSRYQLKAIFSALRRLNIYQAKISRTSNLPARYIIHIV 126 Query: 154 GP--QDGSAEKLESC---YEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTAR 208 Q G+ +++ S Y CL+ +K IAFP I + +P A + A + Sbjct: 127 ESTWQQGTQQEIASLANNYRSCLTSATRKSLKVIAFPDIICSMSQYPIAQAVYTAFKEVL 186 Query: 209 KFLETNTEMNRIIFCTFLPIDVEIYETLMQL 239 +FL +R F+ + EIY+ + + Sbjct: 187 EFLMDKPNKSRFKKVYFICQNEEIYQIYLDV 217 >UniRef50_Q7REF6 Cluster: ATPase associated with chromosome architecture/replication; n=3; Plasmodium|Rep: ATPase associated with chromosome architecture/replication - Plasmodium yoelii yoelii Length = 254 Score = 56.0 bits (129), Expect = 8e-07 Identities = 56/200 (28%), Positives = 85/200 (42%), Gaps = 20/200 (10%) Query: 41 SKYLNKSQGIDSKKSTTDDLKEFEKIKINT-EKNKSISERVSIFKG-----DITKLEI-- 92 +K +N + I +KK + +L + E I+I EK+ +S+ D+ + + Sbjct: 26 NKNINLDKLIRNKKIKSHELYKIEDIEILLQEKHHDVSQTYPTINNVNQIVDVKNIPVFK 85 Query: 93 ------DAVVNAANSRL---KAGGGVDGAIH--RAAGPFLQAECDSIGGCPTG-DAKVTG 140 DA+VN N K G G D + + + G L E I G + VT Sbjct: 86 KSENHGDAIVNGTNKIFELTKDGMGYDCSSNFLKTCGNKLYDEIKIIREKNIGKNILVTK 145 Query: 141 GYNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAA 200 GYN KYIIH + P +L+ CY+ L +E IK+I FP I +GI F Sbjct: 146 GYNSSYKYIIHVIEPYYNQINELKKCYKDALLIAKENDIKTIVFPLIGSGISLFKKYDVV 205 Query: 201 HIALRTARKFLETNTEMNRI 220 L +F++ N I Sbjct: 206 VCCLEGIYEFIKHKENFNFI 225 >UniRef50_A3BF04 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 128 Score = 55.2 bits (127), Expect = 1e-06 Identities = 28/50 (56%), Positives = 33/50 (66%), Gaps = 4/50 (8%) Query: 80 VSIFKGDITKLEID----AVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC 125 + + K DIT +D A+VNAAN R+ GGGVDGAIHRAAGP L C Sbjct: 24 LKLHKDDITLWSVDGATVAIVNAANERMLGGGGVDGAIHRAAGPELVEAC 73 >UniRef50_Q0Q476 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1A)] [Contains: Non-structural protein 1 (nsp1) (Leader protein); Non-structural protein 2 (nsp2) (p65 homolog); Non-structural protein 3 (EC 3.4.22.-) (nsp3) (Papain- like proteinase) (PL-PRO) (PL2-PRO); Non-structural protein 4 (nsp4); 3C-like proteinase (EC 3.4.22.-) (3CL-PRO) (3CLp) (nsp5); Non- structural protein 6 (nsp6); Non-structural protein 7 (nsp7); Non- structural protein 8 (nsp8); Non-structural protein 9 (nsp9); Non- structural protein 10 (nsp10) (Growth factor-like peptide) (GFL); RNA- directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (nsp12); Helicase (Hel) (nsp13); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp14); Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)]; n=183; Coronavirus|Rep: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1A)] [Contains: Non-structural protein 1 (nsp1) (Leader protein); Non-structural protein 2 (nsp2) (p65 homolog); Non-structural protein 3 (EC 3.4.22.-) (nsp3) (Papain- like proteinase) (PL-PRO) (PL2-PRO); Non-structural protein 4 (nsp4); 3C-like proteinase (EC 3.4.22.-) (3CL-PRO) (3CLp) (nsp5); Non- structural protein 6 (nsp6); Non-structural protein 7 (nsp7); Non- structural protein 8 (nsp8); Non-structural protein 9 (nsp9); Non- structural protein 10 (nsp10) (Growth factor-like peptide) (GFL); RNA- directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (nsp12); Helicase (Hel) (nsp13); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp14); Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)] - Bat coronavirus 279/2005 (BtCoV) (BtCoV/279/2005) Length = 7079 Score = 54.8 bits (126), Expect = 2e-06 Identities = 37/118 (31%), Positives = 57/118 (48%), Gaps = 8/118 (6%) Query: 95 VVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGDAKVTGGYNLPAKYII 150 +VNAAN LK GGGV GA+++A +Q E D G G + + G+NL AK + Sbjct: 1033 IVNAANVHLKHGGGVAGALNKATNGAMQQESDDYIKKNGPLTVGGSCLLSGHNL-AKKCM 1091 Query: 151 HTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTAR 208 H VGP + E ++ +F + + P +S GI+G + + + T R Sbjct: 1092 HVVGPNLNAGEDVQLLKAAYANFNSQ---DVLLAPLLSAGIFGAKPLQSLKMCVETVR 1146 >UniRef50_A7AWQ8 Cluster: Putative uncharacterized protein; n=1; Babesia bovis|Rep: Putative uncharacterized protein - Babesia bovis Length = 418 Score = 54.4 bits (125), Expect = 2e-06 Identities = 55/223 (24%), Positives = 94/223 (42%), Gaps = 16/223 (7%) Query: 30 DFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEK----NKSISERVSIFKG 85 DFI+ E + P K SQ +T + ++ E +T+ N ++ +V I Sbjct: 30 DFIEREPIKPIRKV---SQERMEPWTTCERWRKHEVPPSDTQPKFSVNHDVNNKVYIGTC 86 Query: 86 DITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLP 145 DI +LE+ AV + IH +G + E C GD YN+ Sbjct: 87 DILELEVGAVAVFLDELSPFVSRTAKRIHIQSGKSMPYEEFEKMRC--GDVMTQRSYNIG 144 Query: 146 AKYIIHTVGPQ------DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLA 199 ++Y I+T+ P+ D SA + C + L + + ++A P Y +P+ Sbjct: 145 SEYAIYTIAPRYASKYPDASANIVNMCVREVLKTAIDTGLDTVAIPLKMGREYTYPDEQF 204 Query: 200 AHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 LR+ R++LE N+I ID + Y +L++ +FP Sbjct: 205 TTAVLRSLRRWLEIPAVSNKIKRVFLFDIDTDAY-SLLRRFFP 246 >UniRef50_P18458 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1a)] [Contains: Non-structural protein 1 (nsp1); Non-structural protein 2 (nsp2); Non-structural protein 3 (nsp3); 3C-like serine proteinase (EC 3.4.21.-) (3CLSP) (M- PRO) (p27) (nsp4); Non-structural protein 5 (nsp5); Non-structural protein 6 (nsp6); Non-structural protein 7 (nsp7); Non-structural protein 8 (nsp8); Non-structural protein 9 (nsp9); RNA-directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (p100) (nsp11); Helicase (Hel) (p67) (nsp12); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp13); Non- structural protein 14 (nsp14); Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)]; n=3; Torovirus|Rep: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1a)] [Contains: Non-structural protein 1 (nsp1); Non-structural protein 2 (nsp2); Non-structural protein 3 (nsp3); 3C-like serine proteinase (EC 3.4.21.-) (3CLSP) (M- PRO) (p27) (nsp4); Non-structural protein 5 (nsp5); Non-structural protein 6 (nsp6); Non-structural protein 7 (nsp7); Non-structural protein 8 (nsp8); Non-structural protein 9 (nsp9); RNA-directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (p100) (nsp11); Helicase (Hel) (p67) (nsp12); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp13); Non- structural protein 14 (nsp14); Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)] - Berne virus (BEV) Length = 6857 Score = 54.4 bits (125), Expect = 2e-06 Identities = 50/173 (28%), Positives = 83/173 (47%), Gaps = 14/173 (8%) Query: 31 FIDLE-NVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERV-SIFKGDIT 88 F+D + + W+ L+ +G DS + +++ + KI + + S+F+ + Sbjct: 1651 FVDYDVKKNEWT--LSPEEGEDSDDNLDLPFEQYYEFKIGQTNVVLVQDDFKSVFEFLKS 1708 Query: 89 KLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGDAKVTGGYNL 144 + +D VVN ANS+LK GGG+ I GP LQA ++ P A + G+ L Sbjct: 1709 EQGVDYVVNPANSQLKHGGGIAKVISCMCGPKLQAWSNNYITKNKTVPVTKAIKSPGFQL 1768 Query: 145 PAKY-IIHTVGPQ--DGSA-EKLESCYEKCLSFQQEYQIKSIAFPCISTGIYG 193 K IIH VGP+ DG +KL+ + ++ +I +STGI+G Sbjct: 1769 GKKVNIIHAVGPRVSDGDVFQKLDQAWRSVFDLCEDQH--TILTSMLSTGIFG 1819 >UniRef50_Q6NIW9 Cluster: Putative uncharacterized protein; n=1; Corynebacterium diphtheriae|Rep: Putative uncharacterized protein - Corynebacterium diphtheriae Length = 254 Score = 54.0 bits (124), Expect = 3e-06 Identities = 47/166 (28%), Positives = 68/166 (40%), Gaps = 16/166 (9%) Query: 74 KSISERVSIFKGDITKLEIDAVVNAANSRL-----KAGGGVDGAIHRAAGPFLQAEC--- 125 K+ + ++ GDIT+L A+V A L + + IH+ AG L+ EC Sbjct: 71 KATTPAATVVVGDITELPFSAMVVPATQTLIGPTSPSISDLAARIHQRAGFGLRLECARL 130 Query: 126 --DSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQ------DGSAEKLESCYEKCLSFQQEY 177 +S G A VT G+ LP +IIH V PQ S E L C++ + Sbjct: 131 LKESHEHIEVGSAYVTSGFLLPTPWIIHIVTPQLNLAARGESIELLRQCFQNIFATAAGR 190 Query: 178 QIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFC 223 K + P TG GFP + A I +T + +I C Sbjct: 191 DWKELTIPSQLTGPLGFPAGMEAQILSEELAAARKTGFSAHVVIVC 236 >UniRef50_P87515 Cluster: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; P123'; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); Non-structural protein 3' (nsP3'); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)]; n=13; Alphavirus|Rep: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; P123'; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); Non-structural protein 3' (nsP3'); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)] - Barmah forest virus (BFV) Length = 2410 Score = 54.0 bits (124), Expect = 3e-06 Identities = 45/124 (36%), Positives = 61/124 (49%), Gaps = 19/124 (15%) Query: 84 KGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIG--GCPTGDAKVTGG 141 +GDI+ DAVVNAAN + G GV GAI+R P D+ G PTG A Sbjct: 1339 RGDISNAPEDAVVNAANQQGVKGAGVCGAIYR-KWP------DAFGDVATPTGTAV---S 1388 Query: 142 YNLPAKYIIHTVGP------QDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIY-GF 194 ++ K +IH VGP ++ L S Y + +I ++A P +STGIY G Sbjct: 1389 KSVQDKLVIHAVGPNFSKCSEEEGDRDLASAYRAAAEIVMDKKITTVAVPLLSTGIYAGG 1448 Query: 195 PNRL 198 NR+ Sbjct: 1449 KNRV 1452 >UniRef50_UPI0000E1FED6 Cluster: PREDICTED: hypothetical protein isoform 4; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein isoform 4 - Pan troglodytes Length = 483 Score = 53.2 bits (122), Expect = 6e-06 Identities = 33/106 (31%), Positives = 51/106 (48%), Gaps = 5/106 (4%) Query: 85 GDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNL 144 GDI ++D +VN+ GV AI AG +++EC + P D +T G L Sbjct: 185 GDIATEQVDVIVNSTARTFNRKSGVSKAILEGAGQAVESECAVLAAQPHRDFIITPGGCL 244 Query: 145 PAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTG 190 K IIH G +D + + S E+C ++ + S++ P I TG Sbjct: 245 KCKIIIHVPGRKD-VRKTVTSVLEEC----EQRKYTSVSLPAIGTG 285 Score = 44.8 bits (101), Expect = 0.002 Identities = 32/115 (27%), Positives = 54/115 (46%), Gaps = 7/115 (6%) Query: 45 NKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLK 104 ++S D+K S D L + + + + ++ + + GD+ + D +VN+ L+ Sbjct: 59 SRSMSRDNKFSKKDCLS-IRNVVASIQTKEGLN--LKLISGDVLYIWADVIVNSVPMNLQ 115 Query: 105 AGGG-VDGAIHRAAGPFLQAECDS---IGGCPTGDAKVTGGYNLPAKYIIHTVGP 155 GGG + A + AGP LQ E D G+ +T G NL K ++H V P Sbjct: 116 LGGGPLSRAFLQKAGPMLQKELDDRRRETEEKVGNIFMTSGCNLDCKAVLHAVAP 170 >UniRef50_Q08X95 Cluster: Appr-1-p processing enzyme family protein; n=3; Bacteria|Rep: Appr-1-p processing enzyme family protein - Stigmatella aurantiaca DW4/3-1 Length = 229 Score = 53.2 bits (122), Expect = 6e-06 Identities = 40/128 (31%), Positives = 55/128 (42%), Gaps = 8/128 (6%) Query: 80 VSIFKGDITKLEIDAVVNAANSR-----LKAGGGVDGAIHRAAGPFLQAECDSIGGCPTG 134 + + +GD+ +DA+VNA N L GV GA+ R G E +G P G Sbjct: 77 IRVVEGDLLDQRVDAIVNAWNRNVLPWWLLVPQGVSGALKRRGGLQPFRELARMGPLPLG 136 Query: 135 DAKVTGGYNLPAKYIIHTVGPQ---DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGI 191 A VT LP + IIH G S + + L+ +E +S+AFP I G Sbjct: 137 AAVVTSAGTLPYQGIIHVAGINLLWRASEQSIRDSVANALARARERGWRSLAFPLIGAGS 196 Query: 192 YGFPNRLA 199 GF A Sbjct: 197 GGFDEEKA 204 >UniRef50_UPI0000EB30ED Cluster: UPI0000EB30ED related cluster; n=1; Canis lupus familiaris|Rep: UPI0000EB30ED UniRef100 entry - Canis familiaris Length = 243 Score = 52.0 bits (119), Expect = 1e-05 Identities = 37/150 (24%), Positives = 72/150 (48%), Gaps = 13/150 (8%) Query: 46 KSQGIDSKKSTTDDLKE---FEKIKINTEKNKSISERVSIFKGDITKL---EIDAVVNAA 99 K QG SK ++ D E + + + K+ + +++++ +I+ L E++A++N Sbjct: 94 KKQGEVSKAASADSTTEGTPADGFTVLSTKSLFLGQKLNLIHSEISNLAGFEVEAIINPT 153 Query: 100 NSRLKAGGGVDGAIHRAAGP-FLQAECD---SIGGCPTGDAKVTGGYNLPAKYIIHTVGP 155 N+ + + + + G F++A + G A V+ G+ LPAK++IH P Sbjct: 154 NADIDLKDDLGNTLEKKGGKEFVEAVLELRKKNGPLEVAGAAVSAGHGLPAKFVIHCNSP 213 Query: 156 ---QDGSAEKLESCYEKCLSFQQEYQIKSI 182 D E LE + CL+ + ++KSI Sbjct: 214 VWGADKCEELLEKTVKNCLALADDKKLKSI 243 >UniRef50_UPI0000F2EBB4 Cluster: PREDICTED: similar to LRP16 protein; n=1; Monodelphis domestica|Rep: PREDICTED: similar to LRP16 protein - Monodelphis domestica Length = 168 Score = 49.2 bits (112), Expect = 9e-05 Identities = 29/82 (35%), Positives = 49/82 (59%), Gaps = 13/82 (15%) Query: 17 LSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSI 76 LS +++ + Y DFI L+ + W + + G KE E+ + K+K++ Sbjct: 81 LSDKQREEHYFCRDFIRLKKIPTWKEMAKGAAG-----------KEAEEPQYR--KDKAL 127 Query: 77 SERVSIFKGDITKLEIDAVVNA 98 +E++S+F+GDITKLE+DA+VNA Sbjct: 128 NEKLSLFRGDITKLEVDAIVNA 149 >UniRef50_Q4RPB7 Cluster: Chromosome 1 SCAF15008, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 1 SCAF15008, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 145 Score = 48.8 bits (111), Expect = 1e-04 Identities = 27/96 (28%), Positives = 52/96 (54%), Gaps = 18/96 (18%) Query: 17 LSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSI 76 + +EE+R+ Y++S F+ L++V W+ S+ + +N+ + Sbjct: 17 IKVEERREYYRTSSFVPLDDVPVWTPTAGASE------------------QPLYRRNEKL 58 Query: 77 SERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGA 112 +++S++ GDITKLEIDA+VNA +R + + G+ Sbjct: 59 DQKISLYSGDITKLEIDAIVNAEEARCRDPPSLPGS 94 >UniRef50_UPI000155BDA5 Cluster: PREDICTED: similar to LRP16 protein; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to LRP16 protein - Ornithorhynchus anatinus Length = 186 Score = 47.2 bits (107), Expect = 4e-04 Identities = 28/86 (32%), Positives = 51/86 (59%), Gaps = 14/86 (16%) Query: 17 LSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSI 76 LS +++ + Y DF+ L+ + W + ++G+ +K E K K K+K + Sbjct: 14 LSDKQREEHYFCRDFVRLKKIPTWKE---TAKGVQAKV-------EEPKYK----KDKQL 59 Query: 77 SERVSIFKGDITKLEIDAVVNAANSR 102 +E++S+ +GDITKLE+DA+VNA ++ Sbjct: 60 NEKISLLRGDITKLEVDAIVNAGAAK 85 >UniRef50_A3EXG5 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1A)] [Contains: Non-structural protein 1 (nsp1) (Leader protein); Non-structural protein 2 (nsp2) (p65 homolog); Non-structural protein 3 (EC 3.4.22.-) (nsp3) (Papain- like proteinase) (PL-PRO) (PL2-PRO); Non-structural protein 4 (nsp4); 3C-like proteinase (EC 3.4.22.-) (3CL-PRO) (3CLp) (nsp5); Non- structural protein 6 (nsp6); Non-structural protein 7 (nsp7); Non- structural protein 8 (nsp8); Non-structural protein 9 (nsp9); Non- structural protein 10 (nsp10) (Growth factor-like peptide) (GFL); RNA- directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (nsp12); Helicase (Hel) (nsp13); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp14); Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)]; n=4; Bat coronavirus HKU9|Rep: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1A)] [Contains: Non-structural protein 1 (nsp1) (Leader protein); Non-structural protein 2 (nsp2) (p65 homolog); Non-structural protein 3 (EC 3.4.22.-) (nsp3) (Papain- like proteinase) (PL-PRO) (PL2-PRO); Non-structural protein 4 (nsp4); 3C-like proteinase (EC 3.4.22.-) (3CL-PRO) (3CLp) (nsp5); Non- structural protein 6 (nsp6); Non-structural protein 7 (nsp7); Non- structural protein 8 (nsp8); Non-structural protein 9 (nsp9); Non- structural protein 10 (nsp10) (Growth factor-like peptide) (GFL); RNA- directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (nsp12); Helicase (Hel) (nsp13); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp14); Uridylate-specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'-O-methyl transferase (EC 2.1.1.-) (nsp16)] - Bat coronavirus HKU9 (BtCoV) (BtCoV/HKU9) Length = 6930 Score = 47.2 bits (107), Expect = 4e-04 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 13/107 (12%) Query: 95 VVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGDAKVTGGYNLPAKYII 150 +VNAAN L GGGV GA++RA +Q E G G + + L + I+ Sbjct: 962 LVNAANVNLHHGGGVAGALNRATNNAMQKESSEYIKANGSLQPGGHVLLSSHGLASHGIL 1021 Query: 151 HTVGPQDGSAEK---LESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 H VGP + L++ Y F S+ P +S GI+GF Sbjct: 1022 HVVGPDKRLGQDLALLDAVYAAYTGFD------SVLTPLVSAGIFGF 1062 >UniRef50_Q8IBS9 Cluster: Putative uncharacterized protein MAL7P1.83; n=1; Plasmodium falciparum 3D7|Rep: Putative uncharacterized protein MAL7P1.83 - Plasmodium falciparum (isolate 3D7) Length = 936 Score = 46.4 bits (105), Expect = 7e-04 Identities = 48/211 (22%), Positives = 94/211 (44%), Gaps = 15/211 (7%) Query: 45 NKSQGIDSKKSTTDDLKEFEKIKINTEK---NKSISERVSIFKGDITKLEIDAVVNAANS 101 +K + ID K+S D +K K + + + +++E++ + GDIT ++ A+V AN+ Sbjct: 328 DKKEIIDIKQSRYD-MKRLYKFSLQNKIYMIDNNLNEKIKTYNGDITNIKSHAIVLFANN 386 Query: 102 RLKAGGGVDGAIHRAAGPFLQAECD-SIGGCPTGDAKVTGGYNLPAKYIIHTVGPQDGSA 160 + + + ++ L+ E I +G+ +T Y+ KYI+H + P+ S Sbjct: 387 NYRYSKDICNNLFSSSLMKLEEEEKFEIKNKKSGEVYLTNSYDNIHKYILHIMLPKYNSK 446 Query: 161 ------EKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETN 214 + C + L E +I+++ P I+ ++ FP + L++ R + Sbjct: 447 FILATHNTMNLCVYEILYVCFEKKIETLTIPIINFHMF-FPINIFLITLLKSIRSLIMIP 505 Query: 215 TEMNRIIFCTFLPIDVEIYETL---MQLYFP 242 N I F+ IY L M ++FP Sbjct: 506 QFYNTIKSIIFVTKSNHIYFLLLKYMSIFFP 536 >UniRef50_Q4YCG7 Cluster: Putative uncharacterized protein; n=3; Plasmodium (Vinckeia)|Rep: Putative uncharacterized protein - Plasmodium berghei Length = 851 Score = 46.0 bits (104), Expect = 9e-04 Identities = 52/231 (22%), Positives = 97/231 (41%), Gaps = 22/231 (9%) Query: 33 DLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKN-----------KSISERVS 81 D EN D + N + D K ++ +E++K I N K +++++ Sbjct: 311 DSENEDVNNINNNGNNFCDKNKRIDNEQEEYQKTDIGQAYNYSIKNKTYMVDKELNKKIK 370 Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECD-SIGGCPTGDAKVTG 140 I+ GDI +E ++ AN+ K + ++ + L+ E I +G+ +T Sbjct: 371 IYNGDIANVESQGIILYANNNYKYSKSICENLYSSNLMKLEEEEKYEIRTKKSGEVYLTN 430 Query: 141 GYNLPAKYIIHTVGPQDGSA------EKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF 194 Y+ KYI+H + P+ S + C + L E +I+SI+ P + ++ F Sbjct: 431 SYDNIHKYILHVMLPKYNSKFILATHNTMNLCVHEILYACFEKKIQSISIPIVCFSLF-F 489 Query: 195 PNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETLMQ---LYFP 242 P + L++ R L N I F+ IY L++ ++FP Sbjct: 490 PINIFLITLLKSLRSLLLIPQFYNTIKNIIFVTNSNNIYFFLLKYISIFFP 540 >UniRef50_Q4T4T2 Cluster: Chromosome undetermined SCAF9554, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF9554, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 329 Score = 44.0 bits (99), Expect = 0.003 Identities = 27/96 (28%), Positives = 47/96 (48%), Gaps = 5/96 (5%) Query: 144 LPAKYIIHTVGPQ---DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAA 200 + A +I+H PQ D S ++LE CL ++ + S+AFP + GFP + AA Sbjct: 227 MAAGFILHCHAPQWGWDQSEQQLERTVRNCLWASEDRPLTSVAFPPLPAARNGFPRQTAA 286 Query: 201 HIALRT-ARKFL-ETNTEMNRIIFCTFLPIDVEIYE 234 + L+ F+ +++ + I+ C I V + E Sbjct: 287 QLVLKAICSHFVSSSSSSLKNILLCDSESISVYLQE 322 >UniRef50_Q6QLN1 Cluster: Non-structural polyprotein; n=40; root|Rep: Non-structural polyprotein - Avian hepatitis E virus Length = 1531 Score = 44.0 bits (99), Expect = 0.003 Identities = 38/127 (29%), Positives = 52/127 (40%), Gaps = 7/127 (5%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGG 141 + G++ + D +VN AN + GGG+ G HR P L C + PTG G Sbjct: 627 VIVGNLLDVAADWLVNPANRDHQPGGGLCGMFHR-RWPHLWPVCGEVQDLPTGPVIFQQG 685 Query: 142 YNLPAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAH 201 P K +IH GP + Q + ++A P IS GIY P R + Sbjct: 686 ---PPK-VIHAPGPDYRIKPDPDGLRRVYAVVHQAH--GTVASPLISAGIYRAPARESFE 739 Query: 202 IALRTAR 208 TAR Sbjct: 740 AWAATAR 746 >UniRef50_Q2V9U1 Cluster: Nonstructural protein 3; n=38; Eastern equine encephalitis virus|Rep: Nonstructural protein 3 - Eastern equine encephalitis virus (EEEV) (Eastern equineencephalomyelitis virus) Length = 539 Score = 43.6 bits (98), Expect = 0.005 Identities = 38/119 (31%), Positives = 58/119 (48%), Gaps = 19/119 (15%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRA-AGPFLQAECDSIGGCPTGDAKVTG 140 + +GDI+K DA+VNAAN++ + G GV GA+++ G F D + TG A + Sbjct: 6 VIRGDISKSTDDAIVNAANNKGQPGAGVCGALYKKWPGAF-----DKV-PIATGTAHLV- 58 Query: 141 GYNLPAKYIIHTVGPQ-------DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIY 192 + P IIH VGP +G+ +KL Y + ++ P +STG Y Sbjct: 59 -KHTP--NIIHAVGPNFSRVSEVEGN-QKLSEVYMDIAKIINRERYNKVSIPLLSTGTY 113 >UniRef50_A5KAG2 Cluster: Putative uncharacterized protein; n=1; Plasmodium vivax|Rep: Putative uncharacterized protein - Plasmodium vivax Length = 801 Score = 43.6 bits (98), Expect = 0.005 Identities = 37/165 (22%), Positives = 74/165 (44%), Gaps = 8/165 (4%) Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFL-QAECDSIGGCPTG 134 +++++ I GDI+ ++ +AVV AN + V ++ L + E I +G Sbjct: 348 MNKKIKIVNGDISAVDSEAVVLFANHNYRFSKRVCDDLYSCTLMKLDEEERIEIKSKKSG 407 Query: 135 DAKVTGGYNLPAKYIIHTVGPQDGSA------EKLESCYEKCLSFQQEYQIKSIAFPCIS 188 + +T Y+ KYI+H + P+ S + C ++ L E +++S++ P + Sbjct: 408 EVCLTNSYDGIHKYILHVMLPKYNSKYILATHNTMNLCVQEILCVCVEKRVQSVSIPIVC 467 Query: 189 TGIYGFPNRLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIY 233 G++ FP + +++ R L N I F+ E+Y Sbjct: 468 FGLF-FPTNIFLVSLMKSLRSLLLLPQFYNAIRSIVFVTNSNELY 511 >UniRef50_UPI0000F1E4D0 Cluster: PREDICTED: similar to collaborator of STAT6; n=3; Danio rerio|Rep: PREDICTED: similar to collaborator of STAT6 - Danio rerio Length = 1279 Score = 43.2 bits (97), Expect = 0.006 Identities = 41/147 (27%), Positives = 61/147 (41%), Gaps = 4/147 (2%) Query: 85 GDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNL 144 GDIT DA+VN + + GV I AGP + A+ +G T Sbjct: 745 GDITNETTDAIVNTTDFKDFQTNGVCKDILTKAGPHVHAQLKG-AQVASGQIFTTPPGGF 803 Query: 145 PAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGF-PNRLAAHIA 203 P K I+H G + S +++ ++ + + Q +S+A P I G G PN +A I Sbjct: 804 PCKTIMHVCGERSPSV--IKTLAKEIVVQCESGQYQSVAIPAICAGQEGMDPNVVAKSIL 861 Query: 204 LRTARKFLETNTEMNRIIFCTFLPIDV 230 E N + R I L I+V Sbjct: 862 DGVKEGVQEVNLQYLRNIRIILLKINV 888 >UniRef50_A7QKZ8 Cluster: Chromosome chr8 scaffold_115, whole genome shotgun sequence; n=2; Vitis vinifera|Rep: Chromosome chr8 scaffold_115, whole genome shotgun sequence - Vitis vinifera (Grape) Length = 738 Score = 43.2 bits (97), Expect = 0.006 Identities = 39/115 (33%), Positives = 57/115 (49%), Gaps = 18/115 (15%) Query: 27 KSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKI----KINTEKNKSISERVSI 82 K++D I LE V+ +++NK +++ + DL KI + + S + Sbjct: 269 KAADII-LEKVE---EFVNK---VENARLVLVDLSHGSKILSLVRAKAAQRNIDSNKFFT 321 Query: 83 FKGDITKL------EIDAVVNAANSRLK-AGGGVDGAIHRAAGPFLQAECDSIGG 130 F GDIT+L +A+ NAAN RLK GGG + AI AAGP L+ E G Sbjct: 322 FVGDITRLYSKGGLRCNAIANAANWRLKPGGGGANAAIFSAAGPELEVETKKRAG 376 >UniRef50_A4S5T1 Cluster: Predicted protein; n=1; Ostreococcus lucimarinus CCE9901|Rep: Predicted protein - Ostreococcus lucimarinus CCE9901 Length = 381 Score = 43.2 bits (97), Expect = 0.006 Identities = 30/113 (26%), Positives = 56/113 (49%), Gaps = 9/113 (7%) Query: 138 VTGGYNLPAKYIIHTVGPQ------DGSAEKLESCYEKCLSFQ-QEYQIKSIAFPCISTG 190 +T G LPA+ I H VGP+ + L CY L+ E + +++A Sbjct: 1 MTSGGRLPARRIAHCVGPRYAEKYATAAEHALVHCYVSALTKAVDECKARTVACTPACDE 60 Query: 191 IYGFPNRLAAHIALRTARKFLET-NTEMNRIIFCTFLPIDVEIYETLMQLYFP 242 G+P+ AA + +RT R+FLE + +++ ++ C +++ Y + ++FP Sbjct: 61 KKGYPSDSAAMVMVRTIRRFLEKWSGKLDCVVVCA-NAAEMDDYRAALSVFFP 112 >UniRef50_A6RX72 Cluster: Predicted protein; n=1; Botryotinia fuckeliana B05.10|Rep: Predicted protein - Botryotinia fuckeliana B05.10 Length = 736 Score = 42.7 bits (96), Expect = 0.008 Identities = 24/56 (42%), Positives = 33/56 (58%), Gaps = 8/56 (14%) Query: 181 SIAFPCISTGIYGFPNRLAAHIALRTARKFLE-------TNTEMNRIIFCTFLPID 229 +IAFP ISTG FP+RLAA IA+ T R FL + +++FC + P+D Sbjct: 378 TIAFPAISTGHKSFPHRLAARIAVGTVRDFLRHPIFGAVRRKMIRKVVFCVW-PVD 432 >UniRef50_P13886 Cluster: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)]; n=122; Alphavirus|Rep: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)] - O'nyong-nyong virus (strain Gulu) (ONNV) Length = 2514 Score = 42.7 bits (96), Expect = 0.008 Identities = 41/120 (34%), Positives = 53/120 (44%), Gaps = 15/120 (12%) Query: 86 DITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLP 145 DI K + VVNAAN R G GV A++R E P G AK P Sbjct: 1343 DIAKNTEECVVNAANPRGVPGDGVCKAVYRK-----WPESFRNSATPVGTAKTIMCGQYP 1397 Query: 146 AKYIIHTVGPQDGS---AE---KLESCYEKCLSFQQEYQIKSIAFPCISTGIY-GFPNRL 198 +IH VGP + AE +L S Y + + S+A P +STG+Y G +RL Sbjct: 1398 ---VIHAVGPNFSNYSEAEGDRELASVYREVAKEVSRLGVSSVAIPLLSTGVYSGGKDRL 1454 >UniRef50_Q10MW4 Cluster: Basic helix-loop-helix, putative, expressed; n=4; Oryza sativa|Rep: Basic helix-loop-helix, putative, expressed - Oryza sativa subsp. japonica (Rice) Length = 572 Score = 42.3 bits (95), Expect = 0.011 Identities = 28/64 (43%), Positives = 35/64 (54%), Gaps = 7/64 (10%) Query: 66 IKINTEKNKSISERVSIFKGDITKLE------IDAVVNAANSRLK-AGGGVDGAIHRAAG 118 +K K S R F GDIT+L+ + + NAAN RLK GGGV+ AI+ AAG Sbjct: 155 VKEKAAKKNINSSRFFTFVGDITQLQSKGGLRCNVIANAANWRLKPGGGGVNAAIYNAAG 214 Query: 119 PFLQ 122 LQ Sbjct: 215 EDLQ 218 >UniRef50_Q24DG1 Cluster: Putative uncharacterized protein; n=2; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 3154 Score = 41.9 bits (94), Expect = 0.014 Identities = 27/91 (29%), Positives = 43/91 (47%), Gaps = 1/91 (1%) Query: 8 EIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIK 67 ++ K R LS ++ KI K S FID EN K + S +ST +D K+ K Sbjct: 2143 DLYKGRNQSLSFDDDIKINKKSTFIDFENKKQEQKQQSPQSFSQSHQSTNED-KQTPKSS 2201 Query: 68 INTEKNKSISERVSIFKGDITKLEIDAVVNA 98 IN + +++I + +IF + D N+ Sbjct: 2202 INKQDDENIEQIQNIFNSESKLYTFDKASNS 2232 >UniRef50_Q7RF86 Cluster: GYF domain, putative; n=6; Plasmodium (Vinckeia)|Rep: GYF domain, putative - Plasmodium yoelii yoelii Length = 2031 Score = 41.5 bits (93), Expect = 0.019 Identities = 30/118 (25%), Positives = 55/118 (46%), Gaps = 3/118 (2%) Query: 1 MVNSTKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL 60 +++ST + EK K++ + + K K D D ++ D + K + DD Sbjct: 1739 IISSTNKKTEKTTKNKVNKKNENKSDKGEDIGDKKSEDKKGED-KKGEDAKGDDKKGDDK 1797 Query: 61 KEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAG 118 K+ EK+K +T + I + V I KG+ TK+ + + NS+ K G + ++ G Sbjct: 1798 KKTEKMKWSTTGERKIEKLVDIMKGEETKINMQ--IKIENSKKKQENGNNNKNNKKLG 1853 >UniRef50_P13887 Cluster: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)]; n=181; root|Rep: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)] - Ross river virus (strain NB5092) (RRV) Length = 2479 Score = 40.3 bits (90), Expect = 0.043 Identities = 46/160 (28%), Positives = 73/160 (45%), Gaps = 22/160 (13%) Query: 86 DITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGC--PTGDAKVTGGYN 143 DI+ +AVVNAAN++ G GV A+ R P DS G P G AK+ Sbjct: 1341 DISGHAEEAVVNAANAKGTVGVGVCRAVAR-KWP------DSFKGAATPVGTAKLVQANG 1393 Query: 144 LPAKYIIHTVGPQDGSA------EKLESCYEKCLSFQQEYQIKSIAFPCISTGIY-GFPN 196 + +IH VGP + +L + Y IKS+A P +STG++ G + Sbjct: 1394 M---NVIHAVGPNFSTVTEAEGDRELAAAYRAVAGIINASNIKSVAIPLLSTGVFSGGKD 1450 Query: 197 RLAAHIALRTARKFLETNTEMNRIIFCTFLPIDVEIYETL 236 R+ +L ++T T+ + +I+C + +I E + Sbjct: 1451 RVMQ--SLNHLFTAMDT-TDADVVIYCRDKAWEKKIQEAI 1487 >UniRef50_A7BRB1 Cluster: Protein containing Appr-1-p processing domain; n=1; Beggiatoa sp. PS|Rep: Protein containing Appr-1-p processing domain - Beggiatoa sp. PS Length = 217 Score = 39.5 bits (88), Expect = 0.075 Identities = 37/139 (26%), Positives = 57/139 (41%), Gaps = 15/139 (10%) Query: 87 ITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGP-FLQAECDSIGGCPT---GDAKVTGGY 142 + + +DA+V A + GGG +I AGP L+A P+ GD +T + Sbjct: 49 LQNMAVDAIVYGAKDTGEMGGGAASSIIEEAGPKILEAARKEFALLPSKNIGDVVITDSF 108 Query: 143 NLP---AKYIIHTVG-----PQDG---SAEKLESCYEKCLSFQQEYQIKSIAFPCISTGI 191 NL K++ H + PQ S EKL K + + +SIAF + TG Sbjct: 109 NLKERGIKFVCHLISIIKYTPQGAYCPSPEKLYDGVFKSIQLAYDKGARSIAFSAMGTGE 168 Query: 192 YGFPNRLAAHIALRTARKF 210 A + + A+ F Sbjct: 169 GRLKPEHCARLMISAAKDF 187 >UniRef50_Q0Q467 Cluster: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1a)] [Contains: Non-structural protein 1 (nsp1) (p9); Non-structural protein 2 (nsp2) (p87); Non- structural protein 3 (EC 3.4.22.-) (nsp3) (Papain-like proteinases 1/2) (PL1-PRO/PL2-PRO) (p195); Non-structural protein 4 (nsp4) (Peptide HD2); 3C-like proteinase (EC 3.4.22.-) (3CL-PRO) (3CLp) (M- PRO) (p34) (nsp5); Non-structural protein 6 (nsp6); Non-structural protein 7 (nsp7) (p5); Non-structural protein 8 (nsp8) (p23); Non- structural protein 9 (nsp9) (p12); Non-structural protein 10 (nsp10) (Growth factor-like peptide) (GFL) (p14); RNA-directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (p100) (nsp12); Helicase (Hel) (p66) (p66- HEL) (nsp13); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp14); Uridylate- specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'- O-methyl transferase (EC 2.1.1.-) (nsp16)]; n=225; root|Rep: Replicase polyprotein 1ab (pp1ab) (ORF1ab polyprotein) [Includes: Replicase polyprotein 1a (pp1a) (ORF1a)] [Contains: Non-structural protein 1 (nsp1) (p9); Non-structural protein 2 (nsp2) (p87); Non- structural protein 3 (EC 3.4.22.-) (nsp3) (Papain-like proteinases 1/2) (PL1-PRO/PL2-PRO) (p195); Non-structural protein 4 (nsp4) (Peptide HD2); 3C-like proteinase (EC 3.4.22.-) (3CL-PRO) (3CLp) (M- PRO) (p34) (nsp5); Non-structural protein 6 (nsp6); Non-structural protein 7 (nsp7) (p5); Non-structural protein 8 (nsp8) (p23); Non- structural protein 9 (nsp9) (p12); Non-structural protein 10 (nsp10) (Growth factor-like peptide) (GFL) (p14); RNA-directed RNA polymerase (EC 2.7.7.48) (RdRp) (Pol) (p100) (nsp12); Helicase (Hel) (p66) (p66- HEL) (nsp13); Exoribonuclease (EC 3.1.13.-) (ExoN) (nsp14); Uridylate- specific endoribonuclease (EC 3.1.-.-) (NendoU) (nsp15); Putative 2'- O-methyl transferase (EC 2.1.1.-) (nsp16)] - Bat coronavirus 512/2005 (BtCoV) (BtCoV/512/2005) Length = 6793 Score = 39.5 bits (88), Expect = 0.075 Identities = 46/152 (30%), Positives = 69/152 (45%), Gaps = 20/152 (13%) Query: 78 ERVSIFKGDITKL---EIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTG 134 + + ++G+++ L D VVNAAN +L GGG+ A+ LQ + G Sbjct: 1303 KNIEFYQGELSALLSVNHDFVVNAANEQLSHGGGIAKALDDLTKGELQVLSNQYVS-RNG 1361 Query: 135 DAKVTGGYNLPAK--YIIHTVGPQDG--SAEKLESCYEKCLSFQQEYQIKSI-AFPCIST 189 KV G + K I++ VGP+ G +AE L Y F+Q K + P +S Sbjct: 1362 SIKVGSGVLIKCKEHSILNVVGPRKGKHAAELLTKAY--TFVFKQ----KGVPLMPLLSV 1415 Query: 190 GIYGFP--NRLAAHIAL---RTARKFLETNTE 216 GI+ P LAA +A R + F T+ E Sbjct: 1416 GIFKVPITESLAAFLACVGDRVCKCFCYTDKE 1447 >UniRef50_Q22U36 Cluster: Cyclic nucleotide-binding domain containing protein; n=1; Tetrahymena thermophila SB210|Rep: Cyclic nucleotide-binding domain containing protein - Tetrahymena thermophila SB210 Length = 913 Score = 39.1 bits (87), Expect = 0.099 Identities = 26/82 (31%), Positives = 45/82 (54%), Gaps = 6/82 (7%) Query: 2 VNSTKWEIEKNRILKLSLEEKRK-IYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL 60 ++++++E+ +NR+ + E Y+ S D +N D +K NKS I T D L Sbjct: 679 LDNSEFELNQNRLEHKQVNESNSDYYQKSKETDQQNEDSENKNTNKSIMI-----TQDVL 733 Query: 61 KEFEKIKINTEKNKSISERVSI 82 K+F + N+EK+K+ + VSI Sbjct: 734 KDFNDLNQNSEKSKNFHKLVSI 755 >UniRef50_Q6ZKH7 Cluster: Putative uncharacterized protein OJ1119_D01.23; n=2; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein OJ1119_D01.23 - Oryza sativa subsp. japonica (Rice) Length = 267 Score = 38.7 bits (86), Expect = 0.13 Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 4/36 (11%) Query: 80 VSIFKGDITKLEID----AVVNAANSRLKAGGGVDG 111 + + KGDIT +D A+VNAAN R+ GGGVDG Sbjct: 83 LKLHKGDITLWSVDGATVAIVNAANERMLGGGGVDG 118 >UniRef50_Q8ZN14 Cluster: Gifsy-1 prophage protein; n=4; Bacteria|Rep: Gifsy-1 prophage protein - Salmonella typhimurium Length = 274 Score = 37.5 bits (83), Expect = 0.30 Identities = 44/162 (27%), Positives = 67/162 (41%), Gaps = 21/162 (12%) Query: 77 SERVSIFKGDITKL-EIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAEC------DSIG 129 +E V I G + E D +V+AANS GGVD AI GP LQ + +G Sbjct: 24 TENVEIIPGPFETIPEFDCMVSAANSFGLMDGGVDAAITAYFGPQLQERVQQHILREYLG 83 Query: 130 GCPTGDAKVTGGYNLPAKYIIH------------TVGPQDGSAEKLESCYEKCLSFQQEY 177 P G A V N +++H T + + L + ++ S ++ Sbjct: 84 EQPVGTAFVIETGNSKYPWLVHAPTMRVPLIIDGTDAVYNATRAALLAIFQHNKSAGEDR 143 Query: 178 QIKSIAFPCISTGI-YGFPNRLAAHIALRTARKFLETNTEMN 218 +IKS+ FP + G P +A + L F+ TE+N Sbjct: 144 KIKSVVFPAMGAGCGQVSPGSVARQMKL-AWDGFINCTTEIN 184 >UniRef50_Q8JJX1 Cluster: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)]; n=62; Alphavirus|Rep: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)] - Salmon pancreas disease virus (SPDV) Length = 2601 Score = 37.5 bits (83), Expect = 0.30 Identities = 36/137 (26%), Positives = 57/137 (41%), Gaps = 16/137 (11%) Query: 64 EKIKINTEKNKSISERVS--IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFL 121 +K+K+ N + + +I E + +VNAANS + G GV GA++ A G Sbjct: 1407 DKVKVAEILNSMVGAAPGYRVLNRNIITAEEEVLVNAANSNGRPGDGVCGALYGAFG--- 1463 Query: 122 QAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVG------PQDGSAEKLESCYEKCLSFQQ 175 + G G+A + G IIH G ++ A +L + Y + Sbjct: 1464 --DAFPNGAIGAGNAVLVRGLEAT---IIHAAGADFREVDEETGARQLRAAYRAAATLVT 1518 Query: 176 EYQIKSIAFPCISTGIY 192 I S A P +ST I+ Sbjct: 1519 ANGITSAAIPLLSTHIF 1535 >UniRef50_Q69HN2 Cluster: Putative uncharacterized protein; n=1; Ciona intestinalis|Rep: Putative uncharacterized protein - Ciona intestinalis (Transparent sea squirt) Length = 437 Score = 37.1 bits (82), Expect = 0.40 Identities = 30/111 (27%), Positives = 48/111 (43%), Gaps = 6/111 (5%) Query: 86 DITKLEIDAVVNAANSRLKAGGG-VDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNL 144 D+TK I +VN+ + G V + R GP LQ EC + T ++T G NL Sbjct: 90 DLTKSNI--IVNSVGPDFELSKGQVSAILLRRVGPQLQTECTNNPKFATESYRITTGGNL 147 Query: 145 PAKYIIHTVGPQDGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 +I+H V P ++E + L + ++ P + +G G P Sbjct: 148 -CDHIVHYVLP--NKEYRIEESIMELLEKCDNMEAITVVMPVLGSGNRGVP 195 >UniRef50_A2E8H6 Cluster: Viral A-type inclusion protein, putative; n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion protein, putative - Trichomonas vaginalis G3 Length = 2458 Score = 37.1 bits (82), Expect = 0.40 Identities = 24/66 (36%), Positives = 35/66 (53%), Gaps = 2/66 (3%) Query: 15 LKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL--KEFEKIKINTEK 72 LK +EE +K +SS+ E + W +++ ID+ KS ++L K E IK N EK Sbjct: 1009 LKSEIEELKKKLESSEQNKEEENNGWGDENTETENIDNLKSEIEELNKKLDESIKSNDEK 1068 Query: 73 NKSISE 78 K I E Sbjct: 1069 QKKIEE 1074 >UniRef50_Q3BBL7 Cluster: Putative uncharacterized protein; n=14; Pyrococcus|Rep: Putative uncharacterized protein - Pyrococcus sp. 322 Length = 96 Score = 37.1 bits (82), Expect = 0.40 Identities = 20/69 (28%), Positives = 30/69 (43%) Query: 161 EKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLETNTEMNRI 220 +KL+ L E ++SIAFP IS GIYG P + T +FL+ + + Sbjct: 12 DKLKPAILGALKKADELGVRSIAFPAISAGIYGCPLEKVVKVFKDTVEQFLKEAKNVKDV 71 Query: 221 IFCTFLPID 229 + D Sbjct: 72 FLVLYSETD 80 >UniRef50_A6DE82 Cluster: Exonuclease SbcC; n=1; Caminibacter mediatlanticus TB-2|Rep: Exonuclease SbcC - Caminibacter mediatlanticus TB-2 Length = 665 Score = 36.7 bits (81), Expect = 0.53 Identities = 30/97 (30%), Positives = 49/97 (50%), Gaps = 6/97 (6%) Query: 1 MVNSTKWEIEKNRILKLSLEEKRKIYKSSDFID-LENVDPWSKYLNKSQGIDSKKSTT-D 58 ++ K EIEK + K LEEK I+K + L+ P +K+ D+ S + D Sbjct: 204 ILEKLKKEIEKLTLQKDKLEEKVLIFKFEKYRSYLKENTPCPLCGSKNHNFDNLDSVSED 263 Query: 59 DLKEFEK-IKINTEKNKSISE---RVSIFKGDITKLE 91 D+ E++ + I EKNK + + +I + +I KLE Sbjct: 264 DINEYKNLVNILEEKNKEFEDKKIKQNILESEILKLE 300 >UniRef50_A4GSN8 Cluster: Nuclear-pore anchor; n=7; Arabidopsis thaliana|Rep: Nuclear-pore anchor - Arabidopsis thaliana (Mouse-ear cress) Length = 2093 Score = 36.7 bits (81), Expect = 0.53 Identities = 25/75 (33%), Positives = 40/75 (53%), Gaps = 3/75 (4%) Query: 3 NSTKWEIEKNRILKLSLE-EKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLK 61 N K E+EKN+ + +L KRK K D + +N +K L +++ K++TTD + Sbjct: 1430 NKQKQELEKNKKIHYTLNMTKRKYEKEKDELSKQN-QSLAKQLEEAKEEAGKRTTTDAVV 1488 Query: 62 EFEKIKINTEKNKSI 76 E + +K EK K I Sbjct: 1489 E-QSVKEREEKEKRI 1502 >UniRef50_Q54DH8 Cluster: Putative uncharacterized protein TAF1; n=2; cellular organisms|Rep: Putative uncharacterized protein TAF1 - Dictyostelium discoideum AX4 Length = 2310 Score = 36.7 bits (81), Expect = 0.53 Identities = 27/93 (29%), Positives = 50/93 (53%), Gaps = 5/93 (5%) Query: 16 KLSLEEKRKIYKSSDFIDLENVDPWSKYLN--KSQGIDS--KKSTTDDLKEFEKIKINTE 71 +L +EE +++K DLE S++++ K+ GID K + TD + +K ++ E Sbjct: 24 ELDVEENDQVFKDLKK-DLELFAKSSQHISFKKTIGIDEDDKNAVTDSVIVPDKNALDYE 82 Query: 72 KNKSISERVSIFKGDITKLEIDAVVNAANSRLK 104 ++E + + +I KL D + NAA +RL+ Sbjct: 83 DIDEVAEEIQSTENEINKLNADKLANAAIARLQ 115 >UniRef50_A0DTL5 Cluster: Chromosome undetermined scaffold_63, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_63, whole genome shotgun sequence - Paramecium tetraurelia Length = 282 Score = 36.7 bits (81), Expect = 0.53 Identities = 30/91 (32%), Positives = 50/91 (54%), Gaps = 11/91 (12%) Query: 8 EIEKNRILKLSL---EEKRKIY--KSSDFIDLEN-VDPWSKYLNKSQGIDSKKSTTDDLK 61 +IE+ +ILKL L E +K Y K + LE V+ + Y +K + + KK L+ Sbjct: 31 DIEQQKILKLQLSRIENLKKEYSKKEQEICRLEQQVEQFRIYYDKYENV--KKLLESALE 88 Query: 62 EFEKIKINTEKNKSISERVSIFKGDITKLEI 92 + EKI+ +NKS+ +++S F+ KLE+ Sbjct: 89 QLEKIE---NQNKSLQKKLSDFQESYAKLEL 116 >UniRef50_Q6FSG9 Cluster: Candida glabrata strain CBS138 chromosome H complete sequence; n=4; Saccharomycetales|Rep: Candida glabrata strain CBS138 chromosome H complete sequence - Candida glabrata (Yeast) (Torulopsis glabrata) Length = 451 Score = 36.7 bits (81), Expect = 0.53 Identities = 18/76 (23%), Positives = 39/76 (51%), Gaps = 2/76 (2%) Query: 8 EIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIK 67 ++ I++++++ R Y+ + D ++D + Y S G D+ K DD+ E E+ + Sbjct: 309 DVYLKNIIEMAIDTVR--YRKKKYSDYYDLDDFGTYQAVSSGTDTSKDAKDDIMEIERKR 366 Query: 68 INTEKNKSISERVSIF 83 + N+ I +S+F Sbjct: 367 TISLTNEDIYTSLSLF 382 >UniRef50_UPI00004993C7 Cluster: hypothetical protein 3.t00030; n=1; Entamoeba histolytica HM-1:IMSS|Rep: hypothetical protein 3.t00030 - Entamoeba histolytica HM-1:IMSS Length = 1144 Score = 36.3 bits (80), Expect = 0.70 Identities = 27/87 (31%), Positives = 43/87 (49%), Gaps = 7/87 (8%) Query: 8 EIEKNRILKLSLEE-KRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEF--- 63 E E+ K +EE K K+Y++ I E + + K + I+ K D++KE Sbjct: 603 ETERQERKKEEIEEFKEKVYETEKKI--EGITNRIDEMVKKEEIEEIKQNIDNIKEIIKS 660 Query: 64 -EKIKINTEKNKSISERVSIFKGDITK 89 +++KIN EKNK I E + +I K Sbjct: 661 IDEVKINNEKNKKIIEGIQKENEEIKK 687 >UniRef50_Q6MRT6 Cluster: Putative uncharacterized protein; n=1; Mycoplasma mycoides subsp. mycoides SC|Rep: Putative uncharacterized protein - Mycoplasma mycoides subsp. mycoides SC Length = 472 Score = 36.3 bits (80), Expect = 0.70 Identities = 33/109 (30%), Positives = 54/109 (49%), Gaps = 11/109 (10%) Query: 4 STKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEF 63 STK++ EK R+L E +K KS + DL+ LNK Q D +++ KE Sbjct: 182 STKYQEEKIRLLSEEYENNKKNLKSQE-KDLKEKTEMLLMLNK-QKTDLEQTLVMLTKEK 239 Query: 64 EKIKINTEKNKS----ISERVS-----IFKGDITKLEIDAVVNAANSRL 103 +++ +N EK K+ IS+++S + K D +I V+N + L Sbjct: 240 DQLLVNEEKLKNEISEISKKISDKKDELIKDDTALKKIKTVINGIDQNL 288 >UniRef50_Q8I4Z1 Cluster: Putative uncharacterized protein; n=2; Plasmodium|Rep: Putative uncharacterized protein - Plasmodium falciparum (isolate 3D7) Length = 1846 Score = 36.3 bits (80), Expect = 0.70 Identities = 22/80 (27%), Positives = 41/80 (51%), Gaps = 2/80 (2%) Query: 16 KLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEK--IKINTEKN 73 +L LEEK K+ K + + E ++ Y+ K ++ KK+ + +++ K I+ + EK Sbjct: 1418 QLLLEEKIKLQKEKELFENEKLERKMSYMLKINELEKKKNERNKMEKSYKRMIQKDKEKK 1477 Query: 74 KSISERVSIFKGDITKLEID 93 K R I +G+ K+ D Sbjct: 1478 KKKESRDKIRRGEEEKMSAD 1497 >UniRef50_A0CHZ3 Cluster: Chromosome undetermined scaffold_186, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_186, whole genome shotgun sequence - Paramecium tetraurelia Length = 1325 Score = 36.3 bits (80), Expect = 0.70 Identities = 31/99 (31%), Positives = 45/99 (45%), Gaps = 8/99 (8%) Query: 2 VNSTKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPW----SKYLNKSQGIDSKKSTT 57 +N ++ R+ KL EEK K K +D I P S + N+SQ ID+ K T Sbjct: 47 INKDTAQVLSQRVEKLQ-EEKDKYKKQADEILKRTEGPGIHRSSIHSNRSQKIDNDKFTQ 105 Query: 58 DDLKEFEKIKINTE---KNKSISERVSIFKGDITKLEID 93 D +K+ E + + E K K + K DI +L D Sbjct: 106 DQIKQREILALELEMNQKEKQFLAEIENLKMDIKQLTHD 144 >UniRef50_Q6CT35 Cluster: Similar to sgd|S0006295 Saccharomyces cerevisiae YPR091c; n=1; Kluyveromyces lactis|Rep: Similar to sgd|S0006295 Saccharomyces cerevisiae YPR091c - Kluyveromyces lactis (Yeast) (Candida sphaerica) Length = 783 Score = 36.3 bits (80), Expect = 0.70 Identities = 21/73 (28%), Positives = 36/73 (49%), Gaps = 3/73 (4%) Query: 10 EKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKIN 69 E N I +L E++ ++ +D V+ +N+S D +++TD + F K K N Sbjct: 510 ENNDISNTNLAERQS---TNSAVDSPTVEESESTINESYNTDQPQTSTDSTRSFLKNKSN 566 Query: 70 TEKNKSISERVSI 82 + N SI R S+ Sbjct: 567 DDSNVSIRSRSSV 579 >UniRef50_UPI000065F7D8 Cluster: Homolog of Homo sapiens "Splice Isoform 1 of Bromodomain-containing protein 4; n=1; Takifugu rubripes|Rep: Homolog of Homo sapiens "Splice Isoform 1 of Bromodomain-containing protein 4 - Takifugu rubripes Length = 321 Score = 35.9 bits (79), Expect = 0.93 Identities = 15/49 (30%), Positives = 30/49 (61%) Query: 32 IDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERV 80 I L+N D W++ ++S + S KS+ D ++F K + E+ K++ ++V Sbjct: 162 IVLKNADSWARLASQSVALASGKSSKDAFQQFRKAALEKERVKALKKQV 210 >UniRef50_A7DT33 Cluster: Putative uncharacterized protein; n=3; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 245 Score = 35.9 bits (79), Expect = 0.93 Identities = 28/93 (30%), Positives = 49/93 (52%), Gaps = 7/93 (7%) Query: 4 STKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEF 63 S K +IE + KL L ++R IY+ + ID +N D K + K++G+ ++ D K Sbjct: 70 SYKTQIEM--VQKLILAKRRIIYEVQEKIDKKNYDK-MKLVEKTEGLGTQADNNVDTK-L 125 Query: 64 EKIKINTEKNKSISERVSIFKG---DITKLEID 93 + IKI N+ ++ R + D+ K++ID Sbjct: 126 KNIKITRIMNRLVAYRKTFLLQEIMDVFKIDID 158 >UniRef50_A2EMN0 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 1077 Score = 35.9 bits (79), Expect = 0.93 Identities = 42/184 (22%), Positives = 74/184 (40%), Gaps = 17/184 (9%) Query: 38 DPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVS---IFKGDITKLEIDA 94 + W K K+ I+S S + KE E +++T N + ++S +F ++TK E+ A Sbjct: 596 EEWEKLYGKTLTIESWMSNKTETKE-EIYEVSTGCNIKVLIQLSNKYVFGVNLTKAELVA 654 Query: 95 VVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVG 154 N P + + S G +KV LP ++ G Sbjct: 655 EFTPENKEENCDDSYK------TNPAFRVDIPSRKAALDGVSKV-----LPLDFVCKKTG 703 Query: 155 PQDGSAEKLES--CYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALRTARKFLE 212 + +++S C E ++F+ S +FP I+ I PN L I +R + K + Sbjct: 704 VFKINKFQMQSWGCVETSVTFEPAIIKASDSFPLITMSIENLPNELVQGICVRFSVKIVN 763 Query: 213 TNTE 216 T+ Sbjct: 764 NGTK 767 >UniRef50_UPI00006CE511 Cluster: hypothetical protein TTHERM_00141050; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00141050 - Tetrahymena thermophila SB210 Length = 267 Score = 35.5 bits (78), Expect = 1.2 Identities = 42/166 (25%), Positives = 69/166 (41%), Gaps = 13/166 (7%) Query: 80 VSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSI----GGCPTGD 135 + I KG+I ID +VN + L + +A L+ E DS+ G D Sbjct: 28 IIILKGNICNENIDCIVNWVDCFLMNERTY--ILKQALNDKLKKELDSVKHSKGILTLND 85 Query: 136 AKVTGGYNLP-AKYIIHTVGPQ-DGSAEK----LESCYEKCLSFQQEYQIKSIAFPCIST 189 +T L K IIH+ P G EK E +C+ + SI F S+ Sbjct: 86 CFITSPGKLQNTKKIIHSTLPLWRGGHEKELQYFEESITQCIQLAINQNMSSIGFTQDSS 145 Query: 190 GIYGFPNRLAAHIALRTARKFLE-TNTEMNRIIFCTFLPIDVEIYE 234 I+G P + A I +++ +F +T + R+ F +++Y+ Sbjct: 146 DIFGIPLQDCAEILIQSFYRFATFKDTSIKRVYFIHQDSSAIQVYK 191 >UniRef50_UPI000049880F Cluster: hypothetical protein 63.t00025; n=1; Entamoeba histolytica HM-1:IMSS|Rep: hypothetical protein 63.t00025 - Entamoeba histolytica HM-1:IMSS Length = 1005 Score = 35.5 bits (78), Expect = 1.2 Identities = 23/99 (23%), Positives = 49/99 (49%), Gaps = 6/99 (6%) Query: 1 MVNSTKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL 60 ++N EK +I+KL EE+ K + + ++++++ NK++ ++ KK DD+ Sbjct: 663 LINEIISTTEKTKIIKLGTEEEIKEFNEAKEKEMKSIEERKNKENKTKKVERKKRRVDDI 722 Query: 61 ------KEFEKIKINTEKNKSISERVSIFKGDITKLEID 93 KE K +I T N+ ++++ K + +D Sbjct: 723 DIKDTNKEERKRRIETFLNEVKVKKLNELKEENVSFVLD 761 >UniRef50_Q0WYB5 Cluster: Nonstructural protein; n=141; Hepatitis E virus|Rep: Nonstructural protein - Hepatitis E virus Length = 1717 Score = 35.5 bits (78), Expect = 1.2 Identities = 30/117 (25%), Positives = 54/117 (46%), Gaps = 15/117 (12%) Query: 82 IFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGG 141 ++ G + + + D +VNA+N + GGG+ A F Q +S + Sbjct: 814 VYAGSLFESDCDWLVNASNPGHRPGGGLCHA-------FYQRFPESFHPTDFIMREGLAA 866 Query: 142 YNLPAKYIIHTVGPQ---DGSAEKLESCYEKCLSFQQEYQIKSIAFPCISTGIYGFP 195 Y L + IIH V P + + ++LE+ Y + S ++ + A+P + +GIY P Sbjct: 867 YTLTPRPIIHAVAPDYRIEQNPKRLEAAYRETCS-----RLGTAAYPLLGSGIYQVP 918 >UniRef50_Q1UZP6 Cluster: Putative uncharacterized protein; n=1; Candidatus Pelagibacter ubique HTCC1002|Rep: Putative uncharacterized protein - Candidatus Pelagibacter ubique HTCC1002 Length = 297 Score = 35.5 bits (78), Expect = 1.2 Identities = 22/77 (28%), Positives = 42/77 (54%), Gaps = 1/77 (1%) Query: 25 IYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDL-KEFEKIKINTEKNKSISERVSIF 83 IY S FID E+ + ++++L++ I + T+DL KE E+++ + +++++ F Sbjct: 113 IYLPSVFIDTEDAETYAEFLDEDIWIPFTEMLTEDLGKESEEVEKLKKAVENLNKYQDFF 172 Query: 84 KGDITKLEIDAVVNAAN 100 K D +K D AN Sbjct: 173 KKDFSKYYTDIFNYDAN 189 >UniRef50_Q9U0D4 Cluster: Sequestrin; n=2; Plasmodium falciparum|Rep: Sequestrin - Plasmodium falciparum Length = 652 Score = 35.5 bits (78), Expect = 1.2 Identities = 31/103 (30%), Positives = 51/103 (49%), Gaps = 10/103 (9%) Query: 8 EIEKNRILKLSLEEKRKIYKSS-DFIDLENVDPWSKYL------NKSQGIDSKKSTTDDL 60 +IEK +I K+ +E KIY+ D +D + + +S Y+ N I ++K T D Sbjct: 147 KIEKEKINKMDKDEIDKIYREELDKMDRDAI--YSMYIEDISNKNIKDLIKNEKETNKDK 204 Query: 61 KEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRL 103 + + I IN +K K I V I K DI K ++ + ++L Sbjct: 205 NKKKDIDINKKKKKDIDIDVDIDK-DIHKDHVEELYGEVKNKL 246 >UniRef50_Q54KL2 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 337 Score = 35.5 bits (78), Expect = 1.2 Identities = 20/56 (35%), Positives = 31/56 (55%), Gaps = 3/56 (5%) Query: 17 LSLEEKRKIYKSSDFIDLENVD---PWSKYLNKSQGIDSKKSTTDDLKEFEKIKIN 69 + LE+K++ Y SD+ D N+ +SKYL + K++ D +K FE I IN Sbjct: 137 IDLEKKKEQYDESDWNDSSNISNPYSYSKYLAEKATWSYKENNADKVKSFEIIIIN 192 >UniRef50_Q24GP7 Cluster: Putative uncharacterized protein; n=2; cellular organisms|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 2929 Score = 35.5 bits (78), Expect = 1.2 Identities = 24/84 (28%), Positives = 43/84 (51%), Gaps = 2/84 (2%) Query: 5 TKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGID-SKKSTTDDLKEF 63 T+W+I N + + +K+ I K SD+ + +VD ++ K + KKS+ + L+ Sbjct: 1747 TEWQIPNNLDILDYINQKQTIQKESDYQKISDVDLKKEFDEKEYSAEFIKKSSPNSLEIL 1806 Query: 64 E-KIKINTEKNKSISERVSIFKGD 86 E K I+ +K + S + I GD Sbjct: 1807 EMKQNISNDKKEEQSYKSEIKLGD 1830 >UniRef50_UPI00006CAB22 Cluster: hypothetical protein TTHERM_00780730; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00780730 - Tetrahymena thermophila SB210 Length = 132 Score = 35.1 bits (77), Expect = 1.6 Identities = 22/76 (28%), Positives = 38/76 (50%), Gaps = 1/76 (1%) Query: 14 ILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKN 73 +++ S+E+ + K + ++ S+ +N SQ SKK D +E K + N + Sbjct: 1 MIRYSIEDLANLMKGHNIFKVKAQTQQSQSVNGSQSQQSKKKKPDSHEEISKSEFNNSQ- 59 Query: 74 KSISERVSIFKGDITK 89 K ISE +S K D +K Sbjct: 60 KLISEYLSADKSDKSK 75 >UniRef50_Q6A5L0 Cluster: Anaerobic glycerol-3-phosphate dehydrogenase subunit A; n=2; Actinomycetales|Rep: Anaerobic glycerol-3-phosphate dehydrogenase subunit A - Propionibacterium acnes Length = 544 Score = 35.1 bits (77), Expect = 1.6 Identities = 35/136 (25%), Positives = 57/136 (41%), Gaps = 11/136 (8%) Query: 33 DLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIFKGDITKLEI 92 DLE D W + KS+ + ST L+ ++ N I ++ G + ++ Sbjct: 97 DLEFSDQWVEGAKKSKVPFEEISTAQALRREPRL------NPGIKRAFAVQDGSVDGWQM 150 Query: 93 DAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHT 152 V AA+S ++ G V + AA + E D I D K + K++I+T Sbjct: 151 --VWGAAHSAIEYGAKV---MTYAAVTEIIREGDQITAVVAHDLKHDEQIRIDCKFVINT 205 Query: 153 VGPQDGSAEKLESCYE 168 GP G +L CY+ Sbjct: 206 AGPWAGRIAELVGCYD 221 >UniRef50_Q1FGW8 Cluster: Peptidase M23B precursor; n=1; Clostridium phytofermentans ISDg|Rep: Peptidase M23B precursor - Clostridium phytofermentans ISDg Length = 469 Score = 35.1 bits (77), Expect = 1.6 Identities = 21/92 (22%), Positives = 52/92 (56%), Gaps = 4/92 (4%) Query: 10 EKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKIN 69 E++ +L LS E+ ++I + ++ I N + +++Y N+ I +++ DD+KE E+ ++ Sbjct: 240 EQDTLLTLSAEKGKEIVRYTEAIGA-NEELFAEYSNE---IANQEKNIDDIKEEERKRVE 295 Query: 70 TEKNKSISERVSIFKGDITKLEIDAVVNAANS 101 ++ K I E I + + + +++ + N+ Sbjct: 296 EQERKRIEEEARIKREEEARKKLELENQSPNA 327 >UniRef50_Q0PBQ1 Cluster: Putative uncharacterized protein; n=12; Campylobacter|Rep: Putative uncharacterized protein - Campylobacter jejuni Length = 386 Score = 35.1 bits (77), Expect = 1.6 Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 8/106 (7%) Query: 10 EKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKIN 69 E ++L L E I K DF D ENV K L K+ +D++ S + + E E + Sbjct: 50 ETKKVLNSLLVEFLTILKKLDFFDDENVTKVIKALVKASIVDAQNSLYEYISEAELL--- 106 Query: 70 TEKNKSISERVSIFKGDITK--LEIDAVVNAANSRLKAGGGVDGAI 113 NK I + ++ K I+ E + ++ + + GG++ AI Sbjct: 107 ---NKQIENQKNLIKNQISDNFFEFENILQECSFCDEFSGGLNDAI 149 >UniRef50_A3S6V5 Cluster: Putative uncharacterized protein; n=1; Prochlorococcus marinus str. MIT 9211|Rep: Putative uncharacterized protein - Prochlorococcus marinus str. MIT 9211 Length = 113 Score = 35.1 bits (77), Expect = 1.6 Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 2/79 (2%) Query: 23 RKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSI 82 +K K D LE ++ + K+QG+ KK+ DL E + +++ ++N +I E Sbjct: 29 KKFKKDGDLAILETIEKSKANMAKAQGL--KKTKWYDLDEIDALRMLVKQNYTIIEDQKA 86 Query: 83 FKGDITKLEIDAVVNAANS 101 KG IT L I +++ S Sbjct: 87 IKGWITFLGIVTLLSLIGS 105 >UniRef50_Q331Z6 Cluster: Conserved hypothetical phage-related protein; n=1; Clostridium phage c-st|Rep: Conserved hypothetical phage-related protein - Clostridium botulinum C bacteriophage Length = 1662 Score = 35.1 bits (77), Expect = 1.6 Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 2/97 (2%) Query: 10 EKNRILKLSLEEKRKIY--KSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIK 67 +K +L L+ +IY K D D E+ D +SK LNK Q SK D + Sbjct: 1257 KKMDLLNEELKSYEEIYNAKIKDIDDKESEDKYSKELNKKQKEKSKLQIQHDALMMDSSL 1316 Query: 68 INTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLK 104 K +S+ E + + DI + + D + LK Sbjct: 1317 EAKAKRESLLEEIKKKQEDIDQFQHDRDITLRKKNLK 1353 >UniRef50_A4VE14 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 399 Score = 35.1 bits (77), Expect = 1.6 Identities = 25/77 (32%), Positives = 40/77 (51%), Gaps = 7/77 (9%) Query: 1 MVNSTKWEIEKNRILKLSLEEKRKIYKSSD--FIDLENVDPWSKYLNKSQG----IDSKK 54 +V+ + EI+K + L + LEE+ +Y D F D+ V+ SK+ N Q IDS+K Sbjct: 278 LVSKFQEEIDKQKELDVKLEERLNVYLEQDKKFQDI-MVESNSKFTNYKQAHDSLIDSQK 336 Query: 55 STTDDLKEFEKIKINTE 71 T + E +K T+ Sbjct: 337 KTESSINELKKKNEKTD 353 >UniRef50_Q6LQJ9 Cluster: UPF0234 protein PBPRA2024; n=15; Proteobacteria|Rep: UPF0234 protein PBPRA2024 - Photobacterium profundum (Photobacterium sp. (strain SS9)) Length = 161 Score = 35.1 bits (77), Expect = 1.6 Identities = 23/83 (27%), Positives = 40/83 (48%), Gaps = 3/83 (3%) Query: 25 IYKSSDFIDLEN-VDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISERVSIF 83 I DF+++ N VD ++ L D K + E +KI TE + +++ VSI Sbjct: 6 IVSEVDFVEVRNAVDNSARELKTR--FDFKNVEASITFDKEIVKITTESDFQLTQLVSIL 63 Query: 84 KGDITKLEIDAVVNAANSRLKAG 106 +G++ K E+DA ++ G Sbjct: 64 RGNLAKREVDAQSMTQKDTVRTG 86 >UniRef50_Q4SQ87 Cluster: Chromosome 4 SCAF14533, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 4 SCAF14533, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1780 Score = 34.7 bits (76), Expect = 2.1 Identities = 26/70 (37%), Positives = 31/70 (44%), Gaps = 3/70 (4%) Query: 105 AGGGVDGAIHRAAGPF-LQAECDSIGGCPTGD-AKVTGGYNLPAKYIIHTV-GPQDGSAE 161 AGGG DG + AAG L+ E + CP G GG P T G GSA Sbjct: 1503 AGGGEDGCLSCAAGRIHLREEGRCLLSCPRGRYHHSAGGSCEPCHASCRTCSGRLPGSAR 1562 Query: 162 KLESCYEKCL 171 E C++ CL Sbjct: 1563 VCEDCHDSCL 1572 >UniRef50_Q22DL4 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 895 Score = 34.7 bits (76), Expect = 2.1 Identities = 17/62 (27%), Positives = 30/62 (48%) Query: 20 EEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISER 79 E + K+ K + DP N+ G+ KK D+ E E+ +IN+E+N ++ Sbjct: 578 ERQEKLEKMKNLKKRMKYDPRKAIQNEKNGVKDKKDDNDENDETEENRINSEENDEDDDQ 637 Query: 80 VS 81 V+ Sbjct: 638 VN 639 >UniRef50_Q22751 Cluster: Putative uncharacterized protein dnj-23; n=2; Caenorhabditis|Rep: Putative uncharacterized protein dnj-23 - Caenorhabditis elegans Length = 242 Score = 34.7 bits (76), Expect = 2.1 Identities = 26/93 (27%), Positives = 45/93 (48%), Gaps = 2/93 (2%) Query: 4 STKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEF 63 +TK+++ LS EEKRKIY + +D + ++ K+ + KK T +D+ F Sbjct: 57 TTKFQLLNKAYQILSDEEKRKIYDETGSVD-DEAGELNEDALKAWRMIFKKVTKEDIDSF 115 Query: 64 EK-IKINTEKNKSISERVSIFKGDITKLEIDAV 95 K + + E+ + F GDI K+ A+ Sbjct: 116 MKTYQGSREQKDELVVHYEKFNGDIAKIREYAI 148 >UniRef50_Q4A7Z9 Cluster: ABC transporter permease protein; n=5; Mycoplasma hyopneumoniae|Rep: ABC transporter permease protein - Mycoplasma hyopneumoniae (strain 7448) Length = 725 Score = 34.3 bits (75), Expect = 2.8 Identities = 30/107 (28%), Positives = 49/107 (45%), Gaps = 7/107 (6%) Query: 12 NRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTE 71 N +L SL++K K YK+ L+ W K L ++ +K + +LKE+++ K Sbjct: 422 NLLLLKSLKQKIKSYKAQT---LKRFLEWEKNLISKFSLNIEKLSETELKEYQEYK---S 475 Query: 72 KNKSISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAG 118 KN SI E ++ T +++ N A K GG + A G Sbjct: 476 KNISIKEAINQAVLQ-TAEKVEITKNLAKKPTKLSGGQQQRVAIARG 521 >UniRef50_A7S5A3 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 670 Score = 34.3 bits (75), Expect = 2.8 Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 2/70 (2%) Query: 8 EIEKNRILKLSLEE-KRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKI 66 ++ K +++L E+ RK+Y SS ++ E + P +KY++ K ST + K+ + Sbjct: 45 KLVKKELIELRKEKYSRKLYASSRHVNDETLTPHTKYVDVEVSTAEKNSTEETGKDKDP- 103 Query: 67 KINTEKNKSI 76 K N +NK++ Sbjct: 104 KTNEPENKTL 113 >UniRef50_A7AQ69 Cluster: Isy1-like splicing family protein; n=1; Babesia bovis|Rep: Isy1-like splicing family protein - Babesia bovis Length = 228 Score = 34.3 bits (75), Expect = 2.8 Identities = 30/109 (27%), Positives = 42/109 (38%), Gaps = 6/109 (5%) Query: 6 KWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEK 65 KW K+ + + RK +S+ D + W L K I + L EF Sbjct: 14 KWLRIKSGLAAHDTQLTRKPRHTSEVTDYRTAEHWRNLLVKDVMISISRIQNASLGEFAI 73 Query: 66 IKINTEKNKSI------SERVSIFKGDITKLEIDAVVNAANSRLKAGGG 108 +N E N+ I ERV G + A+ NA + LK GGG Sbjct: 74 RDLNDEINRLIGLRKRWDERVIELGGPDQRALSSAIENAHGAELKIGGG 122 >UniRef50_A0D3I1 Cluster: Chromosome undetermined scaffold_36, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_36, whole genome shotgun sequence - Paramecium tetraurelia Length = 1351 Score = 34.3 bits (75), Expect = 2.8 Identities = 31/111 (27%), Positives = 51/111 (45%), Gaps = 6/111 (5%) Query: 16 KLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKS 75 K LE K+K + D I+L+N+ + ++ + Q + ++S D +K E K EK +S Sbjct: 1110 KQCLENKQKFEQQIDEINLKNILKNNDFIKQIQQL-QQQSQDDQVKLLELKKQLEEKEES 1168 Query: 76 ISERVSIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRAAGPFLQAECD 126 I E K +EI + N +RLK V + + + Q E D Sbjct: 1169 IKE-----KDGKHAIEIQLITNNYVNRLKDKDDVIQNLQQEIQSYQQVELD 1214 >UniRef50_A0BUU6 Cluster: Chromosome undetermined scaffold_13, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_13, whole genome shotgun sequence - Paramecium tetraurelia Length = 1010 Score = 34.3 bits (75), Expect = 2.8 Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 3/89 (3%) Query: 6 KWEIEKNRILKLSLEEKRKIYKSSDFI-DLENVDPWSKYLNKSQGIDSKKSTTDDLKEFE 64 K ++E + +LK EEKRK ++ D + DL + K + I+S K +LK + Sbjct: 190 KQQLEIDDLLKKIEEEKRKSKEAQDRLQDLMKQNFDQKLQSLQNEINSLKQEVTNLKN-Q 248 Query: 65 KIKINTEKNKSISERVSIFKGDITKLEID 93 K + T+ N ++S+ V+ K I KL +D Sbjct: 249 KDDL-TKHNHNLSDEVNQLKDQIAKLTLD 276 >UniRef50_UPI0000ED8E89 Cluster: hypothetical protein CdifQ_04003614; n=1; Clostridium difficile QCD-32g58|Rep: hypothetical protein CdifQ_04003614 - Clostridium difficile QCD-32g58 Length = 1451 Score = 33.9 bits (74), Expect = 3.7 Identities = 25/85 (29%), Positives = 43/85 (50%), Gaps = 1/85 (1%) Query: 11 KNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINT 70 K I K+ EEK+ I KS + ID+ + S+ ++ ID +S + K +E IK+ Sbjct: 881 KTSIEKIE-EEKKVIKKSIEDIDVNIFNLNSEKDRINRHIDDTESKINKFKIYEPIKLEN 939 Query: 71 EKNKSISERVSIFKGDITKLEIDAV 95 K + + + + D TK EI+ + Sbjct: 940 AKLEELEIKYKKYLEDPTKKEIETL 964 >UniRef50_UPI00006CD9EF Cluster: hypothetical protein TTHERM_00399290; n=3; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00399290 - Tetrahymena thermophila SB210 Length = 793 Score = 33.9 bits (74), Expect = 3.7 Identities = 26/67 (38%), Positives = 38/67 (56%), Gaps = 4/67 (5%) Query: 15 LKLSLEEKRKIYKSSDFIDLENV--DPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEK 72 LKL+LE + Y + D LEN D K K Q D ++S TD+ + ++IK+N EK Sbjct: 334 LKLNLELINQ-YFNYDETSLENNQNDQEDKSSQKQQQFDLQQSLTDEQYQ-DEIKVNEEK 391 Query: 73 NKSISER 79 KS+ +R Sbjct: 392 FKSLRKR 398 >UniRef50_Q897A5 Cluster: Conserved protein; n=1; Clostridium tetani|Rep: Conserved protein - Clostridium tetani Length = 571 Score = 33.9 bits (74), Expect = 3.7 Identities = 25/71 (35%), Positives = 42/71 (59%), Gaps = 8/71 (11%) Query: 18 SLEEKRKIYKSSD--FIDLENVDPWSKYLNKSQGIDSKKS--TTDDLKEFEKIKINT--E 71 +L+E + +K D FIDL+N D W K+ + S I+SK + K+ KIK+++ E Sbjct: 472 NLKEIVEFFKEQDVEFIDLKNEDNWVKWEDIS--IESKNGDIKVNFPKDKYKIKVDSSKE 529 Query: 72 KNKSISERVSI 82 KNKS ++++ Sbjct: 530 KNKSFISKINV 540 >UniRef50_Q31C98 Cluster: Putative uncharacterized protein precursor; n=5; Prochlorococcus marinus|Rep: Putative uncharacterized protein precursor - Prochlorococcus marinus (strain MIT 9312) Length = 206 Score = 33.9 bits (74), Expect = 3.7 Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 2/75 (2%) Query: 2 VNSTKWEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLK 61 + +K +E +I + + E+++KI K ++ + ++ K K + I+ KS ++ K Sbjct: 72 IEKSKSVLENKKINEKNNEKRKKIEKPKSVLENKKIN--EKNNEKRKKIEKSKSVLENKK 129 Query: 62 EFEKIKINTEKNKSI 76 E KI +KN I Sbjct: 130 EINSEKIQKQKNNKI 144 >UniRef50_Q8LB56 Cluster: Nuclear RNA binding protein A-like protein; n=6; core eudicotyledons|Rep: Nuclear RNA binding protein A-like protein - Arabidopsis thaliana (Mouse-ear cress) Length = 360 Score = 33.9 bits (74), Expect = 3.7 Identities = 23/61 (37%), Positives = 36/61 (59%), Gaps = 7/61 (11%) Query: 19 LEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSISE 78 LEEK+K +++ ++ VD +K Q + SKKS D++ IK+ TEK+K I+E Sbjct: 225 LEEKKKALQATK-VEERKVD--TKAFEAMQQLSSKKSNNDEVF----IKLGTEKDKRITE 277 Query: 79 R 79 R Sbjct: 278 R 278 >UniRef50_Q8ILK6 Cluster: Putative uncharacterized protein; n=2; Plasmodium|Rep: Putative uncharacterized protein - Plasmodium falciparum (isolate 3D7) Length = 359 Score = 33.9 bits (74), Expect = 3.7 Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 2/76 (2%) Query: 17 LSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKSI 76 L L+E+++IY +D +N + NK Q I++KK +D K+ + K NT +N+ Sbjct: 31 LFLKEEKEIYTYKK-LDEQNKEKECND-NKDQEINNKKKKINDNKKEDMDKQNTTQNEEK 88 Query: 77 SERVSIFKGDITKLEI 92 + S+F I +EI Sbjct: 89 KDEDSVFFKRIINVEI 104 >UniRef50_Q4XYB9 Cluster: Putative uncharacterized protein; n=4; Plasmodium (Vinckeia)|Rep: Putative uncharacterized protein - Plasmodium chabaudi Length = 320 Score = 33.9 bits (74), Expect = 3.7 Identities = 26/79 (32%), Positives = 37/79 (46%), Gaps = 1/79 (1%) Query: 10 EKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKI- 68 EK I K +K+K K+ D + EN D S+ + S+ DD K+ EKI I Sbjct: 6 EKTEIKKGDKVKKKKNKKNIDIKNGENNDEKSRLQKYMDELWGFSSSEDDDKKHEKINIT 65 Query: 69 NTEKNKSISERVSIFKGDI 87 + EK K S+ I K + Sbjct: 66 DLEKEKEYSDNDPILKNKL 84 >UniRef50_A2EMF2 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 266 Score = 33.9 bits (74), Expect = 3.7 Identities = 22/66 (33%), Positives = 34/66 (51%), Gaps = 3/66 (4%) Query: 12 NRILKLSLEEKRKIYKSSDFIDLENVDPWSKYL-NKSQGIDSKKSTTDDLKEFEKIKINT 70 NR K L + I+ D+ NV+ S +L K GI K+ D+LKEFE + ++ Sbjct: 71 NRYTKSYLSPVKCIFDDFHVTDVNNVEDISSFLFYKEYGIKLYKN--DNLKEFESVNLDI 128 Query: 71 EKNKSI 76 + K+I Sbjct: 129 QTEKAI 134 >UniRef50_A2DDP1 Cluster: Viral A-type inclusion protein, putative; n=1; Trichomonas vaginalis G3|Rep: Viral A-type inclusion protein, putative - Trichomonas vaginalis G3 Length = 573 Score = 33.9 bits (74), Expect = 3.7 Identities = 26/105 (24%), Positives = 52/105 (49%), Gaps = 5/105 (4%) Query: 2 VNSTKWEIEK--NRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDD 59 +N K + E+ I ++ E++ + D+ ++EN+D SK + + + ++ + Sbjct: 16 INELKKQNEELLQEIEEIKQEDEEDRNQMHDY-EIENIDLRSKVSDYQNELSNLENLINS 74 Query: 60 LKEFEKIKINTEKNKSISERVSIFKGDITKLEIDAVVNAANSRLK 104 LK EKI + E NK + ++ FK D + E + + N R+K Sbjct: 75 LKS-EKINLEVE-NKDLMSQLERFKQDYSDYEESILESDENKRIK 117 >UniRef50_A0MV34 Cluster: Ventral nervous system defective 2; n=1; Acropora millepora|Rep: Ventral nervous system defective 2 - Acropora millepora (Coral) Length = 207 Score = 33.9 bits (74), Expect = 3.7 Identities = 22/76 (28%), Positives = 35/76 (46%) Query: 16 KLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDDLKEFEKIKINTEKNKS 75 ++SL+E+R + S D + D ++ + KS G+ STT + K ++E NK Sbjct: 27 QMSLQERRSLLICSPSSDEQEEDSSTQEIAKSSGLQVLSSTTSSAQLETSKKEHSESNKK 86 Query: 76 ISERVSIFKGDITKLE 91 RV K LE Sbjct: 87 RKRRVLFTKAQTFVLE 102 >UniRef50_A0C1X3 Cluster: Chromosome undetermined scaffold_143, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_143, whole genome shotgun sequence - Paramecium tetraurelia Length = 624 Score = 33.9 bits (74), Expect = 3.7 Identities = 16/43 (37%), Positives = 23/43 (53%) Query: 163 LESCYEKCLSFQQEYQIKSIAFPCISTGIYGFPNRLAAHIALR 205 +E + +E IK IAFP IS I+GF +A+ I L+ Sbjct: 277 IEQLIQNIFQLAKEKNIKQIAFPVISVEIFGFYMNMASQILLK 319 >UniRef50_UPI0000F2C318 Cluster: PREDICTED: similar to RIKEN cDNA 2610034M16 gene; n=3; Tetrapoda|Rep: PREDICTED: similar to RIKEN cDNA 2610034M16 gene - Monodelphis domestica Length = 1383 Score = 33.5 bits (73), Expect = 4.9 Identities = 18/57 (31%), Positives = 35/57 (61%), Gaps = 3/57 (5%) Query: 2 VNSTK-WEIEKNRILKLSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTT 57 +NS + W++E N+++KLS E + ++S+ E +D W+K + Q +SKK ++ Sbjct: 921 LNSERDWKLEMNKLIKLSSEFPSRDSRASNSSQEEAIDQWAK--RRKQFKESKKCSS 975 >UniRef50_A1L230 Cluster: Zgc:158614; n=2; Danio rerio|Rep: Zgc:158614 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 455 Score = 33.5 bits (73), Expect = 4.9 Identities = 17/65 (26%), Positives = 39/65 (60%), Gaps = 3/65 (4%) Query: 9 IEKNRILK-LSLEEKRKIYKSSDFIDLENVDPWSKYLNKSQGIDSKKSTTDD--LKEFEK 65 ++K ++ K +++E ++ + + SDF+ ++ W+K KS D+K T D L+ +++ Sbjct: 156 LDKTKLSKAMNIEIEKVLLRQSDFLQQYGIEVWTKKEVKSVDTDAKTVTFQDGTLQNYDQ 215 Query: 66 IKINT 70 + I+T Sbjct: 216 LLIST 220 >UniRef50_Q982Q7 Cluster: Mlr8538 protein; n=2; Mesorhizobium loti|Rep: Mlr8538 protein - Rhizobium loti (Mesorhizobium loti) Length = 985 Score = 33.5 bits (73), Expect = 4.9 Identities = 24/85 (28%), Positives = 37/85 (43%), Gaps = 1/85 (1%) Query: 94 AVVNAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTV 153 A N+ LK G+ G + RAAGP++ A +I G A + +L + Sbjct: 191 AAQGGLNALLKDSVGILGGVARAAGPWI-AALAAIYGAYRLIASFSAEASLGVDSATRAL 249 Query: 154 GPQDGSAEKLESCYEKCLSFQQEYQ 178 Q S E ++ + +S Q EYQ Sbjct: 250 AAQASSVESIDGKIKDLVSIQSEYQ 274 >UniRef50_Q892P8 Cluster: Lipoate-protein ligase A; n=2; Clostridia|Rep: Lipoate-protein ligase A - Clostridium tetani Length = 332 Score = 33.5 bits (73), Expect = 4.9 Identities = 24/82 (29%), Positives = 44/82 (53%), Gaps = 8/82 (9%) Query: 20 EEKRKIYKSSDFIDLENVDPWSKYLN------KSQGIDSKKSTTDDLKEFEKIKINTEKN 73 +E + + + +D+ NVD +YLN KS+GIDS +S +LKE K + Sbjct: 142 DEGKAYHHGTILVDV-NVDKLQRYLNVSSDKIKSKGIDSVRSRVINLKELHKDLTIDKIC 200 Query: 74 KSISERVS-IFKGDITKLEIDA 94 K++++ S I+ G++ L + + Sbjct: 201 KAMTKSFSRIYHGELNNLHVSS 222 >UniRef50_Q2GBI0 Cluster: TonB-dependent receptor precursor; n=1; Novosphingobium aromaticivorans DSM 12444|Rep: TonB-dependent receptor precursor - Novosphingobium aromaticivorans (strain DSM 12444) Length = 678 Score = 33.5 bits (73), Expect = 4.9 Identities = 30/127 (23%), Positives = 55/127 (43%), Gaps = 9/127 (7%) Query: 40 WSKYLNKSQGIDSKKSTTDDLKEFEK---IKINTEKNKSISERVSIFKGDITKLEIDAVV 96 WS + N + T+ +K+F++ + ++ +K+ I++ +++ G T+ DAV Sbjct: 310 WSMFSNPTY--TDPDGTSAQIKQFDRRWVLGLSAQKHWEIADSLAVSLG--TENRYDAVG 365 Query: 97 NAANSRLKAGGGVDGAIHRAAGPFLQAECDSIGGCPTGDAKVTGGYNLPAKYIIHTVGPQ 156 N R A ++ H G A + P +VTGG L Y ++V + Sbjct: 366 NVGVDRTAARAFLESLGHFRVGELSSALYGEVAWKPLAGLRVTGG--LRGDYYHYSVRAR 423 Query: 157 DGSAEKL 163 D A L Sbjct: 424 DSVAASL 430 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.317 0.135 0.392 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 270,321,251 Number of Sequences: 1657284 Number of extensions: 11280350 Number of successful extensions: 37708 Number of sequences better than 10.0: 282 Number of HSP's better than 10.0 without gapping: 191 Number of HSP's successfully gapped in prelim test: 91 Number of HSP's that attempted gapping in prelim test: 37118 Number of HSP's gapped (non-prelim): 353 length of query: 244 length of database: 575,637,011 effective HSP length: 99 effective length of query: 145 effective length of database: 411,565,895 effective search space: 59677054775 effective search space used: 59677054775 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.6 bits) S2: 71 (32.7 bits)
- SilkBase 1999-2023 -