BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= pg--0033.Seq (615 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_P75917 Cluster: Uncharacterized protein ymdA precursor;... 124 1e-27 UniRef50_A4R3Q9 Cluster: Putative uncharacterized protein; n=1; ... 69 7e-11 UniRef50_Q87JZ5 Cluster: UPF0189 protein VPA0103; n=5; cellular ... 62 8e-09 UniRef50_P67341 Cluster: UPF0189 protein ymdB; n=11; Bacteria|Re... 62 1e-08 UniRef50_UPI0000E4815A Cluster: PREDICTED: similar to LRP16 prot... 62 1e-08 UniRef50_Q9HXU7 Cluster: UPF0189 protein PA3693; n=13; Bacteria|... 61 2e-08 UniRef50_Q8PHB6 Cluster: UPF0189 protein XAC3343; n=9; Proteobac... 60 4e-08 UniRef50_Q88SK6 Cluster: UPF0189 protein lp_3408; n=13; cellular... 59 7e-08 UniRef50_Q01WP7 Cluster: Appr-1-p processing domain protein; n=1... 59 9e-08 UniRef50_Q8EYT0 Cluster: UPF0189 protein LA_4133; n=11; cellular... 59 9e-08 UniRef50_Q2GZS3 Cluster: Putative uncharacterized protein; n=1; ... 58 1e-07 UniRef50_Q9ZBG3 Cluster: UPF0189 protein SCO6450; n=4; Actinomyc... 58 2e-07 UniRef50_A6S485 Cluster: Putative uncharacterized protein; n=1; ... 58 2e-07 UniRef50_Q926Y8 Cluster: UPF0189 protein lin2902; n=14; Firmicut... 58 2e-07 UniRef50_Q4P1I0 Cluster: Putative uncharacterized protein; n=1; ... 57 3e-07 UniRef50_Q0UQZ6 Cluster: Predicted protein; n=1; Phaeosphaeria n... 57 3e-07 UniRef50_Q1R0S7 Cluster: Appr-1-p processing; n=1; Chromohalobac... 57 4e-07 UniRef50_A5WHZ6 Cluster: Appr-1-p processing domain protein; n=2... 57 4e-07 UniRef50_A1IFK2 Cluster: Putative uncharacterized protein; n=1; ... 57 4e-07 UniRef50_Q5KCD7 Cluster: Putative uncharacterized protein; n=2; ... 57 4e-07 UniRef50_Q4WYQ2 Cluster: LRP16 family protein; n=8; cellular org... 57 4e-07 UniRef50_A1Z1Q3 Cluster: MACRO domain-containing protein 2; n=41... 56 5e-07 UniRef50_Q9BQ69 Cluster: MACRO domain-containing protein 1; n=18... 56 5e-07 UniRef50_A6GJ81 Cluster: Putative uncharacterized protein; n=1; ... 56 7e-07 UniRef50_Q17432 Cluster: Putative uncharacterized protein; n=2; ... 56 7e-07 UniRef50_Q97AU0 Cluster: UPF0189 protein TV0719; n=1; Thermoplas... 56 7e-07 UniRef50_Q8KAE4 Cluster: UPF0189 protein CT2219; n=24; cellular ... 56 7e-07 UniRef50_A6NXN8 Cluster: Putative uncharacterized protein; n=1; ... 56 9e-07 UniRef50_Q985D2 Cluster: UPF0189 protein mll7730; n=54; cellular... 56 9e-07 UniRef50_Q0LI88 Cluster: Appr-1-p processing; n=2; cellular orga... 55 1e-06 UniRef50_Q8EP31 Cluster: Hypothetical conserved protein; n=1; Oc... 55 2e-06 UniRef50_Q6PHJ5 Cluster: Zgc:65960; n=5; cellular organisms|Rep:... 54 2e-06 UniRef50_Q8TQD0 Cluster: UPF0189 protein MA_1614; n=4; cellular ... 54 3e-06 UniRef50_Q47EQ7 Cluster: Appr-1-p processing; n=1; Dechloromonas... 54 4e-06 UniRef50_P0A1T0 Cluster: Uncharacterized protein ymdA precursor;... 54 4e-06 UniRef50_Q66HV6 Cluster: Zgc:92353; n=1; Danio rerio|Rep: Zgc:92... 53 5e-06 UniRef50_Q1HPZ5 Cluster: LRP16 protein; n=1; Bombyx mori|Rep: LR... 53 5e-06 UniRef50_Q0CQJ0 Cluster: Protein LRP16; n=5; cellular organisms|... 53 5e-06 UniRef50_UPI000049917F Cluster: conserved hypothetical protein; ... 53 6e-06 UniRef50_A1G783 Cluster: Appr-1-p processing; n=1; Salinispora a... 53 6e-06 UniRef50_A7RJ44 Cluster: Predicted protein; n=3; Eukaryota|Rep: ... 53 6e-06 UniRef50_UPI000023F24A Cluster: hypothetical protein FG04179.1; ... 52 1e-05 UniRef50_A6LTB5 Cluster: Appr-1-p processing domain protein; n=1... 52 1e-05 UniRef50_Q8RB30 Cluster: UPF0189 protein TTE0995; n=20; Bacteria... 52 1e-05 UniRef50_Q6AKL0 Cluster: Putative uncharacterized protein; n=1; ... 52 1e-05 UniRef50_Q5DCZ3 Cluster: SJCHGC06209 protein; n=1; Schistosoma j... 51 2e-05 UniRef50_A3ZLZ3 Cluster: Putative uncharacterized protein; n=2; ... 51 3e-05 UniRef50_Q6AAQ5 Cluster: Conserved protein; n=2; Bacteria|Rep: C... 50 3e-05 UniRef50_Q0B030 Cluster: Phosphatase; n=1; Syntrophomonas wolfei... 50 4e-05 UniRef50_Q9HJ67 Cluster: UPF0189 protein Ta1105; n=2; Thermoplas... 50 6e-05 UniRef50_Q1K4D1 Cluster: Appr-1-p processing; n=1; Desulfuromona... 49 1e-04 UniRef50_Q30ZH6 Cluster: Appr-1-p processing; n=1; Desulfovibrio... 47 3e-04 UniRef50_Q8B4N1 Cluster: ORF-1; n=8; root|Rep: ORF-1 - Rock brea... 46 5e-04 UniRef50_A7EET2 Cluster: Putative uncharacterized protein; n=1; ... 46 7e-04 UniRef50_Q5R014 Cluster: Predicted phosphatase; n=6; Bacteria|Re... 46 0.001 UniRef50_A6SR30 Cluster: Putative uncharacterized protein; n=1; ... 45 0.001 UniRef50_UPI00015C5846 Cluster: hypothetical protein CKO_02023; ... 45 0.002 UniRef50_A4W959 Cluster: Putative uncharacterized protein; n=1; ... 45 0.002 UniRef50_A0LGZ1 Cluster: Appr-1-p processing domain protein; n=1... 45 0.002 UniRef50_O22875 Cluster: Expressed protein; n=7; Magnoliophyta|R... 44 0.002 UniRef50_Q9WYX8 Cluster: UPF0189 protein TM_0508; n=4; Thermotog... 44 0.002 UniRef50_A1D5K4 Cluster: Appr-1-p processing enzyme family prote... 44 0.003 UniRef50_Q93SX7 Cluster: UPF0189 protein; n=1; Acinetobacter sp.... 44 0.003 UniRef50_A5V0Y4 Cluster: Appr-1-p processing domain protein; n=5... 44 0.004 UniRef50_A6F1P7 Cluster: Appr-1-p processing; n=1; Marinobacter ... 43 0.005 UniRef50_Q4DSL4 Cluster: Putative uncharacterized protein; n=3; ... 43 0.007 UniRef50_A0UYE8 Cluster: Appr-1-p processing; n=3; Bacteria|Rep:... 42 0.009 UniRef50_A3BF04 Cluster: Putative uncharacterized protein; n=1; ... 42 0.012 UniRef50_UPI0000498318 Cluster: conserved hypothetical protein; ... 42 0.015 UniRef50_A5D049 Cluster: Predicted phosphatase; n=3; Bacteria|Re... 42 0.015 UniRef50_O07733 Cluster: UPF0189 protein Rv1899c/MT1950; n=9; My... 42 0.015 UniRef50_A7BY23 Cluster: Putative uncharacterized protein; n=1; ... 41 0.020 UniRef50_A5TRW5 Cluster: Putative uncharacterized protein; n=1; ... 40 0.036 UniRef50_Q8ZXT3 Cluster: UPF0189 protein PAE1111; n=8; Thermopro... 40 0.036 UniRef50_A0H6G6 Cluster: Appr-1-p processing; n=1; Chloroflexus ... 40 0.047 UniRef50_A5ZAB5 Cluster: Putative uncharacterized protein; n=1; ... 40 0.062 UniRef50_Q22CT8 Cluster: Appr-1-p processing enzyme family prote... 40 0.062 UniRef50_A7T167 Cluster: Predicted protein; n=1; Nematostella ve... 40 0.062 UniRef50_A6BCW6 Cluster: Putative uncharacterized protein; n=2; ... 39 0.082 UniRef50_A0CX06 Cluster: Chromosome undetermined scaffold_3, who... 39 0.082 UniRef50_Q5V4P3 Cluster: Putative uncharacterized protein; n=2; ... 39 0.082 UniRef50_A7HJC7 Cluster: Appr-1-p processing domain protein; n=1... 39 0.11 UniRef50_UPI0000F1EDA9 Cluster: PREDICTED: similar to Poly [ADP-... 38 0.19 UniRef50_Q6D5N1 Cluster: Putative exported protein; n=1; Pectoba... 38 0.19 UniRef50_A6PEZ6 Cluster: Appr-1-p processing domain protein; n=1... 38 0.19 UniRef50_A7S3X0 Cluster: Predicted protein; n=1; Nematostella ve... 38 0.19 UniRef50_A2DTG7 Cluster: Appr-1-p processing enzyme family prote... 38 0.19 UniRef50_A3DLM0 Cluster: Appr-1-p processing domain protein; n=1... 38 0.19 UniRef50_A7B8S3 Cluster: Putative uncharacterized protein; n=1; ... 38 0.25 UniRef50_P62605 Cluster: Type-1 fimbrial protein, C chain precur... 38 0.25 UniRef50_Q460N5 Cluster: Poly [ADP-ribose] polymerase 14; n=23; ... 37 0.33 UniRef50_Q0UG78 Cluster: Putative uncharacterized protein; n=1; ... 37 0.44 UniRef50_Q9YBE9 Cluster: UPF0189 protein APE_1648.1; n=1; Aeropy... 37 0.44 UniRef50_Q8ZJT7 Cluster: Putative periplasmic protein; n=3; Salm... 36 0.58 UniRef50_UPI0000498CB9 Cluster: conserved hypothetical protein; ... 36 0.77 UniRef50_Q2V9U1 Cluster: Nonstructural protein 3; n=38; Eastern ... 36 0.77 UniRef50_Q03IQ8 Cluster: Predicted phosphatase homologous to the... 36 0.77 UniRef50_A0X2G8 Cluster: Appr-1-p processing domain protein; n=1... 36 0.77 UniRef50_Q5XC09 Cluster: UPF0189 protein M6_Spy0919; n=19; Strep... 36 1.0 UniRef50_UPI0000F2CC14 Cluster: PREDICTED: similar to Poly [ADP-... 35 1.3 UniRef50_Q5KUT6 Cluster: Hypothetical conserved protein; n=2; Ge... 35 1.3 UniRef50_A2FMC7 Cluster: Appr-1-p processing enzyme family prote... 35 1.3 UniRef50_A0CX10 Cluster: Chromosome undetermined scaffold_3, who... 35 1.3 UniRef50_UPI00006A2284 Cluster: UPI00006A2284 related cluster; n... 35 1.8 UniRef50_Q6ZKH7 Cluster: Putative uncharacterized protein OJ1119... 35 1.8 UniRef50_A2BJA7 Cluster: A1pp, Appr-1-p processing enzyme; n=1; ... 35 1.8 UniRef50_Q9JMS9 Cluster: Uncharacterized protein yuaK; n=1; Esch... 35 1.8 UniRef50_P87515 Cluster: Non-structural polyprotein (Polyprotein... 35 1.8 UniRef50_Q4RG95 Cluster: Chromosome 12 SCAF15104, whole genome s... 34 2.3 UniRef50_Q9WJC8 Cluster: Nonstructural polyprotein; n=12; Venezu... 34 2.3 UniRef50_Q8IXQ6 Cluster: Poly [ADP-ribose] polymerase 9; n=26; E... 34 2.3 UniRef50_UPI0000660C67 Cluster: Homolog of Oncorhynchus mykiss "... 34 3.1 UniRef50_UPI0000F3214F Cluster: UPI0000F3214F related cluster; n... 34 3.1 UniRef50_Q6NRC6 Cluster: MGC83934 protein; n=2; Xenopus|Rep: MGC... 34 3.1 UniRef50_Q4SK43 Cluster: Chromosome 2 SCAF14570, whole genome sh... 34 3.1 UniRef50_Q5PDJ6 Cluster: Fimbrial subunit; n=4; Salmonella|Rep: ... 34 3.1 UniRef50_A5W6W0 Cluster: Putative uncharacterized protein precur... 34 3.1 UniRef50_A0IIY3 Cluster: Putative uncharacterized protein precur... 34 3.1 UniRef50_P55223 Cluster: Fimbrial subunit type 1 precursor; n=37... 34 3.1 UniRef50_Q8X3Q9 Cluster: Putative IS encoded protein; n=2; Esche... 33 4.1 UniRef50_O28751 Cluster: UPF0189 protein AF_1521; n=25; Euryarch... 33 4.1 UniRef50_UPI0000F2CC13 Cluster: PREDICTED: similar to B aggressi... 33 5.4 UniRef50_UPI0000E48437 Cluster: PREDICTED: similar to slowpoke b... 33 5.4 UniRef50_UPI0000E46337 Cluster: PREDICTED: similar to TRAAK; n=2... 33 5.4 UniRef50_Q6ZED8 Cluster: Slr7060 protein; n=1; Synechocystis sp.... 33 5.4 UniRef50_Q38Y93 Cluster: Hypothetical cell surface protein; n=1;... 33 5.4 UniRef50_UPI00006A1CA6 Cluster: poly (ADP-ribose) polymerase fam... 33 7.1 UniRef50_Q327R1 Cluster: Putative uncharacterized protein; n=1; ... 33 7.1 UniRef50_Q7WTH2 Cluster: Putative uncharacterized protein; n=1; ... 33 7.1 UniRef50_P67344 Cluster: UPF0189 protein SA0314; n=13; Staphyloc... 33 7.1 UniRef50_Q4V2H0 Cluster: Putative uncharacterized protein; n=13;... 32 9.4 UniRef50_A6PBP5 Cluster: Appr-1-p processing domain protein; n=1... 32 9.4 UniRef50_A1HMQ5 Cluster: Appr-1-p processing domain protein; n=4... 32 9.4 UniRef50_Q0CEI7 Cluster: Putative uncharacterized protein; n=1; ... 32 9.4 UniRef50_O67112 Cluster: UPF0189 protein aq_987; n=3; cellular o... 32 9.4 >UniRef50_P75917 Cluster: Uncharacterized protein ymdA precursor; n=19; Enterobacteriaceae|Rep: Uncharacterized protein ymdA precursor - Escherichia coli (strain K12) Length = 103 Score = 124 bits (300), Expect = 1e-27 Identities = 57/57 (100%), Positives = 57/57 (100%) Frame = +1 Query: 85 MFRPFLNSLMLGSLFFPFIAIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNCPQ 255 MFRPFLNSLMLGSLFFPFIAIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNCPQ Sbjct: 1 MFRPFLNSLMLGSLFFPFIAIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNCPQ 57 Score = 98.3 bits (234), Expect = 1e-19 Identities = 47/48 (97%), Positives = 47/48 (97%) Frame = +3 Query: 249 PTNGSIPGKTYSSKALMSGNVKNAQIASVKVQYLDKQKKLAVMNIEYN 392 P NGSIPGKTYSSKALMSGNVKNAQIASVKVQYLDKQKKLAVMNIEYN Sbjct: 56 PQNGSIPGKTYSSKALMSGNVKNAQIASVKVQYLDKQKKLAVMNIEYN 103 >UniRef50_A4R3Q9 Cluster: Putative uncharacterized protein; n=1; Magnaporthe grisea|Rep: Putative uncharacterized protein - Magnaporthe grisea (Rice blast fungus) (Pyricularia grisea) Length = 263 Score = 69.3 bits (162), Expect = 7e-11 Identities = 34/85 (40%), Positives = 49/85 (57%) Frame = +3 Query: 357 QKKLAVMNIEYN*VSEQLTLLSRKMRFNQKACCTLKKPEGVRRRYENAYSCVQGDITKLA 536 + KL N+ + ++L + + + C L KP +R+ + + GDITKL Sbjct: 16 EAKLLSRNLTHLFQEKELKTQAEGLEMAKTVSCDLTKPPPPNKRFNDRIALYHGDITKLM 75 Query: 537 VDVIVNAANPSLMGSGGVDGAIHRA 611 VD IVNAAN +L+G GGVDG+IHRA Sbjct: 76 VDAIVNAANETLLGGGGVDGSIHRA 100 >UniRef50_Q87JZ5 Cluster: UPF0189 protein VPA0103; n=5; cellular organisms|Rep: UPF0189 protein VPA0103 - Vibrio parahaemolyticus Length = 170 Score = 62.5 bits (145), Expect = 8e-09 Identities = 29/39 (74%), Positives = 31/39 (79%) Frame = +3 Query: 495 NAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 NA S VQGDIT VD IVNAANP ++G GGVDGAIHRA Sbjct: 2 NAISLVQGDITTAHVDAIVNAANPRMLGGGGVDGAIHRA 40 >UniRef50_P67341 Cluster: UPF0189 protein ymdB; n=11; Bacteria|Rep: UPF0189 protein ymdB - Salmonella typhimurium Length = 179 Score = 62.1 bits (144), Expect = 1e-08 Identities = 28/34 (82%), Positives = 31/34 (91%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +QGDIT+L+VD IVNAAN SLMG GGVDGAIHRA Sbjct: 8 IQGDITQLSVDAIVNAANASLMGGGGVDGAIHRA 41 >UniRef50_UPI0000E4815A Cluster: PREDICTED: similar to LRP16 protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to LRP16 protein - Strongylocentrotus purpuratus Length = 415 Score = 61.7 bits (143), Expect = 1e-08 Identities = 31/50 (62%), Positives = 33/50 (66%) Frame = +3 Query: 462 KKPEGVRRRYENAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 K + N S QGDITKL VD IVNAAN SL+G GGVDGAIHRA Sbjct: 150 KSTSAAKSDLNNRVSVWQGDITKLDVDCIVNAANRSLLGGGGVDGAIHRA 199 >UniRef50_Q9HXU7 Cluster: UPF0189 protein PA3693; n=13; Bacteria|Rep: UPF0189 protein PA3693 - Pseudomonas aeruginosa Length = 173 Score = 61.3 bits (142), Expect = 2e-08 Identities = 28/33 (84%), Positives = 30/33 (90%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 QGDIT+LAVD IVNAAN SL+G GGVDGAIHRA Sbjct: 8 QGDITRLAVDAIVNAANSSLLGGGGVDGAIHRA 40 >UniRef50_Q8PHB6 Cluster: UPF0189 protein XAC3343; n=9; Proteobacteria|Rep: UPF0189 protein XAC3343 - Xanthomonas axonopodis pv. citri Length = 179 Score = 60.1 bits (139), Expect = 4e-08 Identities = 28/33 (84%), Positives = 30/33 (90%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 QGDIT+L VDVIVNAAN SL+G GGVDGAIHRA Sbjct: 7 QGDITELDVDVIVNAANESLLGGGGVDGAIHRA 39 >UniRef50_Q88SK6 Cluster: UPF0189 protein lp_3408; n=13; cellular organisms|Rep: UPF0189 protein lp_3408 - Lactobacillus plantarum Length = 172 Score = 59.3 bits (137), Expect = 7e-08 Identities = 26/34 (76%), Positives = 29/34 (85%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 + GDITK+ VD IVNAAN SL+G GGVDGAIHRA Sbjct: 7 IHGDITKMTVDAIVNAANTSLLGGGGVDGAIHRA 40 >UniRef50_Q01WP7 Cluster: Appr-1-p processing domain protein; n=1; Solibacter usitatus Ellin6076|Rep: Appr-1-p processing domain protein - Solibacter usitatus (strain Ellin6076) Length = 178 Score = 58.8 bits (136), Expect = 9e-08 Identities = 25/35 (71%), Positives = 31/35 (88%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 ++GDIT++AVDV+ NAAN +L G GGVDGAIHRAG Sbjct: 14 IRGDITRIAVDVMANAANSALAGGGGVDGAIHRAG 48 >UniRef50_Q8EYT0 Cluster: UPF0189 protein LA_4133; n=11; cellular organisms|Rep: UPF0189 protein LA_4133 - Leptospira interrogans Length = 175 Score = 58.8 bits (136), Expect = 9e-08 Identities = 27/40 (67%), Positives = 31/40 (77%) Frame = +3 Query: 495 NAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 N ++ DIT+L VD IVNAAN SL+G GGVDGAIHRAG Sbjct: 3 NKIKLIKEDITQLEVDAIVNAANSSLLGGGGVDGAIHRAG 42 >UniRef50_Q2GZS3 Cluster: Putative uncharacterized protein; n=1; Chaetomium globosum|Rep: Putative uncharacterized protein - Chaetomium globosum (Soil fungus) Length = 282 Score = 58.4 bits (135), Expect = 1e-07 Identities = 27/34 (79%), Positives = 30/34 (88%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 ++GDITKLAVD IVNAAN SL+G GGVD AIHRA Sbjct: 57 IRGDITKLAVDAIVNAANRSLLGGGGVDEAIHRA 90 >UniRef50_Q9ZBG3 Cluster: UPF0189 protein SCO6450; n=4; Actinomycetales|Rep: UPF0189 protein SCO6450 - Streptomyces coelicolor Length = 169 Score = 58.0 bits (134), Expect = 2e-07 Identities = 26/35 (74%), Positives = 29/35 (82%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 VQGDIT+ + D IVNAAN SL+G GGVDGAIHR G Sbjct: 7 VQGDITRQSADAIVNAANSSLLGGGGVDGAIHRRG 41 >UniRef50_A6S485 Cluster: Putative uncharacterized protein; n=1; Botryotinia fuckeliana B05.10|Rep: Putative uncharacterized protein - Botryotinia fuckeliana B05.10 Length = 283 Score = 57.6 bits (133), Expect = 2e-07 Identities = 26/41 (63%), Positives = 31/41 (75%) Frame = +3 Query: 489 YENAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 + + ++GDIT L VD IVNAAN SL+G GGVDGAIHRA Sbjct: 40 FNDRIGLIRGDITHLEVDAIVNAANNSLLGGGGVDGAIHRA 80 >UniRef50_Q926Y8 Cluster: UPF0189 protein lin2902; n=14; Firmicutes|Rep: UPF0189 protein lin2902 - Listeria innocua Length = 176 Score = 57.6 bits (133), Expect = 2e-07 Identities = 26/34 (76%), Positives = 30/34 (88%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 V+GDIT+ VDVIVNAANP L+G GGVDGAIH+A Sbjct: 6 VKGDITEQNVDVIVNAANPGLLGGGGVDGAIHQA 39 >UniRef50_Q4P1I0 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 220 Score = 57.2 bits (132), Expect = 3e-07 Identities = 26/41 (63%), Positives = 31/41 (75%) Frame = +3 Query: 489 YENAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 + + S GDIT L++D IVNAAN SL+G GGVDGAIHRA Sbjct: 34 FSHLLSIFTGDITTLSIDAIVNAANNSLLGGGGVDGAIHRA 74 >UniRef50_Q0UQZ6 Cluster: Predicted protein; n=1; Phaeosphaeria nodorum|Rep: Predicted protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 291 Score = 57.2 bits (132), Expect = 3e-07 Identities = 26/36 (72%), Positives = 30/36 (83%) Frame = +3 Query: 504 SCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 S ++ DIT LA+D IVNAAN SL+G GGVDGAIHRA Sbjct: 42 SIIRRDITTLAIDAIVNAANTSLLGGGGVDGAIHRA 77 >UniRef50_Q1R0S7 Cluster: Appr-1-p processing; n=1; Chromohalobacter salexigens DSM 3043|Rep: Appr-1-p processing - Chromohalobacter salexigens (strain DSM 3043 / ATCC BAA-138 / NCIMB13768) Length = 183 Score = 56.8 bits (131), Expect = 4e-07 Identities = 27/34 (79%), Positives = 29/34 (85%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 V GDIT+L VD IVNAAN SLMG GGVDGAI+RA Sbjct: 13 VSGDITRLDVDAIVNAANHSLMGGGGVDGAIYRA 46 >UniRef50_A5WHZ6 Cluster: Appr-1-p processing domain protein; n=2; Bacteria|Rep: Appr-1-p processing domain protein - Psychrobacter sp. PRwf-1 Length = 194 Score = 56.8 bits (131), Expect = 4e-07 Identities = 26/34 (76%), Positives = 28/34 (82%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +Q DIT L VD IVNAAN SL+G GGVDGAIHRA Sbjct: 29 IQADITTLKVDAIVNAANSSLLGGGGVDGAIHRA 62 >UniRef50_A1IFK2 Cluster: Putative uncharacterized protein; n=1; Candidatus Desulfococcus oleovorans Hxd3|Rep: Putative uncharacterized protein - Candidatus Desulfococcus oleovorans Hxd3 Length = 195 Score = 56.8 bits (131), Expect = 4e-07 Identities = 26/33 (78%), Positives = 28/33 (84%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 QGDIT L VD IVNAAN +L+G GGVDGAIHRA Sbjct: 33 QGDITTLEVDAIVNAANKTLLGGGGVDGAIHRA 65 >UniRef50_Q5KCD7 Cluster: Putative uncharacterized protein; n=2; Filobasidiella neoformans|Rep: Putative uncharacterized protein - Cryptococcus neoformans (Filobasidiella neoformans) Length = 252 Score = 56.8 bits (131), Expect = 4e-07 Identities = 31/66 (46%), Positives = 41/66 (62%), Gaps = 1/66 (1%) Frame = +3 Query: 417 LSRKMRFNQKACCTLKKPE-GVRRRYENAYSCVQGDITKLAVDVIVNAANPSLMGSGGVD 593 LS+ R + KP+ ++ + S +GDIT+L D+IVNAAN SL+G GGVD Sbjct: 45 LSQLYRHDHTNALNPTKPKYEFTKQLNDRVSIWRGDITELEADMIVNAANSSLLGGGGVD 104 Query: 594 GAIHRA 611 GAIHRA Sbjct: 105 GAIHRA 110 >UniRef50_Q4WYQ2 Cluster: LRP16 family protein; n=8; cellular organisms|Rep: LRP16 family protein - Aspergillus fumigatus (Sartorya fumigata) Length = 354 Score = 56.8 bits (131), Expect = 4e-07 Identities = 28/42 (66%), Positives = 32/42 (76%), Gaps = 1/42 (2%) Frame = +3 Query: 489 YENAYSCVQGDITKLA-VDVIVNAANPSLMGSGGVDGAIHRA 611 + N S ++ DITKL VD IVNAAN SL+G GGVDGAIHRA Sbjct: 37 FNNIISLIRNDITKLENVDCIVNAANESLLGGGGVDGAIHRA 78 >UniRef50_A1Z1Q3 Cluster: MACRO domain-containing protein 2; n=41; cellular organisms|Rep: MACRO domain-containing protein 2 - Homo sapiens (Human) Length = 448 Score = 56.4 bits (130), Expect = 5e-07 Identities = 27/45 (60%), Positives = 31/45 (68%) Frame = +3 Query: 477 VRRRYENAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 V++ S +GDIT L VD IVNAAN SL+G GGVDG IHRA Sbjct: 64 VKKSLTEKVSLYRGDITLLEVDAIVNAANASLLGGGGVDGCIHRA 108 >UniRef50_Q9BQ69 Cluster: MACRO domain-containing protein 1; n=18; cellular organisms|Rep: MACRO domain-containing protein 1 - Homo sapiens (Human) Length = 325 Score = 56.4 bits (130), Expect = 5e-07 Identities = 26/36 (72%), Positives = 29/36 (80%) Frame = +3 Query: 504 SCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 S ++ DITKL VD IVNAAN SL+G GGVDG IHRA Sbjct: 155 SLLRSDITKLEVDAIVNAANSSLLGGGGVDGCIHRA 190 >UniRef50_A6GJ81 Cluster: Putative uncharacterized protein; n=1; Plesiocystis pacifica SIR-1|Rep: Putative uncharacterized protein - Plesiocystis pacifica SIR-1 Length = 173 Score = 56.0 bits (129), Expect = 7e-07 Identities = 23/33 (69%), Positives = 29/33 (87%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +GDIT+++ D IVNAANP ++G GGVDGAIHRA Sbjct: 9 RGDITRVSCDAIVNAANPKMLGGGGVDGAIHRA 41 >UniRef50_Q17432 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 203 Score = 56.0 bits (129), Expect = 7e-07 Identities = 26/32 (81%), Positives = 27/32 (84%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GDITKL+VD IVNAAN L G GGVDGAIHRA Sbjct: 31 GDITKLSVDAIVNAANSRLAGGGGVDGAIHRA 62 >UniRef50_Q97AU0 Cluster: UPF0189 protein TV0719; n=1; Thermoplasma volcanium|Rep: UPF0189 protein TV0719 - Thermoplasma volcanium Length = 186 Score = 56.0 bits (129), Expect = 7e-07 Identities = 25/40 (62%), Positives = 29/40 (72%) Frame = +3 Query: 495 NAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 N ++GDIT + + IVNAANPSLMG GGVDGAIH G Sbjct: 9 NLIEIIEGDITDVNCEAIVNAANPSLMGGGGVDGAIHLKG 48 >UniRef50_Q8KAE4 Cluster: UPF0189 protein CT2219; n=24; cellular organisms|Rep: UPF0189 protein CT2219 - Chlorobium tepidum Length = 172 Score = 56.0 bits (129), Expect = 7e-07 Identities = 25/34 (73%), Positives = 28/34 (82%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 ++ DIT L VD IVNAAN SL+G GGVDGAIHRA Sbjct: 10 IKADITSLTVDAIVNAANTSLLGGGGVDGAIHRA 43 >UniRef50_A6NXN8 Cluster: Putative uncharacterized protein; n=1; Bacteroides capillosus ATCC 29799|Rep: Putative uncharacterized protein - Bacteroides capillosus ATCC 29799 Length = 347 Score = 55.6 bits (128), Expect = 9e-07 Identities = 25/34 (73%), Positives = 28/34 (82%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 V+ DITK+ VD IVNAAN SL+G GGVDG IHRA Sbjct: 6 VRNDITKMKVDAIVNAANESLLGGGGVDGCIHRA 39 >UniRef50_Q985D2 Cluster: UPF0189 protein mll7730; n=54; cellular organisms|Rep: UPF0189 protein mll7730 - Rhizobium loti (Mesorhizobium loti) Length = 176 Score = 55.6 bits (128), Expect = 9e-07 Identities = 26/32 (81%), Positives = 27/32 (84%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GDITKL VD IVNAAN L+G GGVDGAIHRA Sbjct: 13 GDITKLDVDAIVNAANTLLLGGGGVDGAIHRA 44 >UniRef50_Q0LI88 Cluster: Appr-1-p processing; n=2; cellular organisms|Rep: Appr-1-p processing - Herpetosiphon aurantiacus ATCC 23779 Length = 173 Score = 55.2 bits (127), Expect = 1e-06 Identities = 26/34 (76%), Positives = 28/34 (82%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +QGDITK A IVNAAN SL+G GGVDGAIHRA Sbjct: 8 LQGDITKFAGAAIVNAANSSLLGGGGVDGAIHRA 41 >UniRef50_Q8EP31 Cluster: Hypothetical conserved protein; n=1; Oceanobacillus iheyensis|Rep: Hypothetical conserved protein - Oceanobacillus iheyensis Length = 185 Score = 54.8 bits (126), Expect = 2e-06 Identities = 26/40 (65%), Positives = 29/40 (72%) Frame = +3 Query: 492 ENAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +N V GDITK +VIVNAAN SL+G GGVDGAIH A Sbjct: 7 DNTLEIVVGDITKETTNVIVNAANGSLLGGGGVDGAIHHA 46 >UniRef50_Q6PHJ5 Cluster: Zgc:65960; n=5; cellular organisms|Rep: Zgc:65960 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 452 Score = 54.4 bits (125), Expect = 2e-06 Identities = 25/36 (69%), Positives = 28/36 (77%) Frame = +3 Query: 504 SCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 S +GDIT L +D IVNAAN SL+G GGVDG IHRA Sbjct: 64 SLYKGDITILEIDAIVNAANSSLLGGGGVDGCIHRA 99 >UniRef50_Q8TQD0 Cluster: UPF0189 protein MA_1614; n=4; cellular organisms|Rep: UPF0189 protein MA_1614 - Methanosarcina acetivorans Length = 195 Score = 54.0 bits (124), Expect = 3e-06 Identities = 24/34 (70%), Positives = 29/34 (85%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 ++ DIT+L VD IVNAAN +L+G GGVDGAIHRA Sbjct: 32 IERDITELKVDAIVNAANNTLLGGGGVDGAIHRA 65 >UniRef50_Q47EQ7 Cluster: Appr-1-p processing; n=1; Dechloromonas aromatica RCB|Rep: Appr-1-p processing - Dechloromonas aromatica (strain RCB) Length = 186 Score = 53.6 bits (123), Expect = 4e-06 Identities = 24/33 (72%), Positives = 27/33 (81%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 GD+T AVD IVNAAN +L+G GGVDGAIHR G Sbjct: 19 GDLTDQAVDAIVNAANRTLLGGGGVDGAIHRRG 51 >UniRef50_P0A1T0 Cluster: Uncharacterized protein ymdA precursor; n=4; Salmonella|Rep: Uncharacterized protein ymdA precursor - Salmonella typhimurium Length = 106 Score = 53.6 bits (123), Expect = 4e-06 Identities = 22/41 (53%), Positives = 28/41 (68%) Frame = +1 Query: 127 FFPFIAIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNC 249 FF A+ TVQGGVIHF G IVEP CD+ST + +++ C Sbjct: 18 FFIHSAVGQQTVQGGVIHFRGAIVEPLCDISTHAENIDLTC 58 Score = 36.7 bits (81), Expect = 0.44 Identities = 16/22 (72%), Positives = 19/22 (86%) Frame = +3 Query: 324 IASVKVQYLDKQKKLAVMNIEY 389 IA+V++ YLD QK LAVMNIEY Sbjct: 84 IATVRLHYLDAQKSLAVMNIEY 105 >UniRef50_Q66HV6 Cluster: Zgc:92353; n=1; Danio rerio|Rep: Zgc:92353 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 248 Score = 53.2 bits (122), Expect = 5e-06 Identities = 22/31 (70%), Positives = 26/31 (83%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHR 608 GDITKL +D + NAAN +L+G GGVDGAIHR Sbjct: 75 GDITKLEIDAVANAANKTLLGGGGVDGAIHR 105 >UniRef50_Q1HPZ5 Cluster: LRP16 protein; n=1; Bombyx mori|Rep: LRP16 protein - Bombyx mori (Silk moth) Length = 275 Score = 53.2 bits (122), Expect = 5e-06 Identities = 24/36 (66%), Positives = 27/36 (75%) Frame = +3 Query: 504 SCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 S +GDITKL +D +VNAAN L GGVDGAIHRA Sbjct: 112 SIFKGDITKLEIDAVVNAANSRLKAGGGVDGAIHRA 147 >UniRef50_Q0CQJ0 Cluster: Protein LRP16; n=5; cellular organisms|Rep: Protein LRP16 - Aspergillus terreus (strain NIH 2624) Length = 344 Score = 53.2 bits (122), Expect = 5e-06 Identities = 27/37 (72%), Positives = 30/37 (81%), Gaps = 1/37 (2%) Frame = +3 Query: 504 SCVQGDITKLA-VDVIVNAANPSLMGSGGVDGAIHRA 611 S ++ DITKL VD IVNAAN SL+G GGVDGAIHRA Sbjct: 42 SLIRHDITKLLDVDCIVNAANSSLLGGGGVDGAIHRA 78 >UniRef50_UPI000049917F Cluster: conserved hypothetical protein; n=1; Entamoeba histolytica HM-1:IMSS|Rep: conserved hypothetical protein - Entamoeba histolytica HM-1:IMSS Length = 316 Score = 52.8 bits (121), Expect = 6e-06 Identities = 24/34 (70%), Positives = 27/34 (79%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 + GDITK+ VDV+VNAAN L G GGVDGAIH A Sbjct: 54 ITGDITKIQVDVVVNAANSYLRGGGGVDGAIHCA 87 >UniRef50_A1G783 Cluster: Appr-1-p processing; n=1; Salinispora arenicola CNS205|Rep: Appr-1-p processing - Salinispora arenicola CNS205 Length = 202 Score = 52.8 bits (121), Expect = 6e-06 Identities = 25/38 (65%), Positives = 28/38 (73%) Frame = +3 Query: 498 AYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 A V GDIT+ VD IV AAN SL+G GGVDGA+HRA Sbjct: 35 AIEVVLGDITQQNVDAIVTAANESLLGGGGVDGAVHRA 72 >UniRef50_A7RJ44 Cluster: Predicted protein; n=3; Eukaryota|Rep: Predicted protein - Nematostella vectensis Length = 183 Score = 52.8 bits (121), Expect = 6e-06 Identities = 23/32 (71%), Positives = 26/32 (81%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GDIT L +D IVNAAN +L+G GGVDG IHRA Sbjct: 14 GDITALEIDAIVNAANTTLLGGGGVDGCIHRA 45 >UniRef50_UPI000023F24A Cluster: hypothetical protein FG04179.1; n=1; Gibberella zeae PH-1|Rep: hypothetical protein FG04179.1 - Gibberella zeae PH-1 Length = 220 Score = 52.0 bits (119), Expect = 1e-05 Identities = 23/34 (67%), Positives = 27/34 (79%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 ++GDIT+L +D IVNAAN SL G GVDGAIH A Sbjct: 47 IRGDITELRIDAIVNAANKSLRGGSGVDGAIHSA 80 >UniRef50_A6LTB5 Cluster: Appr-1-p processing domain protein; n=1; Clostridium beijerinckii NCIMB 8052|Rep: Appr-1-p processing domain protein - Clostridium beijerinckii NCIMB 8052 Length = 214 Score = 52.0 bits (119), Expect = 1e-05 Identities = 23/31 (74%), Positives = 26/31 (83%) Frame = +3 Query: 519 DITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 DITK+ D IVNAAN SL+G GGVDGAIH+A Sbjct: 10 DITKIKFDAIVNAANASLLGGGGVDGAIHKA 40 >UniRef50_Q8RB30 Cluster: UPF0189 protein TTE0995; n=20; Bacteria|Rep: UPF0189 protein TTE0995 - Thermoanaerobacter tengcongensis Length = 175 Score = 52.0 bits (119), Expect = 1e-05 Identities = 23/35 (65%), Positives = 28/35 (80%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 ++G+I VD IVNAAN SL+G GGVDGAIH+AG Sbjct: 8 IKGNIVDQEVDAIVNAANSSLIGGGGVDGAIHKAG 42 >UniRef50_Q6AKL0 Cluster: Putative uncharacterized protein; n=1; Desulfotalea psychrophila|Rep: Putative uncharacterized protein - Desulfotalea psychrophila Length = 176 Score = 51.6 bits (118), Expect = 1e-05 Identities = 23/31 (74%), Positives = 27/31 (87%) Frame = +3 Query: 519 DITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +IT+ VDVIVNAANP L+G GGVDGAIH+A Sbjct: 10 NITQAEVDVIVNAANPRLLGGGGVDGAIHQA 40 >UniRef50_Q5DCZ3 Cluster: SJCHGC06209 protein; n=1; Schistosoma japonicum|Rep: SJCHGC06209 protein - Schistosoma japonicum (Blood fluke) Length = 194 Score = 51.2 bits (117), Expect = 2e-05 Identities = 23/33 (69%), Positives = 25/33 (75%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +GDIT L +D I NAAN L G GGVDGAIHRA Sbjct: 33 RGDITHLRIDAIANAANRQLRGGGGVDGAIHRA 65 >UniRef50_A3ZLZ3 Cluster: Putative uncharacterized protein; n=2; Planctomycetaceae|Rep: Putative uncharacterized protein - Blastopirellula marina DSM 3645 Length = 191 Score = 50.8 bits (116), Expect = 3e-05 Identities = 23/33 (69%), Positives = 25/33 (75%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 GDIT VD++VNAAN L G GGVDGAIH AG Sbjct: 15 GDITDQNVDIVVNAANSRLAGGGGVDGAIHAAG 47 >UniRef50_Q6AAQ5 Cluster: Conserved protein; n=2; Bacteria|Rep: Conserved protein - Propionibacterium acnes Length = 223 Score = 50.4 bits (115), Expect = 3e-05 Identities = 23/34 (67%), Positives = 26/34 (76%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 ++ DIT L VD +VNAAN L G GGVDGAIHRA Sbjct: 59 LRADITTLDVDAVVNAANRQLAGGGGVDGAIHRA 92 >UniRef50_Q0B030 Cluster: Phosphatase; n=1; Syntrophomonas wolfei subsp. wolfei str. Goettingen|Rep: Phosphatase - Syntrophomonas wolfei subsp. wolfei (strain Goettingen) Length = 176 Score = 50.0 bits (114), Expect = 4e-05 Identities = 26/35 (74%), Positives = 28/35 (80%), Gaps = 1/35 (2%) Frame = +3 Query: 510 VQGDITKLA-VDVIVNAANPSLMGSGGVDGAIHRA 611 VQGDIT+ + VIVNAAN SL G GGVDGAIHRA Sbjct: 11 VQGDITRQEDMAVIVNAANSSLRGGGGVDGAIHRA 45 >UniRef50_Q9HJ67 Cluster: UPF0189 protein Ta1105; n=2; Thermoplasma acidophilum|Rep: UPF0189 protein Ta1105 - Thermoplasma acidophilum Length = 196 Score = 49.6 bits (113), Expect = 6e-05 Identities = 23/32 (71%), Positives = 25/32 (78%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GDIT+ + IVNAAN SLMG GGVDGAIH A Sbjct: 16 GDITESDAEAIVNAANSSLMGGGGVDGAIHSA 47 >UniRef50_Q1K4D1 Cluster: Appr-1-p processing; n=1; Desulfuromonas acetoxidans DSM 684|Rep: Appr-1-p processing - Desulfuromonas acetoxidans DSM 684 Length = 193 Score = 48.8 bits (111), Expect = 1e-04 Identities = 22/34 (64%), Positives = 26/34 (76%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 ++ DIT+L VD IVN A L+GSGGVDGAIH A Sbjct: 7 IKADITQLNVDAIVNTATTKLLGSGGVDGAIHDA 40 >UniRef50_Q30ZH6 Cluster: Appr-1-p processing; n=1; Desulfovibrio desulfuricans G20|Rep: Appr-1-p processing - Desulfovibrio desulfuricans (strain G20) Length = 183 Score = 47.2 bits (107), Expect = 3e-04 Identities = 20/34 (58%), Positives = 24/34 (70%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +QGD+T D +VNAAN L G GGVDGA+H A Sbjct: 13 LQGDLTLFKADAVVNAANSRLAGGGGVDGALHAA 46 >UniRef50_Q8B4N1 Cluster: ORF-1; n=8; root|Rep: ORF-1 - Rock bream iridovirus Length = 566 Score = 46.4 bits (105), Expect = 5e-04 Identities = 23/35 (65%), Positives = 24/35 (68%) Frame = +3 Query: 504 SCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHR 608 S V DIT L VD IVNAAN +G GGVDG IHR Sbjct: 393 SVVLDDITSLRVDAIVNAANTVGLGGGGVDGRIHR 427 >UniRef50_A7EET2 Cluster: Putative uncharacterized protein; n=1; Sclerotinia sclerotiorum 1980|Rep: Putative uncharacterized protein - Sclerotinia sclerotiorum 1980 Length = 506 Score = 46.0 bits (104), Expect = 7e-04 Identities = 21/33 (63%), Positives = 24/33 (72%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHR 608 V GD+ K VDVIVNAAN SL+ G+DG IHR Sbjct: 24 VDGDLLKYPVDVIVNAANASLVRGDGIDGEIHR 56 >UniRef50_Q5R014 Cluster: Predicted phosphatase; n=6; Bacteria|Rep: Predicted phosphatase - Idiomarina loihiensis Length = 167 Score = 45.6 bits (103), Expect = 0.001 Identities = 22/38 (57%), Positives = 25/38 (65%), Gaps = 1/38 (2%) Frame = +3 Query: 501 YSCVQGDITKLA-VDVIVNAANPSLMGSGGVDGAIHRA 611 Y CV GDI + ++ IVNAAN L GGV GAIHRA Sbjct: 3 YECVHGDINQQTEIEAIVNAANAKLQTGGGVAGAIHRA 40 >UniRef50_A6SR30 Cluster: Putative uncharacterized protein; n=1; Botryotinia fuckeliana B05.10|Rep: Putative uncharacterized protein - Botryotinia fuckeliana B05.10 Length = 474 Score = 45.2 bits (102), Expect = 0.001 Identities = 21/32 (65%), Positives = 23/32 (71%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GD+ K VDVIVNAAN L GG+DGAIH A Sbjct: 26 GDMLKYPVDVIVNAANVKLKKGGGIDGAIHAA 57 >UniRef50_UPI00015C5846 Cluster: hypothetical protein CKO_02023; n=1; Citrobacter koseri ATCC BAA-895|Rep: hypothetical protein CKO_02023 - Citrobacter koseri ATCC BAA-895 Length = 107 Score = 44.8 bits (101), Expect = 0.002 Identities = 19/42 (45%), Positives = 24/42 (57%) Frame = +1 Query: 124 LFFPFIAIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNC 249 + F A GGVIHF GQIVEP C+VS + ++M C Sbjct: 18 ILFSSSVTAQQITTGGVIHFRGQIVEPPCEVSARQQNIDMTC 59 Score = 40.3 bits (90), Expect = 0.036 Identities = 18/45 (40%), Positives = 30/45 (66%) Frame = +3 Query: 255 NGSIPGKTYSSKALMSGNVKNAQIASVKVQYLDKQKKLAVMNIEY 389 NG + ++ + + + + QIASV+V YLD Q++LA+M+IEY Sbjct: 62 NGQMKTNRFTLQQVTTAPQRLRQIASVRVNYLDPQQRLAIMSIEY 106 >UniRef50_A4W959 Cluster: Putative uncharacterized protein; n=1; Enterobacter sp. 638|Rep: Putative uncharacterized protein - Enterobacter sp. 638 Length = 93 Score = 44.8 bits (101), Expect = 0.002 Identities = 20/49 (40%), Positives = 29/49 (59%) Frame = +1 Query: 109 LMLGSLFFPFIAIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNCPQ 255 +ML S P A + + V GGVIHF G IVE C+++ Q ++CP+ Sbjct: 1 MMLSSALIP--AFSATLVNGGVIHFRGMIVENPCEITPQKQQFALSCPK 47 >UniRef50_A0LGZ1 Cluster: Appr-1-p processing domain protein; n=1; Syntrophobacter fumaroxidans MPOB|Rep: Appr-1-p processing domain protein - Syntrophobacter fumaroxidans (strain DSM 10017 / MPOB) Length = 175 Score = 44.8 bits (101), Expect = 0.002 Identities = 23/37 (62%), Positives = 25/37 (67%) Frame = +3 Query: 504 SCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 S VQGD+T+L VD IVNAAN L GGV GAI G Sbjct: 11 SLVQGDLTELRVDAIVNAANRHLALGGGVAGAIRMKG 47 >UniRef50_O22875 Cluster: Expressed protein; n=7; Magnoliophyta|Rep: Expressed protein - Arabidopsis thaliana (Mouse-ear cress) Length = 193 Score = 44.4 bits (100), Expect = 0.002 Identities = 22/38 (57%), Positives = 27/38 (71%), Gaps = 4/38 (10%) Frame = +3 Query: 510 VQGDITKLAVD----VIVNAANPSLMGSGGVDGAIHRA 611 ++GDITK +VD IVN AN ++G GG DGAIHRA Sbjct: 21 LKGDITKWSVDSSSDAIVNPANERMLGGGGADGAIHRA 58 >UniRef50_Q9WYX8 Cluster: UPF0189 protein TM_0508; n=4; Thermotogaceae|Rep: UPF0189 protein TM_0508 - Thermotoga maritima Length = 599 Score = 44.4 bits (100), Expect = 0.002 Identities = 23/35 (65%), Positives = 25/35 (71%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 V+GDIT+ VD IVNAAN L GGV GAI RAG Sbjct: 432 VKGDITREEVDAIVNAANEYLKHGGGVAGAIVRAG 466 >UniRef50_A1D5K4 Cluster: Appr-1-p processing enzyme family protein; n=1; Neosartorya fischeri NRRL 181|Rep: Appr-1-p processing enzyme family protein - Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / NRRL 181)(Aspergillus fischerianus (strain ATCC 1020 / DSM 3700 / NRRL 181)) Length = 257 Score = 44.0 bits (99), Expect = 0.003 Identities = 21/36 (58%), Positives = 25/36 (69%) Frame = +3 Query: 504 SCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 S ++ DI +L VD IVNAA SL G GGVD A+H A Sbjct: 93 SFIEHDIARLQVDCIVNAAKESLQGGGGVDRAMHLA 128 >UniRef50_Q93SX7 Cluster: UPF0189 protein; n=1; Acinetobacter sp. ED45-25|Rep: UPF0189 protein - Acinetobacter sp. (strain ED45-25) Length = 183 Score = 44.0 bits (99), Expect = 0.003 Identities = 19/33 (57%), Positives = 24/33 (72%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHR 608 +Q DIT AV IVN+AN SL+G GG+D IH+ Sbjct: 7 IQADITAFAVHAIVNSANKSLLGGGGLDYVIHK 39 >UniRef50_A5V0Y4 Cluster: Appr-1-p processing domain protein; n=5; Bacteria|Rep: Appr-1-p processing domain protein - Roseiflexus sp. RS-1 Length = 181 Score = 43.6 bits (98), Expect = 0.004 Identities = 20/34 (58%), Positives = 25/34 (73%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 ++G+I + VD IVNAAN +L GGV GAIHRA Sbjct: 13 IRGNIVEQDVDAIVNAANETLAPGGGVSGAIHRA 46 >UniRef50_A6F1P7 Cluster: Appr-1-p processing; n=1; Marinobacter algicola DG893|Rep: Appr-1-p processing - Marinobacter algicola DG893 Length = 183 Score = 43.2 bits (97), Expect = 0.005 Identities = 20/36 (55%), Positives = 26/36 (72%), Gaps = 1/36 (2%) Frame = +3 Query: 507 CVQGDITKLA-VDVIVNAANPSLMGSGGVDGAIHRA 611 CV+GDIT+ ++ +VNAAN LM GGV GA+H A Sbjct: 13 CVRGDITRQDDLEAVVNAANAQLMSGGGVAGALHAA 48 >UniRef50_Q4DSL4 Cluster: Putative uncharacterized protein; n=3; Trypanosoma|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 297 Score = 42.7 bits (96), Expect = 0.007 Identities = 19/32 (59%), Positives = 23/32 (71%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 G +T L +D IVNAAN + +G GVDGAIH A Sbjct: 130 GPVTDLQLDAIVNAANKTCLGGKGVDGAIHAA 161 >UniRef50_A0UYE8 Cluster: Appr-1-p processing; n=3; Bacteria|Rep: Appr-1-p processing - Clostridium cellulolyticum H10 Length = 341 Score = 42.3 bits (95), Expect = 0.009 Identities = 22/34 (64%), Positives = 24/34 (70%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 V+ DITKL VD IVNAAN L GGV GAI +A Sbjct: 6 VRQDITKLKVDAIVNAANTDLRMGGGVCGAIFKA 39 >UniRef50_A3BF04 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 128 Score = 41.9 bits (94), Expect = 0.012 Identities = 22/35 (62%), Positives = 25/35 (71%), Gaps = 4/35 (11%) Frame = +3 Query: 519 DITKLAVD----VIVNAANPSLMGSGGVDGAIHRA 611 DIT +VD IVNAAN ++G GGVDGAIHRA Sbjct: 30 DITLWSVDGATVAIVNAANERMLGGGGVDGAIHRA 64 >UniRef50_UPI0000498318 Cluster: conserved hypothetical protein; n=1; Entamoeba histolytica HM-1:IMSS|Rep: conserved hypothetical protein - Entamoeba histolytica HM-1:IMSS Length = 627 Score = 41.5 bits (93), Expect = 0.015 Identities = 28/76 (36%), Positives = 44/76 (57%) Frame = +3 Query: 351 DKQKKLAVMNIEYN*VSEQLTLLSRKMRFNQKACCTLKKPEGVRRRYENAYSCVQGDITK 530 D+ +KL N + +S+ L ++ + ++ K TLK+ +G + N + +GDITK Sbjct: 86 DECQKLCQNNELMDLISQMLQEKNKDVVYS-KNIITLKE-QGHSFLFSNKLALWKGDITK 143 Query: 531 LAVDVIVNAANPSLMG 578 L VD IVNAAN L+G Sbjct: 144 LCVDAIVNAANNQLLG 159 >UniRef50_A5D049 Cluster: Predicted phosphatase; n=3; Bacteria|Rep: Predicted phosphatase - Pelotomaculum thermopropionicum SI Length = 359 Score = 41.5 bits (93), Expect = 0.015 Identities = 21/35 (60%), Positives = 24/35 (68%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 ++GDIT+L VD IVNAAN L GV GAI R G Sbjct: 5 LKGDITELQVDAIVNAANNHLWMGAGVAGAIKRKG 39 >UniRef50_O07733 Cluster: UPF0189 protein Rv1899c/MT1950; n=9; Mycobacterium|Rep: UPF0189 protein Rv1899c/MT1950 - Mycobacterium tuberculosis Length = 359 Score = 41.5 bits (93), Expect = 0.015 Identities = 20/34 (58%), Positives = 23/34 (67%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 Q D+TKL +D I NAAN L +GGV AI RAG Sbjct: 196 QADVTKLELDAITNAANTRLRHAGGVAAAIARAG 229 >UniRef50_A7BY23 Cluster: Putative uncharacterized protein; n=1; Beggiatoa sp. PS|Rep: Putative uncharacterized protein - Beggiatoa sp. PS Length = 708 Score = 41.1 bits (92), Expect = 0.020 Identities = 19/35 (54%), Positives = 24/35 (68%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 +QG+IT+ VD IVN + SL GSG +D AI AG Sbjct: 536 IQGNITQQKVDAIVNTTDRSLSGSGAIDYAIQNAG 570 >UniRef50_A5TRW5 Cluster: Putative uncharacterized protein; n=1; Fusobacterium nucleatum subsp. polymorphum ATCC 10953|Rep: Putative uncharacterized protein - Fusobacterium nucleatum subsp. polymorphum ATCC 10953 Length = 175 Score = 40.3 bits (90), Expect = 0.036 Identities = 23/42 (54%), Positives = 27/42 (64%), Gaps = 1/42 (2%) Frame = +3 Query: 489 YENAYSCVQGDITKLA-VDVIVNAANPSLMGSGGVDGAIHRA 611 Y++ V GDITK+ V+ IVNAAN L GGV GAI RA Sbjct: 2 YKDIIKLVNGDITKIPEVEAIVNAANNYLEMGGGVCGAIFRA 43 >UniRef50_Q8ZXT3 Cluster: UPF0189 protein PAE1111; n=8; Thermoprotei|Rep: UPF0189 protein PAE1111 - Pyrobaculum aerophilum Length = 182 Score = 40.3 bits (90), Expect = 0.036 Identities = 20/35 (57%), Positives = 24/35 (68%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 ++GDIT++ D IVNAAN L GGV GAI R G Sbjct: 13 MRGDITEVEADAIVNAANSYLEHGGGVAGAIVRKG 47 >UniRef50_A0H6G6 Cluster: Appr-1-p processing; n=1; Chloroflexus aggregans DSM 9485|Rep: Appr-1-p processing - Chloroflexus aggregans DSM 9485 Length = 184 Score = 39.9 bits (89), Expect = 0.047 Identities = 20/33 (60%), Positives = 22/33 (66%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +GDI +VD IVNAAN L GGV GAI RA Sbjct: 18 EGDIVTQSVDAIVNAANEQLRQGGGVCGAIFRA 50 >UniRef50_A5ZAB5 Cluster: Putative uncharacterized protein; n=1; Eubacterium ventriosum ATCC 27560|Rep: Putative uncharacterized protein - Eubacterium ventriosum ATCC 27560 Length = 274 Score = 39.5 bits (88), Expect = 0.062 Identities = 19/41 (46%), Positives = 26/41 (63%) Frame = +3 Query: 456 TLKKPEGVRRRYENAYSCVQGDITKLAVDVIVNAANPSLMG 578 T+K+ G + S QGD+T+L VD IVNAAN +L+G Sbjct: 78 TVKEQHGSNNPLADKISIWQGDMTRLKVDAIVNAANSALLG 118 >UniRef50_Q22CT8 Cluster: Appr-1-p processing enzyme family protein; n=1; Tetrahymena thermophila SB210|Rep: Appr-1-p processing enzyme family protein - Tetrahymena thermophila SB210 Length = 535 Score = 39.5 bits (88), Expect = 0.062 Identities = 22/41 (53%), Positives = 24/41 (58%) Frame = +3 Query: 492 ENAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 E S V+ D+T VD IVNAAN L GGV GAI R G Sbjct: 44 ETQISIVKNDLTMENVDAIVNAANNFLAHGGGVAGAICRKG 84 >UniRef50_A7T167 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 502 Score = 39.5 bits (88), Expect = 0.062 Identities = 18/32 (56%), Positives = 20/32 (62%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GDITKLA D IVN N SL G + +HRA Sbjct: 58 GDITKLAADAIVNTTNESLSDRGALSERVHRA 89 >UniRef50_A6BCW6 Cluster: Putative uncharacterized protein; n=2; Bacteria|Rep: Putative uncharacterized protein - Dorea longicatena DSM 13814 Length = 267 Score = 39.1 bits (87), Expect = 0.082 Identities = 20/38 (52%), Positives = 26/38 (68%), Gaps = 5/38 (13%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMG-----SGGVDGAIHRA 611 +GDIT+L+VD IVNAAN ++G G +D AIH A Sbjct: 98 RGDITRLSVDAIVNAANSQMLGCFVPCHGCIDNAIHSA 135 >UniRef50_A0CX06 Cluster: Chromosome undetermined scaffold_3, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_3, whole genome shotgun sequence - Paramecium tetraurelia Length = 1064 Score = 39.1 bits (87), Expect = 0.082 Identities = 22/63 (34%), Positives = 35/63 (55%), Gaps = 1/63 (1%) Frame = +3 Query: 426 KMRFNQKACCTLKKPEGVRRRYENAYSCVQGDITKL-AVDVIVNAANPSLMGSGGVDGAI 602 +++FN + +KK + E + DIT++ VD IVN A+P+L GG+ GA+ Sbjct: 679 EIQFNNQKWIVVKKTPMKIKILEQSIIIHNQDITQIKGVDAIVNVADPNLKNRGGICGAV 738 Query: 603 HRA 611 RA Sbjct: 739 FRA 741 >UniRef50_Q5V4P3 Cluster: Putative uncharacterized protein; n=2; Halobacteriaceae|Rep: Putative uncharacterized protein - Haloarcula marismortui (Halobacterium marismortui) Length = 166 Score = 39.1 bits (87), Expect = 0.082 Identities = 18/37 (48%), Positives = 23/37 (62%) Frame = +3 Query: 501 YSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 + +QGDI + D +VNAAN SL GV GA+ RA Sbjct: 3 FEVIQGDIAAQSADALVNAANTSLRMGSGVAGALKRA 39 >UniRef50_A7HJC7 Cluster: Appr-1-p processing domain protein; n=1; Fervidobacterium nodosum Rt17-B1|Rep: Appr-1-p processing domain protein - Fervidobacterium nodosum Rt17-B1 Length = 184 Score = 38.7 bits (86), Expect = 0.11 Identities = 20/35 (57%), Positives = 21/35 (60%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 V GDIT +D IVNAAN L GGV G I R G Sbjct: 14 VVGDITTQNIDAIVNAANSYLSHGGGVAGVISRKG 48 >UniRef50_UPI0000F1EDA9 Cluster: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2); n=1; Danio rerio|Rep: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) - Danio rerio Length = 1419 Score = 37.9 bits (84), Expect = 0.19 Identities = 18/32 (56%), Positives = 23/32 (71%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GDITK+ V+ +VN+ N SL S GV GAI +A Sbjct: 909 GDITKVKVEAVVNSTNTSLNLSSGVSGAILKA 940 Score = 37.1 bits (82), Expect = 0.33 Identities = 19/33 (57%), Positives = 24/33 (72%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +GDITK A DVIVN+ N +L + GV GAI +A Sbjct: 624 KGDITKEAADVIVNSTNKTLDLNTGVSGAILKA 656 >UniRef50_Q6D5N1 Cluster: Putative exported protein; n=1; Pectobacterium atrosepticum|Rep: Putative exported protein - Erwinia carotovora subsp. atroseptica (Pectobacterium atrosepticum) Length = 98 Score = 37.9 bits (84), Expect = 0.19 Identities = 15/27 (55%), Positives = 20/27 (74%) Frame = +1 Query: 169 GVIHFYGQIVEPACDVSTQSSPVEMNC 249 G+I F G IVEP CDV+T S+ V ++C Sbjct: 28 GIIRFSGAIVEPVCDVTTHSNNVTVSC 54 >UniRef50_A6PEZ6 Cluster: Appr-1-p processing domain protein; n=1; Shewanella sediminis HAW-EB3|Rep: Appr-1-p processing domain protein - Shewanella sediminis HAW-EB3 Length = 268 Score = 37.9 bits (84), Expect = 0.19 Identities = 21/38 (55%), Positives = 23/38 (60%), Gaps = 5/38 (13%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMG-----SGGVDGAIHRA 611 QGDIT+LA D IVNAAN L G +D AIH A Sbjct: 95 QGDITRLAADAIVNAANKELQGCFQPLHSCIDNAIHSA 132 >UniRef50_A7S3X0 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 143 Score = 37.9 bits (84), Expect = 0.19 Identities = 19/34 (55%), Positives = 21/34 (61%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 QGDIT D +VNAAN L+ GGV GAI G Sbjct: 5 QGDITNERADAVVNAANCDLIHGGGVAGAILAKG 38 >UniRef50_A2DTG7 Cluster: Appr-1-p processing enzyme family protein; n=2; Trichomonas vaginalis G3|Rep: Appr-1-p processing enzyme family protein - Trichomonas vaginalis G3 Length = 316 Score = 37.9 bits (84), Expect = 0.19 Identities = 19/32 (59%), Positives = 20/32 (62%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GD TKL D IVNAAN L GG+ GAI A Sbjct: 61 GDSTKLKCDAIVNAANSYLAAGGGICGAIFSA 92 >UniRef50_A3DLM0 Cluster: Appr-1-p processing domain protein; n=1; Staphylothermus marinus F1|Rep: Appr-1-p processing domain protein - Staphylothermus marinus (strain ATCC 43588 / DSM 3639 / F1) Length = 192 Score = 37.9 bits (84), Expect = 0.19 Identities = 17/35 (48%), Positives = 24/35 (68%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 V+GDIT+L V+ IVN AN ++ GG+ G + R G Sbjct: 20 VKGDITELDVEAIVNPANSFMLMGGGLAGVLKRKG 54 >UniRef50_A7B8S3 Cluster: Putative uncharacterized protein; n=1; Actinomyces odontolyticus ATCC 17982|Rep: Putative uncharacterized protein - Actinomyces odontolyticus ATCC 17982 Length = 270 Score = 37.5 bits (83), Expect = 0.25 Identities = 20/38 (52%), Positives = 25/38 (65%), Gaps = 5/38 (13%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGG-----VDGAIHRA 611 +GDIT+L VD IVNAAN +L+G +D AIH A Sbjct: 91 RGDITRLEVDAIVNAANSALLGCRAPGHTCIDNAIHSA 128 >UniRef50_P62605 Cluster: Type-1 fimbrial protein, C chain precursor; n=70; Enterobacteriaceae|Rep: Type-1 fimbrial protein, C chain precursor - Escherichia coli Length = 180 Score = 37.5 bits (83), Expect = 0.25 Identities = 17/39 (43%), Positives = 24/39 (61%) Frame = +1 Query: 142 AIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNCPQM 258 A A +TV GG +HF G++V AC V+T S +N Q+ Sbjct: 22 ASAVTTVNGGTVHFKGEVVNAACAVNTNSFDQTVNLGQV 60 >UniRef50_Q460N5 Cluster: Poly [ADP-ribose] polymerase 14; n=23; Euteleostomi|Rep: Poly [ADP-ribose] polymerase 14 - Homo sapiens (Human) Length = 1720 Score = 37.1 bits (82), Expect = 0.33 Identities = 16/33 (48%), Positives = 23/33 (69%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 QGD+ +L VDV+VNA+N L GG+ A+ +A Sbjct: 727 QGDLARLPVDVVVNASNEDLKHYGGLAAALSKA 759 >UniRef50_Q0UG78 Cluster: Putative uncharacterized protein; n=1; Phaeosphaeria nodorum|Rep: Putative uncharacterized protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 2298 Score = 36.7 bits (81), Expect = 0.44 Identities = 20/44 (45%), Positives = 27/44 (61%), Gaps = 2/44 (4%) Frame = +3 Query: 486 RYENAYSCVQGDITKLAVDVIVNAANPSLMGSGG--VDGAIHRA 611 +Y S D+TKL VD IVN+AN SL + G ++ AIH+A Sbjct: 656 KYNRIISFCHHDLTKLKVDAIVNSANKSLKMTRGDTLNNAIHKA 699 Score = 34.7 bits (76), Expect = 1.8 Identities = 20/53 (37%), Positives = 29/53 (54%) Frame = +3 Query: 456 TLKKPEGVRRRYENAYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 T KP V + + V+ DITKL VDV+VN+ + S G G +D + + G Sbjct: 1068 TQAKPSAV---FNDKIYLVREDITKLEVDVMVNSTDVSFRGMGTLDRTVLQKG 1117 >UniRef50_Q9YBE9 Cluster: UPF0189 protein APE_1648.1; n=1; Aeropyrum pernix|Rep: UPF0189 protein APE_1648.1 - Aeropyrum pernix Length = 189 Score = 36.7 bits (81), Expect = 0.44 Identities = 15/33 (45%), Positives = 22/33 (66%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 GD+TK+ + +VN AN ++ GG GA+ RAG Sbjct: 16 GDLTKVRAEAVVNPANSLMIMGGGAAGALKRAG 48 >UniRef50_Q8ZJT7 Cluster: Putative periplasmic protein; n=3; Salmonella|Rep: Putative periplasmic protein - Salmonella typhimurium Length = 104 Score = 36.3 bits (80), Expect = 0.58 Identities = 15/34 (44%), Positives = 22/34 (64%) Frame = +1 Query: 148 AGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNC 249 A + V GVIHF GQIVE C+++ +E++C Sbjct: 24 ATAPVSAGVIHFKGQIVEYGCNLAPHDRNIEVSC 57 >UniRef50_UPI0000498CB9 Cluster: conserved hypothetical protein; n=1; Entamoeba histolytica HM-1:IMSS|Rep: conserved hypothetical protein - Entamoeba histolytica HM-1:IMSS Length = 348 Score = 35.9 bits (79), Expect = 0.77 Identities = 22/52 (42%), Positives = 31/52 (59%), Gaps = 6/52 (11%) Frame = +3 Query: 477 VRRRYENAYSCVQGDITKLAVDVIVNAANPSLMG-----SGGVDGAIH-RAG 614 + +++ + +GDITKL +D IVNAAN +L+G VD IH RAG Sbjct: 85 LNKQFSKSIRVWKGDITKLKIDSIVNAANNTLVGCFIPLHSCVDSIIHERAG 136 >UniRef50_Q2V9U1 Cluster: Nonstructural protein 3; n=38; Eastern equine encephalitis virus|Rep: Nonstructural protein 3 - Eastern equine encephalitis virus (EEEV) (Eastern equineencephalomyelitis virus) Length = 539 Score = 35.9 bits (79), Expect = 0.77 Identities = 17/37 (45%), Positives = 23/37 (62%) Frame = +3 Query: 498 AYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHR 608 AY ++GDI+K D IVNAAN GV GA+++ Sbjct: 3 AYRVIRGDISKSTDDAIVNAANNKGQPGAGVCGALYK 39 >UniRef50_Q03IQ8 Cluster: Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1; n=3; Streptococcus thermophilus|Rep: Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 - Streptococcus thermophilus (strain ATCC BAA-491 / LMD-9) Length = 260 Score = 35.9 bits (79), Expect = 0.77 Identities = 19/36 (52%), Positives = 24/36 (66%), Gaps = 5/36 (13%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMG-----SGGVDGAIH 605 +GDIT+L +D IVNAAN +L+G VD AIH Sbjct: 89 KGDITRLEIDAIVNAANKTLLGCMKPLHNCVDNAIH 124 >UniRef50_A0X2G8 Cluster: Appr-1-p processing domain protein; n=1; Shewanella pealeana ATCC 700345|Rep: Appr-1-p processing domain protein - Shewanella pealeana ATCC 700345 Length = 304 Score = 35.9 bits (79), Expect = 0.77 Identities = 22/40 (55%), Positives = 26/40 (65%), Gaps = 6/40 (15%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMG-----SGGVDGAIH-RAG 614 +GDIT LAVD IVNAAN ++G +D AIH RAG Sbjct: 126 KGDITTLAVDAIVNAANNQMLGCFQPQHKCIDNAIHNRAG 165 >UniRef50_Q5XC09 Cluster: UPF0189 protein M6_Spy0919; n=19; Streptococcus|Rep: UPF0189 protein M6_Spy0919 - Streptococcus pyogenes serotype M6 Length = 270 Score = 35.5 bits (78), Expect = 1.0 Identities = 20/35 (57%), Positives = 22/35 (62%), Gaps = 5/35 (14%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMG-----SGGVDGAIH 605 GDI LAVD IVNAAN L+G G +D AIH Sbjct: 91 GDIRYLAVDAIVNAANSELLGCFIPNHGCIDNAIH 125 >UniRef50_UPI0000F2CC14 Cluster: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2); n=1; Monodelphis domestica|Rep: PREDICTED: similar to Poly [ADP-ribose] polymerase 14 (PARP-14) (B aggressive lymphoma protein 2) - Monodelphis domestica Length = 1874 Score = 35.1 bits (77), Expect = 1.3 Identities = 15/33 (45%), Positives = 21/33 (63%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +GD+T+ DV+VNAAN L GG+ A+ A Sbjct: 867 KGDLTQFPADVVVNAANEELQHHGGLAAALSEA 899 >UniRef50_Q5KUT6 Cluster: Hypothetical conserved protein; n=2; Geobacillus|Rep: Hypothetical conserved protein - Geobacillus kaustophilus Length = 161 Score = 35.1 bits (77), Expect = 1.3 Identities = 20/38 (52%), Positives = 24/38 (63%), Gaps = 1/38 (2%) Frame = +3 Query: 504 SCVQGDITKL-AVDVIVNAANPSLMGSGGVDGAIHRAG 614 S + GD+TK+ V+ I NAAN GGV AIHRAG Sbjct: 3 SAMVGDLTKVEGVEYICNAANGIGPMGGGVAAAIHRAG 40 >UniRef50_A2FMC7 Cluster: Appr-1-p processing enzyme family protein; n=1; Trichomonas vaginalis G3|Rep: Appr-1-p processing enzyme family protein - Trichomonas vaginalis G3 Length = 361 Score = 35.1 bits (77), Expect = 1.3 Identities = 15/34 (44%), Positives = 21/34 (61%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 ++G+ KL D +VNAAN L GG+ G +H A Sbjct: 122 MRGNSVKLECDAVVNAANSHLYPGGGICGVLHSA 155 >UniRef50_A0CX10 Cluster: Chromosome undetermined scaffold_3, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_3, whole genome shotgun sequence - Paramecium tetraurelia Length = 183 Score = 35.1 bits (77), Expect = 1.3 Identities = 19/35 (54%), Positives = 24/35 (68%), Gaps = 1/35 (2%) Frame = +3 Query: 510 VQGDITKLA-VDVIVNAANPSLMGSGGVDGAIHRA 611 ++ +I KL VD IVNAAN L+ GGV GAI +A Sbjct: 9 IKENIVKLVDVDAIVNAANQELLPGGGVCGAIFQA 43 >UniRef50_UPI00006A2284 Cluster: UPI00006A2284 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2284 UniRef100 entry - Xenopus tropicalis Length = 694 Score = 34.7 bits (76), Expect = 1.8 Identities = 16/31 (51%), Positives = 22/31 (70%) Frame = +3 Query: 519 DITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 D+ + +VDV+VNAAN L GG+ GA+ RA Sbjct: 9 DLARHSVDVVVNAANEDLKHIGGLAGALLRA 39 >UniRef50_Q6ZKH7 Cluster: Putative uncharacterized protein OJ1119_D01.23; n=2; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein OJ1119_D01.23 - Oryza sativa subsp. japonica (Rice) Length = 267 Score = 34.7 bits (76), Expect = 1.8 Identities = 18/32 (56%), Positives = 22/32 (68%), Gaps = 4/32 (12%) Frame = +3 Query: 513 QGDITKLAVD----VIVNAANPSLMGSGGVDG 596 +GDIT +VD IVNAAN ++G GGVDG Sbjct: 87 KGDITLWSVDGATVAIVNAANERMLGGGGVDG 118 >UniRef50_A2BJA7 Cluster: A1pp, Appr-1-p processing enzyme; n=1; Hyperthermus butylicus DSM 5456|Rep: A1pp, Appr-1-p processing enzyme - Hyperthermus butylicus (strain DSM 5456 / JCM 9403) Length = 199 Score = 34.7 bits (76), Expect = 1.8 Identities = 15/33 (45%), Positives = 22/33 (66%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 +GDIT+ + +VN AN ++ GGV GA+ RA Sbjct: 20 RGDITEAECEAVVNPANSLMIMGGGVAGALRRA 52 >UniRef50_Q9JMS9 Cluster: Uncharacterized protein yuaK; n=1; Escherichia coli K12|Rep: Uncharacterized protein yuaK - Escherichia coli (strain K12) Length = 94 Score = 34.7 bits (76), Expect = 1.8 Identities = 15/15 (100%), Positives = 15/15 (100%) Frame = +2 Query: 2 YLSDIGCLEIQGASL 46 YLSDIGCLEIQGASL Sbjct: 31 YLSDIGCLEIQGASL 45 >UniRef50_P87515 Cluster: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; P123'; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); Non-structural protein 3' (nsP3'); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)]; n=13; Alphavirus|Rep: Non-structural polyprotein (Polyprotein nsP1234) (P1234) [Contains: P123; P123'; mRNA-capping enzyme nsP1 (EC 2.1.1.-) (EC 2.7.7.-) (Non- structural protein 1); Protease/triphosphatase/NTPase/helicase nsP2 (EC 3.4.22.-) (EC 3.1.3.33) (EC 3.6.1.15) (EC 3.6.1.-) (Non-structural protein 2) (nsP2); Non-structural protein 3 (nsP3); Non-structural protein 3' (nsP3'); RNA-directed RNA polymerase nsP4 (EC 2.7.7.48) (Non-structural protein 4) (nsP4)] - Barmah forest virus (BFV) Length = 2410 Score = 34.7 bits (76), Expect = 1.8 Identities = 17/37 (45%), Positives = 22/37 (59%) Frame = +3 Query: 498 AYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHR 608 AY +GDI+ D +VNAAN + GV GAI+R Sbjct: 1334 AYRVKRGDISNAPEDAVVNAANQQGVKGAGVCGAIYR 1370 >UniRef50_Q4RG95 Cluster: Chromosome 12 SCAF15104, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 12 SCAF15104, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1433 Score = 34.3 bits (75), Expect = 2.3 Identities = 22/68 (32%), Positives = 33/68 (48%) Frame = +3 Query: 408 LTLLSRKMRFNQKACCTLKKPEGVRRRYENAYSCVQGDITKLAVDVIVNAANPSLMGSGG 587 + LLS + + C + P GV+ S Q D+ L VD +VN AN +L +GG Sbjct: 477 VALLSTHPQSSASCLCRVSAPSGVQ------LSVSQADLCALQVDAVVNPANENLQHTGG 530 Query: 588 VDGAIHRA 611 + A+ A Sbjct: 531 LALALLEA 538 >UniRef50_Q9WJC8 Cluster: Nonstructural polyprotein; n=12; Venezuelan equine encephalitis virus|Rep: Nonstructural polyprotein - Venezuelan equine encephalitis virus Length = 2455 Score = 34.3 bits (75), Expect = 2.3 Identities = 17/37 (45%), Positives = 22/37 (59%) Frame = +3 Query: 498 AYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHR 608 +Y V+GDI VIVNAAN GGV GA+++ Sbjct: 1332 SYHVVRGDIANAEEGVIVNAANSRGQPGGGVCGALYK 1368 >UniRef50_Q8IXQ6 Cluster: Poly [ADP-ribose] polymerase 9; n=26; Eutheria|Rep: Poly [ADP-ribose] polymerase 9 - Homo sapiens (Human) Length = 854 Score = 34.3 bits (75), Expect = 2.3 Identities = 16/32 (50%), Positives = 22/32 (68%) Frame = +3 Query: 519 DITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 D+T AVD +VNAAN L+ GG+ A+ +AG Sbjct: 126 DLTTHAVDAVVNAANEDLLHGGGLALALVKAG 157 >UniRef50_UPI0000660C67 Cluster: Homolog of Oncorhynchus mykiss "VHSV-induced protein-10.; n=1; Takifugu rubripes|Rep: Homolog of Oncorhynchus mykiss "VHSV-induced protein-10. - Takifugu rubripes Length = 1476 Score = 33.9 bits (74), Expect = 3.1 Identities = 16/35 (45%), Positives = 21/35 (60%) Frame = +3 Query: 498 AYSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAI 602 A+ V GDITK DVI+N++N + GV AI Sbjct: 896 AFEVVSGDITKETCDVIINSSNQNFTLKSGVSKAI 930 >UniRef50_UPI0000F3214F Cluster: UPI0000F3214F related cluster; n=1; Bos taurus|Rep: UPI0000F3214F UniRef100 entry - Bos Taurus Length = 166 Score = 33.9 bits (74), Expect = 3.1 Identities = 14/18 (77%), Positives = 15/18 (83%) Frame = +3 Query: 558 ANPSLMGSGGVDGAIHRA 611 AN SL+G GGVDG IHRA Sbjct: 89 ANASLLGGGGVDGCIHRA 106 >UniRef50_Q6NRC6 Cluster: MGC83934 protein; n=2; Xenopus|Rep: MGC83934 protein - Xenopus laevis (African clawed frog) Length = 914 Score = 33.9 bits (74), Expect = 3.1 Identities = 16/34 (47%), Positives = 23/34 (67%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 +GD+T+ VD +VNAAN L GG+ A+ +AG Sbjct: 86 KGDMTRQNVDAVVNAANEDLKHFGGLALALVKAG 119 >UniRef50_Q4SK43 Cluster: Chromosome 2 SCAF14570, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 2 SCAF14570, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 418 Score = 33.9 bits (74), Expect = 3.1 Identities = 15/34 (44%), Positives = 21/34 (61%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 + D+T VD +VNAAN L GG+ A+ +AG Sbjct: 61 KADLTNFPVDAVVNAANERLQHVGGIALALSKAG 94 >UniRef50_Q5PDJ6 Cluster: Fimbrial subunit; n=4; Salmonella|Rep: Fimbrial subunit - Salmonella paratyphi-a Length = 180 Score = 33.9 bits (74), Expect = 3.1 Identities = 13/25 (52%), Positives = 18/25 (72%) Frame = +1 Query: 154 STVQGGVIHFYGQIVEPACDVSTQS 228 +TV GG ++F GQ+V+ AC VS S Sbjct: 27 TTVTGGTVNFVGQVVDAACSVSADS 51 >UniRef50_A5W6W0 Cluster: Putative uncharacterized protein precursor; n=1; Pseudomonas putida F1|Rep: Putative uncharacterized protein precursor - Pseudomonas putida F1 Length = 114 Score = 33.9 bits (74), Expect = 3.1 Identities = 16/40 (40%), Positives = 24/40 (60%), Gaps = 1/40 (2%) Frame = +1 Query: 142 AIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEM-NCPQM 258 A AG+ V G++ F G IVEP+C + S+ M +CP + Sbjct: 20 AQAGAVVASGMLQFTGSIVEPSCTTTVGSAGWRMDDCPAL 59 >UniRef50_A0IIY3 Cluster: Putative uncharacterized protein precursor; n=1; Serratia proteamaculans 568|Rep: Putative uncharacterized protein precursor - Serratia proteamaculans 568 Length = 102 Score = 33.9 bits (74), Expect = 3.1 Identities = 15/46 (32%), Positives = 22/46 (47%) Frame = +1 Query: 112 MLGSLFFPFIAIAGSTVQGGVIHFYGQIVEPACDVSTQSSPVEMNC 249 ++ + F + A + + GGVI F G IVE C V+ S C Sbjct: 8 LIAGMLFSWTAASYAGTTGGVIRFVGSIVESPCTVNIADSTANTQC 53 >UniRef50_P55223 Cluster: Fimbrial subunit type 1 precursor; n=37; Enterobacteriaceae|Rep: Fimbrial subunit type 1 precursor - Salmonella typhimurium Length = 185 Score = 33.9 bits (74), Expect = 3.1 Identities = 19/53 (35%), Positives = 28/53 (52%), Gaps = 4/53 (7%) Frame = +1 Query: 85 MFRPFLNSLMLGSLFFPFIAIAGS----TVQGGVIHFYGQIVEPACDVSTQSS 231 M + S + +F A+A +V GG IHF G++V AC VST+S+ Sbjct: 1 MRHKLMTSTIASLMFVAAAAVAADPTPVSVVGGTIHFEGKLVNAACAVSTKSA 53 >UniRef50_Q8X3Q9 Cluster: Putative IS encoded protein; n=2; Escherichia coli O157:H7|Rep: Putative IS encoded protein - Escherichia coli O157:H7 Length = 117 Score = 33.5 bits (73), Expect = 4.1 Identities = 14/17 (82%), Positives = 15/17 (88%) Frame = +2 Query: 2 YLSDIGCLEIQGASLTL 52 YLSD GCLEIQGASL + Sbjct: 31 YLSDTGCLEIQGASLVI 47 >UniRef50_O28751 Cluster: UPF0189 protein AF_1521; n=25; Euryarchaeota|Rep: UPF0189 protein AF_1521 - Archaeoglobus fulgidus Length = 192 Score = 33.5 bits (73), Expect = 4.1 Identities = 18/33 (54%), Positives = 20/33 (60%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 QGDIT+ IVNAAN L GGV AI +A Sbjct: 18 QGDITQYPAKAIVNAANKRLEHGGGVAYAIAKA 50 >UniRef50_UPI0000F2CC13 Cluster: PREDICTED: similar to B aggressive lymphoma long; n=1; Monodelphis domestica|Rep: PREDICTED: similar to B aggressive lymphoma long - Monodelphis domestica Length = 1624 Score = 33.1 bits (72), Expect = 5.4 Identities = 15/32 (46%), Positives = 22/32 (68%) Frame = +3 Query: 519 DITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 D+T+ D +VNAAN L+ +GG+ A+ RAG Sbjct: 107 DLTRHPADAVVNAANERLLHAGGLALALVRAG 138 >UniRef50_UPI0000E48437 Cluster: PREDICTED: similar to slowpoke binding protein; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to slowpoke binding protein - Strongylocentrotus purpuratus Length = 687 Score = 33.1 bits (72), Expect = 5.4 Identities = 13/46 (28%), Positives = 24/46 (52%) Frame = +3 Query: 177 PFLWPNCGTGM*RQHPVITRRNELPTNGSIPGKTYSSKALMSGNVK 314 P++WP + +H V+ E +NGS+ Y++KA + + K Sbjct: 151 PYIWPTADIDLWEEHEVVIVTQEFNSNGSLKDYIYNAKAKLDWSNK 196 >UniRef50_UPI0000E46337 Cluster: PREDICTED: similar to TRAAK; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to TRAAK - Strongylocentrotus purpuratus Length = 260 Score = 33.1 bits (72), Expect = 5.4 Identities = 13/31 (41%), Positives = 17/31 (54%) Frame = +1 Query: 118 GSLFFPFIAIAGSTVQGGVIHFYGQIVEPAC 210 G F F A+ G + GG+IH GQI+ C Sbjct: 137 GEAFCMFYAVIGIAITGGIIHSVGQILHATC 167 >UniRef50_Q6ZED8 Cluster: Slr7060 protein; n=1; Synechocystis sp. PCC 6803|Rep: Slr7060 protein - Synechocystis sp. (strain PCC 6803) Length = 588 Score = 33.1 bits (72), Expect = 5.4 Identities = 15/32 (46%), Positives = 21/32 (65%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 GDITK + IVN+ + +L SG + AIH+A Sbjct: 425 GDITKEKAEAIVNSTDRNLSNSGALSRAIHQA 456 >UniRef50_Q38Y93 Cluster: Hypothetical cell surface protein; n=1; Lactobacillus sakei subsp. sakei 23K|Rep: Hypothetical cell surface protein - Lactobacillus sakei subsp. sakei (strain 23K) Length = 1987 Score = 33.1 bits (72), Expect = 5.4 Identities = 18/49 (36%), Positives = 28/49 (57%), Gaps = 2/49 (4%) Frame = +3 Query: 243 ELPTNGSIPGKTYSSK--ALMSGNVKNAQIASVKVQYLDKQKKLAVMNI 383 ELPT+ + G+TY++K A N+K +Q ASV V ++ L N+ Sbjct: 1505 ELPTDALVAGQTYTAKVVATNGANMKESQSASVTVSEKEETDPLVDYNV 1553 >UniRef50_UPI00006A1CA6 Cluster: poly (ADP-ribose) polymerase family, member 14; n=12; Xenopus tropicalis|Rep: poly (ADP-ribose) polymerase family, member 14 - Xenopus tropicalis Length = 1527 Score = 32.7 bits (71), Expect = 7.1 Identities = 18/37 (48%), Positives = 21/37 (56%) Frame = +3 Query: 501 YSCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 Y GDITK + DVIVN++N S GV AI A Sbjct: 948 YQVRTGDITKESTDVIVNSSNSSFTQKIGVSKAILEA 984 >UniRef50_Q327R1 Cluster: Putative uncharacterized protein; n=1; Shigella dysenteriae Sd197|Rep: Putative uncharacterized protein - Shigella dysenteriae serotype 1 (strain Sd197) Length = 70 Score = 32.7 bits (71), Expect = 7.1 Identities = 14/15 (93%), Positives = 14/15 (93%) Frame = +2 Query: 2 YLSDIGCLEIQGASL 46 YLSD GCLEIQGASL Sbjct: 13 YLSDTGCLEIQGASL 27 >UniRef50_Q7WTH2 Cluster: Putative uncharacterized protein; n=1; Escherichia coli|Rep: Putative uncharacterized protein - Escherichia coli Length = 80 Score = 32.7 bits (71), Expect = 7.1 Identities = 14/15 (93%), Positives = 14/15 (93%) Frame = +2 Query: 2 YLSDIGCLEIQGASL 46 YLSD GCLEIQGASL Sbjct: 13 YLSDTGCLEIQGASL 27 >UniRef50_P67344 Cluster: UPF0189 protein SA0314; n=13; Staphylococcus|Rep: UPF0189 protein SA0314 - Staphylococcus aureus (strain N315) Length = 266 Score = 32.7 bits (71), Expect = 7.1 Identities = 14/22 (63%), Positives = 16/22 (72%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMG 578 QGDIT L +D IVNAAN +G Sbjct: 91 QGDITTLKIDAIVNAANSRFLG 112 >UniRef50_Q4V2H0 Cluster: Putative uncharacterized protein; n=13; Burkholderia|Rep: Putative uncharacterized protein - Burkholderia mallei (Pseudomonas mallei) Length = 111 Score = 32.3 bits (70), Expect = 9.4 Identities = 19/47 (40%), Positives = 26/47 (55%), Gaps = 2/47 (4%) Frame = +1 Query: 85 MFRPFL--NSLMLGSLFFPFIAIAGSTVQGGVIHFYGQIVEPACDVS 219 M RP L +SL+ S+F ++ A Q G++ F G IVEP C S Sbjct: 1 MQRPSLIASSLLTASMFA--LSFAAHAQQTGIVRFTGMIVEPPCSFS 45 >UniRef50_A6PBP5 Cluster: Appr-1-p processing domain protein; n=1; Shewanella sediminis HAW-EB3|Rep: Appr-1-p processing domain protein - Shewanella sediminis HAW-EB3 Length = 293 Score = 32.3 bits (70), Expect = 9.4 Identities = 14/21 (66%), Positives = 17/21 (80%) Frame = +3 Query: 516 GDITKLAVDVIVNAANPSLMG 578 GDIT+L VD I+NAAN L+G Sbjct: 117 GDITQLKVDAIINAANVYLLG 137 >UniRef50_A1HMQ5 Cluster: Appr-1-p processing domain protein; n=4; Clostridiales|Rep: Appr-1-p processing domain protein - Thermosinus carboxydivorans Nor1 Length = 264 Score = 32.3 bits (70), Expect = 9.4 Identities = 16/30 (53%), Positives = 18/30 (60%) Frame = +3 Query: 513 QGDITKLAVDVIVNAANPSLMGSGGVDGAI 602 QGDIT+ D IVN AN L+ GG AI Sbjct: 92 QGDITEETTDAIVNPANSRLVHGGGAARAI 121 >UniRef50_Q0CEI7 Cluster: Putative uncharacterized protein; n=1; Aspergillus terreus NIH2624|Rep: Putative uncharacterized protein - Aspergillus terreus (strain NIH 2624) Length = 524 Score = 32.3 bits (70), Expect = 9.4 Identities = 17/36 (47%), Positives = 20/36 (55%) Frame = +3 Query: 504 SCVQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRA 611 S DIT L VD IV + G GG+DGA+H A Sbjct: 320 SLAHTDITTLEVDCIVTGISEP-RGQGGLDGAVHAA 354 >UniRef50_O67112 Cluster: UPF0189 protein aq_987; n=3; cellular organisms|Rep: UPF0189 protein aq_987 - Aquifex aeolicus Length = 165 Score = 32.3 bits (70), Expect = 9.4 Identities = 17/35 (48%), Positives = 21/35 (60%) Frame = +3 Query: 510 VQGDITKLAVDVIVNAANPSLMGSGGVDGAIHRAG 614 V+G IT++ DVIVN AN + GGV I R G Sbjct: 6 VKGSITEVDADVIVNPANSRGLMGGGVAVVIKRLG 40 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 621,798,236 Number of Sequences: 1657284 Number of extensions: 12470343 Number of successful extensions: 28637 Number of sequences better than 10.0: 135 Number of HSP's better than 10.0 without gapping: 27755 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 28630 length of database: 575,637,011 effective HSP length: 97 effective length of database: 414,880,463 effective search space used: 44392209541 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -