BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= fmgV10e16r (756 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q179J9 Cluster: Mitochondrial ATP synthase b chain; n=3... 316 4e-85 UniRef50_Q94516 Cluster: ATP synthase B chain, mitochondrial pre... 302 5e-81 UniRef50_Q5XUB3 Cluster: Putative ATP synthase-like protein; n=1... 233 5e-60 UniRef50_Q0PXW9 Cluster: Putative ATP synthase-like protein; n=1... 226 5e-58 UniRef50_UPI0000517B84 Cluster: PREDICTED: similar to ATP syntha... 225 9e-58 UniRef50_UPI0000585FFD Cluster: PREDICTED: similar to ATP syntha... 217 2e-55 UniRef50_P24539 Cluster: ATP synthase B chain, mitochondrial pre... 184 3e-45 UniRef50_Q5DI09 Cluster: SJCHGC09031 protein; n=1; Schistosoma j... 153 3e-36 UniRef50_A7RXX3 Cluster: Predicted protein; n=1; Nematostella ve... 131 2e-29 UniRef50_UPI0000E24DC6 Cluster: PREDICTED: similar to ATP syntha... 109 1e-22 UniRef50_UPI0000DD7E8D Cluster: PREDICTED: similar to ATP syntha... 89 1e-16 UniRef50_Q19126 Cluster: Atp synthase b homolog protein 2; n=4; ... 85 2e-15 UniRef50_Q6AWE2 Cluster: AT16129p; n=3; Drosophila melanogaster|... 57 5e-07 UniRef50_Q870C4 Cluster: ATP synthase subunit 4, mitochondrial p... 56 9e-07 UniRef50_Q5KL26 Cluster: ATP synthase, putative; n=1; Filobasidi... 52 2e-05 UniRef50_A3PHG2 Cluster: C-5 cytosine-specific DNA methylase; n=... 49 1e-04 UniRef50_P08123 Cluster: Collagen alpha-2(I) chain precursor; n=... 45 0.002 UniRef50_UPI0000E48567 Cluster: PREDICTED: hypothetical protein;... 43 0.007 UniRef50_UPI0000F30A81 Cluster: UPI0000F30A81 related cluster; n... 43 0.009 UniRef50_Q72KK1 Cluster: Prephenate dehydrogenase; n=1; Thermus ... 42 0.016 UniRef50_Q095Q3 Cluster: Putative uncharacterized protein; n=1; ... 42 0.016 UniRef50_A6GFZ9 Cluster: Serine/threonine kinase PKN8; n=1; Ples... 42 0.016 UniRef50_Q9N3D7 Cluster: Collagen protein 48; n=3; Chromadorea|R... 42 0.016 UniRef50_Q4RFZ6 Cluster: Chromosome undetermined SCAF15108, whol... 42 0.022 UniRef50_Q5C1A3 Cluster: SJCHGC09249 protein; n=1; Schistosoma j... 42 0.022 UniRef50_Q4PDX4 Cluster: Putative uncharacterized protein; n=1; ... 42 0.022 UniRef50_O93419 Cluster: Collagen XVIII precursor; n=3; Gallus g... 41 0.029 UniRef50_A5NYC5 Cluster: Putative uncharacterized protein; n=1; ... 41 0.038 UniRef50_Q4P3N6 Cluster: Putative uncharacterized protein; n=1; ... 41 0.038 UniRef50_Q9UQ35 Cluster: Serine/arginine repetitive matrix prote... 40 0.050 UniRef50_Q4SRH5 Cluster: L-lactate dehydrogenase; n=4; Euteleost... 40 0.066 UniRef50_Q4IYP6 Cluster: Putative uncharacterized protein; n=1; ... 40 0.066 UniRef50_Q22GI2 Cluster: UBX domain containing protein; n=1; Tet... 40 0.066 UniRef50_Q17A79 Cluster: Collagen alpha chain, anopheles; n=7; C... 40 0.066 UniRef50_UPI0000E80F2F Cluster: PREDICTED: hypothetical protein;... 40 0.088 UniRef50_A1G8K0 Cluster: Acyl-CoA dehydrogenase-like; n=2; Salin... 40 0.088 UniRef50_Q2VLH1 Cluster: Major ampullate spidroin 2; n=8; Aranei... 40 0.088 UniRef50_UPI000065EA11 Cluster: Collagen alpha-1(XV) chain precu... 39 0.12 UniRef50_Q73UH7 Cluster: Putative uncharacterized protein; n=2; ... 39 0.12 UniRef50_Q0SBU7 Cluster: Glycine rich protein; n=1; Rhodococcus ... 39 0.12 UniRef50_A5NZ47 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Re... 39 0.12 UniRef50_A3C636 Cluster: Putative uncharacterized protein; n=3; ... 39 0.12 UniRef50_P90679 Cluster: Fibrillar collagen; n=2; Annelida/Echiu... 39 0.12 UniRef50_O97406 Cluster: Collagen pro alpha-chain precursor; n=1... 39 0.12 UniRef50_UPI0000DD81AD Cluster: PREDICTED: hypothetical protein;... 39 0.15 UniRef50_Q7UJU9 Cluster: Putative uncharacterized protein; n=1; ... 39 0.15 UniRef50_Q2I6N3 Cluster: Uncharacterized Gly-rich protein; n=3; ... 39 0.15 UniRef50_A5NR62 Cluster: Putative uncharacterized protein; n=1; ... 39 0.15 UniRef50_Q6EQL3 Cluster: Putative uncharacterized protein OSJNBa... 39 0.15 UniRef50_UPI00015B49AB Cluster: PREDICTED: similar to alpha-5 ty... 38 0.20 UniRef50_UPI0000F2D61A Cluster: PREDICTED: hypothetical protein;... 38 0.20 UniRef50_UPI0000EBDE87 Cluster: PREDICTED: hypothetical protein;... 38 0.20 UniRef50_UPI00004D1B58 Cluster: UPI00004D1B58 related cluster; n... 38 0.20 UniRef50_Q4RVK5 Cluster: Chromosome 15 SCAF14992, whole genome s... 38 0.20 UniRef50_Q4RTK0 Cluster: Chromosome 2 SCAF14997, whole genome sh... 38 0.20 UniRef50_Q9L060 Cluster: Putative uncharacterized protein SCO297... 38 0.20 UniRef50_Q5UBV9 Cluster: Resuscitation promoting factor; n=1; My... 38 0.20 UniRef50_A7DAS9 Cluster: Putative uncharacterized protein; n=1; ... 38 0.20 UniRef50_A5P2U3 Cluster: Putative PAS/PAC sensor protein; n=6; P... 38 0.20 UniRef50_Q60AW0 Cluster: Putative uncharacterized protein; n=1; ... 38 0.27 UniRef50_Q3WGB8 Cluster: Putative uncharacterized protein; n=8; ... 38 0.27 UniRef50_Q0RAQ2 Cluster: Putative uncharacterized protein; n=1; ... 38 0.27 UniRef50_A6W7I0 Cluster: Putative uncharacterized protein; n=1; ... 38 0.27 UniRef50_A5NM96 Cluster: Putative uncharacterized protein; n=1; ... 38 0.27 UniRef50_A3P9K7 Cluster: DNA ligase, ATP-dependent; n=12; Proteo... 38 0.27 UniRef50_A0QXB8 Cluster: Putative uncharacterized protein; n=1; ... 38 0.27 UniRef50_Q8MW55 Cluster: Precollagen-NG; n=2; Mytilus|Rep: Preco... 38 0.27 UniRef50_A7S046 Cluster: Predicted protein; n=1; Nematostella ve... 38 0.27 UniRef50_Q8U4L2 Cluster: Putative uncharacterized protein PF0070... 38 0.27 UniRef50_Q14050 Cluster: Collagen alpha-3(IX) chain precursor; n... 38 0.27 UniRef50_P20908 Cluster: Collagen alpha-1(V) chain precursor; n=... 38 0.27 UniRef50_UPI000065EAC0 Cluster: Homolog of Homo sapiens "PREDICT... 38 0.35 UniRef50_UPI000065E422 Cluster: Collagen alpha-1(XI) chain precu... 38 0.35 UniRef50_Q4SB89 Cluster: Chromosome undetermined SCAF14676, whol... 38 0.35 UniRef50_A1BM62 Cluster: Latency associated nuclear antigen (LAN... 38 0.35 UniRef50_Q82F52 Cluster: Putative GntR-family transcriptional re... 38 0.35 UniRef50_Q2J869 Cluster: Tetratricopeptide TPR_2; n=1; Frankia s... 38 0.35 UniRef50_Q5C2Y9 Cluster: SJCHGC09378 protein; n=2; Platyhelminth... 38 0.35 UniRef50_A1XVT1 Cluster: Fibrillar collagen precursor; n=1; Hydr... 38 0.35 UniRef50_P46804 Cluster: Spidroin-2; n=17; Orbiculariae|Rep: Spi... 38 0.35 UniRef50_UPI0000EBD1F0 Cluster: PREDICTED: hypothetical protein;... 37 0.47 UniRef50_UPI0000E7F798 Cluster: PREDICTED: hypothetical protein;... 37 0.47 UniRef50_Q2VIS4 Cluster: Filaggrin 2; n=3; Mus musculus|Rep: Fil... 37 0.47 UniRef50_Q3JU34 Cluster: Putative uncharacterized protein; n=5; ... 37 0.47 UniRef50_Q2W370 Cluster: Putative uncharacterized protein; n=3; ... 37 0.47 UniRef50_Q1QHE7 Cluster: OmpA/MotB precursor; n=2; Nitrobacter|R... 37 0.47 UniRef50_A1G361 Cluster: Putative uncharacterized protein; n=1; ... 37 0.47 UniRef50_Q0JLS5 Cluster: Os01g0575200 protein; n=1; Oryza sativa... 37 0.47 UniRef50_Q2H3Y0 Cluster: Putative uncharacterized protein; n=1; ... 37 0.47 UniRef50_P18503 Cluster: Short-chain collagen C4; n=2; Ephydatia... 37 0.47 UniRef50_UPI000155BCDD Cluster: PREDICTED: similar to Elongation... 37 0.62 UniRef50_UPI0000F2E1B1 Cluster: PREDICTED: hypothetical protein;... 37 0.62 UniRef50_UPI0000F2C810 Cluster: PREDICTED: hypothetical protein;... 37 0.62 UniRef50_UPI0000DA3A8A Cluster: PREDICTED: hypothetical protein;... 37 0.62 UniRef50_UPI000059FC02 Cluster: PREDICTED: hypothetical protein ... 37 0.62 UniRef50_UPI00003932A2 Cluster: hypothetical protein Blon0300011... 37 0.62 UniRef50_UPI000069E795 Cluster: UPI000069E795 related cluster; n... 37 0.62 UniRef50_Q4SPM7 Cluster: Chromosome 16 SCAF14537, whole genome s... 37 0.62 UniRef50_Q4S1P4 Cluster: Chromosome 6 SCAF14768, whole genome sh... 37 0.62 UniRef50_Q3JM63 Cluster: Peptide synthetase NRPS5-4-3; n=16; Bur... 37 0.62 UniRef50_A7HI44 Cluster: LigA; n=1; Anaeromyxobacter sp. Fw109-5... 37 0.62 UniRef50_A5P034 Cluster: Putative uncharacterized protein precur... 37 0.62 UniRef50_A5NQT8 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Re... 37 0.62 UniRef50_A4VVK3 Cluster: ATP synthase B chain; n=3; Streptococcu... 37 0.62 UniRef50_Q0ISC2 Cluster: Os11g0538400 protein; n=1; Oryza sativa... 37 0.62 UniRef50_Q171W5 Cluster: Lava lamp protein; n=2; Culicidae|Rep: ... 37 0.62 UniRef50_P20849 Cluster: Collagen alpha-1(IX) chain precursor; n... 37 0.62 UniRef50_UPI0000F201FC Cluster: PREDICTED: similar to collagen, ... 36 0.82 UniRef50_UPI0000DD84BF Cluster: PREDICTED: hypothetical protein;... 36 0.82 UniRef50_UPI00005C000E Cluster: PREDICTED: similar to Apolipopro... 36 0.82 UniRef50_Q4SNW2 Cluster: Chromosome 15 SCAF14542, whole genome s... 36 0.82 UniRef50_Q4RWT3 Cluster: Chromosome 15 SCAF14981, whole genome s... 36 0.82 UniRef50_Q3JTD9 Cluster: Putative uncharacterized protein; n=1; ... 36 0.82 UniRef50_A5P3S2 Cluster: Putative uncharacterized protein; n=1; ... 36 0.82 UniRef50_A5P062 Cluster: Putative uncharacterized protein; n=2; ... 36 0.82 UniRef50_A5NX95 Cluster: Putative uncharacterized protein; n=1; ... 36 0.82 UniRef50_A5NVB2 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Re... 36 0.82 UniRef50_A2W4K4 Cluster: Putative uncharacterized protein; n=5; ... 36 0.82 UniRef50_A0VWL6 Cluster: NAD-dependent epimerase/dehydratase; n=... 36 0.82 UniRef50_A0V8U7 Cluster: Acyl-CoA dehydrogenase, type 2-like; n=... 36 0.82 UniRef50_Q26640 Cluster: Alpha2(IV)-like collagen; n=4; Strongyl... 36 0.82 UniRef50_A6YIY0 Cluster: Major ampullate spidroin 2; n=3; Latrod... 36 0.82 UniRef50_P53420 Cluster: Collagen alpha-4(IV) chain precursor; n... 36 0.82 UniRef50_UPI0001555BF3 Cluster: PREDICTED: similar to Thy-1 prot... 36 1.1 UniRef50_UPI0000DD8409 Cluster: PREDICTED: hypothetical protein;... 36 1.1 UniRef50_UPI0000D9F288 Cluster: PREDICTED: hypothetical protein;... 36 1.1 UniRef50_UPI00015A783F Cluster: LOC553362 protein; n=1; Danio re... 36 1.1 UniRef50_UPI000069F5B9 Cluster: alpha 1 type XIII collagen isofo... 36 1.1 UniRef50_Q4SZ70 Cluster: Chromosome undetermined SCAF11805, whol... 36 1.1 UniRef50_Q4SZ69 Cluster: Chromosome undetermined SCAF11805, whol... 36 1.1 UniRef50_Q4RX03 Cluster: Chromosome 11 SCAF14979, whole genome s... 36 1.1 UniRef50_Q1LYN9 Cluster: Novel protein similar to vertebrate col... 36 1.1 UniRef50_Q6I7K4 Cluster: Orf663 protein; n=3; Proteobacteria|Rep... 36 1.1 UniRef50_Q4IVL7 Cluster: Putative uncharacterized protein precur... 36 1.1 UniRef50_A5P2Z2 Cluster: Integral membrane protein-like protein;... 36 1.1 UniRef50_A5P281 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Re... 36 1.1 UniRef50_A0VWK8 Cluster: Putative uncharacterized protein; n=1; ... 36 1.1 UniRef50_A0VBD2 Cluster: Putative uncharacterized protein precur... 36 1.1 UniRef50_Q2QMM2 Cluster: Retrotransposon protein, putative, uncl... 36 1.1 UniRef50_A7E3J6 Cluster: Putative DUX4 protein; n=1; Procavia ca... 36 1.1 UniRef50_Q1JTJ3 Cluster: SET-domain protein, putative; n=1; Toxo... 36 1.1 UniRef50_Q19050 Cluster: Putative uncharacterized protein col-18... 36 1.1 UniRef50_A7S288 Cluster: Predicted protein; n=1; Nematostella ve... 36 1.1 UniRef50_Q4P641 Cluster: Putative uncharacterized protein; n=1; ... 36 1.1 UniRef50_P29143 Cluster: Halolysin precursor; n=5; Halobacterial... 36 1.1 UniRef50_P25067 Cluster: Collagen alpha-2(VIII) chain precursor;... 36 1.1 UniRef50_UPI000155DCFE Cluster: PREDICTED: hypothetical protein;... 36 1.4 UniRef50_UPI0000F2EAE8 Cluster: PREDICTED: hypothetical protein,... 36 1.4 UniRef50_UPI0000E813B5 Cluster: PREDICTED: hypothetical protein;... 36 1.4 UniRef50_UPI0000DD78A3 Cluster: PREDICTED: hypothetical protein;... 36 1.4 UniRef50_UPI00015A592A Cluster: Type IV collagen alpha 4 chain; ... 36 1.4 UniRef50_UPI000065FCBB Cluster: Homolog of Oncorhynchus mykiss "... 36 1.4 UniRef50_UPI0000ECA83C Cluster: Centrosome-associated protein CE... 36 1.4 UniRef50_Q4S5M8 Cluster: Chromosome 9 SCAF14729, whole genome sh... 36 1.4 UniRef50_Q9RX57 Cluster: Putative uncharacterized protein; n=1; ... 36 1.4 UniRef50_Q5PIF1 Cluster: Subunit S of type I restriction-modific... 36 1.4 UniRef50_Q2IFX3 Cluster: Putative uncharacterized protein precur... 36 1.4 UniRef50_Q9KXB9 Cluster: Tail fiber protein; n=7; root|Rep: Tail... 36 1.4 UniRef50_A7H8S3 Cluster: Putative uncharacterized protein precur... 36 1.4 UniRef50_A7DG98 Cluster: Putative uncharacterized protein; n=2; ... 36 1.4 UniRef50_A5P2L0 Cluster: Putative uncharacterized protein; n=3; ... 36 1.4 UniRef50_A5NQE3 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Re... 36 1.4 UniRef50_A5NPK7 Cluster: Putative uncharacterized protein; n=3; ... 36 1.4 UniRef50_A5NMK3 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Re... 36 1.4 UniRef50_Q6YW72 Cluster: Pr1-like protein; n=4; Oryza sativa (ja... 36 1.4 UniRef50_A7NUN9 Cluster: Chromosome chr18 scaffold_1, whole geno... 36 1.4 UniRef50_O45114 Cluster: Collagen protein 103; n=3; cellular org... 36 1.4 UniRef50_Q6ZQQ4 Cluster: CDNA FLJ46309 fis, clone TESTI4039744; ... 36 1.4 UniRef50_P33485 Cluster: Probable nuclear antigen; n=5; root|Rep... 36 1.4 UniRef50_Q02241 Cluster: Kinesin-like protein KIF23; n=34; Eumet... 36 1.4 UniRef50_P12107 Cluster: Collagen alpha-1(XI) chain precursor; n... 36 1.4 UniRef50_Q02388 Cluster: Collagen alpha-1(VII) chain precursor; ... 36 1.4 UniRef50_UPI000155CDC9 Cluster: PREDICTED: hypothetical protein;... 35 1.9 UniRef50_UPI0000F2DA9D Cluster: PREDICTED: hypothetical protein;... 35 1.9 UniRef50_UPI0000EBD3CA Cluster: PREDICTED: hypothetical protein;... 35 1.9 UniRef50_UPI0000E7F95D Cluster: PREDICTED: similar to MGC86401 p... 35 1.9 UniRef50_UPI0000E21BFC Cluster: PREDICTED: hypothetical protein;... 35 1.9 UniRef50_UPI0000DD85F5 Cluster: PREDICTED: hypothetical protein;... 35 1.9 UniRef50_UPI00006A1B4A Cluster: Collagen alpha-3(VI) chain precu... 35 1.9 UniRef50_UPI0000ECB838 Cluster: Hypothetical protein; n=1; Gallu... 35 1.9 UniRef50_Q6TEP5 Cluster: Hyaluronan-mediated motility receptor; ... 35 1.9 UniRef50_Q4TC25 Cluster: Chromosome undetermined SCAF7059, whole... 35 1.9 UniRef50_Q4S480 Cluster: Chromosome undetermined SCAF14743, whol... 35 1.9 UniRef50_Q9AD79 Cluster: Putative membrane protein; n=1; Strepto... 35 1.9 UniRef50_Q2INF4 Cluster: Putative uncharacterized protein; n=1; ... 35 1.9 UniRef50_Q3WG32 Cluster: Putative uncharacterized protein; n=1; ... 35 1.9 UniRef50_Q11AX3 Cluster: Cytochrome c, class I; n=3; Rhizobiales... 35 1.9 UniRef50_Q08UF8 Cluster: Tetratricopeptide repeat domain protein... 35 1.9 UniRef50_A5P378 Cluster: Putative uncharacterized protein; n=3; ... 35 1.9 UniRef50_A0QX11 Cluster: Putative uncharacterized protein; n=1; ... 35 1.9 UniRef50_Q6ZAE0 Cluster: Putative uncharacterized protein P0410E... 35 1.9 UniRef50_Q9VMV5 Cluster: CG16858-PA; n=6; Schizophora|Rep: CG168... 35 1.9 UniRef50_Q66S51 Cluster: Collagen repeat-containing protein; n=1... 35 1.9 UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocent... 35 1.9 UniRef50_A7RPE2 Cluster: Predicted protein; n=1; Nematostella ve... 35 1.9 UniRef50_A4H5G1 Cluster: Putative uncharacterized protein; n=1; ... 35 1.9 UniRef50_A0CHT2 Cluster: Chromosome undetermined scaffold_184, w... 35 1.9 UniRef50_A6NF26 Cluster: Uncharacterized protein COL27A1; n=28; ... 35 1.9 UniRef50_Q750X0 Cluster: AGL181Cp; n=1; Eremothecium gossypii|Re... 35 1.9 UniRef50_UPI0001555CB8 Cluster: PREDICTED: similar to T-box 1; n... 35 2.5 UniRef50_UPI0000F2146D Cluster: PREDICTED: similar to alpha-1 ty... 35 2.5 UniRef50_UPI0000EBD4BD Cluster: PREDICTED: similar to alpha-3 ty... 35 2.5 UniRef50_UPI0000E801E7 Cluster: PREDICTED: similar to alpha 1 ty... 35 2.5 UniRef50_UPI0000E1F200 Cluster: PREDICTED: hypothetical protein;... 35 2.5 UniRef50_UPI00006D930E Cluster: hypothetical protein Paer2_01003... 35 2.5 UniRef50_UPI00006A0C93 Cluster: Collagen alpha-1(XIX) chain prec... 35 2.5 UniRef50_UPI000069E9BC Cluster: UPI000069E9BC related cluster; n... 35 2.5 UniRef50_UPI000065F78D Cluster: Homolog of Homo sapiens "Collage... 35 2.5 UniRef50_Q4TFV5 Cluster: Chromosome undetermined SCAF4174, whole... 35 2.5 UniRef50_Q4SWY6 Cluster: Chromosome undetermined SCAF13320, whol... 35 2.5 UniRef50_Q5GAF3 Cluster: Putative uncharacterized protein; n=2; ... 35 2.5 UniRef50_Q2IMJ3 Cluster: LigA; n=4; cellular organisms|Rep: LigA... 35 2.5 UniRef50_Q2IMH4 Cluster: Fe-S oxidoreductase; n=1; Anaeromyxobac... 35 2.5 UniRef50_A7FBU7 Cluster: Putative uncharacterized protein; n=1; ... 35 2.5 UniRef50_Q0AYI8 Cluster: Translation initiation factor IF-2; n=1... 35 2.5 UniRef50_Q08NL8 Cluster: Putative uncharacterized protein; n=1; ... 35 2.5 UniRef50_A5NMX6 Cluster: Cytochrome B561; n=1; Methylobacterium ... 35 2.5 UniRef50_A5NLP4 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Re... 35 2.5 UniRef50_A0VAK1 Cluster: Uncharacterized protein UPF0065 precurs... 35 2.5 UniRef50_Q8S842 Cluster: Putative uncharacterized protein OSJNBa... 35 2.5 UniRef50_Q5GAB4 Cluster: PHANTASTICA-like protein; n=1; Selagine... 35 2.5 UniRef50_Q7QYY1 Cluster: GLP_164_20758_21504; n=1; Giardia lambl... 35 2.5 UniRef50_Q5TV76 Cluster: ENSANGP00000028104; n=1; Anopheles gamb... 35 2.5 UniRef50_Q29FV7 Cluster: GA17072-PA; n=1; Drosophila pseudoobscu... 35 2.5 UniRef50_Q26634 Cluster: Alpha-1 collagen; n=4; Echinoida|Rep: A... 35 2.5 UniRef50_Q20778 Cluster: Dumpy : shorter than wild-type protein ... 35 2.5 UniRef50_Q20142 Cluster: Collagen protein 172, isoform a; n=4; N... 35 2.5 UniRef50_O44174 Cluster: Collagen protein 104; n=2; Caenorhabdit... 35 2.5 UniRef50_Q8NFW1 Cluster: Collagen alpha-1(XXII) chain; n=23; Eut... 35 2.5 UniRef50_Q7SAE9 Cluster: Putative uncharacterized protein NCU070... 35 2.5 UniRef50_Q96JG9 Cluster: Zinc finger protein 469; n=5; Eutheria|... 35 2.5 UniRef50_P31568 Cluster: Protein ycf2; n=1; Oenothera picensis|R... 35 2.5 UniRef50_UPI0000F1ED14 Cluster: PREDICTED: similar to autoantige... 34 3.3 UniRef50_UPI0000F1E5D4 Cluster: PREDICTED: similar to collagen, ... 34 3.3 UniRef50_UPI0000EBCFCF Cluster: PREDICTED: hypothetical protein;... 34 3.3 UniRef50_UPI0000EBC639 Cluster: PREDICTED: hypothetical protein;... 34 3.3 UniRef50_UPI0000DB7985 Cluster: PREDICTED: similar to Collagen t... 34 3.3 UniRef50_Q4T971 Cluster: Chromosome undetermined SCAF7635, whole... 34 3.3 UniRef50_Q4SHQ0 Cluster: Chromosome 5 SCAF14581, whole genome sh... 34 3.3 UniRef50_Q4RAQ5 Cluster: Chromosome undetermined SCAF23104, whol... 34 3.3 UniRef50_Q9A567 Cluster: Putative uncharacterized protein; n=1; ... 34 3.3 UniRef50_Q2S573 Cluster: SpoOJ protein; n=1; Salinibacter ruber ... 34 3.3 UniRef50_Q3W4Q1 Cluster: Protein kinase; n=1; Frankia sp. EAN1pe... 34 3.3 UniRef50_Q08VS0 Cluster: Putative uncharacterized protein; n=1; ... 34 3.3 UniRef50_A7HAA8 Cluster: Ribonuclease R; n=4; cellular organisms... 34 3.3 UniRef50_A7DM25 Cluster: FMN-binding domain protein; n=2; Methyl... 34 3.3 UniRef50_A5P662 Cluster: Tetratricopeptide TPR_2 repeat protein;... 34 3.3 UniRef50_A5NRS7 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Re... 34 3.3 UniRef50_A4X268 Cluster: Putative uncharacterized protein; n=1; ... 34 3.3 UniRef50_A0U0Q4 Cluster: Polysaccharide biosynthesis protein pre... 34 3.3 UniRef50_Q8GSW8 Cluster: Pr1-like protein; n=5; Oryza sativa (ja... 34 3.3 UniRef50_Q5JKI0 Cluster: Epstein-Barr virus EBNA-1-like; n=6; Or... 34 3.3 UniRef50_Q0JGY9 Cluster: Os01g0895300 protein; n=1; Oryza sativa... 34 3.3 UniRef50_Q8WP20 Cluster: Putative uncharacterized protein; n=2; ... 34 3.3 UniRef50_Q8T5C7 Cluster: Erythrocyte binding protein 1; n=51; ce... 34 3.3 UniRef50_A7RG67 Cluster: Predicted protein; n=1; Nematostella ve... 34 3.3 UniRef50_A2FKS2 Cluster: Putative uncharacterized protein; n=1; ... 34 3.3 UniRef50_Q6FPM9 Cluster: Similarities with tr|Q12218 Saccharomyc... 34 3.3 UniRef50_Q2UK29 Cluster: Predicted protein; n=3; Trichocomaceae|... 34 3.3 UniRef50_A4QZG0 Cluster: Predicted protein; n=1; Magnaporthe gri... 34 3.3 UniRef50_Q9LD55 Cluster: Eukaryotic translation initiation facto... 34 3.3 UniRef50_Q8WXF8 Cluster: DNA-binding death effector domain-conta... 34 3.3 UniRef50_P12105 Cluster: Collagen alpha-1(III) chain precursor; ... 34 3.3 UniRef50_UPI00015532AC Cluster: PREDICTED: hypothetical protein;... 34 4.4 UniRef50_UPI0000EBDF9C Cluster: PREDICTED: hypothetical protein;... 34 4.4 UniRef50_UPI0000EBC3A2 Cluster: PREDICTED: hypothetical protein;... 34 4.4 UniRef50_UPI0000E49418 Cluster: PREDICTED: similar to ENSANGP000... 34 4.4 UniRef50_UPI0000E47B28 Cluster: PREDICTED: hypothetical protein;... 34 4.4 UniRef50_UPI0000D9F367 Cluster: PREDICTED: hypothetical protein;... 34 4.4 UniRef50_UPI0000D9C121 Cluster: PREDICTED: hypothetical protein;... 34 4.4 UniRef50_UPI00006D9F3F Cluster: COG4783: Putative Zn-dependent p... 34 4.4 UniRef50_UPI000065E567 Cluster: Homolog of Brachydanio rerio "Co... 34 4.4 UniRef50_UPI0000EB1F38 Cluster: UPI0000EB1F38 related cluster; n... 34 4.4 UniRef50_UPI0000EB16EB Cluster: UPI0000EB16EB related cluster; n... 34 4.4 UniRef50_Q4SFX9 Cluster: Chromosome 7 SCAF14601, whole genome sh... 34 4.4 UniRef50_Q8PLD5 Cluster: Putative uncharacterized protein XAC186... 34 4.4 UniRef50_Q7WP99 Cluster: Putative exported protein; n=1; Bordete... 34 4.4 UniRef50_Q4R0X3 Cluster: RacL protein; n=6; Streptomyces|Rep: Ra... 34 4.4 UniRef50_Q3WE76 Cluster: Similar to Uncharacterized protein with... 34 4.4 UniRef50_Q3WDT3 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_Q3W4J7 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_Q2AC94 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_Q0RI45 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_Q0FPK6 Cluster: Putative uncharacterized protein; n=2; ... 34 4.4 UniRef50_Q0AAS2 Cluster: Flagellar assembly protein FliH; n=1; A... 34 4.4 UniRef50_Q08TN9 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_A5NWU4 Cluster: Small GTP-binding protein; n=1; Methylo... 34 4.4 UniRef50_A5EMI3 Cluster: Putative uncharacterized protein; n=2; ... 34 4.4 UniRef50_A4FPN6 Cluster: PE-PGRS family protein; n=1; Saccharopo... 34 4.4 UniRef50_A3TY96 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_A1GES0 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_A0UFI5 Cluster: Phage P2 GpE family protein; n=5; Prote... 34 4.4 UniRef50_A0TSH0 Cluster: LigA; n=1; Burkholderia cenocepacia MC0... 34 4.4 UniRef50_A0TLI8 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_Q7XSA2 Cluster: OSJNBa0005N02.3 protein; n=16; Magnolio... 34 4.4 UniRef50_Q5Z547 Cluster: Plant disease resistance polyprotein-li... 34 4.4 UniRef50_Q2QTA8 Cluster: Retrotransposon protein, putative, uncl... 34 4.4 UniRef50_Q08J84 Cluster: Putative long tail fiber protein; n=2; ... 34 4.4 UniRef50_Q9XWR2 Cluster: Putative uncharacterized protein col-12... 34 4.4 UniRef50_Q7QWN8 Cluster: GLP_26_54603_52153; n=1; Giardia lambli... 34 4.4 UniRef50_Q54IK0 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_Q4QAK9 Cluster: Putative uncharacterized protein; n=3; ... 34 4.4 UniRef50_O16161 Cluster: Precollagen P precursor; n=6; Mytilus|R... 34 4.4 UniRef50_A7SQ58 Cluster: Predicted protein; n=1; Nematostella ve... 34 4.4 UniRef50_A7SHG3 Cluster: Predicted protein; n=1; Nematostella ve... 34 4.4 UniRef50_A5KB95 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_A5K327 Cluster: DnaJ domain containing protein; n=5; Pl... 34 4.4 UniRef50_A2E4P1 Cluster: Putative uncharacterized protein; n=1; ... 34 4.4 UniRef50_Q96WL0 Cluster: TPR-containing protein Mql1; n=4; Dikar... 34 4.4 UniRef50_Q1E1B4 Cluster: Putative uncharacterized protein; n=2; ... 34 4.4 UniRef50_A4RCZ0 Cluster: Predicted protein; n=1; Magnaporthe gri... 34 4.4 UniRef50_A2QUT9 Cluster: Remark: alternate names for Drosophila ... 34 4.4 UniRef50_P38249 Cluster: Eukaryotic translation initiation facto... 34 4.4 UniRef50_Q86Y22 Cluster: Collagen alpha-1(XXIII) chain; n=7; Eut... 34 4.4 UniRef50_P08120 Cluster: Collagen alpha-1(IV) chain precursor; n... 34 4.4 UniRef50_P0C2W8 Cluster: Collagen alpha-1(I) chain; n=1; Mammut ... 34 4.4 UniRef50_O00555 Cluster: Voltage-dependent P/Q-type calcium chan... 34 4.4 UniRef50_UPI00015B5E0C Cluster: PREDICTED: similar to ENSANGP000... 33 5.8 UniRef50_UPI000155640C Cluster: PREDICTED: similar to NFI-B prot... 33 5.8 UniRef50_UPI0001554CC3 Cluster: PREDICTED: similar to MGC84826 p... 33 5.8 UniRef50_UPI0000EBD0F6 Cluster: PREDICTED: hypothetical protein;... 33 5.8 UniRef50_UPI0000EBC1A2 Cluster: PREDICTED: hypothetical protein;... 33 5.8 UniRef50_UPI0000E48B5F Cluster: PREDICTED: hypothetical protein;... 33 5.8 UniRef50_UPI0000DD8441 Cluster: PREDICTED: hypothetical protein;... 33 5.8 UniRef50_UPI0000DA2594 Cluster: PREDICTED: hypothetical protein;... 33 5.8 UniRef50_UPI0000382DCB Cluster: hypothetical protein Magn0300309... 33 5.8 UniRef50_UPI0000EB17DA Cluster: Membrane-associated guanylate ki... 33 5.8 UniRef50_Q58EB8 Cluster: LOC560949 protein; n=26; Danio rerio|Re... 33 5.8 UniRef50_Q4TBC0 Cluster: Chromosome undetermined SCAF7164, whole... 33 5.8 UniRef50_Q4TBB9 Cluster: Chromosome undetermined SCAF7164, whole... 33 5.8 UniRef50_Q4T320 Cluster: Chromosome undetermined SCAF10132, whol... 33 5.8 UniRef50_Q4RIV6 Cluster: Chromosome undetermined SCAF15041, whol... 33 5.8 UniRef50_Q4RGH6 Cluster: Chromosome 18 SCAF15100, whole genome s... 33 5.8 UniRef50_Q60CV8 Cluster: Pseudouridine synthase; n=8; Rhizobiale... 33 5.8 UniRef50_Q60CC9 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_Q3JQ60 Cluster: Putative uncharacterized protein; n=3; ... 33 5.8 UniRef50_Q4J5P2 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_Q44378 Cluster: Virulence protein; n=1; Plasmid Ti|Rep:... 33 5.8 UniRef50_Q3W548 Cluster: Amidase; n=1; Frankia sp. EAN1pec|Rep: ... 33 5.8 UniRef50_Q3VXS0 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_Q1N9Y1 Cluster: Glycosyl transferase, group 1 family pr... 33 5.8 UniRef50_Q1AXH7 Cluster: Allergen V5/Tpx-1 related precursor; n=... 33 5.8 UniRef50_Q0LT82 Cluster: ABC transporter related; n=1; Caulobact... 33 5.8 UniRef50_A7HCK9 Cluster: LigA; n=1; Anaeromyxobacter sp. Fw109-5... 33 5.8 UniRef50_A6GIG8 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_A6G4U2 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_A5P4P2 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_A5P2P7 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_A5NU64 Cluster: LigA precursor; n=1; Methylobacterium s... 33 5.8 UniRef50_A5NQG8 Cluster: Putative uncharacterized protein precur... 33 5.8 UniRef50_A5NMJ5 Cluster: LigA; n=2; Proteobacteria|Rep: LigA - M... 33 5.8 UniRef50_A1K7M5 Cluster: Putative xanthine dehydrogenase protein... 33 5.8 UniRef50_A1K3P0 Cluster: Pseudouridylate synthase; n=3; Betaprot... 33 5.8 UniRef50_A1G8C7 Cluster: Penicillin amidase; n=2; Salinispora|Re... 33 5.8 UniRef50_A0V8J7 Cluster: CoA-binding precursor; n=2; Comamonadac... 33 5.8 UniRef50_Q2QQ86 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_Q0IQZ3 Cluster: Os11g0696000 protein; n=1; Oryza sativa... 33 5.8 UniRef50_Q0E4C7 Cluster: Os02g0125700 protein; n=1; Oryza sativa... 33 5.8 UniRef50_Q0DTC4 Cluster: Os03g0256800 protein; n=29; Oryza sativ... 33 5.8 UniRef50_A2X4U4 Cluster: Putative uncharacterized protein; n=3; ... 33 5.8 UniRef50_Q8WT68 Cluster: Elongation factor-1 alpha; n=3; Endopte... 33 5.8 UniRef50_Q60R78 Cluster: Putative uncharacterized protein CBG214... 33 5.8 UniRef50_Q54C75 Cluster: SNF2-related domain-containing protein;... 33 5.8 UniRef50_Q4QE67 Cluster: Putative uncharacterized protein; n=4; ... 33 5.8 UniRef50_P91249 Cluster: Collagen protein 20; n=16; Chromadorea|... 33 5.8 UniRef50_A5KAZ6 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_A5K084 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_A2DJE2 Cluster: Putative uncharacterized protein; n=1; ... 33 5.8 UniRef50_P17437 Cluster: Skin secretory protein xP2 precursor; n... 33 5.8 UniRef50_Q5SQQ9 Cluster: Ventral anterior homeobox 1; n=22; Eute... 33 5.8 UniRef50_O75420 Cluster: PERQ amino acid-rich with GYF domain-co... 33 5.8 UniRef50_Q9U7C9 Cluster: Nucleomorphin; n=2; Dictyostelium disco... 33 5.8 UniRef50_P70331 Cluster: MyoD family inhibitor; n=2; Murinae|Rep... 33 5.8 UniRef50_Q9BXS0 Cluster: Collagen alpha-1(XXV) chain (CLAC-P) (A... 33 5.8 UniRef50_UPI00015B96D1 Cluster: UPI00015B96D1 related cluster; n... 33 7.6 UniRef50_UPI0000F2C29C Cluster: PREDICTED: hypothetical protein;... 33 7.6 UniRef50_UPI0000F2108E Cluster: PREDICTED: similar to putative u... 33 7.6 UniRef50_UPI0000E7F880 Cluster: PREDICTED: hypothetical protein;... 33 7.6 UniRef50_UPI0000E22814 Cluster: PREDICTED: hypothetical protein;... 33 7.6 UniRef50_UPI0000E21B5B Cluster: PREDICTED: hypothetical protein;... 33 7.6 UniRef50_UPI0000DD83EC Cluster: PREDICTED: hypothetical protein;... 33 7.6 UniRef50_UPI0000DD7C75 Cluster: PREDICTED: hypothetical protein;... 33 7.6 UniRef50_UPI0000DD7C3F Cluster: PREDICTED: hypothetical protein;... 33 7.6 UniRef50_UPI0000DD78CC Cluster: PREDICTED: hypothetical protein;... 33 7.6 UniRef50_UPI0000DA44CD Cluster: PREDICTED: similar to procollage... 33 7.6 UniRef50_UPI000023D933 Cluster: hypothetical protein FG09625.1; ... 33 7.6 UniRef50_UPI0000D8DCBD Cluster: UPI0000D8DCBD related cluster; n... 33 7.6 UniRef50_UPI0000EB0F94 Cluster: Collagen alpha-3(IX) chain precu... 33 7.6 UniRef50_UPI0000EB08AB Cluster: UPI0000EB08AB related cluster; n... 33 7.6 UniRef50_Q4T2J8 Cluster: Chromosome undetermined SCAF10255, whol... 33 7.6 UniRef50_Q4SZ73 Cluster: Chromosome undetermined SCAF11805, whol... 33 7.6 UniRef50_Q98IB0 Cluster: Mlr2485 protein; n=1; Mesorhizobium lot... 33 7.6 UniRef50_Q67LM1 Cluster: Magnesium chelatase; n=1; Symbiobacteri... 33 7.6 UniRef50_Q67LM0 Cluster: Putative chelatase; n=1; Symbiobacteriu... 33 7.6 UniRef50_Q3BMQ0 Cluster: Putative uncharacterized protein; n=1; ... 33 7.6 UniRef50_Q2RZJ1 Cluster: Putative uncharacterized protein; n=1; ... 33 7.6 UniRef50_Q2RYN8 Cluster: Putative uncharacterized protein; n=1; ... 33 7.6 UniRef50_O67450 Cluster: Replicative DNA helicase; n=1; Aquifex ... 33 7.6 UniRef50_Q3WAL8 Cluster: IucA/IucC; n=1; Frankia sp. EAN1pec|Rep... 33 7.6 UniRef50_Q3W1F9 Cluster: Putative uncharacterized protein; n=1; ... 33 7.6 UniRef50_Q1B057 Cluster: Putative uncharacterized protein; n=2; ... 33 7.6 UniRef50_O85783 Cluster: Defective in fruiting DifE; n=4; Cystob... 33 7.6 UniRef50_A7BRT2 Cluster: ATPase involved in DNA repair; n=1; Beg... 33 7.6 UniRef50_A6G454 Cluster: Putative uncharacterized protein; n=1; ... 33 7.6 UniRef50_A5P159 Cluster: Major facilitator superfamily MFS_1 pre... 33 7.6 UniRef50_A5NZH2 Cluster: Putative uncharacterized protein; n=1; ... 33 7.6 UniRef50_A5NRY5 Cluster: Cytochrome c, monohaem; n=5; Alphaprote... 33 7.6 UniRef50_A5NQ96 Cluster: Peptidase C39, bacteriocin processing; ... 33 7.6 UniRef50_A1G3D3 Cluster: Fumarate reductase/succinate dehydrogen... 33 7.6 UniRef50_A0TRP5 Cluster: Endonuclease/exonuclease/phosphatase; n... 33 7.6 UniRef50_A0H8S1 Cluster: Pseudouridine synthase, Rsu; n=2; Comam... 33 7.6 UniRef50_A0AWL8 Cluster: Putative uncharacterized protein; n=2; ... 33 7.6 UniRef50_Q8H576 Cluster: Putative uncharacterized protein OJ1656... 33 7.6 UniRef50_Q69TN1 Cluster: Putative uncharacterized protein OSJNBa... 33 7.6 UniRef50_Q0DNS9 Cluster: Os03g0736600 protein; n=1; Oryza sativa... 33 7.6 UniRef50_Q9VET6 Cluster: CG14889-PA; n=2; Sophophora|Rep: CG1488... 33 7.6 UniRef50_Q9VCD1 Cluster: CG6129-PB, isoform B; n=6; Diptera|Rep:... 33 7.6 UniRef50_Q86NZ7 Cluster: LP07855p; n=8; Endopterygota|Rep: LP078... 33 7.6 UniRef50_Q6QDY4 Cluster: Beta-giardin; n=1; Giardia intestinalis... 33 7.6 UniRef50_Q25467 Cluster: Cuticule collagen; n=3; Tylenchoidea|Re... 33 7.6 UniRef50_A2EJ43 Cluster: Viral A-type inclusion protein, putativ... 33 7.6 UniRef50_Q59F25 Cluster: Alpha 3 type VI collagen isoform 5 vari... 33 7.6 UniRef50_Q4P6N2 Cluster: Putative uncharacterized protein; n=1; ... 33 7.6 UniRef50_Q2U760 Cluster: Predicted protein; n=1; Aspergillus ory... 33 7.6 UniRef50_Q0U3K3 Cluster: Predicted protein; n=1; Phaeosphaeria n... 33 7.6 UniRef50_P20930 Cluster: Filaggrin; n=18; Catarrhini|Rep: Filagg... 33 7.6 UniRef50_P39060 Cluster: Collagen alpha-1(XVIII) chain precursor... 33 7.6 UniRef50_Q07092 Cluster: Collagen alpha-1(XVI) chain precursor; ... 33 7.6 UniRef50_P12111 Cluster: Collagen alpha-3(VI) chain precursor; n... 33 7.6 >UniRef50_Q179J9 Cluster: Mitochondrial ATP synthase b chain; n=3; Arthropoda|Rep: Mitochondrial ATP synthase b chain - Aedes aegypti (Yellowfever mosquito) Length = 238 Score = 316 bits (776), Expect = 4e-85 Identities = 154/244 (63%), Positives = 185/244 (75%), Gaps = 1/244 (0%) Frame = -2 Query: 746 MLSRVALRSGASKQTACTALVARGSASDVATHDQKTFARPVRGE-PGKVRLGFIPEEWFQ 570 MLSR AL + A K ++ARGSAS AT RPVR E PGKVR+GF+PEEWF Sbjct: 1 MLSRAALLAAAKKPAGL--ILARGSAS--ATDGN----RPVRAEHPGKVRMGFLPEEWFT 52 Query: 569 FFHSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLLVMVYVAHVKFGPKLAAWLD 390 FF++KTGVTGPY FG GL TYLCSKEIYVMEHEYY+GLSL +MV A KFGP +AA+ D Sbjct: 53 FFYNKTGVTGPYVFGAGLLTYLCSKEIYVMEHEYYNGLSLAIMVIYAVKKFGPAVAAYCD 112 Query: 389 KEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRER 210 KE++ E EW R ++ L A+E EK EQWRA+GQ LL++AKKENV LQLEAAYRER Sbjct: 113 KEIDRIEGEWKADRENNIQQLAQAMEDEKKEQWRAEGQTLLMEAKKENVALQLEAAYRER 172 Query: 209 LMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALDRCIADLAS 30 M Y EVK+RLDYQ+E+ NV+RR++QKHMVDWIV NV K+ITP+QEK+ L RCIADL + Sbjct: 173 AMTVYREVKKRLDYQVERQNVDRRISQKHMVDWIVKNVVKSITPEQEKETLSRCIADLGA 232 Query: 29 LARK 18 +A + Sbjct: 233 IAAR 236 >UniRef50_Q94516 Cluster: ATP synthase B chain, mitochondrial precursor; n=7; Endopterygota|Rep: ATP synthase B chain, mitochondrial precursor - Drosophila melanogaster (Fruit fly) Length = 243 Score = 302 bits (742), Expect = 5e-81 Identities = 146/241 (60%), Positives = 178/241 (73%) Frame = -2 Query: 746 MLSRVALRSGASKQTACTALVARGSASDVATHDQKTFARPVRGEPGKVRLGFIPEEWFQF 567 M SR AL + T A +A+ +++ RP PGKVRLGF+PEEWFQF Sbjct: 1 MFSRAALLTAQRPLTVAATRSAAAAAAPGGAIERRQ--RPEH--PGKVRLGFLPEEWFQF 56 Query: 566 FHSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLLVMVYVAHVKFGPKLAAWLDK 387 F++KTGVTGPYTFGVGL TYLCSKEIYVMEHEYYSGLSL +M +A K GP +A W D Sbjct: 57 FYNKTGVTGPYTFGVGLITYLCSKEIYVMEHEYYSGLSLGIMAIIAVKKLGPVIAKWADG 116 Query: 386 EVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERL 207 E++ E+EW EGR +K L DAIE EK EQWRA G LL++AKKEN+ LQLEAA+RER Sbjct: 117 EIDKIESEWKEGREAELKVLSDAIEAEKKEQWRADGALLLMEAKKENIALQLEAAFRERA 176 Query: 206 MYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALDRCIADLASL 27 M YSEVKRRLDYQ+E +VERRL+QKHMV+WI +NV +I+P QEK+ L++CIADL++L Sbjct: 177 MNVYSEVKRRLDYQVECRHVERRLSQKHMVNWITTNVLASISPQQEKETLNKCIADLSAL 236 Query: 26 A 24 A Sbjct: 237 A 237 >UniRef50_Q5XUB3 Cluster: Putative ATP synthase-like protein; n=1; Toxoptera citricida|Rep: Putative ATP synthase-like protein - Toxoptera citricida (Brown citrus aphid) Length = 273 Score = 233 bits (569), Expect = 5e-60 Identities = 111/217 (51%), Positives = 149/217 (68%), Gaps = 1/217 (0%) Frame = -2 Query: 665 DVATHDQKTFARPVR-GEPGKVRLGFIPEEWFQFFHSKTGVTGPYTFGVGLATYLCSKEI 489 D D F R VR EP K R F+PEEWF+ F+ KTGVTGPY G+ TYL SKEI Sbjct: 56 DGPERDLVNFPRMVRLEEPAKTRYLFVPEEWFEVFYKKTGVTGPYVLAAGVTTYLLSKEI 115 Query: 488 YVMEHEYYSGLSLLVMVYVAHVKFGPKLAAWLDKEVEATENEWNEGRNQTVKALEDAIEG 309 +V+EHE+ L+ + + YV K G LAA+LDKE++ E N R + L++ IE Sbjct: 116 WVVEHEFPYVLATIGLFYVGWKKLGTSLAAFLDKEIDEYEASCNASRKSEIDGLKETIEH 175 Query: 308 EKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEVKRRLDYQLEKSNVERRLAQ 129 +KTE WR + Q+ +IQAK+ENV LQLEA YRER + AY++VKRRLDYQL+ +N+ R + Q Sbjct: 176 QKTEIWRTEAQKHVIQAKRENVALQLEAIYRERALQAYNQVKRRLDYQLDLANLTRTVQQ 235 Query: 128 KHMVDWIVSNVTKAITPDQEKQALDRCIADLASLARK 18 +HMV+WI+ NV K++T +QEKQ+ +C+ADL +LA K Sbjct: 236 RHMVNWIIENVLKSLTNEQEKQSFKKCMADLQALAAK 272 >UniRef50_Q0PXW9 Cluster: Putative ATP synthase-like protein; n=1; Diaphorina citri|Rep: Putative ATP synthase-like protein - Diaphorina citri (Asian citrus psyllid) Length = 249 Score = 226 bits (552), Expect = 5e-58 Identities = 115/249 (46%), Positives = 169/249 (67%), Gaps = 6/249 (2%) Frame = -2 Query: 746 MLSRVALRSGASKQTACTALVARGSA----SDV-ATHDQKTFARPVRG-EPGKVRLGFIP 585 MLSR ++ +KQ+ L ARG+A SD D F RP R +P VR IP Sbjct: 1 MLSRFVMQHALTKQSPMIVL-ARGAALLPTSDKHPERDLVNFPRPKRLIDPEPVRHTCIP 59 Query: 584 EEWFQFFHSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLLVMVYVAHVKFGPKL 405 E WF+FF+ + GVTGPYTF GL TYL SKEI+V+EH++ ++ +++V + H FG +L Sbjct: 60 ERWFEFFYPRLGVTGPYTFTFGLITYLLSKEIWVVEHDFGYVMASVIIVGLGHKLFGKQL 119 Query: 404 AAWLDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEA 225 A +LDKE+ A E + + RN + +L+ AIE E Q R++ Q +L +AK+EN+ +QLEA Sbjct: 120 ANYLDKEIAAEEEQDDAARNDKLASLKGAIENELWNQERSKAQAVLYEAKRENIQMQLEA 179 Query: 224 AYRERLMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALDRCI 45 +RER ++AY +VK RL+YQ +++RR++QKHMV W+VS+V K+ITPDQ+KQ++ +CI Sbjct: 180 VFRERALFAYQQVKNRLEYQAALESIQRRISQKHMVSWVVSHVLKSITPDQDKQSIKKCI 239 Query: 44 ADLASLARK 18 +DL +LA + Sbjct: 240 SDLKALAAR 248 >UniRef50_UPI0000517B84 Cluster: PREDICTED: similar to ATP synthase B chain, mitochondrial precursor (FO-ATP synthase subunit B); n=1; Apis mellifera|Rep: PREDICTED: similar to ATP synthase B chain, mitochondrial precursor (FO-ATP synthase subunit B) - Apis mellifera Length = 238 Score = 225 bits (550), Expect = 9e-58 Identities = 109/242 (45%), Positives = 160/242 (66%) Frame = -2 Query: 746 MLSRVALRSGASKQTACTALVARGSASDVATHDQKTFARPVRGEPGKVRLGFIPEEWFQF 567 MLSR+ R+ S+ L + VA+ + RP+ +P VRLGFIP+EWF+F Sbjct: 1 MLSRLTFRNIPSQ---VKTLACGIQTTAVASSNGPRLKRPI--DPPPVRLGFIPDEWFKF 55 Query: 566 FHSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLLVMVYVAHVKFGPKLAAWLDK 387 F+ KTGVTGPY F +TYL SKE YVMEHE+Y+GLSLL ++ KFG K+ A+LDK Sbjct: 56 FYPKTGVTGPYVFLTTFSTYLLSKEWYVMEHEFYNGLSLLSIIIYVQYKFGAKIGAFLDK 115 Query: 386 EVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERL 207 E++ E E N +N+ ++ +++ I + E+WR GQ ++ KK+N+ +QLEA+YRE L Sbjct: 116 EIDKDEEELNNQKNENIEEIQNQINELEKEKWRIDGQLMVYDVKKQNIWMQLEASYRENL 175 Query: 206 MYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALDRCIADLASL 27 +S+VK+ LDY + + RR++QKHM+ WI+++V +ITP+QEK L +CI DL SL Sbjct: 176 ATIHSQVKKILDYHAQIDIINRRISQKHMMQWIINSVLASITPEQEKANLLQCIKDLESL 235 Query: 26 AR 21 ++ Sbjct: 236 SK 237 >UniRef50_UPI0000585FFD Cluster: PREDICTED: similar to ATP synthase, H+ transporting, mitochondrial F0 complex, subunit b; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to ATP synthase, H+ transporting, mitochondrial F0 complex, subunit b - Strongylocentrotus purpuratus Length = 249 Score = 217 bits (531), Expect = 2e-55 Identities = 115/251 (45%), Positives = 158/251 (62%), Gaps = 10/251 (3%) Frame = -2 Query: 746 MLSRVALRSGASKQTACTALVARGSASDVATHDQKTF---ARPVR------GEPGKVRLG 594 MLSR+A+R+G+ A ++ R SA V+ QK + P R E GK+R G Sbjct: 1 MLSRLAMRNGS----AIASIALRSSAPCVSAAPQKMLLSTSTPQRMPNKMPEEAGKIRFG 56 Query: 593 FIPEEWFQFFHSKTGVTGPYTFGVGLATYLCSKEIYVMEHE-YYSGLSLLVMVYVAHVKF 417 F+PEEWFQF + KTGVTGPY FG GL +L +KEIYVM E ++ ++L + +Y K Sbjct: 57 FVPEEWFQFMYKKTGVTGPYVFGTGLILFLLNKEIYVMGPETVHAAVALGLFIY-GIKKL 115 Query: 416 GPKLAAWLDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLL 237 GP +A W DK+ E T + GRN + A +DAIE EKTEQWR G++ L A++ENV + Sbjct: 116 GPGIAEWADKKREETLADAYAGRNANIAAYKDAIEHEKTEQWRLDGRKQLFDARRENVAM 175 Query: 236 QLEAAYRERLMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQAL 57 ++E YRERL V++++DY +E N +RRL Q+HMV WI NV K+ITP QEK + Sbjct: 176 RMEIEYRERLQQVAQAVQKKMDYHVELENTKRRLEQQHMVRWIEQNVVKSITPQQEKDIM 235 Query: 56 DRCIADLASLA 24 CI++L +LA Sbjct: 236 STCISNLKNLA 246 >UniRef50_P24539 Cluster: ATP synthase B chain, mitochondrial precursor; n=35; Euteleostomi|Rep: ATP synthase B chain, mitochondrial precursor - Homo sapiens (Human) Length = 256 Score = 184 bits (447), Expect = 3e-45 Identities = 103/250 (41%), Positives = 147/250 (58%), Gaps = 7/250 (2%) Frame = -2 Query: 746 MLSRVALRSGASKQTAC--TALVARGSASDVAT-HDQKTFARPVRGEP---GKVRLGFIP 585 MLSRV L + A+ + A + G T H + PV P GKVR G IP Sbjct: 1 MLSRVVLSAAATAAPSLKNAAFLGPGVLQATRTFHTGQPHLVPVPPLPEYGGKVRYGLIP 60 Query: 584 EEWFQFFHSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLL-VMVYVAHVKFGPK 408 EE+FQF + KTGVTGPY G GL Y SKEIYV+ E ++ LS+L VMVY K+GP Sbjct: 61 EEFFQFLYPKTGVTGPYVLGTGLILYALSKEIYVISAETFTALSVLGVMVY-GIKKYGPF 119 Query: 407 LAAWLDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLE 228 +A + DK E + E + +++ +++AI+ EK++Q Q + L ++ N+ + LE Sbjct: 120 VADFADKLNEQKLAQLEEAKQASIQHIQNAIDTEKSQQALVQKRHYLFDVQRNNIAMALE 179 Query: 227 AAYRERLMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALDRC 48 YRERL Y EVK RLDY + N+ RR Q+HM++W+ +V ++I+ QEK+ + +C Sbjct: 180 VTYRERLYRVYKEVKNRLDYHISVQNMMRRKEQEHMINWVEKHVVQSISTQQEKETIAKC 239 Query: 47 IADLASLARK 18 IADL LA+K Sbjct: 240 IADLKLLAKK 249 >UniRef50_Q5DI09 Cluster: SJCHGC09031 protein; n=1; Schistosoma japonicum|Rep: SJCHGC09031 protein - Schistosoma japonicum (Blood fluke) Length = 274 Score = 153 bits (372), Expect = 3e-36 Identities = 81/197 (41%), Positives = 114/197 (57%), Gaps = 1/197 (0%) Frame = -2 Query: 608 KVRLGFIPEEWFQFFHSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLLVMVYVA 429 KVR+G P+ WF F+SKTGVTGPY F G +L +KEI++ + + L M V Sbjct: 70 KVRMGVFPDSWFHPFYSKTGVTGPYMFMFGSFMFLINKEIWLFDGHFLECLVFFGMSTVI 129 Query: 428 HVKFGPKLAAWLDKEVEATENE-WNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKK 252 K GP +LD+ + E +++ N+ L++ I+ + E R ++AK+ Sbjct: 130 IKKAGPYARKFLDECTQEDEQVMYHKPINEVKSYLDNTIKTCEVEVGRTTAVSEHVRAKE 189 Query: 251 ENVLLQLEAAYRERLMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQ 72 EN+ LQLEA YRERL Y V RRLDY +E N +R Q+HMV+W+V +V K ITP Q Sbjct: 190 ENIALQLEATYRERLQKVYRAVHRRLDYHVEWENTRKRYIQQHMVNWVVDHVVKGITPAQ 249 Query: 71 EKQALDRCIADLASLAR 21 EK+ L CI +L LA+ Sbjct: 250 EKETLAHCINELERLAQ 266 >UniRef50_A7RXX3 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 240 Score = 131 bits (316), Expect = 2e-29 Identities = 66/179 (36%), Positives = 102/179 (56%) Frame = -2 Query: 560 SKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLLVMVYVAHVKFGPKLAAWLDKEV 381 +KTG TG F GLA YL S EI ++ E Y + Y K G +A LD Sbjct: 61 AKTGETGQLMFFGGLAAYLLSNEILIIHEETYIAAVMGGTFYWLMKKAGGPIAEMLDNTS 120 Query: 380 EATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMY 201 + + +N GRN ++K L+DAI+ EK + + +I+ +EN ++ +E YR + + Sbjct: 121 QEILDAFNVGRNASIKHLQDAIDNEKHLEHMLSCRTDIIEMMRENNVMGMELEYRNNVHH 180 Query: 200 AYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALDRCIADLASLA 24 EVK+RLDYQ+E R++ Q H++DW+ V K+ITP QEK+++ +CI DL ++A Sbjct: 181 VVKEVKKRLDYQVEMETFHRKVEQAHIIDWVEKEVIKSITPQQEKESISQCIRDLKAMA 239 >UniRef50_UPI0000E24DC6 Cluster: PREDICTED: similar to ATP synthase, H+ transporting, mitochondrial F0 complex, subunit B1; n=1; Pan troglodytes|Rep: PREDICTED: similar to ATP synthase, H+ transporting, mitochondrial F0 complex, subunit B1 - Pan troglodytes Length = 274 Score = 109 bits (261), Expect = 1e-22 Identities = 55/148 (37%), Positives = 90/148 (60%), Gaps = 1/148 (0%) Frame = -2 Query: 497 KEIYVMEHEYYSGLSLL-VMVYVAHVKFGPKLAAWLDKEVEATENEWNEGRNQTVKALED 321 K IYV+ E ++ LS+L VMVY K+GP +A + DK E + E + +++ +++ Sbjct: 54 KGIYVISAETFTALSILGVMVYGIK-KYGPFVADFADKLNEQKLAQLEEAKQASIQQIQN 112 Query: 320 AIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEVKRRLDYQLEKSNVER 141 AI+ EK++Q Q + L ++ N+ + LE YRERL Y EVK RLDY + N+ R Sbjct: 113 AIDMEKSQQALVQKRHYLFDVQRNNIAMALEVTYRERLYRVYKEVKNRLDYHISVQNMMR 172 Query: 140 RLAQKHMVDWIVSNVTKAITPDQEKQAL 57 R Q+HM++W+ +V ++I+ QEK+ + Sbjct: 173 RKEQEHMINWVEKHVVQSISTQQEKETI 200 >UniRef50_UPI0000DD7E8D Cluster: PREDICTED: similar to ATP synthase B chain, mitochondrial precursor; n=1; Homo sapiens|Rep: PREDICTED: similar to ATP synthase B chain, mitochondrial precursor - Homo sapiens Length = 423 Score = 88.6 bits (210), Expect = 1e-16 Identities = 47/128 (36%), Positives = 71/128 (55%) Frame = -2 Query: 611 GKVRLGFIPEEWFQFFHSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLLVMVYV 432 GKVRLG I EE+ +F + K GVTGP G GL Y SKEIYV+ E +S +S++ + Sbjct: 275 GKVRLGLILEEFLRFLYLKAGVTGPCVLGTGLILYALSKEIYVIIAETFSTISVVGLPVY 334 Query: 431 AHVKFGPKLAAWLDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKK 252 A K+G +A + K E + E + +K + D I+ EK++Q Q + L ++ Sbjct: 335 AIKKYGASVAEFAGKLNEQKLAQLEEAKQAPIKQIRDGIDLEKSQQALVQKRHYLFDVQR 394 Query: 251 ENVLLQLE 228 N+ + LE Sbjct: 395 NNIAMALE 402 >UniRef50_Q19126 Cluster: Atp synthase b homolog protein 2; n=4; Caenorhabditis|Rep: Atp synthase b homolog protein 2 - Caenorhabditis elegans Length = 305 Score = 85.0 bits (201), Expect = 2e-15 Identities = 64/208 (30%), Positives = 105/208 (50%), Gaps = 5/208 (2%) Frame = -2 Query: 635 ARPVRGEPGKVRLGFIPEEWFQFFHSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGL 456 ARP+ P K RL +P+ WF F TGV+GPY F GL +L +KE++V E + + + Sbjct: 99 ARPMY--PPKSRLLMMPDSWFTPFQKVTGVSGPYLFFGGLFAFLVNKELWVFEEQGHMTV 156 Query: 455 SLLVMVYVAHVKFGPKLAAWLDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQG- 279 ++ + G K+ L + N + +G Q + L++A+E +KT + + Sbjct: 157 GWILFYLLVTRTAGYKIDQGLYNGYQERVN-FFKGLIQ--EDLKEAVEFKKTSAKQTESL 213 Query: 278 ---QELLIQAKKENVLLQLEAAYRERLMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWI 108 +E A KE++ LQLEA YR+ + +E+KRR+DY E + R+ ++ ++ I Sbjct: 214 NSIKESYPTALKESMALQLEATYRKNVQSVATELKRRIDYLKETEESKARVEREQLLKLI 273 Query: 107 VSNVTKAITPDQEK-QALDRCIADLASL 27 S V K + K + L I L L Sbjct: 274 NSEVDKEFSDRSFKDKYLQNAIQQLKGL 301 >UniRef50_Q6AWE2 Cluster: AT16129p; n=3; Drosophila melanogaster|Rep: AT16129p - Drosophila melanogaster (Fruit fly) Length = 194 Score = 56.8 bits (131), Expect = 5e-07 Identities = 38/130 (29%), Positives = 60/130 (46%), Gaps = 14/130 (10%) Frame = -2 Query: 674 SASDVATHDQKTFAR-PVRGEPGKVRLGFIPEEWFQFFHSKTGVTGPYTFGVGLATYLCS 498 ++ TH + +R P G PGKVR GF + W V GP GVGL Y+CS Sbjct: 56 TSRSATTHSAQGLSRLPGHGSPGKVRPGFPSDNW---------VKGP--MGVGLLAYICS 104 Query: 497 KEIYVMEHE-------------YYSGLSLLVMVYVAHVKFGPKLAAWLDKEVEATENEWN 357 + ++HE Y SG+++ ++ A ++ P + W D E+ E+E+ Sbjct: 105 GDCCAIKHEHSGLSLGIMEDGYYSSGITIGILTTFAVIRLLPAIVKWADSEIIKIESEYE 164 Query: 356 EGRNQTVKAL 327 + R +K L Sbjct: 165 KSRETKIKVL 174 >UniRef50_Q870C4 Cluster: ATP synthase subunit 4, mitochondrial precursor; n=17; Pezizomycotina|Rep: ATP synthase subunit 4, mitochondrial precursor - Paracoccidioides brasiliensis Length = 244 Score = 56.0 bits (129), Expect = 9e-07 Identities = 53/224 (23%), Positives = 90/224 (40%), Gaps = 2/224 (0%) Frame = -2 Query: 701 ACTALVARGSASDVATHDQKTFARPVRGE-PGKVRLGFIPEEWFQFFHSKTGVTGPYTFG 525 A T L + S S+V T D KT A+ + PG + SKT + G Sbjct: 27 AATTLTSTRSVSNVPTEDPKTKAQSIIDALPGNSLV------------SKTAILSA---G 71 Query: 524 VGLATYLCSKEIYVMEHEYYSGLSLLVMVYVAHVKFGPKLAAWLDKEVEATENEWNEGRN 345 GL+ S E+YV E + LL + GP W + +++ ++ N R Sbjct: 72 AGLSIAAISNELYVFSEETVAAFCLLSVFAGVAKMAGPMYKEWAETQIQKQKDILNGARA 131 Query: 344 QTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEVKRRLDYQ 165 A++ IE K + L + KE L+ +A E+ +E K+ LD Sbjct: 132 NHTNAVKQRIENVKQLSGVVDITKALFEVSKETARLEAQAYELEQRTALAAEAKKVLDSW 191 Query: 164 LEKSNVERRLAQKHMVDWIVSNVTKAI-TPDQEKQALDRCIADL 36 ++ + Q+ + ++S V K + P +Q L + + D+ Sbjct: 192 VQYEGQVKVRQQRELAQTVISKVQKELENPKVIQQILQQSVTDV 235 >UniRef50_Q5KL26 Cluster: ATP synthase, putative; n=1; Filobasidiella neoformans|Rep: ATP synthase, putative - Cryptococcus neoformans (Filobasidiella neoformans) Length = 237 Score = 51.6 bits (118), Expect = 2e-05 Identities = 42/176 (23%), Positives = 71/176 (40%), Gaps = 1/176 (0%) Frame = -2 Query: 545 TGPYTFGVGLATYLCSKEIYVMEHEYYSGLSLLVMVYVAHVKFGPKLAAWLDKEVEATEN 366 TG G GL S E+YV E + LV+ V A W + ++E ++ Sbjct: 58 TGGVILGTGLTAAAVSSELYVANEETVLLVGFLVIATVIGKSVSAPYAEWANGQIEKVKS 117 Query: 365 EWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEV 186 N R + +A+ D I+ + E L KE L+ E + +E+ Sbjct: 118 ILNSAREEHTRAVTDRIDSVGQLKEVVPLTESLYAVAKETNKLEHENFILAQENAVKAEL 177 Query: 185 KRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAIT-PDQEKQALDRCIADLASLAR 21 K LD + +R Q +V + +NV + P +KQ L+ +A + +A+ Sbjct: 178 KSVLDSWVRYEQQQREAEQIALVKTVQANVEAELAKPAFKKQLLEEALAQVEQIAK 233 >UniRef50_A3PHG2 Cluster: C-5 cytosine-specific DNA methylase; n=1; Rhodobacter sphaeroides ATCC 17029|Rep: C-5 cytosine-specific DNA methylase - Rhodobacter sphaeroides (strain ATCC 17029 / ATH 2.4.9) Length = 446 Score = 48.8 bits (111), Expect = 1e-04 Identities = 33/83 (39%), Positives = 41/83 (49%) Frame = -1 Query: 309 REDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRP 130 RE +A A GA + G AA G LQG A + LR +A G PAR + G + Sbjct: 246 REPRGLADAERGAAE--RHGHTLGAAPGALQGAARQQRLRDDARHGDPARRLGDGLGAGL 303 Query: 129 EAHGRLDSEQRDQGDHSGPGEAG 61 E HGR +QRD+ GP AG Sbjct: 304 EGHGR-HGDQRDEPGRLGPDSAG 325 >UniRef50_P08123 Cluster: Collagen alpha-2(I) chain precursor; n=49; Chordata|Rep: Collagen alpha-2(I) chain precursor - Homo sapiens (Human) Length = 1366 Score = 45.2 bits (102), Expect = 0.002 Identities = 37/101 (36%), Positives = 44/101 (43%), Gaps = 2/101 (1%) Frame = -1 Query: 318 NRGREDGAVARAGTGAPH--PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERG 145 N+G E G V GT P G GER A +GE LRGE G P R+ RG Sbjct: 620 NKG-EPGVVGAVGTAGPSGPSGLPGERGAAGIPGGKGEKGEPGLRGEI--GNPGRDGARG 676 Query: 144 APSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 AP A G + D+G+ G AGP PG G+ Sbjct: 677 APGAVGAPGPAGATG-DRGEAGAAGPAGPAGPRGSPGERGE 716 >UniRef50_UPI0000E48567 Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 991 Score = 43.2 bits (97), Expect = 0.007 Identities = 34/96 (35%), Positives = 41/96 (42%), Gaps = 3/96 (3%) Frame = -1 Query: 279 TGAPHPGQ---EGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLD 109 TGAP + E ER P RL+ + +RG P RG P H R Sbjct: 693 TGAPTQSRMRPEMERPPRPSSRLEPPPMGQNIRGPRPGFQPGAMEHRGGP-----HDRHR 747 Query: 108 SEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 QRD G GP GPGP RGPG + + G +R Sbjct: 748 MPQRD-GRGPGPDGRGPGPDGRGPGPESRHMMGRDR 782 >UniRef50_UPI0000F30A81 Cluster: UPI0000F30A81 related cluster; n=1; Bos taurus|Rep: UPI0000F30A81 UniRef100 entry - Bos Taurus Length = 571 Score = 42.7 bits (96), Expect = 0.009 Identities = 31/101 (30%), Positives = 39/101 (38%), Gaps = 1/101 (0%) Frame = -1 Query: 336 ESTGGRNRGREDGAVARAGTGAPH-PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 +S G +G D A GAP PG +G P + GR G R R A R Sbjct: 465 QSPGQGAQGTRDLRSCGAQAGAPRDPGTQGPTEPRSPGR--GAHGPRDPRSRGAQARVPR 522 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP 37 + P P++HG RD G H PG +GP Sbjct: 523 DPGTQGPRDPQSHGAQAGAPRDPGTHGAAEPQSPGQGAQGP 563 >UniRef50_Q72KK1 Cluster: Prephenate dehydrogenase; n=1; Thermus thermophilus HB27|Rep: Prephenate dehydrogenase - Thermus thermophilus (strain HB27 / ATCC BAA-163 / DSM 7039) Length = 493 Score = 41.9 bits (94), Expect = 0.016 Identities = 39/108 (36%), Positives = 44/108 (40%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 P GR GR + G G HPGQ RAP R +A RG A P Sbjct: 250 PGGPPGAGRPPGRARRVASGGGGGQAHPGQPPHRAPKPPPR---DARP---RGPGAG--P 301 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 AR +R RP G Q +G H G GP PL R PG AG+ Sbjct: 302 ARG-DRQDRHRPGRGG--GEHQGHRGPHHPGGGGGPPPLLRHPGGAGK 346 Score = 33.5 bits (73), Expect = 5.8 Identities = 33/96 (34%), Positives = 37/96 (38%), Gaps = 3/96 (3%) Frame = -1 Query: 315 RGREDGAVARAGTGAPH-PGQEGERAPAARG-RLQGEAHVRLLRGEAASGLPAREVERGA 142 RG + A+ G PH PG G P G RL + G G P RGA Sbjct: 194 RGPKPHGGAKPPPGPPHVPGGGGLPRPHPGGLRLPQDEPGH---GGGEQGGPEGG-HRGA 249 Query: 141 PSRPEAHGRLDSEQRDQGDHSGPGEAGPG-PLHRGP 37 P P GR R G G+A PG P HR P Sbjct: 250 PGGPPGAGRPPGRARRVASGGGGGQAHPGQPPHRAP 285 >UniRef50_Q095Q3 Cluster: Putative uncharacterized protein; n=1; Stigmatella aurantiaca DW4/3-1|Rep: Putative uncharacterized protein - Stigmatella aurantiaca DW4/3-1 Length = 550 Score = 41.9 bits (94), Expect = 0.016 Identities = 31/87 (35%), Positives = 37/87 (42%), Gaps = 3/87 (3%) Frame = -1 Query: 303 DGAVARAGTGAPHPGQEGERAP---AARGRLQGEAHVRLLRGEAASGLPAREVERGAPSR 133 D A RAG G +G R P AA+ R G H RG +G R + +G +R Sbjct: 196 DPARGRAGGSGHEAGGDGRRLPDAHAAQHRADGAVHAH--RGVGHAGGGHRLLRQGRRAR 253 Query: 132 PEAHGRLDSEQRDQGDHSGPGEAGPGP 52 HG D R H GEAGP P Sbjct: 254 GHVHGDEDGHHRGAQAHD-QGEAGPHP 279 >UniRef50_A6GFZ9 Cluster: Serine/threonine kinase PKN8; n=1; Plesiocystis pacifica SIR-1|Rep: Serine/threonine kinase PKN8 - Plesiocystis pacifica SIR-1 Length = 1489 Score = 41.9 bits (94), Expect = 0.016 Identities = 30/85 (35%), Positives = 37/85 (43%), Gaps = 5/85 (5%) Frame = -1 Query: 273 APHPGQEGERAPAARG-RLQGEAHVRLLRGEAASGL----PAREVERGAPSRPEAHGRLD 109 APHP RA R +L+G A VR L A +GL P R G+ P AH R++ Sbjct: 1295 APHPDHPAPRARRLRAAKLRGGARVRGLADGALAGLVHAKPGRRRGHGSAPGPRAHRRVE 1354 Query: 108 SEQRDQGDHSGPGEAGPGPLHRGPG 34 Q + EA P R PG Sbjct: 1355 GPAGAQRGRAARAEADRQPRARAPG 1379 >UniRef50_Q9N3D7 Cluster: Collagen protein 48; n=3; Chromadorea|Rep: Collagen protein 48 - Caenorhabditis elegans Length = 285 Score = 41.9 bits (94), Expect = 0.016 Identities = 39/106 (36%), Positives = 43/106 (40%), Gaps = 5/106 (4%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPH--PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 G R EDGA G P PGQ G+ P RG GE L G+A G Sbjct: 161 GNPGRNGEDGAPGPQGPSGPPGPPGQPGQ--PGQRGP-PGEPGALLPGGDAPPGPSGPPG 217 Query: 153 ERGAPSRPEAHGR--LDSEQRDQGDHSGPGEAG-PGPLHRGPGFAG 25 GAP +P G D D G PG+ G PGP PG AG Sbjct: 218 RPGAPGQPGKAGSPGQDGSNGDAGVAGEPGQRGPPGP----PGQAG 259 >UniRef50_Q4RFZ6 Cluster: Chromosome undetermined SCAF15108, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF15108, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 149 Score = 41.5 bits (93), Expect = 0.022 Identities = 37/99 (37%), Positives = 44/99 (44%), Gaps = 6/99 (6%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAP--HPGQEGERAPAARGRLQGEAHVRL-LRGE--AASGLPA 163 GG+ G+ G R G P HPG R P + EAH+R LRG+ A G+P Sbjct: 7 GGKGLGK--GGAKRHPQGPPRQHPGHHQTRHPPPGSARRREAHLRPDLRGDPRGAEGVPG 64 Query: 162 REVERGAPSRPEAHGRLDSEQRD-QGDHSGPGEAGPGPL 49 ER P R HG E D G P EAGP P+ Sbjct: 65 ---ERD-PRRRHLHGARQEEDGDGHGRGVRPEEAGPHPV 99 >UniRef50_Q5C1A3 Cluster: SJCHGC09249 protein; n=1; Schistosoma japonicum|Rep: SJCHGC09249 protein - Schistosoma japonicum (Blood fluke) Length = 367 Score = 41.5 bits (93), Expect = 0.022 Identities = 32/97 (32%), Positives = 43/97 (44%), Gaps = 1/97 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 R + G R EDG + A PG GE P ++QG + +++ + A GLP Sbjct: 213 RGTKGPRGPPGEDGPPGKDSL-AGEPGPPGEPGPRGPIQVQGYDNGKVVGPQGAKGLPGY 271 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAG-PGP 52 RG P P A G + G+ PGE G PGP Sbjct: 272 PGPRGRPGIPGAPG----DPGPIGESGVPGEDGPPGP 304 >UniRef50_Q4PDX4 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 335 Score = 41.5 bits (93), Expect = 0.022 Identities = 30/81 (37%), Positives = 36/81 (44%), Gaps = 1/81 (1%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPAARGRLQGEAHV-RLLRGEAASGLPAREVERGAPSRPEA 124 GA+A GTG G G AP + G QG+A RG G E RG S E Sbjct: 256 GAIA-TGTGTGGAGDAGGSAPVSSGAEQGDAEAGDEARGSEERGDDGTEDRRGGQS--EG 312 Query: 123 HGRLDSEQRDQGDHSGPGEAG 61 DS+ D+GD G+AG Sbjct: 313 DDDSDSDGNDEGDAGDAGDAG 333 >UniRef50_O93419 Cluster: Collagen XVIII precursor; n=3; Gallus gallus|Rep: Collagen XVIII precursor - Gallus gallus (Chicken) Length = 1344 Score = 41.1 bits (92), Expect = 0.029 Identities = 36/105 (34%), Positives = 44/105 (41%), Gaps = 2/105 (1%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAP-HPGQEGERA-PAARGRLQGEAHVRLLRGEAASGLPAREV 154 G + E G AG G P GQ+GE P G L H + E +G P Sbjct: 336 GEKGEKGELGIKGSAGFGYPGSKGQKGEPGEPGPPGPLS--RHTDSMSLEQVTGPPG--- 390 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 G P + A GR D E D G+ PGE GP PG +GQ+ Sbjct: 391 PTGPPGKDGAPGR-DGEPGDPGEDGKPGEMGPQGFPGTPGESGQK 434 >UniRef50_A5NYC5 Cluster: Putative uncharacterized protein; n=1; Methylobacterium sp. 4-46|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 945 Score = 40.7 bits (91), Expect = 0.038 Identities = 40/120 (33%), Positives = 48/120 (40%), Gaps = 10/120 (8%) Frame = -1 Query: 396 VGQGXXXXXXXXXXXXEP-NRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAAR--- 229 VG+G +P RE R R V R G GAPH E A AAR Sbjct: 414 VGEGHQVVDRVRGDQRQPVGREHPQRRVPARPVRRVRRVGEGAPHGQHREELAEAARHHH 473 Query: 228 -GRLQ----GEAHVRLLRGEAASGLPAREVERGAPSRP-EAHGRLDSEQRDQGDHSGPGE 67 GR + G H R +GE G E+E RP E GR++ D+G G GE Sbjct: 474 EGRERQEPSGRGHQR--QGEGVLGQDQPEIEPALEPRPGERRGRVEEADPDRGGGRGRGE 531 >UniRef50_Q4P3N6 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 313 Score = 40.7 bits (91), Expect = 0.038 Identities = 35/113 (30%), Positives = 53/113 (46%), Gaps = 6/113 (5%) Frame = -2 Query: 563 HSKTGVTGPYTFGVGLATYLCSKEIYVMEHEYYSGL-SLLVMVYVAHVKFGPKLAAWLDK 387 +S TG T G GL SKEIYV E + SL+ V V GP W D Sbjct: 55 NSLVSKTGWVTLGTGLTAVAISKEIYVANEETVILVGSLIFAVLVGRAITGP-YKEWADS 113 Query: 386 EVEATENEWNE-----GRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENV 243 ++EAT+++ +E GR +T + +E A+ LL+ AK++++ Sbjct: 114 QIEATKDDRSEDSIANGRFKTY-VMISTLEFSDIGSQSARVMPLLLFAKQDDL 165 >UniRef50_Q9UQ35 Cluster: Serine/arginine repetitive matrix protein 2; n=8; Eumetazoa|Rep: Serine/arginine repetitive matrix protein 2 - Homo sapiens (Human) Length = 2752 Score = 40.3 bits (90), Expect = 0.050 Identities = 36/118 (30%), Positives = 46/118 (38%), Gaps = 4/118 (3%) Frame = -1 Query: 342 NRESTGGRNRGREDGAV---ARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASG 172 +R T R R R V +R+ + A G+ R PA RGR + R RG + S Sbjct: 615 SRSRTPARRRSRTRSPVRRRSRSRSPARRSGRSRSRTPARRGRSRSRTPAR--RGRSRSR 672 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLH-RGPGFAGQEVNGSER 1 PAR R P GR S +G G H R P G+ + SER Sbjct: 673 TPARRSGRSRSRTPARRGRSRSRTPRRGRSRSRSLVRRGRSHSRTPQRRGRSGSSSER 730 >UniRef50_Q4SRH5 Cluster: L-lactate dehydrogenase; n=4; Euteleostomi|Rep: L-lactate dehydrogenase - Tetraodon nigroviridis (Green puffer) Length = 360 Score = 39.9 bits (89), Expect = 0.066 Identities = 28/86 (32%), Positives = 35/86 (40%), Gaps = 2/86 (2%) Frame = -1 Query: 288 RAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGE-AASGLPAREVERGAPSRPEAHGRL 112 R G PH E P+ G G+ HVR RG + L A + R + EAHGR Sbjct: 270 RPECGRPHREHRQEHEPSPPGLHHGQRHVRHRRGGLPVAALRAEQQRREQRGQHEAHGRR 329 Query: 111 DSEQRDQGDHS-GPGEAGPGPLHRGP 37 ++ H G E G L GP Sbjct: 330 GGPAEEERRHPVGHPEGPEGRLSTGP 355 >UniRef50_Q4IYP6 Cluster: Putative uncharacterized protein; n=1; Azotobacter vinelandii AvOP|Rep: Putative uncharacterized protein - Azotobacter vinelandii AvOP Length = 1006 Score = 39.9 bits (89), Expect = 0.066 Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 6/85 (7%) Frame = -1 Query: 309 REDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRL-----LRGEAASGLPAREVERG 145 R+ G AR G G PGQ R PA +GR A R L G AR++ G Sbjct: 638 RQHGRPARGGGGLRRPGQRRHRRPARQGRAARRATGRADDHLPLDPRRLRGGRARDLRAG 697 Query: 144 APS-RPEAHGRLDSEQRDQGDHSGP 73 P+ RP H R E+ + +GP Sbjct: 698 PPARRPGLHRRRQHERHGRPVRAGP 722 >UniRef50_Q22GI2 Cluster: UBX domain containing protein; n=1; Tetrahymena thermophila SB210|Rep: UBX domain containing protein - Tetrahymena thermophila SB210 Length = 2004 Score = 39.9 bits (89), Expect = 0.066 Identities = 34/109 (31%), Positives = 54/109 (49%), Gaps = 9/109 (8%) Frame = -2 Query: 389 KEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQG-------QELLIQAKKENVL--L 237 K+++ EN NE N+ +K L+++I E T + +E I+ +KE +L L Sbjct: 777 KKLQELENIKNEEENR-LKKLKESIGNEDTNKTNLNNNQNAKFEEEERIKREKEEILKKL 835 Query: 236 QLEAAYRERLMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTK 90 QLE A +ERL Y +VK+ + Q K V L K D ++ + K Sbjct: 836 QLEKAEKERLQQEYEKVKKEQEEQ--KRIVNENLLLKQEKDKLLEEIQK 882 >UniRef50_Q17A79 Cluster: Collagen alpha chain, anopheles; n=7; Coelomata|Rep: Collagen alpha chain, anopheles - Aedes aegypti (Yellowfever mosquito) Length = 1746 Score = 39.9 bits (89), Expect = 0.066 Identities = 37/109 (33%), Positives = 45/109 (41%), Gaps = 2/109 (1%) Frame = -1 Query: 345 PNRESTGGR-NRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAAS-G 172 P E T G E G G P PG+EG R A R +G ++ +GE G Sbjct: 461 PGPEGTKGEPGDNGEPGPRGLRGAHGP-PGREGRRGRAGRDGERGVVGLQGSKGEPGPVG 519 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 LP E+G R HG + E+ QG GE GP L PG G Sbjct: 520 LPGMVGEKGERGR---HGNI-GEKGAQGHEGIQGEDGPPGLPGLPGELG 564 >UniRef50_UPI0000E80F2F Cluster: PREDICTED: hypothetical protein; n=1; Gallus gallus|Rep: PREDICTED: hypothetical protein - Gallus gallus Length = 211 Score = 39.5 bits (88), Expect = 0.088 Identities = 29/71 (40%), Positives = 34/71 (47%), Gaps = 2/71 (2%) Frame = -1 Query: 339 RESTGGRNRGREDG-AVARAGTGAPHPGQEGERAPAARGRLQ-GEAHVRLLRGEAASGLP 166 + + GGR R R G A R G A + +PAARGR + G GEA G P Sbjct: 114 QRAAGGRRRRRGSGDAEPRPGAAARWDPEPARPSPAARGRPRAGPGRATCSPGEA--GAP 171 Query: 165 AREVERGAPSR 133 R RGAPSR Sbjct: 172 GRCRRRGAPSR 182 >UniRef50_A1G8K0 Cluster: Acyl-CoA dehydrogenase-like; n=2; Salinispora|Rep: Acyl-CoA dehydrogenase-like - Salinispora arenicola CNS205 Length = 665 Score = 39.5 bits (88), Expect = 0.088 Identities = 37/113 (32%), Positives = 46/113 (40%), Gaps = 6/113 (5%) Frame = -1 Query: 321 RNRGREDGAVARAGTGA-----PHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPARE 157 R R GA +R+G+ A P G P R G+AH R RG PA Sbjct: 132 RRRAGLCGAASRSGSRAARGERPIRGPTNSVRPTCRAVPGGQAHARR-RGHRPRVRPATA 190 Query: 156 VER-GAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 V R G P RP++HGR +R+ G G PG + G G V R Sbjct: 191 VRRSGGPRRPDSHGRPRRLRREGGVR---GRRPPGRPYGPTGPRGDRVRPGVR 240 >UniRef50_Q2VLH1 Cluster: Major ampullate spidroin 2; n=8; Araneidae|Rep: Major ampullate spidroin 2 - Argiope trifasciata (Banded garden spider) Length = 661 Score = 39.5 bits (88), Expect = 0.088 Identities = 37/116 (31%), Positives = 47/116 (40%), Gaps = 2/116 (1%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGA--PHPGQEGERAPAARGRLQGEAHVRLLRGEAASG 172 P ++ GGR A A A G P GQ+G++A G+ QG G A G Sbjct: 248 PGQQGPGGRGPYGPSAAAAAAAAGGYGPGAGQQGQQAGQGSGQ-QGP-------GGAGQG 299 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSE 4 P RG P G + G GPG GP +GPG GQ+ GS+ Sbjct: 300 GP-----RG--QGPYGPGAATAAAAAAGPGYGPGAGQQGPGSQGPGSGGQQGPGSQ 348 Score = 37.1 bits (82), Expect = 0.47 Identities = 31/112 (27%), Positives = 46/112 (41%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 ++ GG+ A A A G + G++ P + G+ G+ + G A G P Sbjct: 525 QQGPGGQGPYGPSAAAAAAAAGPGYGPGAGQQGPGSGGQQGGQGSGQQGPGGAGQGGP-- 582 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSE 4 RG P G + G + GPG GP +GPG GQ+ GS+ Sbjct: 583 ---RG--QGPYGPGAAAAAAAAAGGY-GPGAGQQGPGSQGPGSGGQQGPGSQ 628 Score = 35.9 bits (79), Expect = 1.1 Identities = 31/112 (27%), Positives = 45/112 (40%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 ++ GG+ A A A G + G++ P + G+ G + G A G P Sbjct: 385 QQGPGGQGPYGPSAAAAAAAAGPGYGPGAGQQGPGSGGQQGGPGSGQQGPGGAGQGGP-- 442 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSE 4 RG P G + G + GPG GP +GPG GQ+ GS+ Sbjct: 443 ---RG--QGPYGPGAAAAAAAAAGGY-GPGAGQQGPGSQGPGSGGQQGPGSQ 488 Score = 33.9 bits (74), Expect = 4.4 Identities = 27/104 (25%), Positives = 37/104 (35%) Frame = -1 Query: 330 TGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVE 151 +GG+ G+ G G G P +G P A A G G ++ Sbjct: 560 SGGQQGGQGSGQQGPGGAGQGGPRGQGPYGPGAAAAAAAAAG-GYGPGAGQQGPGSQGPG 618 Query: 150 RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 G P + G +GPG GPG +GPG GQ+ Sbjct: 619 SGGQQGPGSQGPYGPSAAAAAAAAGPGY-GPGAGQQGPGSGGQQ 661 >UniRef50_UPI000065EA11 Cluster: Collagen alpha-1(XV) chain precursor [Contains: Endostatin (Endostatin-XV) (Restin) (Related to endostatin)].; n=1; Takifugu rubripes|Rep: Collagen alpha-1(XV) chain precursor [Contains: Endostatin (Endostatin-XV) (Restin) (Related to endostatin)]. - Takifugu rubripes Length = 1156 Score = 39.1 bits (87), Expect = 0.12 Identities = 32/93 (34%), Positives = 39/93 (41%), Gaps = 9/93 (9%) Frame = -1 Query: 270 PHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQ 91 P+PG G R P GE+ +L G A G + E+G P P G+ D E Sbjct: 381 PYPGAPGPRGPPGPPGPPGESSEVILPG--APGKDGEDGEKGEPGLPGVDGK-DGEPGPA 437 Query: 90 GDHSGPGEAG----PGPL----HRG-PGFAGQE 19 GD GE G PGP H G PG G + Sbjct: 438 GDKGDKGEPGLTGQPGPKGDQGHPGIPGLQGPD 470 >UniRef50_Q73UH7 Cluster: Putative uncharacterized protein; n=2; Bacteria|Rep: Putative uncharacterized protein - Mycobacterium paratuberculosis Length = 388 Score = 39.1 bits (87), Expect = 0.12 Identities = 34/105 (32%), Positives = 39/105 (37%), Gaps = 2/105 (1%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 P+ ++ GR RAG A A AARG AH R LRG +G P Sbjct: 287 PSGDAAAGRRHRGMGRTAGRAGARAARRRLAAPPAAAARGARPAVAHRRALRGVFGAGDP 346 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP--GPLHRGP 37 RG P+RP G L G P P GP R P Sbjct: 347 T--ARRGRPARP---GPLAGRSLRPGVRRRPDRRRPATGPAARRP 386 >UniRef50_Q0SBU7 Cluster: Glycine rich protein; n=1; Rhodococcus sp. RHA1|Rep: Glycine rich protein - Rhodococcus sp. (strain RHA1) Length = 176 Score = 39.1 bits (87), Expect = 0.12 Identities = 38/117 (32%), Positives = 46/117 (39%), Gaps = 8/117 (6%) Frame = -1 Query: 333 STGGRNRGREDGAVARAG---TGAPHPGQEGERAPAARGRLQGEAHV-RLLRGEAASGLP 166 S G E GA AG GAP G G AP G Q A G+ +G P Sbjct: 20 SAGAGIASAEPGAPGGAGGSAPGAPGVGAPGFGAPGTGGDAQSNAETGNANAGDGGAGAP 79 Query: 165 AREVERGAPS--RPEAHGRLDSEQRDQGD--HSGPGEAGPGPLHRGPGFAGQEVNGS 7 + G P+ G +SE GD ++ G+A GP G GF G V GS Sbjct: 80 G--ISFGGPTIGLNNGGGNGNSEVGSGGDGGNARSGDATTGPTTGGDGFGGWGVGGS 134 >UniRef50_A5NZ47 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Rep: LigA - Methylobacterium sp. 4-46 Length = 593 Score = 39.1 bits (87), Expect = 0.12 Identities = 42/122 (34%), Positives = 45/122 (36%), Gaps = 13/122 (10%) Frame = -1 Query: 327 GGRNRGREDGAVARAGT-GAPHPGQEGERAPAARG-RLQGEAHVRLLRG-----EAASGL 169 GGR RGR G V RA G P PG RA A RG R + R + G A G Sbjct: 75 GGR-RGRPRGGVRRAARPGGPAPGPRARRARAGRGPRARHPGLSRPVAGPRRALRPARGH 133 Query: 168 PAREVERG------APSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGS 7 P G AP R A GR R PG A P R P G G Sbjct: 134 PRHAARAGAGRARRAPLR-HADGRGRGAARGPARRQSPGRADPPHQRRRPRRGGAGARGG 192 Query: 6 ER 1 +R Sbjct: 193 DR 194 >UniRef50_A3C636 Cluster: Putative uncharacterized protein; n=3; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 429 Score = 39.1 bits (87), Expect = 0.12 Identities = 33/91 (36%), Positives = 38/91 (41%), Gaps = 3/91 (3%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARA--GTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAAS-GL 169 RE+ GG + GR DG VARA G G P G AR R + A L GEA GL Sbjct: 221 REAAGGADAGRRDGHVARARRGAGGPDAGVGAGVLLRARRRRREAAGAVLDGGEAGEPGL 280 Query: 168 PAREVERGAPSRPEAHGRLDSEQRDQGDHSG 76 R G P A R + +G G Sbjct: 281 RRRARRAGGPRAAAAARRPPAVGGARGGEGG 311 >UniRef50_P90679 Cluster: Fibrillar collagen; n=2; Annelida/Echiura/Pogonophora group|Rep: Fibrillar collagen - Arenicola marina (Lugworm) (Rock worm) Length = 684 Score = 39.1 bits (87), Expect = 0.12 Identities = 33/101 (32%), Positives = 38/101 (37%), Gaps = 5/101 (4%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAP----HPGQEGERAPAARGRLQGEAHVRLLRGE-A 181 P E G RG R G G P PGQ+G P G + RGE Sbjct: 79 PGVEGKAGP-RGATGSTGDRGGPGTPGGPGQPGQQGLVGPTGPAGAPGSPGSKGNRGEPG 137 Query: 180 ASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP 58 A G P G P R + G EQ + GD PG +GP Sbjct: 138 AQGRPGDTGAAGEPGRDGSPG----EQGEDGDGGSPGPSGP 174 Score = 34.3 bits (75), Expect = 3.3 Identities = 34/106 (32%), Positives = 41/106 (38%), Gaps = 9/106 (8%) Frame = -1 Query: 342 NRESTGGRNRGREDGAVARAGT-GAP----HPGQEGERAPAARGRLQGEAHVRLLRGEAA 178 NR G + R + GA G G+P G G P+ QGE + L G Sbjct: 132 NRGEPGAQGRPGDTGAAGEPGRDGSPGEQGEDGDGGSPGPSGPPGQQGERGLVGLPGMRG 191 Query: 177 SGLPAREV-ERGAPSRPEAHGRLDS--EQRDQGDHSGPGEAG-PGP 52 PA + RG P P G S E+ G PG AG PGP Sbjct: 192 EPGPAGPLGSRGEPGPPGDDGSPGSRGERGSPGSSGSPGMAGQPGP 237 >UniRef50_O97406 Cluster: Collagen pro alpha-chain precursor; n=1; Haliotis discus|Rep: Collagen pro alpha-chain precursor - Haliotis discus (Abalone) Length = 1439 Score = 39.1 bits (87), Expect = 0.12 Identities = 33/107 (30%), Positives = 46/107 (42%), Gaps = 2/107 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAAS-GLPA 163 R STG + ++GA ++G+ PGQ+G R G + GE G+P Sbjct: 890 RGSTGNMGQAGKNGAPGQSGS----PGQKGNRGEDGSPGSSGPTGPQGASGERGEPGMPG 945 Query: 162 REVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAG-PGPLHRGPGFAG 25 E G P P+ +G PG+AG PGP+ GPG G Sbjct: 946 PPGETG-PGGPQGPNGARGANGRRGSDGLPGKAGPPGPV-GGPGSNG 990 >UniRef50_UPI0000DD81AD Cluster: PREDICTED: hypothetical protein; n=1; Homo sapiens|Rep: PREDICTED: hypothetical protein - Homo sapiens Length = 204 Score = 38.7 bits (86), Expect = 0.15 Identities = 43/120 (35%), Positives = 50/120 (41%), Gaps = 8/120 (6%) Frame = -1 Query: 336 ESTGGRNRGREDGAVARAGTGAPHPGQEGER-APAARGRLQG-EA-----HVRLLRGEAA 178 E +GGR RGR G A P G R A RGR G EA R L G A Sbjct: 80 ERSGGRGRGRGRGRPGAGAQAAARPVGGGTRHCSAVRGRAPGREAPGARRGSRALAGTGA 139 Query: 177 SGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHR-GPGFAGQEVNGSER 1 +G R ERG P+R GR S + Q G G P HR G +G+ V+ R Sbjct: 140 AG-GVRNAERGCPARSGVLGR--SGRSCQ----GRGVRRPAEGHRDGAATSGEVVSAGAR 192 >UniRef50_Q7UJU9 Cluster: Putative uncharacterized protein; n=1; Pirellula sp.|Rep: Putative uncharacterized protein - Rhodopirellula baltica Length = 337 Score = 38.7 bits (86), Expect = 0.15 Identities = 35/109 (32%), Positives = 39/109 (35%), Gaps = 3/109 (2%) Frame = -1 Query: 327 GGRNRG---REDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPARE 157 G R RG R D R G G P EG+R P G R A G + Sbjct: 129 GSRERGDRERGDRERGRRGDGERGPRGEGDRGPRGDGERGARGEGRGPEDGARRGPRDGD 188 Query: 156 VERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNG 10 ERG P G D G G+ G GP GPGF G +G Sbjct: 189 GERG----PRGDGDRGPRGEDGRGPRGEGDRGRGP---GPGFGGPSRDG 230 >UniRef50_Q2I6N3 Cluster: Uncharacterized Gly-rich protein; n=3; cellular organisms|Rep: Uncharacterized Gly-rich protein - uncultured delta proteobacterium DeepAnt-1F12 Length = 1293 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 427 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 485 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 486 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 519 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 439 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 497 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 498 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 531 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 451 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 509 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 510 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 543 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 463 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 521 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 522 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 555 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 475 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 533 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 534 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 567 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 487 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 545 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 546 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 579 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 499 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 557 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 558 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 591 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 511 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 569 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 570 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 603 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 523 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 581 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 582 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 615 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 535 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 593 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 594 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 627 Score = 38.7 bits (86), Expect = 0.15 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 709 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 767 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 768 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 801 Score = 37.1 bits (82), Expect = 0.47 Identities = 29/94 (30%), Positives = 31/94 (32%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE PA GEA G A PA E + P Sbjct: 271 EAGPAGEAGA-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 329 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 330 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 363 Score = 37.1 bits (82), Expect = 0.47 Identities = 29/94 (30%), Positives = 31/94 (32%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE PA GEA G A PA E + P Sbjct: 415 EAGPAGEAGA-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 473 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 474 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 507 Score = 37.1 bits (82), Expect = 0.47 Identities = 29/94 (30%), Positives = 31/94 (32%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE PA GEA G A PA E + P Sbjct: 697 EAGPAGEAGA-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 755 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 756 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 789 Score = 36.7 bits (81), Expect = 0.62 Identities = 30/103 (29%), Positives = 32/103 (31%), Gaps = 2/103 (1%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAP--HPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 G DGA AG P G GE PA GEA G A A E Sbjct: 225 GAAGEAGADGARGPAGEAGPAGEAGAAGEAGPAGEAGPAGEAGAAGEAGPAGEAGAAGEA 284 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 + P E G+ GEAGP G AG Sbjct: 285 GAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 327 Score = 36.7 bits (81), Expect = 0.62 Identities = 30/103 (29%), Positives = 32/103 (31%), Gaps = 2/103 (1%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAP--HPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 G DGA AG P G GE PA GEA G A A E Sbjct: 651 GAAGEAGADGARGPAGEAGPAGEAGAAGEAGPAGEAGPAGEAGAAGEAGPAGEAGAAGEA 710 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 + P E G+ GEAGP G AG Sbjct: 711 GAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 753 Score = 36.3 bits (80), Expect = 0.82 Identities = 31/94 (32%), Positives = 33/94 (35%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA RG A PA E + P Sbjct: 199 EAGAAGEAGA-AGEAGAAGEAGPAGEAGAAGEAGADGARGPAGEAGPAGEAGAAGEAGPA 257 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 258 G------EAGPAGEAGAAGEAGPAGEAGAAGEAG 285 Score = 36.3 bits (80), Expect = 0.82 Identities = 31/94 (32%), Positives = 33/94 (35%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA RG A PA E + P Sbjct: 625 EAGAAGEAGA-AGEAGAAGEAGPAGEAGAAGEAGADGARGPAGEAGPAGEAGAAGEAGPA 683 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 684 G------EAGPAGEAGAAGEAGPAGEAGAAGEAG 711 Score = 36.3 bits (80), Expect = 0.82 Identities = 34/104 (32%), Positives = 37/104 (35%), Gaps = 1/104 (0%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 + G E GA AG A G GE PA GEA G A PA E Sbjct: 802 AAGEAGAAGEAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEA 860 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP-GFAG 25 + P D Q G+ GEAGP GP G AG Sbjct: 861 GAAGEAGPAG---ADGAQGPAGEAGAAGEAGPAG-EAGPVGEAG 900 Score = 35.9 bits (79), Expect = 1.1 Identities = 31/105 (29%), Positives = 33/105 (31%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 R G E GA AG A G GE A GEA G A PA Sbjct: 236 RGPAGEAGPAGEAGAAGEAGP-AGEAGPAGEAGAAGEAGPAGEAGAAGEAGAAGEAGPAG 294 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E + P E G+ GEAGP G AG Sbjct: 295 EAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 339 Score = 35.9 bits (79), Expect = 1.1 Identities = 31/105 (29%), Positives = 33/105 (31%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 R G E GA AG A G GE A GEA G A PA Sbjct: 662 RGPAGEAGPAGEAGAAGEAGP-AGEAGPAGEAGAAGEAGPAGEAGAAGEAGAAGEAGPAG 720 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E + P E G+ GEAGP G AG Sbjct: 721 EAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 765 Score = 35.5 bits (78), Expect = 1.4 Identities = 30/94 (31%), Positives = 32/94 (34%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E + P Sbjct: 397 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEA------GAAGEAGPAGEAGAAGEAGPA 449 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 450 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 483 Score = 35.5 bits (78), Expect = 1.4 Identities = 28/95 (29%), Positives = 31/95 (32%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE A GEA G A PA E + P Sbjct: 787 EAGPAGEAGA-AGEAGAAGEAGAAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 845 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 E G+ GEAGP G AG+ Sbjct: 846 GEAGAAGEAGPAGEAGAAGEAGPAGADGAQGPAGE 880 Score = 35.1 bits (77), Expect = 1.9 Identities = 34/100 (34%), Positives = 36/100 (36%), Gaps = 6/100 (6%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV----ERGAP 139 E GA AG A G GE PA GEA G A PA E E GA Sbjct: 307 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGAA 365 Query: 138 SRPEAHGRLDS--EQRDQGDHSGPGEAGPGPLHRGPGFAG 25 A G + E G+ GEAGP G AG Sbjct: 366 GEAGAAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 405 Score = 35.1 bits (77), Expect = 1.9 Identities = 34/100 (34%), Positives = 36/100 (36%), Gaps = 6/100 (6%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV----ERGAP 139 E GA AG A G GE PA GEA G A PA E E GA Sbjct: 745 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGAA 803 Query: 138 SRPEAHGRLDS--EQRDQGDHSGPGEAGPGPLHRGPGFAG 25 A G + E G+ GEAGP G AG Sbjct: 804 GEAGAAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 843 Score = 34.7 bits (76), Expect = 2.5 Identities = 28/94 (29%), Positives = 30/94 (31%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE A GEA G A PA E + P Sbjct: 337 EAGPAGEAGA-AGEAGPAGEAGAAGEAGAAGEAGAAGEAGAAGEAGPAGEAGAAGEAGPA 395 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 396 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 429 Score = 34.7 bits (76), Expect = 2.5 Identities = 28/94 (29%), Positives = 30/94 (31%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE A GEA G A PA E + P Sbjct: 775 EAGPAGEAGA-AGEAGPAGEAGAAGEAGAAGEAGAAGEAGAAGEAGPAGEAGAAGEAGPA 833 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 834 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 867 Score = 34.3 bits (75), Expect = 3.3 Identities = 34/109 (31%), Positives = 36/109 (33%), Gaps = 6/109 (5%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 + G E GA AG A G GE PA GEA G A PA E Sbjct: 364 AAGEAGAAGEAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEA 422 Query: 153 ----ERGAPSR--PEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E GA P E G+ GEAGP G AG Sbjct: 423 GAAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 471 Score = 33.9 bits (74), Expect = 4.4 Identities = 33/96 (34%), Positives = 35/96 (36%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E GA Sbjct: 145 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEA--GAAGEAG 201 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 A G E G+ GEAGP G AG + Sbjct: 202 AAG----EAGAAGEAGAAGEAGPAGEAGAAGEAGAD 233 Score = 33.9 bits (74), Expect = 4.4 Identities = 28/94 (29%), Positives = 30/94 (31%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE A GEA G A PA E + P Sbjct: 259 EAGPAGEAGA-AGEAGPAGEAGAAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 317 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 318 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 351 Score = 33.9 bits (74), Expect = 4.4 Identities = 28/94 (29%), Positives = 30/94 (31%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE A GEA G A PA E + P Sbjct: 403 EAGPAGEAGA-AGEAGPAGEAGAAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 461 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 462 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 495 Score = 33.9 bits (74), Expect = 4.4 Identities = 33/96 (34%), Positives = 35/96 (36%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E GA AG A G GE PA GEA G A PA E GA Sbjct: 571 EAGAAGEAGP-AGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPAGEA--GAAGEAG 627 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 A G E G+ GEAGP G AG + Sbjct: 628 AAG----EAGAAGEAGAAGEAGPAGEAGAAGEAGAD 659 Score = 33.9 bits (74), Expect = 4.4 Identities = 28/94 (29%), Positives = 30/94 (31%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G AG A G GE A GEA G A PA E + P Sbjct: 685 EAGPAGEAGA-AGEAGPAGEAGAAGEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAGPA 743 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E G+ GEAGP G AG Sbjct: 744 GEAGAAGEAGPAGEAGAAGEAGPAGEAGAAGEAG 777 >UniRef50_A5NR62 Cluster: Putative uncharacterized protein; n=1; Methylobacterium sp. 4-46|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 1171 Score = 38.7 bits (86), Expect = 0.15 Identities = 35/105 (33%), Positives = 40/105 (38%), Gaps = 2/105 (1%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGER-APAARGRLQGEAHVR-LLRGEAASG 172 P R R+R R ARAG GAP P R PA G + R R + G Sbjct: 511 PRRAPRAARHRRRRPRRQARAGAGAPRPSDGAARPGPARAGAGRRRPGDRGAARSRSRQG 570 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP 37 L R RGAP R R + G + PG G G R P Sbjct: 571 L--RRPRRGAPGRAR---RRAPARPPAGAAADPGRLGGGGRPRRP 610 Score = 36.3 bits (80), Expect = 0.82 Identities = 32/98 (32%), Positives = 40/98 (40%), Gaps = 2/98 (2%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVR--LLRGEAASGLPAREVE 151 GR+ G G+ +R GA G+ R R +G A R RG A+GLP R Sbjct: 420 GRDHGCLGGSGSRGARGARPRGRRRARPRRGGRRARGGARHRGGPARGAGAAGLPRRPDH 479 Query: 150 RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP 37 G RP G + D+G G AG P R P Sbjct: 480 PGPRPRPPGRGGARA-LGDRGGGHGRAAAGAEP-RRAP 515 >UniRef50_Q6EQL3 Cluster: Putative uncharacterized protein OSJNBa0042H24.38; n=2; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein OSJNBa0042H24.38 - Oryza sativa subsp. japonica (Rice) Length = 288 Score = 38.7 bits (86), Expect = 0.15 Identities = 41/110 (37%), Positives = 49/110 (44%), Gaps = 1/110 (0%) Frame = -1 Query: 342 NRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPA 163 +RES R G + GA A G G+ PG+ R AA G +R GEA G P Sbjct: 41 SRESVH-RGPGPQGGA-AEHGHGSGRPGRATARGGAASCG-DGRC-MRESGGEARQGNPG 96 Query: 162 REVERGAPSRPEAHGRLDSEQRDQG-DHSGPGEAGPGPLHRGPGFAGQEV 16 ERG SRP A + +EQR G + G P GPG G EV Sbjct: 97 GPGERGGGSRPAALLGMKAEQRPGGVPRARTGRRRP----EGPGEDGGEV 142 >UniRef50_UPI00015B49AB Cluster: PREDICTED: similar to alpha-5 type IV collagen; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to alpha-5 type IV collagen - Nasonia vitripennis Length = 1702 Score = 38.3 bits (85), Expect = 0.20 Identities = 30/85 (35%), Positives = 37/85 (43%), Gaps = 1/85 (1%) Frame = -1 Query: 261 GQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDH 82 G +G + PA R L G HV + G P RG RP GR E+ D G Sbjct: 576 GAQGPKGPAGRVILPGSHHVSPPGDKGDKGFPGIVGLRGIRGRPGKDGR-KGERGDTGFR 634 Query: 81 SGPGEAG-PGPLHRGPGFAGQEVNG 10 G +G PGP PGF+ Q +G Sbjct: 635 GLMGLSGEPGP----PGFSAQGPDG 655 >UniRef50_UPI0000F2D61A Cluster: PREDICTED: hypothetical protein; n=1; Monodelphis domestica|Rep: PREDICTED: hypothetical protein - Monodelphis domestica Length = 327 Score = 38.3 bits (85), Expect = 0.20 Identities = 29/93 (31%), Positives = 35/93 (37%) Frame = -1 Query: 336 ESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPARE 157 E+ GGR R R + A+A P + P+AR L AA+ A Sbjct: 224 EAAGGRRRRRRERPTAQASGRPLAPTETPVPLPSARPALACALRKPAAAAAAAAAAAAAV 283 Query: 156 VERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP 58 APS G E R QG G GEA P Sbjct: 284 TPAAAPSATRRRG---GEGRGQGKREGDGEASP 313 >UniRef50_UPI0000EBDE87 Cluster: PREDICTED: hypothetical protein; n=1; Bos taurus|Rep: PREDICTED: hypothetical protein - Bos taurus Length = 616 Score = 38.3 bits (85), Expect = 0.20 Identities = 28/74 (37%), Positives = 35/74 (47%), Gaps = 1/74 (1%) Frame = -1 Query: 276 GAPHPGQEGERAPAA-RGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQ 100 GAPHPG RAP A GR +G++ + G A S LPA V G GR+ + Sbjct: 348 GAPHPGPSAPRAPVALAGRAEGKSRIAPALG-AQSLLPAGGVSGG--------GRVGRKW 398 Query: 99 RDQGDHSGPGEAGP 58 R+ G G GP Sbjct: 399 RENGGRGRLGARGP 412 >UniRef50_UPI00004D1B58 Cluster: UPI00004D1B58 related cluster; n=1; Xenopus tropicalis|Rep: UPI00004D1B58 UniRef100 entry - Xenopus tropicalis Length = 634 Score = 38.3 bits (85), Expect = 0.20 Identities = 28/99 (28%), Positives = 45/99 (45%), Gaps = 2/99 (2%) Frame = -1 Query: 312 GREDGAVARAGTGAP-HPGQEGERAPAARGRLQGEAHVRLLRGEAA-SGLPAREVERGAP 139 G DGA + G+P PG +G+ P L G+ + GE +G P + G P Sbjct: 141 GSVDGAGGKGEPGSPGSPGAQGQAGPRGPTGLSGQKGEKGEPGEPGQNGEPGKS---GPP 197 Query: 138 SRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 + G+ + ++ ++GD PG+AG H G G+ Sbjct: 198 GQIGLRGK-EGDRGEKGDEGTPGDAGDPGEHGMKGAKGE 235 >UniRef50_Q4RVK5 Cluster: Chromosome 15 SCAF14992, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 15 SCAF14992, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1493 Score = 38.3 bits (85), Expect = 0.20 Identities = 32/129 (24%), Positives = 65/129 (50%), Gaps = 6/129 (4%) Frame = -2 Query: 494 EIYVMEHEYYSGLSLLVMVYVAHVKFGPKLAAWLDKEVEATENEWNEGRNQTVKALEDAI 315 E+ +YY L L+ + +K L++++ ++E + R+Q K+LEDA+ Sbjct: 994 ELLTRSSDYYKFLGELLK-NMEELKIRNTKIEMLEEQLRLLKDETKD-RDQKNKSLEDAL 1051 Query: 314 EGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEVK------RRLDYQLEKS 153 K E +++ Q ++ K +LQ A +E L +++++ R+ YQLE+ Sbjct: 1052 ARYKLELSQSKEQLFSLEEVKRTTVLQANAT-KESLDSTHNQLQDLNDQLTRIKYQLEEE 1110 Query: 152 NVERRLAQK 126 ++RLA++ Sbjct: 1111 KRKKRLAEE 1119 >UniRef50_Q4RTK0 Cluster: Chromosome 2 SCAF14997, whole genome shotgun sequence; n=3; Eumetazoa|Rep: Chromosome 2 SCAF14997, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1130 Score = 38.3 bits (85), Expect = 0.20 Identities = 37/99 (37%), Positives = 41/99 (41%), Gaps = 2/99 (2%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHP-GQEGERAPAARGRLQGEAHVRLLRGEAASGL 169 P E GGR R A RAG G P G G R A G G V L EAA Sbjct: 929 PPGEPRGGRRREAPGRAGERAGAGGRAPAGPRGGRGGPAGGGAPGPLPVGRL-AEAAH-- 985 Query: 168 PAREVERGAPSRPEAHGRLDSEQRDQ-GDHSGPGEAGPG 55 P R+ +G P AH + D E R + P EA PG Sbjct: 986 PHRD-HQGEQRAPSAHHQGDQETRQALREDLPPPEADPG 1023 >UniRef50_Q9L060 Cluster: Putative uncharacterized protein SCO2975; n=1; Streptomyces coelicolor|Rep: Putative uncharacterized protein SCO2975 - Streptomyces coelicolor Length = 1345 Score = 38.3 bits (85), Expect = 0.20 Identities = 35/103 (33%), Positives = 43/103 (41%), Gaps = 4/103 (3%) Frame = -1 Query: 333 STGGRNRGREDGAVAR-AGTGAP--HPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPA 163 +TGG G R AG GAP P EG AP A G+ H G A G PA Sbjct: 388 ATGGSGAGGPGAPAPRTAGRGAPGRDPYAEGPPAPGAARTGAGDPHSDG-PGPGAYGAPA 446 Query: 162 REVERGAPSRPEAHGRLDSEQRDQGDHSGP-GEAGPGPLHRGP 37 P +A+ R D+ +RD G G++ GP GP Sbjct: 447 PGTPGSDPHGRDAYDR-DAYERDPGGRDASYGQSLSGPDRTGP 488 >UniRef50_Q5UBV9 Cluster: Resuscitation promoting factor; n=1; Mycobacterium avium subsp. avium|Rep: Resuscitation promoting factor - Mycobacterium avium subsp. avium Length = 217 Score = 38.3 bits (85), Expect = 0.20 Identities = 37/103 (35%), Positives = 43/103 (41%), Gaps = 2/103 (1%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERG 145 GR R R AV RAG HPG A G G V +RG A PAR Sbjct: 33 GRARRRRVRAVGRAG----HPGATDRGRRAGAGH-PGPRRVAGVRGAAVRPDPARR---- 83 Query: 144 APSRPEAHGRLDSEQRDQGDHSGPGEAGPGP--LHRGPGFAGQ 22 +RP GR ++R G G AGPG L R G +G+ Sbjct: 84 --TRPGRSGRTRCQRRPAGPARGRAPAGPGAAGLLRPAGTSGR 124 >UniRef50_A7DAS9 Cluster: Putative uncharacterized protein; n=1; Methylobacterium extorquens PA1|Rep: Putative uncharacterized protein - Methylobacterium extorquens PA1 Length = 777 Score = 38.3 bits (85), Expect = 0.20 Identities = 35/102 (34%), Positives = 40/102 (39%), Gaps = 9/102 (8%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQ--------EGERAPAARGRLQGEAHVRLLRGEAASG 172 GGR++G E G G H G + E APA G+ QG H RL GEAA Sbjct: 442 GGRDQGEEVGRTGAEGDEGVHVGMAAQQVRHADPEEAPAGPGQHQGREH-RLHPGEAACA 500 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQ-GDHSGPGEAGPGPL 49 AR A + H D R GD E G PL Sbjct: 501 EKARHRMVEARQQMAPHVEDDDRGRQHGGDDQVAAECGRLPL 542 >UniRef50_A5P2U3 Cluster: Putative PAS/PAC sensor protein; n=6; Proteobacteria|Rep: Putative PAS/PAC sensor protein - Methylobacterium sp. 4-46 Length = 639 Score = 38.3 bits (85), Expect = 0.20 Identities = 34/96 (35%), Positives = 38/96 (39%), Gaps = 2/96 (2%) Frame = -1 Query: 336 ESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPARE 157 E GGR G G AR APH G + R P AR R +RL R A G R Sbjct: 351 ERRGGRAMGAVAGR-ARRRLRAPHRGLDRGRGPQAR-RAGARHRLRLRRDHAPGGAGGRA 408 Query: 156 VERG-APSRPEAHGRLDSEQRDQG-DHSGPGEAGPG 55 G P R A R + R +G PG G G Sbjct: 409 AGLGDRPRRLAADARPGAPARPRGRSRERPGRLGRG 444 >UniRef50_Q60AW0 Cluster: Putative uncharacterized protein; n=1; Methylococcus capsulatus|Rep: Putative uncharacterized protein - Methylococcus capsulatus Length = 946 Score = 37.9 bits (84), Expect = 0.27 Identities = 29/94 (30%), Positives = 39/94 (41%), Gaps = 1/94 (1%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAA-RGRLQGEAHVRLLRGEAASGL 169 P+ + GG A T G++ + APA G+ Q + GEAA Sbjct: 623 PDLDEMGGTAEPDNHEETGAASTSVNDSGEQPDGAPATPHGKRQRDTPEPSTGGEAAEQ- 681 Query: 168 PAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGE 67 P RE E G P R GR D + G + GPG+ Sbjct: 682 PGREAEYGGPRR-GVTGR-DGSETGVGSNKGPGD 713 >UniRef50_Q3WGB8 Cluster: Putative uncharacterized protein; n=8; Bacteria|Rep: Putative uncharacterized protein - Frankia sp. EAN1pec Length = 1835 Score = 37.9 bits (84), Expect = 0.27 Identities = 36/113 (31%), Positives = 44/113 (38%), Gaps = 1/113 (0%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 P R + G +RGR G R G GQ G PAA + G G+AA LP Sbjct: 278 PGRPADAGADRGRHPG---RPAAGDHLAGQPGAALPAA-VQPAGARQPDARPGDAALLLP 333 Query: 165 AREVERGAPSRPEAHGR-LDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNG 10 R++ A R H + +R DH P PG G G A Q G Sbjct: 334 GRQLAARADRRGAEHRHAVQPGRRPHRDHGRPDP--PGRPDHGAGAARQADRG 384 >UniRef50_Q0RAQ2 Cluster: Putative uncharacterized protein; n=1; Frankia alni ACN14a|Rep: Putative uncharacterized protein - Frankia alni (strain ACN14a) Length = 1214 Score = 37.9 bits (84), Expect = 0.27 Identities = 32/91 (35%), Positives = 39/91 (42%), Gaps = 1/91 (1%) Frame = -1 Query: 276 GAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSR-PEAHGRLDSEQ 100 G P P + G + AA GR+Q A V R P+ + P R P+A Q Sbjct: 272 GVPPPSERGPGS-AAPGRVQPAAPVDGTRTTRLPTPPSPQPAGPMPGRRPQAEPGPPPAQ 330 Query: 99 RDQGDHSGPGEAGPGPLHRGPGFAGQEVNGS 7 G +GPG AGPGP GP G GS Sbjct: 331 --VGRLTGPGSAGPGPAGSGPAGPGSIDAGS 359 >UniRef50_A6W7I0 Cluster: Putative uncharacterized protein; n=1; Kineococcus radiotolerans SRS30216|Rep: Putative uncharacterized protein - Kineococcus radiotolerans SRS30216 Length = 638 Score = 37.9 bits (84), Expect = 0.27 Identities = 29/86 (33%), Positives = 34/86 (39%), Gaps = 3/86 (3%) Frame = -1 Query: 282 GTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEA---ASGLPAREVERGAPSRPEAHGRL 112 G GAP G R L+ A RG ++G P R R P AHGR Sbjct: 19 GGGAPPSRDRGRGGSPVRSGLRHSAEGPPARGAGRGGSAGAPRRVSPRTRPPEVVAHGRG 78 Query: 111 DSEQRDQGDHSGPGEAGPGPLHRGPG 34 D + + P AGP P RGPG Sbjct: 79 DPPRSGRTRRETP--AGPVPPRRGPG 102 Score = 34.3 bits (75), Expect = 3.3 Identities = 32/98 (32%), Positives = 35/98 (35%) Frame = -1 Query: 318 NRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAP 139 +RGR G RA G R PA RL G+ H R R + R RG Sbjct: 182 HRGR--GRPRRAPRGHDDRRHRHRRGPAGGRRLAGQHHRRGQRHPHLAHRLRRGPRRGGR 239 Query: 138 SRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 RP R D G SG E G G R P G Sbjct: 240 GRPPRRDRRDL----PGSRSGHPELGGGGARRPPPLGG 273 >UniRef50_A5NM96 Cluster: Putative uncharacterized protein; n=1; Methylobacterium sp. 4-46|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 152 Score = 37.9 bits (84), Expect = 0.27 Identities = 33/86 (38%), Positives = 35/86 (40%), Gaps = 5/86 (5%) Frame = -1 Query: 312 GREDGAVARAGTGAPHPGQEGERAPAA-RGRLQGEAHVRLLRGEAASGLPAREVERG--- 145 G EDG AG G HP RAP A RGR + A R G S P R G Sbjct: 66 GGEDGGADGAGDGVGHP----RRAPRADRGRDEPPARARRHPGRGRSPGPRRAPAPGQCP 121 Query: 144 -APSRPEAHGRLDSEQRDQGDHSGPG 70 A SR A GR + GD G G Sbjct: 122 AAGSRGRAQGRAGLDAARPGDRRGRG 147 >UniRef50_A3P9K7 Cluster: DNA ligase, ATP-dependent; n=12; Proteobacteria|Rep: DNA ligase, ATP-dependent - Burkholderia pseudomallei (strain 1106a) Length = 1163 Score = 37.9 bits (84), Expect = 0.27 Identities = 33/113 (29%), Positives = 43/113 (38%), Gaps = 7/113 (6%) Frame = -1 Query: 330 TGGRNRGR-EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 TGGR R + DG A A HP + + A +G GEA R G + S P+ Sbjct: 765 TGGRTRRKARDGGARDAPPLARHPKRGDAGSSARKGARDGEAGKRAAAGSSPSSSPSSST 824 Query: 153 ER--GAPSRPEAHGRLDSEQR----DQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 A R GR S R D+G + + P P AG V+ Sbjct: 825 STSISASGRTRGGGRSASRDRAGDADEGANEDANDHAPRERAGAPKVAGVRVS 877 >UniRef50_A0QXB8 Cluster: Putative uncharacterized protein; n=1; Mycobacterium smegmatis str. MC2 155|Rep: Putative uncharacterized protein - Mycobacterium smegmatis (strain ATCC 700084 / mc(2)155) Length = 474 Score = 37.9 bits (84), Expect = 0.27 Identities = 19/44 (43%), Positives = 25/44 (56%) Frame = -1 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 ERGA R + +G S+ D+G PG GPGP H GP +G+ Sbjct: 197 ERGANVRGQQNGGSASQAGDRGSRRAPG-FGPGPRHAGPDRSGR 239 >UniRef50_Q8MW55 Cluster: Precollagen-NG; n=2; Mytilus|Rep: Precollagen-NG - Mytilus galloprovincialis (Mediterranean mussel) Length = 905 Score = 37.9 bits (84), Expect = 0.27 Identities = 35/109 (32%), Positives = 40/109 (36%), Gaps = 6/109 (5%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAP---HPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPARE 157 GG GA A G G P PG +G R PA QG G + G Sbjct: 204 GGAGASASAGAFATGGGGFPLPGAPGPQGPRGPAGPPGDQGHGGPPGPPGHSPQGPQGSR 263 Query: 156 VERGAPSRPEAHGR--LDSEQRDQGDHSGPGEAGPGPLHRGP-GFAGQE 19 GAP A+G G PG+AG P RGP G AG + Sbjct: 264 GAPGAPGEQGANGNPGQPGNAGAPGQPGAPGQAG-APGARGPSGAAGHQ 311 Score = 35.1 bits (77), Expect = 1.9 Identities = 38/118 (32%), Positives = 46/118 (38%), Gaps = 11/118 (9%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 P + G N G+ A A GAP GQ G AP ARG H G G P Sbjct: 269 PGEQGANG-NPGQPGNAGAPGQPGAP--GQAG--APGARGPSGAAGHQGAQGGVDQPGSP 323 Query: 165 ARE---VERGAPSRPEAHGR--LDSEQRDQGDHSGPGE------AGPGPLHRGPGFAG 25 ++ + GAP P A G + G+ GPGE AGP + PG G Sbjct: 324 GQQGSAGQPGAPGNPGAPGAPGPTGQAGSVGNIGGPGERGAQGSAGPRGIQGRPGCKG 381 >UniRef50_A7S046 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 1400 Score = 37.9 bits (84), Expect = 0.27 Identities = 33/95 (34%), Positives = 37/95 (38%), Gaps = 2/95 (2%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGT-GAPHP-GQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 S GG G +G G P P G G P+ G V+ GE G P R Sbjct: 836 SKGGEGTPGSQGMPGMSGPPGRPGPPGPPGPPGPSGPSGSNGRNGVKGSTGEG--GRPGR 893 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPG 55 + E G P P GR D E G PGE GPG Sbjct: 894 DGEPGEPGTP---GR-DGEPGIPGPDGRPGERGPG 924 >UniRef50_Q8U4L2 Cluster: Putative uncharacterized protein PF0070; n=4; Thermococcaceae|Rep: Putative uncharacterized protein PF0070 - Pyrococcus furiosus Length = 300 Score = 37.9 bits (84), Expect = 0.27 Identities = 32/120 (26%), Positives = 61/120 (50%), Gaps = 7/120 (5%) Frame = -2 Query: 389 KEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAK-----KENVLLQLEA 225 +E++ N W + R++ K LE EK +++A+ E+ + K KE + +L+ Sbjct: 32 EELQKELNVWIQKRDE--KNLEVRRLREKAREFKAKRDEINQKIKELKKNKEEINAKLDL 89 Query: 224 AYRERLMYAYS--EVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALDR 51 Y+E L Y E K+ ++ K +E R+ + ++W + ITP++EKQ +D+ Sbjct: 90 LYQEALEYKTKRDEFKQLRRLKMPKEKIEERIEK---LEWELQT-NPNITPEREKQIVDQ 145 >UniRef50_Q14050 Cluster: Collagen alpha-3(IX) chain precursor; n=31; Euteleostomi|Rep: Collagen alpha-3(IX) chain precursor - Homo sapiens (Human) Length = 684 Score = 37.9 bits (84), Expect = 0.27 Identities = 30/101 (29%), Positives = 42/101 (41%), Gaps = 1/101 (0%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAAS-GLPAREVERGAPSRPEA 124 GA +AG G EG R P G + G + G+P ++ + G P Sbjct: 259 GAPGKAGDRGER-GPEGFRGPKGDLGRPGPKGTPGVAGPSGEPGMPGKDGQNGVPG---- 313 Query: 123 HGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 LD ++ + G + PGE GP L PG AG + ER Sbjct: 314 ---LDGQKGEAGRNGAPGEKGPNGLPGLPGRAGSKGEKGER 351 >UniRef50_P20908 Cluster: Collagen alpha-1(V) chain precursor; n=63; Coelomata|Rep: Collagen alpha-1(V) chain precursor - Homo sapiens (Human) Length = 1838 Score = 37.9 bits (84), Expect = 0.27 Identities = 27/83 (32%), Positives = 40/83 (48%), Gaps = 1/83 (1%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PG++G + PA R LQG V L G P + ++G P G + D+G+ Sbjct: 1122 PGEKGPQGPAGRDGLQGP--VGLPGPAGPVGPPGEDGDKGEIGEPGQKG----SKGDKGE 1175 Query: 84 HSGPGEAGP-GPLHRGPGFAGQE 19 PG GP GP+ + PG +G + Sbjct: 1176 QGPPGPTGPQGPIGQ-PGPSGAD 1197 >UniRef50_UPI000065EAC0 Cluster: Homolog of Homo sapiens "PREDICTED "similar to matrilin 2 precursor; n=1; Takifugu rubripes|Rep: Homolog of Homo sapiens "PREDICTED "similar to matrilin 2 precursor - Takifugu rubripes Length = 910 Score = 37.5 bits (83), Expect = 0.35 Identities = 42/147 (28%), Positives = 57/147 (38%), Gaps = 6/147 (4%) Frame = -1 Query: 441 GVCGSREIRTKIGC--LVG-QGXXXXXXXXXXXXEPNRESTGGRNRGREDGAVARAGT-G 274 G CG+ ++ G LVG +G P + G R G + G G Sbjct: 226 GECGTPGVKGDAGPVGLVGTRGPRGLQGERGPTGPPGIQGETGIGRPGPKGDIGFQGQPG 285 Query: 273 APHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLD-SEQR 97 P P GE P QG V+ +G GLP + +RG P G+ + Sbjct: 286 PPGPRGIGELGPPGP---QGPQGVQGSKGPTGEGLPGPKGDRGLPGPRGPRGQQGVGIKG 342 Query: 96 DQGDHSGPGEAGP-GPLHRGPGFAGQE 19 D+GD PG GP GP G G G++ Sbjct: 343 DKGDFGPPGFPGPTGP--TGVGIQGEK 367 >UniRef50_UPI000065E422 Cluster: Collagen alpha-1(XI) chain precursor.; n=1; Takifugu rubripes|Rep: Collagen alpha-1(XI) chain precursor. - Takifugu rubripes Length = 1668 Score = 37.5 bits (83), Expect = 0.35 Identities = 31/103 (30%), Positives = 39/103 (37%), Gaps = 5/103 (4%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E+G V G P PG G + P QG G ++G + E G P Sbjct: 1091 ENGDVGAMGPPGP-PGPRGPQGPGGTVGSQGPPG-----GIGSAGAVGEKGEAGEAGNPG 1144 Query: 126 AHGR-----LDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 HG E ++GD PG AGP L PG G + N Sbjct: 1145 PHGEPGMAGRKGETGEKGDTGPPGAAGPAGLRGPPGDDGPKGN 1187 >UniRef50_Q4SB89 Cluster: Chromosome undetermined SCAF14676, whole genome shotgun sequence; n=4; Percomorpha|Rep: Chromosome undetermined SCAF14676, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1399 Score = 37.5 bits (83), Expect = 0.35 Identities = 30/100 (30%), Positives = 40/100 (40%), Gaps = 4/100 (4%) Frame = -1 Query: 306 EDGAVARAGTGAPH--PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSR 133 +DG V G P G+ GE+ PA QG L + A G + E+G P Sbjct: 563 KDGEVGAQGPAGPAGLQGERGEQGPAGATGFQG-----LPGPQGAVGETGKPGEQGVPGE 617 Query: 132 PEAHGRLDS--EQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 G S ++ G+ PG AGP PG AG + Sbjct: 618 AGLPGPAGSRGDRGFPGERGAPGAAGPTGARGSPGPAGND 657 Score = 33.5 bits (73), Expect = 5.8 Identities = 33/105 (31%), Positives = 37/105 (35%), Gaps = 8/105 (7%) Frame = -1 Query: 315 RGREDGAVARAGTGAPHP-------GQEGERAPAARGRLQGEAHVRLLRG-EAASGLPAR 160 RG + A A G P P G+ GE+ L G A R RG G P Sbjct: 582 RGEQGPAGATGFQGLPGPQGAVGETGKPGEQGVPGEAGLPGPAGSRGDRGFPGERGAPGA 641 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 GA P G D + D G PG GP L PG G Sbjct: 642 AGPTGARGSPGPAGN-DGAKGDAGAPGNPGAQGPPGLQGMPGERG 685 >UniRef50_A1BM62 Cluster: Latency associated nuclear antigen (LANA)-like protein; n=6; root|Rep: Latency associated nuclear antigen (LANA)-like protein - Ovine herpesvirus 2 Length = 551 Score = 37.5 bits (83), Expect = 0.35 Identities = 35/112 (31%), Positives = 40/112 (35%), Gaps = 1/112 (0%) Frame = -1 Query: 336 ESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQG-EAHVRLLRGEAASGLPAR 160 E GG G G V G PG EGE P G G E + GE G Sbjct: 200 EGPGGEGEG-PGGEVEGPGGEGEGPGGEGE-GPGGEGEGPGGEGEGPVGEGEGPGG---- 253 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSE 4 G E G + + G+ GPG G GP G G G+E G E Sbjct: 254 ---EGEGPVGEGEGPVGEGEGPGGEGEGPGGEGEGPGGEGEGPGGEEGPGGE 302 Score = 35.5 bits (78), Expect = 1.4 Identities = 39/116 (33%), Positives = 41/116 (35%), Gaps = 4/116 (3%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 P E GG G G V G PG E E P G G V GE P Sbjct: 106 PGGEGPGGEGEG-PGGEVEGPGGEGEGPGGEVE-GPGGEGEGPG-GEVEGPGGEGEG--P 160 Query: 165 AREVE----RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNG 10 EVE G E G E+ G+ GPG G GP G G G EV G Sbjct: 161 GGEVEGPGGEGKGPGGEVEGPGGEEEGPGGEGEGPGGEGEGPGGEGEG-PGGEVEG 215 Score = 35.1 bits (77), Expect = 1.9 Identities = 39/118 (33%), Positives = 43/118 (36%), Gaps = 7/118 (5%) Frame = -1 Query: 336 ESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRL---QGEAHVRLLRGEAASGL- 169 E GG G G G PG EGE P G +GE V G G Sbjct: 214 EGPGGEGEGPGGEGEGPGGEGEG-PGGEGE-GPVGEGEGPGGEGEGPVGEGEGPVGEGEG 271 Query: 168 PAREVER--GAPSRPEAHGR-LDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSE 4 P E E G P G E+ G+ GPG G GP GPG G+E G E Sbjct: 272 PGGEGEGPGGEGEGPGGEGEGPGGEEGPGGEGEGPGGEGEGPGGGGPG--GEEEEGEE 327 Score = 33.9 bits (74), Expect = 4.4 Identities = 35/108 (32%), Positives = 41/108 (37%), Gaps = 6/108 (5%) Frame = -1 Query: 315 RGREDGAVARAGTGAPH--PGQEGERAPAARGRL---QGEAHVRLLRGEAASGL-PAREV 154 RGR G + T PG EGE P G +GE + G G P EV Sbjct: 17 RGRRPGPKKKTVTEGKGEGPGGEGE-GPGGEGEGPGGEGEGPGGEVEGPGGEGEGPGGEV 75 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNG 10 E P E G + G+ GPG G GP GPG G+ G Sbjct: 76 E--GPGG-EGEGPGGEVEGPGGEEEGPGGEGEGPGGEGPGGEGEGPGG 120 >UniRef50_Q82F52 Cluster: Putative GntR-family transcriptional regulator; n=1; Streptomyces avermitilis|Rep: Putative GntR-family transcriptional regulator - Streptomyces avermitilis Length = 478 Score = 37.5 bits (83), Expect = 0.35 Identities = 32/92 (34%), Positives = 39/92 (42%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAH 121 GA R GA PG+ G A RGR + A V + G A VERG RP Sbjct: 293 GAHGRVPRGAGGPGRAGGGGGAGRGRGRRGAAVGRVDGGAVRAGGGGAVERGRDGRPA-- 350 Query: 120 GRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 GR ++ R G + G P +HR AG Sbjct: 351 GRREAGGR--GRAAAAGRRAPCRVHRSGRSAG 380 >UniRef50_Q2J869 Cluster: Tetratricopeptide TPR_2; n=1; Frankia sp. CcI3|Rep: Tetratricopeptide TPR_2 - Frankia sp. (strain CcI3) Length = 654 Score = 37.5 bits (83), Expect = 0.35 Identities = 35/109 (32%), Positives = 42/109 (38%), Gaps = 6/109 (5%) Frame = -1 Query: 345 PNRESTGGRNRGREDGA-VARAGTGAPHPGQEGERAPA---ARGRLQGEAHVRLLRGEAA 178 P TGG G +RAG G G+R + AR R+ EA R RG Sbjct: 156 PEGTGTGGGRAPAAGGERFSRAGGPRGDRGASGDRGASGASARNRVAAEAAWRERRGRPG 215 Query: 177 SGLPAREVERGAPSRPEAHGR--LDSEQRDQGDHSGPGEAGPGPLHRGP 37 + R G PS HGR DS + + G GP P H GP Sbjct: 216 AEETGRS---GRPSSRPDHGRPGRDSSVSHRPGQARAGSDGPRPRHDGP 261 >UniRef50_Q5C2Y9 Cluster: SJCHGC09378 protein; n=2; Platyhelminthes|Rep: SJCHGC09378 protein - Schistosoma japonicum (Blood fluke) Length = 603 Score = 37.5 bits (83), Expect = 0.35 Identities = 36/106 (33%), Positives = 44/106 (41%), Gaps = 1/106 (0%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 R G R + + G + G PH G G P +G A E + G P Sbjct: 93 RGQPGPRGKQGDAGEPGKPGEAGPH-GYPGFMGPPGEPGPEGPAG-----SEGSEGPPG- 145 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRG-PGFAG 25 E+G P + +G L E D GD PG AGP P RG PGF G Sbjct: 146 --EQGPPGK---NGEL-GEAGDVGDAGPPGRAGP-PGRRGHPGFPG 184 Score = 35.1 bits (77), Expect = 1.9 Identities = 36/133 (27%), Positives = 48/133 (36%), Gaps = 3/133 (2%) Frame = -1 Query: 441 GVCGSREIRTKIGCLVGQGXXXXXXXXXXXXEPNREST-GGRNRGREDGAVARAGTGAPH 265 G G +R G + G G P G R E G + G Sbjct: 202 GPIGRAGMRGPRGSVGGVGVAGSKGEQGLSGAPGSPGEIGPRGDSGEPGIPGKDG----R 257 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRG-EAASGLPAREVERGAPSRPEAHGRLDSEQRDQG 88 PG++GE P +G A G + G P + RG P + G + S + +G Sbjct: 258 PGKQGEPGPKGAPGGKGPAGPPGPPGLDGPVGYPGDQGPRGPPGQVGERGPMGS-RGSRG 316 Query: 87 DHSGPGEAGP-GP 52 D PGE GP GP Sbjct: 317 DRGDPGEVGPLGP 329 >UniRef50_A1XVT1 Cluster: Fibrillar collagen precursor; n=1; Hydra vulgaris|Rep: Fibrillar collagen precursor - Hydra attenuata (Hydra) (Hydra vulgaris) Length = 1476 Score = 37.5 bits (83), Expect = 0.35 Identities = 34/111 (30%), Positives = 48/111 (43%), Gaps = 2/111 (1%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 PN E G + E VA+ G P G +G++ P + +GE + GE G P Sbjct: 163 PNGEGLAG-SLETEGSVVAKEGPKGPM-GPDGKQGPRGQYGDRGEPGPQ---GEP--GDP 215 Query: 165 AREVERGAPSRPEAHGR--LDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 + +G P +GR L+ E+ D G PGE GP L G G + Sbjct: 216 GEKGAQGPAGSPGTNGRDGLNGERGDPGTIGPPGEPGPEGLQGPSGLDGPQ 266 Score = 33.1 bits (72), Expect = 7.6 Identities = 27/87 (31%), Positives = 36/87 (41%), Gaps = 1/87 (1%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PG +GER P G R +GE G ++G+ + GR D + D+G Sbjct: 350 PGDQGERGPKGETGATGSRGERGEQGER--GKAGEPGQKGSKGPLGSRGR-DGFRGDKGS 406 Query: 84 HSGPGEAGPGPLHRGPGFAG-QEVNGS 7 PG GP L G G Q +GS Sbjct: 407 SGSPGSQGPRGLVGPRGHQGLQGSDGS 433 >UniRef50_P46804 Cluster: Spidroin-2; n=17; Orbiculariae|Rep: Spidroin-2 - Nephila clavipes (Golden silk orbweaver) Length = 627 Score = 37.5 bits (83), Expect = 0.35 Identities = 38/114 (33%), Positives = 46/114 (40%), Gaps = 1/114 (0%) Frame = -1 Query: 345 PNRESTGGRN-RGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGL 169 P R G + G A A AG+G PG G R G QG+ G AA+ Sbjct: 58 PGRYGPGQQGPSGPGSAAAAAAGSGQQGPGGYGPRQQGPGGYGQGQQGPS-GPGSAAAAS 116 Query: 168 PAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGS 7 A E G P +G Q+ G + GPG+ GPG GPG G GS Sbjct: 117 AAASAESGQQG-PGGYG---PGQQGPGGY-GPGQQGPGGY--GPGQQGPSGPGS 163 >UniRef50_UPI0000EBD1F0 Cluster: PREDICTED: hypothetical protein; n=1; Bos taurus|Rep: PREDICTED: hypothetical protein - Bos taurus Length = 217 Score = 37.1 bits (82), Expect = 0.47 Identities = 35/97 (36%), Positives = 40/97 (41%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 S GG+ R RE G R G G P P GE AP A +G R + SGL A Sbjct: 93 SPGGQERERE-GKGERGGAG-PWPAG-GEGAPQAEPEGRGSE-----RPDKRSGLNATGS 144 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHR 43 + RP R + G SGPG PGP R Sbjct: 145 KESGSQRPR---RKKEQGAGLGARSGPGSGVPGPRAR 178 >UniRef50_UPI0000E7F798 Cluster: PREDICTED: hypothetical protein; n=2; Gallus gallus|Rep: PREDICTED: hypothetical protein - Gallus gallus Length = 1794 Score = 37.1 bits (82), Expect = 0.47 Identities = 38/119 (31%), Positives = 51/119 (42%), Gaps = 9/119 (7%) Frame = -1 Query: 330 TGGRNRGREDGAVARAG----TGAP-HPGQEGERAPAARGRLQGEAHVRLLRGEAAS-GL 169 TG R + E G +G GAP PG+ G R P GE ++ GE G+ Sbjct: 978 TGLRGKDGEPGKKGTSGMKGEAGAPGEPGERGMRGPLGLPGRPGEQGIKGDPGEPGKDGI 1037 Query: 168 PAREVERG-APSRPEAHGRLDSEQRDQGDHSGPGEAG-PGPL-HRGPGFAGQEVNGSER 1 + ++G + S E L E+ D+G GE G PGP GP G E+ ER Sbjct: 1038 TGEKGDKGESMSSLENTITLKGEKGDRGKDGLKGERGPPGPKGEPGPPGKGVEMKDLER 1096 Score = 33.1 bits (72), Expect = 7.6 Identities = 24/85 (28%), Positives = 34/85 (40%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PG++G + +GE + +G P + E GAP P A G Q +G+ Sbjct: 1206 PGEKGTKGDRGESGQKGEPGIGFRGPVGQAGPPGLKGEPGAPGPPGAQG----IQGIRGN 1261 Query: 84 HSGPGEAGPGPLHRGPGFAGQEVNG 10 PG G PG GQ+ G Sbjct: 1262 AGIPGSQGDRGAPGLPGTPGQKAGG 1286 >UniRef50_Q2VIS4 Cluster: Filaggrin 2; n=3; Mus musculus|Rep: Filaggrin 2 - Mus musculus (Mouse) Length = 2362 Score = 37.1 bits (82), Expect = 0.47 Identities = 26/97 (26%), Positives = 40/97 (41%), Gaps = 1/97 (1%) Frame = -1 Query: 342 NRESTGGRNRGREDGAVARAGTGAPHP-GQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 +R+ G+ + + G+ HP EGE R G H +G+ +G Sbjct: 1256 SRQPQAGQGQPSQSGSGRSPRRSPVHPESSEGEEHSVVPQRHSGSGHGHG-QGQGQAGHQ 1314 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPG 55 RE G P RPE + DS ++ Q P ++G G Sbjct: 1315 QRESVHGQPVRPEVPTQ-DSSRQPQAGQGQPSQSGSG 1350 Score = 36.7 bits (81), Expect = 0.62 Identities = 25/97 (25%), Positives = 39/97 (40%), Gaps = 1/97 (1%) Frame = -1 Query: 342 NRESTGGRNRGREDGAVARAGTGAPHP-GQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 +R+ G+ + + G+ HP EGE R G H +G+ +G Sbjct: 1880 SRQPQAGQGQPSQSGSGRSPRRSPVHPESSEGEEHSVVPQRHSGSGHGHG-QGQGQAGHQ 1938 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPG 55 RE G P RP+ + DS + Q P ++G G Sbjct: 1939 QRESVHGQPVRPQGPSQ-DSSSQPQASQGQPSQSGSG 1974 Score = 33.1 bits (72), Expect = 7.6 Identities = 33/112 (29%), Positives = 45/112 (40%), Gaps = 10/112 (8%) Frame = -1 Query: 339 RESTGGRNRGREDGAVA------RAGTGAPHP---GQEGERAPAARGRLQGEAHVRLLRG 187 RES G+ RGR G +AG G P G+ R+P +GE H + + Sbjct: 1550 RESVHGQ-RGRPQGPSQDSSRQPQAGQGQPSQSGSGRSPRRSPVHPESSEGEEHSVVPQR 1608 Query: 186 EAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPG-PLHRGPG 34 + SG + + G R HG+ Q D S +AG G P G G Sbjct: 1609 YSGSGHGHGQGQAGHQQRESVHGQRGRPQGPSQDSSRQPQAGQGQPSQSGSG 1660 >UniRef50_Q3JU34 Cluster: Putative uncharacterized protein; n=5; Burkholderiales|Rep: Putative uncharacterized protein - Burkholderia pseudomallei (strain 1710b) Length = 478 Score = 37.1 bits (82), Expect = 0.47 Identities = 26/79 (32%), Positives = 35/79 (44%), Gaps = 1/79 (1%) Frame = -1 Query: 315 RGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVE-RGAP 139 RG ++ AR A H E RG E H+ L RG+ PA + R Sbjct: 43 RGADEERDARIVEVAEHRDHERHERDRERGADGPERHLELQRGDLGRAEPAAHRDLRQQD 102 Query: 138 SRPEAHGRLDSEQRDQGDH 82 +P+ HGR E+R +GDH Sbjct: 103 HQPDPHGR---ERRARGDH 118 >UniRef50_Q2W370 Cluster: Putative uncharacterized protein; n=3; Magnetospirillum|Rep: Putative uncharacterized protein - Magnetospirillum magneticum (strain AMB-1 / ATCC 700264) Length = 429 Score = 37.1 bits (82), Expect = 0.47 Identities = 26/74 (35%), Positives = 33/74 (44%), Gaps = 1/74 (1%) Frame = -1 Query: 306 EDGA-VARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRP 130 ED A +A AG G P G GE A+ R Q + LLR A + REV R P P Sbjct: 286 EDAAHLAEAGLGRPDAGAGGEADAEAKARFQKGGELFLLRRFAEAAAAFREVLRLKPDHP 345 Query: 129 EAHGRLDSEQRDQG 88 + L + + G Sbjct: 346 RSLFNLAMAEAELG 359 >UniRef50_Q1QHE7 Cluster: OmpA/MotB precursor; n=2; Nitrobacter|Rep: OmpA/MotB precursor - Nitrobacter hamburgensis (strain X14 / DSM 10229) Length = 673 Score = 37.1 bits (82), Expect = 0.47 Identities = 19/51 (37%), Positives = 29/51 (56%) Frame = -1 Query: 186 EAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 ++A PA + + +PS P A+G ++ +RD+ SGPG GP GPG Sbjct: 179 KSAPTTPAPQPQTTSPSTPPANGEPNATRRDERGRSGPGREHGGP--GGPG 227 >UniRef50_A1G361 Cluster: Putative uncharacterized protein; n=1; Salinispora arenicola CNS205|Rep: Putative uncharacterized protein - Salinispora arenicola CNS205 Length = 1159 Score = 37.1 bits (82), Expect = 0.47 Identities = 33/109 (30%), Positives = 42/109 (38%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVER 148 GG RG AV A A G G A RGRL R R A LP + Sbjct: 203 GGCRRGGHRTAVVPALLPAGPGGDPGAGAAGRRGRLP---RARARRRPAGDRLPRPGHPK 259 Query: 147 GAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 P+ P ++Q + PG A PGP G AG+ V+ ++ Sbjct: 260 PVPTAPLGGSGQPADQ-SRPRRQRPGRAQPGPGGSGADLAGRRVDPGDQ 307 >UniRef50_Q0JLS5 Cluster: Os01g0575200 protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Os01g0575200 protein - Oryza sativa subsp. japonica (Rice) Length = 391 Score = 37.1 bits (82), Expect = 0.47 Identities = 33/91 (36%), Positives = 38/91 (41%), Gaps = 7/91 (7%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGTGAPHPG----QEG-ERAPAARGRLQGEAHVRLLRGEAASGL 169 + G G DG V R G GAPHPG EG +RA R L A + R A Sbjct: 295 AAAGEPDGDGDGGVRRGGAGAPHPGMPQVDEGDQRAVRLRRHLLAAASSQGHRQHQAPD- 353 Query: 168 PAREVERGAPSRPEAHG--RLDSEQRDQGDH 82 R +ERG R + G R D R DH Sbjct: 354 RGRRLERGVVPRGDDEGGERRDHFLRPARDH 384 >UniRef50_Q2H3Y0 Cluster: Putative uncharacterized protein; n=1; Chaetomium globosum|Rep: Putative uncharacterized protein - Chaetomium globosum (Soil fungus) Length = 1874 Score = 37.1 bits (82), Expect = 0.47 Identities = 20/61 (32%), Positives = 29/61 (47%), Gaps = 5/61 (8%) Frame = -1 Query: 177 SGLPAREVERGAPSRPEAHGR---LDSE--QRDQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 SG+P E+ERGAP +P G+ L+S R + PG+ P H +G+ Sbjct: 1027 SGIPGEEMERGAPRQPPPVGQFQNLESRLPSRQSPSAASPGQEAPASAHAPGSISGESAQ 1086 Query: 12 G 10 G Sbjct: 1087 G 1087 >UniRef50_P18503 Cluster: Short-chain collagen C4; n=2; Ephydatia muelleri|Rep: Short-chain collagen C4 - Ephydatia muelleri (Mueller's freshwater sponge) Length = 366 Score = 37.1 bits (82), Expect = 0.47 Identities = 28/73 (38%), Positives = 35/73 (47%) Frame = -1 Query: 276 GAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQR 97 GAP P Q AP A G L G A + +G+ GLP + + GAP P D + Sbjct: 43 GAPGP-QGAPGAPGAPG-LPGPAGPQGPKGD--KGLPGNDGQPGAPGAPG----YDGAKG 94 Query: 96 DQGDHSGPGEAGP 58 D+GD PG GP Sbjct: 95 DKGDTGAPGPQGP 107 >UniRef50_UPI000155BCDD Cluster: PREDICTED: similar to Elongation factor RNA polymerase II-like 3; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Elongation factor RNA polymerase II-like 3 - Ornithorhynchus anatinus Length = 364 Score = 36.7 bits (81), Expect = 0.62 Identities = 30/110 (27%), Positives = 49/110 (44%), Gaps = 14/110 (12%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAP-------HPGQEGERAPAARGRLQGEAHVRLL--RG 187 RE+ GG +R + + R T P PG G+R+ A + R + + G Sbjct: 136 REAPGGISRDQLAQRLLRDQTPCPAREPEPRQPGPSGQRSQALQLRQPQASPDGPVPQEG 195 Query: 186 EAASGLPAREVERGAPSRPEAHGRLDSEQRDQG-----DHSGPGEAGPGP 52 + GLP+R+ +G + E G + E+ ++ DHS P + GP P Sbjct: 196 YSTEGLPSRDPGQGEERKEEEEGEEEKEEEEEEMALHLDHSPPAQTGPEP 245 >UniRef50_UPI0000F2E1B1 Cluster: PREDICTED: hypothetical protein; n=1; Monodelphis domestica|Rep: PREDICTED: hypothetical protein - Monodelphis domestica Length = 257 Score = 36.7 bits (81), Expect = 0.62 Identities = 35/112 (31%), Positives = 43/112 (38%), Gaps = 14/112 (12%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAP-HPGQEGE---RAPAAR------GRLQGEAHVRL 196 P R++ GR RE G R AP PG GE R P AR GRL ++ + Sbjct: 64 PERQTESGRGGPREPGRPRRVAGAAPASPGGLGEWPGRPPRARAASETGGRLGPQSDMAK 123 Query: 195 L----RGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGP 52 R + L R E GAP + G + S P A PGP Sbjct: 124 CPPASRWPPTAPLMERAAEAGAPPQSSPDGAAWPPEEPNSPESSPSGARPGP 175 >UniRef50_UPI0000F2C810 Cluster: PREDICTED: hypothetical protein; n=1; Monodelphis domestica|Rep: PREDICTED: hypothetical protein - Monodelphis domestica Length = 117 Score = 36.7 bits (81), Expect = 0.62 Identities = 39/104 (37%), Positives = 44/104 (42%), Gaps = 9/104 (8%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERAPAA----RGRL---QGEAHVRLL--RGEAASG 172 G RGRE G R G G P PG +G R P + RG + Q E RLL R A G Sbjct: 8 GPLRGRETGG--RPGLG-PGPGAKGLRPPRSPLTIRGAVAAGQAELGARLLIQRPAAPFG 64 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRG 40 L + RG R + G G GP A PGP RG Sbjct: 65 LSSSPGRRGPSDRSVSMG---GSALPPGLLPGPAPAPPGPRRRG 105 >UniRef50_UPI0000DA3A8A Cluster: PREDICTED: hypothetical protein; n=1; Rattus norvegicus|Rep: PREDICTED: hypothetical protein - Rattus norvegicus Length = 253 Score = 36.7 bits (81), Expect = 0.62 Identities = 31/109 (28%), Positives = 42/109 (38%), Gaps = 2/109 (1%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQ-EGERAPAARGRLQGEAHVRLLRGEAASGL 169 P+ ++ N+G + R G G PG+ E R RGR +GE R R A Sbjct: 35 PDAKAPASPNQGAVQPGM-RLGLGRGAPGEAERRRGRGIRGRWEGEGSCRASRAGAQHNA 93 Query: 168 PAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGE-AGPGPLHRGPGFAG 25 + G+PS P + +G SGPG L PG G Sbjct: 94 FSAAASPGSPSHPGLRPPARCSPKRRG-CSGPGRLPAASSLRAAPGTRG 141 >UniRef50_UPI000059FC02 Cluster: PREDICTED: hypothetical protein XP_848755; n=1; Canis lupus familiaris|Rep: PREDICTED: hypothetical protein XP_848755 - Canis familiaris Length = 295 Score = 36.7 bits (81), Expect = 0.62 Identities = 30/87 (34%), Positives = 36/87 (41%), Gaps = 2/87 (2%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGL-PAREVER 148 G RGR G G AP P E+ P A GR G +LL GL REV + Sbjct: 164 GWPRGRAHGLAGTVGAAAPRP----EQPPEAAGRALGPLRRQLLGNHGHGGLVRRREVTK 219 Query: 147 GAPSR-PEAHGRLDSEQRDQGDHSGPG 70 R PEA + +R G +G G Sbjct: 220 SPTDRLPEAAQKWTGRERLLGVGTGGG 246 >UniRef50_UPI00003932A2 Cluster: hypothetical protein Blon03000113; n=1; Bifidobacterium longum DJO10A|Rep: hypothetical protein Blon03000113 - Bifidobacterium longum DJO10A Length = 71 Score = 36.7 bits (81), Expect = 0.62 Identities = 19/36 (52%), Positives = 23/36 (63%) Frame = -1 Query: 207 HVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQ 100 H R R AASGLP+ E ER A +R EAH R + E+ Sbjct: 18 HERENRHRAASGLPSLEEERAAAAREEAHVRREREK 53 >UniRef50_UPI000069E795 Cluster: UPI000069E795 related cluster; n=1; Xenopus tropicalis|Rep: UPI000069E795 UniRef100 entry - Xenopus tropicalis Length = 232 Score = 36.7 bits (81), Expect = 0.62 Identities = 35/117 (29%), Positives = 51/117 (43%), Gaps = 6/117 (5%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAH--VRLLRGEAASGLP 166 RES+G N GRE + +G + G G R + G L E+ L R + +G Sbjct: 61 RESSGTGNSGRESSGIGNSGRESSSTGNLG-RESSGTGNLGRESSGTGNLGRESSGTGNS 119 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGE-AGPGPLHR---GPGFAGQEVNGS 7 RE S E+ G +S + G + E +G G HR G G G+E +G+ Sbjct: 120 GRESSGTGNSGRESSGIGNSGRESSGTGNSHRESSGTGNSHRESSGTGNLGRESSGT 176 >UniRef50_Q4SPM7 Cluster: Chromosome 16 SCAF14537, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 16 SCAF14537, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 444 Score = 36.7 bits (81), Expect = 0.62 Identities = 35/111 (31%), Positives = 44/111 (39%), Gaps = 3/111 (2%) Frame = -1 Query: 345 PNRESTGGR-NRGREDGAVARAGTGAPH-PGQEGERAPAARGRLQGEAHVRLLRGEAAS- 175 P R G + +RG E R TGA PG G P +GE R G A Sbjct: 182 PERGPRGPKGDRGLEGSPGERGPTGAAGLPGPAGISGPMGLKGDKGERGERGGSGSAGQP 241 Query: 174 GLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 GLP + ++G P G + G + G GPG + PG AGQ Sbjct: 242 GLPGEKGQKGEQGDPGPPG-VPGVMGIPGINGKHGSPGPGGVRGDPGPAGQ 291 >UniRef50_Q4S1P4 Cluster: Chromosome 6 SCAF14768, whole genome shotgun sequence; n=2; Tetraodon nigroviridis|Rep: Chromosome 6 SCAF14768, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1347 Score = 36.7 bits (81), Expect = 0.62 Identities = 27/75 (36%), Positives = 32/75 (42%), Gaps = 1/75 (1%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGT-GAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPARE 157 S G + + E G++ G G HPGQ G A R +GE L GE GLP Sbjct: 417 SPGQKGQPGEPGSLDPPGAPGKVHPGQPGAPGKAGRPGNKGEDG---LPGEPGFGLPGPP 473 Query: 156 VERGAPSRPEAHGRL 112 G P RP G L Sbjct: 474 GPPGLPGRPSEVGEL 488 >UniRef50_Q3JM63 Cluster: Peptide synthetase NRPS5-4-3; n=16; Burkholderia|Rep: Peptide synthetase NRPS5-4-3 - Burkholderia pseudomallei (strain 1710b) Length = 1005 Score = 36.7 bits (81), Expect = 0.62 Identities = 26/79 (32%), Positives = 33/79 (41%), Gaps = 1/79 (1%) Frame = -1 Query: 288 RAGTGAPHPGQEGE-RAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRL 112 R GA P + G R A GR + + +R AA L AR + R AP R + Sbjct: 199 RVRGGAARPRRAGRARRAAGAGRRRADGELRRAARRAAR-LSARALSRRAPLRVRRDAAV 257 Query: 111 DSEQRDQGDHSGPGEAGPG 55 GD GPG+ G G Sbjct: 258 RRRAARMGDGLGPGQRGDG 276 >UniRef50_A7HI44 Cluster: LigA; n=1; Anaeromyxobacter sp. Fw109-5|Rep: LigA - Anaeromyxobacter sp. Fw109-5 Length = 535 Score = 36.7 bits (81), Expect = 0.62 Identities = 29/74 (39%), Positives = 33/74 (44%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 R + GR RGR +A G AP P + RAP GR G R RG+AA G PAR Sbjct: 357 RPAGAGRARGRRRARLAPCGA-APGPPRRRPRAPVG-GRPGGVGD-RGRRGQAARGTPAR 413 Query: 159 EVERGAPSRPEAHG 118 G R G Sbjct: 414 RRRGGDRGRGAGGG 427 >UniRef50_A5P034 Cluster: Putative uncharacterized protein precursor; n=1; Methylobacterium sp. 4-46|Rep: Putative uncharacterized protein precursor - Methylobacterium sp. 4-46 Length = 1034 Score = 36.7 bits (81), Expect = 0.62 Identities = 29/88 (32%), Positives = 34/88 (38%), Gaps = 1/88 (1%) Frame = -1 Query: 303 DGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEA 124 DG R G A H GE P AR + H+ RG A GL R P+ Sbjct: 552 DGGSRRVGAAARHAQDRGECPPPARADQRRARHLED-RGRADGGLCRGFRRRRDPAGCRR 610 Query: 123 HGRLDSEQRDQGDHSGP-GEAGPGPLHR 43 HG + Q Q +GP G A G R Sbjct: 611 HGGVADRQEGQRAGAGPRGGARAGAYRR 638 >UniRef50_A5NQT8 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Rep: LigA - Methylobacterium sp. 4-46 Length = 761 Score = 36.7 bits (81), Expect = 0.62 Identities = 35/109 (32%), Positives = 47/109 (43%), Gaps = 7/109 (6%) Frame = -1 Query: 312 GREDGAVARA-GTGAPHPGQEGERAPAARG----RLQGEAHVRLLRGEAAS-GLPAREVE 151 G G AR G P PG+ G A AARG L+G+ L G LPAR+ Sbjct: 506 GESPGPAARPPGLAGPAPGRRG--AAAARGGPPAHLRGDGGSLALPGPRLGLPLPARDDR 563 Query: 150 RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPG-PLHRGPGFAGQEVNGS 7 + P+R R D+ R + +G + GP P PG +G G+ Sbjct: 564 QRLPARRRRLDRGDAGPRARRARAGARQRGPCLPRLPRPGASGARARGA 612 >UniRef50_A4VVK3 Cluster: ATP synthase B chain; n=3; Streptococcus suis|Rep: ATP synthase B chain - Streptococcus suis (strain 05ZYH33) Length = 168 Score = 36.7 bits (81), Expect = 0.62 Identities = 21/58 (36%), Positives = 35/58 (60%), Gaps = 1/58 (1%) Frame = -2 Query: 383 VEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQ-ELLIQAKKENVLLQLEAAYRE 213 V+ E+E +GR ++ K ++DA+E K E+ R Q ++ IQ K+ L++EA RE Sbjct: 67 VQQREDELVQGRIESQKIIQDAVERAKLEKKRILEQADVEIQGLKQKAQLEIEAEKRE 124 >UniRef50_Q0ISC2 Cluster: Os11g0538400 protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Os11g0538400 protein - Oryza sativa subsp. japonica (Rice) Length = 610 Score = 36.7 bits (81), Expect = 0.62 Identities = 37/100 (37%), Positives = 42/100 (42%), Gaps = 1/100 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPA-ARGRLQGEAHVRLLRGEAASGLPA 163 R + R R R++ AR G GAP +G R PA RG L EA RG P Sbjct: 210 RRARRRRRRPRQEAHHAR-GDGAPPRAADGRRVPAPRRGGLGEEAR----RGRRRRVPPG 264 Query: 162 REVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHR 43 RGAP R R + QG H G G G PL R Sbjct: 265 GAPRRGAPRRRRQVRRRAVVR--QGGH-GDGRRGVPPLPR 301 >UniRef50_Q171W5 Cluster: Lava lamp protein; n=2; Culicidae|Rep: Lava lamp protein - Aedes aegypti (Yellowfever mosquito) Length = 3407 Score = 36.7 bits (81), Expect = 0.62 Identities = 37/130 (28%), Positives = 64/130 (49%), Gaps = 5/130 (3%) Frame = -2 Query: 392 DKEVEATENEWNEGRNQTVKALEDAIE--GEKTEQWRAQGQELLIQAKKE-NVLLQLEAA 222 D+E E + E N RN ++ L+ +E G K + Q L+ A KE +L +L A Sbjct: 1064 DEEPELLKVELNS-RNDEIRELKKELELLGVKKAGEIEEAQAKLVAATKEIEILKELVAE 1122 Query: 221 YRERLMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALD--RC 48 +++L+ Y E + + +L++ AQK M D V ++ + + EK + D R Sbjct: 1123 QKQQLIETYQEHENEIAGKLKEIQDYENQAQK-MADQ-VEDLNRQLVEVGEKYSNDMKRQ 1180 Query: 47 IADLASLARK 18 + +L SL +K Sbjct: 1181 VEELKSLTQK 1190 >UniRef50_P20849 Cluster: Collagen alpha-1(IX) chain precursor; n=85; Euteleostomi|Rep: Collagen alpha-1(IX) chain precursor - Homo sapiens (Human) Length = 921 Score = 36.7 bits (81), Expect = 0.62 Identities = 28/91 (30%), Positives = 34/91 (37%), Gaps = 1/91 (1%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRG-EAASGLPAREVE 151 G R G + G H G+EG++ QG + LRG G + Sbjct: 427 GMRGHKGAKGEIGEPGRQG-HKGEEGDQGELGEVGAQGPPGAQGLRGITGIVGDKGEKGA 485 Query: 150 RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP 58 RG P G L DQG PGEAGP Sbjct: 486 RGLDGEPGPQG-LPGAPGDQGQRGPPGEAGP 515 >UniRef50_UPI0000F201FC Cluster: PREDICTED: similar to collagen, type XXVIII; n=1; Danio rerio|Rep: PREDICTED: similar to collagen, type XXVIII - Danio rerio Length = 766 Score = 36.3 bits (80), Expect = 0.82 Identities = 32/96 (33%), Positives = 39/96 (40%), Gaps = 2/96 (2%) Frame = -1 Query: 300 GAVARAGT-GAPHP-GQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 GA AG G+P P G G +G QG RG +G + E+G P P Sbjct: 113 GAKGDAGPPGSPGPLGMPGRGIQGEKGN-QGPVGPPGQRGNPGTGFTGPKGEQGFPGNPG 171 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 A G QR G+ GE G L PG GQ+ Sbjct: 172 APG-----QRGTGEPGPKGEPGLRGLAGDPGIPGQD 202 >UniRef50_UPI0000DD84BF Cluster: PREDICTED: hypothetical protein; n=4; Homo/Pan/Gorilla group|Rep: PREDICTED: hypothetical protein - Homo sapiens Length = 404 Score = 36.3 bits (80), Expect = 0.82 Identities = 34/98 (34%), Positives = 41/98 (41%), Gaps = 5/98 (5%) Frame = -1 Query: 315 RGREDGAVAR-----AGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVE 151 +GRED V R A G+ P G A A +GR+QG A R LRGE + P R Sbjct: 96 QGREDAGVGRRNRDPAEPGSLRPTSLGFPARAGQGRVQGAAPGRKLRGEPKTP-PGRSGP 154 Query: 150 RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP 37 G H +E + S GPGP R P Sbjct: 155 SGGVRSGAGHRAAAAEHQLGAGFS--VRNGPGPPARLP 190 >UniRef50_UPI00005C000E Cluster: PREDICTED: similar to Apolipoprotein B48 receptor; n=4; Laurasiatheria|Rep: PREDICTED: similar to Apolipoprotein B48 receptor - Bos taurus Length = 1020 Score = 36.3 bits (80), Expect = 0.82 Identities = 15/31 (48%), Positives = 17/31 (54%) Frame = -1 Query: 102 QRDQGDHSGPGEAGPGPLHRGPGFAGQEVNG 10 Q DQ P EAGPGP G AGQ+ +G Sbjct: 876 QEDQSTDEDPAEAGPGPQREADGSAGQDAHG 906 >UniRef50_Q4SNW2 Cluster: Chromosome 15 SCAF14542, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 15 SCAF14542, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1009 Score = 36.3 bits (80), Expect = 0.82 Identities = 30/99 (30%), Positives = 39/99 (39%), Gaps = 4/99 (4%) Frame = -1 Query: 303 DGAVARAGTGAPHPGQEGERA----PAARGRLQGEAHVRLLRGEAASGLPAREVERGAPS 136 DG G P PG +GE+ P G G V + + GLP E+G P Sbjct: 383 DGHPGDVGETGP-PGTDGEKGNTGRPGRSGPPGGPGDVGKKGDQGSPGLPGDPGEQGNPG 441 Query: 135 RPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 P G E +GD+ G GPG + G +G E Sbjct: 442 IPGDPG-APGEIGRRGDYGVKGSQGPGGIKGEKGESGPE 479 >UniRef50_Q4RWT3 Cluster: Chromosome 15 SCAF14981, whole genome shotgun sequence; n=2; Clupeocephala|Rep: Chromosome 15 SCAF14981, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1877 Score = 36.3 bits (80), Expect = 0.82 Identities = 25/88 (28%), Positives = 38/88 (43%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PG++G+ PA R +QG + L G P + ++G P G + D+G+ Sbjct: 1133 PGEKGDVGPAGRDGIQGP--IGLPGSAGPQGQPGEDGDKGEVGGPGQKG----SKGDKGE 1186 Query: 84 HSGPGEAGPGPLHRGPGFAGQEVNGSER 1 PG AG + PG AG + R Sbjct: 1187 LGPPGPAGLQGVIGAPGPAGSDGEAGPR 1214 Score = 34.3 bits (75), Expect = 3.3 Identities = 31/99 (31%), Positives = 36/99 (36%), Gaps = 1/99 (1%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPS-RP 130 E+G V G P PG G + P QG G A E P P Sbjct: 1260 ENGDVGAMGPPGP-PGPRGPQGPGGTVGPQGPPGGVGSAGAVGEKGEAGEAGNPGPQGEP 1318 Query: 129 EAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 GR E ++GD PG AGP L PG G + N Sbjct: 1319 GVMGR-KGETGEKGDTGPPGAAGPPGLRGPPGDDGPKGN 1356 Score = 33.1 bits (72), Expect = 7.6 Identities = 24/82 (29%), Positives = 35/82 (42%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PGQ G P+ + G R G A G R+ E+G+ A G + + G Sbjct: 1390 PGQPGPPGPSGEAGVPGPPGKRGSVGPA--GKEGRQGEKGSKGEAGAEGPV-GKTGPVGP 1446 Query: 84 HSGPGEAGPGPLHRGPGFAGQE 19 PG++GP L PG G++ Sbjct: 1447 QGPPGKSGPEGLRGIPGPVGEQ 1468 >UniRef50_Q3JTD9 Cluster: Putative uncharacterized protein; n=1; Burkholderia pseudomallei 1710b|Rep: Putative uncharacterized protein - Burkholderia pseudomallei (strain 1710b) Length = 489 Score = 36.3 bits (80), Expect = 0.82 Identities = 28/74 (37%), Positives = 36/74 (48%), Gaps = 1/74 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPA-ARGRLQGEAHVRLLRGEAASGLPA 163 R S GGR RGR + AR G+ A + G+ A AR +G AH + A L A Sbjct: 296 RASRGGRRRGRTARSPARRGSRAHARRRAGQGASRYARAAARGRAHADGGGRDRAGRLRA 355 Query: 162 REVERGAPSRPEAH 121 R ER P+R + H Sbjct: 356 R--ERCGPARRQPH 367 >UniRef50_A5P3S2 Cluster: Putative uncharacterized protein; n=1; Methylobacterium sp. 4-46|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 1077 Score = 36.3 bits (80), Expect = 0.82 Identities = 39/117 (33%), Positives = 42/117 (35%), Gaps = 2/117 (1%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 P RE G RGR+ G AR G PG G RGR +G A R R P Sbjct: 423 PCREP-GSDARGRDRGRGARPGGLGVRPGGRG------RGRARGGAAPRAPRPHPRPRRP 475 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPG--PLHRGPGFAGQEVNGSER 1 P R R D QR G S P G G H GPG A + R Sbjct: 476 GGADRARTPVRGRGAERGD-RQRGPGFGSRPRRGGSGGRGRHPGPGTAAAAAQAARR 531 >UniRef50_A5P062 Cluster: Putative uncharacterized protein; n=2; Proteobacteria|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 829 Score = 36.3 bits (80), Expect = 0.82 Identities = 34/106 (32%), Positives = 38/106 (35%), Gaps = 6/106 (5%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPH--PG-QEGERAPAARGRLQGEAHVRLLRGEAASGL 169 R + GR RG D R G G PG + R GR G R +G Sbjct: 224 RAAARGRGRGEGDRRTPRPGDGQDREAPGARPSRRDHGGVGRADGRPRRPRRRDAPGTGA 283 Query: 168 PAREVE--RGAPSRPEAHGRLDSEQRDQGDHSGPG-EAGPGPLHRG 40 E RGAP R GR D + R GP G G HRG Sbjct: 284 GRAEAHHPRGAPPRQARRGR-DPDARIDDQRPGPDPRRGLGRRHRG 328 >UniRef50_A5NX95 Cluster: Putative uncharacterized protein; n=1; Methylobacterium sp. 4-46|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 1270 Score = 36.3 bits (80), Expect = 0.82 Identities = 34/109 (31%), Positives = 40/109 (36%), Gaps = 6/109 (5%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVAR------AGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGE 184 P R GGR R GAV R G P P + RA AARG + + G Sbjct: 292 PARSGAGGRPRRSRRGAVGRWCRRHAPRRGGPLPAPQPRRARAARGARRYRRGLVAGGGH 351 Query: 183 AASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP 37 + R + G P R A GR +R GPG A G P Sbjct: 352 GGAVGLRRRLREGRPRRRPARGR----RRRALPAEGPGRAPGGQRRHRP 396 >UniRef50_A5NVB2 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Rep: LigA - Methylobacterium sp. 4-46 Length = 907 Score = 36.3 bits (80), Expect = 0.82 Identities = 31/95 (32%), Positives = 35/95 (36%), Gaps = 2/95 (2%) Frame = -1 Query: 312 GREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSR 133 G D A P P A R RL G RLL E + LP R+ + R Sbjct: 470 GDRDHRGASPAGRRPDPAHPAPPARPRRARLDGRFRHRLLLAELPARLPVRQDQ----DR 525 Query: 132 PEAHGRLDSEQRDQGDHSGPGEAGPGPL--HRGPG 34 P R E R + DH G A P HRG G Sbjct: 526 PLLRPRHGPEARRRRDHPGDRRARAQPRHDHRGGG 560 >UniRef50_A2W4K4 Cluster: Putative uncharacterized protein; n=5; Proteobacteria|Rep: Putative uncharacterized protein - Burkholderia cenocepacia PC184 Length = 1294 Score = 36.3 bits (80), Expect = 0.82 Identities = 35/101 (34%), Positives = 43/101 (42%), Gaps = 2/101 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGL--P 166 R++ G R R AV R A +G+R P AR R + HV R A G P Sbjct: 145 RDAAAGHERHRP--AVRRNAARARRA--QGDRDPDARVRRDDQRHVHPDRPSAEHGRVGP 200 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHR 43 ARE R P+ + RD DH+ G A P PL R Sbjct: 201 AREAARHPDLVPDVAEAAVAVPRDVRDHASVG-AVPFPLAR 240 >UniRef50_A0VWL6 Cluster: NAD-dependent epimerase/dehydratase; n=1; Dinoroseobacter shibae DFL 12|Rep: NAD-dependent epimerase/dehydratase - Dinoroseobacter shibae DFL 12 Length = 575 Score = 36.3 bits (80), Expect = 0.82 Identities = 30/94 (31%), Positives = 37/94 (39%), Gaps = 2/94 (2%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAH--VRLLRGEAASG 172 P GG + E G P +GER RG L G+ H LR +G Sbjct: 436 PAEGGPGGADHHGEFRLAPPLPAGHARPA-DGERLSHGRGLLPGQVHEFADHLRHGRPAG 494 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPG 70 R+ ER PSR HG LD + R + PG Sbjct: 495 RHQRDRERD-PSRGGPHGHLDPQHRRRAPGPRPG 527 >UniRef50_A0V8U7 Cluster: Acyl-CoA dehydrogenase, type 2-like; n=13; Proteobacteria|Rep: Acyl-CoA dehydrogenase, type 2-like - Delftia acidovorans SPH-1 Length = 587 Score = 36.3 bits (80), Expect = 0.82 Identities = 34/95 (35%), Positives = 39/95 (41%), Gaps = 1/95 (1%) Frame = -1 Query: 315 RGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGL-PAREVERGAP 139 RGR +R G G P G R P ARG + + G A L R RG P Sbjct: 25 RGRSRPHRSRPGHGLPQVGHRAPRLP-ARGPARHRGRLHHHPGGADQFLGHGRRRGRGRP 83 Query: 138 SRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 RP RL + R GD +G A HRGPG Sbjct: 84 GRPGHPLRLPA-LRHPGDAAGDCRADRAG-HRGPG 116 >UniRef50_Q26640 Cluster: Alpha2(IV)-like collagen; n=4; Strongylocentrotus purpuratus|Rep: Alpha2(IV)-like collagen - Strongylocentrotus purpuratus (Purple sea urchin) Length = 1747 Score = 36.3 bits (80), Expect = 0.82 Identities = 35/111 (31%), Positives = 42/111 (37%), Gaps = 4/111 (3%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRL---LRGEAAS-G 172 R G RG G G GA G +G R L+G+ + RGE + G Sbjct: 1262 RGQKGNSGRGGFPGFPGEPG-GAGEIGSQGPRGAKGYPGLKGDRGIPAPIGSRGEPGTPG 1320 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 P ERG RP G Q +GD G GP P GPG G + Sbjct: 1321 TPGGPGERGYSGRPGFEGP-KGLQGMKGDLGEQGPPGPLPPDLGPGPQGMQ 1370 >UniRef50_A6YIY0 Cluster: Major ampullate spidroin 2; n=3; Latrodectus hesperus|Rep: Major ampullate spidroin 2 - Latrodectus hesperus Length = 3779 Score = 36.3 bits (80), Expect = 0.82 Identities = 31/121 (25%), Positives = 37/121 (30%) Frame = -1 Query: 387 GXXXXXXXXXXXXEPNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEA 208 G P R+ G A A AG+G G G A AA Sbjct: 1893 GAGAAAAAAAGGAGPGRQQAYGPGGSGATAAAAAAGSGPSGYGPGGAGAAAAAAAGGAGP 1952 Query: 207 HVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFA 28 + G SG A P R + +G + S GPG G GPG A Sbjct: 1953 GRQQAYGPGGSGAAAAAASGAGPGRQQVYGPVGSGAAAAAAAGGPGYGGQQGY--GPGGA 2010 Query: 27 G 25 G Sbjct: 2011 G 2011 Score = 34.3 bits (75), Expect = 3.3 Identities = 35/100 (35%), Positives = 38/100 (38%), Gaps = 3/100 (3%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGA--PSRPE 127 GA A A G PG + P G A AASG E GA PS P Sbjct: 3248 GAAAAAAAGGAGPGTQQAYGPGGSGAAAAAA--------AASGPGPSGYEPGAAGPSGPA 3299 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGP-GFAGQEVNG 10 G + G SGPG G GP GP G GQ+ G Sbjct: 3300 GAGAAAAAAAAGG--SGPGGYGQGPSGYGPSGPGGQQGYG 3337 Score = 33.5 bits (73), Expect = 5.8 Identities = 31/121 (25%), Positives = 36/121 (29%) Frame = -1 Query: 387 GXXXXXXXXXXXXEPNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEA 208 G P R+ G A A AG+G G G A AA Sbjct: 1115 GAGAAAAAAAGGAGPGRQQAYGPGGSGATAAAAVAGSGPSGYGPGGAGAAAAAAAGGAGP 1174 Query: 207 HVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFA 28 + G SG A P R + +G S GPG G GPG A Sbjct: 1175 GRQQAYGPGGSGAAAAAASGAGPGRQQVYGPGGSGAAAAAAAGGPGYGGQQGY--GPGGA 1232 Query: 27 G 25 G Sbjct: 1233 G 1233 Score = 33.5 bits (73), Expect = 5.8 Identities = 35/136 (25%), Positives = 43/136 (31%), Gaps = 5/136 (3%) Frame = -1 Query: 393 GQGXXXXXXXXXXXXEPNRESTGGRNRGREDGAVARAGTG-----APHPGQEGERAPAAR 229 GQG P R+ G A A G G PG G A AA Sbjct: 3510 GQGGSGAAAAAAGGAGPGRQQGYGPGSSGAAAAAAAGGPGFGGQQGYGPGGSGAAAAAAA 3569 Query: 228 GRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPL 49 G G + G SG A A S P +G + G G +GPG Sbjct: 3570 GGA-GPGRQQAY-GPGGSGAAAAAAA-AAGSGPSGYGPSAAGPSGPGGSGAAGGSGPGGF 3626 Query: 48 HRGPGFAGQEVNGSER 1 +GP G G ++ Sbjct: 3627 GQGPAGYGPSGPGGQQ 3642 >UniRef50_P53420 Cluster: Collagen alpha-4(IV) chain precursor; n=36; Euteleostomi|Rep: Collagen alpha-4(IV) chain precursor - Homo sapiens (Human) Length = 1690 Score = 36.3 bits (80), Expect = 0.82 Identities = 30/93 (32%), Positives = 38/93 (40%), Gaps = 1/93 (1%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPA-ARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEA 124 G + G P PG G PA A GR + G+ G P + RGAP P Sbjct: 1224 GPRGKKGPPGP-PGSSGPPGPAGATGRAPKDIPDPGPPGD--QGPPGPDGPRGAPGPPGL 1280 Query: 123 HGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 G +D + + GD PG GP PG+ G Sbjct: 1281 PGSVDLLRGEPGDCGLPGPPGPPGPPGPPGYKG 1313 >UniRef50_UPI0001555BF3 Cluster: PREDICTED: similar to Thy-1 protein; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Thy-1 protein - Ornithorhynchus anatinus Length = 333 Score = 35.9 bits (79), Expect = 1.1 Identities = 28/89 (31%), Positives = 42/89 (47%), Gaps = 3/89 (3%) Frame = -1 Query: 309 REDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAP--- 139 R G +A AG G P PG + APA GR Q ++ +SGLP+ +E G+P Sbjct: 168 RTQGGLAVAGGGLPSPGMQ-RAAPAILGR-QIRYYIYSGVSNLSSGLPS--LESGSPPPF 223 Query: 138 SRPEAHGRLDSEQRDQGDHSGPGEAGPGP 52 S A R+ + ++ + S + G P Sbjct: 224 STSPARARVQEKPQESSERSPRTQVGGSP 252 >UniRef50_UPI0000DD8409 Cluster: PREDICTED: hypothetical protein; n=2; Eutheria|Rep: PREDICTED: hypothetical protein - Homo sapiens Length = 352 Score = 35.9 bits (79), Expect = 1.1 Identities = 25/93 (26%), Positives = 33/93 (35%) Frame = -1 Query: 312 GREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSR 133 G DG G GA H G + A GR + + G A RE ERG P++ Sbjct: 22 GSADGGARGGGAGAGHYFSGGRASAALSGRAERSCEAPVRSGRAGG---RREAERGRPAK 78 Query: 132 PEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 + S++ G G R PG Sbjct: 79 LQGRTAAGSDRPRAAGAGDRGGGGCCSCRRSPG 111 >UniRef50_UPI0000D9F288 Cluster: PREDICTED: hypothetical protein; n=1; Macaca mulatta|Rep: PREDICTED: hypothetical protein - Macaca mulatta Length = 341 Score = 35.9 bits (79), Expect = 1.1 Identities = 27/85 (31%), Positives = 32/85 (37%) Frame = -1 Query: 309 REDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRP 130 R G A + GA + G R ARG L G A P R R AP Sbjct: 96 RSPGGAACSRLGAQSESRWGTRGAVARGALPGGARGPGTPSVEPGPRP-RPARREAPLPT 154 Query: 129 EAHGRLDSEQRDQGDHSGPGEAGPG 55 AH R + G+ S PG+ G G Sbjct: 155 AAHARSRGAKAAGGEGSAPGQRGAG 179 >UniRef50_UPI00015A783F Cluster: LOC553362 protein; n=1; Danio rerio|Rep: LOC553362 protein - Danio rerio Length = 1353 Score = 35.9 bits (79), Expect = 1.1 Identities = 30/103 (29%), Positives = 40/103 (38%), Gaps = 5/103 (4%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPE 127 E G G+ P PGQ G R P +G + GE + G+ + E G P Sbjct: 728 EKGESGHVGSMGP-PGQHGPRGP--QGAIGGEGPQGMPGAVGQPGVVGEKGEDGEAGNPG 784 Query: 126 AHGRLD-----SEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 G E ++GD PG AGP + PG G + N Sbjct: 785 NVGETGLVGEKGEVGEKGDAGPPGAAGPPGIRGIPGSDGPKGN 827 >UniRef50_UPI000069F5B9 Cluster: alpha 1 type XIII collagen isoform 1; n=1; Xenopus tropicalis|Rep: alpha 1 type XIII collagen isoform 1 - Xenopus tropicalis Length = 514 Score = 35.9 bits (79), Expect = 1.1 Identities = 27/84 (32%), Positives = 37/84 (44%), Gaps = 3/84 (3%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEA-HVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQG 88 PG++GE + +GE LL G+ + ERG P P HG + E+ D G Sbjct: 257 PGKKGEPGASGTPGKKGEVGESGLLGAPGLPGVAGAKGERGMPGMPGKHG-IKGEKGDTG 315 Query: 87 DHSG--PGEAGPGPLHRGPGFAGQ 22 + G GE G L PG G+ Sbjct: 316 NTIGGVKGEPGNPGLQGPPGPKGE 339 >UniRef50_Q4SZ70 Cluster: Chromosome undetermined SCAF11805, whole genome shotgun sequence; n=2; Fungi/Metazoa group|Rep: Chromosome undetermined SCAF11805, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 848 Score = 35.9 bits (79), Expect = 1.1 Identities = 38/106 (35%), Positives = 43/106 (40%), Gaps = 5/106 (4%) Frame = -1 Query: 327 GGRNRGREDGAVARAG-TGAPH-PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 GG GA R G +G P PG GE+ A R + G A V+ G G P Sbjct: 713 GGPGGPGISGAPGRPGESGRPGAPGSPGEKGQAGRDGIPGPAGVKGEPGLPGYGGPG--- 769 Query: 153 ERGAPSRPEAHG--RLDSEQRDQGDHSGPGEAG-PGPLHRGPGFAG 25 G P P A G L QG GEAG PGP PG +G Sbjct: 770 SPGIPGSPGAKGDPGLPGPSGAQGFPGSKGEAGFPGP-PGPPGSSG 814 Score = 35.1 bits (77), Expect = 1.9 Identities = 33/92 (35%), Positives = 37/92 (40%), Gaps = 6/92 (6%) Frame = -1 Query: 279 TGAPH-PGQEGERAPAARGR-----LQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHG 118 TG P PGQ G APA GR G + +GE GLP G P P G Sbjct: 261 TGFPGIPGQPG--APAGPGRPGVDGRPGAPGIPGPKGEPGFGLPGPP---GIPGVPGPKG 315 Query: 117 RLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 L + D G GPG G GPG G+ Sbjct: 316 -LPGPKGDPGFPGGPGSPGRSGFDGGPGLKGE 346 >UniRef50_Q4SZ69 Cluster: Chromosome undetermined SCAF11805, whole genome shotgun sequence; n=2; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF11805, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 471 Score = 35.9 bits (79), Expect = 1.1 Identities = 27/81 (33%), Positives = 38/81 (46%), Gaps = 1/81 (1%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PG EG R P G ++GE + G+ G P ++ + G+ P + G L +GD Sbjct: 12 PGPEGPRGPPGSGGVKGEKGIPGAPGQ--PGFPGQKGDLGSSGIPGSPG-LPGAPGLKGD 68 Query: 84 HSGPGEAG-PGPLHRGPGFAG 25 PG +G PGP PG G Sbjct: 69 IGLPGVSGFPGP-KGDPGLPG 88 >UniRef50_Q4RX03 Cluster: Chromosome 11 SCAF14979, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 11 SCAF14979, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1208 Score = 35.9 bits (79), Expect = 1.1 Identities = 32/103 (31%), Positives = 38/103 (36%), Gaps = 3/103 (2%) Frame = -1 Query: 312 GREDGAVARAGTG-APHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPS 136 GR+ A R G + G GE P G + RG GLP + ERG P Sbjct: 981 GRQGDAGLRGPAGPSGEKGDAGEDGPVGPPGPSGPQGLAGQRGIV--GLPGQRGERGFPG 1038 Query: 135 RPEAHGRLDSEQRDQ--GDHSGPGEAGPGPLHRGPGFAGQEVN 13 P G + GD PG GP L G G+E N Sbjct: 1039 LPGPSGEPGKQGAPGTGGDRGPPGPVGPPGLTGPAGELGREFN 1081 Score = 35.1 bits (77), Expect = 1.9 Identities = 33/106 (31%), Positives = 41/106 (38%), Gaps = 2/106 (1%) Frame = -1 Query: 336 ESTGGRNRGRE-DGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 E G++ R G + G P+ G++GE PA G RGE S PA Sbjct: 818 EGAPGKDGARGLTGPIGPPGPSGPN-GEKGETGPAGPSGAPGTRGTPGDRGETGSPGPAG 876 Query: 159 EV-ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 GA +P G E +GD PG GP PG AG Sbjct: 877 FAGPPGADGQPGIKGE-QGETGQKGDAGAPGPQGPS---GAPGPAG 918 >UniRef50_Q1LYN9 Cluster: Novel protein similar to vertebrate collagen family; n=3; Danio rerio|Rep: Novel protein similar to vertebrate collagen family - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 531 Score = 35.9 bits (79), Expect = 1.1 Identities = 29/94 (30%), Positives = 39/94 (41%), Gaps = 2/94 (2%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAH 121 G + +AG G+ GE P +QG +RG G+P +RG P P Sbjct: 164 GPIGQAGQAGQKVGEPGEPGPTGAQGIQG------IRGNP--GIPGSVGQRGQPGDPGEP 215 Query: 120 GRL-DSEQRDQGDHSG-PGEAGPGPLHRGPGFAG 25 GR D +R + +G G GP PG AG Sbjct: 216 GRQGDRGKRGKNGSAGAQGAIGPPGQPGPPGLAG 249 >UniRef50_Q6I7K4 Cluster: Orf663 protein; n=3; Proteobacteria|Rep: Orf663 protein - Myxococcus xanthus Length = 663 Score = 35.9 bits (79), Expect = 1.1 Identities = 18/42 (42%), Positives = 22/42 (52%), Gaps = 3/42 (7%) Frame = -1 Query: 339 RESTGGRNRGREDGAV---ARAGTGAPHPGQEGERAPAARGR 223 R GGR +GR G R G G PHP + ER P+ RG+ Sbjct: 606 RAPHGGRGQGRAPGCDWRRVRRGRGRPHPERRQERGPSVRGQ 647 >UniRef50_Q4IVL7 Cluster: Putative uncharacterized protein precursor; n=1; Azotobacter vinelandii AvOP|Rep: Putative uncharacterized protein precursor - Azotobacter vinelandii AvOP Length = 1343 Score = 35.9 bits (79), Expect = 1.1 Identities = 41/123 (33%), Positives = 51/123 (41%), Gaps = 19/123 (15%) Frame = -1 Query: 345 PNRESTGGRNRGR----EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAA 178 P G R RG G R +G PG +AP AR R QG +R G+ Sbjct: 322 PALAGRGERGRGPLQPLRGGQSGRHRSGGALPGDALGQAPGARRRRQGR-RLRRAAGQTQ 380 Query: 177 SGLPAREVERGA---PS----------RPEAHGRLDS--EQRDQGDHSGPGEAGPGPLHR 43 G PA E++ GA P+ RP A L +Q QG +GP + PGPL Sbjct: 381 RGGPA-ELQAGAEPQPARRRGLGRPGHRPTARAELPGRRQQPAQGHRTGPRQ--PGPLGE 437 Query: 42 GPG 34 G G Sbjct: 438 GAG 440 >UniRef50_A5P2Z2 Cluster: Integral membrane protein-like protein; n=3; Alphaproteobacteria|Rep: Integral membrane protein-like protein - Methylobacterium sp. 4-46 Length = 418 Score = 35.9 bits (79), Expect = 1.1 Identities = 27/75 (36%), Positives = 32/75 (42%), Gaps = 7/75 (9%) Frame = -1 Query: 246 RAPAARGRLQGEAH---VRLLRGEAASGLPAREVERGAPSR---PEAHGR-LDSEQRDQG 88 R P AR R G+A +R R E G+ A + R P R P H R D +R Sbjct: 43 RRPGARQRQLGDARALPLRRRRAERLDGVAAHRLARRGPLRAGDPRPHRRDPDRARRGAA 102 Query: 87 DHSGPGEAGPGPLHR 43 H G GP PL R Sbjct: 103 AHRPGGRCGPAPLAR 117 >UniRef50_A5P281 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Rep: LigA - Methylobacterium sp. 4-46 Length = 321 Score = 35.9 bits (79), Expect = 1.1 Identities = 30/89 (33%), Positives = 35/89 (39%), Gaps = 1/89 (1%) Frame = -1 Query: 297 AVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHG 118 A ARAGTG+ P + G P A R + H SG AR + + PE G Sbjct: 60 ASARAGTGSRAPAESGN--PIAHCRSRAGPH-------GGSGSGARWSPHRSGAAPERAG 110 Query: 117 RLDSEQRDQGDHSGPGE-AGPGPLHRGPG 34 D H G AGPG R PG Sbjct: 111 EKDERLHGNPRHGARGRGAGPGTRRREPG 139 >UniRef50_A0VWK8 Cluster: Putative uncharacterized protein; n=1; Dinoroseobacter shibae DFL 12|Rep: Putative uncharacterized protein - Dinoroseobacter shibae DFL 12 Length = 527 Score = 35.9 bits (79), Expect = 1.1 Identities = 39/105 (37%), Positives = 42/105 (40%), Gaps = 3/105 (2%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASG-LPAREVE 151 GGR + R G R AP P EG APAA QG R G+A G L R Sbjct: 49 GGRAQARP-GRFGRGRFAAPPP-DEGAEAPAAS---QGGFAGRFGGGQAGQGGLRGRFAG 103 Query: 150 RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP--GPLHRGPGFAGQ 22 G P +P G QG G AGP P PG AGQ Sbjct: 104 MGQPGQPGQPGPAGQAAGGQGGGVGGRFAGPRGAPGAAAPG-AGQ 147 >UniRef50_A0VBD2 Cluster: Putative uncharacterized protein precursor; n=1; Delftia acidovorans SPH-1|Rep: Putative uncharacterized protein precursor - Delftia acidovorans SPH-1 Length = 608 Score = 35.9 bits (79), Expect = 1.1 Identities = 32/111 (28%), Positives = 45/111 (40%), Gaps = 4/111 (3%) Frame = -1 Query: 321 RNRGREDGAVARAGTGAPHPGQEG-ERAPAARGR--LQGEAHVRLLRGEAASGLPAREVE 151 R+R G AG H ++G E+AP +G L G AH + +A ++ Sbjct: 142 RHRNAVGGRQVAAGAEQHHGQRDGNEQAPVDQGHVDLAGLAHAGVAHFQARQIAQLYDLA 201 Query: 150 RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFA-GQEVNGSER 1 R A + L S+ R G H G+ GPG H G G V +R Sbjct: 202 RDAEGARDQG--LRSDDRGHGGHQHQGQQGPGRGHHVEGILHGGRVGQQQR 250 >UniRef50_Q2QMM2 Cluster: Retrotransposon protein, putative, unclassified; n=5; Oryza sativa (japonica cultivar-group)|Rep: Retrotransposon protein, putative, unclassified - Oryza sativa subsp. japonica (Rice) Length = 1770 Score = 35.9 bits (79), Expect = 1.1 Identities = 29/87 (33%), Positives = 36/87 (41%), Gaps = 1/87 (1%) Frame = -1 Query: 312 GREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSR 133 GRE G A G P P G P A + G + + G A +G P R +PS Sbjct: 1094 GREQGGEAPEPNGGPRPPMAGAGPPPACPTVPGASDPQDGPG-ATAGRP-----RLSPSD 1147 Query: 132 PEAHG-RLDSEQRDQGDHSGPGEAGPG 55 PE G + R D PG+A PG Sbjct: 1148 PEVVGTEAECAPRGLSDEERPGDAAPG 1174 >UniRef50_A7E3J6 Cluster: Putative DUX4 protein; n=1; Procavia capensis|Rep: Putative DUX4 protein - Procavia capensis (Cape hyrax) (Rock dassie) Length = 481 Score = 35.9 bits (79), Expect = 1.1 Identities = 28/94 (29%), Positives = 39/94 (41%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERG 145 G R A A G+ GER +G+ QG+ + G+ G E + Sbjct: 64 GNRRASRSRKSRSASARASGGGEAGERQGQGQGQGQGQGQGQ---GQG-QGQGQDETQTQ 119 Query: 144 APSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHR 43 AP++PEA GRLD +R G P + P R Sbjct: 120 APAQPEAPGRLDEPER--GRRRRPADNFPAAARR 151 >UniRef50_Q1JTJ3 Cluster: SET-domain protein, putative; n=1; Toxoplasma gondii RH|Rep: SET-domain protein, putative - Toxoplasma gondii RH Length = 4382 Score = 35.9 bits (79), Expect = 1.1 Identities = 29/95 (30%), Positives = 46/95 (48%) Frame = -2 Query: 407 LAAWLDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLE 228 L W+ EA + W +G+++ EDA EGEKT R + Q++ +A++ Sbjct: 3289 LQLWVPLFCEAAQLLWGDGQSEA----EDASEGEKTN--REEEQKIYGRAERNREGRTAS 3342 Query: 227 AAYRERLMYAYSEVKRRLDYQLEKSNVERRLAQKH 123 + R A E K D LEKS+ +R A++H Sbjct: 3343 SPLRCDCEEARGERKSE-DADLEKSHCMQRSAERH 3376 >UniRef50_Q19050 Cluster: Putative uncharacterized protein col-186; n=2; Caenorhabditis|Rep: Putative uncharacterized protein col-186 - Caenorhabditis elegans Length = 333 Score = 35.9 bits (79), Expect = 1.1 Identities = 32/114 (28%), Positives = 46/114 (40%), Gaps = 5/114 (4%) Frame = -1 Query: 345 PNRESTGGRNR--GREDGAVARAGTGAPH-PGQEGERAPAARGRLQGEAHVRLLRGEA-- 181 P R+ GR GR G+ G P PG G P R + G + + G+ Sbjct: 194 PGRQGKQGRKGPPGRPGGSGLPGDKGLPGLPGSMGPVGPEGRPGIPGLPGGKGIPGKLIE 253 Query: 180 ASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 +GL + G P P G+ ++ D S PG+ G + GPG GQ+ Sbjct: 254 VAGLEGPPGQSGLPGMPGLQGKPGNDGYPGRDGS-PGDQGDDGIPGGPGKPGQK 306 >UniRef50_A7S288 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 1448 Score = 35.9 bits (79), Expect = 1.1 Identities = 34/107 (31%), Positives = 43/107 (40%), Gaps = 10/107 (9%) Frame = -1 Query: 315 RGREDGAVARAGTGAPHPGQEGERAPAARGRL--QGEAHVRLLRGEAASGLPAREVERGA 142 +G + A G GAP G G P ARG + G ++R GLP + +RG Sbjct: 104 KGTQGDAGTPGGNGAP--GLPGP--PGARGLIGPPGPPGKPVIRRVGIPGLPGKRGDRGT 159 Query: 141 PSR---PEAHGR-----LDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 P P G + D GD PGEAGP + G AG Sbjct: 160 PGAKGPPGLAGEPGEPGTQGPRGDPGDVGEPGEAGPSGRNGANGTAG 206 Score = 34.7 bits (76), Expect = 2.5 Identities = 26/90 (28%), Positives = 32/90 (35%), Gaps = 6/90 (6%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGE-AHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDS-----E 103 PG +GER P QGE H + A G+P + G P G S Sbjct: 613 PGAQGERGPRGEQGKQGEKGHAGEGGADGAPGIPGEQGPMGPVGAPGPVGNAGSPGSPGP 672 Query: 102 QRDQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 +GD PG+ GP G G N Sbjct: 673 SGPKGDSGEPGQQGPRGTQGSDGKRGANGN 702 >UniRef50_Q4P641 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 631 Score = 35.9 bits (79), Expect = 1.1 Identities = 30/99 (30%), Positives = 36/99 (36%), Gaps = 3/99 (3%) Frame = -1 Query: 342 NRESTG-GRNRGREDGAVARAGTGAPHPGQEG--ERAPAARGRLQGEAHVRLLRGEAASG 172 +R +G G G GA GTG+ G +G ++ P G G A G SG Sbjct: 457 SRSGSGTGSGSGTGSGAGTGTGTGSSDSGDDGPIDKGPPDTGAAAGGATGAPGTGADTSG 516 Query: 171 LPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPG 55 P GAP P A G G PG PG Sbjct: 517 APGN----GAPGAPGAGGNGAPGAPGSGASGTPGNGAPG 551 Score = 33.1 bits (72), Expect = 7.6 Identities = 30/85 (35%), Positives = 31/85 (36%), Gaps = 5/85 (5%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVER-----GAPS 136 GA GTGA G G AP A G G G ASG P GAP Sbjct: 503 GATGAPGTGADTSGAPGNGAPGAPGA--GGNGAPGAPGSGASGTPGNGAPGAPGAPGAPG 560 Query: 135 RPEAHGRLDSEQRDQGDHSGPGEAG 61 P A G D+ G PG AG Sbjct: 561 APGAPGAPDAP--GNGAPGAPGAAG 583 >UniRef50_P29143 Cluster: Halolysin precursor; n=5; Halobacteriales|Rep: Halolysin precursor - Halophilic archaebacteria (strain 172p1) Length = 530 Score = 35.9 bits (79), Expect = 1.1 Identities = 21/55 (38%), Positives = 26/55 (47%), Gaps = 3/55 (5%) Frame = -1 Query: 210 AHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQ---RDQGDHSGPGEAGPG 55 AH L E S L V+ G S + HGR+D+ Q D GD G G+ G G Sbjct: 367 AHPNLSNAELRSHLQNTAVDVGLSSEEQGHGRVDAGQAVTTDPGDGGGGGDPGDG 421 >UniRef50_P25067 Cluster: Collagen alpha-2(VIII) chain precursor; n=52; Amniota|Rep: Collagen alpha-2(VIII) chain precursor - Homo sapiens (Human) Length = 703 Score = 35.9 bits (79), Expect = 1.1 Identities = 31/96 (32%), Positives = 39/96 (40%), Gaps = 2/96 (2%) Frame = -1 Query: 300 GAVARAGTGAPH-PGQEGERAPAARGRLQGEAHVRLLRGEAAS-GLPAREVERGAPSRPE 127 G + G G P PG +G+R PA L G+ RGE G P + +G P Sbjct: 308 GLIGPTGYGMPGLPGPKGDRGPAGVPGLLGD------RGEPGEDGEPGEQGPQGLGGPPG 361 Query: 126 AHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 G R +G GEAGPG PG G + Sbjct: 362 LPGSAGLPGR-RGPPGPKGEAGPGGPPGVPGIRGDQ 396 >UniRef50_UPI000155DCFE Cluster: PREDICTED: hypothetical protein; n=1; Equus caballus|Rep: PREDICTED: hypothetical protein - Equus caballus Length = 381 Score = 35.5 bits (78), Expect = 1.4 Identities = 29/88 (32%), Positives = 37/88 (42%), Gaps = 4/88 (4%) Frame = -1 Query: 309 REDGAVARAGTGAPH--PGQEGERAPAARGRLQGEAH--VRLLRGEAASGLPAREVERGA 142 R+ G V R G G P PG+ G A RG E H R RG LPAR ++ Sbjct: 197 RKPGPVPRGG-GFPRQAPGRSGPPAGPRRGHGSREHHHAARPARGRRQRRLPARPLQGPQ 255 Query: 141 PSRPEAHGRLDSEQRDQGDHSGPGEAGP 58 + + G L + + GPGE P Sbjct: 256 AALLQKRGLLPAHPPRRPSGRGPGEERP 283 >UniRef50_UPI0000F2EAE8 Cluster: PREDICTED: hypothetical protein, partial; n=1; Monodelphis domestica|Rep: PREDICTED: hypothetical protein, partial - Monodelphis domestica Length = 467 Score = 35.5 bits (78), Expect = 1.4 Identities = 41/116 (35%), Positives = 52/116 (44%), Gaps = 7/116 (6%) Frame = -1 Query: 345 PNRESTGGR-----NRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEA 181 P R GGR +RG+E GA AG GA G A RG+ +G A RG Sbjct: 361 PKRSPDGGRGHAARDRGQERGAA--AGRGAGGTG----HAARDRGQERGPA---AGRGAG 411 Query: 180 ASGLPARE--VERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 +G AR+ ERG P+ G RD+G GP AG G G G+A ++ Sbjct: 412 GTGYAARDRGQERG-PAAGRGAGGTGYAARDRGQERGPA-AGRGA--GGTGYAARD 463 >UniRef50_UPI0000E813B5 Cluster: PREDICTED: hypothetical protein; n=1; Gallus gallus|Rep: PREDICTED: hypothetical protein - Gallus gallus Length = 229 Score = 35.5 bits (78), Expect = 1.4 Identities = 34/107 (31%), Positives = 48/107 (44%), Gaps = 2/107 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 ++STG R +GR G PG++G+ P +RG+ + R +G +G P R Sbjct: 57 QDSTGARPQGRHP----TQGQHRRPPGRDGQ-GPPSRGQRRFAPLYRTPKGSPVAGRPRR 111 Query: 159 EVERGAPSRPEAHGRLDSEQRDQG--DHSGPGEAGPGPLHRGPGFAG 25 RGA + ++ SEQR G S P E G RG AG Sbjct: 112 RCPRGAARQRDSR----SEQRAAGARPRSRPLEGGSS---RGRAAAG 151 >UniRef50_UPI0000DD78A3 Cluster: PREDICTED: hypothetical protein; n=1; Homo sapiens|Rep: PREDICTED: hypothetical protein - Homo sapiens Length = 156 Score = 35.5 bits (78), Expect = 1.4 Identities = 27/93 (29%), Positives = 34/93 (36%), Gaps = 8/93 (8%) Frame = -1 Query: 291 ARAGTGAPHPGQEGERA----PAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEA 124 A AG P P E A P A+G G A R + R + P+RP Sbjct: 49 AEAGKARPAPAGRAESAQRAGPRAKGAGGGRARARSVSAGLGPQPRGRTRPKPPPARPHP 108 Query: 123 HGRLDSEQRDQGDHSGPGEAG----PGPLHRGP 37 H +R+ H E+G PGP GP Sbjct: 109 HPGASERKRNGTRHGARAESGRIGAPGPSPHGP 141 >UniRef50_UPI00015A592A Cluster: Type IV collagen alpha 4 chain; n=3; Danio rerio|Rep: Type IV collagen alpha 4 chain - Danio rerio Length = 1639 Score = 35.5 bits (78), Expect = 1.4 Identities = 33/91 (36%), Positives = 39/91 (42%), Gaps = 7/91 (7%) Frame = -1 Query: 276 GAPHP-GQEGERAPAARGRLQGEAHVRLLRG-EAASGLPAREVERGAPSRPEAHGR--LD 109 GAP P G G + + G L+G G P R+ ERG P GR Sbjct: 711 GAPGPLGPSGVQGCQGPKGVPGPPGPIGLQGMSGVPGYPGRKGERGKDGAPGPPGRPGKS 770 Query: 108 SEQRDQGDHSGPGEAGPGPL--HRG-PGFAG 25 EQ D+GD PG+ G L HRG PG G Sbjct: 771 PEQCDKGDEGLPGKKGEQGLIGHRGYPGEKG 801 >UniRef50_UPI000065FCBB Cluster: Homolog of Oncorhynchus mykiss "Vitelline envelope protein alpha.; n=1; Takifugu rubripes|Rep: Homolog of Oncorhynchus mykiss "Vitelline envelope protein alpha. - Takifugu rubripes Length = 195 Score = 35.5 bits (78), Expect = 1.4 Identities = 30/94 (31%), Positives = 35/94 (37%), Gaps = 12/94 (12%) Frame = -1 Query: 279 TGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAH-GRLDSE 103 TG HPGQ GER P + G+ P + ER + E H G+ D Sbjct: 4 TGERHPGQTGERHPGQKSERHPGQKCERHPGQTGERHPGQRDERHPGQKSERHPGQTDER 63 Query: 102 QRDQ--GDHSG------PG---EAGPGPLHRGPG 34 Q G H G PG E PG R PG Sbjct: 64 HPGQKSGRHPGQRDERHPGQRDERHPGQTERHPG 97 >UniRef50_UPI0000ECA83C Cluster: Centrosome-associated protein CEP250 (Centrosomal protein 2) (Centrosomal Nek2-associated protein 1) (C-Nap1).; n=2; Gallus gallus|Rep: Centrosome-associated protein CEP250 (Centrosomal protein 2) (Centrosomal Nek2-associated protein 1) (C-Nap1). - Gallus gallus Length = 2424 Score = 35.5 bits (78), Expect = 1.4 Identities = 47/167 (28%), Positives = 78/167 (46%), Gaps = 7/167 (4%) Frame = -2 Query: 482 MEHEYYSGLSLLVMVYVAHVKFGPKLAAWLDKEVEATENEWNEGR--NQTVKALEDAIEG 309 M + + L LV+ + K+ L +++E +E E + R N ++ ED+ +G Sbjct: 405 MSNSHQQHLKSLVLALKCDCENLEKIRGELQQKLELSEQEASRLRQSNTELQLKEDSAQG 464 Query: 308 EKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEVKRRLDYQLEKSNVERRL-- 135 EK EQ A + E VL L AA E+ +E+ + +LE+S+++R L Sbjct: 465 EKVEQQLAMER---AHHDHELVLKDL-AALEEKHSLLQNELVAARE-KLEESHLQRDLLK 519 Query: 134 AQKHMVDWIVSNVTK---AITPDQEKQALDRCIADLASLARK*TEAN 3 +KH + + K A+T Q K L+ IADL + A K + N Sbjct: 520 QEKHELTVALEKAEKSVAALTGAQNK--LNSEIADLHTAAAKMSSIN 564 >UniRef50_Q4S5M8 Cluster: Chromosome 9 SCAF14729, whole genome shotgun sequence; n=5; Euteleostomi|Rep: Chromosome 9 SCAF14729, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1475 Score = 35.5 bits (78), Expect = 1.4 Identities = 34/105 (32%), Positives = 40/105 (38%), Gaps = 3/105 (2%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGTGAPH--PGQEGERAPAARGRLQGEAHVRLLRGEAA-SGLPA 163 STG G V G P PG+EG G A V+ RG +G P Sbjct: 1062 STGSAGDRGPPGPVGPPGLTGPSGDPGREGAAGSDGPPGRDGAAGVKGERGNTGPAGAPG 1121 Query: 162 REVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFA 28 GAP P G L +Q D+G+ G AGP L G A Sbjct: 1122 AP---GAPGAPGPVGPL-GKQGDRGEAGAQGPAGPPGLAGARGMA 1162 >UniRef50_Q9RX57 Cluster: Putative uncharacterized protein; n=1; Deinococcus radiodurans|Rep: Putative uncharacterized protein - Deinococcus radiodurans Length = 839 Score = 35.5 bits (78), Expect = 1.4 Identities = 29/94 (30%), Positives = 37/94 (39%), Gaps = 3/94 (3%) Frame = -1 Query: 297 AVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHG 118 A AR G+GA G APAA Q G+ AR + G PS A Sbjct: 529 AAARGGSGAAGGAAGGASAPAAARPAQTPGASAGGASGGGEGVSARPSQGGTPSGTPASA 588 Query: 117 RLDSEQRDQGDHSGPGEAGPG---PLHRGPGFAG 25 + + + G+ SG G +G G P PG G Sbjct: 589 PVAAGRPAGGEGSGSGTSGSGSGAPAAARPGQGG 622 >UniRef50_Q5PIF1 Cluster: Subunit S of type I restriction-modification system; n=2; Salmonella|Rep: Subunit S of type I restriction-modification system - Salmonella paratyphi-a Length = 462 Score = 35.5 bits (78), Expect = 1.4 Identities = 21/65 (32%), Positives = 27/65 (41%) Frame = -2 Query: 410 KLAAWLDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQL 231 +L AW D + N N + T L A GE T QWRA+ L+ LL+ Sbjct: 385 QLFAWADTIEKQVNNALNRVNSLTQSILAKAFRGELTAQWRAENPSLISGENSAAALLEK 444 Query: 230 EAAYR 216 A R Sbjct: 445 IKAER 449 >UniRef50_Q2IFX3 Cluster: Putative uncharacterized protein precursor; n=1; Anaeromyxobacter dehalogenans 2CP-C|Rep: Putative uncharacterized protein precursor - Anaeromyxobacter dehalogenans (strain 2CP-C) Length = 293 Score = 35.5 bits (78), Expect = 1.4 Identities = 37/112 (33%), Positives = 45/112 (40%), Gaps = 5/112 (4%) Frame = -1 Query: 345 PNRESTGGRNRG-REDGAVARA--GTGAPHPGQ--EGERAPAARGRLQGEAHVRLLRGEA 181 P R G + RE A R G G G+ +GER RG + A R E Sbjct: 129 PARRGLGAASAALREAAARLRGLRGGGLRGGGERGDGERGDGERGDGERGAAPRRRGPEV 188 Query: 180 ASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 E+E G A GR + RD+ D SGP G G + R PG AG Sbjct: 189 VEVKSPAELEAGV-----ARGRPEPTYRDRADRSGPHMRG-GGVRRAPGAAG 234 >UniRef50_Q9KXB9 Cluster: Tail fiber protein; n=7; root|Rep: Tail fiber protein - Bacteriophage VT2-Sa Length = 645 Score = 35.5 bits (78), Expect = 1.4 Identities = 30/102 (29%), Positives = 40/102 (39%), Gaps = 1/102 (0%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAAS-GLPAREVE 151 G R GAV AG G++GER P L+G+ R +G+ G + + Sbjct: 277 GERGDVGAQGAVGPAGPRG-EKGEQGERGPQGIPGLKGDTGERGPKGDQGDMGPKGEKGD 335 Query: 150 RGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 G P+ P+ E QG GE G PG AG Sbjct: 336 PGGPAGPQGPKGERGEAGPQGPMGARGERGETGPRGEPGPAG 377 >UniRef50_A7H8S3 Cluster: Putative uncharacterized protein precursor; n=1; Anaeromyxobacter sp. Fw109-5|Rep: Putative uncharacterized protein precursor - Anaeromyxobacter sp. Fw109-5 Length = 298 Score = 35.5 bits (78), Expect = 1.4 Identities = 32/94 (34%), Positives = 42/94 (44%), Gaps = 6/94 (6%) Frame = -1 Query: 330 TGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVE 151 TGG+ +D A R+ TG GQ +RAP+ H G +ASG ARE Sbjct: 31 TGGQRSPGDDAA--RSTTGNQGSGQGSDRAPSGSDGSTSSPHSSPQTGSSASG--ARETG 86 Query: 150 RGAPSRPEAHG-----RLDSEQRDQGDH-SGPGE 67 G+ + P G + D E+R Q H S GE Sbjct: 87 TGSATAPSPSGSQSQLKGDLEERIQELHASNQGE 120 >UniRef50_A7DG98 Cluster: Putative uncharacterized protein; n=2; Methylobacterium extorquens PA1|Rep: Putative uncharacterized protein - Methylobacterium extorquens PA1 Length = 695 Score = 35.5 bits (78), Expect = 1.4 Identities = 29/78 (37%), Positives = 35/78 (44%), Gaps = 6/78 (7%) Frame = -1 Query: 345 PNRESTGGR-NRGREDGAVARAGTGA---PHPGQEGERA-PAARGRLQGEAHV-RLLRGE 184 P+ +T GR R D A G+GA P P G A PA E V RL R Sbjct: 162 PSEPATAGRPRRSAFDAPTALRGSGAFSAPAPLGSGTTANPAPASEESEEPSVARLPRFR 221 Query: 183 AASGLPAREVERGAPSRP 130 +A+ LP RG P+RP Sbjct: 222 SATSLPGSAATRGTPARP 239 >UniRef50_A5P2L0 Cluster: Putative uncharacterized protein; n=3; cellular organisms|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 1094 Score = 35.5 bits (78), Expect = 1.4 Identities = 32/79 (40%), Positives = 35/79 (44%), Gaps = 11/79 (13%) Frame = -1 Query: 327 GGRNRG--REDGAVARAGTGAPHPG-QEGERAPAARG---RLQGEAHVRLLRGEAASGLP 166 GG G R+DG R G GA G + G APAARG R + A R A GL Sbjct: 589 GGAGGGAERDDGGAGREGGGAGGGGGRAGGAAPAARGGDRRARRAARGRPSARRGARGLS 648 Query: 165 AREVER-----GAPSRPEA 124 R R G PS PEA Sbjct: 649 GRPAARPAAASGGPSLPEA 667 >UniRef50_A5NQE3 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Rep: LigA - Methylobacterium sp. 4-46 Length = 494 Score = 35.5 bits (78), Expect = 1.4 Identities = 35/98 (35%), Positives = 40/98 (40%), Gaps = 2/98 (2%) Frame = -1 Query: 321 RNRGREDGAVAR-AGTGAPHPGQEGERAPAARGRLQGEAHVRLLR-GEAASGLPAREVER 148 R GR GA R AG A G P G +G RLLR G SG PA R Sbjct: 357 RPEGRHPGAARRHAGRAAIRRGLRRRLPP---GPPRGLRDPRLLRRGRRGSG-PAPHRPR 412 Query: 147 GAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 + P AH + D G +G +GP P R PG Sbjct: 413 RPAADPPAHRAAHPDAGDDGRPAGLSRSGPAPRCR-PG 449 >UniRef50_A5NPK7 Cluster: Putative uncharacterized protein; n=3; Proteobacteria|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 706 Score = 35.5 bits (78), Expect = 1.4 Identities = 30/103 (29%), Positives = 34/103 (33%), Gaps = 3/103 (2%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 S G GR A + + GAP PG P AR A + A Sbjct: 552 SASGAGAGRRPAAASSSAAGAPRPGPPAAGPPPARPAAASPAASAERPARPPAAAAASSS 611 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP-GPL--HRGPG 34 P RP A GR + G PG P GP R PG Sbjct: 612 AGAGPRRPAAPGRPGLGRPGLG-RPAPGRPAPDGPAAGGRSPG 653 >UniRef50_A5NMK3 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Rep: LigA - Methylobacterium sp. 4-46 Length = 157 Score = 35.5 bits (78), Expect = 1.4 Identities = 35/107 (32%), Positives = 39/107 (36%), Gaps = 4/107 (3%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERG 145 GR GR DG P G R PA G G R L G PA Sbjct: 49 GRAGGRRDGPQGGPARADPRSGLSPRRGPAFAGAPAGRP--RRLVPRVGIGKPA------ 100 Query: 144 APSRPEAHGRLDSEQRDQ-GDHSGP---GEAGPGPLHRGPGFAGQEV 16 SR A G L +R + GDH+ P A P P GFAG + Sbjct: 101 VTSRRAAAGELPQGRRARPGDHAPPRSRAAAAPAPSPPLSGFAGNAI 147 >UniRef50_Q6YW72 Cluster: Pr1-like protein; n=4; Oryza sativa (japonica cultivar-group)|Rep: Pr1-like protein - Oryza sativa subsp. japonica (Rice) Length = 289 Score = 35.5 bits (78), Expect = 1.4 Identities = 25/63 (39%), Positives = 31/63 (49%), Gaps = 1/63 (1%) Frame = -1 Query: 312 GREDGAVARAG-TGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPS 136 GR G+ R G TG PHP PA + +GE + E A+G RE ER PS Sbjct: 225 GRGGGSGGRKGMTGGPHPSARVAGGPARQRHARGE------KAEWAAG-ERRERERDGPS 277 Query: 135 RPE 127 RP+ Sbjct: 278 RPK 280 >UniRef50_A7NUN9 Cluster: Chromosome chr18 scaffold_1, whole genome shotgun sequence; n=3; core eudicotyledons|Rep: Chromosome chr18 scaffold_1, whole genome shotgun sequence - Vitis vinifera (Grape) Length = 873 Score = 35.5 bits (78), Expect = 1.4 Identities = 19/62 (30%), Positives = 33/62 (53%), Gaps = 4/62 (6%) Frame = -2 Query: 329 LEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEVKR----RLDYQL 162 +ED +E ++ E W+A Q + + KEN +LQ R+R ++ + + RL QL Sbjct: 523 VEDEVEIQRLEAWKADLQNRIAEESKENAVLQASLERRKRDLHEHRQALEQDVARLQEQL 582 Query: 161 EK 156 +K Sbjct: 583 QK 584 >UniRef50_O45114 Cluster: Collagen protein 103; n=3; cellular organisms|Rep: Collagen protein 103 - Caenorhabditis elegans Length = 371 Score = 35.5 bits (78), Expect = 1.4 Identities = 32/114 (28%), Positives = 39/114 (34%), Gaps = 2/114 (1%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEA-ASGL 169 P G N E A A PG G A R G+ G A G Sbjct: 163 PGSNGGAGSNGASEGSAGGCKTCPAGPPGPPGPAGQAGRPGNDGQPGAPSFGGGVGAPGA 222 Query: 168 PAREVERGAPSRPEAHGRLDSEQRD-QGDHSGPGEAGPGPLHRGPGFAGQEVNG 10 P + G+P +P A G+ ++ QG S PG GP PG G G Sbjct: 223 PGPAGDAGSPGQPGAPGQPGRPGKNAQGGSSRPGPPGPAGPPGPPGNNGAPGGG 276 >UniRef50_Q6ZQQ4 Cluster: CDNA FLJ46309 fis, clone TESTI4039744; n=49; Homo/Pan/Gorilla group|Rep: CDNA FLJ46309 fis, clone TESTI4039744 - Homo sapiens (Human) Length = 385 Score = 35.5 bits (78), Expect = 1.4 Identities = 31/104 (29%), Positives = 38/104 (36%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVER 148 GGR + R G GAPH +G R+ L E G A LP + Sbjct: 53 GGRAEALPTSQMGRPGRGAPHLPDDG-RSGRGAPHLPDEGR----PGRGAPHLPGGAAGQ 107 Query: 147 GAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEV 16 PS P GR ++ Q G G G H G AGQ + Sbjct: 108 RRPSPPRRGGRAEAPLTSQA-----GRPGRGAPHLPDGAAGQRL 146 >UniRef50_P33485 Cluster: Probable nuclear antigen; n=5; root|Rep: Probable nuclear antigen - Pseudorabies virus (strain Kaplan) (PRV) Length = 1733 Score = 35.5 bits (78), Expect = 1.4 Identities = 40/143 (27%), Positives = 47/143 (32%), Gaps = 6/143 (4%) Frame = -1 Query: 465 LRTVTAGHGVCGSREIRTKIGCLVGQGXXXXXXXXXXXXEPNRESTGGRNRGREDGAVAR 286 +R G GV G +G G G E+ GG R R Sbjct: 958 VRAGGGGAGVAGGAG-EAGLGAGAGLGAGAGLGAGGAGGPGAGEAGGGARRRRRRRWDDE 1016 Query: 285 AGTGAPHPGQEGE--RAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRL 112 AG P GQ G R P RG L + RGE G+ + P AH R Sbjct: 1017 AGLLGPERGQAGRGLRGPGPRGGLGEPGRGHVGRGEEGRGVGPGGLAGAGPVHAVAHQRR 1076 Query: 111 DSEQRDQGDH----SGPGEAGPG 55 D+GD G AGPG Sbjct: 1077 HG-AGDEGDRVRGLPPLGRAGPG 1098 >UniRef50_Q02241 Cluster: Kinesin-like protein KIF23; n=34; Eumetazoa|Rep: Kinesin-like protein KIF23 - Homo sapiens (Human) Length = 960 Score = 35.5 bits (78), Expect = 1.4 Identities = 28/114 (24%), Positives = 54/114 (47%), Gaps = 4/114 (3%) Frame = -2 Query: 347 NQTVKALEDAIEGEKTEQWRA-QGQELLIQA-KKENVLLQLEAAYRERLMYAYSEVKRRL 174 + V + E+ ++G+ E+ + GQ+L I+ +K+N L+ + E+ Y E KR L Sbjct: 530 DNAVLSKENHMQGKLNEKEKMISGQKLEIERLEKKNKTLEYKIEILEKTTTIYEEDKRNL 589 Query: 173 DYQLEKSN--VERRLAQKHMVDWIVSNVTKAITPDQEKQALDRCIADLASLARK 18 +LE N ++R+ + K ++ + + T EK+ R A + K Sbjct: 590 QQELETQNQKLQRQFSDKRRLEARLQGMVTETTMKWEKECERRVAAKQLEMQNK 643 >UniRef50_P12107 Cluster: Collagen alpha-1(XI) chain precursor; n=83; Euteleostomi|Rep: Collagen alpha-1(XI) chain precursor - Homo sapiens (Human) Length = 1806 Score = 35.5 bits (78), Expect = 1.4 Identities = 39/118 (33%), Positives = 50/118 (42%), Gaps = 14/118 (11%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGT-GAPHP-GQEGERA-PAARGR--LQGEAHVRLLRG-EAASG 172 S+G + + G G G P P G+ G+R P A G + GE + RG + G Sbjct: 548 SSGAKGESGDPGPQGPRGVQGPPGPTGKPGKRGRPGADGGRGMPGEPGAKGDRGFDGLPG 607 Query: 171 LPAREVERG-----APSRPEAHGRLDSEQRDQGDHSGPGEAGPGPL--HRG-PGFAGQ 22 LP + RG P P + E + G PGEAGP L RG PG GQ Sbjct: 608 LPGDKGHRGERGPQGPPGPPGDDGMRGEDGEIGPRGLPGEAGPRGLLGPRGTPGAPGQ 665 Score = 35.1 bits (77), Expect = 1.9 Identities = 31/102 (30%), Positives = 44/102 (43%), Gaps = 1/102 (0%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVER 148 G R R G AG PG++G + PA R +QG V L +G P + ++ Sbjct: 1072 GLRGRPGPQGPPGPAGEKGA-PGEKGPQGPAGRDGVQGP--VGLPGPAGPAGSPGEDGDK 1128 Query: 147 GAPSRPEAHGRLDSEQRDQGDHSGPGEAG-PGPLHRGPGFAG 25 G P G + +G++ PG G GP+ PG AG Sbjct: 1129 GEIGEPGQKG----SKGGKGENGPPGPPGLQGPV-GAPGIAG 1165 Score = 34.7 bits (76), Expect = 2.5 Identities = 29/99 (29%), Positives = 37/99 (37%), Gaps = 1/99 (1%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGE-AHVRLLRGEAASGLPAREVERGAPSRP 130 E+G V G P PG G + P QG V + G G P G P Sbjct: 1211 ENGDVGPMGPPGP-PGPRGPQGPNGADGPQGPPGSVGSVGGVGEKGEPGEAGNPGPPGEA 1269 Query: 129 EAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 G E+ ++G+ PG AGP PG G + N Sbjct: 1270 GVGGP-KGERGEKGEAGPPGAAGPPGAKGPPGDDGPKGN 1307 >UniRef50_Q02388 Cluster: Collagen alpha-1(VII) chain precursor; n=30; Eumetazoa|Rep: Collagen alpha-1(VII) chain precursor - Homo sapiens (Human) Length = 2944 Score = 35.5 bits (78), Expect = 1.4 Identities = 33/96 (34%), Positives = 36/96 (37%), Gaps = 3/96 (3%) Frame = -1 Query: 288 RAGTGAP-HPGQEGER-APAARGRLQGEAHVRLLRG-EAASGLPAREVERGAPSRPEAHG 118 R G P PG G AP +G G A + RG A G P G P P A G Sbjct: 1265 RGQVGPPGDPGLPGRTGAPGPQGP-PGSATAKGERGFPGADGRPGSPGRAGNPGTPGAPG 1323 Query: 117 RLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNG 10 L G PGE GP PG GQ + G Sbjct: 1324 -LKGSPGLPGPRGDPGERGPRGPKGEPGAPGQVIGG 1358 Score = 34.3 bits (75), Expect = 3.3 Identities = 36/109 (33%), Positives = 45/109 (41%), Gaps = 5/109 (4%) Frame = -1 Query: 330 TGGRNRGREDGAVARAGT-GAPH-PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPARE 157 TG DGA + G G+P PG G P +GE G+A GLP + Sbjct: 2253 TGKPGAPGRDGASGKDGDRGSPGVPGSPGLPGPVGP---KGEPGPTGAPGQAVVGLPGAK 2309 Query: 156 VERGAPSRPEAHGRLDSEQRDQGDHSGPGEAG-PGPLHRG--PGFAGQE 19 E+GAP G L E +GD PG G G R PG G++ Sbjct: 2310 GEKGAPG--GLAGDLVGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGED 2356 Score = 33.1 bits (72), Expect = 7.6 Identities = 40/127 (31%), Positives = 47/127 (37%), Gaps = 21/127 (16%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPH-----PGQEGERAPAARGRLQGEAHVRLLRG-------- 187 GG GA AG P G+ GE P RG L G L G Sbjct: 2180 GGHGDPGPPGAPGLAGPAGPQGPSGLKGEPGETGPPGRG-LTGPTGAVGLPGPPGPSGLV 2238 Query: 186 --EAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAG-PGPL----HRGP-GF 31 + + GLP + E G P P G + D+G PG G PGP+ GP G Sbjct: 2239 GPQGSPGLPGQVGETGKPGAPGRDG-ASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGA 2297 Query: 30 AGQEVNG 10 GQ V G Sbjct: 2298 PGQAVVG 2304 >UniRef50_UPI000155CDC9 Cluster: PREDICTED: hypothetical protein; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: hypothetical protein - Ornithorhynchus anatinus Length = 330 Score = 35.1 bits (77), Expect = 1.9 Identities = 38/98 (38%), Positives = 47/98 (47%), Gaps = 6/98 (6%) Frame = -1 Query: 291 ARAG-TGAPHPGQEGERAPAARGRLQ-GE--AHVRLLR-GEAASGLP-AREVERGAPSRP 130 ARAG +GA + A + R + GE A VR R GE + + AR E GA R Sbjct: 4 ARAGESGAEVKTARPDEVDAEKRRARAGELGAEVRTARPGEVDAEMRRARVGEAGAEVRT 63 Query: 129 EAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEV 16 HG +D+E R + GEAG G PG AG EV Sbjct: 64 AWHGEVDAEMR----WARAGEAGAGVRMDQPGEAGAEV 97 >UniRef50_UPI0000F2DA9D Cluster: PREDICTED: hypothetical protein; n=1; Monodelphis domestica|Rep: PREDICTED: hypothetical protein - Monodelphis domestica Length = 266 Score = 35.1 bits (77), Expect = 1.9 Identities = 32/101 (31%), Positives = 34/101 (33%), Gaps = 2/101 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPA-ARGRLQGEAHVRLLRGEAASGLPA 163 RE GR RGR A G P G EG + P ARG +G R G G Sbjct: 2 REEARGRRRGRGSRGPA-GGAAEPGAGAEGGKGPGWARGAGRGGGGCRGCGGSGGGGGRG 60 Query: 162 REVERGAPSR-PEAHGRLDSEQRDQGDHSGPGEAGPGPLHR 43 PSR P L G G PGP R Sbjct: 61 GRARDTPPSRSPGGAAWLRRGFGSGGARGSSGRREPGPFVR 101 >UniRef50_UPI0000EBD3CA Cluster: PREDICTED: hypothetical protein; n=1; Bos taurus|Rep: PREDICTED: hypothetical protein - Bos taurus Length = 188 Score = 35.1 bits (77), Expect = 1.9 Identities = 27/70 (38%), Positives = 29/70 (41%), Gaps = 2/70 (2%) Frame = -1 Query: 333 STGGRNRG-REDGAVARAGTGAPHPGQEGERAPAA-RGRLQGEAHVRLLRGEAASGLPAR 160 S GGR R+ G A AG G+ P G APAA R R Q R R P R Sbjct: 3 SLGGRRAAERQPGLPAAAGNGSSAPAGYGSWAPAASRRRAQHILPTRRERASDTRPAPPR 62 Query: 159 EVERGAPSRP 130 A SRP Sbjct: 63 SAPSPAASRP 72 >UniRef50_UPI0000E7F95D Cluster: PREDICTED: similar to MGC86401 protein; n=2; Gallus gallus|Rep: PREDICTED: similar to MGC86401 protein - Gallus gallus Length = 509 Score = 35.1 bits (77), Expect = 1.9 Identities = 23/61 (37%), Positives = 29/61 (47%), Gaps = 2/61 (3%) Frame = -1 Query: 180 ASGLPAREVERGAPSRPEAHGRLDSEQRD--QGDHSGPGEAGPGPLHRGPGFAGQEVNGS 7 A GL A AP+RP A R+ S + D +G S GE G + R PG G G+ Sbjct: 10 AGGLAAAGAAAAAPARPHAQRRMGSARGDVMRGSRSSAGEGRGGAVVRPPG-RGTAAAGA 68 Query: 6 E 4 E Sbjct: 69 E 69 >UniRef50_UPI0000E21BFC Cluster: PREDICTED: hypothetical protein; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein - Pan troglodytes Length = 235 Score = 35.1 bits (77), Expect = 1.9 Identities = 29/88 (32%), Positives = 36/88 (40%), Gaps = 3/88 (3%) Frame = -1 Query: 288 RAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLD 109 +A +G P EG+RAP A Q +R A G P R + PSR R Sbjct: 55 QAPSGVNFPPPEGDRAPRAP---QESDFLRSCHQRPARG-PPRSIHNQDPSRCSPQRRPP 110 Query: 108 SEQRDQ---GDHSGPGEAGPGPLHRGPG 34 + R+ H G G AGP PL G Sbjct: 111 AGLREAKVAAAHRGVGAAGPRPLRAAAG 138 >UniRef50_UPI0000DD85F5 Cluster: PREDICTED: hypothetical protein; n=1; Homo sapiens|Rep: PREDICTED: hypothetical protein - Homo sapiens Length = 240 Score = 35.1 bits (77), Expect = 1.9 Identities = 29/92 (31%), Positives = 36/92 (39%), Gaps = 3/92 (3%) Frame = -1 Query: 315 RGREDGAVARAGTGAPHPGQEGERAPAARGRLQ---GEAHVRLLRGEAASGLPAREVERG 145 RG E G+ R A + E AP + G R GE+A G+ A + R Sbjct: 126 RGGEPGSEPRPRARAILSARTSEPAPPGAEQYAAGPGAGRGRAGGGESAGGVGAGQAHRP 185 Query: 144 APSRPEAHGRLDSEQRDQGDHSGPGEAGPGPL 49 SRP R + Q G P AGP PL Sbjct: 186 GSSRPPGSARRGAAQPAPGTQP-PPRAGPAPL 216 >UniRef50_UPI00006A1B4A Cluster: Collagen alpha-3(VI) chain precursor.; n=5; Xenopus tropicalis|Rep: Collagen alpha-3(VI) chain precursor. - Xenopus tropicalis Length = 2535 Score = 35.1 bits (77), Expect = 1.9 Identities = 28/109 (25%), Positives = 40/109 (36%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVER 148 G + DG G P+ G +G R P +G +G+ SG+ + + Sbjct: 1941 GRQGEAGSDGVEGEPGNNGPN-GPQGRRGPPGLKGARGFPGEPGNKGD--SGIQGIQGSQ 1997 Query: 147 GAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 G P P G R QGD PG +G PG G+ E+ Sbjct: 1998 GMPGPPGPQGPQGLSGR-QGDAGAPGASGSFGKPGAPGLKGEPGENGEK 2045 Score = 33.1 bits (72), Expect = 7.6 Identities = 32/103 (31%), Positives = 41/103 (39%), Gaps = 3/103 (2%) Frame = -1 Query: 312 GREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAAS-GLPAREVERGAPS 136 G E G R G G GE R L+G +R +G+ G+ E+G P Sbjct: 1829 GEEGGHGERGFRGLN--GTRGESGCPGRRGLKGARGIRGDKGDDGEFGIDGVPGEQGEPG 1886 Query: 135 RPEAHG-RLDS-EQRDQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 A G R D+ Q +G PGE G L PG G +N Sbjct: 1887 GRGASGERGDTGAQGRKGPRGQPGEKGENGLRGDPGEPGTNIN 1929 >UniRef50_UPI0000ECB838 Cluster: Hypothetical protein; n=1; Gallus gallus|Rep: Hypothetical protein - Gallus gallus Length = 1550 Score = 35.1 bits (77), Expect = 1.9 Identities = 26/95 (27%), Positives = 44/95 (46%) Frame = -2 Query: 389 KEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRER 210 K E ENE E R + +K + + EK ++W+ + ++ +QA+++ LL E + R Sbjct: 378 KIAEDHENELKEAREEVLKI--ETLYKEKEKKWKCESEDQRVQAEEKLSLLHTE--LQNR 433 Query: 209 LMYAYSEVKRRLDYQLEKSNVERRLAQKHMVDWIV 105 L Y K+ L + E + Q H IV Sbjct: 434 LEYE----KQNLQKEFEVREAQMNQLQDHQAAKIV 464 >UniRef50_Q6TEP5 Cluster: Hyaluronan-mediated motility receptor; n=4; Danio rerio|Rep: Hyaluronan-mediated motility receptor - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 903 Score = 35.1 bits (77), Expect = 1.9 Identities = 26/106 (24%), Positives = 50/106 (47%) Frame = -2 Query: 356 EGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEVKRR 177 E ++ ++ L+ +E E+ E+ RAQ Q Q ++++V + +A RL E++ Sbjct: 656 ETHSEELRCLQMDVEQERGEKERAQTQLEKEQKRRQSV--EGRSAEASRLRSHVEELEDE 713 Query: 176 LDYQLEKSNVERRLAQKHMVDWIVSNVTKAITPDQEKQALDRCIAD 39 + ER A+ H V+W ++E+Q L R +A+ Sbjct: 714 VSKLRRLMQEERDAAEHHTVEWQQERQQLCTQIEEERQDLHRQLAE 759 >UniRef50_Q4TC25 Cluster: Chromosome undetermined SCAF7059, whole genome shotgun sequence; n=2; Euteleostomi|Rep: Chromosome undetermined SCAF7059, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 354 Score = 35.1 bits (77), Expect = 1.9 Identities = 19/44 (43%), Positives = 20/44 (45%) Frame = -1 Query: 183 AASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGP 52 AA AR +R AP PE D G GPGE GPGP Sbjct: 242 AAGHQRARPADRSAPGEPE-RAAADDAPLAGGPGPGPGETGPGP 284 >UniRef50_Q4S480 Cluster: Chromosome undetermined SCAF14743, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF14743, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 624 Score = 35.1 bits (77), Expect = 1.9 Identities = 34/110 (30%), Positives = 44/110 (40%), Gaps = 10/110 (9%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPH-----PGQEGERAPAARGRLQGEAHVRLLRGEAAS-GLPA 163 GR R + GA+ R G+ P PG GE+ P ++G A + RG A S L Sbjct: 324 GRERRQRRGAIGRGGSPGPVGPPGVPGSRGEKGPLGDSGVRGPAGPKGARGPAVSLQLEG 383 Query: 162 REVERGA--PSRP--EAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 + G P P + R + G H GP G LH G F G Sbjct: 384 MRMRAGGSEPLGPPGSSSERPAGPEALHGRHHGPALPVRG-LHPGQVFPG 432 >UniRef50_Q9AD79 Cluster: Putative membrane protein; n=1; Streptomyces coelicolor|Rep: Putative membrane protein - Streptomyces coelicolor Length = 221 Score = 35.1 bits (77), Expect = 1.9 Identities = 32/105 (30%), Positives = 43/105 (40%), Gaps = 10/105 (9%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEA----ASG 172 R +GGR+ + AGT P G A +A G L G L+ G A A+G Sbjct: 65 RVESGGRDPSPDPATAPAAGT-VGEPSGAGPSATSAMGGLSGSPGPGLIPGLAPAPSATG 123 Query: 171 ----LPAREVER--GAPSRPEAHGRLDSEQRDQGDHSGPGEAGPG 55 LP R GAP P+ +R++GD +G PG Sbjct: 124 PAVPLPTAPPVRTPGAPETPKPGEGAGERERERGDDTGERAPAPG 168 >UniRef50_Q2INF4 Cluster: Putative uncharacterized protein; n=1; Anaeromyxobacter dehalogenans 2CP-C|Rep: Putative uncharacterized protein - Anaeromyxobacter dehalogenans (strain 2CP-C) Length = 781 Score = 35.1 bits (77), Expect = 1.9 Identities = 36/108 (33%), Positives = 42/108 (38%), Gaps = 2/108 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 R GR RGRE G V AP P EG+R R G R R +A R Sbjct: 235 RPGRAGRRRGRERGGVL-----APQPRPEGDRVLGLRRAPAGPGGAR-PRARSAPRARLR 288 Query: 159 EVERGAPSRPEAHG--RLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 A + +AH R + R G H P GP P R P F G+ Sbjct: 289 RGRGAARAGRDAHAARRFRAGPRRSGAHL-PRRHGPRP-RRLPVFRGR 334 >UniRef50_Q3WG32 Cluster: Putative uncharacterized protein; n=1; Frankia sp. EAN1pec|Rep: Putative uncharacterized protein - Frankia sp. EAN1pec Length = 494 Score = 35.1 bits (77), Expect = 1.9 Identities = 35/105 (33%), Positives = 41/105 (39%), Gaps = 11/105 (10%) Frame = -1 Query: 315 RGREDGAVARAGTGAPHPGQEGERAPAAR--------GRLQGEAHVRLLRGEAASGLPAR 160 +GR+ A +G P + G RA AAR G G+ R R A G PA Sbjct: 8 QGRDSDRAAGSGVRLGRPARPGPRAAAARPGAPDRRTGWFPGDQPGRAGR-LRAQGPPAG 66 Query: 159 EVERGAPSRPEAH---GRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 RG P RP H GR R H+ P PG HR G Sbjct: 67 RRLRGGPGRPGDHPGAGRRGGPAR--ALHAHPRRPEPGAAHRRDG 109 >UniRef50_Q11AX3 Cluster: Cytochrome c, class I; n=3; Rhizobiales|Rep: Cytochrome c, class I - Mesorhizobium sp. (strain BNC1) Length = 286 Score = 35.1 bits (77), Expect = 1.9 Identities = 26/85 (30%), Positives = 37/85 (43%), Gaps = 2/85 (2%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEA--ASGLPAREVERGAPSR 133 E+GA A A G P EGE AP A G E + A A G PA + + AP++ Sbjct: 202 EEGAAAPAEGGEQTPPAEGEAAPPAEGAAPAEGTAAPAQDGAAPAEGAPAGDTQ--APAQ 259 Query: 132 PEAHGRLDSEQRDQGDHSGPGEAGP 58 E + + + G +G + P Sbjct: 260 TEETTTPPATEGEAGGSTGETQPAP 284 >UniRef50_Q08UF8 Cluster: Tetratricopeptide repeat domain protein; n=1; Stigmatella aurantiaca DW4/3-1|Rep: Tetratricopeptide repeat domain protein - Stigmatella aurantiaca DW4/3-1 Length = 897 Score = 35.1 bits (77), Expect = 1.9 Identities = 28/72 (38%), Positives = 35/72 (48%), Gaps = 5/72 (6%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLR-GEAASG- 172 P+R+ G R D A AG APHP G RA RLQ H R L+ +AA+G Sbjct: 798 PHRQHAGARGDHHRDPARGLAGDPAPHPQALGRRA-----RLQRRHHRRSLQEDDAAAGH 852 Query: 171 ---LPAREVERG 145 LP ++ RG Sbjct: 853 GDALPPVQLGRG 864 >UniRef50_A5P378 Cluster: Putative uncharacterized protein; n=3; Proteobacteria|Rep: Putative uncharacterized protein - Methylobacterium sp. 4-46 Length = 1338 Score = 35.1 bits (77), Expect = 1.9 Identities = 33/87 (37%), Positives = 35/87 (40%), Gaps = 3/87 (3%) Frame = -1 Query: 291 ARAGTGAPHPGQEGERA-PAARGRLQGEAHVRLLRGEAASGLPAREVERGA-PSRPEAH- 121 ARAG P + RA PA RGR R G A GLP R R A P RP Sbjct: 177 ARAGLDGPGLLADPRRARPARRGRRHRGRDQRARAGGAPRGLPGRARRRAARPRRPARRA 236 Query: 120 GRLDSEQRDQGDHSGPGEAGPGPLHRG 40 G D + R P GP L RG Sbjct: 237 GGRDPQPRP------PSRRGPRRLRRG 257 >UniRef50_A0QX11 Cluster: Putative uncharacterized protein; n=1; Mycobacterium smegmatis str. MC2 155|Rep: Putative uncharacterized protein - Mycobacterium smegmatis (strain ATCC 700084 / mc(2)155) Length = 212 Score = 35.1 bits (77), Expect = 1.9 Identities = 32/107 (29%), Positives = 44/107 (41%), Gaps = 3/107 (2%) Frame = -1 Query: 312 GREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASG---LPAREVERGA 142 GR DG +G G PH ++ +R QG H ++ G G L AR R A Sbjct: 57 GRRDGVAPCSGQGVPH--RDDDRHGEQEDD-QGPPHPGVVGGSTHPGGVVLRARAKPRVA 113 Query: 141 PSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 + P A GR ++ +GPG P RGP A + + R Sbjct: 114 DAPPGA-GRAHHDRAVGAVEAGPGRPVPRHAQRGPRPARERQDDEHR 159 >UniRef50_Q6ZAE0 Cluster: Putative uncharacterized protein P0410E02.6; n=4; Oryza sativa|Rep: Putative uncharacterized protein P0410E02.6 - Oryza sativa subsp. japonica (Rice) Length = 357 Score = 35.1 bits (77), Expect = 1.9 Identities = 28/92 (30%), Positives = 38/92 (41%), Gaps = 2/92 (2%) Frame = -1 Query: 327 GGRNRGREDGA--VARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 GG R R G +A TG PHP PA + R +GE + RGE RE Sbjct: 14 GGGARARAGGGRGARKAMTGGPHPSARAAGGPACQRRARGEEPMGRRRGER-----GREE 68 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP 58 + P++ G+ S + H GP + P Sbjct: 69 KWAEPAQERKGGKRTS---FRAWHDGPDTSPP 97 >UniRef50_Q9VMV5 Cluster: CG16858-PA; n=6; Schizophora|Rep: CG16858-PA - Drosophila melanogaster (Fruit fly) Length = 1940 Score = 35.1 bits (77), Expect = 1.9 Identities = 29/88 (32%), Positives = 36/88 (40%), Gaps = 1/88 (1%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PG++G P +GE LRGE PA EV G P + + + QG Sbjct: 724 PGEDGYTGPKGVKGAKGEQGAIGLRGEIGDRGPAGEVIPG-PVGAKGYPGPTGDYGQQGA 782 Query: 84 HSGPGEAGPGPLHRGPGFAGQE-VNGSE 4 PG G L G G+ GQ V G E Sbjct: 783 PGLPGRDGEPGLDGGIGYKGQRGVPGQE 810 >UniRef50_Q66S51 Cluster: Collagen repeat-containing protein; n=1; Oikopleura dioica|Rep: Collagen repeat-containing protein - Oikopleura dioica (Tunicate) Length = 1041 Score = 35.1 bits (77), Expect = 1.9 Identities = 31/107 (28%), Positives = 42/107 (39%), Gaps = 4/107 (3%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGE----AHVRLLRGEAA 178 P GG G GA +G PG +G+R P+ +QGE L+ Sbjct: 477 PGPSGEGGGPPGPA-GAKGHSGRRG-EPGPDGKRGPSGEPGVQGETGPPGEQGLIGPAGP 534 Query: 177 SGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP 37 G+P +G P G + +Q +QG PGE G P GP Sbjct: 535 PGVPGESGRQGKDGSPGPQG-VRGQQGNQGYPGEPGEPGQ-PGETGP 579 Score = 33.5 bits (73), Expect = 5.8 Identities = 30/94 (31%), Positives = 36/94 (38%) Frame = -1 Query: 303 DGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEA 124 DG G P PG+ G + GE L G+A G P E G P Sbjct: 622 DGEPGADGQPGP-PGETGSKGHQGEAGPPGETGAAGLNGDA--GPPG---ETGPAGPPGE 675 Query: 123 HGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQ 22 GR EQ G+ GE+GP GPG G+ Sbjct: 676 SGR-PGEQGLTGETGMRGESGPQGPAGGPGLTGE 708 >UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocentrotus purpuratus|Rep: 3 alpha procollagen - Strongylocentrotus purpuratus (Purple sea urchin) Length = 1752 Score = 35.1 bits (77), Expect = 1.9 Identities = 28/81 (34%), Positives = 32/81 (39%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PG GE+ G +R L G+ G P ERG P GR G Sbjct: 806 PGAFGEKGDFGPQGNPGGQGLRGLTGQP--GQPGIGGERGNIGDPGTRGR----DGIPGQ 859 Query: 84 HSGPGEAGPGPLHRGPGFAGQ 22 G GE GPG L GPG G+ Sbjct: 860 AGGKGETGPGGLPGGPGIKGE 880 >UniRef50_A7RPE2 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 283 Score = 35.1 bits (77), Expect = 1.9 Identities = 16/40 (40%), Positives = 23/40 (57%) Frame = -1 Query: 177 SGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP 58 +GLP R+ G P R +G +D ++ D+GD PG GP Sbjct: 17 AGLPGRDGAMGPPGRDGQNG-VDGQKGDRGDMGPPGHPGP 55 >UniRef50_A4H5G1 Cluster: Putative uncharacterized protein; n=1; Leishmania braziliensis|Rep: Putative uncharacterized protein - Leishmania braziliensis Length = 2178 Score = 35.1 bits (77), Expect = 1.9 Identities = 29/84 (34%), Positives = 38/84 (45%), Gaps = 5/84 (5%) Frame = -1 Query: 297 AVARAG--TGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAA---SGLPAREVERGAPSR 133 +V RAG T AP G +R RG H++L+ G+ A S R +E R Sbjct: 803 SVDRAGLMTDAPRQGMSDKRKDK-RG------HLKLVEGDGAELRSLHLTRALEEVTIGR 855 Query: 132 PEAHGRLDSEQRDQGDHSGPGEAG 61 PE HG D + D+ D G E G Sbjct: 856 PEGHGPRDQVEEDEDDEDGTDEEG 879 >UniRef50_A0CHT2 Cluster: Chromosome undetermined scaffold_184, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_184, whole genome shotgun sequence - Paramecium tetraurelia Length = 315 Score = 35.1 bits (77), Expect = 1.9 Identities = 15/39 (38%), Positives = 26/39 (66%) Frame = -2 Query: 386 EVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQEL 270 +VEAT+ EW++G+N T K ++ +KT Q+R +E+ Sbjct: 177 KVEATKVEWHDGKNLTKKLIKKKQRNKKTGQFRVISKEV 215 >UniRef50_A6NF26 Cluster: Uncharacterized protein COL27A1; n=28; Euteleostomi|Rep: Uncharacterized protein COL27A1 - Homo sapiens (Human) Length = 1861 Score = 35.1 bits (77), Expect = 1.9 Identities = 33/116 (28%), Positives = 44/116 (37%), Gaps = 9/116 (7%) Frame = -1 Query: 345 PNRESTGG-RNRGREDGAVARAGT-GAPHP----GQEGERAPAARGRLQGEAHVRLLRG- 187 P +E G R + + G G G P P G EG + + G V+ L+G Sbjct: 1366 PGQEGVQGLRGKPGQQGQPGHPGPRGWPGPKGSKGAEGPKGKQGKAGAPGRRGVQGLQGL 1425 Query: 186 EAASGLPAREVERGAPSRPEAHGR--LDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 G+ R+ G GR +Q +QGD PG GP PG AG Sbjct: 1426 PGPRGVVGRQGLEGIAGPDGLPGRDGQAGQQGEQGDDGDPGPMGPAGKRGNPGVAG 1481 >UniRef50_Q750X0 Cluster: AGL181Cp; n=1; Eremothecium gossypii|Rep: AGL181Cp - Ashbya gossypii (Yeast) (Eremothecium gossypii) Length = 711 Score = 35.1 bits (77), Expect = 1.9 Identities = 21/69 (30%), Positives = 33/69 (47%) Frame = -2 Query: 293 WRAQGQELLIQAKKENVLLQLEAAYRERLMYAYSEVKRRLDYQLEKSNVERRLAQKHMVD 114 W QGQ+++ EN L+ RL+Y +E+ R+L+ Q K N R H + Sbjct: 145 WYLQGQDVVPVRSGENRLVSGIRLPLSRLLYHCNELVRQLEAQ-SKLNTPRHYMVAHKLQ 203 Query: 113 WIVSNVTKA 87 W +S + A Sbjct: 204 WFMSQLLPA 212 >UniRef50_UPI0001555CB8 Cluster: PREDICTED: similar to T-box 1; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to T-box 1 - Ornithorhynchus anatinus Length = 614 Score = 34.7 bits (76), Expect = 2.5 Identities = 32/95 (33%), Positives = 37/95 (38%), Gaps = 8/95 (8%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAH 121 G+ A G PG G R PA + VR RG G REV R SRP A Sbjct: 250 GSKALVPGGGFFPGLSGRRTPAPHS-INRPPGVRAGRGGEGRG---REVPRRRQSRPRAS 305 Query: 120 GRLDS--------EQRDQGDHSGPGEAGPGPLHRG 40 R+ + R G +G G AGP P G Sbjct: 306 ARIKGPGRRGTARDVRLPGPDAGVGGAGPAPTPAG 340 >UniRef50_UPI0000F2146D Cluster: PREDICTED: similar to alpha-1 type XI collagen; n=1; Danio rerio|Rep: PREDICTED: similar to alpha-1 type XI collagen - Danio rerio Length = 616 Score = 34.7 bits (76), Expect = 2.5 Identities = 30/100 (30%), Positives = 39/100 (39%), Gaps = 2/100 (2%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRP- 130 E+G V G P PG G + P QG + G A G + E G P P Sbjct: 227 ENGDVGPMGPPGP-PGPRGPQGPPGADGPQGPPGG--IGGMGAVGEKGEQGEAGNPGPPG 283 Query: 129 -EAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVN 13 G E+ ++G+ PG AGP PG G + N Sbjct: 284 EPGPGGPKGERGEKGEAGPPGAAGPAGPKGPPGDDGPKGN 323 Score = 33.9 bits (74), Expect = 4.4 Identities = 26/83 (31%), Positives = 36/83 (43%), Gaps = 1/83 (1%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 PG+ G PA R +QG V L G P + ++G P G + D+G+ Sbjct: 108 PGERGPLGPAGRDGVQGP--VGLPGPAGPQGPPGEDGDKGEVGEPGQKG----SKADKGE 161 Query: 84 HSGPGEAG-PGPLHRGPGFAGQE 19 PG G GP+ PG AG + Sbjct: 162 QGPPGPPGLQGPI-GAPGPAGAD 183 >UniRef50_UPI0000EBD4BD Cluster: PREDICTED: similar to alpha-3 type IX collagen; n=1; Bos taurus|Rep: PREDICTED: similar to alpha-3 type IX collagen - Bos taurus Length = 403 Score = 34.7 bits (76), Expect = 2.5 Identities = 24/76 (31%), Positives = 30/76 (39%), Gaps = 5/76 (6%) Frame = -1 Query: 261 GQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGR-----LDSEQR 97 G GER P +G+ L G+P G P P G+ LD E+ Sbjct: 143 GDRGERGPEGFRGPKGD-----LGRPGPKGIPGMSGPSGEPGMPGKDGQDGVPGLDGEKG 197 Query: 96 DQGDHSGPGEAGPGPL 49 + G H PGE GP L Sbjct: 198 EAGRHGAPGEKGPNGL 213 >UniRef50_UPI0000E801E7 Cluster: PREDICTED: similar to alpha 1 type XIX collagen; n=1; Gallus gallus|Rep: PREDICTED: similar to alpha 1 type XIX collagen - Gallus gallus Length = 890 Score = 34.7 bits (76), Expect = 2.5 Identities = 31/109 (28%), Positives = 44/109 (40%), Gaps = 4/109 (3%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGT-GAPH-PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 S G + G + G+ G+P PG+ GER P G E G P Sbjct: 670 SEGSPGKPGPPGPPGKPGSPGSPGLPGEPGERGPIGDTGFPGP--------EGPQGKPGI 721 Query: 159 EVERGAPSRPEAHGRLDSE--QRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 + G P P A GR + ++GD PG+ GP PG +G++ Sbjct: 722 NGKDGLPGPPGAVGRPGDRGPKGERGDQGIPGDKGPQGERGRPGPSGEK 770 >UniRef50_UPI0000E1F200 Cluster: PREDICTED: hypothetical protein; n=1; Pan troglodytes|Rep: PREDICTED: hypothetical protein - Pan troglodytes Length = 142 Score = 34.7 bits (76), Expect = 2.5 Identities = 30/88 (34%), Positives = 40/88 (45%), Gaps = 5/88 (5%) Frame = -1 Query: 282 GTGAPHPGQEGERAPAARGRLQGEA-HVRLLRGEAASGLPAREVER-GA---PSRPEAHG 118 G G P PG A GR +G A + G ++ P V R G+ P+ E+ G Sbjct: 45 GGGGPAPGY-------ATGRSRGGALRSAEMPGACSAVRPGLHVHRLGSAVGPAGAESAG 97 Query: 117 RLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 + R G +GPG+AGPG GPG Sbjct: 98 PARALPRSIGLRAGPGQAGPGACSAGPG 125 >UniRef50_UPI00006D930E Cluster: hypothetical protein Paer2_01003155; n=1; Pseudomonas aeruginosa 2192|Rep: hypothetical protein Paer2_01003155 - Pseudomonas aeruginosa 2192 Length = 360 Score = 34.7 bits (76), Expect = 2.5 Identities = 26/71 (36%), Positives = 31/71 (43%), Gaps = 3/71 (4%) Frame = -1 Query: 312 GREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRL--LRGEAASGLPAREVE-RGA 142 G G V A APHPG+E +R R G A L + + GLP R + R A Sbjct: 53 GHAGGDVGIAVAVAPHPGREAQRRGGQRQAFAGAAQEYLVDVPKDVRQGLPERMFDHREA 112 Query: 141 PSRPEAHGRLD 109 P R GR D Sbjct: 113 PFRLVHRGRPD 123 >UniRef50_UPI00006A0C93 Cluster: Collagen alpha-1(XIX) chain precursor (Collagen alpha-1(Y) chain).; n=1; Xenopus tropicalis|Rep: Collagen alpha-1(XIX) chain precursor (Collagen alpha-1(Y) chain). - Xenopus tropicalis Length = 599 Score = 34.7 bits (76), Expect = 2.5 Identities = 30/95 (31%), Positives = 40/95 (42%), Gaps = 3/95 (3%) Frame = -1 Query: 300 GAVARAGTGAP--HPGQEGERAPAARGRLQGEAHVRLLRGEAAS-GLPAREVERGAPSRP 130 G R G P +PG+ GER P G + G+ + GLP GAP RP Sbjct: 312 GEKGRNGAPGPPGYPGESGERGPVGDIGFPGPEGPAGVPGKNGNDGLPGSV---GAPGRP 368 Query: 129 EAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 G + ++GD PG+ GP PG +G Sbjct: 369 GDRG----PKGERGDPGIPGDQGPQGERGKPGPSG 399 >UniRef50_UPI000069E9BC Cluster: UPI000069E9BC related cluster; n=4; Xenopus tropicalis|Rep: UPI000069E9BC UniRef100 entry - Xenopus tropicalis Length = 379 Score = 34.7 bits (76), Expect = 2.5 Identities = 37/126 (29%), Positives = 49/126 (38%), Gaps = 17/126 (13%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPH-----PGQ---EGERAPAARGRLQGEAHVRLLRGEAA-S 175 G + ++G+ G PH PGQ +GER LQG LRG A S Sbjct: 58 GDKGLAGQNGSQGLKGPAGPHGKAGPPGQKGNQGERGTNGEPGLQGPKGETGLRGPAGVS 117 Query: 174 GLPAREVERGAPSRPEAHGR--LDSEQRDQGD------HSGPGEAGPGPLHRGPGFAGQE 19 G P ++ G P G + + D+GD + G GP H G GQ+ Sbjct: 118 GPPGKDGFAGIPGASGIPGPSGIKGSKGDKGDPGLAGQNGSQGLKGPAGPHGKAGPPGQK 177 Query: 18 VNGSER 1 N ER Sbjct: 178 GNQGER 183 >UniRef50_UPI000065F78D Cluster: Homolog of Homo sapiens "Collagen alpha 1; n=2; Clupeocephala|Rep: Homolog of Homo sapiens "Collagen alpha 1 - Takifugu rubripes Length = 2683 Score = 34.7 bits (76), Expect = 2.5 Identities = 33/100 (33%), Positives = 40/100 (40%), Gaps = 5/100 (5%) Frame = -1 Query: 300 GAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAH 121 G+ +AG P PG G+ +G +GE GLP ERG P Sbjct: 2313 GSPGKAGPAGP-PGSPGQVGAPGTDGFKGN------KGEVGVGLPGLRGERGDPGPRGEA 2365 Query: 120 GR--LDSEQRDQ--GDHSGP-GEAGPGPLHRGPGFAGQEV 16 GR LD ++ Q G GP GE G G L G G V Sbjct: 2366 GRPGLDGDRGLQGLGGMQGPRGEKGDGGLQGDKGDKGDTV 2405 >UniRef50_Q4TFV5 Cluster: Chromosome undetermined SCAF4174, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF4174, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 185 Score = 34.7 bits (76), Expect = 2.5 Identities = 36/100 (36%), Positives = 42/100 (42%), Gaps = 10/100 (10%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV--- 154 G R + +G A G G PG G AA GR+Q +RLL+ E A P E Sbjct: 44 GPGRRQPEGQQAERG-GRVRPGGGGALPGAAVGRVQ----LRLLQPELAGPGPVHEAGQR 98 Query: 153 ERGAPSR---PEAHGRLDSEQRDQ----GDHSGPGEAGPG 55 R P P A G + DQ G H GEAGPG Sbjct: 99 RRQGPQPLPVPHAAGPAAAAPVDQAAAAGHHQDRGEAGPG 138 >UniRef50_Q4SWY6 Cluster: Chromosome undetermined SCAF13320, whole genome shotgun sequence; n=2; Eukaryota|Rep: Chromosome undetermined SCAF13320, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 476 Score = 34.7 bits (76), Expect = 2.5 Identities = 27/87 (31%), Positives = 37/87 (42%), Gaps = 6/87 (6%) Frame = -1 Query: 261 GQEGERAPAARGRLQGEAHVRLLRGEAA--SGLPAREVERGAPSRPEAHGR--LDSEQRD 94 G+EG+ +Q R L + A S P + G P P + G + + D Sbjct: 125 GEEGQLVQLRSNLVQLNTSNRNLENKLANLSRTPGPPGKAGQPGPPGSKGGPGVPGSKGD 184 Query: 93 QGDHSGPGEAGPG--PLHRGPGFAGQE 19 QG PGEAGP P GPG G++ Sbjct: 185 QGPKGDPGEAGPAGPPGGSGPGAKGEK 211 >UniRef50_Q5GAF3 Cluster: Putative uncharacterized protein; n=2; Singapore grouper iridovirus|Rep: Putative uncharacterized protein - Grouper iridovirus Length = 367 Score = 34.7 bits (76), Expect = 2.5 Identities = 34/99 (34%), Positives = 41/99 (41%), Gaps = 4/99 (4%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERA-PAARGRL--QGEAHVRLLRGEAASGLPAREVERGAPS 136 + GA G P PG +GE P A G QGE RGE G RGAP Sbjct: 84 KSGARGVPGPAGP-PGLQGETGDPGATGSPGPQGETGQPGPRGEP--GRDGESGPRGAPG 140 Query: 135 RPEAHGRLDSEQRDQGDHSGPGEAG-PGPLHRGPGFAGQ 22 +P G E+ + PG G PGP+ PG G+ Sbjct: 141 QPGPPGPKGDEESSGINPGPPGPPGPPGPI-GPPGVTGE 178 >UniRef50_Q2IMJ3 Cluster: LigA; n=4; cellular organisms|Rep: LigA - Anaeromyxobacter dehalogenans (strain 2CP-C) Length = 808 Score = 34.7 bits (76), Expect = 2.5 Identities = 29/95 (30%), Positives = 32/95 (33%) Frame = -1 Query: 321 RNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGA 142 R R R A +RA G PA R R A R R + G AR R A Sbjct: 156 RRRARRLAARSRAAEGHARGEARVLPRPAPRARRVPGAGARRHRRDEGRGGRARRRARPA 215 Query: 141 PSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP 37 +RP R R PG G RGP Sbjct: 216 RARPRGRARPRRRARGAAGRGRPGRRRAGRAPRGP 250 Score = 33.5 bits (73), Expect = 5.8 Identities = 37/110 (33%), Positives = 42/110 (38%), Gaps = 8/110 (7%) Frame = -1 Query: 417 RTKIGCLVGQGXXXXXXXXXXXXEPNR-ESTGGRNRGREDGAVARAGTGAPHPGQEGERA 241 R + G +G G R E GR R R ARA G PG+ G Sbjct: 501 RARRGARLGGGHARRAGAGGERAGRGRAEGRAGRRRARP----ARARGGDRAPGRRGRAG 556 Query: 240 --PAARGRLQGEAHVRLLRGEAASGLPAREVER-----GAPSRPEAHGRL 112 P ARGR GEA R R G PA + R GA + P GRL Sbjct: 557 ARPPARGR--GEA-ARAARARRRGGRPAAQGGRVAPRDGAAAHPGRRGRL 603 >UniRef50_Q2IMH4 Cluster: Fe-S oxidoreductase; n=1; Anaeromyxobacter dehalogenans 2CP-C|Rep: Fe-S oxidoreductase - Anaeromyxobacter dehalogenans (strain 2CP-C) Length = 412 Score = 34.7 bits (76), Expect = 2.5 Identities = 24/74 (32%), Positives = 32/74 (43%) Frame = -1 Query: 273 APHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRD 94 AP PG + AP+A EA V + + PA AP+ P + + D Sbjct: 186 APRPGAAAKSAPSAAPAAAPEADVAAAPRTSIAPAPAAPRAFVAPAAPAPAPAPRALRAD 245 Query: 93 QGDHSGPGEAGPGP 52 GD +GP AGP P Sbjct: 246 AGD-AGPPAAGPAP 258 >UniRef50_A7FBU7 Cluster: Putative uncharacterized protein; n=1; Acinetobacter baumannii ATCC 17978|Rep: Putative uncharacterized protein - Acinetobacter baumannii (strain ATCC 17978 / NCDC KC 755) Length = 366 Score = 34.7 bits (76), Expect = 2.5 Identities = 31/140 (22%), Positives = 60/140 (42%), Gaps = 1/140 (0%) Frame = -2 Query: 437 YVAHVKFGPKLAAWLDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQA 258 YVA FG + +K + NEW+ R +K +++ I EK ++W A + Sbjct: 193 YVADPDFGEDMIELFNKNKSSQLNEWH--RTLFIKVIKE-ISCEKNKKWNAVNAIVKDPI 249 Query: 257 KKENVLLQLEAAYRERLMYAYSEVK-RRLDYQLEKSNVERRLAQKHMVDWIVSNVTKAIT 81 K ++ ++ L YA + + + Y K +E+ L + ++ SN + Sbjct: 250 VKTQFREIMKDQPKQNLDYALAGRRDYKQLYSQAKDRLEKELKKNAWLNSYASNTERRSH 309 Query: 80 PDQEKQALDRCIADLASLAR 21 + + LD IA+ +L + Sbjct: 310 AQERLKHLDMLIAEQETLEK 329 >UniRef50_Q0AYI8 Cluster: Translation initiation factor IF-2; n=1; Syntrophomonas wolfei subsp. wolfei str. Goettingen|Rep: Translation initiation factor IF-2 - Syntrophomonas wolfei subsp. wolfei (strain Goettingen) Length = 882 Score = 34.7 bits (76), Expect = 2.5 Identities = 28/92 (30%), Positives = 36/92 (39%), Gaps = 1/92 (1%) Frame = -1 Query: 321 RNRGREDGAVARAGTGAPHP-GQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERG 145 R R GA R AP G+ RAP A GR RG ++ P V R Sbjct: 150 RPENRSAGATGRTDNRAPGAAGRTDNRAPGAVGRTDN-------RGAGSASRPDNRVTRP 202 Query: 144 APSRPEAHGRLDSEQRDQGDHSGPGEAGPGPL 49 A RP+ G S+ + + PG P P+ Sbjct: 203 AAGRPDNKGSRPSDAKRPPQRTVPGNT-PRPV 233 >UniRef50_Q08NL8 Cluster: Putative uncharacterized protein; n=1; Stigmatella aurantiaca DW4/3-1|Rep: Putative uncharacterized protein - Stigmatella aurantiaca DW4/3-1 Length = 760 Score = 34.7 bits (76), Expect = 2.5 Identities = 35/92 (38%), Positives = 40/92 (43%), Gaps = 11/92 (11%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGE--RAPAARGR-----LQGEAH--VRLLRG 187 R +G R E G G PH G+ + RA AR R +GE H LRG Sbjct: 77 RRPSGERPWRAEQGRRGNRRVG-PHGGRRRDDARAHLARAREGAEVRRGERHHGEGALRG 135 Query: 186 EAASGLPAREVE-RGAPSRPEAHGR-LDSEQR 97 E G P R+ RG P R AHGR L E R Sbjct: 136 EGRPGAPRRDPGLRGPPRRRSAHGRALQGELR 167 >UniRef50_A5NMX6 Cluster: Cytochrome B561; n=1; Methylobacterium sp. 4-46|Rep: Cytochrome B561 - Methylobacterium sp. 4-46 Length = 427 Score = 34.7 bits (76), Expect = 2.5 Identities = 37/99 (37%), Positives = 41/99 (41%), Gaps = 10/99 (10%) Frame = -1 Query: 327 GGRNRGREDGA---VARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPARE 157 G GR D A V G GAPHPG A AA GR G AH A LPA Sbjct: 105 GDHRPGRTDPARRAVPAGGRGAPHPGLRAAGAGAAGGR--GPAH------GGALALPAPG 156 Query: 156 VERG-APSR----PEAHGRLD--SEQRDQGDHSGPGEAG 61 +RG P+R P+ R D R GD P + G Sbjct: 157 GDRGPRPARLRQDPDRGRRADDLGLHRRGGDRPAPRDGG 195 >UniRef50_A5NLP4 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Rep: LigA - Methylobacterium sp. 4-46 Length = 797 Score = 34.7 bits (76), Expect = 2.5 Identities = 33/102 (32%), Positives = 35/102 (34%), Gaps = 4/102 (3%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAP----AARGRLQGEAHVRLLRGEAASGLPAR 160 GG R R A G PG P A G + H R G PAR Sbjct: 584 GGAPRDRPRPGAAGTGDHRDRPGARARPRPGHPAAPEGARPADRHGRFRDGLLVPVEPAR 643 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 R RP A R E R GD + G AGP R PG Sbjct: 644 LPLRQDQDRPLAGPRRGREPRGGGDPA--GGAGPRARPRAPG 683 Score = 33.9 bits (74), Expect = 4.4 Identities = 30/94 (31%), Positives = 34/94 (36%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVER 148 GGR GR G AR G G P G+ R P RGR A + P Sbjct: 18 GGRPPGRRRGGAARRGAGRPVAGRL-RRDP--RGRSPAGARSAPGPADDRGRAPGPRRAG 74 Query: 147 GAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLH 46 A SRP+ G + R S G A P H Sbjct: 75 AARSRPDRRGDVPGRPRASRRRSRGGGADRCPRH 108 >UniRef50_A0VAK1 Cluster: Uncharacterized protein UPF0065 precursor; n=1; Delftia acidovorans SPH-1|Rep: Uncharacterized protein UPF0065 precursor - Delftia acidovorans SPH-1 Length = 743 Score = 34.7 bits (76), Expect = 2.5 Identities = 35/114 (30%), Positives = 49/114 (42%), Gaps = 12/114 (10%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPH--PGQE-------GERAP---AARGRLQGEAHVRL 196 R G R+RG + A G H PG + GE+ P A RGR + Sbjct: 117 RSPGGRRHRGAKAALAAAHGARRAHGLPGADRAGRGLGGEQHPDHGAPRGRPLRAQRQQA 176 Query: 195 LRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 L + A+G R + R P R HG+ ++ +G +GPG+ GP GPG Sbjct: 177 LHHQCAAG---RCLHRHRPHRRRQHGQRGADGLLRGALAGPGD-GPALSQDGPG 226 >UniRef50_Q8S842 Cluster: Putative uncharacterized protein OSJNBa0053D03.15; n=2; Oryza sativa|Rep: Putative uncharacterized protein OSJNBa0053D03.15 - Oryza sativa (Rice) Length = 314 Score = 34.7 bits (76), Expect = 2.5 Identities = 44/163 (26%), Positives = 56/163 (34%), Gaps = 5/163 (3%) Frame = -1 Query: 477 ARILLRTVTAGHGVCGSREIRTKIGCLVGQGXXXXXXXXXXXXE---PNRESTGGRNRGR 307 A I +R + GHG G+R +R ++ + G G P R+ G R GR Sbjct: 94 AAIRVRGGSTGHGWEGAR-VRGEVAWITGGGGGAGREGARTGSTGVGPTRQPLGPRWTGR 152 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERG-APSRP 130 A A GAP G G R R RG AA P R +RG R Sbjct: 153 TRLTPAGAPRGAPEGGLAGTADGRRRARAPMVTAGDHRRGGAA---PERAEKRGKRKGRS 209 Query: 129 EAH-GRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSE 4 AH G + + G G G P G+ E Sbjct: 210 TAHPGTTRTAETTTGVEESGGAVRDGEDDGAPAVGGRNGGADE 252 >UniRef50_Q5GAB4 Cluster: PHANTASTICA-like protein; n=1; Selaginella kraussiana|Rep: PHANTASTICA-like protein - Selaginella kraussiana Length = 404 Score = 34.7 bits (76), Expect = 2.5 Identities = 20/79 (25%), Positives = 36/79 (45%) Frame = -2 Query: 395 LDKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENVLLQLEAAYR 216 L KE+E + WN + L + + + E+ + Q++L K L + E Y Sbjct: 278 LVKELEENKESWNVQKKNAASTLRELKQQLECERIEKRKQKMLEVESKIQALRKEEKLYL 337 Query: 215 ERLMYAYSEVKRRLDYQLE 159 ++L Y+E+ +LD E Sbjct: 338 DKLELDYAELVAKLDRDAE 356 >UniRef50_Q7QYY1 Cluster: GLP_164_20758_21504; n=1; Giardia lamblia ATCC 50803|Rep: GLP_164_20758_21504 - Giardia lamblia ATCC 50803 Length = 248 Score = 34.7 bits (76), Expect = 2.5 Identities = 33/97 (34%), Positives = 45/97 (46%), Gaps = 5/97 (5%) Frame = -1 Query: 309 REDGAVARAGTGAPHPGQEGERAPAARGRLQGE---AHVRLLRGEAASGLPAREVERGAP 139 R+ A+A GA P G+R PA +GR + E H R+ ++ AR++ A Sbjct: 116 RDQLALAAQAGGARAPLAAGDRHPAGQGREEAEEASGHRRVFGQKSGDVYGARDLGH-AL 174 Query: 138 SRPEAHGRLDSEQRDQGDHSGPGEAG--PGPLHRGPG 34 P A G L R + + PG G PGP RGPG Sbjct: 175 GAPLAPG-LGLRPRGRRGRAPPGVRGALPGP-GRGPG 209 >UniRef50_Q5TV76 Cluster: ENSANGP00000028104; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000028104 - Anopheles gambiae str. PEST Length = 309 Score = 34.7 bits (76), Expect = 2.5 Identities = 37/125 (29%), Positives = 48/125 (38%), Gaps = 18/125 (14%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERA-----PAARGRLQGEAHVRLLR---------G 187 GR GA+ A A G++ A PA RG Q A +R +R G Sbjct: 128 GRQSAARVGALPAAAVRAVRAGRDDRPAALPGRPARRGHWQ-RARLRPVRAGNARPGDGG 186 Query: 186 EAASGLPAREVERGAPSR----PEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE 19 AA R V RG P A D+ +R +G H G GEAG G H G+ Sbjct: 187 AAAPRAAGRRVRRGVRGARGDAPPARAAADAVRRGEGRHPGVGEAG-GARHEPESVRGEA 245 Query: 18 VNGSE 4 ++ Sbjct: 246 ARDTD 250 >UniRef50_Q29FV7 Cluster: GA17072-PA; n=1; Drosophila pseudoobscura|Rep: GA17072-PA - Drosophila pseudoobscura (Fruit fly) Length = 1903 Score = 34.7 bits (76), Expect = 2.5 Identities = 24/98 (24%), Positives = 41/98 (41%), Gaps = 1/98 (1%) Frame = -1 Query: 324 GRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERG 145 G N E G+ G + ++ + + Q +V L ++ + +R+ Sbjct: 892 GANDSSEAGSAGNKGRPLSFAEWQRKKKVHSHSQSQSPGNVDKLSQDSRESMMSRDSGGD 951 Query: 144 A-PSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 A +R + G +D + + H G G GPGP H GPG Sbjct: 952 AGDARSRSRGPMDMDDQHSRLHRGFGGDGPGP-HSGPG 988 >UniRef50_Q26634 Cluster: Alpha-1 collagen; n=4; Echinoida|Rep: Alpha-1 collagen - Strongylocentrotus purpuratus (Purple sea urchin) Length = 1414 Score = 34.7 bits (76), Expect = 2.5 Identities = 38/114 (33%), Positives = 45/114 (39%), Gaps = 5/114 (4%) Frame = -1 Query: 327 GGRNRGREDGAVARAGTGAP----HPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 G R EDG + GAP PG+ GE A QG A R G + Sbjct: 552 GQRGERGEDGG--QGSPGAPGLTGEPGKRGEPGVAGPPGPQGSAGER-----GNQGPQGQ 604 Query: 159 EVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP-GPLHRGPGFAGQEVNGSER 1 G P P A G + QGD+ PGE+GP GP PG G+ ER Sbjct: 605 AGSMGPPGPPGASGDAGA----QGDNGPPGESGPEGP----PGARGERGAPGER 650 >UniRef50_Q20778 Cluster: Dumpy : shorter than wild-type protein 17; n=4; Caenorhabditis|Rep: Dumpy : shorter than wild-type protein 17 - Caenorhabditis elegans Length = 352 Score = 34.7 bits (76), Expect = 2.5 Identities = 32/100 (32%), Positives = 42/100 (42%), Gaps = 6/100 (6%) Frame = -1 Query: 318 NRGREDGAVARAGTG-APHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVER-- 148 ++G+ R G A PG++G +P G L E +G P +VE Sbjct: 209 SQGKPGARGMRGARGQAAMPGRDG--SPGMPGSLGPIGPPGAAGEEGPTGEPGADVEHQI 266 Query: 147 ---GAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGP 37 GA P A G +Q +QGD G AGP P RGP Sbjct: 267 GLPGAKGTPGAPGE-SGDQGEQGDRGATGIAGP-PGERGP 304 >UniRef50_Q20142 Cluster: Collagen protein 172, isoform a; n=4; Nematoda|Rep: Collagen protein 172, isoform a - Caenorhabditis elegans Length = 341 Score = 34.7 bits (76), Expect = 2.5 Identities = 29/86 (33%), Positives = 35/86 (40%), Gaps = 1/86 (1%) Frame = -1 Query: 279 TGAP-HPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSE 103 TG P PGQ+G P LQG+ +GE G+P + G P RP G + Sbjct: 202 TGPPGFPGQKGPNGPRGSPGLQGQDG---KKGE--QGMPGPQGPTGRPGRPGPKGPKGED 256 Query: 102 QRDQGDHSGPGEAGPGPLHRGPGFAG 25 R G AGP L PG G Sbjct: 257 GRVIMVAGPAGPAGPPGLPGTPGKRG 282 >UniRef50_O44174 Cluster: Collagen protein 104; n=2; Caenorhabditis|Rep: Collagen protein 104 - Caenorhabditis elegans Length = 281 Score = 34.7 bits (76), Expect = 2.5 Identities = 29/88 (32%), Positives = 35/88 (39%), Gaps = 4/88 (4%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRG-EAASGLPAREVERGAPSRPEAHGRLDSEQRD-- 94 PG G P +G LRG + G P G P P +G + RD Sbjct: 141 PGFPGPAGPPGAPGDKGNDGAPGLRGPDGHPGHPGNPGYGGEPGAPGNNGEPGDKGRDGS 200 Query: 93 QGDHSGPGEAGP-GPLHRGPGFAGQEVN 13 G PG GP GP+ + PGF GQ N Sbjct: 201 HGTKGAPGPQGPSGPVGQ-PGFPGQPGN 227 >UniRef50_Q8NFW1 Cluster: Collagen alpha-1(XXII) chain; n=23; Euteleostomi|Rep: Collagen alpha-1(XXII) chain - Homo sapiens (Human) Length = 1626 Score = 34.7 bits (76), Expect = 2.5 Identities = 33/118 (27%), Positives = 45/118 (38%), Gaps = 4/118 (3%) Frame = -1 Query: 342 NRESTGGRNRGREDGAVARAGT-GAP-HPGQEGERAPAARGRLQGEAHVRLLRGEAASGL 169 ++ S G R G AG GAP +PG+ G L + LL + + Sbjct: 1058 DKGSPGSRGLPGFPGPQGPAGRDGAPGNPGERGPPGKPGLSSLLSPGDINLLAKDVCNDC 1117 Query: 168 PAREVERGAPSRPEAHGR--LDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 P G P P G + + +G GEAGP L PG AG + + ER Sbjct: 1118 PPGPP--GLPGLPGFKGDKGVPGKPGREGTEGKKGEAGPPGLPGPPGIAGPQGSQGER 1173 Score = 33.9 bits (74), Expect = 4.4 Identities = 30/100 (30%), Positives = 41/100 (41%), Gaps = 6/100 (6%) Frame = -1 Query: 306 EDGAVARAGTGAPHPGQEGERAPAARGRLQGEA-HVRLLRGEAASGLPAREVERGAPSRP 130 + G +G P G++G+ PA + G L+GE G P +GAP P Sbjct: 618 QQGRPGPSGVAGPQ-GEKGDVGPAGPPGVPGSVVQQEGLKGE--QGAPGPRGHQGAPGPP 674 Query: 129 EAHGRLDSEQRD-----QGDHSGPGEAGPGPLHRGPGFAG 25 A G + E RD QG G+ GP + PG G Sbjct: 675 GARGPIGPEGRDGPPGLQGLRGKKGDMGPPGI---PGLLG 711 >UniRef50_Q7SAE9 Cluster: Putative uncharacterized protein NCU07002.1; n=1; Neurospora crassa|Rep: Putative uncharacterized protein NCU07002.1 - Neurospora crassa Length = 2573 Score = 34.7 bits (76), Expect = 2.5 Identities = 27/87 (31%), Positives = 33/87 (37%), Gaps = 1/87 (1%) Frame = -1 Query: 330 TGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQG-EAHVRLLRGEAASGLPAREV 154 TGG +R R G + AG+G+ H G G+ P R H G SG Sbjct: 2448 TGGSSRDRRSGRDSNAGSGSGHRGGGGDMPPPPPSREPSHRGHGSHRGGGDGSGHGQGGA 2507 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGP 73 R + HGR E R SGP Sbjct: 2508 PPMGGGRSDGHGRPPRESRSSRP-SGP 2533 >UniRef50_Q96JG9 Cluster: Zinc finger protein 469; n=5; Eutheria|Rep: Zinc finger protein 469 - Homo sapiens (Human) Length = 3925 Score = 34.7 bits (76), Expect = 2.5 Identities = 24/89 (26%), Positives = 35/89 (39%), Gaps = 1/89 (1%) Frame = -1 Query: 315 RGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPS 136 RG E+G P P +G +P + G + + G+ S + PS Sbjct: 3685 RGTENGMKPATPKAKPGPSSQGSGSPRPGTKTGGGSQPQPASGQLQSETATTPAKPSFPS 3744 Query: 135 RPEAHGRLDSEQRDQGDHSGPGEAG-PGP 52 R A RL + + + GP EAG GP Sbjct: 3745 RSPAPERLPARAQAKSCTKGPREAGEQGP 3773 >UniRef50_P31568 Cluster: Protein ycf2; n=1; Oenothera picensis|Rep: Protein ycf2 - Oenothera picensis (Oenothera odoarata) Length = 721 Score = 34.7 bits (76), Expect = 2.5 Identities = 18/50 (36%), Positives = 29/50 (58%) Frame = -2 Query: 392 DKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENV 243 ++EVE TE+E EG + V+ E+ +EG TE +G E ++ +E V Sbjct: 284 EEEVEGTEDEEVEGTEEEVEGTEEEVEG--TEDEEVEGTEEEVEGTEEEV 331 Score = 34.7 bits (76), Expect = 2.5 Identities = 18/50 (36%), Positives = 29/50 (58%) Frame = -2 Query: 392 DKEVEATENEWNEGRNQTVKALEDAIEGEKTEQWRAQGQELLIQAKKENV 243 ++EVE TE+E EG + V+ E+ +EG TE +G E ++ +E V Sbjct: 306 EEEVEGTEDEEVEGTEEEVEGTEEEVEG--TEDEEVEGTEEEVEGTEEEV 353 >UniRef50_UPI0000F1ED14 Cluster: PREDICTED: similar to autoantigen; n=5; Danio rerio|Rep: PREDICTED: similar to autoantigen - Danio rerio Length = 1375 Score = 34.3 bits (75), Expect = 3.3 Identities = 31/85 (36%), Positives = 36/85 (42%), Gaps = 2/85 (2%) Frame = -1 Query: 267 HPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQG 88 HPGQEG R P G + RG P RG P P + E+ G Sbjct: 679 HPGQEGPRGPKGSAGESGSDGLPGPRGREGPAGP-----RGEPGPPGIGEK--GEKGSFG 731 Query: 87 DHSGPGEAG-PGPLHRGP-GFAGQE 19 D PG AG PGP+ GP G AG + Sbjct: 732 DVGAPGIAGPPGPV--GPKGDAGAQ 754 >UniRef50_UPI0000F1E5D4 Cluster: PREDICTED: similar to collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive),; n=1; Danio rerio|Rep: PREDICTED: similar to collagen, type VII, alpha 1 (epidermolysis bullosa, dystrophic, dominant and recessive), - Danio rerio Length = 1641 Score = 34.3 bits (75), Expect = 3.3 Identities = 36/113 (31%), Positives = 49/113 (43%), Gaps = 15/113 (13%) Frame = -1 Query: 300 GAVARAGT-GAP----HPGQEGERAPAARGRLQGEAHVR----LLRGEAAS----GLPAR 160 GAV G+ G P PG+ G P +GE + LL+GEA S GLP R Sbjct: 1061 GAVGLPGSQGLPGIRGDPGEPGLIGPQGSKGEKGEREMMDCYYLLQGEAGSAIGGGLPGR 1120 Query: 159 EVERGAPSRPEAHGR--LDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGS 7 + E G P P GR ++ + + G PG+ G PG + + GS Sbjct: 1121 KGEPGIPGIPGTPGRQGVNGAKGEPGARGLPGQDGRPGSQGTPGLSIKGDKGS 1173 >UniRef50_UPI0000EBCFCF Cluster: PREDICTED: hypothetical protein; n=1; Bos taurus|Rep: PREDICTED: hypothetical protein - Bos taurus Length = 441 Score = 34.3 bits (75), Expect = 3.3 Identities = 38/115 (33%), Positives = 47/115 (40%), Gaps = 3/115 (2%) Frame = -1 Query: 336 ESTGGRNRGRE---DGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 +ST G R R GA+ GT +P PG RA +ARGR G A G A G Sbjct: 277 KSTVGTVRNRRIPGGGAMLAVGTRSP-PG----RAESARGREPGRAGSG---GPGAGGRG 328 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 A G S G + +G+ GPG GP PG+ G G+ R Sbjct: 329 AGRRGGGEKSERRKRGEGTRGEGARGEPGGPGRTGP------PGWLGSARLGAVR 377 >UniRef50_UPI0000EBC639 Cluster: PREDICTED: hypothetical protein; n=1; Bos taurus|Rep: PREDICTED: hypothetical protein - Bos taurus Length = 234 Score = 34.3 bits (75), Expect = 3.3 Identities = 27/94 (28%), Positives = 39/94 (41%), Gaps = 1/94 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 R + G RNR ++ A A AG P ++ + RGR G + R G+ A R Sbjct: 142 RSALGARNR-QQRRARAAAGARGRGPRKQQRESRGRRGRATGASSPRRRSGDFAGDKAHR 200 Query: 159 EVERGAPSRPEAHGRLDSEQR-DQGDHSGPGEAG 61 E P+R RL + R +G + G G Sbjct: 201 ERRDARPARARGPRRLSAPSRASRGLQARAGAGG 234 >UniRef50_UPI0000DB7985 Cluster: PREDICTED: similar to Collagen type IV CG4145-PA, isoform A isoform 1; n=1; Apis mellifera|Rep: PREDICTED: similar to Collagen type IV CG4145-PA, isoform A isoform 1 - Apis mellifera Length = 1913 Score = 34.3 bits (75), Expect = 3.3 Identities = 29/97 (29%), Positives = 41/97 (42%), Gaps = 11/97 (11%) Frame = -1 Query: 264 PGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLD-------- 109 PG +GER P LQG+ + G A G + ++G P G L+ Sbjct: 320 PGSKGERGPPGPIGLQGQKGEKGNMGLAFEGPKGDKGQKGEIGPPGTPGPLEPFQGMREE 379 Query: 108 --SEQRDQGDHSGPGEAGPGPLHRGPGFAG-QEVNGS 7 + Q D+G+ GE GP PG G Q ++GS Sbjct: 380 VIAPQGDRGEKGDKGEMGPDGFKGEPGPIGDQGISGS 416 Score = 33.5 bits (73), Expect = 5.8 Identities = 23/82 (28%), Positives = 33/82 (40%), Gaps = 1/82 (1%) Frame = -1 Query: 261 GQEGERAPAARGRLQGEAHVRLLRG-EAASGLPAREVERGAPSRPEAHGRLDSEQRDQGD 85 G +GE P +G+ + G +G+P + GAP P G + + G Sbjct: 202 GDKGEPGPQGPRGTKGDRGKMGIPGFTGIAGVPGVQGPPGAPGIPGRDG-CNGTDGEPGA 260 Query: 84 HSGPGEAGPGPLHRGPGFAGQE 19 PGE GP PG GQ+ Sbjct: 261 RGYPGEVGPRGFRGPPGLKGQK 282 >UniRef50_Q4T971 Cluster: Chromosome undetermined SCAF7635, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome undetermined SCAF7635, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 715 Score = 34.3 bits (75), Expect = 3.3 Identities = 30/111 (27%), Positives = 40/111 (36%) Frame = -1 Query: 333 STGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREV 154 S G R + G AG P +GE H LL + G+ Sbjct: 149 SPGARGKRGPVGLPGAAGPRGPPGVYQGEELCPNACSTGRTGHPGLLGMKGHKGVKG--- 205 Query: 153 ERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQEVNGSER 1 E G P R + H + +Q QG G GPG + PG G + +G R Sbjct: 206 ESGEPGR-QGHKGEEGDQGPQGVVGAQGPPGPGGVRGFPGIMGSKGDGGPR 255 Score = 34.3 bits (75), Expect = 3.3 Identities = 36/118 (30%), Positives = 48/118 (40%), Gaps = 10/118 (8%) Frame = -1 Query: 327 GGRNRGREDGAVARAGT-GAPH-PGQEGERAPAARGRL---QGEAHVRLLRGEAAS---- 175 G + G GA G GA PG G+R P + QG+ +RG + Sbjct: 247 GSKGDGGPRGAPGDVGPRGAQGGPGDAGQRGPTGEPGIPGAQGDGGPPGIRGAPGAKGDP 306 Query: 174 GLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAGQE-VNGSE 4 GLP + G P P + G G PG+AGP L PG GQ+ V+G + Sbjct: 307 GLPGPDGREGIPGLPGSKGL-------PGKSGAPGDAGPQGLPGLPGAYGQKGVSGEK 357 Score = 33.1 bits (72), Expect = 7.6 Identities = 29/87 (33%), Positives = 35/87 (40%), Gaps = 2/87 (2%) Frame = -1 Query: 315 RGREDGAVARAGTGAP-HPGQEGERAPAARGRLQGEAHVRLLRGE-AASGLPAREVERGA 142 RG R G P PG G + L G A R +G+ A GL + E+G Sbjct: 399 RGGPGSRGPRGHDGPPGSPGARGSKGDPGLPGLPGPAGYRGQKGDRGAVGLDGPKGEQG- 457 Query: 141 PSRPEAHGRLDSEQRDQGDHSGPGEAG 61 P+ E E D GDH PGE G Sbjct: 458 PAGAEG---TSGEPGDVGDHGEPGEKG 481 >UniRef50_Q4SHQ0 Cluster: Chromosome 5 SCAF14581, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 5 SCAF14581, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 665 Score = 34.3 bits (75), Expect = 3.3 Identities = 23/93 (24%), Positives = 37/93 (39%) Frame = -1 Query: 312 GREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSR 133 G + A + G P + + + RG + ++ G PA GA S Sbjct: 94 GNQSDASMHSDQGDNDPSDAEQHSGSERGHQDEDEDDEDAGHQSDGGSPAGSGVSGAGSG 153 Query: 132 PEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 G + S++ + D P ++GPG H GPG Sbjct: 154 RSERGSVRSDRSPRSDPGTP-QSGPGTPHSGPG 185 >UniRef50_Q4RAQ5 Cluster: Chromosome undetermined SCAF23104, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF23104, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 206 Score = 34.3 bits (75), Expect = 3.3 Identities = 32/102 (31%), Positives = 40/102 (39%), Gaps = 8/102 (7%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGERA------PAARGRLQGEAHVRLLRGEAA 178 R+ GG+ +E+ A G+ PG EG +A A R GE R GE Sbjct: 83 RDGDGGQGTSQEEERGAEEAGGSQGPGTEGGQARGRGGQRATRRGRAGEGQERGREGEQQ 142 Query: 177 S-GLPARE-VERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP 58 S G RE P R A L R + D +G AGP Sbjct: 143 SRGREGREGTAEPGPPRRAAPSTLGENLRRELDRAGSEGAGP 184 >UniRef50_Q9A567 Cluster: Putative uncharacterized protein; n=1; Caulobacter vibrioides|Rep: Putative uncharacterized protein - Caulobacter crescentus (Caulobacter vibrioides) Length = 362 Score = 34.3 bits (75), Expect = 3.3 Identities = 32/113 (28%), Positives = 43/113 (38%), Gaps = 3/113 (2%) Frame = -1 Query: 345 PNRESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLP 166 P RE +R + G A P+ Q+G+R P H RL EAA + Sbjct: 33 PRREEDRVHHRAEQRGQGADHRQPGPYLPQDGQRRPR---------HERL---EAALAVQ 80 Query: 165 AREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGP---LHRGPGFAGQEV 16 R+ G RP+ H R + D D G A P L + PG E+ Sbjct: 81 GRDRPAGGEDRPQHHSRHEDLHADDEDDEQQGHAQKPPALTLGQAPGLKEPEL 133 Score = 33.5 bits (73), Expect = 5.8 Identities = 26/66 (39%), Positives = 29/66 (43%), Gaps = 4/66 (6%) Frame = -1 Query: 222 LQGEAHV---RLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQG-DHSGPGEAGPG 55 L+GE + R R AASGL R P R E +EQR QG DH PG P Sbjct: 3 LRGERWISSNRNARATAASGLRGERAMRRRPRREEDRVHHRAEQRGQGADHRQPGPYLPQ 62 Query: 54 PLHRGP 37 R P Sbjct: 63 DGQRRP 68 >UniRef50_Q2S573 Cluster: SpoOJ protein; n=1; Salinibacter ruber DSM 13855|Rep: SpoOJ protein - Salinibacter ruber (strain DSM 13855) Length = 326 Score = 34.3 bits (75), Expect = 3.3 Identities = 21/52 (40%), Positives = 23/52 (44%) Frame = -1 Query: 213 EAHVRLLRGEAASGLPAREVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGP 58 EA V LL A L REVER A E G DSE + D + E P Sbjct: 221 EAQVALLEETIAEDLSVREVERRARQWHEDEGAADSEAAEGADDTAVPETAP 272 >UniRef50_Q3W4Q1 Cluster: Protein kinase; n=1; Frankia sp. EAN1pec|Rep: Protein kinase - Frankia sp. EAN1pec Length = 870 Score = 34.3 bits (75), Expect = 3.3 Identities = 29/95 (30%), Positives = 34/95 (35%), Gaps = 1/95 (1%) Frame = -1 Query: 291 ARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSRPEAHGRL 112 A+A G P G G P + G G R G +SG P GA P GRL Sbjct: 435 AQALAGPPAGGSGGLSGPGSPGGAGGPGSRRGAGGPESSGAPGSPGAAGASDEP---GRL 491 Query: 111 DSEQRDQGDHSGPGEAGPGP-LHRGPGFAGQEVNG 10 D+ G + G P P G G NG Sbjct: 492 DAAGAAAGYDTSGGLGTPAPSAEDGAGMPESVANG 526 >UniRef50_Q08VS0 Cluster: Putative uncharacterized protein; n=1; Stigmatella aurantiaca DW4/3-1|Rep: Putative uncharacterized protein - Stigmatella aurantiaca DW4/3-1 Length = 567 Score = 34.3 bits (75), Expect = 3.3 Identities = 28/103 (27%), Positives = 37/103 (35%), Gaps = 2/103 (1%) Frame = -1 Query: 336 ESTGGRNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVR--LLRGEAASGLPA 163 E G RG+ + G H E E P A LQ H + A G P Sbjct: 26 EDAEGHERGQVPRQPTQHAGGREHEDGEREVTPEAEATLQPPRHGDDDHVGHHVARGHPG 85 Query: 162 REVERGAPSRPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPG 34 ++RGA +RP+ R + + H G G G G G Sbjct: 86 DLIQRGAKARPDVVERHVDDGDVEHRHQRRGHGGDGDACLGAG 128 >UniRef50_A7HAA8 Cluster: Ribonuclease R; n=4; cellular organisms|Rep: Ribonuclease R - Anaeromyxobacter sp. Fw109-5 Length = 910 Score = 34.3 bits (75), Expect = 3.3 Identities = 36/107 (33%), Positives = 41/107 (38%), Gaps = 2/107 (1%) Frame = -1 Query: 339 RESTGGRNRGREDGAVARAGTGAPHPGQEGE-RAPAARGRLQGEAHVRLLRGEAASGLPA 163 RE G R RGR R G G G EG R ARGR +GE RG A G Sbjct: 802 REQQGER-RGRGGEGRGRGGEGRGRGGGEGRGRGGEARGR-RGEGRG---RGGEARGRGG 856 Query: 162 REVERGAPSRP-EAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 RG R A G++ S + G G G+ P G G Sbjct: 857 EARGRGPSERAGGAPGKVGSRRGPPGRKGGAGKGAAKPSKGAKGRGG 903 >UniRef50_A7DM25 Cluster: FMN-binding domain protein; n=2; Methylobacterium extorquens PA1|Rep: FMN-binding domain protein - Methylobacterium extorquens PA1 Length = 847 Score = 34.3 bits (75), Expect = 3.3 Identities = 17/43 (39%), Positives = 18/43 (41%) Frame = -1 Query: 288 RAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAR 160 R G AP P +G P RGR G H R RG P R Sbjct: 106 RRGDPAPDPQHQGRPLPLGRGRAPGRRHARAGRGRRPVTRPGR 148 >UniRef50_A5P662 Cluster: Tetratricopeptide TPR_2 repeat protein; n=1; Methylobacterium sp. 4-46|Rep: Tetratricopeptide TPR_2 repeat protein - Methylobacterium sp. 4-46 Length = 425 Score = 34.3 bits (75), Expect = 3.3 Identities = 32/97 (32%), Positives = 40/97 (41%), Gaps = 2/97 (2%) Frame = -1 Query: 309 REDGAVA-RAGTGAPHPGQEGER-APAARGRLQGEAHVRLLRGEAASGLPAREVERGAPS 136 R GA+A R G A PG+E R P A+ Q + G A++ PAR P Sbjct: 9 RGGGALALRRGRRAAEPGREPPRIVPPAQ---QQHGLLPAHPGPASARDPARPARPRPPP 65 Query: 135 RPEAHGRLDSEQRDQGDHSGPGEAGPGPLHRGPGFAG 25 R R + G G + GP P RGP AG Sbjct: 66 RGAGPRRGRPRRARPGPRRGRRDRGPAPRGRGPRRAG 102 >UniRef50_A5NRS7 Cluster: LigA; n=1; Methylobacterium sp. 4-46|Rep: LigA - Methylobacterium sp. 4-46 Length = 355 Score = 34.3 bits (75), Expect = 3.3 Identities = 33/106 (31%), Positives = 37/106 (34%), Gaps = 7/106 (6%) Frame = -1 Query: 306 EDGAVARA-GTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEA-ASGLPAREVERGAPSR 133 + G VA G G PG+ E GR E + GE A P+RE Sbjct: 167 DPGPVAGCLGAGLAEPGERREPDRRQGGRQGDERRDQRAAGEDHAPASPSREAAPTPRLT 226 Query: 132 PEAHGRLDSEQRDQGDHSGPGEAGPGP-----LHRGPGFAGQEVNG 10 P G R G S P AGPG L RGP G G Sbjct: 227 PRRVGLEPGAGRGTGTESRPRAAGPGSRDGARLRRGPASTGSGFGG 272 >UniRef50_A4X268 Cluster: Putative uncharacterized protein; n=1; Salinispora tropica CNB-440|Rep: Putative uncharacterized protein - Salinispora tropica CNB-440 Length = 3754 Score = 34.3 bits (75), Expect = 3.3 Identities = 32/95 (33%), Positives = 40/95 (42%), Gaps = 5/95 (5%) Frame = -1 Query: 303 DGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGAPSR-PE 127 D V R T + PG + + G G+A +RLL A G PA + +R P+ Sbjct: 1394 DVCVERGLTASDLPGLDRDLI----GETHGDADIRLLPPAAGPGRPAGQPDRERRHHGPD 1449 Query: 126 AHGRLDSEQR----DQGDHSGPGEAGPGPLHRGPG 34 A G L S R G G GPG RGPG Sbjct: 1450 AGGDLGSVDRRVDVPSGRGGGVSVGGPGD-GRGPG 1483 >UniRef50_A0U0Q4 Cluster: Polysaccharide biosynthesis protein precursor; n=3; Burkholderia cepacia complex|Rep: Polysaccharide biosynthesis protein precursor - Burkholderia cenocepacia MC0-3 Length = 870 Score = 34.3 bits (75), Expect = 3.3 Identities = 28/84 (33%), Positives = 37/84 (44%) Frame = -1 Query: 321 RNRGREDGAVARAGTGAPHPGQEGERAPAARGRLQGEAHVRLLRGEAASGLPAREVERGA 142 R+RG E + A A + +R PAA+ R + E V AS AR V R Sbjct: 169 RDRGAEARSRPPAADAARARRAQADRLPAAKHRARREPAV----ARRASADDARVVRRRV 224 Query: 141 PSRPEAHGRLDSEQRDQGDHSGPG 70 +R GRL QR+ +GPG Sbjct: 225 RTRRPV-GRLPRRQREHAPRAGPG 247 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 768,026,677 Number of Sequences: 1657284 Number of extensions: 16332894 Number of successful extensions: 72614 Number of sequences better than 10.0: 430 Number of HSP's better than 10.0 without gapping: 65556 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 71954 length of database: 575,637,011 effective HSP length: 99 effective length of database: 411,565,895 effective search space used: 62558016040 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -