BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= epV30264 (516 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_UPI00015B4AC8 Cluster: PREDICTED: similar to sarcalumen... 280 1e-74 UniRef50_Q8I0D4 Cluster: CG9297-PB, isoform B; n=11; Endopterygo... 274 8e-73 UniRef50_Q86TD4 Cluster: Sarcalumenin precursor; n=40; Euteleost... 149 5e-35 UniRef50_Q54ST5 Cluster: Putative uncharacterized protein; n=1; ... 93 3e-18 UniRef50_Q584E9 Cluster: Sarcoplasmic reticulum glycoprotein, pu... 89 7e-17 UniRef50_Q4QDJ3 Cluster: Sarcoplasmic reticulum glycoprotein, pu... 84 2e-15 UniRef50_Q4DYK9 Cluster: Sarcoplasmic reticulum glycoprotein, pu... 81 1e-14 UniRef50_Q3EAA4 Cluster: Uncharacterized protein At4g05520.1; n=... 79 4e-14 UniRef50_Q4RPD4 Cluster: Chromosome 1 SCAF15008, whole genome sh... 78 1e-13 UniRef50_Q4Q2X3 Cluster: Putative uncharacterized protein; n=3; ... 78 1e-13 UniRef50_Q9NZN3 Cluster: EH domain-containing protein 3; n=136; ... 71 1e-11 UniRef50_UPI0000EBE67E Cluster: PREDICTED: similar to EHD1 prote... 71 2e-11 UniRef50_UPI0000498DD7 Cluster: EH-domain containing protein; n=... 69 6e-11 UniRef50_Q94919 Cluster: PAST-1; n=3; Coelomata|Rep: PAST-1 - Dr... 69 8e-11 UniRef50_O96909 Cluster: Sarcalumenin/eps15 homolog; n=7; Plasmo... 67 2e-10 UniRef50_UPI000065D43F Cluster: EH domain-containing protein 3.;... 62 1e-08 UniRef50_UPI0000E81040 Cluster: PREDICTED: sarcalumenin; n=1; Ga... 56 6e-07 UniRef50_UPI000065D539 Cluster: EH domain-containing protein 2.;... 52 1e-05 UniRef50_Q259P6 Cluster: H0818H01.5 protein; n=3; Oryza sativa|R... 48 2e-04 UniRef50_Q57XN3 Cluster: Putative uncharacterized protein; n=1; ... 47 2e-04 UniRef50_Q4QBU8 Cluster: Putative uncharacterized protein; n=3; ... 46 7e-04 UniRef50_Q4T253 Cluster: Chromosome undetermined SCAF10336, whol... 45 0.001 UniRef50_Q4DH62 Cluster: Putative uncharacterized protein; n=2; ... 42 0.006 UniRef50_UPI0000F21E62 Cluster: PREDICTED: hypothetical protein,... 42 0.011 UniRef50_A4J1X0 Cluster: Transcriptional regulator, XRE family; ... 41 0.015 UniRef50_A1I7P4 Cluster: Putative uncharacterized protein; n=1; ... 40 0.025 UniRef50_Q1EY64 Cluster: Rubrerythrin; n=1; Clostridium oremland... 37 0.24 UniRef50_O62361 Cluster: Putative uncharacterized protein fbxa-1... 37 0.31 UniRef50_A0CC98 Cluster: Chromosome undetermined scaffold_167, w... 36 0.55 UniRef50_Q9ZVE3 Cluster: Putative uncharacterized protein At2g06... 36 0.72 UniRef50_UPI0000D56BD7 Cluster: PREDICTED: similar to 5-azacytid... 35 0.96 UniRef50_Q9KBP6 Cluster: Transcriptional regulator; n=1; Bacillu... 35 1.3 UniRef50_A5Z8L6 Cluster: Putative uncharacterized protein; n=1; ... 35 1.3 UniRef50_Q0DBA9 Cluster: Os06g0585900 protein; n=21; Oryza sativ... 35 1.3 UniRef50_Q3SDK7 Cluster: Rab_C86 protein; n=2; Paramecium tetrau... 35 1.3 UniRef50_Q3IVV0 Cluster: Putative uncharacterized protein; n=2; ... 34 1.7 UniRef50_A1FWD0 Cluster: Putative uncharacterized protein; n=1; ... 34 2.2 UniRef50_Q9LP19 Cluster: F14D7.7 protein; n=2; Arabidopsis thali... 34 2.2 UniRef50_Q64DK1 Cluster: Putative uncharacterized protein; n=1; ... 34 2.2 UniRef50_A3DGE1 Cluster: ABC transporter related protein; n=2; C... 33 2.9 UniRef50_Q9XE96 Cluster: Putative uncharacterized protein T19B17... 33 2.9 UniRef50_UPI00015B46B2 Cluster: PREDICTED: similar to ankyrin re... 33 3.9 UniRef50_Q9SSD1 Cluster: Protein TOO MANY MOUTHS precursor; n=2;... 33 3.9 UniRef50_Q5R050 Cluster: Sensor protein; n=2; Idiomarina|Rep: Se... 33 5.1 UniRef50_Q30NU3 Cluster: ABC transporter-related protein; n=1; T... 33 5.1 UniRef50_A2E2M8 Cluster: Putative uncharacterized protein; n=1; ... 33 5.1 UniRef50_UPI0001555AA6 Cluster: PREDICTED: similar to hepatocell... 32 6.8 UniRef50_Q2LTL3 Cluster: Metal-dependent phosphohydrolase; n=1; ... 32 6.8 UniRef50_A0BCY4 Cluster: Chromosome undetermined scaffold_10, wh... 32 6.8 UniRef50_UPI00006A2CC4 Cluster: UPI00006A2CC4 related cluster; n... 32 8.9 UniRef50_Q4T6A1 Cluster: Chromosome undetermined SCAF8850, whole... 32 8.9 UniRef50_Q8RAS5 Cluster: GTPases; n=3; Firmicutes|Rep: GTPases -... 32 8.9 UniRef50_Q2IL99 Cluster: ABC transporter, ATPase subunit; n=2; B... 32 8.9 UniRef50_A1DA37 Cluster: Putative uncharacterized protein; n=1; ... 32 8.9 UniRef50_A7I6U0 Cluster: Dynamin family protein; n=1; Candidatus... 32 8.9 UniRef50_Q44633 Cluster: Probable tRNA modification GTPase trmE;... 32 8.9 >UniRef50_UPI00015B4AC8 Cluster: PREDICTED: similar to sarcalumenin; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to sarcalumenin - Nasonia vitripennis Length = 884 Score = 280 bits (687), Expect = 1e-74 Identities = 131/168 (77%), Positives = 148/168 (88%) Frame = +1 Query: 13 EIPENLRSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRDL 192 +IPE LR R+HINQ+L+LDEE E VE+ AD+VLRDLKRLY+N+IKPLE LYKYRDL Sbjct: 416 DIPEILRPRDHINQLLKLDEENELKEKAVEKVADVVLRDLKRLYDNAIKPLETLYKYRDL 475 Query: 193 SNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHG 372 SNRHFGDPEIFSKPLVLFMGPWSGGKSSI+NYL +E+ SLRTGAEPSPAYFNILMHG Sbjct: 476 SNRHFGDPEIFSKPLVLFMGPWSGGKSSIINYLLNIEYKPTSLRTGAEPSPAYFNILMHG 535 Query: 373 KDPQVLDGTQLAADWTFSGLQKFGQGLEERLRGLKHPSKILEKVNIVE 516 ++ +VLDGTQLAADWTFSGLQKFGQGL +RL+G K SK+LEKVNIVE Sbjct: 536 EEEEVLDGTQLAADWTFSGLQKFGQGLLDRLKGYKLKSKLLEKVNIVE 583 >UniRef50_Q8I0D4 Cluster: CG9297-PB, isoform B; n=11; Endopterygota|Rep: CG9297-PB, isoform B - Drosophila melanogaster (Fruit fly) Length = 952 Score = 274 bits (672), Expect = 8e-73 Identities = 129/172 (75%), Positives = 148/172 (86%) Frame = +1 Query: 1 LLTGEIPENLRSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYK 180 L G IPEN RSR+HI ++L+LDEE + E + A+I+LRD+KR+YEN++KPLE LYK Sbjct: 496 LWEGHIPENRRSRQHITELLQLDEEFNAREKATDNVAEIILRDIKRIYENAVKPLETLYK 555 Query: 181 YRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNI 360 YRDLSNRHF DPEIFSKPL+LFMGPWSGGKSSI+NYLT E+T SLRTGAEPSPAYFNI Sbjct: 556 YRDLSNRHFSDPEIFSKPLILFMGPWSGGKSSILNYLTDNEYTPNSLRTGAEPSPAYFNI 615 Query: 361 LMHGKDPQVLDGTQLAADWTFSGLQKFGQGLEERLRGLKHPSKILEKVNIVE 516 LM G + +VLDGTQLAAD+TF+GLQKFGQGLEERLRGLK SKILEKVNIVE Sbjct: 616 LMWGNETEVLDGTQLAADYTFAGLQKFGQGLEERLRGLKMKSKILEKVNIVE 667 >UniRef50_Q86TD4 Cluster: Sarcalumenin precursor; n=40; Euteleostomi|Rep: Sarcalumenin precursor - Homo sapiens (Human) Length = 932 Score = 149 bits (360), Expect = 5e-35 Identities = 76/165 (46%), Positives = 108/165 (65%), Gaps = 2/165 (1%) Frame = +1 Query: 28 LRSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHF 207 LR R HI + L L+E+ + + VL+ L+++Y +SIKPLE YKY +L Sbjct: 491 LRDRSHIEKTLMLNEDKPSDDYSA------VLQRLRKIYHSSIKPLEQSYKYNELRQHEI 544 Query: 208 GDPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQV 387 D EI SKP+VLF+GPWS GKS+++NYL GLE T + L TGAEP+ + F +LMHG + Sbjct: 545 TDGEITSKPMVLFLGPWSVGKSTMINYLLGLENTRYQLYTGAEPTTSEFTVLMHGPKLKT 604 Query: 388 LDGTQLAAD--WTFSGLQKFGQGLEERLRGLKHPSKILEKVNIVE 516 ++G +AAD +FS L+KFGQ E+L G++ P K+LE+V V+ Sbjct: 605 IEGIVMAADSARSFSPLEKFGQNFLEKLIGIEVPHKLLERVTFVD 649 >UniRef50_Q54ST5 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 568 Score = 93.5 bits (222), Expect = 3e-18 Identities = 51/132 (38%), Positives = 74/132 (56%), Gaps = 3/132 (2%) Frame = +1 Query: 130 LKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFT 309 LK LY + IKPLE L K+ D + D +I +KP++L +G +S GK+S +NYL F Sbjct: 23 LKSLYSSKIKPLEQLTKFGDFFSPTLTDADIAAKPMILLLGQYSTGKTSFINYLLEKPFV 82 Query: 310 EWSLRTGAEPSPAYFNILMHGKDPQVLDGTQL---AADWTFSGLQKFGQGLEERLRGLKH 480 EPS FN +MHG D ++L G + + D+ F GL+KFG G R + Sbjct: 83 --GSNVAVEPSTDRFNAVMHGTDDRILPGNIVCVQSQDFPFKGLEKFGNGFMGRFQCSLS 140 Query: 481 PSKILEKVNIVE 516 + ILEKV+ ++ Sbjct: 141 NAPILEKVSFID 152 >UniRef50_Q584E9 Cluster: Sarcoplasmic reticulum glycoprotein, putative; n=2; Trypanosoma|Rep: Sarcoplasmic reticulum glycoprotein, putative - Trypanosoma brucei Length = 624 Score = 88.6 bits (210), Expect = 7e-17 Identities = 51/157 (32%), Positives = 83/157 (52%), Gaps = 2/157 (1%) Frame = +1 Query: 52 QILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSK 231 +++R E+ E +++ + VL +L+++Y I+P+E + Y FG+ + K Sbjct: 6 KLVRKAEDLQE-DLSWDSHIKHVLIELEKIYFQRIRPIEVKFDYDMCCPSWFGESMVQKK 64 Query: 232 PLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAA 411 P + F+GP+S GKS+ +NYL L TG +P F ++ H KD Q + G L A Sbjct: 65 PFITFLGPFSAGKSTFINYLLQGNL----LSTGPQPVTDRFTVISHAKDVQKIPGRVLMA 120 Query: 412 D--WTFSGLQKFGQGLEERLRGLKHPSKILEKVNIVE 516 D F GL +FG E L G+ HP IL+ V +++ Sbjct: 121 DSKQPFRGLNQFGGVFGEVLEGITHPHPILQSVTLID 157 >UniRef50_Q4QDJ3 Cluster: Sarcoplasmic reticulum glycoprotein, putative; n=3; Leishmania|Rep: Sarcoplasmic reticulum glycoprotein, putative - Leishmania major Length = 633 Score = 83.8 bits (198), Expect = 2e-15 Identities = 44/142 (30%), Positives = 76/142 (53%), Gaps = 2/142 (1%) Frame = +1 Query: 97 VERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSS 276 V S +++ L LY ++PLE +Y + + + + +P + GPWS GK++ Sbjct: 21 VPGSMGALIKKLHPLYTQRVRPLEEMYSFDVFRPSWYEETILNERPFITLFGPWSAGKTT 80 Query: 277 IVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLA--ADWTFSGLQKFGQG 450 +NYL L+ + L TG +P+ A F ++M+GK+P + G LA F GL FG+ Sbjct: 81 FINYL--LQSND--LWTGPQPTTAEFTVVMYGKEPGPVAGQALANSKHLPFKGLLDFGES 136 Query: 451 LEERLRGLKHPSKILEKVNIVE 516 L+G + P +LE+V +++ Sbjct: 137 FIRNLKGFQAPHALLERVTLID 158 >UniRef50_Q4DYK9 Cluster: Sarcoplasmic reticulum glycoprotein, putative; n=3; Trypanosoma|Rep: Sarcoplasmic reticulum glycoprotein, putative - Trypanosoma cruzi Length = 610 Score = 81.4 bits (192), Expect = 1e-14 Identities = 44/149 (29%), Positives = 80/149 (53%), Gaps = 2/149 (1%) Frame = +1 Query: 76 ASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGP 255 A+E+ D ++ L Y +KP+E +YKY F + + KP V F GP Sbjct: 85 ATESVAMEPEGLDELIEVLHTNYLKCVKPVEDMYKYDLFRPSWFEETILNQKPFVTFFGP 144 Query: 256 WSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAA--DWTFSG 429 WS GKS+ +N+L + L TG +P+ A F ++++G++ + G LA+ + F G Sbjct: 145 WSSGKSTFINHLLQDNY----LWTGPQPTTAEFTVVLYGEEVGPVSGHVLASAKNLPFKG 200 Query: 430 LQKFGQGLEERLRGLKHPSKILEKVNIVE 516 L +FG+ + +G + P ++L++V +++ Sbjct: 201 LTEFGESFLGKFQGYRVPHELLKRVTLID 229 >UniRef50_Q3EAA4 Cluster: Uncharacterized protein At4g05520.1; n=14; Magnoliophyta|Rep: Uncharacterized protein At4g05520.1 - Arabidopsis thaliana (Mouse-ear cress) Length = 546 Score = 79.4 bits (187), Expect = 4e-14 Identities = 39/135 (28%), Positives = 72/135 (53%), Gaps = 2/135 (1%) Frame = +1 Query: 118 VLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTG 297 ++ LKRLY +KPLE Y++ D ++ + +KP+V+ +G +S GK++ + +L G Sbjct: 160 IVDGLKRLYTEKLKPLEVTYRFNDFASPVLTSSDFDAKPMVMLLGQYSTGKTTFIKHLLG 219 Query: 298 LEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLA--ADWTFSGLQKFGQGLEERLRG 471 ++ G EP+ F + M G D + + G +A AD F+GL FG + Sbjct: 220 CDYP--GAHIGPEPTTDRFVVAMSGPDERTIPGNTMAVQADMPFNGLTSFGGAFLSKFEC 277 Query: 472 LKHPSKILEKVNIVE 516 + P +L+++ +V+ Sbjct: 278 SQMPHPVLDQITLVD 292 >UniRef50_Q4RPD4 Cluster: Chromosome 1 SCAF15008, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 1 SCAF15008, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 597 Score = 78.2 bits (184), Expect = 1e-13 Identities = 42/135 (31%), Positives = 69/135 (51%), Gaps = 2/135 (1%) Frame = +1 Query: 118 VLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTG 297 V L+RLY + PLE Y++ D + D + +KP+VL +G +S GK++ + +L Sbjct: 16 VSEGLRRLYRTKLFPLEDTYRFHDFHSPALEDADFDNKPMVLLVGQYSTGKTTFIRHLME 75 Query: 298 LEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD--WTFSGLQKFGQGLEERLRG 471 +F +R G EP+ F +MHG+ V+ G L D F L FG R Sbjct: 76 QDFP--GMRIGPEPTTDSFIAVMHGEQEGVIPGNALVVDPKKPFRKLNAFGNAFLNRFMC 133 Query: 472 LKHPSKILEKVNIVE 516 + P+ +LE ++I++ Sbjct: 134 AQMPNPVLESISIID 148 >UniRef50_Q4Q2X3 Cluster: Putative uncharacterized protein; n=3; Leishmania|Rep: Putative uncharacterized protein - Leishmania major Length = 541 Score = 78.2 bits (184), Expect = 1e-13 Identities = 48/151 (31%), Positives = 73/151 (48%), Gaps = 2/151 (1%) Frame = +1 Query: 70 EEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFM 249 +EA E + +R + L L Y + I+P+E Y Y F + PLV F+ Sbjct: 5 QEALEKKEAWDRHLESTLLQLSHFYTSRIEPVEASYNYNVFRPTWFAESIKQKMPLVTFL 64 Query: 250 GPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD--WTF 423 GP+S GKSS +NYL ++ L TG +P F ++M+G+ Q + G L AD + Sbjct: 65 GPFSSGKSSFINYLLQGDY----LMTGPQPVTDKFTVVMYGEQFQQIPGRVLMADSRLPY 120 Query: 424 SGLQKFGQGLEERLRGLKHPSKILEKVNIVE 516 L +FG E G+ P IL V+ ++ Sbjct: 121 RCLSQFGDAFAEFFAGVVAPHPILRSVSFID 151 >UniRef50_Q9NZN3 Cluster: EH domain-containing protein 3; n=136; Eukaryota|Rep: EH domain-containing protein 3 - Homo sapiens (Human) Length = 546 Score = 71.3 bits (167), Expect = 1e-11 Identities = 39/135 (28%), Positives = 69/135 (51%), Gaps = 2/135 (1%) Frame = +1 Query: 118 VLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTG 297 V LK+LY++ + PLE Y++ + + D + +KP+VL +G +S GK++ + YL Sbjct: 21 VSEGLKKLYKSKLLPLEEHYRFHEFHSPALEDADFDNKPMVLLVGQYSTGKTTFIRYLLE 80 Query: 298 LEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD--WTFSGLQKFGQGLEERLRG 471 +F +R G EP+ F +M G ++ G L D F L FG R Sbjct: 81 QDFP--GMRIGPEPTTDSFIAVMQGDMEGIIPGNALVVDPKKPFRKLNAFGNAFLNRFVC 138 Query: 472 LKHPSKILEKVNIVE 516 + P+ +LE +++++ Sbjct: 139 AQLPNPVLESISVID 153 >UniRef50_UPI0000EBE67E Cluster: PREDICTED: similar to EHD1 protein; n=1; Bos taurus|Rep: PREDICTED: similar to EHD1 protein - Bos taurus Length = 678 Score = 70.5 bits (165), Expect = 2e-11 Identities = 39/135 (28%), Positives = 68/135 (50%), Gaps = 2/135 (1%) Frame = +1 Query: 118 VLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTG 297 V L++LY + PLE Y++ + + D + +KP+VL +G +S GK++ + +L Sbjct: 516 VSEGLRQLYAQKLLPLEEHYRFHEFHSPALEDADFDNKPMVLLVGQYSTGKTTFIRHLIE 575 Query: 298 LEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD--WTFSGLQKFGQGLEERLRG 471 +F +R G EP+ F +MHG V+ G L D F L FG R Sbjct: 576 QDFP--GMRIGPEPTTDSFIAVMHGPTEGVVPGNALVVDPRRPFRKLNAFGNAFLNRFMC 633 Query: 472 LKHPSKILEKVNIVE 516 + P+ +L+ ++I++ Sbjct: 634 AQLPNPVLDSISIID 648 >UniRef50_UPI0000498DD7 Cluster: EH-domain containing protein; n=3; Entamoeba histolytica HM-1:IMSS|Rep: EH-domain containing protein - Entamoeba histolytica HM-1:IMSS Length = 508 Score = 68.9 bits (161), Expect = 6e-11 Identities = 40/142 (28%), Positives = 71/142 (50%), Gaps = 2/142 (1%) Frame = +1 Query: 97 VERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSS 276 ++ S V+ +K++Y+ +K LE YKY L + + +KP+VLF+G +S GK++ Sbjct: 12 IDESYTSVIDGIKKIYDTKLKKLETDYKYDYLISPTMRPADFDAKPMVLFLGQYSTGKTT 71 Query: 277 IVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLA--ADWTFSGLQKFGQG 450 +NYL ++ ++ G EP+ F +MHG + G L D F+ L +FG Sbjct: 72 FINYLLNYDYPGSNI--GPEPTTDGFAAIMHGPTNGNVPGNTLCVQTDKPFTNLARFGND 129 Query: 451 LEERLRGLKHPSKILEKVNIVE 516 + G +LE + ++ Sbjct: 130 FMAKFSGAYCNLPLLEHMTFID 151 >UniRef50_Q94919 Cluster: PAST-1; n=3; Coelomata|Rep: PAST-1 - Drosophila melanogaster (Fruit fly) Length = 496 Score = 68.5 bits (160), Expect = 8e-11 Identities = 39/145 (26%), Positives = 71/145 (48%), Gaps = 2/145 (1%) Frame = +1 Query: 88 EITVERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGG 267 E + + V+ +LK++Y + + PLE Y++ D + DP+ + P++L +G +S G Sbjct: 8 EKNTQEVVENVIGELKKIYRSKLLPLEEHYQFHDFHSPKLEDPDFDANPVILLVGLYSTG 67 Query: 268 KSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD--WTFSGLQKF 441 K++ + YL +F +R G EP+ F +M+ V+ G L D F L K+ Sbjct: 68 KTTFIRYLLERDFP--GIRIGPEPTTDRFIAVMYDDKEGVIPGNALVVDPNKQFRPLSKY 125 Query: 442 GQGLEERLRGLKHPSKILEKVNIVE 516 G R + S +L ++ V+ Sbjct: 126 GNAFLNRFQCSSVASPVLNAISNVD 150 >UniRef50_O96909 Cluster: Sarcalumenin/eps15 homolog; n=7; Plasmodium|Rep: Sarcalumenin/eps15 homolog - Plasmodium falciparum Length = 529 Score = 67.3 bits (157), Expect = 2e-10 Identities = 41/137 (29%), Positives = 68/137 (49%), Gaps = 2/137 (1%) Frame = +1 Query: 112 DIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYL 291 D VL L LY+ I LE + Y + SKP++L +G +S GK++ + +L Sbjct: 20 DNVLEGLYSLYKTYILDLEKEFMYYHFYKPLLTSGDFLSKPMILLLGQYSTGKTTFIKHL 79 Query: 292 TGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAADWT--FSGLQKFGQGLEERL 465 E+ +R G EP+ F +M+ + Q++ G L +D T FS L+ FG +L Sbjct: 80 IEKEYC--GMRIGPEPTTDKFVAVMYNEKEQLIPGNALVSDITKPFSQLESFGNSFLSKL 137 Query: 466 RGLKHPSKILEKVNIVE 516 S++L+ V I++ Sbjct: 138 ECSNTSSEVLKSVTIID 154 >UniRef50_UPI000065D43F Cluster: EH domain-containing protein 3.; n=1; Takifugu rubripes|Rep: EH domain-containing protein 3. - Takifugu rubripes Length = 587 Score = 61.7 bits (143), Expect = 1e-08 Identities = 43/145 (29%), Positives = 68/145 (46%), Gaps = 16/145 (11%) Frame = +1 Query: 130 LKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIV--------- 282 LK+LY+ + PLE YK+ + + D + +KP+VL +G +S GK+S + Sbjct: 25 LKKLYKTKLLPLEENYKFHEFHSPALEDADFDNKPMVLLVGQYSTGKTSFIRVFVKHLIS 84 Query: 283 ---NYLTG--LEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD--WTFSGLQKF 441 N L LE +R G EP+ F +MHG V+ G L D F L F Sbjct: 85 CPCNLLNSYLLEQDFPGMRIGPEPTTDSFIAVMHGDTEGVIPGNALVVDPKKPFRKLNAF 144 Query: 442 GQGLEERLRGLKHPSKILEKVNIVE 516 G R + P+ +LE +++++ Sbjct: 145 GNAFLNRFVCAQLPNPVLESISVID 169 >UniRef50_UPI0000E81040 Cluster: PREDICTED: sarcalumenin; n=1; Gallus gallus|Rep: PREDICTED: sarcalumenin - Gallus gallus Length = 725 Score = 55.6 bits (128), Expect = 6e-07 Identities = 29/74 (39%), Positives = 43/74 (58%) Frame = +1 Query: 31 RSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFG 210 R R H+ L+L+E+ + + VL+ L+++Y SIKPLE Y+Y +L Sbjct: 338 RDRSHLENTLKLNEDKPADDFSG------VLQRLRKIYHASIKPLEQSYRYNELRQHEIT 391 Query: 211 DPEIFSKPLVLFMG 252 D EI SKP+VLF+G Sbjct: 392 DGEITSKPMVLFLG 405 >UniRef50_UPI000065D539 Cluster: EH domain-containing protein 2.; n=2; Takifugu rubripes|Rep: EH domain-containing protein 2. - Takifugu rubripes Length = 605 Score = 51.6 bits (118), Expect = 1e-05 Identities = 41/158 (25%), Positives = 68/158 (43%), Gaps = 25/158 (15%) Frame = +1 Query: 118 VLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTG 297 V LK LY + PLE Y + D + D + +KP+VL +G +S GK++ + T Sbjct: 6 VTEGLKSLYRKKLLPLEQYYGFHDFHSPSLEDADFDNKPMVLVVGQYSTGKTTFIK-TTH 64 Query: 298 LEFTEWSL-----------------------RTGAEPSPAYFNILMHGKDPQVLDGTQLA 408 + F ++ R G EP+ F +MHG+ V+ G L Sbjct: 65 VSFHSSAVEKNQIHTFCVCDRYLLEQDIPGSRVGPEPTTDCFTAIMHGEVESVIPGNALI 124 Query: 409 AD--WTFSGLQKFGQGLEERLRGLKHPSKILEKVNIVE 516 D F L FG R + + +++LE ++I++ Sbjct: 125 VDPNKPFRKLNPFGNTFLNRFQCAQMSNQVLESISIID 162 >UniRef50_Q259P6 Cluster: H0818H01.5 protein; n=3; Oryza sativa|Rep: H0818H01.5 protein - Oryza sativa (Rice) Length = 555 Score = 47.6 bits (108), Expect = 2e-04 Identities = 23/89 (25%), Positives = 45/89 (50%) Frame = +1 Query: 103 RSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIV 282 RS ++ LK+ Y ++PLE Y++ D + + +KP+V+ +G +S GK++ + Sbjct: 122 RSVTSIIDGLKKSYIEKLRPLEKTYQFDDFVSPLLTSSDFDAKPMVMLLGQYSTGKTTFI 181 Query: 283 NYLTGLEFTEWSLRTGAEPSPAYFNILMH 369 +L + G EP+ F ++ H Sbjct: 182 KHLLKTNYP--GAHIGPEPTTDRFVVITH 208 >UniRef50_Q57XN3 Cluster: Putative uncharacterized protein; n=1; Trypanosoma brucei|Rep: Putative uncharacterized protein - Trypanosoma brucei Length = 600 Score = 47.2 bits (107), Expect = 2e-04 Identities = 25/74 (33%), Positives = 42/74 (56%), Gaps = 2/74 (2%) Frame = +1 Query: 229 KPLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLA 408 +P+V+ +G S GKS+++N L G+E R+G P+ F ++ G+D DG Sbjct: 128 QPMVIVLGNHSAGKSTMINRLLGIELQ----RSGVSPTDDGFTVIQSGEDDITEDGPTAV 183 Query: 409 AD--WTFSGLQKFG 444 +D ++F L+KFG Sbjct: 184 SDPRYSFQELRKFG 197 >UniRef50_Q4QBU8 Cluster: Putative uncharacterized protein; n=3; Leishmania|Rep: Putative uncharacterized protein - Leishmania major Length = 444 Score = 45.6 bits (103), Expect = 7e-04 Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 2/72 (2%) Frame = +1 Query: 235 LVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD 414 +V+F+G S GKS+++NYL+G E E TG P+ F ++ G DG + ++ Sbjct: 1 MVMFLGNHSSGKSTLINYLSGCEVQE----TGVAPTDDGFTVIKRGAYDMDADGPSVVSN 56 Query: 415 --WTFSGLQKFG 444 + + LQ+FG Sbjct: 57 PKYQYQSLQQFG 68 >UniRef50_Q4T253 Cluster: Chromosome undetermined SCAF10336, whole genome shotgun sequence; n=3; Tetraodontidae|Rep: Chromosome undetermined SCAF10336, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 648 Score = 44.8 bits (101), Expect = 0.001 Identities = 39/160 (24%), Positives = 68/160 (42%), Gaps = 27/160 (16%) Frame = +1 Query: 118 VLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVN---- 285 V L+ LY + PLE Y + D + + SKP+VL +G +S GK++ ++ Sbjct: 22 VTEGLQALYAKKLLPLEDAYLFHDFHSPALEAADFQSKPMVLLVGQYSTGKTTFISTPLI 81 Query: 286 ---------------------YLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQ 402 YL +F +R G EP+ F +M+G + ++ G Sbjct: 82 FPAACRCRGDGHPPVPQQEHGYLLEQDFP--GMRIGPEPTTDGFIAVMYGDNEGIVPGNA 139 Query: 403 LAAD--WTFSGLQKFGQGLEERLRGLKHPSKILEKVNIVE 516 L D F L FG R + P+++L+ ++I++ Sbjct: 140 LVVDPKKPFRKLNVFGNSFLNRFICSQMPNQVLQSISIID 179 >UniRef50_Q4DH62 Cluster: Putative uncharacterized protein; n=2; Trypanosoma cruzi|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 673 Score = 42.3 bits (95), Expect = 0.006 Identities = 27/80 (33%), Positives = 37/80 (46%), Gaps = 2/80 (2%) Frame = +1 Query: 235 LVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD 414 +V+ +G S GKS+I+NYL G RTG P+ F I+ G DG +D Sbjct: 205 MVMLLGNHSAGKSTIINYLLGRAVQ----RTGVAPTDDGFTIIQRGDRDSEEDGPTSLSD 260 Query: 415 --WTFSGLQKFGQGLEERLR 468 + LQKFG R + Sbjct: 261 PRYQLQDLQKFGMHFVHRFK 280 >UniRef50_UPI0000F21E62 Cluster: PREDICTED: hypothetical protein, partial; n=1; Danio rerio|Rep: PREDICTED: hypothetical protein, partial - Danio rerio Length = 104 Score = 41.5 bits (93), Expect = 0.011 Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 2/57 (3%) Frame = +1 Query: 28 LRSREHINQILRL--DEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRDL 192 LR R HI + LRL DE+A + ++R L+++Y NSI+P+E YKY +L Sbjct: 50 LRDRSHIEETLRLAADEKAGDYAAALQR--------LRKIYHNSIRPMEQAYKYNEL 98 >UniRef50_A4J1X0 Cluster: Transcriptional regulator, XRE family; n=1; Desulfotomaculum reducens MI-1|Rep: Transcriptional regulator, XRE family - Desulfotomaculum reducens MI-1 Length = 738 Score = 41.1 bits (92), Expect = 0.015 Identities = 19/38 (50%), Positives = 25/38 (65%), Gaps = 2/38 (5%) Frame = +1 Query: 226 SKPLVLFMGPWSGGKSSIVNYLTGLE--FTEWSLRTGA 333 SKPLV F+GP GKS ++N LTGL+ +W+ T A Sbjct: 126 SKPLVAFLGPSDAGKSRLINILTGLDVLLAQWTPTTAA 163 >UniRef50_A1I7P4 Cluster: Putative uncharacterized protein; n=1; Candidatus Desulfococcus oleovorans Hxd3|Rep: Putative uncharacterized protein - Candidatus Desulfococcus oleovorans Hxd3 Length = 494 Score = 40.3 bits (90), Expect = 0.025 Identities = 29/97 (29%), Positives = 46/97 (47%) Frame = +1 Query: 73 EASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMG 252 E S+A + E + +L L EN + P+ Y Y D + +I +PLVL +G Sbjct: 5 EKSDASMYTENYIRSLQAELLELVENRMTPIALRYGYSDSPL----ESKIKWRPLVLIIG 60 Query: 253 PWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNIL 363 +S GKS+++N G + TG P+ F +L Sbjct: 61 NYSSGKSTLINDFLGADIQ----ATGQAPTDDSFTVL 93 >UniRef50_Q1EY64 Cluster: Rubrerythrin; n=1; Clostridium oremlandii OhILAs|Rep: Rubrerythrin - Clostridium oremlandii OhILAs Length = 165 Score = 37.1 bits (82), Expect = 0.24 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 3/76 (3%) Frame = +1 Query: 13 EIPENLRSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLE-GLYK--Y 183 EI +N +EH+ + L+ E+ +IT+ R A ++ RD K YEN+ + E + K + Sbjct: 77 EIFDNRAEKEHVEEALK---ESDIPDITILRMAYLIERDYKEFYENAAQNAEDSIIKELF 133 Query: 184 RDLSNRHFGDPEIFSK 231 LS G IF K Sbjct: 134 ETLSKWEAGHESIFKK 149 >UniRef50_O62361 Cluster: Putative uncharacterized protein fbxa-135; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein fbxa-135 - Caenorhabditis elegans Length = 345 Score = 36.7 bits (81), Expect = 0.31 Identities = 23/55 (41%), Positives = 31/55 (56%), Gaps = 1/55 (1%) Frame = +1 Query: 79 SEAEITVERSADIVLRDLKRLY-ENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLV 240 SEA IT+++ + L+ LK+ + NSI P YR SN H EIF KPL+ Sbjct: 234 SEARITLDKISLDNLKFLKQHFTSNSINPFNIAISYRQRSNEHI--DEIFGKPLI 286 >UniRef50_A0CC98 Cluster: Chromosome undetermined scaffold_167, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_167, whole genome shotgun sequence - Paramecium tetraurelia Length = 569 Score = 35.9 bits (79), Expect = 0.55 Identities = 30/98 (30%), Positives = 48/98 (48%), Gaps = 5/98 (5%) Frame = +1 Query: 40 EHINQILRLDEEASEAEITVERSADIVLRDL--KRLYENSIKPLEGLY--KYRDLSNRHF 207 E++N+I +L + + VE + +L++L + Y N IK + L K DL F Sbjct: 62 ENMNKIKQLSDL-----LIVEEMQETILKNLVLEASYTNLIKKSKTLQVIKVMDLLKESF 116 Query: 208 GD-PEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWS 318 + S LV F+G GKS+ +NY G E E++ Sbjct: 117 DHIKSVNSMDLVFFLGGTGSGKSTSINYYLGHELEEYN 154 >UniRef50_Q9ZVE3 Cluster: Putative uncharacterized protein At2g06860; n=1; Arabidopsis thaliana|Rep: Putative uncharacterized protein At2g06860 - Arabidopsis thaliana (Mouse-ear cress) Length = 938 Score = 35.5 bits (78), Expect = 0.72 Identities = 21/58 (36%), Positives = 28/58 (48%) Frame = +3 Query: 342 ARLL*YPNAWKRSSSFGRYATGRRLDVLWLAEVWSRIGGATQGLETPQQNIRKGEHCG 515 ARLL +PN S+SFG + GR+L+V E+W G + I G CG Sbjct: 52 ARLLEFPNNPAWSASFGIFILGRQLEVTKPNEIWVLFAGTPIRFSLREFKIVTGLPCG 109 >UniRef50_UPI0000D56BD7 Cluster: PREDICTED: similar to 5-azacytidine-induced protein 1 (Pre-acrosome localization protein 1); n=5; Tribolium castaneum|Rep: PREDICTED: similar to 5-azacytidine-induced protein 1 (Pre-acrosome localization protein 1) - Tribolium castaneum Length = 773 Score = 35.1 bits (77), Expect = 0.96 Identities = 19/46 (41%), Positives = 29/46 (63%) Frame = +1 Query: 31 RSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLE 168 R RE I RL++EAS+ ++ +E+S++ +R LK YE I LE Sbjct: 594 RDREIEAVIERLEKEASDNKLQIEQSSENRIRRLKEKYEKEILDLE 639 Score = 33.5 bits (73), Expect = 2.9 Identities = 18/46 (39%), Positives = 29/46 (63%) Frame = +1 Query: 31 RSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLE 168 R R+ I RL++EAS+ ++ +E+S++ +R LK YE I LE Sbjct: 168 RDRKIEAVIERLEKEASDNKLQIEQSSENRIRRLKEKYEKEILDLE 213 >UniRef50_Q9KBP6 Cluster: Transcriptional regulator; n=1; Bacillus halodurans|Rep: Transcriptional regulator - Bacillus halodurans Length = 555 Score = 34.7 bits (76), Expect = 1.3 Identities = 21/64 (32%), Positives = 30/64 (46%), Gaps = 2/64 (3%) Frame = +1 Query: 175 YKYRDLSNRHFGDPEIFSKPL--VLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPA 348 Y + LSN+ P+ FSK ++F PW G IVN + + + E A+ PA Sbjct: 429 YFMKRLSNKEKKQPKYFSKDAYDLIFRYPWPGNVREIVNMVEHVIYLETGDLITADSLPA 488 Query: 349 YFNI 360 Y I Sbjct: 489 YLKI 492 >UniRef50_A5Z8L6 Cluster: Putative uncharacterized protein; n=1; Eubacterium ventriosum ATCC 27560|Rep: Putative uncharacterized protein - Eubacterium ventriosum ATCC 27560 Length = 246 Score = 34.7 bits (76), Expect = 1.3 Identities = 31/128 (24%), Positives = 60/128 (46%), Gaps = 5/128 (3%) Frame = +1 Query: 148 NSIKPLEGLYKYRDLSNR---HFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWS 318 N + + L K D S+R + D I + + +GP GKS+++ + G E E S Sbjct: 3 NEVLKISNLVKKYDNSDRVILNNLDLTINDEDFLCILGPSGCGKSTLIRCIAGFEDYEGS 62 Query: 319 LRTGAEP--SPAYFNILMHGKDPQVLDGTQLAADWTFSGLQKFGQGLEERLRGLKHPSKI 492 ++ +P P I++ Q+ + + T++ L+ G++++ + K Sbjct: 63 IKVDGQPVVKPGPDRIMVFQDFNQLFPWKTVLKNITYA-LKV--NGMKDKAEREQKAKKY 119 Query: 493 LEKVNIVE 516 LEKVN+V+ Sbjct: 120 LEKVNLVQ 127 >UniRef50_Q0DBA9 Cluster: Os06g0585900 protein; n=21; Oryza sativa|Rep: Os06g0585900 protein - Oryza sativa subsp. japonica (Rice) Length = 1948 Score = 34.7 bits (76), Expect = 1.3 Identities = 22/73 (30%), Positives = 39/73 (53%), Gaps = 5/73 (6%) Frame = +1 Query: 4 LTGEIPENLRSREHINQILRLDEEAS-EAEITVERSADIVLRDLK-RLYENSIKPLEGL- 174 LTGEIPE+L S + + ++ ++ S + + + + ++ DLK + SI P+ + Sbjct: 185 LTGEIPESLASSKSLQVLVLMNNALSGQLPVALFNCSSLIDLDLKHNSFLGSIPPITAIS 244 Query: 175 --YKYRDLSNRHF 207 KY DL + HF Sbjct: 245 LQMKYLDLEDNHF 257 >UniRef50_Q3SDK7 Cluster: Rab_C86 protein; n=2; Paramecium tetraurelia|Rep: Rab_C86 protein - Paramecium tetraurelia Length = 308 Score = 34.7 bits (76), Expect = 1.3 Identities = 19/51 (37%), Positives = 30/51 (58%), Gaps = 1/51 (1%) Frame = +1 Query: 211 DPEIFSKPL-VLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNI 360 D ++ KP VLF+G + GKSS++N + G + E S +TG+ + NI Sbjct: 68 DQQMLKKPKEVLFVGRSNVGKSSLINAILGQKVAETSSKTGSTLRLQFHNI 118 >UniRef50_Q3IVV0 Cluster: Putative uncharacterized protein; n=2; Rhodobacter sphaeroides|Rep: Putative uncharacterized protein - Rhodobacter sphaeroides (strain ATCC 17023 / 2.4.1 / NCIB 8253 / DSM158) Length = 569 Score = 34.3 bits (75), Expect = 1.7 Identities = 30/99 (30%), Positives = 45/99 (45%), Gaps = 2/99 (2%) Frame = +1 Query: 226 SKPLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGT-Q 402 ++P +L G +S GK+S+VN L G + S+ + A P P + D LDGT Q Sbjct: 20 ARPRILVAGEFSSGKTSLVNALLGEDLLPASVTSTALP-PIWIRHGEGAPDCLFLDGTVQ 78 Query: 403 LAADWTFSGLQKFGQGLEERLR-GLKHPSKILEKVNIVE 516 A G LE L HPS +L ++++ Sbjct: 79 RFASLAEMLAHLDGTDLERISHCRLAHPSPLLRAFDLID 117 >UniRef50_A1FWD0 Cluster: Putative uncharacterized protein; n=1; Stenotrophomonas maltophilia R551-3|Rep: Putative uncharacterized protein - Stenotrophomonas maltophilia R551-3 Length = 621 Score = 33.9 bits (74), Expect = 2.2 Identities = 17/47 (36%), Positives = 27/47 (57%) Frame = +1 Query: 211 DPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAY 351 +P+IF + +V +G +S GKS+ +N G +F S T A P+Y Sbjct: 287 NPDIFRRTVVGVVGGFSSGKSAFINSFLGKDFKLLSSVTPATVIPSY 333 >UniRef50_Q9LP19 Cluster: F14D7.7 protein; n=2; Arabidopsis thaliana|Rep: F14D7.7 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 996 Score = 33.9 bits (74), Expect = 2.2 Identities = 17/39 (43%), Positives = 23/39 (58%) Frame = +3 Query: 342 ARLL*YPNAWKRSSSFGRYATGRRLDVLWLAEVWSRIGG 458 ARLL +PN S+SFG + GR+L+V E+W G Sbjct: 52 ARLLEFPNNHAWSASFGIFILGRQLEVTKPNEIWVLFAG 90 >UniRef50_Q64DK1 Cluster: Putative uncharacterized protein; n=1; uncultured archaeon GZfos18C8|Rep: Putative uncharacterized protein - uncultured archaeon GZfos18C8 Length = 660 Score = 33.9 bits (74), Expect = 2.2 Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 3/51 (5%) Frame = +1 Query: 178 KYRDLSNRHFG---DPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWSL 321 KYR L N G P + S+ +VL GP GGKSSI+ ++ WSL Sbjct: 10 KYRHLENVKLGPFSSPSVSSEMVVL-AGPNGGGKSSILELISSALSNAWSL 59 >UniRef50_A3DGE1 Cluster: ABC transporter related protein; n=2; Clostridium|Rep: ABC transporter related protein - Clostridium thermocellum (strain ATCC 27405 / DSM 1237) Length = 638 Score = 33.5 bits (73), Expect = 2.9 Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 1/55 (1%) Frame = +1 Query: 220 IFSKPLVLFMGPWSGGKSSIVNYLTG-LEFTEWSLRTGAEPSPAYFNILMHGKDP 381 I K V +GP GKS+++ LTG ++ E S R G +P Y++ G +P Sbjct: 352 IRKKERVFILGPNGCGKSTLLKILTGKIDDYEGSFRYGHNVNPGYYDQEQEGLNP 406 >UniRef50_Q9XE96 Cluster: Putative uncharacterized protein T19B17.11; n=2; Arabidopsis thaliana|Rep: Putative uncharacterized protein T19B17.11 - Arabidopsis thaliana (Mouse-ear cress) Length = 963 Score = 33.5 bits (73), Expect = 2.9 Identities = 18/56 (32%), Positives = 27/56 (48%) Frame = +3 Query: 348 LL*YPNAWKRSSSFGRYATGRRLDVLWLAEVWSRIGGATQGLETPQQNIRKGEHCG 515 LL +PN S++FG Y GR+LD+ E+W G + ++ G CG Sbjct: 54 LLDFPNKPAWSTTFGLYLLGRKLDIDKANEIWVVYAGTPVRFSLREFHLVTGLACG 109 >UniRef50_UPI00015B46B2 Cluster: PREDICTED: similar to ankyrin repeat protein, putative; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to ankyrin repeat protein, putative - Nasonia vitripennis Length = 2036 Score = 33.1 bits (72), Expect = 3.9 Identities = 16/48 (33%), Positives = 33/48 (68%) Frame = +1 Query: 7 TGEIPENLRSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYEN 150 +G+ PENL S ++I+++L + E+ + VE++ +++DLKR+ +N Sbjct: 1435 SGKSPENLSSDKNISKVLNVIEKLF---LKVEKNDSTLIKDLKRIDDN 1479 >UniRef50_Q9SSD1 Cluster: Protein TOO MANY MOUTHS precursor; n=2; Arabidopsis thaliana|Rep: Protein TOO MANY MOUTHS precursor - Arabidopsis thaliana (Mouse-ear cress) Length = 496 Score = 33.1 bits (72), Expect = 3.9 Identities = 18/50 (36%), Positives = 28/50 (56%) Frame = +1 Query: 4 LTGEIPENLRSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENS 153 LTGEIP R +H+++ LRL++ + + ER +R RLY N+ Sbjct: 363 LTGEIPLEFRDVKHLSE-LRLNDNSLTGPVPFERDTVWRMRRKLRLYNNA 411 >UniRef50_Q5R050 Cluster: Sensor protein; n=2; Idiomarina|Rep: Sensor protein - Idiomarina loihiensis Length = 539 Score = 32.7 bits (71), Expect = 5.1 Identities = 17/52 (32%), Positives = 29/52 (55%) Frame = +1 Query: 31 RSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYR 186 R ++HI++ R +EEAS+ +ER + LK +K LE L++Y+ Sbjct: 233 RVQQHISRQRRAEEEASQLNTELERQVTQRTQALKESNSELLKTLEQLHQYQ 284 >UniRef50_Q30NU3 Cluster: ABC transporter-related protein; n=1; Thiomicrospira denitrificans ATCC 33889|Rep: ABC transporter-related protein - Thiomicrospira denitrificans (strain ATCC 33889 / DSM 1351) Length = 254 Score = 32.7 bits (71), Expect = 5.1 Identities = 17/38 (44%), Positives = 23/38 (60%) Frame = +1 Query: 190 LSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTGLE 303 LSN F EIFS + +GP GGK++++ L GLE Sbjct: 23 LSNISF---EIFSSEYIAIIGPNGGGKTTLIRMLLGLE 57 >UniRef50_A2E2M8 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 752 Score = 32.7 bits (71), Expect = 5.1 Identities = 14/51 (27%), Positives = 29/51 (56%) Frame = +1 Query: 1 LLTGEIPENLRSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENS 153 L E+ + + HI ++ +L EE + ++S+D ++R+L+R +NS Sbjct: 527 LSEAEVRNSQSADHHIQELKQLQEELDHLRTSFKQSSDTIIRELERERQNS 577 >UniRef50_UPI0001555AA6 Cluster: PREDICTED: similar to hepatocellular carcinoma-associated protein HCA11; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to hepatocellular carcinoma-associated protein HCA11 - Ornithorhynchus anatinus Length = 626 Score = 32.3 bits (70), Expect = 6.8 Identities = 21/81 (25%), Positives = 37/81 (45%), Gaps = 2/81 (2%) Frame = +1 Query: 280 VNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAAD--WTFSGLQKFGQGL 453 + YL +F +R G EP+ F +M+G G L D F L +FG Sbjct: 26 LRYLLEQDFP--GMRIGPEPTTDSFIAVMYGDTEGSTPGNALVVDPKKPFRKLSRFGNAF 83 Query: 454 EERLRGLKHPSKILEKVNIVE 516 R + P+++L+ ++I++ Sbjct: 84 LNRFMCSQLPNQVLKSISIID 104 >UniRef50_Q2LTL3 Cluster: Metal-dependent phosphohydrolase; n=1; Syntrophus aciditrophicus SB|Rep: Metal-dependent phosphohydrolase - Syntrophus aciditrophicus (strain SB) Length = 782 Score = 32.3 bits (70), Expect = 6.8 Identities = 25/91 (27%), Positives = 39/91 (42%), Gaps = 1/91 (1%) Frame = +1 Query: 211 DPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQV- 387 DP I S P F P ++ + + E S RT + P+PA + L+H + +V Sbjct: 668 DPSISSLPETDFRYPGPKPQTKEAGLVLLGDVLEASSRTLSNPTPARISSLVHDRIEKVF 727 Query: 388 LDGTQLAADWTFSGLQKFGQGLEERLRGLKH 480 +DG + T L K + L G+ H Sbjct: 728 MDGQLDECELTMRDLSKIAEIFNRILNGIFH 758 >UniRef50_A0BCY4 Cluster: Chromosome undetermined scaffold_10, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_10, whole genome shotgun sequence - Paramecium tetraurelia Length = 470 Score = 32.3 bits (70), Expect = 6.8 Identities = 14/26 (53%), Positives = 19/26 (73%) Frame = +1 Query: 235 LVLFMGPWSGGKSSIVNYLTGLEFTE 312 L+ F+G + GK+SI+ YL GLEF E Sbjct: 5 LIAFVGDPAVGKTSIIKYLKGLEFEE 30 >UniRef50_UPI00006A2CC4 Cluster: UPI00006A2CC4 related cluster; n=1; Xenopus tropicalis|Rep: UPI00006A2CC4 UniRef100 entry - Xenopus tropicalis Length = 423 Score = 31.9 bits (69), Expect = 8.9 Identities = 17/49 (34%), Positives = 27/49 (55%) Frame = +1 Query: 43 HINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYRD 189 ++N+I +L EE E + I DLKRLYE+ ++ L+ + Y D Sbjct: 371 YVNEIHKLKEEIERLE-----NESIDTEDLKRLYEDKVEELDSIDDYSD 414 >UniRef50_Q4T6A1 Cluster: Chromosome undetermined SCAF8850, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF8850, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 2055 Score = 31.9 bits (69), Expect = 8.9 Identities = 25/88 (28%), Positives = 39/88 (44%), Gaps = 1/88 (1%) Frame = +1 Query: 16 IPENLRSREHINQILRLDEEASEAEITVERSADIVLRDLKRLYENSIKPLEGLYKYR-DL 192 +PE LR E ++ R EEAS + DI L R+ N+I+PL +K Sbjct: 141 VPEKLRMPESTLKVKRSIEEAS-LTLQYLSKLDITPEPLHRVVSNTIEPLTLFHKLGVGR 199 Query: 193 SNRHFGDPEIFSKPLVLFMGPWSGGKSS 276 + + +P SK + M W+G + Sbjct: 200 LDMYVLNPVKESKEMQFLMQKWAGNSKA 227 >UniRef50_Q8RAS5 Cluster: GTPases; n=3; Firmicutes|Rep: GTPases - Thermoanaerobacter tengcongensis Length = 428 Score = 31.9 bits (69), Expect = 8.9 Identities = 30/121 (24%), Positives = 56/121 (46%), Gaps = 1/121 (0%) Frame = +1 Query: 100 ERSADIVLRDLKRLYENSIKPLEGLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSI 279 E ++ R ++ + + LE L K+R+L + +I P+V +G + GKS++ Sbjct: 173 ETKLEVDRRHIRNRIKAIEEKLEELEKHRNLQRQRRKKNQI---PVVAIVGYTNAGKSTL 229 Query: 280 VNYLTGLE-FTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAADWTFSGLQKFGQGLE 456 +N LTG + + E L +P+ A +L G++ + D ++ F LE Sbjct: 230 LNALTGADAYVEDKLFATLDPT-ARKLVLPSGREVILTDTVGFIRKLPHDLVEAFKSTLE 288 Query: 457 E 459 E Sbjct: 289 E 289 >UniRef50_Q2IL99 Cluster: ABC transporter, ATPase subunit; n=2; Bacteria|Rep: ABC transporter, ATPase subunit - Anaeromyxobacter dehalogenans (strain 2CP-C) Length = 248 Score = 31.9 bits (69), Expect = 8.9 Identities = 13/31 (41%), Positives = 21/31 (67%) Frame = +1 Query: 211 DPEIFSKPLVLFMGPWSGGKSSIVNYLTGLE 303 D E++ LV+ +GP GKS+++N L GL+ Sbjct: 42 DLELYDGELVVILGPSGSGKSTLLNILGGLD 72 >UniRef50_A1DA37 Cluster: Putative uncharacterized protein; n=1; Neosartorya fischeri NRRL 181|Rep: Putative uncharacterized protein - Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / NRRL 181)(Aspergillus fischerianus (strain ATCC 1020 / DSM 3700 / NRRL 181)) Length = 637 Score = 31.9 bits (69), Expect = 8.9 Identities = 16/36 (44%), Positives = 21/36 (58%) Frame = +1 Query: 199 RHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEF 306 R G E S P ++ +G S GKSS++ LTGL F Sbjct: 23 RELGVSENVSLPQLVVVGDQSSGKSSLLKALTGLSF 58 >UniRef50_A7I6U0 Cluster: Dynamin family protein; n=1; Candidatus Methanoregula boonei 6A8|Rep: Dynamin family protein - Methanoregula boonei (strain 6A8) Length = 679 Score = 31.9 bits (69), Expect = 8.9 Identities = 30/109 (27%), Positives = 47/109 (43%), Gaps = 1/109 (0%) Frame = +1 Query: 169 GLYKYRDLSNRHFGDPEIFSKPLVLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPA 348 GL ++R + E S + +F G S GKSS++N + G + L G P A Sbjct: 178 GLVEFRGTIDSILDRAEDRSFEIAVF-GRVSSGKSSLLNAIIGTDV----LPVGVTPVTA 232 Query: 349 YFNILMHGKDPQVLDGTQLAADWTFSGLQKFGQGLEE-RLRGLKHPSKI 492 ++HGKDP + A TF Q E+ +KH +++ Sbjct: 233 VPTRIIHGKDPSLTVSFADAPARTFETRQLAEFATEQHNPNNIKHVTRV 281 >UniRef50_Q44633 Cluster: Probable tRNA modification GTPase trmE; n=4; Buchnera aphidicola|Rep: Probable tRNA modification GTPase trmE - Buchnera aphidicola subsp. Schizaphis graminum Length = 456 Score = 31.9 bits (69), Expect = 8.9 Identities = 24/94 (25%), Positives = 45/94 (47%), Gaps = 1/94 (1%) Frame = +1 Query: 238 VLFMGPWSGGKSSIVNYLTGLEFTEWSLRTGAEPSPAYFNILMHGKDPQVLDGTQLAADW 417 ++ +GP + GKSS++N L+ + + G Y NI +HG +++D L Sbjct: 221 IVIVGPPNAGKSSLLNVLSCRDRAIVTDLPGTTRDVLYENINIHGISCEIIDTAGLRE-- 278 Query: 418 TFSGLQKFG-QGLEERLRGLKHPSKILEKVNIVE 516 T ++K G Q E ++ H +++K +E Sbjct: 279 TEDKIEKIGIQRSWEMIKNSDHVLYVMDKTISLE 312 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.317 0.138 0.404 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 515,670,925 Number of Sequences: 1657284 Number of extensions: 10417237 Number of successful extensions: 31589 Number of sequences better than 10.0: 56 Number of HSP's better than 10.0 without gapping: 30754 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 31565 length of database: 575,637,011 effective HSP length: 95 effective length of database: 418,195,031 effective search space used: 31782822356 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.6 bits)
- SilkBase 1999-2023 -