BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= NRPG0441 (724 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9VV43 Cluster: TPPP family protein CG4893; n=8; Endopt... 210 2e-53 UniRef50_O94811 Cluster: Tubulin polymerization-promoting protei... 102 8e-21 UniRef50_P91127 Cluster: TPPP family protein C32E8.3; n=2; Caeno... 88 2e-16 UniRef50_UPI0000E4647B Cluster: PREDICTED: hypothetical protein;... 85 1e-15 UniRef50_UPI00005852C1 Cluster: PREDICTED: hypothetical protein;... 84 4e-15 UniRef50_Q5TR29 Cluster: ENSANGP00000025926; n=1; Anopheles gamb... 82 2e-14 UniRef50_UPI00015613C9 Cluster: PREDICTED: similar to FKSG46; n=... 72 1e-11 UniRef50_Q5BZ16 Cluster: SJCHGC08790 protein; n=1; Schistosoma j... 69 2e-10 UniRef50_A7SHU5 Cluster: Predicted protein; n=2; Nematostella ve... 67 4e-10 UniRef50_Q4UA39 Cluster: Putative uncharacterized protein; n=2; ... 52 2e-05 UniRef50_Q7R2L7 Cluster: GLP_546_56018_56500; n=1; Giardia lambl... 51 3e-05 UniRef50_Q9VT66 Cluster: CG6709-PA; n=2; Sophophora|Rep: CG6709-... 50 6e-05 UniRef50_A7ATL8 Cluster: Putative uncharacterized protein; n=1; ... 50 6e-05 UniRef50_UPI0000D55823 Cluster: PREDICTED: similar to CG4893-PA,... 48 2e-04 UniRef50_Q86E15 Cluster: Clone ZZZ338 mRNA sequence; n=3; Schist... 46 0.001 UniRef50_A3FQJ6 Cluster: Putative uncharacterized protein; n=2; ... 46 0.001 UniRef50_UPI0000DB72DA Cluster: PREDICTED: hypothetical protein;... 39 0.11 UniRef50_Q0I8P7 Cluster: Lipoprotein, putative; n=2; Synechococc... 38 0.19 UniRef50_A2YHP8 Cluster: Putative uncharacterized protein; n=6; ... 36 0.77 UniRef50_Q4SJ96 Cluster: Chromosome 4 SCAF14575, whole genome sh... 35 2.3 UniRef50_Q22X54 Cluster: Putative uncharacterized protein; n=1; ... 35 2.3 UniRef50_A5I2J4 Cluster: Putative prophage head protein; n=2; Cl... 34 3.1 UniRef50_A0CF04 Cluster: Chromosome undetermined scaffold_173, w... 34 3.1 UniRef50_Q92541 Cluster: RNA polymerase-associated protein RTF1 ... 34 3.1 UniRef50_Q2HD62 Cluster: Putative uncharacterized protein; n=1; ... 27 3.4 UniRef50_UPI0000DB6CBD Cluster: PREDICTED: similar to rhinoceros... 34 4.1 UniRef50_Q22551 Cluster: Groundhog (Hedgehog-like family) protei... 33 5.4 UniRef50_UPI00015B58A5 Cluster: PREDICTED: similar to GA18227-PA... 33 7.1 UniRef50_UPI000049981A Cluster: hypothetical protein 515.t00001;... 33 7.1 UniRef50_Q8RFM6 Cluster: Putative uncharacterized protein FN0666... 33 7.1 UniRef50_Q31J45 Cluster: Oxidoreductase; n=1; Thiomicrospira cru... 33 9.4 UniRef50_A0YHU3 Cluster: Putative uncharacterized protein; n=1; ... 33 9.4 UniRef50_A0UZ41 Cluster: Putative uncharacterized protein; n=1; ... 33 9.4 UniRef50_Q4QG91 Cluster: Putative uncharacterized protein; n=2; ... 33 9.4 UniRef50_O17117 Cluster: Putative uncharacterized protein; n=1; ... 33 9.4 >UniRef50_Q9VV43 Cluster: TPPP family protein CG4893; n=8; Endopterygota|Rep: TPPP family protein CG4893 - Drosophila melanogaster (Fruit fly) Length = 192 Score = 210 bits (514), Expect = 2e-53 Identities = 101/135 (74%), Positives = 115/135 (85%) Frame = +3 Query: 318 GDPKSDGKAITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLA 497 GD KSDGK ITLSQSDKWMKQAKVID KKITTTDT IHFKK K++K+ + DY KFLDDLA Sbjct: 48 GDSKSDGKLITLSQSDKWMKQAKVID-KKITTTDTGIHFKKFKAMKISLSDYNKFLDDLA 106 Query: 498 KNKKVELDEIKKKLTTCGQPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFDETGKGK 677 K KKVEL EIK+KL +CG PG+ S + AAAAVDRLTDTSKYTGSH++RFD +GKGK Sbjct: 107 KTKKVELSEIKQKLASCGAPGVVS--VSAGKAAAAVDRLTDTSKYTGSHKERFDASGKGK 164 Query: 678 GIAGRKDLVDGSGYV 722 GIAGR+++VDGSGYV Sbjct: 165 GIAGRRNVVDGSGYV 179 >UniRef50_O94811 Cluster: Tubulin polymerization-promoting protein; n=61; Euteleostomi|Rep: Tubulin polymerization-promoting protein - Homo sapiens (Human) Length = 219 Score = 102 bits (245), Expect = 8e-21 Identities = 65/169 (38%), Positives = 95/169 (56%), Gaps = 7/169 (4%) Frame = +3 Query: 237 GASNGTSSKSEDNALXXXXXXXXXXXXGDPKSDGKAITLSQSDKWMKQAKVIDGKKITTT 416 GA G ++ E +AL GD ++ G+ + K K +VIDG+ +T T Sbjct: 37 GAGEGAAASPELSALEEAFRRFAVH--GDARATGREMHGKNWSKLCKDCQVIDGRNVTVT 94 Query: 417 DTAIHFKKLKSVK---LGIDDYQKFLDDLAKN--KKVELDEIKKKLTTC--GQPGITSHV 575 D I F K+K + + +Q+ L++LAK K +E +++ G+ I S V Sbjct: 95 DVDIVFSKIKGKSCRTITFEQFQEALEELAKKRFKDKSSEEAVREVHRLIEGKAPIISGV 154 Query: 576 TKSPAAAAAVDRLTDTSKYTGSHRQRFDETGKGKGIAGRKDLVDGSGYV 722 TK+ ++ V RLTDT+K+TGSH++RFD +GKGKG AGR DLVD SGYV Sbjct: 155 TKA-ISSPTVSRLTDTTKFTGSHKERFDPSGKGKGKAGRVDLVDESGYV 202 >UniRef50_P91127 Cluster: TPPP family protein C32E8.3; n=2; Caenorhabditis|Rep: TPPP family protein C32E8.3 - Caenorhabditis elegans Length = 180 Score = 88.2 bits (209), Expect = 2e-16 Identities = 54/130 (41%), Positives = 71/130 (54%), Gaps = 10/130 (7%) Frame = +3 Query: 363 DKWMKQAKVIDGKKITTTDTAIHFKKLKSVK--LGIDDYQKFL----DDLAKNKKV---- 512 DKW+K A V+D K IT T T I F K+ K D+ +K L +D A+ K Sbjct: 38 DKWLKDAGVLDNKAITGTMTGIAFSKVTGPKKKATFDETKKVLAFVAEDRARQSKKPIQD 97 Query: 513 ELDEIKKKLTTCGQPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFDETGKGKGIAGR 692 ELD I +KL P + + AA RLTD +KYTG+H++RFD GKGKG +GR Sbjct: 98 ELDAITEKLAKLEAPSVGGAAKAN--AAGVYSRLTDHTKYTGAHKERFDAEGKGKGKSGR 155 Query: 693 KDLVDGSGYV 722 D + +GYV Sbjct: 156 ADTTENTGYV 165 >UniRef50_UPI0000E4647B Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 171 Score = 85.4 bits (202), Expect = 1e-15 Identities = 55/123 (44%), Positives = 77/123 (62%), Gaps = 8/123 (6%) Frame = +3 Query: 354 SQSDKWMKQAKVIDGKKITTTDTAIHFKK--LKSV---KLGIDDYQKFLDDLAKNKKVEL 518 S+ K + K+ D KK T+TDT I F + +KS K+ ++K L+ A+ K Sbjct: 30 SKWGKMFRDLKLYD-KKFTSTDTDIIFNRPEVKSKTDRKINFAQFKKALELCAEKKYGSK 88 Query: 519 DEIKK---KLTTCGQPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFDETGKGKGIAG 689 D+++K K+ PG TS TK+ + A VDRLTD+SKYTGSH++RFDE+GKGKG+ G Sbjct: 89 DDVQKLIEKICAGKGPG-TSGATKA-SKAGGVDRLTDSSKYTGSHKERFDESGKGKGLDG 146 Query: 690 RKD 698 RKD Sbjct: 147 RKD 149 >UniRef50_UPI00005852C1 Cluster: PREDICTED: hypothetical protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: hypothetical protein - Strongylocentrotus purpuratus Length = 175 Score = 83.8 bits (198), Expect = 4e-15 Identities = 54/137 (39%), Positives = 76/137 (55%), Gaps = 9/137 (6%) Frame = +3 Query: 339 KAITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLK-SVKLGIDDYQKFLDDL---AKNK 506 K IT K MK+ ++D KK+ T+ I F++ K S KL + Y+KFL L AK+K Sbjct: 29 KDITSKNFSKMMKECDIMD-KKVNQTEIDIIFQRAKASPKLKVLTYEKFLTSLKMIAKSK 87 Query: 507 -----KVELDEIKKKLTTCGQPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFDETGK 671 + +IK ++ + P T S + VD TD +KYTG HR+RF++ G Sbjct: 88 YGTDEEENFGKIKNQIRSSSGPSTAG--TTSTSTTGKVDHFTDVTKYTGQHRERFEKDGT 145 Query: 672 GKGIAGRKDLVDGSGYV 722 GKG AGR+ LV+ SGYV Sbjct: 146 GKGKAGREYLVEESGYV 162 >UniRef50_Q5TR29 Cluster: ENSANGP00000025926; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000025926 - Anopheles gambiae str. PEST Length = 115 Score = 81.8 bits (193), Expect = 2e-14 Identities = 39/81 (48%), Positives = 52/81 (64%) Frame = +3 Query: 327 KSDGKAITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNK 506 + DGK I LSQSD WM+QA +I K T T T + F + + L D+Y +FL L K Sbjct: 35 QGDGKRILLSQSDCWMQQANLIGPKHFTLTQTGLIFFEFRKSTLDYDEYLQFLALLCNEK 94 Query: 507 KVELDEIKKKLTTCGQPGITS 569 +V ++E+K+KLT CG PGITS Sbjct: 95 QVSVEEVKEKLTNCGPPGITS 115 >UniRef50_UPI00015613C9 Cluster: PREDICTED: similar to FKSG46; n=1; Equus caballus|Rep: PREDICTED: similar to FKSG46 - Equus caballus Length = 293 Score = 72.1 bits (169), Expect = 1e-11 Identities = 35/58 (60%), Positives = 45/58 (77%) Frame = +3 Query: 549 GQPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFDETGKGKGIAGRKDLVDGSGYV 722 G+ I S VTK+ ++ V RLTDT+K+TGSH++RFD +G+GKG AGR DLVD SGYV Sbjct: 220 GKAPIISGVTKA-ISSPTVSRLTDTTKFTGSHKERFDPSGRGKGKAGRVDLVDESGYV 276 >UniRef50_Q5BZ16 Cluster: SJCHGC08790 protein; n=1; Schistosoma japonicum|Rep: SJCHGC08790 protein - Schistosoma japonicum (Blood fluke) Length = 169 Score = 68.5 bits (160), Expect = 2e-10 Identities = 40/87 (45%), Positives = 54/87 (62%), Gaps = 8/87 (9%) Frame = +3 Query: 486 DDLAKNKKVE----LDEIKKKLTTCGQPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQR 653 ++ AK K+ +++IK K+ G P + H T + +A RLTD YTGSH++R Sbjct: 76 EEYAKYNKISQADAVNKIKNKIVNSGGPKL--HGTTQLSKDSATSRLTDVKGYTGSHKER 133 Query: 654 FD-ETGKGKGIAGRKDLVD---GSGYV 722 FD ETGKGKGI GR+D+VD SGYV Sbjct: 134 FDTETGKGKGIEGREDVVDSKAASGYV 160 >UniRef50_A7SHU5 Cluster: Predicted protein; n=2; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 172 Score = 67.3 bits (157), Expect = 4e-10 Identities = 46/117 (39%), Positives = 64/117 (54%), Gaps = 9/117 (7%) Frame = +3 Query: 399 KKITTTDTAIHFKKLK-----SVKLGIDDYQKFLDDLAKNKKVELDEIKKKLTT-C-GQP 557 +K T+TDT I F + + K+ + ++ L A+ K D++ K C G+ Sbjct: 44 QKFTSTDTDIIFSRTEVKPKTERKINFNQFKVALGLCAEKKFGSKDQVGKLTEKICKGKG 103 Query: 558 GITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFDETGKGKGIAGRKDLVD--GSGYV 722 TS TK+ V+RLTDT YTGSH++RFD++GKGKGI GR D D GYV Sbjct: 104 PATSGATKA-VKVGGVERLTDTKCYTGSHKERFDKSGKGKGIEGRVDRDDKAAQGYV 159 >UniRef50_Q4UA39 Cluster: Putative uncharacterized protein; n=2; Theileria|Rep: Putative uncharacterized protein - Theileria annulata Length = 257 Score = 51.6 bits (118), Expect = 2e-05 Identities = 26/69 (37%), Positives = 40/69 (57%), Gaps = 3/69 (4%) Frame = +3 Query: 504 KKVELDEIKKKLTTCGQPGITSHVTKSPAAA---AAVDRLTDTSKYTGSHRQRFDETGKG 674 K ++ E + K P + + K+P +RLTD +TGSHR+RFDE G+G Sbjct: 48 KSLKQPEARVKFPDSEVPRYHNKLFKNPEGLKERCVFERLTDHRFFTGSHRERFDENGRG 107 Query: 675 KGIAGRKDL 701 +G+AGR++L Sbjct: 108 RGLAGRENL 116 >UniRef50_Q7R2L7 Cluster: GLP_546_56018_56500; n=1; Giardia lamblia ATCC 50803|Rep: GLP_546_56018_56500 - Giardia lamblia ATCC 50803 Length = 160 Score = 51.2 bits (117), Expect = 3e-05 Identities = 29/62 (46%), Positives = 37/62 (59%), Gaps = 9/62 (14%) Frame = +3 Query: 564 TSHVTKS---PAAAAAV------DRLTDTSKYTGSHRQRFDETGKGKGIAGRKDLVDGSG 716 T H T S P AA+ V DRLTD S Y G+H++RF+ G G+G+AGR + GSG Sbjct: 85 TEHPTSSRDKPKAASQVKGGSIFDRLTDPSTYHGTHKERFNADGTGRGLAGRDSVAKGSG 144 Query: 717 YV 722 V Sbjct: 145 TV 146 >UniRef50_Q9VT66 Cluster: CG6709-PA; n=2; Sophophora|Rep: CG6709-PA - Drosophila melanogaster (Fruit fly) Length = 117 Score = 50.0 bits (114), Expect = 6e-05 Identities = 24/72 (33%), Positives = 44/72 (61%) Frame = +3 Query: 342 AITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNKKVELD 521 +I LSQ D W++QAK++ IT T T + + + K +L +D+ + L++LA + + +D Sbjct: 37 SILLSQLDAWLEQAKLMP-NPITRTQTGLIYMRYKKWRLEYEDFLEVLNNLASDNNLAID 95 Query: 522 EIKKKLTTCGQP 557 E+K+ + G P Sbjct: 96 EMKQIMIDAGVP 107 >UniRef50_A7ATL8 Cluster: Putative uncharacterized protein; n=1; Babesia bovis|Rep: Putative uncharacterized protein - Babesia bovis Length = 274 Score = 50.0 bits (114), Expect = 6e-05 Identities = 20/32 (62%), Positives = 27/32 (84%) Frame = +3 Query: 606 DRLTDTSKYTGSHRQRFDETGKGKGIAGRKDL 701 +RLTD +TGSHR+RFDE G G+G+AGR+D+ Sbjct: 101 ERLTDYRFFTGSHRERFDENGYGRGLAGREDV 132 >UniRef50_UPI0000D55823 Cluster: PREDICTED: similar to CG4893-PA, partial; n=1; Tribolium castaneum|Rep: PREDICTED: similar to CG4893-PA, partial - Tribolium castaneum Length = 90 Score = 48.4 bits (110), Expect = 2e-04 Identities = 24/65 (36%), Positives = 39/65 (60%) Frame = +3 Query: 345 ITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNKKVELDE 524 ITL Q +KW+ AK++ +KI DT F K KS + + KFL +L++ K + + E Sbjct: 25 ITLEQINKWLTDAKLM-SEKIKPEDTKSCFDKFKSETIDFATFHKFLHELSERKGIPISE 83 Query: 525 IKKKL 539 +++KL Sbjct: 84 LEEKL 88 >UniRef50_Q86E15 Cluster: Clone ZZZ338 mRNA sequence; n=3; Schistosoma japonicum|Rep: Clone ZZZ338 mRNA sequence - Schistosoma japonicum (Blood fluke) Length = 171 Score = 46.0 bits (104), Expect = 0.001 Identities = 26/60 (43%), Positives = 35/60 (58%), Gaps = 1/60 (1%) Frame = +3 Query: 522 EIKKKLTTCGQPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFD-ETGKGKGIAGRKD 698 E+K+K+ P I H ++ RLTD +TGSH++RFD +TGKG G AGR D Sbjct: 94 ELKRKIAEAS-PAI--HGGTKISSDPTTSRLTDVKTFTGSHKERFDAQTGKGLGKAGRVD 150 >UniRef50_A3FQJ6 Cluster: Putative uncharacterized protein; n=2; Cryptosporidium|Rep: Putative uncharacterized protein - Cryptosporidium parvum Iowa II Length = 251 Score = 45.6 bits (103), Expect = 0.001 Identities = 21/50 (42%), Positives = 26/50 (52%) Frame = +3 Query: 570 HVTKSPAAAAAVDRLTDTSKYTGSHRQRFDETGKGKGIAGRKDLVDGSGY 719 H + + DRL D YTG H+ RFD+ G G G AGR+ L GY Sbjct: 77 HTIPEDSKTSVFDRLLDPKLYTGMHKYRFDKDGNGLGKAGREYLFREDGY 126 >UniRef50_UPI0000DB72DA Cluster: PREDICTED: hypothetical protein; n=1; Apis mellifera|Rep: PREDICTED: hypothetical protein - Apis mellifera Length = 91 Score = 39.1 bits (87), Expect = 0.11 Identities = 24/71 (33%), Positives = 38/71 (53%) Frame = +3 Query: 345 ITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNKKVELDE 524 I LSQSDKW+ A+++D +TTTDT DLA++K ++ ++ Sbjct: 40 IPLSQSDKWLISARILDMVTLTTTDT----------------------DLAESKNLDFED 77 Query: 525 IKKKLTTCGQP 557 +K K+ CG+P Sbjct: 78 MKYKMQICGKP 88 >UniRef50_Q0I8P7 Cluster: Lipoprotein, putative; n=2; Synechococcus|Rep: Lipoprotein, putative - Synechococcus sp. (strain CC9311) Length = 176 Score = 38.3 bits (85), Expect = 0.19 Identities = 25/95 (26%), Positives = 43/95 (45%), Gaps = 4/95 (4%) Frame = +3 Query: 399 KKITTTDTA---IHFKKLKSVKLGIDDYQKFLDDLAKNKKVELDEIKKKLTTCGQPGITS 569 K + T D A + KL + + Y+KF+ + +NK + L+E ++L P I + Sbjct: 78 KALATLDKAEVELQASKLNEYRDQVAIYEKFVGQIRQNKTMTLEEAAQQLKAQAAPVIAA 137 Query: 570 HVTKSPAA-AAAVDRLTDTSKYTGSHRQRFDETGK 671 H S V+ L D+ S + + D +GK Sbjct: 138 HEQLSETTDCIEVEELMDSDNQASSGKSKDDASGK 172 >UniRef50_A2YHP8 Cluster: Putative uncharacterized protein; n=6; Oryza sativa|Rep: Putative uncharacterized protein - Oryza sativa subsp. indica (Rice) Length = 1254 Score = 36.3 bits (80), Expect = 0.77 Identities = 28/103 (27%), Positives = 49/103 (47%) Frame = +3 Query: 372 MKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNKKVELDEIKKKLTTCG 551 M+ A I + TD ++ + K +K D+ + +D K + +E DE+++++ C Sbjct: 142 MEAALEISSRWPRVTDASL-LRWRKKLKRTSDECSQIMDR-CKRRAMEDDEMEQEVRQCA 199 Query: 552 QPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFDETGKGKG 680 P +H TKS ++ + D S T S QRF+ G G Sbjct: 200 FPKRIAHATKSFISSFTGQKKVD-SLITTSTIQRFERFANGAG 241 >UniRef50_Q4SJ96 Cluster: Chromosome 4 SCAF14575, whole genome shotgun sequence; n=4; Tetraodontidae|Rep: Chromosome 4 SCAF14575, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1048 Score = 34.7 bits (76), Expect = 2.3 Identities = 18/44 (40%), Positives = 22/44 (50%), Gaps = 4/44 (9%) Frame = -1 Query: 664 VSSKRCLCEPVYLLVSVNRSTAAAAAGDFVT----CDVIPGWPQ 545 + S RC C+P Y L + RS AA G FV PGWP+ Sbjct: 625 LGSYRCACDPGYELAADRRSCETAACGGFVAKLNGSLATPGWPK 668 >UniRef50_Q22X54 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 806 Score = 34.7 bits (76), Expect = 2.3 Identities = 19/54 (35%), Positives = 29/54 (53%) Frame = +3 Query: 375 KQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNKKVELDEIKKK 536 ++ K+IDG T + I +KLK K I D K ++ KK E DE+++K Sbjct: 429 EKQKLIDGVLATRENAGIEIQKLKEQKKVIHDESKKEREILLQKKKEEDEVERK 482 >UniRef50_A5I2J4 Cluster: Putative prophage head protein; n=2; Clostridium botulinum|Rep: Putative prophage head protein - Clostridium botulinum A str. ATCC 3502 Length = 343 Score = 34.3 bits (75), Expect = 3.1 Identities = 15/68 (22%), Positives = 36/68 (52%), Gaps = 3/68 (4%) Frame = +3 Query: 351 LSQSDKWMKQAKVIDGKKITTTDT---AIHFKKLKSVKLGIDDYQKFLDDLAKNKKVELD 521 + D W K+++V+ GK+ TD ++H + ++++ D + F ++N ++ L Sbjct: 4 MKSKDYWKKRSEVVAGKQFKKTDNYILSLHLEYMEALSSIQKDIEVFYSRFSQNNEISLQ 63 Query: 522 EIKKKLTT 545 E ++ L + Sbjct: 64 EARRLLNS 71 >UniRef50_A0CF04 Cluster: Chromosome undetermined scaffold_173, whole genome shotgun sequence; n=5; Oligohymenophorea|Rep: Chromosome undetermined scaffold_173, whole genome shotgun sequence - Paramecium tetraurelia Length = 151 Score = 34.3 bits (75), Expect = 3.1 Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 4/111 (3%) Frame = +3 Query: 324 PKSDGKAITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSV----KLGIDDYQKFLDD 491 P+ DGK K K ++D KK+T+TD + F K+K + ++K L Sbjct: 17 PEMDGKTFA-----KVSKDCHLLD-KKLTSTDVDLIFAKIKPTPAARSITYAQFEKGLQM 70 Query: 492 LAKNKKVELDEIKKKLTTCGQPGITSHVTKSPAAAAAVDRLTDTSKYTGSH 644 +A+ K V + ++ ++ G P TK A AV D + YTG H Sbjct: 71 MAEKKGVGVQDVHNQILNAGGPHFQG--TK----ADAVKFHDDKNLYTGVH 115 >UniRef50_Q92541 Cluster: RNA polymerase-associated protein RTF1 homolog; n=40; Eumetazoa|Rep: RNA polymerase-associated protein RTF1 homolog - Homo sapiens (Human) Length = 670 Score = 34.3 bits (75), Expect = 3.1 Identities = 33/141 (23%), Positives = 63/141 (44%) Frame = +3 Query: 150 STEAQNTDAAVEQVTQEVKDVKLENGNAPGASNGTSSKSEDNALXXXXXXXXXXXXGDPK 329 S+ + + D++ E E +V + N+ +S+ + S SED GD + Sbjct: 95 SSGSSDKDSSAESSAPEEGEVSDSDSNSSSSSSDSDSSSEDEEFHDGYGEDLM---GDEE 151 Query: 330 SDGKAITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNKK 509 + +++ ++ + I+ +++ I KKLK+ K +K + K KK Sbjct: 152 DRARLEQMTEKEREQELFNRIEKREVLKRRFEIK-KKLKTAK------KK--EKKEKKKK 202 Query: 510 VELDEIKKKLTTCGQPGITSH 572 E ++ KKKLT + +TSH Sbjct: 203 QEEEQEKKKLTQIQESQVTSH 223 >UniRef50_Q2HD62 Cluster: Putative uncharacterized protein; n=1; Chaetomium globosum|Rep: Putative uncharacterized protein - Chaetomium globosum (Soil fungus) Length = 1113 Score = 27.5 bits (58), Expect(2) = 3.4 Identities = 12/32 (37%), Positives = 20/32 (62%) Frame = +3 Query: 318 GDPKSDGKAITLSQSDKWMKQAKVIDGKKITT 413 GDP +G+A T S +DK ++ + +G K+ T Sbjct: 85 GDPTDNGQAETTSNTDKIAEKHQKKEGLKVNT 116 Score = 25.4 bits (53), Expect(2) = 3.4 Identities = 13/44 (29%), Positives = 21/44 (47%) Frame = +3 Query: 534 KLTTCGQPGITSHVTKSPAAAAAVDRLTDTSKYTGSHRQRFDET 665 K+ T G PG S + +SP + ++ K +G H + ET Sbjct: 113 KVNTAGVPGAESELLRSPQPQHKLS-ISKIQKISGVHAPTYRET 155 >UniRef50_UPI0000DB6CBD Cluster: PREDICTED: similar to rhinoceros CG7036-PB, isoform B; n=1; Apis mellifera|Rep: PREDICTED: similar to rhinoceros CG7036-PB, isoform B - Apis mellifera Length = 2662 Score = 33.9 bits (74), Expect = 4.1 Identities = 21/72 (29%), Positives = 35/72 (48%), Gaps = 1/72 (1%) Frame = +3 Query: 327 KSDGKAITLSQSDKWMK-QAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKN 503 K K I ++ K + K+ID +K TD++ ++ K K+G D L++ K Sbjct: 1186 KQSVKVIEKKDEEQSKKDEQKIIDQEKAECTDSSSKSEEKKVKKIGSKDAINILEEEMKQ 1245 Query: 504 KKVELDEIKKKL 539 ++ E D KK L Sbjct: 1246 RRAERDSPKKSL 1257 >UniRef50_Q22551 Cluster: Groundhog (Hedgehog-like family) protein 6; n=2; Caenorhabditis|Rep: Groundhog (Hedgehog-like family) protein 6 - Caenorhabditis elegans Length = 559 Score = 33.5 bits (73), Expect = 5.4 Identities = 16/30 (53%), Positives = 20/30 (66%), Gaps = 1/30 (3%) Frame = +1 Query: 247 MERPVKARITPYLSRK-PSRRFPNLEIPSP 333 +ERPV AR TPY+ R P+R P +E P P Sbjct: 174 IERPVPARPTPYIERPVPARPAPYIERPEP 203 Score = 32.7 bits (71), Expect = 9.4 Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 4/41 (9%) Frame = +1 Query: 247 MERPVKARITPYLSRKPSRRFPNLE----IPSPMEKPSRSR 357 +ERPV AR PY+ P+R P +E P P +P R+R Sbjct: 210 IERPVPARPAPYIEPTPARPAPYIEPSTAKPQPRPQPPRTR 250 >UniRef50_UPI00015B58A5 Cluster: PREDICTED: similar to GA18227-PA; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to GA18227-PA - Nasonia vitripennis Length = 2301 Score = 33.1 bits (72), Expect = 7.1 Identities = 30/110 (27%), Positives = 54/110 (49%), Gaps = 8/110 (7%) Frame = +3 Query: 339 KAITLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFL-DDLAKNKKVE 515 K I+ +D + ID K + TD KK +VK ID+ +K + + + + KK Sbjct: 2020 KTISEEINDTKKVASDQIDKTKRSVTDRIDETKK--NVKDTIDETKKMVAEKIDETKKSI 2077 Query: 516 LDEIKKKLTTCGQPGITSHVTKSPAAA-------AAVDRLTDTSKYTGSH 644 D+I++ ++ + G T K +A ++VDR+ +TSKY ++ Sbjct: 2078 SDKIQRSISKSDEDGQTIDAEKQVSAEKKEGTSISSVDRILETSKYLATN 2127 >UniRef50_UPI000049981A Cluster: hypothetical protein 515.t00001; n=4; Entamoeba histolytica HM-1:IMSS|Rep: hypothetical protein 515.t00001 - Entamoeba histolytica HM-1:IMSS Length = 642 Score = 33.1 bits (72), Expect = 7.1 Identities = 24/126 (19%), Positives = 44/126 (34%), Gaps = 2/126 (1%) Frame = +3 Query: 168 TDAAVEQVTQEVKDVKLENGNAPGASNGTSSKSEDNALXXXXXXXXXXXXGDPKSDGKAI 347 T +E ++ KDVK EN N P + + N + + Sbjct: 437 TQNKIENTNEQQKDVKKENNNPPKTETNSKENTHTNEQQKDVKKENTNPPKPETNSKENK 496 Query: 348 TLSQSDKWMKQAKVIDGKKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNKKV--ELD 521 + QS KQ + + T I K +K +D ++ +K+ E++ Sbjct: 497 EIIQSSNTNKQINSLPSLPLNNTPLGIALKAIKPTPEQLDKLHSSFNNFVDAQKITYEIE 556 Query: 522 EIKKKL 539 K++L Sbjct: 557 SFKEQL 562 >UniRef50_Q8RFM6 Cluster: Putative uncharacterized protein FN0666; n=1; Fusobacterium nucleatum subsp. nucleatum|Rep: Putative uncharacterized protein FN0666 - Fusobacterium nucleatum subsp. nucleatum Length = 205 Score = 33.1 bits (72), Expect = 7.1 Identities = 18/43 (41%), Positives = 27/43 (62%), Gaps = 1/43 (2%) Frame = +3 Query: 414 TDTAIHFKKLKSVKLGIDDYQKFLDDLAKNKKVELD-EIKKKL 539 T T I + V++ I + KFL+D+AKN KVE+D + K K+ Sbjct: 93 TITEIKYNSPTEVEVYITENGKFLEDIAKNCKVEVDKKFKSKM 135 >UniRef50_Q31J45 Cluster: Oxidoreductase; n=1; Thiomicrospira crunogena XCL-2|Rep: Oxidoreductase - Thiomicrospira crunogena (strain XCL-2) Length = 735 Score = 32.7 bits (71), Expect = 9.4 Identities = 21/65 (32%), Positives = 33/65 (50%) Frame = +3 Query: 426 IHFKKLKSVKLGIDDYQKFLDDLAKNKKVELDEIKKKLTTCGQPGITSHVTKSPAAAAAV 605 IHF++ + + DDY + D VE+ + + G+PG+ +PA AAAV Sbjct: 664 IHFEQGRVKETNFDDYPALMMDETPEIIVEIVKSENGPGGYGEPGVP---PLAPALAAAV 720 Query: 606 DRLTD 620 +LTD Sbjct: 721 SQLTD 725 >UniRef50_A0YHU3 Cluster: Putative uncharacterized protein; n=1; marine gamma proteobacterium HTCC2143|Rep: Putative uncharacterized protein - marine gamma proteobacterium HTCC2143 Length = 490 Score = 32.7 bits (71), Expect = 9.4 Identities = 14/46 (30%), Positives = 24/46 (52%) Frame = +2 Query: 332 RWKSHHALAKRQMDEASQSH*WKENNNNGHGHSLQKTQIGKTRHRR 469 R+K HH + RQ+ + S W+ N ++ G S + + + K H R Sbjct: 203 RYKPHHYYSHRQVTHHTDSRRWRHNPHHRRGVSYRNSHVQKRFHPR 248 >UniRef50_A0UZ41 Cluster: Putative uncharacterized protein; n=1; Clostridium cellulolyticum H10|Rep: Putative uncharacterized protein - Clostridium cellulolyticum H10 Length = 104 Score = 32.7 bits (71), Expect = 9.4 Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 3/72 (4%) Frame = +3 Query: 336 GKAITLSQSDKWM--KQAKVIDG-KKITTTDTAIHFKKLKSVKLGIDDYQKFLDDLAKNK 506 GK I DK + K K ID KK +T D + +L ++ +F ++ KN Sbjct: 6 GKKIQEMMQDKMVQSKVNKAIDMLKKDSTKDLEKKLSAINKDEL-MEKVNEFDEEKLKNI 64 Query: 507 KVELDEIKKKLT 542 K++ DEIKKK+T Sbjct: 65 KIDKDEIKKKIT 76 >UniRef50_Q4QG91 Cluster: Putative uncharacterized protein; n=2; Leishmania|Rep: Putative uncharacterized protein - Leishmania major Length = 302 Score = 32.7 bits (71), Expect = 9.4 Identities = 20/61 (32%), Positives = 35/61 (57%) Frame = +3 Query: 417 DTAIHFKKLKSVKLGIDDYQKFLDDLAKNKKVELDEIKKKLTTCGQPGITSHVTKSPAAA 596 D A HF++L+S + DY L +++K+ + L+++ K++ G VT +PAAA Sbjct: 200 DVAAHFEELESQFFSLGDY---LREISKH-VIRLNDMSKQVNNGVNYGANGGVTGAPAAA 255 Query: 597 A 599 A Sbjct: 256 A 256 >UniRef50_O17117 Cluster: Putative uncharacterized protein; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 591 Score = 32.7 bits (71), Expect = 9.4 Identities = 23/85 (27%), Positives = 43/85 (50%), Gaps = 4/85 (4%) Frame = +3 Query: 429 HFKKLKSVKLGIDDYQKFLDDLAK---NKKVELDEIKKKLTTCGQPGITSHVTKSPAAAA 599 + ++ +K + D Q+ L+ +AK NK+ E++E+KK ++S + K+ AA Sbjct: 216 YMNDMRDLKKALKDNQEGLEKIAKDVKNKEGEIEELKK--------SVSSEIVKATEAAH 267 Query: 600 AVDRL-TDTSKYTGSHRQRFDETGK 671 A D+L K H +R ++ K Sbjct: 268 ATDQLRKKLQKQQDEHEKRVEQEHK 292 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 727,794,579 Number of Sequences: 1657284 Number of extensions: 14907890 Number of successful extensions: 48779 Number of sequences better than 10.0: 35 Number of HSP's better than 10.0 without gapping: 46086 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 48722 length of database: 575,637,011 effective HSP length: 98 effective length of database: 413,223,179 effective search space used: 58677691418 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -