BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= MFBP03_F_I18 (867 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_A0NDG7 Cluster: ENSANGP00000030449; n=2; Diptera|Rep: E... 80 6e-14 UniRef50_Q9VF05 Cluster: CG5516-PA; n=2; Endopterygota|Rep: CG55... 75 2e-12 UniRef50_Q8TBK6 Cluster: Zinc finger CCHC domain-containing prot... 72 2e-11 UniRef50_UPI00015B5C23 Cluster: PREDICTED: similar to zinc finge... 67 6e-10 UniRef50_A7T6T1 Cluster: Predicted protein; n=1; Nematostella ve... 58 3e-07 UniRef50_Q54C90 Cluster: Putative uncharacterized protein; n=1; ... 51 3e-05 UniRef50_Q9FGX7 Cluster: Similarity to unknown protein; n=5; Mag... 50 8e-05 UniRef50_UPI0000587099 Cluster: PREDICTED: similar to ZCCHC10 pr... 50 1e-04 UniRef50_A0E2U0 Cluster: Chromosome undetermined scaffold_75, wh... 47 5e-04 UniRef50_Q55V50 Cluster: Putative uncharacterized protein; n=1; ... 47 5e-04 UniRef50_UPI00006CB052 Cluster: hypothetical protein TTHERM_0023... 46 0.002 UniRef50_Q0US55 Cluster: Putative uncharacterized protein; n=2; ... 42 0.020 UniRef50_A2QQ15 Cluster: Similarity: the predicted ORF may be N-... 38 0.33 UniRef50_Q9LHK6 Cluster: Similarity to non-LTR retroelement reve... 37 0.58 UniRef50_Q5NRP7 Cluster: Putative gag-pol polyprotein, identical... 37 0.58 UniRef50_Q868S9 Cluster: Gag-like protein; n=1; Anopheles gambia... 35 2.3 UniRef50_Q8MY21 Cluster: Gag-like protein; n=2; Forficula scudde... 34 5.4 UniRef50_Q4D9Y6 Cluster: Putative uncharacterized protein; n=1; ... 34 5.4 UniRef50_UPI00006CDE0D Cluster: hypothetical protein TTHERM_0029... 33 7.1 UniRef50_Q24362 Cluster: Putative ORF1; n=2; melanogaster subgro... 33 7.1 UniRef50_A7RK63 Cluster: Predicted protein; n=1; Nematostella ve... 33 7.1 UniRef50_UPI00006CFCAD Cluster: hypothetical protein TTHERM_0058... 33 9.4 UniRef50_Q55AJ7 Cluster: Putative uncharacterized protein; n=2; ... 33 9.4 >UniRef50_A0NDG7 Cluster: ENSANGP00000030449; n=2; Diptera|Rep: ENSANGP00000030449 - Anopheles gambiae str. PEST Length = 252 Score = 80.2 bits (189), Expect = 6e-14 Identities = 35/66 (53%), Positives = 46/66 (69%), Gaps = 2/66 (3%) Frame = +3 Query: 168 AERKKQXXXXXXFPQGIRCQKCLEFGHWSYECKGKRKILVRPSRTRIMHKNLK--AKEEG 341 A ++K+ FP+GIRCQKCLEFGHWSYEC+GKRK L R SRT+++ KNL + G Sbjct: 9 ALKQKEAKLAAAFPKGIRCQKCLEFGHWSYECQGKRKYLHRSSRTQVLKKNLNKIGSKNG 68 Query: 342 QCSNGS 359 + G+ Sbjct: 69 DAATGT 74 >UniRef50_Q9VF05 Cluster: CG5516-PA; n=2; Endopterygota|Rep: CG5516-PA - Drosophila melanogaster (Fruit fly) Length = 176 Score = 74.9 bits (176), Expect = 2e-12 Identities = 35/67 (52%), Positives = 43/67 (64%), Gaps = 1/67 (1%) Frame = +3 Query: 138 MTL-GTYNRHQAERKKQXXXXXXFPQGIRCQKCLEFGHWSYECKGKRKILVRPSRTRIMH 314 MTL G R A++K+ FP GIRCQKCL+ GHWSYECK KRK + R SRT+ + Sbjct: 1 MTLVGLAARKLAQKKRSAKQAADFPNGIRCQKCLQIGHWSYECKEKRKYVHRSSRTKQLS 60 Query: 315 KNLKAKE 335 K + KE Sbjct: 61 KRMSQKE 67 >UniRef50_Q8TBK6 Cluster: Zinc finger CCHC domain-containing protein 10; n=35; Euteleostomi|Rep: Zinc finger CCHC domain-containing protein 10 - Homo sapiens (Human) Length = 192 Score = 72.1 bits (169), Expect = 2e-11 Identities = 35/62 (56%), Positives = 40/62 (64%), Gaps = 3/62 (4%) Frame = +3 Query: 210 QGIRCQKCLEFGHWSYECKGKRKILVRPSRTRIMHKNLKAKEEG---QCSNGSCKIPNKK 380 Q +RCQKCLEFGHW+YEC GKRK L RPSRT + K LK KE Q S G + K Sbjct: 41 QHVRCQKCLEFGHWTYECTGKRKYLHRPSRTAELKKALKEKENRLLLQQSIGETNVERKA 100 Query: 381 KR 386 K+ Sbjct: 101 KK 102 >UniRef50_UPI00015B5C23 Cluster: PREDICTED: similar to zinc finger, CCHC domain containing 10; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to zinc finger, CCHC domain containing 10 - Nasonia vitripennis Length = 159 Score = 66.9 bits (156), Expect = 6e-10 Identities = 32/58 (55%), Positives = 39/58 (67%), Gaps = 2/58 (3%) Frame = +3 Query: 216 IRCQKCLEFGHWSYECKGKRKILVRPSRTRIMHKNLKAKEEGQCSN--GSCKIPNKKK 383 +RCQKCLE GHWSYECKGKRK L R SRT + K L+ K E + +N + + P KK Sbjct: 23 VRCQKCLEMGHWSYECKGKRKYLHRNSRTVQLKKALEKKPEDRFNNEASNTEEPTNKK 80 >UniRef50_A7T6T1 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 55 Score = 58.0 bits (134), Expect = 3e-07 Identities = 23/37 (62%), Positives = 28/37 (75%) Frame = +3 Query: 219 RCQKCLEFGHWSYECKGKRKILVRPSRTRIMHKNLKA 329 +CQKCL+ GHW+YEC KRK L R SRT +M+K KA Sbjct: 6 KCQKCLQIGHWTYECTNKRKYLHRMSRTTVMNKKWKA 42 >UniRef50_Q54C90 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 204 Score = 51.2 bits (117), Expect = 3e-05 Identities = 24/56 (42%), Positives = 35/56 (62%), Gaps = 1/56 (1%) Frame = +3 Query: 222 CQKCLEFGHWSYECKGKRKILVRPSRTRIMH-KNLKAKEEGQCSNGSCKIPNKKKR 386 CQKCL+FGH S+ECK K + RP+RT +++ K L K GS K ++K++ Sbjct: 9 CQKCLQFGHLSFECKSKPLYVSRPTRTALLNDKKLLEKTLNSYQKGSTKDNDEKEK 64 >UniRef50_Q9FGX7 Cluster: Similarity to unknown protein; n=5; Magnoliophyta|Rep: Similarity to unknown protein - Arabidopsis thaliana (Mouse-ear cress) Length = 280 Score = 50.0 bits (114), Expect = 8e-05 Identities = 21/48 (43%), Positives = 31/48 (64%), Gaps = 1/48 (2%) Frame = +3 Query: 219 RCQKCLEFGHWSYECKGKRKILVRPSRT-RIMHKNLKAKEEGQCSNGS 359 +CQKC + GHW+YECK +R + RPSRT ++ + L+ K +GS Sbjct: 104 QCQKCFQAGHWTYECKNERVYISRPSRTQQLKNPKLRTKPSVDDLDGS 151 >UniRef50_UPI0000587099 Cluster: PREDICTED: similar to ZCCHC10 protein; n=1; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to ZCCHC10 protein - Strongylocentrotus purpuratus Length = 117 Score = 49.6 bits (113), Expect = 1e-04 Identities = 21/46 (45%), Positives = 31/46 (67%) Frame = +3 Query: 243 GHWSYECKGKRKILVRPSRTRIMHKNLKAKEEGQCSNGSCKIPNKK 380 GHW+YEC GKRK + RPSR ++K LK K++ + G+ ++ N K Sbjct: 2 GHWTYECSGKRKYVHRPSRISTLNKKLKQKQKEE-EEGAKRLENPK 46 >UniRef50_A0E2U0 Cluster: Chromosome undetermined scaffold_75, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_75, whole genome shotgun sequence - Paramecium tetraurelia Length = 167 Score = 47.2 bits (107), Expect = 5e-04 Identities = 17/29 (58%), Positives = 23/29 (79%) Frame = +3 Query: 216 IRCQKCLEFGHWSYECKGKRKILVRPSRT 302 +RCQ+CL++GH +YECK + L RPSRT Sbjct: 35 VRCQRCLKYGHLTYECKNENVYLYRPSRT 63 >UniRef50_Q55V50 Cluster: Putative uncharacterized protein; n=1; Filobasidiella neoformans|Rep: Putative uncharacterized protein - Cryptococcus neoformans (Filobasidiella neoformans) Length = 217 Score = 47.2 bits (107), Expect = 5e-04 Identities = 17/29 (58%), Positives = 23/29 (79%) Frame = +3 Query: 219 RCQKCLEFGHWSYECKGKRKILVRPSRTR 305 RCQKCL+FGH++Y+CK + RPSRT+ Sbjct: 20 RCQKCLKFGHYTYQCKNPMPYVPRPSRTK 48 >UniRef50_UPI00006CB052 Cluster: hypothetical protein TTHERM_00239330; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00239330 - Tetrahymena thermophila SB210 Length = 364 Score = 45.6 bits (103), Expect = 0.002 Identities = 18/42 (42%), Positives = 29/42 (69%) Frame = +3 Query: 210 QGIRCQKCLEFGHWSYECKGKRKILVRPSRTRIMHKNLKAKE 335 + RCQKCL+ GH++YEC+ + + RPS T + +KN + K+ Sbjct: 22 ENYRCQKCLKLGHFTYECQNQAAYVYRPSAT-MQYKNQELKQ 62 >UniRef50_Q0US55 Cluster: Putative uncharacterized protein; n=2; Pezizomycotina|Rep: Putative uncharacterized protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 342 Score = 41.9 bits (94), Expect = 0.020 Identities = 21/41 (51%), Positives = 28/41 (68%), Gaps = 4/41 (9%) Frame = +3 Query: 222 CQKCLEFGHWSYECKG---KRKILVRPSRT-RIMHKNLKAK 332 CQKCL+ GH+SYECK +R RPSRT ++++ LK K Sbjct: 18 CQKCLKRGHYSYECKATAQERPYQSRPSRTQQLLNPKLKPK 58 >UniRef50_A2QQ15 Cluster: Similarity: the predicted ORF may be N-terminally too short; n=5; Trichocomaceae|Rep: Similarity: the predicted ORF may be N-terminally too short - Aspergillus niger Length = 301 Score = 37.9 bits (84), Expect = 0.33 Identities = 17/31 (54%), Positives = 22/31 (70%), Gaps = 3/31 (9%) Frame = +3 Query: 222 CQKCLEFGHWSYEC---KGKRKILVRPSRTR 305 CQKCL+ GH+SYEC +R + RPSRT+ Sbjct: 22 CQKCLKRGHYSYECTVTAQERPYMSRPSRTQ 52 >UniRef50_Q9LHK6 Cluster: Similarity to non-LTR retroelement reverse transcriptase; n=1; Arabidopsis thaliana|Rep: Similarity to non-LTR retroelement reverse transcriptase - Arabidopsis thaliana (Mouse-ear cress) Length = 332 Score = 37.1 bits (82), Expect = 0.58 Identities = 16/40 (40%), Positives = 25/40 (62%) Frame = +1 Query: 571 SFSMLEGRCLFSDDFLNLMLVYYXPWYNSYFQQTXTSPTL 690 +F + E R LF++D + ++ W+N+ QQT TSPTL Sbjct: 246 NFVIFEDRILFAEDTVQKSILEAKRWHNAKLQQTPTSPTL 285 >UniRef50_Q5NRP7 Cluster: Putative gag-pol polyprotein, identical; n=2; Solanum demissum|Rep: Putative gag-pol polyprotein, identical - Solanum demissum (Wild potato) Length = 1588 Score = 37.1 bits (82), Expect = 0.58 Identities = 16/43 (37%), Positives = 25/43 (58%) Frame = +3 Query: 216 IRCQKCLEFGHWSYECKGKRKILVRPSRTRIMHKNLKAKEEGQ 344 I+C KC GH +EC +R IL+R + K+ + +EEG+ Sbjct: 342 IQCHKCHGRGHMMHECPSRRNILLRENGEYESEKSEREEEEGE 384 >UniRef50_Q868S9 Cluster: Gag-like protein; n=1; Anopheles gambiae|Rep: Gag-like protein - Anopheles gambiae (African malaria mosquito) Length = 724 Score = 35.1 bits (77), Expect = 2.3 Identities = 9/17 (52%), Positives = 15/17 (88%) Frame = +3 Query: 216 IRCQKCLEFGHWSYECK 266 +RC +CLE GHW+++C+ Sbjct: 660 VRCYRCLELGHWAHDCR 676 >UniRef50_Q8MY21 Cluster: Gag-like protein; n=2; Forficula scudderi|Rep: Gag-like protein - Forficula scudderi Length = 148 Score = 33.9 bits (74), Expect = 5.4 Identities = 11/20 (55%), Positives = 14/20 (70%) Frame = +3 Query: 219 RCQKCLEFGHWSYECKGKRK 278 +C KC FGH SYEC+G + Sbjct: 66 KCYKCQNFGHMSYECEGNNE 85 >UniRef50_Q4D9Y6 Cluster: Putative uncharacterized protein; n=1; Trypanosoma cruzi|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 158 Score = 33.9 bits (74), Expect = 5.4 Identities = 14/43 (32%), Positives = 24/43 (55%), Gaps = 4/43 (9%) Frame = +3 Query: 222 CQKCLEFGHWSYECK----GKRKILVRPSRTRIMHKNLKAKEE 338 CQKC HW++EC+ RK R SR++++ +K + + Sbjct: 27 CQKCASTEHWTFECQNAADATRKTTARLSRSQMLRWGIKRRRQ 69 >UniRef50_UPI00006CDE0D Cluster: hypothetical protein TTHERM_00297160; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00297160 - Tetrahymena thermophila SB210 Length = 2311 Score = 33.5 bits (73), Expect = 7.1 Identities = 29/78 (37%), Positives = 43/78 (55%), Gaps = 2/78 (2%) Frame = -2 Query: 260 LIAPVSKLQALLASYALRKSSSQSCLLFPL-SLMPVICSQSHIQNLKGKINKIRG*LNIC 84 +I P +K+Q L+ + RK SS + +L L + + S+ +N K +I+ L C Sbjct: 1377 IINPFTKIQTLVNN---RKYSSSNKILGDLVEYLNIGNGFSYFKNTGSK--RIQQLLIQC 1431 Query: 83 LELYCSNCNSL-LFXNLK 33 L+LYC N NSL F NLK Sbjct: 1432 LKLYCKNQNSLNCFNNLK 1449 >UniRef50_Q24362 Cluster: Putative ORF1; n=2; melanogaster subgroup|Rep: Putative ORF1 - Drosophila melanogaster (Fruit fly) Length = 426 Score = 33.5 bits (73), Expect = 7.1 Identities = 17/49 (34%), Positives = 24/49 (48%) Frame = +3 Query: 207 PQGIRCQKCLEFGHWSYECKGKRKILVRPSRTRIMHKNLKAKEEGQCSN 353 P +RC+KCL FGH + CK + + S T+ + K E C N Sbjct: 183 PLPLRCKKCLRFGHPTPICKSV-ETCINCSETKHTNDGEKCTNEKNCLN 230 >UniRef50_A7RK63 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 85 Score = 33.5 bits (73), Expect = 7.1 Identities = 19/57 (33%), Positives = 25/57 (43%) Frame = -1 Query: 381 FSCWVSCSCHCCIDLLLWLSDSYA*FVYVKDELGSCVCPCTHSSSVQTPSTSGILCL 211 F C V CSCHC + + + YV D L C C+ S+V+ S CL Sbjct: 30 FRCNVDCSCHCNVACSCHCNVECSCPCYV-DCLSHCYVDCSRHSNVECSSYCNFKCL 85 >UniRef50_UPI00006CFCAD Cluster: hypothetical protein TTHERM_00585470; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00585470 - Tetrahymena thermophila SB210 Length = 305 Score = 33.1 bits (72), Expect = 9.4 Identities = 9/34 (26%), Positives = 19/34 (55%) Frame = +3 Query: 219 RCQKCLEFGHWSYECKGKRKILVRPSRTRIMHKN 320 RC CL++GH++Y C+ + + ++ + N Sbjct: 17 RCMNCLQYGHYTYNCRSDTQYFLNGGQSSLSQSN 50 >UniRef50_Q55AJ7 Cluster: Putative uncharacterized protein; n=2; Dictyostelium discoideum|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 772 Score = 33.1 bits (72), Expect = 9.4 Identities = 13/46 (28%), Positives = 26/46 (56%), Gaps = 2/46 (4%) Frame = +3 Query: 132 LNMTLGTYNRHQAERKKQXXXXXXF--PQGIRCQKCLEFGHWSYEC 263 +N+ G N + +RK+Q + + I+C++C + H+S+EC Sbjct: 244 INVKYGNDNNNNNKRKRQSVTTSRYYMEESIKCERCGDHDHFSFEC 289 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 606,229,629 Number of Sequences: 1657284 Number of extensions: 10213869 Number of successful extensions: 25909 Number of sequences better than 10.0: 23 Number of HSP's better than 10.0 without gapping: 24508 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 25867 length of database: 575,637,011 effective HSP length: 100 effective length of database: 409,908,611 effective search space used: 77062818868 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -