BLASTP 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= BGIBMGA001439-TA|BGIBMGA001439-PA|undefined (195 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q28WW4 Cluster: GA12376-PA; n=1; Drosophila pseudoobscu... 54 2e-06 UniRef50_Q9W161 Cluster: CG13581-PA; n=2; Drosophila melanogaste... 53 5e-06 UniRef50_UPI0000F1F256 Cluster: PREDICTED: similar to SJCHGC0536... 47 3e-04 UniRef50_A2RUZ3 Cluster: Nefm protein; n=4; Danio rerio|Rep: Nef... 36 0.83 UniRef50_A7PQ74 Cluster: Chromosome chr18 scaffold_24, whole gen... 35 1.5 UniRef50_Q8Q0U6 Cluster: Putative uncharacterized protein; n=1; ... 35 1.5 UniRef50_Q4SW56 Cluster: Chromosome undetermined SCAF13690, whol... 34 2.5 UniRef50_A0DMH5 Cluster: Chromosome undetermined scaffold_56, wh... 34 2.5 UniRef50_UPI00015B4A4D Cluster: PREDICTED: similar to conserved ... 33 3.4 UniRef50_Q4Z3X9 Cluster: Pb-reticulocyte binding protein; n=2; P... 33 3.4 UniRef50_Q22M90 Cluster: Putative uncharacterized protein; n=1; ... 33 3.4 UniRef50_Q7R220 Cluster: GLP_630_73647_79199; n=1; Giardia lambl... 33 4.5 UniRef50_UPI000056383B Cluster: hypothetical protein GLP_165_633... 33 5.9 UniRef50_Q8G1W8 Cluster: Penicillin-binding protein, 1A family; ... 33 5.9 UniRef50_Q0I631 Cluster: Glycosyl transferase family protein; n=... 33 5.9 UniRef50_A7TJ52 Cluster: Putative uncharacterized protein; n=1; ... 33 5.9 UniRef50_UPI0000F2E010 Cluster: PREDICTED: similar to chondroiti... 32 7.8 UniRef50_A2SNA5 Cluster: Superfamily II DNA/RNA helicases SNF2 f... 32 7.8 UniRef50_Q8I293 Cluster: Putative uncharacterized protein PFA023... 32 7.8 >UniRef50_Q28WW4 Cluster: GA12376-PA; n=1; Drosophila pseudoobscura|Rep: GA12376-PA - Drosophila pseudoobscura (Fruit fly) Length = 166 Score = 54.4 bits (125), Expect = 2e-06 Identities = 30/77 (38%), Positives = 46/77 (59%), Gaps = 4/77 (5%) Query: 72 VSAVIHR-FRKPVPDYLLAKVETVR--KIQPPMIAATSSEKDILKESKKS-YLNMRNKRG 127 VS+ I+R R + DY L+ VE K PM + ++E ++L+ ++S YL R + Sbjct: 74 VSSRIYRPSRSLIFDYNLSPVEQQHFSKCSDPMKSVPAAELELLRSGQRSTYLERRYEHS 133 Query: 128 PDDKYLYMESENWKYGW 144 PDDKY Y E+ +W+YGW Sbjct: 134 PDDKYNYPEATSWRYGW 150 >UniRef50_Q9W161 Cluster: CG13581-PA; n=2; Drosophila melanogaster|Rep: CG13581-PA - Drosophila melanogaster (Fruit fly) Length = 208 Score = 52.8 bits (121), Expect = 5e-06 Identities = 23/53 (43%), Positives = 35/53 (66%), Gaps = 1/53 (1%) Query: 100 PMIAATSSEKDILKESKKS-YLNMRNKRGPDDKYLYMESENWKYGWKLNESEL 151 PM A ++E +L+ +++ YL R +R PDDKY Y E+ +W+YGW ES+L Sbjct: 147 PMKAVPAAELQLLQSGQRTTYLERRYERSPDDKYNYPEATSWRYGWFHRESDL 199 >UniRef50_UPI0000F1F256 Cluster: PREDICTED: similar to SJCHGC05363 protein; n=1; Danio rerio|Rep: PREDICTED: similar to SJCHGC05363 protein - Danio rerio Length = 180 Score = 46.8 bits (106), Expect = 3e-04 Identities = 22/86 (25%), Positives = 42/86 (48%) Query: 88 LAKVETVRKIQPPMIAATSSEKDILKESKKSYLNMRNKRGPDDKYLYMESENWKYGWKLN 147 L++ +R + P A + + YL R ++GP++K+ Y +W+YGW+L Sbjct: 83 LSEAPLMRPVSPQTSGALYQGISTEGKGRLLYLRKRAQKGPEEKFDYPILSSWEYGWRLG 142 Query: 148 ESELKLRGPEHGKINHLLHSLVSRVG 173 + E R P +G+ + + +R G Sbjct: 143 DFETDCRTPANGRSGVVKSAFYARNG 168 >UniRef50_A2RUZ3 Cluster: Nefm protein; n=4; Danio rerio|Rep: Nefm protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 707 Score = 35.5 bits (78), Expect = 0.83 Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 11/104 (10%) Query: 19 YEKESR-LRAKWFNLHKEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVSAVIH 77 Y++E + LR LHKEK A + +DT ++ D+ E+ R+H+ A I Sbjct: 123 YDRELQDLRCALEQLHKEK----AQILLDT-DHMDEDLQRIRERYEDESRLREHMDAAIR 177 Query: 78 RFRKPVPDYLLAKVETVRKIQPPMIAATSSEKDILKESKKSYLN 121 +K D +L K+E RK+Q A E D L+++ + ++ Sbjct: 178 GMKKDKDDSVLMKMELERKVQ-----ALVDEMDFLRQNHEEEIS 216 >UniRef50_A7PQ74 Cluster: Chromosome chr18 scaffold_24, whole genome shotgun sequence; n=1; Vitis vinifera|Rep: Chromosome chr18 scaffold_24, whole genome shotgun sequence - Vitis vinifera (Grape) Length = 364 Score = 34.7 bits (76), Expect = 1.5 Identities = 32/137 (23%), Positives = 55/137 (40%), Gaps = 7/137 (5%) Query: 32 LHKEKIEKCATLKVDTKNYTHS-DIAEATMISGMEAITRDHVSAVIHRFRKP--VPDYLL 88 LHKE IE +K K T D+A ++S M + D V++ + LL Sbjct: 118 LHKELIE---IIKKRKKELTEKRDLAAQDLLSHM-LLVPDENGKVLNEMEISTYILGVLL 173 Query: 89 AKVETVRKIQPPMIAATSSEKDILKESKKSYLNMRNKRGPDDKYLYMESENWKYGWKLNE 148 A ET ++ S D+ K + + +GP++ + + +N K+ W + Sbjct: 174 ASHETTSTAITFVLKYLSEFPDVYDAVLKEQMEIAKSKGPEEFLNWNDIQNMKHSWNVAR 233 Query: 149 SELKLRGPEHGKINHLL 165 ++L P G L Sbjct: 234 ESMRLSPPGIGGFREAL 250 >UniRef50_Q8Q0U6 Cluster: Putative uncharacterized protein; n=1; Methanosarcina mazei|Rep: Putative uncharacterized protein - Methanosarcina mazei (Methanosarcina frisia) Length = 671 Score = 34.7 bits (76), Expect = 1.5 Identities = 26/107 (24%), Positives = 51/107 (47%), Gaps = 5/107 (4%) Query: 67 ITRDHVSAVIHRFR--KPVPDYLLAKVETVRKIQPPMIAATSSEKDILKESKKSYLNMRN 124 + R+ + VIH+ + + +++ +E V+ P + A D+ +K N++N Sbjct: 538 LVRNIIRGVIHKIKTEEEASEWMRKHLEKVKNSSPSLGDAGGKGYDLNNFLRKD--NLKN 595 Query: 125 -KRGPDDKYLYMESENWKYGWKLNESELKLRGPEHGKINHLLHSLVS 170 DD+YL+++S + K+ K+ +L G I +H LVS Sbjct: 596 VPEIDDDEYLFLDSPSNKWDLKITLEDLSRSGRSIEDILKEIHGLVS 642 >UniRef50_Q4SW56 Cluster: Chromosome undetermined SCAF13690, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome undetermined SCAF13690, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 537 Score = 33.9 bits (74), Expect = 2.5 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 5/77 (6%) Query: 22 ESRLRAKWFNLHKEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVSAVIHRFRK 81 E+ LRA +H++K + +++D++ + DI EA RD A+I +K Sbjct: 120 EAELRAALEQIHRDKTQ----IQLDSE-HLEEDIQRLRERLDEEARIRDETEAIIRVLKK 174 Query: 82 PVPDYLLAKVETVRKIQ 98 D LAK E +KIQ Sbjct: 175 DTSDSELAKSELEKKIQ 191 >UniRef50_A0DMH5 Cluster: Chromosome undetermined scaffold_56, whole genome shotgun sequence; n=2; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_56, whole genome shotgun sequence - Paramecium tetraurelia Length = 417 Score = 33.9 bits (74), Expect = 2.5 Identities = 35/166 (21%), Positives = 70/166 (42%), Gaps = 4/166 (2%) Query: 16 IENYEKESRLRAKWFNLHKEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVSAV 75 ++N E+ R K ++ KE +E+ T+K D+ N +S + + E + V+ Sbjct: 87 VQNLEENLREIVKKYDQSKEDLERERTIKYDS-NRNYSQLYQRYQDQEREVLKYQQVAKS 145 Query: 76 IHRFRKPVPDYLLAKVETVR-KIQPPMIAATSSEK--DILKESKKSYLNMRNKRGPDDKY 132 I +K V L + E K EK ILK+ ++ + + K + ++ Sbjct: 146 IETMQKQVQRELQEQKEKWNAKNNEIQEQKKVQEKLQSILKQKEREINDFKLKLKEEREF 205 Query: 133 LYMESENWKYGWKLNESELKLRGPEHGKINHLLHSLVSRVGPQPDP 178 E++ + +L L+ + + N+ L +L+ + QP P Sbjct: 206 RSYENQRLVQEFTQTYQDLTLQNDQLIQENNELRTLILEIDNQPQP 251 >UniRef50_UPI00015B4A4D Cluster: PREDICTED: similar to conserved hypothetical protein; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to conserved hypothetical protein - Nasonia vitripennis Length = 1832 Score = 33.5 bits (73), Expect = 3.4 Identities = 17/69 (24%), Positives = 39/69 (56%), Gaps = 5/69 (7%) Query: 9 PKVINFLIENYEKESRLRAKWFNLHKEK-----IEKCATLKVDTKNYTHSDIAEATMISG 63 P+VI+ +++ E+E+ + K + +E+ +++ T K D +N ++ A A I+ Sbjct: 483 PEVISKVVQEKEEEAEIFYKSSDSEEEQDEPMEVDEAGTAKKDNENTNENETATAVGIAD 542 Query: 64 MEAITRDHV 72 + +T+DH+ Sbjct: 543 TQKLTQDHI 551 >UniRef50_Q4Z3X9 Cluster: Pb-reticulocyte binding protein; n=2; Plasmodium (Vinckeia)|Rep: Pb-reticulocyte binding protein - Plasmodium berghei Length = 1913 Score = 33.5 bits (73), Expect = 3.4 Identities = 25/105 (23%), Positives = 54/105 (51%), Gaps = 3/105 (2%) Query: 71 HVSAVIHRFRKPVPDYLLAKVETVRKIQP-PMIAATSSEKDILKESKKSYLNMRNKRGPD 129 H+ + K + DY+ + I P ++ + + +KD + SKK +N+ NK+ + Sbjct: 394 HIPYLNQSTMKDIWDYVRLFYNVICYIDPIDLVKSLTYQKDKIIXSKKKSVNLGNKKMAE 453 Query: 130 DKYLYMESENWKYGWKLNESELKLRGPEHGKINHLLHSLVSRVGP 174 + ++++N K G K+++ E K K N+L+ + ++R+ P Sbjct: 454 NTSNTIDNQN-KNGIKISKGE-KRNSFLKTKKNNLMLTHLARINP 496 >UniRef50_Q22M90 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 1698 Score = 33.5 bits (73), Expect = 3.4 Identities = 22/63 (34%), Positives = 28/63 (44%), Gaps = 2/63 (3%) Query: 103 AATSSEKDILKESKKSYLNMRNKRGPDDKYLY--MESENWKYGWKLNESELKLRGPEHGK 160 A E + SK+S LN KRG + ME N K LN S+ L+G + Sbjct: 362 AENEEETSSCRNSKQSSLNSSKKRGKSQQNSRRSMEISNQKRSSSLNSSKQSLKGYPQQQ 421 Query: 161 INH 163 INH Sbjct: 422 INH 424 >UniRef50_Q7R220 Cluster: GLP_630_73647_79199; n=1; Giardia lamblia ATCC 50803|Rep: GLP_630_73647_79199 - Giardia lamblia ATCC 50803 Length = 1850 Score = 33.1 bits (72), Expect = 4.5 Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 10/102 (9%) Query: 20 EKESRLRAKWFNLH--KEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVSAVIH 77 E S +R+K + K+K++K T +D+K+++ I EAT E +H + H Sbjct: 1165 ELSSTIRSKEDEISELKQKVKKYKTAYIDSKSFSSDAIKEAT---AQELAKYEHGLEIAH 1221 Query: 78 RFRKPVPDYLLAKVETVRKIQPPMIAATSSEKDILKESKKSY 119 K V + +A E ++ ++ + S+E + LK K Y Sbjct: 1222 ---KEVLELRMANAELKAALE--IVQSRSTEAEDLKHKSKKY 1258 >UniRef50_UPI000056383B Cluster: hypothetical protein GLP_165_63389_64429; n=1; Giardia lamblia ATCC 50803|Rep: hypothetical protein GLP_165_63389_64429 - Giardia lamblia ATCC 50803 Length = 346 Score = 32.7 bits (71), Expect = 5.9 Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 4/73 (5%) Query: 14 FLIENYEKESRLRAKWFNLHKEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVS 73 + I+ + K++ ++ + +E AT+K DTK+ T+ E SG E + S Sbjct: 245 YSIDRFVKDNAVKEEDIAYAREVFNVSATVKADTKDETN----ETRSTSGEEVKEEEPQS 300 Query: 74 AVIHRFRKPVPDY 86 + R ++P+P Y Sbjct: 301 TKVQRKKRPIPVY 313 >UniRef50_Q8G1W8 Cluster: Penicillin-binding protein, 1A family; n=11; Rhizobiales|Rep: Penicillin-binding protein, 1A family - Brucella suis Length = 718 Score = 32.7 bits (71), Expect = 5.9 Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 7/77 (9%) Query: 49 NYTHSDIAEATMIS-GMEAITRDHVSAVIHRFRKPVPDYLLA-KVETVRKI-----QPPM 101 N S++ E+ +S G A+ R H ++VI R + PDY L + V+K+ Q + Sbjct: 279 NVVLSNMVESGFLSEGQVAVARRHPASVIDRAKDESPDYFLDWAFDEVKKVADRFNQHTL 338 Query: 102 IAATSSEKDILKESKKS 118 I T+ +++I K +++S Sbjct: 339 IVRTTLDRNIQKAAEES 355 >UniRef50_Q0I631 Cluster: Glycosyl transferase family protein; n=17; Cyanobacteria|Rep: Glycosyl transferase family protein - Synechococcus sp. (strain CC9311) Length = 357 Score = 32.7 bits (71), Expect = 5.9 Identities = 18/80 (22%), Positives = 39/80 (48%), Gaps = 2/80 (2%) Query: 34 KEKIEKCATLKVDTKNYTHSDIAEAT--MISGMEAITRDHVSAVIHRFRKPVPDYLLAKV 91 K+ + K + + +K T S+ EA M++G + + + HR R+P P L + Sbjct: 14 KQLLRKIGSGEHTSKGLTRSEADEAMELMLTGGASDVQIGAFLIAHRIRRPEPQELTGML 73 Query: 92 ETVRKIQPPMIAATSSEKDI 111 +T +++ P +++ + I Sbjct: 74 DTYKRLGPCLLSEPDQRRPI 93 >UniRef50_A7TJ52 Cluster: Putative uncharacterized protein; n=1; Vanderwaltozyma polyspora DSM 70294|Rep: Putative uncharacterized protein - Vanderwaltozyma polyspora DSM 70294 Length = 953 Score = 32.7 bits (71), Expect = 5.9 Identities = 22/71 (30%), Positives = 35/71 (49%), Gaps = 3/71 (4%) Query: 104 ATSSEKDILKE-SKKSYLNMRNKRGPDDKYLYMES--ENWKYGWKLNESELKLRGPEHGK 160 AT ++LK + +S + N D+ L + S ++W YGW L SE K R +GK Sbjct: 479 ATKKLLEVLKGFNSESSQHKANVSNLYDQQLVLSSSDDHWVYGWLLETSESKKRNSIYGK 538 Query: 161 INHLLHSLVSR 171 ++L + R Sbjct: 539 NSNLKQTTTRR 549 >UniRef50_UPI0000F2E010 Cluster: PREDICTED: similar to chondroitin polymerizing factor,; n=1; Monodelphis domestica|Rep: PREDICTED: similar to chondroitin polymerizing factor, - Monodelphis domestica Length = 1692 Score = 32.3 bits (70), Expect = 7.8 Identities = 17/50 (34%), Positives = 26/50 (52%) Query: 125 KRGPDDKYLYMESENWKYGWKLNESELKLRGPEHGKINHLLHSLVSRVGP 174 + G D + S + + W LN +ELK +GPE G + + LV + GP Sbjct: 218 QEGDDATFSLELSTSAQGAWFLNGAELKAKGPESGSRDEVQGYLVQQHGP 267 >UniRef50_A2SNA5 Cluster: Superfamily II DNA/RNA helicases SNF2 family-like protein; n=2; Burkholderiales|Rep: Superfamily II DNA/RNA helicases SNF2 family-like protein - Methylibium petroleiphilum (strain PM1) Length = 585 Score = 32.3 bits (70), Expect = 7.8 Identities = 17/45 (37%), Positives = 23/45 (51%), Gaps = 1/45 (2%) Query: 151 LKLRGPEHGKINHLLHSLVSRVGPQPDPVHYALPDTGYECCGGSI 195 LK +H + L+H L+SRV PDP ALP Y+ S+ Sbjct: 74 LKGLSADHVEAESLVHQLLSRVRA-PDPFELALPPRDYQAAAASL 117 >UniRef50_Q8I293 Cluster: Putative uncharacterized protein PFA0235w; n=2; Plasmodium|Rep: Putative uncharacterized protein PFA0235w - Plasmodium falciparum (isolate 3D7) Length = 1389 Score = 32.3 bits (70), Expect = 7.8 Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 5/63 (7%) Query: 102 IAATSSEKDILKESKKSYL--NMRNKRGPDDKYLYMESENWKYGWKLNESELKLRGPEHG 159 I+ T +EK KE KK+Y+ N NK+ D Y + + + +KY N + ++ H Sbjct: 990 ISITINEK---KEKKKNYIYENYENKKQMDVLYDHKQDDIYKYDQLNNTNINNIKNLNHS 1046 Query: 160 KIN 162 KIN Sbjct: 1047 KIN 1049 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.316 0.133 0.400 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 228,098,038 Number of Sequences: 1657284 Number of extensions: 8809318 Number of successful extensions: 26410 Number of sequences better than 10.0: 19 Number of HSP's better than 10.0 without gapping: 4 Number of HSP's successfully gapped in prelim test: 15 Number of HSP's that attempted gapping in prelim test: 26403 Number of HSP's gapped (non-prelim): 21 length of query: 195 length of database: 575,637,011 effective HSP length: 97 effective length of query: 98 effective length of database: 414,880,463 effective search space: 40658285374 effective search space used: 40658285374 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.6 bits) S2: 70 (32.3 bits)
- SilkBase 1999-2023 -