SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTP 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= BGIBMGA001439-TA|BGIBMGA001439-PA|undefined
         (195 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q28WW4 Cluster: GA12376-PA; n=1; Drosophila pseudoobscu...    54   2e-06
UniRef50_Q9W161 Cluster: CG13581-PA; n=2; Drosophila melanogaste...    53   5e-06
UniRef50_UPI0000F1F256 Cluster: PREDICTED: similar to SJCHGC0536...    47   3e-04
UniRef50_A2RUZ3 Cluster: Nefm protein; n=4; Danio rerio|Rep: Nef...    36   0.83 
UniRef50_A7PQ74 Cluster: Chromosome chr18 scaffold_24, whole gen...    35   1.5  
UniRef50_Q8Q0U6 Cluster: Putative uncharacterized protein; n=1; ...    35   1.5  
UniRef50_Q4SW56 Cluster: Chromosome undetermined SCAF13690, whol...    34   2.5  
UniRef50_A0DMH5 Cluster: Chromosome undetermined scaffold_56, wh...    34   2.5  
UniRef50_UPI00015B4A4D Cluster: PREDICTED: similar to conserved ...    33   3.4  
UniRef50_Q4Z3X9 Cluster: Pb-reticulocyte binding protein; n=2; P...    33   3.4  
UniRef50_Q22M90 Cluster: Putative uncharacterized protein; n=1; ...    33   3.4  
UniRef50_Q7R220 Cluster: GLP_630_73647_79199; n=1; Giardia lambl...    33   4.5  
UniRef50_UPI000056383B Cluster: hypothetical protein GLP_165_633...    33   5.9  
UniRef50_Q8G1W8 Cluster: Penicillin-binding protein, 1A family; ...    33   5.9  
UniRef50_Q0I631 Cluster: Glycosyl transferase family protein; n=...    33   5.9  
UniRef50_A7TJ52 Cluster: Putative uncharacterized protein; n=1; ...    33   5.9  
UniRef50_UPI0000F2E010 Cluster: PREDICTED: similar to chondroiti...    32   7.8  
UniRef50_A2SNA5 Cluster: Superfamily II DNA/RNA helicases SNF2 f...    32   7.8  
UniRef50_Q8I293 Cluster: Putative uncharacterized protein PFA023...    32   7.8  

>UniRef50_Q28WW4 Cluster: GA12376-PA; n=1; Drosophila
           pseudoobscura|Rep: GA12376-PA - Drosophila pseudoobscura
           (Fruit fly)
          Length = 166

 Score = 54.4 bits (125), Expect = 2e-06
 Identities = 30/77 (38%), Positives = 46/77 (59%), Gaps = 4/77 (5%)

Query: 72  VSAVIHR-FRKPVPDYLLAKVETVR--KIQPPMIAATSSEKDILKESKKS-YLNMRNKRG 127
           VS+ I+R  R  + DY L+ VE     K   PM +  ++E ++L+  ++S YL  R +  
Sbjct: 74  VSSRIYRPSRSLIFDYNLSPVEQQHFSKCSDPMKSVPAAELELLRSGQRSTYLERRYEHS 133

Query: 128 PDDKYLYMESENWKYGW 144
           PDDKY Y E+ +W+YGW
Sbjct: 134 PDDKYNYPEATSWRYGW 150


>UniRef50_Q9W161 Cluster: CG13581-PA; n=2; Drosophila
           melanogaster|Rep: CG13581-PA - Drosophila melanogaster
           (Fruit fly)
          Length = 208

 Score = 52.8 bits (121), Expect = 5e-06
 Identities = 23/53 (43%), Positives = 35/53 (66%), Gaps = 1/53 (1%)

Query: 100 PMIAATSSEKDILKESKKS-YLNMRNKRGPDDKYLYMESENWKYGWKLNESEL 151
           PM A  ++E  +L+  +++ YL  R +R PDDKY Y E+ +W+YGW   ES+L
Sbjct: 147 PMKAVPAAELQLLQSGQRTTYLERRYERSPDDKYNYPEATSWRYGWFHRESDL 199


>UniRef50_UPI0000F1F256 Cluster: PREDICTED: similar to SJCHGC05363
           protein; n=1; Danio rerio|Rep: PREDICTED: similar to
           SJCHGC05363 protein - Danio rerio
          Length = 180

 Score = 46.8 bits (106), Expect = 3e-04
 Identities = 22/86 (25%), Positives = 42/86 (48%)

Query: 88  LAKVETVRKIQPPMIAATSSEKDILKESKKSYLNMRNKRGPDDKYLYMESENWKYGWKLN 147
           L++   +R + P    A         + +  YL  R ++GP++K+ Y    +W+YGW+L 
Sbjct: 83  LSEAPLMRPVSPQTSGALYQGISTEGKGRLLYLRKRAQKGPEEKFDYPILSSWEYGWRLG 142

Query: 148 ESELKLRGPEHGKINHLLHSLVSRVG 173
           + E   R P +G+   +  +  +R G
Sbjct: 143 DFETDCRTPANGRSGVVKSAFYARNG 168


>UniRef50_A2RUZ3 Cluster: Nefm protein; n=4; Danio rerio|Rep: Nefm
           protein - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 707

 Score = 35.5 bits (78), Expect = 0.83
 Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 11/104 (10%)

Query: 19  YEKESR-LRAKWFNLHKEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVSAVIH 77
           Y++E + LR     LHKEK    A + +DT ++   D+         E+  R+H+ A I 
Sbjct: 123 YDRELQDLRCALEQLHKEK----AQILLDT-DHMDEDLQRIRERYEDESRLREHMDAAIR 177

Query: 78  RFRKPVPDYLLAKVETVRKIQPPMIAATSSEKDILKESKKSYLN 121
             +K   D +L K+E  RK+Q     A   E D L+++ +  ++
Sbjct: 178 GMKKDKDDSVLMKMELERKVQ-----ALVDEMDFLRQNHEEEIS 216


>UniRef50_A7PQ74 Cluster: Chromosome chr18 scaffold_24, whole genome
           shotgun sequence; n=1; Vitis vinifera|Rep: Chromosome
           chr18 scaffold_24, whole genome shotgun sequence - Vitis
           vinifera (Grape)
          Length = 364

 Score = 34.7 bits (76), Expect = 1.5
 Identities = 32/137 (23%), Positives = 55/137 (40%), Gaps = 7/137 (5%)

Query: 32  LHKEKIEKCATLKVDTKNYTHS-DIAEATMISGMEAITRDHVSAVIHRFRKP--VPDYLL 88
           LHKE IE    +K   K  T   D+A   ++S M  +  D    V++       +   LL
Sbjct: 118 LHKELIE---IIKKRKKELTEKRDLAAQDLLSHM-LLVPDENGKVLNEMEISTYILGVLL 173

Query: 89  AKVETVRKIQPPMIAATSSEKDILKESKKSYLNMRNKRGPDDKYLYMESENWKYGWKLNE 148
           A  ET       ++   S   D+     K  + +   +GP++   + + +N K+ W +  
Sbjct: 174 ASHETTSTAITFVLKYLSEFPDVYDAVLKEQMEIAKSKGPEEFLNWNDIQNMKHSWNVAR 233

Query: 149 SELKLRGPEHGKINHLL 165
             ++L  P  G     L
Sbjct: 234 ESMRLSPPGIGGFREAL 250


>UniRef50_Q8Q0U6 Cluster: Putative uncharacterized protein; n=1;
           Methanosarcina mazei|Rep: Putative uncharacterized
           protein - Methanosarcina mazei (Methanosarcina frisia)
          Length = 671

 Score = 34.7 bits (76), Expect = 1.5
 Identities = 26/107 (24%), Positives = 51/107 (47%), Gaps = 5/107 (4%)

Query: 67  ITRDHVSAVIHRFR--KPVPDYLLAKVETVRKIQPPMIAATSSEKDILKESKKSYLNMRN 124
           + R+ +  VIH+ +  +   +++   +E V+   P +  A     D+    +K   N++N
Sbjct: 538 LVRNIIRGVIHKIKTEEEASEWMRKHLEKVKNSSPSLGDAGGKGYDLNNFLRKD--NLKN 595

Query: 125 -KRGPDDKYLYMESENWKYGWKLNESELKLRGPEHGKINHLLHSLVS 170
                DD+YL+++S + K+  K+   +L   G     I   +H LVS
Sbjct: 596 VPEIDDDEYLFLDSPSNKWDLKITLEDLSRSGRSIEDILKEIHGLVS 642


>UniRef50_Q4SW56 Cluster: Chromosome undetermined SCAF13690, whole
           genome shotgun sequence; n=2; Tetraodontidae|Rep:
           Chromosome undetermined SCAF13690, whole genome shotgun
           sequence - Tetraodon nigroviridis (Green puffer)
          Length = 537

 Score = 33.9 bits (74), Expect = 2.5
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 5/77 (6%)

Query: 22  ESRLRAKWFNLHKEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVSAVIHRFRK 81
           E+ LRA    +H++K +    +++D++ +   DI         EA  RD   A+I   +K
Sbjct: 120 EAELRAALEQIHRDKTQ----IQLDSE-HLEEDIQRLRERLDEEARIRDETEAIIRVLKK 174

Query: 82  PVPDYLLAKVETVRKIQ 98
              D  LAK E  +KIQ
Sbjct: 175 DTSDSELAKSELEKKIQ 191


>UniRef50_A0DMH5 Cluster: Chromosome undetermined scaffold_56, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_56,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 417

 Score = 33.9 bits (74), Expect = 2.5
 Identities = 35/166 (21%), Positives = 70/166 (42%), Gaps = 4/166 (2%)

Query: 16  IENYEKESRLRAKWFNLHKEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVSAV 75
           ++N E+  R   K ++  KE +E+  T+K D+ N  +S + +       E +    V+  
Sbjct: 87  VQNLEENLREIVKKYDQSKEDLERERTIKYDS-NRNYSQLYQRYQDQEREVLKYQQVAKS 145

Query: 76  IHRFRKPVPDYLLAKVETVR-KIQPPMIAATSSEK--DILKESKKSYLNMRNKRGPDDKY 132
           I   +K V   L  + E    K           EK   ILK+ ++   + + K   + ++
Sbjct: 146 IETMQKQVQRELQEQKEKWNAKNNEIQEQKKVQEKLQSILKQKEREINDFKLKLKEEREF 205

Query: 133 LYMESENWKYGWKLNESELKLRGPEHGKINHLLHSLVSRVGPQPDP 178
              E++     +     +L L+  +  + N+ L +L+  +  QP P
Sbjct: 206 RSYENQRLVQEFTQTYQDLTLQNDQLIQENNELRTLILEIDNQPQP 251


>UniRef50_UPI00015B4A4D Cluster: PREDICTED: similar to conserved
           hypothetical protein; n=1; Nasonia vitripennis|Rep:
           PREDICTED: similar to conserved hypothetical protein -
           Nasonia vitripennis
          Length = 1832

 Score = 33.5 bits (73), Expect = 3.4
 Identities = 17/69 (24%), Positives = 39/69 (56%), Gaps = 5/69 (7%)

Query: 9   PKVINFLIENYEKESRLRAKWFNLHKEK-----IEKCATLKVDTKNYTHSDIAEATMISG 63
           P+VI+ +++  E+E+ +  K  +  +E+     +++  T K D +N   ++ A A  I+ 
Sbjct: 483 PEVISKVVQEKEEEAEIFYKSSDSEEEQDEPMEVDEAGTAKKDNENTNENETATAVGIAD 542

Query: 64  MEAITRDHV 72
            + +T+DH+
Sbjct: 543 TQKLTQDHI 551


>UniRef50_Q4Z3X9 Cluster: Pb-reticulocyte binding protein; n=2;
           Plasmodium (Vinckeia)|Rep: Pb-reticulocyte binding
           protein - Plasmodium berghei
          Length = 1913

 Score = 33.5 bits (73), Expect = 3.4
 Identities = 25/105 (23%), Positives = 54/105 (51%), Gaps = 3/105 (2%)

Query: 71  HVSAVIHRFRKPVPDYLLAKVETVRKIQP-PMIAATSSEKDILKESKKSYLNMRNKRGPD 129
           H+  +     K + DY+      +  I P  ++ + + +KD +  SKK  +N+ NK+  +
Sbjct: 394 HIPYLNQSTMKDIWDYVRLFYNVICYIDPIDLVKSLTYQKDKIIXSKKKSVNLGNKKMAE 453

Query: 130 DKYLYMESENWKYGWKLNESELKLRGPEHGKINHLLHSLVSRVGP 174
           +    ++++N K G K+++ E K       K N+L+ + ++R+ P
Sbjct: 454 NTSNTIDNQN-KNGIKISKGE-KRNSFLKTKKNNLMLTHLARINP 496


>UniRef50_Q22M90 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 1698

 Score = 33.5 bits (73), Expect = 3.4
 Identities = 22/63 (34%), Positives = 28/63 (44%), Gaps = 2/63 (3%)

Query: 103 AATSSEKDILKESKKSYLNMRNKRGPDDKYLY--MESENWKYGWKLNESELKLRGPEHGK 160
           A    E    + SK+S LN   KRG   +     ME  N K    LN S+  L+G    +
Sbjct: 362 AENEEETSSCRNSKQSSLNSSKKRGKSQQNSRRSMEISNQKRSSSLNSSKQSLKGYPQQQ 421

Query: 161 INH 163
           INH
Sbjct: 422 INH 424


>UniRef50_Q7R220 Cluster: GLP_630_73647_79199; n=1; Giardia lamblia
            ATCC 50803|Rep: GLP_630_73647_79199 - Giardia lamblia
            ATCC 50803
          Length = 1850

 Score = 33.1 bits (72), Expect = 4.5
 Identities = 27/102 (26%), Positives = 49/102 (48%), Gaps = 10/102 (9%)

Query: 20   EKESRLRAKWFNLH--KEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVSAVIH 77
            E  S +R+K   +   K+K++K  T  +D+K+++   I EAT     E    +H   + H
Sbjct: 1165 ELSSTIRSKEDEISELKQKVKKYKTAYIDSKSFSSDAIKEAT---AQELAKYEHGLEIAH 1221

Query: 78   RFRKPVPDYLLAKVETVRKIQPPMIAATSSEKDILKESKKSY 119
               K V +  +A  E    ++  ++ + S+E + LK   K Y
Sbjct: 1222 ---KEVLELRMANAELKAALE--IVQSRSTEAEDLKHKSKKY 1258


>UniRef50_UPI000056383B Cluster: hypothetical protein
           GLP_165_63389_64429; n=1; Giardia lamblia ATCC
           50803|Rep: hypothetical protein GLP_165_63389_64429 -
           Giardia lamblia ATCC 50803
          Length = 346

 Score = 32.7 bits (71), Expect = 5.9
 Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 4/73 (5%)

Query: 14  FLIENYEKESRLRAKWFNLHKEKIEKCATLKVDTKNYTHSDIAEATMISGMEAITRDHVS 73
           + I+ + K++ ++ +     +E     AT+K DTK+ T+    E    SG E    +  S
Sbjct: 245 YSIDRFVKDNAVKEEDIAYAREVFNVSATVKADTKDETN----ETRSTSGEEVKEEEPQS 300

Query: 74  AVIHRFRKPVPDY 86
             + R ++P+P Y
Sbjct: 301 TKVQRKKRPIPVY 313


>UniRef50_Q8G1W8 Cluster: Penicillin-binding protein, 1A family;
           n=11; Rhizobiales|Rep: Penicillin-binding protein, 1A
           family - Brucella suis
          Length = 718

 Score = 32.7 bits (71), Expect = 5.9
 Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 7/77 (9%)

Query: 49  NYTHSDIAEATMIS-GMEAITRDHVSAVIHRFRKPVPDYLLA-KVETVRKI-----QPPM 101
           N   S++ E+  +S G  A+ R H ++VI R +   PDY L    + V+K+     Q  +
Sbjct: 279 NVVLSNMVESGFLSEGQVAVARRHPASVIDRAKDESPDYFLDWAFDEVKKVADRFNQHTL 338

Query: 102 IAATSSEKDILKESKKS 118
           I  T+ +++I K +++S
Sbjct: 339 IVRTTLDRNIQKAAEES 355


>UniRef50_Q0I631 Cluster: Glycosyl transferase family protein; n=17;
           Cyanobacteria|Rep: Glycosyl transferase family protein -
           Synechococcus sp. (strain CC9311)
          Length = 357

 Score = 32.7 bits (71), Expect = 5.9
 Identities = 18/80 (22%), Positives = 39/80 (48%), Gaps = 2/80 (2%)

Query: 34  KEKIEKCATLKVDTKNYTHSDIAEAT--MISGMEAITRDHVSAVIHRFRKPVPDYLLAKV 91
           K+ + K  + +  +K  T S+  EA   M++G  +  +     + HR R+P P  L   +
Sbjct: 14  KQLLRKIGSGEHTSKGLTRSEADEAMELMLTGGASDVQIGAFLIAHRIRRPEPQELTGML 73

Query: 92  ETVRKIQPPMIAATSSEKDI 111
           +T +++ P +++     + I
Sbjct: 74  DTYKRLGPCLLSEPDQRRPI 93


>UniRef50_A7TJ52 Cluster: Putative uncharacterized protein; n=1;
           Vanderwaltozyma polyspora DSM 70294|Rep: Putative
           uncharacterized protein - Vanderwaltozyma polyspora DSM
           70294
          Length = 953

 Score = 32.7 bits (71), Expect = 5.9
 Identities = 22/71 (30%), Positives = 35/71 (49%), Gaps = 3/71 (4%)

Query: 104 ATSSEKDILKE-SKKSYLNMRNKRGPDDKYLYMES--ENWKYGWKLNESELKLRGPEHGK 160
           AT    ++LK  + +S  +  N     D+ L + S  ++W YGW L  SE K R   +GK
Sbjct: 479 ATKKLLEVLKGFNSESSQHKANVSNLYDQQLVLSSSDDHWVYGWLLETSESKKRNSIYGK 538

Query: 161 INHLLHSLVSR 171
            ++L  +   R
Sbjct: 539 NSNLKQTTTRR 549


>UniRef50_UPI0000F2E010 Cluster: PREDICTED: similar to chondroitin
           polymerizing factor,; n=1; Monodelphis domestica|Rep:
           PREDICTED: similar to chondroitin polymerizing factor, -
           Monodelphis domestica
          Length = 1692

 Score = 32.3 bits (70), Expect = 7.8
 Identities = 17/50 (34%), Positives = 26/50 (52%)

Query: 125 KRGPDDKYLYMESENWKYGWKLNESELKLRGPEHGKINHLLHSLVSRVGP 174
           + G D  +    S + +  W LN +ELK +GPE G  + +   LV + GP
Sbjct: 218 QEGDDATFSLELSTSAQGAWFLNGAELKAKGPESGSRDEVQGYLVQQHGP 267


>UniRef50_A2SNA5 Cluster: Superfamily II DNA/RNA helicases SNF2
           family-like protein; n=2; Burkholderiales|Rep:
           Superfamily II DNA/RNA helicases SNF2 family-like
           protein - Methylibium petroleiphilum (strain PM1)
          Length = 585

 Score = 32.3 bits (70), Expect = 7.8
 Identities = 17/45 (37%), Positives = 23/45 (51%), Gaps = 1/45 (2%)

Query: 151 LKLRGPEHGKINHLLHSLVSRVGPQPDPVHYALPDTGYECCGGSI 195
           LK    +H +   L+H L+SRV   PDP   ALP   Y+    S+
Sbjct: 74  LKGLSADHVEAESLVHQLLSRVRA-PDPFELALPPRDYQAAAASL 117


>UniRef50_Q8I293 Cluster: Putative uncharacterized protein PFA0235w;
            n=2; Plasmodium|Rep: Putative uncharacterized protein
            PFA0235w - Plasmodium falciparum (isolate 3D7)
          Length = 1389

 Score = 32.3 bits (70), Expect = 7.8
 Identities = 21/63 (33%), Positives = 33/63 (52%), Gaps = 5/63 (7%)

Query: 102  IAATSSEKDILKESKKSYL--NMRNKRGPDDKYLYMESENWKYGWKLNESELKLRGPEHG 159
            I+ T +EK   KE KK+Y+  N  NK+  D  Y + + + +KY    N +   ++   H 
Sbjct: 990  ISITINEK---KEKKKNYIYENYENKKQMDVLYDHKQDDIYKYDQLNNTNINNIKNLNHS 1046

Query: 160  KIN 162
            KIN
Sbjct: 1047 KIN 1049


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.316    0.133    0.400 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 228,098,038
Number of Sequences: 1657284
Number of extensions: 8809318
Number of successful extensions: 26410
Number of sequences better than 10.0: 19
Number of HSP's better than 10.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 15
Number of HSP's that attempted gapping in prelim test: 26403
Number of HSP's gapped (non-prelim): 21
length of query: 195
length of database: 575,637,011
effective HSP length: 97
effective length of query: 98
effective length of database: 414,880,463
effective search space: 40658285374
effective search space used: 40658285374
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.6 bits)
S2: 70 (32.3 bits)

- SilkBase 1999-2023 -