SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/09/19
 
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= NV120039.Seq
         (672 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_UPI0000DB74AD Cluster: PREDICTED: similar to Nopp140 CG...    45   0.002
UniRef50_Q9VNX6 Cluster: CG7421-PA, isoform A; n=3; Drosophila m...    41   0.024
UniRef50_Q54IE8 Cluster: Putative uncharacterized protein irlE; ...    37   0.39 
UniRef50_Q4Q3B3 Cluster: Putative uncharacterized protein; n=3; ...    35   1.6  
UniRef50_A0NJ37 Cluster: Putative uncharacterized protein; n=1; ...    35   2.1  
UniRef50_Q24IJ8 Cluster: Putative uncharacterized protein; n=1; ...    34   2.7  
UniRef50_O01465 Cluster: Groundhog (Hedgehog-like family) protei...    34   3.6  
UniRef50_Q58242 Cluster: Uncharacterized protein MJ0832 [Contain...    34   3.6  
UniRef50_Q9XJE6 Cluster: Replication initiation protein; n=13; r...    33   4.8  
UniRef50_Q22UL4 Cluster: Zinc carboxypeptidase family protein; n...    33   4.8  
UniRef50_A2DXH5 Cluster: Putative uncharacterized protein; n=1; ...    33   4.8  
UniRef50_A0E8L0 Cluster: Chromosome undetermined scaffold_83, wh...    33   4.8  
UniRef50_Q6GNV8 Cluster: MGC80840 protein; n=2; Xenopus|Rep: MGC...    33   6.3  
UniRef50_A0BRG1 Cluster: Chromosome undetermined scaffold_122, w...    33   6.3  
UniRef50_Q6L2Z0 Cluster: Type I restriction-modification system ...    33   6.3  
UniRef50_Q24FP6 Cluster: EF hand family protein; n=1; Tetrahymen...    33   8.3  
UniRef50_A2DVP9 Cluster: Putative uncharacterized protein; n=1; ...    33   8.3  
UniRef50_A0BZE2 Cluster: Chromosome undetermined scaffold_139, w...    33   8.3  

>UniRef50_UPI0000DB74AD Cluster: PREDICTED: similar to Nopp140
           CG7421-PB, isoform B; n=1; Apis mellifera|Rep:
           PREDICTED: similar to Nopp140 CG7421-PB, isoform B -
           Apis mellifera
          Length = 685

 Score = 44.8 bits (101), Expect = 0.002
 Identities = 22/56 (39%), Positives = 33/56 (58%)
 Frame = +3

Query: 93  QADVNSLVHQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKV 260
           +  V++LV+ YL K D SLA+ F KKTKA    K   T+LD+   + + ++ K  V
Sbjct: 6   ELSVSALVYDYLLKKDASLAKVFQKKTKAPTLPKGAPTILDVYQHYQKTSQKKLNV 61


>UniRef50_Q9VNX6 Cluster: CG7421-PA, isoform A; n=3; Drosophila
           melanogaster|Rep: CG7421-PA, isoform A - Drosophila
           melanogaster (Fruit fly)
          Length = 720

 Score = 41.1 bits (92), Expect = 0.024
 Identities = 23/64 (35%), Positives = 40/64 (62%)
 Frame = +3

Query: 84  TEIQADVNSLVHQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVP 263
           T++    +++V +YL+  DK+LA+ F +KTKA   AK+   L +I+ +F Q  K+  K+P
Sbjct: 2   TDLLKIADAIVLEYLQSKDKNLAKVFQQKTKAASVAKSSPKLSEIL-QFYQ-TKSPKKIP 59

Query: 264 VTRA 275
             +A
Sbjct: 60  AIKA 63


>UniRef50_Q54IE8 Cluster: Putative uncharacterized protein irlE; n=1;
            Dictyostelium discoideum AX4|Rep: Putative
            uncharacterized protein irlE - Dictyostelium discoideum
            AX4
          Length = 1350

 Score = 37.1 bits (82), Expect = 0.39
 Identities = 35/126 (27%), Positives = 61/126 (48%), Gaps = 1/126 (0%)
 Frame = +3

Query: 51   ILDKNFKMN-LTTEIQADVNSLVHQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAK 227
            +LD     N L + I  D   L  + L   +K L++  +K+ K +   KN++ LLD + K
Sbjct: 696  LLDGTINSNILNSMIIKDKEKLDLKELFFKEKPLSEAELKE-KFEIATKNEKELLDQLKK 754

Query: 228  FNQGNKAKAKVPVTRAKKMPLRNQQFKQMGKFQQLRKKLKVQILVAPMMSHQNRHLSPTK 407
             N   K K K    + KK   +NQQ +Q  K Q  ++K + Q  +    +++N+H+   +
Sbjct: 755  ENDAEKLKKK---NKLKKQ--KNQQQQQQAKQQAQQQKQQHQQNI--QQNYENQHIEDQR 807

Query: 408  QLKNLT 425
            +    T
Sbjct: 808  KFNQQT 813


>UniRef50_Q4Q3B3 Cluster: Putative uncharacterized protein; n=3;
           Leishmania|Rep: Putative uncharacterized protein -
           Leishmania major
          Length = 692

 Score = 35.1 bits (77), Expect = 1.6
 Identities = 26/85 (30%), Positives = 40/85 (47%)
 Frame = +3

Query: 189 AKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKMPLRNQQFKQMGKFQQLRKKLKVQILVAP 368
           A+ Q+  L I  + ++  KA  KV V  A+  PL+ +        Q LR +LK   L  P
Sbjct: 115 AQQQEPSLQIDQRASEAFKASCKVAV-EARVAPLQQE-------LQGLRHRLKALTLSRP 166

Query: 369 MMSHQNRHLSPTKQLKNLTLHQLRN 443
             SH NRH+  +    N +   +R+
Sbjct: 167 YSSHYNRHVRSSSVASNSSALTMRD 191


>UniRef50_A0NJ37 Cluster: Putative uncharacterized protein; n=1;
           Oenococcus oeni ATCC BAA-1163|Rep: Putative
           uncharacterized protein - Oenococcus oeni ATCC BAA-1163
          Length = 307

 Score = 34.7 bits (76), Expect = 2.1
 Identities = 42/163 (25%), Positives = 71/163 (43%), Gaps = 9/163 (5%)
 Frame = +3

Query: 9   VVYTVEAGNKEIH*ILDKNFKMNLTTEIQADVNSLVH-----QYLEKIDKSLA--QTFMK 167
           + Y++  G K    I   +F +N+ +  +    ++ +     + + K  K LA  Q F K
Sbjct: 57  ISYSINGGKKHSVRIRSNSFAINIPSSNKEQKVTIYNGNVSAKIVVKASKQLADYQKFAK 116

Query: 168 KTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKMPL-RNQQFKQMGKFQQLRKKL 344
           K      A +      II K N+  KA+     T A+   + R +Q +   K ++L++  
Sbjct: 117 KYNQSLIASSLPK--SIIKKANELKKAQVAKQTTAAEIARMSRTEQLQLEEKNKELQQDA 174

Query: 345 K-VQILVAPMMSHQNRHLSPTKQLKNLTLHQLRNKNHQMTATV 470
           K VQ   A   S     L P K+LKN   + + NKN+ +   V
Sbjct: 175 KEVQAATAKSKSENKDKLLP-KKLKNAIKNAVSNKNYIIRVNV 216


>UniRef50_Q24IJ8 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 1501

 Score = 34.3 bits (75), Expect = 2.7
 Identities = 21/87 (24%), Positives = 41/87 (47%)
 Frame = +3

Query: 129 EKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKMPLRNQQFK 308
           +K++    Q   ++  ++ +  +++ + +    FN  N  K +   T   K     Q +K
Sbjct: 650 QKLNTEGNQRQPQRNSSQKQLSSEKNVHNNAQNFNNKNMQKQQAFSTVTDKKANEQQTYK 709

Query: 309 QMGKFQQLRKKLKVQILVAPMMSHQNR 389
           QMG  QQ +K +  Q+L     S+QN+
Sbjct: 710 QMGYQQQQQKDINSQLLQKKGSSYQNQ 736


>UniRef50_O01465 Cluster: Groundhog (Hedgehog-like family) protein
           9; n=2; Caenorhabditis|Rep: Groundhog (Hedgehog-like
           family) protein 9 - Caenorhabditis elegans
          Length = 588

 Score = 33.9 bits (74), Expect = 3.6
 Identities = 16/40 (40%), Positives = 24/40 (60%)
 Frame = +3

Query: 165 KKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKM 284
           + T  KP A +Q T+LD + +  Q NK   ++PV R KK+
Sbjct: 370 ESTTIKPNAVSQPTILDFVERSKQQNKL-MRIPVYRGKKL 408


>UniRef50_Q58242 Cluster: Uncharacterized protein MJ0832 [Contains:
           Mja rnr-1 intein; Mja rnr-2 intein]; n=2;
           Methanococcales|Rep: Uncharacterized protein MJ0832
           [Contains: Mja rnr-1 intein; Mja rnr-2 intein] -
           Methanococcus jannaschii
          Length = 1750

 Score = 33.9 bits (74), Expect = 3.6
 Identities = 26/92 (28%), Positives = 45/92 (48%)
 Frame = +3

Query: 99  DVNSLVHQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAK 278
           ++  +V+  L+KIDK +A+ +      K R   ++        F++   AKA +  T A 
Sbjct: 64  ELKDIVYNVLKKIDKDVAENYRNGIILKVRTSEKE-----FESFDKEKIAKALIRETGAD 118

Query: 279 KMPLRNQQFKQMGKFQQLRKKLKVQILVAPMM 374
           +   R    K   + ++  KKLKV+ L APM+
Sbjct: 119 EETAR----KIADEVERELKKLKVKYLTAPMI 146


>UniRef50_Q9XJE6 Cluster: Replication initiation protein; n=13;
           root|Rep: Replication initiation protein - Lactococcus
           lactis bacteriophage Tuc2009
          Length = 260

 Score = 33.5 bits (73), Expect = 4.8
 Identities = 19/88 (21%), Positives = 43/88 (48%), Gaps = 3/88 (3%)
 Frame = +3

Query: 99  DVNSLVHQYLEKIDKSLAQTFMKKTKAK---PRAKNQQTLLDIIAKFNQGNKAKAKVPVT 269
           D+NSL+ +YL+   +  ++   K+  A+    +  +++    +I   N     K + P  
Sbjct: 165 DINSLLSEYLDSFIEFSSKNIAKRAMAQVEFMKLSSEEKKQAVIGAKNYFEWYKQENPED 224

Query: 270 RAKKMPLRNQQFKQMGKFQQLRKKLKVQ 353
           + KK  + +  F +   F+  ++K+KV+
Sbjct: 225 KTKKFSINSYAFLESATFKSFQQKVKVK 252


>UniRef50_Q22UL4 Cluster: Zinc carboxypeptidase family protein; n=1;
            Tetrahymena thermophila SB210|Rep: Zinc carboxypeptidase
            family protein - Tetrahymena thermophila SB210
          Length = 1600

 Score = 33.5 bits (73), Expect = 4.8
 Identities = 21/65 (32%), Positives = 30/65 (46%)
 Frame = +3

Query: 129  EKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKMPLRNQQFK 308
            EK+D+SL+    K    +    N Q  +D I +  Q NK K K    + +    R QQ +
Sbjct: 982  EKLDRSLSSRQQKSLLIESCQSNNQNQIDFIEEIEQDNKFKEKKYHAKVRNQE-REQQRE 1040

Query: 309  QMGKF 323
            Q  KF
Sbjct: 1041 QCIKF 1045


>UniRef50_A2DXH5 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 403

 Score = 33.5 bits (73), Expect = 4.8
 Identities = 25/115 (21%), Positives = 56/115 (48%), Gaps = 4/115 (3%)
 Frame = +3

Query: 123 YLEKIDKSLAQTFMK----KTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKMPL 290
           YL ++ + LA T+ +    + K +   K      +II K N+  K + +   ++ +++ +
Sbjct: 78  YLRELIRKLATTYQECVDLQKKYQEAKKGAGNSAEIITKQNEDLKIEIESLTSKLQELEV 137

Query: 291 RNQQFKQMGKFQQLRKKLKVQILVAPMMSHQNRHLSPTKQLKNLTLHQLRNKNHQ 455
           + Q  +Q  K +   K+ +++ L   + + QN H   + Q+++L    L+N   Q
Sbjct: 138 QKQSKEQSLKSKVSSKQNQLKALNDSITNLQNAHRELSNQVEDLKSTVLKNTGRQ 192


>UniRef50_A0E8L0 Cluster: Chromosome undetermined scaffold_83, whole
            genome shotgun sequence; n=2; Paramecium tetraurelia|Rep:
            Chromosome undetermined scaffold_83, whole genome shotgun
            sequence - Paramecium tetraurelia
          Length = 1221

 Score = 33.5 bits (73), Expect = 4.8
 Identities = 26/100 (26%), Positives = 49/100 (49%), Gaps = 2/100 (2%)
 Frame = +3

Query: 54   LDKNFKMNLTTEIQAD--VNSLVHQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAK 227
            L+K++   L  E Q    +    ++  EK  +++ Q   K+T  +  A  +Q    I+ K
Sbjct: 832  LEKDYDARLLEERQQRERMEKEYNKEKEKYRETIEQ-IRKETIGEIEALEEQNQQQILQK 890

Query: 228  FNQGNKAKAKVPVTRAKKMPLRNQQFKQMGKFQQLRKKLK 347
             +QG KAK++V +T+ K   L  ++ KQ    +   ++ K
Sbjct: 891  TDQGLKAKSEVSMTKKKIQSLLQEEEKQNENLKDYNEQKK 930


>UniRef50_Q6GNV8 Cluster: MGC80840 protein; n=2; Xenopus|Rep:
           MGC80840 protein - Xenopus laevis (African clawed frog)
          Length = 664

 Score = 33.1 bits (72), Expect = 6.3
 Identities = 16/53 (30%), Positives = 34/53 (64%)
 Frame = +3

Query: 87  EIQADVNSLVHQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFNQGNK 245
           +++ D  ++  +Y ++  K++ ++  KK +AKPR++N  TL+D+    N+ NK
Sbjct: 383 QVKGDAKNI--RYRDEATKNVIES--KKPEAKPRSQNLDTLIDVRNTSNKSNK 431


>UniRef50_A0BRG1 Cluster: Chromosome undetermined scaffold_122,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_122,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 1949

 Score = 33.1 bits (72), Expect = 6.3
 Identities = 25/116 (21%), Positives = 54/116 (46%), Gaps = 1/116 (0%)
 Frame = +3

Query: 117 HQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKMPLRN 296
           +QYL+K    L +   KK + KP+ +     L  I K  + NK   +  +   +++    
Sbjct: 452 NQYLQKKISILEEELSKKKQPKPQKQQPDRDLQ-IQKLTEANKRYLEENLKLFEEIRQLR 510

Query: 297 QQFKQMGKFQQLRKKLKVQILVAPMMSHQ-NRHLSPTKQLKNLTLHQLRNKNHQMT 461
           ++F      Q   K  ++Q     +++HQ  + +   +Q+ NL + +++ K  ++T
Sbjct: 511 EKFDYSSVLQNSMKDSQIQ---ENVVNHQLEQQIERMQQIHNLEIQKMKKKIEKLT 563


>UniRef50_Q6L2Z0 Cluster: Type I restriction-modification system
            restriction subunit; n=1; Picrophilus torridus|Rep: Type
            I restriction-modification system restriction subunit -
            Picrophilus torridus
          Length = 996

 Score = 33.1 bits (72), Expect = 6.3
 Identities = 19/81 (23%), Positives = 33/81 (40%)
 Frame = +3

Query: 105  NSLVHQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKM 284
            N L     EKI+K +     KK + K   K  Q L++ + + N G +    +P     K+
Sbjct: 865  NPLYETVSEKIEKIINDWNNKKIEIKNEYKELQDLINKVNEINDGERNSGLMPPAYIAKV 924

Query: 285  PLRNQQFKQMGKFQQLRKKLK 347
             + N      G   +   ++K
Sbjct: 925  IMDNMNINSTGMINKFNDEIK 945


>UniRef50_Q24FP6 Cluster: EF hand family protein; n=1; Tetrahymena
            thermophila SB210|Rep: EF hand family protein -
            Tetrahymena thermophila SB210
          Length = 1793

 Score = 32.7 bits (71), Expect = 8.3
 Identities = 24/97 (24%), Positives = 49/97 (50%), Gaps = 1/97 (1%)
 Frame = +3

Query: 54   LDKNFKMNLTTEIQADVNSLVHQYL-EKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKF 230
            L +  +  L  +I++++ S + Q L E+I+  L Q + ++ + K +  +QQ+        
Sbjct: 847  LRQQIEQELKEQIESNLRSQIKQQLREQIENELQQKYSQQQQQKEKQNSQQS-------- 898

Query: 231  NQGNKAKAKVPVTRAKKMPLRNQQFKQMGKFQQLRKK 341
             Q  +  +K    + +K P +NQQ KQ+ +   L K+
Sbjct: 899  -QRLQRDSKQQSNQKQKQPNQNQQLKQLIEQLNLEKQ 934


>UniRef50_A2DVP9 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 889

 Score = 32.7 bits (71), Expect = 8.3
 Identities = 23/101 (22%), Positives = 43/101 (42%)
 Frame = +3

Query: 156 TFMKKTKAKPRAKNQQTLLDIIAKFNQGNKAKAKVPVTRAKKMPLRNQQFKQMGKFQQLR 335
           T  +  KA P+  + Q  + I   F   N A     +  + +    NQQ + +   Q + 
Sbjct: 293 TIQQPFKATPQQNSNQNAVIISPAFTMQNAAIQAQALNGSAQTFNNNQQNQVVSNLQNIP 352

Query: 336 KKLKVQILVAPMMSHQNRHLSPTKQLKNLTLHQLRNKNHQM 458
           K++  Q    P  + Q    +P+KQ   ++  Q  NK++ +
Sbjct: 353 KQISQQQAPTPQPTPQKAPGAPSKQQAQVSTPQNINKSNPL 393


>UniRef50_A0BZE2 Cluster: Chromosome undetermined scaffold_139,
           whole genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_139,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 250

 Score = 32.7 bits (71), Expect = 8.3
 Identities = 26/116 (22%), Positives = 53/116 (45%), Gaps = 1/116 (0%)
 Frame = +3

Query: 57  DKNFKMN-LTTEIQADVNSLVHQYLEKIDKSLAQTFMKKTKAKPRAKNQQTLLDIIAKFN 233
           D  +  N +++ IQ  +N   +    K     +Q   K +K KP    Q+ LLD   +  
Sbjct: 112 DSQYNSNKVSSYIQEKINEKSNY--TKTTTGESQDDEKSSKQKPSIIYQKQLLDKDLQIM 169

Query: 234 QGNKAKAKVPVTRAKKMPLRNQQFKQMGKFQQLRKKLKVQILVAPMMSHQNRHLSP 401
           Q  K   ++  +  K   +  +  K + + Q+L+ ++++Q +    ++HQ   +SP
Sbjct: 170 QLQKELKQIKESNKKYQNIEKEYDKLLQENQKLKLQIQLQQVQITQLTHQQSLISP 225


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 509,444,856
Number of Sequences: 1657284
Number of extensions: 7925945
Number of successful extensions: 23794
Number of sequences better than 10.0: 18
Number of HSP's better than 10.0 without gapping: 22917
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 23740
length of database: 575,637,011
effective HSP length: 98
effective length of database: 413,223,179
effective search space used: 51652897375
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2022 -