SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTP 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= BGIBMGA000917-TA|BGIBMGA000917-PA|undefined
         (147 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_Q5KP60 Cluster: Putative uncharacterized protein; n=2; ...    35   0.62 
UniRef50_Q54IS5 Cluster: Putative uncharacterized protein; n=1; ...    35   0.82 
UniRef50_Q91LG7 Cluster: ORF73; n=1; Shrimp white spot syndrome ...    34   1.1  
UniRef50_A0AFQ6 Cluster: Complete genome; n=1; Listeria welshime...    34   1.1  
UniRef50_Q4PFN2 Cluster: Putative uncharacterized protein; n=1; ...    34   1.1  
UniRef50_UPI00015B5A4B Cluster: PREDICTED: similar to CG12398-PA...    34   1.4  
UniRef50_Q91749 Cluster: Oviduct specific protein-1A; n=2; Xenop...    34   1.4  
UniRef50_UPI00006CFB9F Cluster: hypothetical protein TTHERM_0048...    33   1.9  
UniRef50_A0BFE4 Cluster: Chromosome undetermined scaffold_104, w...    33   1.9  
UniRef50_Q41FU7 Cluster: Putative uncharacterized protein; n=1; ...    33   3.3  
UniRef50_A0DHC4 Cluster: Chromosome undetermined scaffold_50, wh...    33   3.3  
UniRef50_A2DCR9 Cluster: Leucine Rich Repeat family protein; n=1...    32   4.4  
UniRef50_Q5AL57 Cluster: Putative uncharacterized protein; n=1; ...    32   4.4  
UniRef50_A5DXA0 Cluster: Putative uncharacterized protein; n=1; ...    32   4.4  
UniRef50_UPI0000DB70C8 Cluster: PREDICTED: hypothetical protein;...    32   5.8  
UniRef50_A5P1H8 Cluster: RNA polymerase sigma factor; n=6; Alpha...    32   5.8  
UniRef50_Q9VT00 Cluster: CG3654-PD; n=6; Diptera|Rep: CG3654-PD ...    32   5.8  
UniRef50_Q4UEL7 Cluster: Putative uncharacterized protein; n=2; ...    32   5.8  
UniRef50_A0D1C6 Cluster: Chromosome undetermined scaffold_34, wh...    32   5.8  
UniRef50_UPI0000DB6FE2 Cluster: PREDICTED: similar to CG13980-PA...    31   7.6  
UniRef50_Q23JH8 Cluster: NLI interacting factor-like phosphatase...    31   7.6  
UniRef50_A2FL64 Cluster: Putative uncharacterized protein; n=1; ...    31   7.6  
UniRef50_A2FKB8 Cluster: Putative uncharacterized protein; n=1; ...    31   7.6  
UniRef50_A6RQZ1 Cluster: Putative uncharacterized protein; n=1; ...    31   7.6  

>UniRef50_Q5KP60 Cluster: Putative uncharacterized protein; n=2;
           Filobasidiella neoformans|Rep: Putative uncharacterized
           protein - Cryptococcus neoformans (Filobasidiella
           neoformans)
          Length = 1180

 Score = 35.1 bits (77), Expect = 0.62
 Identities = 21/77 (27%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 3   TKARSNSLGTITQDINVPEIIFSTKDQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPN 62
           +K  +++ G        PE+   +KD    D Q    PPA+R+R+++ SP  E    +  
Sbjct: 748 SKRPASAAGLSATQGTAPEV--RSKDIGGRDSQDLMPPPAQRRRIQSPSPSPEPGPSEDQ 805

Query: 63  TTTYAITTKNQFDILDS 79
               A  ++N+FD  D+
Sbjct: 806 APRRAAPSQNRFDFQDA 822


>UniRef50_Q54IS5 Cluster: Putative uncharacterized protein; n=1;
           Dictyostelium discoideum AX4|Rep: Putative
           uncharacterized protein - Dictyostelium discoideum AX4
          Length = 679

 Score = 34.7 bits (76), Expect = 0.82
 Identities = 19/55 (34%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 21  EIIFSTKDQIKHDWQQDRIPPA--KRKRVETDSPQTEKVKKKPNTTTYAITTKNQ 73
           E +F+ KD ++   Q D   P   K+K     +  T+KV ++P TTT  ITT  +
Sbjct: 602 ENLFNLKDDLELQQQNDEKKPVEDKKKPAAAATTVTKKVTEEPTTTTTTITTSKK 656


>UniRef50_Q91LG7 Cluster: ORF73; n=1; Shrimp white spot syndrome
           virus|Rep: ORF73 - White spot syndrome virus (WSSV)
          Length = 1044

 Score = 34.3 bits (75), Expect = 1.1
 Identities = 25/94 (26%), Positives = 41/94 (43%), Gaps = 5/94 (5%)

Query: 25  STKDQIKHDWQQ--DRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFDILDSEES 82
           S KD I +D+++  D+   AKR  +    P T    KK N  T  I+T ++F     +E+
Sbjct: 771 SLKD-IFNDFEKTCDKYKTAKRAIIGAQDPSTSTPSKKENGITRIISTLSEFH--SKDEA 827

Query: 83  TTDNXXXXXXXXXXXXFVTGVNRTLKRKRVFNSF 116
           T                ++GV   ++   VF+ F
Sbjct: 828 TVSALLDKTMLLGSRTIMSGVRCVIRNNSVFSGF 861


>UniRef50_A0AFQ6 Cluster: Complete genome; n=1; Listeria welshimeri
           serovar 6b str. SLCC5334|Rep: Complete genome - Listeria
           welshimeri serovar 6b (strain ATCC 35897 / DSM 20650
           /SLCC5334)
          Length = 818

 Score = 34.3 bits (75), Expect = 1.1
 Identities = 21/79 (26%), Positives = 33/79 (41%)

Query: 30  IKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFDILDSEESTTDNXXX 89
           +K D Q+D+    K+K  E      EK  +K N T  A  +    D  D++ ++ DN   
Sbjct: 230 VKEDEQKDKKEAEKKKEEEKQKELAEKEAEKNNETKEANGSNESNDAKDNKVASKDNESE 289

Query: 90  XXXXXXXXXFVTGVNRTLK 108
                      +G+N T K
Sbjct: 290 NDSNKTSGEKSSGLNATKK 308


>UniRef50_Q4PFN2 Cluster: Putative uncharacterized protein; n=1;
           Ustilago maydis|Rep: Putative uncharacterized protein -
           Ustilago maydis (Smut fungus)
          Length = 399

 Score = 34.3 bits (75), Expect = 1.1
 Identities = 16/63 (25%), Positives = 32/63 (50%), Gaps = 2/63 (3%)

Query: 24  FSTKDQIKHDWQQDRI--PPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFDILDSEE 81
           F   DQI HD+  ++   PP       TD+ +T++VKK+ N    + +   ++ + + ++
Sbjct: 209 FYFADQITHDYLLEKAAPPPTANSTTSTDTSETQRVKKRQNFVPLSSSLATKWHLSERQQ 268

Query: 82  STT 84
             T
Sbjct: 269 RRT 271


>UniRef50_UPI00015B5A4B Cluster: PREDICTED: similar to CG12398-PA;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           CG12398-PA - Nasonia vitripennis
          Length = 678

 Score = 33.9 bits (74), Expect = 1.4
 Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 3/70 (4%)

Query: 13  ITQDINVPEIIFSTK--DQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITT 70
           +T + N P I+ + K  D IK DWQ     P +R+  E   P  ++ K      T ++  
Sbjct: 605 VTGNTNAPTIMIAEKGADMIKQDWQHHHHHPRRRRTSEKLKPDRQRRKDYARRKTASLNP 664

Query: 71  KNQ-FDILDS 79
             Q + +L+S
Sbjct: 665 LVQRYSVLES 674


>UniRef50_Q91749 Cluster: Oviduct specific protein-1A; n=2; Xenopus
           laevis|Rep: Oviduct specific protein-1A - Xenopus laevis
           (African clawed frog)
          Length = 480

 Score = 33.9 bits (74), Expect = 1.4
 Identities = 17/36 (47%), Positives = 21/36 (58%)

Query: 40  PPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFD 75
           PPAK+K VE+D    EK  KKP     A  TK++ D
Sbjct: 164 PPAKKKPVESDEETKEKEDKKPVAVFAADKTKSEED 199


>UniRef50_UPI00006CFB9F Cluster: hypothetical protein
           TTHERM_00486710; n=1; Tetrahymena thermophila SB210|Rep:
           hypothetical protein TTHERM_00486710 - Tetrahymena
           thermophila SB210
          Length = 604

 Score = 33.5 bits (73), Expect = 1.9
 Identities = 22/68 (32%), Positives = 40/68 (58%), Gaps = 8/68 (11%)

Query: 12  TITQDINVPEIIFSTKDQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTK 71
           +ITQDIN+ E+I   K QI++  QQ +   + ++     S   ++ K+K N ++Y  +T 
Sbjct: 42  SITQDINIQEVI--NKSQIQYQQQQQQYLLSLKR----SSEHLDQEKEKQNNSSY--STN 93

Query: 72  NQFDILDS 79
           NQ  +L++
Sbjct: 94  NQQTLLNT 101


>UniRef50_A0BFE4 Cluster: Chromosome undetermined scaffold_104,
           whole genome shotgun sequence; n=4; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_104,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 374

 Score = 33.5 bits (73), Expect = 1.9
 Identities = 17/51 (33%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 31  KHDWQQDRIPPAKRKRVETDSPQTEKV-KKKPNT-TTYAITTKNQFDILDS 79
           K D +Q++ P  K+K ++ DSP  E++ +++PN+   Y++  ++  DIL S
Sbjct: 245 KKDIKQEKKPGRKKKILDNDSPNQEQIEQQQPNSQNVYSLLQQDVLDILIS 295


>UniRef50_Q41FU7 Cluster: Putative uncharacterized protein; n=1;
          Exiguobacterium sibiricum 255-15|Rep: Putative
          uncharacterized protein - Exiguobacterium sibiricum
          255-15
          Length = 245

 Score = 32.7 bits (71), Expect = 3.3
 Identities = 17/55 (30%), Positives = 28/55 (50%)

Query: 19 VPEIIFSTKDQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQ 73
          + E I ST+ ++K+ W + R    + K V T+S    K KKK    T +   +N+
Sbjct: 18 IAEKIESTEGKVKYAWSKYRKSLTENKAVSTESSPVAKPKKKKAGVTVSAPKQNE 72


>UniRef50_A0DHC4 Cluster: Chromosome undetermined scaffold_50, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_50,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 743

 Score = 32.7 bits (71), Expect = 3.3
 Identities = 20/80 (25%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 4   KARSNSLGTITQDIN--VPEIIFSTKDQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKKP 61
           K +  S  TI Q++   + ++     ++ K + QQ   P  +  + +T  P+T+K+K +P
Sbjct: 488 KPKEESDATINQEVQDYLKKLDSKKPEKAKEETQQQ--PEKQPPQQQTQVPKTQKLKTEP 545

Query: 62  NTTTYAITTKNQFDILDSEE 81
                 + TK Q + +DS +
Sbjct: 546 PKQEQKLQTKQQVEKVDSAD 565


>UniRef50_A2DCR9 Cluster: Leucine Rich Repeat family protein; n=1;
            Trichomonas vaginalis G3|Rep: Leucine Rich Repeat family
            protein - Trichomonas vaginalis G3
          Length = 1082

 Score = 32.3 bits (70), Expect = 4.4
 Identities = 18/68 (26%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 21   EIIFSTKDQIKHDWQQDRIPPAKRKRV--ETDSPQTEKVKKKPNTTTYAITTKNQFDILD 78
            E I   K +I +D  ++ I P K+KR+  E+DS   E++ +K   +T    + +   +++
Sbjct: 941  EDILPKKKRIINDSDEEVILPKKKKRILLESDSNDEEEIPRKSKKSTPKKVSNSNRKVIN 1000

Query: 79   SEESTTDN 86
             + S +D+
Sbjct: 1001 MDSSDSDD 1008


>UniRef50_Q5AL57 Cluster: Putative uncharacterized protein; n=1;
           Candida albicans|Rep: Putative uncharacterized protein -
           Candida albicans (Yeast)
          Length = 139

 Score = 32.3 bits (70), Expect = 4.4
 Identities = 18/53 (33%), Positives = 29/53 (54%), Gaps = 3/53 (5%)

Query: 23  IFSTKDQIKHDWQQDRIPPAKRKRVETDSPQ--TEKVKKKPNTTTYAITTKNQ 73
           +F T DQ++ +W+ ++   A  K+ E +S Q    K K+KPN+T     T  Q
Sbjct: 53  VFKTLDQLREEWKAEK-EQANPKKEEENSNQKPVAKQKQKPNSTKKQKQTPKQ 104


>UniRef50_A5DXA0 Cluster: Putative uncharacterized protein; n=1;
           Lodderomyces elongisporus NRRL YB-4239|Rep: Putative
           uncharacterized protein - Lodderomyces elongisporus
           (Yeast) (Saccharomyces elongisporus)
          Length = 1637

 Score = 32.3 bits (70), Expect = 4.4
 Identities = 14/34 (41%), Positives = 22/34 (64%)

Query: 27  KDQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKK 60
           ++QIKHD +Q R+   +RK +E    Q E+ KK+
Sbjct: 960 EEQIKHDEEQRRLKEERRKELEEKKRQKEEEKKQ 993


>UniRef50_UPI0000DB70C8 Cluster: PREDICTED: hypothetical protein;
           n=1; Apis mellifera|Rep: PREDICTED: hypothetical protein
           - Apis mellifera
          Length = 2470

 Score = 31.9 bits (69), Expect = 5.8
 Identities = 16/52 (30%), Positives = 25/52 (48%)

Query: 30  IKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFDILDSEE 81
           I  DW+  RI   +     +DS +T K KK+   T+ +  + +     DSEE
Sbjct: 677 IPSDWENVRIKVERMSDENSDSKETRKKKKRKKATSSSSESSSSSSSSDSEE 728


>UniRef50_A5P1H8 Cluster: RNA polymerase sigma factor; n=6;
          Alphaproteobacteria|Rep: RNA polymerase sigma factor -
          Methylobacterium sp. 4-46
          Length = 281

 Score = 31.9 bits (69), Expect = 5.8
 Identities = 14/37 (37%), Positives = 17/37 (45%)

Query: 27 KDQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPNT 63
          + +I H W   R PPA   R  TDS    + K  P T
Sbjct: 12 RSRISHSWSSFRCPPASPCRPMTDSSSARRAKPGPAT 48


>UniRef50_Q9VT00 Cluster: CG3654-PD; n=6; Diptera|Rep: CG3654-PD -
           Drosophila melanogaster (Fruit fly)
          Length = 2351

 Score = 31.9 bits (69), Expect = 5.8
 Identities = 14/45 (31%), Positives = 26/45 (57%)

Query: 35  QQDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFDILDS 79
           +Q   PP+K ++ +  + Q ++ K+K N++ YA   K Q +  DS
Sbjct: 403 KQPETPPSKTEKEQEPTKQIKEAKEKTNSSIYAKEAKQQLENGDS 447


>UniRef50_Q4UEL7 Cluster: Putative uncharacterized protein; n=2;
           Theileria|Rep: Putative uncharacterized protein -
           Theileria annulata
          Length = 312

 Score = 31.9 bits (69), Expect = 5.8
 Identities = 14/44 (31%), Positives = 24/44 (54%)

Query: 36  QDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFDILDS 79
           +D+IP  ++K ++T  P T     + + T   ITT N ++  DS
Sbjct: 191 KDKIPTVQKKPIQTQVPPTSSFNLRSSRTNRNITTSNYYNKDDS 234


>UniRef50_A0D1C6 Cluster: Chromosome undetermined scaffold_34, whole
           genome shotgun sequence; n=2; Paramecium
           tetraurelia|Rep: Chromosome undetermined scaffold_34,
           whole genome shotgun sequence - Paramecium tetraurelia
          Length = 770

 Score = 31.9 bits (69), Expect = 5.8
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 6/74 (8%)

Query: 12  TITQDINVPEIIFSTKDQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTK 71
           T+++ I+V E+IF       H ++Q + P +    +     + EK K K    T  I+ K
Sbjct: 540 TLSKLISVLELIFIRN----HIFKQHKDPYSSS--IVQPIKKEEKDKMKQKEQTIEISKK 593

Query: 72  NQFDILDSEESTTD 85
           N FD+++S+E  T+
Sbjct: 594 NSFDLIESDEEGTN 607


>UniRef50_UPI0000DB6FE2 Cluster: PREDICTED: similar to CG13980-PA;
           n=1; Apis mellifera|Rep: PREDICTED: similar to
           CG13980-PA - Apis mellifera
          Length = 1080

 Score = 31.5 bits (68), Expect = 7.6
 Identities = 19/80 (23%), Positives = 32/80 (40%), Gaps = 1/80 (1%)

Query: 36  QDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFDILDSEESTTDNXXXXXXXXX 95
           +D  P      VE +  +T+KVKK P  +  A T +  F  +D   S++           
Sbjct: 652 RDLSPTKSNLEVEKEEDETKKVKKHPGQSDLA-TEEESFRYIDESSSSSPKSSHTRRSRP 710

Query: 96  XXXFVTGVNRTLKRKRVFNS 115
               +    R+  +K +F S
Sbjct: 711 STGKIKKTGRSTVKKCLFES 730


>UniRef50_Q23JH8 Cluster: NLI interacting factor-like phosphatase
           family protein; n=1; Tetrahymena thermophila SB210|Rep:
           NLI interacting factor-like phosphatase family protein -
           Tetrahymena thermophila SB210
          Length = 1190

 Score = 31.5 bits (68), Expect = 7.6
 Identities = 14/34 (41%), Positives = 19/34 (55%)

Query: 53  QTEKVKKKPNTTTYAITTKNQFDILDSEESTTDN 86
           +T  +K+ P +TT     KN F I D + STT N
Sbjct: 193 KTPHIKQPPTSTTSKSNKKNNFQIRDDQNSTTVN 226


>UniRef50_A2FL64 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 2102

 Score = 31.5 bits (68), Expect = 7.6
 Identities = 16/65 (24%), Positives = 33/65 (50%), Gaps = 1/65 (1%)

Query: 21  EIIFSTKDQIKHDWQQDRIPPAKRKRVETDSPQTE-KVKKKPNTTTYAITTKNQFDILDS 79
           E++   KD+IKH  +   + P K + +  D+P ++ K+  +   T+    ++N+ D  + 
Sbjct: 282 EVLGKLKDEIKHMEKPIPVNPEKSEPIHIDTPPSKHKISPERKNTSPVSASENKLDSENK 341

Query: 80  EESTT 84
           E   T
Sbjct: 342 ESPAT 346


>UniRef50_A2FKB8 Cluster: Putative uncharacterized protein; n=1;
           Trichomonas vaginalis G3|Rep: Putative uncharacterized
           protein - Trichomonas vaginalis G3
          Length = 296

 Score = 31.5 bits (68), Expect = 7.6
 Identities = 18/67 (26%), Positives = 32/67 (47%), Gaps = 5/67 (7%)

Query: 20  PEIIFSTKDQIKHDWQQDRIPPAKRKRVETDSPQTEKVKKKPNTTTYAITTKNQFDILDS 79
           PE  F+++D IK  W++ R    ++ + +  S +T +  K  N T   + +K      DS
Sbjct: 112 PEACFNSRDLIKEYWKKVREEDEQKTKRKRRSSETNQDNKSSNETEKTVKSKK-----DS 166

Query: 80  EESTTDN 86
            E   D+
Sbjct: 167 SEDNNDD 173


>UniRef50_A6RQZ1 Cluster: Putative uncharacterized protein; n=1;
           Botryotinia fuckeliana B05.10|Rep: Putative
           uncharacterized protein - Botryotinia fuckeliana B05.10
          Length = 426

 Score = 31.5 bits (68), Expect = 7.6
 Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 2/71 (2%)

Query: 5   ARSNSLGTITQDINVPEIIFSTKDQIKHDWQQDRIPPAKRK--RVETDSPQTEKVKKKPN 62
           A   S   + Q+  VPE+    ++ I+   Q+D IP  KRK  R   DS + +    +  
Sbjct: 157 AEQESPEKVIQETQVPEMDMELEEDIEEPEQEDSIPEVKRKESRSHNDSRRRQMSVSRHR 216

Query: 63  TTTYAITTKNQ 73
             + + T +N+
Sbjct: 217 AGSASDTERNE 227


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.315    0.130    0.365 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 143,394,989
Number of Sequences: 1657284
Number of extensions: 4992547
Number of successful extensions: 16095
Number of sequences better than 10.0: 24
Number of HSP's better than 10.0 without gapping: 10
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 16079
Number of HSP's gapped (non-prelim): 29
length of query: 147
length of database: 575,637,011
effective HSP length: 93
effective length of query: 54
effective length of database: 421,509,599
effective search space: 22761518346
effective search space used: 22761518346
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.6 bits)
S2: 68 (31.5 bits)

- SilkBase 1999-2023 -