SilkBase IMG001 IMG002 IMG003 IMG005 IMG006 IMG007 IMG008 IMG009 kuwako IMG010 IMG011 IMG012

Last updated: 2022/11/18
BLASTX 2.2.12 [Aug-07-2005]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= bmov10b10
         (712 letters)

Database: uniref50 
           1,657,284 sequences; 575,637,011 total letters

Searching..................................................done

                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

UniRef50_UPI00005454F5 Cluster: PREDICTED: hypothetical protein;...    60   7e-08
UniRef50_Q9H0W7 Cluster: THAP domain-containing protein 2; n=12;...    52   2e-05
UniRef50_Q8WY91 Cluster: THAP domain-containing protein 4; n=18;...    51   3e-05
UniRef50_Q1DGX1 Cluster: Putative uncharacterized protein; n=2; ...    49   1e-04
UniRef50_Q1JPT7 Cluster: Zgc:136597; n=4; Clupeocephala|Rep: Zgc...    48   2e-04
UniRef50_UPI00004A5BB5 Cluster: PREDICTED: similar to THAP domai...    48   3e-04
UniRef50_Q0V9A7 Cluster: Putative uncharacterized protein MGC147...    48   3e-04
UniRef50_Q9NVV9 Cluster: THAP domain-containing protein 1; n=14;...    48   3e-04
UniRef50_UPI0000DB71DF Cluster: PREDICTED: similar to THAP domai...    47   4e-04
UniRef50_UPI0000E493FC Cluster: PREDICTED: similar to transposas...    46   0.001
UniRef50_Q0P4A1 Cluster: Zgc:153243; n=3; Clupeocephala|Rep: Zgc...    45   0.002
UniRef50_Q4RTQ3 Cluster: Chromosome 2 SCAF14997, whole genome sh...    44   0.004
UniRef50_Q29K07 Cluster: GA10453-PA; n=1; Drosophila pseudoobscu...    44   0.004
UniRef50_UPI00015B5AC5 Cluster: PREDICTED: hypothetical protein;...    42   0.015
UniRef50_Q6PFI2 Cluster: L(3)mbt-like 2; n=3; Danio rerio|Rep: L...    42   0.015
UniRef50_A4IGI1 Cluster: Zgc:163143 protein; n=1; Danio rerio|Re...    42   0.015
UniRef50_Q58EG0 Cluster: THAP domain containing, apoptosis assoc...    42   0.020
UniRef50_Q7QG69 Cluster: ENSANGP00000020085; n=1; Anopheles gamb...    42   0.020
UniRef50_UPI0000F213AC Cluster: PREDICTED: hypothetical protein;...    41   0.035
UniRef50_Q4V6V0 Cluster: IP01025p; n=3; Sophophora|Rep: IP01025p...    41   0.035
UniRef50_Q16XI0 Cluster: Putative uncharacterized protein; n=1; ...    40   0.046
UniRef50_Q0VFB2 Cluster: LOC779531 protein; n=1; Xenopus tropica...    40   0.060
UniRef50_Q16XH9 Cluster: Putative uncharacterized protein; n=1; ...    40   0.060
UniRef50_UPI000058482A Cluster: PREDICTED: similar to MGC82205 p...    40   0.080
UniRef50_Q5PQ32 Cluster: LOC495997 protein; n=3; Xenopus|Rep: LO...    40   0.080
UniRef50_Q293X7 Cluster: GA19783-PA; n=1; Drosophila pseudoobscu...    40   0.080
UniRef50_A4QP83 Cluster: LOC100005466 protein; n=2; Danio rerio|...    38   0.18 
UniRef50_Q1KZX7 Cluster: Transposase; n=1; Anopheles gambiae str...    38   0.18 
UniRef50_UPI0000F1DF3B Cluster: PREDICTED: hypothetical protein;...    38   0.32 
UniRef50_Q08CB6 Cluster: Zgc:153292; n=2; Danio rerio|Rep: Zgc:1...    38   0.32 
UniRef50_UPI00015B5079 Cluster: PREDICTED: hypothetical protein;...    36   0.74 
UniRef50_UPI00015B45C1 Cluster: PREDICTED: similar to viral A-ty...    36   0.74 
UniRef50_Q9VIS5 Cluster: CG10631-PA; n=1; Drosophila melanogaste...    36   0.98 
UniRef50_Q8WTV1 Cluster: THAP domain-containing protein 3; n=12;...    36   0.98 
UniRef50_Q16MP6 Cluster: Putative uncharacterized protein; n=1; ...    36   1.3  
UniRef50_Q9BT49 Cluster: THAP domain-containing protein 7; n=18;...    36   1.3  
UniRef50_Q03D70 Cluster: ATP-dependent nuclease, subunit B; n=1;...    35   1.7  
UniRef50_Q96EK4 Cluster: THAP domain-containing protein 11; n=20...    35   1.7  
UniRef50_UPI0000D56D8D Cluster: PREDICTED: hypothetical protein;...    35   2.3  
UniRef50_Q566P8 Cluster: LOC553397 protein; n=5; Danio rerio|Rep...    35   2.3  
UniRef50_Q60MX8 Cluster: Putative uncharacterized protein CBG229...    35   2.3  
UniRef50_Q4V5S0 Cluster: IP06774p; n=4; Sophophora|Rep: IP06774p...    35   2.3  
UniRef50_A4VE92 Cluster: Putative uncharacterized protein; n=1; ...    35   2.3  
UniRef50_UPI0000DB74EF Cluster: PREDICTED: hypothetical protein;...    34   3.0  
UniRef50_UPI00015B5019 Cluster: PREDICTED: similar to GA20163-PA...    34   4.0  
UniRef50_UPI0000F1E17E Cluster: PREDICTED: hypothetical protein,...    34   4.0  
UniRef50_Q5TRK2 Cluster: ENSANGP00000029564; n=1; Anopheles gamb...    34   4.0  
UniRef50_Q17N87 Cluster: Putative uncharacterized protein; n=1; ...    34   4.0  
UniRef50_Q16PJ2 Cluster: Putative uncharacterized protein; n=1; ...    34   4.0  
UniRef50_UPI0001556045 Cluster: PREDICTED: similar to THAP domai...    33   5.2  
UniRef50_Q6NYT2 Cluster: Zgc:65871; n=2; Danio rerio|Rep: Zgc:65...    33   5.2  
UniRef50_Q1KZX8 Cluster: Transposase; n=2; Anopheles gambiae str...    33   5.2  
UniRef50_Q9P2Z0 Cluster: THAP domain-containing protein 10; n=6;...    33   5.2  
UniRef50_P34427 Cluster: Protein lin-36; n=2; Caenorhabditis ele...    33   5.2  
UniRef50_UPI00015B40B2 Cluster: PREDICTED: similar to THAP domai...    33   6.9  
UniRef50_UPI0000E87B1B Cluster: O-Antigen Polymerase; n=1; Methy...    33   6.9  
UniRef50_A2A8D4 Cluster: THAP domain containing, apoptosis assoc...    33   6.9  
UniRef50_Q8BJ25 Cluster: THAP domain-containing protein 3; n=9; ...    33   6.9  
UniRef50_Q6IR68 Cluster: LOC432088 protein; n=3; Xenopus|Rep: LO...    33   9.2  
UniRef50_A6T1S2 Cluster: Uncharacterized conserved protein; n=2;...    33   9.2  
UniRef50_Q7Q704 Cluster: ENSANGP00000017828; n=1; Anopheles gamb...    33   9.2  
UniRef50_Q1KZX9 Cluster: Transposase; n=1; Anopheles gambiae str...    33   9.2  

>UniRef50_UPI00005454F5 Cluster: PREDICTED: hypothetical protein;
           n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
           Danio rerio
          Length = 183

 Score = 59.7 bits (138), Expect = 7e-08
 Identities = 23/54 (42%), Positives = 35/54 (64%), Gaps = 1/54 (1%)
 Frame = +1

Query: 511 CDDCKNMDSIMFFRFPEDSSLRQIWTDLTGRN-NWTPTDFSYICIPHFSVDCFK 669
           CD C N+D + F++FP     R+IW+   GR+  WTP++ S +C  HF+ DCF+
Sbjct: 4   CDFCGNIDGVSFYKFPLQEERRRIWSVNMGRDVGWTPSETSSLCSAHFTPDCFE 57


>UniRef50_Q9H0W7 Cluster: THAP domain-containing protein 2; n=12;
           Mammalia|Rep: THAP domain-containing protein 2 - Homo
           sapiens (Human)
          Length = 228

 Score = 51.6 bits (118), Expect = 2e-05
 Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 1/70 (1%)
 Frame = +1

Query: 496 CAVLGCDDCKNMD-SIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDCFKL 672
           CA  GC    N   +I F RFP D   R+ W  L  R N+ P   +++C  HF   CF L
Sbjct: 5   CAAAGCATTYNKHINISFHRFPLDPKRRKEWVRLVRRKNFVPGKHTFLCSKHFEASCFDL 64

Query: 673 DNEDXMVLVD 702
             +   + +D
Sbjct: 65  TGQTRRLKMD 74


>UniRef50_Q8WY91 Cluster: THAP domain-containing protein 4; n=18;
           Euteleostomi|Rep: THAP domain-containing protein 4 -
           Homo sapiens (Human)
          Length = 577

 Score = 50.8 bits (116), Expect = 3e-05
 Identities = 25/67 (37%), Positives = 35/67 (52%), Gaps = 4/67 (5%)
 Frame = +1

Query: 496 CAVLGCDDCKNMD---SIMFFRFPEDSSLRQI-WTDLTGRNNWTPTDFSYICIPHFSVDC 663
           CA + C + +      ++ F RFP   S R I W     R+NWTPT +S++C  HF+ D 
Sbjct: 5   CAAVNCSNRQGKGEKRAVSFHRFPLKDSKRLIQWLKAVQRDNWTPTKYSFLCSEHFTKDS 64

Query: 664 FKLDNED 684
           F    ED
Sbjct: 65  FSKRLED 71


>UniRef50_Q1DGX1 Cluster: Putative uncharacterized protein; n=2;
           Aedes aegypti|Rep: Putative uncharacterized protein -
           Aedes aegypti (Yellowfever mosquito)
          Length = 688

 Score = 48.8 bits (111), Expect = 1e-04
 Identities = 21/57 (36%), Positives = 29/57 (50%), Gaps = 1/57 (1%)
 Frame = +1

Query: 484 ITKTCAVLGCDDCKNMDSIMFFRFPEDSSLRQIWTDLTGR-NNWTPTDFSYICIPHF 651
           + + C V GC           F FP D  LRQ W D+ G+ ++WT  + +YIC  HF
Sbjct: 1   MARCCVVTGCAASNQDFGTFLFSFPRDEKLRQQWIDVLGKPSSWTVPEAAYICWSHF 57


>UniRef50_Q1JPT7 Cluster: Zgc:136597; n=4; Clupeocephala|Rep:
           Zgc:136597 - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 158

 Score = 48.0 bits (109), Expect = 2e-04
 Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 2/78 (2%)
 Frame = +1

Query: 484 ITKTCAVLGCDDCKNMD-SIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSV 657
           + ++C+  GC +    D +I F +FP     +   W     R N+ PT +S IC  HF+ 
Sbjct: 1   MVQSCSAYGCKNRYQKDRNISFHKFPLARPEVCVQWVSAMSRRNFKPTKYSNICSQHFTS 60

Query: 658 DCFKLDNEDXMVLVDKAV 711
           DCFK +  +  VL D AV
Sbjct: 61  DCFKQECNN-RVLKDNAV 77


>UniRef50_UPI00004A5BB5 Cluster: PREDICTED: similar to THAP domain
           protein 1 isoform 1; n=1; Canis lupus familiaris|Rep:
           PREDICTED: similar to THAP domain protein 1 isoform 1 -
           Canis familiaris
          Length = 178

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
 Frame = +1

Query: 484 ITKTCAVLGCDDCKNMDS-IMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSV 657
           + ++C+  GC +  + D  + F +FP    SL + W     R N+ PT +S IC  HF+ 
Sbjct: 1   MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKKWEAAVRRKNFKPTKYSSICSEHFTP 60

Query: 658 DCFK 669
           DCFK
Sbjct: 61  DCFK 64


>UniRef50_Q0V9A7 Cluster: Putative uncharacterized protein
           MGC147467; n=2; Deuterostomia|Rep: Putative
           uncharacterized protein MGC147467 - Xenopus tropicalis
           (Western clawed frog) (Silurana tropicalis)
          Length = 502

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 19/44 (43%), Positives = 27/44 (61%), Gaps = 1/44 (2%)
 Frame = +1

Query: 538 IMFFRFPEDSSLR-QIWTDLTGRNNWTPTDFSYICIPHFSVDCF 666
           + F RFP    +R  +WTD   R+NWTP  +S++C  HFS + F
Sbjct: 22  VSFHRFPLKDHVRLSLWTDALQRDNWTPGPYSFLCSDHFSPESF 65


>UniRef50_Q9NVV9 Cluster: THAP domain-containing protein 1; n=14;
           Tetrapoda|Rep: THAP domain-containing protein 1 - Homo
           sapiens (Human)
          Length = 213

 Score = 47.6 bits (108), Expect = 3e-04
 Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 2/64 (3%)
 Frame = +1

Query: 484 ITKTCAVLGCDDCKNMDS-IMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSV 657
           + ++C+  GC +  + D  + F +FP    SL + W     R N+ PT +S IC  HF+ 
Sbjct: 1   MVQSCSAYGCKNRYDKDKPVSFHKFPLTRPSLCKEWEAAVRRKNFKPTKYSSICSEHFTP 60

Query: 658 DCFK 669
           DCFK
Sbjct: 61  DCFK 64


>UniRef50_UPI0000DB71DF Cluster: PREDICTED: similar to THAP domain
           containing, apoptosis associated protein 2; n=1; Apis
           mellifera|Rep: PREDICTED: similar to THAP domain
           containing, apoptosis associated protein 2 - Apis
           mellifera
          Length = 567

 Score = 47.2 bits (107), Expect = 4e-04
 Identities = 23/72 (31%), Positives = 36/72 (50%)
 Frame = +1

Query: 496 CAVLGCDDCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDCFKLD 675
           CA +GC++      IM   FP D  LR+IW +   R +W P++ S++C  HF    + + 
Sbjct: 4   CAAVGCNNRSEKGYIMKC-FPRDPKLRKIWQERVARADWEPSNNSFLCHVHFEPQEWSIT 62

Query: 676 NEDXMVLVDKAV 711
               + L   AV
Sbjct: 63  QSGRIRLKKNAV 74


>UniRef50_UPI0000E493FC Cluster: PREDICTED: similar to transposase;
           n=2; Strongylocentrotus purpuratus|Rep: PREDICTED:
           similar to transposase - Strongylocentrotus purpuratus
          Length = 851

 Score = 45.6 bits (103), Expect = 0.001
 Identities = 21/63 (33%), Positives = 28/63 (44%)
 Frame = +1

Query: 496 CAVLGCDDCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDCFKLD 675
           C   GC + K+      +RFP D   R+IW +   R  W PT  S +C  HF    F+  
Sbjct: 4   CCAFGCSN-KSEKGYKMYRFPADPQRRKIWENKVSRVGWKPTSSSCLCEIHFDESQFENG 62

Query: 676 NED 684
             D
Sbjct: 63  RAD 65


>UniRef50_Q0P4A1 Cluster: Zgc:153243; n=3; Clupeocephala|Rep:
           Zgc:153243 - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 224

 Score = 45.2 bits (102), Expect = 0.002
 Identities = 26/70 (37%), Positives = 33/70 (47%), Gaps = 5/70 (7%)
 Frame = +1

Query: 517 DCKNM----DSIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDCFKLDNE 681
           DC N       I F+RFP     L + W    GR N+ PT  S +C  HF  DCF+ D  
Sbjct: 9   DCSNRFVKGSEIRFYRFPISKPQLAEQWVRSLGRKNFVPTQNSCLCSEHFQPDCFR-DYN 67

Query: 682 DXMVLVDKAV 711
             + L + AV
Sbjct: 68  GKLFLREDAV 77


>UniRef50_Q4RTQ3 Cluster: Chromosome 2 SCAF14997, whole genome
           shotgun sequence; n=3; Clupeocephala|Rep: Chromosome 2
           SCAF14997, whole genome shotgun sequence - Tetraodon
           nigroviridis (Green puffer)
          Length = 950

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 25/75 (33%), Positives = 32/75 (42%), Gaps = 4/75 (5%)
 Frame = +1

Query: 439 KVAPELKQNADTDMGITKTCAVLGCDDCKNM----DSIMFFRFPEDSSLRQIWTDLTGRN 606
           +  P   Q ADT     K   V   + C+N      S+ F+RFP D   +Q W     R 
Sbjct: 437 RALPGGAQEADTRRRSQKLGCV-SAERCRNRRTPGTSLSFYRFPRDPERKQRWIAAVNRA 495

Query: 607 NWTPTDFSYICIPHF 651
            W P D S +C  HF
Sbjct: 496 GWVPNDGSRLCSTHF 510


>UniRef50_Q29K07 Cluster: GA10453-PA; n=1; Drosophila
            pseudoobscura|Rep: GA10453-PA - Drosophila pseudoobscura
            (Fruit fly)
          Length = 3625

 Score = 44.0 bits (99), Expect = 0.004
 Identities = 21/64 (32%), Positives = 34/64 (53%), Gaps = 1/64 (1%)
 Frame = +1

Query: 496  CAVLGCDDCKNMDSIMFFRFPEDSSLRQIW-TDLTGRNNWTPTDFSYICIPHFSVDCFKL 672
            CAVL C   K  D + F+++P D ++ + W T+L  R+    +    +C  HF+ DCF  
Sbjct: 2358 CAVLSCFQPKG-DGVRFYKYPSDIAMARRWATNLKHRSMQASSHGFLVCQSHFAADCFDP 2416

Query: 673  DNED 684
            +  D
Sbjct: 2417 ETGD 2420



 Score = 35.9 bits (79), Expect = 0.98
 Identities = 21/59 (35%), Positives = 26/59 (44%), Gaps = 3/59 (5%)
 Frame = +1

Query: 496  CAVLGCDDCKNMDSIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDF--SYICIPHFSVDC 663
            CAV GC+  K    +  FRFP +D  +   W +    N   P D     IC  HF  DC
Sbjct: 1286 CAVEGCESSKEQPEVRLFRFPTDDDDMLWKWCNNLKMN---PVDCLGVRICNKHFDADC 1341


>UniRef50_UPI00015B5AC5 Cluster: PREDICTED: hypothetical protein;
           n=1; Nasonia vitripennis|Rep: PREDICTED: hypothetical
           protein - Nasonia vitripennis
          Length = 732

 Score = 41.9 bits (94), Expect = 0.015
 Identities = 21/59 (35%), Positives = 30/59 (50%), Gaps = 3/59 (5%)
 Frame = +1

Query: 493 TCAVLGCDDCK--NMDSIMFFRFPEDS-SLRQIWTDLTGRNNWTPTDFSYICIPHFSVD 660
           +C + GC+  K  N  +I +++FP D+  L + W       NW P D   IC  HF VD
Sbjct: 3   SCCLEGCESYKFLNRQTIGYYKFPFDNIPLLEKWLSQIRIPNWVPEDHHRICSTHFHVD 61


>UniRef50_Q6PFI2 Cluster: L(3)mbt-like 2; n=3; Danio rerio|Rep:
           L(3)mbt-like 2 - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 805

 Score = 41.9 bits (94), Expect = 0.015
 Identities = 19/60 (31%), Positives = 27/60 (45%), Gaps = 2/60 (3%)
 Frame = +1

Query: 496 CAVLGCDDCKNMDSIMFFRFPEDSSLRQIWTDLTG--RNNWTPTDFSYICIPHFSVDCFK 669
           C   GC      + +  FRFP+D      W       R NW    +S++C  HF+ DCF+
Sbjct: 5   CVAYGCGKISGQN-VSMFRFPKDPEEFSKWQRQVQKTRRNWLANTYSHLCNEHFTKDCFE 63


>UniRef50_A4IGI1 Cluster: Zgc:163143 protein; n=1; Danio rerio|Rep:
           Zgc:163143 protein - Danio rerio (Zebrafish)
           (Brachydanio rerio)
          Length = 413

 Score = 41.9 bits (94), Expect = 0.015
 Identities = 21/63 (33%), Positives = 29/63 (46%), Gaps = 1/63 (1%)
 Frame = +1

Query: 496 CAVLGCDDCKNMDSIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDCFKL 672
           C+   C +     S+ F  FP +DSSL + W       +W P   S IC  HF   CF L
Sbjct: 5   CSAYNCKNTLRNKSVSFHLFPLKDSSLLKKWLKNLRWKDWKPNPNSKICSAHFEEKCFIL 64

Query: 673 DNE 681
           + +
Sbjct: 65  EGK 67


>UniRef50_Q58EG0 Cluster: THAP domain containing, apoptosis
           associated protein 3; n=2; Danio rerio|Rep: THAP domain
           containing, apoptosis associated protein 3 - Danio rerio
           (Zebrafish) (Brachydanio rerio)
          Length = 213

 Score = 41.5 bits (93), Expect = 0.020
 Identities = 24/63 (38%), Positives = 34/63 (53%), Gaps = 4/63 (6%)
 Frame = +1

Query: 490 KTCAVLGCDDCKNMDS--IMFFRFP--EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSV 657
           K+C+   C +  N  +  I F RFP  + S L+Q W D  GR+++ P     IC  HF+ 
Sbjct: 3   KSCSASNCTNRYNNKNPEITFHRFPFSKPSVLKQ-WLDNIGRDDFQPRKHMVICSLHFTP 61

Query: 658 DCF 666
           DCF
Sbjct: 62  DCF 64


>UniRef50_Q7QG69 Cluster: ENSANGP00000020085; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000020085 - Anopheles gambiae
           str. PEST
          Length = 193

 Score = 41.5 bits (93), Expect = 0.020
 Identities = 22/62 (35%), Positives = 32/62 (51%), Gaps = 3/62 (4%)
 Frame = +1

Query: 493 TCAVLGCD-DCKNMDSIMFFRFPEDS-SLRQIWTDLTGRN-NWTPTDFSYICIPHFSVDC 663
           +C +  CD    + D + F +FP  S  L + W   TGR+ +W PT +S +C  HF    
Sbjct: 4   SCVIPDCDLKYTHSDDVSFHKFPLKSPELLKQWIQFTGRDESWHPTKWSALCSRHFVASD 63

Query: 664 FK 669
           FK
Sbjct: 64  FK 65


>UniRef50_UPI0000F213AC Cluster: PREDICTED: hypothetical protein;
           n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
           Danio rerio
          Length = 307

 Score = 40.7 bits (91), Expect = 0.035
 Identities = 25/75 (33%), Positives = 36/75 (48%), Gaps = 2/75 (2%)
 Frame = +1

Query: 493 TCAVLGCDDCKNMDS-IMFFRFPEDSSLR-QIWTDLTGRNNWTPTDFSYICIPHFSVDCF 666
           +C+ L C +  +  + I F RFP D   R Q W     R+N+ P+  + IC  HF   CF
Sbjct: 2   SCSALSCKNRPSPGTGISFHRFPLDDKDRLQKWLLNLRRDNFQPSPSARICSQHFEDGCF 61

Query: 667 KLDNEDXMVLVDKAV 711
             +N   + L   AV
Sbjct: 62  FTNNHGKLCLSKSAV 76


>UniRef50_Q4V6V0 Cluster: IP01025p; n=3; Sophophora|Rep: IP01025p -
           Drosophila melanogaster (Fruit fly)
          Length = 762

 Score = 40.7 bits (91), Expect = 0.035
 Identities = 23/64 (35%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
 Frame = +1

Query: 496 CAVLGCDD-CKNMDSIMFFRFP-EDSSLRQIWTDLTGRN-NWTPTDFSYICIPHFSVDCF 666
           CAV+ C     +  SI F RFP +   L Q W + T R+  W P+ +S +C  HF  + F
Sbjct: 5   CAVINCSHKYVHAGSISFHRFPFKRKDLLQKWKEFTQRSAQWMPSKWSALCSRHFGDEDF 64

Query: 667 KLDN 678
              N
Sbjct: 65  NCSN 68


>UniRef50_Q16XI0 Cluster: Putative uncharacterized protein; n=1;
           Aedes aegypti|Rep: Putative uncharacterized protein -
           Aedes aegypti (Yellowfever mosquito)
          Length = 756

 Score = 40.3 bits (90), Expect = 0.046
 Identities = 17/44 (38%), Positives = 24/44 (54%), Gaps = 1/44 (2%)
 Frame = +1

Query: 523 KNMDSIMFFRFPEDSSLRQIWTDLTGRN-NWTPTDFSYICIPHF 651
           +   +I  F+FPED  LR +W     R+ +W P   + ICI HF
Sbjct: 20  RGQSTISTFKFPEDKHLRDLWITALNRDPSWQPGSTASICINHF 63


>UniRef50_Q0VFB2 Cluster: LOC779531 protein; n=1; Xenopus
           tropicalis|Rep: LOC779531 protein - Xenopus tropicalis
           (Western clawed frog) (Silurana tropicalis)
          Length = 318

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 23/66 (34%), Positives = 31/66 (46%), Gaps = 6/66 (9%)
 Frame = +1

Query: 493 TCAVLGCDD-----CKNMDSIMFFRFPEDSSLRQI-WTDLTGRNNWTPTDFSYICIPHFS 654
           TC   GC++     CK      FFRFP     R   W     R NW P++ S IC  HF+
Sbjct: 4   TCVAYGCNNRFFKGCKKQ----FFRFPMKDRKRLFDWIAAIRRKNWMPSETSRICSDHFT 59

Query: 655 VDCFKL 672
           ++ + L
Sbjct: 60  LNDYML 65



 Score = 33.5 bits (73), Expect = 5.2
 Identities = 20/57 (35%), Positives = 24/57 (42%), Gaps = 1/57 (1%)
 Frame = +1

Query: 484 ITKTCAVLGCDDCKNMDSIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHF 651
           +   C  L  D C+N+    FFR P ED  L   W     +  W PT    IC  HF
Sbjct: 156 VANRCTNLFYDGCENV----FFRMPMEDPELLGKWVLAIQKKYWKPTISCRICSDHF 208


>UniRef50_Q16XH9 Cluster: Putative uncharacterized protein; n=1;
           Aedes aegypti|Rep: Putative uncharacterized protein -
           Aedes aegypti (Yellowfever mosquito)
          Length = 636

 Score = 39.9 bits (89), Expect = 0.060
 Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
 Frame = +1

Query: 547 FRFPEDSSLRQIWTDLTGR-NNWTPTDFSYICIPHFSVDCFK 669
           F+FP+D S+R+ W     R + W P   S IC+ HF  D F+
Sbjct: 28  FKFPDDPSIREQWVAAVARPDGWQPKKTSCICMNHFRKDDFE 69


>UniRef50_UPI000058482A Cluster: PREDICTED: similar to MGC82205
           protein; n=1; Strongylocentrotus purpuratus|Rep:
           PREDICTED: similar to MGC82205 protein -
           Strongylocentrotus purpuratus
          Length = 403

 Score = 39.5 bits (88), Expect = 0.080
 Identities = 20/62 (32%), Positives = 28/62 (45%), Gaps = 4/62 (6%)
 Frame = +1

Query: 493 TCAVLGCDDCKNMDSIM---FFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVD 660
           +CA   C +  +   +    F R P  D +L ++W    GR +W P   S IC  HF   
Sbjct: 3   SCAATNCKNRSDRAVVRGRSFHRLPLRDPALLKVWLSQMGRESWKPRPSSAICSDHFEKI 62

Query: 661 CF 666
           CF
Sbjct: 63  CF 64


>UniRef50_Q5PQ32 Cluster: LOC495997 protein; n=3; Xenopus|Rep:
           LOC495997 protein - Xenopus laevis (African clawed frog)
          Length = 440

 Score = 39.5 bits (88), Expect = 0.080
 Identities = 24/71 (33%), Positives = 36/71 (50%), Gaps = 12/71 (16%)
 Frame = +1

Query: 496 CAVLGCD--DCKNM--DSIMFFRFP-EDSSLRQIW------TDLTGRNNWTPT-DFSYIC 639
           C+ LGC   D +    ++I F R P +D   R +W      TD +G+  W P+ D+ Y C
Sbjct: 101 CSSLGCTTRDSRQTRNNNISFHRLPRKDDPRRNLWIANCQRTDPSGKGLWDPSSDYVYFC 160

Query: 640 IPHFSVDCFKL 672
             HF   CF++
Sbjct: 161 SKHFEKSCFEV 171



 Score = 33.5 bits (73), Expect = 5.2
 Identities = 17/44 (38%), Positives = 26/44 (59%), Gaps = 6/44 (13%)
 Frame = +1

Query: 538 IMFFRFPEDSSLRQIW-TDLTGRN-----NWTPTDFSYICIPHF 651
           I F RFP++ + RQ+W T +T  +     +WTP+  S +C  HF
Sbjct: 23  ITFHRFPKEQARRQLWITAVTHSHAAVGTDWTPSIHSSLCSQHF 66


>UniRef50_Q293X7 Cluster: GA19783-PA; n=1; Drosophila
           pseudoobscura|Rep: GA19783-PA - Drosophila pseudoobscura
           (Fruit fly)
          Length = 638

 Score = 39.5 bits (88), Expect = 0.080
 Identities = 21/57 (36%), Positives = 28/57 (49%), Gaps = 5/57 (8%)
 Frame = +1

Query: 496 CAVLGCDDC-----KNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHF 651
           CAVL C +      KN D + FF+FP ++ L + W    G  N      + ICI HF
Sbjct: 3   CAVLNCKNARTASEKNQD-LCFFKFPRNADLAKQWVSFCGNKNALNLKNASICIKHF 58


>UniRef50_A4QP83 Cluster: LOC100005466 protein; n=2; Danio
           rerio|Rep: LOC100005466 protein - Danio rerio
           (Zebrafish) (Brachydanio rerio)
          Length = 619

 Score = 38.3 bits (85), Expect = 0.18
 Identities = 21/61 (34%), Positives = 27/61 (44%), Gaps = 4/61 (6%)
 Frame = +1

Query: 496 CAVLGCDDCKNMD----SIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDC 663
           CA  GC + +        I F RFP D   RQ WT    R+ + P   S +C  HF  + 
Sbjct: 16  CAAYGCSNERTKKLKDKGITFHRFPRDVKRRQAWTLALRRDKFEPKPRSLLCSCHFRPED 75

Query: 664 F 666
           F
Sbjct: 76  F 76


>UniRef50_Q1KZX7 Cluster: Transposase; n=1; Anopheles gambiae str.
           PEST|Rep: Transposase - Anopheles gambiae str. PEST
          Length = 879

 Score = 38.3 bits (85), Expect = 0.18
 Identities = 22/67 (32%), Positives = 36/67 (53%), Gaps = 6/67 (8%)
 Frame = +1

Query: 490 KTCAVLGC-DDCKNMD----SIMFFRFPEDSSLRQIWTDLTGRN-NWTPTDFSYICIPHF 651
           ++CA   C ++ +N+     +I F  FP D SL + W D   R+ +W PT  S +C  HF
Sbjct: 3   RSCAAAFCKNNAENVKKRGLNITFHSFPSDDSLPK-WIDFCKRDEHWKPTKISTVCSLHF 61

Query: 652 SVDCFKL 672
             D +++
Sbjct: 62  KPDDYQM 68


>UniRef50_UPI0000F1DF3B Cluster: PREDICTED: hypothetical protein;
           n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
           Danio rerio
          Length = 620

 Score = 37.5 bits (83), Expect = 0.32
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%)
 Frame = +1

Query: 544 FFRFPEDSSLR-QIWTDLTGRNNWTPTDFSYICIPHFSVDCFK 669
           F +FP +  LR + W       NW PT  S +C  HF  DCF+
Sbjct: 28  FHKFPLEDGLRVREWLRRMRWQNWWPTGNSVLCSDHFEKDCFE 70


>UniRef50_Q08CB6 Cluster: Zgc:153292; n=2; Danio rerio|Rep:
           Zgc:153292 - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 415

 Score = 37.5 bits (83), Expect = 0.32
 Identities = 17/43 (39%), Positives = 22/43 (51%), Gaps = 1/43 (2%)
 Frame = +1

Query: 544 FFRFPEDSSLR-QIWTDLTGRNNWTPTDFSYICIPHFSVDCFK 669
           F +FP +  LR + W       NW PT  S +C  HF  DCF+
Sbjct: 28  FHKFPLEDGLRVREWLRRMRWQNWWPTGNSVLCSDHFEKDCFE 70


>UniRef50_UPI00015B5079 Cluster: PREDICTED: hypothetical protein;
           n=1; Nasonia vitripennis|Rep: PREDICTED: hypothetical
           protein - Nasonia vitripennis
          Length = 657

 Score = 36.3 bits (80), Expect = 0.74
 Identities = 21/76 (27%), Positives = 35/76 (46%), Gaps = 3/76 (3%)
 Frame = +1

Query: 484 ITKTCAVLGCDDC--KNMD-SIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHFS 654
           +   C+V  C++   K  D  I +F+FP+D      W     + N    + +++C  HF 
Sbjct: 1   MASNCSVKYCENENEKTKDRGIRYFKFPKDPETAAKWVKACSKKN-IDLNCAHVCSVHFQ 59

Query: 655 VDCFKLDNEDXMVLVD 702
            DCF  + +D  V  D
Sbjct: 60  EDCFIKNPDDLNVSGD 75


>UniRef50_UPI00015B45C1 Cluster: PREDICTED: similar to viral A-type
           inclusion protein, putative; n=1; Nasonia
           vitripennis|Rep: PREDICTED: similar to viral A-type
           inclusion protein, putative - Nasonia vitripennis
          Length = 450

 Score = 36.3 bits (80), Expect = 0.74
 Identities = 13/50 (26%), Positives = 26/50 (52%)
 Frame = +1

Query: 511 CDDCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVD 660
           C      ++  FF  P+D  +R++W +     ++T  + +Y+C  HF+ D
Sbjct: 2   CKSKNYKNTYSFFSAPKDPEIRRLWQEAIKIKDYTVNEDTYVCSKHFTKD 51


>UniRef50_Q9VIS5 Cluster: CG10631-PA; n=1; Drosophila
            melanogaster|Rep: CG10631-PA - Drosophila melanogaster
            (Fruit fly)
          Length = 3781

 Score = 35.9 bits (79), Expect = 0.98
 Identities = 21/59 (35%), Positives = 26/59 (44%), Gaps = 3/59 (5%)
 Frame = +1

Query: 496  CAVLGCDDCKNMDSIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFS--YICIPHFSVDC 663
            C V GC+  K    +  FRFP ED  +   W +    N   P D +   IC  HF  DC
Sbjct: 1425 CVVEGCEASKEQPDVRLFRFPTEDDDMLWKWCNNLKMN---PVDCTGVRICNKHFEADC 1480


>UniRef50_Q8WTV1 Cluster: THAP domain-containing protein 3; n=12;
           Theria|Rep: THAP domain-containing protein 3 - Homo
           sapiens (Human)
          Length = 239

 Score = 35.9 bits (79), Expect = 0.98
 Identities = 20/62 (32%), Positives = 27/62 (43%), Gaps = 3/62 (4%)
 Frame = +1

Query: 490 KTCAVLGCDD--CKNMDSIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVD 660
           K+CA   C +        + F RFP     L + W    GR N+ P   + IC  HF  +
Sbjct: 3   KSCAARQCCNRYSSRRKQLTFHRFPFSRPELLKEWVLNIGRGNFKPKQHTVICSEHFRPE 62

Query: 661 CF 666
           CF
Sbjct: 63  CF 64


>UniRef50_Q16MP6 Cluster: Putative uncharacterized protein; n=1;
           Aedes aegypti|Rep: Putative uncharacterized protein -
           Aedes aegypti (Yellowfever mosquito)
          Length = 286

 Score = 35.5 bits (78), Expect = 1.3
 Identities = 18/68 (26%), Positives = 28/68 (41%), Gaps = 6/68 (8%)
 Frame = +1

Query: 484 ITKTCAVLGC-DDCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFS-----YICIP 645
           ++K C    C +      S+ +F FP+ +     W    GR +            Y+C  
Sbjct: 1   MSKRCCAASCYNSVATNRSVEYFGFPKSNEYAAAWAKAAGREDLLEKSLCNIIKYYLCSE 60

Query: 646 HFSVDCFK 669
           HFS DCF+
Sbjct: 61  HFSDDCFQ 68


>UniRef50_Q9BT49 Cluster: THAP domain-containing protein 7; n=18;
           Eumetazoa|Rep: THAP domain-containing protein 7 - Homo
           sapiens (Human)
          Length = 309

 Score = 35.5 bits (78), Expect = 1.3
 Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 12/71 (16%)
 Frame = +1

Query: 496 CAVLGCDDCKNMDS----IMFFRFPE-DSSLRQIWT------DLTGRNNWTP-TDFSYIC 639
           C+  GC      ++    I F R P+ D+  R +W       D +G+  W P +++ Y C
Sbjct: 5   CSAAGCCTRDTRETRNRGISFHRLPKKDNPRRGLWLANCQRLDPSGQGLWDPASEYIYFC 64

Query: 640 IPHFSVDCFKL 672
             HF  DCF+L
Sbjct: 65  SKHFEEDCFEL 75


>UniRef50_Q03D70 Cluster: ATP-dependent nuclease, subunit B; n=1;
           Pediococcus pentosaceus ATCC 25745|Rep: ATP-dependent
           nuclease, subunit B - Pediococcus pentosaceus (strain
           ATCC 25745 / 183-1w)
          Length = 1192

 Score = 35.1 bits (77), Expect = 1.7
 Identities = 22/81 (27%), Positives = 35/81 (43%)
 Frame = +1

Query: 349 SVDMNINLVYKMKDSLADFNDNKTKPTPIAKVAPELKQNADTDMGITKTCAVLGCDDCKN 528
           ++D  + LV K+   LA+F      P  + +VA  + +N+D  M I      LG      
Sbjct: 119 NIDAQLGLVQKIATQLAEFKQGNVGPDELGRVAENIAENSDAGMDIKAKLHDLGIIYSAY 178

Query: 529 MDSIMFFRFPEDSSLRQIWTD 591
            + I   RF + S +    TD
Sbjct: 179 EEEIQ-SRFIDASDITHTLTD 198


>UniRef50_Q96EK4 Cluster: THAP domain-containing protein 11; n=20;
           Mammalia|Rep: THAP domain-containing protein 11 - Homo
           sapiens (Human)
          Length = 313

 Score = 35.1 bits (77), Expect = 1.7
 Identities = 19/61 (31%), Positives = 30/61 (49%), Gaps = 8/61 (13%)
 Frame = +1

Query: 493 TCAVLGCDDCKNMDSIM-FFRFPEDSSLRQIWTDLTGR-------NNWTPTDFSYICIPH 648
           TC V GC +  + D  + F+ FP+D+ LR++W     R       + + PT    +C  H
Sbjct: 5   TCCVPGCYNNSHRDKALHFYTFPKDAELRRLWLKNVSRAGVSGCFSTFQPTTGHRLCSVH 64

Query: 649 F 651
           F
Sbjct: 65  F 65


>UniRef50_UPI0000D56D8D Cluster: PREDICTED: hypothetical protein;
           n=1; Tribolium castaneum|Rep: PREDICTED: hypothetical
           protein - Tribolium castaneum
          Length = 897

 Score = 34.7 bits (76), Expect = 2.3
 Identities = 16/53 (30%), Positives = 28/53 (52%)
 Frame = +1

Query: 517 DCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDCFKLD 675
           +CKN  +     FP+ + + + W +     ++ P D S++C+ HFS D   LD
Sbjct: 16  NCKNT-ATTGIPFPKQAEILKQWLEALEIPDFVPDDTSFVCLDHFSNDSDVLD 67


>UniRef50_Q566P8 Cluster: LOC553397 protein; n=5; Danio rerio|Rep:
           LOC553397 protein - Danio rerio (Zebrafish) (Brachydanio
           rerio)
          Length = 216

 Score = 34.7 bits (76), Expect = 2.3
 Identities = 16/57 (28%), Positives = 31/57 (54%), Gaps = 4/57 (7%)
 Frame = +1

Query: 493 TCAVLGCDDCKNMDS----IMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHF 651
           +CA  GC + + + +    I F +FP+++ LR+ W     R  ++ ++ S +C  HF
Sbjct: 19  SCAAWGCKNRRAVQTKSRGITFHKFPKENVLRKQWEIALKRKGFSASESSVLCSEHF 75


>UniRef50_Q60MX8 Cluster: Putative uncharacterized protein CBG22961;
           n=1; Caenorhabditis briggsae|Rep: Putative
           uncharacterized protein CBG22961 - Caenorhabditis
           briggsae
          Length = 1047

 Score = 34.7 bits (76), Expect = 2.3
 Identities = 23/69 (33%), Positives = 30/69 (43%), Gaps = 13/69 (18%)
 Frame = +1

Query: 511 CDDCKNMDSI--MFFRFPEDSSLRQIWTDLTG-------RNNWTPTDFSY----ICIPHF 651
           C  C  +  I  M   FP D   R+IW +L G       R+   P  FS     IC  HF
Sbjct: 182 CTVCNRIMKIGEMHLNFPADLDRRRIWANLLGFKYKDILRSKMGPVSFSIAAGPICTEHF 241

Query: 652 SVDCFKLDN 678
           + +CF+  N
Sbjct: 242 AEECFRNHN 250


>UniRef50_Q4V5S0 Cluster: IP06774p; n=4; Sophophora|Rep: IP06774p -
           Drosophila melanogaster (Fruit fly)
          Length = 255

 Score = 34.7 bits (76), Expect = 2.3
 Identities = 20/61 (32%), Positives = 29/61 (47%), Gaps = 3/61 (4%)
 Frame = +1

Query: 496 CAVLGCDD---CKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDCF 666
           CAV  C +     N     +F FP++    Q W D   R+N  PT  + IC  HF+ + F
Sbjct: 3   CAVKNCGNNNRIANRTKWRYFHFPKEKPNLQRWIDFCQRDNINPTT-ACICNEHFAPNDF 61

Query: 667 K 669
           +
Sbjct: 62  E 62


>UniRef50_A4VE92 Cluster: Putative uncharacterized protein; n=1;
           Tetrahymena thermophila SB210|Rep: Putative
           uncharacterized protein - Tetrahymena thermophila SB210
          Length = 113

 Score = 34.7 bits (76), Expect = 2.3
 Identities = 26/63 (41%), Positives = 34/63 (53%), Gaps = 1/63 (1%)
 Frame = -3

Query: 470 SAFCFNSGATFAIGVGFV-LLSLKSAKLSFILYTRLIFISTEIQILPI*ISRLLSQNGIF 294
           S  CFNS ++F   VGFV LLSL+      +LY  L  +S +  +  I  + LL  N IF
Sbjct: 46  SGICFNSYSSFLSYVGFVQLLSLELTFNFCLLYDILCLLSIKTCMFFIKRNFLLCNNIIF 105

Query: 293 IIF 285
            IF
Sbjct: 106 CIF 108


>UniRef50_UPI0000DB74EF Cluster: PREDICTED: hypothetical protein;
           n=1; Apis mellifera|Rep: PREDICTED: hypothetical protein
           - Apis mellifera
          Length = 409

 Score = 34.3 bits (75), Expect = 3.0
 Identities = 16/52 (30%), Positives = 25/52 (48%), Gaps = 5/52 (9%)
 Frame = +1

Query: 490 KTCAVLGC-----DDCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFS 630
           + C + GC        K+ +    F FP++  LR+ W     R NW+PT +S
Sbjct: 252 RRCCIPGCKGNYDSTLKSNNYASVFLFPKNEELRKKWLAAIPRKNWSPTKYS 303


>UniRef50_UPI00015B5019 Cluster: PREDICTED: similar to GA20163-PA;
           n=1; Nasonia vitripennis|Rep: PREDICTED: similar to
           GA20163-PA - Nasonia vitripennis
          Length = 843

 Score = 33.9 bits (74), Expect = 4.0
 Identities = 22/66 (33%), Positives = 28/66 (42%), Gaps = 6/66 (9%)
 Frame = +1

Query: 496 CAVLGCDDCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNW---TPTDF--SY-ICIPHFSV 657
           CA   C      D   FFRFP+D    + W     RN+    TP +   SY +C  HF+ 
Sbjct: 9   CAFKNCRGLSRRDKRSFFRFPKDPQRSKQWVVACDRNDLLEKTPIELFNSYRVCAKHFTD 68

Query: 658 DCFKLD 675
             F  D
Sbjct: 69  TMFLND 74


>UniRef50_UPI0000F1E17E Cluster: PREDICTED: hypothetical protein,
           partial; n=1; Danio rerio|Rep: PREDICTED: hypothetical
           protein, partial - Danio rerio
          Length = 391

 Score = 33.9 bits (74), Expect = 4.0
 Identities = 15/49 (30%), Positives = 23/49 (46%)
 Frame = +1

Query: 505 LGCDDCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHF 651
           L   + K +  + F+RFP D   ++ W     R N+ P   S +C  HF
Sbjct: 10  LNRSESKQLKKLSFYRFPCDEKQKRKWLQSIRRKNFYPNCNSRVCSWHF 58


>UniRef50_Q5TRK2 Cluster: ENSANGP00000029564; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000029564 - Anopheles gambiae
           str. PEST
          Length = 455

 Score = 33.9 bits (74), Expect = 4.0
 Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 1/55 (1%)
 Frame = +1

Query: 535 SIMFFRFPEDSSLRQIWTDLTGRN-NWTPTDFSYICIPHFSVDCFKLDNEDXMVL 696
           +++F  FP D +LR  W     R  +WTP     +C  HF  + +++ +   + L
Sbjct: 24  ALIFHAFPADEALRSRWMAFCRRGVDWTPFKTDAVCSAHFRHEDYQMAHSPLLKL 78


>UniRef50_Q17N87 Cluster: Putative uncharacterized protein; n=1;
           Aedes aegypti|Rep: Putative uncharacterized protein -
           Aedes aegypti (Yellowfever mosquito)
          Length = 199

 Score = 33.9 bits (74), Expect = 4.0
 Identities = 15/56 (26%), Positives = 24/56 (42%)
 Frame = +1

Query: 484 ITKTCAVLGCDDCKNMDSIMFFRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHF 651
           I K+C V       +  ++   RFP+D   R+ W       + T  D + +C  HF
Sbjct: 5   IIKSCGVTSSIARNSAGAVSLHRFPKDKETRKRWVQFCNEPDLTNPDAAVLCSRHF 60


>UniRef50_Q16PJ2 Cluster: Putative uncharacterized protein; n=1;
           Aedes aegypti|Rep: Putative uncharacterized protein -
           Aedes aegypti (Yellowfever mosquito)
          Length = 157

 Score = 33.9 bits (74), Expect = 4.0
 Identities = 16/41 (39%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
 Frame = +1

Query: 535 SIMFFRFPEDSSLRQIWTDLTG-RNNWTPTDFSYICIPHFS 654
           +I FFRFP D  L+ +W    G  +N+  T  S +C  HF+
Sbjct: 18  NIGFFRFPGDEDLQSVWKKFCGVSDNFEITACSRVCSFHFA 58


>UniRef50_UPI0001556045 Cluster: PREDICTED: similar to THAP domain
           containing 8, partial; n=1; Ornithorhynchus
           anatinus|Rep: PREDICTED: similar to THAP domain
           containing 8, partial - Ornithorhynchus anatinus
          Length = 214

 Score = 33.5 bits (73), Expect = 5.2
 Identities = 13/40 (32%), Positives = 20/40 (50%), Gaps = 1/40 (2%)
 Frame = +1

Query: 553 FP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVDCFK 669
           FP +D +  Q W     +  W PT   ++C  HF+  CF+
Sbjct: 1   FPLQDPARLQEWLQQMRQEQWVPTRHQHLCSEHFAPSCFE 40


>UniRef50_Q6NYT2 Cluster: Zgc:65871; n=2; Danio rerio|Rep: Zgc:65871
           - Danio rerio (Zebrafish) (Brachydanio rerio)
          Length = 257

 Score = 33.5 bits (73), Expect = 5.2
 Identities = 19/61 (31%), Positives = 29/61 (47%), Gaps = 8/61 (13%)
 Frame = +1

Query: 493 TCAVLGCDDCKNMD-SIMFFRFPEDSSLRQIWTDLTGR-------NNWTPTDFSYICIPH 648
           TC V GC +  + D  + F+ FP+D + R+IW     R       + + PT    +C  H
Sbjct: 5   TCCVPGCYNNSHRDRDLRFYTFPKDPTQREIWLKNISRAGVSGCFSTFQPTTGHRVCSVH 64

Query: 649 F 651
           F
Sbjct: 65  F 65


>UniRef50_Q1KZX8 Cluster: Transposase; n=2; Anopheles gambiae str.
           PEST|Rep: Transposase - Anopheles gambiae str. PEST
          Length = 877

 Score = 33.5 bits (73), Expect = 5.2
 Identities = 16/46 (34%), Positives = 19/46 (41%), Gaps = 1/46 (2%)
 Frame = +1

Query: 538 IMFFRFPEDSSLRQIWTDLTGRN-NWTPTDFSYICIPHFSVDCFKL 672
           I F  FP    LR  W     R  NW P+    IC  HF    F++
Sbjct: 24  ISFHIFPNYDPLRHAWVQFCNREENWEPSKRDVICSAHFQESDFQM 69


>UniRef50_Q9P2Z0 Cluster: THAP domain-containing protein 10; n=6;
           Theria|Rep: THAP domain-containing protein 10 - Homo
           sapiens (Human)
          Length = 257

 Score = 33.5 bits (73), Expect = 5.2
 Identities = 17/47 (36%), Positives = 25/47 (53%), Gaps = 3/47 (6%)
 Frame = +1

Query: 547 FRFPEDSSLRQIWTDLTG--RNNW-TPTDFSYICIPHFSVDCFKLDN 678
           FRFP+D ++R +W       R +W    D S IC  HF+  CF + +
Sbjct: 21  FRFPKDRAVRLLWDRFVRGCRADWYGGNDRSVICSDHFAPACFDVSS 67


>UniRef50_P34427 Cluster: Protein lin-36; n=2; Caenorhabditis
           elegans|Rep: Protein lin-36 - Caenorhabditis elegans
          Length = 962

 Score = 33.5 bits (73), Expect = 5.2
 Identities = 20/57 (35%), Positives = 26/57 (45%), Gaps = 11/57 (19%)
 Frame = +1

Query: 541 MFFRFPEDSSLRQIWTDLTG-------RNNWTPTDFSY----ICIPHFSVDCFKLDN 678
           M   FP D   R+IW +L G       R+   P  FS     IC  HF+ +CF+  N
Sbjct: 178 MHLNFPADLDRRRIWANLLGFKYKDILRSKMGPVSFSIAAGPICTEHFAEECFRNHN 234


>UniRef50_UPI00015B40B2 Cluster: PREDICTED: similar to THAP domain
           containing 4; n=1; Nasonia vitripennis|Rep: PREDICTED:
           similar to THAP domain containing 4 - Nasonia
           vitripennis
          Length = 202

 Score = 33.1 bits (72), Expect = 6.9
 Identities = 14/35 (40%), Positives = 16/35 (45%)
 Frame = +1

Query: 547 FRFPEDSSLRQIWTDLTGRNNWTPTDFSYICIPHF 651
           FRFP     R+ W D   R NW P     +C  HF
Sbjct: 14  FRFPSSDVKRKQWLDAIRRPNWKPKKGHGLCGEHF 48


>UniRef50_UPI0000E87B1B Cluster: O-Antigen Polymerase; n=1;
           Methylophilales bacterium HTCC2181|Rep: O-Antigen
           Polymerase - Methylophilales bacterium HTCC2181
          Length = 435

 Score = 33.1 bits (72), Expect = 6.9
 Identities = 22/56 (39%), Positives = 30/56 (53%)
 Frame = -3

Query: 446 ATFAIGVGFVLLSLKSAKLSFILYTRLIFISTEIQILPI*ISRLLSQNGIFIIFTA 279
           A F   +GFV + +   ++SFIL+T  I I +      I  S+LL   G FII TA
Sbjct: 219 ALFLFFIGFVTVLITGERMSFILFTSSILIISLAMPTQI-KSKLLIIFGFFIILTA 273


>UniRef50_A2A8D4 Cluster: THAP domain containing, apoptosis
           associated protein 3; n=1; Mus musculus|Rep: THAP domain
           containing, apoptosis associated protein 3 - Mus
           musculus (Mouse)
          Length = 184

 Score = 33.1 bits (72), Expect = 6.9
 Identities = 19/62 (30%), Positives = 27/62 (43%), Gaps = 3/62 (4%)
 Frame = +1

Query: 490 KTCAVLGCDD--CKNMDSIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVD 660
           K+CA   C +        + F RFP     L + W    GR ++ P   + IC  HF  +
Sbjct: 3   KSCAARQCCNRYSSRRKQLTFHRFPFSRPELLREWVLNIGRADFKPKQHTVICSEHFRPE 62

Query: 661 CF 666
           CF
Sbjct: 63  CF 64


>UniRef50_Q8BJ25 Cluster: THAP domain-containing protein 3; n=9;
           Eutheria|Rep: THAP domain-containing protein 3 - Mus
           musculus (Mouse)
          Length = 218

 Score = 33.1 bits (72), Expect = 6.9
 Identities = 19/62 (30%), Positives = 27/62 (43%), Gaps = 3/62 (4%)
 Frame = +1

Query: 490 KTCAVLGCDD--CKNMDSIMFFRFP-EDSSLRQIWTDLTGRNNWTPTDFSYICIPHFSVD 660
           K+CA   C +        + F RFP     L + W    GR ++ P   + IC  HF  +
Sbjct: 3   KSCAARQCCNRYSSRRKQLTFHRFPFSRPELLREWVLNIGRADFKPKQHTVICSEHFRPE 62

Query: 661 CF 666
           CF
Sbjct: 63  CF 64


>UniRef50_Q6IR68 Cluster: LOC432088 protein; n=3; Xenopus|Rep:
           LOC432088 protein - Xenopus laevis (African clawed frog)
          Length = 318

 Score = 32.7 bits (71), Expect = 9.2
 Identities = 19/61 (31%), Positives = 28/61 (45%), Gaps = 8/61 (13%)
 Frame = +1

Query: 493 TCAVLGCDDCKNMDS-IMFFRFPEDSSLRQIWTDLTGR-------NNWTPTDFSYICIPH 648
           TC V GC    + D  + F+ FP+D  LR +W     R       + + PT+   +C  H
Sbjct: 25  TCCVPGCYSNSHRDKGLHFYTFPKDPELRCLWLKNVSRGGVSGCFSTFQPTNGHRVCSLH 84

Query: 649 F 651
           F
Sbjct: 85  F 85


>UniRef50_A6T1S2 Cluster: Uncharacterized conserved protein; n=2;
           Janthinobacterium sp. Marseille|Rep: Uncharacterized
           conserved protein - Janthinobacterium sp. (strain
           Marseille) (Minibacterium massiliensis)
          Length = 901

 Score = 32.7 bits (71), Expect = 9.2
 Identities = 19/58 (32%), Positives = 30/58 (51%)
 Frame = +1

Query: 346 ISVDMNINLVYKMKDSLADFNDNKTKPTPIAKVAPELKQNADTDMGITKTCAVLGCDD 519
           I  D+  NL+ ++ D LA F D+   P PI   AP+++ + +    +TK  AV    D
Sbjct: 307 IIADLKGNLIKRL-DELASFRDSLFDPKPIEVKAPKIQADPELLQRLTKPKAVKPAQD 363


>UniRef50_Q7Q704 Cluster: ENSANGP00000017828; n=1; Anopheles gambiae
           str. PEST|Rep: ENSANGP00000017828 - Anopheles gambiae
           str. PEST
          Length = 162

 Score = 32.7 bits (71), Expect = 9.2
 Identities = 20/59 (33%), Positives = 25/59 (42%), Gaps = 6/59 (10%)
 Frame = +1

Query: 493 TCAVLGCDDC-----KNMDSIMFFRFPEDSSLRQIWTDLTGRN-NWTPTDFSYICIPHF 651
           +CAV  C++      K M  I F  FP D   RQ W     R  +W P     +C  HF
Sbjct: 5   SCAVADCNNNRRNVRKRMLDIGFHTFPSDPVQRQRWVKFCQREPSWQPKSCDSMCSVHF 63


>UniRef50_Q1KZX9 Cluster: Transposase; n=1; Anopheles gambiae str.
           PEST|Rep: Transposase - Anopheles gambiae str. PEST
          Length = 894

 Score = 32.7 bits (71), Expect = 9.2
 Identities = 13/49 (26%), Positives = 24/49 (48%), Gaps = 1/49 (2%)
 Frame = +1

Query: 535 SIMFFRFPEDSSLRQIWTDLTGRN-NWTPTDFSYICIPHFSVDCFKLDN 678
           ++ F +FPE    +Q W     R  +W P+  + +C  HF    ++L +
Sbjct: 25  NVCFHKFPEGKETKQKWIAFCQREISWIPSSSNVVCSQHFLPSDYQLSS 73


  Database: uniref50
    Posted date:  Oct 5, 2007 11:19 AM
  Number of letters in database: 575,637,011
  Number of sequences in database:  1,657,284
  
Lambda     K      H
   0.318    0.134    0.401 

Gapped
Lambda     K      H
   0.279   0.0580    0.190 


Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 603,311,763
Number of Sequences: 1657284
Number of extensions: 11163986
Number of successful extensions: 23639
Number of sequences better than 10.0: 62
Number of HSP's better than 10.0 without gapping: 22965
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 23625
length of database: 575,637,011
effective HSP length: 98
effective length of database: 413,223,179
effective search space used: 57024798702
frameshift window, decay const: 40,  0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)

- SilkBase 1999-2023 -