BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= prgv0166 (526 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q54DI3 Cluster: Putative uncharacterized protein; n=1; ... 37 0.33 UniRef50_A2DIR5 Cluster: Putative uncharacterized protein; n=1; ... 35 1.3 UniRef50_Q23H27 Cluster: Putative uncharacterized protein; n=1; ... 34 2.3 UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocent... 33 3.0 UniRef50_Q5AU25 Cluster: Putative uncharacterized protein; n=1; ... 33 3.0 UniRef50_UPI0000DB7124 Cluster: PREDICTED: similar to CG32030-PA... 33 4.0 UniRef50_UPI00006CD0ED Cluster: hypothetical protein TTHERM_0012... 33 4.0 UniRef50_Q4KLP7 Cluster: MGC115718 protein; n=2; Xenopus|Rep: MG... 33 4.0 UniRef50_Q7PUR9 Cluster: ENSANGP00000008445; n=1; Anopheles gamb... 33 4.0 UniRef50_Q7ZWE3 Cluster: La-related protein 7; n=3; Clupeocephal... 33 4.0 UniRef50_Q5WXV5 Cluster: Putative uncharacterized protein; n=4; ... 33 5.3 UniRef50_Q399J8 Cluster: Putative uncharacterized protein; n=9; ... 33 5.3 UniRef50_A1ZJB8 Cluster: Putative uncharacterized protein; n=1; ... 33 5.3 UniRef50_Q9M1Q4 Cluster: Putative uncharacterized protein T17J13... 32 7.0 UniRef50_A7PS77 Cluster: Chromosome chr14 scaffold_27, whole gen... 32 7.0 UniRef50_A6R9E6 Cluster: Predicted protein; n=1; Ajellomyces cap... 32 7.0 UniRef50_UPI0001556400 Cluster: PREDICTED: similar to Dlx1, part... 32 9.3 UniRef50_Q4RN77 Cluster: Chromosome undetermined SCAF15016, whol... 32 9.3 UniRef50_Q22MK6 Cluster: Cyclic nucleotide-binding domain contai... 32 9.3 UniRef50_Q0E9Q0 Cluster: CG41441-PA; n=1; Drosophila melanogaste... 32 9.3 UniRef50_A2FMG8 Cluster: Putative uncharacterized protein; n=2; ... 32 9.3 >UniRef50_Q54DI3 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 1825 Score = 36.7 bits (81), Expect = 0.33 Identities = 26/87 (29%), Positives = 44/87 (50%) Frame = +1 Query: 247 KL*SATKSIKGNSMVSCVQMKSETSASSYQLNKLSNMSGLKVTKMTQNKQSPEEQSQANA 426 +L S SIK S++ ++ KS T + QLN + SG V + Q +P +S ++ Sbjct: 1159 RLNSQMHSIK--SLLQNIESKSSTLVMNQQLNSSNGQSGSSVGGVAQTLMTPNSESFQSS 1216 Query: 427 SKRTIDAIQKLQMQGLLVKKPRLDSET 507 SKR I + + Q Q ++ + DS + Sbjct: 1217 SKRGI--VNRRQQQQQQQQQQQFDSSS 1241 >UniRef50_A2DIR5 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 861 Score = 34.7 bits (76), Expect = 1.3 Identities = 25/72 (34%), Positives = 35/72 (48%) Frame = +2 Query: 32 PPVPNVQSPRFGKPLKVSPGAIINPKRPNSEDGPFSCFKKPKESLLPISESSNLGPNDDG 211 PP P+ KP+++SP A NP P+S F F P S++P E +DDG Sbjct: 527 PPSLLTPQPKESKPVEISPIAHDNPPAPSS----FDDFSIP-PSIVPSEEDDK---DDDG 578 Query: 212 HVEFASSSQTSK 247 V+ S S + K Sbjct: 579 VVKIPSDSDSDK 590 >UniRef50_Q23H27 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 402 Score = 33.9 bits (74), Expect = 2.3 Identities = 28/87 (32%), Positives = 42/87 (48%), Gaps = 6/87 (6%) Frame = +1 Query: 256 SATKSIKGNSMVSCVQMKSETSASSY----QLNKLSNMSGLKVTKMTQNKQSPEEQSQAN 423 S KS KG ++ C++ T S+ Q NK++++SG K TK T Q E + + Sbjct: 39 SKEKSTKGLNLSFCMKRDLSTLNKSFIMASQQNKMNDLSGSKSTKFTSQYQDQEFDTFKS 98 Query: 424 ASK--RTIDAIQKLQMQGLLVKKPRLD 498 + R+I K M+ LL K LD Sbjct: 99 RGRVLRSISGTNK--MRELLNKNQTLD 123 >UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocentrotus purpuratus|Rep: 3 alpha procollagen - Strongylocentrotus purpuratus (Purple sea urchin) Length = 1752 Score = 33.5 bits (73), Expect = 3.0 Identities = 16/34 (47%), Positives = 20/34 (58%), Gaps = 2/34 (5%) Frame = -2 Query: 198 GPKFDDSEIGKRDSLGFL--KQLKGPSSEFGLFG 103 GPK D E G + GFL K +KGP ++GL G Sbjct: 259 GPKGDQGEYGDKGDKGFLGMKGMKGPKGQYGLKG 292 >UniRef50_Q5AU25 Cluster: Putative uncharacterized protein; n=1; Emericella nidulans|Rep: Putative uncharacterized protein - Emericella nidulans (Aspergillus nidulans) Length = 1347 Score = 33.5 bits (73), Expect = 3.0 Identities = 25/86 (29%), Positives = 40/86 (46%), Gaps = 1/86 (1%) Frame = +2 Query: 14 NTARLRP-PVPNVQSPRFGKPLKVSPGAIINPKRPNSEDGPFSCFKKPKESLLPISESSN 190 NT RL P P P + P PL +P ++ + G + +S+ P++ SSN Sbjct: 323 NTPRLHPVPSPGLPQP---SPLTTTPNSL------SGYFGSAANTPAAAQSMTPVAPSSN 373 Query: 191 LGPNDDGHVEFASSSQTSKNFRAQLN 268 G DGHV+ +++ A+LN Sbjct: 374 EGSESDGHVKPRTAAVPGSGEVAELN 399 >UniRef50_UPI0000DB7124 Cluster: PREDICTED: similar to CG32030-PA, isoform A isoform 1; n=1; Apis mellifera|Rep: PREDICTED: similar to CG32030-PA, isoform A isoform 1 - Apis mellifera Length = 765 Score = 33.1 bits (72), Expect = 4.0 Identities = 23/54 (42%), Positives = 27/54 (50%), Gaps = 2/54 (3%) Frame = +2 Query: 11 PNTARLRPPVPNVQSPRFGKPLKV--SPGAIINPKRPNSEDGPFSCFKKPKESL 166 P ARL PPVP P FG LK SP A N P S F+ KK K+++ Sbjct: 607 PLGARLPPPVPQAPPPLFGVNLKSPRSPTATDNGNTPKSPPPAFT--KKSKKTV 658 >UniRef50_UPI00006CD0ED Cluster: hypothetical protein TTHERM_00125290; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00125290 - Tetrahymena thermophila SB210 Length = 2228 Score = 33.1 bits (72), Expect = 4.0 Identities = 19/76 (25%), Positives = 36/76 (47%), Gaps = 1/76 (1%) Frame = +1 Query: 265 KSIKGN-SMVSCVQMKSETSASSYQLNKLSNMSGLKVTKMTQNKQSPEEQSQANASKRTI 441 K++K + +S + + ETS+ + +N+L M TK+TQ + +N ++ Sbjct: 223 KNVKAQLAPISSERGQIETSSGKFDINRLEEMISNNQTKVTQPNAITNNSNVSNKRQQQS 282 Query: 442 DAIQKLQMQGLLVKKP 489 + QK Q +G P Sbjct: 283 NLFQKKQQRGQKQNNP 298 >UniRef50_Q4KLP7 Cluster: MGC115718 protein; n=2; Xenopus|Rep: MGC115718 protein - Xenopus laevis (African clawed frog) Length = 640 Score = 33.1 bits (72), Expect = 4.0 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 1/82 (1%) Frame = +2 Query: 2 GTRPNTARLRPPVPNVQSPRFGKPLKVSPGAIINPKRPNSE-DGPFSCFKKPKESLLPIS 178 G RP++A RP PN++ P KP+K SP + P+S G F + K+ Sbjct: 226 GKRPHSAPKRPS-PNIKFPPSAKPVKQSPVPVTKALPPHSAFKGVFCVAEGQKQGQEEQR 284 Query: 179 ESSNLGPNDDGHVEFASSSQTS 244 ++ NL G E SS + + Sbjct: 285 KTCNLHSELKGGKEPESSKKVT 306 >UniRef50_Q7PUR9 Cluster: ENSANGP00000008445; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000008445 - Anopheles gambiae str. PEST Length = 2086 Score = 33.1 bits (72), Expect = 4.0 Identities = 22/83 (26%), Positives = 34/83 (40%) Frame = +2 Query: 11 PNTARLRPPVPNVQSPRFGKPLKVSPGAIINPKRPNSEDGPFSCFKKPKESLLPISESSN 190 PN + + PP P P GA+ +P+ P+ +DG ++ PI+ Sbjct: 372 PNASSMGPPPPAGTPPNQSPSGAGGSGALGHPQHPSQQDGGHPPSPHGQQQQPPITSLVT 431 Query: 191 LGPNDDGHVEFASSSQTSKNFRA 259 GP D ++ AS T N A Sbjct: 432 TGP-DGAPLDEASQQSTLSNTSA 453 >UniRef50_Q7ZWE3 Cluster: La-related protein 7; n=3; Clupeocephala|Rep: La-related protein 7 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 555 Score = 33.1 bits (72), Expect = 4.0 Identities = 16/52 (30%), Positives = 23/52 (44%) Frame = +2 Query: 140 CFKKPKESLLPISESSNLGPNDDGHVEFASSSQTSKNFRAQLNLSKEILWFL 295 C K L P++ L + +GHV F SS K +A+ K+ W L Sbjct: 445 CIKDMLSELSPVAYVDLLDGDTEGHVRFKSSEDAQKVIKARFEFQKKYNWNL 496 >UniRef50_Q5WXV5 Cluster: Putative uncharacterized protein; n=4; Legionella pneumophila|Rep: Putative uncharacterized protein - Legionella pneumophila (strain Lens) Length = 392 Score = 32.7 bits (71), Expect = 5.3 Identities = 19/68 (27%), Positives = 33/68 (48%) Frame = +2 Query: 80 VSPGAIINPKRPNSEDGPFSCFKKPKESLLPISESSNLGPNDDGHVEFASSSQTSKNFRA 259 ++PGA++ ++ P K PK LLP+ E+S+L DD E ++ +A Sbjct: 85 LAPGAVVRITPLQNKSIPELLIKTPKNQLLPLKEASSLYNQDD---EVGNNPLAITKHQA 141 Query: 260 QLNLSKEI 283 L + E+ Sbjct: 142 MLQIKPEL 149 >UniRef50_Q399J8 Cluster: Putative uncharacterized protein; n=9; Proteobacteria|Rep: Putative uncharacterized protein - Burkholderia sp. (strain 383) (Burkholderia cepacia (strain ATCC 17760/ NCIB 9086 / R18194)) Length = 166 Score = 32.7 bits (71), Expect = 5.3 Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 2/43 (4%) Frame = +2 Query: 2 GTRPNTARLRPPVPNVQSPRFGKPLKVSPGAIINP--KRPNSE 124 GT P + PP P++ +P F K L PG I++ RPN + Sbjct: 87 GTGPLANKSNPPTPDLTTPAFKKRLNDYPGVIVSSVILRPNGD 129 >UniRef50_A1ZJB8 Cluster: Putative uncharacterized protein; n=1; Microscilla marina ATCC 23134|Rep: Putative uncharacterized protein - Microscilla marina ATCC 23134 Length = 341 Score = 32.7 bits (71), Expect = 5.3 Identities = 20/72 (27%), Positives = 38/72 (52%) Frame = +1 Query: 247 KL*SATKSIKGNSMVSCVQMKSETSASSYQLNKLSNMSGLKVTKMTQNKQSPEEQSQANA 426 +L S I+G S+ +Q+K + +++LNK ++SG K+T++ Q ++ + A Sbjct: 119 RLKSNINPIEG-SLTKLLQLKGVSYQYTFELNKYGDLSGEKITEIKQKTIDADKPYTSKA 177 Query: 427 SKRTIDAIQKLQ 462 +R Q LQ Sbjct: 178 KQRLGFIAQDLQ 189 >UniRef50_Q9M1Q4 Cluster: Putative uncharacterized protein T17J13.160; n=2; Arabidopsis thaliana|Rep: Putative uncharacterized protein T17J13.160 - Arabidopsis thaliana (Mouse-ear cress) Length = 673 Score = 32.3 bits (70), Expect = 7.0 Identities = 26/86 (30%), Positives = 36/86 (41%) Frame = +2 Query: 8 RPNTARLRPPVPNVQSPRFGKPLKVSPGAIINPKRPNSEDGPFSCFKKPKESLLPISESS 187 RP A +RPP NV P + L G P + DGP + P LL + S Sbjct: 338 RPGAATMRPPYGNVFRPYRPENLNPPVGNGFRPMQHPRNDGP----RFPSPPLLTPLDIS 393 Query: 188 NLGPNDDGHVEFASSSQTSKNFRAQL 265 NL + ++ S +Q NF Q+ Sbjct: 394 NLSVS-----QYPSQTQNRPNFNPQV 414 >UniRef50_A7PS77 Cluster: Chromosome chr14 scaffold_27, whole genome shotgun sequence; n=1; Vitis vinifera|Rep: Chromosome chr14 scaffold_27, whole genome shotgun sequence - Vitis vinifera (Grape) Length = 1905 Score = 32.3 bits (70), Expect = 7.0 Identities = 14/34 (41%), Positives = 20/34 (58%), Gaps = 1/34 (2%) Frame = +2 Query: 110 RPNSEDG-PFSCFKKPKESLLPISESSNLGPNDD 208 RPN DG P CFK +++ P+S+ S +DD Sbjct: 425 RPNDNDGEPSKCFKDQNKNISPVSQGSLCEEDDD 458 >UniRef50_A6R9E6 Cluster: Predicted protein; n=1; Ajellomyces capsulatus NAm1|Rep: Predicted protein - Ajellomyces capsulatus NAm1 Length = 505 Score = 32.3 bits (70), Expect = 7.0 Identities = 12/22 (54%), Positives = 15/22 (68%) Frame = +1 Query: 34 TSAQCTKPQIWETFESFAWSYN 99 TS CTKP++WE F +F YN Sbjct: 204 TSPYCTKPELWEQFWTFNSGYN 225 >UniRef50_UPI0001556400 Cluster: PREDICTED: similar to Dlx1, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Dlx1, partial - Ornithorhynchus anatinus Length = 222 Score = 31.9 bits (69), Expect = 9.3 Identities = 15/62 (24%), Positives = 31/62 (50%) Frame = +2 Query: 35 PVPNVQSPRFGKPLKVSPGAIINPKRPNSEDGPFSCFKKPKESLLPISESSNLGPNDDGH 214 P P R +PL++SP ++ +S C KP+++ +P+S++ +L + Sbjct: 32 PPPTCNPTRVYRPLQLSPPSVTFSTPQHSTAAGQRCSNKPRKTPVPVSDAPDLFDSSPRR 91 Query: 215 VE 220 +E Sbjct: 92 IE 93 >UniRef50_Q4RN77 Cluster: Chromosome undetermined SCAF15016, whole genome shotgun sequence; n=6; Euteleostomi|Rep: Chromosome undetermined SCAF15016, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 164 Score = 31.9 bits (69), Expect = 9.3 Identities = 16/45 (35%), Positives = 24/45 (53%) Frame = +2 Query: 113 PNSEDGPFSCFKKPKESLLPISESSNLGPNDDGHVEFASSSQTSK 247 P S D P + + + S L S+ L P+ D H++ +S SQT K Sbjct: 108 PYSHDRPLQAYLRWRISQLAASDVHLLAPDGDSHLQPSSESQTRK 152 >UniRef50_Q22MK6 Cluster: Cyclic nucleotide-binding domain containing protein; n=1; Tetrahymena thermophila SB210|Rep: Cyclic nucleotide-binding domain containing protein - Tetrahymena thermophila SB210 Length = 1467 Score = 31.9 bits (69), Expect = 9.3 Identities = 24/74 (32%), Positives = 35/74 (47%), Gaps = 4/74 (5%) Frame = +1 Query: 265 KSIKGNSMVSCVQMKSETSASSYQLNKLSNMSGLKVTKMTQNKQSPEEQSQANASKRT-- 438 K+ KGN + E SA S L +L + K+ + QNK EE + NA T Sbjct: 737 KNEKGNKQIDVSDELFENSAFSQFLQELIKIKKQKLQEFKQNKSKEEEDNVENAEIFTTM 796 Query: 439 --IDAIQKLQMQGL 474 ++ I+K QMQ + Sbjct: 797 LYVEQIKK-QMQNI 809 >UniRef50_Q0E9Q0 Cluster: CG41441-PA; n=1; Drosophila melanogaster|Rep: CG41441-PA - Drosophila melanogaster (Fruit fly) Length = 195 Score = 31.9 bits (69), Expect = 9.3 Identities = 19/74 (25%), Positives = 26/74 (35%) Frame = -2 Query: 282 ISFDRFSCALKFLLVCEEDANSTCPSSFGPKFDDSEIGKRDSLGFLKQLKGPSSEFGLFG 103 I+ D +C F V + CP GP F + G + K S F F Sbjct: 60 IAVDELACRASFSAVLVPKNSFRCPRISGPPFSVNTAGAHNHAKLCPHQKLSGSSFARFA 119 Query: 102 LIIAPGETFKGFPN 61 L P F+ + N Sbjct: 120 LRSPPNRGFERYEN 133 >UniRef50_A2FMG8 Cluster: Putative uncharacterized protein; n=2; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 411 Score = 31.9 bits (69), Expect = 9.3 Identities = 29/92 (31%), Positives = 40/92 (43%), Gaps = 5/92 (5%) Frame = +2 Query: 11 PNTARLRPPVPNV---QSPRFGKPLKVSPGAIINPKRPNSEDGPFSCFKK-PKESLLPIS 178 P+T + RP Q+ R K PG I N +RP D S F K P++ PI Sbjct: 154 PSTIKARPATQQTNRKQTRRTKFQKKQHPGPIYNVERPIGSDARKSSFSKAPRD--FPIP 211 Query: 179 ESSNLGPND-DGHVEFASSSQTSKNFRAQLNL 271 +S GP D D EF S + R +++ Sbjct: 212 QSP--GPGDYDIPSEFGFSGKNLSRIRTTMHI 241 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 520,655,698 Number of Sequences: 1657284 Number of extensions: 10141051 Number of successful extensions: 31311 Number of sequences better than 10.0: 21 Number of HSP's better than 10.0 without gapping: 30195 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 31286 length of database: 575,637,011 effective HSP length: 95 effective length of database: 418,195,031 effective search space used: 33037407449 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -