BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= BmNP01_FL5_L13 (850 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9UMS4 Cluster: Pre-mRNA-processing factor 19; n=50; Fu... 237 3e-61 UniRef50_Q5DD07 Cluster: SJCHGC06229 protein; n=2; Schistosoma j... 200 3e-50 UniRef50_Q22W27 Cluster: Putative uncharacterized protein; n=1; ... 133 6e-30 UniRef50_Q3EBP5 Cluster: Uncharacterized protein At2g33340.2; n=... 132 1e-29 UniRef50_Q5AXS4 Cluster: Putative uncharacterized protein; n=2; ... 130 3e-29 UniRef50_Q5KKN8 Cluster: Nuclear matrix protein NMP200, putative... 129 8e-29 UniRef50_Q550P2 Cluster: WD40 repeat-containing protein; n=2; Di... 125 1e-27 UniRef50_A1CBT5 Cluster: Cell cycle control protein (Cwf8), puta... 121 2e-26 UniRef50_Q2H7I1 Cluster: Putative uncharacterized protein; n=5; ... 119 1e-25 UniRef50_A0CII1 Cluster: Chromosome undetermined scaffold_19, wh... 118 2e-25 UniRef50_A4RU49 Cluster: Predicted protein; n=2; Ostreococcus|Re... 114 2e-24 UniRef50_O14011 Cluster: Cell cycle control protein cwf8; n=1; S... 106 6e-22 UniRef50_O77325 Cluster: Conserved protein, putative; n=4; Plasm... 104 2e-21 UniRef50_Q4UCL6 Cluster: Putative uncharacterized protein; n=1; ... 104 3e-21 UniRef50_Q4MZ83 Cluster: Guanine nucleotide-binding protein, put... 104 3e-21 UniRef50_A3LZK6 Cluster: Predicted protein; n=2; Saccharomycetac... 104 3e-21 UniRef50_Q5CWJ2 Cluster: PRP19 non-snRNP sliceosome component re... 103 4e-21 UniRef50_A5KBA1 Cluster: WD domain, G-beta repeat domain contain... 101 3e-20 UniRef50_Q4FY68 Cluster: WD-repeat protein; n=3; Leishmania|Rep:... 100 5e-20 UniRef50_Q4PIF1 Cluster: Putative uncharacterized protein; n=1; ... 100 5e-20 UniRef50_Q5ADV1 Cluster: Putative uncharacterized protein PRP19;... 92 1e-17 UniRef50_Q75CH9 Cluster: ACL060Cp; n=1; Eremothecium gossypii|Re... 92 2e-17 UniRef50_Q4DPL8 Cluster: Putative uncharacterized protein; n=4; ... 89 2e-16 UniRef50_Q6FW58 Cluster: Similar to sp|P32523 Saccharomyces cere... 82 1e-14 UniRef50_A5CAR6 Cluster: Putative uncharacterized protein; n=1; ... 76 1e-12 UniRef50_Q6CS51 Cluster: Kluyveromyces lactis strain NRRL Y-1140... 74 5e-12 UniRef50_A7TRZ5 Cluster: Putative uncharacterized protein; n=1; ... 72 2e-11 UniRef50_P32523 Cluster: Pre-mRNA-splicing factor 19; n=3; Sacch... 70 9e-11 UniRef50_P93809 Cluster: F19P19.2 protein; n=1; Arabidopsis thal... 69 1e-10 UniRef50_A5E007 Cluster: Putative uncharacterized protein; n=1; ... 59 1e-07 UniRef50_Q3LWF9 Cluster: MRNA splicing protein PRP19; n=1; Bigel... 55 3e-06 UniRef50_A2EWI4 Cluster: Putative uncharacterized protein; n=2; ... 46 0.001 UniRef50_UPI00015B9740 Cluster: UPI00015B9740 related cluster; n... 38 0.32 UniRef50_UPI0000EBD1F1 Cluster: PREDICTED: hypothetical protein;... 38 0.32 UniRef50_Q4SHB0 Cluster: Chromosome 5 SCAF14581, whole genome sh... 38 0.32 UniRef50_Q9I9F1 Cluster: 5-methylcytosine G/T mismatch-specific ... 38 0.42 UniRef50_Q4Q975 Cluster: Putative uncharacterized protein; n=2; ... 38 0.42 UniRef50_UPI0000DD7D01 Cluster: PREDICTED: hypothetical protein;... 37 0.56 UniRef50_Q6YT10 Cluster: Putative lateral root primordia; n=6; O... 36 0.97 UniRef50_UPI0000EB2227 Cluster: Huntingtin-associated protein 1 ... 36 1.3 UniRef50_Q7EZ37 Cluster: Putative uncharacterized protein B1015H... 36 1.3 UniRef50_Q23QU7 Cluster: U-box domain containing protein; n=1; T... 36 1.3 UniRef50_Q8U2U5 Cluster: Putative uncharacterized protein PF0734... 36 1.3 UniRef50_UPI0000D99B29 Cluster: PREDICTED: hypothetical protein;... 36 1.7 UniRef50_Q8IWN7 Cluster: Retinitis pigmentosa 1-like 1 protein; ... 36 1.7 UniRef50_UPI0000F2EB03 Cluster: PREDICTED: hypothetical protein;... 35 2.2 UniRef50_A4MH05 Cluster: Oxidoreductase, 2OG-Fe(II) oxygenase fa... 35 2.2 UniRef50_A3B1M1 Cluster: Putative uncharacterized protein; n=1; ... 35 2.2 UniRef50_Q4TF27 Cluster: Chromosome undetermined SCAF4887, whole... 35 3.0 UniRef50_A3SIB5 Cluster: Putative uncharacterized protein; n=2; ... 35 3.0 UniRef50_Q4N4R0 Cluster: Peptidyl-prolyl cis-trans isomerase; n=... 35 3.0 UniRef50_UPI0000383927 Cluster: COG1529: Aerobic-type carbon mon... 34 3.9 UniRef50_Q9VEG1 Cluster: CG7431-PA; n=2; Sophophora|Rep: CG7431-... 34 3.9 UniRef50_Q7PWP8 Cluster: ENSANGP00000013932; n=1; Anopheles gamb... 34 3.9 UniRef50_Q1JSL3 Cluster: MRNA decapping enzyme, putative precurs... 34 3.9 UniRef50_Q4V2F4 Cluster: Putative uncharacterized protein; n=2; ... 34 5.2 UniRef50_Q166N8 Cluster: Putative uncharacterized protein; n=1; ... 34 5.2 UniRef50_Q0MX76 Cluster: Glycosyl transferase; n=12; Bacteria|Re... 34 5.2 UniRef50_Q86YZ3 Cluster: Hornerin; n=8; Theria|Rep: Hornerin - H... 34 5.2 UniRef50_UPI0000D9E76F Cluster: PREDICTED: hypothetical protein;... 33 6.9 UniRef50_Q9RWF2 Cluster: Leucyl aminopeptidase, putative; n=1; D... 33 6.9 UniRef50_Q1R2D4 Cluster: Putative uncharacterized protein; n=3; ... 33 6.9 UniRef50_A5NNR1 Cluster: LigA; n=2; cellular organisms|Rep: LigA... 33 6.9 UniRef50_A3L7L8 Cluster: Putative uncharacterized protein; n=2; ... 33 6.9 UniRef50_Q5DI17 Cluster: SJCHGC05395 protein; n=1; Schistosoma j... 33 6.9 UniRef50_Q5CUL0 Cluster: Ubiquitin-fusion degadation-2 (UFD2) fa... 33 6.9 UniRef50_A5K4P2 Cluster: Putative uncharacterized protein; n=1; ... 33 6.9 UniRef50_Q9ULR0 Cluster: Pre-mRNA-splicing factor ISY1 homolog; ... 33 6.9 UniRef50_A1GDA8 Cluster: Putative uncharacterized protein; n=1; ... 33 9.1 UniRef50_A0H4X8 Cluster: Putative uncharacterized protein; n=1; ... 33 9.1 UniRef50_Q9XVT2 Cluster: Putative uncharacterized protein; n=1; ... 33 9.1 UniRef50_A4HW23 Cluster: Putative uncharacterized protein; n=1; ... 33 9.1 UniRef50_Q8JZM8 Cluster: Mucin-4 precursor (Pancreatic adenocarc... 33 9.1 UniRef50_Q6NGC5 Cluster: Phospho-N-acetylmuramoyl-pentapeptide-t... 33 9.1 >UniRef50_Q9UMS4 Cluster: Pre-mRNA-processing factor 19; n=50; Fungi/Metazoa group|Rep: Pre-mRNA-processing factor 19 - Homo sapiens (Human) Length = 504 Score = 237 bits (580), Expect = 3e-61 Identities = 123/227 (54%), Positives = 150/227 (66%), Gaps = 7/227 (3%) Frame = +3 Query: 114 MSLYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXX 293 MSL C+ISN S V+ERR+IEKYI ENG DPIN + L E L Sbjct: 1 MSLICSISNEVPEHPCVSPVSNHVYERRLIEKYIAENGTDPINNQPLSEEQLIDIKVAHP 60 Query: 294 XXXXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 SATSIPA LK++QDEWDA+MLH+FT RQQLQT RQELSHALYQHDAACRVIARL Sbjct: 61 IRPKPPSATSIPAILKALQDEWDAVMLHSFTLRQQLQTTRQELSHALYQHDAACRVIARL 120 Query: 474 TKEVTAAREALATLKPQAGIAAPQAPHPTE-------XXXXXXXXXXXXXDVVSRLQERA 632 TKEVTAAREALATLKPQAG+ PQA ++ +++ +LQ++A Sbjct: 121 TKEVTAAREALATLKPQAGLIVPQAVPSSQPSVVGAGEPMDLGELVGMTPEIIQKLQDKA 180 Query: 633 TALTQERKRRXRTLPXGLLAPXQIRXFLTLASHPGLHSXSVPGILSL 773 T LT ERK+R +T+P L+ P ++ + +ASH GLHS S+PGIL+L Sbjct: 181 TVLTTERKKRGKTVPEELVKPEELSKYRQVASHVGLHSASIPGILAL 227 >UniRef50_Q5DD07 Cluster: SJCHGC06229 protein; n=2; Schistosoma japonicum|Rep: SJCHGC06229 protein - Schistosoma japonicum (Blood fluke) Length = 535 Score = 200 bits (489), Expect = 3e-50 Identities = 124/245 (50%), Positives = 144/245 (58%), Gaps = 28/245 (11%) Frame = +3 Query: 114 MSLYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXX 293 MSL C++SN SG +FERR+IEKY+ ENG DPI+ + L VE+L Sbjct: 1 MSLLCSLSNEVPEHPVVSPRSGHIFERRLIEKYLSENGTDPIDQQPLAVEELIDIKASAF 60 Query: 294 XXXXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 SATSIPA LK++QDEWDA+ML +FT RQQLQTARQELSHALYQHDAACRVIARL Sbjct: 61 VRPKPPSATSIPAILKTLQDEWDAVMLQSFTLRQQLQTARQELSHALYQHDAACRVIARL 120 Query: 474 TKEVTAAREALATLKPQAGIAAP-----QA------PHPTEXXXXXXXXXXXXXDVV--- 611 TKEVTAAREALATLKPQAGI P QA PH V Sbjct: 121 TKEVTAAREALATLKPQAGIMQPTQMNVQAVNAAHQPHTNTAPATPQTSDTSDTSVTMDS 180 Query: 612 SRLQER------ATALTQER--------KRRXRTLPXGLLAPXQIRXFLTLASHPGLHSX 749 ++LQE + QER KRR +T+P GL I + LA+H GLHS Sbjct: 181 NQLQEEIGISEDVISTLQERASQLTAERKRRGKTVPEGLARSRAISEYQQLANHVGLHSA 240 Query: 750 SVPGI 764 S+PGI Sbjct: 241 SMPGI 245 >UniRef50_Q22W27 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 609 Score = 133 bits (321), Expect = 6e-30 Identities = 87/237 (36%), Positives = 116/237 (48%), Gaps = 4/237 (1%) Frame = +3 Query: 96 SDIISKMSL-YCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLX 272 S ++ KMS YCAI+ SG VFE+R+IEK+I G PI G+ L +DL Sbjct: 96 SQLLIKMSWNYCAITGEQLHEPVVSKKSGHVFEKRVIEKHIQSTGQCPITGQALSNDDLI 155 Query: 273 XXXXXXXXXXXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAA 452 ++ SIP LK+ Q+EWDALML + +Q L+T RQELSHALYQHDAA Sbjct: 156 PVQLNVQTKPRSVTSNSIPGILKTFQNEWDALMLETYNLKQHLETVRQELSHALYQHDAA 215 Query: 453 CRVIARLTKEVTAAREALATL--KPQAGIAAPQAPHPTEXXXXXXXXXXXXXDVVSRLQE 626 CRVIARL KE AR +A L K + G + +++ + E Sbjct: 216 CRVIARLIKERDEARFEVAQLQEKLRQGKMELEEADAQNEQQQEQTNNKLPQNIIDSINE 275 Query: 627 RATALTQERKRRXRTLP-XGLLAPXQIRXFLTLASHPGLHSXSVPGILSLGHQSIRS 794 A L Q RK R + A ++ GLHS + PGI +L + R+ Sbjct: 276 TALKLNQIRKDRKKDAEYNSKFAATEVISSYGPKETVGLHSTTNPGINALDYNRQRN 332 >UniRef50_Q3EBP5 Cluster: Uncharacterized protein At2g33340.2; n=13; Euphyllophyta|Rep: Uncharacterized protein At2g33340.2 - Arabidopsis thaliana (Mouse-ear cress) Length = 537 Score = 132 bits (318), Expect = 1e-29 Identities = 86/229 (37%), Positives = 118/229 (51%), Gaps = 13/229 (5%) Frame = +3 Query: 126 CAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXXXX 305 CAIS SG +FERR+IE++I + G P+ G+ L ++D+ Sbjct: 3 CAISGEVPVEPVVSTKSGLLFERRLIERHISDYGKCPVTGEPLTIDDIVPIKTGEIIKPK 62 Query: 306 XXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTKEV 485 SIP L + Q+EWD LML F QQL TARQELSHALYQHD+ACRVIARL KE Sbjct: 63 TLHTASIPGLLGTFQNEWDGLMLSNFALEQQLHTARQELSHALYQHDSACRVIARLKKER 122 Query: 486 TAAREALATLK------PQAGIA------APQAPHPTE-XXXXXXXXXXXXXDVVSRLQE 626 AR+ LA ++ P+A A +A E ++++ L + Sbjct: 123 DEARQLLAEVERHIPAAPEAVTANAALSNGKRAAVDEELGPDAKKLCPGISAEIITELTD 182 Query: 627 RATALTQERKRRXRTLPXGLLAPXQIRXFLTLASHPGLHSXSVPGILSL 773 AL+Q+RK+ R +P L + + F L+SHP LH + PGI S+ Sbjct: 183 CNAALSQKRKK--RQIPQTLASIDTLERFTQLSSHP-LHKTNKPGICSM 228 >UniRef50_Q5AXS4 Cluster: Putative uncharacterized protein; n=2; Pezizomycotina|Rep: Putative uncharacterized protein - Emericella nidulans (Aspergillus nidulans) Length = 475 Score = 130 bits (315), Expect = 3e-29 Identities = 85/216 (39%), Positives = 111/216 (51%), Gaps = 4/216 (1%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 + CAIS SG+VFE+R++E YI ENG DP+NG+EL EDL Sbjct: 1 MLCAISGEAPQEPVVSPKSGSVFEKRLVEAYIAENGKDPVNGEELSTEDLIEVKTQRVVR 60 Query: 300 XXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTK 479 + TSIP+ L Q+EWDAL L +T RQ L RQELS ALYQHDAA RVIARLTK Sbjct: 61 PRPPTLTSIPSLLSVFQEEWDALALETYTLRQTLAQTRQELSAALYQHDAAVRVIARLTK 120 Query: 480 EVTAAREALATLKPQAGIAAPQAPHPTEXXXXXXXXXXXXXDVVSRLQERATALTQERKR 659 E AR+AL+ + A + A +V++R++ +L+ + R Sbjct: 121 ERDEARDALSKVTVGARSSGAAA------DAMQVDSAGLPDEVLARVENTQASLS--KTR 172 Query: 660 RXRTLPXGLLAPXQIRXF----LTLASHPGLHSXSV 755 R R +P G I + T A +PG S SV Sbjct: 173 RKRPVPEGWATGEAISAYKPTESTDAFYPGGKSLSV 208 >UniRef50_Q5KKN8 Cluster: Nuclear matrix protein NMP200, putative; n=2; Filobasidiella neoformans|Rep: Nuclear matrix protein NMP200, putative - Cryptococcus neoformans (Filobasidiella neoformans) Length = 507 Score = 129 bits (312), Expect = 8e-29 Identities = 78/218 (35%), Positives = 116/218 (53%), Gaps = 1/218 (0%) Frame = +3 Query: 123 YCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXXX 302 +CAIS TSGAV+E+ +IE+YI ENG DPI+G+ L +DL Sbjct: 3 FCAISGSPPTVPVVSKTSGAVYEKALIERYIEENGTDPISGEALTKDDLVDVKAKPSTIP 62 Query: 303 XXXS-ATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTK 479 + TSIPA L ++Q E+D++ML + ++ Q++RQEL++ALY+ DAA RVIARL K Sbjct: 63 PRPANQTSIPALLTALQSEYDSIMLESLEIKKAFQSSRQELANALYREDAATRVIARLMK 122 Query: 480 EVTAAREALATLKPQAGIAAPQAPHPTEXXXXXXXXXXXXXDVVSRLQERATALTQERKR 659 E AR+AL++++ G P A DV +++ E AL+ RK+ Sbjct: 123 ERDEARQALSSIQSTIGFQPPAAAEEPAADVEMAQEGALPADVEAKVMETNQALSSVRKK 182 Query: 660 RXRTLPXGLLAPXQIRXFLTLASHPGLHSXSVPGILSL 773 R + P G I+ + + P LH+ GI +L Sbjct: 183 R-KPAP-GYKKVDDIKSYTQINHVPSLHATKPAGITAL 218 >UniRef50_Q550P2 Cluster: WD40 repeat-containing protein; n=2; Dictyostelium discoideum|Rep: WD40 repeat-containing protein - Dictyostelium discoideum AX4 Length = 514 Score = 125 bits (302), Expect = 1e-27 Identities = 63/133 (47%), Positives = 83/133 (62%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 + CAIS +G V+E+R+IEKYI NG +P G+ L + DL Sbjct: 1 MICAISGSTTEEPVISTKTGNVYEKRLIEKYIDTNGKEPTTGEPLGLSDLITVKIGKTVK 60 Query: 300 XXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTK 479 +ATSIP+ L+ Q+EWD+LML FT +QQ +T RQEL+H++YQ+DAACRVIARL K Sbjct: 61 PRPTTATSIPSMLQLFQNEWDSLMLETFTLKQQHETVRQELAHSMYQYDAACRVIARLVK 120 Query: 480 EVTAAREALATLK 518 E AAR ALA + Sbjct: 121 ERDAARSALANAR 133 >UniRef50_A1CBT5 Cluster: Cell cycle control protein (Cwf8), putative; n=9; Pezizomycotina|Rep: Cell cycle control protein (Cwf8), putative - Aspergillus clavatus Length = 526 Score = 121 bits (292), Expect = 2e-26 Identities = 80/195 (41%), Positives = 103/195 (52%), Gaps = 4/195 (2%) Frame = +3 Query: 183 VFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXXXXXXSATSIPATLKSMQDEWD 362 VFERR+IE YI ENG DP+NG+EL +DL + TSIP+ L Q+EWD Sbjct: 72 VFERRLIEAYIAENGKDPVNGEELSTDDLIEVKTQRVVRPRPPTLTSIPSLLNVFQEEWD 131 Query: 363 ALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTKEVTAAREALATLKPQAGIAAP 542 AL L +T RQ L RQELS ALYQHDAA RVIARLTKE AR+AL+ K G + Sbjct: 132 ALALETYTLRQTLAQTRQELSVALYQHDAAVRVIARLTKERDEARDALS--KVTVGASRT 189 Query: 543 QAPHPTEXXXXXXXXXXXXXDVVSRLQERATALTQERKRRXRTLPXGLLAPXQIRXFL-- 716 + V++R++ AL+ + RR R +P G + + Sbjct: 190 GGDEAMQVDSTGLPDA-----VLARIESTQVALS--KTRRKRAIPEGWATSDALSTYKPT 242 Query: 717 -TL-ASHPGLHSXSV 755 TL S+PG + SV Sbjct: 243 ETLEPSYPGSKALSV 257 >UniRef50_Q2H7I1 Cluster: Putative uncharacterized protein; n=5; Pezizomycotina|Rep: Putative uncharacterized protein - Chaetomium globosum (Soil fungus) Length = 807 Score = 119 bits (286), Expect = 1e-25 Identities = 69/163 (42%), Positives = 90/163 (55%) Frame = +3 Query: 174 SGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXXXXXXSATSIPATLKSMQD 353 +G VFE+R+I KYI ENG +P +EL EDL + TS+P+ LK+ QD Sbjct: 349 TGTVFEKRLILKYIEENGKEPGTDEELDPEDLLAVKTSRVVRPRPPNFTSLPSLLKAFQD 408 Query: 354 EWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTKEVTAAREALATLKPQAGI 533 EWDAL+L + R+QL R+EL+ ALYQHDAA RVIARLT+E AAREAL+ + Sbjct: 409 EWDALVLETYNTREQLSRTREELATALYQHDAAVRVIARLTRERDAAREALSNV-----T 463 Query: 534 AAPQAPHPTEXXXXXXXXXXXXXDVVSRLQERATALTQERKRR 662 A A P +V + E LT+ RK+R Sbjct: 464 VAQAATGPANGDAMAVDNESLPEHLVEHVNELQQQLTKGRKKR 506 >UniRef50_A0CII1 Cluster: Chromosome undetermined scaffold_19, whole genome shotgun sequence; n=4; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_19, whole genome shotgun sequence - Paramecium tetraurelia Length = 489 Score = 118 bits (283), Expect = 2e-25 Identities = 59/131 (45%), Positives = 78/131 (59%) Frame = +3 Query: 126 CAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXXXX 305 CA+S SG ++E+R+IEK+I G PI G+ L +EDL Sbjct: 7 CALSGELIETPVISKVSGHIYEKRLIEKHIESTGTCPITGRPLNIEDLIEVKVSRVQKPR 66 Query: 306 XXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTKEV 485 +ATSIP+ L +Q+EWDAL+L F +Q L+ R EL+HALYQHDAACRVIA+L KE Sbjct: 67 PVTATSIPSLLSLLQNEWDALLLEQFQLKQHLEQVRHELTHALYQHDAACRVIAKLIKER 126 Query: 486 TAAREALATLK 518 AR L+ L+ Sbjct: 127 DQARIELSQLQ 137 >UniRef50_A4RU49 Cluster: Predicted protein; n=2; Ostreococcus|Rep: Predicted protein - Ostreococcus lucimarinus CCE9901 Length = 478 Score = 114 bits (275), Expect = 2e-24 Identities = 70/181 (38%), Positives = 92/181 (50%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 ++CAIS G ++ER +I K I E G P+ + L V+DL Sbjct: 1 MFCAISGAAPARPVVTPR-GVLYERSLIVKAIEERGECPVTKESLSVDDLIELKAQKWVN 59 Query: 300 XXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTK 479 + S+P L + +EWDALML T R++LQT RQELSHALYQHDAACRVIARL K Sbjct: 60 PRPEATMSVPGLLSAFHNEWDALMLETHTLRKELQTTRQELSHALYQHDAACRVIARLMK 119 Query: 480 EVTAAREALATLKPQAGIAAPQAPHPTEXXXXXXXXXXXXXDVVSRLQERATALTQERKR 659 E AR+ALA K A +A P VV+++ + L+ RK+ Sbjct: 120 ERDDARDALANAKGSAKRSAAGDAEPESKKVKAGLPAA----VVAKMNDVQKELSSGRKK 175 Query: 660 R 662 R Sbjct: 176 R 176 >UniRef50_O14011 Cluster: Cell cycle control protein cwf8; n=1; Schizosaccharomyces pombe|Rep: Cell cycle control protein cwf8 - Schizosaccharomyces pombe (Fission yeast) Length = 488 Score = 106 bits (255), Expect = 6e-22 Identities = 58/137 (42%), Positives = 75/137 (54%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 ++C+IS SG V+E+R+IE+ I E DP+ +E +EDL Sbjct: 1 MFCSISGETPKEPVISRVSGNVYEKRLIEQVIRETSKDPVTQQECTLEDLVPVKVPDFVR 60 Query: 300 XXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTK 479 SATS+PA L Q+EWD++ L F R+ L +QELS ALY DAA RVI+RLTK Sbjct: 61 PRPPSATSLPALLSLFQEEWDSVALEQFELRRNLTETKQELSTALYSLDAALRVISRLTK 120 Query: 480 EVTAAREALATLKPQAG 530 E AREALA G Sbjct: 121 ERDEAREALAKFSDNIG 137 >UniRef50_O77325 Cluster: Conserved protein, putative; n=4; Plasmodium|Rep: Conserved protein, putative - Plasmodium falciparum (isolate 3D7) Length = 532 Score = 104 bits (250), Expect = 2e-21 Identities = 55/137 (40%), Positives = 77/137 (56%) Frame = +3 Query: 114 MSLYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXX 293 MS+ C IS T G +FE+R+IEK+II G+ P++G+ L +EDL Sbjct: 1 MSIICTISGQTPEEPVISKT-GYIFEKRLIEKHIINYGICPVSGEVLTLEDLYPIKNEKI 59 Query: 294 XXXXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 +A+SIP L Q EWD+++ F+ R + R ELSH+LYQ+DAA RVIA+L Sbjct: 60 VKPRPITASSIPGLLSIFQTEWDSIISEMFSLRTHVNDIRNELSHSLYQYDAATRVIAKL 119 Query: 474 TKEVTAAREALATLKPQ 524 KE +E + LK Q Sbjct: 120 LKEKNGYKEEIENLKKQ 136 >UniRef50_Q4UCL6 Cluster: Putative uncharacterized protein; n=1; Theileria annulata|Rep: Putative uncharacterized protein - Theileria annulata Length = 527 Score = 104 bits (249), Expect = 3e-21 Identities = 74/218 (33%), Positives = 101/218 (46%) Frame = +3 Query: 114 MSLYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXX 293 M+ C IS T G +FERR+IEK++ E+ V P G+ L +DL Sbjct: 1 MTFLCTISGVQPQEPCLSKT-GYIFERRLIEKHLEESPVCPATGEPLTPQDLINIKTDVV 59 Query: 294 XXXXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 +A+SIP L +Q EWDAL L R + R++LS++LYQHDAA RVIARL Sbjct: 60 TKPRPVTASSIPGLLSLLQSEWDALALETHNMRSHVDEVRKQLSYSLYQHDAATRVIARL 119 Query: 474 TKEVTAAREALATLKPQAGIAAPQAPHPTEXXXXXXXXXXXXXDVVSRLQERATALTQER 653 K+ +A + + LK Q + D + RLQ+ A L ER Sbjct: 120 IKQRDSALQEVEALKQQLLLFRTN-------YDVNSLETEFDKDTMVRLQDLAKVLLSER 172 Query: 654 KRRXRTLPXGLLAPXQIRXFLTLASHPGLHSXSVPGIL 767 K+R + G L F A LHS + PG+L Sbjct: 173 KKRDLS---GYLDAEAFSKF-KCAGEFRLHSSTKPGVL 206 >UniRef50_Q4MZ83 Cluster: Guanine nucleotide-binding protein, putative; n=2; Piroplasmida|Rep: Guanine nucleotide-binding protein, putative - Theileria parva Length = 496 Score = 104 bits (249), Expect = 3e-21 Identities = 74/219 (33%), Positives = 105/219 (47%), Gaps = 1/219 (0%) Frame = +3 Query: 114 MSLYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXX 293 M+ C IS T G +FERR+IEK++ E+ V P G+ L ++DL Sbjct: 1 MTFLCTISGVQPQEPCLSKT-GYIFERRLIEKHLEESPVCPATGEPLTLQDLITIKTDVV 59 Query: 294 XXXXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 +A+SIP L +Q EWDAL L R + R++LS++LYQHDAA RVIARL Sbjct: 60 TKPRPVTASSIPGLLSLLQSEWDALALETHNMRSHVDEVRKQLSYSLYQHDAATRVIARL 119 Query: 474 TKEVTAAREALATLKPQA-GIAAPQAPHPTEXXXXXXXXXXXXXDVVSRLQERATALTQE 650 K+ A + + +LK Q + P+ E D + RLQ+ + L E Sbjct: 120 IKQRDTALQEVESLKQQLLQFRSNYDPNSLETEFDK--------DTLVRLQDFSKVLLAE 171 Query: 651 RKRRXRTLPXGLLAPXQIRXFLTLASHPGLHSXSVPGIL 767 RK+R + G + F A LHS + PG+L Sbjct: 172 RKKRDLS---GYVPSDAFTKF-KCAGEFRLHSSTKPGVL 206 >UniRef50_A3LZK6 Cluster: Predicted protein; n=2; Saccharomycetaceae|Rep: Predicted protein - Pichia stipitis (Yeast) Length = 514 Score = 104 bits (249), Expect = 3e-21 Identities = 61/139 (43%), Positives = 75/139 (53%), Gaps = 2/139 (1%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 + C+IS SGA+F+R+ IE YI G DPI+ + L VE+L Sbjct: 1 MICSISGQQASDPVISPKSGAIFDRKHIESYISTAGTDPISDQPLTVEELIAVKTSVSEV 60 Query: 300 XXXX--SATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 SATSIPA L + Q+EWDAL L FT R+QL AR+ELS ALY HDAA RV A Sbjct: 61 IPPRISSATSIPALLSTFQNEWDALALEVFTLRKQLYKAREELSAALYHHDAAVRVAANA 120 Query: 474 TKEVTAAREALATLKPQAG 530 +E A+ AL L G Sbjct: 121 IRERDEAKAALQELAISIG 139 >UniRef50_Q5CWJ2 Cluster: PRP19 non-snRNP sliceosome component required for DNA repair; n=3; Cryptosporidium|Rep: PRP19 non-snRNP sliceosome component required for DNA repair - Cryptosporidium parvum Iowa II Length = 576 Score = 103 bits (248), Expect = 4e-21 Identities = 55/135 (40%), Positives = 76/135 (56%) Frame = +3 Query: 114 MSLYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXX 293 MSL C+IS T G +FE+R+IE+YI N PI EL ++DL Sbjct: 25 MSLICSISGTTPEDPVISKT-GYIFEKRLIEEYIRCNNSCPITKSELSLDDLIQVKSKSN 83 Query: 294 XXXXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 TSIP L S++ EWDA+ + F R +L+ + +L+H+LYQHDAACRVIAR+ Sbjct: 84 LKPRLIKNTSIPGILDSLRTEWDAMAMEMFQLRSELEQTKSQLTHSLYQHDAACRVIARI 143 Query: 474 TKEVTAAREALATLK 518 T+E A LA ++ Sbjct: 144 TREKDQAISRLAEIQ 158 >UniRef50_A5KBA1 Cluster: WD domain, G-beta repeat domain containing protein; n=1; Plasmodium vivax|Rep: WD domain, G-beta repeat domain containing protein - Plasmodium vivax Length = 498 Score = 101 bits (241), Expect = 3e-20 Identities = 53/137 (38%), Positives = 78/137 (56%) Frame = +3 Query: 114 MSLYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXX 293 MS+ C IS T G VFE+R+IEK+I+ G+ P++G+ L ++DL Sbjct: 1 MSILCTISGQTPEEPVVSKT-GYVFEKRLIEKHILNYGICPVSGEVLTLQDLYPLKNEQV 59 Query: 294 XXXXXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 +A+SIP L +Q EWDA++ FT R + R +L+H+LYQ+DAA RVIA+L Sbjct: 60 VKPRPITASSIPGLLSILQTEWDAIISEMFTLRTHVNDIRNQLTHSLYQYDAATRVIAKL 119 Query: 474 TKEVTAAREALATLKPQ 524 KE +E + L+ Q Sbjct: 120 LKEKNDYKEEVKKLRNQ 136 >UniRef50_Q4FY68 Cluster: WD-repeat protein; n=3; Leishmania|Rep: WD-repeat protein - Leishmania major strain Friedlin Length = 513 Score = 100 bits (239), Expect = 5e-20 Identities = 60/152 (39%), Positives = 81/152 (53%), Gaps = 6/152 (3%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 L C IS SG VFER ++EKY+ E+G P+ G LR EDL Sbjct: 2 LRCNISQRVPTHPVVSVKSGLVFERSLVEKYVDEHGRCPVTGDPLRKEDLITAQGAAPDA 61 Query: 300 XXXXS----ATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIA 467 S A S+P L+ +Q EW+ + L F+ RQQ+ + EL+HAL Q+DAACRVIA Sbjct: 62 SIASSGSLGAASVPTLLERLQVEWEGVALEQFSLRQQVTQLQLELAHALQQYDAACRVIA 121 Query: 468 RLTKEVTAAR--EALATLKPQAGIAAPQAPHP 557 RL+KE+ + R +A+A + G A P P Sbjct: 122 RLSKELDSLRGGKAVAAEETDKGPAVVVVPAP 153 >UniRef50_Q4PIF1 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 551 Score = 100 bits (239), Expect = 5e-20 Identities = 73/245 (29%), Positives = 111/245 (45%), Gaps = 30/245 (12%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 ++CAIS SG ++E+R+I KYI ENG DP+ G L ++DL Sbjct: 1 MFCAISGEPPKVPVVSRKSGLIYEQRLIHKYINENGKDPVTGDTLELDDLIEIKSTFPIV 60 Query: 300 XXXXSA-----------------TSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSH 428 + +SIP+ L S+Q+E+DA++L FT ++ RQEL+H Sbjct: 61 RRGKAGRQAAAGPKTAVPRPPQHSSIPSLLTSLQNEYDAIILETFTLKKHYDNLRQELAH 120 Query: 429 ALYQHDAACRVIARLTKEVTAAREALATLKPQAGIAAPQAPH-----------PTE-XXX 572 ALY +DA+ RVIARL E AREALA+++ G A P + P E Sbjct: 121 ALYANDASARVIARLLNERDQAREALASIQGTIG-AGPSSSRTAQDVEMSDSAPAENGSA 179 Query: 573 XXXXXXXXXXDVVSRLQERATALTQERK-RRXRTLPXGLLAPXQIRXFLTLASHPGLHSX 749 ++V + A L+ +RK + R P G + + F+ + P +H Sbjct: 180 GAAETGGLPKEIVDVIDSTAQRLSSQRKAKSKRKAPEGYASQSTVADFVEVQKLPSMHHA 239 Query: 750 SVPGI 764 G+ Sbjct: 240 RPAGV 244 >UniRef50_Q5ADV1 Cluster: Putative uncharacterized protein PRP19; n=1; Candida albicans|Rep: Putative uncharacterized protein PRP19 - Candida albicans (Yeast) Length = 446 Score = 92.3 bits (219), Expect = 1e-17 Identities = 56/155 (36%), Positives = 79/155 (50%), Gaps = 7/155 (4%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 + C+IS SGA+F+R+ I YI +G DPI + L +L Sbjct: 1 MICSISGEIATDPVVSPKSGAIFQRKHIVNYIATSGTDPITDEPLTESELISLKVNEKST 60 Query: 300 XXXX------SATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRV 461 S +SIP+ L + Q+EWDA++L FT ++QLQ+A+QELS ALY+ DAA V Sbjct: 61 AIAQPSPPDPSNSSIPSLLSTFQNEWDAIVLEVFTLKKQLQSAKQELSIALYRQDAAVNV 120 Query: 462 IARLTKEVTAAREALATLKPQAGIA-APQAPHPTE 563 A+ +E AREAL L ++ P P E Sbjct: 121 AAKAIRERDEAREALEKLSSSINLSDVPDTNTPPE 155 >UniRef50_Q75CH9 Cluster: ACL060Cp; n=1; Eremothecium gossypii|Rep: ACL060Cp - Ashbya gossypii (Yeast) (Eremothecium gossypii) Length = 504 Score = 91.9 bits (218), Expect = 2e-17 Identities = 60/152 (39%), Positives = 76/152 (50%), Gaps = 8/152 (5%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 ++CAIS S VFE+R+IE+YI E+GVDPI+ L + L Sbjct: 1 MFCAISGKPPITPVVSPESKCVFEKRLIEQYIDEHGVDPISKTSLTKDALIVIAQTPQQY 60 Query: 300 XXXX---SAT-----SIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAAC 455 SAT SIP L ++Q+EWDA+ML F R QL ++ELS ALY+ DAA Sbjct: 61 ALANAVNSATLNANYSIPNLLSTLQNEWDAVMLETFELRSQLDMCKKELSSALYKCDAAI 120 Query: 456 RVIARLTKEVTAAREALATLKPQAGIAAPQAP 551 RV AR +E R L L G A AP Sbjct: 121 RVAARAKQESDELRHTLTELTEAVGGQAADAP 152 >UniRef50_Q4DPL8 Cluster: Putative uncharacterized protein; n=4; Trypanosoma|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 513 Score = 88.6 bits (210), Expect = 2e-16 Identities = 46/132 (34%), Positives = 72/132 (54%), Gaps = 7/132 (5%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 ++C ISN SG ++ER ++E+YI E+G P+ G+ L+ +DL Sbjct: 1 MFCCISNRVPHEPVVSRLSGCLYERSLVEQYIAEHGRCPVTGEALQKDDLIAVRPTVLKS 60 Query: 300 XXXX-------SATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACR 458 S+ ++P L + +WDA+ML F+ RQQL +QEL+ AL+Q+++ACR Sbjct: 61 VAGGVGGALSPSSETVPGILAKLHSQWDAIMLEQFSLRQQLAQTQQELAQALHQYESACR 120 Query: 459 VIARLTKEVTAA 494 VIA K+ AA Sbjct: 121 VIATFIKDRDAA 132 >UniRef50_Q6FW58 Cluster: Similar to sp|P32523 Saccharomyces cerevisiae YLL036c PRP19; n=1; Candida glabrata|Rep: Similar to sp|P32523 Saccharomyces cerevisiae YLL036c PRP19 - Candida glabrata (Yeast) (Torulopsis glabrata) Length = 533 Score = 82.2 bits (194), Expect = 1e-14 Identities = 43/122 (35%), Positives = 68/122 (55%), Gaps = 8/122 (6%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 +YCAIS S V+ERR++E+Y+ ++G DP+NG+ L VE L Sbjct: 1 MYCAISGKVPKEPVLSLESRCVYERRLVEEYVRQHGTDPVNGRPLAVEQLVEINVDPESM 60 Query: 300 XXXXSAT--------SIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAAC 455 +A SIP+ L ++Q+EWDA+ML F R+ +++ +++LS LY+ DAA Sbjct: 61 TLVNAANSATLNSNYSIPSLLSTLQNEWDAVMLENFELRKAVESLKKKLSTTLYERDAAK 120 Query: 456 RV 461 +V Sbjct: 121 KV 122 >UniRef50_A5CAR6 Cluster: Putative uncharacterized protein; n=1; Vitis vinifera|Rep: Putative uncharacterized protein - Vitis vinifera (Grape) Length = 209 Score = 76.2 bits (179), Expect = 1e-12 Identities = 41/89 (46%), Positives = 51/89 (57%) Frame = +3 Query: 174 SGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXXXXXXSATSIPATLKSMQD 353 SG +FE+R+IE++ + G PI G+ L ++D+ A SIP L Q Sbjct: 85 SGLLFEKRLIERH--DYGKCPITGEPLTMDDIVPIQTGKIVKPRPVQAASIPGMLGMFQI 142 Query: 354 EWDALMLHAFTQRQQLQTARQELSHALYQ 440 EWD LML F QQL TARQELSHALYQ Sbjct: 143 EWDGLMLSNFALEQQLHTARQELSHALYQ 171 >UniRef50_Q6CS51 Cluster: Kluyveromyces lactis strain NRRL Y-1140 chromosome D of strain NRRL Y- 1140 of Kluyveromyces lactis; n=1; Kluyveromyces lactis|Rep: Kluyveromyces lactis strain NRRL Y-1140 chromosome D of strain NRRL Y- 1140 of Kluyveromyces lactis - Kluyveromyces lactis (Yeast) (Candida sphaerica) Length = 491 Score = 73.7 bits (173), Expect = 5e-12 Identities = 45/140 (32%), Positives = 65/140 (46%), Gaps = 8/140 (5%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 ++CAIS S +FE+ +IE+YI + G DPI L+ DL Sbjct: 1 MFCAISGKPPIKAVLSPNSKCIFEQHLIEQYIEQKGTDPITDDPLQKTDLVEINATPQQI 60 Query: 300 XXXXSATS--------IPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAAC 455 S +S IP+ L ++Q EWDA+ML F R+QL ++ LS LY+ DA Sbjct: 61 SLSESLSSSTIANNYSIPSLLSTLQKEWDAVMLENFELRKQLDVCKKNLSDTLYRFDAVA 120 Query: 456 RVIARLTKEVTAAREALATL 515 A+ E ++ LA L Sbjct: 121 SAAAKAFVERDQLKQELAEL 140 >UniRef50_A7TRZ5 Cluster: Putative uncharacterized protein; n=1; Vanderwaltozyma polyspora DSM 70294|Rep: Putative uncharacterized protein - Vanderwaltozyma polyspora DSM 70294 Length = 504 Score = 71.7 bits (168), Expect = 2e-11 Identities = 40/129 (31%), Positives = 66/129 (51%), Gaps = 8/129 (6%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 ++CAIS S ++FE+ +IE+Y+ ++G DPI + L++ +L Sbjct: 1 MFCAISGKPAKFPVLSPKSKSIFEKALIEQYVEQSGKDPITNEPLKLSELVEISQTPQQT 60 Query: 300 XXXXSAT--------SIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAAC 455 + SIP L ++Q+EWDA+ML F R+QL ++LS A Y+ D+A Sbjct: 61 SLVNAVNASTLNTNYSIPNLLSTLQNEWDAIMLENFQLRKQLDAFTKQLSIAYYERDSAK 120 Query: 456 RVIARLTKE 482 + A+ KE Sbjct: 121 LIAAKTLKE 129 >UniRef50_P32523 Cluster: Pre-mRNA-splicing factor 19; n=3; Saccharomyces cerevisiae|Rep: Pre-mRNA-splicing factor 19 - Saccharomyces cerevisiae (Baker's yeast) Length = 503 Score = 69.7 bits (163), Expect = 9e-11 Identities = 44/144 (30%), Positives = 66/144 (45%), Gaps = 8/144 (5%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXX 299 + CAIS S +FE+ ++E+Y+ + G DPI + L +E++ Sbjct: 1 MLCAISGKVPRRPVLSPKSRTIFEKSLLEQYVKDTGNDPITNEPLSIEEIVEIVPSAQQA 60 Query: 300 XXXXSATS--------IPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAAC 455 S S IP L S+Q+EWDA+ML F R L + ++LS +Y+ DAA Sbjct: 61 SLTESTNSATLKANYSIPNLLTSLQNEWDAIMLENFKLRSTLDSLTKKLSTVMYERDAAK 120 Query: 456 RVIARLTKEVTAAREALATLKPQA 527 V A+L E + L QA Sbjct: 121 LVAAQLLMEKNEDSKDLPKSSQQA 144 >UniRef50_P93809 Cluster: F19P19.2 protein; n=1; Arabidopsis thaliana|Rep: F19P19.2 protein - Arabidopsis thaliana (Mouse-ear cress) Length = 580 Score = 68.9 bits (161), Expect = 1e-10 Identities = 42/75 (56%), Positives = 47/75 (62%), Gaps = 11/75 (14%) Frame = +3 Query: 354 EWDALMLHAFTQRQQLQTARQELSHALY-----------QHDAACRVIARLTKEVTAARE 500 EWD+LML F QQL TARQELSHALY QHDAACRVIARL KE +R+ Sbjct: 137 EWDSLMLSNFALEQQLHTARQELSHALYQVIDGGYTFPLQHDAACRVIARLKKERDESRQ 196 Query: 501 ALATLKPQAGIAAPQ 545 LA + Q AAP+ Sbjct: 197 LLAEAERQLP-AAPE 210 Score = 38.7 bits (86), Expect = 0.18 Identities = 21/57 (36%), Positives = 34/57 (59%) Frame = +3 Query: 603 DVVSRLQERATALTQERKRRXRTLPXGLLAPXQIRXFLTLASHPGLHSXSVPGILSL 773 +V++ L + AL+Q+RK+ R +P L + + F L+SHP LH + PGI S+ Sbjct: 258 EVITELTDCNAALSQQRKK--RQIPKTLASVDALEKFTQLSSHP-LHKTNKPGIFSM 311 >UniRef50_A5E007 Cluster: Putative uncharacterized protein; n=1; Lodderomyces elongisporus NRRL YB-4239|Rep: Putative uncharacterized protein - Lodderomyces elongisporus (Yeast) (Saccharomyces elongisporus) Length = 555 Score = 59.3 bits (137), Expect = 1e-07 Identities = 27/68 (39%), Positives = 45/68 (66%) Frame = +3 Query: 312 SATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTKEVTA 491 + +SIP+ L ++Q+EWD+++L FT R+ +Q +Q+LS ALY+ DA+ V A+ +E Sbjct: 104 ATSSIPSLLSTLQNEWDSIVLELFTLRKTVQLLKQQLSMALYRADASVNVAAKALRERDQ 163 Query: 492 AREALATL 515 AR + L Sbjct: 164 ARREIERL 171 Score = 44.4 bits (100), Expect = 0.004 Identities = 19/50 (38%), Positives = 30/50 (60%) Frame = +3 Query: 120 LYCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDL 269 + CA+S SG+VFE++ IEKY++ +G DPIN + L + +L Sbjct: 1 MICALSGLPIQNPVASPKSGSVFEKKYIEKYVLTSGKDPINDEPLTIGEL 50 >UniRef50_Q3LWF9 Cluster: MRNA splicing protein PRP19; n=1; Bigelowiella natans|Rep: MRNA splicing protein PRP19 - Bigelowiella natans (Pedinomonas minutissima) (Chlorarachnion sp.(strain CCMP 621)) Length = 139 Score = 54.8 bits (126), Expect = 3e-06 Identities = 33/132 (25%), Positives = 57/132 (43%) Frame = +3 Query: 123 YCAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXXX 302 YC++S TSG +F+ I+ Y+ E PI G ++L Sbjct: 3 YCSMSGLFTTRPMILTTSGYIFDEYAIKSYLNEFKKCPITGMPSTHKNLIECKNSNNFKC 62 Query: 303 XXXSATSIPATLKSMQDEWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTKE 482 T + L+ ++++W +L F + L RQEL + YQ+DAA R + ++ Sbjct: 63 VFSQHTDLITLLEEIKNQWQKYILEYFQLKNNLLYIRQELILSYYQNDAAYRALVSALRD 122 Query: 483 VTAAREALATLK 518 ++ + TLK Sbjct: 123 RNKLKKVIFTLK 134 >UniRef50_A2EWI4 Cluster: Putative uncharacterized protein; n=2; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 476 Score = 46.4 bits (105), Expect = 0.001 Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 1/100 (1%) Frame = +3 Query: 177 GAVFERRIIEKYIIENGVDPINGKELRVEDLXXXXXXXXXXXXXXSAT-SIPATLKSMQD 353 G V+++ IE I + V P+ K L + DL S L S+Q+ Sbjct: 20 GIVYDKDSIEHQIQISPVCPVTDKSLTLADLIPLKIDMPVNKTQTVRNYSFGDYLLSLQN 79 Query: 354 EWDALMLHAFTQRQQLQTARQELSHALYQHDAACRVIARL 473 +W++ + R++L +EL+ ALY+ +AA RVIAR+ Sbjct: 80 QWNSKQKELYETRKKLAQCERELAQALYETEAAKRVIARI 119 >UniRef50_UPI00015B9740 Cluster: UPI00015B9740 related cluster; n=2; unknown|Rep: UPI00015B9740 UniRef100 entry - unknown Length = 515 Score = 37.9 bits (84), Expect = 0.32 Identities = 24/79 (30%), Positives = 33/79 (41%) Frame = +2 Query: 443 RRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSA 622 RR + H+G G +RG + G HG +R C H CG+ +++A Sbjct: 267 RRGLRLRPDRHQGAGGQSRGDGRSHPGGA-----GRRRHGCGARLCRD--HACGQVLAAA 319 Query: 623 GARHRTHAGTEAPXPHPAR 679 R R H G A HP R Sbjct: 320 APRRRPHRGDPARPRHPLR 338 >UniRef50_UPI0000EBD1F1 Cluster: PREDICTED: hypothetical protein; n=1; Bos taurus|Rep: PREDICTED: hypothetical protein - Bos taurus Length = 278 Score = 37.9 bits (84), Expect = 0.32 Identities = 32/86 (37%), Positives = 37/86 (43%), Gaps = 13/86 (15%) Frame = +2 Query: 473 HKGGDGGARGPRHTETAGRHCSTPST--TPHGGVSRQCGSNRH------VCGRGVSSAG- 625 H GG G G R + GRH PS TP G + GS H V GRG+ G Sbjct: 104 HPGGGGFLGGRRLSHAQGRHSPAPSVPYTP-GEPASPTGSRAHPGFRRGVGGRGMGGTGP 162 Query: 626 --ARHRTHA--GTEAPXPHPARXTAG 691 RTHA +AP P P R + G Sbjct: 163 GCGGGRTHARRSQDAPPPPPPRGSGG 188 >UniRef50_Q4SHB0 Cluster: Chromosome 5 SCAF14581, whole genome shotgun sequence; n=4; Tetraodontidae|Rep: Chromosome 5 SCAF14581, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1680 Score = 37.9 bits (84), Expect = 0.32 Identities = 31/115 (26%), Positives = 43/115 (37%) Frame = +2 Query: 437 PTRRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVS 616 P+ R + S G GA G R ++ + + + G S+ CG + HV V Sbjct: 235 PSARVSTAGSDVPSRGPIGAAGNRRSQGSKTSSQSSKDSSEGMTSQPCGMSEHVPSSAV- 293 Query: 617 SAGARHRTHAGTEAPXPHPARXTAGTRXDQXIPHSRIASWSSFXQCSWYPISGTS 781 +AG T A T P +A T Q P S W S S + S S Sbjct: 294 TAGPSTSTSAATSGSVSPPTSSSAATAIPQ--PSSNGKPWKSKSVSSKHAASSAS 346 >UniRef50_Q9I9F1 Cluster: 5-methylcytosine G/T mismatch-specific DNA glycosylase; n=3; Gallus gallus|Rep: 5-methylcytosine G/T mismatch-specific DNA glycosylase - Gallus gallus (Chicken) Length = 416 Score = 37.5 bits (83), Expect = 0.42 Identities = 22/70 (31%), Positives = 31/70 (44%) Frame = +2 Query: 446 RSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSAG 625 R+ D P GG+GGAR + C PS GG + +C + GR S+ Sbjct: 34 RAAPRDLPVRDGGEGGARSSQQRHGTAVRCERPSA--RGGKAERCATKAEAGGRARRSSA 91 Query: 626 ARHRTHAGTE 655 R+ AG+E Sbjct: 92 ---RSDAGSE 98 >UniRef50_Q4Q975 Cluster: Putative uncharacterized protein; n=2; Leishmania|Rep: Putative uncharacterized protein - Leishmania major Length = 1771 Score = 37.5 bits (83), Expect = 0.42 Identities = 26/94 (27%), Positives = 38/94 (40%), Gaps = 1/94 (1%) Frame = +2 Query: 434 VPTRRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGV 613 VP RR ++ T G AR H E G+ S+P RQ G+ R G + Sbjct: 955 VPARRVSLIEAVTPTNAPGRARSQPHAE--GQPRSSPGALFRSRSGRQYGNTRSFTGDSL 1012 Query: 614 SSAGARHRTHAGTEA-PXPHPARXTAGTRXDQXI 712 + G H +HA + P P P + T + + Sbjct: 1013 AGKGEAHASHAHSYTDPSPSPLQKRRPTALEDRL 1046 >UniRef50_UPI0000DD7D01 Cluster: PREDICTED: hypothetical protein; n=2; Homo/Pan/Gorilla group|Rep: PREDICTED: hypothetical protein - Homo sapiens Length = 188 Score = 37.1 bits (82), Expect = 0.56 Identities = 26/74 (35%), Positives = 32/74 (43%) Frame = +2 Query: 476 KGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSAGARHRTHAGTE 655 KGG+ GA PRH G + S P G + R+ S G + AGAR H T Sbjct: 119 KGGERGAPLPRHVPAFGSYPSV--FNPFGVMGRK--SKEEGVAPGQNPAGARMCNHPPTR 174 Query: 656 APXPHPARXTAGTR 697 + P P AG R Sbjct: 175 SSLPSPPPGNAGQR 188 >UniRef50_Q6YT10 Cluster: Putative lateral root primordia; n=6; Oryza sativa|Rep: Putative lateral root primordia - Oryza sativa subsp. japonica (Rice) Length = 324 Score = 36.3 bits (80), Expect = 0.97 Identities = 28/71 (39%), Positives = 34/71 (47%), Gaps = 2/71 (2%) Frame = +2 Query: 467 PTHKGGDGGARGPRHTETAGRHCSTPST--TPHGGVSRQCGSNRHVCGRGVSSAGARHRT 640 P H G GG G RH AG S+PST PHGG + GS+ GV++A A + Sbjct: 249 PEHSSGGGGGMGGRHA-AAGEAGSSPSTAAAPHGG--GEGGSS------GVAAAAAAVSS 299 Query: 641 HAGTEAPXPHP 673 A P P P Sbjct: 300 SAVVMDPYPTP 310 >UniRef50_UPI0000EB2227 Cluster: Huntingtin-associated protein 1 (HAP-1) (Neuroan 1).; n=1; Canis lupus familiaris|Rep: Huntingtin-associated protein 1 (HAP-1) (Neuroan 1). - Canis familiaris Length = 650 Score = 35.9 bits (79), Expect = 1.3 Identities = 27/98 (27%), Positives = 38/98 (38%), Gaps = 9/98 (9%) Frame = +2 Query: 479 GGDGGARGPRHTETAGRHCSTPSTTP---------HGGVSRQCGSNRHVCGRGVSSAGAR 631 G GG+ GP A R ++P+ P H Q +R C G A AR Sbjct: 12 GARGGSAGPSAVPLAPRPAASPAPEPSARPERGPAHAPAGAQAAGSRSACVSG-PQARAR 70 Query: 632 HRTHAGTEAPXPHPARXTAGTRXDQXIPHSRIASWSSF 745 + AG+ A P + + Q +P S A W+ F Sbjct: 71 PTSKAGSAAGAPRTSTFSVLQGNAQAVPRSLDAPWTRF 108 >UniRef50_Q7EZ37 Cluster: Putative uncharacterized protein B1015H11.133; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein B1015H11.133 - Oryza sativa subsp. japonica (Rice) Length = 162 Score = 35.9 bits (79), Expect = 1.3 Identities = 25/75 (33%), Positives = 32/75 (42%), Gaps = 3/75 (4%) Frame = +2 Query: 440 TRRSVSCD-SPTHKGGDGG--ARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRG 610 T R SC S H G DG A A R C +T G +R+C + R C RG Sbjct: 10 TARHASCQGSARHDGWDGDRDATVAARQPIAVRPCGGGATADGGSAARRCSTRRWQC-RG 68 Query: 611 VSSAGARHRTHAGTE 655 + AGA +G + Sbjct: 69 AAEAGAVRHDGSGAD 83 >UniRef50_Q23QU7 Cluster: U-box domain containing protein; n=1; Tetrahymena thermophila SB210|Rep: U-box domain containing protein - Tetrahymena thermophila SB210 Length = 1130 Score = 35.9 bits (79), Expect = 1.3 Identities = 17/48 (35%), Positives = 26/48 (54%) Frame = +3 Query: 126 CAISNXXXXXXXXXXTSGAVFERRIIEKYIIENGVDPINGKELRVEDL 269 CAIS +S V ER II+K +++N +DP N L+++ L Sbjct: 927 CAISLDILKDPVMLPSSKCVVERSIIKKALLDNEIDPFNRSPLKIDQL 974 >UniRef50_Q8U2U5 Cluster: Putative uncharacterized protein PF0734; n=1; Pyrococcus furiosus|Rep: Putative uncharacterized protein PF0734 - Pyrococcus furiosus Length = 371 Score = 35.9 bits (79), Expect = 1.3 Identities = 28/88 (31%), Positives = 49/88 (55%), Gaps = 5/88 (5%) Frame = -1 Query: 343 LFSVAGIDVALGGFGFTIGGVL-ISIRSSTLNSLPLIG----STPFSIMYFSMILLSNTA 179 +FSV+GI +A+GGF F++ +L +S+ S+ + L G S + MI++ T+ Sbjct: 92 VFSVSGIMIAIGGFEFSVNDILNMSLVSTIYSCFLLFGVFLISGLIPAIILLMIVVIQTS 151 Query: 178 PDVGDTTGTSETSFDIAQ*RDILLMISL 95 VG++ SE I + + L++ISL Sbjct: 152 LIVGNSGSISELVMGIPK-TENLILISL 178 >UniRef50_UPI0000D99B29 Cluster: PREDICTED: hypothetical protein; n=1; Macaca mulatta|Rep: PREDICTED: hypothetical protein - Macaca mulatta Length = 331 Score = 35.5 bits (78), Expect = 1.7 Identities = 27/73 (36%), Positives = 30/73 (41%), Gaps = 4/73 (5%) Frame = +2 Query: 467 PTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVS----SAGARH 634 P GG GA GPR R +T +P G V + R V G GVS S G RH Sbjct: 128 PERSGGGEGAPGPR------REPATRGRSPAGRVREEGAGARGVRGGGVSQLKVSPGGRH 181 Query: 635 RTHAGTEAPXPHP 673 R G P P Sbjct: 182 RAALGLSRDSPPP 194 >UniRef50_Q8IWN7 Cluster: Retinitis pigmentosa 1-like 1 protein; n=8; Catarrhini|Rep: Retinitis pigmentosa 1-like 1 protein - Homo sapiens (Human) Length = 2480 Score = 35.5 bits (78), Expect = 1.7 Identities = 25/72 (34%), Positives = 33/72 (45%), Gaps = 2/72 (2%) Frame = +2 Query: 407 C*TRIKSCTVPTRRSVSCDSPTHKGGDGGARGPRHTETAGRHCS--TPSTTPHGGVSRQC 580 C T + P RRS SC S T ARGP + G TPS P+ G SR+ Sbjct: 853 CPTPPRGRPCPQRRSSSCGS-TGSSHQSTARGPGGSPQEGTRQPGPTPSPGPNSGASRRS 911 Query: 581 GSNRHVCGRGVS 616 +++ RG+S Sbjct: 912 SASQGAGSRGLS 923 >UniRef50_UPI0000F2EB03 Cluster: PREDICTED: hypothetical protein; n=1; Monodelphis domestica|Rep: PREDICTED: hypothetical protein - Monodelphis domestica Length = 439 Score = 35.1 bits (77), Expect = 2.2 Identities = 24/66 (36%), Positives = 30/66 (45%), Gaps = 4/66 (6%) Frame = +2 Query: 494 ARGPRHTETAGRHCSTPSTTPHGGVSR-QCGSNRHVCGRGVSSAGARHRT-HAGTE--AP 661 AR + AGR P G V + G+ R C VSSA +RH AG + +P Sbjct: 160 ARSQEEPQRAGRDFQEPPKPASGAVGEGRAGAQRPPCPASVSSASSRHPPGEAGAKRLSP 219 Query: 662 XPHPAR 679 P PAR Sbjct: 220 PPTPAR 225 >UniRef50_A4MH05 Cluster: Oxidoreductase, 2OG-Fe(II) oxygenase family, truncation/internal deletion; n=2; Burkholderia pseudomallei|Rep: Oxidoreductase, 2OG-Fe(II) oxygenase family, truncation/internal deletion - Burkholderia pseudomallei 305 Length = 119 Score = 35.1 bits (77), Expect = 2.2 Identities = 29/89 (32%), Positives = 35/89 (39%), Gaps = 2/89 (2%) Frame = +2 Query: 443 RRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQ--CGSNRHVCGRGVS 616 R +V+ +PT G GARG R + S S P G R+ GS R GRG Sbjct: 26 RATVAARAPTRVGWPRGARGSRRSSPG----SAGSRRPTSGSRRRGMSGSVRRAPGRGSC 81 Query: 617 SAGARHRTHAGTEAPXPHPARXTAGTRXD 703 A R T A T P R + D Sbjct: 82 LAARRASTRAPTSRSARRPRRRPSRRESD 110 >UniRef50_A3B1M1 Cluster: Putative uncharacterized protein; n=1; Oryza sativa (japonica cultivar-group)|Rep: Putative uncharacterized protein - Oryza sativa subsp. japonica (Rice) Length = 331 Score = 35.1 bits (77), Expect = 2.2 Identities = 26/78 (33%), Positives = 33/78 (42%), Gaps = 4/78 (5%) Frame = +2 Query: 434 VPTRRSVSC-DSPTHKG--GDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCG 604 +P R S + T KG GD G G E G H +TP+ HG + + G Sbjct: 97 LPCRLKASTIHAETGKGHHGDAGGHGATTMEAGGEHGATPAGGEHGATTMEAGGE----- 151 Query: 605 RGVSSAGARH-RTHAGTE 655 G + AG H T AG E Sbjct: 152 HGATPAGGEHGATPAGGE 169 >UniRef50_Q4TF27 Cluster: Chromosome undetermined SCAF4887, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF4887, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 133 Score = 34.7 bits (76), Expect = 3.0 Identities = 21/62 (33%), Positives = 22/62 (35%), Gaps = 3/62 (4%) Frame = +2 Query: 467 PTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGR---GVSSAGARHR 637 P H GD G GP G H S PH G + Q GR G RHR Sbjct: 26 PQHPPGDPGEAGPPAGRAEGAHPSAQDAQPHPGAAEQHQPQGDQRGRRQPGRRQQSTRHR 85 Query: 638 TH 643 H Sbjct: 86 LH 87 >UniRef50_A3SIB5 Cluster: Putative uncharacterized protein; n=2; Roseovarius|Rep: Putative uncharacterized protein - Roseovarius nubinhibens ISM Length = 543 Score = 34.7 bits (76), Expect = 3.0 Identities = 19/42 (45%), Positives = 27/42 (64%) Frame = +3 Query: 405 TARQELSHALYQHDAACRVIARLTKEVTAAREALATLKPQAG 530 +A+Q A + +AA R + LT+++ AAREALA L PQ G Sbjct: 168 SAKQAAQAARLRAEAAERDLKDLTEDLQAAREALAALVPQEG 209 >UniRef50_Q4N4R0 Cluster: Peptidyl-prolyl cis-trans isomerase; n=2; Theileria|Rep: Peptidyl-prolyl cis-trans isomerase - Theileria parva Length = 517 Score = 34.7 bits (76), Expect = 3.0 Identities = 12/31 (38%), Positives = 24/31 (77%) Frame = +3 Query: 177 GAVFERRIIEKYIIENGVDPINGKELRVEDL 269 G +F+ I++++I +GV+P+NG +L ++DL Sbjct: 57 GHIFDHDKIKEFVISHGVNPVNGAKLALDDL 87 >UniRef50_UPI0000383927 Cluster: COG1529: Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs; n=1; Magnetospirillum magnetotacticum MS-1|Rep: COG1529: Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs - Magnetospirillum magnetotacticum MS-1 Length = 868 Score = 34.3 bits (75), Expect = 3.9 Identities = 22/60 (36%), Positives = 25/60 (41%), Gaps = 1/60 (1%) Frame = +2 Query: 470 THKGGDGGARGPRHTETAGRHCSTPSTTPH-GGVSRQCGSNRHVCGRGVSSAGARHRTHA 646 T GG + P ETA HC P PH GV+R +N CG GA T A Sbjct: 618 TADGGAYASLSPAVIETALEHCCGPYIVPHVEGVARLVATNNGTCG-AFRGFGANEMTFA 676 >UniRef50_Q9VEG1 Cluster: CG7431-PA; n=2; Sophophora|Rep: CG7431-PA - Drosophila melanogaster (Fruit fly) Length = 631 Score = 34.3 bits (75), Expect = 3.9 Identities = 15/41 (36%), Positives = 19/41 (46%) Frame = +2 Query: 479 GGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVC 601 GG GG G + T G HC + P GGV G ++ C Sbjct: 378 GGAGGGGGGASSATGGTHCQSLLALPSGGVGGSMGCAKNGC 418 >UniRef50_Q7PWP8 Cluster: ENSANGP00000013932; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000013932 - Anopheles gambiae str. PEST Length = 412 Score = 34.3 bits (75), Expect = 3.9 Identities = 18/57 (31%), Positives = 24/57 (42%) Frame = +2 Query: 479 GGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSAGARHRTHAG 649 GG GGA G T H + P+ + G + G H ++AGA HAG Sbjct: 160 GGSGGAVGGHSVHTPSDHLTPPAQSGSNGGYQGSGGASHGYSNSFANAGAAAGAHAG 216 >UniRef50_Q1JSL3 Cluster: MRNA decapping enzyme, putative precursor; n=1; Toxoplasma gondii|Rep: MRNA decapping enzyme, putative precursor - Toxoplasma gondii Length = 512 Score = 34.3 bits (75), Expect = 3.9 Identities = 24/72 (33%), Positives = 31/72 (43%), Gaps = 1/72 (1%) Frame = +2 Query: 461 DSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGR-GVSSAGARHR 637 + P +GG R PR ++A ST T G+ ++ G NR GR VS AG Sbjct: 319 ERPPSRGGLS-PRSPRERDSAAEASSTAGTVSLVGMLKRSGENREAPGRSSVSGAGLLGS 377 Query: 638 THAGTEAPXPHP 673 G AP P Sbjct: 378 GLLGAPAPAGPP 389 >UniRef50_Q4V2F4 Cluster: Putative uncharacterized protein; n=2; Burkholderia mallei|Rep: Putative uncharacterized protein - Burkholderia mallei (Pseudomonas mallei) Length = 1477 Score = 33.9 bits (74), Expect = 5.2 Identities = 25/77 (32%), Positives = 29/77 (37%) Frame = +2 Query: 497 RGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSAGARHRTHAGTEAPXPHPA 676 R PRH PHG + R R AGAR R H G A H A Sbjct: 714 RDPRHHPRQHDQRGRQDARPHGAEPARAAGRRRRGARAKRRAGARGRLHRG--ARHRHRA 771 Query: 677 RXTAGTRXDQXIPHSRI 727 R + TR PH+R+ Sbjct: 772 RRSDRTRG----PHARV 784 >UniRef50_Q166N8 Cluster: Putative uncharacterized protein; n=1; Roseobacter denitrificans OCh 114|Rep: Putative uncharacterized protein - Roseobacter denitrificans (strain ATCC 33942 / OCh 114) (Erythrobactersp. (strain OCh 114)) (Roseobacter denitrificans) Length = 322 Score = 33.9 bits (74), Expect = 5.2 Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 4/79 (5%) Frame = +2 Query: 437 PTRRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTP--HGGVSRQCGSNRHVC--G 604 P + + + D+ KGGDGG +G H + + + C P P H ++ NR V Sbjct: 184 PDQENSAHDAGNLKGGDGG-KGRGHGDASPQRCE-PREPPDNHRAAKQESQDNRRVLKGR 241 Query: 605 RGVSSAGARHRTHAGTEAP 661 R V G R R H G P Sbjct: 242 RYVQHVGFRGRHHDGKPQP 260 >UniRef50_Q0MX76 Cluster: Glycosyl transferase; n=12; Bacteria|Rep: Glycosyl transferase - consortium cosmid clone pGZ1 Length = 531 Score = 33.9 bits (74), Expect = 5.2 Identities = 25/94 (26%), Positives = 34/94 (36%), Gaps = 1/94 (1%) Frame = +2 Query: 452 VSCDSPTHKGGDGGARGPRHTET-AGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSAGA 628 V ++ +H GAR H T AG H T + T H G H + A Sbjct: 437 VGPNAVSHSDAYAGARAGAHAGTHAGTHAGTHAGT-HAGTHAGTHGGAHAGTHASAHAAT 495 Query: 629 RHRTHAGTEAPXPHPARXTAGTRXDQXIPHSRIA 730 THA A A+ + + I H+R A Sbjct: 496 HAATHASARAGTRSKAQAASSRASNAGIDHARTA 529 >UniRef50_Q86YZ3 Cluster: Hornerin; n=8; Theria|Rep: Hornerin - Homo sapiens (Human) Length = 2850 Score = 33.9 bits (74), Expect = 5.2 Identities = 20/57 (35%), Positives = 26/57 (45%), Gaps = 1/57 (1%) Frame = +2 Query: 482 GDGGARGPRHTETAGRHCSTPSTTPHG-GVSRQCGSNRHVCGRGVSSAGARHRTHAG 649 G G + P H + +PS HG G R S RH G G SS G H++ +G Sbjct: 884 GSGSGQSPGHGQRGSGSGQSPSYGRHGSGSGRSSSSGRHGSGSGQSS-GFGHKSSSG 939 Score = 33.5 bits (73), Expect = 6.9 Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 1/57 (1%) Frame = +2 Query: 482 GDGGARGPRHTETAGRHCSTPSTTPHG-GVSRQCGSNRHVCGRGVSSAGARHRTHAG 649 G G + P H + +PS HG G R S +H G G SS H + +G Sbjct: 1119 GSGSGQSPGHGQRGSGSRQSPSYGRHGSGSGRSSSSGQHGSGLGESSGFGHHESSSG 1175 Score = 33.5 bits (73), Expect = 6.9 Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 1/57 (1%) Frame = +2 Query: 482 GDGGARGPRHTETAGRHCSTPSTTPHG-GVSRQCGSNRHVCGRGVSSAGARHRTHAG 649 G G + P H + +PS HG G R S +H G G SS H + +G Sbjct: 1589 GSGSGQSPGHGQRGSGSRQSPSYGRHGSGSGRSSSSGQHGSGLGESSGFGHHESSSG 1645 Score = 33.5 bits (73), Expect = 6.9 Identities = 18/57 (31%), Positives = 24/57 (42%), Gaps = 1/57 (1%) Frame = +2 Query: 482 GDGGARGPRHTETAGRHCSTPSTTPHG-GVSRQCGSNRHVCGRGVSSAGARHRTHAG 649 G G + P H + +PS HG G R S +H G G SS H + +G Sbjct: 2529 GSGSGQSPGHGQRGSGSRQSPSYGRHGSGSGRSSSSGQHGSGLGESSGFGHHESSSG 2585 >UniRef50_UPI0000D9E76F Cluster: PREDICTED: hypothetical protein; n=1; Macaca mulatta|Rep: PREDICTED: hypothetical protein - Macaca mulatta Length = 414 Score = 33.5 bits (73), Expect = 6.9 Identities = 25/81 (30%), Positives = 34/81 (41%), Gaps = 3/81 (3%) Frame = +2 Query: 464 SPTHKGGDGGARGPRHTETAGRHCSTPST-TPHGGVSRQCGSNRHVCGRGVSSA--GARH 634 +P H+G G + P T T G P TPH G ++ H RG + A G+ H Sbjct: 203 TPHHRGAQGPPQVP--TPTTGERRGHPQVPTPHHGGAQGPPPGPHTPPRGSAGATPGSPH 260 Query: 635 RTHAGTEAPXPHPARXTAGTR 697 T G + P P P G + Sbjct: 261 PTR-GVQGPPPSPHTHHGGAQ 280 >UniRef50_Q9RWF2 Cluster: Leucyl aminopeptidase, putative; n=1; Deinococcus radiodurans|Rep: Leucyl aminopeptidase, putative - Deinococcus radiodurans Length = 482 Score = 33.5 bits (73), Expect = 6.9 Identities = 13/28 (46%), Positives = 15/28 (53%) Frame = +1 Query: 604 TWCLVCRSAPPHSRRNGSAXAAPCPXDC 687 TW RS+P H+RR A PCP C Sbjct: 59 TWPQARRSSPEHARRGKRRSAGPCPHQC 86 >UniRef50_Q1R2D4 Cluster: Putative uncharacterized protein; n=3; Escherichia coli|Rep: Putative uncharacterized protein - Escherichia coli (strain UTI89 / UPEC) Length = 129 Score = 33.5 bits (73), Expect = 6.9 Identities = 16/51 (31%), Positives = 20/51 (39%), Gaps = 1/51 (1%) Frame = +2 Query: 458 CDSPTHKGGDGGARGPRHTETAGRHCST-PSTTPHGGVSRQCGSNRHVCGR 607 CD T GDGG R R ++G CS + P + NR R Sbjct: 69 CDPATRPSGDGGKRSSRPDRSSGERCSACVNCQPRSSAATDARKNRDAAAR 119 >UniRef50_A5NNR1 Cluster: LigA; n=2; cellular organisms|Rep: LigA - Methylobacterium sp. 4-46 Length = 1001 Score = 33.5 bits (73), Expect = 6.9 Identities = 28/90 (31%), Positives = 33/90 (36%), Gaps = 5/90 (5%) Frame = +2 Query: 437 PTRRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSN--RHVCGRG 610 P R P GG GARGPR AGR + G R+ + R GRG Sbjct: 99 PARHGRRAALPPLAGGSAGARGPRRGGLAGRERARRGRGGRGARLRRGADHLGRPRRGRG 158 Query: 611 VSSAGARHRTHAGTEA---PXPHPARXTAG 691 ++ G R A P H AR G Sbjct: 159 LARRGGAPRRAGARPAPARPGMHGARGAGG 188 >UniRef50_A3L7L8 Cluster: Putative uncharacterized protein; n=2; Pseudomonas aeruginosa|Rep: Putative uncharacterized protein - Pseudomonas aeruginosa 2192 Length = 121 Score = 33.5 bits (73), Expect = 6.9 Identities = 20/59 (33%), Positives = 26/59 (44%) Frame = +2 Query: 473 HKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSAGARHRTHAG 649 H GG GG +G H G + S+ HG + S+R RG+S A A T G Sbjct: 46 HGGGKGGGKGGSHGGNLGGNLGGHSSKGHGSATSGIASSRD--SRGLSQASAISATTPG 102 >UniRef50_Q5DI17 Cluster: SJCHGC05395 protein; n=1; Schistosoma japonicum|Rep: SJCHGC05395 protein - Schistosoma japonicum (Blood fluke) Length = 299 Score = 33.5 bits (73), Expect = 6.9 Identities = 13/32 (40%), Positives = 22/32 (68%) Frame = +3 Query: 171 TSGAVFERRIIEKYIIENGVDPINGKELRVED 266 TSGAV + +++ I + +DPINGK+++ D Sbjct: 239 TSGAVVTKEVVDTVIKKEMIDPINGKKMKPTD 270 >UniRef50_Q5CUL0 Cluster: Ubiquitin-fusion degadation-2 (UFD2) family protein with a UBOX at the C-terminus; n=2; Cryptosporidium|Rep: Ubiquitin-fusion degadation-2 (UFD2) family protein with a UBOX at the C-terminus - Cryptosporidium parvum Iowa II Length = 1041 Score = 33.5 bits (73), Expect = 6.9 Identities = 13/33 (39%), Positives = 22/33 (66%) Frame = +3 Query: 171 TSGAVFERRIIEKYIIENGVDPINGKELRVEDL 269 TS + +R++IE+ +I +GVDP N L ++L Sbjct: 989 TSSKIMDRKVIERILISDGVDPFNRLPLTKDEL 1021 >UniRef50_A5K4P2 Cluster: Putative uncharacterized protein; n=1; Plasmodium vivax|Rep: Putative uncharacterized protein - Plasmodium vivax Length = 767 Score = 33.5 bits (73), Expect = 6.9 Identities = 21/80 (26%), Positives = 33/80 (41%) Frame = +2 Query: 479 GGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSAGARHRTHAGTEA 658 G DGG P TP P GG + G+NRH G S + + +H+ + + Sbjct: 68 GDDGGGESP-----PADVFFTPRNAPRGGADK--GANRHAGGHSASHSASHSASHSASHS 120 Query: 659 PXPHPARXTAGTRXDQXIPH 718 H A +A ++ + H Sbjct: 121 -ASHSANQSANQPANRAVTH 139 >UniRef50_Q9ULR0 Cluster: Pre-mRNA-splicing factor ISY1 homolog; n=40; Eukaryota|Rep: Pre-mRNA-splicing factor ISY1 homolog - Homo sapiens (Human) Length = 331 Score = 33.5 bits (73), Expect = 6.9 Identities = 17/46 (36%), Positives = 21/46 (45%) Frame = +2 Query: 443 RRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQC 580 RR + C S T G A PR T S P+T+P G S +C Sbjct: 280 RRLLGCRSGTRPARSGSAPSPRATTAVPMGPSLPTTSPRGAPSCRC 325 >UniRef50_A1GDA8 Cluster: Putative uncharacterized protein; n=1; Salinispora arenicola CNS205|Rep: Putative uncharacterized protein - Salinispora arenicola CNS205 Length = 757 Score = 33.1 bits (72), Expect = 9.1 Identities = 21/60 (35%), Positives = 28/60 (46%), Gaps = 1/60 (1%) Frame = +3 Query: 366 LMLHAFTQRQQLQTARQELSHALYQHDAACRVIARLTKEVTAA-REALATLKPQAGIAAP 542 +ML +QQL RQE + + D R + L E+ A +A AT KP G A P Sbjct: 529 VMLEIAQAQQQLADLRQETWRSRQESDELQRQVTELRLELAGAPAQAAATAKPAVGTAKP 588 >UniRef50_A0H4X8 Cluster: Putative uncharacterized protein; n=1; Chloroflexus aggregans DSM 9485|Rep: Putative uncharacterized protein - Chloroflexus aggregans DSM 9485 Length = 508 Score = 33.1 bits (72), Expect = 9.1 Identities = 21/54 (38%), Positives = 25/54 (46%), Gaps = 1/54 (1%) Frame = +2 Query: 437 PTRRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHG-GVSRQCGSNRH 595 PT ++ C TH G R RH RHC P+T PHG V R+ RH Sbjct: 50 PTCTALPCPDATHTHGIAVPR--RHPHA--RHCRAPTTHPHGIAVPRRHPHARH 99 >UniRef50_Q9XVT2 Cluster: Putative uncharacterized protein; n=1; Caenorhabditis elegans|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 541 Score = 33.1 bits (72), Expect = 9.1 Identities = 21/85 (24%), Positives = 36/85 (42%) Frame = +2 Query: 416 RIKSCTVPTRRSVSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRH 595 R+K T+P R +S DS + G R R R + +++ G S++ +H Sbjct: 229 RLKIATIPGARDISTDS-SDAGSSSRRRRARRRSHRSRSRTYSTSSDRGSSSKRSRQQQH 287 Query: 596 VCGRGVSSAGARHRTHAGTEAPXPH 670 + + SS R R+ + P H Sbjct: 288 MKRKTPSSPSRRSRSRSPPPYPPGH 312 >UniRef50_A4HW23 Cluster: Putative uncharacterized protein; n=1; Leishmania infantum|Rep: Putative uncharacterized protein - Leishmania infantum Length = 4231 Score = 33.1 bits (72), Expect = 9.1 Identities = 21/68 (30%), Positives = 29/68 (42%), Gaps = 1/68 (1%) Frame = +2 Query: 497 RGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHVCGRGVSSAGARHRTHAGTEAPXP-HP 673 R P E G TPSTT G + + H R V + + + G ++P P HP Sbjct: 167 RLPNQPEVIGEALGTPSTTTAAG-TMEASPQVHRAARRVVTTTNQAASAEGEQSPPPRHP 225 Query: 674 ARXTAGTR 697 A T G + Sbjct: 226 AASTVGDK 233 >UniRef50_Q8JZM8 Cluster: Mucin-4 precursor (Pancreatic adenocarcinoma mucin) (Testis mucin) (Ascites sialoglycoprotein) (ASGP) [Contains: Mucin-4 alpha chain (Ascites sialoglycoprotein 1) (ASGP-1); Mucin-4 beta chain (Ascites sialoglycoprotein 2) (ASGP-2)]; n=16; Murinae|Rep: Mucin-4 precursor (Pancreatic adenocarcinoma mucin) (Testis mucin) (Ascites sialoglycoprotein) (ASGP) [Contains: Mucin-4 alpha chain (Ascites sialoglycoprotein 1) (ASGP-1); Mucin-4 beta chain (Ascites sialoglycoprotein 2) (ASGP-2)] - Mus musculus (Mouse) Length = 3443 Score = 33.1 bits (72), Expect = 9.1 Identities = 26/109 (23%), Positives = 43/109 (39%), Gaps = 3/109 (2%) Frame = +2 Query: 281 NTTYSKAKTSKCHINPCHTEKHAR*VGRPDVTCLHTETAVADC*TRIKSCTVP---TRRS 451 +TT +TS I T +H + T T+T++ + + + P T + Sbjct: 1577 STTNMLTRTSSTQITSGDT-RHTTAIVTQGSTPATTQTSLTPSSRNMSTVSTPITSTHKL 1635 Query: 452 VSCDSPTHKGGDGGARGPRHTETAGRHCSTPSTTPHGGVSRQCGSNRHV 598 + H G G + P+ T T STPS T H + + S R + Sbjct: 1636 STLPQRQHTGSKGTSSNPQTTTTPEMTTSTPSATSHDLIETETSSQRTI 1684 >UniRef50_Q6NGC5 Cluster: Phospho-N-acetylmuramoyl-pentapeptide-transferase; n=45; Actinobacteria (class)|Rep: Phospho-N-acetylmuramoyl-pentapeptide-transferase - Corynebacterium diphtheriae Length = 366 Score = 33.1 bits (72), Expect = 9.1 Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 2/68 (2%) Frame = -1 Query: 334 VAGIDVALGGFGFTIGGVLISI-RSSTLN-SLPLIGSTPFSIMYFSMILLSNTAPDVGDT 161 V G+ +ALGG GF + + + R+ LN LIG S+++ ++ILL A G T Sbjct: 87 VLGLTLALGGLGFADDFIKLYMGRNLGLNKKAKLIGQLAISLIFGALILLFPNAD--GLT 144 Query: 160 TGTSETSF 137 G+S SF Sbjct: 145 PGSSHLSF 152 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 821,480,871 Number of Sequences: 1657284 Number of extensions: 16559419 Number of successful extensions: 53678 Number of sequences better than 10.0: 74 Number of HSP's better than 10.0 without gapping: 49935 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 53496 length of database: 575,637,011 effective HSP length: 100 effective length of database: 409,908,611 effective search space used: 74603367202 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -