BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA000793-TA|BGIBMGA000793-PA|IPR005578|Hrf1
(372 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_UPI0000D56AF4 Cluster: PREDICTED: similar to CG5484-PB,... 301 1e-80
UniRef50_Q6NNZ6 Cluster: GM14490p; n=9; Diptera|Rep: GM14490p - ... 277 4e-73
UniRef50_Q5BJH7 Cluster: Yip1-interacting factor homolog B; n=16... 241 2e-62
UniRef50_A7RZV4 Cluster: Predicted protein; n=2; Eumetazoa|Rep: ... 234 2e-60
UniRef50_O95070 Cluster: Protein YIF1A; n=32; Euteleostomi|Rep: ... 234 3e-60
UniRef50_Q6PC24 Cluster: Protein YIF1A; n=17; Deuterostomia|Rep:... 234 3e-60
UniRef50_UPI0000E22B9A Cluster: PREDICTED: Yip1 interacting fact... 220 4e-56
UniRef50_Q2PJ77 Cluster: Putative uncharacterized protein; n=3; ... 166 9e-40
UniRef50_A6NGW1 Cluster: Uncharacterized protein YIF1A; n=7; The... 146 8e-34
UniRef50_Q5C3U4 Cluster: SJCHGC05273 protein; n=1; Schistosoma j... 144 3e-33
UniRef50_Q4P6M5 Cluster: Putative uncharacterized protein; n=1; ... 139 1e-31
UniRef50_Q54XV4 Cluster: Putative uncharacterized protein; n=1; ... 136 1e-30
UniRef50_Q6C4J2 Cluster: Similar to tr|P87148 Schizosaccharomyce... 129 1e-28
UniRef50_Q5KKT8 Cluster: ER to Golgi transport-related protein, ... 127 4e-28
UniRef50_A6R648 Cluster: Hrf1 domain protein; n=12; Pezizomycoti... 123 7e-27
UniRef50_P87148 Cluster: Protein transport protein yif1; n=1; Sc... 123 9e-27
UniRef50_Q4WD83 Cluster: ER to Golgi transport protein Yif1; n=6... 118 3e-25
UniRef50_A3LPC0 Cluster: Predicted protein; n=5; Saccharomycetal... 108 3e-22
UniRef50_Q5BSJ3 Cluster: SJCHGC04045 protein; n=1; Schistosoma j... 97 5e-19
UniRef50_P53845 Cluster: Protein transport protein YIF1; n=5; Sa... 95 2e-18
UniRef50_Q8STM7 Cluster: Similarity to HYPOTHETICAL TRANSMEMBRAN... 90 1e-16
UniRef50_UPI000049A0DA Cluster: conserved hypothetical protein; ... 86 1e-15
UniRef50_Q9FYH6 Cluster: F17F8.24; n=15; Magnoliophyta|Rep: F17F... 85 4e-15
UniRef50_A7TF63 Cluster: Putative uncharacterized protein; n=1; ... 84 5e-15
UniRef50_Q01E40 Cluster: Predicted membrane protein; n=2; Ostreo... 81 6e-14
UniRef50_Q5CS10 Cluster: Protein with 5 transmembrane domains; n... 78 3e-13
UniRef50_Q23K44 Cluster: Hrf1 family protein; n=1; Tetrahymena t... 77 6e-13
UniRef50_A0C565 Cluster: Chromosome undetermined scaffold_15, wh... 74 5e-12
UniRef50_Q4GZ21 Cluster: Putative uncharacterized protein; n=1; ... 65 3e-09
UniRef50_Q4CZI1 Cluster: Putative uncharacterized protein; n=2; ... 60 7e-08
UniRef50_Q4QCT2 Cluster: Putative uncharacterized protein; n=2; ... 59 2e-07
UniRef50_UPI0000498A87 Cluster: hypothetical protein 3.t00011; n... 50 7e-05
UniRef50_A2EGH1 Cluster: Putative uncharacterized protein; n=3; ... 49 2e-04
UniRef50_Q7QSE8 Cluster: GLP_426_11085_11759; n=1; Giardia lambl... 38 0.32
UniRef50_A5K5P4 Cluster: Putative uncharacterized protein; n=1; ... 38 0.32
UniRef50_A6TTN6 Cluster: Mur ligase, middle domain protein; n=1;... 34 6.9
UniRef50_A0QQR3 Cluster: NADH ubiquinone oxidoreductase subunit ... 34 6.9
>UniRef50_UPI0000D56AF4 Cluster: PREDICTED: similar to CG5484-PB,
isoform B; n=2; Endopterygota|Rep: PREDICTED: similar to
CG5484-PB, isoform B - Tribolium castaneum
Length = 360
Score = 301 bits (740), Expect = 1e-80
Identities = 160/356 (44%), Positives = 208/356 (58%), Gaps = 24/356 (6%)
Query: 20 RKAKRVSDVAAMG-TP------APNPAFSPTXXXXXXXXXXXXXXXQESFQLDANQDFNT 72
RK KRVSDV AMG TP PN + SF N
Sbjct: 15 RKVKRVSDVNAMGYTPYDPTQGMPNNLYPNLGSESPAPNPYQQNFASPSFSSTQNYGVPP 74
Query: 73 HXXXXXXXXXXXXXXXMTSPA--QISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKF 130
+ P Q + QP+VQDMA+QYG QLA GK +++E+ K+
Sbjct: 75 NNPQMYGFNPASPYPNAQPPPNPQFGGVFGQPMVQDMALQYGQQLANTGKSMIKQEVEKY 134
Query: 131 VPVSRLRYYFAVDTRYVIRKLMLIVFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMG 190
VPV+ L+YYFAVDT+YV+ KLML+ FP+THK+W VKY+QD PVQPR++INAPDLYIP+M
Sbjct: 135 VPVNSLKYYFAVDTKYVLSKLMLLFFPFTHKDWSVKYEQDGPVQPRFEINAPDLYIPTMA 194
Query: 191 YVTYVLLAGFMLGLQHRFSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLL 250
YVTYVL+AG +LG+Q +F+PEQIGI ASSALA+ + E+ +T DLL
Sbjct: 195 YVTYVLVAGMVLGMQQKFTPEQIGILASSALAWFVVELAVYSCTLYIANIKTTLRTFDLL 254
Query: 251 AYSGYKYTVMISSLLAGLLAGRTGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSY 310
A+SGYK+ +I S+L L+ +T YYC L+Y + AL++FLV+TL+ Q+L S Y
Sbjct: 255 AFSGYKFVGIIVSILVSLIGAKTAYYCCLIYVNLALAFFLVRTLKAQVLVESNAQPTSYY 314
Query: 311 GFPNPPYAANPYSDAWDKPTPGGTKRRVYFLLFVAITQPLLCWWLTYHLVASPPGE 366
G D P G KRR+YFLLFVA QP+L WWL++HL+ SP E
Sbjct: 315 G---------------DVAPPTGNKRRLYFLLFVAAVQPVLSWWLSFHLIGSPSPE 355
>UniRef50_Q6NNZ6 Cluster: GM14490p; n=9; Diptera|Rep: GM14490p -
Drosophila melanogaster (Fruit fly)
Length = 405
Score = 277 bits (678), Expect = 4e-73
Identities = 130/263 (49%), Positives = 178/263 (67%), Gaps = 6/263 (2%)
Query: 97 SMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVF 156
+M QQP+VQDMA+QYG +LA QGK+ ++ + K+VPV++L+YYFAVD YV RKL L+ F
Sbjct: 135 AMFQQPIVQDMAMQYGQKLADQGKQIMENQFEKWVPVAKLKYYFAVDNAYVGRKLRLLFF 194
Query: 157 PYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQ 216
PY HK+W ++YDQ+ PVQPRYD+NAPDLY+P+MGY+TYV++AG +LG+Q RFSPEQ+GIQ
Sbjct: 195 PYMHKDWSLRYDQEHPVQPRYDVNAPDLYLPTMGYITYVIVAGLLLGMQKRFSPEQLGIQ 254
Query: 217 ASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTGYY 276
ASSA+AY IFE+ KTLDLLA++GYKY ++ L+ L ++GYY
Sbjct: 255 ASSAMAYSIFELVIYSLALYVMNVKTSLKTLDLLAFTGYKYVNIVVCLMVSTLFFKSGYY 314
Query: 277 CGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAANPYSDAWDKPTPGGTKR 336
L Y+S + +F+++TLR +LL P PS PY NP + GG KR
Sbjct: 315 IALAYTSFSFGFFMLRTLRTKLLQ-DNSPAAPSGAINYDPY-GNPQQFDYS----GGKKR 368
Query: 337 RVYFLLFVAITQPLLCWWLTYHL 359
++YFL + Q L + L+ HL
Sbjct: 369 KLYFLFMIVAGQALFAFLLSKHL 391
>UniRef50_Q5BJH7 Cluster: Yip1-interacting factor homolog B; n=16;
Theria|Rep: Yip1-interacting factor homolog B - Homo
sapiens (Human)
Length = 314
Score = 241 bits (590), Expect = 2e-62
Identities = 121/270 (44%), Positives = 170/270 (62%), Gaps = 21/270 (7%)
Query: 91 SPAQISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRK 150
SP ++ L PV +MA+ YG+ LAAQGKE V + + +F+P+++L+YYFAVDT YV RK
Sbjct: 65 SPTPHAAFLADPV-SNMAMAYGSSLAAQGKELVDKNIDRFIPITKLKYYFAVDTMYVGRK 123
Query: 151 LMLIVFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSP 210
L L+ FPY H++W V+Y QDTPV PR+D+NAPDLYIP+M ++TYVL+AG LG Q RFSP
Sbjct: 124 LGLLFFPYLHQDWEVQYQQDTPVAPRFDVNAPDLYIPAMAFITYVLVAGLALGTQDRFSP 183
Query: 211 EQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLA 270
+ +G+QASSALA++ E+ T+DL+A+ GYKY MI +L GLL
Sbjct: 184 DLLGLQASSALAWLTLEVLAILLSLYLVTVNTDLTTIDLVAFLGYKYVGMIGGVLMGLLF 243
Query: 271 GRTGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAANPYSDAWDKPT 330
G+ GYY L + A+ F+++TLRL++L+ + P G N
Sbjct: 244 GKIGYYLVLGWCCVAIFVFMIRTLRLKILADAAAEGVPVRGARN---------------- 287
Query: 331 PGGTKRRVYFLLFVAITQPLLCWWLTYHLV 360
+ R+Y + VA QP+L +WLT+HLV
Sbjct: 288 ----QLRMYLTMAVAAAQPMLMYWLTFHLV 313
>UniRef50_A7RZV4 Cluster: Predicted protein; n=2; Eumetazoa|Rep:
Predicted protein - Nematostella vectensis
Length = 248
Score = 234 bits (573), Expect = 2e-60
Identities = 124/268 (46%), Positives = 172/268 (64%), Gaps = 29/268 (10%)
Query: 96 SSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIV 155
+ +QQP+ +MA QYG +A+QGKE V++ L +FV +S+L+YYFAVDT YV++KL L++
Sbjct: 6 ADFMQQPMT-NMAFQYGTNVASQGKEYVEKNLDRFVSISKLKYYFAVDTSYVVKKLGLLL 64
Query: 156 FPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHR----FSPE 211
FP+THK W V+Y+++ PV PRY++NAPDLYIP M +VTYVL+AG +LG Q+R F+PE
Sbjct: 65 FPFTHKNWAVQYNKEEPVAPRYEVNAPDLYIPVMAFVTYVLVAGLVLGTQNRQVVQFTPE 124
Query: 212 QIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAG 271
Q+GI ASSAL ++ E+ KT DLLA+ GYKY MI S LAGLL
Sbjct: 125 QLGITASSALIWLFVEIMAILFSMYLCNVQSEIKTFDLLAFCGYKYFGMILSCLAGLLFK 184
Query: 272 RTGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAANPYSDAWDKPTP 331
GYYC +Y+S ++FL++TLRL ++ PE SD +
Sbjct: 185 SLGYYCVFIYTSITNAFFLIRTLRLVII-----PET---------------SDGIART-- 222
Query: 332 GGTKRRVYFLLFVAITQPLLCWWLTYHL 359
+KRR+Y LLF+A+ QP ++LT HL
Sbjct: 223 --SKRRIYLLLFIAVLQPFFMFFLTSHL 248
>UniRef50_O95070 Cluster: Protein YIF1A; n=32; Euteleostomi|Rep:
Protein YIF1A - Homo sapiens (Human)
Length = 293
Score = 234 bits (572), Expect = 3e-60
Identities = 121/266 (45%), Positives = 168/266 (63%), Gaps = 24/266 (9%)
Query: 95 ISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLI 154
++ +L P+ ++A+ YG+ +A+ GK+ V +ELH+FV VS+L+Y+FAVDT YV +KL L+
Sbjct: 51 VNHLLGDPMA-NVAMAYGSSIASHGKDMVHKELHRFVSVSKLKYFFAVDTAYVAKKLGLL 109
Query: 155 VFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIG 214
VFPYTH+ W V+Y +D P+ PR D+NAPDLYIP+M ++TYVLLAG LG+Q RFSPE +G
Sbjct: 110 VFPYTHQNWEVQYSRDAPLPPRQDLNAPDLYIPTMAFITYVLLAGMALGIQKRFSPEVLG 169
Query: 215 IQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTG 274
+ AS+AL +++ E+ T LLAYSGYKY MI S+L GLL G G
Sbjct: 170 LCASTALVWVVMEVLALLLGLYLATVRSDLSTFHLLAYSGYKYVGMILSVLTGLLFGSDG 229
Query: 275 YYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAANPYSDAWDKPTPGGT 334
YY L ++S AL YF+V++LR L GP+ S G P P
Sbjct: 230 YYVALAWTSSALMYFIVRSLRTAAL----GPD--SMGGPVP-----------------RQ 266
Query: 335 KRRVYFLLFVAITQPLLCWWLTYHLV 360
+ ++Y L A QPL+ +WLT+HLV
Sbjct: 267 RLQLYLTLGAAAFQPLIIYWLTFHLV 292
>UniRef50_Q6PC24 Cluster: Protein YIF1A; n=17; Deuterostomia|Rep:
Protein YIF1A - Danio rerio (Zebrafish) (Brachydanio
rerio)
Length = 307
Score = 234 bits (572), Expect = 3e-60
Identities = 115/266 (43%), Positives = 174/266 (65%), Gaps = 20/266 (7%)
Query: 95 ISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLI 154
+ ++ P+ + A+ YG+ LA QGK+ V +E+++F+ V++L+Y+FAVDT+YV++KL+L+
Sbjct: 61 VGNIFADPMA-NAAMMYGSTLANQGKDIVNKEINRFMSVNKLKYFFAVDTKYVMKKLLLL 119
Query: 155 VFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIG 214
+FPYTH++W V+Y +DTP+ PR+D+NAPDLYIP+M ++TY+LLAG LG+Q RFSPE +G
Sbjct: 120 MFPYTHQDWEVRYHRDTPLTPRHDVNAPDLYIPTMAFITYILLAGMALGIQKRFSPEVLG 179
Query: 215 IQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTG 274
+ AS+AL ++I E+ T DL+AYSGYKY MI ++ GLL G G
Sbjct: 180 LCASTALVWMIIEVLVMLLSLYLLTVHTDLSTFDLVAYSGYKYVGMILTVFCGLLFGSDG 239
Query: 275 YYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAANPYSDAWDKPTPGGT 334
YY L +SSCAL +F+V++L++++LS S G + A KP
Sbjct: 240 YYVALAWSSCALMFFIVRSLKMKILSSISA---DSMG-----------AGASAKP----- 280
Query: 335 KRRVYFLLFVAITQPLLCWWLTYHLV 360
+ R+Y + A QP + +WLT HLV
Sbjct: 281 RFRLYITVASAAFQPFIIYWLTAHLV 306
>UniRef50_UPI0000E22B9A Cluster: PREDICTED: Yip1 interacting factor
homolog isoform 4; n=2; Catarrhini|Rep: PREDICTED: Yip1
interacting factor homolog isoform 4 - Pan troglodytes
Length = 301
Score = 220 bits (538), Expect = 4e-56
Identities = 111/246 (45%), Positives = 155/246 (63%), Gaps = 9/246 (3%)
Query: 95 ISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLI 154
++ +L P+ ++A+ YG+ +A+ GK+ V +ELH+FV VS+L+Y+FAVDT YV +KL L+
Sbjct: 51 VNHLLGDPMA-NVAMAYGSSIASHGKDMVHKELHRFVSVSKLKYFFAVDTAYVAKKLGLL 109
Query: 155 VFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIG 214
VFPYTH+ W V+Y +D P+ PR D+NAPDLYIP+M ++TYVLLAG LG+Q RFSPE +G
Sbjct: 110 VFPYTHQNWEVQYSRDAPLPPRQDLNAPDLYIPTMAFITYVLLAGMALGIQKRFSPEVLG 169
Query: 215 IQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTG 274
+ AS+AL +++ E+ T LLAYSGYKY MI S+L GLL G G
Sbjct: 170 LCASTALVWVVMEVLALLLGLYLATVRSDLSTFHLLAYSGYKYVGMILSVLTGLLFGSDG 229
Query: 275 YYCGLLYSSCALSYFLVKTLR----LQLLSGSQGPEQPSYGFPNPPYA---ANPYS-DAW 326
YY L ++S AL YF+V + L L G P + + A P++ AW
Sbjct: 230 YYVALAWTSSALMYFIVSLVHPPIPLPSLEGKPPPPGQASVLLSSRCALCGQQPWAPTAW 289
Query: 327 DKPTPG 332
P+PG
Sbjct: 290 GAPSPG 295
>UniRef50_Q2PJ77 Cluster: Putative uncharacterized protein; n=3;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 380
Score = 166 bits (403), Expect = 9e-40
Identities = 101/264 (38%), Positives = 143/264 (54%), Gaps = 29/264 (10%)
Query: 94 QISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLML 153
Q ++ P++ + A Q+G Q A Q KE +L K++ L+YYFAVD YV +KL +
Sbjct: 90 QPQQLMSDPML-NAAKQFGGQFAEQQKE----KLTKYLGTFNLKYYFAVDNAYVGKKLGI 144
Query: 154 IVFPYTHKEWMVKYDQDT-PVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQ 212
+ FP+ HK+W +K+ P R D+NAPDLYIP M ++TY+L++GF+LG Q RFSPE
Sbjct: 145 LFFPFFHKDWSLKFAGSADPAPAREDVNAPDLYIPLMSFLTYILVSGFVLGTQGRFSPEI 204
Query: 213 IGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGR 272
+GI S+AL ++I E LAYS YK+ MI LL ++ +
Sbjct: 205 LGILTSNALIWVILENIVIFISKYILNISQSLSVWHSLAYSTYKFAHMIVCLLLFMVGDK 264
Query: 273 TGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAANPYSDAWDKPTPG 332
T YY L YSS AL FL++++ + S G SYG +
Sbjct: 265 TFYYGALAYSSLALVIFLLRSVS-HFMFDSSG----SYG------------------SEE 301
Query: 333 GTKRRVYFLLFVAITQPLLCWWLT 356
G KR++ + FV ITQPL+ WWLT
Sbjct: 302 GRKRKLILVAFVVITQPLIMWWLT 325
>UniRef50_A6NGW1 Cluster: Uncharacterized protein YIF1A; n=7;
Theria|Rep: Uncharacterized protein YIF1A - Homo sapiens
(Human)
Length = 241
Score = 146 bits (354), Expect = 8e-34
Identities = 62/113 (54%), Positives = 89/113 (78%), Gaps = 1/113 (0%)
Query: 95 ISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLI 154
++ +L P+ ++A+ YG+ +A+ GK+ V +ELH+FV VS+L+Y+FAVDT YV +KL L+
Sbjct: 51 VNHLLGDPMA-NVAMAYGSSIASHGKDMVHKELHRFVSVSKLKYFFAVDTAYVAKKLGLL 109
Query: 155 VFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHR 207
VFPYTH+ W V+Y +D P+ PR D+NAPDLYIP+M ++TYVLLAG LG+Q R
Sbjct: 110 VFPYTHQNWEVQYSRDAPLPPRQDLNAPDLYIPTMAFITYVLLAGMALGIQKR 162
Score = 53.6 bits (123), Expect = 8e-06
Identities = 39/101 (38%), Positives = 52/101 (51%), Gaps = 23/101 (22%)
Query: 260 MISSLLAGLLAGRTGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAA 319
MI S+L GLL G GYY L ++S AL YF+V++LR L GP+ S G P P
Sbjct: 163 MILSVLTGLLFGSDGYYVALAWTSSALMYFIVRSLRTAAL----GPD--SMGGPVP---- 212
Query: 320 NPYSDAWDKPTPGGTKRRVYFLLFVAITQPLLCWWLTYHLV 360
+ ++Y L A QPL+ +WLT+HLV
Sbjct: 213 -------------RQRLQLYLTLGAAAFQPLIIYWLTFHLV 240
>UniRef50_Q5C3U4 Cluster: SJCHGC05273 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC05273 protein - Schistosoma
japonicum (Blood fluke)
Length = 237
Score = 144 bits (349), Expect = 3e-33
Identities = 65/161 (40%), Positives = 97/161 (60%)
Query: 97 SMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVF 156
S +Q + D+A +YG+ + +G VQ+ + ++V RL+YYF+V+ YV +K+ +I+F
Sbjct: 50 SFVQNQFIPDLAARYGSAMFDEGANFVQKNVDQYVNRLRLKYYFSVNNSYVAKKIGVILF 109
Query: 157 PYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQ 216
P+ H +W + YD PV P DINAPDLYIP M +TYVLL G + G Q RFSPE +GI
Sbjct: 110 PFAHTKWAINYDPAGPVPPSDDINAPDLYIPLMATITYVLLCGVIFGFQGRFSPEYLGIL 169
Query: 217 ASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKY 257
+S A +++ E+ LD++AY GYK+
Sbjct: 170 SSEAFGWLLLEVLLSLFAIYILNIQNNISYLDIVAYCGYKF 210
>UniRef50_Q4P6M5 Cluster: Putative uncharacterized protein; n=1;
Ustilago maydis|Rep: Putative uncharacterized protein -
Ustilago maydis (Smut fungus)
Length = 412
Score = 139 bits (336), Expect = 1e-31
Identities = 79/225 (35%), Positives = 121/225 (53%), Gaps = 26/225 (11%)
Query: 107 MAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTHKEWMVK 166
M +Q+G +AA G E VQ+ + +P+ L++YF V YV+ KL +++FP+ HK W
Sbjct: 142 MGVQFGQHMAAVGGEYVQKNFNALLPMPVLKHYFNVSNSYVLHKLRIVLFPWRHKPWSRA 201
Query: 167 YDQ------------DTP-------------VQPRYDINAPDLYIPSMGYVTYVLLAGFM 201
+ +TP + PR D+N+PDLYIP+M +VTY+++ +
Sbjct: 202 HRHSAAVGGVGSAYAETPSGIKTASSGAEGFLPPRDDVNSPDLYIPTMAFVTYIIVTSVI 261
Query: 202 LGLQHRFSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMI 261
LGL+ RF PE +G++AS ALA I+ E+ +DLLAYSGYK+ +
Sbjct: 262 LGLESRFHPEVLGLRASRALAIILVELAAIKFGTYILNIQGDHTMMDLLAYSGYKFVGTL 321
Query: 262 SSLLAGLLAGR-TGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGP 305
+LL GLL R Y+ LY A ++FL+++LR +L P
Sbjct: 322 ITLLVGLLKVRGLVYWSVFLYCFAANAFFLLRSLRYVVLPDPSSP 366
>UniRef50_Q54XV4 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 380
Score = 136 bits (328), Expect = 1e-30
Identities = 76/225 (33%), Positives = 117/225 (52%), Gaps = 10/225 (4%)
Query: 95 ISSMLQQPVVQ---DMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKL 151
IS + P+ Q + YG L + GK+ V K+ S L+ YF V+ YV K+
Sbjct: 132 ISQISDNPLTQAGLTYGLNYGQTLFSGGKQYVDSNFGKYFSFSTLKSYFNVNNSYVFNKI 191
Query: 152 MLIVFPYTHKEWMVKY----DQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHR 207
L++FPYT K W + D D+ + PR DINAPDLYIP M ++TY LL GF +G++ +
Sbjct: 192 KLLIFPYTQKTWKRRIGRTSDVDSYLPPRDDINAPDLYIPLMAFITYFLLYGFQMGMEKK 251
Query: 208 FSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAG 267
FSP+ +G + + + E+ D+++YSGYKY +M+ +A
Sbjct: 252 FSPDYLGACITKGIVFWAIEL-LIFKCGFFFSNSNSIPFYDMISYSGYKYVLMVIFQIAT 310
Query: 268 LLAGRTGYYCGLLYSSCALSYFLVKTLRL--QLLSGSQGPEQPSY 310
+L G Y S ++++F++KTLRL +SG+ P Y
Sbjct: 311 ILLGSYVSYIIKCVLSVSIAFFMLKTLRLVFSSVSGAHDHISPDY 355
>UniRef50_Q6C4J2 Cluster: Similar to tr|P87148 Schizosaccharomyces
pombe; n=1; Yarrowia lipolytica|Rep: Similar to
tr|P87148 Schizosaccharomyces pombe - Yarrowia
lipolytica (Candida lipolytica)
Length = 340
Score = 129 bits (312), Expect = 1e-28
Identities = 67/207 (32%), Positives = 115/207 (55%), Gaps = 10/207 (4%)
Query: 107 MAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTHKEW--- 163
+ +Q G A G+E +++ +K+V VS+LRYYF V YV++KL L++FP+ HK W
Sbjct: 84 VGLQVGRSAVAAGQEYMEKNFNKYVSVSQLRYYFQVSNLYVVKKLGLVLFPFLHKPWTRD 143
Query: 164 MVKYDQDTPVQ----PRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASS 219
+V+ + ++ R DINAPD+YIP+M + TY++L + G+ F P+ G AS
Sbjct: 144 VVRSETTGEIEGYAPARDDINAPDMYIPTMAFTTYIILCSVLSGVHDHFHPQLFGTLASK 203
Query: 220 ALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTGYYCGL 279
A++ ++FE+ + D AY+GYK ++ ++LA L G T G+
Sbjct: 204 AVSVMVFEL--LVLRLATYLLSADSQLFDFAAYAGYKLVGVLITILAASLTGSTYVKWGV 261
Query: 280 -LYSSCALSYFLVKTLRLQLLSGSQGP 305
LY+ A + FL+++++ ++ P
Sbjct: 262 FLYTYIANAMFLLRSIKYLIIPDGTSP 288
>UniRef50_Q5KKT8 Cluster: ER to Golgi transport-related protein,
putative; n=2; Filobasidiella neoformans|Rep: ER to
Golgi transport-related protein, putative - Cryptococcus
neoformans (Filobasidiella neoformans)
Length = 368
Score = 127 bits (307), Expect = 4e-28
Identities = 85/259 (32%), Positives = 129/259 (49%), Gaps = 27/259 (10%)
Query: 107 MAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTHKEWMVK 166
M +Q+G A G+E V++ +++P+ ++ F+V YV+ KL LI+FP+ HK W +
Sbjct: 126 MGMQFGKSAVAAGQEYVEKNFTRYLPLQLIKISFSVTNSYVLNKLRLILFPWRHKPWSRQ 185
Query: 167 YDQDTP-------VQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASS 219
+ T PR DINAPDLYIP+M VTY LL GLQ RF PE +G+ S
Sbjct: 186 SRRSTDNGAVEGWQAPRDDINAPDLYIPTMALVTYTLLCALASGLQSRFHPEVLGLSLSK 245
Query: 220 ALAYIIFEM-XXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLA-GRTGYYC 277
ALA +I E ++L+ Y GYK+ +I++++ LL G+
Sbjct: 246 ALAVVITEFCAIKLGCYLLDVRGSGASGVELVGYGGYKFVGIIATIVVSLLGLGKMITLG 305
Query: 278 GLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAANPYSDAWDKPTPGGTKRR 337
+Y+ A ++FL+++L+ LL P A+ S + + RR
Sbjct: 306 VFIYTFAANAFFLLRSLKYVLL----------------PDAS--VSSSVTTLSHSQRSRR 347
Query: 338 VYFLLFVAITQPLLCWWLT 356
V FL FVA+ Q L WL+
Sbjct: 348 VQFLFFVAVAQVLWMGWLS 366
>UniRef50_A6R648 Cluster: Hrf1 domain protein; n=12;
Pezizomycotina|Rep: Hrf1 domain protein - Ajellomyces
capsulatus NAm1
Length = 358
Score = 123 bits (297), Expect = 7e-27
Identities = 77/237 (32%), Positives = 125/237 (52%), Gaps = 26/237 (10%)
Query: 99 LQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPY 158
+ P Q M Q G G+E V++ L++++ + L++YF V YV+ K ML++FP+
Sbjct: 70 ISDPTAQ-MGFQVGKSAVMAGQEYVEQNLNRYISIPALKHYFNVSNSYVLNKTMLVLFPW 128
Query: 159 THKEWMVKYDQDTPVQ------------------PRYDINAPDLYIPSMGYVTYVLLAGF 200
HK W + + VQ PR D+N+PD+YIP+M VTY++L+
Sbjct: 129 RHKPWSRQQARLNAVQSSANGQIAQAQYTSIYLPPRDDLNSPDMYIPAMALVTYIILSTA 188
Query: 201 MLGLQHRFSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVM 260
+ GL+ F PE +G ++ALA +IFE+ + LDL+AYSGYK+ +
Sbjct: 189 LAGLRGVFHPELLGSITTTALAVVIFEILCLKIAMYILSISNDSQLLDLVAYSGYKFVGI 248
Query: 261 ISSLLAG--LLAGR-TGYYCG---LLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYG 311
I +L++ L G+ TG + G Y+ A ++FL+++L+ LL S + P G
Sbjct: 249 IVTLVSSEVLTPGQGTGSWVGWTAFTYTFLANAFFLLRSLKYVLLPDSSS-DSPMRG 304
>UniRef50_P87148 Cluster: Protein transport protein yif1; n=1;
Schizosaccharomyces pombe|Rep: Protein transport protein
yif1 - Schizosaccharomyces pombe (Fission yeast)
Length = 293
Score = 123 bits (296), Expect = 9e-27
Identities = 81/271 (29%), Positives = 127/271 (46%), Gaps = 26/271 (9%)
Query: 93 AQISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLM 152
A S+ L M Q G G+E V++ K++ +RL +YF V YV+ KL+
Sbjct: 42 ANPSAYLPNSATAQMGFQLGKNAVNAGQEYVEQNFGKWLSTTRLHHYFTVTNSYVVAKLL 101
Query: 153 LIVFPYTHKEWMVKYDQ-------DTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQ 205
LI+FP+ + W K + + P D+N+PD+YIP M + T++LL + GLQ
Sbjct: 102 LIIFPWRRRSWARKLRRSEINGSAEGYCPPAEDLNSPDMYIPLMAFTTHILLLCALAGLQ 161
Query: 206 HRFSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLL 265
F PE G++AS A A ++ E + LDLLA+SGYK+ +I + L
Sbjct: 162 DDFQPELFGLRASKACAVVLVEFLATRLGCYLLNISSQSQVLDLLAFSGYKFVGLILTSL 221
Query: 266 AGLLAGRTGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAANPYSDA 325
+ L LY A ++FL+++L+ +L P A N +
Sbjct: 222 SKLFEMPWVTRFVFLYMYLATAFFLLRSLKYAVL-------------PESTMAINATITS 268
Query: 326 WDKPTPGGTKRRVYFLLFVAITQPLLCWWLT 356
+ RR+YFL F+A +Q L + L+
Sbjct: 269 HQR------SRRIYFLFFIAASQILFMYVLS 293
>UniRef50_Q4WD83 Cluster: ER to Golgi transport protein Yif1; n=6;
Pezizomycotina|Rep: ER to Golgi transport protein Yif1 -
Aspergillus fumigatus (Sartorya fumigata)
Length = 366
Score = 118 bits (283), Expect = 3e-25
Identities = 71/222 (31%), Positives = 117/222 (52%), Gaps = 25/222 (11%)
Query: 96 SSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIV 155
S + P Q M Q G A G+E +++ +++V + L++YF V YV+ KL L++
Sbjct: 67 SGFINDPTAQ-MGFQVGKTAMAAGQEYMEQNFNRYVSIPALKHYFNVSNSYVLNKLALVL 125
Query: 156 FPYTHKEWMVKYDQDTP------------------VQPRYDINAPDLYIPSMGYVTYVLL 197
FP+ HK W + + T + PR D+N+PD+YIP M VTY+LL
Sbjct: 126 FPWRHKPWSRQQARLTTSSAGPNGQIAQQQYSSMFLPPRDDLNSPDMYIPVMALVTYILL 185
Query: 198 AGFMLGLQHRFSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKY 257
+ + G + +F PE +G ++A+A I+FE+ + LDL+AYSGYK+
Sbjct: 186 SAVLAGFRGQFHPELLGSITTTAIAVIVFEILCLKLAMYILSINNESQLLDLVAYSGYKF 245
Query: 258 TVMISSLLAG--LLAGR-TGYYCG---LLYSSCALSYFLVKT 293
+I++L+ L GR TG + G +Y+ A ++FL+ +
Sbjct: 246 VGIIATLVMSEILTPGRGTGGWVGWVVFMYTFLANAFFLLSS 287
>UniRef50_A3LPC0 Cluster: Predicted protein; n=5;
Saccharomycetales|Rep: Predicted protein - Pichia
stipitis (Yeast)
Length = 334
Score = 108 bits (259), Expect = 3e-22
Identities = 70/215 (32%), Positives = 105/215 (48%), Gaps = 19/215 (8%)
Query: 106 DMAIQYGNQLAAQGKEA----VQRELHKFVP-VSRLRYYFAVDTRYVIRKLMLIVFPYTH 160
D A +Q A G E+ +Q+ F+P S L+YYF V YV RK++L++FPY +
Sbjct: 85 DPATSLASQFARSGFESSNQYLQQNFGSFIPGTSDLKYYFQVSNSYVTRKILLVLFPYRN 144
Query: 161 KEWMVKYDQDTP-----------VQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFS 209
K W Q+ P +D+NAPDLYIP M +VTY+LL GL +F
Sbjct: 145 KNWNRLTSQEATGDPSPNGQTSYAPPSHDVNAPDLYIPLMSFVTYILLWAAFQGLNEKFH 204
Query: 210 PEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLL 269
P+ G AS LA+ + ++ DL+A+S YKY V+I L L
Sbjct: 205 PKLFGYLASQTLAFSVVDI-AFFKIGLYLLNCSQSSMWDLVAFSSYKYVVIIVLLCWKHL 263
Query: 270 AGR--TGYYCGLLYSSCALSYFLVKTLRLQLLSGS 302
G Y+ ++ + L+ FL+++L+ +L S
Sbjct: 264 VGNGWVSYFPVVIVLTINLAVFLMRSLKFLVLPNS 298
>UniRef50_Q5BSJ3 Cluster: SJCHGC04045 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC04045 protein - Schistosoma
japonicum (Blood fluke)
Length = 137
Score = 97.5 bits (232), Expect = 5e-19
Identities = 55/160 (34%), Positives = 85/160 (53%), Gaps = 24/160 (15%)
Query: 201 MLGLQHRFSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVM 260
+ G Q RFSPE +GI +S A +++ E+ LD++AY GYK+ M
Sbjct: 1 IFGFQGRFSPEYLGILSSEAFGWLLLEVLLSLFAIYILNIQNNISYLDIVAYCGYKFVSM 60
Query: 261 ISSLLAGLLAGRTGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGPEQPSYGFPNPPYAAN 320
I L++ + R GYY GLLY S AL++FL+++L+L++L
Sbjct: 61 IVVLISYITLDRPGYYFGLLYVSVALAFFLIRSLKLKIL--------------------- 99
Query: 321 PYSDAWDKPTPGGTKRRVYFLLFVAITQPLLCWWLTYHLV 360
P+++A+ KRR+Y LL +A+ QPL+ WWLT +V
Sbjct: 100 PHAEAYPSEC---NKRRIYLLLLIALVQPLMMWWLTRRVV 136
>UniRef50_P53845 Cluster: Protein transport protein YIF1; n=5;
Saccharomycetales|Rep: Protein transport protein YIF1 -
Saccharomyces cerevisiae (Baker's yeast)
Length = 314
Score = 95.5 bits (227), Expect = 2e-18
Identities = 63/193 (32%), Positives = 96/193 (49%), Gaps = 11/193 (5%)
Query: 118 QGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTH--KEWMVKYDQDTPVQP 175
Q +E V + ++ YF V TRYVI KL LI+ P+ + K W D + P
Sbjct: 86 QFQETVNKATANAAGSQQISTYFQVSTRYVINKLKLILVPFLNGTKNWQRIMDSGNFLPP 145
Query: 176 RYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASSALAYIIFEMXXXXXXX 235
R D+N+PD+Y+P MG VTY+L+ GL+ F+PE + + SS LA++ ++
Sbjct: 146 RDDVNSPDMYMPIMGLVTYILIWNTQQGLKGSFNPEDLYYKLSSTLAFVCLDLLILKLGL 205
Query: 236 XXXXXXX--XXKTLDLLAYSGYKYTVMISSLLAGLLAGRT-GYYCGLL---YSSCALSYF 289
++LL Y GYK+ +I LA LL T + +L Y A F
Sbjct: 206 YLLIDSKIPSFSLVELLCYVGYKFVPLI---LAQLLTNVTMPFNLNILIKFYLFIAFGVF 262
Query: 290 LVKTLRLQLLSGS 302
L+++++ LLS S
Sbjct: 263 LLRSVKFNLLSRS 275
>UniRef50_Q8STM7 Cluster: Similarity to HYPOTHETICAL TRANSMEMBRANE
PROTEIN YNO3_YEAST; n=1; Encephalitozoon cuniculi|Rep:
Similarity to HYPOTHETICAL TRANSMEMBRANE PROTEIN
YNO3_YEAST - Encephalitozoon cuniculi
Length = 206
Score = 89.8 bits (213), Expect = 1e-16
Identities = 62/207 (29%), Positives = 103/207 (49%), Gaps = 16/207 (7%)
Query: 107 MAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTHKEWMVK 166
M I+ G + + E R L V + R YF +D +V++KL+LI+FP+ +KEW
Sbjct: 1 MEIEIGKEAIRKSTEYASRGLGG-VSLKPFRTYFDIDNTFVLKKLVLILFPFNNKEWT-- 57
Query: 167 YDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASSALAYIIF 226
D R P+LY+P+M +++Y+LL LGL+ FSPE++GI + + +
Sbjct: 58 --GDDEGMAR-----PELYVPAMSFISYILLRALYLGLEGMFSPERLGIVFTR--LFFLE 108
Query: 227 EMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTGYYCGLLYSSCAL 286
+ TLD++AYSGYKY ++ LL L R G +Y +
Sbjct: 109 AVCIALTRISGYFVDVGLSTLDVVAYSGYKYVIV---LLLQLNKMRYVQVIGGMYLYVSF 165
Query: 287 SYFLVKTLRLQLL-SGSQGPEQPSYGF 312
FL ++L+ +++ G++ + Y F
Sbjct: 166 FVFLSRSLKRRVMDKGAERMRRAYYLF 192
>UniRef50_UPI000049A0DA Cluster: conserved hypothetical protein;
n=1; Entamoeba histolytica HM-1:IMSS|Rep: conserved
hypothetical protein - Entamoeba histolytica HM-1:IMSS
Length = 265
Score = 86.2 bits (204), Expect = 1e-15
Identities = 55/182 (30%), Positives = 86/182 (47%), Gaps = 7/182 (3%)
Query: 118 QGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTHK-EWMVKY----DQDTP 172
QG + ++ +L RYYF V+T +V++K+++I+ PY W KY DQ
Sbjct: 50 QGDQLLKSQLGGIFSFDAWRYYFNVNTSFVLKKILMIIMPYPFLGTWERKYVIGEDQSKL 109
Query: 173 VQ-PRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASSALAYIIFEMXXX 231
P+ DI APDLYIP MG+++YVL GF G + F+PE + + + L I E+
Sbjct: 110 YNVPQEDIYAPDLYIPLMGFISYVLAIGFYYGSKGTFTPETLSMTTTLCLILISLEVMIL 169
Query: 232 XXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTGYYCGLLYSSCALSYFLV 291
+ L+Y Y + ++ G L Y LL+ + ++FL
Sbjct: 170 KFLEYMLFNYSSDFRI-YLSYVSYVFVPVLMCTFVGSLQIPYLTYIALLFFGTSYAFFLY 228
Query: 292 KT 293
KT
Sbjct: 229 KT 230
>UniRef50_Q9FYH6 Cluster: F17F8.24; n=15; Magnoliophyta|Rep:
F17F8.24 - Arabidopsis thaliana (Mouse-ear cress)
Length = 286
Score = 84.6 bits (200), Expect = 4e-15
Identities = 62/209 (29%), Positives = 97/209 (46%), Gaps = 23/209 (11%)
Query: 111 YGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTHKEW------M 164
YG ++ E VQ + ++ S +YYF V+ +YV KL +++FP+ H+ +
Sbjct: 45 YGERILGSSSEYVQSNISRYF--SDPQYYFQVNDQYVRNKLKVVLFPFLHRPFNCTIVGS 102
Query: 165 VKYDQ------DTPV-------QPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPE 211
Q PV P YDINAPDLYIP M + TYV+LAG LGL +F+PE
Sbjct: 103 ASNPQGHWTRISEPVGGRLSYKPPIYDINAPDLYIPFMAFGTYVVLAGLSLGLNGKFTPE 162
Query: 212 QIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAG 271
+ L ++ LD++AY GY + + + A ++ G
Sbjct: 163 ALNWLFVKGLVGWFLQV-MLLKVTLLSLGSGEAPLLDIVAYGGYAFAGLCLAGFAKIMWG 221
Query: 272 RTGYYCGLLYSSCALSYFLVKTLRLQLLS 300
+ YY + ++ FLVKT++ L +
Sbjct: 222 YS-YYALMPWTCLCTGIFLVKTMKRVLFA 249
>UniRef50_A7TF63 Cluster: Putative uncharacterized protein; n=1;
Vanderwaltozyma polyspora DSM 70294|Rep: Putative
uncharacterized protein - Vanderwaltozyma polyspora DSM
70294
Length = 322
Score = 84.2 bits (199), Expect = 5e-15
Identities = 62/215 (28%), Positives = 100/215 (46%), Gaps = 23/215 (10%)
Query: 105 QDMAIQYG-----NQLAAQGKEAVQRELHKFVP-VSRLRYYFAVDTRYVIRKLMLIVFPY 158
Q MA Q G N + Q Q + K S + +YF V T YV++K++L++FP+
Sbjct: 75 QSMAFQLGQSAFSNFIGQQNFSQFQETVSKAASGSSSVSHYFQVSTSYVLQKILLVLFPF 134
Query: 159 THKE-WM-VKYDQDTP------VQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSP 210
+K W + Q + + P+ DIN+PD+YIP MG VTY+L+ GL F+P
Sbjct: 135 MNKNNWQRIPESQSSGAGTVSFMPPKDDINSPDMYIPVMGLVTYILIWNTQQGLSGSFNP 194
Query: 211 EQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKT--LDLLAYSGYKYTVMISSLLAGL 268
E + + SS +A++ ++ T +LL Y GYK+ + L
Sbjct: 195 ENLYYKLSSTVAFLALDLIILKLGLYLLVSTNSPTTSITELLCYVGYKFVPLTLVLFVPA 254
Query: 269 LAGRTGYYCGLL---YSSCALSYFLVKTLRLQLLS 300
L +Y L+ Y A FL++ ++ + S
Sbjct: 255 LP----FYLSLILKVYLFIAFGVFLLRAVKFNMFS 285
>UniRef50_Q01E40 Cluster: Predicted membrane protein; n=2;
Ostreococcus|Rep: Predicted membrane protein -
Ostreococcus tauri
Length = 499
Score = 80.6 bits (190), Expect = 6e-14
Identities = 58/204 (28%), Positives = 88/204 (43%), Gaps = 11/204 (5%)
Query: 111 YGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTHKEWMVKYDQD 170
YG + G V K+ + +R YF V YV KL L++ P+ HK + +
Sbjct: 119 YGGKFLNDGASFVSSNYAKYFSTASMRAYFDVTESYVFHKLRLLLCPFLHKGSWARLPES 178
Query: 171 TP-----VQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHR---FSPEQIGIQA-SSAL 221
PR DINAPDLYIP M + TYVL A + F+PE + A S L
Sbjct: 179 VAGGTAYKPPRNDINAPDLYIPLMAFWTYVLTASIREVFSSKSGAFTPEALATHAWWSGL 238
Query: 222 AYIIFE--MXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTGYYCGL 279
+ + + LD+ AY GY + +L++ +G Y+ L
Sbjct: 239 LWSVESAFIWIALRTASTSNHIVSAPMLDIAAYVGYSFVYGSVTLMSKFSSGSLIYWLFL 298
Query: 280 LYSSCALSYFLVKTLRLQLLSGSQ 303
+S+ + F+ KTL+ + S S+
Sbjct: 299 SWSAVCNAVFMAKTLKKIIFSESR 322
>UniRef50_Q5CS10 Cluster: Protein with 5 transmembrane domains; n=2;
Cryptosporidium|Rep: Protein with 5 transmembrane
domains - Cryptosporidium parvum Iowa II
Length = 410
Score = 78.2 bits (184), Expect = 3e-13
Identities = 57/186 (30%), Positives = 94/186 (50%), Gaps = 19/186 (10%)
Query: 133 VSRLRYYFAVDTRYVIRKLMLIVFPYT-----HKEWMVKYDQDTPVQPRYD---INAP-- 182
++ LR +FAV YVI+K++LI+ PY +K Y+ T + + +N P
Sbjct: 193 IASLRSHFAVSHEYVIKKILLIICPYITFFTQNKRKSFSYENHTSISSNANDGGVNLPTL 252
Query: 183 --DLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASSALAYIIFEMXXXXXXXXXXXX 240
DLYIP MG++TY+L G + G+ +F+P+ +G A+ ++ +I E+
Sbjct: 253 FSDLYIPLMGFITYILADGVINGVFSQFNPQMLGSTATFSIVLLITEI-ILFQLVAYIFA 311
Query: 241 XXXXKTLDLLAYSGYKYTVMISSLLAGLLAG--RTGYYCGL-LY---SSCALSYFLVKTL 294
TLDL++ GYKYT ++ A L G +T + L +Y SS L Y ++K +
Sbjct: 312 ARVLSTLDLISTLGYKYTSIVLCDFALLSTGGIKTYLFWALFIYFSISSSLLVYMMLKVI 371
Query: 295 RLQLLS 300
+ S
Sbjct: 372 STRSFS 377
>UniRef50_Q23K44 Cluster: Hrf1 family protein; n=1; Tetrahymena
thermophila SB210|Rep: Hrf1 family protein - Tetrahymena
thermophila SB210
Length = 283
Score = 77.4 bits (182), Expect = 6e-13
Identities = 54/204 (26%), Positives = 97/204 (47%), Gaps = 11/204 (5%)
Query: 111 YGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYTHK-EWMVKYDQ 169
+G QL K++ S +R +F +D Y++RK+ LI+FP+ + EW VK ++
Sbjct: 47 FGPQLIPDNIIPTGNFAEKWIFNSYVRSFFDIDNMYILRKMKLILFPFLQRGEWEVKVNE 106
Query: 170 DTP-------VQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASSALA 222
+ P+ + ++PDLY+P MG +T+VL++ +G+ F PE I S L
Sbjct: 107 YASSSQEQNFISPKDNPHSPDLYLPLMGLITFVLVSCLSVGIGDNFQPEIIQRNTSYCL- 165
Query: 223 YIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTGYYCGLLYS 282
+I F L++L++ Y++ + L+ L G + ++Y
Sbjct: 166 FITFIEIYLYKFLFFLVGIKNIGILNMLSHLSYRFLSLTCILICNLSFGGWFTFILMVYL 225
Query: 283 SCALSYFLVKTLR--LQLLSGSQG 304
+F+ KTL+ +Q +S S G
Sbjct: 226 LTTSVFFIFKTLKRYIQTMSDSFG 249
>UniRef50_A0C565 Cluster: Chromosome undetermined scaffold_15, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_15,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 267
Score = 74.1 bits (174), Expect = 5e-12
Identities = 48/181 (26%), Positives = 85/181 (46%), Gaps = 4/181 (2%)
Query: 134 SRLRYYFAVDTRYVIRKLMLIVFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVT 193
S+ R+YF VD YV++K ++ + PY ++ + + P +++APDLY+P M VT
Sbjct: 65 SQYRFYFDVDNMYVVKKSIMTLAPYLYRGNWTLNSEFQAISPTENVHAPDLYLPLMSLVT 124
Query: 194 YVLLAGFMLGLQHR--FSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLA 251
+VLL LG+ + FSP I + E+ T+DL++
Sbjct: 125 FVLLRCLSLGINDKTQFSPGYIVDSFWKCFVISLLEV-IIIKIVFCFLDGIRVNTVDLVS 183
Query: 252 YSGYKYTVMISSLLAGLLAGRTGYYCGLLYSSCALSYFLVKTLRLQLLSGSQGP-EQPSY 310
+ Y+Y + + ++ +L + G +Y + F+ KTL+ S +Q E S+
Sbjct: 184 HLNYRYCSLCALMVFNILTNGIFSFVGTIYVLICQAIFIYKTLQRYSPSHNQSALELSSF 243
Query: 311 G 311
G
Sbjct: 244 G 244
>UniRef50_Q4GZ21 Cluster: Putative uncharacterized protein; n=1;
Trypanosoma brucei|Rep: Putative uncharacterized protein
- Trypanosoma brucei
Length = 300
Score = 64.9 bits (151), Expect = 3e-09
Identities = 46/125 (36%), Positives = 64/125 (51%), Gaps = 5/125 (4%)
Query: 175 PRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSP-EQIGIQASSALAYIIFEMXXXXX 233
P D++A DLY+P MG +TYV+L+GF+ GL H P EQ+ A S L ++ E+
Sbjct: 138 PTEDVHAFDLYVPLMGAITYVILSGFLYGLHHNSVPNEQLVGPAWSLLFWLQVEVFILKL 197
Query: 234 XXXXXXXXXXXKTLDLLAYSGYKY-TVMISSLLAGL--LAGRTGY-YCGLLYSSCALSYF 289
L+L A YKY T+ ++ LL + L G T Y + LLY A + F
Sbjct: 198 VCHLLRTTPASTILELTALCSYKYITICLAVLLREVLRLEGETVYTWAILLYVVLANATF 257
Query: 290 LVKTL 294
KTL
Sbjct: 258 AAKTL 262
Score = 37.5 bits (83), Expect = 0.56
Identities = 20/59 (33%), Positives = 33/59 (55%), Gaps = 3/59 (5%)
Query: 100 QQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPY 158
QQ + ++ +QYG + +E Q + V+ R YF VD +YV RKL +++FP+
Sbjct: 32 QQNAMLEIGMQYGQNVL---QEKSQGFMSYISVVTGFRRYFRVDNQYVKRKLTMLLFPF 87
>UniRef50_Q4CZI1 Cluster: Putative uncharacterized protein; n=2;
Trypanosoma cruzi|Rep: Putative uncharacterized protein
- Trypanosoma cruzi
Length = 292
Score = 60.5 bits (140), Expect = 7e-08
Identities = 44/133 (33%), Positives = 66/133 (49%), Gaps = 5/133 (3%)
Query: 167 YDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGL-QHRFSPEQIGIQASSALAYII 225
Y + P ++ A DLY+P MG VTY++L+GF+ GL HR + E + AS+ + + +
Sbjct: 122 YPTTSTALPTNNVYALDLYLPLMGAVTYIILSGFVHGLHHHRVTNEDLLGFASALVFWFL 181
Query: 226 FEMXXXXXXXXXXXXXXXXKTLDLLAYSGYKY-TVMISSLLAGLLAGRT-GYYCGLL--Y 281
E+ L+L+A +GYKY TV I L LL + YY G + Y
Sbjct: 182 GEVFVLKMVSYILRIVPDINVLELMALTGYKYLTVSIIVFLRELLQFESDAYYIGTMATY 241
Query: 282 SSCALSYFLVKTL 294
A F+VK +
Sbjct: 242 ILFANGVFVVKNV 254
Score = 43.2 bits (97), Expect = 0.011
Identities = 20/59 (33%), Positives = 34/59 (57%), Gaps = 3/59 (5%)
Query: 101 QPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSRLRYYFAVDTRYVIRKLMLIVFPYT 159
Q ++ M +QYG + G++ R + +S + YF VD +YV RKL +++FP+T
Sbjct: 27 QEIMLQMGLQYGQSMLQGGEQKFMRHMPV---ISNIYRYFRVDNQYVKRKLGILLFPFT 82
>UniRef50_Q4QCT2 Cluster: Putative uncharacterized protein; n=2;
Leishmania|Rep: Putative uncharacterized protein -
Leishmania major
Length = 320
Score = 58.8 bits (136), Expect = 2e-07
Identities = 43/143 (30%), Positives = 64/143 (44%), Gaps = 6/143 (4%)
Query: 172 PVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASSALA---YIIFEM 228
P+ P D+ A DLYIP M +TYV+LA ++ G S S+A + E+
Sbjct: 155 PLLPLNDVFASDLYIPLMSVITYVVLAAYIFGANSPTSSVTAASLISTAWVIGIWFFLEV 214
Query: 229 XXXXXXXXXXXXXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTGYYCGL--LYSSCAL 286
L+LLA GYKY ++ LLA + Y GL LY+ A
Sbjct: 215 VVLKGVAYALLVAPNPPLLELLALCGYKYVLLCLGLLASQCLPPSRLYSGLFMLYAVLAH 274
Query: 287 SYFLVKTLRLQLL-SGSQGPEQP 308
S+F V+ +Q + + + P +P
Sbjct: 275 SFFTVRVSGMQYMRNDGRVPPRP 297
Score = 39.9 bits (89), Expect = 0.10
Identities = 23/76 (30%), Positives = 39/76 (51%), Gaps = 6/76 (7%)
Query: 107 MAIQYGNQLAAQGKEAVQRELHKFVPVSR-LRYYFAVDTRYVIRKLMLIVFPYTHKEWMV 165
M + YG + + + + L ++P R +R YFAVD YV RKL+++ P+ +
Sbjct: 39 MGLSYGQNILQKHVQQGEAGLAYYMPFIRAIRNYFAVDNTYVKRKLIMLTMPF-----LT 93
Query: 166 KYDQDTPVQPRYDINA 181
KY + +PV D +
Sbjct: 94 KYVRKSPVGGESDFGS 109
>UniRef50_UPI0000498A87 Cluster: hypothetical protein 3.t00011; n=1;
Entamoeba histolytica HM-1:IMSS|Rep: hypothetical
protein 3.t00011 - Entamoeba histolytica HM-1:IMSS
Length = 231
Score = 50.4 bits (115), Expect = 7e-05
Identities = 30/103 (29%), Positives = 49/103 (47%), Gaps = 5/103 (4%)
Query: 130 FVPVSRLRYYFAVDTRYVIRKLMLIVFPYTH----KEWMVKYDQDTPVQPR-YDINAPDL 184
++ RYYF V VI +++++FP K +K D + P Y A +L
Sbjct: 26 YIEFDEWRYYFDVTVHSVIEHIIMVLFPLLFPTPWKPKCIKSDVNEFFLPSSYQRYATEL 85
Query: 185 YIPSMGYVTYVLLAGFMLGLQHRFSPEQIGIQASSALAYIIFE 227
Y P + T+V+ G G+ ++FSPE +G L +I F+
Sbjct: 86 YTPLVSGFTFVIFVGLWQGITNQFSPEHLGTLVLVLLIFINFQ 128
>UniRef50_A2EGH1 Cluster: Putative uncharacterized protein; n=3;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 291
Score = 49.2 bits (112), Expect = 2e-04
Identities = 45/216 (20%), Positives = 91/216 (42%), Gaps = 7/216 (3%)
Query: 92 PAQISSMLQQPVVQDMAIQYGNQLAAQGKEAVQRELHKFVPVSR--LRYYFAVDTRYVIR 149
P I + V++ ++ L + K ++L + + S+ + YFAV +I
Sbjct: 25 PQTIPQFMSPEVLKMASVMTNAYLPDEIKNLDPKQLEQRIAQSQALIPSYFAVTPNSIIH 84
Query: 150 KLMLIVFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQHRFS 209
++ + P+ K+W + P + NAP+LY P + LL+ + G+Q++FS
Sbjct: 85 RIKNLACPFFVKQWSRSVPEGQQFIPINNPNAPELYTPITFCFLFFLLSALISGVQNKFS 144
Query: 210 PEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLA-YSGYKYTVMISSLLAGL 268
+ + +Q I E+ L L+A +S + + + +L +
Sbjct: 145 MDYLYLQIIKFGLIIFVEVAICKTLFKNVGVQGSYPILSLIADFSCLSFYMCVVTLFSWN 204
Query: 269 LAGRTGYYCGLLYSSCALSYFLVKTLRL-QLLSGSQ 303
A Y+ LY + + + ++TL Q ++G Q
Sbjct: 205 CA---LYWISFLYCAFSAMIWTLRTLNSEQCMAGRQ 237
>UniRef50_Q7QSE8 Cluster: GLP_426_11085_11759; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_426_11085_11759 - Giardia lamblia
ATCC 50803
Length = 224
Score = 38.3 bits (85), Expect = 0.32
Identities = 41/157 (26%), Positives = 69/157 (43%), Gaps = 18/157 (11%)
Query: 136 LRYYFAVDTRYVIRKLMLIVFPYTHKEWMVKYDQDTPVQPRYDINAPDLYIPSMGYVTYV 195
LR YF + T +++L L++FPY V++++ + P DLY+P +G+ Y
Sbjct: 38 LRPYFNITTSTFLKRLGLLMFPY------VRFNERSTNLPL------DLYMPLVGHAAYG 85
Query: 196 LLAGFMLGLQ-HRFSPEQIGIQASSALAYIIFEMXXXXXXXXXXXXXXXXKTLDLLAYSG 254
++ F L F P + LA II + L+L+A S
Sbjct: 86 VIMAFTEILSTGGFRPAL--FTRNVILAAIIVGLEALVVIGISSSVSLKVPKLELIALSC 143
Query: 255 YK-YTVMISSLLAGLLAGRTGYYCGLLYSSCALSYFL 290
YK + +I+S ++ + R YY L++ AL L
Sbjct: 144 YKVFVSVIASFIS--IFNRVLYYIALVWLGVALGVHL 178
>UniRef50_A5K5P4 Cluster: Putative uncharacterized protein; n=1;
Plasmodium vivax|Rep: Putative uncharacterized protein -
Plasmodium vivax
Length = 730
Score = 38.3 bits (85), Expect = 0.32
Identities = 29/109 (26%), Positives = 46/109 (42%), Gaps = 10/109 (9%)
Query: 183 DLYIPSMGYVTYVLLAGFMLGLQHR---FSPEQIGIQASSALAYIIFEMXXXXXXXXXXX 239
DLYIP M +TY+LL + Q F+P+ + S + FE
Sbjct: 510 DLYIPLMSSITYILLYTLTVTAQKNNFVFNPDNLFSITSYVFLLLFFETAIIKFLFLLTC 569
Query: 240 XXXXXKTLDLLAYSGYKYTVMISSLLAGLLAGRTGYYCGLLYSSCALSY 288
L +L++ YK+ + L GL+ + +Y LL+ +CA Y
Sbjct: 570 RDINLSFLHILSFISYKFVI-----LCGLIVTKFFFY--LLHFTCASLY 611
>UniRef50_A6TTN6 Cluster: Mur ligase, middle domain protein; n=1;
Alkaliphilus metalliredigens QYMF|Rep: Mur ligase,
middle domain protein - Alkaliphilus metalliredigens
QYMF
Length = 403
Score = 33.9 bits (74), Expect = 6.9
Identities = 20/54 (37%), Positives = 30/54 (55%), Gaps = 4/54 (7%)
Query: 165 VKY-DQDTPVQPRYDINAPDLYIPSMGYVTYVLLAGFMLGLQH--RFSPEQIGI 215
+KY DQ Q + D D +IP+ GY V+ A F +G+ H +FSP +I +
Sbjct: 196 IKYVDQGMTFQVKLDDQLEDFFIPTFGYHN-VINALFAIGVSHHQKFSPSEIKV 248
>UniRef50_A0QQR3 Cluster: NADH ubiquinone oxidoreductase subunit 5;
n=3; Actinomycetales|Rep: NADH ubiquinone oxidoreductase
subunit 5 - Mycobacterium smegmatis (strain ATCC 700084
/ mc(2)155)
Length = 993
Score = 33.9 bits (74), Expect = 6.9
Identities = 21/44 (47%), Positives = 27/44 (61%), Gaps = 5/44 (11%)
Query: 258 TVMISSLLAGLLAGRTGYYCGLLYS-----SCALSYFLVKTLRL 296
TVM + L A LL G TGY CG +++ AL+ FLV+TL L
Sbjct: 650 TVMRNRLAAVLLVGVTGYGCGAIFAFHGAPDLALTQFLVETLVL 693
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.321 0.137 0.421
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 398,232,450
Number of Sequences: 1657284
Number of extensions: 16138416
Number of successful extensions: 31528
Number of sequences better than 10.0: 37
Number of HSP's better than 10.0 without gapping: 33
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 31437
Number of HSP's gapped (non-prelim): 62
length of query: 372
length of database: 575,637,011
effective HSP length: 102
effective length of query: 270
effective length of database: 406,594,043
effective search space: 109780391610
effective search space used: 109780391610
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.9 bits)
S2: 73 (33.5 bits)
- SilkBase 1999-2023 -