BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA001438-TA|BGIBMGA001438-PA|IPR009730|Micro-fibrillar-
associated 1, C-terminal
(441 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q9W062 Cluster: CG1017-PA; n=5; Endopterygota|Rep: CG10... 333 5e-90
UniRef50_P55081 Cluster: Microfibrillar-associated protein 1; n=... 309 9e-83
UniRef50_Q16U12 Cluster: Microfibril-associated protein; n=4; Bi... 277 4e-73
UniRef50_Q93712 Cluster: Putative uncharacterized protein; n=2; ... 206 1e-51
UniRef50_Q4RGL1 Cluster: Chromosome undetermined SCAF15099, whol... 192 2e-47
UniRef50_Q9FKN6 Cluster: Similarity to microfibrillar-associated... 167 7e-40
UniRef50_A7ATD4 Cluster: Micro-fibrillar-associated protein 1 C-... 134 6e-30
UniRef50_Q4U9V1 Cluster: Microfibrillar-associated protein, puta... 124 5e-27
UniRef50_A4RS12 Cluster: Predicted protein; n=2; Ostreococcus|Re... 124 6e-27
UniRef50_Q54SU3 Cluster: Putative uncharacterized protein; n=1; ... 116 2e-24
UniRef50_Q8IE75 Cluster: Microfibril-associated protein homologu... 109 1e-22
UniRef50_A5K2Z5 Cluster: Micro-fibrillar-associated protein 1 C-... 108 2e-22
UniRef50_UPI00006CD032 Cluster: Micro-fibrillar-associated prote... 102 2e-20
UniRef50_A0EFW2 Cluster: Chromosome undetermined scaffold_94, wh... 99 1e-19
UniRef50_Q9P7H6 Cluster: Microfibrillar-associated protein famil... 95 2e-18
UniRef50_Q7S7V7 Cluster: Predicted protein; n=7; Pezizomycotina|... 91 5e-17
UniRef50_Q4P301 Cluster: Putative uncharacterized protein; n=1; ... 90 9e-17
UniRef50_A1DDP7 Cluster: Microfibrillar-associated protein MfaP1... 84 6e-15
UniRef50_Q0TWF3 Cluster: Putative uncharacterized protein; n=1; ... 83 1e-14
UniRef50_A2ELS5 Cluster: Putative uncharacterized protein; n=1; ... 77 7e-13
UniRef50_UPI0000499156 Cluster: microfibril-associated protein; ... 76 2e-12
UniRef50_Q5C2C1 Cluster: SJCHGC04323 protein; n=1; Schistosoma j... 61 5e-08
UniRef50_UPI000155C08B Cluster: PREDICTED: similar to Microfibri... 57 8e-07
UniRef50_Q6CA21 Cluster: Similar to tr|Q8X0K0 Neurospora crassa ... 57 1e-06
UniRef50_UPI0000DC125F Cluster: UPI0000DC125F related cluster; n... 56 2e-06
UniRef50_Q5KAL3 Cluster: Putative uncharacterized protein; n=1; ... 55 4e-06
UniRef50_Q0E1X2 Cluster: Os02g0294000 protein; n=5; Oryza sativa... 52 2e-05
UniRef50_A6S856 Cluster: Putative uncharacterized protein; n=1; ... 38 0.53
UniRef50_Q1YGP6 Cluster: Putative uncharacterized protein; n=1; ... 36 1.6
UniRef50_Q4WBH2 Cluster: C6 transcription factor, putative; n=1;... 36 2.8
UniRef50_Q5BX93 Cluster: SJCHGC03879 protein; n=1; Schistosoma j... 35 3.7
UniRef50_UPI0000EBF1F9 Cluster: PREDICTED: similar to microfibri... 34 6.5
UniRef50_UPI0000D56E42 Cluster: PREDICTED: similar to CG32580-PA... 34 8.6
UniRef50_UPI00006CD58E Cluster: hypothetical protein TTHERM_0050... 34 8.6
UniRef50_A2RV13 Cluster: Zgc:85787 protein; n=3; Danio rerio|Rep... 34 8.6
>UniRef50_Q9W062 Cluster: CG1017-PA; n=5; Endopterygota|Rep:
CG1017-PA - Drosophila melanogaster (Fruit fly)
Length = 478
Score = 333 bits (819), Expect = 5e-90
Identities = 198/467 (42%), Positives = 249/467 (53%), Gaps = 34/467 (7%)
Query: 6 AQPTGIQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFIE-QQ 64
A +GIQSTAGAIP+RNEKGE+SMQKVKVQRYISGK+PDYA+ FI+ ++
Sbjct: 8 AAASGIQSTAGAIPMRNEKGELSMQKVKVQRYISGKRPDYARADSSSEESDDDDFIDTRK 67
Query: 65 RPERKQVLPQIITRKE-------------EHHSDSEKEVDDPRLRRLR----NIAQSPPR 107
R ER + + E + + EVDDPRLRRLR ++
Sbjct: 68 RLERHKAERHKLELSRQGGSAEGEERAAGEGQEEDDAEVDDPRLRRLRQRPVDMEDMERE 127
Query: 108 RAE-----HKPEII--DAEPEAESEIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160
R E H+PEI+ D+E E E E
Sbjct: 128 RRERHRHIHEPEIMESDSEDEEEDEGAQGAIQRGTNKITLASESDTDAELSDTELENRRT 187
Query: 161 XXXXXVLGRXXXXXXXXXX----XXXSGSSDTEY---TDSEEDTGPRVKPVFVRASERMT 213
+L + S S +EY T+SEED PR+KP+FVR +R T
Sbjct: 188 KLRSRMLQQQREEEVLQKEDEKQSESSESESSEYEEETESEEDNEPRLKPLFVRKRDRAT 247
Query: 214 VAERERKMXXXXXXXXXXXXXXXXXXXXXXXXXXXTIRSEQRGAQGEQKEGNINDVCTDD 273
+ E+ER+ +++ + + E E I DVCTDD
Sbjct: 248 IQEKEREAQKQKQLEAEAKRAAKERRRATLRMVEESVKKDLEKTKPETNEACIEDVCTDD 307
Query: 274 ENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDERRLEQRINPKVVTNK 333
ENDE+EYEAWKL L ++R+RNMTE+ERR E R NPKVVTNK
Sbjct: 308 ENDEVEYEAWKLRELKRMKRDREERDNVEREKLDIDRMRNMTEEERRQELRQNPKVVTNK 367
Query: 334 AVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRT 393
A KGKYKFLQKYYHRGAFYLD+E DV K+DF+ TL+DHFDKT+LPKVMQVK FGR GRT
Sbjct: 368 ATKGKYKFLQKYYHRGAFYLDEENDVLKRDFAQATLEDHFDKTILPKVMQVKNFGRCGRT 427
Query: 394 KYTHLVDQDTTEFDSAWSNETSA-ARLTN-FRGGMKQVFEKPSAERK 438
KYTHLVDQDTT+FDS W E+S+ + N GGM+Q F+KP+ ++
Sbjct: 428 KYTHLVDQDTTKFDSPWYAESSSNIKFHNEHAGGMRQQFDKPTGSKR 474
>UniRef50_P55081 Cluster: Microfibrillar-associated protein 1; n=25;
Eumetazoa|Rep: Microfibrillar-associated protein 1 -
Homo sapiens (Human)
Length = 439
Score = 309 bits (759), Expect = 9e-83
Identities = 182/447 (40%), Positives = 241/447 (53%), Gaps = 25/447 (5%)
Query: 2 NVLPAQPTGIQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFI 61
+ L QP IQSTAGA+P+RNEKGEISM+KVKV+RY+SGK+PDYA FI
Sbjct: 5 SALMKQPP-IQSTAGAVPVRNEKGEISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFI 63
Query: 62 EQQRPERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRN-IAQSPPRR-AEHK----PEI 115
++ + + + EE DS DPRLRRL+N I++ R A H+ PE+
Sbjct: 64 KKAKEQEAE--------PEEQEEDSSS---DPRLRRLQNRISEDVEERLARHRKIVEPEV 112
Query: 116 I-DAEPEAESEIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLGRXXXXX 174
+ +++ E E + +
Sbjct: 113 VGESDSEVEGDAWRMEREDSSEEEEEEIDDEEIERRRGMMRQRAQERKNEEMEVMEVEDE 172
Query: 175 XXXXXXXXSGSSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXX 234
S S EYTDSE++ PR+KPVF+R +R+TV ERE +
Sbjct: 173 GRSGEESESESEYEEYTDSEDEMEPRLKPVFIRKKDRVTVQEREAEALKQKELEQEAKRM 232
Query: 235 XXXXXXXXXXXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXX 294
E + ++ ++ + TDDENDE EYEAWK+
Sbjct: 233 AEERRQYTLQIVGEETPKELE--ENKRSLAALDALNTDDENDEEEYEAWKVRELKRIKRD 290
Query: 295 XXXXXXXXXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLD 354
+ER+RN+TE+ERR E R N KV+TNKAVKGKYKFLQKYYHRGAF++D
Sbjct: 291 REDREALEKEKAEIERMRNLTEEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMD 350
Query: 355 KEEDVFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNET 414
++E+V+K+DFS PTL+DHF+KT+LPKVMQVK FGRSGRTKYTHLVDQDTT FDSAW E
Sbjct: 351 EDEEVYKRDFSAPTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWGQE- 409
Query: 415 SAARLTNFR---GGMKQVFEKPSAERK 438
SA F+ G++ VFE+PSA+++
Sbjct: 410 SAQNTKFFKQKAAGVRDVFERPSAKKR 436
>UniRef50_Q16U12 Cluster: Microfibril-associated protein; n=4;
Bilateria|Rep: Microfibril-associated protein - Aedes
aegypti (Yellowfever mosquito)
Length = 492
Score = 277 bits (679), Expect = 4e-73
Identities = 138/255 (54%), Positives = 170/255 (66%), Gaps = 3/255 (1%)
Query: 183 SGSSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXX 242
S SS+ E T+SEE+ PR+KP+FVR +R TV E+ER+
Sbjct: 232 SESSEYEETESEEENEPRLKPLFVRKKDRTTVIEKEREANKQKQLEYESKKAAKERRRQT 291
Query: 243 XXXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXX 302
+I+ + A+ + E N+NDV TDDENDE+EYEAWKL
Sbjct: 292 LKLVEDSIKKDMEKAKVDN-EPNLNDVNTDDENDEVEYEAWKLRELKRIKRDREEKEALE 350
Query: 303 XXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQ 362
L +ER+RNMTEDERR +QR+NPK VTNK VKGKYKFLQKYYHRGAFYLD+E+ V+KQ
Sbjct: 351 KEKLEIERIRNMTEDERRQDQRLNPKQVTNKTVKGKYKFLQKYYHRGAFYLDQEDQVYKQ 410
Query: 363 DFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAW-SNETSAARLTN 421
DFS PTL+DHFDKT+LPKVMQVK FGR GRTKYTHLVDQDTT+ +S W ++ + + N
Sbjct: 411 DFSAPTLEDHFDKTILPKVMQVKNFGRCGRTKYTHLVDQDTTKAESPWFADSANNTKFYN 470
Query: 422 FR-GGMKQVFEKPSA 435
R GGM+QVFEKPS+
Sbjct: 471 ERAGGMRQVFEKPSS 485
Score = 100 bits (240), Expect = 7e-20
Identities = 58/136 (42%), Positives = 76/136 (55%), Gaps = 14/136 (10%)
Query: 4 LPAQPT--GIQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFI 61
+ QPT GIQSTAGAIP+RN KGE+SMQKVKV RY+SGK+P+YAQ FI
Sbjct: 1 MSGQPTIYGIQSTAGAIPVRNPKGELSMQKVKVHRYVSGKRPEYAQHSSSEEESDEEDFI 60
Query: 62 EQQRPERKQVLPQIITRKE--EHHSDSEKEVDDPRLRRLRNIAQSPPRRAE--------- 110
+ +R + R+E E D +VDDPR+RRL+ I + E
Sbjct: 61 DNRRTAEESYRESRRRREETDEEEDDLPGDVDDPRIRRLQAIRAAEAEEIERERRERHRV 120
Query: 111 -HKPEIIDAEPEAESE 125
H+PE++ +E E E E
Sbjct: 121 IHEPELVQSEEEEEDE 136
>UniRef50_Q93712 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 466
Score = 206 bits (502), Expect = 1e-51
Identities = 98/259 (37%), Positives = 152/259 (58%), Gaps = 3/259 (1%)
Query: 185 SSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXX 244
SS+ E +D ++D PR+KP+F R +R+T+ E E++
Sbjct: 208 SSEEEDSDEDDDPVPRLKPIFTRKKDRITLQEAEKEKEKEILKKIEDEKRAEERKRESAK 267
Query: 245 XXXXTIRSEQRGAQGEQKEG-NINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXX 303
++ E+ + + ++ +++ V TDDE + + YEAWKL
Sbjct: 268 LVEKVLQEEEAAEKRKTEDRVDLSSVLTDDETENMAYEAWKLREMKRLKRNRDEREEAAR 327
Query: 304 XXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQD 363
L+++ M+E+ER R+NPKV+TNK KGKYKFLQKY+HRGAF+LD+E++V K++
Sbjct: 328 EKAELDKIHAMSEEERLKYLRLNPKVITNKQDKGKYKFLQKYFHRGAFFLDEEDEVLKRN 387
Query: 364 FSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAW--SNETSAARLTN 421
F+ T DD FDKT+LPKVMQVK FG++ RTKYTHL ++DTT+ W +N+ ++ T
Sbjct: 388 FAEATNDDQFDKTILPKVMQVKNFGKASRTKYTHLTEEDTTDHQGVWASTNQLNSQFSTK 447
Query: 422 FRGGMKQVFEKPSAERKHN 440
GG + VFE+P+ +++ N
Sbjct: 448 RAGGSRPVFERPATKKRKN 466
Score = 57.6 bits (133), Expect = 6e-07
Identities = 39/112 (34%), Positives = 53/112 (47%), Gaps = 15/112 (13%)
Query: 14 TAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFIEQQRPERKQVLP 73
T GAIPI+NEKG+ MQKVKV RY++GK P+YA+ E R +
Sbjct: 25 TLGAIPIKNEKGQTVMQKVKVSRYVAGKAPEYARNYDSDSSESDR---ETDRDD------ 75
Query: 74 QIITRKEEHHSDSEKEVDDPRLRRLRNIAQSPPRRAEHKPEIIDAEPEAESE 125
R+ +S E D R RR + + RR KPE++ + SE
Sbjct: 76 ---DRRRRRRRESSDEEDRRRHRRHEDYGR---RRQVEKPEVLGKVEDESSE 121
>UniRef50_Q4RGL1 Cluster: Chromosome undetermined SCAF15099, whole
genome shotgun sequence; n=2; Euteleostomi|Rep:
Chromosome undetermined SCAF15099, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 413
Score = 192 bits (467), Expect = 2e-47
Identities = 86/134 (64%), Positives = 110/134 (82%), Gaps = 4/134 (2%)
Query: 308 LERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGP 367
+E+ NMT++ERR E R + KV+TNK KGKYKFLQKYYHRGAF++D+EEDV+K+DFS P
Sbjct: 280 IEKFHNMTDEERRAELRNSGKVITNKGTKGKYKFLQKYYHRGAFFMDEEEDVYKRDFSAP 339
Query: 368 TLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNETSAARLTNFR---G 424
TL+DHF+KT+LPKVMQVK FGRSGRTKYTHLVDQDTT FDSAW+ E SA F+
Sbjct: 340 TLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWAQE-SAQNSKFFKQKAA 398
Query: 425 GMKQVFEKPSAERK 438
G++ VF++P+ +++
Sbjct: 399 GVRDVFDRPTVKKR 412
Score = 83.8 bits (198), Expect = 8e-15
Identities = 48/115 (41%), Positives = 68/115 (59%), Gaps = 13/115 (11%)
Query: 11 IQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFIEQQRPERKQ 70
IQSTAGA+P+RNEKGE+SM+KVKV+RY+SGK+PDYA F++ + K+
Sbjct: 14 IQSTAGAVPVRNEKGELSMEKVKVKRYVSGKRPDYAPMQSSDEEDEDFQFVK----KGKE 69
Query: 71 VLPQIITRKEEHHSDSEKEVDDPRLRRLRN-IAQSPPRRAEHKPEIIDAEPEAES 124
V P++ E ++ DPRLRRL N +++ R +I + E AES
Sbjct: 70 VEPEV--------EQEEDDMSDPRLRRLLNRVSEDVEERLARHRQISEPEVVAES 116
>UniRef50_Q9FKN6 Cluster: Similarity to microfibrillar-associated
protein 1; n=5; Magnoliophyta|Rep: Similarity to
microfibrillar-associated protein 1 - Arabidopsis
thaliana (Mouse-ear cress)
Length = 435
Score = 167 bits (405), Expect = 7e-40
Identities = 102/267 (38%), Positives = 140/267 (52%), Gaps = 16/267 (5%)
Query: 187 DTEY-TDSEEDTG--PRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXX 243
++EY TDSE+D +KPVFV +ER T+AERER
Sbjct: 164 ESEYETDSEDDMPGIAMIKPVFVPKAERDTIAERERLEAEEEALEELAKRKLEQRKLETK 223
Query: 244 XXXXXTIRSEQRGAQGEQ-KEGNINDVCTDDE-NDELEYEAWKLXXXXXXXXXXXXXXXX 301
+R ++ + +E NI DV TDDE N+ EYE WK
Sbjct: 224 QIVVEEVRKDEEIRKNILLEEANIGDVETDDELNEAEEYEVWKTREIGRIKRERDAREAM 283
Query: 302 XXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEED--- 358
+E+LRNMTE ERR +R NPK ++ + K K+ F+QKYYH+GAF+ +D
Sbjct: 284 LREREEIEKLRNMTEQERRDWERKNPKPLSAQPKK-KWNFMQKYYHKGAFFQADPDDEAG 342
Query: 359 ------VFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAW-S 411
+F++DFS PT +D DK++LPKVMQVK FGRSGRTK+THLV++DTT++ + W S
Sbjct: 343 SAGTDGIFQRDFSAPTGEDRLDKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWSNPWTS 402
Query: 412 NETSAARLTNFRGGMKQVFEKPSAERK 438
N+ + GM KP +K
Sbjct: 403 NDPLREKYNKKMAGMDAPIAKPKGSKK 429
>UniRef50_A7ATD4 Cluster: Micro-fibrillar-associated protein 1
C-terminus containing protein; n=1; Babesia bovis|Rep:
Micro-fibrillar-associated protein 1 C-terminus
containing protein - Babesia bovis
Length = 437
Score = 134 bits (323), Expect = 6e-30
Identities = 85/256 (33%), Positives = 123/256 (48%), Gaps = 13/256 (5%)
Query: 185 SSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXX 244
+ ++E S + KPVFV R+TV E++
Sbjct: 182 TEESEPVTSSQTINALAKPVFVPKKSRLTVKEKKEIEREEQKKIEAEQKRLEERRKQSKE 241
Query: 245 XXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXX 304
T+ +E ++ E +N V DE E EYE WK+
Sbjct: 242 LVIQTLVAEN---MHQEIENEVNCVDDKDELTEEEYELWKIRELKRIIRDRNERNAHERL 298
Query: 305 XLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEED----VF 360
+ER R MTE+ER + + A K K KFLQKYYH+GAF++DK ED ++
Sbjct: 299 AAEVERRREMTEEERLEDDERIRQEKGPIAPKTKIKFLQKYYHKGAFFMDKLEDGSEPIY 358
Query: 361 KQDFSGPTLDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSAWSNETSAAR 418
K+DF+ PT DD DK+++PK MQV++ +G+ GR+KYTHL +DTT+FD WS + A
Sbjct: 359 KRDFNAPTADDCVDKSLMPKSMQVRRGQYGKMGRSKYTHLTAEDTTKFDMPWSQQ--PAP 416
Query: 419 LTNFRGGMKQVFEKPS 434
T G + F++PS
Sbjct: 417 FT--PAGARDSFDRPS 430
>UniRef50_Q4U9V1 Cluster: Microfibrillar-associated protein,
putative; n=2; Theileria|Rep: Microfibrillar-associated
protein, putative - Theileria annulata
Length = 431
Score = 124 bits (299), Expect = 5e-27
Identities = 87/269 (32%), Positives = 135/269 (50%), Gaps = 26/269 (9%)
Query: 185 SSDTEYTDSE---EDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXX 241
S +++Y + E ED KPVFV R T +E+E+
Sbjct: 173 SEESDYQEDEAGVEDLDVLSKPVFVPKGSRKTESEKEQLRKEEVLRKENEKKRLMERKRD 232
Query: 242 XXXXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXX 301
++ + + E ++ I+D D DE EYE WK+
Sbjct: 233 TKEMVIQKVQELEE--EPEPEDELIDDT---DTFDEKEYELWKIRELKRILRDKEEREKF 287
Query: 302 XXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEED--- 358
++ R+MT++ER L+ + KVV K+ K +FLQKYYHRGAF++DK +D
Sbjct: 288 KKLEEEVKLRRSMTDEERELDNQKVDKVVVEKS---KLRFLQKYYHRGAFFMDKLQDKSE 344
Query: 359 -VFKQDFSGPTLDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSAWSNETS 415
++ +DF+ PT +D DK++LPK M+V++ +G+ G+ K+THL D DTT+FD AWS +T
Sbjct: 345 PLYARDFNAPTAEDCVDKSLLPKPMRVRRGLYGKQGQVKHTHLKDVDTTQFD-AWS-KTD 402
Query: 416 AARLTNF-------RGGMKQVFEKPSAER 437
+LT G KQVF++PS ++
Sbjct: 403 KYKLTGLFSVIITQFSGTKQVFDRPSRKK 431
>UniRef50_A4RS12 Cluster: Predicted protein; n=2; Ostreococcus|Rep:
Predicted protein - Ostreococcus lucimarinus CCE9901
Length = 256
Score = 124 bits (298), Expect = 6e-27
Identities = 84/255 (32%), Positives = 122/255 (47%), Gaps = 21/255 (8%)
Query: 202 KPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXXXXXXTIRSEQRGAQGEQ 261
KPVFVR ER T+ ER++ + ++ E+ A
Sbjct: 3 KPVFVRKIERDTIEERDKMLAELDAEAAKTEAAKAAKKAESKKLVEVEVKREEALAAA-M 61
Query: 262 KEGNINDVCTDDE-NDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDERR 320
E +DV TDDE +D LE++AWK ER+R+MTE+ER
Sbjct: 62 DEMEPSDVDTDDELDDALEFDAWKSRELERLKTDRIQRELIFREREEQERIRSMTEEERD 121
Query: 321 L--EQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVF---------KQDFSGPTL 369
L +R+ + K K K F+QKYYH+GAF+ + +D F K+DFS PT
Sbjct: 122 LYHAKRLAKRAEQEKE-KPKMAFMQKYYHKGAFFQESADDAFGTAGPDEIYKRDFSAPTA 180
Query: 370 DDHFDKTVLPKVMQVK--KFGRSGRTKYTHLVDQDTT-----EFDSAWSNETSAARLTNF 422
++ FDK++LP MQV+ KFGR+G+TK+THL +DT+ + D WS + R
Sbjct: 181 EEKFDKSILPAAMQVRKGKFGRAGQTKWTHLAAEDTSAARKGDDDDLWSGRDKSVRAIKD 240
Query: 423 RGGMKQVFEKPSAER 437
+ KQ + +A R
Sbjct: 241 KMLAKQGGLRDAARR 255
>UniRef50_Q54SU3 Cluster: Putative uncharacterized protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative
uncharacterized protein - Dictyostelium discoideum AX4
Length = 460
Score = 116 bits (278), Expect = 2e-24
Identities = 75/231 (32%), Positives = 108/231 (46%), Gaps = 8/231 (3%)
Query: 186 SDTEYTDSEE--DTGPRVKPVFVRASERMTVA---ERERKMXXXXXXXXXXXXXXXXXXX 240
+D+E D +E D P +P F++ +R T+ + E++
Sbjct: 197 TDSEEDDEDEYWDQPPIFRPTFIKKDDRGTIKTDEQWEKEEQEQQAQLEREKEQRKIEAH 256
Query: 241 XXXXXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXX 300
R EQ + EQKE D D++ D + W
Sbjct: 257 RKLKDELDRDRKEQEAKELEQKEEEEYD--DDEDQDGSKKLLWIQRELERVRLEIHTRLL 314
Query: 301 XXXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVF 360
R R MT+D+ E + + + K + KFLQ+ YHRGAF+ D +E +
Sbjct: 315 AEFEKKEFARRRAMTDDQILKEDPSRSRTNIDNSQKKQLKFLQRDYHRGAFFQD-DEYIK 373
Query: 361 KQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWS 411
+DFS PT +D F++ +LPKVMQVK FG++GRTKYTHL DQDTTE DS W+
Sbjct: 374 NKDFSAPTGEDKFNRELLPKVMQVKNFGKAGRTKYTHLKDQDTTEKDSLWN 424
>UniRef50_Q8IE75 Cluster: Microfibril-associated protein homologue,
putative; n=4; Plasmodium|Rep: Microfibril-associated
protein homologue, putative - Plasmodium falciparum
(isolate 3D7)
Length = 490
Score = 109 bits (263), Expect = 1e-22
Identities = 60/174 (34%), Positives = 95/174 (54%), Gaps = 11/174 (6%)
Query: 272 DDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDERRLEQRINPKVVT 331
D+E +E EYE WK+ L +++ R MT+ E + + P
Sbjct: 322 DEELNEEEYELWKIRHINRLKRDELDRKKHEILELEIKKRRKMTDKEIIQDNKTLPN--K 379
Query: 332 NKAVKGKYKFLQKYYHRGAFYLDK----EEDVFKQDFSGPTLDDHFDKTVLPKVMQVK-- 385
K K K F+QKYYH+G FY D +E+++ +D++ P +D D+ LPKV+QV+
Sbjct: 380 EKKKKRKMLFMQKYYHKGGFYQDLFEEGKEEIYLRDYNEPVYEDKVDRQNLPKVLQVRRG 439
Query: 386 KFGRSGRTKYTHLVDQDTTEFDSAWSNETSAARLTNFRGGMKQVFEKPSAERKH 439
KFG+ G++KYTHL+D DT+ DS W+N S + F+ K F++P+ +K+
Sbjct: 440 KFGKQGQSKYTHLLDNDTSRKDSLWNNIESNMK---FKDKKKDQFDRPTYRKKN 490
>UniRef50_A5K2Z5 Cluster: Micro-fibrillar-associated protein 1
C-terminus domain containing protein; n=2;
Plasmodium|Rep: Micro-fibrillar-associated protein 1
C-terminus domain containing protein - Plasmodium vivax
Length = 478
Score = 108 bits (260), Expect = 2e-22
Identities = 74/265 (27%), Positives = 123/265 (46%), Gaps = 21/265 (7%)
Query: 185 SSDTEYTDSEEDTGPRVKPVFVRASERMTVAER-ERKMXXXXXXXXXXXXXXXXXXXXXX 243
S D Y + E+ + P +K +V ++R T+ E +++
Sbjct: 216 SGDENYMNGEDGSAP-MKHEYVFKTKRKTLLESFQKEQNEKQLQKSEATEKKIIEEEKKE 274
Query: 244 XXXXXTIRSEQRGAQGEQKEGNI--------NDVCTDDENDELEYEAWKLXXXXXXXXXX 295
TI +E Q + +E N+ +D + E DE EY+ WKL
Sbjct: 275 KAIEETIHNEIMIEQMKNQENNVFSSDENFDDDDADEGEPDEKEYQLWKLRHMSRLKRDE 334
Query: 296 XXXXXXXXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDK 355
+ R MT+ E + ++ P K K K F+QKYYHRG F+ D
Sbjct: 335 LDRRKHQLVQDEISERRKMTDREIMEQNKLLPH--KEKKKKKKMLFMQKYYHRGGFFQDL 392
Query: 356 ----EEDVFKQDFSGPTLDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSA 409
+E+++++D++ P +D DK LPKV++V++ FG+ G++KYTHL+D DT+ DS
Sbjct: 393 FEEGKEEIYRRDYNEPVYEDKVDKENLPKVLRVRRGNFGKQGQSKYTHLLDNDTSRKDSL 452
Query: 410 WSNETSAARLTNFRGGMKQVFEKPS 434
W+N AR + + +FE+P+
Sbjct: 453 WANRDLEARRARRK---EDLFERPT 474
>UniRef50_UPI00006CD032 Cluster: Micro-fibrillar-associated protein
1 C-terminus containing protein; n=1; Tetrahymena
thermophila SB210|Rep: Micro-fibrillar-associated
protein 1 C-terminus containing protein - Tetrahymena
thermophila SB210
Length = 521
Score = 102 bits (245), Expect = 2e-20
Identities = 71/259 (27%), Positives = 119/259 (45%), Gaps = 15/259 (5%)
Query: 192 DSEEDTGP---RVKPVFVRASER-----MTVAERERKMXXXXXXXXXXXXXXXXXXXXXX 243
+ EE+ P +KPV++ SER + + E+E +
Sbjct: 267 EEEEEVRPVYKMMKPVYIPKSERDYQNQLDIEEQELEEQRKKQEQIAKQQIKMIIMEQKK 326
Query: 244 XXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXX 303
+ E++ Q + +ND DD + E E E WK+
Sbjct: 327 QQIIGNLGDEEQSDDSRQGKDFMND--DDDMDREFEREQWKIRELKRIRKDRDEQIKREK 384
Query: 304 XXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQD 363
ER MT +E E + + K K + F+QKYYH+GAFY D ++DV ++D
Sbjct: 385 ELAEQERRSKMTNEEIIEEDK--RLGLHQKKEKRQIGFMQKYYHKGAFYQDDDDDVLQRD 442
Query: 364 FSGPTLDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSAWS-NETSAARLT 420
F+ P ++ DK+VLP +MQ ++ FG+ G++KYTHL DQDTT FD + +++ ++
Sbjct: 443 FNMPVGEELLDKSVLPHLMQKRRGNFGKKGQSKYTHLTDQDTTNFDPKYRVDDSLQKKML 502
Query: 421 NFRGGMKQVFEKPSAERKH 439
+ + G+K ++K+
Sbjct: 503 SKQAGLKAANNLDPRKKKY 521
>UniRef50_A0EFW2 Cluster: Chromosome undetermined scaffold_94, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_94,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 410
Score = 99 bits (238), Expect = 1e-19
Identities = 53/182 (29%), Positives = 93/182 (51%), Gaps = 3/182 (1%)
Query: 249 TIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTL 308
+++++ A E+ + + +D DE EY WK+ +
Sbjct: 217 SVKADAAKAVNEESDDGKQKLNDEDTLDETEYALWKIRELKRIKQFNDEKNKYEIEKAEI 276
Query: 309 ERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGPT 368
+R RN+T+ +R E T K KY F+QKYY+ GAFY D ++ +F++D++ P
Sbjct: 277 DRRRNLTDMQRIQEDFKLGSDKTKMEDKTKYVFMQKYYNTGAFYKDMDDPIFQRDYNLPV 336
Query: 369 LDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSAWS-NETSAARLTNFRGG 425
+D + K LP+++Q ++ FG+ G +KYTHL +DTT FD + +++ + N + G
Sbjct: 337 GEDLWRKDNLPQILQKRRGEFGKKGNSKYTHLTQEDTTNFDPTYQVDQSIRQKFLNQQAG 396
Query: 426 MK 427
K
Sbjct: 397 SK 398
>UniRef50_Q9P7H6 Cluster: Microfibrillar-associated protein familt
protein; n=1; Schizosaccharomyces pombe|Rep:
Microfibrillar-associated protein familt protein -
Schizosaccharomyces pombe (Fission yeast)
Length = 355
Score = 95.5 bits (227), Expect = 2e-18
Identities = 60/186 (32%), Positives = 94/186 (50%), Gaps = 10/186 (5%)
Query: 260 EQKEGN--INDVC-TDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTE 316
E K N +ND+ TD + + EYE WKL + +E R M
Sbjct: 171 ETKNNNELLNDIDDTDGIDPQSEYELWKLRHLLRKKRDKEKSLELEREKMAIEERRLMNS 230
Query: 317 DERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGPTLDDHFDKT 376
+ER + + + K +FLQKYYH+GAFY E+ V K+D+S T + +K
Sbjct: 231 EEREAQDLKDAEASRRGKKKSSMQFLQKYYHKGAFY-QNEDIVSKRDYSEATEGEVLNKD 289
Query: 377 VLPKVMQVK--KFGRSGRTKYTHLVDQDTTEFDSAWSNETSAARLTNFR--GGMKQVFEK 432
+LPK MQ++ F ++G+T++THL ++DTT+ SAW + + N GG+ +
Sbjct: 290 LLPKPMQIRGDLFAKAGQTRWTHLANEDTTKEGSAWYDPKNPILQKNLHRLGGLHS--DS 347
Query: 433 PSAERK 438
P ++RK
Sbjct: 348 PLSKRK 353
>UniRef50_Q7S7V7 Cluster: Predicted protein; n=7;
Pezizomycotina|Rep: Predicted protein - Neurospora
crassa
Length = 712
Score = 91.1 bits (216), Expect = 5e-17
Identities = 57/160 (35%), Positives = 82/160 (51%), Gaps = 9/160 (5%)
Query: 260 EQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDER 319
+ ++ I+D TDD + E EY AWKL +ER RN+TE+ER
Sbjct: 407 DPEDDQIDD--TDDIDPEAEYAAWKLRELKRVRREREAIEAKEKELAEIERRRNLTEEER 464
Query: 320 RLEQRIN-PKVVTNKAVKGKYKFLQKYYHRGAFYLD--KEEDVFKQDFSGPTL-DDHFDK 375
R E + + K KGK ++QKY+H+GAFY D KE + K+D G DD ++
Sbjct: 465 RAEDEKHLQQQKEEKEGKGKMAYMQKYFHKGAFYQDESKEMGLDKRDIMGARFADDVKNR 524
Query: 376 TVLPKVMQVK---KFGRSGRTKYTHLVDQDTTEFDSAWSN 412
+LPK +Q++ K GR G TKY L +DT ++ N
Sbjct: 525 ELLPKALQLRDMTKLGRKGATKYRDLKSEDTGQWGRLHDN 564
>UniRef50_Q4P301 Cluster: Putative uncharacterized protein; n=1;
Ustilago maydis|Rep: Putative uncharacterized protein -
Ustilago maydis (Smut fungus)
Length = 500
Score = 90.2 bits (214), Expect = 9e-17
Identities = 67/248 (27%), Positives = 110/248 (44%), Gaps = 15/248 (6%)
Query: 186 SDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXXX 245
S+ E +SE T P +KP+FV R T++
Sbjct: 160 SEGESEESETKTEPLLKPIFVPKQARTTISTDAAADQHQLELDAEAKAEAEAAVRRKEAH 219
Query: 246 XXXTIRSEQRGAQGEQKEGNINDVC-TDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXX 304
+++ A+ E ++ + DV TD + E E++AW+
Sbjct: 220 DLAAAAIKRQLAEKEYQDTHQTDVDDTDGLDPEAEFQAWRERELARLRRDHEAILAKQRA 279
Query: 305 XLTLERLRNMTEDER-RL-EQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQ 362
++ +++ E E+ RL +R K +G FLQKYYH+G+F+ D D+ K+
Sbjct: 280 QQEIDAFKSLPEAEKERLGRERAAQLRAEKKEQRGNPAFLQKYYHKGSFFQDM--DILKR 337
Query: 363 DFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNETSAARLTNF 422
D++ T D D + LPK+MQV+ +G GR+K+THL ++DT++ A RL
Sbjct: 338 DYTEKTSKD-VDISKLPKMMQVRGYGEKGRSKWTHLANEDTSK---------GAMRLDVL 387
Query: 423 RGGMKQVF 430
+GG K F
Sbjct: 388 QGGSKGCF 395
>UniRef50_A1DDP7 Cluster: Microfibrillar-associated protein MfaP1,
putative; n=10; Eurotiomycetidae|Rep:
Microfibrillar-associated protein MfaP1, putative -
Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / NRRL
181)(Aspergillus fischerianus (strain ATCC 1020 / DSM
3700 / NRRL 181))
Length = 512
Score = 84.2 bits (199), Expect = 6e-15
Identities = 51/160 (31%), Positives = 80/160 (50%), Gaps = 9/160 (5%)
Query: 262 KEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDERRL 321
+EG I+D D + E EY AWKL +ER RN+T +ER
Sbjct: 234 EEGAIDD--RDGVDPEAEYAAWKLRELKRIKREREAIEAAEKEREEIERRRNLTAEERER 291
Query: 322 EQR--INPKVVTNKAVKGKYKFLQKYYHRGAFYLD--KEEDVFKQDFSGPTLDDHFDKTV 377
E R I + +A +G+ F+Q+Y+H+GAF+ D + E + K++ G D +
Sbjct: 292 EDREFIEKQKQEKEASRGQTGFMQRYFHKGAFFRDDLEREGLDKRNVMGQRFADDVARET 351
Query: 378 LPKVMQVK---KFGRSGRTKYTHLVDQDTTEFDSAWSNET 414
LP+ MQ++ K G+ GRT+Y L +DT F ++N +
Sbjct: 352 LPEYMQIRDMTKLGKKGRTRYKDLRTEDTGRFGEGFNNRS 391
>UniRef50_Q0TWF3 Cluster: Putative uncharacterized protein; n=1;
Phaeosphaeria nodorum|Rep: Putative uncharacterized
protein - Phaeosphaeria nodorum (Septoria nodorum)
Length = 420
Score = 83.0 bits (196), Expect = 1e-14
Identities = 61/234 (26%), Positives = 106/234 (45%), Gaps = 13/234 (5%)
Query: 183 SGSSD---TEYTDSEEDTGPR--VKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXX 237
SGS D +E S ED P+ ++PVF++ +ER VA +
Sbjct: 117 SGSEDEDESEQESSSEDEAPKKLLRPVFLKKNERNKVAAPVKSAEEAAAEEEARRQEQSR 176
Query: 238 XXXXXXXXXXXTIRSE-QRGAQGEQKEGNINDVC-TDDENDELEYEAWKLXXXXXXXXXX 295
++ ++ + ++ +IN + TD + EY AWKL
Sbjct: 177 ALVQEQVEQRIAEKAAGKKDWDDDVEDADINAIDDTDGLDAAAEYAAWKLRELKRIKRER 236
Query: 296 XXXXXXXXXXLTLERLRNMTEDERRLEQR-INPKVVTNKAVKGKYKFLQKYYHRGAFYLD 354
+ER RN++ ER E R + ++A +G+ +++QKY+H+GAF+ D
Sbjct: 237 QAIEEAEAERAEIERRRNLSAAERDAEDRAFIDQQKEDRADRGEMQYMQKYFHKGAFFTD 296
Query: 355 --KEEDVFKQDFSGPTLDDHFDKTVLPKVMQVK---KFGRSGRTKYTHLVDQDT 403
KE V +++ +D ++ VLP+ MQ++ K G+ GRT+Y + +DT
Sbjct: 297 ELKELGVDRRNLMNARFEDQTNRDVLPEYMQIRDMTKLGKKGRTRYKDMKTEDT 350
>UniRef50_A2ELS5 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 231
Score = 77.4 bits (182), Expect = 7e-13
Identities = 38/81 (46%), Positives = 52/81 (64%), Gaps = 5/81 (6%)
Query: 332 NKAVKGKYKFLQKYYHRGAFYLD---KEEDVFKQDFSGPTLDDHFDKTVLPKVMQVK--K 386
N KG KF QKYYH+GAF +D K E++ +D+ PT DD DKT LPK M V+
Sbjct: 126 NPKEKGHMKFYQKYYHKGAFSIDESEKAEELLNRDYLTPTGDDLLDKTALPKEMMVRGDD 185
Query: 387 FGRSGRTKYTHLVDQDTTEFD 407
+ + G++K+THL ++DTT D
Sbjct: 186 YNKRGKSKWTHLSNEDTTTVD 206
>UniRef50_UPI0000499156 Cluster: microfibril-associated protein;
n=1; Entamoeba histolytica HM-1:IMSS|Rep:
microfibril-associated protein - Entamoeba histolytica
HM-1:IMSS
Length = 242
Score = 76.2 bits (179), Expect = 2e-12
Identities = 60/221 (27%), Positives = 98/221 (44%), Gaps = 16/221 (7%)
Query: 189 EYTDSEEDTGPRVKPVFVRASERMTVAER--ERKMXXXXXXXXXXXXXXXXXXXXXXXXX 246
EYT+ E + +P+FV + + E+ E+++
Sbjct: 28 EYTEEESNEEEDEEPIFVPMRKEIIKKEQIEEKEIKENVFPPYKQQTQDINTNEINKKLI 87
Query: 247 XXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXL 306
TI+ E Q E +E D+ + E+EAW+
Sbjct: 88 QMTIQKELE--QKENEESTEEFSSGDEYGGKDEFEAWQQRELERLKKEYIEQLNYQHD-- 143
Query: 307 TLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEE-DVFKQ--D 363
LE+L+ + E + + K + K+KF+QKYYH G+F+ D + DV K D
Sbjct: 144 -LEKLKEICSTESQNHEE------EKKKERKKWKFMQKYYHIGSFFRDGGKWDVSKGNWD 196
Query: 364 FSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTT 404
F T DD DK++LPK++Q K +G+ GR+K+T+L ++DTT
Sbjct: 197 FDAATGDDWMDKSLLPKILQTKDWGKKGRSKHTNLKEEDTT 237
>UniRef50_Q5C2C1 Cluster: SJCHGC04323 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC04323 protein - Schistosoma
japonicum (Blood fluke)
Length = 241
Score = 61.3 bits (142), Expect = 5e-08
Identities = 25/36 (69%), Positives = 32/36 (88%)
Query: 11 IQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYA 46
I STAGA+P++NEKG+ M KVKVQRY++GKKPD+A
Sbjct: 10 IHSTAGAVPVKNEKGQFYMVKVKVQRYVAGKKPDFA 45
Score = 42.7 bits (96), Expect = 0.019
Identities = 16/36 (44%), Positives = 27/36 (75%)
Query: 184 GSSDTEYTDSEEDTGPRVKPVFVRASERMTVAERER 219
G S+ EYT S+++ P++KPVFVRA +R+T+ + +
Sbjct: 205 GYSEEEYTSSDDEVAPKLKPVFVRARDRITLQAKHK 240
>UniRef50_UPI000155C08B Cluster: PREDICTED: similar to
Microfibrillar-associated protein 1, partial; n=1;
Ornithorhynchus anatinus|Rep: PREDICTED: similar to
Microfibrillar-associated protein 1, partial -
Ornithorhynchus anatinus
Length = 243
Score = 57.2 bits (132), Expect = 8e-07
Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 12/101 (11%)
Query: 26 EISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFIEQQRPERKQVLPQIITRKEEHHSD 85
EISM+KVKV+RY+SGK+PDYA FI ++ + +++ P EE D
Sbjct: 1 EISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFI--KKAKEQEIEP------EEQEED 52
Query: 86 SEKEVDDPRLRRLRN-IAQSPPRRAEHKPEIIDAEPEAESE 125
S DPRLRRL+N I++ R +I++ E ES+
Sbjct: 53 SS---SDPRLRRLQNRISEDVEERLARHRKIVEPEVVGESD 90
>UniRef50_Q6CA21 Cluster: Similar to tr|Q8X0K0 Neurospora crassa
Related to microfibril- associated protein; n=1;
Yarrowia lipolytica|Rep: Similar to tr|Q8X0K0 Neurospora
crassa Related to microfibril- associated protein -
Yarrowia lipolytica (Candida lipolytica)
Length = 333
Score = 56.8 bits (131), Expect = 1e-06
Identities = 46/159 (28%), Positives = 65/159 (40%), Gaps = 14/159 (8%)
Query: 250 IRSEQRGAQGEQKEGNINDVC----TDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXX 305
IR+EQ + +E N + TDD++ E E E WK
Sbjct: 180 IRAEQEAQRALYEEENAAEFGGVDDTDDQDVEKELEDWKAREKARLDRDRQELISREEAL 239
Query: 306 LTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFS 365
+R E N KGK K GAFY +E+D+ K+D S
Sbjct: 240 AREDRKEEAGEQNEPSGDGSNHWEARGDDKKGKPK--------GAFY--QEQDILKRDLS 289
Query: 366 GPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTT 404
GP DD+ DKT +P+ + K G+ GR + L +QDT+
Sbjct: 290 GPLQDDYVDKTNVPQSLLGKNVGQKGRIMHKSLKEQDTS 328
>UniRef50_UPI0000DC125F Cluster: UPI0000DC125F related cluster;
n=1; Rattus norvegicus|Rep: UPI0000DC125F UniRef100
entry - Rattus norvegicus
Length = 274
Score = 55.6 bits (128), Expect = 2e-06
Identities = 24/36 (66%), Positives = 31/36 (86%)
Query: 11 IQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYA 46
IQSTAG +PIR+EK EISM+K +V+ Y+ GK+PDYA
Sbjct: 18 IQSTAGTVPIRHEKCEISMEKGRVKLYVPGKRPDYA 53
>UniRef50_Q5KAL3 Cluster: Putative uncharacterized protein; n=1;
Filobasidiella neoformans|Rep: Putative uncharacterized
protein - Cryptococcus neoformans (Filobasidiella
neoformans)
Length = 513
Score = 54.8 bits (126), Expect = 4e-06
Identities = 44/167 (26%), Positives = 65/167 (38%), Gaps = 5/167 (2%)
Query: 186 SDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXXX 245
+D+E EE P +PVFV + R AE+
Sbjct: 132 TDSEEESEEEVKKPMFRPVFVPKNARNMTAEKAA--AEAEEARKREEEAEEQRKLASKEL 189
Query: 246 XXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXX 305
+IR E + ++D TD + E E+EAW+
Sbjct: 190 AGESIRRELVEKEAADIVPEVDD--TDGLDVEAEFEAWRARELARLLREKQAQAAKDEEK 247
Query: 306 LTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFY 352
+ER R M E E+RL++ + T + KG+ FLQKYYH+GAF+
Sbjct: 248 EEIERRRAMPE-EQRLKEDMEFAARTREKEKGQMGFLQKYYHKGAFH 293
>UniRef50_Q0E1X2 Cluster: Os02g0294000 protein; n=5; Oryza
sativa|Rep: Os02g0294000 protein - Oryza sativa subsp.
japonica (Rice)
Length = 360
Score = 52.4 bits (120), Expect = 2e-05
Identities = 27/69 (39%), Positives = 44/69 (63%), Gaps = 13/69 (18%)
Query: 327 PKVVTNKAVKGKYKFLQKYYHRGAFYLDKEE----------DVFKQDFSGPTLDDHFDKT 376
PK +T +K + +F+++YYH+G F+ D + +++++DFSGPT D D +
Sbjct: 273 PKKMT---IKKQMRFMRRYYHKGCFFQDDADGAAQTAAGACEIYRRDFSGPTGLDKMDVS 329
Query: 377 VLPKVMQVK 385
VLPKVMQV+
Sbjct: 330 VLPKVMQVE 338
>UniRef50_A6S856 Cluster: Putative uncharacterized protein; n=1;
Botryotinia fuckeliana B05.10|Rep: Putative
uncharacterized protein - Botryotinia fuckeliana B05.10
Length = 243
Score = 37.9 bits (84), Expect = 0.53
Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 3/88 (3%)
Query: 260 EQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDER 319
E++E ++D TDD + E E AWKL +ER RN+TE+ER
Sbjct: 157 EEEEDEVDD--TDDIDPEAELAAWKLRELKRIKRDREAIEEREKELEEVERRRNLTEEER 214
Query: 320 RLE-QRINPKVVTNKAVKGKYKFLQKYY 346
+ E K + +GK +QK +
Sbjct: 215 KKEDDEYIAKQKEEREGRGKMATMQKRF 242
>UniRef50_Q1YGP6 Cluster: Putative uncharacterized protein; n=1;
Aurantimonas sp. SI85-9A1|Rep: Putative uncharacterized
protein - Aurantimonas sp. SI85-9A1
Length = 188
Score = 36.3 bits (80), Expect = 1.6
Identities = 17/51 (33%), Positives = 32/51 (62%), Gaps = 1/51 (1%)
Query: 61 IEQQRPERKQVLPQIITRKEEHHSDSEKEVDDPRLRR-LRNIAQSPPRRAE 110
+ QQRP+RK L + + + +D ++DPRL++ LR+ A++ RR++
Sbjct: 137 VNQQRPDRKPKLRDLAPVEHQRVADMVARIEDPRLQKALRDFAETTLRRSK 187
>UniRef50_Q4WBH2 Cluster: C6 transcription factor, putative; n=1;
Aspergillus fumigatus|Rep: C6 transcription factor,
putative - Aspergillus fumigatus (Sartorya fumigata)
Length = 761
Score = 35.5 bits (78), Expect = 2.8
Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 2/34 (5%)
Query: 63 QQRPERKQVLPQI--ITRKEEHHSDSEKEVDDPR 94
Q R E Q +PQ+ R E HHSDS+ +DDP+
Sbjct: 494 QSRSEADQKIPQLDDFLRLEAHHSDSDLNIDDPK 527
>UniRef50_Q5BX93 Cluster: SJCHGC03879 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC03879 protein - Schistosoma
japonicum (Blood fluke)
Length = 193
Score = 35.1 bits (77), Expect = 3.7
Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 3/60 (5%)
Query: 354 DKEEDVFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNE 413
D+EED + DFSG +D+ TV K+ + KK + KY L + + EFD S+E
Sbjct: 37 DEEEDTNEYDFSG--MDNVDLSTVNSKINKTKK-STDSKKKYQKLDEDENNEFDDNQSSE 93
>UniRef50_UPI0000EBF1F9 Cluster: PREDICTED: similar to
microfibrillar-associated protein 1, partial; n=2;
Eutheria|Rep: PREDICTED: similar to
microfibrillar-associated protein 1, partial - Bos
taurus
Length = 231
Score = 34.3 bits (75), Expect = 6.5
Identities = 27/102 (26%), Positives = 41/102 (40%), Gaps = 5/102 (4%)
Query: 186 SDTEYTDSEEDT--GPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXX 243
SDT T G ++P+ R +R+TV ERE +
Sbjct: 118 SDTSEATEHAHTHLGGSLRPLCCR-KDRVTVQEREAEALKQKELEQEAKHMAEERRKYTL 176
Query: 244 XXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKL 285
+ E + ++ ++ + TDDENDE EYEAWK+
Sbjct: 177 KIVEEETKKELE--ENKRSLAALDALNTDDENDEEEYEAWKV 216
>UniRef50_UPI0000D56E42 Cluster: PREDICTED: similar to CG32580-PA;
n=2; Coelomata|Rep: PREDICTED: similar to CG32580-PA -
Tribolium castaneum
Length = 1766
Score = 33.9 bits (74), Expect = 8.6
Identities = 18/60 (30%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 67 ERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRNIAQSPPRRAEHKPEIIDAEPEAESEI 126
E+K+ P+++ KEE E++ ++P + + + P E +PEI++ E E E EI
Sbjct: 459 EKKEEEPEVLEEKEEEPEIVEEKEEEPEIIEKK---EEEPEEKEEEPEIVE-EKEEEPEI 514
Score = 33.9 bits (74), Expect = 8.6
Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 1/60 (1%)
Query: 67 ERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRNIAQSPPRRAEHKPEIIDAEPEAESEI 126
E K+ P+II +KEE EK+ ++P + ++ E +PEI++ E E E +I
Sbjct: 556 EEKKEEPKIIEKKEEEPEIIEKKKEEPEIIEIKKEEPEILEEKEEEPEILE-EKEEEPKI 614
>UniRef50_UPI00006CD58E Cluster: hypothetical protein
TTHERM_00509090; n=1; Tetrahymena thermophila SB210|Rep:
hypothetical protein TTHERM_00509090 - Tetrahymena
thermophila SB210
Length = 1143
Score = 33.9 bits (74), Expect = 8.6
Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Query: 62 EQQRPERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRNIAQ 103
+QQ+P RK V P + RK+ +S KE + +L RN AQ
Sbjct: 324 KQQKPSRKSVSPAVTARKQHQQENSNKE--ESKLNTSRNGAQ 363
>UniRef50_A2RV13 Cluster: Zgc:85787 protein; n=3; Danio rerio|Rep:
Zgc:85787 protein - Danio rerio (Zebrafish) (Brachydanio
rerio)
Length = 871
Score = 33.9 bits (74), Expect = 8.6
Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 1/64 (1%)
Query: 62 EQQRPERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRNIAQSPPRRAEHKPEIIDAEPE 121
++++P R I RKEE + +E DP R L+ Q+ + +H E D + E
Sbjct: 327 KREQPRRSIKKDYSIVRKEEEREEDRREDRDPPFRSLKEF-QNMSKEEDHDEEKEDDDEE 385
Query: 122 AESE 125
E E
Sbjct: 386 EEEE 389
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.312 0.130 0.365
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 381,529,422
Number of Sequences: 1657284
Number of extensions: 13214747
Number of successful extensions: 29962
Number of sequences better than 10.0: 35
Number of HSP's better than 10.0 without gapping: 28
Number of HSP's successfully gapped in prelim test: 7
Number of HSP's that attempted gapping in prelim test: 29819
Number of HSP's gapped (non-prelim): 82
length of query: 441
length of database: 575,637,011
effective HSP length: 103
effective length of query: 338
effective length of database: 404,936,759
effective search space: 136868624542
effective search space used: 136868624542
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 42 (21.9 bits)
S2: 74 (33.9 bits)
- SilkBase 1999-2023 -