BLASTP 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= BGIBMGA001438-TA|BGIBMGA001438-PA|IPR009730|Micro-fibrillar- associated 1, C-terminal (441 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_Q9W062 Cluster: CG1017-PA; n=5; Endopterygota|Rep: CG10... 333 5e-90 UniRef50_P55081 Cluster: Microfibrillar-associated protein 1; n=... 309 9e-83 UniRef50_Q16U12 Cluster: Microfibril-associated protein; n=4; Bi... 277 4e-73 UniRef50_Q93712 Cluster: Putative uncharacterized protein; n=2; ... 206 1e-51 UniRef50_Q4RGL1 Cluster: Chromosome undetermined SCAF15099, whol... 192 2e-47 UniRef50_Q9FKN6 Cluster: Similarity to microfibrillar-associated... 167 7e-40 UniRef50_A7ATD4 Cluster: Micro-fibrillar-associated protein 1 C-... 134 6e-30 UniRef50_Q4U9V1 Cluster: Microfibrillar-associated protein, puta... 124 5e-27 UniRef50_A4RS12 Cluster: Predicted protein; n=2; Ostreococcus|Re... 124 6e-27 UniRef50_Q54SU3 Cluster: Putative uncharacterized protein; n=1; ... 116 2e-24 UniRef50_Q8IE75 Cluster: Microfibril-associated protein homologu... 109 1e-22 UniRef50_A5K2Z5 Cluster: Micro-fibrillar-associated protein 1 C-... 108 2e-22 UniRef50_UPI00006CD032 Cluster: Micro-fibrillar-associated prote... 102 2e-20 UniRef50_A0EFW2 Cluster: Chromosome undetermined scaffold_94, wh... 99 1e-19 UniRef50_Q9P7H6 Cluster: Microfibrillar-associated protein famil... 95 2e-18 UniRef50_Q7S7V7 Cluster: Predicted protein; n=7; Pezizomycotina|... 91 5e-17 UniRef50_Q4P301 Cluster: Putative uncharacterized protein; n=1; ... 90 9e-17 UniRef50_A1DDP7 Cluster: Microfibrillar-associated protein MfaP1... 84 6e-15 UniRef50_Q0TWF3 Cluster: Putative uncharacterized protein; n=1; ... 83 1e-14 UniRef50_A2ELS5 Cluster: Putative uncharacterized protein; n=1; ... 77 7e-13 UniRef50_UPI0000499156 Cluster: microfibril-associated protein; ... 76 2e-12 UniRef50_Q5C2C1 Cluster: SJCHGC04323 protein; n=1; Schistosoma j... 61 5e-08 UniRef50_UPI000155C08B Cluster: PREDICTED: similar to Microfibri... 57 8e-07 UniRef50_Q6CA21 Cluster: Similar to tr|Q8X0K0 Neurospora crassa ... 57 1e-06 UniRef50_UPI0000DC125F Cluster: UPI0000DC125F related cluster; n... 56 2e-06 UniRef50_Q5KAL3 Cluster: Putative uncharacterized protein; n=1; ... 55 4e-06 UniRef50_Q0E1X2 Cluster: Os02g0294000 protein; n=5; Oryza sativa... 52 2e-05 UniRef50_A6S856 Cluster: Putative uncharacterized protein; n=1; ... 38 0.53 UniRef50_Q1YGP6 Cluster: Putative uncharacterized protein; n=1; ... 36 1.6 UniRef50_Q4WBH2 Cluster: C6 transcription factor, putative; n=1;... 36 2.8 UniRef50_Q5BX93 Cluster: SJCHGC03879 protein; n=1; Schistosoma j... 35 3.7 UniRef50_UPI0000EBF1F9 Cluster: PREDICTED: similar to microfibri... 34 6.5 UniRef50_UPI0000D56E42 Cluster: PREDICTED: similar to CG32580-PA... 34 8.6 UniRef50_UPI00006CD58E Cluster: hypothetical protein TTHERM_0050... 34 8.6 UniRef50_A2RV13 Cluster: Zgc:85787 protein; n=3; Danio rerio|Rep... 34 8.6 >UniRef50_Q9W062 Cluster: CG1017-PA; n=5; Endopterygota|Rep: CG1017-PA - Drosophila melanogaster (Fruit fly) Length = 478 Score = 333 bits (819), Expect = 5e-90 Identities = 198/467 (42%), Positives = 249/467 (53%), Gaps = 34/467 (7%) Query: 6 AQPTGIQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFIE-QQ 64 A +GIQSTAGAIP+RNEKGE+SMQKVKVQRYISGK+PDYA+ FI+ ++ Sbjct: 8 AAASGIQSTAGAIPMRNEKGELSMQKVKVQRYISGKRPDYARADSSSEESDDDDFIDTRK 67 Query: 65 RPERKQVLPQIITRKE-------------EHHSDSEKEVDDPRLRRLR----NIAQSPPR 107 R ER + + E + + EVDDPRLRRLR ++ Sbjct: 68 RLERHKAERHKLELSRQGGSAEGEERAAGEGQEEDDAEVDDPRLRRLRQRPVDMEDMERE 127 Query: 108 RAE-----HKPEII--DAEPEAESEIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 160 R E H+PEI+ D+E E E E Sbjct: 128 RRERHRHIHEPEIMESDSEDEEEDEGAQGAIQRGTNKITLASESDTDAELSDTELENRRT 187 Query: 161 XXXXXVLGRXXXXXXXXXX----XXXSGSSDTEY---TDSEEDTGPRVKPVFVRASERMT 213 +L + S S +EY T+SEED PR+KP+FVR +R T Sbjct: 188 KLRSRMLQQQREEEVLQKEDEKQSESSESESSEYEEETESEEDNEPRLKPLFVRKRDRAT 247 Query: 214 VAERERKMXXXXXXXXXXXXXXXXXXXXXXXXXXXTIRSEQRGAQGEQKEGNINDVCTDD 273 + E+ER+ +++ + + E E I DVCTDD Sbjct: 248 IQEKEREAQKQKQLEAEAKRAAKERRRATLRMVEESVKKDLEKTKPETNEACIEDVCTDD 307 Query: 274 ENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDERRLEQRINPKVVTNK 333 ENDE+EYEAWKL L ++R+RNMTE+ERR E R NPKVVTNK Sbjct: 308 ENDEVEYEAWKLRELKRMKRDREERDNVEREKLDIDRMRNMTEEERRQELRQNPKVVTNK 367 Query: 334 AVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRT 393 A KGKYKFLQKYYHRGAFYLD+E DV K+DF+ TL+DHFDKT+LPKVMQVK FGR GRT Sbjct: 368 ATKGKYKFLQKYYHRGAFYLDEENDVLKRDFAQATLEDHFDKTILPKVMQVKNFGRCGRT 427 Query: 394 KYTHLVDQDTTEFDSAWSNETSA-ARLTN-FRGGMKQVFEKPSAERK 438 KYTHLVDQDTT+FDS W E+S+ + N GGM+Q F+KP+ ++ Sbjct: 428 KYTHLVDQDTTKFDSPWYAESSSNIKFHNEHAGGMRQQFDKPTGSKR 474 >UniRef50_P55081 Cluster: Microfibrillar-associated protein 1; n=25; Eumetazoa|Rep: Microfibrillar-associated protein 1 - Homo sapiens (Human) Length = 439 Score = 309 bits (759), Expect = 9e-83 Identities = 182/447 (40%), Positives = 241/447 (53%), Gaps = 25/447 (5%) Query: 2 NVLPAQPTGIQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFI 61 + L QP IQSTAGA+P+RNEKGEISM+KVKV+RY+SGK+PDYA FI Sbjct: 5 SALMKQPP-IQSTAGAVPVRNEKGEISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFI 63 Query: 62 EQQRPERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRN-IAQSPPRR-AEHK----PEI 115 ++ + + + EE DS DPRLRRL+N I++ R A H+ PE+ Sbjct: 64 KKAKEQEAE--------PEEQEEDSSS---DPRLRRLQNRISEDVEERLARHRKIVEPEV 112 Query: 116 I-DAEPEAESEIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVLGRXXXXX 174 + +++ E E + + Sbjct: 113 VGESDSEVEGDAWRMEREDSSEEEEEEIDDEEIERRRGMMRQRAQERKNEEMEVMEVEDE 172 Query: 175 XXXXXXXXSGSSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXX 234 S S EYTDSE++ PR+KPVF+R +R+TV ERE + Sbjct: 173 GRSGEESESESEYEEYTDSEDEMEPRLKPVFIRKKDRVTVQEREAEALKQKELEQEAKRM 232 Query: 235 XXXXXXXXXXXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXX 294 E + ++ ++ + TDDENDE EYEAWK+ Sbjct: 233 AEERRQYTLQIVGEETPKELE--ENKRSLAALDALNTDDENDEEEYEAWKVRELKRIKRD 290 Query: 295 XXXXXXXXXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLD 354 +ER+RN+TE+ERR E R N KV+TNKAVKGKYKFLQKYYHRGAF++D Sbjct: 291 REDREALEKEKAEIERMRNLTEEERRAELRANGKVITNKAVKGKYKFLQKYYHRGAFFMD 350 Query: 355 KEEDVFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNET 414 ++E+V+K+DFS PTL+DHF+KT+LPKVMQVK FGRSGRTKYTHLVDQDTT FDSAW E Sbjct: 351 EDEEVYKRDFSAPTLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWGQE- 409 Query: 415 SAARLTNFR---GGMKQVFEKPSAERK 438 SA F+ G++ VFE+PSA+++ Sbjct: 410 SAQNTKFFKQKAAGVRDVFERPSAKKR 436 >UniRef50_Q16U12 Cluster: Microfibril-associated protein; n=4; Bilateria|Rep: Microfibril-associated protein - Aedes aegypti (Yellowfever mosquito) Length = 492 Score = 277 bits (679), Expect = 4e-73 Identities = 138/255 (54%), Positives = 170/255 (66%), Gaps = 3/255 (1%) Query: 183 SGSSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXX 242 S SS+ E T+SEE+ PR+KP+FVR +R TV E+ER+ Sbjct: 232 SESSEYEETESEEENEPRLKPLFVRKKDRTTVIEKEREANKQKQLEYESKKAAKERRRQT 291 Query: 243 XXXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXX 302 +I+ + A+ + E N+NDV TDDENDE+EYEAWKL Sbjct: 292 LKLVEDSIKKDMEKAKVDN-EPNLNDVNTDDENDEVEYEAWKLRELKRIKRDREEKEALE 350 Query: 303 XXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQ 362 L +ER+RNMTEDERR +QR+NPK VTNK VKGKYKFLQKYYHRGAFYLD+E+ V+KQ Sbjct: 351 KEKLEIERIRNMTEDERRQDQRLNPKQVTNKTVKGKYKFLQKYYHRGAFYLDQEDQVYKQ 410 Query: 363 DFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAW-SNETSAARLTN 421 DFS PTL+DHFDKT+LPKVMQVK FGR GRTKYTHLVDQDTT+ +S W ++ + + N Sbjct: 411 DFSAPTLEDHFDKTILPKVMQVKNFGRCGRTKYTHLVDQDTTKAESPWFADSANNTKFYN 470 Query: 422 FR-GGMKQVFEKPSA 435 R GGM+QVFEKPS+ Sbjct: 471 ERAGGMRQVFEKPSS 485 Score = 100 bits (240), Expect = 7e-20 Identities = 58/136 (42%), Positives = 76/136 (55%), Gaps = 14/136 (10%) Query: 4 LPAQPT--GIQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFI 61 + QPT GIQSTAGAIP+RN KGE+SMQKVKV RY+SGK+P+YAQ FI Sbjct: 1 MSGQPTIYGIQSTAGAIPVRNPKGELSMQKVKVHRYVSGKRPEYAQHSSSEEESDEEDFI 60 Query: 62 EQQRPERKQVLPQIITRKE--EHHSDSEKEVDDPRLRRLRNIAQSPPRRAE--------- 110 + +R + R+E E D +VDDPR+RRL+ I + E Sbjct: 61 DNRRTAEESYRESRRRREETDEEEDDLPGDVDDPRIRRLQAIRAAEAEEIERERRERHRV 120 Query: 111 -HKPEIIDAEPEAESE 125 H+PE++ +E E E E Sbjct: 121 IHEPELVQSEEEEEDE 136 >UniRef50_Q93712 Cluster: Putative uncharacterized protein; n=2; Caenorhabditis|Rep: Putative uncharacterized protein - Caenorhabditis elegans Length = 466 Score = 206 bits (502), Expect = 1e-51 Identities = 98/259 (37%), Positives = 152/259 (58%), Gaps = 3/259 (1%) Query: 185 SSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXX 244 SS+ E +D ++D PR+KP+F R +R+T+ E E++ Sbjct: 208 SSEEEDSDEDDDPVPRLKPIFTRKKDRITLQEAEKEKEKEILKKIEDEKRAEERKRESAK 267 Query: 245 XXXXTIRSEQRGAQGEQKEG-NINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXX 303 ++ E+ + + ++ +++ V TDDE + + YEAWKL Sbjct: 268 LVEKVLQEEEAAEKRKTEDRVDLSSVLTDDETENMAYEAWKLREMKRLKRNRDEREEAAR 327 Query: 304 XXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQD 363 L+++ M+E+ER R+NPKV+TNK KGKYKFLQKY+HRGAF+LD+E++V K++ Sbjct: 328 EKAELDKIHAMSEEERLKYLRLNPKVITNKQDKGKYKFLQKYFHRGAFFLDEEDEVLKRN 387 Query: 364 FSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAW--SNETSAARLTN 421 F+ T DD FDKT+LPKVMQVK FG++ RTKYTHL ++DTT+ W +N+ ++ T Sbjct: 388 FAEATNDDQFDKTILPKVMQVKNFGKASRTKYTHLTEEDTTDHQGVWASTNQLNSQFSTK 447 Query: 422 FRGGMKQVFEKPSAERKHN 440 GG + VFE+P+ +++ N Sbjct: 448 RAGGSRPVFERPATKKRKN 466 Score = 57.6 bits (133), Expect = 6e-07 Identities = 39/112 (34%), Positives = 53/112 (47%), Gaps = 15/112 (13%) Query: 14 TAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFIEQQRPERKQVLP 73 T GAIPI+NEKG+ MQKVKV RY++GK P+YA+ E R + Sbjct: 25 TLGAIPIKNEKGQTVMQKVKVSRYVAGKAPEYARNYDSDSSESDR---ETDRDD------ 75 Query: 74 QIITRKEEHHSDSEKEVDDPRLRRLRNIAQSPPRRAEHKPEIIDAEPEAESE 125 R+ +S E D R RR + + RR KPE++ + SE Sbjct: 76 ---DRRRRRRRESSDEEDRRRHRRHEDYGR---RRQVEKPEVLGKVEDESSE 121 >UniRef50_Q4RGL1 Cluster: Chromosome undetermined SCAF15099, whole genome shotgun sequence; n=2; Euteleostomi|Rep: Chromosome undetermined SCAF15099, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 413 Score = 192 bits (467), Expect = 2e-47 Identities = 86/134 (64%), Positives = 110/134 (82%), Gaps = 4/134 (2%) Query: 308 LERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGP 367 +E+ NMT++ERR E R + KV+TNK KGKYKFLQKYYHRGAF++D+EEDV+K+DFS P Sbjct: 280 IEKFHNMTDEERRAELRNSGKVITNKGTKGKYKFLQKYYHRGAFFMDEEEDVYKRDFSAP 339 Query: 368 TLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNETSAARLTNFR---G 424 TL+DHF+KT+LPKVMQVK FGRSGRTKYTHLVDQDTT FDSAW+ E SA F+ Sbjct: 340 TLEDHFNKTILPKVMQVKNFGRSGRTKYTHLVDQDTTSFDSAWAQE-SAQNSKFFKQKAA 398 Query: 425 GMKQVFEKPSAERK 438 G++ VF++P+ +++ Sbjct: 399 GVRDVFDRPTVKKR 412 Score = 83.8 bits (198), Expect = 8e-15 Identities = 48/115 (41%), Positives = 68/115 (59%), Gaps = 13/115 (11%) Query: 11 IQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFIEQQRPERKQ 70 IQSTAGA+P+RNEKGE+SM+KVKV+RY+SGK+PDYA F++ + K+ Sbjct: 14 IQSTAGAVPVRNEKGELSMEKVKVKRYVSGKRPDYAPMQSSDEEDEDFQFVK----KGKE 69 Query: 71 VLPQIITRKEEHHSDSEKEVDDPRLRRLRN-IAQSPPRRAEHKPEIIDAEPEAES 124 V P++ E ++ DPRLRRL N +++ R +I + E AES Sbjct: 70 VEPEV--------EQEEDDMSDPRLRRLLNRVSEDVEERLARHRQISEPEVVAES 116 >UniRef50_Q9FKN6 Cluster: Similarity to microfibrillar-associated protein 1; n=5; Magnoliophyta|Rep: Similarity to microfibrillar-associated protein 1 - Arabidopsis thaliana (Mouse-ear cress) Length = 435 Score = 167 bits (405), Expect = 7e-40 Identities = 102/267 (38%), Positives = 140/267 (52%), Gaps = 16/267 (5%) Query: 187 DTEY-TDSEEDTG--PRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXX 243 ++EY TDSE+D +KPVFV +ER T+AERER Sbjct: 164 ESEYETDSEDDMPGIAMIKPVFVPKAERDTIAERERLEAEEEALEELAKRKLEQRKLETK 223 Query: 244 XXXXXTIRSEQRGAQGEQ-KEGNINDVCTDDE-NDELEYEAWKLXXXXXXXXXXXXXXXX 301 +R ++ + +E NI DV TDDE N+ EYE WK Sbjct: 224 QIVVEEVRKDEEIRKNILLEEANIGDVETDDELNEAEEYEVWKTREIGRIKRERDAREAM 283 Query: 302 XXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEED--- 358 +E+LRNMTE ERR +R NPK ++ + K K+ F+QKYYH+GAF+ +D Sbjct: 284 LREREEIEKLRNMTEQERRDWERKNPKPLSAQPKK-KWNFMQKYYHKGAFFQADPDDEAG 342 Query: 359 ------VFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAW-S 411 +F++DFS PT +D DK++LPKVMQVK FGRSGRTK+THLV++DTT++ + W S Sbjct: 343 SAGTDGIFQRDFSAPTGEDRLDKSILPKVMQVKHFGRSGRTKWTHLVNEDTTDWSNPWTS 402 Query: 412 NETSAARLTNFRGGMKQVFEKPSAERK 438 N+ + GM KP +K Sbjct: 403 NDPLREKYNKKMAGMDAPIAKPKGSKK 429 >UniRef50_A7ATD4 Cluster: Micro-fibrillar-associated protein 1 C-terminus containing protein; n=1; Babesia bovis|Rep: Micro-fibrillar-associated protein 1 C-terminus containing protein - Babesia bovis Length = 437 Score = 134 bits (323), Expect = 6e-30 Identities = 85/256 (33%), Positives = 123/256 (48%), Gaps = 13/256 (5%) Query: 185 SSDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXX 244 + ++E S + KPVFV R+TV E++ Sbjct: 182 TEESEPVTSSQTINALAKPVFVPKKSRLTVKEKKEIEREEQKKIEAEQKRLEERRKQSKE 241 Query: 245 XXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXX 304 T+ +E ++ E +N V DE E EYE WK+ Sbjct: 242 LVIQTLVAEN---MHQEIENEVNCVDDKDELTEEEYELWKIRELKRIIRDRNERNAHERL 298 Query: 305 XLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEED----VF 360 +ER R MTE+ER + + A K K KFLQKYYH+GAF++DK ED ++ Sbjct: 299 AAEVERRREMTEEERLEDDERIRQEKGPIAPKTKIKFLQKYYHKGAFFMDKLEDGSEPIY 358 Query: 361 KQDFSGPTLDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSAWSNETSAAR 418 K+DF+ PT DD DK+++PK MQV++ +G+ GR+KYTHL +DTT+FD WS + A Sbjct: 359 KRDFNAPTADDCVDKSLMPKSMQVRRGQYGKMGRSKYTHLTAEDTTKFDMPWSQQ--PAP 416 Query: 419 LTNFRGGMKQVFEKPS 434 T G + F++PS Sbjct: 417 FT--PAGARDSFDRPS 430 >UniRef50_Q4U9V1 Cluster: Microfibrillar-associated protein, putative; n=2; Theileria|Rep: Microfibrillar-associated protein, putative - Theileria annulata Length = 431 Score = 124 bits (299), Expect = 5e-27 Identities = 87/269 (32%), Positives = 135/269 (50%), Gaps = 26/269 (9%) Query: 185 SSDTEYTDSE---EDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXX 241 S +++Y + E ED KPVFV R T +E+E+ Sbjct: 173 SEESDYQEDEAGVEDLDVLSKPVFVPKGSRKTESEKEQLRKEEVLRKENEKKRLMERKRD 232 Query: 242 XXXXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXX 301 ++ + + E ++ I+D D DE EYE WK+ Sbjct: 233 TKEMVIQKVQELEE--EPEPEDELIDDT---DTFDEKEYELWKIRELKRILRDKEEREKF 287 Query: 302 XXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEED--- 358 ++ R+MT++ER L+ + KVV K+ K +FLQKYYHRGAF++DK +D Sbjct: 288 KKLEEEVKLRRSMTDEERELDNQKVDKVVVEKS---KLRFLQKYYHRGAFFMDKLQDKSE 344 Query: 359 -VFKQDFSGPTLDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSAWSNETS 415 ++ +DF+ PT +D DK++LPK M+V++ +G+ G+ K+THL D DTT+FD AWS +T Sbjct: 345 PLYARDFNAPTAEDCVDKSLLPKPMRVRRGLYGKQGQVKHTHLKDVDTTQFD-AWS-KTD 402 Query: 416 AARLTNF-------RGGMKQVFEKPSAER 437 +LT G KQVF++PS ++ Sbjct: 403 KYKLTGLFSVIITQFSGTKQVFDRPSRKK 431 >UniRef50_A4RS12 Cluster: Predicted protein; n=2; Ostreococcus|Rep: Predicted protein - Ostreococcus lucimarinus CCE9901 Length = 256 Score = 124 bits (298), Expect = 6e-27 Identities = 84/255 (32%), Positives = 122/255 (47%), Gaps = 21/255 (8%) Query: 202 KPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXXXXXXTIRSEQRGAQGEQ 261 KPVFVR ER T+ ER++ + ++ E+ A Sbjct: 3 KPVFVRKIERDTIEERDKMLAELDAEAAKTEAAKAAKKAESKKLVEVEVKREEALAAA-M 61 Query: 262 KEGNINDVCTDDE-NDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDERR 320 E +DV TDDE +D LE++AWK ER+R+MTE+ER Sbjct: 62 DEMEPSDVDTDDELDDALEFDAWKSRELERLKTDRIQRELIFREREEQERIRSMTEEERD 121 Query: 321 L--EQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVF---------KQDFSGPTL 369 L +R+ + K K K F+QKYYH+GAF+ + +D F K+DFS PT Sbjct: 122 LYHAKRLAKRAEQEKE-KPKMAFMQKYYHKGAFFQESADDAFGTAGPDEIYKRDFSAPTA 180 Query: 370 DDHFDKTVLPKVMQVK--KFGRSGRTKYTHLVDQDTT-----EFDSAWSNETSAARLTNF 422 ++ FDK++LP MQV+ KFGR+G+TK+THL +DT+ + D WS + R Sbjct: 181 EEKFDKSILPAAMQVRKGKFGRAGQTKWTHLAAEDTSAARKGDDDDLWSGRDKSVRAIKD 240 Query: 423 RGGMKQVFEKPSAER 437 + KQ + +A R Sbjct: 241 KMLAKQGGLRDAARR 255 >UniRef50_Q54SU3 Cluster: Putative uncharacterized protein; n=1; Dictyostelium discoideum AX4|Rep: Putative uncharacterized protein - Dictyostelium discoideum AX4 Length = 460 Score = 116 bits (278), Expect = 2e-24 Identities = 75/231 (32%), Positives = 108/231 (46%), Gaps = 8/231 (3%) Query: 186 SDTEYTDSEE--DTGPRVKPVFVRASERMTVA---ERERKMXXXXXXXXXXXXXXXXXXX 240 +D+E D +E D P +P F++ +R T+ + E++ Sbjct: 197 TDSEEDDEDEYWDQPPIFRPTFIKKDDRGTIKTDEQWEKEEQEQQAQLEREKEQRKIEAH 256 Query: 241 XXXXXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXX 300 R EQ + EQKE D D++ D + W Sbjct: 257 RKLKDELDRDRKEQEAKELEQKEEEEYD--DDEDQDGSKKLLWIQRELERVRLEIHTRLL 314 Query: 301 XXXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVF 360 R R MT+D+ E + + + K + KFLQ+ YHRGAF+ D +E + Sbjct: 315 AEFEKKEFARRRAMTDDQILKEDPSRSRTNIDNSQKKQLKFLQRDYHRGAFFQD-DEYIK 373 Query: 361 KQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWS 411 +DFS PT +D F++ +LPKVMQVK FG++GRTKYTHL DQDTTE DS W+ Sbjct: 374 NKDFSAPTGEDKFNRELLPKVMQVKNFGKAGRTKYTHLKDQDTTEKDSLWN 424 >UniRef50_Q8IE75 Cluster: Microfibril-associated protein homologue, putative; n=4; Plasmodium|Rep: Microfibril-associated protein homologue, putative - Plasmodium falciparum (isolate 3D7) Length = 490 Score = 109 bits (263), Expect = 1e-22 Identities = 60/174 (34%), Positives = 95/174 (54%), Gaps = 11/174 (6%) Query: 272 DDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDERRLEQRINPKVVT 331 D+E +E EYE WK+ L +++ R MT+ E + + P Sbjct: 322 DEELNEEEYELWKIRHINRLKRDELDRKKHEILELEIKKRRKMTDKEIIQDNKTLPN--K 379 Query: 332 NKAVKGKYKFLQKYYHRGAFYLDK----EEDVFKQDFSGPTLDDHFDKTVLPKVMQVK-- 385 K K K F+QKYYH+G FY D +E+++ +D++ P +D D+ LPKV+QV+ Sbjct: 380 EKKKKRKMLFMQKYYHKGGFYQDLFEEGKEEIYLRDYNEPVYEDKVDRQNLPKVLQVRRG 439 Query: 386 KFGRSGRTKYTHLVDQDTTEFDSAWSNETSAARLTNFRGGMKQVFEKPSAERKH 439 KFG+ G++KYTHL+D DT+ DS W+N S + F+ K F++P+ +K+ Sbjct: 440 KFGKQGQSKYTHLLDNDTSRKDSLWNNIESNMK---FKDKKKDQFDRPTYRKKN 490 >UniRef50_A5K2Z5 Cluster: Micro-fibrillar-associated protein 1 C-terminus domain containing protein; n=2; Plasmodium|Rep: Micro-fibrillar-associated protein 1 C-terminus domain containing protein - Plasmodium vivax Length = 478 Score = 108 bits (260), Expect = 2e-22 Identities = 74/265 (27%), Positives = 123/265 (46%), Gaps = 21/265 (7%) Query: 185 SSDTEYTDSEEDTGPRVKPVFVRASERMTVAER-ERKMXXXXXXXXXXXXXXXXXXXXXX 243 S D Y + E+ + P +K +V ++R T+ E +++ Sbjct: 216 SGDENYMNGEDGSAP-MKHEYVFKTKRKTLLESFQKEQNEKQLQKSEATEKKIIEEEKKE 274 Query: 244 XXXXXTIRSEQRGAQGEQKEGNI--------NDVCTDDENDELEYEAWKLXXXXXXXXXX 295 TI +E Q + +E N+ +D + E DE EY+ WKL Sbjct: 275 KAIEETIHNEIMIEQMKNQENNVFSSDENFDDDDADEGEPDEKEYQLWKLRHMSRLKRDE 334 Query: 296 XXXXXXXXXXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDK 355 + R MT+ E + ++ P K K K F+QKYYHRG F+ D Sbjct: 335 LDRRKHQLVQDEISERRKMTDREIMEQNKLLPH--KEKKKKKKMLFMQKYYHRGGFFQDL 392 Query: 356 ----EEDVFKQDFSGPTLDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSA 409 +E+++++D++ P +D DK LPKV++V++ FG+ G++KYTHL+D DT+ DS Sbjct: 393 FEEGKEEIYRRDYNEPVYEDKVDKENLPKVLRVRRGNFGKQGQSKYTHLLDNDTSRKDSL 452 Query: 410 WSNETSAARLTNFRGGMKQVFEKPS 434 W+N AR + + +FE+P+ Sbjct: 453 WANRDLEARRARRK---EDLFERPT 474 >UniRef50_UPI00006CD032 Cluster: Micro-fibrillar-associated protein 1 C-terminus containing protein; n=1; Tetrahymena thermophila SB210|Rep: Micro-fibrillar-associated protein 1 C-terminus containing protein - Tetrahymena thermophila SB210 Length = 521 Score = 102 bits (245), Expect = 2e-20 Identities = 71/259 (27%), Positives = 119/259 (45%), Gaps = 15/259 (5%) Query: 192 DSEEDTGP---RVKPVFVRASER-----MTVAERERKMXXXXXXXXXXXXXXXXXXXXXX 243 + EE+ P +KPV++ SER + + E+E + Sbjct: 267 EEEEEVRPVYKMMKPVYIPKSERDYQNQLDIEEQELEEQRKKQEQIAKQQIKMIIMEQKK 326 Query: 244 XXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXX 303 + E++ Q + +ND DD + E E E WK+ Sbjct: 327 QQIIGNLGDEEQSDDSRQGKDFMND--DDDMDREFEREQWKIRELKRIRKDRDEQIKREK 384 Query: 304 XXLTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQD 363 ER MT +E E + + K K + F+QKYYH+GAFY D ++DV ++D Sbjct: 385 ELAEQERRSKMTNEEIIEEDK--RLGLHQKKEKRQIGFMQKYYHKGAFYQDDDDDVLQRD 442 Query: 364 FSGPTLDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSAWS-NETSAARLT 420 F+ P ++ DK+VLP +MQ ++ FG+ G++KYTHL DQDTT FD + +++ ++ Sbjct: 443 FNMPVGEELLDKSVLPHLMQKRRGNFGKKGQSKYTHLTDQDTTNFDPKYRVDDSLQKKML 502 Query: 421 NFRGGMKQVFEKPSAERKH 439 + + G+K ++K+ Sbjct: 503 SKQAGLKAANNLDPRKKKY 521 >UniRef50_A0EFW2 Cluster: Chromosome undetermined scaffold_94, whole genome shotgun sequence; n=1; Paramecium tetraurelia|Rep: Chromosome undetermined scaffold_94, whole genome shotgun sequence - Paramecium tetraurelia Length = 410 Score = 99 bits (238), Expect = 1e-19 Identities = 53/182 (29%), Positives = 93/182 (51%), Gaps = 3/182 (1%) Query: 249 TIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTL 308 +++++ A E+ + + +D DE EY WK+ + Sbjct: 217 SVKADAAKAVNEESDDGKQKLNDEDTLDETEYALWKIRELKRIKQFNDEKNKYEIEKAEI 276 Query: 309 ERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGPT 368 +R RN+T+ +R E T K KY F+QKYY+ GAFY D ++ +F++D++ P Sbjct: 277 DRRRNLTDMQRIQEDFKLGSDKTKMEDKTKYVFMQKYYNTGAFYKDMDDPIFQRDYNLPV 336 Query: 369 LDDHFDKTVLPKVMQVKK--FGRSGRTKYTHLVDQDTTEFDSAWS-NETSAARLTNFRGG 425 +D + K LP+++Q ++ FG+ G +KYTHL +DTT FD + +++ + N + G Sbjct: 337 GEDLWRKDNLPQILQKRRGEFGKKGNSKYTHLTQEDTTNFDPTYQVDQSIRQKFLNQQAG 396 Query: 426 MK 427 K Sbjct: 397 SK 398 >UniRef50_Q9P7H6 Cluster: Microfibrillar-associated protein familt protein; n=1; Schizosaccharomyces pombe|Rep: Microfibrillar-associated protein familt protein - Schizosaccharomyces pombe (Fission yeast) Length = 355 Score = 95.5 bits (227), Expect = 2e-18 Identities = 60/186 (32%), Positives = 94/186 (50%), Gaps = 10/186 (5%) Query: 260 EQKEGN--INDVC-TDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTE 316 E K N +ND+ TD + + EYE WKL + +E R M Sbjct: 171 ETKNNNELLNDIDDTDGIDPQSEYELWKLRHLLRKKRDKEKSLELEREKMAIEERRLMNS 230 Query: 317 DERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFSGPTLDDHFDKT 376 +ER + + + K +FLQKYYH+GAFY E+ V K+D+S T + +K Sbjct: 231 EEREAQDLKDAEASRRGKKKSSMQFLQKYYHKGAFY-QNEDIVSKRDYSEATEGEVLNKD 289 Query: 377 VLPKVMQVK--KFGRSGRTKYTHLVDQDTTEFDSAWSNETSAARLTNFR--GGMKQVFEK 432 +LPK MQ++ F ++G+T++THL ++DTT+ SAW + + N GG+ + Sbjct: 290 LLPKPMQIRGDLFAKAGQTRWTHLANEDTTKEGSAWYDPKNPILQKNLHRLGGLHS--DS 347 Query: 433 PSAERK 438 P ++RK Sbjct: 348 PLSKRK 353 >UniRef50_Q7S7V7 Cluster: Predicted protein; n=7; Pezizomycotina|Rep: Predicted protein - Neurospora crassa Length = 712 Score = 91.1 bits (216), Expect = 5e-17 Identities = 57/160 (35%), Positives = 82/160 (51%), Gaps = 9/160 (5%) Query: 260 EQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDER 319 + ++ I+D TDD + E EY AWKL +ER RN+TE+ER Sbjct: 407 DPEDDQIDD--TDDIDPEAEYAAWKLRELKRVRREREAIEAKEKELAEIERRRNLTEEER 464 Query: 320 RLEQRIN-PKVVTNKAVKGKYKFLQKYYHRGAFYLD--KEEDVFKQDFSGPTL-DDHFDK 375 R E + + K KGK ++QKY+H+GAFY D KE + K+D G DD ++ Sbjct: 465 RAEDEKHLQQQKEEKEGKGKMAYMQKYFHKGAFYQDESKEMGLDKRDIMGARFADDVKNR 524 Query: 376 TVLPKVMQVK---KFGRSGRTKYTHLVDQDTTEFDSAWSN 412 +LPK +Q++ K GR G TKY L +DT ++ N Sbjct: 525 ELLPKALQLRDMTKLGRKGATKYRDLKSEDTGQWGRLHDN 564 >UniRef50_Q4P301 Cluster: Putative uncharacterized protein; n=1; Ustilago maydis|Rep: Putative uncharacterized protein - Ustilago maydis (Smut fungus) Length = 500 Score = 90.2 bits (214), Expect = 9e-17 Identities = 67/248 (27%), Positives = 110/248 (44%), Gaps = 15/248 (6%) Query: 186 SDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXXX 245 S+ E +SE T P +KP+FV R T++ Sbjct: 160 SEGESEESETKTEPLLKPIFVPKQARTTISTDAAADQHQLELDAEAKAEAEAAVRRKEAH 219 Query: 246 XXXTIRSEQRGAQGEQKEGNINDVC-TDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXX 304 +++ A+ E ++ + DV TD + E E++AW+ Sbjct: 220 DLAAAAIKRQLAEKEYQDTHQTDVDDTDGLDPEAEFQAWRERELARLRRDHEAILAKQRA 279 Query: 305 XLTLERLRNMTEDER-RL-EQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQ 362 ++ +++ E E+ RL +R K +G FLQKYYH+G+F+ D D+ K+ Sbjct: 280 QQEIDAFKSLPEAEKERLGRERAAQLRAEKKEQRGNPAFLQKYYHKGSFFQDM--DILKR 337 Query: 363 DFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNETSAARLTNF 422 D++ T D D + LPK+MQV+ +G GR+K+THL ++DT++ A RL Sbjct: 338 DYTEKTSKD-VDISKLPKMMQVRGYGEKGRSKWTHLANEDTSK---------GAMRLDVL 387 Query: 423 RGGMKQVF 430 +GG K F Sbjct: 388 QGGSKGCF 395 >UniRef50_A1DDP7 Cluster: Microfibrillar-associated protein MfaP1, putative; n=10; Eurotiomycetidae|Rep: Microfibrillar-associated protein MfaP1, putative - Neosartorya fischeri (strain ATCC 1020 / DSM 3700 / NRRL 181)(Aspergillus fischerianus (strain ATCC 1020 / DSM 3700 / NRRL 181)) Length = 512 Score = 84.2 bits (199), Expect = 6e-15 Identities = 51/160 (31%), Positives = 80/160 (50%), Gaps = 9/160 (5%) Query: 262 KEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDERRL 321 +EG I+D D + E EY AWKL +ER RN+T +ER Sbjct: 234 EEGAIDD--RDGVDPEAEYAAWKLRELKRIKREREAIEAAEKEREEIERRRNLTAEERER 291 Query: 322 EQR--INPKVVTNKAVKGKYKFLQKYYHRGAFYLD--KEEDVFKQDFSGPTLDDHFDKTV 377 E R I + +A +G+ F+Q+Y+H+GAF+ D + E + K++ G D + Sbjct: 292 EDREFIEKQKQEKEASRGQTGFMQRYFHKGAFFRDDLEREGLDKRNVMGQRFADDVARET 351 Query: 378 LPKVMQVK---KFGRSGRTKYTHLVDQDTTEFDSAWSNET 414 LP+ MQ++ K G+ GRT+Y L +DT F ++N + Sbjct: 352 LPEYMQIRDMTKLGKKGRTRYKDLRTEDTGRFGEGFNNRS 391 >UniRef50_Q0TWF3 Cluster: Putative uncharacterized protein; n=1; Phaeosphaeria nodorum|Rep: Putative uncharacterized protein - Phaeosphaeria nodorum (Septoria nodorum) Length = 420 Score = 83.0 bits (196), Expect = 1e-14 Identities = 61/234 (26%), Positives = 106/234 (45%), Gaps = 13/234 (5%) Query: 183 SGSSD---TEYTDSEEDTGPR--VKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXX 237 SGS D +E S ED P+ ++PVF++ +ER VA + Sbjct: 117 SGSEDEDESEQESSSEDEAPKKLLRPVFLKKNERNKVAAPVKSAEEAAAEEEARRQEQSR 176 Query: 238 XXXXXXXXXXXTIRSE-QRGAQGEQKEGNINDVC-TDDENDELEYEAWKLXXXXXXXXXX 295 ++ ++ + ++ +IN + TD + EY AWKL Sbjct: 177 ALVQEQVEQRIAEKAAGKKDWDDDVEDADINAIDDTDGLDAAAEYAAWKLRELKRIKRER 236 Query: 296 XXXXXXXXXXLTLERLRNMTEDERRLEQR-INPKVVTNKAVKGKYKFLQKYYHRGAFYLD 354 +ER RN++ ER E R + ++A +G+ +++QKY+H+GAF+ D Sbjct: 237 QAIEEAEAERAEIERRRNLSAAERDAEDRAFIDQQKEDRADRGEMQYMQKYFHKGAFFTD 296 Query: 355 --KEEDVFKQDFSGPTLDDHFDKTVLPKVMQVK---KFGRSGRTKYTHLVDQDT 403 KE V +++ +D ++ VLP+ MQ++ K G+ GRT+Y + +DT Sbjct: 297 ELKELGVDRRNLMNARFEDQTNRDVLPEYMQIRDMTKLGKKGRTRYKDMKTEDT 350 >UniRef50_A2ELS5 Cluster: Putative uncharacterized protein; n=1; Trichomonas vaginalis G3|Rep: Putative uncharacterized protein - Trichomonas vaginalis G3 Length = 231 Score = 77.4 bits (182), Expect = 7e-13 Identities = 38/81 (46%), Positives = 52/81 (64%), Gaps = 5/81 (6%) Query: 332 NKAVKGKYKFLQKYYHRGAFYLD---KEEDVFKQDFSGPTLDDHFDKTVLPKVMQVK--K 386 N KG KF QKYYH+GAF +D K E++ +D+ PT DD DKT LPK M V+ Sbjct: 126 NPKEKGHMKFYQKYYHKGAFSIDESEKAEELLNRDYLTPTGDDLLDKTALPKEMMVRGDD 185 Query: 387 FGRSGRTKYTHLVDQDTTEFD 407 + + G++K+THL ++DTT D Sbjct: 186 YNKRGKSKWTHLSNEDTTTVD 206 >UniRef50_UPI0000499156 Cluster: microfibril-associated protein; n=1; Entamoeba histolytica HM-1:IMSS|Rep: microfibril-associated protein - Entamoeba histolytica HM-1:IMSS Length = 242 Score = 76.2 bits (179), Expect = 2e-12 Identities = 60/221 (27%), Positives = 98/221 (44%), Gaps = 16/221 (7%) Query: 189 EYTDSEEDTGPRVKPVFVRASERMTVAER--ERKMXXXXXXXXXXXXXXXXXXXXXXXXX 246 EYT+ E + +P+FV + + E+ E+++ Sbjct: 28 EYTEEESNEEEDEEPIFVPMRKEIIKKEQIEEKEIKENVFPPYKQQTQDINTNEINKKLI 87 Query: 247 XXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXL 306 TI+ E Q E +E D+ + E+EAW+ Sbjct: 88 QMTIQKELE--QKENEESTEEFSSGDEYGGKDEFEAWQQRELERLKKEYIEQLNYQHD-- 143 Query: 307 TLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEE-DVFKQ--D 363 LE+L+ + E + + K + K+KF+QKYYH G+F+ D + DV K D Sbjct: 144 -LEKLKEICSTESQNHEE------EKKKERKKWKFMQKYYHIGSFFRDGGKWDVSKGNWD 196 Query: 364 FSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTT 404 F T DD DK++LPK++Q K +G+ GR+K+T+L ++DTT Sbjct: 197 FDAATGDDWMDKSLLPKILQTKDWGKKGRSKHTNLKEEDTT 237 >UniRef50_Q5C2C1 Cluster: SJCHGC04323 protein; n=1; Schistosoma japonicum|Rep: SJCHGC04323 protein - Schistosoma japonicum (Blood fluke) Length = 241 Score = 61.3 bits (142), Expect = 5e-08 Identities = 25/36 (69%), Positives = 32/36 (88%) Query: 11 IQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYA 46 I STAGA+P++NEKG+ M KVKVQRY++GKKPD+A Sbjct: 10 IHSTAGAVPVKNEKGQFYMVKVKVQRYVAGKKPDFA 45 Score = 42.7 bits (96), Expect = 0.019 Identities = 16/36 (44%), Positives = 27/36 (75%) Query: 184 GSSDTEYTDSEEDTGPRVKPVFVRASERMTVAERER 219 G S+ EYT S+++ P++KPVFVRA +R+T+ + + Sbjct: 205 GYSEEEYTSSDDEVAPKLKPVFVRARDRITLQAKHK 240 >UniRef50_UPI000155C08B Cluster: PREDICTED: similar to Microfibrillar-associated protein 1, partial; n=1; Ornithorhynchus anatinus|Rep: PREDICTED: similar to Microfibrillar-associated protein 1, partial - Ornithorhynchus anatinus Length = 243 Score = 57.2 bits (132), Expect = 8e-07 Identities = 38/101 (37%), Positives = 55/101 (54%), Gaps = 12/101 (11%) Query: 26 EISMQKVKVQRYISGKKPDYAQGMXXXXXXXXXXFIEQQRPERKQVLPQIITRKEEHHSD 85 EISM+KVKV+RY+SGK+PDYA FI ++ + +++ P EE D Sbjct: 1 EISMEKVKVKRYVSGKRPDYAPMESSDEEDEEFQFI--KKAKEQEIEP------EEQEED 52 Query: 86 SEKEVDDPRLRRLRN-IAQSPPRRAEHKPEIIDAEPEAESE 125 S DPRLRRL+N I++ R +I++ E ES+ Sbjct: 53 SS---SDPRLRRLQNRISEDVEERLARHRKIVEPEVVGESD 90 >UniRef50_Q6CA21 Cluster: Similar to tr|Q8X0K0 Neurospora crassa Related to microfibril- associated protein; n=1; Yarrowia lipolytica|Rep: Similar to tr|Q8X0K0 Neurospora crassa Related to microfibril- associated protein - Yarrowia lipolytica (Candida lipolytica) Length = 333 Score = 56.8 bits (131), Expect = 1e-06 Identities = 46/159 (28%), Positives = 65/159 (40%), Gaps = 14/159 (8%) Query: 250 IRSEQRGAQGEQKEGNINDVC----TDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXX 305 IR+EQ + +E N + TDD++ E E E WK Sbjct: 180 IRAEQEAQRALYEEENAAEFGGVDDTDDQDVEKELEDWKAREKARLDRDRQELISREEAL 239 Query: 306 LTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFYLDKEEDVFKQDFS 365 +R E N KGK K GAFY +E+D+ K+D S Sbjct: 240 AREDRKEEAGEQNEPSGDGSNHWEARGDDKKGKPK--------GAFY--QEQDILKRDLS 289 Query: 366 GPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTT 404 GP DD+ DKT +P+ + K G+ GR + L +QDT+ Sbjct: 290 GPLQDDYVDKTNVPQSLLGKNVGQKGRIMHKSLKEQDTS 328 >UniRef50_UPI0000DC125F Cluster: UPI0000DC125F related cluster; n=1; Rattus norvegicus|Rep: UPI0000DC125F UniRef100 entry - Rattus norvegicus Length = 274 Score = 55.6 bits (128), Expect = 2e-06 Identities = 24/36 (66%), Positives = 31/36 (86%) Query: 11 IQSTAGAIPIRNEKGEISMQKVKVQRYISGKKPDYA 46 IQSTAG +PIR+EK EISM+K +V+ Y+ GK+PDYA Sbjct: 18 IQSTAGTVPIRHEKCEISMEKGRVKLYVPGKRPDYA 53 >UniRef50_Q5KAL3 Cluster: Putative uncharacterized protein; n=1; Filobasidiella neoformans|Rep: Putative uncharacterized protein - Cryptococcus neoformans (Filobasidiella neoformans) Length = 513 Score = 54.8 bits (126), Expect = 4e-06 Identities = 44/167 (26%), Positives = 65/167 (38%), Gaps = 5/167 (2%) Query: 186 SDTEYTDSEEDTGPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXXXX 245 +D+E EE P +PVFV + R AE+ Sbjct: 132 TDSEEESEEEVKKPMFRPVFVPKNARNMTAEKAA--AEAEEARKREEEAEEQRKLASKEL 189 Query: 246 XXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXX 305 +IR E + ++D TD + E E+EAW+ Sbjct: 190 AGESIRRELVEKEAADIVPEVDD--TDGLDVEAEFEAWRARELARLLREKQAQAAKDEEK 247 Query: 306 LTLERLRNMTEDERRLEQRINPKVVTNKAVKGKYKFLQKYYHRGAFY 352 +ER R M E E+RL++ + T + KG+ FLQKYYH+GAF+ Sbjct: 248 EEIERRRAMPE-EQRLKEDMEFAARTREKEKGQMGFLQKYYHKGAFH 293 >UniRef50_Q0E1X2 Cluster: Os02g0294000 protein; n=5; Oryza sativa|Rep: Os02g0294000 protein - Oryza sativa subsp. japonica (Rice) Length = 360 Score = 52.4 bits (120), Expect = 2e-05 Identities = 27/69 (39%), Positives = 44/69 (63%), Gaps = 13/69 (18%) Query: 327 PKVVTNKAVKGKYKFLQKYYHRGAFYLDKEE----------DVFKQDFSGPTLDDHFDKT 376 PK +T +K + +F+++YYH+G F+ D + +++++DFSGPT D D + Sbjct: 273 PKKMT---IKKQMRFMRRYYHKGCFFQDDADGAAQTAAGACEIYRRDFSGPTGLDKMDVS 329 Query: 377 VLPKVMQVK 385 VLPKVMQV+ Sbjct: 330 VLPKVMQVE 338 >UniRef50_A6S856 Cluster: Putative uncharacterized protein; n=1; Botryotinia fuckeliana B05.10|Rep: Putative uncharacterized protein - Botryotinia fuckeliana B05.10 Length = 243 Score = 37.9 bits (84), Expect = 0.53 Identities = 26/88 (29%), Positives = 39/88 (44%), Gaps = 3/88 (3%) Query: 260 EQKEGNINDVCTDDENDELEYEAWKLXXXXXXXXXXXXXXXXXXXXLTLERLRNMTEDER 319 E++E ++D TDD + E E AWKL +ER RN+TE+ER Sbjct: 157 EEEEDEVDD--TDDIDPEAELAAWKLRELKRIKRDREAIEEREKELEEVERRRNLTEEER 214 Query: 320 RLE-QRINPKVVTNKAVKGKYKFLQKYY 346 + E K + +GK +QK + Sbjct: 215 KKEDDEYIAKQKEEREGRGKMATMQKRF 242 >UniRef50_Q1YGP6 Cluster: Putative uncharacterized protein; n=1; Aurantimonas sp. SI85-9A1|Rep: Putative uncharacterized protein - Aurantimonas sp. SI85-9A1 Length = 188 Score = 36.3 bits (80), Expect = 1.6 Identities = 17/51 (33%), Positives = 32/51 (62%), Gaps = 1/51 (1%) Query: 61 IEQQRPERKQVLPQIITRKEEHHSDSEKEVDDPRLRR-LRNIAQSPPRRAE 110 + QQRP+RK L + + + +D ++DPRL++ LR+ A++ RR++ Sbjct: 137 VNQQRPDRKPKLRDLAPVEHQRVADMVARIEDPRLQKALRDFAETTLRRSK 187 >UniRef50_Q4WBH2 Cluster: C6 transcription factor, putative; n=1; Aspergillus fumigatus|Rep: C6 transcription factor, putative - Aspergillus fumigatus (Sartorya fumigata) Length = 761 Score = 35.5 bits (78), Expect = 2.8 Identities = 16/34 (47%), Positives = 21/34 (61%), Gaps = 2/34 (5%) Query: 63 QQRPERKQVLPQI--ITRKEEHHSDSEKEVDDPR 94 Q R E Q +PQ+ R E HHSDS+ +DDP+ Sbjct: 494 QSRSEADQKIPQLDDFLRLEAHHSDSDLNIDDPK 527 >UniRef50_Q5BX93 Cluster: SJCHGC03879 protein; n=1; Schistosoma japonicum|Rep: SJCHGC03879 protein - Schistosoma japonicum (Blood fluke) Length = 193 Score = 35.1 bits (77), Expect = 3.7 Identities = 22/60 (36%), Positives = 32/60 (53%), Gaps = 3/60 (5%) Query: 354 DKEEDVFKQDFSGPTLDDHFDKTVLPKVMQVKKFGRSGRTKYTHLVDQDTTEFDSAWSNE 413 D+EED + DFSG +D+ TV K+ + KK + KY L + + EFD S+E Sbjct: 37 DEEEDTNEYDFSG--MDNVDLSTVNSKINKTKK-STDSKKKYQKLDEDENNEFDDNQSSE 93 >UniRef50_UPI0000EBF1F9 Cluster: PREDICTED: similar to microfibrillar-associated protein 1, partial; n=2; Eutheria|Rep: PREDICTED: similar to microfibrillar-associated protein 1, partial - Bos taurus Length = 231 Score = 34.3 bits (75), Expect = 6.5 Identities = 27/102 (26%), Positives = 41/102 (40%), Gaps = 5/102 (4%) Query: 186 SDTEYTDSEEDT--GPRVKPVFVRASERMTVAERERKMXXXXXXXXXXXXXXXXXXXXXX 243 SDT T G ++P+ R +R+TV ERE + Sbjct: 118 SDTSEATEHAHTHLGGSLRPLCCR-KDRVTVQEREAEALKQKELEQEAKHMAEERRKYTL 176 Query: 244 XXXXXTIRSEQRGAQGEQKEGNINDVCTDDENDELEYEAWKL 285 + E + ++ ++ + TDDENDE EYEAWK+ Sbjct: 177 KIVEEETKKELE--ENKRSLAALDALNTDDENDEEEYEAWKV 216 >UniRef50_UPI0000D56E42 Cluster: PREDICTED: similar to CG32580-PA; n=2; Coelomata|Rep: PREDICTED: similar to CG32580-PA - Tribolium castaneum Length = 1766 Score = 33.9 bits (74), Expect = 8.6 Identities = 18/60 (30%), Positives = 33/60 (55%), Gaps = 4/60 (6%) Query: 67 ERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRNIAQSPPRRAEHKPEIIDAEPEAESEI 126 E+K+ P+++ KEE E++ ++P + + + P E +PEI++ E E E EI Sbjct: 459 EKKEEEPEVLEEKEEEPEIVEEKEEEPEIIEKK---EEEPEEKEEEPEIVE-EKEEEPEI 514 Score = 33.9 bits (74), Expect = 8.6 Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 1/60 (1%) Query: 67 ERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRNIAQSPPRRAEHKPEIIDAEPEAESEI 126 E K+ P+II +KEE EK+ ++P + ++ E +PEI++ E E E +I Sbjct: 556 EEKKEEPKIIEKKEEEPEIIEKKKEEPEIIEIKKEEPEILEEKEEEPEILE-EKEEEPKI 614 >UniRef50_UPI00006CD58E Cluster: hypothetical protein TTHERM_00509090; n=1; Tetrahymena thermophila SB210|Rep: hypothetical protein TTHERM_00509090 - Tetrahymena thermophila SB210 Length = 1143 Score = 33.9 bits (74), Expect = 8.6 Identities = 17/42 (40%), Positives = 24/42 (57%), Gaps = 2/42 (4%) Query: 62 EQQRPERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRNIAQ 103 +QQ+P RK V P + RK+ +S KE + +L RN AQ Sbjct: 324 KQQKPSRKSVSPAVTARKQHQQENSNKE--ESKLNTSRNGAQ 363 >UniRef50_A2RV13 Cluster: Zgc:85787 protein; n=3; Danio rerio|Rep: Zgc:85787 protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 871 Score = 33.9 bits (74), Expect = 8.6 Identities = 19/64 (29%), Positives = 30/64 (46%), Gaps = 1/64 (1%) Query: 62 EQQRPERKQVLPQIITRKEEHHSDSEKEVDDPRLRRLRNIAQSPPRRAEHKPEIIDAEPE 121 ++++P R I RKEE + +E DP R L+ Q+ + +H E D + E Sbjct: 327 KREQPRRSIKKDYSIVRKEEEREEDRREDRDPPFRSLKEF-QNMSKEEDHDEEKEDDDEE 385 Query: 122 AESE 125 E E Sbjct: 386 EEEE 389 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.312 0.130 0.365 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 381,529,422 Number of Sequences: 1657284 Number of extensions: 13214747 Number of successful extensions: 29962 Number of sequences better than 10.0: 35 Number of HSP's better than 10.0 without gapping: 28 Number of HSP's successfully gapped in prelim test: 7 Number of HSP's that attempted gapping in prelim test: 29819 Number of HSP's gapped (non-prelim): 82 length of query: 441 length of database: 575,637,011 effective HSP length: 103 effective length of query: 338 effective length of database: 404,936,759 effective search space: 136868624542 effective search space used: 136868624542 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 42 (21.9 bits) S2: 74 (33.9 bits)
- SilkBase 1999-2023 -