BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA001060-TA|BGIBMGA001060-PA|IPR000996|Clathrin light
chain
(210 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_Q9VWA1 Cluster: Clathrin light chain; n=9; Endopterygot... 124 2e-27
UniRef50_Q2M0W4 Cluster: GA19975-PA; n=4; Endopterygota|Rep: GA1... 117 2e-25
UniRef50_P09496-2 Cluster: Isoform Non; n=45; Metazoa|Rep: Isofo... 102 6e-21
UniRef50_A7SZR4 Cluster: Predicted protein; n=1; Nematostella ve... 96 6e-19
UniRef50_UPI0000E4601B Cluster: PREDICTED: similar to clathryn l... 88 1e-16
UniRef50_P09496 Cluster: Clathrin light chain A; n=36; Euteleost... 87 2e-16
UniRef50_Q2PFR5 Cluster: Putative uncharacterized protein; n=7; ... 71 2e-11
UniRef50_Q5DH75 Cluster: SJCHGC00953 protein; n=1; Schistosoma j... 70 4e-11
UniRef50_P90961 Cluster: Clathrin light chain protein 1; n=2; Ca... 62 1e-08
UniRef50_Q4PEZ7 Cluster: Putative uncharacterized protein; n=1; ... 58 2e-07
UniRef50_A1CHU7 Cluster: Clathrin light chain; n=16; Pezizomycot... 52 1e-05
UniRef50_Q9USP6 Cluster: Clathrin light chain; n=1; Schizosaccha... 51 2e-05
UniRef50_Q5KC20 Cluster: Clathrin light chain, putative; n=1; Fi... 41 0.019
UniRef50_Q5DAY7 Cluster: SJCHGC02696 protein; n=1; Schistosoma j... 40 0.045
UniRef50_Q7SGU9 Cluster: Putative uncharacterized protein NCU032... 39 0.10
UniRef50_Q2GQI7 Cluster: Putative uncharacterized protein; n=1; ... 38 0.14
UniRef50_Q9W1R6 Cluster: CG11079-PA, isoform A; n=6; Diptera|Rep... 38 0.24
UniRef50_Q9W596 Cluster: Microtubule-associated protein futsch; ... 38 0.24
UniRef50_P41996 Cluster: Cytokinesis protein B0280.5 precursor; ... 37 0.42
UniRef50_Q57UL2 Cluster: Putative uncharacterized protein; n=3; ... 36 0.96
UniRef50_Q9XWR0 Cluster: Putative uncharacterized protein; n=2; ... 34 2.2
UniRef50_Q54TP5 Cluster: SAP DNA-binding domain-containing prote... 34 2.2
UniRef50_Q5KE83 Cluster: Putative uncharacterized protein; n=2; ... 34 2.2
UniRef50_A4S4M3 Cluster: Predicted protein; n=2; Ostreococcus|Re... 34 2.9
UniRef50_Q9ZHL0 Cluster: Large supernatant protein 2; n=4; Haemo... 33 3.9
UniRef50_Q6ALQ5 Cluster: Putative uncharacterized protein; n=1; ... 33 3.9
UniRef50_Q0JHZ7 Cluster: Os01g0834100 protein; n=6; Oryza sativa... 33 3.9
UniRef50_A2DRE4 Cluster: Putative uncharacterized protein; n=2; ... 33 3.9
UniRef50_Q9LUI2 Cluster: Centromere protein; n=3; Arabidopsis th... 33 5.1
UniRef50_Q00SK9 Cluster: Homology to unknown gene; n=1; Ostreoco... 33 5.1
UniRef50_A4H4M6 Cluster: Putative uncharacterized protein; n=1; ... 33 5.1
UniRef50_A2DUF4 Cluster: Clan SC, family S9, unassigned serine p... 33 5.1
UniRef50_Q5KPR4 Cluster: Transcriptional activator, putative; n=... 33 5.1
UniRef50_Q4WT36 Cluster: M protein repeat protein; n=6; Eurotiom... 33 5.1
UniRef50_Q4G3A5 Cluster: DNA-directed RNA polymerase subunit bet... 33 5.1
UniRef50_Q9BTC0 Cluster: Death-inducer obliterator 1; n=27; Eume... 33 5.1
UniRef50_UPI000150A032 Cluster: hypothetical protein TTHERM_0034... 33 6.8
UniRef50_UPI0000F1D35A Cluster: PREDICTED: hypothetical protein;... 33 6.8
UniRef50_Q89GN8 Cluster: UDP-N-acetylglucosamine 2-epimerase; n=... 33 6.8
UniRef50_Q1NWV7 Cluster: Putative uncharacterized protein; n=3; ... 33 6.8
UniRef50_A0VKY9 Cluster: Putative uncharacterized protein; n=1; ... 33 6.8
UniRef50_Q4Q843 Cluster: Glycoprotein 96-92, putative; n=5; Leis... 33 6.8
UniRef50_Q4E491 Cluster: Mucin-associated surface protein (MASP)... 33 6.8
UniRef50_Q4CZ55 Cluster: Histone deacetylase, putative; n=2; Try... 33 6.8
UniRef50_Q09EF7 Cluster: Putative uncharacterized protein; n=8; ... 33 6.8
UniRef50_A7AUT5 Cluster: Putative uncharacterized protein; n=1; ... 33 6.8
UniRef50_A4R5R2 Cluster: Putative uncharacterized protein; n=1; ... 33 6.8
UniRef50_UPI0001554761 Cluster: PREDICTED: similar to retinoic a... 32 9.0
UniRef50_Q1LXA8 Cluster: Novel protein similar to human extracel... 32 9.0
UniRef50_Q6MKC8 Cluster: Putative signal peptide protein contain... 32 9.0
UniRef50_Q9VC00 Cluster: CG13648-PA; n=1; Drosophila melanogaste... 32 9.0
UniRef50_Q7R414 Cluster: GLP_68_19620_20219; n=1; Giardia lambli... 32 9.0
UniRef50_Q55F94 Cluster: Putative dynamin family protein; n=1; D... 32 9.0
UniRef50_Q22515 Cluster: Putative uncharacterized protein; n=3; ... 32 9.0
UniRef50_A2EGE9 Cluster: Putative uncharacterized protein; n=1; ... 32 9.0
UniRef50_Q9LUZ9 Cluster: RING-H2 finger protein ATL5O; n=1; Arab... 32 9.0
>UniRef50_Q9VWA1 Cluster: Clathrin light chain; n=9;
Endopterygota|Rep: Clathrin light chain - Drosophila
melanogaster (Fruit fly)
Length = 219
Score = 124 bits (299), Expect = 2e-27
Identities = 85/216 (39%), Positives = 113/216 (52%), Gaps = 29/216 (13%)
Query: 3 DFGDSFVEPE-VDPAADFLAREQNQLAGLEDELE--TSAPPPAISTS---------TNGF 50
DFGD F E VDPAA+FLAREQ+ L LE E+ +++ PPA ST T
Sbjct: 2 DFGDDFAAKEDVDPAAEFLAREQSALGDLEAEITGGSASAPPAASTDEGLGELLGGTASE 61
Query: 51 DDFVEV-------PSASAFDANGLLDDAPLGSTPTTVFKQEREEPEKIKIWREEQXXXXX 103
D + S +F+ G + P+G + REEPEKI+ WREEQ
Sbjct: 62 GDLLSAGGTGGLESSTGSFEVIGGESNEPVGISGPP---PSREEPEKIRKWREEQKQRLE 118
Query: 104 XXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEE 163
+ Q +KKEL+DW + E ISKTK A+R NAE+ A ++E
Sbjct: 119 EKDIEEERKKEELRQQSKKELDDWLRQIGESISKTKLASR----NAEKQAATLENGTIEP 174
Query: 164 GNEWARVSELCDFGP---RRGRDVARLRSIVLQLKQ 196
G EW R+++LCDF P + G+DV+R+RSI L LKQ
Sbjct: 175 GTEWERIAKLCDFNPKVNKAGKDVSRMRSIYLHLKQ 210
>UniRef50_Q2M0W4 Cluster: GA19975-PA; n=4; Endopterygota|Rep:
GA19975-PA - Drosophila pseudoobscura (Fruit fly)
Length = 222
Score = 117 bits (281), Expect = 2e-25
Identities = 83/216 (38%), Positives = 110/216 (50%), Gaps = 26/216 (12%)
Query: 3 DFGDSF-VEPEVDPAADFLAREQNQLAGLEDEL---ETSAPPPAISTSTNGFD----DFV 54
DFGD F ++ EVDPAA+FLAREQ+ L LE E+ +AP A T G
Sbjct: 2 DFGDDFALKEEVDPAAEFLAREQSALGDLEAEITGGSGTAPDAATVDDTLGLGLGGLSGA 61
Query: 55 EVPSASAFDANGLLDDAPLGS---------TPTTVF--KQEREEPEKIKIWREEQXXXXX 103
S A G ++ GS P + REEPEKI+ WREEQ
Sbjct: 62 GAELGSELSATGGGLESSTGSFEVIGGESNEPVGISGPPPSREEPEKIRKWREEQKQRLE 121
Query: 104 XXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEE 163
+ Q +KKEL+DW + E ISKTK +S++NAE+ A ++E
Sbjct: 122 EKDVEEERKKEELRQQSKKELDDWLRQIGESISKTK----QSSRNAEKQAASLENGTIEP 177
Query: 164 GNEWARVSELCDFGP---RRGRDVARLRSIVLQLKQ 196
G EW R+++LCDF P + G+DV+R+RSI L LKQ
Sbjct: 178 GTEWERIAKLCDFNPKVNKAGKDVSRMRSIYLHLKQ 213
>UniRef50_P09496-2 Cluster: Isoform Non; n=45; Metazoa|Rep: Isoform
Non - Homo sapiens (Human)
Length = 218
Score = 102 bits (245), Expect = 6e-21
Identities = 70/196 (35%), Positives = 102/196 (52%), Gaps = 19/196 (9%)
Query: 12 EVDPAADFLAREQNQLAGLEDE-----LETSAPPPAISTSTNGFDDFVE-VPSASAF-DA 64
E DPAA FLA++++++AG+E++ L+ AP P G D V+ V + + ++
Sbjct: 28 EEDPAAAFLAQQESEIAGIENDEAFAILDGGAPGPQPHGEPPGGPDAVDGVMNGEYYQES 87
Query: 65 NGLLDDAPLGSTPTTVFKQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKEL 124
NG D S + + EPE I+ WREEQ + A KEL
Sbjct: 88 NGPTDSYAAISQVDRL----QSEPESIRKWREEQMERLEALDANSRKQEAEWKEKAIKEL 143
Query: 125 EDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDFGP---RRG 181
E+WY +EQ+ KTKA NR AE A + S G EW RV+ LCDF P ++
Sbjct: 144 EEWYARQDEQLQKTKANNRA----AEEAFVNDIDES-SPGTEWERVARLCDFNPKSSKQA 198
Query: 182 RDVARLRSIVLQLKQA 197
+DV+R+RS+++ LKQA
Sbjct: 199 KDVSRMRSVLISLKQA 214
>UniRef50_A7SZR4 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 198
Score = 95.9 bits (228), Expect = 6e-19
Identities = 72/199 (36%), Positives = 95/199 (47%), Gaps = 18/199 (9%)
Query: 2 DDFGDSFVEPEVDPAADFLAREQNQLAGLEDELETSAPPPAISTSTNGFD-DFVEVPSAS 60
D F D E VDPAA+FLAREQ+ LA L ++L P GFD E P +
Sbjct: 8 DTFSD---EQAVDPAAEFLAREQDDLAELGEDL----GGPNSDVEGVGFDMSGGEEPVMN 60
Query: 61 AFDANGLLDDAPLGSTPTTVFKQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIA 120
F+ G + PT V ++ E E ++ WREE+ + A
Sbjct: 61 GFEDEG-ESSVSQQTAPTPVTRE--IEHESVRKWREEKAAQLEKMDEEEKAEIEEWREQA 117
Query: 121 KKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDFGP-- 178
KEL DWY EQ+ KTK +NR + E A ++S G EW +V CDF P
Sbjct: 118 HKELNDWYDRRNEQLGKTKNSNR---ADEESFVAERDDTST-PGTEWEKVCRACDFNPKA 173
Query: 179 -RRGRDVARLRSIVLQLKQ 196
+ +DV+R+RSI LQLKQ
Sbjct: 174 TKNTKDVSRMRSIFLQLKQ 192
>UniRef50_UPI0000E4601B Cluster: PREDICTED: similar to clathryn
light chain (LCA3), partial; n=4; Strongylocentrotus
purpuratus|Rep: PREDICTED: similar to clathryn light
chain (LCA3), partial - Strongylocentrotus purpuratus
Length = 169
Score = 88.2 bits (209), Expect = 1e-16
Identities = 48/119 (40%), Positives = 63/119 (52%), Gaps = 8/119 (6%)
Query: 87 EPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANRESA 146
EPEKI++WREEQ AKKE+ DWY EEQ K KA+NR
Sbjct: 55 EPEKIRLWREEQKEILEKKDEEADELEVEWKVSAKKEISDWYARREEQGVKAKASNRA-- 112
Query: 147 KNAERAQARGSESSVEEGNEWARVSELCDFGPRRG---RDVARLRSIVLQLKQAGSRPN 202
AE A + + G EW R++ LCDF P+ +D+ R RSI+L LKQ+G +P+
Sbjct: 113 --AEEAFIQ-ERDEITPGQEWERIARLCDFNPKNNKNLKDITRFRSILLHLKQSGVQPS 168
>UniRef50_P09496 Cluster: Clathrin light chain A; n=36;
Euteleostomi|Rep: Clathrin light chain A - Homo sapiens
(Human)
Length = 248
Score = 87.4 bits (207), Expect = 2e-16
Identities = 69/221 (31%), Positives = 107/221 (48%), Gaps = 39/221 (17%)
Query: 12 EVDPAADFLAREQNQLAGLEDE-----LETSAPPPAISTSTNGFDDFVE-VPSASAF-DA 64
E DPAA FLA++++++AG+E++ L+ AP P G D V+ V + + ++
Sbjct: 28 EEDPAAAFLAQQESEIAGIENDEAFAILDGGAPGPQPHGEPPGGPDAVDGVMNGEYYQES 87
Query: 65 NGLLDDAPLGSTPTTVFKQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKEL 124
NG D S + + EPE I+ WREEQ + A KEL
Sbjct: 88 NGPTDSYAAISQVDRL----QSEPESIRKWREEQMERLEALDANSRKQEAEWKEKAIKEL 143
Query: 125 EDWYKSHEEQISKTKAANRESAK----------------------NAERAQARGSESSVE 162
E+WY +EQ+ KTKA NR + + + E+A + ++
Sbjct: 144 EEWYARQDEQLQKTKANNRVADEAFYKQPFADVIGYVTNINHPCYSLEQAAEEAFVNDID 203
Query: 163 E---GNEWARVSELCDFGP---RRGRDVARLRSIVLQLKQA 197
E G EW RV+ LCDF P ++ +DV+R+RS+++ LKQA
Sbjct: 204 ESSPGTEWERVARLCDFNPKSSKQAKDVSRMRSVLISLKQA 244
>UniRef50_Q2PFR5 Cluster: Putative uncharacterized protein; n=7;
Eutheria|Rep: Putative uncharacterized protein - Macaca
fascicularis (Crab eating macaque) (Cynomolgus monkey)
Length = 101
Score = 71.3 bits (167), Expect = 2e-11
Identities = 38/81 (46%), Positives = 51/81 (62%), Gaps = 8/81 (9%)
Query: 120 AKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDFGP- 178
A KELE+WY +EQ+ KTKA NR AE A + S G EW RV+ LCDF P
Sbjct: 22 AIKELEEWYARQDEQLQKTKANNRA----AEEAFVNDIDES-SPGTEWERVARLCDFNPK 76
Query: 179 --RRGRDVARLRSIVLQLKQA 197
++ +DV+R+RS+++ LKQA
Sbjct: 77 SSKQAKDVSRMRSVLISLKQA 97
>UniRef50_Q5DH75 Cluster: SJCHGC00953 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC00953 protein - Schistosoma
japonicum (Blood fluke)
Length = 201
Score = 70.1 bits (164), Expect = 4e-11
Identities = 52/196 (26%), Positives = 85/196 (43%), Gaps = 12/196 (6%)
Query: 12 EVDPAADFLAREQNQLAGLEDELETSAPPPAIST-STNGFDD--FVEVPSASAFDANGLL 68
+ DP +DFLAREQ L L D+ + T S D +V + S + + +
Sbjct: 3 DFDPVSDFLAREQEALGDLGDDFKLDEGTAGTETRSYEPVSDASYVFLNGQSGMNGSHVG 62
Query: 69 D-----DAPLGSTPTTVFKQEREEP-EKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKK 122
D D + + + E + + WREE +++I KK
Sbjct: 63 DYLTKPDCTVSDSDHDFNRVESDSNLSSTESWREEFNKRIKTKDAEEEKKCIELMEIGKK 122
Query: 123 ELEDWYKSHEEQISKTKAANRESAKNAE-RAQARGSESSVEEGNEWARVSELCDF--GPR 179
EL DWY+++ +Q+ RE + + G++ SV++ W + LCDF P+
Sbjct: 123 ELNDWYRNYHQQLETRSRELREKKSDLNGQLMNGGNKPSVKDSAVWESICNLCDFQSKPK 182
Query: 180 RGRDVARLRSIVLQLK 195
D +R+RSI+L LK
Sbjct: 183 TAIDTSRMRSILLSLK 198
>UniRef50_P90961 Cluster: Clathrin light chain protein 1; n=2;
Caenorhabditis|Rep: Clathrin light chain protein 1 -
Caenorhabditis elegans
Length = 226
Score = 62.1 bits (144), Expect = 1e-08
Identities = 64/227 (28%), Positives = 98/227 (43%), Gaps = 46/227 (20%)
Query: 14 DPAADFLAREQNQLAGLE-----------DELETSAPPPAISTSTNGF----DDFVEV-- 56
DP ADFLAREQN A + D E AP PA+ D+ V
Sbjct: 3 DPVADFLAREQNLFADFDGAPPAAAAANPDAPEADAPAPALDDDFGDLQIAGDEPPPVVH 62
Query: 57 PSASAFDANGLLDD---APL-------------------GST-PTTVFKQ-EREEPEKIK 92
P+ S D +GL+DD AP GS P+ + R E EKI+
Sbjct: 63 PTDSGVDLDGLVDDNAAAPAIVVPAVEPMVNGNHSASSGGSKGPSPILSTVPRIEAEKIR 122
Query: 93 IWREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANRESAKNAERA 152
+W+ +Q + AKKELE+WYK E+ + + N ++ K+ +
Sbjct: 123 LWKAQQEQLLSKKDEAEEKKKIELRANAKKELEEWYKQREKTLQLSHDENLKNEKSNQEL 182
Query: 153 QARGSESSVEEGNEWARVSELCD-FGPRRGRDVARLRSIVLQLKQAG 198
A+ + +W V++L D + G+D++RL++++ LK AG
Sbjct: 183 FAKQQDGDA----QWETVNKLVDQQKSKSGKDLSRLKTLLAGLKHAG 225
>UniRef50_Q4PEZ7 Cluster: Putative uncharacterized protein; n=1;
Ustilago maydis|Rep: Putative uncharacterized protein -
Ustilago maydis (Smut fungus)
Length = 291
Score = 57.6 bits (133), Expect = 2e-07
Identities = 35/146 (23%), Positives = 68/146 (46%), Gaps = 7/146 (4%)
Query: 44 STSTNG---FDDFVEVPSASAFDANGLLDDAPLGSTPTTVFKQEREEPEKIKIWREEQXX 100
S +TNG DD ++ SA A + ++ + S P+ +++ EEPE ++ WRE Q
Sbjct: 119 SANTNGGSYHDDADDLMSARAAAPSRVIPTSVPASQPSYSYEEPTEEPEAVRQWRETQKD 178
Query: 101 XXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESS 160
+ A+++++++Y + + K AAN+E+ + R
Sbjct: 179 AIAKRDAEDERKKAEAISKAEQDIDNFYAEYNAKKEKNIAANKENEAKFHEERTR----E 234
Query: 161 VEEGNEWARVSELCDFGPRRGRDVAR 186
+ EG W RV+++ D + + +AR
Sbjct: 235 LAEGTTWDRVTKMVDLKNSQSKTIAR 260
Score = 34.7 bits (76), Expect = 1.7
Identities = 30/77 (38%), Positives = 38/77 (49%), Gaps = 9/77 (11%)
Query: 3 DFGDSFVEPEVDPAADFLAREQNQLAGLEDELETSAPPPAISTSTNGFDDFVEVPSASAF 62
DFG+S +P DP ADFLARE+ L D+ + + DDF SA+AF
Sbjct: 4 DFGES--KPS-DPTADFLAREREAAGVLSDDADLFGSSNTAGIGASA-DDFER--SATAF 57
Query: 63 DANGLLDDA-PLGSTPT 78
A L DD P S P+
Sbjct: 58 PA--LDDDGLPAASAPS 72
>UniRef50_A1CHU7 Cluster: Clathrin light chain; n=16;
Pezizomycotina|Rep: Clathrin light chain - Aspergillus
clavatus
Length = 246
Score = 51.6 bits (118), Expect = 1e-05
Identities = 34/137 (24%), Positives = 60/137 (43%), Gaps = 10/137 (7%)
Query: 71 APLGSTPTTVFKQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKS 130
AP T + + EEPE ++ WRE + + A+++++D+Y S
Sbjct: 109 APFPPTGYASYGEPSEEPEPVREWRERRDAEITRRAEISNEKKEATINKAREDIDDFYVS 168
Query: 131 HEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDF------GPRRGRDV 184
+ + K +A R +AE+ A ++S G W R+++L D G G
Sbjct: 169 YNNKTDKLRAQTR---ADAEQFLANREDTSA-GGTSWERIAKLVDISGKGTKGGASGSGK 224
Query: 185 ARLRSIVLQLKQAGSRP 201
R R ++L LK+ + P
Sbjct: 225 ERFRELLLDLKKDQNAP 241
>UniRef50_Q9USP6 Cluster: Clathrin light chain; n=1;
Schizosaccharomyces pombe|Rep: Clathrin light chain -
Schizosaccharomyces pombe (Fission yeast)
Length = 229
Score = 50.8 bits (116), Expect = 2e-05
Identities = 50/216 (23%), Positives = 88/216 (40%), Gaps = 19/216 (8%)
Query: 1 MDDFGDSFVEPEVDPA---ADFLAREQNQLAGLEDELETSAPPPAISTSTNGFDD----F 53
++DF D V VD + DFL RE+ L + ET A+ N + F
Sbjct: 7 LEDFDDGLVTAPVDDSKNNTDFLEREKLALGEDAGQFETPEDKDALLNFENDSEAEQTRF 66
Query: 54 VE--VPSASAFDANGLLD--DAP-LGSTPTTVFKQEREEPEKIKIWREEQXXXXXXXXXX 108
+ P + A+G AP +G + E +PE ++ W+E+Q
Sbjct: 67 EQNFPPIDAEMQASGTFSAPKAPYMGQAEVHPPEDESGDPEPVRKWKEDQMKRIQERDES 126
Query: 109 XXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWA 168
++ A+K ++D+Y++ ++ K A +R K E+ +ES W
Sbjct: 127 SKKLRESNIEKARKAIDDFYENFNDKRDKVIAKSR---KEQEKL-LEENESKSTGTTSWE 182
Query: 169 RVSELCDFGPR---RGRDVARLRSIVLQLKQAGSRP 201
R+ +L D + GR R R +++ L + + P
Sbjct: 183 RILKLIDLSDKPEAHGRSTERFRELLISLAKDSNAP 218
>UniRef50_Q5KC20 Cluster: Clathrin light chain, putative; n=1;
Filobasidiella neoformans|Rep: Clathrin light chain,
putative - Cryptococcus neoformans (Filobasidiella
neoformans)
Length = 288
Score = 41.1 bits (92), Expect = 0.019
Identities = 30/125 (24%), Positives = 51/125 (40%), Gaps = 14/125 (11%)
Query: 86 EEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANRES 145
E+ E IK W+ Q M A+K ++ +Y E+ +K K N
Sbjct: 161 EDTEPIKAWKARQAEEIQKRDEADKKRRDEMSDRAEKAIDQFY----EEYNKMKEKNIRE 216
Query: 146 AKNAERAQARGSESSVEEGNEWARVSELCDFGPRR----------GRDVARLRSIVLQLK 195
K +E + V +G W R+S+L + G D+AR++ I+L L+
Sbjct: 217 NKESEAEFLEKLQEGVAKGTAWERISDLISLENSQSKTIRPSVPGGSDLARMKEILLALR 276
Query: 196 QAGSR 200
+ G +
Sbjct: 277 REGDK 281
>UniRef50_Q5DAY7 Cluster: SJCHGC02696 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC02696 protein - Schistosoma
japonicum (Blood fluke)
Length = 189
Score = 39.9 bits (89), Expect = 0.045
Identities = 29/133 (21%), Positives = 52/133 (39%), Gaps = 7/133 (5%)
Query: 51 DDFVEVPSASAFDANGLLDDAPLGSTPTTVFKQEREEPEKIKIWREEQXXXXXXXXXXXX 110
+DF+ + L+ P G + + +EPE I+ W
Sbjct: 8 NDFLAREKKDLYGLESDLNFLPNGDNNGSEHPKVNDEPECIRRWAISFQKRIDVKDDAER 67
Query: 111 XXXXXMLQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESS-----VEEGN 165
+ + K+L DW + + +S+ + +R K AQ + +E+S V+ G
Sbjct: 68 KSLSVLEEQGLKDLHDWALQYRDSLSRGQKLSRAHDKELRSAQKKAAEASVGMAQVDAGQ 127
Query: 166 E--WARVSELCDF 176
W RV +LC+F
Sbjct: 128 AVLWERVCQLCNF 140
>UniRef50_Q7SGU9 Cluster: Putative uncharacterized protein
NCU03220.1; n=1; Neurospora crassa|Rep: Putative
uncharacterized protein NCU03220.1 - Neurospora crassa
Length = 1627
Score = 38.7 bits (86), Expect = 0.10
Identities = 22/58 (37%), Positives = 30/58 (51%), Gaps = 1/58 (1%)
Query: 22 REQNQLAGLEDELETSAPPPAISTSTNGFDDFVEVPSASAFDANGLLDDAPLGSTPTT 79
R QNQ L+ L+T PPPA+ + G P+A A N L+ +P GS+P T
Sbjct: 108 RSQNQDYDLDYALDTGPPPPAVPEAPTGPPMSYRYPAAMAEQQNSYLNSSP-GSSPLT 164
>UniRef50_Q2GQI7 Cluster: Putative uncharacterized protein; n=1;
Chaetomium globosum|Rep: Putative uncharacterized
protein - Chaetomium globosum (Soil fungus)
Length = 469
Score = 38.3 bits (85), Expect = 0.14
Identities = 30/73 (41%), Positives = 36/73 (49%), Gaps = 6/73 (8%)
Query: 6 DSFVEPEVDPAADFLAREQNQLAGLEDELETSAPPPAISTSTNGFDDFVEVPSASAFDAN 65
+ F P D AA F AR Q A + L APP I T G+ V P+ SA+ N
Sbjct: 311 ECFKVPAADIAAFFEARATQQAA---NSLNLPAPPSPIETHFLGYAMRVSCPNGSAY--N 365
Query: 66 GLL-DDAPLGSTP 77
GLL D+A L S P
Sbjct: 366 GLLGDNAALSSLP 378
>UniRef50_Q9W1R6 Cluster: CG11079-PA, isoform A; n=6; Diptera|Rep:
CG11079-PA, isoform A - Drosophila melanogaster (Fruit
fly)
Length = 318
Score = 37.5 bits (83), Expect = 0.24
Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 4/75 (5%)
Query: 120 AKKELEDWYKSHEEQI----SKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCD 175
A +E E +YK +EQ+ +KT+ E+ KNA + +R SV + R +L
Sbjct: 33 AAREEEFFYKQQKEQLKNLKTKTEPKAPEAPKNAAKRNSRQISVSVSAADNGRRSGDLKT 92
Query: 176 FGPRRGRDVARLRSI 190
GPRR + R S+
Sbjct: 93 HGPRRQMETVRTGSL 107
>UniRef50_Q9W596 Cluster: Microtubule-associated protein futsch; n=6;
melanogaster subgroup|Rep: Microtubule-associated protein
futsch - Drosophila melanogaster (Fruit fly)
Length = 5412
Score = 37.5 bits (83), Expect = 0.24
Identities = 43/202 (21%), Positives = 84/202 (41%), Gaps = 14/202 (6%)
Query: 21 AREQNQLAGLEDELETSAPPPAISTSTNGFDDFVEV----PSASAFDANGLLDD-APL-- 73
+R ++ G ETS P ++ +G DD E+ + + +A + D+ +PL
Sbjct: 1887 SRRESVKDGAAQSRETSRPASVAESAKDGADDLKELSRPESTTQSKEAGSIKDEKSPLAS 1946
Query: 74 --GSTPTTVFKQEREEPEKIK-IWREEQXXXXXXXXXXXXXXXXXMLQIAKKELE-DWYK 129
S P +V + ++E EK K R E + + K E E +
Sbjct: 1947 EEASRPASVAESVKDEAEKSKEESRRESVAEKSPLPSKEASRPASVAESIKDEAEKSKEE 2006
Query: 130 SHEEQIS-KTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDFGPRRGRDVARLR 188
S E ++ K+ ++E+++ A A++ E+ E+ E +R + + P ++ +R
Sbjct: 2007 SRRESVAEKSPLPSKEASRPASVAESIKDEA--EKSKEESRRESVAEKSPLPSKEASRPA 2064
Query: 189 SIVLQLKQAGSRPNYPSRATKV 210
S+ +K + SR V
Sbjct: 2065 SVAESIKDEAEKSKEESRRESV 2086
>UniRef50_P41996 Cluster: Cytokinesis protein B0280.5 precursor;
n=2; Caenorhabditis elegans|Rep: Cytokinesis protein
B0280.5 precursor - Caenorhabditis elegans
Length = 524
Score = 36.7 bits (81), Expect = 0.42
Identities = 18/50 (36%), Positives = 25/50 (50%)
Query: 23 EQNQLAGLEDELETSAPPPAISTSTNGFDDFVEVPSASAFDANGLLDDAP 72
EQNQ GL++ L P + + NG D E PS+ F+ L+ D P
Sbjct: 399 EQNQCVGLDNGLHAIGCSPRVLSCQNGHVDIFECPSSLVFNDQSLICDYP 448
>UniRef50_Q57UL2 Cluster: Putative uncharacterized protein; n=3;
Trypanosoma|Rep: Putative uncharacterized protein -
Trypanosoma brucei
Length = 941
Score = 35.5 bits (78), Expect = 0.96
Identities = 21/81 (25%), Positives = 42/81 (51%)
Query: 117 LQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDF 176
LQ A +E+E YK+ E ++ + A +E+A+ + ++ ES V R EL +
Sbjct: 625 LQHALEEIEARYKACERRLDDERQALKETAEKLQESRDAAEESKVAHRRTQTRRQELEEE 684
Query: 177 GPRRGRDVARLRSIVLQLKQA 197
R+ ++A + + +L++A
Sbjct: 685 LQRKNSELAAVGERITELEEA 705
>UniRef50_Q9XWR0 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 1464
Score = 34.3 bits (75), Expect = 2.2
Identities = 23/84 (27%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 117 LQIAKKELEDWYKSHEEQISKTKAANRESAKNA-ERAQARGSESSVEEGNEWARVSELCD 175
L+ A + ++W K HEE I ++K K A +RA+A E+ +++ S +
Sbjct: 1313 LKKALADCDEWKKKHEESIVESKTEILMERKRAMDRAEACEKETELKQSRMATIESARME 1372
Query: 176 FGPRRGR---DVARLRSIVLQLKQ 196
G R ++ R R I++QL++
Sbjct: 1373 LGGELARTQSELDRCRQIIIQLEE 1396
>UniRef50_Q54TP5 Cluster: SAP DNA-binding domain-containing protein;
n=1; Dictyostelium discoideum AX4|Rep: SAP DNA-binding
domain-containing protein - Dictyostelium discoideum AX4
Length = 1216
Score = 34.3 bits (75), Expect = 2.2
Identities = 16/56 (28%), Positives = 33/56 (58%)
Query: 117 LQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSE 172
LQI + + +Y+++++ K K + +ES K +E+ + + ES E+ +E + SE
Sbjct: 274 LQILEFLINKYYRNNDDNNKKEKESEKESEKESEKEKEKEKESEKEKESEKEKESE 329
>UniRef50_Q5KE83 Cluster: Putative uncharacterized protein; n=2;
Filobasidiella neoformans|Rep: Putative uncharacterized
protein - Cryptococcus neoformans (Filobasidiella
neoformans)
Length = 1057
Score = 34.3 bits (75), Expect = 2.2
Identities = 22/84 (26%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 117 LQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDF 176
L++ +++ED K H +Q+ + K AN E +N R Q ++ + +EW S++ +
Sbjct: 744 LEVESRDVED-LKYHFDQLKQVKTANEEEIENLLR-QIEKLKAERRQEDEWK--SKVEEL 799
Query: 177 GPRRGRDVARLRSIVLQLKQAGSR 200
R + R +++ L+L++ SR
Sbjct: 800 SKRVDMEGMRRQNVELELEEVRSR 823
>UniRef50_A4S4M3 Cluster: Predicted protein; n=2; Ostreococcus|Rep:
Predicted protein - Ostreococcus lucimarinus CCE9901
Length = 408
Score = 33.9 bits (74), Expect = 2.9
Identities = 23/63 (36%), Positives = 32/63 (50%), Gaps = 3/63 (4%)
Query: 28 AGLEDELETSA--PPPAISTSTNGFDDFVEVPSASAFDANGLLDDAPLGSTPTTVFKQER 85
AG ED ++ A PPP T T+G ++ VEV + ++ AP+GS P V Q R
Sbjct: 233 AGDEDGSDSEALPPPPMPPTGTSGGNE-VEVAPPRVTETTTVVRRAPVGSMPRVVPVQRR 291
Query: 86 EEP 88
P
Sbjct: 292 PPP 294
>UniRef50_Q9ZHL0 Cluster: Large supernatant protein 2; n=4;
Haemophilus ducreyi|Rep: Large supernatant protein 2 -
Haemophilus ducreyi
Length = 4919
Score = 33.5 bits (73), Expect = 3.9
Identities = 27/89 (30%), Positives = 38/89 (42%), Gaps = 4/89 (4%)
Query: 4 FGDSFVEPEVDPAADFLAREQNQLAGLEDELETSAP---PPAISTSTNGFDDFVEVPSAS 60
+G PE A+ A E Q +G + ++ P PPA+ T D EVPS
Sbjct: 2674 YGTINKSPEAIARANAKADEAIQASGYDPRIKPVVPEDAPPALPPRTQSLIDSTEVPSYR 2733
Query: 61 AFDANGLLDDAPLGSTPTTV-FKQEREEP 88
+ AN DDA P+ + K +EP
Sbjct: 2734 SALANVKFDDASPWPQPSALRSKAFADEP 2762
>UniRef50_Q6ALQ5 Cluster: Putative uncharacterized protein; n=1;
Desulfotalea psychrophila|Rep: Putative uncharacterized
protein - Desulfotalea psychrophila
Length = 3196
Score = 33.5 bits (73), Expect = 3.9
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 1/68 (1%)
Query: 136 SKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDFGPRRGRDVARLRSIVLQLK 195
+K KA +ESAK A ++ +G E +E+ E ++ D R D L + + L
Sbjct: 1105 AKAKAKKKESAKPAAKSTPKGKEEKIEDFGEKIGGAKK-DLYTRSLTDAEHLDTATVPLS 1163
Query: 196 QAGSRPNY 203
+A PNY
Sbjct: 1164 KAFPAPNY 1171
>UniRef50_Q0JHZ7 Cluster: Os01g0834100 protein; n=6; Oryza
sativa|Rep: Os01g0834100 protein - Oryza sativa subsp.
japonica (Rice)
Length = 444
Score = 33.5 bits (73), Expect = 3.9
Identities = 19/71 (26%), Positives = 39/71 (54%)
Query: 117 LQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDF 176
L+ +L+D K +EQ+ T+ + R + + AE A+A+ + +S + + A+++EL
Sbjct: 3 LESQLSQLQDELKKAKEQLLSTEHSKRRALQEAEDARAQAAAASAQVRDSEAQLAELSSA 62
Query: 177 GPRRGRDVARL 187
R ++ RL
Sbjct: 63 EESRLLELRRL 73
>UniRef50_A2DRE4 Cluster: Putative uncharacterized protein; n=2;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 193
Score = 33.5 bits (73), Expect = 3.9
Identities = 32/114 (28%), Positives = 51/114 (44%), Gaps = 11/114 (9%)
Query: 87 EPEKIKI---WREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANR 143
EPEK+ W + + M Q A ++L +++K EE +T+A +
Sbjct: 85 EPEKVSALVEWEQNRNKIIADQDDEEEKQISAMRQKASEDLSNFHKKIEEG-QETRAHHN 143
Query: 144 ESAKNAERAQARGSESSVEEGNEWARVSELCDFGPR--RGRDVARLRSIVLQLK 195
+AE A ES E N+W V DF +DV+R++ ++LQLK
Sbjct: 144 VEV-DAETKAAL--ESHPE--NQWEGVVSYIDFNRSDLHEKDVSRMKGLLLQLK 192
>UniRef50_Q9LUI2 Cluster: Centromere protein; n=3; Arabidopsis
thaliana|Rep: Centromere protein - Arabidopsis thaliana
(Mouse-ear cress)
Length = 1728
Score = 33.1 bits (72), Expect = 5.1
Identities = 19/59 (32%), Positives = 28/59 (47%)
Query: 132 EEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDFGPRRGRDVARLRSI 190
EE + T AN E + E + ES +GN R SELCD R+ ++ L ++
Sbjct: 1173 EEMLKATHNANAELCEAVEELRKDCKESRKLKGNLEKRNSELCDLAGRQDEEIKILSNL 1231
>UniRef50_Q00SK9 Cluster: Homology to unknown gene; n=1;
Ostreococcus tauri|Rep: Homology to unknown gene -
Ostreococcus tauri
Length = 502
Score = 33.1 bits (72), Expect = 5.1
Identities = 27/120 (22%), Positives = 49/120 (40%), Gaps = 2/120 (1%)
Query: 82 KQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAA 141
++ RE +K W E + +++ K LE+ + E +K A
Sbjct: 353 ERARERALAVKAWEEHKTQESVEQEAKRIKEEARRVEMEAKRLEEAKQRVREHTTKDLNA 412
Query: 142 NRESAKNAERAQARGSESSVE-EGNEWARVSELCDFGPRRGRD-VARLRSIVLQLKQAGS 199
E AK E+ + R +E E A+ E + G + R+ +AR S+ +LK + +
Sbjct: 413 KMEQAKRDEQERDRIERERIEAEEARKAKERERKEKGLAKARELLARKNSVKAELKSSAT 472
>UniRef50_A4H4M6 Cluster: Putative uncharacterized protein; n=1;
Leishmania braziliensis|Rep: Putative uncharacterized
protein - Leishmania braziliensis
Length = 1455
Score = 33.1 bits (72), Expect = 5.1
Identities = 16/34 (47%), Positives = 20/34 (58%), Gaps = 2/34 (5%)
Query: 143 RESAKNAERAQA--RGSESSVEEGNEWARVSELC 174
R AKN RA A +GS +VEE EW+R+ C
Sbjct: 1047 RSRAKNTSRASAALQGSRHAVEEAEEWSRLLSRC 1080
>UniRef50_A2DUF4 Cluster: Clan SC, family S9, unassigned serine
peptidase; n=1; Trichomonas vaginalis G3|Rep: Clan SC,
family S9, unassigned serine peptidase - Trichomonas
vaginalis G3
Length = 441
Score = 33.1 bits (72), Expect = 5.1
Identities = 16/46 (34%), Positives = 26/46 (56%)
Query: 121 KKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNE 166
KKE+E+ +K +EE+ +K KA AK QA+ ++ E N+
Sbjct: 379 KKEMEERHKKNEEKRAKKKAKAEAKAKKLAEEQAKLNQQENTEANQ 424
>UniRef50_Q5KPR4 Cluster: Transcriptional activator, putative; n=2;
Filobasidiella neoformans|Rep: Transcriptional
activator, putative - Cryptococcus neoformans
(Filobasidiella neoformans)
Length = 1290
Score = 33.1 bits (72), Expect = 5.1
Identities = 40/164 (24%), Positives = 64/164 (39%), Gaps = 11/164 (6%)
Query: 30 LEDELETSAPP--PAISTSTNGFDDFVEVPSASAFDANGLLDDAPLGSTPTTVFKQEREE 87
L+D+ T+ P P ++ NGF F S DA L + P V +++
Sbjct: 528 LQDDFLTARVPSTPLMTPPLNGFGSFPVTRRPSQSDATSPLPEV---GPPEQVAEKDPLA 584
Query: 88 PEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTK-AANRESA 146
+ K + + L + KKE E K EE+ + K AA +E+A
Sbjct: 585 AQVWKAYARARDTLPNGQRMENLTWRMMHLTLKKKEEEQAAKEKEEREKEDKEAAEKEAA 644
Query: 147 KNAERAQARGSESSVEEGNEWARVSELCDFGPRRGRDVARLRSI 190
+ A A A + ++ A +EL RRGR + R +
Sbjct: 645 EAAAAATAAAATAAAA-----AAAAELPPVEERRGRTKGKSRIV 683
>UniRef50_Q4WT36 Cluster: M protein repeat protein; n=6;
Eurotiomycetidae|Rep: M protein repeat protein -
Aspergillus fumigatus (Sartorya fumigata)
Length = 878
Score = 33.1 bits (72), Expect = 5.1
Identities = 25/84 (29%), Positives = 36/84 (42%), Gaps = 1/84 (1%)
Query: 77 PTTVFKQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIA-KKELEDWYKSHEEQI 135
P +V KQ RE+ EKI + EE Q+A +++ K E++
Sbjct: 289 PGSVEKQLREKDEKIALLLEEGQKLSKSEMDHRTAIKKLRQQLADNSKIQMETKKRTEKL 348
Query: 136 SKTKAANRESAKNAERAQARGSES 159
+ A AK AE A+ R SES
Sbjct: 349 ERDLANLEARAKRAEAAEKRASES 372
>UniRef50_Q4G3A5 Cluster: DNA-directed RNA polymerase subunit beta'';
n=1; Emiliania huxleyi|Rep: DNA-directed RNA polymerase
subunit beta'' - Emiliania huxleyi
Length = 1267
Score = 33.1 bits (72), Expect = 5.1
Identities = 18/34 (52%), Positives = 22/34 (64%), Gaps = 1/34 (2%)
Query: 26 QLAGLEDELETSAPPPAISTSTNGFDDFVEVPSA 59
QL GL++E++T A P I T N DFVEV SA
Sbjct: 1025 QLIGLDNEVQTLALKPGIKTKFNS-GDFVEVSSA 1057
>UniRef50_Q9BTC0 Cluster: Death-inducer obliterator 1; n=27;
Eumetazoa|Rep: Death-inducer obliterator 1 - Homo sapiens
(Human)
Length = 2240
Score = 33.1 bits (72), Expect = 5.1
Identities = 15/47 (31%), Positives = 23/47 (48%)
Query: 126 DWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSE 172
DW + E + K ++R+ +N ER+ R E + G EW R E
Sbjct: 2128 DWDRPREWDRHRDKDSSRDWDRNRERSANRDREREADRGKEWDRSRE 2174
>UniRef50_UPI000150A032 Cluster: hypothetical protein
TTHERM_00343370; n=1; Tetrahymena thermophila SB210|Rep:
hypothetical protein TTHERM_00343370 - Tetrahymena
thermophila SB210
Length = 1138
Score = 32.7 bits (71), Expect = 6.8
Identities = 19/65 (29%), Positives = 33/65 (50%), Gaps = 4/65 (6%)
Query: 89 EKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAK--KELEDWYKSHEEQISKTKAANRESA 146
E+++IWR++ +QI K KE+E+ + +EE I KTK ++ A
Sbjct: 623 EEVEIWRQKNECEDKINQLSSHHEQHTTIQIEKFTKEIENQKQEYEEHIKKTK--KQQQA 680
Query: 147 KNAER 151
+N +R
Sbjct: 681 ENDDR 685
>UniRef50_UPI0000F1D35A Cluster: PREDICTED: hypothetical protein;
n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
Danio rerio
Length = 178
Score = 32.7 bits (71), Expect = 6.8
Identities = 16/43 (37%), Positives = 23/43 (53%)
Query: 120 AKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVE 162
+KKE + HEE+ TK RE+ KNA+ RG+ + E
Sbjct: 87 SKKEKRTTDQPHEEKAETTKRHGREAPKNAKETAGRGTNARHE 129
>UniRef50_Q89GN8 Cluster: UDP-N-acetylglucosamine 2-epimerase; n=3;
cellular organisms|Rep: UDP-N-acetylglucosamine
2-epimerase - Bradyrhizobium japonicum
Length = 841
Score = 32.7 bits (71), Expect = 6.8
Identities = 18/64 (28%), Positives = 30/64 (46%), Gaps = 3/64 (4%)
Query: 33 ELETSAPPPAISTSTNGFDDFVEVPS---ASAFDANGLLDDAPLGSTPTTVFKQEREEPE 89
E + PP I T G+ DFV++ S A D+ G+ ++A + P + E PE
Sbjct: 301 EAKIDLPPNLIWTEPVGYSDFVKLLSNCIAVITDSGGVQEEACILGVPCVTIRDRTERPE 360
Query: 90 KIKI 93
+ +
Sbjct: 361 TVDV 364
>UniRef50_Q1NWV7 Cluster: Putative uncharacterized protein; n=3;
delta proteobacterium MLMS-1|Rep: Putative
uncharacterized protein - delta proteobacterium MLMS-1
Length = 687
Score = 32.7 bits (71), Expect = 6.8
Identities = 22/87 (25%), Positives = 36/87 (41%)
Query: 2 DDFGDSFVEPEVDPAADFLAREQNQLAGLEDELETSAPPPAISTSTNGFDDFVEVPSASA 61
DDFGD F + + AA +E A ++ E AP A ++ DF +
Sbjct: 164 DDFGDLFGDDQQTAAAGGGDQETTPAAAADEAGEEQAPAVAAASEAKDEFDFDDDFDIDG 223
Query: 62 FDANGLLDDAPLGSTPTTVFKQEREEP 88
FD + + D P + V + ++P
Sbjct: 224 FDFDDSIPDIPDEEALSAVAGDDTQQP 250
>UniRef50_A0VKY9 Cluster: Putative uncharacterized protein; n=1;
Delftia acidovorans SPH-1|Rep: Putative uncharacterized
protein - Delftia acidovorans SPH-1
Length = 301
Score = 32.7 bits (71), Expect = 6.8
Identities = 19/53 (35%), Positives = 29/53 (54%), Gaps = 3/53 (5%)
Query: 49 GFDDFVEVPSASAFDANGLLDDAPLGSTPTTVFKQEREE---PEKIKIWREEQ 98
GFDD V +A +A GLL D + + K+ER + E++K +RE+Q
Sbjct: 67 GFDDGVTQKIVAAMEARGLLVDGMVAAWTKRQVKRERTDDSSTERVKAFREKQ 119
>UniRef50_Q4Q843 Cluster: Glycoprotein 96-92, putative; n=5;
Leishmania|Rep: Glycoprotein 96-92, putative -
Leishmania major
Length = 716
Score = 32.7 bits (71), Expect = 6.8
Identities = 27/131 (20%), Positives = 55/131 (41%), Gaps = 3/131 (2%)
Query: 82 KQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKK-ELEDWYKSHEEQISKTKA 140
++E+EE + ++ E++ + +K ELE+ + EE+ +
Sbjct: 510 EREQEEARQRRVAEEKEAQKKAEKKAEEAEDELAATRRQRKGELEELQRQREEEEKQRIE 569
Query: 141 ANRESAKNAERAQARGSESSVEEGNE--WARVSELCDFGPRRGRDVARLRSIVLQLKQAG 198
R+ + A+R + + E ++E E R EL + RR R+ R V +L+ G
Sbjct: 570 MVRKQREEAQRKREKLKERDIKEAEEIKRQRKEELAELQKRREREQEVQRKKVEELRTKG 629
Query: 199 SRPNYPSRATK 209
+ + + K
Sbjct: 630 KKDSKKEQILK 640
>UniRef50_Q4E491 Cluster: Mucin-associated surface protein (MASP),
putative; n=3; Trypanosoma cruzi|Rep: Mucin-associated
surface protein (MASP), putative - Trypanosoma cruzi
Length = 461
Score = 32.7 bits (71), Expect = 6.8
Identities = 19/55 (34%), Positives = 26/55 (47%), Gaps = 3/55 (5%)
Query: 37 SAPPPAISTSTNGFDDFVEVPSASAFD---ANGLLDDAPLGSTPTTVFKQEREEP 88
SAPPP IST +NG + V S + + L P P+ V+ QE+ P
Sbjct: 165 SAPPPFISTGSNGSQTVIGVSSTQQMEYSKESANLLKKPAEDIPSKVYPQEKPAP 219
>UniRef50_Q4CZ55 Cluster: Histone deacetylase, putative; n=2;
Trypanosoma cruzi|Rep: Histone deacetylase, putative -
Trypanosoma cruzi
Length = 661
Score = 32.7 bits (71), Expect = 6.8
Identities = 15/48 (31%), Positives = 28/48 (58%), Gaps = 2/48 (4%)
Query: 125 EDWYKSHEEQISKTKAANRESAKNAERAQARGSE-SSVEEGNEWARVS 171
+DW++ H+ + + + E A+NAE+ Q + + S EEG+ W R +
Sbjct: 355 DDWHQ-HQTKEEEKEEEEEEEAENAEKVQRQNPQLSPEEEGDGWKRAA 401
>UniRef50_Q09EF7 Cluster: Putative uncharacterized protein; n=8;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 1911
Score = 32.7 bits (71), Expect = 6.8
Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 1/68 (1%)
Query: 24 QNQLAGLEDELETSAPPPAISTSTNGFDDFVEVP-SASAFDANGLLDDAPLGSTPTTVFK 82
Q++LA L+ S P S S +V++P +AS+ N D+ PL S+P+ F
Sbjct: 1246 QSKLANLQKSAVESLQNPMSSNSRQNRSIYVDIPRAASSIGLNENSDEVPLRSSPSVRFA 1305
Query: 83 QEREEPEK 90
+ ++
Sbjct: 1306 DSSQNMQR 1313
>UniRef50_A7AUT5 Cluster: Putative uncharacterized protein; n=1;
Babesia bovis|Rep: Putative uncharacterized protein -
Babesia bovis
Length = 691
Score = 32.7 bits (71), Expect = 6.8
Identities = 22/72 (30%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 28 AGLEDELETSAPPPAISTSTNGFDDFVEVPSASAFDANGLLDDAPL--GSTPTTVFKQE- 84
A + +L + P + T+++ D +E P+ S DA+ +D +P GST + FK +
Sbjct: 198 ADTQSQLPNNTNPETLETASSVLDSHIESPTQSQVDAD--MDSSPSAPGSTVDSTFKVKF 255
Query: 85 REEPEKIKIWRE 96
R + IK +RE
Sbjct: 256 RLSLQSIKDYRE 267
>UniRef50_A4R5R2 Cluster: Putative uncharacterized protein; n=1;
Magnaporthe grisea|Rep: Putative uncharacterized protein
- Magnaporthe grisea (Rice blast fungus) (Pyricularia
grisea)
Length = 1153
Score = 32.7 bits (71), Expect = 6.8
Identities = 15/47 (31%), Positives = 29/47 (61%), Gaps = 3/47 (6%)
Query: 117 LQIAKKELEDWYKSHEEQISKTKAANRESAKNAERAQARGSESSVEE 163
L+ +K+LED SH E++ K A + + +N E+ A G+E+++ +
Sbjct: 841 LEATRKDLED---SHNEEVKKLVALHTNALQNLEQQGAAGTEATIAQ 884
>UniRef50_UPI0001554761 Cluster: PREDICTED: similar to retinoic acid
induced 1; n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
similar to retinoic acid induced 1 - Ornithorhynchus
anatinus
Length = 1389
Score = 32.3 bits (70), Expect = 9.0
Identities = 23/61 (37%), Positives = 29/61 (47%), Gaps = 5/61 (8%)
Query: 3 DFGDSFVEPEVDPAADFLAREQNQLAG--LEDELETSAPPPAISTSTNGFDDFVEVPSAS 60
DFG E DPAA+F A E + AG L E S P ++ T FD F + + S
Sbjct: 684 DFGQQLFE---DPAAEFTAPESKKAAGPPLPYNPEPSLPAAPANSETATFDCFPDPATGS 740
Query: 61 A 61
A
Sbjct: 741 A 741
>UniRef50_Q1LXA8 Cluster: Novel protein similar to human
extracellular matrix protein 2, female organ and
adipocyte specific; n=4; Euteleostomi|Rep: Novel protein
similar to human extracellular matrix protein 2, female
organ and adipocyte specific - Danio rerio (Zebrafish)
(Brachydanio rerio)
Length = 666
Score = 32.3 bits (70), Expect = 9.0
Identities = 24/94 (25%), Positives = 42/94 (44%), Gaps = 6/94 (6%)
Query: 82 KQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAA 141
K+E E +K+K +E + + KK++E+ ++ EE+ + +
Sbjct: 199 KEEAERQKKLKEAKEAKEKEAEQRRLREEEEEKAAAEERKKKMEEQRRAEEERARRLEME 258
Query: 142 NRESAK----NAERAQ--ARGSESSVEEGNEWAR 169
RE + AERAQ RG E++ +E W R
Sbjct: 259 QREMMRALEEAAERAQEGLRGDEATEDEDIVWLR 292
>UniRef50_Q6MKC8 Cluster: Putative signal peptide protein containing
LysM motif precursor; n=1; Bdellovibrio
bacteriovorus|Rep: Putative signal peptide protein
containing LysM motif precursor - Bdellovibrio
bacteriovorus
Length = 549
Score = 32.3 bits (70), Expect = 9.0
Identities = 22/83 (26%), Positives = 37/83 (44%), Gaps = 4/83 (4%)
Query: 14 DPAADFLAREQNQLAGLEDELETSAPPPAISTSTNGFDDFVEVPSASAFDANGLLDDAPL 73
+P AD A ++A E +E A PPA + T + F VP+ A G D
Sbjct: 61 EPGADVPAPPAPEVASPEPTMEDIA-PPAPTPETQTVETFSPVPATGAV---GSEPDYAR 116
Query: 74 GSTPTTVFKQEREEPEKIKIWRE 96
+ ++K E+P +++W +
Sbjct: 117 EAEFHRIYKNYNEQPTSVELWEK 139
>UniRef50_Q9VC00 Cluster: CG13648-PA; n=1; Drosophila
melanogaster|Rep: CG13648-PA - Drosophila melanogaster
(Fruit fly)
Length = 2768
Score = 32.3 bits (70), Expect = 9.0
Identities = 20/67 (29%), Positives = 33/67 (49%), Gaps = 1/67 (1%)
Query: 20 LAREQNQLAGLEDELETSAPPPAISTSTNGFDDFVEVPSASAFDANGLLDDAPLGSTPTT 79
++ E ++ ED+L +S AI++ST G D ++SA G D+A + PT
Sbjct: 835 ISEESTEVPVAEDDLSSSTSASAIASSTEGVQDAASETTSSAPARAGDKDEAAT-TVPTA 893
Query: 80 VFKQERE 86
K + E
Sbjct: 894 QDKDDEE 900
>UniRef50_Q7R414 Cluster: GLP_68_19620_20219; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_68_19620_20219 - Giardia lamblia
ATCC 50803
Length = 199
Score = 32.3 bits (70), Expect = 9.0
Identities = 17/51 (33%), Positives = 28/51 (54%), Gaps = 4/51 (7%)
Query: 123 ELEDWYKSH-EEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSE 172
+LE WY +H EE+I + A R+ KN E+ + E G + A+V++
Sbjct: 66 DLEQWYLAHPEERIKDKEEAERQRQKNREKKE---KEKEKRPGQQSAKVAQ 113
>UniRef50_Q55F94 Cluster: Putative dynamin family protein; n=1;
Dictyostelium discoideum AX4|Rep: Putative dynamin
family protein - Dictyostelium discoideum AX4
Length = 880
Score = 32.3 bits (70), Expect = 9.0
Identities = 37/149 (24%), Positives = 57/149 (38%), Gaps = 14/149 (9%)
Query: 31 EDELETSAPPPAISTSTNGFDDFVEVPS--ASAFDANGLLDDAPLGSTPTTVFKQEREEP 88
E+ + + P + STS + + AS NG + GSTPT EE
Sbjct: 19 EETPQGTFQPASQSTSNTNLNSLASSVNNGASVGSTNGSTPNNSNGSTPTYNHNNSAEEL 78
Query: 89 EKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKELEDWYKSHEEQISKTKAANRESAKN 148
EK K +E+ +AKK+ ED K +EQ+ + E +
Sbjct: 79 EKQKKEEDEKRKKSELEAAAAV--------VAKKK-EDEEKQRKEQVELERKRRDEEIR- 128
Query: 149 AERAQARGSESSVEEGNEWARVSELCDFG 177
R A + ++ +E NE +S L G
Sbjct: 129 --RTNAAAANAANKELNEQVEISSLEQMG 155
>UniRef50_Q22515 Cluster: Putative uncharacterized protein; n=3;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 921
Score = 32.3 bits (70), Expect = 9.0
Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 2/51 (3%)
Query: 118 QIAKKELEDWYKSHEEQISKTKA--ANRESAKNAERAQARGSESSVEEGNE 166
++ KKE ++W K EE K KA +E K E + + +++ EE E
Sbjct: 550 EVDKKEFDEWEKEQEELKKKEKAEKKEKEEKKKTEGEEEKNEDAAGEEKTE 600
>UniRef50_A2EGE9 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 1157
Score = 32.3 bits (70), Expect = 9.0
Identities = 42/183 (22%), Positives = 71/183 (38%), Gaps = 9/183 (4%)
Query: 7 SFVEPEVDPAADFLAREQNQLAGLEDELETSAPPPAISTSTNGFDDFVEVPSASAFDANG 66
S V ++ +F+ + NQL +E E PP +T + P +DA
Sbjct: 869 STVYNQLQTLNNFIKEQYNQLRAKMNE-EEIIPPQITATQKSTLKSIDFTPLT--YDAGS 925
Query: 67 LLDDAPLGSTPTTVFKQEREEPEKIKIWREEQXXXXXXXXXXXXXXXXXMLQIAKKELED 126
++ ++E+EE E+ KI E++ + +KE E+
Sbjct: 926 --EEKKKKEEEEERKRKEKEEEERKKIEEEKKRKEEEKKRKEEEEKRKKEEERKRKEEEE 983
Query: 127 WY-KSHEEQISKTKAANRESAKNAERAQARGSESSVEEGNEWARVSELCDFGPRRGRDVA 185
K EE+ + + R+ + ER + E ++E E R L + RR D A
Sbjct: 984 RKRKEEEEERKRREEEERKRKEEEERKRREEEERRIKEEKERQR---LIEEERRRQEDEA 1040
Query: 186 RLR 188
RLR
Sbjct: 1041 RLR 1043
>UniRef50_Q9LUZ9 Cluster: RING-H2 finger protein ATL5O; n=1;
Arabidopsis thaliana|Rep: RING-H2 finger protein ATL5O -
Arabidopsis thaliana (Mouse-ear cress)
Length = 308
Score = 32.3 bits (70), Expect = 9.0
Identities = 16/47 (34%), Positives = 26/47 (55%)
Query: 44 STSTNGFDDFVEVPSASAFDANGLLDDAPLGSTPTTVFKQEREEPEK 90
S GFDD V P+A+A + LD + + S P V+++ EE ++
Sbjct: 88 SLPLGGFDDGVSSPAATATRDDKGLDSSVISSIPLFVYEENEEEEDE 134
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.309 0.127 0.358
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 222,051,070
Number of Sequences: 1657284
Number of extensions: 8399227
Number of successful extensions: 25539
Number of sequences better than 10.0: 56
Number of HSP's better than 10.0 without gapping: 22
Number of HSP's successfully gapped in prelim test: 34
Number of HSP's that attempted gapping in prelim test: 25474
Number of HSP's gapped (non-prelim): 72
length of query: 210
length of database: 575,637,011
effective HSP length: 97
effective length of query: 113
effective length of database: 414,880,463
effective search space: 46881492319
effective search space used: 46881492319
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 42 (21.7 bits)
S2: 70 (32.3 bits)
- SilkBase 1999-2023 -