BLASTP 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= BGIBMGA000355-TA|BGIBMGA000355-PA|IPR002853|Transcription
factor TFIIE, alpha subunit
(421 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_O96880 Cluster: CG10415-PA; n=7; Endopterygota|Rep: CG1... 517 e-145
UniRef50_P29083 Cluster: Transcription initiation factor IIE sub... 330 3e-89
UniRef50_Q0P4H8 Cluster: General transcription factor IIE, polyp... 299 1e-79
UniRef50_Q7ZTQ1 Cluster: Tfiiealpha protein; n=2; Tetrapoda|Rep:... 293 6e-78
UniRef50_UPI0000588A8B Cluster: PREDICTED: hypothetical protein;... 285 2e-75
UniRef50_A7SES2 Cluster: Predicted protein; n=1; Nematostella ve... 266 6e-70
UniRef50_Q5C3D9 Cluster: SJCHGC08129 protein; n=1; Schistosoma j... 175 1e-42
UniRef50_O62513 Cluster: Putative uncharacterized protein; n=2; ... 150 8e-35
UniRef50_Q6C0A8 Cluster: Yarrowia lipolytica chromosome F of str... 102 2e-20
UniRef50_Q4P883 Cluster: Putative uncharacterized protein; n=1; ... 101 5e-20
UniRef50_P36100 Cluster: Transcription initiation factor IIE sub... 96 1e-18
UniRef50_A2QZ57 Cluster: Contig An12c0110, complete genome; n=9;... 92 3e-17
UniRef50_Q557M8 Cluster: Transcription factor IIE; n=5; Dictyost... 89 2e-16
UniRef50_A7TS52 Cluster: Putative uncharacterized protein; n=1; ... 87 1e-15
UniRef50_A5DK07 Cluster: Putative uncharacterized protein; n=1; ... 86 2e-15
UniRef50_Q9HEM3 Cluster: Related to transcription initiation fac... 84 6e-15
UniRef50_Q6BLR4 Cluster: Debaryomyces hansenii chromosome F of s... 83 1e-14
UniRef50_Q8SRL9 Cluster: TRANSCRIPTION INITIATION FACTOR TFIIE A... 82 2e-14
UniRef50_A4R1J8 Cluster: Putative uncharacterized protein; n=2; ... 74 6e-12
UniRef50_Q5KK86 Cluster: Transcription initiation factor TFIIE a... 71 6e-11
UniRef50_A6SDF4 Cluster: Putative uncharacterized protein; n=2; ... 71 6e-11
UniRef50_Q0U5E8 Cluster: Putative uncharacterized protein; n=1; ... 65 4e-09
UniRef50_A4RRX3 Cluster: Predicted protein; n=2; Ostreococcus|Re... 59 2e-07
UniRef50_Q9ZVS9 Cluster: F15K9.12 protein; n=7; Magnoliophyta|Re... 58 6e-07
UniRef50_Q9P3W1 Cluster: Transcription initiation factor IIE sub... 58 6e-07
UniRef50_A2FXI4 Cluster: TFIIE alpha subunit family protein; n=1... 55 4e-06
UniRef50_Q7RRA7 Cluster: Putative uncharacterized protein PY0082... 52 4e-05
UniRef50_Q9SVG6 Cluster: Putative uncharacterized protein F21C20... 47 8e-04
UniRef50_Q4WT96 Cluster: C2H2 transcription factor, putative; n=... 42 0.040
UniRef50_Q5CFE9 Cluster: Transcription initiation factor iie, al... 38 0.50
UniRef50_A0DI32 Cluster: Chromosome undetermined scaffold_51, wh... 38 0.50
UniRef50_Q2RAU3 Cluster: Transcription initiation factor IIE, pu... 38 0.66
UniRef50_Q5F437 Cluster: Putative uncharacterized protein; n=58;... 37 1.1
UniRef50_Q7Q1Z2 Cluster: ENSANGP00000020855; n=3; Endopterygota|... 37 1.1
UniRef50_Q4UIR5 Cluster: Transcription factor TFIIE, putative; n... 36 1.5
UniRef50_UPI0000DB6E7A Cluster: PREDICTED: similar to zinc finge... 36 2.0
UniRef50_A5K1L8 Cluster: Putative uncharacterized protein; n=1; ... 36 2.0
UniRef50_UPI0000F2EAC5 Cluster: PREDICTED: similar to Zinc finge... 36 2.6
UniRef50_UPI00006A2359 Cluster: UPI00006A2359 related cluster; n... 36 2.6
UniRef50_UPI000065DC2F Cluster: Homolog of Homo sapiens "Zinc fi... 36 2.6
UniRef50_A0LSB8 Cluster: ATPase, BadF/BadG/BcrA/BcrD type; n=2; ... 36 2.6
UniRef50_Q17AY2 Cluster: Putative uncharacterized protein; n=1; ... 36 2.6
UniRef50_UPI00015BB057 Cluster: Transcription factor TFIIE, alph... 35 3.5
UniRef50_UPI000155C8D6 Cluster: PREDICTED: similar to novel KRAB... 35 3.5
UniRef50_UPI000155491E Cluster: PREDICTED: hypothetical protein;... 35 3.5
UniRef50_Q9YAD5 Cluster: Transcription factor E; n=1; Aeropyrum ... 35 3.5
UniRef50_Q9H0M5 Cluster: Zinc finger protein 700; n=149; Eutheri... 35 3.5
UniRef50_UPI0000F1FD74 Cluster: PREDICTED: similar to zinc finge... 35 4.6
UniRef50_A5VI23 Cluster: Transposase, IS605 OrfB family; n=6; La... 35 4.6
UniRef50_Q0IQ75 Cluster: Os12g0140200 protein; n=1; Oryza sativa... 35 4.6
UniRef50_Q5C1C7 Cluster: SJCHGC07628 protein; n=1; Schistosoma j... 35 4.6
UniRef50_UPI0000F2DB49 Cluster: PREDICTED: similar to zinc finge... 34 6.1
UniRef50_UPI0000F2D3EE Cluster: PREDICTED: similar to novel KRAB... 34 6.1
UniRef50_UPI0000D9C692 Cluster: PREDICTED: similar to zinc finge... 34 6.1
UniRef50_UPI00015A6608 Cluster: UPI00015A6608 related cluster; n... 34 6.1
UniRef50_Q4SC94 Cluster: Chromosome undetermined SCAF14659, whol... 34 6.1
UniRef50_A3DDQ2 Cluster: Putative uncharacterized protein; n=2; ... 34 6.1
UniRef50_Q4R3I4 Cluster: Testis cDNA clone: QtsA-16729, similar ... 34 6.1
UniRef50_Q4V6Y6 Cluster: IP01303p; n=3; Sophophora|Rep: IP01303p... 34 6.1
UniRef50_A7AVU0 Cluster: Putative uncharacterized protein; n=1; ... 34 6.1
UniRef50_A5K7P3 Cluster: Putative uncharacterized protein; n=1; ... 34 6.1
UniRef50_A3LWV1 Cluster: Predicted protein; n=2; Pichia|Rep: Pre... 34 6.1
UniRef50_UPI0000F2E8AD Cluster: PREDICTED: similar to novel KRAB... 34 8.1
UniRef50_UPI0000F1DD89 Cluster: PREDICTED: hypothetical protein;... 34 8.1
UniRef50_UPI0000588499 Cluster: PREDICTED: hypothetical protein;... 34 8.1
UniRef50_Q4TGZ6 Cluster: Chromosome undetermined SCAF3363, whole... 34 8.1
UniRef50_Q4SH16 Cluster: Chromosome 8 SCAF14587, whole genome sh... 34 8.1
UniRef50_Q4RT41 Cluster: Chromosome 12 SCAF14999, whole genome s... 34 8.1
UniRef50_Q8YQ19 Cluster: Alr4017 protein; n=11; Cyanobacteria|Re... 34 8.1
UniRef50_Q9VFB9 Cluster: CG6654-PA; n=2; Sophophora|Rep: CG6654-... 34 8.1
UniRef50_Q7QY61 Cluster: GLP_572_40344_41573; n=1; Giardia lambl... 34 8.1
UniRef50_A0BLR1 Cluster: Chromosome undetermined scaffold_114, w... 34 8.1
UniRef50_Q2GW72 Cluster: Putative uncharacterized protein; n=1; ... 34 8.1
UniRef50_Q57878 Cluster: 7-cyano-7-deazaguanine tRNA-ribosyltran... 34 8.1
>UniRef50_O96880 Cluster: CG10415-PA; n=7; Endopterygota|Rep:
CG10415-PA - Drosophila melanogaster (Fruit fly)
Length = 429
Score = 517 bits (1276), Expect = e-145
Identities = 264/426 (61%), Positives = 317/426 (74%), Gaps = 18/426 (4%)
Query: 2 TEERYVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRA 61
TE RYVTEVPSSLKQLARLVVRGFY++EDALI+DMLVRNPCMKEDDI ELL+FE+K LRA
Sbjct: 16 TEVRYVTEVPSSLKQLARLVVRGFYSLEDALIIDMLVRNPCMKEDDIGELLRFEKKQLRA 75
Query: 62 RISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDA 121
RI+ L+ DKFIQ+RLKMETG DGKAQKVNYYFINYKTFVNVVKYKLDLMRKR+ETEERDA
Sbjct: 76 RITTLRTDKFIQIRLKMETGPDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRMETEERDA 135
Query: 122 TSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFN 181
TSRASFKC +C KTFTDLEADQL+DMAT EF+CTFC + V+ED +A+PKKDSRL+LA FN
Sbjct: 136 TSRASFKCSSCSKTFTDLEADQLFDMATLEFRCTFCGSSVEEDSAAMPKKDSRLMLAHFN 195
Query: 182 EQLETLYILLREVEGIKLAPEILEPEPVDINTIRGLTSKQSALRPGGEQWSGEATRNQGM 241
EQL+ LY LLREVEGIKLAPE+LEPEPVDI+TIRGL +K +A RP G WSGEATRNQG
Sbjct: 196 EQLQPLYDLLREVEGIKLAPEVLEPEPVDIDTIRGL-NKPNATRPDGMAWSGEATRNQGF 254
Query: 242 LVEETRVDVTIGDDKPARDAGALRKERPVWMVESTIASNEQSDSAHSTDXXXXXXXXXXX 301
VEETRVDVTIG D + DA RK RP+WM EST+ ++ +D+A
Sbjct: 255 AVEETRVDVTIGGDDTS-DAVIERKSRPIWMTESTVITD--TDAADG----AADAVQTAS 307
Query: 302 XXXXXGKEKNDDIMSVLLAHEKQPSAGNPVSNAVKGLXXXXXXXXXXXXXXPYKLKDELA 361
+++N+DIMSVLL HEKQP P +KG+ K E +
Sbjct: 308 GSGHRNRKENEDIMSVLLQHEKQPGQKEP---HMKGMRVGSSNANSSDSSDDEK-DIENS 363
Query: 362 AVAEMEXXXXXXXXNA------PSVMVNGKTVPLTSVDDDVIAQMTPTEKETYIQIYQEY 415
+ +++ +A P+V+V G+ PL +DD++IAQMTP EKE YI +YQ++
Sbjct: 364 KIPDVDFDNYINSDSAEEDDDVPTVLVAGRPHPLDQLDDNLIAQMTPQEKENYIHVYQQH 423
Query: 416 YSHMYD 421
YSH+++
Sbjct: 424 YSHIFE 429
>UniRef50_P29083 Cluster: Transcription initiation factor IIE
subunit alpha; n=28; Euteleostomi|Rep: Transcription
initiation factor IIE subunit alpha - Homo sapiens
(Human)
Length = 439
Score = 330 bits (812), Expect = 3e-89
Identities = 183/441 (41%), Positives = 257/441 (58%), Gaps = 22/441 (4%)
Query: 1 MTEERYVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLR 60
M + +TEVP++LK+LA+ V+RGFY IE AL +D+L+RN C+KE+D+ ELLKF+RK LR
Sbjct: 1 MADPDVLTEVPAALKRLAKYVIRGFYGIEHALALDILIRNSCVKEEDMLELLKFDRKQLR 60
Query: 61 ARISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERD 120
+ ++ LK DKFI+ R+++ET DGK + NYYFINY+T VNVVKYKLD MR+R+ET+ERD
Sbjct: 61 SVLNNLKGDKFIKCRMRVETAADGKTTRHNYYFINYRTLVNVVKYKLDHMRRRIETDERD 120
Query: 121 ATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKF 180
+T+RASFKCP C TFTDLEA+QL+D T F+CTFC V+ED SA+PKKD+R LLA+F
Sbjct: 121 STNRASFKCPVCSSTFTDLEANQLFDPMTGTFRCTFCHTEVEEDESAMPKKDARTLLARF 180
Query: 181 NEQLETLYILLREVEGIKLAPEILEPEPVDINTIR-----GLTSKQSALRPGG---EQWS 232
NEQ+E +Y LLRE E + LA EILEPEP +I ++ T+ +A GG E W+
Sbjct: 181 NEQIEPIYALLRETEDVNLAYEILEPEPTEIPALKQSKDHAATTAGAASLAGGHHREAWA 240
Query: 233 GEATRNQGMLVEETRVDVTIGDD-KPARDAGALRKERPVWMVESTIASNEQSDSAHSTDX 291
+ + + + +++ +D A G KERP+W+ EST+ S+ +
Sbjct: 241 TKGPSYEDLYTQNVVINMDDQEDLHRASLEGKSAKERPIWLRESTVQGAYGSEDMK--EG 298
Query: 292 XXXXXXXXXXXXXXXGKEKNDDIMSVLLAHEKQPSAG--NPVSNAVKGLXXXXXXXXXXX 349
G + N+++M LL HEK+ S+ V A
Sbjct: 299 GIDMDAFQEREEGHAGPDDNEEVMRALLIHEKKTSSAMAGSVGAAAPVTAANGSDSESET 358
Query: 350 XXXPYKLKDELAAVA-------EMEXXXXXXXXNAPSVMVNGKTVPLTSVDD--DVIAQM 400
AAVA E E + P VMV G+ + V +++AQM
Sbjct: 359 SESDDDSPPRPAAVAVHKREEDEEEDDEFEEVADDPIVMVAGRPFSYSEVSQRPELVAQM 418
Query: 401 TPTEKETYIQIYQEYYSHMYD 421
TP EKE YI + Q + +++
Sbjct: 419 TPEEKEAYIAMGQRMFEDLFE 439
>UniRef50_Q0P4H8 Cluster: General transcription factor IIE,
polypeptide 1, alpha 56kDa; n=2; Tetrapoda|Rep: General
transcription factor IIE, polypeptide 1, alpha 56kDa -
Xenopus tropicalis (Western clawed frog) (Silurana
tropicalis)
Length = 421
Score = 299 bits (733), Expect = 1e-79
Identities = 167/430 (38%), Positives = 260/430 (60%), Gaps = 18/430 (4%)
Query: 1 MTEERYVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLR 60
M ++ +TEVP++LK+LA+ +VRGFY +E +L +D+L+R PC+KEDDI LLKFE+K LR
Sbjct: 1 MGDQETMTEVPAALKRLAKYMVRGFYGLEYSLTLDVLIRYPCVKEDDIGLLLKFEKKQLR 60
Query: 61 ARISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERD 120
++ LK DKFI+ R+++ETG +GK+ + NYY+INYK V+VVKYKLD +R+++E++ERD
Sbjct: 61 TILNTLKADKFIKCRMRVETGPNGKSTRHNYYYINYKVLVDVVKYKLDHVRRKIESDERD 120
Query: 121 ATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKF 180
+T+RASFKCP C T++DLE +QL+D T+ F+CT+C+ V+ED S+LPK+D+R LLA+F
Sbjct: 121 STTRASFKCPGCLSTYSDLEVNQLFDPFTELFRCTYCNVEVEEDSSSLPKRDARTLLARF 180
Query: 181 NEQLETLYILLREVEGIKLAPEILEPEPVDINTIRGLTSKQS---ALRPGGEQ--WSGEA 235
NEQ+E +++LL+E E I L E+LEP+P +I + G +QS AL G+Q W+ ++
Sbjct: 181 NEQIEPIFVLLQETEDIILPCELLEPQPTEIPELCGSFDQQSSSLALDLQGQQGKWANKS 240
Query: 236 TRNQGMLVEETRVDVTIGDDKPARDAGALRKERPVWMVESTIASNEQSDSAHSTDXXXXX 295
+ M V+ ++V D K + KE+PVWM +ST+ + +S+ S
Sbjct: 241 SVG-NMYVQNVTINVRESDFKKKGKERKV-KEQPVWMKDSTVHGSPPEESSASFKTEAPL 298
Query: 296 XXXXXXXXXXXGKEKNDD--IMSVLLAHEKQPSAGNPVSNAVKGLXXXXXXXXXXXXXXP 353
KE N D ++ LL HE + +G V++ K
Sbjct: 299 IEDENANL----KEDNPDNEVIRTLLIHEMKSMSGPAVNSFPKSDSGSDTSESDEEKKST 354
Query: 354 YKLKDELAAVAEMEXXXXXXXXNAPSVMVNGKTVPLTSVDDD--VIAQMTPTEKETYIQI 411
E + AE E P VMV+G+ + V + +++ MT E+E YI +
Sbjct: 355 KPAPGESHSNAEQEEESETVD---PVVMVSGQPHVYSEVSQNPSLVSFMTEEEREAYITV 411
Query: 412 YQEYYSHMYD 421
Q+ + +++
Sbjct: 412 GQKMFQSVFE 421
>UniRef50_Q7ZTQ1 Cluster: Tfiiealpha protein; n=2; Tetrapoda|Rep:
Tfiiealpha protein - Xenopus laevis (African clawed
frog)
Length = 263
Score = 293 bits (719), Expect = 6e-78
Identities = 136/215 (63%), Positives = 171/215 (79%)
Query: 1 MTEERYVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLR 60
MT+ TEVP+ LK+LA+ VVRGFY +E AL +D+L+RNPC+KE+D+ ELLKF+RK LR
Sbjct: 1 MTDPDVATEVPAVLKRLAKYVVRGFYGLEHALALDILIRNPCVKEEDMMELLKFDRKQLR 60
Query: 61 ARISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERD 120
A ++ LK DKFI+ R+++ET DGK + NYYFINYK VNVVKYKLD MR+R+ET+ERD
Sbjct: 61 AVLNTLKGDKFIKCRMRVETATDGKTTRHNYYFINYKLLVNVVKYKLDHMRRRIETDERD 120
Query: 121 ATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKF 180
+T+RASFKCP C TFTDLEA+QL+D T F+CTFC V+ED SA+PKKD+R L+A+F
Sbjct: 121 STNRASFKCPNCCSTFTDLEANQLFDPMTGMFRCTFCQTEVEEDESAMPKKDARTLVARF 180
Query: 181 NEQLETLYILLREVEGIKLAPEILEPEPVDINTIR 215
NEQ+E +Y LLRE E I LA EILEPEP DI +R
Sbjct: 181 NEQIEPIYALLRETEDINLAYEILEPEPTDIPALR 215
>UniRef50_UPI0000588A8B Cluster: PREDICTED: hypothetical protein;
n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 436
Score = 285 bits (698), Expect = 2e-75
Identities = 167/439 (38%), Positives = 247/439 (56%), Gaps = 25/439 (5%)
Query: 1 MTEERYVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLR 60
++E+ +T VP LKQLA+ ++ GFY++E AL+VDMLVRN MKEDD+ +LLKF++K LR
Sbjct: 5 LSEQGVLTVVPDKLKQLAKYIMHGFYSVEHALVVDMLVRNTIMKEDDLADLLKFDKKQLR 64
Query: 61 ARISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERD 120
+ +KNDK ++ +++ET DG+A + YY+INYKTFVNVV++KLD MR+++E ERD
Sbjct: 65 TLLMKIKNDKLLKQVMRVETQADGRAMRHYYYYINYKTFVNVVRFKLDRMRQKIENRERD 124
Query: 121 ATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKF 180
+T +A FKC C +T+ + +A LYD T +CT+C A V ED SA+PK+D+ + F
Sbjct: 125 STCKALFKCTECTRTYNEFDAGNLYDPITMAMKCTYCGAEVQEDESAVPKRDAVTQMVTF 184
Query: 181 NEQLETLYILLREVEGIKLAPEILEPEPVDINTI-RGLTSKQSALRPGGEQ--WSGEATR 237
NEQ++ ++ LL++VE +KL+ ++LEP PV ++ RGL + L GG Q W E R
Sbjct: 185 NEQMKPMFDLLKDVEHVKLSQQLLEPTPVPLSDAQRGLLGNR-GLNVGGPQRGWR-EGDR 242
Query: 238 NQGMLVEETRVDVTIGDDK-PARDAGALRKERPVWMVESTIASNEQSDSAHSTDXXXXXX 296
L + V + IG+++ PA D A R ERPVWM +ST+ E + + +
Sbjct: 243 GPAELYHQD-VTINIGENQGPATDQPAPR-ERPVWMTDSTV---EGAITEAAPSGSGVET 297
Query: 297 XXXXXXXXXXGKEKNDDIMSVLLAHEK----QPS---AGNPVS-NAVKGLXXXXXXXXXX 348
GK +IM LLAHE+ +PS P S N
Sbjct: 298 MEAERAGTKAGKSHEGEIMKALLAHERKVGQRPSFHHEHEPNSDNESDASASDDEYSQPQ 357
Query: 349 XXXXPYKLKDELAAVAEMEXXXXXXXXNAPS------VMVNGKTVPLTSVDDDVIAQMTP 402
P +D LA M VMV G+ VPL V ++I++M
Sbjct: 358 GRTEPPLSEDRLAFGTMMGDVTMGRHEEEEEEEEDMIVMVQGRPVPLDEVTPEMISEMDQ 417
Query: 403 TEKETYIQIYQEYYSHMYD 421
EK+ YI++ Q+ ++ MY+
Sbjct: 418 EEKDEYIRLAQQAHADMYE 436
>UniRef50_A7SES2 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 421
Score = 266 bits (653), Expect = 6e-70
Identities = 152/423 (35%), Positives = 226/423 (53%), Gaps = 13/423 (3%)
Query: 3 EERYVTEVPSSLKQLARLVVRGFYTIEDALIVDMLV--RNPCMKEDDICELLKFERKMLR 60
E +TE+P LK+LA++VVRGFY E I+++L ++PC+KEDD+ +L++F+++ LR
Sbjct: 7 EPELLTEIPPVLKRLAKVVVRGFYDTEQVAIINVLTNAKHPCVKEDDLMDLVRFDKRQLR 66
Query: 61 ARISILKNDKFIQVRLKMETGLD-GKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEER 119
+ LKNDK I+ R+ E D G NYYFIN+K FVNVVKYKLD +RK++E++E+
Sbjct: 67 QALVRLKNDKLIKQRIHKEKAPDTGMTLTFNYYFINFKVFVNVVKYKLDHVRKKIESDEK 126
Query: 120 DATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPK-KDSRLLLA 178
A +R SF C C ++DLE D+L DM T +CT+CS +V+ED S + + + R LA
Sbjct: 127 QAKNRPSFVCSECHNKYSDLEVDRLIDMTTGLLKCTYCSGIVEEDTSEIQEGNNKRTSLA 186
Query: 179 KFNEQLETLYILLREVEGIKLAPEILEPEPVDINTIRGLTSKQSALRPGGEQWSGEATRN 238
KFNEQ+E ++ LLR+ E I LA ILEPEP D+ +I S G SG A+
Sbjct: 187 KFNEQIEPIFKLLRDSENINLADAILEPEP-DLTSINKHVSHSHI--SGHASRSGWASDR 243
Query: 239 QGMLVEETRVDVTIGDDKPARDAGALRKERPVWMVESTIASNEQSDSAHSTDXXXXXXXX 298
GM + + +++G + A A KE P+WM +ST+ S ++ + +T
Sbjct: 244 YGMEATDLGIKISMGAEDEAGAANKRVKEAPIWMKQSTVHSAPEAVAQAATPSAASTSEK 303
Query: 299 XXXXXXXXGKEKNDDIMSVLLAHEKQPSAGNPVSNAVKGLXXXXXXXXXXXXXXPYKLKD 358
D++++ LLAHE A G
Sbjct: 304 HALDHST-----EDEVLADLLAHESVAKKPKLDIKAALGEDENSSSSDSDSEKASNATAP 358
Query: 359 ELAAVAE-MEXXXXXXXXNAPSVMVNGKTVPLTSVDDDVIAQMTPTEKETYIQIYQEYYS 417
V + M+ + VNGK V L V DD++ M+P E+E Y + +++ YS
Sbjct: 359 THDKVTDFMDAADSDDEEEEHKIKVNGKVVALGDVTDDMMKLMSPEEEEAYHKAFEQAYS 418
Query: 418 HMY 420
HMY
Sbjct: 419 HMY 421
>UniRef50_Q5C3D9 Cluster: SJCHGC08129 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC08129 protein - Schistosoma
japonicum (Blood fluke)
Length = 185
Score = 175 bits (427), Expect = 1e-42
Identities = 82/144 (56%), Positives = 109/144 (75%)
Query: 1 MTEERYVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLR 60
++ ++ T+VP L +L R +VR FY E +LIVDMLVRN MKEDD+CE L+FERK LR
Sbjct: 42 LSNQKDSTKVPVCLIKLVRSIVRTFYLREHSLIVDMLVRNTIMKEDDLCERLRFERKQLR 101
Query: 61 ARISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERD 120
+ LK D+FI+ +L++ET DGK K+ +YFI+YK FVNVVKY+LD M++RLE E+R
Sbjct: 102 QYLHTLKCDQFIKSKLQLETDADGKTTKITHYFIDYKLFVNVVKYRLDQMQRRLEAEQRQ 161
Query: 121 ATSRASFKCPACGKTFTDLEADQL 144
+TSRASFKC +C T+TDLE D+L
Sbjct: 162 STSRASFKCSSCNTTYTDLEVDRL 185
>UniRef50_O62513 Cluster: Putative uncharacterized protein; n=2;
Caenorhabditis|Rep: Putative uncharacterized protein -
Caenorhabditis elegans
Length = 433
Score = 150 bits (363), Expect = 8e-35
Identities = 86/300 (28%), Positives = 161/300 (53%), Gaps = 11/300 (3%)
Query: 2 TEERYVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRA 61
TE + V E+P +L + +VV+ F++ E +IV ++R C++E+++ ++F++KMLR
Sbjct: 15 TETQVVDEIPEALNTILLMVVKNFFSSEHFIIVYHIMRAQCIREENLKARIQFDQKMLRQ 74
Query: 62 RISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDA 121
++ LK +K ++ R + + + + +Y+INY+ +NVV+YK+D MR++LE+ E+
Sbjct: 75 LLASLKAEKLVKERTITQKNENNRTVSIIFYYINYRAVLNVVRYKIDHMRQKLESREQMD 134
Query: 122 TSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFN 181
T+RA ++C AC ++ LE +++ D + C C V D + +P + +R +A+FN
Sbjct: 135 TNRAHYRCGACQSSYDMLEINRILDAESGRLICWRCHGDVLADETVVPSRTTRTAVARFN 194
Query: 182 EQLETLYILLREVEGIKLAPEILEPE---PVDINTIRGLTSKQSALRPGGE-----QWSG 233
EQ+ L+ + + GI+LAP +LEP+ ++ + L +Q GG Q G
Sbjct: 195 EQMTPLFSHICALNGIQLAPHLLEPDITKYLEDDKELQLQQQQMDFTSGGGGGGRIQLGG 254
Query: 234 EATRNQGML-VEETRVDVTIGDDKPARDAGALRKER--PVWMVESTIASNEQSDSAHSTD 290
A Q + + D D + G + + + P W+ ++ I E S + H D
Sbjct: 255 VAHSYQNIASINYQNGDAVFVDLNADINKGPVEEAKIMPEWLKDNAIGGGEASHNEHVLD 314
>UniRef50_Q6C0A8 Cluster: Yarrowia lipolytica chromosome F of strain
CLIB122 of Yarrowia lipolytica; n=1; Yarrowia
lipolytica|Rep: Yarrowia lipolytica chromosome F of
strain CLIB122 of Yarrowia lipolytica - Yarrowia
lipolytica (Candida lipolytica)
Length = 382
Score = 102 bits (245), Expect = 2e-20
Identities = 72/278 (25%), Positives = 130/278 (46%), Gaps = 18/278 (6%)
Query: 13 SLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFI 72
++K+L + V RGFY + L++D L+++ + ++++ +LL K +R + LK+DK +
Sbjct: 3 NIKRLLQYVTRGFYDTKSILVMDALLKHVVLSDEELHQLLSIPAKEIRQICAKLKDDKLL 62
Query: 73 QVRLKMET-----GLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASF 127
+ + E G + QK YY+I+Y ++ +K+K+ M K+ E + +
Sbjct: 63 KDHTQREQQENTYGYNKNYQK-TYYYIHYTVTIDAIKWKVHSMNKQAEEALGKKSQPQGY 121
Query: 128 KCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETL 187
CP C K FT LEA M F C CS V+ ED ++ + + L K +Q++ +
Sbjct: 122 VCPLCTKKFTTLEAVTNVQM-DGTFTCDVCSTVIVEDTTSDESRVHQDRLEKLMQQIKPI 180
Query: 188 YILLREVEGIKLAPEILEPE-----PVDINTIRGLTSKQSALRPGGEQWSGEATRNQGML 242
LR+++ I++A E P + G T S+++ G + T +
Sbjct: 181 IDELRKIDDIQVAENTFETSLAKAVPAQLEVTAGST---SSIKTAGRGTTSSQTGSNKSS 237
Query: 243 VEETRVDVTIGDDKPARDAGALRKER---PVWMVESTI 277
+ D+ RD A E+ P W +EST+
Sbjct: 238 TLTVNLSTGAEDEAAQRDEKAKLAEQNALPAWYMESTV 275
>UniRef50_Q4P883 Cluster: Putative uncharacterized protein; n=1;
Ustilago maydis|Rep: Putative uncharacterized protein -
Ustilago maydis (Smut fungus)
Length = 433
Score = 101 bits (241), Expect = 5e-20
Identities = 61/194 (31%), Positives = 102/194 (52%), Gaps = 4/194 (2%)
Query: 12 SSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKF 71
S+++++A++V R FY +++D LV + D + L + K L A S L DK
Sbjct: 25 SAIRRMAQIVARIFYDDRHIVLMDQLVSITVLPADVLAHRLGIQVKELAALSSKLLEDKL 84
Query: 72 IQV--RLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKC 129
I R ++ + ++ YY++++K F++V K+++ +RK+++T R+ + C
Sbjct: 85 ICTFRRNEIRDTITNRSVPRTYYYLDFKLFLDVTKWRMMSIRKKIDTRLRNELDNKGYVC 144
Query: 130 PACGKTFTDLEADQLYDMATQEFQCTF--CSAVVDEDMSALPKKDSRLLLAKFNEQLETL 187
P C K+++ LE L DM F C CS + ++ A K S+ L +FNEQL TL
Sbjct: 145 PRCKKSYSTLEVAHLLDMFRNVFVCDTPGCSTELVDNEEAEDVKRSKDSLMRFNEQLSTL 204
Query: 188 YILLREVEGIKLAP 201
LR EGI L P
Sbjct: 205 LGGLRRTEGITLPP 218
>UniRef50_P36100 Cluster: Transcription initiation factor IIE
subunit alpha; n=5; Saccharomycetales|Rep: Transcription
initiation factor IIE subunit alpha - Saccharomyces
cerevisiae (Baker's yeast)
Length = 482
Score = 96.3 bits (229), Expect = 1e-18
Identities = 50/192 (26%), Positives = 100/192 (52%)
Query: 14 LKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQ 73
+K L + VVRGFY L++D ++ + + EDD+ +LL + L I+ L++D+ I
Sbjct: 9 VKNLLKFVVRGFYGGSFVLVLDAILFHSVLAEDDLKQLLSINKTELGPLIARLRSDRLIS 68
Query: 74 VRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACG 133
+ + E + K+ + YY++ Y ++ +K+K+ + +RL+ + + + CP C
Sbjct: 69 IHKQREYPPNSKSVERVYYYVKYPHAIDAIKWKVHQVVQRLKDDLDKNSEPNGYMCPICL 128
Query: 134 KTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYILLRE 193
+T LEA QL + EF C+ C + ED S K+ + L + +Q++ + L++
Sbjct: 129 TKYTQLEAVQLLNFDRTEFLCSLCDEPLVEDDSGKKNKEKQDKLNRLMDQIQPIIDSLKK 188
Query: 194 VEGIKLAPEILE 205
++ ++ E
Sbjct: 189 IDDSRIEENTFE 200
>UniRef50_A2QZ57 Cluster: Contig An12c0110, complete genome; n=9;
Eurotiomycetidae|Rep: Contig An12c0110, complete genome
- Aspergillus niger
Length = 453
Score = 91.9 bits (218), Expect = 3e-17
Identities = 71/282 (25%), Positives = 130/282 (46%), Gaps = 21/282 (7%)
Query: 17 LARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQVRL 76
L R VVR FY L+VD L + + +D+ LL ++K LR + L+ D+ I V
Sbjct: 7 LIRSVVRAFYETRHILVVDALFIHSVLHAEDLAFLLGMQQKDLRKLCAKLREDRLISVST 66
Query: 77 KMETGLDGKAQKVN--YYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACGK 134
+ E DG + VN YY+I V+ +KYK+ + ++ + + R + C CG
Sbjct: 67 RAEI-RDGSTRPVNREYYYIPLHPVVDAIKYKVSKLTSTIKAQYTPSQERKEYICLRCGS 125
Query: 135 TFTDLEADQLYDMATQEFQCTFCSAVVD--EDMSALPKKD--SRLLLAKFNEQLETLYIL 190
+T+L+ LY + + F+C C A+++ ED+ D +K QL+T+ L
Sbjct: 126 EWTELDVLSLY--SEEGFECQNCGAILERTEDVKGAEGIDRTGHEKNSKLMNQLDTMLKL 183
Query: 191 LREVEGIKLAPEILE---PEPVDINTIRGLTSKQSALRPGGEQWSGEATRNQGMLVEETR 247
L++++ +++ P + +D+ + ++A+ +Q N
Sbjct: 184 LKQIDSVEIPPNDFDTAWDHKIDVVRNQQTHPTRAAVIVPSKQQQEAVRGNTKTDATALE 243
Query: 248 VDVTIGDDKPARD--AGALRKER-------PVWMVESTIASN 280
+ +T ++K A + A A RK PVW ST++++
Sbjct: 244 ISLTSSEEKSAAEQAAEAARKAAVEKQNALPVWHTHSTVSTS 285
>UniRef50_Q557M8 Cluster: Transcription factor IIE; n=5;
Dictyostelium discoideum|Rep: Transcription factor IIE -
Dictyostelium discoideum AX4
Length = 456
Score = 89.4 bits (212), Expect = 2e-16
Identities = 51/186 (27%), Positives = 99/186 (53%), Gaps = 3/186 (1%)
Query: 14 LKQLARLVVRGFYTIEDALIVDMLVRNPC-MKEDDICELLKFERKMLRARISILKNDKFI 72
L L ++V+R FY E A+I+D L+R +K++D+ L+ ++K +R + LK D +
Sbjct: 11 LDDLVKMVIRAFYPDEYAVIIDGLLREKKRIKDEDLALRLRIQQKYVRKILMDLKGDSMV 70
Query: 73 QVR-LKMET-GLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCP 130
+ +K+E G + + ++I+YK +++VKYKL + RK++E+ + ++KC
Sbjct: 71 KSSDVKVEAKGPNERGSTHLLWYIDYKHIIDIVKYKLYMFRKKMESVKVQKIDVQTYKCQ 130
Query: 131 ACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYIL 190
C K +T L+ +L +M T C C ++E+++ + + QL +
Sbjct: 131 TCHKVYTALDIPKLLNMDTGALACEICDGELEEELNNESLTQTAKHQSDLFSQLRKIIEQ 190
Query: 191 LREVEG 196
L++ EG
Sbjct: 191 LKKTEG 196
>UniRef50_A7TS52 Cluster: Putative uncharacterized protein; n=1;
Vanderwaltozyma polyspora DSM 70294|Rep: Putative
uncharacterized protein - Vanderwaltozyma polyspora DSM
70294
Length = 506
Score = 86.6 bits (205), Expect = 1e-15
Identities = 47/192 (24%), Positives = 98/192 (51%)
Query: 14 LKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQ 73
+K L + VVRGFY L++D ++ + + E+D+ +LL + L I+ L++D +
Sbjct: 9 VKNLLKFVVRGFYGGSYILVLDAILYHSVLAEEDLKQLLGINKTDLGPLIARLRSDGLLS 68
Query: 74 VRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACG 133
+ E + K+ + Y++I + ++ +K+K+ + +RL+ + + + CP C
Sbjct: 69 THKQREYPPNSKSIERVYFYIKFPHAIDAIKWKVHQVVQRLKDDLDKFSEPNGYMCPICL 128
Query: 134 KTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYILLRE 193
+T LEA QL + EF C+ C + ED S K+ + L + +Q++ + L++
Sbjct: 129 TKYTQLEAVQLLNFDRTEFLCSLCDEPLVEDDSGKKNKEKQDRLNRLMDQVQPIIDYLKK 188
Query: 194 VEGIKLAPEILE 205
++ ++ E
Sbjct: 189 IDDSRIEENTFE 200
>UniRef50_A5DK07 Cluster: Putative uncharacterized protein; n=1;
Pichia guilliermondii|Rep: Putative uncharacterized
protein - Pichia guilliermondii (Yeast) (Candida
guilliermondii)
Length = 380
Score = 85.8 bits (203), Expect = 2e-15
Identities = 49/200 (24%), Positives = 94/200 (47%), Gaps = 1/200 (0%)
Query: 6 YVTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISI 65
Y ++ +++ L R V RGFY LI+D L+ + + E+D+ LL + K +RA+
Sbjct: 32 YSPQMEDTVRSLLRFVARGFYDKAVVLIIDALIVHSVLSEEDLVYLLGMKPKEVRAQCYR 91
Query: 66 LKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRA 125
L +D+ I + E + +V YY+I+ ++ +K+K+ + ++ E +
Sbjct: 92 LVDDRIILSHFQREESQNRVFNRV-YYYIHVTKAIDAIKWKVHFLVHTMKEEMTQYGNPQ 150
Query: 126 SFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLE 185
+ CP CGK + L+A L F+C C + ED S+ + L K Q++
Sbjct: 151 GYLCPRCGKRVSQLDAISLLSADRTHFECDTCGGTLTEDDSSQQAYMRQARLEKLMVQVD 210
Query: 186 TLYILLREVEGIKLAPEILE 205
+ L+ ++ + + E
Sbjct: 211 PVIAHLKRIDDMTIQENTFE 230
>UniRef50_Q9HEM3 Cluster: Related to transcription initiation factor
IIE chain TFA1; n=2; Sordariales|Rep: Related to
transcription initiation factor IIE chain TFA1 -
Neurospora crassa
Length = 353
Score = 84.2 bits (199), Expect = 6e-15
Identities = 49/190 (25%), Positives = 101/190 (53%), Gaps = 6/190 (3%)
Query: 15 KQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQV 74
K L R V+R FY+ ++ LI+D LV++ C+++DD+ L+K K L + L++ +F+ V
Sbjct: 5 KTLIRCVMRAFYSTQEILIIDALVQHSCLRDDDLGHLMKLGNKDLHKACAGLRDARFLVV 64
Query: 75 RLKMETGLDGKAQKVN--YYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPAC 132
+ E GK + N YY+I+Y+ ++ +K+++ K ++ + + + CP
Sbjct: 65 HTRPELQA-GKTRPQNKTYYYIDYRQTIDAIKWRVYKTDKDMQGIAKPSEENKEYVCPRV 123
Query: 133 GKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYILLR 192
G + + L ++++ F C C AV+++ + L++ N Q + + +L+
Sbjct: 124 GCEAQWSQMEVLDSVSSRGFTCQRCGAVLEQAKER--EAPGHQQLSRMNNQFKFMTDMLQ 181
Query: 193 EVEGIKLAPE 202
EV+ + + PE
Sbjct: 182 EVDKV-VIPE 190
>UniRef50_Q6BLR4 Cluster: Debaryomyces hansenii chromosome F of
strain CBS767 of Debaryomyces hansenii; n=4;
Saccharomycetales|Rep: Debaryomyces hansenii chromosome
F of strain CBS767 of Debaryomyces hansenii -
Debaryomyces hansenii (Yeast) (Torulaspora hansenii)
Length = 397
Score = 83.0 bits (196), Expect = 1e-14
Identities = 46/183 (25%), Positives = 89/183 (48%)
Query: 13 SLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFI 72
+++ L R V RGFY+ LI+D ++ + + EDD+ LL RK LR+ + L D+ +
Sbjct: 4 TIRSLIRFVARGFYSKPYVLILDAVLLHSVLSEDDLIYLLSIHRKELRSLCNKLVEDRLL 63
Query: 73 QVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPAC 132
++ E + Y++I+ ++ +K+K+ + ++ E + + CP C
Sbjct: 64 VNHIQKEENAQQRLITRTYFYIHTTEAIDSIKWKVHSIVNIIKEEMTHYGNPQGYVCPRC 123
Query: 133 GKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYILLR 192
GK + L+A L F+C C V+ ED S+ + L K Q++ + L+
Sbjct: 124 GKKVSQLDAISLLSDDKTNFECDNCGGVLIEDDSSKQASLRQAKLEKLMNQVDPVISYLK 183
Query: 193 EVE 195
+++
Sbjct: 184 KID 186
>UniRef50_Q8SRL9 Cluster: TRANSCRIPTION INITIATION FACTOR TFIIE
ALPHA SUBUNIT; n=1; Encephalitozoon cuniculi|Rep:
TRANSCRIPTION INITIATION FACTOR TFIIE ALPHA SUBUNIT -
Encephalitozoon cuniculi
Length = 291
Score = 82.2 bits (194), Expect = 2e-14
Identities = 51/182 (28%), Positives = 88/182 (48%), Gaps = 5/182 (2%)
Query: 14 LKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQ 73
+ L + VVR FY +I D+L+R + + ++CE +K K + I L+ DK I+
Sbjct: 7 MNDLIKKVVRKFYEPHHVVIADILLRKTLLYDTELCERMKMLSKEVNRLIIKLREDKIIK 66
Query: 74 VRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACG 133
K+E+ D Y+INY +V+KYK+ M K LE + A + C CG
Sbjct: 67 YETKVESREDNGQILRTVYYINYAEVRDVIKYKIFKMTKNLENNIKMAQVE-GYVCMECG 125
Query: 134 KTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYILLRE 193
K ++ L+A L M F+C C + E+ + ++ + +L+ + +LL+E
Sbjct: 126 KEYSSLDAQCL--MENYVFKCEDCKGDLVENKK--DRSADCMMYSNLMSELDDIVLLLKE 181
Query: 194 VE 195
+
Sbjct: 182 TD 183
>UniRef50_A4R1J8 Cluster: Putative uncharacterized protein; n=2;
Sordariomycetes|Rep: Putative uncharacterized protein -
Magnaporthe grisea (Rice blast fungus) (Pyricularia
grisea)
Length = 418
Score = 74.1 bits (174), Expect = 6e-12
Identities = 53/194 (27%), Positives = 98/194 (50%), Gaps = 14/194 (7%)
Query: 15 KQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQV 74
+ L R VVR FY + L+VD LV + +++DD+ L+ K L LK D+F+ V
Sbjct: 5 RTLVRSVVRAFYDTKHILVVDALVIHSALRDDDLAYLMNMNTKDLHKLCGRLKEDRFLTV 64
Query: 75 RLKMETGLDGKAQKVN--YYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKC--P 130
+ E +G+ + VN YYFI+Y+ ++ +K+++ + K+++ A R + C
Sbjct: 65 HTRPEL-KEGQQRPVNRMYYFIDYRQTIDAIKWRVYTVDKQMQGVTVPADERKEYFCLRV 123
Query: 131 ACGKTFTDLEADQLYDMATQEFQCTFCSAVV--DEDMSALPKKDSRLLLAKFNEQLETLY 188
C K ++ +E L + + F C C +V+ D D K S + N+Q + +
Sbjct: 124 GCKKEYSLMEV--LDKPSARGFLCHDCGSVLKHDPDGGGGGHKQS----TRMNDQFKFIT 177
Query: 189 ILLREVEGIKLAPE 202
+L +++ + + PE
Sbjct: 178 GMLPQIDSV-VVPE 190
>UniRef50_Q5KK86 Cluster: Transcription initiation factor TFIIE
alpha subunit, putative; n=1; Filobasidiella
neoformans|Rep: Transcription initiation factor TFIIE
alpha subunit, putative - Cryptococcus neoformans
(Filobasidiella neoformans)
Length = 416
Score = 70.9 bits (166), Expect = 6e-11
Identities = 33/120 (27%), Positives = 66/120 (55%), Gaps = 3/120 (2%)
Query: 83 DGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDLEAD 142
D + + V+Y++++Y+ F NV KY+L +MRK ++ + + ++CP G+ + L+
Sbjct: 110 DSRTRDVHYWYLDYREFANVTKYRLAMMRKGIDERIKSEVGQRGYQCPQDGRVYDTLDVG 169
Query: 143 QLYDMATQEFQCTFCSA-VVDEDMSALPKKDSRL--LLAKFNEQLETLYILLREVEGIKL 199
L+D T F+C C A +++ D + + +S L ++ +FN + L+ VE + L
Sbjct: 170 HLFDPTTSTFRCEDCQAELIEHDPTIDQENNSSLQDMMQRFNIATAPIRDALKAVEVLTL 229
>UniRef50_A6SDF4 Cluster: Putative uncharacterized protein; n=2;
Sclerotiniaceae|Rep: Putative uncharacterized protein -
Botryotinia fuckeliana B05.10
Length = 449
Score = 70.9 bits (166), Expect = 6e-11
Identities = 46/193 (23%), Positives = 89/193 (46%), Gaps = 7/193 (3%)
Query: 15 KQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQV 74
+ L R +R FY + L++D LV + +++DD+ L+ K L LK D+F+ V
Sbjct: 5 QMLVRSCMRSFYDTKHILVIDALVIHSALRDDDLAYLMSINTKELHKLCGKLKEDRFLAV 64
Query: 75 RLKMETGLDGKAQKVN--YYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPAC 132
+ E +G+ + +N YYFI+Y+ ++ +K+++ + K ++ R + CP C
Sbjct: 65 HSRPEI-KEGQQRPINRTYYFIDYRATIDAIKWRVFQIDKAVQGNTVPDDERKEYFCPRC 123
Query: 133 GKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYILLR 192
+T E L + F C C ++ D + +K N Q + + LL
Sbjct: 124 KSEWTMFEV--LDKRNYEGFLCHKCDYLLVHDPD--NNRGGHEQSSKLNAQFKFITDLLP 179
Query: 193 EVEGIKLAPEILE 205
+++ + + E
Sbjct: 180 KIDQVVIPANTFE 192
>UniRef50_Q0U5E8 Cluster: Putative uncharacterized protein; n=1;
Phaeosphaeria nodorum|Rep: Putative uncharacterized
protein - Phaeosphaeria nodorum (Septoria nodorum)
Length = 412
Score = 64.9 bits (151), Expect = 4e-09
Identities = 51/205 (24%), Positives = 92/205 (44%), Gaps = 11/205 (5%)
Query: 15 KQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFER--KMLRARISILKNDKFI 72
KQL R VVR FY IE +++D L + + +D+ +L+ + K + + LK
Sbjct: 9 KQLVRTVVRMFYEIEHVVVMDALCYHGALPVNDLVLVLEAGKNTKHVGKIVGKLKEAGMC 68
Query: 73 QVRLK--METGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCP 130
V + M G + + YY+I+Y+ ++ KYK+ + + ++ + +A KC
Sbjct: 69 AVYTQQVMREGATKQTSR-EYYYIDYRRAIDACKYKIHKIDEMVKKNAKPTAEKAELKCA 127
Query: 131 ACGKTFTDLEA----DQLYDMATQEFQCTFCSAVVDE-DMSALPKKDSRLLLAKFNEQLE 185
C +T ++ D + F C C +DE D + AKFN+
Sbjct: 128 RCRSQYTTMDVLNSIDPEPSAESSGFLCLRCGHPLDEIDAGGQADDMADDTPAKFNKMFS 187
Query: 186 TLYILLREVEGIKLAPEILEPEPVD 210
L L+ E++ +K+ P + + VD
Sbjct: 188 PLLNLMAEIDQMKI-PHVEGKDAVD 211
>UniRef50_A4RRX3 Cluster: Predicted protein; n=2; Ostreococcus|Rep:
Predicted protein - Ostreococcus lucimarinus CCE9901
Length = 424
Score = 59.3 bits (137), Expect = 2e-07
Identities = 70/327 (21%), Positives = 128/327 (39%), Gaps = 32/327 (9%)
Query: 32 LIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFI---QVRLKMETGLDGKAQK 88
++VD L R +EDD+ LK K +R ++ L+ +K + VR K + A++
Sbjct: 77 VVVDALTRRTWTREDDLANDLKLSFKQVRKLLTYLEREKLVSRAHVREKDKARAQALAER 136
Query: 89 -----------VNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPA----CG 133
V++ +NY ++V +Y+L +M+K+ + C CG
Sbjct: 137 GITDAPDVKKTVSWCSLNYGRALDVTRYRLHVMKKQARQRVDKGKIHEVYVCVGPTEFCG 196
Query: 134 KTFTDLEADQLYDMATQEFQCTFCSAVV---DEDMSALPKKDSRLLLAKFNEQLETLYIL 190
K ++ L+A L D F+C+ C V +D + P+ +R + + L
Sbjct: 197 KIYSSLDAAALLDPVEMVFKCSNCGCEVRQAGKDGAPEPEGAARETKESLQRRYDELERQ 256
Query: 191 LREVEGIKLAPEILEPEPV-DINTIRGLTSKQSALRPGGEQWSGEATRNQGML---VEET 246
+ +E +LA + P P+ T + K+ + GG G R L +EET
Sbjct: 257 FKPLEA-QLAKAMKTPAPLYGTLTEWAVARKRYSQNKGGGGGGGGGRRGGAALDVHLEET 315
Query: 247 RVDVTIGDDKPARDAGA---LRKERPVWMVESTIASNEQSDSAHSTDXXXXXXXXXXXXX 303
+ V +G + A K +P W+ + E+ D+A
Sbjct: 316 KFVVELGSTAEEQQKQAEIDAIKVQPEWVTRNQF---EKEDAAQGAAANGEDAKDAAAKP 372
Query: 304 XXXGKEKNDDIMSVLLAHEKQPSAGNP 330
+ + + ++L A +KQ P
Sbjct: 373 TADNEVQEQYLQALLSALQKQQGGKKP 399
>UniRef50_Q9ZVS9 Cluster: F15K9.12 protein; n=7; Magnoliophyta|Rep:
F15K9.12 protein - Arabidopsis thaliana (Mouse-ear
cress)
Length = 506
Score = 57.6 bits (133), Expect = 6e-07
Identities = 74/305 (24%), Positives = 131/305 (42%), Gaps = 61/305 (20%)
Query: 31 ALIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQVRLKMETG--------- 81
A+++D L R ++E+D+ + L+ K LR I + + +K I + ET
Sbjct: 51 AVVLDALARRQWVREEDLAKDLQLHAKQLRKIIRLFEEEKLIMRDHRKETAKGAKMYSAA 110
Query: 82 ----LDGKAQ-KV-----NYYFINY------KTFV---NVVKYKLDLMRKRLETEERDAT 122
DG+A+ KV +Y ++Y +F+ +VV+++L M+KRL+ E D
Sbjct: 111 VAATTDGRAEDKVKLHTHSYCCLDYAQARFISSFLQICDVVRFRLHRMKKRLKDELEDKN 170
Query: 123 SRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSA-------------VVDEDMSALP 169
+ + CP C + + L+A +L M F C C+ VVD D +A
Sbjct: 171 TVQEYGCPNCQRKYNALDALRLISMVDDSFHCENCNGELVVECNKLTSEEVVDGDDNARR 230
Query: 170 KKDSRL--LLAKFNEQLETLYILLREVEGIKLAPEILEPEPVDINTIRGLTSKQSALRPG 227
++ L +L K Q++ L L V+ + P P + + A R
Sbjct: 231 RRRENLKNMLQKLEVQMKPLMDQLNRVKDL--------PIPEFGSFLAWEARAAMAAREN 282
Query: 228 GEQWSGEATRNQG-------MLVEETRVDVTIGD---DKPARDAGALRKERPVWMVESTI 277
G+ + R+QG + ET+V+V +GD D ++ + K P WM++ +
Sbjct: 283 GDLNPNDPLRSQGGYGSTPMPFLGETKVEVNLGDGNEDVKSKGGDSSLKVLPPWMIKEGM 342
Query: 278 ASNEQ 282
E+
Sbjct: 343 NLTEE 347
>UniRef50_Q9P3W1 Cluster: Transcription initiation factor IIE
subunit alpha; n=1; Schizosaccharomyces pombe|Rep:
Transcription initiation factor IIE subunit alpha -
Schizosaccharomyces pombe (Fission yeast)
Length = 448
Score = 57.6 bits (133), Expect = 6e-07
Identities = 45/209 (21%), Positives = 92/209 (44%), Gaps = 22/209 (10%)
Query: 7 VTEVPSSLKQLARLVVRGFYTIEDALIVDMLVRNPCMKEDDICELLKFER---------- 56
++ P +++L ++++R FY + +D ++R+ I +L F R
Sbjct: 1 MSNAPEIVQRLIKMIMRAFYETRHIIFMDAILRHSAYVLHKIRTILTFGRLTDEQTALLM 60
Query: 57 ----KMLRARISILKNDKF--IQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLM 110
K R L+ D+ IQ R +M+ G + Y++I++ + ++ +K+++ +
Sbjct: 61 GIPIKECRFIAGKLREDRLLAIQSRTEMKEGQQ-RQYHTTYFYIDFCSTIDSIKWRMHQL 119
Query: 111 RKRLETEERDATSRASFKCPACGKTFTDLEADQLYDMATQE--FQCTFCSAVVDEDMSAL 168
K +E R+ + CP C K F+ L+ + + T E F C C + +D +
Sbjct: 120 VKTVEDRMRNDFDSKGYVCPFCNKKFSSLD---VLSLVTNEGTFACNVCGTELKDDEESA 176
Query: 169 PKKDSRLLLAKFNEQLETLYILLREVEGI 197
S+ L K Q+ + L+ V+ I
Sbjct: 177 EMMSSQKRLGKLMGQVNGIIDALKRVDEI 205
>UniRef50_A2FXI4 Cluster: TFIIE alpha subunit family protein; n=1;
Trichomonas vaginalis G3|Rep: TFIIE alpha subunit family
protein - Trichomonas vaginalis G3
Length = 334
Score = 54.8 bits (126), Expect = 4e-06
Identities = 65/278 (23%), Positives = 123/278 (44%), Gaps = 34/278 (12%)
Query: 7 VTEVPSSLKQLARLVVRGFYTIEDALIVD-MLVRNPCMKEDDICELLKFERKMLRARISI 65
+ + +K+L R + FY E +I++ L+ + M +I E L ++K++ I I
Sbjct: 1 MASITQQMKELVRKITYMFYEKEKVMIMEGFLMFDEPMTLKEIGEKLHLQKKIIDDCIGI 60
Query: 66 LKNDKFIQVRLKMET-GLD---------GKAQKVN----YYFINYKTFVNVVKYKLDLMR 111
L+ D I + ++ G D QK N YY I+YK F + V+ K+ L++
Sbjct: 61 LRRDGMISSKQTLDLEGWDLTKKPISQMSDTQKKNRTVYYYAIDYKVFCDSVRLKIQLVK 120
Query: 112 KRLETEERDATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVV-DEDMSALPK 170
L+ + S++C +C TF +E ++L + + F C C + D D S
Sbjct: 121 NHLK-KLCGKADNISYRCESCHTTFNFIEINRLSEYS--NFTCPECGGTLKDLDNSDEVN 177
Query: 171 KDSRLLLAKFNEQLETLYILLREVEGIKLAPEILEPE------PVDINTIRGLTSKQSAL 224
+ R K+ E ++R++ + ++ + E + P + T+ +K+ +
Sbjct: 178 ANKR----KYEEFCHLTDDIIRQITNL-VSNMVFEDDFDFRTNPAKLMTLESYNAKEKYI 232
Query: 225 RPGGEQWSGEATRNQGMLVEETR--VDVTIGDDKPARD 260
+ E SG+ R + + + + VDV D PA D
Sbjct: 233 KENAE--SGKIIRAEEKVDTDKQNVVDVKTVDIGPAID 268
>UniRef50_Q7RRA7 Cluster: Putative uncharacterized protein PY00824;
n=1; Plasmodium yoelii yoelii|Rep: Putative
uncharacterized protein PY00824 - Plasmodium yoelii
yoelii
Length = 369
Score = 51.6 bits (118), Expect = 4e-05
Identities = 39/144 (27%), Positives = 65/144 (45%), Gaps = 8/144 (5%)
Query: 21 VVRGFYTIEDALIVDMLVRNPCM-KEDDICELLKFERKMLRARISILKNDKFIQVRLKME 79
V R F E+ +I DM V N C+ E DI + + +R+ +S L +K+I K +
Sbjct: 23 VSRFFMNDEEIIIFDMFVHNECLYLEKDIISSINMNEQKIRSILSKLLKEKYIIQVQKYK 82
Query: 80 TGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDL 139
G + Y NY V V+ Y++ M L+ ++ + + C C T++ L
Sbjct: 83 NNEKGSYYQTYYCLNNY--IVYVIDYRIKQMELELQKKKNECD---VYICKFCNATYSQL 137
Query: 140 EADQL-YDMATQEFQCTFCSAVVD 162
+A L D F C FC+ ++
Sbjct: 138 DAQILPLDPYDAHFLC-FCNNKIE 160
>UniRef50_Q9SVG6 Cluster: Putative uncharacterized protein
F21C20.160; n=1; Arabidopsis thaliana|Rep: Putative
uncharacterized protein F21C20.160 - Arabidopsis
thaliana (Mouse-ear cress)
Length = 416
Score = 47.2 bits (107), Expect = 8e-04
Identities = 30/110 (27%), Positives = 56/110 (50%), Gaps = 9/110 (8%)
Query: 32 LIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFI-QVRLKMETGL--DGKAQ- 87
L++D L R ++E+D+ + LK K LR + + +FI +V K ++ +G+ +
Sbjct: 34 LVLDALTRRQWVREEDLAKELKLNTKQLRTILRYFEEQQFIMRVHRKEKSSATTNGRGED 93
Query: 88 --KVNYYF---INYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPAC 132
KV+ Y ++Y +V++YKL M+K + D + + CP C
Sbjct: 94 KVKVHMYSYCCLDYSQIYDVIRYKLHRMKKEFKDVLEDKDNVQEYGCPNC 143
>UniRef50_Q4WT96 Cluster: C2H2 transcription factor, putative; n=5;
Trichocomaceae|Rep: C2H2 transcription factor, putative
- Aspergillus fumigatus (Sartorya fumigata)
Length = 570
Score = 41.5 bits (93), Expect = 0.040
Identities = 24/65 (36%), Positives = 30/65 (46%), Gaps = 3/65 (4%)
Query: 95 NYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTD---LEADQLYDMATQE 151
NYK F V KL + LE ER +KC CGK+FTD L+ +
Sbjct: 432 NYKNFKCSVCGKLFARQATLERHERSHRGEKPYKCTECGKSFTDSSELKTHSRTHTGEKP 491
Query: 152 FQCTF 156
F+CTF
Sbjct: 492 FKCTF 496
>UniRef50_Q5CFE9 Cluster: Transcription initiation factor iie, alpha
subunit; n=2; Cryptosporidium|Rep: Transcription
initiation factor iie, alpha subunit - Cryptosporidium
hominis
Length = 503
Score = 37.9 bits (84), Expect = 0.50
Identities = 61/338 (18%), Positives = 125/338 (36%), Gaps = 28/338 (8%)
Query: 102 VVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLYDMATQE--FQCTFCSA 159
VV+++ + + + ++ E +DA + C C ++ LEA L D+ + F C FC+
Sbjct: 162 VVEWQYNTIIREIDQEIKDAVNLDELMCNRCNAKYSSLEALSL-DLNPDDGLFLCRFCNE 220
Query: 160 VVDEDMSALPKKDSRLLLAKFNEQLETLYILLREVEG--IKLAPEILEPEPVDINTIRGL 217
+ SA + ++ + QL+ L L V+ I + P ++ N +
Sbjct: 221 KLKSVDSASFRNAAKDKAERVRSQLQILSNSLELVKSMHIPVFPPYQSKNDINFNKTKLP 280
Query: 218 TSKQSALRPGGEQWSGEATRNQGMLVEETRVDVTIGDDK-PARDAGALRKE--RPVWMVE 274
S ++ + + T+N L ET ++I ++ P + K P +
Sbjct: 281 NSLENDGNIDSKISDDQITKNS--LSNETSTPISIDNNSSPISNQSNFNKNTNHPSSGIR 338
Query: 275 STIASNEQSDSAHSTDXXXXXXXXXXXXXXXXGKEKNDDIMSVLLAHEKQPSAGNPV--- 331
++ +++ S S+ S + I+ +H + S N +
Sbjct: 339 NSNSTSSISSSS-SPQVLPQNTISKVKFGIKLSAKSTSSIIGTSNSHYSKLSGNNAIKQE 397
Query: 332 -SNAVKGLXXXXXXXXXXXXXXPYKLKDELAAVAEMEX----XXXXXXXNAPSVM----- 381
SN + + D + ++ + +P+ +
Sbjct: 398 NSNTISSSNLSNNFTNSSNIKITGIITDNIKSIDQESNRHTPTTASSSTKSPAKIEEPTF 457
Query: 382 ----VNGKTVPLTSVDDDVIAQMTPTEKETYIQIYQEY 415
+ K +T +DDD+I QMT TE Y ++ Q+Y
Sbjct: 458 SVSAIKDKVFKITEIDDDIINQMTDTEYLKYDELLQQY 495
>UniRef50_A0DI32 Cluster: Chromosome undetermined scaffold_51, whole
genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_51,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 363
Score = 37.9 bits (84), Expect = 0.50
Identities = 31/126 (24%), Positives = 69/126 (54%), Gaps = 9/126 (7%)
Query: 1 MTEERYVTEVPSSLKQLARLVVRGFYTIEDALIVDML-VRNPCMK--EDDICELLKFER- 56
+TE++ +VP +LKQ + + Y +D L+V +++ +K + D+ +++K+ +
Sbjct: 30 LTEKKVTFKVPLNLKQYRQKLQNLIYFQQDHLLVKSTPIKHKLLKFLQTDLQKMIKYMKI 89
Query: 57 --KMLRARISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLDLMRKRL 114
K++ A +SILKN I + +++ + K++ ++ I + +++K K M + L
Sbjct: 90 NFKLMPA-LSILKNQVIINSQNQLDLTIFWKSRDIS--TIQSQILRSIIKQKQYSMEELL 146
Query: 115 ETEERD 120
E E +D
Sbjct: 147 EIEPKD 152
>UniRef50_Q2RAU3 Cluster: Transcription initiation factor IIE,
putative; n=5; Oryza sativa|Rep: Transcription
initiation factor IIE, putative - Oryza sativa subsp.
japonica (Rice)
Length = 334
Score = 37.5 bits (83), Expect = 0.66
Identities = 17/57 (29%), Positives = 29/57 (50%)
Query: 101 NVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFC 157
+VV+Y++ MRK+L+ D + + CP C + ++ +A QL F C C
Sbjct: 4 DVVRYRIHRMRKKLKDGLDDRDTVQHYVCPNCKRRYSAFDALQLVSDMDDYFHCEHC 60
>UniRef50_Q5F437 Cluster: Putative uncharacterized protein; n=58;
Tetrapoda|Rep: Putative uncharacterized protein - Gallus
gallus (Chicken)
Length = 490
Score = 36.7 bits (81), Expect = 1.1
Identities = 19/47 (40%), Positives = 23/47 (48%), Gaps = 3/47 (6%)
Query: 114 LETEERDATSRASFKCPACGKTFT---DLEADQLYDMATQEFQCTFC 157
L +R T +KCP CGK FT DL QL A + F+C C
Sbjct: 213 LSYHQRIHTGERPYKCPECGKGFTGSSDLSRHQLIHTAERPFKCHEC 259
>UniRef50_Q7Q1Z2 Cluster: ENSANGP00000020855; n=3;
Endopterygota|Rep: ENSANGP00000020855 - Anopheles
gambiae str. PEST
Length = 526
Score = 36.7 bits (81), Expect = 1.1
Identities = 19/50 (38%), Positives = 26/50 (52%), Gaps = 3/50 (6%)
Query: 111 RKRLETEERDATSRASFKCPACGKTFTDLE-ADQLYDMATQE--FQCTFC 157
++ L R T + ++CP CGKTFT E + T E FQCT+C
Sbjct: 11 KEHLTNHVRQHTGESPYRCPYCGKTFTRKEHLTNHVRLHTGETPFQCTYC 60
>UniRef50_Q4UIR5 Cluster: Transcription factor TFIIE, putative; n=2;
Theileria|Rep: Transcription factor TFIIE, putative -
Theileria annulata
Length = 398
Score = 36.3 bits (80), Expect = 1.5
Identities = 52/207 (25%), Positives = 83/207 (40%), Gaps = 21/207 (10%)
Query: 13 SLKQLARLVVRGFYTIEDALIVDM-LVRNPCMKEDDICELLKFERKMLRARISILKNDKF 71
+ L R F+ E+ +IVD+ L + E D+ + L LR +S L+
Sbjct: 29 TFSSLLECCTRLFFCDEEIVIVDLFLATERAISEKDLEDELGLPENRLREHLSRLERHGI 88
Query: 72 I-QVRLKMETGLDGKAQKV-------------NYYFINYKTFVNVVKYKLDLMRKRLETE 117
+ + T + K Q+ Y+ IN + V+ YKL M + L+ +
Sbjct: 89 LTRFSNTSVTNIFQKPQRAYKKDSTRESSSTHTYWRINNHVII-VIHYKLTKMEEILQQK 147
Query: 118 ERDATSRASFKCPACGKTFTDLEADQL-YDMATQEFQCTFCSAVVDEDMSALPKKDSRLL 176
+ F CP C T+ L L D F C + V +D SA K+D
Sbjct: 148 LKGLYESDKFICPKCESTYDSLTVQTLEMDGFDAHFICKCGTKVELDDRSA--KEDIYSS 205
Query: 177 LAK-FNEQLETLYILLREVEGIKLAPE 202
K F EQ++ L L + G+++ PE
Sbjct: 206 QHKRFQEQVKNLKKCLYDAWGMEV-PE 231
>UniRef50_UPI0000DB6E7A Cluster: PREDICTED: similar to zinc finger
protein 560, partial; n=1; Apis mellifera|Rep: PREDICTED:
similar to zinc finger protein 560, partial - Apis
mellifera
Length = 1241
Score = 35.9 bits (79), Expect = 2.0
Identities = 29/113 (25%), Positives = 49/113 (43%), Gaps = 4/113 (3%)
Query: 48 ICELLKFERKMLRARISILKNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKL 107
IC + R LRA + + DK + +L + +A K +Y I+ + +V + K
Sbjct: 1006 ICGKEFYSRSRLRAHMIVHNKDKAVMCKLCSAYLSNAEALKTHYKNIHMQDYVCNICGKH 1065
Query: 108 DLMRKRLETEERDATSRASFKCPACG---KTFTDLEADQLYDMATQEFQCTFC 157
RK L + + + A FKC C KT L+ L ++++C C
Sbjct: 1066 VKSRKALHNHQ-NVHAAARFKCTLCPNVYKTSQILKEHLLKHEGIRKYKCNIC 1117
>UniRef50_A5K1L8 Cluster: Putative uncharacterized protein; n=1;
Plasmodium vivax|Rep: Putative uncharacterized protein -
Plasmodium vivax
Length = 1376
Score = 35.9 bits (79), Expect = 2.0
Identities = 22/62 (35%), Positives = 34/62 (54%), Gaps = 4/62 (6%)
Query: 49 CELLKFERKMLRARISILKNDKFIQVRLKMETGL----DGKAQKVNYYFINYKTFVNVVK 104
C+ K E K L +I IL+ + F ++ M +GL DGK NY + N K+F++V +
Sbjct: 1305 CKQYKNEIKNLMVQIKILREELFKCKQMIMASGLNGGNDGKVNPCNYNWTNTKSFLDVYE 1364
Query: 105 YK 106
K
Sbjct: 1365 KK 1366
>UniRef50_UPI0000F2EAC5 Cluster: PREDICTED: similar to Zinc finger
protein 628; n=2; Mammalia|Rep: PREDICTED: similar to
Zinc finger protein 628 - Monodelphis domestica
Length = 1125
Score = 35.5 bits (78), Expect = 2.6
Identities = 19/47 (40%), Positives = 23/47 (48%), Gaps = 3/47 (6%)
Query: 114 LETEERDATSRASFKCPACGKTFT---DLEADQLYDMATQEFQCTFC 157
L +R T F+CPAC KTFT +L Q A + F CT C
Sbjct: 611 LRQHQRVHTGERPFRCPACPKTFTHSSNLLLHQRTHSAERPFACTVC 657
>UniRef50_UPI00006A2359 Cluster: UPI00006A2359 related cluster; n=3;
Xenopus tropicalis|Rep: UPI00006A2359 UniRef100 entry -
Xenopus tropicalis
Length = 938
Score = 35.5 bits (78), Expect = 2.6
Identities = 23/77 (29%), Positives = 34/77 (44%), Gaps = 7/77 (9%)
Query: 110 MRKRLETEERDATSRASFKCPACGKTFTD---LEADQLYDMATQEFQCTFC----SAVVD 162
++ L +R T F C CGK+F+ L++ Q + F CT C S +
Sbjct: 271 LKNSLVRHQRVHTGEKPFTCTLCGKSFSTKCWLQSHQTVHTGEKPFACTVCGKRFSCEMG 330
Query: 163 EDMSALPKKDSRLLLAK 179
M+ LP+ S LL K
Sbjct: 331 SLMAGLPQTQSELLRIK 347
>UniRef50_UPI000065DC2F Cluster: Homolog of Homo sapiens "Zinc
finger protein 484; n=1; Takifugu rubripes|Rep: Homolog
of Homo sapiens "Zinc finger protein 484 - Takifugu
rubripes
Length = 249
Score = 35.5 bits (78), Expect = 2.6
Identities = 14/37 (37%), Positives = 22/37 (59%), Gaps = 3/37 (8%)
Query: 124 RASFKCPACGKTF---TDLEADQLYDMATQEFQCTFC 157
R S++CP CG+T+ T+L+ Q Y + F+C C
Sbjct: 192 RPSYECPECGRTYGRMTELKMHQRYHTGDKPFECACC 228
>UniRef50_A0LSB8 Cluster: ATPase, BadF/BadG/BcrA/BcrD type; n=2;
Acidothermus cellulolyticus 11B|Rep: ATPase,
BadF/BadG/BcrA/BcrD type - Acidothermus cellulolyticus
(strain ATCC 43068 / 11B)
Length = 299
Score = 35.5 bits (78), Expect = 2.6
Identities = 23/76 (30%), Positives = 37/76 (48%), Gaps = 2/76 (2%)
Query: 203 ILEPEPVD--INTIRGLTSKQSALRPGGEQWSGEATRNQGMLVEETRVDVTIGDDKPARD 260
+L P+ V + I+G + + L G + S EA + + +L ET V V + DD A
Sbjct: 46 LLHPDAVSRLVALIQGSRASAAGLGLAGIRGSQEAEKLRAVLAAETGVTVAVADDTEAAF 105
Query: 261 AGALRKERPVWMVEST 276
GA R E + ++ T
Sbjct: 106 LGAFRGEPGIIVIAGT 121
>UniRef50_Q17AY2 Cluster: Putative uncharacterized protein; n=1;
Aedes aegypti|Rep: Putative uncharacterized protein -
Aedes aegypti (Yellowfever mosquito)
Length = 835
Score = 35.5 bits (78), Expect = 2.6
Identities = 20/46 (43%), Positives = 24/46 (52%), Gaps = 3/46 (6%)
Query: 116 TEERDATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVV 161
+E +TS KCP C K F+ E DQL + EF C CSA V
Sbjct: 191 SETGTSTSDGELKCPKCEKVFS--EIDQL-ERHNCEFICNICSATV 233
>UniRef50_UPI00015BB057 Cluster: Transcription factor TFIIE, alpha
subunit; n=1; Ignicoccus hospitalis KIN4/I|Rep:
Transcription factor TFIIE, alpha subunit - Ignicoccus
hospitalis KIN4/I
Length = 173
Score = 35.1 bits (77), Expect = 3.5
Identities = 41/154 (26%), Positives = 75/154 (48%), Gaps = 20/154 (12%)
Query: 37 LVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQVR-LKMETGLDGKAQKVNYYFIN 95
L+R + E+ + E L + +R ++ K +++ VR K+ DG + Y++++
Sbjct: 29 LMRYEDISEEQLTETLGMKPNDVRR--ALYKLERYGLVRNYKIRNENDGTY--IYYWYVD 84
Query: 96 YKTFV-NVVKYK---LDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLYDMATQE 151
+T N++K K L+ +++RLE+EE F CPACG F+ EA YD
Sbjct: 85 RETLNRNLLKIKKSVLEKLKRRLESEEDQY-----FYCPACGLQFSYEEA-MGYD----- 133
Query: 152 FQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLE 185
F C C ++ S KK ++ + E+++
Sbjct: 134 FTCPRCGEPLELAESNPRKKLLEKIVKRLEEEIK 167
>UniRef50_UPI000155C8D6 Cluster: PREDICTED: similar to novel KRAB
box containing protein; n=1; Ornithorhynchus
anatinus|Rep: PREDICTED: similar to novel KRAB box
containing protein - Ornithorhynchus anatinus
Length = 538
Score = 35.1 bits (77), Expect = 3.5
Identities = 20/61 (32%), Positives = 34/61 (55%), Gaps = 7/61 (11%)
Query: 108 DLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLY-DMATQE---FQCTFCSAVVDE 163
+LM K L+T ++ + S++CPAC K F A +++ D ++ FQCT CS +
Sbjct: 347 NLMEKHLQTHKQ---IKGSYRCPACEKVFVTQSARRMHKDCHGRKPDLFQCTKCSCFYET 403
Query: 164 D 164
+
Sbjct: 404 E 404
>UniRef50_UPI000155491E Cluster: PREDICTED: hypothetical protein;
n=1; Ornithorhynchus anatinus|Rep: PREDICTED:
hypothetical protein - Ornithorhynchus anatinus
Length = 598
Score = 35.1 bits (77), Expect = 3.5
Identities = 32/138 (23%), Positives = 64/138 (46%), Gaps = 3/138 (2%)
Query: 67 KNDKFIQVRLKMETGLDGKAQKVNYYFINYKTFVNVVKYKLD-LMRKRLETEERDATSRA 125
KN+KF++V + L+ K +N + ++V +LD LM+ + + AT +A
Sbjct: 385 KNNKFLEVTKMKKQQLEEKKMSLNEEIREIQEKCSIVSGQLDKLMKTSQSVKAKAATMKA 444
Query: 126 SFKCPACGKTFTDLEADQLYD-MATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQL 184
S++ K + ++L + F+ V+D+ +S L + R+ L + N
Sbjct: 445 SYELKLRQKANLEKSIEELKESFEFSNFKKEHGQMVIDQLLSDLESCEERIELEQ-NAFE 503
Query: 185 ETLYILLREVEGIKLAPE 202
+ L +++ IK+A E
Sbjct: 504 KLLQARQDDLKNIKIAQE 521
>UniRef50_Q9YAD5 Cluster: Transcription factor E; n=1; Aeropyrum
pernix|Rep: Transcription factor E - Aeropyrum pernix
Length = 189
Score = 35.1 bits (77), Expect = 3.5
Identities = 34/126 (26%), Positives = 60/126 (47%), Gaps = 13/126 (10%)
Query: 40 NPCMKEDDICELLKFERKMLRARISILKNDKFIQVRLKMETGLDGKAQKVNYYF-INYKT 98
N + +DD+ L +++ +R RI L DK I V K G + + Y++ I+ T
Sbjct: 44 NGGISDDDLESLTGYKQSDIR-RILRLLGDKRIIVSRK---GRHPRKEATRYFWRIDSDT 99
Query: 99 F-VNVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFC 157
V+++ K ++ K + E D+ + + CP CG ++ EA T +F C C
Sbjct: 100 INVSLLTLKKKVLEKLVVKEAHDS-GNSYYTCPRCGSKYSFDEA------FTLDFTCPRC 152
Query: 158 SAVVDE 163
V++E
Sbjct: 153 GEVLEE 158
>UniRef50_Q9H0M5 Cluster: Zinc finger protein 700; n=149;
Eutheria|Rep: Zinc finger protein 700 - Homo sapiens
(Human)
Length = 742
Score = 35.1 bits (77), Expect = 3.5
Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 3/50 (6%)
Query: 111 RKRLETEERDATSRASFKCPACGKTF---TDLEADQLYDMATQEFQCTFC 157
R + T+ERD T + C CGKTF + + + ++C FC
Sbjct: 178 RPSIRTQERDHTGEKPYACKVCGKTFIFHSSIRRHMVMHSGDGTYKCKFC 227
>UniRef50_UPI0000F1FD74 Cluster: PREDICTED: similar to zinc finger
protein 93; n=1; Danio rerio|Rep: PREDICTED: similar to
zinc finger protein 93 - Danio rerio
Length = 608
Score = 34.7 bits (76), Expect = 4.6
Identities = 20/55 (36%), Positives = 26/55 (47%), Gaps = 4/55 (7%)
Query: 106 KLDLMRKRLETEERDATSR---ASFKCPACGKTFTDLEADQLYDMATQEFQCTFC 157
K L+R LET R + + F C CGK+F DL A +L + F C C
Sbjct: 195 KFTLLRA-LETHLRKHSQKFEKKKFPCATCGKSFRDLAAHELVHAEVKPFTCETC 248
>UniRef50_A5VI23 Cluster: Transposase, IS605 OrfB family; n=6;
Lactobacillus|Rep: Transposase, IS605 OrfB family -
Lactobacillus reuteri F275
Length = 391
Score = 34.7 bits (76), Expect = 4.6
Identities = 20/73 (27%), Positives = 37/73 (50%), Gaps = 3/73 (4%)
Query: 95 NYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLYDMATQEFQC 154
++ V++++YK + K+L TS+ C CGK L ++L +A +E+ C
Sbjct: 310 SWSKLVDILQYKCNWYGKKLIQVNPSYTSQI---CANCGKNNHRLGLNKLEWLAVREWDC 366
Query: 155 TFCSAVVDEDMSA 167
C +D D++A
Sbjct: 367 PNCGKHLDRDINA 379
>UniRef50_Q0IQ75 Cluster: Os12g0140200 protein; n=1; Oryza sativa
(japonica cultivar-group)|Rep: Os12g0140200 protein -
Oryza sativa subsp. japonica (Rice)
Length = 431
Score = 34.7 bits (76), Expect = 4.6
Identities = 28/124 (22%), Positives = 56/124 (45%), Gaps = 20/124 (16%)
Query: 32 LIVDMLVRNPCMKEDDICELLKFERKMLRARISILKNDKFIQ-VRLKMETGLD------- 83
+++D L R + + + + LK ++K L + L+ F++ +K +TG +
Sbjct: 1 MVLDALTRYQWVPDTHLAKSLKVQKKKLCLILEFLEKQMFVRRCEVKAKTGRNVSNTATT 60
Query: 84 ------GKAQKVN------YYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPA 131
+ +KV Y INY +VV+Y + M L+++ + + + CP
Sbjct: 61 AGVSAIPRNEKVKSKHPKWYCCINYAKICSVVRYHIMQMEANLKSQLENTNTVDKYTCPN 120
Query: 132 CGKT 135
CGK+
Sbjct: 121 CGKS 124
>UniRef50_Q5C1C7 Cluster: SJCHGC07628 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC07628 protein - Schistosoma
japonicum (Blood fluke)
Length = 190
Score = 34.7 bits (76), Expect = 4.6
Identities = 17/52 (32%), Positives = 26/52 (50%), Gaps = 3/52 (5%)
Query: 109 LMRKRLETEERDATSRASFKCPACGKTF---TDLEADQLYDMATQEFQCTFC 157
++++ LE ER T FKC C KTF T+L+ + + F+C C
Sbjct: 63 VLKQALEGHERTHTGEKPFKCSYCDKTFSVGTNLKRHERVHTGEKPFKCDVC 114
>UniRef50_UPI0000F2DB49 Cluster: PREDICTED: similar to zinc finger
protein 75; n=1; Monodelphis domestica|Rep: PREDICTED:
similar to zinc finger protein 75 - Monodelphis
domestica
Length = 582
Score = 34.3 bits (75), Expect = 6.1
Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 110 MRKRLETEERDATSRASFKCPACGKTF---TDLEADQLYDMATQEFQCTFC 157
+RK +ET ++ T FKC CGK+F +DL Q + ++C C
Sbjct: 413 LRKLMETHKKAYTGEKPFKCHICGKSFKVSSDLIKHQRVHTEERPYRCQEC 463
>UniRef50_UPI0000F2D3EE Cluster: PREDICTED: similar to novel KRAB
box and zinc finger, C2H2 type domain containing
protein; n=1; Monodelphis domestica|Rep: PREDICTED:
similar to novel KRAB box and zinc finger, C2H2 type
domain containing protein - Monodelphis domestica
Length = 660
Score = 34.3 bits (75), Expect = 6.1
Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 3/54 (5%)
Query: 110 MRKRLETEERDATSRASFKCPACGKTFTD---LEADQLYDMATQEFQCTFCSAV 160
M L +R T ++C CGK FT+ L A Q + + ++CT C V
Sbjct: 376 MSNNLPEHQRIHTGEKPYECTQCGKAFTEKGSLAAHQRIHIGEKPYECTQCGKV 429
>UniRef50_UPI0000D9C692 Cluster: PREDICTED: similar to zinc finger
protein 157; n=1; Macaca mulatta|Rep: PREDICTED: similar
to zinc finger protein 157 - Macaca mulatta
Length = 521
Score = 34.3 bits (75), Expect = 6.1
Identities = 20/83 (24%), Positives = 37/83 (44%), Gaps = 6/83 (7%)
Query: 110 MRKRLETEERDATSRASFKCPACGKTFTDLEADQLYDM---ATQEFQCTFCSAVVDEDMS 166
++ L +R T ++CP CGKTF + A Y++ ++C C + S
Sbjct: 405 VKSNLVVHQRTHTGEKPYRCPECGKTFYEKSALTKYELNHTGENPYECNKCRKTFSQ-RS 463
Query: 167 ALPKKDSRLLLAKFNEQLETLYI 189
+L K + + K + TL++
Sbjct: 464 SLTKHQRK--IHKKKTPINTLHV 484
>UniRef50_UPI00015A6608 Cluster: UPI00015A6608 related cluster; n=1;
Danio rerio|Rep: UPI00015A6608 UniRef100 entry - Danio
rerio
Length = 556
Score = 34.3 bits (75), Expect = 6.1
Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 1/53 (1%)
Query: 106 KLDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLYDMA-TQEFQCTFC 157
K L ++LE +R F+C CGK F L Q + + EFQC C
Sbjct: 270 KTFLTSEKLEDHQRCHLGEKPFECEECGKCFVQLTNLQQHQRSHKSEFQCQMC 322
>UniRef50_Q4SC94 Cluster: Chromosome undetermined SCAF14659, whole
genome shotgun sequence; n=3; Clupeocephala|Rep:
Chromosome undetermined SCAF14659, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 856
Score = 34.3 bits (75), Expect = 6.1
Identities = 16/47 (34%), Positives = 23/47 (48%), Gaps = 3/47 (6%)
Query: 114 LETEERDATSRASFKCPACGKTFT---DLEADQLYDMATQEFQCTFC 157
L + T A + C CGKTFT +L+ QL + +QC +C
Sbjct: 397 LNLHRKRHTGEARYTCRLCGKTFTTSGNLKRHQLVHSGEKPYQCDYC 443
>UniRef50_A3DDQ2 Cluster: Putative uncharacterized protein; n=2;
Clostridium|Rep: Putative uncharacterized protein -
Clostridium thermocellum (strain ATCC 27405 / DSM 1237)
Length = 136
Score = 34.3 bits (75), Expect = 6.1
Identities = 19/58 (32%), Positives = 28/58 (48%), Gaps = 2/58 (3%)
Query: 100 VNVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFC 157
V+ + L+ + L EE D A +CP C K F DL D + D + F+C +C
Sbjct: 58 VDSIDEDLEEFARILFDEEEDDDVLAQIECPHCNKVF-DLTEDMI-DGDSDSFECPYC 113
>UniRef50_Q4R3I4 Cluster: Testis cDNA clone: QtsA-16729, similar to
human similar to hypothetical protein (LOC388506),; n=2;
Macaca|Rep: Testis cDNA clone: QtsA-16729, similar to
human similar to hypothetical protein (LOC388506), -
Macaca fascicularis (Crab eating macaque) (Cynomolgus
monkey)
Length = 435
Score = 34.3 bits (75), Expect = 6.1
Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 3/50 (6%)
Query: 111 RKRLETEERDATSRASFKCPACGKTF---TDLEADQLYDMATQEFQCTFC 157
R T+ERD T + C CGKTF + ++ + ++C FC
Sbjct: 124 RPSFRTQERDHTGEKPYVCKECGKTFVFYSSIQRHMVIHKGDGPYKCKFC 173
>UniRef50_Q4V6Y6 Cluster: IP01303p; n=3; Sophophora|Rep: IP01303p -
Drosophila melanogaster (Fruit fly)
Length = 562
Score = 34.3 bits (75), Expect = 6.1
Identities = 22/83 (26%), Positives = 40/83 (48%), Gaps = 8/83 (9%)
Query: 84 GKAQKVNYYF-INYKTFVNVVKYKLDLMRKRLETEE------RDATSRASFKCPACGKTF 136
GK +V+Y ++ +T N+ Y + KR ++ + R +S F C AC KTF
Sbjct: 333 GKTFRVSYSLTLHLRTHTNIRPYVCTVCNKRFKSHQVYSHHLRIHSSERQFSCDACPKTF 392
Query: 137 -TDLEADQLYDMATQEFQCTFCS 158
T ++ + T+ ++C C+
Sbjct: 393 RTSVQLYAHKNTHTKPYRCAVCN 415
>UniRef50_A7AVU0 Cluster: Putative uncharacterized protein; n=1;
Babesia bovis|Rep: Putative uncharacterized protein -
Babesia bovis
Length = 420
Score = 34.3 bits (75), Expect = 6.1
Identities = 25/92 (27%), Positives = 39/92 (42%), Gaps = 2/92 (2%)
Query: 97 KTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQL-YDMATQEFQCT 155
K F+ VV YKL+ M + L+ R F C C + L+ +L D F C
Sbjct: 130 KYFIVVVHYKLNKMEEILQQRRRSLHECDRFICGKCNAVYDSLDVQKLELDGFDAHFIC- 188
Query: 156 FCSAVVDEDMSALPKKDSRLLLAKFNEQLETL 187
+C + V+ D + + EQ++TL
Sbjct: 189 YCGSKVELDDTETKDSMYSSQQQRCEEQVKTL 220
>UniRef50_A5K7P3 Cluster: Putative uncharacterized protein; n=1;
Plasmodium vivax|Rep: Putative uncharacterized protein -
Plasmodium vivax
Length = 995
Score = 34.3 bits (75), Expect = 6.1
Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 4/77 (5%)
Query: 45 EDDICELLKFERKMLRARISILKNDKFIQVRLKMETGLDGKAQKVNYYFIN--YKTFVNV 102
E+ + ++ + L L+N K V+L+M Q+ N Y++N YK F++
Sbjct: 100 ENILTQIYAYNENKLHRYSKYLENLK--NVKLEMFNDTINNIQRENIYYVNFVYKNFLSK 157
Query: 103 VKYKLDLMRKRLETEER 119
+ DL +K+L+ E R
Sbjct: 158 IDLLSDLNKKKLDKERR 174
>UniRef50_A3LWV1 Cluster: Predicted protein; n=2; Pichia|Rep:
Predicted protein - Pichia stipitis (Yeast)
Length = 1176
Score = 34.3 bits (75), Expect = 6.1
Identities = 18/68 (26%), Positives = 35/68 (51%)
Query: 163 EDMSALPKKDSRLLLAKFNEQLETLYILLREVEGIKLAPEILEPEPVDINTIRGLTSKQS 222
ED+ PKK +L + K N+ L L L + +K+ L+P+ + + I+ +S ++
Sbjct: 807 EDIIEKPKKSPQLSIPKVNDVLYQLASGLHYLHSLKIVHRDLKPQNILVADIKKTSSSKA 866
Query: 223 ALRPGGEQ 230
+P E+
Sbjct: 867 TTKPSEEE 874
>UniRef50_UPI0000F2E8AD Cluster: PREDICTED: similar to novel KRAB
box and zinc finger, C2H2 type domain containing
protein; n=1; Monodelphis domestica|Rep: PREDICTED:
similar to novel KRAB box and zinc finger, C2H2 type
domain containing protein - Monodelphis domestica
Length = 572
Score = 33.9 bits (74), Expect = 8.1
Identities = 16/50 (32%), Positives = 23/50 (46%), Gaps = 3/50 (6%)
Query: 111 RKRLETEERDATSRASFKCPACGKTFT---DLEADQLYDMATQEFQCTFC 157
R L +R T S++C CGKTFT +L Q + ++C C
Sbjct: 312 RSHLAVHQRKHTGEKSYECKQCGKTFTWRGNLAEHQRIHTGQKSYKCKHC 361
Score = 33.9 bits (74), Expect = 8.1
Identities = 19/59 (32%), Positives = 25/59 (42%), Gaps = 3/59 (5%)
Query: 111 RKRLETEERDATSRASFKCPACGKTFT---DLEADQLYDMATQEFQCTFCSAVVDEDMS 166
R L +R T + S+KC CGKTF L A Q + ++C C E S
Sbjct: 340 RGNLAEHQRIHTGQKSYKCKHCGKTFAMRGQLAAHQAVHSGEKSYECKQCGKAFAERAS 398
>UniRef50_UPI0000F1DD89 Cluster: PREDICTED: hypothetical protein;
n=1; Danio rerio|Rep: PREDICTED: hypothetical protein -
Danio rerio
Length = 333
Score = 33.9 bits (74), Expect = 8.1
Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 3/51 (5%)
Query: 110 MRKRLETEERDATSRASFKCPACGKTFT---DLEADQLYDMATQEFQCTFC 157
+++ LE +R T F C CGK+F+ +L+ + F C FC
Sbjct: 92 LKQNLEVHKRTHTGEKPFSCQQCGKSFSQKQNLKVHMRVHTGEKPFSCPFC 142
>UniRef50_UPI0000588499 Cluster: PREDICTED: hypothetical protein;
n=1; Strongylocentrotus purpuratus|Rep: PREDICTED:
hypothetical protein - Strongylocentrotus purpuratus
Length = 763
Score = 33.9 bits (74), Expect = 8.1
Identities = 22/75 (29%), Positives = 36/75 (48%), Gaps = 6/75 (8%)
Query: 88 KVNYYFINYKTFVNVVKYKLDLMRKRLETEERDA-TSRASFKCPACGKTFTDLEADQLYD 146
K Y++ + F K ++RK++ E++ T A+FKC C K FT EA +
Sbjct: 544 KSTNYYLQQRPFKCRFCPKRYVLRKKVNEHEKECHTGEAAFKCTHCPKIFTS-EARMMDH 602
Query: 147 MATQE----FQCTFC 157
+ E ++CT C
Sbjct: 603 VKCHEQHRMYRCTLC 617
>UniRef50_Q4TGZ6 Cluster: Chromosome undetermined SCAF3363, whole
genome shotgun sequence; n=2; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF3363,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 332
Score = 33.9 bits (74), Expect = 8.1
Identities = 17/51 (33%), Positives = 26/51 (50%), Gaps = 3/51 (5%)
Query: 111 RKRLETEERDATSRASFKCPACGKTF---TDLEADQLYDMATQEFQCTFCS 158
R+ +E + +SR F C CGK F T L Q A +E++C+ C+
Sbjct: 257 RQSVELHQVTHSSRKPFTCGVCGKAFKLLTGLRCHQRTHQALKEYRCSQCA 307
>UniRef50_Q4SH16 Cluster: Chromosome 8 SCAF14587, whole genome shotgun
sequence; n=2; Tetraodontidae|Rep: Chromosome 8
SCAF14587, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 2317
Score = 33.9 bits (74), Expect = 8.1
Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 3/46 (6%)
Query: 115 ETEERDATSRASFKCPACGKTF---TDLEADQLYDMATQEFQCTFC 157
+TE+ D+ S F C CGKTF LE QL + + +C C
Sbjct: 1370 DTEDSDSDSADYFPCHVCGKTFLTSESLEDHQLCHLGKKPHECAEC 1415
>UniRef50_Q4RT41 Cluster: Chromosome 12 SCAF14999, whole genome
shotgun sequence; n=4; Eumetazoa|Rep: Chromosome 12
SCAF14999, whole genome shotgun sequence - Tetraodon
nigroviridis (Green puffer)
Length = 1488
Score = 33.9 bits (74), Expect = 8.1
Identities = 38/152 (25%), Positives = 64/152 (42%), Gaps = 8/152 (5%)
Query: 139 LEAD-QLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYI----LLRE 193
LEAD ++ A ++ + SAV + SAL K +SR +A Q +TL ++
Sbjct: 1085 LEADMKVLQTAKEQLEEQHKSAVEEISASALAKAESRSSIADLTAQKKTLQAERDEAAQQ 1144
Query: 194 VEGIKLAPEILEPEPVDINTIRGLTSK-QSALRPGGEQWSGEATRNQGMLVEETRVDVTI 252
+ +++ + + V++ +R SK L EQ E R + + E D+
Sbjct: 1145 IRQLQIQLKNASAKQVEMKELRAENSKYHEDLSASKEQLCTETQRTKSLCQEIE--DLKT 1202
Query: 253 GDDKPARDAGALRKERPVWMVESTIASNEQSD 284
D + AL+ E E IA QS+
Sbjct: 1203 ADSAKTQSLQALKDENDKLTQELDIAHGGQSE 1234
>UniRef50_Q8YQ19 Cluster: Alr4017 protein; n=11; Cyanobacteria|Rep:
Alr4017 protein - Anabaena sp. (strain PCC 7120)
Length = 418
Score = 33.9 bits (74), Expect = 8.1
Identities = 36/149 (24%), Positives = 63/149 (42%), Gaps = 5/149 (3%)
Query: 29 EDALIVDMLVRNPCMKEDD--ICELLKFERKMLRARISILKNDKFIQVRL-KMETGLDGK 85
E L+ L NP E + I EL + E + + ++ K +Q +L + +T L
Sbjct: 79 ETTLLGSELPNNPLYTEAEQRIAELQRTEAALTKEIANLQATYKILQGQLSETQTALGRI 138
Query: 86 AQKVNYYFINYKTFVNVVKYKLDLMRKRLETEERDATSRASFKCPACGKTFTDLEADQLY 145
Q+ K + + +L+ ++R+ E R + AS + F D A L
Sbjct: 139 VQESLAQLEQRKQALQISVEQLERRQERIRNEMRTTFAGASQDLAIRVQGFKDYLAGSLQ 198
Query: 146 DMATQEFQCTFCSAVVDEDMSALPKKDSR 174
D+A Q AVV+ + + P KD++
Sbjct: 199 DLAVSAEQLQLVPAVVEREKA--PVKDTK 225
>UniRef50_Q9VFB9 Cluster: CG6654-PA; n=2; Sophophora|Rep: CG6654-PA
- Drosophila melanogaster (Fruit fly)
Length = 639
Score = 33.9 bits (74), Expect = 8.1
Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 1/62 (1%)
Query: 114 LETEERDATSRASFKCPACGKTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDS 173
L+ R T + CP C KTFT Q++ Q + C V +E+ ++P+ S
Sbjct: 539 LKNHRRTHTGERPYVCPFCSKTFTQRGDCQMHQRTHQGERIYIC-PVCNEEFKSMPEMRS 597
Query: 174 RL 175
L
Sbjct: 598 HL 599
>UniRef50_Q7QY61 Cluster: GLP_572_40344_41573; n=1; Giardia lamblia
ATCC 50803|Rep: GLP_572_40344_41573 - Giardia lamblia
ATCC 50803
Length = 409
Score = 33.9 bits (74), Expect = 8.1
Identities = 18/55 (32%), Positives = 26/55 (47%), Gaps = 2/55 (3%)
Query: 92 YFINYKTFVNVVKYKLDLMRKRLETEERDATS--RASFKCPACGKTFTDLEADQL 144
Y N K F+ Y LD+ +LE E +D S + C C K FT+ + +L
Sbjct: 140 YLFNLKKFLITSFYNLDIYVTKLEAELKDPISMGHVKYHCEYCDKRFTEADLYKL 194
>UniRef50_A0BLR1 Cluster: Chromosome undetermined scaffold_114,
whole genome shotgun sequence; n=1; Paramecium
tetraurelia|Rep: Chromosome undetermined scaffold_114,
whole genome shotgun sequence - Paramecium tetraurelia
Length = 550
Score = 33.9 bits (74), Expect = 8.1
Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 8/104 (7%)
Query: 152 FQCTFCSAVVDEDMSALPKKDSRLLLAKFN-----EQLETLYILLREVEGIKLAPEILEP 206
F+ C +++ + + AL D L++ N + E L +L + E IKLA IL
Sbjct: 269 FRKAECLSLLGQQIEALEILDEILMVYPNNTDFLWRKAECLSLLGKHQEAIKLADVILNV 328
Query: 207 EPVDINTIRGLTSKQSALRPGGEQWSGEATRNQGMLVEETRVDV 250
P +NT L+ K L G Q N+G+L+++ +++
Sbjct: 329 NPKHVNT---LSRKAQCLSLLGLQVEAMIWINEGLLIDKNHINL 369
>UniRef50_Q2GW72 Cluster: Putative uncharacterized protein; n=1;
Chaetomium globosum|Rep: Putative uncharacterized
protein - Chaetomium globosum (Soil fungus)
Length = 1454
Score = 33.9 bits (74), Expect = 8.1
Identities = 32/135 (23%), Positives = 58/135 (42%), Gaps = 5/135 (3%)
Query: 134 KTFTDLEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYILLRE 193
K TD EA Q +D AT+ Q + +++ + L+AK +Q++ R
Sbjct: 657 KLHTDSEARQAFDEATETRQFAEAAMAERDELREKLAMGADGLVAKLQKQIDEQ---ARF 713
Query: 194 VEGIKLAPEILEPEPVDINTIRGLTSKQSALRPGGEQWSGEATRNQGMLVEETRVDVTIG 253
++ + E L+ E ++ T+R +++ L + Q + V +G
Sbjct: 714 IDAQRRQTEGLKSELENLQTLRAKEAQRYELET--RELYLMLRDAQDVAASNAAKGVKVG 771
Query: 254 DDKPARDAGALRKER 268
D+ PAR G L +ER
Sbjct: 772 DEDPARMQGILDRER 786
>UniRef50_Q57878 Cluster: 7-cyano-7-deazaguanine
tRNA-ribosyltransferase; n=7; Methanococcales|Rep:
7-cyano-7-deazaguanine tRNA-ribosyltransferase -
Methanococcus jannaschii
Length = 655
Score = 33.9 bits (74), Expect = 8.1
Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 5/62 (8%)
Query: 139 LEADQLYDMATQEFQCTFCSAVVDEDMSALPKKDSRLLLAKFNEQLETLYILLREVEGIK 198
L +++ D+ C CS+ +++++L KK+ LLA+ N LY+ E+ IK
Sbjct: 268 LHLEEIKDLKAFPCSCPVCSSYTPKELASLNKKERERLLAEHN-----LYVTFEEINRIK 322
Query: 199 LA 200
A
Sbjct: 323 QA 324
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.315 0.132 0.368
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 403,068,338
Number of Sequences: 1657284
Number of extensions: 15087609
Number of successful extensions: 47830
Number of sequences better than 10.0: 74
Number of HSP's better than 10.0 without gapping: 30
Number of HSP's successfully gapped in prelim test: 44
Number of HSP's that attempted gapping in prelim test: 47634
Number of HSP's gapped (non-prelim): 216
length of query: 421
length of database: 575,637,011
effective HSP length: 103
effective length of query: 318
effective length of database: 404,936,759
effective search space: 128769889362
effective search space used: 128769889362
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.6 bits)
S2: 74 (33.9 bits)
- SilkBase 1999-2023 -