BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 036605
(340 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|297743397|emb|CBI36264.3| unnamed protein product [Vitis vinifera]
Length = 637
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 195/374 (52%), Positives = 248/374 (66%), Gaps = 47/374 (12%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGT-LLLALGTSNGDILAVDV 59
IW+T+ SL+AEWK PD + + YSC+ C FVGKKRRKERG LLLAL T++G+IL VD+
Sbjct: 32 IWNTSYQSLVAEWKGPDVDSGIGYSCMTCSFVGKKRRKERGACLLLALVTNDGNILVVDI 91
Query: 60 LTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPIS 119
GEMKW+S+ HPGG+ GL+FA KGR LHVVGTNGM S +KSE GE+I EFKAS+KPIS
Sbjct: 92 FAGEMKWESSRYHPGGIVGLSFANKGRILHVVGTNGMVSTLKSENGELIGEFKASKKPIS 151
Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKH 179
S AF +EKI A+A S++R+LSLENG+E+LKF DD+ +Q + S+ A I+T+G+GEKH
Sbjct: 152 SLAFSSDEKILAIAGSKIRVLSLENGKELLKFPDDLESVQSIWISNDANTIVTSGFGEKH 211
Query: 180 LQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVS 239
LQVW CD++ KTV+KGP LSM+H PV +CK+ N +DG VIL+V+ SG AY W+LKT S
Sbjct: 212 LQVWNCDLNRKTVSKGPVLSMKHPPVVFECKHGWNEDDGLVILSVSSSGTAYLWNLKTTS 271
Query: 240 QDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKT----------- 288
E NP KI VK K+A+ DQ++S + KKSRTSII ARLN L++DG T
Sbjct: 272 AQE-YNPTKIMVKTKEAEIDQKHSGSTKKSRTSIITARLNALQSDGWVTALVCYGSINSP 330
Query: 289 ----------------------------------ITGLHDLESEAATASAQNEKKSKKRA 314
+ DLE EA QN+K +KKRA
Sbjct: 331 QFNLLEISNPGEDIVVAATDNIVKTVAEAGRENGVLAGKDLELEAVAEPIQNKKSNKKRA 390
Query: 315 ASDPDLETTRDVVD 328
ASDPDL ++VD
Sbjct: 391 ASDPDLAAAENMVD 404
>gi|255553081|ref|XP_002517583.1| conserved hypothetical protein [Ricinus communis]
gi|223543215|gb|EEF44747.1| conserved hypothetical protein [Ricinus communis]
Length = 634
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 171/291 (58%), Positives = 220/291 (75%), Gaps = 4/291 (1%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
IWST +GSLLAEWKQ D + V SYSC+AC FVGKKR++E T LLALGT GDILA D
Sbjct: 31 IWSTGNGSLLAEWKQSDDDSV-SYSCMACSFVGKKRKQEHSTFLLALGTHGGDILATDAF 89
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
TG+MKWKS+G HPGG+ GLAFA KGRSL VGTNG ASEM S+ GE+ EFKAS++PISS
Sbjct: 90 TGDMKWKSSGHHPGGVIGLAFANKGRSLRSVGTNGKASEMNSKSGELTVEFKASKRPISS 149
Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
AF +E++ A SS R+LSL NG+E+ KF DD+G +Q++S S AK I+T G+GEK L
Sbjct: 150 LAFSSDEEVLAAVSSRARLLSLVNGKELFKFPDDLGTVQHISISKSAKNIVTTGFGEKIL 209
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN--GEDGTVILAVAESGVAYSWDLKTV 238
+WRC++SSK+++ GP LSMRH P+ I+CK++ + EDG ++L+V+ESGV Y W+LKT
Sbjct: 210 HLWRCNLSSKSLSAGPVLSMRHPPLVIECKSTGDEKEEDGLIVLSVSESGVVYVWNLKTS 269
Query: 239 SQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTI 289
SQDE NP KITVK K++ +Q N KK R SI+AA+L+D++ D Q T+
Sbjct: 270 SQDE-VNPTKITVKGSKSETNQHNGDYSKKHRPSILAAQLHDIKDDKQFTV 319
>gi|356528954|ref|XP_003533062.1| PREDICTED: WD repeat-containing protein 43-like [Glycine max]
Length = 620
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 159/361 (44%), Positives = 221/361 (61%), Gaps = 37/361 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
IW+TN G LLAEWK DG+ + YSC+AC F+GKKRRKE+GT LLALGT +G +LAVDV
Sbjct: 32 IWNTNTGHLLAEWKPSDGDHDIRYSCIACSFIGKKRRKEQGTCLLALGTIDGSVLAVDVS 91
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
TGE K T HPGG+ GL+FA KGR L +VG NG+A + +E GE+++EFK ++K I+S
Sbjct: 92 TGERKL--TTSHPGGICGLSFANKGRLLRIVGHNGVAYGVNTETGELLKEFKITKKSITS 149
Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
AF +EK A+ SS R++S E GEE+LKF +D+G ++++S S AK ++T+ + KHL
Sbjct: 150 LAFSNDEKYLAIVSSRPRVISWEIGEEILKFPNDLGNVEHISISSDAKNLVTSDFEGKHL 209
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
QVW+CD++S + +GP L +RH P+ +DC + N ED V+LAV G AY W+L S+
Sbjct: 210 QVWKCDLNSGNLCRGPTLPIRHPPLVLDCHSGCNKED-VVVLAVTGRGSAYIWNLSASSK 268
Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQ-------------- 286
D+ P K+T + K + D++N V+ KK T IIA+RL +E + Q
Sbjct: 269 DQ-IQPTKLTTETKIVETDKENGVSSKKRHTLIIASRLQPVEENKQMKALVTYGSVDHPQ 327
Query: 287 ------------------KTITGLHDLESEAATASAQNEKKSKKR-AASDPDLETTRDVV 327
+ +S + A KK+KKR A SDPDL TT + V
Sbjct: 328 FSILNISNSGENIVLYVGDETDSVQQHDSPSGKAIPMESKKAKKRQATSDPDLPTTTNEV 387
Query: 328 D 328
D
Sbjct: 388 D 388
>gi|356522359|ref|XP_003529814.1| PREDICTED: WD repeat-containing protein 43-like [Glycine max]
Length = 622
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 142/286 (49%), Positives = 196/286 (68%), Gaps = 4/286 (1%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
IW+TN G LLAEWK G+ + YSC+AC F+G+K RKE+GT LLAL T +G ++AVDV
Sbjct: 32 IWNTNTGHLLAEWKPSKGDHDIHYSCIACSFIGEKHRKEQGTSLLALSTIDGSVIAVDVS 91
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
TGE K T HPGG+ GL+FA KGR L +VG NG+A E+ +E GEV++EFK S+K I+S
Sbjct: 92 TGERKL--TTSHPGGICGLSFANKGRLLRIVGHNGVAYEVNTETGEVLKEFKISKKSITS 149
Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
AF +EK A+ SS +RI+S E G+E+LKF +D+G +Q +S S AK ++T+ + KHL
Sbjct: 150 LAFSNDEKYLAIVSSRLRIISWEIGKEILKFPNDLGNVQLISISSDAKNLVTSDFEGKHL 209
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
QVW+CD++S V +GP L +RH P+ +DC + N ED V+LAV G AY W+L S+
Sbjct: 210 QVWKCDLNSGNVGRGPTLPIRHPPLILDCHSGCNKED-VVVLAVTGRGSAYIWNLNASSE 268
Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQ 286
D+ P K+ K K + +++N V+ KK SIIA+RL +E D Q
Sbjct: 269 DQ-IQPTKLNTKTKIVETEKENGVSSKKRHASIIASRLQPVEEDKQ 313
>gi|449436601|ref|XP_004136081.1| PREDICTED: WD repeat-containing protein 43-like [Cucumis sativus]
gi|449491125|ref|XP_004158808.1| PREDICTED: WD repeat-containing protein 43-like [Cucumis sativus]
Length = 624
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/279 (50%), Positives = 197/279 (70%), Gaps = 3/279 (1%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
IWST DGSLLAEWK DG+ YSC+AC F+GKKR+ ++A+GT++GD+LAV+
Sbjct: 32 IWSTRDGSLLAEWKDLDGKNDFGYSCMACCFLGKKRKS--SYCVVAIGTNSGDVLAVNAS 89
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
GE KW S G HPGG+ GL+FA KG L VG+NGMASEM +E G +I+EFKAS+K ISS
Sbjct: 90 NGEKKWVSAGCHPGGVIGLSFANKGCRLRTVGSNGMASEMDTETGNIIKEFKASKKSISS 149
Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
SAF +E+ +A +++ILS ++G+E++ D +GP++ VS SD AK IIT+ G KHL
Sbjct: 150 SAFSLDERYLVVAGKKLKILSTDDGDELIVHPDKLGPVKLVSVSDDAKTIITSELGAKHL 209
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
QVW C+IS+ ++GP LSM+H P +C+N N ED V+L+V+ SG AY W LK +S+
Sbjct: 210 QVWWCNISAGKFSRGPILSMKHPPFVSECRNVSNQEDSVVVLSVSVSGAAYLWKLKVLSE 269
Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLN 279
DE T P K++VK + ++N + KK+R S++A+R++
Sbjct: 270 DEVT-PTKVSVKANDNQSAEENHGSAKKNRASVLASRIH 307
>gi|359482669|ref|XP_003632804.1| PREDICTED: uncharacterized WD repeat-containing protein
all2124-like [Vitis vinifera]
Length = 234
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 119/192 (61%), Positives = 152/192 (79%), Gaps = 1/192 (0%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGT-LLLALGTSNGDILAVDV 59
IW+T+ SL+AEWK PD + + YSC+ C FVGKKRRKERG LLLAL T++G+IL VD+
Sbjct: 32 IWNTSYQSLVAEWKGPDVDSGIGYSCMTCSFVGKKRRKERGACLLLALVTNDGNILVVDI 91
Query: 60 LTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPIS 119
GEMKW+S+ HPGG+ GL+FA KGR LHVVGTNGM S +KSE GE+I EFKAS+KPIS
Sbjct: 92 FAGEMKWESSRYHPGGIVGLSFANKGRILHVVGTNGMVSTLKSENGELIGEFKASKKPIS 151
Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKH 179
S AF +EKI A+A S++R+LSLENG+E+LKF DD+ +Q + S+ A I+T+G+GEKH
Sbjct: 152 SLAFSSDEKILAIAGSKIRVLSLENGKELLKFPDDLESVQSIWISNDANTIVTSGFGEKH 211
Query: 180 LQVWRCDISSKT 191
LQVW CD++ KT
Sbjct: 212 LQVWNCDLNRKT 223
>gi|147767919|emb|CAN62455.1| hypothetical protein VITISV_028174 [Vitis vinifera]
Length = 282
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 129/254 (50%), Positives = 167/254 (65%), Gaps = 31/254 (12%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGT-LLLALGTSNGDILAVDV 59
IW+T+ SL+AEWK PD + +SYSC+ C FVGKKRRKE G LLLAL T++G IL VD+
Sbjct: 32 IWNTSYQSLVAEWKGPDVDSSISYSCMTCSFVGKKRRKEHGACLLLALVTNDGSILVVDI 91
Query: 60 LTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPIS 119
GEMKW+S+ HPGG+ GL+FA KGR LHVVGTNGM S +KSE GE+IREFKAS+KPIS
Sbjct: 92 FAGEMKWESSRYHPGGIVGLSFANKGRILHVVGTNGMVSTLKSENGELIREFKASKKPIS 151
Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKH 179
S AF +EKI A+A S++R+LSLENG+E+LKF +D+ +Q + S+ A I+T
Sbjct: 152 SLAFSSDEKILAIAGSKIRVLSLENGKELLKFPNDLEFVQSIWISNDANTIVT------- 204
Query: 180 LQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVS 239
H P +CK+ N + G VIL+V+ G+AY W+LKT S
Sbjct: 205 ----------------------HPPAVFECKHGWNEDGGLVILSVSSLGIAYLWNLKTTS 242
Query: 240 QDEKTNPAKITVKL 253
+ NP KI + L
Sbjct: 243 A-QGYNPTKIMLLL 255
>gi|147856060|emb|CAN80733.1| hypothetical protein VITISV_024551 [Vitis vinifera]
Length = 546
Score = 197 bits (500), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 97/201 (48%), Positives = 135/201 (67%), Gaps = 27/201 (13%)
Query: 49 TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVI 108
T++G IL V++ GEMKW+S+G HPGG+ L+FA KGR LHVVGTNGM
Sbjct: 165 TNDGSILVVNIFAGEMKWESSGYHPGGIVRLSFANKGRILHVVGTNGM------------ 212
Query: 109 REFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAK 168
F ++KI A+ASS++R+LSLENG+E+LKF DD+ +Q + S+ A
Sbjct: 213 --------------FFRDKKILAIASSKIRVLSLENGKELLKFPDDLESVQSIWISNDAN 258
Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESG 228
I+T+G+GEKHLQVW D++SKT++KG LSM+H PV +CK+ N +DG VIL+V+ SG
Sbjct: 259 TIVTSGFGEKHLQVWNYDLNSKTISKGHVLSMKHLPVVFECKHGWNEDDGLVILSVSSSG 318
Query: 229 VAYSWDLKTVSQDEKTNPAKI 249
+ Y W+LKT + ++ NP KI
Sbjct: 319 ITYLWNLKT-TLAQRYNPTKI 338
>gi|115438873|ref|NP_001043716.1| Os01g0649000 [Oryza sativa Japonica Group]
gi|55296883|dbj|BAD68336.1| unknown protein [Oryza sativa Japonica Group]
gi|113533247|dbj|BAF05630.1| Os01g0649000 [Oryza sativa Japonica Group]
gi|125571385|gb|EAZ12900.1| hypothetical protein OsJ_02821 [Oryza sativa Japonica Group]
Length = 607
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 105/285 (36%), Positives = 165/285 (57%), Gaps = 14/285 (4%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
+W T G ++ W P P SYSC+AC V KK +K+ +L+A+GT+NG +L +D
Sbjct: 34 VWRTGGGEIIEGWTDPISAPDDSYSCIACCSVQKKHKKDGNLILVAVGTTNGQVLVLDS- 92
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
TG + WK+ H + L FA+ GR L+ G +G+ E+ S GE KA++KPI+S
Sbjct: 93 TGVI-WKN-APHTCKVVSLHFARHGRVLYTAGMDGIICELNSRTGESKDTIKATKKPINS 150
Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
+EK ++S R+ S+ +E+L+ DVGP+Q +S SD + +++ K +
Sbjct: 151 FTLSHDEKFMGVSSKITRLFSVSEKKEILRIPSDVGPVQLMSVSDDGRFLVSHVDNNKEV 210
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
QVW CD +S T+ +L+M++ P ++C S + DG ++LAV++ GVA+ W L+T+SQ
Sbjct: 211 QVWSCDQNSCTIVSTASLTMQNQPKIVECTRSTSYGDGGIVLAVSKKGVAHVWHLQTLSQ 270
Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
+E P KI+VK NS++ KK R II+A+L D D
Sbjct: 271 NE-VLPTKISVK---------NSLD-KKGRIPIISAKLCDTNEDN 304
>gi|125527065|gb|EAY75179.1| hypothetical protein OsI_03069 [Oryza sativa Indica Group]
Length = 607
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 105/285 (36%), Positives = 165/285 (57%), Gaps = 14/285 (4%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
+W T G ++ W P P SYSC+AC V KK +K+ +L+A+GT+NG +L +D
Sbjct: 34 VWRTGGGEIIEGWTDPISAPDDSYSCIACCSVQKKHKKDGNLILVAVGTTNGQVLVLDS- 92
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
TG + WK+ H + L FA+ GR L+ G +G+ E+ S GE KA++KPI+S
Sbjct: 93 TGVI-WKN-APHTCKVVSLHFARHGRVLYTAGMDGIICELNSRTGESKDTIKATKKPINS 150
Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
+EK ++S R+ S+ +E+L+ DVGP+Q +S SD + +++ K +
Sbjct: 151 FTLSHDEKFMGVSSKITRLFSVSEKKEILRIPSDVGPVQLMSVSDDGRFLVSHVDNNKEV 210
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
QVW CD +S T+ +L+M++ P ++C S + DG ++LAV++ GVA+ W L+T+SQ
Sbjct: 211 QVWSCDQNSCTIVSTASLTMQNQPKIVECTRSTSYGDGGIVLAVSKKGVAHVWHLQTLSQ 270
Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
+E P KI+VK NS++ KK R II+A+L D D
Sbjct: 271 NE-VLPTKISVK---------NSLD-KKGRIPIISAKLCDTNEDN 304
>gi|359477163|ref|XP_002271327.2| PREDICTED: WD repeat-containing protein 43-like [Vitis vinifera]
gi|296083309|emb|CBI22945.3| unnamed protein product [Vitis vinifera]
Length = 628
Score = 152 bits (385), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 97/295 (32%), Positives = 162/295 (54%), Gaps = 24/295 (8%)
Query: 1 IWSTNDGSLLAE------------WKQPD-GEPVVSYSCLA-CGFVGKKRRKERGTLLLA 46
IW T G + E + +P+ G V Y+CL KK+RK R +LL+
Sbjct: 31 IWDTLKGQVQTEFADIISTDTTNLYSKPERGHLSVDYTCLKWLSLDSKKKRKLRCSLLV- 89
Query: 47 LGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
LGT +GD+LA+DV GE+KW+ HPGG++ ++ + G ++ G +GM S++ S G
Sbjct: 90 LGTGSGDVLALDVSAGELKWRIGDCHPGGVSAISCSTDGSCIYAAGADGMISKIDSSAGN 149
Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDG 166
++ +F+AS K ISS + + K+ A A+++++IL+ + +++ KFS G ++ + +D
Sbjct: 150 LLGKFRASTKAISSMSVSSDGKVLATAAAQLKILNSSDHKKIQKFSGHPGAVRCMIFTDD 209
Query: 167 AKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGE---DGTVILA 223
+ I+++ GE+++ VWR D SK + L+M H V +D + NG+ G +LA
Sbjct: 210 GEYILSSAIGERYIAVWRID-GSKKQSASCVLAMEHPAVFLDSRCIDNGDTDKSGLYVLA 268
Query: 224 VAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARL 278
++E GV Y W K + T P K++V L + S N K S +I AA+L
Sbjct: 269 ISEIGVCYFWYAKNMEDLRNTKPTKVSVSL-----EDNFSKNHKGSLPTIFAAKL 318
>gi|242041231|ref|XP_002468010.1| hypothetical protein SORBIDRAFT_01g037980 [Sorghum bicolor]
gi|241921864|gb|EER95008.1| hypothetical protein SORBIDRAFT_01g037980 [Sorghum bicolor]
Length = 633
Score = 150 bits (378), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 84/265 (31%), Positives = 146/265 (55%), Gaps = 11/265 (4%)
Query: 18 GEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLA 77
G + Y+C+ + K++++ G+ L+ LGT +GD+LA+DV G KWK + HPGG+
Sbjct: 64 GHLALDYTCMKWVQLSSKKKRKAGSSLIVLGTGSGDVLALDVAAGNWKWKVSDCHPGGVT 123
Query: 78 GLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV 137
+A++K GRS++ G +GM + + G V+ +FK+S K IS+ A + + A A+ ++
Sbjct: 124 AVAYSKHGRSVYTAGADGMVCRIDASDGAVVGKFKSSSKAISALAVSSDGNVLATAAGQL 183
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA 197
R + +++ KFS ++ + S+ ++ ++++G GE+++ +W+ S KT +
Sbjct: 184 RTFDTSSNKKIKKFSGHPVAVRSMVFSNDSQYVLSSGVGERYIAIWKLG-SGKTQSSNCI 242
Query: 198 LSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKAD 257
LSM H + +DCK S GE T ILA++E GV Y W V P KI +
Sbjct: 243 LSMEHPAIFVDCKCSDEGE--THILAISEIGVCYFWSANDVDDLRNKKPTKIIL------ 294
Query: 258 ADQQNSVNVKKSRTSIIAARLNDLE 282
++SV SI AA+L ++
Sbjct: 295 --SESSVPTVHQAFSIFAAKLQGVD 317
>gi|449531051|ref|XP_004172501.1| PREDICTED: uncharacterized LOC101213788, partial [Cucumis sativus]
Length = 547
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 98/310 (31%), Positives = 161/310 (51%), Gaps = 21/310 (6%)
Query: 1 IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
IW T G + E+ K G V Y C+ + KKR+++R LLL L
Sbjct: 31 IWDTLKGQIQTEFADFFTSDSTSILTKPEKGHLSVDYKCMKWLSLEKKRKRKRQCLLLLL 90
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT +GD+LA+DV GE+KWK + HPGG+A ++F G ++ G +GM E+ S G +
Sbjct: 91 GTGSGDVLALDVAAGELKWKISDCHPGGVASISFPTHGSCIYTAGADGMLCEINSLTGNL 150
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
+R+FKAS K IS + + KI A A+S+V+I + N +++ KFS G ++ + ++
Sbjct: 151 LRKFKASTKAISCISVSPDGKIIATAASQVKIFNCSNHKKIQKFSGHPGAVRCMVFTEDG 210
Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGT--VILAVA 225
+ I+++ GE+++ VW D K + L+M H + +D + S +G D T ILA++
Sbjct: 211 RYILSSAVGERYIVVWSVD-GGKEQSASCVLAMEHPAIFVDSRCSNDGGDETALYILAIS 269
Query: 226 ESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
E G Y W + + + P KI + + S + K++ +I AA+L + G
Sbjct: 270 EIGACYLWYGQNLEELRTAKPTKILMS-----GNDIFSKSKKRAIPAIYAAKLQGVPKSG 324
Query: 286 QKTITGLHDL 295
+ H L
Sbjct: 325 SGQVFLAHGL 334
>gi|449464758|ref|XP_004150096.1| PREDICTED: uncharacterized protein LOC101213788 [Cucumis sativus]
Length = 570
Score = 148 bits (373), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 98/310 (31%), Positives = 161/310 (51%), Gaps = 21/310 (6%)
Query: 1 IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
IW T G + E+ K G V Y C+ + KKR+++R LLL L
Sbjct: 31 IWDTLKGQIQTEFADFFTSDSTSILTKPEKGHLSVDYKCMKWLSLEKKRKRKRQCLLLLL 90
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT +GD+LA+DV GE+KWK + HPGG+A ++F G ++ G +GM E+ S G +
Sbjct: 91 GTGSGDVLALDVAAGELKWKISDCHPGGVASISFPTHGSCIYTAGADGMLCEINSLTGNL 150
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
+R+FKAS K IS + + KI A A+S+V+I + N +++ KFS G ++ + ++
Sbjct: 151 LRKFKASTKAISCISVSPDGKIIATAASQVKIFNCSNHKKIQKFSGHPGAVRCMVFTEDG 210
Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGT--VILAVA 225
+ I+++ GE+++ VW D K + L+M H + +D + S +G D T ILA++
Sbjct: 211 RYILSSAVGERYIVVWSVD-GGKEQSASCVLAMEHPAIFVDSRCSNDGGDETALYILAIS 269
Query: 226 ESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
E G Y W + + + P KI + + S + K++ +I AA+L + G
Sbjct: 270 EIGACYLWYGQNLEELRTAKPTKILMS-----GNDIFSKSKKRAIPAIYAAKLQGVPKSG 324
Query: 286 QKTITGLHDL 295
+ H L
Sbjct: 325 SGQVFLAHGL 334
>gi|115452511|ref|NP_001049856.1| Os03g0300300 [Oryza sativa Japonica Group]
gi|108707682|gb|ABF95477.1| transducin family protein, putative, expressed [Oryza sativa
Japonica Group]
gi|113548327|dbj|BAF11770.1| Os03g0300300 [Oryza sativa Japonica Group]
gi|215704632|dbj|BAG94260.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222624762|gb|EEE58894.1| hypothetical protein OsJ_10517 [Oryza sativa Japonica Group]
Length = 637
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 150/267 (56%), Gaps = 11/267 (4%)
Query: 18 GEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLA 77
G + Y+C+ + K++++ G+ LL LGT +GD+LA+DV G+ KW+ T HPGG+
Sbjct: 65 GHLALDYTCMKWVQLSSKKKRKAGSSLLVLGTGSGDVLALDVAAGQWKWRVTDCHPGGVT 124
Query: 78 GLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV 137
+A++K GRSL+ GT+GM + + G V+ +FK+S K IS+ A + +I A A+ ++
Sbjct: 125 AVAYSKHGRSLYTGGTDGMVCRINASDGSVVEKFKSSSKAISALAVSPDGEILATAAGQL 184
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA 197
R + +++ KFS ++ + S ++ ++++G GE+++ +W+ S KT +
Sbjct: 185 RTFDASDNKKIQKFSGHPVAVRSMVFSGDSQYVLSSGVGERYVAIWKLG-SGKTQSSSCI 243
Query: 198 LSMRHSPVAIDCKNSP-NGEDGTV-ILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKK 255
LSM H + +DCK S N +G + +LA++E GV Y W + P KI +
Sbjct: 244 LSMEHPAIFVDCKCSDINDTEGEIHVLAISEIGVCYFWSGTNMDDLRNKKPTKIAL---- 299
Query: 256 ADADQQNSVNVKKSRTSIIAARLNDLE 282
+S++ K +I AA+L ++
Sbjct: 300 ----SDSSLSRSKQGFAIFAAKLQGID 322
>gi|224110316|ref|XP_002315482.1| predicted protein [Populus trichocarpa]
gi|222864522|gb|EEF01653.1| predicted protein [Populus trichocarpa]
Length = 617
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 88/294 (29%), Positives = 158/294 (53%), Gaps = 22/294 (7%)
Query: 1 IWSTNDGSLLAEW------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALG 48
IW T G + E+ K G V Y C+ + KK++++ G+ LL LG
Sbjct: 33 IWDTLKGQVQTEFADITSSEGDFYAKPERGHLSVDYKCMKWLSLDKKKKRKLGSSLLVLG 92
Query: 49 TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVI 108
T +GD+LA+DV G++KW + HPGG++ ++F+ + ++ G +GM + + G +
Sbjct: 93 TGSGDVLALDVSAGQLKWSVSDCHPGGVSAVSFSTRDSCIYTSGADGMICKTDPQTGNTL 152
Query: 109 REFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAK 168
+F+AS K IS + + KI A A+++++I + + +++ KFS G ++ + +D K
Sbjct: 153 GKFRASTKAISCMSVSSDGKILATAAAQLKIFNCSDHKKMQKFSGHPGAVRSMVFTDDGK 212
Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN-GED--GTVILAVA 225
I+++ GE+++ +WR D K + L+M H V IDCK N G D G +LA++
Sbjct: 213 YILSSAVGERYVALWRID-GGKRQSASCVLAMEHPAVFIDCKCFENEGVDDAGLYVLAIS 271
Query: 226 ESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLN 279
E+GV Y+W K V + + P K+ + ++ N K + +++AA+L
Sbjct: 272 ETGVCYTWCGKNVEELRNSKPTKVLLSY------EEFPKNHKGALPTVLAAKLQ 319
>gi|414866377|tpg|DAA44934.1| TPA: hypothetical protein ZEAMMB73_517326 [Zea mays]
Length = 629
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 145/265 (54%), Gaps = 11/265 (4%)
Query: 18 GEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLA 77
G + Y+C+ + K++++ G+ L+ LGT +GD+LA+DV G KWK + HPGG+
Sbjct: 62 GHLALDYTCMKWVQLSSKKKRKAGSSLIVLGTGSGDVLALDVAAGHWKWKVSDCHPGGVT 121
Query: 78 GLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV 137
+ ++K GR+++ G +GM + + G V+ +FK+S K IS+ A + + A A+ ++
Sbjct: 122 AVEYSKHGRNVYTAGADGMVCRIDASNGAVVGKFKSSSKAISALAVSSDGNMLATAAGQL 181
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA 197
R + +++ KFS ++ + S+ ++ ++++G GE+++ +W+ S KT +
Sbjct: 182 RTFDTSSNKKIKKFSGHPVAVRSMVFSNDSQYVLSSGVGERYIAIWKLG-SGKTQSSNCI 240
Query: 198 LSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKAD 257
LSM H + +DCK S GE ILA++E GV Y W V P KI +
Sbjct: 241 LSMEHPAMFVDCKCSDKGE--IHILAISEIGVCYFWSANVVDDLRNKKPTKIIL------ 292
Query: 258 ADQQNSVNVKKSRTSIIAARLNDLE 282
++SV+ SI AA+L ++
Sbjct: 293 --SESSVSTTHQALSIFAAKLQGVD 315
>gi|218192639|gb|EEC75066.1| hypothetical protein OsI_11186 [Oryza sativa Indica Group]
Length = 637
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 84/267 (31%), Positives = 150/267 (56%), Gaps = 11/267 (4%)
Query: 18 GEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLA 77
G + Y+C+ + K++++ G+ LL LGT +GD+LA+DV G+ KW+ T HPGG+
Sbjct: 65 GHLALDYTCMKWVQLLSKKKRKAGSSLLVLGTGSGDVLALDVAAGQWKWRVTDCHPGGVT 124
Query: 78 GLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV 137
+A++K GRSL+ GT+GM + + G V+ +FK+S K IS+ A + +I A A+ ++
Sbjct: 125 AVAYSKHGRSLYTGGTDGMVCRINASDGSVVEKFKSSSKAISALAVSPDGEILATAAGQL 184
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA 197
R + +++ KFS ++ + S ++ ++++G GE+++ +W+ S KT +
Sbjct: 185 RTFDASDNKKIQKFSGHPVAVRSMVFSGDSQYVLSSGVGERYVAIWKLG-SGKTQSSSCI 243
Query: 198 LSMRHSPVAIDCKNSP-NGEDGTV-ILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKK 255
LSM H + +DCK S N +G + +LA++E GV Y W + P KI +
Sbjct: 244 LSMEHPAIFVDCKCSDINDTEGEIHVLAISEIGVCYFWSGTNMDDLRNKKPTKIAL---- 299
Query: 256 ADADQQNSVNVKKSRTSIIAARLNDLE 282
+S++ K +I AA+L ++
Sbjct: 300 ----SDSSLSRSKQGFAIFAAKLQGID 322
>gi|357112630|ref|XP_003558111.1| PREDICTED: uncharacterized protein LOC100843293 [Brachypodium
distachyon]
Length = 625
Score = 146 bits (368), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 157/296 (53%), Gaps = 23/296 (7%)
Query: 1 IWSTNDGSLLAEWKQ---------PD---GEPVVSYSCLACGFVGKKRRKERGTLLLALG 48
+W GSL E+ P+ G + Y+C+ + K++++ G+ LL LG
Sbjct: 33 VWDAVRGSLQTEFADIPAVEVGAVPETKRGHLALDYTCMKWVQLSGKKKRKAGSSLLVLG 92
Query: 49 TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVI 108
T +GD+LA+DV G+ KW+ HPGG+ +A+++ GRS++ G +GM + S G V
Sbjct: 93 TGSGDVLALDVAAGQWKWRINDCHPGGVTAVAYSRHGRSVYSAGADGMVCRIDSSDGSVA 152
Query: 109 REFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAK 168
+FK+S K IS+ A + I A A+ ++R + ++V KFS ++ ++ SD K
Sbjct: 153 GKFKSSSKAISALAVSPDGNILATAAGQLRTFDASDNKKVQKFSGHPVAVRSMTFSDNGK 212
Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS-PNGEDGTV-ILAVAE 226
I+++G GE+++ +W+ S K + LSM H + +DCK S + DG + +LA++E
Sbjct: 213 YILSSGVGERYVAIWKLG-SGKAQSSNCILSMEHPAIFVDCKCSDTDATDGEIHVLAISE 271
Query: 227 SGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLE 282
G+ Y W ++ P KI + ++S++ K +I AA+L ++
Sbjct: 272 VGICYFWSGNNMNDLRNKKPTKIAL--------SESSLSRAKQPFTIFAAKLKGID 319
>gi|255551877|ref|XP_002516984.1| nucleotide binding protein, putative [Ricinus communis]
gi|223544072|gb|EEF45598.1| nucleotide binding protein, putative [Ricinus communis]
Length = 586
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 160/295 (54%), Gaps = 24/295 (8%)
Query: 1 IWSTNDGSLLAE------------WKQPD-GEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
IW T G L E + +P+ G + Y+C+ + +KR+++ G+ LL L
Sbjct: 31 IWDTVKGQLQTEFADITSSNNADFYTKPERGHLSIDYTCMKWLSLDRKRKRKLGSSLLVL 90
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT +GD+LA+DV G++KW + HPGG++ ++F+ ++ G +GM ++ + G +
Sbjct: 91 GTGSGDVLALDVSAGQLKWTVSDCHPGGVSAISFSTSDSCIYTAGADGMCCKLDPQTGNM 150
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
+ +F+AS K ISS + + KI A A+++++ ++L + +++ KFS G ++ + ++
Sbjct: 151 LGKFRASTKAISSVSVSPDGKILATAAAQLKTINLSDNKKIQKFSGHPGAVRAMIFTEDG 210
Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN-GED--GTVILAV 224
K I+++ GE+++ +WR D K L+M H V +D K N G D G ++A+
Sbjct: 211 KYILSSAVGERYIALWRTD-GVKKQTASCVLAMEHPAVFLDSKCLENKGVDDAGLCVMAI 269
Query: 225 AESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLN 279
+E+GV Y+W + V + P KI + ++S N K +I AA+L
Sbjct: 270 SETGVCYTWCGQNVEELRIAKPTKIVLF-------SEDSKNQKGGLRTIFAAKLQ 317
>gi|30683535|ref|NP_196685.2| transducin/WD40 domain-containing protein [Arabidopsis thaliana]
gi|209414540|gb|ACI46510.1| At5g11240 [Arabidopsis thaliana]
gi|332004268|gb|AED91651.1| transducin/WD40 domain-containing protein [Arabidopsis thaliana]
Length = 615
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 155/298 (52%), Gaps = 20/298 (6%)
Query: 1 IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
IW T G + E+ K G V Y+C+ + KK++++ GT +L L
Sbjct: 31 IWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTCMKWLSLEKKKKRKLGTSVLVL 90
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT GD+LA+DV +G++KW+ + HPGG+ ++ + K ++ G +GM ++ G +
Sbjct: 91 GTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKASCIYSGGADGMVCQIDPHSGNL 150
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
IR+FKAS K +SS + KI AS++++ + + +++ KF+ G ++ V+ ++
Sbjct: 151 IRKFKASTKTVSSLCVSPDGKILVTASTQLKTFNCSDLKKIQKFTGHPGVVRCVAFTEDG 210
Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAES 227
K ++++ GE+++ VW+ D +K + L++ H PV +D N E G +LA++E
Sbjct: 211 KYVLSSAVGERYIAVWKTD-GAKKQSASCVLALEHPPVFVDSWGETN-EKGLYVLAISEI 268
Query: 228 GVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
GV Y W V + P K V L AD+ + K S I AA+L + G
Sbjct: 269 GVCYFWYGSNVEELCNATPTK--VALATADSSLK---PYKSSLPLIFAAKLQGILKPG 321
>gi|8953386|emb|CAB96659.1| putative protein [Arabidopsis thaliana]
Length = 606
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 155/298 (52%), Gaps = 20/298 (6%)
Query: 1 IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
IW T G + E+ K G V Y+C+ + KK++++ GT +L L
Sbjct: 31 IWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTCMKWLSLEKKKKRKLGTSVLVL 90
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT GD+LA+DV +G++KW+ + HPGG+ ++ + K ++ G +GM ++ G +
Sbjct: 91 GTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKASCIYSGGADGMVCQIDPHSGNL 150
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
IR+FKAS K +SS + KI AS++++ + + +++ KF+ G ++ V+ ++
Sbjct: 151 IRKFKASTKTVSSLCVSPDGKILVTASTQLKTFNCSDLKKIQKFTGHPGVVRCVAFTEDG 210
Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAES 227
K ++++ GE+++ VW+ D +K + L++ H PV +D N E G +LA++E
Sbjct: 211 KYVLSSAVGERYIAVWKTD-GAKKQSASCVLALEHPPVFVDSWGETN-EKGLYVLAISEI 268
Query: 228 GVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
GV Y W V + P K V L AD+ + K S I AA+L + G
Sbjct: 269 GVCYFWYGSNVEELCNATPTK--VALATADSSLK---PYKSSLPLIFAAKLQGILKPG 321
>gi|356521817|ref|XP_003529547.1| PREDICTED: uncharacterized protein LOC100815458 [Glycine max]
Length = 626
Score = 140 bits (352), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 149/275 (54%), Gaps = 11/275 (4%)
Query: 17 DGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGL 76
+G + Y+C+ KKR+++ + LL LGT GD+LA+DV + ++ W+ T HPGG+
Sbjct: 63 NGHLALDYTCIKWFSFEKKRKRKHISSLLVLGTGGGDVLALDVASAQLTWRLTDCHPGGV 122
Query: 77 AGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
+A + S++ G +GM + G ++ +FKAS KP+S + + A A+++
Sbjct: 123 RAIASSANVSSIYTAGVDGMVCVIDFMTGNLLEKFKASTKPVSCMSVSPDGNTLATAAAQ 182
Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
++I + N +++ KFS G ++ + ++ K ++++ GE+++ VWR D +K +
Sbjct: 183 LKIFNCSNHKKIQKFSGHPGSVRCMLFTEDGKYVLSSAVGERYVAVWRID-GAKKQSASC 241
Query: 197 ALSMRHSPVAIDCKNSPNGED---GTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
L+M H V +D + +GE G +LA++E G+ Y W ++ + P KI++ L
Sbjct: 242 VLAMEHPAVFLDSRCIDSGEHDEAGICVLAISEIGICYLWFGNSIEELRNAKPTKISLSL 301
Query: 254 KKADADQQNSVNVKKSRTSIIAARLNDLE--ADGQ 286
+ S N K + +I AA+L ++ A GQ
Sbjct: 302 -----EDMPSRNYKGALPAIYAAKLQGIQKPASGQ 331
>gi|356564408|ref|XP_003550446.1| PREDICTED: uncharacterized protein LOC100784148 [Glycine max]
Length = 626
Score = 139 bits (351), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 82/275 (29%), Positives = 152/275 (55%), Gaps = 11/275 (4%)
Query: 17 DGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGL 76
+G + Y+C+ KKR+++ + LL LGT GD+LA+D+ +G++ W+ T HPGG+
Sbjct: 63 NGHLALDYTCIKWFSFDKKRKRKHISSLLVLGTGGGDVLALDIASGQLTWRLTDCHPGGV 122
Query: 77 AGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
+A + +++ G +GM + G ++ +FKAS KP+S + + K A A+++
Sbjct: 123 RAIASSANVSTIYTAGADGMVCLIDFMTGNLLEKFKASTKPVSCMSVSPDGKTLATAAAQ 182
Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
++I + N +++ KFS G ++ + ++ K I+++ GE+++ VWR + +K +
Sbjct: 183 LKIFNCSNHKKIQKFSGHPGSVRCMVFTEDGKHILSSAVGERYVAVWRIE-GAKKQSASC 241
Query: 197 ALSMRHSPVAID--CKNS-PNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
L+M H V +D C +S + E G +LA++E G+ Y W +V + P KI++ L
Sbjct: 242 VLAMEHPAVFLDSRCIDSEEHDEAGICVLAISEIGICYLWFGNSVEELRNAKPTKISLSL 301
Query: 254 KKADADQQNSVNVKKSRTSIIAARLNDLE--ADGQ 286
+ S N K + +I AA+L ++ A GQ
Sbjct: 302 EDVP-----SRNYKGALPAIYAAKLQGIQKPASGQ 331
>gi|26451883|dbj|BAC43034.1| unknown protein [Arabidopsis thaliana]
Length = 615
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 89/298 (29%), Positives = 154/298 (51%), Gaps = 20/298 (6%)
Query: 1 IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
IW T G + E+ K G V Y+C+ + KK++++ GT +L L
Sbjct: 31 IWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTCMKWLSLEKKKKRKLGTSVLVL 90
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT GD+LA+DV +G++KW+ + HPGG+ ++ + K ++ G +GM ++ G +
Sbjct: 91 GTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKASCIYSGGADGMVCQIDPHSGNL 150
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
IR+FKAS K +SS + KI AS++++ + + +++ KF+ G ++ V+ ++
Sbjct: 151 IRKFKASTKTVSSLCVSPDGKILVTASTQLKTFNCSDLKKIQKFTGHPGVVRCVAFTEDG 210
Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAES 227
K ++++ GE+++ VW+ D +K + L++ H PV D N E G +LA++E
Sbjct: 211 KYVLSSAVGERYIAVWKTD-GAKKQSASCVLALEHPPVFADSWGETN-EKGLYVLAISEI 268
Query: 228 GVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
GV Y W V + P K V L AD+ + K S I AA+L + G
Sbjct: 269 GVCYFWYGSNVEELCNATPTK--VALATADSSLK---PYKSSLPLIFAAKLQGILKPG 321
>gi|313184296|emb|CBL94162.1| putative transducin family protein / WD-40 repeat family protein
[Malus x domestica]
Length = 629
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 78/254 (30%), Positives = 140/254 (55%), Gaps = 6/254 (2%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
I ++ D S+ A K+ G V Y+C+ +K++++ G+ L LGT GD+LA+DV
Sbjct: 46 IVASEDTSIYA--KRERGHLSVDYTCMKWFSSERKKKRKLGSSFLVLGTGGGDVLALDVA 103
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
G +KW+ H GG++ ++FA+ L+ G +GM ++ S G + +FKAS K ISS
Sbjct: 104 AGHLKWRVNDCHAGGVSSISFARNTSCLYTAGADGMICQIDSLTGNLSGKFKASTKAISS 163
Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
+ I A A+++++I +L + +++ KFS G ++ + ++ K I+++ GE+++
Sbjct: 164 MSVSSNGNILATAAAQMKIFNLSDHKKIQKFSGHPGAVRCMIFTEDGKYILSSAVGERYI 223
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV---ILAVAESGVAYSWDLKT 237
VW D K + L M H + +D + +GE G V +LA++E GV Y W +
Sbjct: 224 AVWSID-GGKKQSASVVLKMEHPAIFVDSRCIDSGEVGDVGLYVLAISEVGVCYFWFGQN 282
Query: 238 VSQDEKTNPAKITV 251
+ + P KI++
Sbjct: 283 IEELRSAKPTKISL 296
>gi|297807165|ref|XP_002871466.1| hypothetical protein ARALYDRAFT_325654 [Arabidopsis lyrata subsp.
lyrata]
gi|297317303|gb|EFH47725.1| hypothetical protein ARALYDRAFT_325654 [Arabidopsis lyrata subsp.
lyrata]
Length = 615
Score = 137 bits (346), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 88/298 (29%), Positives = 155/298 (52%), Gaps = 20/298 (6%)
Query: 1 IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
IW T G + E+ K G V Y+C+ + KK++++ GT +L L
Sbjct: 31 IWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTCMKWLSLEKKKKRKLGTSVLVL 90
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT GD+LA+DV +G++KW+ + HPGG+ ++ + K ++ G +GM ++ G +
Sbjct: 91 GTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKASCIYSGGADGMVCQIDPHSGNL 150
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
IR+FKAS K +SS + KI AS++++ + + +++ KF+ G ++ V+ ++
Sbjct: 151 IRKFKASTKTVSSLCVSPDGKILVTASAQLKTFNCSDLKKIQKFTGHPGGVRCVAFTEDG 210
Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAES 227
K ++++ GE+++ VW+ + +K + L++ H PV +D N E G +LA++E
Sbjct: 211 KYVLSSAVGERYIAVWKTN-GAKKQSASCVLALEHPPVFVDSWGETN-EKGLYVLAISEI 268
Query: 228 GVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
GV Y W V + P K V L AD+ + K S I AA+L + G
Sbjct: 269 GVCYFWYGSNVEELCNATPTK--VALATADSSLKPH---KGSLPLIFAAKLQGILKPG 321
>gi|356537448|ref|XP_003537239.1| PREDICTED: uncharacterized protein LOC100778281 [Glycine max]
Length = 557
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 80/278 (28%), Positives = 148/278 (53%), Gaps = 14/278 (5%)
Query: 17 DGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGL 76
+G + Y+C+ +KR+++ + LL LGT GD+LA+DV + ++ W+ T HPGG+
Sbjct: 63 NGHLALDYTCIKWFSFEEKRKRKHISSLLVLGTGGGDVLALDVASAQLTWRLTDCHPGGV 122
Query: 77 AGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
+A + S++ G +GM + G ++ +FKAS KP+S + + A A+++
Sbjct: 123 RVIASSANVSSIYTAGVDGMVCVIDFMTGNLLEKFKASTKPVSCMSVSPDGNTLATAAAQ 182
Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSA---SDGAKIIITAGYGEKHLQVWRCDISSKTVN 193
++I + N +++ KFS G L V ++ K ++++ GE+++ VWR + +K +
Sbjct: 183 LKIFNCSNHKKIQKFSGHPGSLGSVRCMLFTEEGKYVLSSAIGERYVAVWRIE-GAKKQS 241
Query: 194 KGPALSMRHSPVAIDCKNSPNGED---GTVILAVAESGVAYSWDLKTVSQDEKTNPAKIT 250
L+M H V +D + +GE G +LA++E + Y W ++ + T P KI+
Sbjct: 242 ASCVLAMEHPAVFLDSRCIDSGEHDEVGICVLAISEIAICYLWFGNSIEELCNTKPTKIS 301
Query: 251 VKLKKADADQQNSVNVKKSRTSIIAARLNDLE--ADGQ 286
+ L + S N K + +I AA+L ++ A GQ
Sbjct: 302 LSL-----EGMPSRNYKGALPAIYAAKLQGIQKPASGQ 334
>gi|357478843|ref|XP_003609707.1| WD40 repeat-containing protein SMU1 [Medicago truncatula]
gi|355510762|gb|AES91904.1| WD40 repeat-containing protein SMU1 [Medicago truncatula]
Length = 625
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/275 (29%), Positives = 145/275 (52%), Gaps = 12/275 (4%)
Query: 18 GEPVVSYSCLA-CGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGL 76
G V Y+C+ F K++RK + LL LGT +GD+LA+DV G++ WK HPGG+
Sbjct: 60 GHLSVDYTCIKWLSFQSKRKRKHSSSSLLLLGTGSGDVLALDVSAGQLSWKINDCHPGGV 119
Query: 77 AGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
++ G + + G +GM G+++ +FKAS K +S + + K A A+++
Sbjct: 120 RAISSLANGSTFYTAGADGMICATDFSTGKLLEKFKASSKAVSCISVSPDGKKLATAAAQ 179
Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
++I + N +++ K S G ++ + ++ +K I+++ GE+++ VW D SK +
Sbjct: 180 LKIFNCSNKKKIQKCSGHPGSVRCMVFTEDSKYILSSAVGERYVAVWSID-GSKKQSASC 238
Query: 197 ALSMRHSPVAIDCKNSPNGED---GTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
L+M H V +D + GE G +LA++E+G+ Y W ++ + P KI++
Sbjct: 239 VLAMEHPAVFLDSRCIDKGEHDEAGICVLAISEAGLCYLWFGNSIEELRNAKPTKISLS- 297
Query: 254 KKADADQQNSVNVKKSRTSIIAARLNDLE--ADGQ 286
++ ++ N K + +I AA L + A GQ
Sbjct: 298 ----SEDMSTKNYKGALPAIYAANLQSTQKPASGQ 328
>gi|147853966|emb|CAN79552.1| hypothetical protein VITISV_025726 [Vitis vinifera]
Length = 692
Score = 105 bits (261), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 117/212 (55%), Gaps = 9/212 (4%)
Query: 70 GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKI 129
G+ G++ ++ + G ++ G +GM S++ S G ++ +F+AS K ISS + + K+
Sbjct: 177 GKWRRGVSAISCSTDGSCIYAAGADGMISKIDSSAGNLLGKFRASTKAISSMSVSSDGKV 236
Query: 130 FALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
A A+++++IL+ + +++ KFS G ++ + +D + I+++ GE+++ VWR D S
Sbjct: 237 LATAAAQLKILNSSDHKKIQKFSGHPGAVRCMIFTDDGEYILSSAIGERYIAVWRID-GS 295
Query: 190 KTVNKGPALSMRHSPVAIDCKNSPNGE---DGTVILAVAESGVAYSWDLKTVSQDEKTNP 246
K + L+M H V +D + NG+ G +LA++E GV Y W K + T P
Sbjct: 296 KKQSASCVLAMEHPAVFLDSRCIDNGDTDKSGLYVLAISEIGVCYFWYAKNMEDLRNTKP 355
Query: 247 AKITVKLKKADADQQNSVNVKKSRTSIIAARL 278
K++V L + S N K S +I AA+L
Sbjct: 356 TKVSVSL-----EDNFSKNHKGSLPTIFAAKL 382
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 15/88 (17%)
Query: 1 IWSTNDGSLLAE------------WKQPD-GEPVVSYSCLA-CGFVGKKRRKERGTLLLA 46
IW T G + E + +P+ G V Y+CL KK+RK R +LL+
Sbjct: 31 IWDTLKGQVQTEFADIISTDTTNLYSKPERGHLSVDYTCLKWLSLDSKKKRKLRCSLLV- 89
Query: 47 LGTSNGDILAVDVLTGEMKWKSTGRHPG 74
LGT +GD+LA+DV GE+KW+ HPG
Sbjct: 90 LGTGSGDVLALDVSAGELKWRIGDCHPG 117
>gi|168036674|ref|XP_001770831.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162677890|gb|EDQ64355.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 732
Score = 101 bits (251), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 74/290 (25%), Positives = 133/290 (45%), Gaps = 58/290 (20%)
Query: 18 GEPVVSYSCLACGFVGKK---------RRKERGTLLLALGTSNGDILAVDVLTGEMKWKS 68
G + Y+C+A G + +++ G ++ +GT +GDI++ D GE+KW++
Sbjct: 53 GHLALDYTCMAWGALPSNSSTKKKKKGAKEKGGAAVVVVGTGSGDIISWDTTLGELKWRA 112
Query: 69 TGRHPGGLA---------------------------------------------GLAFAK 83
+ H G +A +AF+K
Sbjct: 113 SDCHAGSVAVSSSLNPEFQFSLLMNALQGRELLIQMFHLFVNDVKSCLNLKSVTSVAFSK 172
Query: 84 KGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLE 143
G+ L+ VG + E+ S G+V+ +F+A++ P+S A + A+AS+EV++ L
Sbjct: 173 SGKRLYSVGHDAHVCELDSSSGQVLTKFRAAKTPLSCVAVAEDGGTVAVASAEVKLFDLS 232
Query: 144 NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS 203
+ + + KF P++ + + K +++A GE+H+ +W + + +LS+ H
Sbjct: 233 SKKRLRKFPGHPTPVKTLCFTPDGKYLLSAATGERHVVLWHVEGGKSETSAPISLSVEHP 292
Query: 204 PVAIDCKNSPNGEDGTV--ILAVAESGVAYSWDLKTVSQDEKTNPAKITV 251
ID S EDG V ++AV+E G AY W TV++ E T P I+V
Sbjct: 293 VATIDI--SGFDEDGDVLRVIAVSEGGCAYIWSAPTVNELETTKPTIISV 340
>gi|302759104|ref|XP_002962975.1| hypothetical protein SELMODRAFT_404454 [Selaginella moellendorffii]
gi|300169836|gb|EFJ36438.1| hypothetical protein SELMODRAFT_404454 [Selaginella moellendorffii]
Length = 617
Score = 100 bits (249), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 77/287 (26%), Positives = 142/287 (49%), Gaps = 25/287 (8%)
Query: 2 WSTNDGSLLAEWKQP--------DGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGD 53
W G + AE+ + DG ++Y LA G KK++K+ L+ LGT GD
Sbjct: 25 WDVASGHVCAEFSEIAASSSQALDGHLALNYKSLAWGASSKKKKKK--ASLIVLGTGGGD 82
Query: 54 ILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKA 113
+LA D + GE+ W++ + GG+ + FAK SL G +G+ E++ G+++ +
Sbjct: 83 VLAWDPILGELTWRTNDCNTGGVTSVCFAKNQGSLFSAGIDGIVCELEVASGKILSTLQC 142
Query: 114 SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
S+ I++ + + + +A++E+R+ + + + KF+ + ++ + +++A
Sbjct: 143 SKSSITNISSSPDASLLLVAANELRLFDMRTRKRLRKFTGHASSVNVTVFTEDGRYVLSA 202
Query: 174 GYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGT--VILAVAESGVAY 231
G++++ VW + K AL+M H VA+D +DGT ILAV+E+G AY
Sbjct: 203 AVGDRNIAVWDTNEVGKEAKV--ALAMEHPAVALDSWGFE--KDGTERKILAVSETGEAY 258
Query: 232 SWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARL 278
W ++ + K P ++ V K + K S+ SI+ RL
Sbjct: 259 IWSGLSLEELGKAEPVRVRVGSKNSG---------KASKQSILLGRL 296
>gi|302824612|ref|XP_002993948.1| hypothetical protein SELMODRAFT_431903 [Selaginella moellendorffii]
gi|300138220|gb|EFJ04995.1| hypothetical protein SELMODRAFT_431903 [Selaginella moellendorffii]
Length = 616
Score = 95.1 bits (235), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 75/285 (26%), Positives = 137/285 (48%), Gaps = 21/285 (7%)
Query: 2 WSTNDGSLLAEW--------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGD 53
W G + AE+ + DG ++Y LA G KK++K+ L+ LGT GD
Sbjct: 25 WDVASGHVCAEFSGIAASSSQALDGHLALNYKTLAWGVSSKKKKKKAS--LIVLGTGGGD 82
Query: 54 ILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKA 113
+LA D + GE+ W++ + GG+ + FAK SL G +G+ E++ G+++ +
Sbjct: 83 VLAWDPILGELTWRTNDCNTGGVTSVCFAKNQGSLFSAGIDGIVCELEVASGKILSTLQC 142
Query: 114 SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
S+ I+S + + + +A++E+R+ + + + KF+ + ++ + +++A
Sbjct: 143 SKSSITSISSSPDASLLLVAANELRLFDMRTRKRLRKFTGHASSVNVTVFTEDGRYVLSA 202
Query: 174 GYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSW 233
GE+++ VW + K L+M H VA+D ILAV+E+G AY W
Sbjct: 203 AVGERNIAVWDTNEVGKEAKV--TLAMEHPAVALDSWGFEKDCTERKILAVSETGEAYIW 260
Query: 234 DLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARL 278
++ + K P ++ V K + K S+ SI+ RL
Sbjct: 261 SGLSLEELGKAEPVRVRVGSKNSG---------KASKQSILLGRL 296
>gi|302838626|ref|XP_002950871.1| hypothetical protein VOLCADRAFT_104876 [Volvox carteri f.
nagariensis]
gi|300263988|gb|EFJ48186.1| hypothetical protein VOLCADRAFT_104876 [Volvox carteri f.
nagariensis]
Length = 747
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 60/215 (27%), Positives = 100/215 (46%), Gaps = 16/215 (7%)
Query: 33 GKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVG 92
GKK++ T + ALGT+ GD+ +V GE+KW++ GG+ LAF+ + L G
Sbjct: 66 GKKKQSVESTGI-ALGTAAGDVKLYNVQLGELKWRAMDAVQGGVRSLAFSSEQGQLLATG 124
Query: 93 TNGMASEMKSEMGEVIREFKASEKPISSSAF---LCEEKIFALA--------SSEVRILS 141
NG + GEV + ++AS+ PI++ A E K +A S +++
Sbjct: 125 NNGSLVSLSIPDGEVKKRYEASKYPITAIAIAPGCIEAKSLLIADGKHVFGGGSSMQLWD 184
Query: 142 LENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA---L 198
+ K++ ++ V G ++AG GE+ + VW + K+ + PA L
Sbjct: 185 VSESSRSAKYTGHPNEVRAVVFVPGQPHAVSAGSGERQVAVWEVPPAKKSKKQHPAVTTL 244
Query: 199 SMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSW 233
++ V +D GE V AV+E G A+ W
Sbjct: 245 NLEDPAVQLDAACVSEGETFNVA-AVSEGGEAFVW 278
>gi|440802916|gb|ELR23832.1| WD repeat domain 43 isoform 5, putative [Acanthamoeba castellanii
str. Neff]
Length = 252
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 76/158 (48%), Gaps = 6/158 (3%)
Query: 32 VGKKRRKERGTLLL----ALGTSNGDILAVDVLTGEMKWKSTGR--HPGGLAGLAFAKKG 85
VG+ R +R L A+GT+ G I+ DVL GE+ + G+ H G + L F +
Sbjct: 65 VGQTERSKRAKKQLGGAVAIGTAKGTIVVWDVLRGEVHKELGGKEGHTGKVNDLVFTEGA 124
Query: 86 RSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENG 145
L + + E GE++ +FKA +PISS A + A AS+ +++ L +
Sbjct: 125 TFLFSCSDDLQVIKWNVENGEIVAKFKADNQPISSLALSADNSTLATASAMIKMWDLASS 184
Query: 146 EEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+ KF+ + ++ S K +++A E++ VW
Sbjct: 185 KSQKKFTGHTSAVTSLNFSANGKFLVSASATERYTSVW 222
>gi|301090479|ref|XP_002895452.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262098632|gb|EEY56684.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 642
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 56/215 (26%), Positives = 102/215 (47%), Gaps = 21/215 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV-----GKKRRKERGTLLLALGTSNGDIL 55
+W ++G+L E K+ D +SY + + GKKR + G LLALGTS+G ++
Sbjct: 22 VWDVSNGALQQELKERDH---LSYRYTSLAWTQPKTKGKKRSGDSGLGLLALGTSSGIVV 78
Query: 56 AVDVLTGEMK--WKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFK 112
D+ TGE+K ++ H G + L F +G L+ ++ E G+V R+F+
Sbjct: 79 VWDLATGEVKHTLQADAVHGSGAVQALTFNPQGSLLYSSSSDKHVLEWNVSNGKVERKFR 138
Query: 113 ASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLK----FSDDVGPLQYVSASDG-- 166
S+ A + +I A+ S +R L +G++ K S V L++ + G
Sbjct: 139 CGSGGASALAVSADGEILAVGGSALRTFDLSSGKKSRKLVSGLSTAVTQLRFANVETGIE 198
Query: 167 -AKIIITAGYGEKHLQVWRCDISSKTVNKGPALSM 200
++ + A G + + ++ ++ T + PAL+
Sbjct: 199 SSRFLFAATAGARFVNMYDLEL---TEHDAPALTF 230
>gi|301122917|ref|XP_002909185.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262099947|gb|EEY57999.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 642
Score = 57.8 bits (138), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 55/215 (25%), Positives = 100/215 (46%), Gaps = 21/215 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV-----GKKRRKERGTLLLALGTSNGDIL 55
+W +G+L E K+ D +SY + + KKR + G LLALGTS+G ++
Sbjct: 22 VWDVANGALQQELKERDH---LSYRYTSLAWTQPKTKSKKRSGDSGLGLLALGTSSGIVV 78
Query: 56 AVDVLTGEMK--WKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFK 112
D+ TGE+K ++ H G + L F +G L+ ++ E G+V R+F+
Sbjct: 79 VWDLATGEVKHTLQADAAHGSGAVQALTFNPQGSLLYSSSSDKHILEWNVSSGKVERKFR 138
Query: 113 ASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLK----FSDDVGPLQYVSASDG-- 166
S+ A + +I A+ S +R L +G++ K S V L++ + G
Sbjct: 139 CGSGGASALAVSADGEILAVGGSALRTFDLSSGKKSRKLVSGLSTAVTQLRFANVETGIE 198
Query: 167 -AKIIITAGYGEKHLQVWRCDISSKTVNKGPALSM 200
++ + A G + + ++ ++ T + PAL+
Sbjct: 199 SSRFLFAATAGARFVNMYDLEL---TEHDAPALTF 230
>gi|384245316|gb|EIE18810.1| NUC189-domain-containing protein [Coccomyxa subellipsoidea C-169]
Length = 671
Score = 57.4 bits (137), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 41/165 (24%), Positives = 77/165 (46%), Gaps = 3/165 (1%)
Query: 33 GKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAF-AKKGRS--LH 89
G+ G L+L +GT+ G++ A T + W + G + GG+ LAF A G + L+
Sbjct: 69 GRAVANAAGRLMLVVGTAAGNVRAFSAATAKELWAANGCNEGGVTSLAFEADAGAAGLLY 128
Query: 90 VVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVL 149
VG+N + G + +FKA P+S + + + SS + + ++ + V
Sbjct: 129 SVGSNNHVCALDVASGAQVVKFKAGSHPLSCVGAAPDCQSVLVGSSSLALWNVAEQKRVA 188
Query: 150 KFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNK 194
KF P++ ++ S ++A GE+ + +W + T +K
Sbjct: 189 KFMGHSLPVRAIAFSPSGSHALSASEGERQVALWALPTIAATTSK 233
>gi|359483009|ref|XP_003632878.1| PREDICTED: uncharacterized protein LOC100265278 isoform 2 [Vitis
vinifera]
gi|359483011|ref|XP_002281427.2| PREDICTED: uncharacterized protein LOC100265278 isoform 1 [Vitis
vinifera]
Length = 357
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 42/123 (34%), Positives = 55/123 (44%), Gaps = 45/123 (36%)
Query: 251 VKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG------------------------- 285
VK K+A+ DQ++S + KKSRTSII ARLN L++DG
Sbjct: 2 VKTKEAEIDQKHSGSTKKSRTSIITARLNALQSDGWVTALVCYGSINSPQFNLLEISNPG 61
Query: 286 --------------------QKTITGLHDLESEAATASAQNEKKSKKRAASDPDLETTRD 325
+ + DLE EA QN+K +KKRAASDPDL +
Sbjct: 62 EDIVVAATDNIVKTVAEAGRENGVLAGKDLELEAVAEPIQNKKSNKKRAASDPDLAAAEN 121
Query: 326 VVD 328
+VD
Sbjct: 122 MVD 124
>gi|348675861|gb|EGZ15679.1| hypothetical protein PHYSODRAFT_505395 [Phytophthora sojae]
Length = 664
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 54/217 (24%), Positives = 92/217 (42%), Gaps = 21/217 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTL---------LLALGTSN 51
+W G+L E K+ D Y+CLA + + LLALGTSN
Sbjct: 22 LWDVAGGALQQELKERD-HLSYRYTCLAWTQANASTSSSKKSKKRSGASDLGLLALGTSN 80
Query: 52 GDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIRE 110
G ++ D+ TGE+K G + LAF +G L+ ++ E G V R+
Sbjct: 81 GSVVVWDLATGEVKHTLQADASSGAVQALAFNPQGSLLYSSSSDKHVLEWSVSSGSVERK 140
Query: 111 FKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLK----FSDDVGPLQYVSASDG 166
F+ S+ A + +I A S +R L +G++ K S V L++ + DG
Sbjct: 141 FRVGSGGASALAVSADGEILAAGGSALRTFDLASGKKSRKLVSGLSSAVTQLRFAAVEDG 200
Query: 167 ---AKIIITAGYGEKHLQVWRCDISSKTVNKGPALSM 200
++ + A G + + ++ ++ + PAL+
Sbjct: 201 IEASRFLFAATAGARFVNMYDLQLAE---HDAPALTF 234
>gi|159469634|ref|XP_001692968.1| predicted protein [Chlamydomonas reinhardtii]
gi|158277770|gb|EDP03537.1| predicted protein [Chlamydomonas reinhardtii]
Length = 546
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 42/145 (28%), Positives = 69/145 (47%), Gaps = 4/145 (2%)
Query: 92 GTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF 151
G N + E G V + F+AS+ PI++ A + K S +++ + + E K+
Sbjct: 8 GPNCSLVSVGVEDGAVKKRFEASKYPITAIALAPDGKHVFGGGSTMQLWDVTSEERAAKY 67
Query: 152 SDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA---LSMRHSPVAID 208
+ ++ V+ G I+A GE+H+ VW + K+ + PA LS+ VA+D
Sbjct: 68 TGHPTEVRAVAFVPGQPHAISAATGERHVAVWDVPPAKKSKKQHPAVTTLSLEEPAVAVD 127
Query: 209 CKNSPNGEDGTVILAVAESGVAYSW 233
GE +V AV+E G AY W
Sbjct: 128 AACVSEGETFSVG-AVSEGGEAYVW 151
>gi|195122410|ref|XP_002005704.1| GI18931 [Drosophila mojavensis]
gi|193910772|gb|EDW09639.1| GI18931 [Drosophila mojavensis]
Length = 638
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 58/236 (24%), Positives = 106/236 (44%), Gaps = 27/236 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG------KKRRKER--------GTLLLA 46
IW T +L E+ P + ++ C A ++ KK RK G +A
Sbjct: 30 IWDTESNTLKQEYT-PSLQ--LTGHCTALAWISINAQRIKKSRKSLQINDSTTIGKHYIA 86
Query: 47 LGTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LGTS G ++ V G++ G +H G + +A+ G L+ VG++G A
Sbjct: 87 LGTSKGTVVLFSVTEGKIDRVLKGDKHDGAVTSIAYDNDGH-LYTVGSDGRAIVWSVAEE 145
Query: 106 EVIREFKA-SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSA- 163
IR++ EKP+ + A L + + A+AS ++++ ++ E V F+ G + +++
Sbjct: 146 RSIRQWPVGPEKPL-NVAILPKSRTLAVASRQLKVYDVDKKELVETFTGHSGEINAINSF 204
Query: 164 -SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA--IDCKNSPNGE 216
+ + +IT E+ + W+ D K NK ++ VA + C+ NG+
Sbjct: 205 EFNDNEYVITTARMERVISFWKID--RKGRNKASVCTLLMEDVAHSLACEVRDNGQ 258
>gi|66392231|ref|NP_001018176.1| WD repeat-containing protein 43 [Danio rerio]
gi|21105462|gb|AAM34674.1|AF506230_1 KIAA0007-like protein [Danio rerio]
gi|66267379|gb|AAH95298.1| WD repeat domain 43 [Danio rerio]
Length = 650
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 102/244 (41%), Gaps = 23/244 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRK------ERGTLLLAL 47
IW+T +L E+ P + +C+ G +KRRK + LLAL
Sbjct: 33 IWNTESKTLQQEYV-PSAHLSAACTCVTWGPCRAVQEVPQRKRRKCDAASSAEQSDLLAL 91
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL L G++ G H G + + + + L+ + +E + G+V
Sbjct: 92 GTAAGTILIYSTLKGDLHCTLDGGHSGPVNSVQWHPEDSVLYSGSDDTHIAEWDLKTGKV 151
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
++KA ISS + K+ A +++ LE E KF S V L + +
Sbjct: 152 CCKWKADRSAISSLCISPDGKMLLSAGMTIKMWDLETKEVYRKFTGHSTMVTTLCFATTR 211
Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
S+G ++ ++ L VW+ K N + ++ P ID + S N +D V
Sbjct: 212 PPDSNGM-YFLSGAVHDRLLSVWQVRSDGKDKNSVVSFTLTDEPQHIDLQTS-NSKDEAV 269
Query: 221 ILAV 224
LAV
Sbjct: 270 RLAV 273
>gi|161611380|gb|AAI55573.1| WD repeat domain 43 [Danio rerio]
gi|190336877|gb|AAI62316.1| WD repeat domain 43 [Danio rerio]
gi|190336879|gb|AAI62617.1| WD repeat domain 43 [Danio rerio]
Length = 650
Score = 52.0 bits (123), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 102/244 (41%), Gaps = 23/244 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRK------ERGTLLLAL 47
IW+T +L E+ P + +C+ G +KRRK + LLAL
Sbjct: 33 IWNTESKTLQQEYV-PSAHLSAACTCVTWGPCRAVQEVPQRKRRKCDAASSAEQSDLLAL 91
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL L G++ G H G + + + + L+ + +E + G+V
Sbjct: 92 GTAAGTILIYSTLKGDLHCTLDGGHSGPVNSVQWHPEDSVLYSGSDDTHIAEWDLKTGKV 151
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
++KA ISS + K+ A +++ LE E KF S V L + +
Sbjct: 152 CCKWKADRSAISSLCISPDGKMLLSAGMTIKMWDLETKEVYRKFTGHSTMVTTLCFATTR 211
Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
S+G ++ ++ L VW+ K N + ++ P ID + S N +D V
Sbjct: 212 PPDSNGM-YFLSGAVHDRLLSVWQVRSDGKDKNSVVSFTLTDEPQHIDLQTS-NSKDEAV 269
Query: 221 ILAV 224
LAV
Sbjct: 270 RLAV 273
>gi|195997585|ref|XP_002108661.1| hypothetical protein TRIADDRAFT_51818 [Trichoplax adhaerens]
gi|190589437|gb|EDV29459.1| hypothetical protein TRIADDRAFT_51818 [Trichoplax adhaerens]
Length = 628
Score = 51.6 bits (122), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 39/175 (22%), Positives = 83/175 (47%), Gaps = 7/175 (4%)
Query: 42 TLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMK 101
T LLA G+ G + ++V GE+ K H + + ++ + L+ +G+
Sbjct: 94 TNLLAGGSPTGSLAILNVNKGEVCSKMENGHHAAVTCICWSSED-VLYSASEDGLVIIWD 152
Query: 102 SEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV 161
S E+ +++KA + PIS+ + LAS +++ ++E + + K++ + + Y+
Sbjct: 153 SNTAEIKKQWKAEKSPISAMCTNSDGTKLILASRSIKVWNVETQQILTKYTGHINDVAYL 212
Query: 162 SA-----SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA-LSMRHSPVAIDCK 210
+ A+ I+ +K+L VW+ +V+ A L++ + V +DCK
Sbjct: 213 KVLPSQKTSNAEYFISGAERDKYLNVWQMSTEKSSVHSPLATLALTNYSVGLDCK 267
>gi|195474843|ref|XP_002089699.1| GE19236 [Drosophila yakuba]
gi|194175800|gb|EDW89411.1| GE19236 [Drosophila yakuba]
Length = 626
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 60/273 (21%), Positives = 125/273 (45%), Gaps = 28/273 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG------KKRRKERGT-------LLLAL 47
IW+T +L E+ P+ + S SC A +V KK RK + L +AL
Sbjct: 30 IWNTETNTLYQEYT-PNLQ--TSGSCTALTWVSVSSPRPKKSRKSIQSQDAGGDQLYIAL 86
Query: 48 GTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
GT G ++ + G++ +G H G + + + G+ L+ VG +
Sbjct: 87 GTFKGSVVIYSLAEGKISRTFSGDSHDGAVTSITYDNAGQ-LYTVGADCRVVVWSWTEER 145
Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA 163
I E++ + + +L + A+ S ++++ ++ E V F S ++ + V++
Sbjct: 146 SIAEWQVGPEKPQNIVYLQKSGTLAIGSRQLKVYDIKTKELVETFTGHSGEINTISSVAS 205
Query: 164 SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-IL 222
+DG + ++T E+ + W+ D K NK A ++ +A C S +DG + +
Sbjct: 206 TDGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVA 262
Query: 223 AVAESGV--AYSWDLKTVSQDEKTNPAKITVKL 253
+V +G+ Y ++ ++S ++ P K+++++
Sbjct: 263 SVTRNGIIHIYLLNVNSLSAEKFVKP-KVSLQI 294
>gi|160773180|gb|AAI55127.1| WD repeat domain 43 [Danio rerio]
Length = 650
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 61/244 (25%), Positives = 101/244 (41%), Gaps = 23/244 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRK------ERGTLLLAL 47
IW+T +L E+ P + +C+ G +KRRK + LLAL
Sbjct: 33 IWNTESKTLQQEYV-PSAHLSAACTCVTWGPCRAVQEVPQRKRRKCDAASSAEQSDLLAL 91
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
T+ G IL L G++ G H G + + + + L+ + +E + G+V
Sbjct: 92 ATAAGTILIYSTLKGDLHCTLDGGHSGPVNSVQWHPEDSVLYSGSDDTHIAEWDLKTGKV 151
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
++KA ISS + K+ A +++ LE E KF S V L + +
Sbjct: 152 CCKWKADRSAISSLCISPDGKMLLSAGMTIKMWDLETKEVYRKFTGHSTMVTTLCFATTR 211
Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
S+G ++ ++ L VW+ K N + ++ P ID + S N +D V
Sbjct: 212 PPDSNGM-YFLSGAVHDRLLSVWQVRSDGKDKNSVVSFTLTDEPQHIDLQTS-NSKDEAV 269
Query: 221 ILAV 224
LAV
Sbjct: 270 RLAV 273
>gi|195332612|ref|XP_002032991.1| GM21075 [Drosophila sechellia]
gi|194124961|gb|EDW47004.1| GM21075 [Drosophila sechellia]
Length = 566
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 57/256 (22%), Positives = 113/256 (44%), Gaps = 24/256 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---GKKRRKERGTLL---------LALG 48
IW+T +L E+ P+ + +S SC A +V G + +K R +L +ALG
Sbjct: 30 IWNTETNTLYQEFT-PNLQ--LSGSCTALTWVSVAGSRPKKSRKSLQAQDAGDQLHIALG 86
Query: 49 TSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
T G ++ + G++ G H G + + + G L+ VG +
Sbjct: 87 TYKGSVVIYSLAEGKISRTLIGDSHDGPVTSITYDNAGH-LYTVGADCRVVVWSLTDERS 145
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSAS 164
I E++ + + +L + A+ S ++++ + E V F S ++ + V+++
Sbjct: 146 IAEWQVGPEKPQNIVYLQKSGTLAIGSRQLKVYDINTKELVETFTGHSGEINTISSVAST 205
Query: 165 DGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-ILA 223
DG + ++T E+ + W+ D K NK A ++ +A C S +DG + + +
Sbjct: 206 DGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVAS 262
Query: 224 VAESGVAYSWDLKTVS 239
V +G+ + + L S
Sbjct: 263 VTRNGIIHIYLLNVNS 278
>gi|121934022|gb|AAI27561.1| Wdr43 protein [Danio rerio]
Length = 408
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 62/244 (25%), Positives = 102/244 (41%), Gaps = 23/244 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRK------ERGTLLLAL 47
IW+T +L E+ P + +C+ G +KRRK + LLAL
Sbjct: 33 IWNTESKTLQQEYV-PSAHLSAACTCVTWGPCRAVQEVPQRKRRKCDAASSAEQSDLLAL 91
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL L G++ G H G + + + + L+ + +E + G+V
Sbjct: 92 GTAAGTILIYSTLKGDLHCTLDGGHSGPVNSVQWHPEDSVLYSGSDDTHIAEWDLKTGKV 151
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
++KA ISS + K+ A +++ LE E KF S V L + +
Sbjct: 152 CCKWKADRSAISSLCISPDGKMLLSAGMTIKMWDLETKEVYRKFTGHSTMVTTLCFATTR 211
Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
S+G ++ ++ L VW+ K N + ++ P ID + S N +D V
Sbjct: 212 PPDSNGM-YFLSGAVHDRLLSVWQVWSDGKDKNSVVSFTLTDEPQHIDLQTS-NSKDEAV 269
Query: 221 ILAV 224
LAV
Sbjct: 270 RLAV 273
>gi|195581695|ref|XP_002080669.1| GD10611 [Drosophila simulans]
gi|194192678|gb|EDX06254.1| GD10611 [Drosophila simulans]
Length = 533
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 59/272 (21%), Positives = 126/272 (46%), Gaps = 27/272 (9%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---GKKRRKERGTLL---------LALG 48
IW+T +L E+ P+ + +S SC A +V G + +K R +L +ALG
Sbjct: 30 IWNTETNTLYQEFT-PNLQ--LSGSCTALTWVSVAGSRPKKSRKSLHAQDAGDQLHIALG 86
Query: 49 TSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
T G ++ + G++ +G H G + + + G L+ VG +
Sbjct: 87 TYKGSVVIYSLAEGKISRTLSGDSHDGPVTSITYDNAGH-LYTVGADCRVVVWSLTDERS 145
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSAS 164
I E++ + + +L + A+ S ++++ ++ E V F S ++ + V+++
Sbjct: 146 IAEWQVGPEKPQNIVYLQKSGTLAIGSRQLKVYDIKTKELVETFTGHSGEINTISSVAST 205
Query: 165 DGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-ILA 223
DG + ++T E+ + W+ D K NK A ++ +A C S +DG + + +
Sbjct: 206 DGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVAS 262
Query: 224 VAESGV--AYSWDLKTVSQDEKTNPAKITVKL 253
V +G+ Y ++ ++S ++ P K+++++
Sbjct: 263 VTRNGIIHIYLLNVNSLSAEKFVKP-KVSLQI 293
>gi|291229286|ref|XP_002734606.1| PREDICTED: will die slowly-like [Saccoglossus kowalevskii]
Length = 649
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 55/291 (18%), Positives = 111/291 (38%), Gaps = 39/291 (13%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV----GKKRRKERGTL------------- 43
+W T G+L ++ P + +C+A + G RRK+R +
Sbjct: 26 LWDTTTGTLNQQYT-PSSHLSATCTCVAWSTLRESRGAPRRKKRKSEGIRKGVASSITEN 84
Query: 44 --LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMK 101
LLALGTS G IL + G+++ K G H + L + + SL+ + +E
Sbjct: 85 VDLLALGTSVGSILLYSAVKGDLQSKMDGGHADTVNCLCWHLEEDSLYSCSDDHHITEWS 144
Query: 102 SEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV 161
G+V ++K + I + + A+ +++ +L+N E + +++ P+ +
Sbjct: 145 VSTGKVKCKWKGDKSAIHCISICPGGTVMLSAARTIKVWNLKNKELLKRYTGHASPVSQI 204
Query: 162 SASDGAK-------------------IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRH 202
+ A ++ ++ L W +SK N A S+
Sbjct: 205 NCVPMATRTISISNDEDDALHDVNGLYFLSTAVHDRVLNAWHIKSNSKDKNAVAAFSLIE 264
Query: 203 SPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
P++ D + I + +G +D + +K + TV++
Sbjct: 265 EPISFDVAQCLQSDQSIFIAVLTRTGQVQVFDPTLNGKSKKPIKPRCTVQV 315
>gi|325189290|emb|CCA23810.1| conserved hypothetical protein [Albugo laibachii Nc14]
Length = 1146
Score = 48.9 bits (115), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 47/212 (22%), Positives = 97/212 (45%), Gaps = 15/212 (7%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLA-CGFVGKKRRKERGTLL----LALGTSNGDIL 55
IW+ + GSL+ E ++ D Y+C++ + K+RG + +A+GT+ G IL
Sbjct: 537 IWNVSSGSLVHELRERDHLKN-KYTCISWTSSPNRSDSKKRGNSVSLGRIAVGTTQGSIL 595
Query: 56 AVDVLTGEMKW-----KSTGRHPGGLAGLAFAKKGRSLHVVGTN-GMASEMKSEMGEVIR 109
D+ GEM +S+ H + +AF +G L+ + A E G+V+R
Sbjct: 596 LWDLDRGEMTQTLGSKESSVSHSHAVQDIAFNAQGTRLYSCASQEKKALEWDVVAGKVLR 655
Query: 110 EFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDV-GPLQYVSASDGAK 168
FK S++ + ++ + A+ + +R+ L +G++ +F + + ++ S
Sbjct: 656 SFKVSKEGGNRLCISQDDDLLAVGGNSIRVFDLSSGKKSRQFRAGIPSTVHQLNFSSCKN 715
Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSM 200
+ + E+ + ++ D+ + LSM
Sbjct: 716 FLFASALSERFINIY--DLEKDVDDTAMILSM 745
>gi|260818743|ref|XP_002604542.1| hypothetical protein BRAFLDRAFT_122327 [Branchiostoma floridae]
gi|229289869|gb|EEN60553.1| hypothetical protein BRAFLDRAFT_122327 [Branchiostoma floridae]
Length = 657
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 58/258 (22%), Positives = 106/258 (41%), Gaps = 37/258 (14%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV------GKKRRKERGTL-------LLAL 47
+W T+ G+L E+ P + SCLA G + KKR+++ + ++A+
Sbjct: 34 LWDTDSGALRQEYT-PSAHLSATCSCLAWGPLREARNEHKKRKRKSDQMSALSSLDIVAM 92
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL V+ G+++ + TG H + L + + SL + E +V
Sbjct: 93 GTTAGTILLYSVVKGDLQSQLTGGHDDIVNCLCWRAEENSLFSCSDDQHIVEWTVTTAQV 152
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDD---VGPLQYVS-- 162
++KA P+ + + K A+ +++ LE E +L F+ V L + +
Sbjct: 153 KCKWKADRGPVQTISLGPGGKTLLSAARTIKLWDLEKREVLLSFTGHASLVTSLLFATLP 212
Query: 163 ----------------ASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA 206
A DG + A + ++ + W+ + SK + S+ PV
Sbjct: 213 PPTANGRRKSVTNQEAAVDGLYFLSGATH-DRLMNAWQIKMKSKEKTAVVSFSLTEEPVY 271
Query: 207 IDCKNSPNGEDGTVILAV 224
+D P +D LAV
Sbjct: 272 VDVLR-PRQKDQPAHLAV 288
>gi|443684116|gb|ELT88135.1| hypothetical protein CAPTEDRAFT_102327 [Capitella teleta]
Length = 564
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 52/248 (20%), Positives = 106/248 (42%), Gaps = 24/248 (9%)
Query: 34 KKRRKERGTL-----LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSL 88
K+R+ E G L+A+GT++G +L + G + + G H + + ++ +G+ L
Sbjct: 62 KRRKSEVGKAVHSLELVAIGTTSGSLLLYSINKGSLHSELNGGHTDQVHCVCWSSEGQRL 121
Query: 89 HVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEV 148
+ E E V ++K + P++S ++ A +++ LE E +
Sbjct: 122 FSSSEDQHIIEWSLETASVKHKWKGDKTPVTSVLVCPNDRHLLSAGRSIKLWDLETREML 181
Query: 149 LKF---SDDVGPLQYVS---------ASDG------AKIIITAGYGEKHLQVWRCDISSK 190
KF S +V L VS A++G ++A G++ + W+ +++SK
Sbjct: 182 KKFTGHSSEVISLFAVSPACNVENGAATNGHCSDIDGSYFLSAAKGDRMINAWQINLASK 241
Query: 191 TVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKIT 250
N + S+ P A+ + P ++ V SG ++ + +K KI+
Sbjct: 242 DKNSLASFSLPEEPCAMHL-SQPQQAKPFLMSVVTRSGQLLVFEYTLNGRAKKPLTPKIS 300
Query: 251 VKLKKADA 258
+++ DA
Sbjct: 301 IQVATEDA 308
>gi|434404035|ref|YP_007146920.1| WD40 repeat-containing protein [Cylindrospermum stagnale PCC 7417]
gi|428258290|gb|AFZ24240.1| WD40 repeat-containing protein [Cylindrospermum stagnale PCC 7417]
Length = 1215
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 90/189 (47%), Gaps = 17/189 (8%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
+W+ DG LL +KQP +V+ L GK LA+ S+GDI +++
Sbjct: 992 LWNVADGRLLKTFKQP--RSIVA--DLNFSPDGKT---------LAVACSDGDIKILNLK 1038
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
T + +S H + ++F+ G+ L G++ +E G ++ + +++
Sbjct: 1039 TATLT-QSFPAHSSWVNTISFSPNGKILASGGSDSKVKLWNAENGRLLFTLEGHLSNVTN 1097
Query: 121 SAFLCEEKIFALAS--SEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEK 178
+F + KI A +S S VR+ ++ENG E+ +G + V S K + +AG +
Sbjct: 1098 ISFSPDSKILASSSDDSTVRVWNVENGLEISILEGHLGSVTSVMFSPDGKTLASAGL-DN 1156
Query: 179 HLQVWRCDI 187
+++W+ ++
Sbjct: 1157 TIKMWKLEL 1165
>gi|390562647|ref|ZP_10244835.1| hypothetical protein NITHO_3120009 [Nitrolancetus hollandicus Lb]
gi|390172775|emb|CCF84147.1| hypothetical protein NITHO_3120009 [Nitrolancetus hollandicus Lb]
Length = 271
Score = 47.4 bits (111), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+ A G+ +G I DV +G+ G H G+ +AF+ GR L G +G
Sbjct: 85 IAASGSGDGSIRLWDVASGKEAATLPG-HAWGVWSVAFSHDGRQLLSGGGDGALRLWDVT 143
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE---VRILSLENGEEVLKFSDDVGPLQY 160
GE IR F E + ++A + + AL+ S +R+ LE GEE+ K+ G ++
Sbjct: 144 TGEEIRHFFGHEDWVYTAA-ISPDGRHALSGSRDETIRLWDLETGEEIRKYEGHEGTVES 202
Query: 161 VSASDGAKIIITAGYGEKHLQVW 183
V S + I+ G E L++W
Sbjct: 203 VVFSPDGRRFISHGEDET-LRLW 224
>gi|321478820|gb|EFX89777.1| hypothetical protein DAPPUDRAFT_303119 [Daphnia pulex]
Length = 673
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 45/225 (20%), Positives = 97/225 (43%), Gaps = 6/225 (2%)
Query: 33 GKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVG 92
G++R G+ ++ALGT++G+IL + E+ + G H G + L F G SL
Sbjct: 77 GEERSGVAGSDIIALGTASGNILLYSLNRAELVQQLKGGHTGKIRSLCFNSDGSSLFSAS 136
Query: 93 TNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFS 152
++ E G + +K + +S L ++ +A ++ ++E+ V F+
Sbjct: 137 SDQKIVEWDPVEGVMKSHWKVEKIHVSRICMLQDDCTLVIAGRSIQCWNVESRSIVATFT 196
Query: 153 DDVGPLQYVSAS--DGAKI--IITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAID 208
+ ++ G K+ ++A G++ + WR K + +++ P
Sbjct: 197 GHTTEITHLLPVYLPGTKVGYFLSAADGDRCISAWRFVGEEKNGQSQASFTLQDDPYNFT 256
Query: 209 CKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
+ +D V+ AV +G+ + ++ + +K P + VK+
Sbjct: 257 VSKPVSDKDPVVLAAVTVTGILHIFEHHLNGRSKK--PLQPVVKI 299
>gi|427783733|gb|JAA57318.1| Hypothetical protein [Rhipicephalus pulchellus]
Length = 637
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 64/304 (21%), Positives = 121/304 (39%), Gaps = 38/304 (12%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG------FVGKKRRKERGTL----------- 43
+W T+ G L E+ P +CL+ G F K+RK + T
Sbjct: 28 LWDTSTGRLKQEYT-PSAHLSAKCTCLSWGPTRQQAFAAHKKRKRQKTTDDAGDVLSEAS 86
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA+GT+ G +L + G++ + TG H + G+++ SL V +
Sbjct: 87 LLAMGTAEGTVLLYSLTRGDLHSELTGGHTAAVNGVSWHASSDSLFSVSDDQHIVHWSVS 146
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSA 163
+V ++KA + + A + + ASS ++ +++ V+KF+ V ++ +
Sbjct: 147 SCKVKSKWKADRHSVYAVAAVDTNTVLT-ASSSIKWWNVDTKSIVMKFTGHVTEVRQI-- 203
Query: 164 SDGAKIIITAGYGEKHLQVWRC-----DISSKTVNKGPALS------MRHSPVAIDCKNS 212
+ + + GE C +S+ + KG S + V I +S
Sbjct: 204 ---LPVFVPSKSGEVRTYFLSCAANDRQVSAWHLEKGAGSSSLANFVLSEEAVHISVSHS 260
Query: 213 PNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTS 272
+ AV +SG + ++L+ + + K TV++ + QN K +
Sbjct: 261 SENNKTAHLSAVTKSGALHIFELQLNGRCKTPMQPKYTVQVA---TETQNGRAPKPQPVA 317
Query: 273 IIAA 276
I+AA
Sbjct: 318 ILAA 321
>gi|257094183|ref|YP_003167824.1| WD-40 repeat-containing protein [Candidatus Accumulibacter phosphatis
clade IIA str. UW-1]
gi|257046707|gb|ACV35895.1| WD-40 repeat protein [Candidatus Accumulibacter phosphatis clade IIA
str. UW-1]
Length = 1737
Score = 46.6 bits (109), Expect = 0.016, Method: Composition-based stats.
Identities = 42/178 (23%), Positives = 80/178 (44%), Gaps = 21/178 (11%)
Query: 67 KSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE 126
+S H +A +AF+ GR L + +E G+ IR F P++S AF +
Sbjct: 1330 RSFAGHQSVVASVAFSPDGRHLVSGSWDDSLLLWNAETGQEIRSFVGHHGPVASVAFSPD 1389
Query: 127 EK--IFALASSEVRILSLENGEEVLKFSDDVGPLQYV-SASDGAKIIITAGYGEKHLQVW 183
+ + +R+ E G+E+ ++ GP+ V S++DG +++ +G + L++W
Sbjct: 1390 GRRLLSGTWDQTLRLWDAETGQEIRSYTGHQGPVAGVASSADGRRLL--SGSDDHTLRLW 1447
Query: 184 RCDISSK----TVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
+ + ++GPA S+ SP DG +L+ ++ WD +T
Sbjct: 1448 DAETGQEIRFFAGHQGPATSVAFSP------------DGRRLLSGSDDHTLRLWDAET 1493
Score = 46.2 bits (108), Expect = 0.021, Method: Composition-based stats.
Identities = 50/179 (27%), Positives = 86/179 (48%), Gaps = 14/179 (7%)
Query: 58 DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
D TGE + +S H GG+A +AF+ GR L + +E G+ IR F +
Sbjct: 1112 DAETGE-EIRSFAGHQGGVASVAFSPDGRRLLSGSDDQTLRLWDAETGQEIRSFTGHQGG 1170
Query: 118 ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAG 174
+ S AF + + S + +R+ E G+E+ F+ + V+ S DG +++ +G
Sbjct: 1171 VLSVAFSPDGRRLLSGSRDQTLRLWDAETGQEIRSFAGHQSAVTSVALSPDGRRLL--SG 1228
Query: 175 YGEKHLQVWRC----DISSKTVNKGPALSMRHSPVAIDCKNSPNGE-DGTVILAVAESG 228
++ L++W +I S T ++G S+ SP D + +G D T+ L AE+G
Sbjct: 1229 SHDRTLRLWDAETGQEIRSFTGHQGGVASVAFSP---DGRRLLSGSFDQTLRLWDAETG 1284
Score = 44.7 bits (104), Expect = 0.068, Method: Composition-based stats.
Identities = 45/189 (23%), Positives = 84/189 (44%), Gaps = 18/189 (9%)
Query: 3 STNDGSLLAEWKQPDGEPVVSYS-----CLACGFVGKKRRKERGTLLLALGTSNGDILAV 57
S +D L W G+ + S++ + F RR L G+ + +
Sbjct: 1479 SGSDDHTLRLWDAETGQEIRSFAGHQDWVTSVAFSPDGRR-------LLSGSHDHTLRLW 1531
Query: 58 DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
D +G+ + +S H G + +AF+ GR L + +E G+ IR F + P
Sbjct: 1532 DAESGQ-EIRSFAGHQGWVLSVAFSPDGRRLLSGSDDQTLRLWDAESGQEIRSFAGHQGP 1590
Query: 118 ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAG 174
++S AF + + S + +R+ E G+E+ F+ GP+ V+ S DG +++ +G
Sbjct: 1591 VTSVAFSPDGRRLLSGSRDQTLRLWDAETGQEIRSFAGHQGPVASVAFSPDGRRLL--SG 1648
Query: 175 YGEKHLQVW 183
+ L++W
Sbjct: 1649 SHDGTLRLW 1657
>gi|195430950|ref|XP_002063511.1| GK21367 [Drosophila willistoni]
gi|194159596|gb|EDW74497.1| GK21367 [Drosophila willistoni]
Length = 653
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 54/243 (22%), Positives = 103/243 (42%), Gaps = 26/243 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---------GKKRRKERG-------TLL 44
IW T SL E+ +S C A +V KK RK + L
Sbjct: 30 IWDTEANSLKQEFT---ANLYLSGGCTALTWVSVGAPPTPRAKKSRKSQNDSVVGSNKLY 86
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+ALGT G + + G+++ +G H G + +A+ + L+ VGT+ A
Sbjct: 87 IALGTLKGIVSLYSLAEGQIERTLSGDNHDGPVTSIAYNQSKGHLYTVGTDCRALVWSIA 146
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQY 160
I +++ + + +L + + A+ + ++I +E E V F S D+ +
Sbjct: 147 EERCIADWQVGPEKPHNIVYLAKSRSLAVGARSLKIFDVETKELVETFTGHSSDINAMTS 206
Query: 161 VSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
++ +D + I+T E+ + W+ D + NK A ++ +A + + EDG++
Sbjct: 207 IADND-LEYILTTARMERVISFWKIDKRGR--NKASACTLLMEDLAYCLASEVHTEDGSL 263
Query: 221 ILA 223
+A
Sbjct: 264 RVA 266
>gi|403414246|emb|CCM00946.1| predicted protein [Fibroporia radiculosa]
Length = 359
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 85/184 (46%), Gaps = 21/184 (11%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H ++ L F+ G L +G+ ++ GE+I + ++ IS A+ + + A
Sbjct: 64 HRRSISSLKFSFDGTKLASSAADGLVKIWDADSGEIIHTLQGHDEGISDIAWSPDNEFLA 123
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS- 188
AS + +RI S+E V + V+ + + ++++ G+ E ++VW D++
Sbjct: 124 SASDDKTIRIWSMETMSSVNVLKGHTNFVFCVNFNPKSNLLVSGGFDET-VRVW--DVAR 180
Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDE 242
+T+ PA S PV N DGT+I + A G+ WD LKT++ D+
Sbjct: 181 GRTLKTLPAHS---DPVTAVTFN----HDGTLIASCAMDGLIRIWDSESGQCLKTLADDD 233
Query: 243 KTNP 246
NP
Sbjct: 234 --NP 235
>gi|195425626|ref|XP_002061096.1| GK10631 [Drosophila willistoni]
gi|194157181|gb|EDW72082.1| GK10631 [Drosophila willistoni]
Length = 638
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 47/217 (21%), Positives = 100/217 (46%), Gaps = 10/217 (4%)
Query: 43 LLLALGTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMK 101
L +ALGT G + + G+++ +G H G + +A+ + L+ VG + A
Sbjct: 342 LYIALGTMGGIVSLYSLAEGQIERTLSGDNHDGPVTSIAYNQSKGHLYTVGEDCRALVWS 401
Query: 102 SEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPL 158
I +++ + + +L + + A+ + ++I +E E V F S D+ +
Sbjct: 402 ITEERCIADWQVGPEKPQNVVYLAKSRSLAVGARSLKIFDVETKELVETFTGHSSDINAM 461
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDG 218
++ +D + I+T + E+ + W+ D K NK A ++R A EDG
Sbjct: 462 TSIADND-LEFILTTAHMERVISFWKIDKRGK--NKASACTLRMEDPAYCLACEVRTEDG 518
Query: 219 TVILA-VAESGV-AYSWDLKTVSQDEKTNPAKITVKL 253
++ +A V +G+ Y + +S ++ P K+++++
Sbjct: 519 SLRVASVTRNGIHIYLLKINGLSSEKHIKP-KVSLQI 554
>gi|405971706|gb|EKC36529.1| WD repeat-containing protein 43 [Crassostrea gigas]
Length = 630
Score = 46.2 bits (108), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 49/258 (18%), Positives = 106/258 (41%), Gaps = 30/258 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG------FVGKKRRKERGTL-------LLAL 47
+W ++G L E+ P + +CL+ G G+K+R++ G + L+A+
Sbjct: 27 LWEADNGVLKQEYT-PSAHLSATCTCLSWGPKRNLDTPGRKKRRKSGGVAITEQYDLIAV 85
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL ++ G++K H ++ + + + ++ + G+V
Sbjct: 86 GTTAGSILIYSIVKGDVKTIMEEGHSDSVSDVCWHPENNTIFSCSCDRHIIHWDVISGKV 145
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDV-----------G 156
++K I S LC + + +++ LE + KF+
Sbjct: 146 KHKWKGDSGSIHSIC-LCAQNHLLSSGRSIKLWDLETYSVLKKFTGHATEVFRLLPILSN 204
Query: 157 PLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGE 216
P + + +G ++A ++ + VW+ + + K + S+ PV +D + + +
Sbjct: 205 PTELPDSPEGV-YFLSAALNDRLVNVWQVNTAVKDKSSTATFSLPEDPVMLDLRKN---Q 260
Query: 217 DGTVILAVAESGVAYSWD 234
D ++ AV SG +D
Sbjct: 261 DEVILAAVTRSGQVLIFD 278
>gi|427737264|ref|YP_007056808.1| WD40 repeat-containing protein [Rivularia sp. PCC 7116]
gi|427372305|gb|AFY56261.1| WD40 repeat-containing protein [Rivularia sp. PCC 7116]
Length = 927
Score = 45.8 bits (107), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 56/204 (27%), Positives = 84/204 (41%), Gaps = 24/204 (11%)
Query: 43 LLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAG---LAFAKKGRSLHVVGTNGMASE 99
L +A G S +I D+ +GE K + H G AG L F+ G+ L G +
Sbjct: 695 LTIASGDSKNNIYIWDINSGE-KIRILEGHTGRFAGVNSLKFSPDGQILASAGGDKTVKL 753
Query: 100 MKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGE--EVLKFSDDV 155
G I K E+ +SS AF + KIFA S++ L GE E K +D++
Sbjct: 754 WNLNTGAEIMTLKGHERWVSSVAFSPDGKIFASGSADETANFWDLTTGEILETFKHNDEI 813
Query: 156 GPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-KGPALSMRHSPVAIDCKNSPN 214
+ + + I G + +++W + KG S+R+ SPN
Sbjct: 814 RSIAFSPNGE----IFATGSNDNTIKLWSVSNKEEVCTLKGHKRSIRYITF------SPN 863
Query: 215 GEDGTVILAVAESGVAYS-WDLKT 237
GE ILA + G WD+ T
Sbjct: 864 GE----ILATSSYGNDIKLWDMNT 883
>gi|84995168|ref|XP_952306.1| pre-mRNA splicing factor (PRP17 homologue) [Theileria annulata
strain Ankara]
gi|65302467|emb|CAI74574.1| pre-mRNA splicing factor (PRP17 homologue), putative [Theileria
annulata]
Length = 693
Score = 45.4 bits (106), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 32/149 (21%), Positives = 72/149 (48%), Gaps = 5/149 (3%)
Query: 41 GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
G +LL+ G +G + D+ + + G H G+ ++F + G + +
Sbjct: 417 GNVLLSCGM-DGFVKVWDINSQKCLRNYKG-HAKGVRDVSFIENGNKFYSASFDSNVILW 474
Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLE--NGEEVLKFSDDVGPL 158
+E G+VI +K + P + + ++ IF + +++ ++ +GE VL++S+ +G +
Sbjct: 475 DTEYGKVIGVYKIEKTPYCLTPYPLDDNIFLVGGDSNKVIQMDARSGEMVLEYSEHMGCV 534
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCDI 187
++ D + I+T +K + VW +I
Sbjct: 535 NTITFIDNNRRIVTTA-DDKRVLVWDYNI 562
>gi|296269459|ref|YP_003652091.1| pentapeptide repeat-containing protein [Thermobispora bispora DSM
43833]
gi|296092246|gb|ADG88198.1| pentapeptide repeat protein [Thermobispora bispora DSM 43833]
Length = 1807
Score = 45.4 bits (106), Expect = 0.036, Method: Composition-based stats.
Identities = 53/202 (26%), Positives = 85/202 (42%), Gaps = 25/202 (12%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA G G + D TGE+ + TG HPGG+ +AF G +L T G
Sbjct: 1307 LLAAGDGQGVLCLWDPYTGELLHRLTG-HPGGICAIAFHPDGHALVSGDTEGTVRLWDPH 1365
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPL-QY 160
G+++ E I AF ++F SE VR+ S +GE++ + S G + +
Sbjct: 1366 TGQLMGTLSGHEGAIYHVAFSPSGELFVTGDSEGVVRVWS-ASGEQLAELSGHRGSVWPF 1424
Query: 161 VSASDGAKIIITAGYGEKHLQVW-----RCDISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
G +++ ++ G +++W RC + V +G R + VA
Sbjct: 1425 AFHPKGHRLVTSSSDG--MIRLWDPRTGRC----RRVLRGHG--RRINSVAFSA------ 1470
Query: 216 EDGTVILAVAESGVAYSWDLKT 237
DG ++ A G WD +T
Sbjct: 1471 -DGRMLAACGSDGYVRLWDPQT 1491
>gi|186681444|ref|YP_001864640.1| hypothetical protein Npun_F0966 [Nostoc punctiforme PCC 73102]
gi|186463896|gb|ACC79697.1| WD-40 repeat protein [Nostoc punctiforme PCC 73102]
Length = 1211
Score = 45.4 bits (106), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 42/170 (24%), Positives = 83/170 (48%), Gaps = 20/170 (11%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H G ++ ++F+ G+++ ++ A + + G++++EFK ++ ++S +F + K A
Sbjct: 763 HQGEVSSVSFSPDGKTIATASSDKTA-RLWNLQGQLLQEFKGHQRGVNSVSFSLDGKTIA 821
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW---RCD 186
ASS+ R+ +L+ G+ + +F G + VS S K I T+ +K ++W R
Sbjct: 822 TASSDKTARLWNLQ-GQLLQEFKGHQGLVLSVSFSPDGKTIATSS-DDKTARLWNLQRQL 879
Query: 187 ISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLK 236
+ ++G S+ SP DG I +E G A W+L+
Sbjct: 880 LQEFKGHQGEVSSVSFSP------------DGKTIATASEDGTAQLWNLQ 917
Score = 38.1 bits (87), Expect = 5.4, Method: Compositional matrix adjust.
Identities = 29/111 (26%), Positives = 61/111 (54%), Gaps = 5/111 (4%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE-- 136
++F+ G+++ ++ A ++ + G++++EFK + + S +F + K A ASS+
Sbjct: 1064 VSFSPDGKTIATASSDNTA-QLWNLQGQLLQEFKGHQGLVLSVSFSPDGKTIATASSDNT 1122
Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDI 187
R+ +L+ G+ + +F + VS S K I TA Y +K +++W D+
Sbjct: 1123 ARLWNLQ-GQLLQEFKGHQRGVNSVSFSPDGKTIATASY-DKTIKLWDLDL 1171
>gi|24651910|ref|NP_610421.1| CG30349 [Drosophila melanogaster]
gi|21645568|gb|AAF59028.2| CG30349 [Drosophila melanogaster]
gi|281183429|gb|ADA53583.1| FI03455p [Drosophila melanogaster]
Length = 620
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 58/273 (21%), Positives = 124/273 (45%), Gaps = 28/273 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---GKKRRKERGTLL----------LAL 47
IW+T +L E+ P+ + +S SC A +V G + +K R +L +AL
Sbjct: 30 IWNTETNTLYQEFT-PNLQ--LSGSCTALTWVLVAGSRPKKSRKSLQAQDAAGDQLHIAL 86
Query: 48 GTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
GT G ++ + G++ +G H G + + G L+ VG +
Sbjct: 87 GTYKGSVVIYSLAEGKISRTLSGDSHDGPVTSITNDNAGH-LYTVGADCRVVVWSLTDER 145
Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA 163
I E++ + + +L + A+ ++++ ++ E V F S ++ + V++
Sbjct: 146 SIAEWQVGPEKPQNIVYLQKSGTLAIGYRQLKVYDIKTKELVETFTGHSGEINTISSVAS 205
Query: 164 SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-IL 222
+DG + ++T E+ + W+ D K NK A ++ +A C S +DG + +
Sbjct: 206 TDGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVA 262
Query: 223 AVAESGV--AYSWDLKTVSQDEKTNPAKITVKL 253
+V +G+ Y ++ ++S ++ P K+++++
Sbjct: 263 SVTRNGIIHIYLLNVNSLSAEKFVKP-KVSLQI 294
>gi|19527533|gb|AAL89881.1| RE25260p [Drosophila melanogaster]
Length = 620
Score = 45.4 bits (106), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 58/273 (21%), Positives = 124/273 (45%), Gaps = 28/273 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---GKKRRKERGTLL----------LAL 47
IW+T +L E+ P+ + +S SC A +V G + +K R +L +AL
Sbjct: 30 IWNTETNTLYQEFT-PNLQ--LSGSCTALTWVLVAGSRPKKSRKSLQAQDAAGDQLHIAL 86
Query: 48 GTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
GT G ++ + G++ +G H G + + G L+ VG +
Sbjct: 87 GTYKGSVVIYSLAEGKISRTLSGDSHDGPVTSITNDNAGH-LYTVGADCRVVVWSLTDER 145
Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA 163
I E++ + + +L + A+ ++++ ++ E V F S ++ + V++
Sbjct: 146 SIAEWQVGPEKPQNIVYLQKSGTLAIGYRQLKVYDIKTKELVETFTGHSGEINTISSVAS 205
Query: 164 SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-IL 222
+DG + ++T E+ + W+ D K NK A ++ +A C S +DG + +
Sbjct: 206 TDGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVA 262
Query: 223 AVAESGV--AYSWDLKTVSQDEKTNPAKITVKL 253
+V +G+ Y ++ ++S ++ P K+++++
Sbjct: 263 SVTRNGIIHIYLLNVNSLSAEKFVKP-KVSLQI 294
>gi|117558475|gb|AAI25999.1| LOC398447 protein [Xenopus laevis]
Length = 687
Score = 45.4 bits (106), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 103/254 (40%), Gaps = 28/254 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRKERGTL------LLAL 47
+W T G L E+ P + +CLA KK+RK + LL +
Sbjct: 56 VWDTQGGGLRKEYV-PSAHLSATCTCLAWAQGRADKETHQKKKRKSEASDRAGHYDLLTI 114
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL +++ GE++ K G H G + + + + SL+ + E ++ +V
Sbjct: 115 GTATGTILLYNIVKGELQSKLVGGHDGRVNCVRWHHENNSLYSCSEDKHIIEWNTQTCKV 174
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQY--VS 162
++K +SS + KI A +++ LE E +F S V L + V
Sbjct: 175 KCKWKGDNSSVSSLCISPDGKILLSAGRTIKLWDLETKEVYRQFTGHSTAVTSLMFLTVQ 234
Query: 163 ASDGAKII--------ITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
G+ I ++ ++ + VW+ S K + + ++ P+ +D S +
Sbjct: 235 PPQGSHPIKDSTGLYFLSGAVHDRLISVWQVR-SEKEKSSVLSFTLTEPPLFVDLSPSES 293
Query: 215 GEDGTVILAVAESG 228
E+ + V G
Sbjct: 294 KEEPLKLAVVCRDG 307
>gi|27882005|gb|AAH43856.1| LOC398447 protein, partial [Xenopus laevis]
Length = 665
Score = 45.4 bits (106), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 103/254 (40%), Gaps = 28/254 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRKERGTL------LLAL 47
+W T G L E+ P + +CLA KK+RK + LL +
Sbjct: 35 VWDTQGGGLRKEY-VPSAHLSATCTCLAWAQGRADKETHQKKKRKSEASDRAGHYDLLTI 93
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL +++ GE++ K G H G + + + + SL+ + E ++ +V
Sbjct: 94 GTATGTILLYNIVKGELQSKLVGGHDGRVNCVRWHHENNSLYSCSEDKHIIEWNTQTCKV 153
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQY--VS 162
++K +SS + KI A +++ LE E +F S V L + V
Sbjct: 154 KCKWKGDNSSVSSLCISPDGKILLSAGRTIKLWDLETKEVYRQFTGHSTAVTSLMFLTVQ 213
Query: 163 ASDGAKII--------ITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
G+ I ++ ++ + VW+ S K + + ++ P+ +D S +
Sbjct: 214 PPQGSHPIKDSTGLYFLSGAVHDRLISVWQVR-SEKEKSSVLSFTLTEPPLFVDLSPSES 272
Query: 215 GEDGTVILAVAESG 228
E+ + V G
Sbjct: 273 KEEPLKLAVVCRDG 286
>gi|348515929|ref|XP_003445492.1| PREDICTED: WD repeat-containing protein 43 [Oreochromis niloticus]
Length = 654
Score = 45.1 bits (105), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 102/248 (41%), Gaps = 22/248 (8%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV-----GKKRRK--------ERGTLLLAL 47
IW+T+ +L E+ P + +C+A G G +R+K E LLA+
Sbjct: 35 IWNTDSKTLHQEY-VPSAHLSATCTCIAWGPCRTVKEGPQRKKRKSEAVQVEEKADLLAM 93
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G +L G + G H GG+ + + + L+ + E + G+
Sbjct: 94 GTAAGTVLIYSTAKGALHCTLDGAHSGGVNCVQWHPEESLLYSGSDDTNIVEWDLQTGKT 153
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
++KA ++S + K+ A +++ L+ E KF S V L++ +
Sbjct: 154 RSKWKADRAAVTSLCVSPDGKLLLSAGHIIKMWDLDTKEVYRKFTGHSTAVTTLRFATTR 213
Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
S+G + A + ++ L VW+ K N + ++ P ID S + E+
Sbjct: 214 PPDSNGLYFLSGAAH-DRLLSVWQVREDGKDKNSVVSFTLTDEPQHIDLVTSNSKEEAVR 272
Query: 221 ILAVAESG 228
+ V + G
Sbjct: 273 LAVVCKDG 280
>gi|50927202|gb|AAH79723.1| LOC398447 protein, partial [Xenopus laevis]
Length = 667
Score = 45.1 bits (105), Expect = 0.045, Method: Compositional matrix adjust.
Identities = 56/254 (22%), Positives = 103/254 (40%), Gaps = 28/254 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRKERGTL------LLAL 47
+W T G L E+ P + +CLA KK+RK + LL +
Sbjct: 37 VWDTQGGGLRKEYV-PSAHLSATCTCLAWAQGRADKETHQKKKRKSEASDRAGHYDLLTI 95
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL +++ GE++ K G H G + + + + SL+ + E ++ +V
Sbjct: 96 GTATGTILLYNIVKGELQSKLVGGHDGRVNCVRWHHENNSLYSCSEDKHIIEWNTQTCKV 155
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQY--VS 162
++K +SS + KI A +++ LE E +F S V L + V
Sbjct: 156 KCKWKGDNSSVSSLCISPDGKILLSAGRTIKLWDLETKEVYRQFTGHSTAVTSLMFLTVQ 215
Query: 163 ASDGAKII--------ITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
G+ I ++ ++ + VW+ S K + + ++ P+ +D S +
Sbjct: 216 PPQGSHPIKDSTGLYFLSGAVHDRLISVWQVR-SEKEKSSVLSFTLTEPPLFVDLSPSES 274
Query: 215 GEDGTVILAVAESG 228
E+ + V G
Sbjct: 275 KEEPLKLAVVCRDG 288
>gi|425449150|ref|ZP_18828993.1| Genome sequencing data, contig C309 (fragment) [Microcystis
aeruginosa PCC 7941]
gi|389764351|emb|CCI09334.1| Genome sequencing data, contig C309 (fragment) [Microcystis
aeruginosa PCC 7941]
Length = 277
Score = 45.1 bits (105), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 87/195 (44%), Gaps = 10/195 (5%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA G+S+ I DV TG+ TG H ++ ++F+ G++L +G E
Sbjct: 52 LATGSSDNTIKLWDVETGQQIRTLTG-HNSYVSSVSFSSDGKTLVSGSDDGTIKLWNVET 110
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
G+ IR ++S++F + K A S + +++ ++E GEE+ S G + VS
Sbjct: 111 GQEIRTLSGHNYSVNSASFSNDGKTLATGSRDDTIKLWNVETGEEIRTLSGHNGYVNSVS 170
Query: 163 ASDGAKIIITAGYGEKHLQVWRCDISSKTVN-KGPALSMRHSPVAIDCKNSPNG-EDGTV 220
S K + T + + +++W + + G S+ + D K G +DGT+
Sbjct: 171 FSPDGKTLATGSW-DSTIKLWNVETGEEIRTLSGHNYSVNSVSFSPDGKTLATGSDDGTI 229
Query: 221 ILAVAESGVAYSWDL 235
L E Y W L
Sbjct: 230 KLWNGE----YGWGL 240
>gi|148657484|ref|YP_001277689.1| protein kinase [Roseiflexus sp. RS-1]
gi|148569594|gb|ABQ91739.1| protein kinase [Roseiflexus sp. RS-1]
Length = 1242
Score = 44.7 bits (104), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 44/169 (26%), Positives = 74/169 (43%), Gaps = 14/169 (8%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H +A +AF+ GR G + + + E G VI + + + S F + +A
Sbjct: 881 HTDSIASVAFSPDGRYALSGGGDRVIRLWEIENGRVICKLEGHTLAVYSVVF-SPDGHYA 939
Query: 132 LASS---EVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS 188
L+ S +R+ + G EV +F V + V+ S + II+AG+ E +++W
Sbjct: 940 LSGSWDKTIRLWEVATGREVNRFDRHVNFVNSVAFSPDGRYIISAGWDET-IRLW----- 993
Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
T ++ + V SP DG IL+ +E G WD+KT
Sbjct: 994 -DTTTGHEMYCLKDTDVIWSVCFSP---DGLYILSGSEDGSVKLWDIKT 1038
>gi|298248183|ref|ZP_06971988.1| WD40 repeat, subgroup [Ktedonobacter racemifer DSM 44963]
gi|297550842|gb|EFH84708.1| WD40 repeat, subgroup [Ktedonobacter racemifer DSM 44963]
Length = 433
Score = 44.7 bits (104), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 61/131 (46%), Gaps = 3/131 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+LA G+ +G I D G + TG H GG+ LA++ G L G + +
Sbjct: 159 ILASGSIDGSIKLWDPSRGHLLHTLTG-HGGGVFALAWSPSGGLLVSGGQDSAIKLWDPQ 217
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
G+++R + + S A + + AS++ VR+ L+ G +L F D PL V
Sbjct: 218 SGKLLRSLEGHGNAVRSLALGTDGQTLVSASTDQTVRLWDLQTGRLLLPFIDHPSPLYSV 277
Query: 162 SASDGAKIIIT 172
+ S +II++
Sbjct: 278 AMSPNHQIIVS 288
>gi|126658553|ref|ZP_01729700.1| Peptidase C14, caspase catalytic subunit p20 [Cyanothece sp. CCY0110]
gi|126620140|gb|EAZ90862.1| Peptidase C14, caspase catalytic subunit p20 [Cyanothece sp. CCY0110]
Length = 1523
Score = 44.7 bits (104), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 49/196 (25%), Positives = 91/196 (46%), Gaps = 12/196 (6%)
Query: 44 LLALGTSNGDILAV-DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+LA + N +I+ + ++ TG++ ++ H G+ ++F+ G++L +
Sbjct: 940 ILASSSINHNIIEIWNLETGKVI-RTLKEHNEGVQSVSFSFDGKTLASGSNDNTIKLWDV 998
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEV--LKFSDDVGPL 158
+ GEVI K +PISS +F KI A S + V++ +LE GE + LK +D G +
Sbjct: 999 KTGEVIHTLKGHNEPISSVSFSPNGKILASGSDDNTVKLWNLETGELIRTLKGHNDSGFV 1058
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKG------PALSMRHSPVAIDCKNS 212
+S S +++ + G K+ + +I + + K S+ SP +
Sbjct: 1059 TSLSFSPNGQLLASGSNGSKNGSIILWNIKTGQIIKNLENREVTIWSVSFSPDGKSLASG 1118
Query: 213 PNGEDGTVILAVAESG 228
+D TV L E+G
Sbjct: 1119 SGSDDNTVKLWDIETG 1134
>gi|145515818|ref|XP_001443803.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124411203|emb|CAK76406.1| unnamed protein product [Paramecium tetraurelia]
Length = 2929
Score = 44.3 bits (103), Expect = 0.085, Method: Composition-based stats.
Identities = 59/270 (21%), Positives = 116/270 (42%), Gaps = 26/270 (9%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA G+ + I+ + TG++ K G H + +AF+ G L + + ++
Sbjct: 2385 LLASGSEDQSIILWHIKTGKLITKLLG-HSDSVQSVAFSCDGSRLASASGDYLVKIWDTK 2443
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
+G+ I E + F +I A A + +++ +G++++K +Q +
Sbjct: 2444 LGQEILELSEHNDSLQCVIFSPNGQILASAGGDYIIQLWDAVSGQDIMKLEGHTDAVQSI 2503
Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMR----HSPVAIDCKNSPNGED 217
+ K++ +G + +++W DI++ T M+ H+ SPNGE
Sbjct: 2504 AFYPDGKVL-ASGSSDHSIRIW--DITTGT-------EMQKIDGHTGCVYSIAFSPNGE- 2552
Query: 218 GTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAAR 277
+++ +E W+ K++ + ++ N + + DQQ S+ + SI R
Sbjct: 2553 --ALVSASEDNSILLWNTKSIKEMQQINGDTMWIYSVAQSPDQQ-SLALACIDYSI---R 2606
Query: 278 LNDL--EADGQKTITGLHDLESEAATASAQ 305
L DL E + QK I +E A +A Q
Sbjct: 2607 LWDLKSEKERQKLIGHSDQVEVIAFSADGQ 2636
Score = 41.6 bits (96), Expect = 0.49, Method: Composition-based stats.
Identities = 50/204 (24%), Positives = 91/204 (44%), Gaps = 12/204 (5%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
L+A G+S+ + DV +G+ K G H + +AF+ K L + +
Sbjct: 2343 LIASGSSDTSVRLWDVESGKEISKLEG-HLNWVCSVAFSPKEDLLASGSEDQSIILWHIK 2401
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
G++I + + S AF C+ A AS + V+I + G+E+L+ S+ LQ V
Sbjct: 2402 TGKLITKLLGHSDSVQSVAFSCDGSRLASASGDYLVKIWDTKLGQEILELSEHNDSLQCV 2461
Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVI 221
S +I+ +AG G+ +Q+W +S + + K H+ P DG V+
Sbjct: 2462 IFSPNGQILASAG-GDYIIQLWDA-VSGQDIMKLEG----HTDAVQSIAFYP---DGKVL 2512
Query: 222 LAVAESGVAYSWDLKTVSQDEKTN 245
+ + WD+ T ++ +K +
Sbjct: 2513 ASGSSDHSIRIWDITTGTEMQKID 2536
>gi|449545659|gb|EMD36630.1| hypothetical protein CERSUDRAFT_115662 [Ceriporiopsis subvermispora
B]
Length = 825
Score = 44.3 bits (103), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 48/190 (25%), Positives = 89/190 (46%), Gaps = 26/190 (13%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA G+ + + + + L+G+ + H G+ LA++ GR+L +G+ +E+
Sbjct: 547 LASGSHDTNAIIWNYLSGDQH--AVLCHTAGIFDLAYSADGRTLATGDLDGVVHLWDAEL 604
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASS--EVRILSLENGEEVLKFSDDVGPL-QYV 161
GE R F ++ I+ AF +K + LA S V + + E G+ ++ G + Q+
Sbjct: 605 GERHRSFPSAAGAITQLAFSPNDK-YVLAQSLFSVNVWNAETGDLHAALNEHTGIIWQFA 663
Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRH---SPVAIDCKNSPNGE-- 216
+GA+++ + + +VW ++ G L + H SPV I + SP+GE
Sbjct: 664 IDREGARVVTAS--EDTTARVW-------SLETGDTLFINHEHTSPVWI-AQFSPDGERI 713
Query: 217 -----DGTVI 221
DGT +
Sbjct: 714 STVSSDGTFV 723
>gi|389742109|gb|EIM83296.1| WD40 repeat-containing protein [Stereum hirsutum FP-91666 SS1]
Length = 328
Score = 43.9 bits (102), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 45/184 (24%), Positives = 86/184 (46%), Gaps = 21/184 (11%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H ++ + F G L ++ + ++ GE+++ + IS A+ + + A
Sbjct: 36 HTMSISSIKFNPDGNVLASAASDKLIKLWDTDSGEILKTLMGHTEGISDIAWSNDGEYLA 95
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS- 188
AS + +RI S+E G EV + V+ + + ++++ G+ E ++VW D++
Sbjct: 96 SASDDKTIRIWSMETGTEVKVLYGHTNFVFCVNYNPKSNLLVSGGFDET-VRVW--DVAR 152
Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDE 242
K++ PA S + VA + DGT+I++ A G+ WD LKT+ D+
Sbjct: 153 GKSLKVLPAHSDPVTAVAFN-------HDGTLIVSCAMDGLIRIWDADSGQCLKTLVDDD 205
Query: 243 KTNP 246
NP
Sbjct: 206 --NP 207
>gi|291386985|ref|XP_002709985.1| PREDICTED: WD repeat domain 43 [Oryctolagus cuniculus]
Length = 676
Score = 43.5 bits (101), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 58/262 (22%), Positives = 101/262 (38%), Gaps = 29/262 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG---------KKRRKE-----RGTLLLA 46
+W T + L E+ P + +CLA V KKR+ E T LLA
Sbjct: 40 VWETANNRLHQEY-VPSAHLSGTCTCLAWAPVRLQAKESPQRKKRKSEALGTSNQTDLLA 98
Query: 47 LGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LGT+ G IL + GE+ K T G H + + + + L+ + E +
Sbjct: 99 LGTTVGSILLYSTVKGELHSKLTSGGHDNRVNCIQWHQDSGCLYSCSDDKHIVEWNVQTC 158
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS- 164
+V ++K +SS + K+ A +++ LE E F+ P+ + +
Sbjct: 159 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLKFTT 218
Query: 165 ----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
DG ++ ++ L VW+ +K N + ++ PV ID S
Sbjct: 219 IRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDLTLS 278
Query: 213 PNGEDGTVILAVAESGVAYSWD 234
N E+ + V G + ++
Sbjct: 279 ENKEEPVKLAVVCRDGQVHLFE 300
>gi|195057884|ref|XP_001995342.1| GH23108 [Drosophila grimshawi]
gi|193899548|gb|EDV98414.1| GH23108 [Drosophila grimshawi]
Length = 626
Score = 43.5 bits (101), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 59/277 (21%), Positives = 123/277 (44%), Gaps = 32/277 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGE-----PVVSYSCLACGFVGKKRRKERGTLL--------LAL 47
IW T +L E+ P+ + +++ +A K R+ ++ T + +AL
Sbjct: 30 IWDTEANTLKQEYT-PNLQLAGPCTALTWLTIAAQRPKKTRKTQQQTQISSDNDREYIAL 88
Query: 48 GTSNGDILAVDVLTGEM-KWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
GT G ++ + G++ + S H G + + + L+ VGT+ A
Sbjct: 89 GTLKGAVVLYSLTEGKIERTLSNESHHGAVTAIVYDAHNGHLYSVGTDCRALIWSVAEAR 148
Query: 107 VIREFKA-SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQ----YV 161
I +F EKP++ FL + + A+AS +++I ++N E V + G + +
Sbjct: 149 CISDFSVGPEKPVNC-VFLAKSRTLAVASRQLKIYDVDNVELVETCTGHSGEINAINSFT 207
Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA--IDCKNSPNGEDGT 219
+ + ++T E+ + W+ I K NK ++ VA + C+ EDG
Sbjct: 208 CTKNNVEYVMTTAKMERVISFWK--IEKKGRNKASMCTLLMEDVAHSLTCEVR---EDGQ 262
Query: 220 VILA-VAESG--VAYSWDLKTVSQDEKTNPAKITVKL 253
+ +A V +G Y +++++S ++ P K+++++
Sbjct: 263 LRVASVTRNGNIHIYMLNVESLSTEKYIKP-KVSLQI 298
>gi|330805930|ref|XP_003290929.1| hypothetical protein DICPUDRAFT_81631 [Dictyostelium purpureum]
gi|325078927|gb|EGC32553.1| hypothetical protein DICPUDRAFT_81631 [Dictyostelium purpureum]
Length = 616
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 55/220 (25%), Positives = 87/220 (39%), Gaps = 47/220 (21%)
Query: 29 CGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGR------HPGGLAGLAFA 82
CGF+ + KE+ L T + VD+ +S+G+ P G ++
Sbjct: 188 CGFIVTENEKEKTYKL----TQHQIAKQVDI-------QSSGKVFDLNLDPNGPYNFEYS 236
Query: 83 KKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSL 142
K GR L + G G S + + G+ + E + I + FL E +FALA + +
Sbjct: 237 KNGRYLLLAGARGHYSIVDWQRGKKLTE-RHLGGAIRDACFLHNETMFALAQKKYTYIYN 295
Query: 143 ENGEEVLKFSD-------DVGPLQY--VSASDGAKIIITAGYGEKHLQVWRCDISSKTVN 193
++G E+ D D P Y VSA++ KII DIS V
Sbjct: 296 QDGVELHWLKDHYDPKFIDFLPYHYLLVSATNKGKIIYE-------------DIS---VG 339
Query: 194 KGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSW 233
+ S H P++ C+N N V+ +G+ W
Sbjct: 340 QKVVESHYHGPLSAMCQNPQNA----VMNLGFSTGIVQMW 375
>gi|354567841|ref|ZP_08987008.1| serine/threonine protein kinase with WD40 repeats [Fischerella sp.
JSC-11]
gi|353541515|gb|EHC10982.1| serine/threonine protein kinase with WD40 repeats [Fischerella sp.
JSC-11]
Length = 688
Score = 43.5 bits (101), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 46/195 (23%), Positives = 87/195 (44%), Gaps = 21/195 (10%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H G + G+ F++ G++L + G IR FK + ++S AF + + A
Sbjct: 437 HSGWIWGVVFSRDGQTLASASADQTVKLWDLATGREIRTFKGHKAGVTSVAFSPDGQTLA 496
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
A + V++ ++E G+E+ G + V+ S + + +G +K +++W + ++
Sbjct: 497 TAGLDKTVKLWNVETGKEICTLVGHSGAIASVAFSPDGQ-TLASGSWDKTIKLWNVN-TA 554
Query: 190 KTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKI 249
K + HS + I SP DGT + + ++ WDL T K
Sbjct: 555 KNIRTFTG----HSDLIISVAFSP---DGTSLASGSKDKTIKLWDLAT---------GKA 598
Query: 250 TVKLKKADADQQNSV 264
T+ LK+ D+ NS+
Sbjct: 599 TLTLKE-HTDKVNSI 612
>gi|66814050|ref|XP_641204.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
gi|60469228|gb|EAL67223.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
Length = 622
Score = 43.1 bits (100), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 62/152 (40%), Gaps = 26/152 (17%)
Query: 73 PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFAL 132
P G + ++K GR L + G G S + + G+ E + PI + FL E +FAL
Sbjct: 233 PNGPYKIQYSKNGRYLLLAGAKGHFSMIDWQRGKKFTE-RHLASPIRDAVFLQNESMFAL 291
Query: 133 ASSEVRILSLENGEEVLKFSDDVGP--LQY-------VSASDGAKIIITAGYGEKHLQVW 183
A + + ++G E+ D P L Y VSA++ KII
Sbjct: 292 AQKKYTYIYNQDGVEMHWLKDHYDPKFLDYLPYHFLLVSATNKGKIIYE----------- 340
Query: 184 RCDISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
DIS V + S P++ CKN N
Sbjct: 341 --DIS---VGQKVVESFYSGPLSAMCKNPQNA 367
>gi|288916826|ref|ZP_06411199.1| WD-40 repeat protein [Frankia sp. EUN1f]
gi|288351711|gb|EFC85915.1| WD-40 repeat protein [Frankia sp. EUN1f]
Length = 682
Score = 43.1 bits (100), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 49/195 (25%), Positives = 80/195 (41%), Gaps = 40/195 (20%)
Query: 3 STNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTG 62
S + G++LA PDG VVS G ++G + D+ G
Sbjct: 271 SGHHGAVLAVALTPDGRQVVS------------------------GGADGVVRLWDLADG 306
Query: 63 EMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSA 122
+ TG H G ++ +A GR + G +GM + E GE E P +S A
Sbjct: 307 TERAALTG-HTGEVSAVALTPDGRRVVTGGGDGMLRIWELETGEEQAHIPPGEIPATSRA 365
Query: 123 -FLCEEKIFALA-------------SSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAK 168
FL +++ ALA S +RI L G + + D P+ V+ + +
Sbjct: 366 PFLGPDRMSALAVAPDGRWAASGDLSGALRIWDLAGGRRLGCLTSDGVPVWAVAVTPDGR 425
Query: 169 IIITAGYGEKHLQVW 183
I+T G+G+ +++W
Sbjct: 426 RIVT-GHGDGTVRLW 439
>gi|126656712|ref|ZP_01727926.1| WD-40 repeat protein [Cyanothece sp. CCY0110]
gi|126621932|gb|EAZ92640.1| WD-40 repeat protein [Cyanothece sp. CCY0110]
Length = 1072
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 54/198 (27%), Positives = 87/198 (43%), Gaps = 20/198 (10%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA +G+I D L+G+ K+ TG H + + F+++ L +G+ E
Sbjct: 618 LLASAGRDGEIYIWDALSGKHKFTLTG-HDDSIREIVFSQESNLLISRSDDGIIKLWNLE 676
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSL--ENGEEVLKFSDDVGPLQ 159
GE+I +EK I++ A + I A E V++ SL ++ +L ++
Sbjct: 677 KGELIEHPLNNEKGINTFALSSQLSIVATGQKEGKVKLWSLAQKDFPTLLALKSSQNSIE 736
Query: 160 YVS-ASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDG 218
+V SD K+II H +W D++ K +N P S P D + N
Sbjct: 737 FVQFTSDETKLIIVENNNNIH--IW--DMNKKIINSIP--STDQCPFG-DIQVKGN---- 785
Query: 219 TVILAVAE-SGVAYSWDL 235
ILAVA+ +G WD
Sbjct: 786 --ILAVAQINGTIQIWDF 801
>gi|434387970|ref|YP_007098581.1| WD40 repeat-containing protein [Chamaesiphon minutus PCC 6605]
gi|428018960|gb|AFY95054.1| WD40 repeat-containing protein [Chamaesiphon minutus PCC 6605]
Length = 1221
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 66/143 (46%), Gaps = 6/143 (4%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA G NG I D+ TGE TG H G + GLAF+ G+ L ++ +
Sbjct: 613 LLATGDVNGQIYLWDIATGEPILCCTG-HAGWVHGLAFSHDGKMLASASSDLTVKLWDTF 671
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
G +R F + + + AF + + A SS+ +R+ +G+ + S + V
Sbjct: 672 DGSCLRTFTGHHQRVRAIAFSPDSQSIASGSSDATIRLWDTRSGKCLKILSGHQSYIWSV 731
Query: 162 SAS-DGAKIIITAGYGEKHLQVW 183
+ S DG I +G +K +++W
Sbjct: 732 AFSPDGTT--IASGSEDKSVRLW 752
>gi|428320673|ref|YP_007118555.1| serine/threonine protein kinase with WD40 repeats [Oscillatoria
nigro-viridis PCC 7112]
gi|428244353|gb|AFZ10139.1| serine/threonine protein kinase with WD40 repeats [Oscillatoria
nigro-viridis PCC 7112]
Length = 689
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 32/132 (24%), Positives = 65/132 (49%), Gaps = 3/132 (2%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA G+ + + DV TG + ++ H GG+ +AF++ G++L +
Sbjct: 508 LASGSDDKTVRLWDVKTGS-RLRTLSGHAGGVNAIAFSRDGQTLASGSDDKTVRLWNLNT 566
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
GEV R P+++ AF K A AS++ +R+ ++++G+ F G ++ ++
Sbjct: 567 GEVRRIITGHGGPVNAVAFSPNGKTVASASTDNTIRLSNVQDGKRTRTFKGHSGRVRTIA 626
Query: 163 ASDGAKIIITAG 174
S ++ +I+ G
Sbjct: 627 FSPDSRTLISGG 638
>gi|148706445|gb|EDL38392.1| mCG5393 [Mus musculus]
Length = 688
Score = 43.1 bits (100), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 58/261 (22%), Positives = 104/261 (39%), Gaps = 28/261 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG---------FVGKKRRKE-RGTL----LLA 46
+W T + L E+ P + +CLA KKR+ E GT LLA
Sbjct: 51 VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKESHQRKKRKSEVTGTKDQADLLA 109
Query: 47 LGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LGT+ G IL + GE+ K T G H + + + + L+ + E ++
Sbjct: 110 LGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDNDCLYSCSDDKYIVEWSTQTC 169
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQYVS 162
+V ++K +SS + K+ A +++ LE E F+ P L++ +
Sbjct: 170 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLRFTT 229
Query: 163 -------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSP 213
SDG ++ ++ L VW+ +K + + ++ PV +D S
Sbjct: 230 IRPNESQPSDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYVDLTLSE 289
Query: 214 NGEDGTVILAVAESGVAYSWD 234
N E+ + V G + ++
Sbjct: 290 NKEEPVKLAVVCRDGQVHLFE 310
>gi|37359726|dbj|BAC97841.1| mKIAA0007 protein [Mus musculus]
Length = 665
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 58/261 (22%), Positives = 104/261 (39%), Gaps = 28/261 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG---------FVGKKRRKE-RGTL----LLA 46
+W T + L E+ P + +CLA KKR+ E GT LLA
Sbjct: 28 VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKESHQRKKRKSEVTGTKDQADLLA 86
Query: 47 LGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LGT+ G IL + GE+ K T G H + + + + L+ + E ++
Sbjct: 87 LGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDNDCLYSCSDDKYIVEWSTQTC 146
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQYVS 162
+V ++K +SS + K+ A +++ LE E F+ P L++ +
Sbjct: 147 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLRFTT 206
Query: 163 -------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSP 213
SDG ++ ++ L VW+ +K + + ++ PV +D S
Sbjct: 207 IRPNESQPSDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYVDLTLSE 266
Query: 214 NGEDGTVILAVAESGVAYSWD 234
N E+ + V G + ++
Sbjct: 267 NKEEPVKLAVVCRDGQVHLFE 287
>gi|196000408|ref|XP_002110072.1| hypothetical protein TRIADDRAFT_20616 [Trichoplax adhaerens]
gi|190588196|gb|EDV28238.1| hypothetical protein TRIADDRAFT_20616 [Trichoplax adhaerens]
Length = 1700
Score = 43.1 bits (100), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 41/100 (41%), Gaps = 23/100 (23%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVV-----SYSCLACGFVGKKRRKERGTLLLALGTSNGDIL 55
IW + + L ++K P P S C ACGF +NG +
Sbjct: 682 IWDIDTYAQLFDFKAPGEVPCAICYHPSQQCFACGF------------------TNGTVR 723
Query: 56 AVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNG 95
V T + + + HPG + GLAF+ GR ++ GT G
Sbjct: 724 TFHVGTSSLLAEHSKNHPGRIIGLAFSPSGRFMYSAGTLG 763
>gi|75911009|ref|YP_325305.1| WD-40 repeat-containing protein [Anabaena variabilis ATCC 29413]
gi|75704734|gb|ABA24410.1| WD-40 repeat-containing protein [Anabaena variabilis ATCC 29413]
Length = 1477
Score = 43.1 bits (100), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 44/168 (26%), Positives = 86/168 (51%), Gaps = 13/168 (7%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+ G +GDI + L + K K+ G + L+ + G+ + G + +A ++ +
Sbjct: 817 FIVTGGEDGDI-NLWSLQEKQKIKNWMAEQGAIYSLSISSDGQYIATAGKDRIA-KLWNL 874
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALA--SSEVRILSLENGEEVLKFSDDVGPLQYV 161
+G+ + EFK+ S +F + ++ A A S+ R+ L +GE++ +F VG ++ V
Sbjct: 875 VGQKLSEFKSPNGSFRSISFSPDGRLLATAGDDSKARLWKL-SGEQLAEFKGHVGWVRDV 933
Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTV-----NKGPALSMRHSP 204
S S K++ TAG + +++W +S K + ++G LS+R SP
Sbjct: 934 SFSPDGKLLATAG-DDGKVRLWH--LSGKQLIEFKGHQGGVLSVRFSP 978
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 40/133 (30%), Positives = 60/133 (45%), Gaps = 14/133 (10%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
L AL I V+ LTG H G + G+ F+ GR + G++G +
Sbjct: 702 LFALQKILDKIFEVNQLTG---------HQGWVRGIRFSPNGRLIVTSGSDGTV-RIWDY 751
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
+G+ EFKA I S F + K+ A AS + VRI +L E+L ++ V
Sbjct: 752 LGKQQIEFKAHWGSILSVNFSPDSKLIATASDDGMVRIWNLLG--EMLSEYKHQNVIRDV 809
Query: 162 SASDGAKIIITAG 174
+ S +K I+T G
Sbjct: 810 AFSPDSKFIVTGG 822
>gi|241896997|ref|NP_783570.1| WD repeat-containing protein 43 [Mus musculus]
gi|158518600|sp|Q6ZQL4.2|WDR43_MOUSE RecName: Full=WD repeat-containing protein 43
gi|74214553|dbj|BAE31123.1| unnamed protein product [Mus musculus]
Length = 677
Score = 42.7 bits (99), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 58/261 (22%), Positives = 104/261 (39%), Gaps = 28/261 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG---------FVGKKRRKE-RGTL----LLA 46
+W T + L E+ P + +CLA KKR+ E GT LLA
Sbjct: 40 VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKESHQRKKRKSEVTGTKDQADLLA 98
Query: 47 LGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LGT+ G IL + GE+ K T G H + + + + L+ + E ++
Sbjct: 99 LGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDNDCLYSCSDDKYIVEWSTQTC 158
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQYVS 162
+V ++K +SS + K+ A +++ LE E F+ P L++ +
Sbjct: 159 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLRFTT 218
Query: 163 -------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSP 213
SDG ++ ++ L VW+ +K + + ++ PV +D S
Sbjct: 219 IRPNESQPSDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYVDLTLSE 278
Query: 214 NGEDGTVILAVAESGVAYSWD 234
N E+ + V G + ++
Sbjct: 279 NKEEPVKLAVVCRDGQVHLFE 299
>gi|195492305|ref|XP_002093934.1| GE20481 [Drosophila yakuba]
gi|194180035|gb|EDW93646.1| GE20481 [Drosophila yakuba]
Length = 1229
Score = 42.7 bits (99), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+A+GTS+G IL DV T ++W +H G +A LAF L + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGYSRGLVAMLDT 189
Query: 103 EMGEVIREF 111
+ G+V+RE
Sbjct: 190 QTGDVLREL 198
>gi|194867662|ref|XP_001972124.1| GG14056 [Drosophila erecta]
gi|190653907|gb|EDV51150.1| GG14056 [Drosophila erecta]
Length = 1229
Score = 42.7 bits (99), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 2/69 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+A+GTS+G IL DV T ++W +H G +A LAF L + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGYSRGLVAMLDT 189
Query: 103 EMGEVIREF 111
+ G+V+RE
Sbjct: 190 QTGDVLREL 198
>gi|320593547|gb|EFX05956.1| mRNA splicing factor [Grosmannia clavigera kw1407]
Length = 542
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 40/185 (21%), Positives = 76/185 (41%), Gaps = 18/185 (9%)
Query: 9 LLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKS 68
L+ WK P G+PV + E G +LL+ G+++ + DV ++
Sbjct: 238 LVHVWKSPTGKPVTALRLFP----------ESGHVLLS-GSADNTVRVWDVYHERELLRT 286
Query: 69 TGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP--ISSSAFLCE 126
H L L+F++ GR + +E G+ IR F+ + P ++ + +
Sbjct: 287 YAGHSRALTDLSFSRDGRQFISGSHDRFVKLWDTETGQCIRRFRTGKTPHCLAFNPSVEG 346
Query: 127 EKIFALASSEVRILSLE----NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQV 182
F S +IL + + E V + + + ++ D A+ +T +K L+V
Sbjct: 347 AHEFLTGMSNNKILQWDSRAGDNEPVQDYDHHLAAINTITFVDEARRFMTTS-DDKSLRV 405
Query: 183 WRCDI 187
W +I
Sbjct: 406 WDYNI 410
>gi|51259783|gb|AAH79913.1| Wrd43 protein, partial [Mus musculus]
Length = 592
Score = 42.7 bits (99), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 44/204 (21%), Positives = 84/204 (41%), Gaps = 13/204 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K T G H + + + + L+ + E +
Sbjct: 11 LLALGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDNDCLYSCSDDKYIVEWST 70
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQ 159
+ +V ++K +SS + K+ A +++ LE E F+ P L+
Sbjct: 71 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLR 130
Query: 160 YVS-------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCK 210
+ + SDG ++ ++ L VW+ +K + + ++ PV +D
Sbjct: 131 FTTIRPNESQPSDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYVDLT 190
Query: 211 NSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V G + ++
Sbjct: 191 LSENKEEPVKLAVVCRDGQVHLFE 214
>gi|425460715|ref|ZP_18840196.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
gi|389826568|emb|CCI22821.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
Length = 1108
Score = 42.4 bits (98), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 51/200 (25%), Positives = 83/200 (41%), Gaps = 26/200 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D +++ +T + +AF + + G NG ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G ++REF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 899 KGTMLREFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWYLSNHRPQLINSWTVDDSIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + + P + SP
Sbjct: 959 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWDL 235
DG I A++ G A WD+
Sbjct: 1005 -DGEKIAAISRDGTARRWDI 1023
>gi|393247627|gb|EJD55134.1| WD40 repeat-like protein [Auricularia delicata TFB-10046 SS5]
Length = 353
Score = 42.4 bits (98), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 83/184 (45%), Gaps = 21/184 (11%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H ++ + F+ G L +G+ + GE++R FK K IS A+ + A
Sbjct: 59 HTMSISAIKFSPDGSFLASSAGDGLVKLWDAYTGEILRTFKGHVKGISDIAWARDSLYLA 118
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
AS + VRI +++ G V + + V+ + + ++ +G ++ +++W D++
Sbjct: 119 SASDDKTVRIWNVQLGSTVKILTGHTSQVMCVNFNPQSN-LLASGSVDETVRIW--DVAR 175
Query: 190 KTVNKGPALSMRHSPV-AIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDE 242
+ LS PV A+D DGT+I++ A G+ WD LKT+ D
Sbjct: 176 GKCMR--TLSAHSDPVTAVDFN-----RDGTMIVSCAYDGLIRIWDTASGQCLKTIVDD- 227
Query: 243 KTNP 246
NP
Sbjct: 228 -ANP 230
>gi|444914132|ref|ZP_21234277.1| hypothetical protein D187_06447 [Cystobacter fuscus DSM 2262]
gi|444715066|gb|ELW55939.1| hypothetical protein D187_06447 [Cystobacter fuscus DSM 2262]
Length = 2276
Score = 42.4 bits (98), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 49/190 (25%), Positives = 86/190 (45%), Gaps = 11/190 (5%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA +G + D +G K +S H G + ++++K GR L G +G +
Sbjct: 1342 LASAGEDGTVRLWDAESGR-KLRSLSGHKGWVRSVSWSKDGRRLASAGDDGSVRLWDTAS 1400
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
G ++R + + S ++ + + A A + VR+ + E+G E+ G + VS
Sbjct: 1401 GRMLRSLSGEKGRVWSVSWSADGRRLASAGDDGTVRLWNAESGHELHSLPGHKGMIFSVS 1460
Query: 163 ASDGAKIIITAGYGEKHLQVWRC----DISSKTVNKGPALSMRHSPVAIDCKNSPNGEDG 218
S ++ + G G HL W ++ S + +KG S+ S A + + +G DG
Sbjct: 1461 WSADGRLASSGGDGTVHL--WDAESGHELHSLSGHKGWVFSVSWS--ADGRRLASSGRDG 1516
Query: 219 TVILAVAESG 228
TV L A+SG
Sbjct: 1517 TVRLWDAQSG 1526
>gi|328790602|ref|XP_392867.4| PREDICTED: WD repeat-containing protein 43-like [Apis mellifera]
Length = 582
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 59/270 (21%), Positives = 112/270 (41%), Gaps = 48/270 (17%)
Query: 34 KKRRK-------ERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAG------LA 80
K+RRK ++G ++A+G++NG I D T + + G H G + +
Sbjct: 71 KRRRKSISEESIQKG--IIAMGSTNGKITLYDTTTSSVTAQLDG-HSGSVTAATWSENVG 127
Query: 81 FAKKGRSLHVVG----TNGMASEMKSEMGEVIREFKASEKPISSSA--FLCEEKIFALAS 134
F H++ NG+ + KS G K + +S+ L E++
Sbjct: 128 FITASDDHHIIQWNLQENGVKCKWKSGKG------KTTSLAVSTDGNNLLSGERV----- 176
Query: 135 SEVRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKT 191
V+ L + + F+ + V L + + G +I+ YG+ +L VW D
Sbjct: 177 --VKWWDLNTKQLIRTFTGHANQVTCLHTIKMTSGNNYVISGAYGDGYLSVWALDEQRNE 234
Query: 192 VNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITV 251
+L+++ P+++ S N + +++ V SG A L + ++ P K ++
Sbjct: 235 RTSVASLALQDDPISVSANVSANSQ--VMVVVVTRSGQA---QLFQYQPNGRSKPLKPSL 289
Query: 252 KLKKA-DADQQNSVNVKKSRTSIIAARLND 280
+ A D Q++ V + SI+ A L D
Sbjct: 290 NIAVASDISQKDEVQ----QISILNAMLTD 315
>gi|348574548|ref|XP_003473052.1| PREDICTED: LOW QUALITY PROTEIN: WD repeat-containing protein
43-like [Cavia porcellus]
Length = 675
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 45/205 (21%), Positives = 83/205 (40%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K TG H + + + + L+ + E +
Sbjct: 96 LLALGTAVGSILLYSTVKGELHSKLVTGGHDNRVNCIQWHQDSDCLYSCSDDKHIVEWNT 155
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQ 159
+V ++K ++S + K+ A +++ LE E F+ P L+
Sbjct: 156 RTCKVKCKWKGDNSSVTSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLK 215
Query: 160 YVSA--------SDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
+ + SDG ++ ++ L VW+ +K N + ++ PV ID
Sbjct: 216 FTTVRPPNESQTSDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 275
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V G + ++
Sbjct: 276 TLSENKEEPVKLAVVCRDGQVHLFE 300
>gi|299470075|emb|CBN79252.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 570
Score = 42.4 bits (98), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 61/143 (42%), Gaps = 5/143 (3%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++ G +G I + D TG + W H GG+ L + R L G G +
Sbjct: 339 MMLTGWRDGQIRSHDADTGHLLWSIDNAHAGGVTSLVVSHNERFLVTGGMEGELRVWELR 398
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRIL---SLENGEEVLKFSDDVGPLQY 160
+++ K + ++S A C++ + AL+ S R L L + +V S +G +
Sbjct: 399 SRDLVSHLKQHGQRVTSLALFCDD-VHALSVSRDRSLICWDLRSESQVSHHSQRMGGVNA 457
Query: 161 VSASDGAKIIITAGYGEKHLQVW 183
V+ S +IT G EK + W
Sbjct: 458 VALSQDETTVITMGQ-EKRVTYW 479
>gi|390562646|ref|ZP_10244834.1| hypothetical protein NITHO_3120008 [Nitrolancetus hollandicus Lb]
gi|390172774|emb|CCF84146.1| hypothetical protein NITHO_3120008 [Nitrolancetus hollandicus Lb]
Length = 320
Score = 42.4 bits (98), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 35/132 (26%), Positives = 59/132 (44%), Gaps = 3/132 (2%)
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
G+ +G I DV TG + G HP + LA++ GR L G + E GE
Sbjct: 95 GSGDGLIRIWDVETGGEVARLEG-HPDWVTALAYSPDGRFLLSGGRDATVRLWDVEQGEE 153
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEV--RILSLENGEEVLKFSDDVGPLQYVSASD 165
+ FK + I+S F + S+++ R+ LE +++ F + P++ + S
Sbjct: 154 VHLFKGHTRLITSIGFSPDGTRIVSTSADLGTRLWDLETRKQLRHFMGHLAPVRAAAFSP 213
Query: 166 GAKIIITAGYGE 177
+ I+T G E
Sbjct: 214 DGQRIVTGGQDE 225
>gi|195588248|ref|XP_002083870.1| GD13124 [Drosophila simulans]
gi|194195879|gb|EDX09455.1| GD13124 [Drosophila simulans]
Length = 1229
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+A+GTS+G IL DV T ++W +H G +A LAF L + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLTGFSRGLVAMLDT 189
Query: 103 EMGEVIREF 111
G+V+RE
Sbjct: 190 HTGDVLREL 198
>gi|302695713|ref|XP_003037535.1| hypothetical protein SCHCODRAFT_47625 [Schizophyllum commune H4-8]
gi|300111232|gb|EFJ02633.1| hypothetical protein SCHCODRAFT_47625 [Schizophyllum commune H4-8]
Length = 361
Score = 42.4 bits (98), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 82/192 (42%), Gaps = 20/192 (10%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H ++ + F+ G+ L + E+GE++ K E+ IS A+ + + A
Sbjct: 51 HARSISAVKFSPDGKLLASCAADKTIKIWDPEIGEIVHTLKGHEEGISDIAWSNDGQYIA 110
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW------ 183
AS + VRI S+ + +EV + + V+ + +I++ G+ E +++W
Sbjct: 111 SASDDKTVRIWSVADAKEVKRLIGHTNFVFCVAYNPAGNLIVSGGFDET-IRIWDASKGE 169
Query: 184 --RCDISSKTVNKGPALSMRHSPVAIDCKNSPN-GEDGTVILAVAESGVAYSWD------ 234
R + + +R P D S DGT+I++ A G+ WD
Sbjct: 170 YGRLCTCREADDLYAGKFVRFMPAHSDPVTSVGFSHDGTMIVSCAMDGLIRIWDTESGQC 229
Query: 235 LKTVSQDEKTNP 246
LKT+ D+ NP
Sbjct: 230 LKTLVDDD--NP 239
>gi|389750241|gb|EIM91412.1| WD40 repeat-like protein [Stereum hirsutum FP-91666 SS1]
Length = 540
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 39/154 (25%), Positives = 69/154 (44%), Gaps = 17/154 (11%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LA SNG+I D+ +GE+KW STG H + L F+ G L V + + S
Sbjct: 20 FLAATFSNGNIHLWDIQSGEIKWTSTG-HSMTVKSLRFSPNGTML-VSSSRDCTVRVWSV 77
Query: 104 MG--EVIREFKASEKPISSSAFLCEEKIFALA-----SSEVRILSLENGEEVLK------ 150
+ E I+ K + +SS+ FL E + ++ S++ I L + E K
Sbjct: 78 LDGTETIQPCKHHRRYVSSATFLSNESVIYISWGHDKGSDICIWDLYSDEVTKKVKNVLD 137
Query: 151 -FSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+ P+ +S + +++A Y + +++W
Sbjct: 138 LYIGHTKPIYSISFAPDETRVVSASYDDT-IRIW 170
>gi|218506025|gb|ACK77654.1| RE12102p [Drosophila melanogaster]
Length = 1096
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+A+GTS+G IL DV T ++W +H G +A LAF L + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFSRGLVAMLDT 189
Query: 103 EMGEVIREF 111
G+V+RE
Sbjct: 190 HTGDVLREL 198
>gi|75911051|ref|YP_325347.1| WD-40 repeat-containing serine/threonine protein kinase [Anabaena
variabilis ATCC 29413]
gi|75704776|gb|ABA24452.1| serine/threonine protein kinase with WD40 repeats [Anabaena
variabilis ATCC 29413]
Length = 676
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 47/211 (22%), Positives = 90/211 (42%), Gaps = 21/211 (9%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H G + +AF+ G++L G + G IR K + ++S AF + K A
Sbjct: 432 HSGWVWAIAFSPDGKTLVSAGADKTIKLWNLATGTEIRTLKGHSQGVASVAFSPDGKTLA 491
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
S + +++ +L G+E+ S+ + V+ S K + +G +K +++W
Sbjct: 492 SGSLDKTIKLWNLATGKEIRTLSEHSNVVANVAFSPDGK-TLASGSWDKTIKLWNL---- 546
Query: 190 KTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKI 249
T NK HS + + +P DG + + ++ W+L A
Sbjct: 547 -TTNKVFRTLEGHSDLVMSVVFNP---DGKTLASASKDKTIRLWNL----------AAGK 592
Query: 250 TVKLKKADADQQNSVNVKKSRTSIIAARLND 280
T++ K +D+ NSV ++++A+ ND
Sbjct: 593 TIRTLKGHSDKVNSVVYVPRNSTVLASGSND 623
>gi|24659383|ref|NP_648048.1| CG10144 [Drosophila melanogaster]
gi|7295341|gb|AAF50660.1| CG10144 [Drosophila melanogaster]
Length = 1229
Score = 42.0 bits (97), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+A+GTS+G IL DV T ++W +H G +A LAF L + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFSRGLVAMLDT 189
Query: 103 EMGEVIREF 111
G+V+RE
Sbjct: 190 HTGDVLREL 198
>gi|145476893|ref|XP_001424469.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124391533|emb|CAK57071.1| unnamed protein product [Paramecium tetraurelia]
Length = 784
Score = 42.0 bits (97), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 67/278 (24%), Positives = 112/278 (40%), Gaps = 48/278 (17%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA G+ + I D+ TGE K++ G H G ++ ++F+ G +L +
Sbjct: 169 LLASGSRDKSIRLWDIKTGEEKYRLEG-HNGYVSTISFSFDGITLASGSGDKTIRLWDII 227
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE----VRILSLENGEEVLKFSDDVGPLQ 159
G+ I+ + +SS F IF LAS +R+ + + G++ +F +
Sbjct: 228 TGKEIQRLEGHNGYVSSVCF--SPDIFTLASCGEDKCIRLWNAKTGQQASQFFGHTHQVY 285
Query: 160 YVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALS--MRHSPVAIDCKNSPNGED 217
+ S ++ +G +K +++W V +G +S HS I SP D
Sbjct: 286 SICFSPNGN-LLASGSDDKSIRLW-------DVKEGQQISKLQGHSGGVISVCFSP---D 334
Query: 218 GTVILAVAESGVAYSWDLKTVSQDEK-----------------TNPA--------KITVK 252
GT IL+ + WD+K+ Q K TN A +I
Sbjct: 335 GTTILSGSADQSIRLWDVKSGQQQSKLIGHKCGVYSVCFSQKGTNVASGSYDQSIRIWET 394
Query: 253 LKKADADQQNSVNVKKSRTSIIAARLND---LEADGQK 287
+K+ D Q NS+ V +S +N +AD QK
Sbjct: 395 IKRFDKKQINSLKVSRSEKKTNFTDINQNIHFKADQQK 432
>gi|442318640|ref|YP_007358661.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
gi|441486282|gb|AGC42977.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
Length = 366
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 12/108 (11%)
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT GD++ + + TG +W+ TG P L L A G +++VV +G + + G+V
Sbjct: 94 GTVAGDLVRLALSTGTEQWRLTGEPP-VLTPLVMAPGGTAVYVVAPDGAVRALAVDTGKV 152
Query: 108 IREFKASEKP----------ISSSAFLCEEKIFALASSEVRILSLENG 145
R A KP + S + AL + +R LSLE+G
Sbjct: 153 -RWRVAPPKPDEAHLDTRRGLPSPVLAGGRLVVALGDAGLRALSLEDG 199
>gi|78779428|ref|YP_397540.1| WD-40 repeat-containing protein [Prochlorococcus marinus str. MIT
9312]
gi|78712927|gb|ABB50104.1| WD-40 repeat-containing protein [Prochlorococcus marinus str. MIT
9312]
Length = 357
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 48/188 (25%), Positives = 78/188 (41%), Gaps = 17/188 (9%)
Query: 22 VSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAF 81
VS +ACG+ K ++ +G G I A + TG++ WK H GGL +A
Sbjct: 21 VSDYAIACGWALKGKQ-------FIVGDVAGGIFAFEGDTGKIIWKKENTHSGGLLAMAI 73
Query: 82 AKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILS 141
+G G +G G+VI+ + + + + A+ASS+ +
Sbjct: 74 HPEGEIFVTSGQDGNVQICNCHEGKVIKTLDLGKGWVEHLKWSNDGLFLAIASSKKVYVF 133
Query: 142 LENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGP 196
E GEE D + ++ S+ + + TA YG +V DI + N +G
Sbjct: 134 NEIGEEKWISEDHPSTVSAITWSNKNE-LATACYG----RVTFFDIVNNKTNQKLEWQGS 188
Query: 197 ALSMRHSP 204
+SM SP
Sbjct: 189 LVSMELSP 196
>gi|240849657|gb|ACS54286.1| AT14809p [Drosophila melanogaster]
Length = 1229
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+A+GTS+G IL DV T ++W +H G +A LAF L + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFSRGLVAMLDT 189
Query: 103 EMGEVIREF 111
G+V+RE
Sbjct: 190 HTGDVLREL 198
>gi|195337941|ref|XP_002035584.1| GM13838 [Drosophila sechellia]
gi|194128677|gb|EDW50720.1| GM13838 [Drosophila sechellia]
Length = 1229
Score = 42.0 bits (97), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+A+GTS+G IL DV T ++W +H G +A LAF L + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFSRGLVAMLDT 189
Query: 103 EMGEVIREF 111
G+V+RE
Sbjct: 190 HTGDVLREL 198
>gi|440696814|ref|ZP_20879262.1| trypsin [Streptomyces turgidiscabies Car8]
gi|440280845|gb|ELP68526.1| trypsin [Streptomyces turgidiscabies Car8]
Length = 1389
Score = 42.0 bits (97), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 36/147 (24%), Positives = 63/147 (42%), Gaps = 3/147 (2%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA G+ I D+ T +++ TG H G+ L F+ GR+L G +
Sbjct: 1159 LATGSDTKYIRLWDLATRKIRRTLTGHH-DGVNALEFSPDGRTLATAGGDSRVLIWDLAT 1217
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
G+V + P+++ AF + ++ A AS + R+ G + VG L +
Sbjct: 1218 GKVRVTLTGHDAPVNALAFSPDGRVLATASDDGTARVWDAVTGRARSILTKHVGWLSALD 1277
Query: 163 ASDGAKIIITAGYGEKHLQVWRCDISS 189
S + + TAG + +++W D S
Sbjct: 1278 FSPDGRTLATAGGYDGTVRLWDADTGS 1304
>gi|390562645|ref|ZP_10244833.1| hypothetical protein NITHO_3120007 [Nitrolancetus hollandicus Lb]
gi|390172773|emb|CCF84145.1| hypothetical protein NITHO_3120007 [Nitrolancetus hollandicus Lb]
Length = 305
Score = 42.0 bits (97), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 40/174 (22%), Positives = 75/174 (43%), Gaps = 16/174 (9%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEK-IF 130
H G+ AF+ GR + G + E G+ IR+F ++S F + + +F
Sbjct: 14 HTDGVEQAAFSPDGRQIASAGWDATVRLWDVETGKEIRQFIGHPAGVNSVVFSRDGRSLF 73
Query: 131 ALASSE-------VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+ S+E +R +E GEEVL+ + + S K+ I+ G + +++W
Sbjct: 74 SCGSAEGGTSDRTIRKWDIETGEEVLRLRGHTAGVNAIDCSRDGKLAISGGM-DGSVRLW 132
Query: 184 RCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
+ +T+ H+ V SP+ + I AE+ V + WD+++
Sbjct: 133 DLERGVETLQ----FLGSHNFVTTSVAFSPDAK--RAISGSAEA-VVWLWDVES 179
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 50/195 (25%), Positives = 87/195 (44%), Gaps = 17/195 (8%)
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GTS+ I D+ TGE + G H G+ + ++ G+ G +G E G
Sbjct: 81 GTSDRTIRKWDIETGEEVLRLRG-HTAGVNAIDCSRDGKLAISGGMDGSVRLWDLERGVE 139
Query: 108 IREFKASEKPISSS-AFLCEEK--IFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS 164
+F S +++S AF + K I A + V + +E+GEE+ +F VG + V+ +
Sbjct: 140 TLQFLGSHNFVTTSVAFSPDAKRAISGSAEAVVWLWDVESGEEIRRFRGHVGVITSVAYA 199
Query: 165 DGAKIIITAGYGEKHLQVWRCDISSK----TVNKGPALSMRHSPV---AIDCKNSPNGED 217
+ ++ G ++ ++ W + + T + G S+ +SPV A+ C D
Sbjct: 200 PDGRTALSGGL-DRSVRQWDIETGKERRRLTGHIGDVTSVAYSPVDPRALSCAG-----D 253
Query: 218 GTVILAVAESGVAYS 232
GT+ L E G S
Sbjct: 254 GTIRLWNLEDGTEIS 268
>gi|201027430|ref|NP_001032880.2| WD repeat domain 43 [Rattus norvegicus]
gi|149050703|gb|EDM02876.1| rCG61879 [Rattus norvegicus]
Length = 674
Score = 42.0 bits (97), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 43/204 (21%), Positives = 82/204 (40%), Gaps = 13/204 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 96 LLALGTAVGSILLYSTVKGELHSKLISGGHEKRVNCIQWHQDNDCLYSCSDDKYIVEWST 155
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
+M V ++K +SS + K+ A +++ LE E F+ P+ +
Sbjct: 156 QMCRVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLR 215
Query: 163 AS----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCK 210
+ DG ++ ++ L VW+ +K + + ++ PV ID
Sbjct: 216 FTTIRPNESQPFDGVTGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYIDLT 275
Query: 211 NSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V G + ++
Sbjct: 276 LSENKEEPVKLAVVCRDGQVHLFE 299
>gi|328868055|gb|EGG16435.1| WD40 repeat-containing protein [Dictyostelium fasciculatum]
Length = 578
Score = 42.0 bits (97), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 27/109 (24%), Positives = 53/109 (48%), Gaps = 22/109 (20%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYS---------CLACGFVGKKRRKERGTLLLALGTSN 51
+W+T + S EP+ +YS CL+ K ++G ++A+GT+
Sbjct: 31 VWNTTNSS----------EPLRTYSDDRESSKLTCLSIFTSTKASSIQKG--VIAIGTNY 78
Query: 52 GDILAVDVLTGEMKWKSTGRHPG-GLAGLAFAKKGRSLHVVGTNGMASE 99
GD+L ++L+G+M+ + TG H G + G+ F + + +G ++
Sbjct: 79 GDVLIYNILSGKMEKRLTGAHAGCKVVGVVFGTTEKDIFSCDIDGRVAQ 127
>gi|393222446|gb|EJD07930.1| WD40 repeat-like protein [Fomitiporia mediterranea MF3/22]
Length = 1569
Score = 41.6 bits (96), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 68/147 (46%), Gaps = 9/147 (6%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
+A GT+ G I D TG++K H G + F+ G+ L G + E
Sbjct: 910 IACGTTTGTISIRDGQTGDVKVPPIRAHTGQTTSVVFSPDGKLLASSGQDQTIRIWDVES 969
Query: 105 GEVIREFKASEKPISSSAFLCE-EKIFALASSE-VRILSLEN----GEEVLKFS--DDVG 156
G + + ISS AFL + ++I A A+ +RI +EN GE + +S D +G
Sbjct: 970 GMPDGDVMEIDTAISSLAFLPDGKRIIAGANDRTIRIWDVENRKQVGEPIQGYSVGDHIG 1029
Query: 157 PLQYVSASDGAKIIITAGYGEKHLQVW 183
++ V+ S + +A G K LQ+W
Sbjct: 1030 TIRDVAVSPDGRYFASASDG-KVLQIW 1055
>gi|299755778|ref|XP_001828881.2| WD40 repeat-containing protein [Coprinopsis cinerea okayama7#130]
gi|298411378|gb|EAU92888.2| WD40 repeat-containing protein [Coprinopsis cinerea okayama7#130]
Length = 325
Score = 41.6 bits (96), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 61/238 (25%), Positives = 101/238 (42%), Gaps = 40/238 (16%)
Query: 85 GRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSL 142
G L G + +E G++I F+ ++ I+ A+ + + A AS + V I SL
Sbjct: 47 GSMLASAGPDKTIKLWDTESGDIIHTFRGHKEGINDLAWAPDGEFIASASDDKTVIIWSL 106
Query: 143 ENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS-SKTVNKGPALSMR 201
E E V S + ++ + + ++++ GY E + +W D++ K + PA S
Sbjct: 107 ELREPVKTLSRHTSVVFCINYNPNSNLLVSGGYDETVI-IW--DVARGKALKTLPAHSDP 163
Query: 202 HSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDEKTNPAKITVKL-- 253
+ V + +DGT+I++ A G+ WD LKT+ D+ NP V
Sbjct: 164 VTAVGFN-------DDGTLIISCAMDGLIRLWDAESGQCLKTLVDDD--NPICSHVCFSP 214
Query: 254 --KKADADQQNSV----NVKKS-----------RTSIIAARLNDLEADGQKTITGLHD 294
K A A Q+S N++ S RT I A + GQ +TG D
Sbjct: 215 NSKFALASTQDSTIRLWNIQSSRCVKTYTGHVNRTYCIPACFATKSSKGQYIVTGSED 272
>gi|194752173|ref|XP_001958397.1| GF10899 [Drosophila ananassae]
gi|190625679|gb|EDV41203.1| GF10899 [Drosophila ananassae]
Length = 1229
Score = 41.6 bits (96), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+A+GTS+G IL DV T ++W +H G +A LAF L G+ + + ++
Sbjct: 132 IAVGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFARGLVTMLDTQ 190
Query: 104 MGEVIREF 111
G+V+RE
Sbjct: 191 TGDVLREL 198
>gi|126696474|ref|YP_001091360.1| WD-40 repeat-containing G-protein [Prochlorococcus marinus str. MIT
9301]
gi|126543517|gb|ABO17759.1| G-protein beta WD-40 repeat-containing protein [Prochlorococcus
marinus str. MIT 9301]
Length = 357
Score = 41.6 bits (96), Expect = 0.50, Method: Compositional matrix adjust.
Identities = 44/184 (23%), Positives = 76/184 (41%), Gaps = 9/184 (4%)
Query: 22 VSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAF 81
VS +ACG+ K ++ +G G I A + TG++ WK H GGL ++
Sbjct: 21 VSDYAIACGWALKGKQ-------FIVGDVAGGIFAFEGDTGKIIWKKENTHSGGLLAMSI 73
Query: 82 AKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILS 141
+G G +G G+VI+ + + + + A+ASS+ +
Sbjct: 74 HPEGEIFATSGQDGNVQICNCHEGKVIKTLNLGKGWVEHLKWSNDGLFLAIASSKKVYVF 133
Query: 142 LENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGE-KHLQVWRCDISSKTVNKGPALSM 200
E GEE D + ++ S+ + + TA YG + + + K +G +SM
Sbjct: 134 NEIGEEKWISEDHPSTVSAITWSNKNE-LATACYGRVTFFDIVKNKTNQKLEWQGSLVSM 192
Query: 201 RHSP 204
SP
Sbjct: 193 ELSP 196
>gi|333984288|ref|YP_004513498.1| hypothetical protein [Methylomonas methanica MC09]
gi|333808329|gb|AEG00999.1| WD40 repeat-containing protein [Methylomonas methanica MC09]
Length = 1227
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 29/125 (23%), Positives = 59/125 (47%), Gaps = 5/125 (4%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+A G+++ I +D ++ + S G + +AF+ GR L G + +
Sbjct: 589 FIATGSNDNQIRVIDTACHDILFSSILN--GAVTAVAFSPDGRYLAAAGADPTVQLIDVS 646
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
G +R+FK P+++ AF + A+AS++ VR++ + +G EV + +
Sbjct: 647 TGRTLRQFKLV-GPVANIAFSPDGSRLAVASADKSVRLIDVSSGREVFRVKHGAEATHVI 705
Query: 162 SASDG 166
++DG
Sbjct: 706 FSADG 710
>gi|193213052|ref|YP_001999005.1| WD-40 repeat-containing protein, partial [Chlorobaculum parvum NCIB
8327]
gi|193086529|gb|ACF11805.1| WD-40 repeat protein [Chlorobaculum parvum NCIB 8327]
Length = 241
Score = 41.6 bits (96), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 44/186 (23%), Positives = 80/186 (43%), Gaps = 14/186 (7%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
+W T G + ++K SY AC F +K T + T N ++ +D
Sbjct: 7 LWDTLTGKRIIKYKS-------SYFFRACAF-SPDGKKILATTDVNTDTRNSLLMLLDAN 58
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
+ E+ G HP + AF+ G+ + SE G+ I F+ + +
Sbjct: 59 SAELLMTIQG-HPRDIVSCAFSPNGKKFLSTSLDSTLKLWDSETGQCINTFEDHIEAVWD 117
Query: 121 SAFLCE-EKIFALASSE-VRILSLENGEEVLKFSDDVGPLQYVSA-SDGAKIIITAGYGE 177
AF + +KI + +S +++ + +G ++ FS P+ SDG +II +G +
Sbjct: 118 CAFSPDGDKILSSSSDHTLKLWDVNSGHCLITFSGHSSPVFSCRFNSDGTRII--SGSSD 175
Query: 178 KHLQVW 183
K L++W
Sbjct: 176 KTLKLW 181
>gi|393238055|gb|EJD45594.1| WD40 repeat-like protein [Auricularia delicata TFB-10046 SS5]
Length = 584
Score = 41.6 bits (96), Expect = 0.53, Method: Compositional matrix adjust.
Identities = 28/110 (25%), Positives = 51/110 (46%), Gaps = 7/110 (6%)
Query: 54 ILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKA 113
++ D G +WK G G + + GR L +VG G + + G + E +
Sbjct: 113 VVGSDAAAGRREWKLDG----GSYRSRYTRNGRHLAIVGKKGHVATFDWQTGTLHSELQL 168
Query: 114 SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP--LQYV 161
E FL ++ +FA+A S+ + +NG E+ + S+ + P L+Y+
Sbjct: 169 RET-CRDITFLQDQSMFAVAQSKHVYIYDQNGVELHRLSNHIEPTRLEYL 217
>gi|119490221|ref|ZP_01622734.1| Peptidase C14, caspase catalytic subunit p20 [Lyngbya sp. PCC 8106]
gi|119454107|gb|EAW35260.1| Peptidase C14, caspase catalytic subunit p20 [Lyngbya sp. PCC 8106]
Length = 1245
Score = 41.6 bits (96), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 46/189 (24%), Positives = 95/189 (50%), Gaps = 14/189 (7%)
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS-EMGE 106
G+ + I ++ TGE+ TG H G+ ++ + +++ V G++ ++ + E GE
Sbjct: 696 GSGDNTIKVWNLETGELIRTLTG-HRYGVRSVSISNDSKTI-VSGSDDKTIKVWNLETGE 753
Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS 164
+IR K ++ +SS + + K S + +++ + E G E+ + ++ VS S
Sbjct: 754 LIRTLKGHDREVSSVSISNDSKTIVSGSDDKTIKVWNRETGAEIRTLTGHRYGVRSVSIS 813
Query: 165 DGAKIIITAGYGEKHLQVWRC----DISSKTVNKGPALSMRHSPVAIDCKNSPNG-EDGT 219
+ +K I+ +G G+ ++VW +IS+ T + G S+ ++ D K +G ED T
Sbjct: 814 NDSKTIV-SGSGDNTIKVWNLQTGKEISNLTGHNGQVWSV---SISNDSKTIVSGSEDST 869
Query: 220 VILAVAESG 228
+ + E+G
Sbjct: 870 IKVWNLETG 878
>gi|256380418|ref|YP_003104078.1| hypothetical protein Amir_6431 [Actinosynnema mirum DSM 43827]
gi|255924721|gb|ACU40232.1| WD-40 repeat protein [Actinosynnema mirum DSM 43827]
Length = 1766
Score = 41.6 bits (96), Expect = 0.57, Method: Composition-based stats.
Identities = 42/164 (25%), Positives = 71/164 (43%), Gaps = 20/164 (12%)
Query: 76 LAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
+ LA A GR L G++ + + + GE + E E I++ AF + ASS
Sbjct: 1260 ITALALAPDGRRL-ATGSSAGTAHVWTAGGEHVAELAGHENWINAVAFSPDGARVTTASS 1318
Query: 136 EVRILS-LENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGEKHLQVWRCD---ISSK 190
+ + +G +V +DDVGP+ ++ S DG + A G H VW D +++
Sbjct: 1319 DRTARTWTTDGTQVAVLTDDVGPVTALAHSPDGKHVATGASDGTGH--VWTADGSLVATL 1376
Query: 191 TVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
++G S+ +SP DG +I A +W+
Sbjct: 1377 LGHQGVITSIAYSP------------DGAIITTAGSDKTARTWN 1408
>gi|126657815|ref|ZP_01728969.1| beta transducin-like protein [Cyanothece sp. CCY0110]
gi|126621032|gb|EAZ91747.1| beta transducin-like protein [Cyanothece sp. CCY0110]
Length = 345
Score = 41.6 bits (96), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 55/235 (23%), Positives = 100/235 (42%), Gaps = 15/235 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LL +GT +G++ A+D+ T E + S + +AF + + + VVG + +
Sbjct: 74 LLLVGTGDGELTAIDLETKETLY-SKAVLVNNYSSIAFNVE-KDILVVGDDQTVMVLSLS 131
Query: 104 MGEVIREFKASEKPISSSAFLCE-EKIFALASSE--VRILSLENGEEVLKFSDDVGPLQY 160
G+ + + +S A + I +++ + +RI LE+GE + ++GP
Sbjct: 132 TGQKLSFLREHTGKVSDVAISPDGNNIVSVSGDDQTIRIWDLESGELIKTIGANIGPTTS 191
Query: 161 VSASDGAKIIITAGYG-EKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGT 219
V + + IT G ++ L+ W + K +N S + D K S DGT
Sbjct: 192 VQYTPDGTMFITGAIGSDRTLKFWDA-TTFKLLNT----SSQQPGFINDLKIS---NDGT 243
Query: 220 VILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSII 274
++A + V SWDL T+ Q T K+ + D + K T ++
Sbjct: 244 QLVAAVRNFVK-SWDLTTLQQVWSTKGPKLEINTIAVSPDNRTVATANKEGTIML 297
>gi|296120844|ref|YP_003628622.1| serine/threonine protein kinase-related protein [Planctomyces
limnophilus DSM 3776]
gi|296013184|gb|ADG66423.1| Serine/threonine protein kinase-related protein [Planctomyces
limnophilus DSM 3776]
Length = 1856
Score = 41.6 bits (96), Expect = 0.58, Method: Compositional matrix adjust.
Identities = 45/187 (24%), Positives = 80/187 (42%), Gaps = 20/187 (10%)
Query: 61 TGEMKWKSTGRHPGGLAG-LAFAKKGRSLHVVGTNGMASEM---------KSEMGEVIRE 110
T + K S R P G+ L ++ GR L + +E+ EMG + +
Sbjct: 1477 TADAKLVSEYRLPEGMVNSLEISRDGRLLAIANAETRQAEILVLPGFKRIPMEMGGSVVD 1536
Query: 111 FKASEKPISSSAFLCE-EKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKI 169
K + + AF E E + + ++ R+ +L +G E++ FS + G + S +
Sbjct: 1537 LKKRGALLWAVAFGPEAETLMTIGGTDARLWNLRDGREMMSFSPN-GVVASARYSPDNQW 1595
Query: 170 IITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGV 229
I+T+ + + ++VW+ V L H+ SP+GE +IL ++ G
Sbjct: 1596 IVTSSW-DNSVKVWKAATGESMVR----LEGGHTSAVNMASFSPDGE---LILTASDDGT 1647
Query: 230 AYSWDLK 236
A WD K
Sbjct: 1648 AKLWDWK 1654
>gi|194335525|ref|YP_002017319.1| WD-40 repeat-containing protein [Pelodictyon phaeoclathratiforme
BU-1]
gi|194308002|gb|ACF42702.1| WD-40 repeat protein [Pelodictyon phaeoclathratiforme BU-1]
Length = 316
Score = 41.6 bits (96), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 27/107 (25%), Positives = 55/107 (51%), Gaps = 3/107 (2%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE-EKIFALASSE- 136
LAF+ R + G + M + + GE+ R F+ + + S F + ++ + A+ E
Sbjct: 208 LAFSPDDRFIAFCGRDAMVKILDAASGEITRVFEGHQDAVRSVCFTPDGSRVVSAANDET 267
Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
VR+ +E+G+++ + V +Q V S K II +G ++ +++W
Sbjct: 268 VRLWDIESGKQLHLYRGHVLEVQSVDVSPDGK-IIASGSDDRKIKLW 313
>gi|340726869|ref|XP_003401774.1| PREDICTED: WD repeat-containing protein 43-like [Bombus terrestris]
Length = 629
Score = 41.6 bits (96), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 58/271 (21%), Positives = 116/271 (42%), Gaps = 45/271 (16%)
Query: 34 KKRRK--------ERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKK- 84
KKRRK ++G ++A+G++NG + D T + + G H G + + +++
Sbjct: 70 KKRRKKSISEESNQKG--IVAMGSTNGRVTLYDTTTSLVTAQLNG-HSGTVTAVTWSENV 126
Query: 85 -----GRSLHVVG----TNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
H++ NG+ + KS G+ + S A + K
Sbjct: 127 GLFSAADDHHIIQWNLQENGVRCKWKSGKGKTV-----------SLAISPDGKSLLSGER 175
Query: 136 EVRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTV 192
++ +L+ + V F+ ++V L V + G +I+ G+ +L VW D
Sbjct: 176 VIKWWNLDTKQLVRTFTGHANEVSCLHTVQTTSGNNYVISGACGDGYLNVWALDEQRNER 235
Query: 193 NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVK 252
+L+++ P+++ S N + ++L V SG A+ L + ++ P K ++
Sbjct: 236 TAVASLALQDDPISVSTHISENFQ--VIVLVVTRSGQAH---LFQYQPNGRSKPLKPSLN 290
Query: 253 LKKA-DADQQNSVNVKKSRTSIIAARLNDLE 282
+ A D Q++SV + SI+ +L + E
Sbjct: 291 IAVASDISQKDSVQ----QISILNGKLTEDE 317
>gi|291230364|ref|XP_002735136.1| PREDICTED: katanin p80 subunit-like [Saccoglossus kowalevskii]
Length = 666
Score = 41.6 bits (96), Expect = 0.61, Method: Compositional matrix adjust.
Identities = 26/114 (22%), Positives = 49/114 (42%), Gaps = 2/114 (1%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H + + F+ GR + G +G+A G++I EFK P+++ F E + A
Sbjct: 143 HTNRINSVRFSPDGRWVASAGEDGLAKLWDLAAGKLINEFKHHTGPVNNIEFHPNEFLLA 202
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
S++ V+ LEN V + P++ + ++ + G H+ W
Sbjct: 203 TGSADRTVKFWDLENFNLVGTTDKEASPIRCILFHQDGNVLFSGGQDSLHVYSW 256
>gi|428174103|gb|EKX43001.1| hypothetical protein GUITHDRAFT_73545 [Guillardia theta CCMP2712]
Length = 623
Score = 41.2 bits (95), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 74/162 (45%), Gaps = 12/162 (7%)
Query: 29 CGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSL 88
CG K R L+A G+ NG + V TG+ WK H + +++ + GR L
Sbjct: 2 CGSFSKDSR------LVATGSKNGLVRLWCVETGKC-WKELPGHETMIRSISWGQDGRML 54
Query: 89 HVVGTNGMASEMKSEMGEVIREFKASE-KPISSSAFLCEEKIFALASSE--VRILSLENG 145
V NG K E + A + I+S + + K+ A AS+E V + ++E G
Sbjct: 55 AAVSQNGTTRVWKEPWTEEACQVLAGHGETINSVSLGPDGKMLAAASNEKKVWVWNVETG 114
Query: 146 EEVLKFSDDV-GPLQYVSASDGAKIIITAGYGEKHLQVWRCD 186
E L+ + V G + V+ K +I +G +K +QVW D
Sbjct: 115 EVRLELKEQVEGGVTSVAWRPDGK-MIASGSLDKAIQVWEVD 155
>gi|47220108|emb|CAF99021.1| unnamed protein product [Tetraodon nigroviridis]
Length = 651
Score = 41.2 bits (95), Expect = 0.63, Method: Compositional matrix adjust.
Identities = 55/248 (22%), Positives = 99/248 (39%), Gaps = 22/248 (8%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG--------KKRRKERGTL-----LLAL 47
IW+T+ +L E+ P + C++ G KKR+ E G + LLA+
Sbjct: 35 IWNTDSKTLHQEY-VPSAHLSATCICISWGPCRTLKERPQRKKRKSEAGQVEESADLLAM 93
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G +L G + G H GG+ + + L+ + E + G+
Sbjct: 94 GTAAGTVLIYSTAKGALHCTLDGGHSGGVNCVQWHPDDSLLYSGSDDTNIVEWDLQTGKT 153
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
++KA ++S + K+ A +++ L E KF S V L + +
Sbjct: 154 KSKWKADRAAVTSLCVSPDGKLLLSAGQTIKMWDLNTKEVYRKFTGHSTAVTTLCFATTR 213
Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
S+G + A + ++ L VW+ K N + ++ P ID S + E+
Sbjct: 214 PPDSNGLYFLSGAAH-DRLLSVWQVREDGKDKNSVVSFALTDEPQHIDLIPSKSREEVMR 272
Query: 221 ILAVAESG 228
+ V + G
Sbjct: 273 LAVVCKDG 280
>gi|166366565|ref|YP_001658838.1| WD repeat-containing protein [Microcystis aeruginosa NIES-843]
gi|166088938|dbj|BAG03646.1| WD-repeat protein [Microcystis aeruginosa NIES-843]
Length = 962
Score = 41.2 bits (95), Expect = 0.66, Method: Compositional matrix adjust.
Identities = 44/187 (23%), Positives = 78/187 (41%), Gaps = 22/187 (11%)
Query: 56 AVDVLTGEMKWKSTGR----HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREF 111
++ L + WKS H + + F+ G++L V + E GE IR
Sbjct: 571 VMNTLQALLNWKSERNRLEGHKDFVTSVNFSPDGKTLVSVSRDNTIKLWNVETGEEIRTL 630
Query: 112 KASEKPISSSAFLCEEKIFALAS--SEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKI 169
K + + S F + K S + +++ ++E GEE+ GP+ V+ S K
Sbjct: 631 KGHD-GVQSVNFSPDGKTLVSGSWDNTIKLWNVETGEEIRTLKGHNGPVNSVNFSPDGKT 689
Query: 170 IITAGYGEKHLQVWRCDISSKTVN-KGPALSMRHSPVAIDCKNSPNGE-------DGTVI 221
++ +G G+K +++W + + KG H I SP+G+ D T+
Sbjct: 690 LV-SGSGDKTIKLWNVETGQEIRTLKG------HDNSVISVNFSPDGKTLVSGSGDNTIK 742
Query: 222 LAVAESG 228
L E+G
Sbjct: 743 LWNVETG 749
>gi|329663507|ref|NP_001192527.1| WD repeat-containing protein 43 [Bos taurus]
gi|296482332|tpg|DAA24447.1| TPA: WD repeat domain 43 [Bos taurus]
Length = 681
Score = 41.2 bits (95), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 54/262 (20%), Positives = 101/262 (38%), Gaps = 29/262 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV--------GKKRRKERGTL------LLA 46
+W T + L E+ P + +CLA +K+RK + LLA
Sbjct: 40 VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKEGPQRKKRKSEAIVTSNQADLLA 98
Query: 47 LGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LGT+ G IL + GE+ K G H + + + + L+ + E ++
Sbjct: 99 LGTAVGSILLYSTVKGELHSKLINGGHDNRVNCIQWHQDNGCLYSCSDDKHIVEWNTQTC 158
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS- 164
+V ++K +SS + K+ A +++ LE E F+ P+ ++ +
Sbjct: 159 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLTFTT 218
Query: 165 ----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
DG ++ ++ L VW+ +K N + ++ PV ID S
Sbjct: 219 IRPPNEIQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDLTLS 278
Query: 213 PNGEDGTVILAVAESGVAYSWD 234
N E+ + V G + ++
Sbjct: 279 ENKEEPVKLAVVCRDGQVHLFE 300
>gi|195377108|ref|XP_002047334.1| GJ11984 [Drosophila virilis]
gi|194154492|gb|EDW69676.1| GJ11984 [Drosophila virilis]
Length = 1229
Score = 41.2 bits (95), Expect = 0.68, Method: Composition-based stats.
Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 2/68 (2%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHP-GGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+A+GTS+G IL DV T ++W +H G ++ LAF L V G+ + + ++
Sbjct: 131 IAVGTSHGHILNFDV-TQTLRWAHQDKHSQGAVSSLAFNGDSTRLLVGYARGLVTMLDTQ 189
Query: 104 MGEVIREF 111
G+V+RE
Sbjct: 190 TGDVLREL 197
>gi|428181079|gb|EKX49944.1| hypothetical protein GUITHDRAFT_161985 [Guillardia theta CCMP2712]
Length = 676
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 52/242 (21%), Positives = 99/242 (40%), Gaps = 14/242 (5%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG------FVGKKRRKERGTLLLALGTSNGDI 54
IW G+ +E+ P G+ +C++ G KK++ +RG L+A+GT G I
Sbjct: 24 IWDVTSGAASSEF-SPTGKHAAELTCVSWGKALPVTSSAKKKKSKRGADLVAIGTGAGKI 82
Query: 55 LAVDVLTGEMKWKSTGR---HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREF 111
L D + GE++ G H G + + G +L+ + S E GE+ +
Sbjct: 83 LLWDCVKGELRKTLGGEKSGHTGRINDIVLT-SGSNLYCCTDDCFFSCWNIETGEMTWKV 141
Query: 112 KASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIII 171
++ + A E ASS +++ L + K+ + ++ S ++ ++
Sbjct: 142 AVGKQSANRLALRSSEDEIVTASSAIKVWDLATRKTKRKYPGHASRVVCLAYSADSRFLV 201
Query: 172 TAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAY 231
+ G ++ + +W + ++ SP I GE +LAV+E G
Sbjct: 202 S-GASDRFISLWDSSFETDNTVALKTFTLDSSPTQISLVGG-TGE-SMQMLAVSEGGSVS 258
Query: 232 SW 233
W
Sbjct: 259 VW 260
>gi|153869314|ref|ZP_01998955.1| WD-40 repeat protein [Beggiatoa sp. PS]
gi|152074160|gb|EDN71045.1| WD-40 repeat protein [Beggiatoa sp. PS]
Length = 1207
Score = 41.2 bits (95), Expect = 0.68, Method: Compositional matrix adjust.
Identities = 56/230 (24%), Positives = 101/230 (43%), Gaps = 30/230 (13%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
WS G ++ WK PD ++ S G L + GT N I ++
Sbjct: 685 FWSLPTGKIIKRWKAPDKVKALALS-------------PDGKYLASAGTDN-KITLWNLE 730
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
T + + +G H ++GLAF+ G L +G A + + G+V+ KA +
Sbjct: 731 TDQPQQIFSG-HKDQISGLAFSSDGELLASASYDGTARLWQVKTGKVLHTLKAHTDHVQK 789
Query: 121 SAFLCEEKIFALASSE--VRILSLENG--EEVLK-FSDDVGPLQYVSASDGAKIIITAGY 175
AF + + A +S + +R+ ++ +G E VL+ + ++++ D + +++A
Sbjct: 790 VAFSHDNQWLATSSKDATIRLWNVNSGKTERVLRGHKQIIFDIRFI---DHGQTLVSAS- 845
Query: 176 GEKHLQVWRCDISS---KTVNKGPALSMRHSPVAIDCKNSPNGEDGTVIL 222
++ L++W DI S K V +G + D K +DGTVIL
Sbjct: 846 DDRTLRLW--DIQSGVTKRVFQGHTAGVT-GIATFDNKIFSASDDGTVIL 892
>gi|427419559|ref|ZP_18909742.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 7375]
gi|425762272|gb|EKV03125.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 7375]
Length = 1143
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 34/151 (22%), Positives = 65/151 (43%), Gaps = 9/151 (5%)
Query: 67 KSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE 126
K+ +H GG+ G+AF+ G++L G + M ++ G ++ + P+ S F
Sbjct: 905 KTLTKHQGGVWGVAFSPDGQTLASAGGDNMVKLWHAD-GTFLKTLEGHRAPVWSVMFNPN 963
Query: 127 EKIFALASSEVRI-LSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRC 185
+ A S + L ++G+ + F +D L ++ S ++T G + +++W+
Sbjct: 964 GRTLATTSGDATAKLWNQDGKVITTFDNDGIILFDIAFSPDGHTLVTGG-SDGIVKLWQA 1022
Query: 186 DISSKTVNKGPALSMRHSPVAIDCKNSPNGE 216
D + G H SPNGE
Sbjct: 1023 DGTLLNTMVG------HGAAVFQVAFSPNGE 1047
>gi|440899093|gb|ELR50459.1| WD repeat-containing protein 43, partial [Bos grunniens mutus]
Length = 703
Score = 41.2 bits (95), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 54/262 (20%), Positives = 101/262 (38%), Gaps = 29/262 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV--------GKKRRKERGTL------LLA 46
+W T + L E+ P + +CLA +K+RK + LLA
Sbjct: 62 VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKEGPQRKKRKSEAIVTSNQADLLA 120
Query: 47 LGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LGT+ G IL + GE+ K G H + + + + L+ + E ++
Sbjct: 121 LGTAVGSILLYSTVKGELHSKLINGGHDNRVNCIQWHQDNGCLYSCSDDKHIVEWNTQTC 180
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS- 164
+V ++K +SS + K+ A +++ LE E F+ P+ ++ +
Sbjct: 181 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLTFTT 240
Query: 165 ----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
DG ++ ++ L VW+ +K N + ++ PV ID S
Sbjct: 241 IRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDLTLS 300
Query: 213 PNGEDGTVILAVAESGVAYSWD 234
N E+ + V G + ++
Sbjct: 301 ENKEEPVKLAVVCRDGQVHLFE 322
>gi|262198770|ref|YP_003269979.1| serine/threonine protein kinase with WD40 repeats [Haliangium
ochraceum DSM 14365]
gi|262082117|gb|ACY18086.1| serine/threonine protein kinase with WD40 repeats [Haliangium
ochraceum DSM 14365]
Length = 1823
Score = 41.2 bits (95), Expect = 0.70, Method: Composition-based stats.
Identities = 31/104 (29%), Positives = 47/104 (45%), Gaps = 3/104 (2%)
Query: 76 LAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
+A +F+ G L G +G GE+IR F E I ++AF + A A S
Sbjct: 1589 IAAASFSPDGERLATAGDDGRVHIWNVATGELIRAFVGHEGTIKAAAFAPSGQHLATAGS 1648
Query: 136 E--VRILSLENGEEVLKFSDDVGPLQYVS-ASDGAKIIITAGYG 176
+ VR+ GE + F+ P+ V SDG+++I A G
Sbjct: 1649 DRSVRLWDASTGERLQTFTGHTLPINTVHFNSDGSRLISAAEDG 1692
>gi|66820176|ref|XP_643725.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
gi|60471897|gb|EAL69851.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
Length = 569
Score = 41.2 bits (95), Expect = 0.76, Method: Compositional matrix adjust.
Identities = 38/156 (24%), Positives = 67/156 (42%), Gaps = 11/156 (7%)
Query: 34 KKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTG---RHPGGLAGLAFAKKGRSLHV 90
K +KE L+A+ S G I D + +K G H + + K R L
Sbjct: 57 KSEKKEYYKNLVAVSNS-GKIYMYDCI----DFKKKGIFDTHSKNITAVVLGK--RYLFT 109
Query: 91 VGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLK 150
G S+ + +IR F++SEK ISS E A+A++++ + L N +E K
Sbjct: 110 SSHGGKVSQWDIKTKHIIRSFESSEKSISSMGLNNRESQLAIANNDITFIDLSNFKESKK 169
Query: 151 FSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCD 186
S ++ + + K +I Y + +Q++ +
Sbjct: 170 VSTGSNSIKQIQFTQDDKFLICNRY-QPTIQIYNIE 204
>gi|426226305|ref|XP_004007288.1| PREDICTED: WD repeat-containing protein 43 [Ovis aries]
Length = 716
Score = 41.2 bits (95), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 43/202 (21%), Positives = 82/202 (40%), Gaps = 14/202 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 131 LLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDNGCLYSCSDDKHIVEWNT 190
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
+ +V ++K +SS + K+ A +++ LE E F+ P+ ++
Sbjct: 191 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLT 250
Query: 163 AS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
+ DG ++ ++ L VW+ +K N + ++ PV ID
Sbjct: 251 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVFIDL 310
Query: 210 KNSPNGEDGTVILAVAESGVAY 231
S N E+ + V + G +
Sbjct: 311 TLSENKEEPVKLAVVCKDGQVH 332
>gi|334312883|ref|XP_001380617.2| PREDICTED: WD repeat-containing protein 43 [Monodelphis domestica]
Length = 682
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 46/208 (22%), Positives = 85/208 (40%), Gaps = 20/208 (9%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLA+GT+ G IL + + GE++ K +G H + + + + L+ + E +
Sbjct: 96 LLAIGTAVGSILLYNTVKGELQSKLVSGGHDNKVNCIQWHQDNDCLYSCSDDKYIVEWNT 155
Query: 103 EMGEVIREFKASEKPISSSAFLC---EEKIFALASSEVRILSLENGEEVLKFSDD----- 154
+ V ++K ISS + LC + K+ A +++ LE E F+
Sbjct: 156 QTCRVKCKWKGD---ISSVSALCISPDGKMLLSAGRTIKLWDLETKEVYRHFTGHATAVS 212
Query: 155 ------VGPLQYVSASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA 206
+ PL DG ++ ++ L VW+ +K N + ++ PV
Sbjct: 213 SLIFTTIRPLNESQPFDGITGLYFLSGAVHDRLLNVWQIRSENKEKNAVMSFTVTDEPVY 272
Query: 207 IDCKNSPNGEDGTVILAVAESGVAYSWD 234
ID S N E+ + V G + ++
Sbjct: 273 IDLTLSENKEEPVKLAVVCRDGQVHLFE 300
>gi|170288772|ref|YP_001739010.1| WD-40 repeat-containing protein [Thermotoga sp. RQ2]
gi|170176275|gb|ACB09327.1| WD-40 repeat protein [Thermotoga sp. RQ2]
Length = 580
Score = 40.8 bits (94), Expect = 0.82, Method: Compositional matrix adjust.
Identities = 53/221 (23%), Positives = 99/221 (44%), Gaps = 15/221 (6%)
Query: 13 WKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRH 72
W PDGE + S L V + R +G +NG + D L+ + H
Sbjct: 131 WSFPDGEEISS-QKLGPSVVQIAAYENR----YVVGLANGLAVIRD-LSNQSFSIPLKAH 184
Query: 73 PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFL-CEEKIFA 131
P G+ + F++ G+ + G N + S G ++ ++ + ++ AFL ++ IF
Sbjct: 185 PEGIKKIVFSENGQLIATCGGNTVKVWNASN-GNLVLQYDHA-ITVNDLAFLDNDDLIFV 242
Query: 132 LASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYG-EKHLQVWRCDISSK 190
+ +L+++ E V + +VS+ DG+ A YG +K ++VW D+ +
Sbjct: 243 ADDYKAVVLNIQKNEVVKSIDAHNNFVLFVSSDDGS----IATYGMDKTVKVWNADLELQ 298
Query: 191 TVNKGPALSMRHSPVAIDCKNSPNG-EDGTVILAVAESGVA 230
G LS+ ++ D K +G +D +++ AE G+A
Sbjct: 299 YSLYGHQLSVNTVALSSDGKFIVSGSDDREILIWNAEKGIA 339
>gi|443659552|ref|ZP_21132351.1| WD domain, G-beta repeat family protein [Microcystis aeruginosa
DIANCHI905]
gi|443332697|gb|ELS47292.1| WD domain, G-beta repeat family protein [Microcystis aeruginosa
DIANCHI905]
Length = 1108
Score = 40.8 bits (94), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 26/200 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D +++ +T + +AF + + G NG ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 899 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLNNHRPQLINSWTVDDSIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + + P + SP
Sbjct: 959 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWDL 235
DG I A++ G A WD+
Sbjct: 1005 -DGEKIAAISRDGTARRWDI 1023
>gi|422304841|ref|ZP_16392180.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9806]
gi|389789952|emb|CCI14119.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9806]
Length = 559
Score = 40.8 bits (94), Expect = 0.89, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 59/106 (55%), Gaps = 6/106 (5%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
LAF+ G+ L G++ + + GE+I+ ++A E+ I S A + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKVIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+I + G E++K PL++ + DG + +IT YG+K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGDK-VKIW 485
>gi|403362279|gb|EJY80863.1| hypothetical protein OXYTRI_21745 [Oxytricha trifallax]
Length = 524
Score = 40.8 bits (94), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 33/151 (21%), Positives = 68/151 (45%), Gaps = 6/151 (3%)
Query: 41 GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
G LLL+ + +G + DV+T ++ H + + F+ GR G + +
Sbjct: 246 GHLLLS-ASHDGTVKIWDVMTHRKCLRTYMGHTKAVRDICFSNDGRRFLSAGFDRVIQLW 304
Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEK--IFALASSEVRILSLE-NGEEV-LKFSDDVG 156
+E G+VIR F + P ++K IF + +IL + N E+ L++ + +G
Sbjct: 305 DTETGKVIRSFTNRKTPFCVKFHPSDDKQNIFLAGCANKKILQYDTNSSEITLQYEEHLG 364
Query: 157 PLQYVSASDGAKIIITAGYGEKHLQVWRCDI 187
+ ++ +G + ++ +K + +W I
Sbjct: 365 SINTITFIEGGRRFVSTA-DDKKIFLWEFGI 394
>gi|159029360|emb|CAO90736.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
Length = 1078
Score = 40.8 bits (94), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 26/200 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D +++ +T + +AF + + G NG ++ S+
Sbjct: 812 IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 868
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 869 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLNNHRPQLINSWTVDDSIIY 928
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + + P + SP
Sbjct: 929 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 974
Query: 216 EDGTVILAVAESGVAYSWDL 235
DG I A++ G A WD+
Sbjct: 975 -DGEKIAAISRDGTARRWDI 993
>gi|431911947|gb|ELK14091.1| WD repeat-containing protein 43 [Pteropus alecto]
Length = 680
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 48/220 (21%), Positives = 88/220 (40%), Gaps = 19/220 (8%)
Query: 34 KKRRKE-----RGTLLLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRS 87
KKR+ E T LLALGT+ G IL + GE+ K +G H + + + +
Sbjct: 81 KKRKSEPIGTSNQTDLLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGC 140
Query: 88 LHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
L+ + E ++ +V ++K +SS + K+ A +++ LE E
Sbjct: 141 LYSCSDDKHIVEWNTQTCKVKCKWKGDSSGVSSLCISPDGKMLLSAGRTIKLWVLETKEL 200
Query: 148 VLKFSDDVGPLQYVSAS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNK 194
F+ P+ + + DG ++ ++ L VW+ +K N
Sbjct: 201 YRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNA 260
Query: 195 GPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
+ ++ PV ID S N E+ + V G + ++
Sbjct: 261 VMSFTITDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE 300
>gi|425445434|ref|ZP_18825464.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
gi|389734586|emb|CCI01787.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
Length = 1107
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 82/200 (41%), Gaps = 26/200 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D +++ +T + +AF G + + G NG ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDGNQMGITGRNGKV-QIWSK 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 899 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLSNHRPQLINSWTADDNIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + P + SP
Sbjct: 959 DLVFSPDHQKIA-TAARGK--IKIWDLQGNLFEEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWDL 235
DG I ++ G A WD+
Sbjct: 1005 -DGEKIATISRDGTARRWDI 1023
>gi|345565400|gb|EGX48350.1| hypothetical protein AOL_s00080g320 [Arthrobotrys oligospora ATCC
24927]
Length = 365
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 85/189 (44%), Gaps = 20/189 (10%)
Query: 63 EMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSA 122
+K+ TG H GG++ + F+ G+ + V + S GE+ + F+A +S A
Sbjct: 66 RLKYTLTG-HNGGVSSVKFSPDGKWIASVSADKSLRVWDSRTGELEQIFEAHTAGVSDVA 124
Query: 123 FLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
+ + K A S + +R+ L++G + + ++ + +I++ Y E +
Sbjct: 125 WSPDSKTLATGSDDKTIRLWELKSGRMIRILKGHHNYVYCLNFNPQGNMIVSGSYDEA-V 183
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVA-IDCKNSPNGEDGTVILAVAESGVAYSWD----- 234
++W DI S K L PV+ +D DGT+I++ + + WD
Sbjct: 184 RIW--DIRSGNCQK--TLPAHQDPVSGVDFI-----RDGTMIVSCSHDKLIRIWDTNTGQ 234
Query: 235 -LKTVSQDE 242
LKT+ ++E
Sbjct: 235 CLKTLVEEE 243
>gi|430743609|ref|YP_007202738.1| WD40 repeat-containing protein [Singulisphaera acidiphila DSM
18658]
gi|430015329|gb|AGA27043.1| WD40 repeat-containing protein [Singulisphaera acidiphila DSM
18658]
Length = 360
Score = 40.4 bits (93), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/140 (25%), Positives = 64/140 (45%), Gaps = 15/140 (10%)
Query: 75 GLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS 134
G+ LAFA GR+L V +G S G F+A + S AF +++ A S
Sbjct: 198 GVTSLAFASDGRTLAVGTVDGTVRLWDSAQGRQFPRFRAHGSEVISLAFAEADRMLASCS 257
Query: 135 SEVRILSLENGE----EVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW------- 183
R++ L + + +LK D +Q + + G++ + TAG+ + +++W
Sbjct: 258 HNERMVRLWDVDVDAGRLLKEFDGRSAIQAFAFAPGSRELATAGF-DGAVRLWDVATGDS 316
Query: 184 RCDISSKTVNKGPALSMRHS 203
R D+S+ GP ++ S
Sbjct: 317 RWDLSA---GDGPVTALDFS 333
>gi|295676345|ref|YP_003604869.1| hypothetical protein BC1002_1278 [Burkholderia sp. CCGE1002]
gi|295436188|gb|ADG15358.1| WD40 repeat, subgroup [Burkholderia sp. CCGE1002]
Length = 1445
Score = 40.4 bits (93), Expect = 1.1, Method: Composition-based stats.
Identities = 42/186 (22%), Positives = 77/186 (41%), Gaps = 20/186 (10%)
Query: 58 DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
D TG+ + G H G + AF+ GR + + A + G+ I + +
Sbjct: 894 DAATGKQIVQLNG-HQGPVFSAAFSPDGRRVVSASADRTARVWDAATGQAIAQLIGHREL 952
Query: 118 ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGY 175
+SS+AF + + AS + R+ NG+ + + + GP+ + S + ++TA
Sbjct: 953 VSSAAFSPDGRRVVSASDDKTARVWDAANGQVITQLTGHQGPVFSAAFSPDGRRVVTAS- 1011
Query: 176 GEKHLQVWRCD----ISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAY 231
+K +VW I+ T ++GP S +P DG ++ ++ A
Sbjct: 1012 DDKTARVWDAATGHVITQLTGHQGPVSSAAFTP------------DGLRVVTASDDKTAR 1059
Query: 232 SWDLKT 237
WD T
Sbjct: 1060 VWDAAT 1065
Score = 40.0 bits (92), Expect = 1.7, Method: Composition-based stats.
Identities = 35/145 (24%), Positives = 65/145 (44%), Gaps = 6/145 (4%)
Query: 58 DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
D TG+ +G H G + AF+ GR + +G A + G+ I F ++
Sbjct: 1270 DAATGKQILVLSGHH-GTVFSAAFSPDGRRVVTAAADGTARVWDAATGKQIARFGGHQRA 1328
Query: 118 ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGY 175
+SS+AF + + AS++ R+ G + + + GP+ + S + ++TA
Sbjct: 1329 VSSAAFSPDGQRVVTASADQTARVWDAATGRVIAQLAGHRGPVSSAAFSPDGQRVVTAS- 1387
Query: 176 GEKHLQVWRCDISSKTVNKGPALSM 200
++ +VW I N+G L+M
Sbjct: 1388 ADQTARVW--PIRWLMQNRGRGLAM 1410
Score = 37.4 bits (85), Expect = 9.7, Method: Composition-based stats.
Identities = 42/188 (22%), Positives = 74/188 (39%), Gaps = 20/188 (10%)
Query: 58 DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
D G++ + TG H G + AF+ GR + + A + G VI + + P
Sbjct: 978 DAANGQVITQLTG-HQGPVFSAAFSPDGRRVVTASDDKTARVWDAATGHVITQLTGHQGP 1036
Query: 118 ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGY 175
+SS+AF + AS + R+ G+ + + GP+ S + ++TA
Sbjct: 1037 VSSAAFTPDGLRVVTASDDKTARVWDAATGQMIAQLIGHEGPVNVAVFSLDGQRVLTASR 1096
Query: 176 GEKHLQVWRCDISSKTVNKGPA---LSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
D +++ + G LS PV ++ G DG ++ + A
Sbjct: 1097 ----------DGTARAWDAGQGILLLSGHQEPVV----SAAFGPDGQRVVTASRDRTARV 1142
Query: 233 WDLKTVSQ 240
WD+ T Q
Sbjct: 1143 WDVATGRQ 1150
>gi|302851563|ref|XP_002957305.1| hypothetical protein VOLCADRAFT_98340 [Volvox carteri f. nagariensis]
gi|300257400|gb|EFJ41649.1| hypothetical protein VOLCADRAFT_98340 [Volvox carteri f. nagariensis]
Length = 2174
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 33/118 (27%), Positives = 54/118 (45%), Gaps = 2/118 (1%)
Query: 58 DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
D+ TG+M K H G+ F G V + A+ + + GEVI+ F E
Sbjct: 1138 DLNTGKMLRKYDEIHNEGIYDAWFTPDGLRCVSVSCDKTAAIFELQSGEVIQRFVGHEHW 1197
Query: 118 ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
+ +A +EKI A AS++ VRI ++ G + D G + Y++ I+ +A
Sbjct: 1198 VRFAALSPDEKILATASNDNTVRIWNVAKGNVLHIIRDHRGTMSYINFVHPTWIVTSA 1255
>gi|196009674|ref|XP_002114702.1| hypothetical protein TRIADDRAFT_58457 [Trichoplax adhaerens]
gi|190582764|gb|EDV22836.1| hypothetical protein TRIADDRAFT_58457 [Trichoplax adhaerens]
Length = 1089
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 27/101 (26%), Positives = 47/101 (46%), Gaps = 4/101 (3%)
Query: 117 PISSSAFLCEEKIFALA---SSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
PI++SA C K A++ S+ V++ +L N ++V + Q V + K II+A
Sbjct: 785 PITASALSCNSKYLAVSVRGSNAVQVWNLSNQKQVAFYRGHTDYTQAVHFAPNGKTIISA 844
Query: 174 GYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
+K ++ W C I K + M+ P +C+ N
Sbjct: 845 S-DDKTVKFWSCQIDLKNTVHVRSKRMQFKPTPFECRFVDN 884
>gi|345782155|ref|XP_532920.3| PREDICTED: WD repeat-containing protein 43 [Canis lupus familiaris]
Length = 681
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 43/205 (20%), Positives = 83/205 (40%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 96 LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKYIVEWNT 155
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
+ +V ++K +SS + K+ A +++ LE E F+ P+ ++
Sbjct: 156 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLT 215
Query: 163 AS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
+ DG ++ ++ L VW+ +K N + ++ PV ID
Sbjct: 216 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 275
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V G + ++
Sbjct: 276 ALSENKEEPVKLAVVCRDGEVHLFE 300
>gi|350421517|ref|XP_003492868.1| PREDICTED: WD repeat-containing protein 43-like [Bombus impatiens]
Length = 629
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 53/253 (20%), Positives = 111/253 (43%), Gaps = 41/253 (16%)
Query: 34 KKRRK--------ERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKK- 84
KKRRK ++G ++A+G++NG + D T + + G H G + + +++
Sbjct: 70 KKRRKKSISEESNQKG--IVAMGSTNGRVTLYDTTTSLVTAQLNG-HSGTVTAVTWSENV 126
Query: 85 -----GRSLHVVG----TNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
H++ NG+ + KS G+ + + + + L E++
Sbjct: 127 GLFSAASDHHIIQWNLQENGVRCKWKSGKGKTVSLAVSPD----GKSLLSGERV------ 176
Query: 136 EVRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTV 192
++ +L+ + V F+ ++V L V + G +I+ G+ +L VW D
Sbjct: 177 -IKWWNLDTKQLVRTFTGHANEVSCLHTVQMTSGNNYVISGACGDGYLNVWALDEQRNER 235
Query: 193 NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVK 252
+L+++ P+++ S N + ++L V SG A+ L + ++ P K ++
Sbjct: 236 TAVASLALQDDPISVSTHISENFQ--VIVLVVTRSGQAH---LFQYQPNGRSKPLKPSLN 290
Query: 253 LKKA-DADQQNSV 264
+ A D Q++SV
Sbjct: 291 IAVASDISQKDSV 303
>gi|166365791|ref|YP_001658064.1| hypothetical protein MAE_30500 [Microcystis aeruginosa NIES-843]
gi|166088164|dbj|BAG02872.1| hypothetical protein MAE_30500 [Microcystis aeruginosa NIES-843]
Length = 438
Score = 40.4 bits (93), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
LAF+ G+ L G++ + GE+I+ ++A E+ I S A + A AS +E+
Sbjct: 264 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 323
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+I + G E++K PL++ + DG +IT YG+K +++W
Sbjct: 324 KIWQGQTG-ELIKILRGTAPLKF--SPDG-HFLITGSYGDK-VKIW 364
>gi|148745071|gb|AAI42562.1| LOC100101293 protein [Xenopus laevis]
Length = 655
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 54/254 (21%), Positives = 103/254 (40%), Gaps = 28/254 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACG--------FVGKKRRKERGTL-----LLAL 47
+W T G L E+ P + +CLA KKR+ E LL +
Sbjct: 33 VWDTQGGGLRKEYV-PSAHLSATCTCLAWAQGRADKETHQKKKRKSEAADRDGQYDLLTI 91
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G IL ++ GE++ K G H + + + + SL+ + E ++ +V
Sbjct: 92 GTATGTILLYSIVKGELQSKLVGGHDSRVNCVRWHHENGSLYSCSEDKHIIEWNTQTCKV 151
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYV--- 161
++K +SS + K+ A +++ LE E +F S V L ++
Sbjct: 152 KCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWDLETKEVYRQFTGHSTAVTSLIFLTVQ 211
Query: 162 ------SASDGAKI-IITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
S D A + +++ ++ + VW+ S+K + + ++ PV +D + +
Sbjct: 212 PPRESRSIQDTAGLYFLSSAVHDRLVSVWQVR-SAKDKSSVLSFTLTEPPVFMDLSTTES 270
Query: 215 GEDGTVILAVAESG 228
E+ + V G
Sbjct: 271 KEEPLKLAVVCRDG 284
>gi|158313622|ref|YP_001506130.1| WD-40 repeat-containing protein [Frankia sp. EAN1pec]
gi|158109027|gb|ABW11224.1| WD-40 repeat protein [Frankia sp. EAN1pec]
Length = 1427
Score = 40.4 bits (93), Expect = 1.3, Method: Composition-based stats.
Identities = 48/183 (26%), Positives = 71/183 (38%), Gaps = 13/183 (7%)
Query: 5 NDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRR---KERGTLLLALGTSNGDILAVDVLT 61
+DGSL W G P LA G + R G ++A G +G I D LT
Sbjct: 959 DDGSLRT-WDATTGTPR-----LAVPVTGGRLRCCATGPGGAVVATGGEDGTIRLHDPLT 1012
Query: 62 GEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSS 121
GE+ + G H G + LAF G L G +G + G +P+ +
Sbjct: 1013 GEILRRLAG-HAGPVLALAFGPDGSWLVSAGEDGTLRRWDTAAGRQTGVLSDGSRPVRAC 1071
Query: 122 AFLCE-EKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
A + + A A + + L G + + + G V A DG+ I YG +
Sbjct: 1072 AVAPDGSYLVAPAGDAISVRDLPTGGQRAELTGATGTRACVVAPDGSWIASAGRYGT--I 1129
Query: 181 QVW 183
+VW
Sbjct: 1130 RVW 1132
>gi|332663664|ref|YP_004446452.1| metallophosphoesterase [Haliscomenobacter hydrossis DSM 1100]
gi|332332478|gb|AEE49579.1| metallophosphoesterase [Haliscomenobacter hydrossis DSM 1100]
Length = 628
Score = 40.4 bits (93), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/124 (27%), Positives = 65/124 (52%), Gaps = 7/124 (5%)
Query: 50 SNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIR 109
S G ++A+DV TG+ KW T R G + K R + + T+ + + + G+++
Sbjct: 326 SAGQLVALDVKTGKQKWTYTTR--GKIFSTPALGKDRVV-LASTDSVVYCLALQTGKLLW 382
Query: 110 EFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
+ K + K I +S + E I+ L SSE R L+L++G+E+ F+ G ++ + +D
Sbjct: 383 KLK-TNKSIVASPLIDGETIY-LGSSEGKFRALTLKDGKELWSFNGVAGFVESIPCADAE 440
Query: 168 KIII 171
++
Sbjct: 441 RVYF 444
>gi|425470863|ref|ZP_18849723.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
gi|389883367|emb|CCI36241.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
Length = 1108
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 82/200 (41%), Gaps = 26/200 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D ++ +T + +AF + + G NG ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKNIREFATKMR--SIYSIAFHPDDNQIAITGRNGKV-QIWSQ 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 899 KGTMLQEFTASQAPIYSLAFNGEGTAIITGTSEGKVQYWHLSNHRTKLINSWTVDDSIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + + P + SP
Sbjct: 959 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWDL 235
DG I A++ G A WD+
Sbjct: 1005 -DGEKIAAISRDGTARRWDM 1023
>gi|148270092|ref|YP_001244552.1| WD-40 repeat-containing protein [Thermotoga petrophila RKU-1]
gi|147735636|gb|ABQ46976.1| WD-40 repeat protein [Thermotoga petrophila RKU-1]
Length = 580
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 52/221 (23%), Positives = 95/221 (42%), Gaps = 15/221 (6%)
Query: 13 WKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRH 72
W PDG+ + S L V + R +G +NG L D+ H
Sbjct: 131 WSFPDGKEI-SSQRLGPSVVKIAAHENR----YVMGLANGLALISDISNPSFSIPLKA-H 184
Query: 73 PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEK-IFA 131
P G+ + F++ G+ + G N + S G ++ ++ ++ AFL + IF
Sbjct: 185 PEGIKKIVFSENGQLIATCGGNTVKVWNASN-GRLVLQYDHV-ITVNDVAFLDNDNLIFV 242
Query: 132 LASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYG-EKHLQVWRCDISSK 190
+ +L+++ E V + +VS+ DG+ A YG +K ++VW D+ +
Sbjct: 243 TDDYKAVVLNIQKNEVVKSIDAHNNFVLFVSSDDGS----IATYGMDKTVKVWNADLELQ 298
Query: 191 TVNKGPALSMRHSPVAIDCKNSPNG-EDGTVILAVAESGVA 230
G LS+ ++ D K +G +D +++ AE G+A
Sbjct: 299 YSLYGHQLSVNTVALSSDGKFIVSGSDDREILIWNAEKGIA 339
>gi|149175385|ref|ZP_01854006.1| vegetatible incompatibility protein HET-E1 [Planctomyces maris DSM
8797]
gi|148845653|gb|EDL59995.1| vegetatible incompatibility protein HET-E1 [Planctomyces maris DSM
8797]
Length = 1097
Score = 40.0 bits (92), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 46/216 (21%), Positives = 92/216 (42%), Gaps = 23/216 (10%)
Query: 44 LLALG----TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASE 99
LLA+G + +G+I D TG++ K H + G+ F++ G+SL +
Sbjct: 885 LLAVGGGDPSRSGEITIWDAATGKLVKKLGDAHSDTVLGIRFSRSGKSLLTGAADKFVKI 944
Query: 100 MKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEE---VLKFSDD 154
G ++ F+ + A+ +E + ++ +++ ++E GE+ + +S
Sbjct: 945 FDVASGNFVKSFEGHTHHVLDVAWKADESTIVSSGADNVIKVWNIETGEQKRTISGYSKQ 1004
Query: 155 VGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
V + ++ D I +G G+K + R +++ N R+ + D S
Sbjct: 1005 VTSIAFLGLGDN----IVSGGGDKTV---RMHLTTNGSN------YRNLSGSTDYVYSVA 1051
Query: 215 G-EDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKI 249
G D ++++A E G+ WD KT NP +
Sbjct: 1052 GSRDESIVIAGGEDGILRVWDAKTGKLLNSFNPPPV 1087
>gi|410955542|ref|XP_003984410.1| PREDICTED: WD repeat-containing protein 43 [Felis catus]
Length = 680
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 43/205 (20%), Positives = 83/205 (40%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 96 LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKHIVEWNT 155
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
+ +V ++K +SS + K+ A +++ LE E F+ P+ +
Sbjct: 156 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLV 215
Query: 163 AS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
+ DG ++ ++ L VW+ +K N + ++ PV ID
Sbjct: 216 FTTVRPPSESQPFDGVTGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 275
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V + G + ++
Sbjct: 276 TLSENKEEPVKLAVVCKDGQVHLFE 300
>gi|307152332|ref|YP_003887716.1| WD40 repeat-containing protein [Cyanothece sp. PCC 7822]
gi|306982560|gb|ADN14441.1| WD40 repeat, subgroup [Cyanothece sp. PCC 7822]
Length = 380
Score = 40.0 bits (92), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 47/202 (23%), Positives = 86/202 (42%), Gaps = 18/202 (8%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVS--YSCLACGFVGKKRRKER----GTLLLALGTS-NGD 53
IW + G ++A PDGE +VS + + +R R G + AL S +G
Sbjct: 162 IWLEHQGQVMALRVTPDGEILVSGGLDGIRIWTLNPRRPLYRLTGLGHPVYALAISPDGV 221
Query: 54 ILAVDVLTGEMK-WK--------STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
ILA L GE+K W + H + GL F G+ L +
Sbjct: 222 ILASGSLDGEVKFWNIKEGKLLSTFYPHQATITGLVFTPDGKKLITSSQDKTIKVWDLAT 281
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE-VRILSLENGEEVLKFSDDVGPLQYVSA 163
G++I I + A + KI A ++ +RI ++E GE+ + ++ +Q ++
Sbjct: 282 GQLIYTLAGHTGRIRAIALNPDGKILASGGNDGIRIWNIETGEQYNQIIENYDWVQSLAF 341
Query: 164 SDGAKIIITAGYGEKHLQVWRC 185
S + + + + + +++W+C
Sbjct: 342 SPDGQFLASGSF-DFQVKIWQC 362
>gi|334119173|ref|ZP_08493260.1| WD40 repeat-containing protein [Microcoleus vaginatus FGP-2]
gi|333458644|gb|EGK87261.1| WD40 repeat-containing protein [Microcoleus vaginatus FGP-2]
Length = 547
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 37/143 (25%), Positives = 68/143 (47%), Gaps = 6/143 (4%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA G+ +G I ++ +G+ +G++ L+ LAF G+SL +NG +
Sbjct: 406 LASGSKDGSIRLWNLASGQAIRTISGKNLSVLS-LAFTPDGKSLAAGNSNGTIGLWNAGN 464
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
G++IR + S AF + S + VR+ + +G+ S G + V+
Sbjct: 465 GQLIRRLSGHTDGVWSVAFSRDGTTLVTGSWDKSVRLWDVRSGDLRGNLSGHAGYVSAVA 524
Query: 163 ASDGAKIIITAGY-GEKHLQVWR 184
S K I++AG+ GE +++W+
Sbjct: 525 ISSDGKTIVSAGWLGE--IKIWK 545
>gi|428215135|ref|YP_007088279.1| WD40 repeat-containing protein [Oscillatoria acuminata PCC 6304]
gi|428003516|gb|AFY84359.1| WD40 repeat-containing protein [Oscillatoria acuminata PCC 6304]
Length = 306
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 34/143 (23%), Positives = 64/143 (44%), Gaps = 4/143 (2%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
L G+ + I +V GE +++ H + + F+ G L + K++
Sbjct: 159 LVSGSWDRTIKLWNVAIGE-SYRTIQAHSNPIESVKFSPDGEMLASSSLDSTVKLWKTQT 217
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
GE+I I S AF + + A ASS+ ++I ++E GEE+ D + ++
Sbjct: 218 GELIHTLTGHTDGIRSVAFSPDGRYLASASSDKTIKIWAVETGEELATLGDHSSYVFAIA 277
Query: 163 ASDGAKIIITAGYGEKHLQVWRC 185
S + + T G +K +++WR
Sbjct: 278 FSPDGQTLATGG-DDKTIKLWRA 299
>gi|310817435|ref|YP_003949793.1| wd domain g-beta repeat/pbs lyase heat-like repeat protein
[Stigmatella aurantiaca DW4/3-1]
gi|309390507|gb|ADO67966.1| WD domain G-beta repeat/PBS lyase HEAT-like repeat protein
[Stigmatella aurantiaca DW4/3-1]
Length = 2185
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 64/156 (41%), Gaps = 18/156 (11%)
Query: 62 GEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSS 121
G + + G HPGG LA G+ L+ VG +G G ++E++AS +P+ +
Sbjct: 82 GSQELELAGAHPGGCTALALGLNGQRLYSVGMDGALRAWALASGSKLQEWQASAQPLRAV 141
Query: 122 AFLCEE------------KIFALASSEVRILSLENGE-EVLKFSDDVGPLQYVSASDGAK 168
A + F LAS R + +G L F+ G + VSA D +
Sbjct: 142 AVAPNNATVACAGDDAVVRSFTLASRAQRDMPGHDGPVRALAFTPRDG--RLVSAGDDGR 199
Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSP 204
I I G + R D S + GP L++ P
Sbjct: 200 IRIWYLAGAVEFET-RGDKDSG--HAGPVLALLFPP 232
>gi|157413501|ref|YP_001484367.1| WD-40 repeat-containing G-protein [Prochlorococcus marinus str. MIT
9215]
gi|157388076|gb|ABV50781.1| G-protein beta WD-40 repeats [Prochlorococcus marinus str. MIT
9215]
Length = 357
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 46/188 (24%), Positives = 77/188 (40%), Gaps = 17/188 (9%)
Query: 22 VSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAF 81
VS +ACG+ K ++ +G G I A + G++ W+ H GGL +A
Sbjct: 21 VSDYAIACGWALKGKQ-------FIVGDVAGGIFAFEGDNGKIIWEKENTHSGGLLAMAI 73
Query: 82 AKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILS 141
+G G +G G+VI+ + + + + A+ASS+ +
Sbjct: 74 HPEGEIFATSGQDGNVQICNCHEGKVIKTLDLGKGWVEHLKWSNDGLFLAIASSKKVYVF 133
Query: 142 LENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGP 196
E GEE D + ++ S+ + + TA YG +V DI + N +G
Sbjct: 134 NEVGEEKWTSEDHPSTVSAITWSNKNE-LATACYG----RVTFFDIVNNKTNQKLEWQGS 188
Query: 197 ALSMRHSP 204
+SM SP
Sbjct: 189 LVSMELSP 196
>gi|124022824|ref|YP_001017131.1| hypothetical protein P9303_11171 [Prochlorococcus marinus str. MIT
9303]
gi|123963110|gb|ABM77866.1| G-protein beta WD-40 repeats [Prochlorococcus marinus str. MIT
9303]
Length = 357
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 53/229 (23%), Positives = 91/229 (39%), Gaps = 30/229 (13%)
Query: 26 CLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKG 85
+ACG+V G LL + G L + LTG++ WK H GL +A
Sbjct: 25 AMACGWV------HNGEALLVADVTGGLYL-FNGLTGQINWKINDIHQAGLLSVAIHPSK 77
Query: 86 RSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENG 145
G NG + SE +V++ + + + + + A++ S + +G
Sbjct: 78 AVFATAGQNGCLTIWNSEKYDVVKSIELGGGWVEHLHWSPDGQYLAVSMSRYAYVFDIDG 137
Query: 146 EEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGPALSM 200
EE + S+ + ++ S ++ + TA YG QV D++ +N +G +SM
Sbjct: 138 EEKWRSSEHTSTVSAIAWSSSSE-LATASYG----QVTFFDVTQGAINQKLEWQGSLVSM 192
Query: 201 RHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTN-PAK 248
SP DG V+ ++ + W T + E T P K
Sbjct: 193 VLSP------------DGDVVACGSQDNSVHFWRRSTGNDAEMTGYPGK 229
>gi|452847062|gb|EME48994.1| hypothetical protein DOTSEDRAFT_67892 [Dothistroma septosporum
NZE10]
Length = 446
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 3/111 (2%)
Query: 63 EMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSA 122
+ W+ T RH G LA++ GR L+ GT+G+ SE G V + P SS
Sbjct: 77 DTAWR-TKRHKGSCRCLAYSVDGRQLYSAGTDGIVKAADSETGRVTGKIAIPVDPSSSRD 135
Query: 123 FLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
+ + S + +LS ++G L D P V SD +++
Sbjct: 136 GVDAPTVVHALSPQTLLLSTDSG--ALHLYDLRDPAPSVPVSDSKTAFVSS 184
>gi|428212952|ref|YP_007086096.1| WD40 repeat-containing protein [Oscillatoria acuminata PCC 6304]
gi|428001333|gb|AFY82176.1| WD40 repeat-containing protein [Oscillatoria acuminata PCC 6304]
Length = 795
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 29/121 (23%), Positives = 59/121 (48%), Gaps = 9/121 (7%)
Query: 71 RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIF 130
R+ + + F G+ + T G + E GE +R F A + I + A +++I
Sbjct: 595 RNSEAVYAVIFLNNGQQIIASDTRGSVAFWHRETGEELRRFNAHQGMIRALAISPDDRIL 654
Query: 131 ALASSE--VRILSLENGEEVLKF---SDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRC 185
A AS E +++ L+ G+E+ F +D V + + + DG ++ +G + L++W+
Sbjct: 655 ATASDEGIIKLWQLQTGQEICVFKTHNDAVNAIAF--SPDGQ--LLASGSTDMTLKLWQV 710
Query: 186 D 186
+
Sbjct: 711 N 711
>gi|425437527|ref|ZP_18817942.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
gi|389677472|emb|CCH93583.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
Length = 1108
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 26/200 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D +++ +T + +AF + + G NG ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 899 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLSNYRPQLINSWTADDNIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + + P + SP
Sbjct: 959 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWDL 235
DG I A++ G A WD+
Sbjct: 1005 -DGEKIAAISRDGTARRWDI 1023
>gi|256422452|ref|YP_003123105.1| hypothetical protein Cpin_3437 [Chitinophaga pinensis DSM 2588]
gi|256037360|gb|ACU60904.1| WD-40 repeat protein [Chitinophaga pinensis DSM 2588]
Length = 1160
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 76/193 (39%), Gaps = 18/193 (9%)
Query: 45 LALGTSNGDILAVDVLTGEM--KWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LA+ T+ GD+ DV TG++ W G P + F + + L + A + +
Sbjct: 250 LAVVTTEGDVQLWDVYTGKLIADWPDVGLRP---TSIEFVQHDKQLLLSTKFEAAILLDA 306
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
G++++ F A S +EK +A ++ I ++ + V Y S
Sbjct: 307 RSGKILQSFDADLVRFSPD----QEKYAVVAQAQATIRNVNDSTSYTSLRGHVSWSVYAS 362
Query: 163 ASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVIL 222
SD A I YG+ L+VW + P L + + I + DG I
Sbjct: 363 -SDTANKHIAVCYGDGSLRVWSL------ITGMPELLLADTAAGI--TRTVISPDGRTIA 413
Query: 223 AVAESGVAYSWDL 235
++ G +WDL
Sbjct: 414 GSSQYGTISAWDL 426
>gi|301776542|ref|XP_002923691.1| PREDICTED: WD repeat-containing protein 43-like [Ailuropoda
melanoleuca]
Length = 679
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 43/205 (20%), Positives = 81/205 (39%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 96 LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKHIVEWNT 155
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV- 161
+ +V ++K +SS + K+ A +++ LE E F+ P+ +
Sbjct: 156 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLI 215
Query: 162 ----------SASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
DG ++ ++ L VW+ +K N + ++ PV ID
Sbjct: 216 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRAENKEKNAVMSFTVTDEPVYIDL 275
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V G + ++
Sbjct: 276 TLSENKEEPVKLAVVCRDGQVHLFE 300
>gi|115379662|ref|ZP_01466743.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
gi|115363327|gb|EAU62481.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
Length = 1558
Score = 40.0 bits (92), Expect = 1.7, Method: Composition-based stats.
Identities = 43/156 (27%), Positives = 64/156 (41%), Gaps = 18/156 (11%)
Query: 62 GEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSS 121
G + + G HPGG LA G+ L+ VG +G G ++E++AS +P+ +
Sbjct: 82 GSQELELAGAHPGGCTALALGLNGQRLYSVGMDGALRAWALASGSKLQEWQASAQPLRAV 141
Query: 122 AFLCEE------------KIFALASSEVRILSLENGE-EVLKFSDDVGPLQYVSASDGAK 168
A + F LAS R + +G L F+ G L VSA D +
Sbjct: 142 AVAPNNATVACAGDDAVVRSFTLASRAQRDMPXHDGPVRALAFTPRDGRL--VSAGDDGR 199
Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSP 204
I I G + R D S + GP L++ P
Sbjct: 200 IRIWYLAGAVEFET-RGDKDSG--HAGPVLALLFPP 232
>gi|425450672|ref|ZP_18830496.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
7941]
gi|389768356|emb|CCI06492.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
7941]
Length = 559
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
LAF+ G+ L G++ + GE+I+ ++A E+ I S A + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+I + G E++K PL++ + DG + +IT YG K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGHK-VKIW 485
>gi|425439391|ref|ZP_18819716.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9717]
gi|389720381|emb|CCH95904.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9717]
Length = 559
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 58/106 (54%), Gaps = 6/106 (5%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
LAF+ G+ L G++ + GE+I+ ++A E+ I S A + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+I + G E++K PL++ + DG + +IT YG+K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITCSYGDK-VKIW 485
>gi|443652967|ref|ZP_21130939.1| WD domain, G-beta repeat family protein [Microcystis aeruginosa
DIANCHI905]
gi|159028403|emb|CAO89845.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443334193|gb|ELS48718.1| WD domain, G-beta repeat family protein [Microcystis aeruginosa
DIANCHI905]
Length = 1201
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 71/147 (48%), Gaps = 7/147 (4%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMAS--EMKS 102
LA G+ +G I DV TG+ + ++ H G + ++F+ G++L G +
Sbjct: 669 LATGSDDGTIKLWDVETGQ-EIRTLSGHNGKVNSVSFSSDGKTLAFDSDGGTIKLWYIDI 727
Query: 103 EMGEVIREFKASEKP-ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQ 159
E G+ IR + + S +F + K A S++ +++ ++E GEE+ S G +
Sbjct: 728 ETGKEIRTLSEWNRGCVYSVSFSNDGKTLATGSADKTIKLWNVETGEEIRTLSGHNGKVN 787
Query: 160 YVSASDGAKIIITAGYGEKHLQVWRCD 186
VS S K + T G +K +++W +
Sbjct: 788 SVSFSSDGKTLAT-GSADKTIKLWNVE 813
>gi|425466442|ref|ZP_18845740.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9809]
gi|389831075|emb|CCI26486.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9809]
Length = 559
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
LAF+ G+ L G++ + GE+I+ ++A E+ I S A + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISLAINPHRHLIASASRTEI 444
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+I + G E++K PL++ + DG +IT YG+K +++W
Sbjct: 445 KIWQGQTG-ELIKILRGTAPLKF--SPDG-HFLITGSYGDK-VKIW 485
>gi|432096779|gb|ELK27357.1| WD repeat-containing protein 43 [Myotis davidii]
Length = 678
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 42/205 (20%), Positives = 82/205 (40%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 97 LLALGTAVGSILLYSTVKGELHSKLVSGGHDNKVNCIQWHQDNGCLYSCSDDKHIVEWNT 156
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
+ +V ++K +SS + K+ A +++ LE E F+ P+ +
Sbjct: 157 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKELYRHFTGHATPVSSLM 216
Query: 163 AS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
+ DG ++ ++ L VW+ +K N + ++ P+ ID
Sbjct: 217 FTTIRPPNESQPFDGISGLYFLSGAVHDRLLNVWQVRSENKQKNAVMSFTVTDEPIYIDL 276
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V G + ++
Sbjct: 277 TLSENKEEPVKLAVVCRDGQVHLFE 301
>gi|395828782|ref|XP_003787544.1| PREDICTED: WD repeat-containing protein 43 [Otolemur garnettii]
Length = 680
Score = 39.7 bits (91), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 44/205 (21%), Positives = 83/205 (40%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + L + + L+ + E
Sbjct: 96 LLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCLQWHQDSGCLYSCSDDKHIVEWNV 155
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQ 159
+ +V ++K +SS + K+ A +++ LE E F+ P L+
Sbjct: 156 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLR 215
Query: 160 YVS--------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
+ + DG ++ ++ + VW+ +K N + ++ PV ID
Sbjct: 216 FTTIRPPNESQPCDGITGLYFLSGAVHDRLINVWQVRSENKEKNAVMSFTVTDEPVCIDL 275
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V + G + ++
Sbjct: 276 TLSENKEEPVKLAVVCKDGQVHLFE 300
>gi|114763302|ref|ZP_01442726.1| WD domain/cytochrome c family protein [Pelagibaca bermudensis
HTCC2601]
gi|114544100|gb|EAU47110.1| WD domain/cytochrome c family protein [Roseovarius sp. HTCC2601]
Length = 436
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 46/111 (41%), Gaps = 18/111 (16%)
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQV-----WRCDISSKTV 192
R+L L+ GE++ + D P+ + A G + G GE + + WR +
Sbjct: 216 RVLDLDTGEQIADLTLDRRPILAL-AHHGPTNQLAVGDGEGFIMIVDTRDWRITRDFRAT 274
Query: 193 NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEK 243
KGP ++ SP DG VI A VAY+W + + Q E
Sbjct: 275 RKGPVWALAFSP------------DGAVIHAGGLDDVAYAWPVAMLDQFEP 313
>gi|425460391|ref|ZP_18839872.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9808]
gi|389826904|emb|CCI22215.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9808]
Length = 559
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
LAF+ G+ L G++ + GE+I+ ++A E+ I S A + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+I + G E++K PL++ + DG + +IT YG K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGHK-VKIW 485
>gi|440755325|ref|ZP_20934527.1| dnaJ domain protein [Microcystis aeruginosa TAIHU98]
gi|440175531|gb|ELP54900.1| dnaJ domain protein [Microcystis aeruginosa TAIHU98]
Length = 559
Score = 39.7 bits (91), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
LAF+ G+ L G++ + GE+I+ ++A E+ I S A + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+I + G E++K PL++ + DG + +IT YG K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGHK-VKIW 485
>gi|451980751|ref|ZP_21929137.1| exported hypothetical protein [Nitrospina gracilis 3/211]
gi|451762087|emb|CCQ90376.1| exported hypothetical protein [Nitrospina gracilis 3/211]
Length = 734
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 42/198 (21%), Positives = 81/198 (40%), Gaps = 20/198 (10%)
Query: 47 LGTSNGDILAV-DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
L T+ GD L + D TG + + RH G L AF+ G+S+ G +G + G
Sbjct: 202 LVTAQGDALKLWDASTG-TRLRVFSRHNGKLFAAAFSPDGKSIASAGQDGTVRLFSTATG 260
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSA 163
E++ K + +++ AF E A ++ VR+ +G + F + VS
Sbjct: 261 ELLYALKGHNEKVNAVAFSPEGAHLLSAGTDNTVRLWKTNDGTLLHTFEGHTKEVTSVSF 320
Query: 164 SDGAKIIITAGYGEKHLQVW---------------RCDISSKTVNKGPALSMRHSPVAID 208
S + +++ G ++ +++W + + + KT P +
Sbjct: 321 SPDNRFVVS-GSADQTVRLWDLAPHIAPTVVAEAPKSETTPKTETSAPESGPKSQTPTPP 379
Query: 209 CKNSPNGEDGTVILAVAE 226
+ SP+ E T+++ E
Sbjct: 380 VEESPDTEPPTIVITSHE 397
Score = 38.1 bits (87), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 52/206 (25%), Positives = 92/206 (44%), Gaps = 27/206 (13%)
Query: 41 GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
G +++ G +G ++ D TG+ +WK+ H G + +A + GRSL G +
Sbjct: 64 GKFIVSAG-GDGALILWDARTGD-RWKTLSGHNGAVNAIAISPDGRSLATGGADTRIKVW 121
Query: 101 KSEMGEVIREFKASEKPISSSAFLCE-EKIFALASSEVRIL-SLENGEEVLKFSD--DVG 156
+ G +R ++ AF + ++ + E IL + G+ + F+D D G
Sbjct: 122 DIQSGNEVRSVPGHFDEVTGVAFFPDGTRLISSGLGESVILWDIRTGQPLRVFADQNDSG 181
Query: 157 -------PLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
P++ V+AS K ++TA G+ L++W D S+ T + + RH+
Sbjct: 182 SEFVALEPVRSVAASPSGKTLVTA-QGDA-LKLW--DASTGTRLR---VFSRHNGKLFAA 234
Query: 210 KNSPN-------GEDGTVILAVAESG 228
SP+ G+DGTV L +G
Sbjct: 235 AFSPDGKSIASAGQDGTVRLFSTATG 260
>gi|355728913|gb|AES09699.1| WD repeat domain 43 [Mustela putorius furo]
Length = 678
Score = 39.7 bits (91), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 42/205 (20%), Positives = 81/205 (39%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLA+GT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 93 LLAIGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKYIVEWNT 152
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQY-- 160
+ +V ++K +SS + K+ A +++ LE E F+ P+
Sbjct: 153 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSTLI 212
Query: 161 ---------VSASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
+ DG ++ ++ L VW+ +K N + ++ PV ID
Sbjct: 213 FTTIRPPNEIQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 272
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V G + ++
Sbjct: 273 TLSENKEEPVKLAVVCRDGQVHLFE 297
>gi|390594263|gb|EIN03676.1| WD40 repeat-like protein, partial [Punctularia strigosozonata
HHB-11173 SS5]
Length = 312
Score = 39.7 bits (91), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 48/187 (25%), Positives = 82/187 (43%), Gaps = 18/187 (9%)
Query: 58 DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIRE-FKASEK 116
DV TG+ + H + +AF+ G + +G + G+ I E F+
Sbjct: 124 DVQTGQQIGQPLEGHTDWVYSVAFSPDGIRIVSRSRDGTLRLWDAHTGQAIGESFRGHSN 183
Query: 117 PISSSAFLCEEKIFALASSE--VRILSLENGEEV---LKFSDDVGPLQYVSAS-DGAKII 170
++S+AF + K A SS+ +R+L E G+ V L+ D G + V+ S DGA+I+
Sbjct: 184 WVNSAAFSPDGKHIASGSSDDTIRLLDAETGQPVGDPLQGHD--GWVWSVAYSPDGARIV 241
Query: 171 ITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVA 230
+G + +++W V GP + D + DG I++ +E G
Sbjct: 242 --SGSVDNTIRIWNAQTRQTVV--GPLQGHKK-----DVNSVAFSPDGKYIVSGSEDGTM 292
Query: 231 YSWDLKT 237
WD +T
Sbjct: 293 RIWDAQT 299
>gi|335775362|gb|AEH58546.1| WD repeat-containing protein 43-like protein [Equus caballus]
Length = 550
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 43/205 (20%), Positives = 82/205 (40%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 28 LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGYLYSCSDDKHIVEWNT 87
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV- 161
+ +V ++K +SS + K+ A +++ LE E F+ P+ +
Sbjct: 88 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLL 147
Query: 162 ----------SASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
DG ++ ++ L VW+ +K N + ++ PV ID
Sbjct: 148 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 207
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V + G + ++
Sbjct: 208 TLSENKEEPVKLAVVCKDGQVHLFE 232
>gi|281345819|gb|EFB21403.1| hypothetical protein PANDA_012871 [Ailuropoda melanoleuca]
Length = 605
Score = 39.7 bits (91), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 43/205 (20%), Positives = 81/205 (39%), Gaps = 14/205 (6%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLALGT+ G IL + GE+ K +G H + + + + L+ + E +
Sbjct: 118 LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKHIVEWNT 177
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV- 161
+ +V ++K +SS + K+ A +++ LE E F+ P+ +
Sbjct: 178 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLI 237
Query: 162 ----------SASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
DG ++ ++ L VW+ +K N + ++ PV ID
Sbjct: 238 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRAENKEKNAVMSFTVTDEPVYIDL 297
Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
S N E+ + V G + ++
Sbjct: 298 TLSENKEEPVKLAVVCRDGQVHLFE 322
>gi|148231147|ref|NP_001085834.1| apoptotic peptidase activating factor 1 [Xenopus laevis]
gi|49115497|gb|AAH73405.1| MGC80868 protein [Xenopus laevis]
Length = 1248
Score = 39.3 bits (90), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 37/182 (20%), Positives = 79/182 (43%), Gaps = 13/182 (7%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H + F+ GR G + +SE GE + E +A + + AF +EK+ A
Sbjct: 614 HKDAVFHACFSPDGRKFASCGADKTLQVFRSETGEKLLELEAHDDEVLCCAFSADEKLLA 673
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDG--AKIIITAGYGEKHLQVWRCDI 187
S++ V+I + + G+ + + + + ++G A ++ T + + +W
Sbjct: 674 TCSADRKVKIWNAKTGKPIRVYEEHTEQVNCCQFTNGLSAPLLATCS-NDCFIMLW---- 728
Query: 188 SSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPA 247
S+T L + H+ C+ SP D + + + G WD+++ ++++ A
Sbjct: 729 DSETEYSRNTL-IGHTGAVYHCRYSP---DDRYLASSSMDGSLKIWDVESANEEKSIEVA 784
Query: 248 KI 249
K+
Sbjct: 785 KL 786
>gi|149173262|ref|ZP_01851893.1| WD40-repeat containing protein [Planctomyces maris DSM 8797]
gi|148848068|gb|EDL62400.1| WD40-repeat containing protein [Planctomyces maris DSM 8797]
Length = 1705
Score = 39.3 bits (90), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 41/174 (23%), Positives = 78/174 (44%), Gaps = 10/174 (5%)
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
TG+ + K +H G + + ++ G+ + +G + +E G ++R S + S
Sbjct: 1482 TGQAEKKLEQKHNGYVNTVRYSPDGKRILTSSEDGTSKIWNAESGAMLRSLDQSGTHVKS 1541
Query: 121 SAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGE 177
+ F + AS + + + E G+++ F P++ V+ S DG ++I +G +
Sbjct: 1542 AIFSPDGSQIVTASDDKTLVMWDAETGKKIKTFKGHEWPVREVAYSHDGKRLI--SGSED 1599
Query: 178 KHLQVWRCDISSKTVNKG---PALSMRHSPVAIDCKNSPNGEDGTVILAVAESG 228
+W D + KTV G P S+ SP D + +DGT L ++G
Sbjct: 1600 NTAIIWDIDTAKKTVLSGHTAPVASVVFSP--DDSRAFTASDDGTAKLWDTDTG 1651
Score = 37.4 bits (85), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 70/146 (47%), Gaps = 9/146 (6%)
Query: 64 MKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSA- 122
+K + +H + ++F+ G+ L + A G +EF ++SSA
Sbjct: 1094 LKGRVLAQHVDAILDVSFSHDGKQLVTASRDKTAISWDVSTGNPNKEFTEGHAFLASSAV 1153
Query: 123 FLCEEKIFALAS--SEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
FL + K A A+ + VRI ++ G E +F + G + S +K+++T G EK +
Sbjct: 1154 FLPDGKRLATAAVDNSVRIWDIQTGTEHKRF-EHTGRSAAIDVSFDSKLLVT-GSDEKTV 1211
Query: 181 QVWRCDISSKTVNKGPALSMRHSPVA 206
++W DI++ + K LS HS V+
Sbjct: 1212 RIW--DIATGELLK--ELSGHHSEVS 1233
>gi|57530361|ref|NP_001006396.1| WD repeat-containing protein 43 [Gallus gallus]
gi|53127376|emb|CAG31071.1| hypothetical protein RCJMB04_2a19 [Gallus gallus]
Length = 661
Score = 39.3 bits (90), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 49/234 (20%), Positives = 86/234 (36%), Gaps = 22/234 (9%)
Query: 13 WKQPDGEPVVSYSCLACGFVGKKRRKERGTL-----LLALGTSNGDILAVDVLTGEMKWK 67
W P G P G KKR+ E + +LA+GT+ G IL + GE++ K
Sbjct: 65 WAPPGGRPPPDKD----GPQRKKRKSEAAEVDKQLDILAIGTAVGSILLYSTVKGELQSK 120
Query: 68 STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEE 127
G H + + + + L+ + E ++ +V ++K +SS +
Sbjct: 121 LDGGHDSRINCVRWHQDNCCLYSCSEDKHIVEWNTQTCKVKCKWKGDNSSVSSLCISPDG 180
Query: 128 KIFALASSEVRILSLENGEEVLKFS-----------DDVGPLQYVSASDGAK--IIITAG 174
K+ A +++ LE E F+ V P+ DG ++
Sbjct: 181 KMLLSAGRTIKLWDLETKEVYRHFTGHATSVSSLMFTTVKPMNENKPFDGITGLYFLSGA 240
Query: 175 YGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESG 228
++ L VW+ K N + ++ P ID S E+ + V G
Sbjct: 241 IHDRLLSVWQIRSDRKEKNAVMSFTVTDEPTFIDLTVSELKEEPVKLAVVCRDG 294
>gi|281410841|gb|ADA68831.1| NWDp3 [Podospora anserina]
Length = 210
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 78/169 (46%), Gaps = 13/169 (7%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H G + +AF+ G + +G +++ G+ +R+ + + S AF +
Sbjct: 46 HSGSVRSVAFSPDGSRIVSASDDGTIRIWEAKSGKEVRKLEGHSNWVRSVAFSPDSSRIV 105
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGEKHLQVWRCDIS 188
AS + +RI ++G+EV K G ++ V+ S DG++I+ + ++ +++W S
Sbjct: 106 SASDDGTIRIWEAKSGKEVRKLEGHSGSVRSVAFSPDGSRIV--SASNDQTIRIWEAK-S 162
Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
K V K HS + + SP DG+ I++ + W+ K+
Sbjct: 163 GKEVRKLEG----HSGLVLSVAFSP---DGSRIVSASNDQTIRIWEAKS 204
>gi|254410842|ref|ZP_05024620.1| hypothetical protein MC7420_320 [Coleofasciculus chthonoplastes PCC
7420]
gi|196182197|gb|EDX77183.1| hypothetical protein MC7420_320 [Coleofasciculus chthonoplastes PCC
7420]
Length = 1312
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 43/197 (21%), Positives = 78/197 (39%), Gaps = 39/197 (19%)
Query: 1 IWSTNDGSLLAEWK-----------QPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGT 49
+W T G LL ++ PDG+ +VS S L L
Sbjct: 1049 LWDTTSGKLLHTFRGHEASVSAVAFSPDGQTIVSGSTDTT---------------LKLWD 1093
Query: 50 SNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIR 109
++G++L D G HPGG+ +AF+ G+ + +G + G+++
Sbjct: 1094 TSGNLL--DTFRG---------HPGGVTAVAFSPDGKRIVSGSGDGTLKLWDTTSGKLLH 1142
Query: 110 EFKASEKPISSSAFLCEEKIFALASSEVRI-LSLENGEEVLKFSDDVGPLQYVSASDGAK 168
F+ E +S+ AF + + S++ + L +G + F + V+ S K
Sbjct: 1143 TFRGHEASVSAVAFSPDGQTIVSGSTDTTLKLWDTSGNLLDTFRGHEDAVDAVAFSPDGK 1202
Query: 169 IIITAGYGEKHLQVWRC 185
II+ Y + ++WR
Sbjct: 1203 RIISGSY-DNTFKLWRA 1218
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 30/127 (23%), Positives = 61/127 (48%), Gaps = 11/127 (8%)
Query: 66 WKSTGR-------HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPI 118
W ++G+ HPGG+ +AF+ G+ + +G + G+++ F+ E +
Sbjct: 1009 WDTSGKLLHTFRGHPGGVTAVAFSPDGKRIVSGSGDGTLKLWDTTSGKLLHTFRGHEASV 1068
Query: 119 SSSAFLCEEKIFALASSEVRI-LSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYG 176
S+ AF + + S++ + L +G + F G + V+ S DG +I+ +G G
Sbjct: 1069 SAVAFSPDGQTIVSGSTDTTLKLWDTSGNLLDTFRGHPGGVTAVAFSPDGKRIV--SGSG 1126
Query: 177 EKHLQVW 183
+ L++W
Sbjct: 1127 DGTLKLW 1133
>gi|254409388|ref|ZP_05023169.1| YD repeat protein [Coleofasciculus chthonoplastes PCC 7420]
gi|196183385|gb|EDX78368.1| YD repeat protein [Coleofasciculus chthonoplastes PCC 7420]
Length = 1162
Score = 39.3 bits (90), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 50/196 (25%), Positives = 96/196 (48%), Gaps = 18/196 (9%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A +S+G + D+ G+++ + H GG+A +AF+ G+ + ++G+A +
Sbjct: 907 MIATASSDGTVRLWDI-QGKLQ-RRLPNHSGGVAQVAFSPDGQLIATASSDGIA-RLWDI 963
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
G ++++ + + S AF + A ASS+ VR+ L+ G + G ++ V
Sbjct: 964 QGNLLQDLIGHQGWVRSLAFSPDGTQIATASSDRTVRLWDLQ-GNLRQELKGHQGWVKSV 1022
Query: 162 SASDGAKIIITAGYGEKHLQVWRCDIS-SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
+ S I TA + +++W D + K +N+ P+ + H +A SP DGT
Sbjct: 1023 AFSPNGDYIATASI-DGIVRLWDTDGNLVKELNQHPS-GITH--IAF----SP---DGTR 1071
Query: 221 ILAVAESGVAYSWDLK 236
I + G+A WDL+
Sbjct: 1072 IATASFEGIARLWDLQ 1087
>gi|195359262|ref|XP_002045331.1| GM24417 [Drosophila sechellia]
gi|194127361|gb|EDW49404.1| GM24417 [Drosophila sechellia]
Length = 787
Score = 39.3 bits (90), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 59/279 (21%), Positives = 109/279 (39%), Gaps = 23/279 (8%)
Query: 62 GEMKWKSTGRHPGGLAGLAFAKKG-RSLHVVGTNGMASEMKSEMGEV-IREFKASEKPIS 119
G++ W G H L G A + ++ V+ + G+ S + + E I+
Sbjct: 23 GDIAWSPNGHHLYCLNGSAVNQVDVQTSQVIKSYGLTSPLADNKKPAELDNIDVEEDTIT 82
Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEV-LKFSDDVGPLQYVSASDGAKIIITAGYGEK 178
E + A S +R+ L+ G+ V L + GP+ V S ++I T+G +
Sbjct: 83 CFGLSQEHLVTAHRSGLLRLWELDTGKLVKLWKAQHKGPVIRVEFSPCGRLICTSGGADA 142
Query: 179 HLQVWRCDISSKTV------NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
L++W D S+ + GPA + P + + G D +V A++
Sbjct: 143 TLRLW--DYSNNSCLGALKDFPGPAWLLVFHPNVLRKEIYAGGSDNSV--------YAWN 192
Query: 233 WDLKTVSQDEKTNPAKIT-VKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTITG 291
++ KT+ + + +++T + K +D V V + + I+ LE+ K +
Sbjct: 193 YETKTLVHKMRGHLSQVTGLSFKSNASDCNEIVTVSRDKVMIVWQLQEQLESKQLKVLPL 252
Query: 292 LHDLESEAATASAQN---EKKSKKRAASDPDLETTRDVV 327
+LE Q S K DP RD++
Sbjct: 253 YDELEGVVYADEGQQIIVASGSGKLQQVDPKSWKIRDIL 291
>gi|123968669|ref|YP_001009527.1| hypothetical protein A9601_11361 [Prochlorococcus marinus str.
AS9601]
gi|123198779|gb|ABM70420.1| G-protein beta WD-40 repeats [Prochlorococcus marinus str. AS9601]
Length = 357
Score = 39.3 bits (90), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 44/183 (24%), Positives = 76/183 (41%), Gaps = 17/183 (9%)
Query: 27 LACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGR 86
+ACG+ K ++ +G G + + + TG++ WK H GGL +A +G
Sbjct: 26 IACGWALKGKQ-------FIVGDVAGGLFSFEGDTGKIIWKKENTHSGGLLAMAIHPEGE 78
Query: 87 SLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGE 146
G +G G+VI+ + + + + A+ASS+ + E GE
Sbjct: 79 IFATSGQDGNVQICNCHEGKVIKTLDLGKGWVEHLKWSNDGLFLAIASSKKVYVFNEIGE 138
Query: 147 EVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGPALSMR 201
E D + ++ S+ + + TA YG +V DI + N +G +SM
Sbjct: 139 EKWISEDHPSTVSAITWSNKNE-LATACYG----RVTFFDIVNNKTNQKLEWQGSLVSME 193
Query: 202 HSP 204
SP
Sbjct: 194 LSP 196
>gi|427725983|ref|YP_007073260.1| YD repeat protein [Leptolyngbya sp. PCC 7376]
gi|427357703|gb|AFY40426.1| YD repeat protein [Leptolyngbya sp. PCC 7376]
Length = 1808
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 49/173 (28%), Positives = 69/173 (39%), Gaps = 24/173 (13%)
Query: 72 HPG--GLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKI 129
HP G+ +A++ + G +G +E GE IR + E PI F K
Sbjct: 1190 HPSAQGINAVAYSPDESMVATGGADGNIRLWSAE-GESIRTLEDHEAPIYEMEFSPNGK- 1247
Query: 130 FALASSE---VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCD 186
F L+ SE R+ E GE + F D + VS S ++II TA
Sbjct: 1248 FLLSGSEDFTARLWDPETGELLRTFEDHDNSIYGVSFSPDSQIIATASV----------- 1296
Query: 187 ISSKTVNKGPALSMRHSPVAIDCKN---SPNGEDGTVILAVAESGVAYSWDLK 236
TVN + ID +N S N DG+ I +E G+ WDL+
Sbjct: 1297 --DGTVNIYSVEGQLLQTLEIDLENYDVSFNA-DGSAIATASEDGILRFWDLE 1346
>gi|390594203|gb|EIN03616.1| hypothetical protein PUNSTDRAFT_139337 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1087
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 48/204 (23%), Positives = 86/204 (42%), Gaps = 26/204 (12%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA +++G + DV TG+ + H + +AF+ G + + +
Sbjct: 853 LASASTDGTVRLWDVETGQRIGQPLEEHTNWVCCVAFSPDGNRIVSGSVDRTLRLWDAHT 912
Query: 105 GEVIRE-FKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
G+ I E F+ + S AF + K A SS+ +R+ E GE V + PLQ
Sbjct: 913 GQAIGEPFRGHSDYVQSVAFSPDGKHIASGSSDSTIRLWDAETGEPVGE------PLQGH 966
Query: 162 SAS--------DGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSP 213
++S DG +I+ +G +K +++W D ++ GP + D +
Sbjct: 967 NSSVFSVAYSPDGTRIV--SGSYDKTIRIW--DTQTRQTVVGPLQGHKK-----DVNSVA 1017
Query: 214 NGEDGTVILAVAESGVAYSWDLKT 237
DG +++ +E G WD +T
Sbjct: 1018 FSPDGKHVVSGSEDGTMRIWDTQT 1041
>gi|193215005|ref|YP_001996204.1| WD40 domain-containing protein [Chloroherpeton thalassium ATCC
35110]
gi|193088482|gb|ACF13757.1| WD-40 repeat protein [Chloroherpeton thalassium ATCC 35110]
Length = 671
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 40/153 (26%), Positives = 70/153 (45%), Gaps = 7/153 (4%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA G ++G + D E + S H + +AF+ GR L G + +A +
Sbjct: 455 LLASGGNDGWVKTWDT-RKESEIDSFHEHEDAVTSVAFSSDGRFLASAGNDKIAVLWNAG 513
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENG-EEVLKFSDDVGPLQY 160
G+ +P++ AF K A S + +++ +LE G EE+ VG + +
Sbjct: 514 TGKKKHTLVGHSRPVTCVAFSPNAKFLATGSWDRSIKLWNLETGLEEICLAGHPVG-IDF 572
Query: 161 VSASDGAKIIITAGYGE-KHLQVWRC-DISSKT 191
++ S K++I +GY + L + R DI K+
Sbjct: 573 IAFSPNGKMMIASGYNRVRKLSIMRLWDIEKKS 605
>gi|406834758|ref|ZP_11094352.1| serine/threonine protein kinase-related protein [Schlesneria
paludicola DSM 18645]
Length = 1244
Score = 38.9 bits (89), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 57/131 (43%), Gaps = 5/131 (3%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
A G+++G + DV TG W+S+ H + +AF+ G + G N + S +
Sbjct: 950 FATGSTDGFVRLWDVSTGRALWESS--HGSEINSVAFSPNGALVAAAGENRITSIWDTRT 1007
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
G+ ++ E + S AF + +S + R+ SLE G + G L S
Sbjct: 1008 GQRLKSLVGHENEVMSVAFSPNGRSVLTSSHDGTARVWSLETGGQQFLLPHR-GALLEAS 1066
Query: 163 ASDGAKIIITA 173
S K+I TA
Sbjct: 1067 FSPDGKLIATA 1077
>gi|290976462|ref|XP_002670959.1| predicted protein [Naegleria gruberi]
gi|284084523|gb|EFC38215.1| predicted protein [Naegleria gruberi]
Length = 623
Score = 38.9 bits (89), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 42/184 (22%), Positives = 74/184 (40%), Gaps = 13/184 (7%)
Query: 27 LACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGR 86
++C F+ K+ ++LL G S+G I AVD TGE W H + + + +
Sbjct: 381 VSCSFI-----KDDISILLT-GWSDGSIRAVDCETGEQLWTLANAHRNAVTKIDTSFNNK 434
Query: 87 SLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS---EVRILSLE 143
G G + +++ FK ++ L E+ + +SS E+ L
Sbjct: 435 FFVSAGQEGDVRVYAMKTRQLVCSFKEHTTMVTCLQIL-EDDVHVFSSSKDREIFCYDLR 493
Query: 144 NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS 203
N V F+ VG + + II+ G ++++ W ISS + + S +
Sbjct: 494 NERRVAAFNMKVGSVNGFVVCKDHETIISIG-SDRYITFWT--ISSDSPTRIVPYSKKFE 550
Query: 204 PVAI 207
P I
Sbjct: 551 PKCI 554
>gi|430746307|ref|YP_007205436.1| WD40 repeat-containing protein [Singulisphaera acidiphila DSM
18658]
gi|430018027|gb|AGA29741.1| WD40 repeat-containing protein [Singulisphaera acidiphila DSM
18658]
Length = 939
Score = 38.9 bits (89), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 40/156 (25%), Positives = 76/156 (48%), Gaps = 14/156 (8%)
Query: 85 GRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE--EKIFALASSEVRILSL 142
G VG + + + G V++E ASE+P+++ A + + ALA+ +RI +
Sbjct: 287 GSRFATVGPDKVVRVYNAADGAVVKEIPASEEPVAALALKADGSQIAIALANRAIRICNT 346
Query: 143 ENGEEVLKFSDDVGPLQYVS-ASDGAKIIITAGYGEKHLQVWRCDISSKTVNK-GPALSM 200
+G EV K P+ ++ +DGA++ + GE + + D ++ TV K P
Sbjct: 347 ADGTEVKKSEPLPSPVSALAFRADGAQLAVA---GEDTI-IRVLDAAAATVVKEFPG--- 399
Query: 201 RHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLK 236
H + +PN DG ++++ ++ +A WD+K
Sbjct: 400 -HGAKVNALEFAPN--DGNLLISASDDKLAKLWDVK 432
Score = 37.7 bits (86), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 25/115 (21%), Positives = 56/115 (48%), Gaps = 6/115 (5%)
Query: 73 PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFAL 132
P ++ LAF G L V G + + + + V++EF +++ F + +
Sbjct: 359 PSPVSALAFRADGAQLAVAGEDTIIRVLDAAAATVVKEFPGHGAKVNALEFAPNDGNLLI 418
Query: 133 ASSE---VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGEKHLQVW 183
++S+ ++ ++ G+ + F+ PL ++ S DG+K++ G +K ++VW
Sbjct: 419 SASDDKLAKLWDVKEGKLIRDFAGHTEPLLTLNVSRDGSKLVT--GSADKSIKVW 471
>gi|162149625|ref|YP_001192004.2| leucyl-tRNA synthetase [Metallosphaera sedula DSM 5348]
Length = 938
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 67/161 (41%), Gaps = 16/161 (9%)
Query: 91 VGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV------------- 137
VG + +M+ E+GE + F S+ ++A L E IF + V
Sbjct: 183 VGMHDTKGDMEPEIGEYVVIFFESKMGALAAATLRPETIFGAVAVWVNPKATYTVAEIWG 242
Query: 138 -RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
+++ E E LKF DV L+ VS SD KI+ K + + D T G
Sbjct: 243 KKVIVSEKAAEKLKFQTDVKVLEKVSGSDLLKIVAINPITGKEIPILPADFVDPTTATGV 302
Query: 197 ALSMR-HSPV-AIDCKNSPNGEDGTVILAVAESGVAYSWDL 235
+S+ H+P K + G + ++AV G A + DL
Sbjct: 303 VMSVPAHAPFDYFYLKKAKVGIEPIPVVAVEGQGDAPAKDL 343
>gi|189082975|sp|A4YI28.1|SYL2_METS5 RecName: Full=Leucine--tRNA ligase 2; AltName: Full=Leucyl-tRNA
synthetase 2; Short=LeuRS 2
gi|145702938|gb|ABP96080.1| leucyl-tRNA synthetase [Metallosphaera sedula DSM 5348]
Length = 950
Score = 38.9 bits (89), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 43/161 (26%), Positives = 67/161 (41%), Gaps = 16/161 (9%)
Query: 91 VGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV------------- 137
VG + +M+ E+GE + F S+ ++A L E IF + V
Sbjct: 195 VGMHDTKGDMEPEIGEYVVIFFESKMGALAAATLRPETIFGAVAVWVNPKATYTVAEIWG 254
Query: 138 -RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
+++ E E LKF DV L+ VS SD KI+ K + + D T G
Sbjct: 255 KKVIVSEKAAEKLKFQTDVKVLEKVSGSDLLKIVAINPITGKEIPILPADFVDPTTATGV 314
Query: 197 ALSMR-HSPV-AIDCKNSPNGEDGTVILAVAESGVAYSWDL 235
+S+ H+P K + G + ++AV G A + DL
Sbjct: 315 VMSVPAHAPFDYFYLKKAKVGIEPIPVVAVEGQGDAPAKDL 355
>gi|428308856|ref|YP_007119833.1| WD40 repeat-containing protein [Microcoleus sp. PCC 7113]
gi|428250468|gb|AFZ16427.1| WD40 repeat-containing protein [Microcoleus sp. PCC 7113]
Length = 1205
Score = 38.9 bits (89), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 7/106 (6%)
Query: 45 LALGTSNGDILAVDVLTGEM--KWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LA G+ +G + D+ TG+ W++ H G + +AF+ +GR+L G +
Sbjct: 689 LASGSGDGTLRCWDLNTGQCLKMWQA---HLGQVWSVAFSPQGRTLASSGADNTMKLWDV 745
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGE 146
G+ ++ F++ + S AF + KI A ++ VR + GE
Sbjct: 746 STGQCLKTFQSDNNQVQSVAFSPDGKILASGGNDCLVRCWDINTGE 791
>gi|425457085|ref|ZP_18836791.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
gi|389801671|emb|CCI19204.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
Length = 1108
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 83/199 (41%), Gaps = 26/199 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D +++ +T + +AF G + + G +G ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDGNQIAITGRSGKV-QIWSK 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 899 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLSNYRPQLINSWTADDNIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + + P + SP
Sbjct: 959 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWD 234
DG I A++ G A WD
Sbjct: 1005 -DGEKIAAISRDGTARRWD 1022
>gi|444919762|ref|ZP_21239726.1| High-affnity carbon uptake protein Hat/HatR [Cystobacter fuscus DSM
2262]
gi|444707968|gb|ELW49101.1| High-affnity carbon uptake protein Hat/HatR [Cystobacter fuscus DSM
2262]
Length = 953
Score = 38.9 bits (89), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 44/173 (25%), Positives = 77/173 (44%), Gaps = 20/173 (11%)
Query: 71 RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIF 130
RH + +AF+ GRS+ +G A GE + + + E+P+++ AF E K
Sbjct: 716 RHADAVNAVAFSPDGRSVATASEDGTARLWSVATGEPLGKPFSHERPVTAVAFSPEGKSL 775
Query: 131 ALASSE--VRILSLENGEEV---LKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRC 185
A AS++ R+ + GE + L+ + L + + DG + + G +++W
Sbjct: 776 ATASTDNTARLWNTATGEPLGSPLRHDALITSLAF--SPDGQSLATASDDGS--VRLW-- 829
Query: 186 DISSKTVNKGPALSMRHSPVAI-DCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
D+++ G S H P A+ SP DG + +E A WD+ T
Sbjct: 830 DVAT-----GSERSRLHHPNAVTSVAFSP---DGKSLATGSEDDSARLWDVAT 874
>gi|395507078|ref|XP_003757855.1| PREDICTED: WD repeat-containing protein 43 [Sarcophilus harrisii]
Length = 688
Score = 38.9 bits (89), Expect = 3.7, Method: Compositional matrix adjust.
Identities = 53/250 (21%), Positives = 103/250 (41%), Gaps = 24/250 (9%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
LLA+GT+ G IL + L GE++ K +G H + + + + L+ + E +
Sbjct: 96 LLAIGTAVGSILLYNTLKGELQSKLVSGGHDNKVNCIQWHQDNDCLYSCSDDKHIVEWNT 155
Query: 103 EMGEVIREFKASEKPISSSAFLC---EEKIFALASSEVRILSLENGEEVLKFSDD----- 154
+ V ++K ISS + LC + K+ A +++ LE E F+
Sbjct: 156 QTCRVKCKWKGD---ISSVSALCISPDGKMLLSAGRTIKLWDLETKEVYRHFTGHATAVS 212
Query: 155 ------VGPLQYVSASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA 206
+ P + DG ++ ++ L VW+ +K N + ++ PV
Sbjct: 213 SLMFTTLRPPKESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVY 272
Query: 207 IDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNV 266
ID S N E+ + V G + ++ +K + T+++ + D++++
Sbjct: 273 IDLTLSENKEEPVKLAVVCRDGQVHLFEHILNGYCKKPLTSNCTIQIATSGKDKEST--- 329
Query: 267 KKSRTSIIAA 276
S I+AA
Sbjct: 330 -PSPIPILAA 338
>gi|295838049|ref|ZP_06824982.1| WD-repeat protein [Streptomyces sp. SPB74]
gi|295826815|gb|EDY46831.2| WD-repeat protein [Streptomyces sp. SPB74]
Length = 1631
Score = 38.9 bits (89), Expect = 3.7, Method: Composition-based stats.
Identities = 41/147 (27%), Positives = 64/147 (43%), Gaps = 7/147 (4%)
Query: 41 GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
GT L + G S+G + DV TG+ + GR L AF+ G L G+ G
Sbjct: 1288 GTRLASAG-SDGTVRLWDVATGQRTHELLGRG-DRLISAAFSPAGTVLATAGSTGHVYLW 1345
Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPL 158
+E G +RE + AF + + A A+ + VR+ G L+ G +
Sbjct: 1346 DAENGTFLRELDVETDRTWAEAFSADGEEIATANDDDSVRLWRRATGSHGLRLDGHEGRV 1405
Query: 159 QYVS-ASDGAKIIITAGYGEKHLQVWR 184
+ V+ A DGA I G + +++WR
Sbjct: 1406 RSVAFAKDGAS--IATGCDDGRVRLWR 1430
>gi|381207419|ref|ZP_09914490.1| hypothetical protein SclubJA_17565 [SAR324 cluster bacterium
JCVI-SC AAA005]
Length = 348
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 62/136 (45%), Gaps = 7/136 (5%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
L A+G ++G + A D +G+ WK+T H GG+ L+ + G L VG +G +
Sbjct: 36 LFAVGDASGSVHAFDGTSGQEIWKNTEAHVGGIMSLSVSPNGTQLASVGQDGQLTIWNFA 95
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVL---KFSDDVGPLQY 160
G++ + + + + A+ A++S V +GE++ K V L +
Sbjct: 96 DGQLQHQTEIASGWVEHVAWSKSGNYLAVSSGRVVTNLSSSGEKIWSSEKHPSTVSALGW 155
Query: 161 VSASDGAKIIITAGYG 176
V+ + + TA YG
Sbjct: 156 VTDQE----LSTACYG 167
>gi|193216393|ref|YP_001997592.1| WD40 domain-containing protein [Chloroherpeton thalassium ATCC
35110]
gi|193089870|gb|ACF15145.1| WD-40 repeat protein [Chloroherpeton thalassium ATCC 35110]
Length = 514
Score = 38.9 bits (89), Expect = 3.8, Method: Compositional matrix adjust.
Identities = 35/139 (25%), Positives = 67/139 (48%), Gaps = 8/139 (5%)
Query: 54 ILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMAS--EMKSEMGEVI--R 109
I+ ++ GE+ +K TG H G + +A +K L G++ S ++ S+ I R
Sbjct: 292 IIFWNLRNGEITYKITG-HSGWITSIALNEKNTLLASGGSDETVSLWDIASQPDSAIELR 350
Query: 110 EFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
F + + S AF + KI A + ++ ++E G E+ + ++ ++ S
Sbjct: 351 RFPGHKDYVRSIAFAPDGKILASGGDDRTLKFWNVETGREIYEIEGHASWVRSIAYSPNG 410
Query: 168 KIIITAGYGEKHLQVWRCD 186
KII TAG +K +++W +
Sbjct: 411 KIIATAG-DDKLIKLWNAE 428
>gi|406833633|ref|ZP_11093227.1| serine/threonine protein kinase-related protein [Schlesneria
paludicola DSM 18645]
Length = 1664
Score = 38.9 bits (89), Expect = 3.9, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 67/155 (43%), Gaps = 11/155 (7%)
Query: 81 FAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VR 138
F+ G + ++ A ++ GE+IREFK E + F + + S + R
Sbjct: 1501 FSPNGDYIVTTSSDKTARLWNAKSGELIREFKGHEWAVVCVDFSKDGRWIVTGSEDNTAR 1560
Query: 139 ILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPAL 198
+ ++E E++L S + V+ S + IIT G ++ ++W +KT + L
Sbjct: 1561 VWNVETAEQLLTLSGHTASVTSVNFSPDSMRIITGGQ-DQAAKLW----DAKTGKEILTL 1615
Query: 199 SMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSW 233
S RH+ SP DG IL + G A W
Sbjct: 1616 S-RHTEEVTSVAFSP---DGHQILTGSRDGTAVIW 1646
>gi|403222008|dbj|BAM40140.1| uncharacterized protein TOT_020000403 [Theileria orientalis strain
Shintoku]
Length = 685
Score = 38.9 bits (89), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 26/118 (22%), Positives = 56/118 (47%), Gaps = 3/118 (2%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H G+ ++F ++G + + + +E G++I + + P + +E +F
Sbjct: 438 HSKGVRDISFIEEGSKFYSCSFDSNSILWDTEYGKIIGIYTVEKTPYCLTVCPKDEWVFI 497
Query: 132 LASSEVRILSLE--NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDI 187
+ + + GE VL++++ +G + V+ DG + I+T G +K L VW +I
Sbjct: 498 VGGENKKASQFDARTGEVVLEYAEHMGCVNTVTFIDGNRRILTTG-DDKKLLVWDYNI 554
>gi|427718396|ref|YP_007066390.1| serine/threonine protein kinase with WD40 repeats [Calothrix sp.
PCC 7507]
gi|427350832|gb|AFY33556.1| serine/threonine protein kinase with WD40 repeats [Calothrix sp.
PCC 7507]
Length = 661
Score = 38.5 bits (88), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 30/122 (24%), Positives = 53/122 (43%), Gaps = 9/122 (7%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLC------ 125
H G + +AF+ G +L + GE IR K ++S A+L
Sbjct: 538 HSGLVLAVAFSPDGINLASGSKDKTIKLWNLVTGEAIRTLKGHTDKVNSVAYLPKSGDNK 597
Query: 126 -EEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQV 182
+ I S++ V++ +LE G+E+ D G + V+ S K I + G + +++
Sbjct: 598 NQNTILISGSNDNTVKLWNLETGKEIRTLKRDSGYIYSVAVSADGKTIASGGSADNIIKI 657
Query: 183 WR 184
WR
Sbjct: 658 WR 659
>gi|332707508|ref|ZP_08427552.1| WD-40 repeat-containing protein [Moorea producens 3L]
gi|332353722|gb|EGJ33218.1| WD-40 repeat-containing protein [Moorea producens 3L]
Length = 1182
Score = 38.5 bits (88), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 50/194 (25%), Positives = 81/194 (41%), Gaps = 16/194 (8%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA + + + DV G+ + TG H + + F GR L + + +
Sbjct: 915 LLASASLDNTVKLWDVDNGKEIYTLTG-HTSNVRSITFRSDGRILASGSDDRTIKLWRVQ 973
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRIL--SLENGEEVLKFSDDVGPLQYV 161
GE++R FK I +F + + A AS + RIL +E+G V F + L +
Sbjct: 974 DGELLRTFKGHLHSIRDLSFTPDGQNIATASFDGRILFWQVEDGRMVKVFDNIDSWLATI 1033
Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGE----- 216
S S K++ + G G + +++W + V + P H + SPNG+
Sbjct: 1034 SISPNGKLLASGG-GYRGIKLWN-NSDGTIVKELPG----HGIWIRSLRFSPNGKLLASG 1087
Query: 217 --DGTVILAVAESG 228
D TV L E G
Sbjct: 1088 SFDRTVKLWRVEDG 1101
>gi|444918527|ref|ZP_21238596.1| WD-40 repeat protein [Cystobacter fuscus DSM 2262]
gi|444709706|gb|ELW50706.1| WD-40 repeat protein [Cystobacter fuscus DSM 2262]
Length = 182
Score = 38.5 bits (88), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 27/115 (23%), Positives = 56/115 (48%), Gaps = 5/115 (4%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE-MGEVIREFKASEKPISSSAFLCEEKIF 130
H G ++ AF+ +G+ + G +G +++ GE + + PI S+AF + +
Sbjct: 18 HEGQVSSAAFSPEGQRVITSGADGTVRVWRADGTGEPL-VLDGRQGPIRSAAFSPDGQRI 76
Query: 131 ALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
A + + R+ + E L GP+Q+V+ S + ++TAG ++ ++W
Sbjct: 77 ATSGDDGTARVWRADGTGEPLLLKGHTGPVQFVAFSPDGRQLVTAG-ADRTARIW 130
>gi|425468738|ref|ZP_18847730.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9701]
gi|389884607|emb|CCI35113.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
9701]
Length = 559
Score = 38.5 bits (88), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
LAF+ G+ L G++ + GE+I+ ++A E+ I S A + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444
Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
+I + G E++K PL++ + DG + +IT YG K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGGK-VKIW 485
>gi|344280461|ref|XP_003412002.1| PREDICTED: LOW QUALITY PROTEIN: WD repeat-containing protein
43-like [Loxodonta africana]
Length = 677
Score = 38.5 bits (88), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 56/262 (21%), Positives = 100/262 (38%), Gaps = 29/262 (11%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG---------KKRRKER-GTL----LLA 46
+W T + L E+ P + +CLA KKR+ E GT LLA
Sbjct: 40 VWETGNNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKESPQRKKRKSEAIGTSDQVDLLA 98
Query: 47 LGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LGT+ G IL + GE+ K +G H + + + + L+ + E ++
Sbjct: 99 LGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGYLYSCSDDKHIVEWNAQTC 158
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS- 164
V ++K +SS + K+ A +++ LE E F+ P+ + +
Sbjct: 159 RVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLMFTT 218
Query: 165 ----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
DG ++ ++ L +W+ K N + ++ PV ID S
Sbjct: 219 IRPPNESQPFDGITGLYFLSGAVHDRLLNIWQVRSEKKEKNAVMSFTVTDEPVYIDLTLS 278
Query: 213 PNGEDGTVILAVAESGVAYSWD 234
N E+ + V G + ++
Sbjct: 279 ENKEEPVRLAVVCRDGQVHLFE 300
>gi|321478591|gb|EFX89548.1| hypothetical protein DAPPUDRAFT_303234 [Daphnia pulex]
Length = 907
Score = 38.5 bits (88), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 23/62 (37%), Positives = 39/62 (62%), Gaps = 4/62 (6%)
Query: 132 LASSEVRILSLENGEEVLKFSDDVGPLQYVSA-SDGAKIIITAGYGEKHLQVWRCDISSK 190
LAS E+ + +L+ E ++KF D GP+ ++ +DGA I+IT G + HL +W D+ +
Sbjct: 214 LASGEMCLHNLKFDETLIKFKQDWGPITALTFRTDGAPIMIT-GSSQGHLAIW--DLEQQ 270
Query: 191 TV 192
T+
Sbjct: 271 TL 272
>gi|195582010|ref|XP_002080821.1| GD10690 [Drosophila simulans]
gi|194192830|gb|EDX06406.1| GD10690 [Drosophila simulans]
Length = 787
Score = 38.5 bits (88), Expect = 4.2, Method: Compositional matrix adjust.
Identities = 60/284 (21%), Positives = 110/284 (38%), Gaps = 23/284 (8%)
Query: 62 GEMKWKSTGRHPGGLAGLAFAKKG-RSLHVVGTNGMASEMKSEMGEV-IREFKASEKPIS 119
G++ W G H L G A + ++ V+ + G+ S + + E I+
Sbjct: 23 GDIAWSPNGHHLYCLNGSAVNQVDVQTSQVLKSYGLTSPLADNKKPAELDNIDVEEDTIT 82
Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEV-LKFSDDVGPLQYVSASDGAKIIITAGYGEK 178
E + A S +R+ L+ G+ V L + GP+ V S ++I T+G +
Sbjct: 83 CFGLSQEHLVTAHRSGLLRLWELDTGKLVKLWKAQHKGPVIRVEFSPCGRLICTSGGADA 142
Query: 179 HLQVWRCDISSKTV------NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
L++W D S+ + GPA + P + + G D +V A++
Sbjct: 143 TLRLW--DYSNNSCLGALKDFPGPAWLLVFHPNVLRKEIYAGGSDNSV--------YAWN 192
Query: 233 WDLKTVSQDEKTNPAKIT-VKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTITG 291
++ KT+ + + +++T + K +D V V + + I+ LE+ K +
Sbjct: 193 YETKTLVHKMRGHLSQVTGLSFKSNASDCNEIVTVSRDKVMIVWQLQEQLESKQLKVLPL 252
Query: 292 LHDLESEAATASAQN---EKKSKKRAASDPDLETTRDVVDGECF 332
+LE Q S K DP RD++ F
Sbjct: 253 YDELEGVVYADEGQQIIVASGSGKLQQVDPKSWKIRDILSQSDF 296
>gi|156407954|ref|XP_001641622.1| predicted protein [Nematostella vectensis]
gi|156228761|gb|EDO49559.1| predicted protein [Nematostella vectensis]
Length = 686
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 71/323 (21%), Positives = 124/323 (38%), Gaps = 54/323 (16%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKK-----RRKERGTL----------LL 45
+W +GSL ++ P + +CL + RRK++ T ++
Sbjct: 51 VWDCTNGSLKNQYT-PSSHLSTTCACLKWCNSSRNVEQTPRRKKQKTAHHSTSVSTSDII 109
Query: 46 ALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
ALGT++GDIL + GE+ K G H + + +++ G L+ ++ E +E
Sbjct: 110 ALGTTSGDILLYSLSAGEVHKKLEGGHSSKVNDVEWSRNGE-LYSCSSDKYIVEWCTEKA 168
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF-------------- 151
E ++KA + +SS A +++ L+ E + KF
Sbjct: 169 EHKSKWKADKHGVSSLRVSPLGNHLLSAGRTIKLWDLDTKELLKKFTGHSSIITDLLFLP 228
Query: 152 ----SDDVGPLQYVSASDGAKIIITAGY-----------GEKHLQV-WRCDISSKTVNKG 195
SD G S +G + +A + G HL V W ++ +K N
Sbjct: 229 NPTSSDSNGNHVSDSTINGWYFLSSAEHDRLLNIWQVFIGVIHLLVNWYIELETKAKNSL 288
Query: 196 PALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKK 255
+LS+ P++ID I V + G + ++ +K T++LK
Sbjct: 289 ISLSLTDEPISIDVTRHDTRPKPVHISVVTKDGRLHLFEKSLNGGGKKPLLPSRTIQLKS 348
Query: 256 ADADQQNSVNVKKSRTSIIAARL 278
AD + KS S I A+L
Sbjct: 349 AD-------DKAKSPISFICAKL 364
>gi|300709644|ref|YP_003735458.1| cell surface protein/ lipoprotein [Halalkalicoccus jeotgali B3]
gi|448297585|ref|ZP_21487631.1| cell surface protein/ lipoprotein [Halalkalicoccus jeotgali B3]
gi|299123327|gb|ADJ13666.1| cell surface protein/ lipoprotein [Halalkalicoccus jeotgali B3]
gi|445579894|gb|ELY34287.1| cell surface protein/ lipoprotein [Halalkalicoccus jeotgali B3]
Length = 400
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 32/133 (24%), Positives = 63/133 (47%), Gaps = 9/133 (6%)
Query: 40 RGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASE 99
GT+ + G G A+D +GE +W+ G G A G +++ +G
Sbjct: 264 EGTVFV--GDRGGGFYALDAASGEQRWR---VETGDWTGSGPAVVGGTVYAANRDGNVYA 318
Query: 100 MKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQ 159
+E GE + F+ + + SS + E+++ +A+ E + +L+ E ++S +GP+
Sbjct: 319 FDAESGEQLWRFQTGGE-VWSSPTVAGERVY-VANREGSVYALDAAEGTNQWSLSIGPVT 376
Query: 160 YVSAS--DGAKII 170
+ S + DGA +
Sbjct: 377 FSSPAVVDGALYV 389
>gi|159900554|ref|YP_001546801.1| serine/threonine protein kinase [Herpetosiphon aurantiacus DSM 785]
gi|159893593|gb|ABX06673.1| serine/threonine protein kinase [Herpetosiphon aurantiacus DSM 785]
Length = 774
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 65/150 (43%), Gaps = 17/150 (11%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++ G+ +G I+A+D+ G ++WK + GG+ A A L N + + + E
Sbjct: 599 MVIFGSGDGSIVALDLFKGGIRWKQ--KLQGGIVSSATANDKMVLVGCMDNNLHA-LDLE 655
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSD---------- 153
G I +F+ S SS + + + L+NG++V +++
Sbjct: 656 GGWPIWKFRTSHYVNSSPIIIGNRAFVGGIDGNIYAVDLKNGKQVWQYNTGAQIVSSPVA 715
Query: 154 DVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
D G + Y+ A+DG + AG G VW
Sbjct: 716 DSGRI-YIGAADGTVYCLDAGSGTP---VW 741
>gi|386852528|ref|YP_006270541.1| Vegetative incompatibility protein HET-E-1 [Actinoplanes sp.
SE50/110]
gi|359840032|gb|AEV88473.1| Vegetative incompatibility protein HET-E-1 [Actinoplanes sp.
SE50/110]
Length = 395
Score = 38.5 bits (88), Expect = 4.3, Method: Compositional matrix adjust.
Identities = 39/168 (23%), Positives = 79/168 (47%), Gaps = 13/168 (7%)
Query: 74 GGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVI-REFKASEKPISSSAFLCEEKIFAL 132
G ++ + F++ G+ L + +++ G I + S P+ +++F + K+ A
Sbjct: 169 GAVSSVVFSRDGKRLATTAADSTVRLWQADSGRPIGKALIVSPGPVYATSFSPDGKLLAA 228
Query: 133 ASSE--VRILSLENGEEVLK-FSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW----RC 185
+ VR+ + GE + GP+ ++ S G +++ T+ G+ +++W R
Sbjct: 229 SGGGDVVRLWNPATGEPAGEPIVAGPGPVYALTFSRGTRLLATSSGGDDTVRLWDTATRH 288
Query: 186 DISSKTV-NKGPALSMRHSPVAIDCKNSPNG-EDGTVILAVAESGVAY 231
+++ + GP +MR P D K G +DGTV L A +G A+
Sbjct: 289 PVAAPLAGHTGPVRAMRFGP---DGKLLATGSDDGTVRLWDAVTGRAH 333
>gi|403301810|ref|XP_003941571.1| PREDICTED: WD repeat-containing protein 43 [Saimiri boliviensis
boliviensis]
Length = 678
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 47/220 (21%), Positives = 88/220 (40%), Gaps = 19/220 (8%)
Query: 34 KKRRKE-----RGTLLLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRS 87
KKR+ E T LLALGT+ G IL + GE+ K +G H + + + + G
Sbjct: 81 KKRKSEAVGTSNQTDLLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDGGC 140
Query: 88 LHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
L+ + E + +V ++K ++S + K+ A +++ LE E
Sbjct: 141 LYSCSDDKHIVEWNVQTCKVKCKWKGDNSSVTSLCVSPDGKMLLSAGRTIKLWVLETKEV 200
Query: 148 VLKFSDDVGPLQYVSAS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNK 194
F+ P+ + + DG ++ ++ L VW+ +K +
Sbjct: 201 YRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKSA 260
Query: 195 GPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
+ ++ PV ID S N E+ + V G + ++
Sbjct: 261 VMSFTVTDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE 300
>gi|269961433|ref|ZP_06175797.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
gi|269833810|gb|EEZ87905.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
Length = 383
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 48/205 (23%), Positives = 91/205 (44%), Gaps = 31/205 (15%)
Query: 47 LGTSNGDILAVDVLTGE-MKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
LG SNG + V+++TG +++ + H + +A + GR G + A +E G
Sbjct: 192 LGLSNGKAIHVNLVTGRRLEFLA---HQEKVNSVAISPNGRFALTGGNDYKAYLWDTETG 248
Query: 106 EVIREFKASEKPISSSAFLCEEKIFALAS---SEVRILSLENGEEVLKFSDDVGPLQYVS 162
+V+R F+ ++ + + L + A+ S ++ I LE GEE+ + S L + +
Sbjct: 249 QVLRSFEHEQRVVRVA--LQRDGKLAMTSDGGNQAIIWDLETGEELTQLSSWSRQLIFST 306
Query: 163 A--SDGAKIIITAGYGEKHLQVW---------RCDISSKTVNKGPALSMRHSPVAIDCKN 211
A SD ++++T G + VW R ++ K + P + + A D K
Sbjct: 307 ARFSDDGRLLVT-GTPSGRVMVWDTHTGKRVDRFEVEPKKDTRPPRAVVYDA--AFDSKQ 363
Query: 212 SPNGEDGTVILAVAESGVAYSWDLK 236
I+ +G+A +W L+
Sbjct: 364 R--------IITATSAGIAQAWQLE 380
>gi|162456674|ref|YP_001619041.1| WD repeat-containing protein [Sorangium cellulosum So ce56]
gi|161167256|emb|CAN98561.1| WD-repeat protein [Sorangium cellulosum So ce56]
Length = 1759
Score = 38.5 bits (88), Expect = 4.4, Method: Composition-based stats.
Identities = 42/200 (21%), Positives = 83/200 (41%), Gaps = 20/200 (10%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+LA G+++ + +V +G G H + F+ GR+L + + E
Sbjct: 1496 MLASGSNDTTVRLWEVESGRALRVFEG-HGKAATSVVFSPDGRTLASGSNDTTVRLWEVE 1554
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
G V+R F K ++S F + + A S++ VR+ +E+G +L F D V
Sbjct: 1555 SGRVLRTFGGHGKVVTSVVFSPDGRTLASGSNDTTVRLWEVESGRALLVFEDHGKGATSV 1614
Query: 162 SASDGAKIIITAGYGEKHLQVWRCD----ISSKTVNKGPALSMRHSPVAIDCKNSPNGED 217
+ S + + + Y + +++W + + + P +S+ SP D
Sbjct: 1615 AFSPDGRTLASGSY-DTMVRLWEAGSGRFLGALRGHTAPVVSVSFSP------------D 1661
Query: 218 GTVILAVAESGVAYSWDLKT 237
GT++ + + G W + T
Sbjct: 1662 GTLLASASSDGTLRLWRVAT 1681
>gi|367038693|ref|XP_003649727.1| hypothetical protein THITE_2108573 [Thielavia terrestris NRRL 8126]
gi|346996988|gb|AEO63391.1| hypothetical protein THITE_2108573 [Thielavia terrestris NRRL 8126]
Length = 554
Score = 38.5 bits (88), Expect = 4.4, Method: Compositional matrix adjust.
Identities = 19/63 (30%), Positives = 31/63 (49%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
HPGG+ +A++ + L G N +A + GE + E+PISS + + + F
Sbjct: 314 HPGGIGSVAWSPDSKLLVSCGKNHVAKVWHVDTGECVSTLSGFEEPISSCVWPADGRTFV 373
Query: 132 LAS 134
L S
Sbjct: 374 LGS 376
>gi|73670605|ref|YP_306620.1| hypothetical protein Mbar_A3154 [Methanosarcina barkeri str.
Fusaro]
gi|72397767|gb|AAZ72040.1| hypothetical protein Mbar_A3154 [Methanosarcina barkeri str.
Fusaro]
Length = 631
Score = 38.5 bits (88), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 26/99 (26%), Positives = 51/99 (51%), Gaps = 7/99 (7%)
Query: 50 SNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIR 109
+NG ILA+ V +GE WK+ G A + K + L G + + + + GEV++
Sbjct: 397 NNGTILALSVESGEEVWKTNA---GEWADIIEVKNDKMLVNTGKSFL---LNASTGEVLK 450
Query: 110 EFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEV 148
E E P++SS + ++ +++ +R+ + + G+ V
Sbjct: 451 ELPYPELPVASS-MITDKYVYSTGPYNIRVFNSDTGKPV 488
>gi|166365267|ref|YP_001657540.1| WD repeat-containing protein [Microcystis aeruginosa NIES-843]
gi|166087640|dbj|BAG02348.1| WD-repeat protein Hat [Microcystis aeruginosa NIES-843]
Length = 1108
Score = 38.5 bits (88), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 80/199 (40%), Gaps = 26/199 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D ++ +T + +AF + + G NG ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKNIREFATKMR--SIYSIAFHPDSNQMAITGRNGKV-QIWSQ 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 899 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLSNYRPQLINSWTADDNIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + P + SP
Sbjct: 959 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLFEEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWD 234
DG I A++ G A WD
Sbjct: 1005 -DGEKIAAISRDGTARRWD 1022
>gi|24652208|ref|NP_610526.1| CG1671 [Drosophila melanogaster]
gi|7303841|gb|AAF58888.1| CG1671 [Drosophila melanogaster]
gi|28416343|gb|AAO42644.1| LD47550p [Drosophila melanogaster]
gi|220946482|gb|ACL85784.1| CG1671-PA [synthetic construct]
Length = 787
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 52/245 (21%), Positives = 102/245 (41%), Gaps = 20/245 (8%)
Query: 62 GEMKWKSTGRHPGGLAGLAFAKKG-RSLHVVGTNGMAS-EMKSEMGEVIREFKASEKPIS 119
G++ W G H L G A + ++ ++ + G++S ++ + E I+
Sbjct: 23 GDIAWSPNGHHLYCLNGSAVNQVDVQTSQILNSYGLSSPSADNKRPAELDNIDVEEDTIT 82
Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEV-LKFSDDVGPLQYVSASDGAKIIITAGYGEK 178
E + A S +R+ L+ G+ V L + GP+ V S ++I T+G +
Sbjct: 83 CFGLSQEHLVTAHRSGLLRLWQLDTGKLVKLWKAQHKGPVIRVEFSPCGRLICTSGGADA 142
Query: 179 HLQVWRCDISSKTV------NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
L++W D S+ + GPA + P + + G D TV A++
Sbjct: 143 TLRLW--DYSNNSCLGALKDFPGPAWLLVFHPNVLRKEIYAGGSDNTV--------YAWN 192
Query: 233 WDLKTVSQDEKTNPAKIT-VKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTITG 291
++ KT+ + + +++T + K +D V V + + I+ LE+ K +
Sbjct: 193 YETKTLVHKMRGHLSQVTGLSFKSNASDCNEIVTVSRDKVMIVWQLQEQLESKQLKVLPL 252
Query: 292 LHDLE 296
+LE
Sbjct: 253 YDELE 257
>gi|50547673|ref|XP_501306.1| YALI0C00913p [Yarrowia lipolytica]
gi|49647173|emb|CAG81601.1| YALI0C00913p [Yarrowia lipolytica CLIB122]
Length = 912
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 64/265 (24%), Positives = 123/265 (46%), Gaps = 31/265 (11%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGT---NGMASEM 100
LLA G ++G I D +G++ G H + L F G L V G+ N + ++
Sbjct: 88 LLAAGYTDGSIRVFDYRSGQVMTTFKG-HKSSITCLEFDASGTRL-VSGSRDSNVILWDL 145
Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASS-EVRILSLENGEEVLKFSDDVGPLQ 159
E+G + ++ ++ FL E+K+ A V+I LE+ + + VG +
Sbjct: 146 VEEVG--VSRLRSHRDQVTGVKFLDEKKLITCAKDGLVKIWDLESRDCI---ETRVGGSE 200
Query: 160 YVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS------PVAIDCKNSP 213
+ + ++T G EK ++VW+ D+ +K G LS R S ID + S
Sbjct: 201 VWAMDVYSDTVMTVGR-EKEVKVWQVDLDAK---DGEMLSQRGSLERVSNDRGIDIEFSK 256
Query: 214 NGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADA---DQQNSVNVKKSR 270
+G G VI + + V W +T+++ +K+ KI ++KK D D+ + +V +
Sbjct: 257 DG--GHVIFSGNDKTVEL-WRRRTLAEIKKSIAKKIKRRVKKGDTVDEDEYDEKSVNEIW 313
Query: 271 TSII----AARLNDLEADGQKTITG 291
++ +A++ ++ G ++++G
Sbjct: 314 VQVVNFKTSAKIRNVTWTGSESVSG 338
>gi|380018983|ref|XP_003693398.1| PREDICTED: LOW QUALITY PROTEIN: WD repeat-containing protein
43-like [Apis florea]
Length = 623
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 59/270 (21%), Positives = 110/270 (40%), Gaps = 48/270 (17%)
Query: 34 KKRRK-------ERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAG------LA 80
K+RRK ++G ++A+G++NG I D T + + G H G + +
Sbjct: 71 KRRRKSISEESIQKG--IIAMGSTNGKITLYDTTTSSVTAQLDG-HSGSVTAATWSENVG 127
Query: 81 FAKKGRSLHVVG----TNGMASEMKSEMGEVIREFKASEKPISSSA--FLCEEKIFALAS 134
F H++ NG+ + KS G K + +S+ L E++
Sbjct: 128 FITASDDHHIIQWNFQENGVKCKWKSGKG------KTTSLAVSTDGNNLLSGERV----- 176
Query: 135 SEVRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKT 191
V+ L + F+ + V L + + G +I+ G+ +L VW D
Sbjct: 177 --VKWWDLNTKRLIRTFTGHANQVTCLHTIKMTSGNNYVISGACGDGYLSVWALDEQRNE 234
Query: 192 VNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITV 251
+L+++ P+++ S N + ++L V SG A L + ++ P K ++
Sbjct: 235 RTSVASLALQDDPISVSANVSANSQ--VMVLVVTRSGQA---QLFQYQPNGRSKPLKPSL 289
Query: 252 KLKKA-DADQQNSVNVKKSRTSIIAARLND 280
+ A D Q++ V + SI+ A L D
Sbjct: 290 NIAVASDISQKDEVQ----QISILNAMLTD 315
>gi|33861590|ref|NP_893151.1| hypothetical protein PMM1034 [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
gi|33634167|emb|CAE19493.1| G-protein beta WD-40 repeats [Prochlorococcus marinus subsp.
pastoris str. CCMP1986]
Length = 357
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 43/183 (23%), Positives = 77/183 (42%), Gaps = 17/183 (9%)
Query: 27 LACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGR 86
+ CG+ K + L +G G + D TG++ WK H GGL ++ G
Sbjct: 26 IVCGWALKGK-------LFIVGDVVGGLYGFDGDTGKLIWKQDEAHSGGLLAMSIHPNGE 78
Query: 87 SLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGE 146
G +G+ + GE I+ K + + ++ A+A S+ + ++G
Sbjct: 79 IFATSGQDGIVRILNCNNGEEIKAIKLGKGWVEHLSWSNGGLYLAVAFSKKVYVFNQDGN 138
Query: 147 EVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGPALSMR 201
E + + +S S+ + + TA YG +V DI++K N +G +SM+
Sbjct: 139 EQWGSEEHPSTVSAISWSNKNE-LATACYG----RVTFFDIANKKTNQKLEWQGSLVSMQ 193
Query: 202 HSP 204
SP
Sbjct: 194 LSP 196
>gi|384109844|ref|ZP_10010704.1| ABC-type xylose transport system, periplasmic component [Treponema
sp. JC4]
gi|383868594|gb|EID84233.1| ABC-type xylose transport system, periplasmic component [Treponema
sp. JC4]
Length = 1177
Score = 38.5 bits (88), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 32/130 (24%), Positives = 60/130 (46%), Gaps = 4/130 (3%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+A G+S+ + D TG + + H +A ++++K GR + + +++
Sbjct: 148 YIASGSSDKSVKFWDAATGMLLQTLSDTHTKTVAAISYSKSGRYVATASWDN-TTKIYHA 206
Query: 104 MGEVIREFKASEK-PISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQY 160
MG + R + + F +EK A S++ +RI +ENG+ V +D G +
Sbjct: 207 MGGIERNLLTGHNGAVYAVEFSPDEKYIATGSADNSIRIYDVENGKFVRAITDITGEVWT 266
Query: 161 VSASDGAKII 170
+S S KII
Sbjct: 267 ISYSPDGKII 276
>gi|430813347|emb|CCJ29317.1| unnamed protein product [Pneumocystis jirovecii]
Length = 887
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 26/98 (26%), Positives = 48/98 (48%), Gaps = 12/98 (12%)
Query: 43 LLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
+LL +G SNGDIL ++ + +G H G + GLAF+ + + G +G
Sbjct: 117 ILLCIGLSNGDILVFSTTQDKVILRLSGGHTGDVYGLAFSGDTQ-IWSCGADGK------ 169
Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRIL 140
+ +++ E I S++ +C E + AL S+ ++
Sbjct: 170 -----VVQWELFESKILSTSTVCTESLLALVYSDNEVI 202
>gi|400598576|gb|EJP66285.1| transcriptional repressor TUP1 [Beauveria bassiana ARSEF 2860]
Length = 1055
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 1/68 (1%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
L+A G GDIL D+ +G K + +G H GG+ LAF+ G+ L G + +E
Sbjct: 140 LVATGGGGGDILIWDMESGSRKHRISG-HDGGVRALAFSNDGKVLASSGGDQTVCLWDAE 198
Query: 104 MGEVIREF 111
++I++F
Sbjct: 199 SADLIKKF 206
>gi|398788810|ref|ZP_10550903.1| hypothetical protein SU9_31091 [Streptomyces auratus AGR0001]
gi|396991853|gb|EJJ02976.1| hypothetical protein SU9_31091 [Streptomyces auratus AGR0001]
Length = 1980
Score = 38.5 bits (88), Expect = 4.9, Method: Composition-based stats.
Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 4/114 (3%)
Query: 25 SCLACGFVGKKRRKERGT--LLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFA 82
S AC F + R R T +L+ +G + G + D TG+ + G H +A LAF+
Sbjct: 1585 SVWACRFRPRSRFGARETDPMLVTIG-NEGLVRLWDTSTGQGRRILRG-HGRRVASLAFS 1642
Query: 83 KKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
G L G +G+A +S G REF + S+ F+ A ASS+
Sbjct: 1643 PDGTHLAACGNDGVARVWESATGRRTREFAGDHDRLVSALFVPGSHQLATASSD 1696
>gi|390441802|ref|ZP_10229833.1| conserved hypothetical protein [Microcystis sp. T1-4]
gi|389834856|emb|CCI33959.1| conserved hypothetical protein [Microcystis sp. T1-4]
Length = 1107
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 69/168 (41%), Gaps = 24/168 (14%)
Query: 76 LAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
+ +AF + + G NG ++ S+ G +++EF AS+ PI S AF E +S
Sbjct: 872 IYSIAFHPDDNQIAITGRNGKV-QIWSKKGTMLQEFTASQVPIYSLAFNGEGTAIITGTS 930
Query: 136 E--VRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCD---I 187
E V+ L N L S DD V + D KI TA G+ +++W +
Sbjct: 931 EGKVQYWHLSNHRTKLINSWTVDDSIIYDLVFSPDHQKIA-TATRGK--IKIWDLQGNIL 987
Query: 188 SSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDL 235
+ P + SP DG I A++ G A WD+
Sbjct: 988 KEIKTDSFPVYGVSFSP------------DGEKIAAISRDGTARRWDI 1023
>gi|425452441|ref|ZP_18832258.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
gi|389765762|emb|CCI08435.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
Length = 1108
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 50/199 (25%), Positives = 82/199 (41%), Gaps = 26/199 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D +++ +T + +AF + + G NG ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKKIREFATKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE V+ L N L S DD
Sbjct: 899 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLGNYRPQLINSWTADDNIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + + P + SP
Sbjct: 959 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWD 234
DG I A++ G A WD
Sbjct: 1005 -DGEKIAAISRDGTARRWD 1022
>gi|254501092|ref|ZP_05113243.1| Cytochrome c subfamily, putative [Labrenzia alexandrii DFL-11]
gi|222437163|gb|EEE43842.1| Cytochrome c subfamily, putative [Labrenzia alexandrii DFL-11]
Length = 370
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 47/212 (22%), Positives = 89/212 (41%), Gaps = 43/212 (20%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNG----MASEMKSE--------MG-EVIREFKASEKPI 118
H G+ +AF K G+SL+ +G E ++E G V++ ++ P
Sbjct: 76 HNAGVNAVAFTKDGKSLYSASMDGSIILWDVEERAEKRVVDRNGFGINVLKLGGPADGPD 135
Query: 119 SSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGE 177
+ A+ ++ + RIL LE G+ + +FS + P+ + S DG + + +G G+
Sbjct: 136 TWLAYGAQDGV-------TRILDLETGDRLYEFSLERRPILGLDVSPDGTR--LASGDGQ 186
Query: 178 KHLQV-----WRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
++ V W + + KGP ++ S DG+V+ A + YS
Sbjct: 187 GYITVFETGSWGIETDFRAALKGPIWALAFS------------NDGSVLHAAGIENIVYS 234
Query: 233 W---DLKTVSQDEKTNPAKITVKLKKADADQQ 261
W DL + PA + + ++ ++Q
Sbjct: 235 WPVADLSSFDPMATQTPAFLRKPEEMSNGERQ 266
>gi|113478009|ref|YP_724070.1| WD-40 repeat-containing protein [Trichodesmium erythraeum IMS101]
gi|110169057|gb|ABG53597.1| WD-40 repeat [Trichodesmium erythraeum IMS101]
Length = 1510
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 57/240 (23%), Positives = 94/240 (39%), Gaps = 38/240 (15%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYS----CLACGFVGKKRRKERGTLLLALGTSNGDILA 56
+W T +G++LA D V++S +A K R T NG +LA
Sbjct: 1116 LWDTENGNVLATLLHQDLVIAVAFSPDGKTIATASWDKTAR--------LWDTENGKVLA 1167
Query: 57 VDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEK 116
T H + +AF+ G+++ ++ A +E G+V+ +
Sbjct: 1168 ------------TLNHQSSVRAVAFSPDGKTIATASSDKTARLWDTENGKVLATLN-HQS 1214
Query: 117 PISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAG 174
+++ AF + K A ASS+ R+ ENG +VL + ++ V+ S K I TA
Sbjct: 1215 SVNAVAFSPDGKTIATASSDKTARLWDTENG-KVLATLNHQSSVRAVAFSPDGKTIATAS 1273
Query: 175 YGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
+K ++W T N ++ H SP DG I + A WD
Sbjct: 1274 -SDKTARLW------DTENGKVLATLNHQSRVFAVAFSP---DGKTIATASSDKTARLWD 1323
>gi|329890618|ref|ZP_08268961.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Brevundimonas diminuta ATCC 11568]
gi|328845919|gb|EGF95483.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
protein [Brevundimonas diminuta ATCC 11568]
Length = 642
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 51/97 (52%), Gaps = 2/97 (2%)
Query: 215 GEDGTVILAVAESGVAYSWD-LKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSI 273
G +G IL VA VA+ + L T S+ A+ LK+AD ++ + +V R+++
Sbjct: 170 GVEGRFILPVAIGVVAFIFFCLTTRSRLHAAGEAEAR-ALKEADGKRRQAESVMAGRSAL 228
Query: 274 IAARLNDLEADGQKTITGLHDLESEAATASAQNEKKS 310
+AA +DL +TG H+L AA +SA ++ +
Sbjct: 229 LAAVAHDLRTPIGAILTGAHELTRVAAPSSATRQQTA 265
>gi|401401154|ref|XP_003880944.1| putative pre-mRNA splicing factor [Neospora caninum Liverpool]
gi|325115356|emb|CBZ50911.1| putative pre-mRNA splicing factor [Neospora caninum Liverpool]
Length = 612
Score = 38.5 bits (88), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 31/150 (20%), Positives = 68/150 (45%), Gaps = 6/150 (4%)
Query: 41 GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
G LLL+ + + + DVL +++ H + + +A++GR + +
Sbjct: 336 GHLLLS-ASMDSTVKIWDVLNQRKLYRTYTAHKQAVRDIQWAEEGRRFYSCSFDNTVKLW 394
Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLE--NGEEVLKFSDDVGPL 158
+E G+VI F + P + + +F + S+ R + + G +++++ +G +
Sbjct: 395 DTEAGKVIGSFGNGKTPYCVTVNPNDNNVFVVGSANRRAVQFDARTGNIEVEYAEHIGSV 454
Query: 159 QYVS-ASDGAKIIITAGYGEKHLQVWRCDI 187
V+ +G +++ TA +K L VW I
Sbjct: 455 NTVTFCEEGRRLVTTA--DDKKLFVWEYGI 482
>gi|321466195|gb|EFX77192.1| hypothetical protein DAPPUDRAFT_305906 [Daphnia pulex]
Length = 1099
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 16/169 (9%)
Query: 40 RGTLLLALGTSNGDILAVDVLTGEMKWK-------STGRHPGGLAGLAFAKKGRSLHVVG 92
R LL+ + T G+++A++ TG++KWK + L K SL+++G
Sbjct: 48 RDDLLVLVSTLEGNLIAINKRTGKVKWKLEDEPVIKLSKELSKTFNLLPDPKDGSLYMLG 107
Query: 93 TNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEK-------IFALASSEVRILSLENG 145
+G +E +++ I E ++ SS L K I L + +LS +
Sbjct: 108 NSG--AEALTKLPFTIPELVSASPSQSSDGMLYMGKKLDMWFVIDPLTGEKQEVLSFQGL 165
Query: 146 EEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNK 194
E + +GP ++ ++ + I++ + E+H V D +S T+ +
Sbjct: 166 ETACPRNKPLGPSIFIGRTEYSLILLDSRTRERHWNVTYFDYTSSTLGQ 214
>gi|332661895|ref|YP_004451364.1| WD40 repeat-containing protein [Haliscomenobacter hydrossis DSM
1100]
gi|332337392|gb|AEE54491.1| WD40 repeat-containing protein [Haliscomenobacter hydrossis DSM
1100]
Length = 598
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 13/172 (7%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H L+ +AF+ G+ L ++ A E G+ + K +SS AF + K A
Sbjct: 184 HSSYLSSVAFSPDGKRLATGSSDHSAKIWDVESGKQVLSLKGHSSYVSSVAFSPDGKRLA 243
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
S + +I +E+G++ L + V+ S K + T G G+K ++W +
Sbjct: 244 TGSDDKSAKIWDVESGKQTLSLEGHSSYVSSVAFSPDGKRLAT-GSGDKSAKIWDVESGK 302
Query: 190 KTVNKGPALSMR-HSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
+T LS+ HS SP DG ++ ++ A WD+++ Q
Sbjct: 303 QT------LSLEGHSDYVWSVAFSP---DGKRLVTGSQDQSAKIWDVESGKQ 345
>gi|119490763|ref|ZP_01623095.1| WD-repeat protein [Lyngbya sp. PCC 8106]
gi|119453747|gb|EAW34905.1| WD-repeat protein [Lyngbya sp. PCC 8106]
Length = 991
Score = 38.5 bits (88), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 1/72 (1%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA + +G I D+ TG + +G H GG+ G+ ++ +G+ L G +GM
Sbjct: 686 LASSSKDGTIRLWDIQTGHCRQVWSGHH-GGVNGIVWSHRGQMLATCGEDGMIRLWDVRR 744
Query: 105 GEVIREFKASEK 116
G V RE +A +
Sbjct: 745 GRVYREMEAKSR 756
>gi|390604124|gb|EIN13515.1| WD40 repeat-containing protein [Punctularia strigosozonata
HHB-11173 SS5]
Length = 291
Score = 38.1 bits (87), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 42/180 (23%), Positives = 80/180 (44%), Gaps = 21/180 (11%)
Query: 76 LAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
++ + F+ G+ L + + + GE+I+ + IS ++ + + A AS
Sbjct: 3 ISSVEFSPDGKVLASAAADKLVKLWDTATGEIIKTLVGHTEGISDVSWSPDGEFLASASD 62
Query: 136 E--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS-SKTV 192
+ +RI SLE G + S + + ++++ G+ E +++W DI+ K++
Sbjct: 63 DKTIRIWSLETGLTAKVLLGHTNFVFCASFNPKSNLLVSGGFDET-VRIW--DIARGKSI 119
Query: 193 NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDEKTNP 246
PA S + V DGT+I++ A G+ WD LKT+ D+ NP
Sbjct: 120 KVLPAHSDPVTAVGF-------SHDGTLIVSCAMDGLIRVWDTNSGQCLKTIVDDD--NP 170
>gi|449133882|ref|ZP_21769395.1| WD-containing repeat protein [Rhodopirellula europaea 6C]
gi|448887443|gb|EMB17819.1| WD-containing repeat protein [Rhodopirellula europaea 6C]
Length = 1173
Score = 38.1 bits (87), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 62/152 (40%), Gaps = 10/152 (6%)
Query: 85 GRSLHVVGTNGMASEMKSEMGEVIREFKA-SEKPISSSAFLCEEKIFALASSEVRILSLE 143
G+ L G + A +E G+++ E ++ IS AF + K A V I + E
Sbjct: 921 GQRLVSGGWDATACLWDTETGDLLLELDTKTDSSISDVAFTADGKHIATTGDSVNIWNAE 980
Query: 144 NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS 203
G+ L F++ + VS SD + + T G +K VW + + K P H
Sbjct: 981 TGKHELAFAEHSDHVFAVSFSDDGRWVATGGAKDKTAMVWDRETGEVKL-KLPD----HR 1035
Query: 204 PVAIDCKNSPNGEDGTVILAVAESGVAYSWDL 235
D +PN E+ LA SG W L
Sbjct: 1036 FWVYDLCFTPNNEE----LATVASGTVTIWRL 1063
>gi|443316153|ref|ZP_21045609.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 6406]
gi|442784253|gb|ELR94137.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 6406]
Length = 686
Score = 38.1 bits (87), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 25/105 (23%), Positives = 52/105 (49%), Gaps = 3/105 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+L GT+ G + A+D+ +G +W+ T P LA G+ + + G +G ++++
Sbjct: 459 ILYSGTAAGTLQAIDIDSGIPRWEYTAA-PTAFNVLARTPDGQQIMIGGADGTIHIVQAQ 517
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGE 146
GE++R A + +++ A + + S+ + I +L GE
Sbjct: 518 TGELLRTISAHQGAVNTLAATADGQALISGGSDRHIYIWTLATGE 562
>gi|168701051|ref|ZP_02733328.1| NB-ARC domain protein [Gemmata obscuriglobus UQM 2246]
Length = 491
Score = 38.1 bits (87), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 2/108 (1%)
Query: 67 KSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE 126
KS H + G+ + G+ L G + E GE IR+ + +K +++ AF+ +
Sbjct: 350 KSFEGHTHHVMGVGWTPDGKKLASCGADNFVKVWDYEKGEKIRDMQGHQKQVTALAFVGK 409
Query: 127 EKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIIT 172
F S + VR+ + +NG V F + VSAS +++++
Sbjct: 410 TSQFVTGSGDASVRMWNADNGGNVRSFPGASDFVYAVSASPDGEVVVS 457
>gi|296421914|ref|XP_002840508.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295636726|emb|CAZ84699.1| unnamed protein product [Tuber melanosporum]
Length = 333
Score = 38.1 bits (87), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 38/133 (28%), Positives = 59/133 (44%), Gaps = 20/133 (15%)
Query: 144 NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS 203
NGE V + VG L V D A ++ G + +++W V G L M
Sbjct: 41 NGERVGTYKGHVGALWTVDV-DPATTLLATGSADNTIRIW-------DVRTGKQLKMWEF 92
Query: 204 PVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQN- 262
P A+ K EDGT++LAV E + + L TV ITV L+ D+ N
Sbjct: 93 PTAV--KRVEFSEDGTLLLAVTEQRMGH---LGTVV------VFPITVDLEAEQVDEPNF 141
Query: 263 SVNVKKSRTSIIA 275
++ ++S+ ++ A
Sbjct: 142 TITFRESKATVAA 154
>gi|353243507|emb|CCA75040.1| related to WD40-repeat protein (notchless protein) [Piriformospora
indica DSM 11827]
Length = 1186
Score = 38.1 bits (87), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 40/159 (25%), Positives = 75/159 (47%), Gaps = 33/159 (20%)
Query: 137 VRILSLENGEEVLK-FSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKG 195
+RI ++ G+++ + F G + ++ S ++++A + +Q+W D+ S +K
Sbjct: 851 IRIWNVNTGQQMGRPFQGHKGSVWAIAFSPDGSLLVSAS-EDNTIQIW--DVESGRPSK- 906
Query: 196 PALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKK 255
ALS RH + SP DG++I++V+E + WD+ T S P ++ +
Sbjct: 907 -ALSRRHKDLITSVAFSP---DGSLIVSVSEDKIIRLWDVYTGS------PWGELLQGQP 956
Query: 256 ADADQQNSVNVKKSRTSIIAARLNDLEADGQKTITGLHD 294
DA +IA + +DG + I+GLHD
Sbjct: 957 VDA-------------PVIA-----ISSDGSRIISGLHD 977
>gi|171679856|ref|XP_001904874.1| hypothetical protein [Podospora anserina S mat+]
gi|170939554|emb|CAP64781.1| unnamed protein product [Podospora anserina S mat+]
Length = 1108
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 39/169 (23%), Positives = 78/169 (46%), Gaps = 13/169 (7%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H G + +AF+ G + +G +++ G+ +R+ + + S AF +
Sbjct: 796 HSGSVRSVAFSPDGSRIVSASDDGTIRIWEAKSGKEVRKLEGHSNWVRSVAFSPDSSRIV 855
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGEKHLQVWRCDIS 188
AS + +RI ++G+EV K G ++ V+ S DG++I+ + ++ +++W S
Sbjct: 856 SASDDGTIRIWEAKSGKEVRKLEGHSGSVRSVAFSPDGSRIV--SASNDQTIRIWEAK-S 912
Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
K V K HS + + SP DG+ I++ + W+ K+
Sbjct: 913 GKEVRKLEG----HSGLVLSVAFSP---DGSRIVSASNDQTIRIWEAKS 954
>gi|442323770|ref|YP_007363791.1| hypothetical protein MYSTI_06834 [Myxococcus stipitatus DSM 14675]
gi|441491412|gb|AGC48107.1| hypothetical protein MYSTI_06834 [Myxococcus stipitatus DSM 14675]
Length = 816
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 35/145 (24%), Positives = 66/145 (45%), Gaps = 7/145 (4%)
Query: 58 DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
DV+TG+ G G + GLA++ G+ L + A E GE++ +F E
Sbjct: 182 DVVTGKCLRVLEGHDIGEILGLAWSPDGKKLASGSRSHDARVWDVETGELLHDFPKQEGR 241
Query: 118 ISSSAFLCEEKIFALASS--EVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGY 175
++S AF + K+ A+ + V + LE+GE+V + V+ +++ +A
Sbjct: 242 VTSVAFSPDGKLLAVGNLGWRVHLFDLESGEKVRTLKGHQQSVLCVAFHPSGRLLASAA- 300
Query: 176 GEKHLQVWR----CDISSKTVNKGP 196
+ +++W ++S T N P
Sbjct: 301 SDDTVRIWDMTTGAQVASITTNATP 325
>gi|357393107|ref|YP_004907948.1| hypothetical protein KSE_62270 [Kitasatospora setae KM-6054]
gi|311899584|dbj|BAJ31992.1| hypothetical protein KSE_62270 [Kitasatospora setae KM-6054]
Length = 1389
Score = 38.1 bits (87), Expect = 6.2, Method: Composition-based stats.
Identities = 42/170 (24%), Positives = 66/170 (38%), Gaps = 15/170 (8%)
Query: 49 TSNGDILAVDVLTGEMKWK-STGRHPGGLAG-------LAFAKKGRSLHVVGTNGMASEM 100
+ +G LA + W +TGR LAG LAF+ GR+L G +
Sbjct: 831 SPDGRTLATGGEGAALLWDVATGRTTATLAGFTGAVFSLAFSPDGRTLATGGWDRTVRLW 890
Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPL 158
G ++S AF + A AS + R+ + G F++ GP+
Sbjct: 891 DPATGRTTATLTGHTANVASLAFSPDGSTLATASEDGTARLWDVATGRTTATFTNSSGPV 950
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRC----DISSKTVNKGPALSMRHSP 204
V+ S + + T G GE +W I++ T + G S+ SP
Sbjct: 951 GAVAFSPDGRTLATGG-GEGAALLWEVATGRTIATLTGHTGAVFSLAFSP 999
>gi|297204688|ref|ZP_06922085.1| WD repeat protein [Streptomyces sviceus ATCC 29083]
gi|297148756|gb|EDY54791.2| WD repeat protein [Streptomyces sviceus ATCC 29083]
Length = 1167
Score = 38.1 bits (87), Expect = 6.2, Method: Composition-based stats.
Identities = 48/188 (25%), Positives = 73/188 (38%), Gaps = 15/188 (7%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA G+S+ + D TG M G H G + LAF G L +
Sbjct: 984 LATGSSDTTVRLWDPSTGAMVRILNG-HRGPVRALAFHPDGTFLATASHDRTVRIWDPST 1042
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
G+V+R + + AF + ++ A SS+ VR+ G V S GP++ V+
Sbjct: 1043 GDVVRSLVGHTDQLHTVAFSPDGRLLATGSSDTTVRLWDASTGAMVRMLSGHRGPVRAVA 1102
Query: 163 ASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAI-DCKNSPNGEDGTVI 221
S + + G E I G AL+M + A+ C S DG V+
Sbjct: 1103 FSPDGSCLASGGADET--------IRIHAPASGEALTMMRTDSAVWSCSWSA---DGRVL 1151
Query: 222 LAVAESGV 229
A +G+
Sbjct: 1152 FAGTTAGL 1159
Score = 37.7 bits (86), Expect = 7.3, Method: Composition-based stats.
Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 10/152 (6%)
Query: 60 LTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPIS 119
+TGE TG H G + +AF+ GR L G + A + G+ +R + + P+
Sbjct: 610 VTGEALHTLTG-HQGPVRAVAFSPDGRLLVTGGRDATARIWDATTGQPVRTMRGHDGPVL 668
Query: 120 SSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGE 177
+ AF + + A SS+ VRI G EVL + G L D + +G +
Sbjct: 669 AVAFSPDGSLLATGSSDTTVRIWDPATG-EVLHTASGHGGLVSAVVFDRDGSRLASGGAD 727
Query: 178 KHLQVWRCDISSKTVNK----GPALSMRHSPV 205
++W D++S ++ GP ++R S V
Sbjct: 728 TTARLW--DLTSPGPDRRPGDGPPRALRASRV 757
>gi|290979537|ref|XP_002672490.1| transcription initiation factor TFIID subunit [Naegleria gruberi]
gi|284086067|gb|EFC39746.1| transcription initiation factor TFIID subunit [Naegleria gruberi]
Length = 716
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 3/100 (3%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
+A G+S+ + DV TGE G H G + LAF+ GR G +G
Sbjct: 548 VATGSSDKTVRLWDVQTGECMRMFIGHH-GSINTLAFSHDGRYCASAGDDGQVLVWDIGS 606
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALAS--SEVRILSL 142
G++ +F KP+ S F + A AS S VR+ S+
Sbjct: 607 GKIAYKFIGHTKPVWSLDFNRNDTFLASASLDSTVRVWSM 646
>gi|432941441|ref|XP_004082853.1| PREDICTED: WD repeat-containing protein 43-like [Oryzias latipes]
Length = 646
Score = 38.1 bits (87), Expect = 6.3, Method: Compositional matrix adjust.
Identities = 49/248 (19%), Positives = 101/248 (40%), Gaps = 22/248 (8%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV-----GKKRRKERGTL--------LLAL 47
IW+T+ ++ E+ P + +C+A G G +R+K + LLA+
Sbjct: 35 IWNTDSKTIHQEY-VPSAHLSATCTCIAWGPCRTVKEGPQRKKRKSEAVQVEEKADLLAM 93
Query: 48 GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
GT+ G +L G + H G+ + + + L+ + E + G++
Sbjct: 94 GTAAGSVLLYSTAKGSLHCSLDEAHSAGVNCVHWHPEDSVLYSGSDDTNIVEWDLQSGKM 153
Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
++KA ++S + K+ A +++ L+ E KF S V L++ +
Sbjct: 154 RSKWKADRAAVTSLCVSPDGKLLLSAGHVIKMWDLDTKELYRKFTGHSTAVTTLRFATTR 213
Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
S+G + A + ++ + VW+ K N + ++ P +D S + E+
Sbjct: 214 PPDSNGLYFVSGAAH-DRLISVWQVREDGKDKNSVVSFTLTDEPRHVDLVPSNSKEEAVR 272
Query: 221 ILAVAESG 228
+ V + G
Sbjct: 273 LAVVCQDG 280
>gi|50304115|ref|XP_452007.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74690425|sp|Q6CVN2.1|JIP5_KLULA RecName: Full=WD repeat-containing protein JIP5
gi|49641139|emb|CAH02400.1| KLLA0B10736p [Kluyveromyces lactis]
Length = 525
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 45/192 (23%), Positives = 79/192 (41%), Gaps = 26/192 (13%)
Query: 63 EMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFK---ASEKPIS 119
E+ WK T RH G + G++F G L+ +G + + + S G+V+++ +SEK
Sbjct: 114 ELLWK-TRRHKGSVRGMSFNNDGSKLYTIGIDNVLKKANSLTGKVMKKVTLPLSSEKNHY 172
Query: 120 SSAFLCEEKIFALASSEV--------RILSLENGEEVLKFSDDVGPL-QYVSASDGAKII 170
+ + F L E+ L L+N + + D + + Q+V G +
Sbjct: 173 TKLVTSKTHPFLLLGDELGNIHVLNNDTLQLQNTIKSIHNGDAINDIFQFV----GKSVY 228
Query: 171 ITAGYGEKHLQVWRCDISSKTVNKGPALSM---RHSPVAID------CKNSPNGEDGTVI 221
G+ L W S+++ PA + R V+ D C + N EDG ++
Sbjct: 229 KFISLGQTTLAYWDSRESNESDASIPADDLDAKRKVYVSDDQEDEMICGSFVNPEDGDIL 288
Query: 222 LAVAESGVAYSW 233
+ GV W
Sbjct: 289 VCGMGEGVLTVW 300
>gi|358459346|ref|ZP_09169545.1| WD40 repeat-containing protein [Frankia sp. CN3]
gi|357077324|gb|EHI86784.1| WD40 repeat-containing protein [Frankia sp. CN3]
Length = 1532
Score = 37.7 bits (86), Expect = 7.0, Method: Composition-based stats.
Identities = 50/201 (24%), Positives = 76/201 (37%), Gaps = 22/201 (10%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+LA S+ DV TG TG H GG+ +AF+ G +L G +
Sbjct: 1281 MLATAGSDRTTRLWDVATGREIRTLTG-HGGGVLAVAFSPDGNTLTTAGNDRTVRLWDVA 1339
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
G R + S AF + A A S+ R+ L G+E FS G + V
Sbjct: 1340 TGRETRTLTGHRGVVWSVAFSPDGNALATAGSDGTARLWDLATGQETRTFSGHRGIVWSV 1399
Query: 162 SAS-DGAKIIITAGYGEKHLQVWRC----DISSKTVNKGPALSMRHSPVAIDCKNSPNGE 216
+ + DG + A G L W +I + ++ L + SP
Sbjct: 1400 AFTPDGGSLATAADDGVARL--WEVATGREIRTIAGHQDWLLGVAFSP------------ 1445
Query: 217 DGTVILAVAESGVAYSWDLKT 237
DG + A+ G A WD+++
Sbjct: 1446 DGRTLATAADDGTARLWDVES 1466
>gi|298706017|emb|CBJ29131.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 664
Score = 37.7 bits (86), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 43/163 (26%), Positives = 67/163 (41%), Gaps = 13/163 (7%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTL---------LLALGTSN 51
IW+T G L ++ +P Y+C+A K+ K + ++ALGT
Sbjct: 34 IWNTASGDLRQQYVEPR-HLAKQYTCIAWHRRSSKKSKRGSSGASKPSSSLGMIALGTDK 92
Query: 52 GDILAVDVLTGEMKWKSTGRHPG--GLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIR 109
G + D+ G + + S G G + F+ G SL + E E G V R
Sbjct: 93 GSVSVWDLKRGALAY-SLGEGEGFPSVTSGGFSADGSSLFSASSGKEVVEWNVETGTVAR 151
Query: 110 EFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFS 152
+ K + + KI A+ASS +R+L L G+ K S
Sbjct: 152 KLKGFKHGATKICLHPAGKILAIASSAIRLLDLTTGKTARKLS 194
>gi|296817773|ref|XP_002849223.1| WD repeat protein [Arthroderma otae CBS 113480]
gi|238839676|gb|EEQ29338.1| WD repeat protein [Arthroderma otae CBS 113480]
Length = 414
Score = 37.7 bits (86), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 11/87 (12%)
Query: 66 WKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKAS---EKPISSSA 122
W+ T RH G L F G L+ GT+G KSE G+V+ +F +KP+ +
Sbjct: 62 WR-TRRHKGSCRSLTFGIDGEMLYSAGTDGWVKAAKSETGKVVTKFAVPMPRDKPLHDTD 120
Query: 123 FLC-------EEKIFALASSEVRILSL 142
C + + A SS + I L
Sbjct: 121 MPCLLHALSPQTLLLATDSSALHIFDL 147
>gi|425438823|ref|ZP_18819165.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
gi|389717405|emb|CCH98490.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
Length = 1108
Score = 37.7 bits (86), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 49/199 (24%), Positives = 81/199 (40%), Gaps = 26/199 (13%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A+ +G I +D +++ +T + +AF + + G NG ++ S+
Sbjct: 842 IIAIANKDGQITLLDSQGKKIREFATKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSK 898
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
G +++EF AS+ PI S AF E +SE ++ L N L S DD
Sbjct: 899 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKIQYWHLSNHRPQLINSWTADDNIIY 958
Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
V + D KI TA G+ +++W + P + SP
Sbjct: 959 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLFEEIKTDSFPVYGVSFSP----------- 1004
Query: 216 EDGTVILAVAESGVAYSWD 234
DG I A++ G A WD
Sbjct: 1005 -DGEKIAAISRDGTARRWD 1022
>gi|145346364|ref|XP_001417659.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577887|gb|ABO95952.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 534
Score = 37.7 bits (86), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 26/93 (27%), Positives = 45/93 (48%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+LA G+ +G I V TG K H GG+ + F+K G + +G+ +
Sbjct: 285 MLASGSQDGKIKVWRVSTGTCLRKFEKAHQGGVTSVTFSKDGSQVLSGSFDGLVRVHGLK 344
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
G++++EF+ ++S AF +E ASS+
Sbjct: 345 SGKLLKEFRGHTSYVNSVAFTEDETDVVSASSD 377
>gi|428318693|ref|YP_007116575.1| WD-40 repeat-containing protein [Oscillatoria nigro-viridis PCC
7112]
gi|428242373|gb|AFZ08159.1| WD-40 repeat-containing protein [Oscillatoria nigro-viridis PCC
7112]
Length = 547
Score = 37.7 bits (86), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 36/143 (25%), Positives = 67/143 (46%), Gaps = 6/143 (4%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
L G+ +G I ++ +G+ +G++ L+ LAF G+SL +NG +
Sbjct: 406 LVSGSKDGSIRLWNLASGQAIRTISGKNLSVLS-LAFTPDGKSLAAGNSNGTVGLWNAGN 464
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
G++IR + S AF + S + VR+ + +G+ S G + V+
Sbjct: 465 GQLIRRLSGHTDGVWSVAFSRDGTTLVTGSWDKSVRLWDVRSGDLRGTLSGHSGYVSAVA 524
Query: 163 ASDGAKIIITAGY-GEKHLQVWR 184
S K I++AG+ GE +++W+
Sbjct: 525 ISSDGKTIVSAGWLGE--IKIWK 545
>gi|260806123|ref|XP_002597934.1| hypothetical protein BRAFLDRAFT_79824 [Branchiostoma floridae]
gi|229283204|gb|EEN53946.1| hypothetical protein BRAFLDRAFT_79824 [Branchiostoma floridae]
Length = 415
Score = 37.7 bits (86), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 27/105 (25%), Positives = 50/105 (47%), Gaps = 4/105 (3%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
L A+G+ NG + D+ TG + K + HPGG+ LA+A+ G L+ +G+ S
Sbjct: 312 LAAVGSLNGTLGVWDIPTGVQRHKCS--HPGGIVRLAWAESGYLLYTSCLDGVTRIWDSR 369
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGE 146
G +++ + I A + + +S + R+ +L+ E
Sbjct: 370 TGGTVQQLTGHSEEILDMAVSRDGNVVLTSSGDNTARVFNLQKPE 414
>gi|66813758|ref|XP_641058.1| hypothetical protein DDB_G0280575 [Dictyostelium discoideum AX4]
gi|60469093|gb|EAL67089.1| hypothetical protein DDB_G0280575 [Dictyostelium discoideum AX4]
Length = 1019
Score = 37.7 bits (86), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 36/150 (24%), Positives = 70/150 (46%), Gaps = 16/150 (10%)
Query: 98 SEMKSEMGEVIREFKASEKPISSSAF----LCEEKI--FA--LASSEVRILSLENGEEVL 149
+E E+G+++ E + K I S L + I FA ++S +++L + +
Sbjct: 473 NEFDDEVGQLMGEMLVTNKSIKSLDLSFNDLADNTIQGFAEAISSPSCALVNLNLSDCAM 532
Query: 150 KFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
++D G Q+ A G + I + L +W CD+SS+ V + ++++RH+ D
Sbjct: 533 SVNEDTGA-QFFQALAGNRSI-------QRLLLWSCDLSSQLVKEELSIAIRHNITITDL 584
Query: 210 KNSPNGEDGTVILAVAESGVAYSWDLKTVS 239
N + I + G+ ++ L+TVS
Sbjct: 585 SLGFNDLTSSDISIILHKGLKFNKTLQTVS 614
>gi|434394209|ref|YP_007129156.1| WD40 repeat-containing protein [Gloeocapsa sp. PCC 7428]
gi|428266050|gb|AFZ31996.1| WD40 repeat-containing protein [Gloeocapsa sp. PCC 7428]
Length = 356
Score = 37.7 bits (86), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 28/117 (23%), Positives = 57/117 (48%), Gaps = 3/117 (2%)
Query: 79 LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE-- 136
+AF+ ++L G + GE+ R A +K +++ +F + + FA AS +
Sbjct: 241 VAFSPDSKTLASGSNIGEVKLWNTSNGELRRTIAAHKKEVTAISFSSDGETFATASEDRV 300
Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN 193
+R+ +++NGE V +D + V+ S + G ++ +++WR SS V+
Sbjct: 301 IRLWNIDNGEVVRNLADHSQGITCVAFSQNG-LNFATGSKDRTIKIWRVSPSSVPVS 356
>gi|162452827|ref|YP_001615194.1| protein kinase [Sorangium cellulosum So ce56]
gi|161163409|emb|CAN94714.1| Protein kinase [Sorangium cellulosum So ce56]
Length = 1529
Score = 37.7 bits (86), Expect = 8.4, Method: Composition-based stats.
Identities = 58/258 (22%), Positives = 109/258 (42%), Gaps = 34/258 (13%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
HPGG+ G +++ G+ + + A K++M + E +SS+AF + +
Sbjct: 883 HPGGVMGASYSPDGKYVVTASLDRNARVWKADMTGEPLVLRGHENGVSSAAFSPDGRWIV 942
Query: 132 LAS--SEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
AS VR+ + E L D G + + S + I +A + ++ ++VW+ D +
Sbjct: 943 TASWDGTVRVWKADGTGEPLVLRDHEGRVNSAAFSPDGQRIASASH-DRTVRVWKADGTG 1001
Query: 190 KTV----NKGPALSMRHSP----VAIDCKN------SPNGEDGTVILAVAE---SGVAYS 232
+ + + P ++ SP +A + S +G V+L E GVA+S
Sbjct: 1002 EPLVLRGHDAPVYAVAFSPDGKRIATGSYDHTARVWSADGSGEPVVLRGHEHEVQGVAFS 1061
Query: 233 WD---LKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTI 289
D L T S D+ T ++ KA+ + V ++ + +A + DG++ +
Sbjct: 1062 PDGQQLVTASSDK-------TARVWKANGTGEPLV-LRGHEAPVYSAAFS---PDGRRIV 1110
Query: 290 TGLHDLESEAATASAQNE 307
TG D + A E
Sbjct: 1111 TGSRDKTARVWKADGTGE 1128
>gi|426335145|ref|XP_004029093.1| PREDICTED: WD repeat-containing protein 43 [Gorilla gorilla
gorilla]
Length = 676
Score = 37.7 bits (86), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 47/220 (21%), Positives = 87/220 (39%), Gaps = 19/220 (8%)
Query: 34 KKRRKE-----RGTLLLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRS 87
KKR+ E T LLALGT+ G IL + GE+ K +G H + + + +
Sbjct: 81 KKRKSEAVGTSNQTDLLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDSGC 140
Query: 88 LHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
L+ + E + +V ++K +SS + K+ A +++ LE E
Sbjct: 141 LYSCSDDKHIVEWNVQTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEI 200
Query: 148 VLKFSDDVGPLQYVSAS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNK 194
F+ P+ + + DG ++ ++ L VW+ +K +
Sbjct: 201 YRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKSA 260
Query: 195 GPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
+ ++ PV ID S N E+ + V G + ++
Sbjct: 261 VMSFTVTDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE 300
>gi|393214198|gb|EJC99691.1| WD40 repeat-like protein [Fomitiporia mediterranea MF3/22]
Length = 1229
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 52/240 (21%), Positives = 94/240 (39%), Gaps = 24/240 (10%)
Query: 1 IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
IW+ G +L E+ + +G V S F RR +A GT G I D+
Sbjct: 611 IWNVESGEVLCEFSEGNGAEVNSVV-----FSPDGRR-------IAFGTCRGTISIWDIE 658
Query: 61 TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
+ E+ H G + G+AF+ G + + E +R + + S
Sbjct: 659 SKELVSGPFKGHTGSVRGVAFSPDGMHITSGSADTTIRVWDIEKASTLRVLEGHTASVWS 718
Query: 121 SAFLCEEKIFALASSE--VRILSLENGEEVLK-FSDDVGPLQYVSASDGAKIIITAGYGE 177
AF + S + +R+ E G+ + K F +Q V+ S K I++ G +
Sbjct: 719 VAFSSDGNCIVSGSEDKTLRVWDPETGQAIGKPFVGHTDGVQCVAISPDCKCIVS-GSND 777
Query: 178 KHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
++VW + S+ V GP + K+ DG +++ ++ WD+++
Sbjct: 778 FTVRVW--GMESEKVVAGPFWHLTF------VKSVAFSSDGRRVVSASDDFSIVVWDMES 829
>gi|383778350|ref|YP_005462916.1| hypothetical protein AMIS_31800 [Actinoplanes missouriensis 431]
gi|381371582|dbj|BAL88400.1| hypothetical protein AMIS_31800 [Actinoplanes missouriensis 431]
Length = 405
Score = 37.7 bits (86), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 49/205 (23%), Positives = 86/205 (41%), Gaps = 23/205 (11%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
LLA +G + + +G+ TG H GG+ + F+ G+ L G G ++
Sbjct: 158 LLATAGDDGTVRLWEAASGDPVRTLTG-HTGGVRSVVFSPDGKLLVTAGATGSDKTTDTD 216
Query: 104 MGEVIREFKA-SEKPISSSAFLCEEKIFALA-SSEVRILSLENGEEVLKFSD------DV 155
+R + + +P+ + + +FA++ + +L+ G ++ D D
Sbjct: 217 DDNTVRLWHTRTGRPVGKALSISPGPVFAVSFTPGGELLATSGGGRAIRLWDTATATPDG 276
Query: 156 GPLQYVSA-------SDGAKIIITAGYGEKHLQVW-----RCDISSKTVNKGPALSMRHS 203
PL SA S G K++ T+ G+ +Q+W R + T + GP +MR S
Sbjct: 277 DPLVSSSAPVFALTFSAGGKLLATSSGGDNTVQLWDTASRRPATAPLTGHTGPVRAMRFS 336
Query: 204 PVAIDCKNSPNGEDGTVILAVAESG 228
P +DGTV L A +G
Sbjct: 337 PGGNLLAT--GSDDGTVRLWNAATG 359
>gi|387127265|ref|YP_006295870.1| cell surface protein/ lipoprotein [Methylophaga sp. JAM1]
gi|386274327|gb|AFI84225.1| putative cell surface protein/ lipoprotein [Methylophaga sp. JAM1]
Length = 346
Score = 37.4 bits (85), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 28/104 (26%), Positives = 42/104 (40%), Gaps = 3/104 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
+L +G G + AVD+ TGE W G + A K + + T+G + E
Sbjct: 204 MLYVGNDAGVLFAVDLQTGEPVWL---FESGSRMFASPALKNQQIIFAATDGRVFALNKE 260
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
GE +F A SS A E+ + L + NGE+
Sbjct: 261 SGEPFWQFDAQATIFSSPAIAGEQIYIGAMDGYLYALDINNGEQ 304
>gi|123966361|ref|YP_001011442.1| WD-40 repeat-containing G-protein [Prochlorococcus marinus str. MIT
9515]
gi|123200727|gb|ABM72335.1| G-protein beta WD-40 repeats [Prochlorococcus marinus str. MIT
9515]
Length = 357
Score = 37.4 bits (85), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 49/215 (22%), Positives = 87/215 (40%), Gaps = 25/215 (11%)
Query: 41 GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
G L +G G + + TG++ W+ H GGL ++ G G +G+
Sbjct: 33 GGKLFIVGDVAGGVYGFEGDTGKIVWEKNEAHSGGLLAMSIHPNGNIFATSGQDGIVRIC 92
Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALA-SSEVRILSLENGEEVLKFSDDVGPLQ 159
GE I++ K + + ++ + A A S +V +L+ E E + + +
Sbjct: 93 NCNNGEKIKDIKLGKCWVEHLSWSNDGSFLAAAFSKKVYVLNQEGNENWVS-EEHPSTVS 151
Query: 160 YVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGPALSMRHSPVAIDCKNSPN 214
+S S+ + + TA YG QV D+++ N +G +SM+ SP
Sbjct: 152 AISWSNKNE-LATACYG----QVTFFDVANNKTNQKLKWQGSLVSMKLSP---------- 196
Query: 215 GEDGTVILAVAESGVAYSWDLKTVSQDEKTN-PAK 248
DG ++ ++ + W T E T PAK
Sbjct: 197 --DGDIVACGSQDNSVHFWRRSTGQDAEMTGYPAK 229
>gi|452523|dbj|BAA05499.1| KIAA0007 [Homo sapiens]
Length = 686
Score = 37.4 bits (85), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 47/220 (21%), Positives = 87/220 (39%), Gaps = 19/220 (8%)
Query: 34 KKRRKE-----RGTLLLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRS 87
KKR+ E T LLALGT+ G IL + GE+ K +G H + + + +
Sbjct: 90 KKRKSEAVGMSNQTDLLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDSGC 149
Query: 88 LHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
L+ + E + +V ++K +SS + K+ A +++ LE E
Sbjct: 150 LYSCSDDKHIVEWNVQTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEV 209
Query: 148 VLKFSDDVGPLQYVSAS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNK 194
F+ P+ + + DG ++ ++ L VW+ +K +
Sbjct: 210 YRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKSA 269
Query: 195 GPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
+ ++ PV ID S N E+ + V G + ++
Sbjct: 270 VMSFTVTDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE 309
>gi|427414552|ref|ZP_18904739.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 7375]
gi|425755696|gb|EKU96559.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 7375]
Length = 1494
Score = 37.4 bits (85), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 31/141 (21%), Positives = 64/141 (45%), Gaps = 4/141 (2%)
Query: 45 LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
LA+G + D++TG++ W G H +A + F+ G+ L + + +
Sbjct: 881 LAIGDFKNTVQIWDIVTGQVVWFCLG-HSDWVASVTFSSDGKLLASGSDDHVVKLWSTNS 939
Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
G+ IR F + S AF + K AS + +++ +E+G+ + F + V+
Sbjct: 940 GKCIRTFTGHSGWVLSVAFSSDTKTLVSASKDHTIKLWCIESGKCLRTFEGHSDSVWSVA 999
Query: 163 ASDGAKIIITAGYGEKHLQVW 183
S K + +G ++ +++W
Sbjct: 1000 ISPDGK-TLASGSRDRTIKLW 1019
>gi|145348642|ref|XP_001418755.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578985|gb|ABO97048.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 461
Score = 37.4 bits (85), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 84/215 (39%), Gaps = 19/215 (8%)
Query: 42 TLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMK 101
T LL+ G ++G ++ D K+ H GG+ ++ + G+ G + +
Sbjct: 107 TRLLS-GGADGALMVWDASDNFELLKTMRAHRGGVCAISAHRSGKVALTSGCDAHVAMWD 165
Query: 102 SEMGEVIREFKASEKPISSSAFLCE-EKIFALASSEVRILSLENGEEVLKFS-------- 152
G V +FK E+ + AF C+ + + + ++ + +E G V F+
Sbjct: 166 MRRGRVAYKFKTPER-VEGLAFTCDGSEYVSRLTQKITLTDVEAGSVVTSFTTPAKTLTF 224
Query: 153 DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
D G + YV G ++ A K V R + +G A++ +
Sbjct: 225 DARGRMVYVGCEGGDVLVYDARMAAKEGAVSRIAKAHPQRVRGLAITSAKGDETV----- 279
Query: 213 PNGEDGTVILAVAES-GVAYSWDLKTVSQDEKTNP 246
G+ G L A S GV +WDL+ S NP
Sbjct: 280 --GDSGPSALVTAGSEGVVRAWDLRRASGPRDENP 312
>gi|347441927|emb|CCD34848.1| similar to prolyl oligopeptidase [Botryotinia fuckeliana]
Length = 446
Score = 37.4 bits (85), Expect = 9.4, Method: Compositional matrix adjust.
Identities = 34/142 (23%), Positives = 65/142 (45%), Gaps = 4/142 (2%)
Query: 44 LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
++A G+ + I DV TGE K G H + +AF+ G+ + +
Sbjct: 102 VVASGSYDKTIRLWDVATGESLQKLEG-HSHWVNSVAFSSDGKVVASGSNDNTIRLWDVA 160
Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
GE ++ F+ K ++S AF + K+ A S + +R+ + GE + F ++ V
Sbjct: 161 TGESVQTFEGHSKWVNSVAFSPDGKVVASGSYDETIRLWDVATGESLQTFEGHSESVKSV 220
Query: 162 SASDGAKIIITAGYGEKHLQVW 183
+ S K++ + Y E +++W
Sbjct: 221 AFSPDGKVVASGSYDET-IRLW 241
>gi|282901415|ref|ZP_06309340.1| WD-40 repeat protein [Cylindrospermopsis raciborskii CS-505]
gi|281193694|gb|EFA68666.1| WD-40 repeat protein [Cylindrospermopsis raciborskii CS-505]
Length = 1341
Score = 37.4 bits (85), Expect = 9.4, Method: Composition-based stats.
Identities = 31/114 (27%), Positives = 57/114 (50%), Gaps = 5/114 (4%)
Query: 72 HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
H GG+ + F+ G+S+ +G A + + GE I++F E ++S +F + +I A
Sbjct: 882 HEGGVTSICFSPDGQSIGTGSEDGTA-RLWNLQGENIQQFHGHEDWVTSVSFSPDGQILA 940
Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
S + VR+ +L+ GE + +F + VS S K + T +K ++W
Sbjct: 941 TTSVDKTVRLWNLQ-GETIQQFHGHENWVTSVSFSPDGKTLATTSV-DKTARLW 992
>gi|406836180|ref|ZP_11095774.1| hypothetical protein SpalD1_31214 [Schlesneria paludicola DSM
18645]
Length = 998
Score = 37.4 bits (85), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 32/152 (21%), Positives = 72/152 (47%), Gaps = 13/152 (8%)
Query: 44 LLALG----TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASE 99
LLA G + +G++ +V T + + H + GL F++ G+SL + +
Sbjct: 786 LLATGGGEPSRSGELFLWNVETRSVAKQFVDAHSDTVFGLDFSRDGKSLVSGAADKFVKQ 845
Query: 100 MKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEE---VLKFSDD 154
E G+++R F+ + + + A A ++ +++ ++E GE+ + +S
Sbjct: 846 FDVESGKLVRSFEGHTHHVLGVTWKGDGSRIASAGADNAIKVWNVETGEQHRTIQNYSKQ 905
Query: 155 VGPLQYVSASDGAKIIITAGYGEKHLQVWRCD 186
V + ++ ASD + +G G+K +++ R +
Sbjct: 906 VTAIHFIGASDN----LISGSGDKTVKMHRSN 933
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.311 0.128 0.362
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,260,860,934
Number of Sequences: 23463169
Number of extensions: 208945587
Number of successful extensions: 535335
Number of sequences better than 100.0: 532
Number of HSP's better than 100.0 without gapping: 61
Number of HSP's successfully gapped in prelim test: 471
Number of HSP's that attempted gapping in prelim test: 534697
Number of HSP's gapped (non-prelim): 971
length of query: 340
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 197
effective length of database: 9,003,962,200
effective search space: 1773780553400
effective search space used: 1773780553400
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 77 (34.3 bits)