BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 036605
         (340 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|297743397|emb|CBI36264.3| unnamed protein product [Vitis vinifera]
          Length = 637

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 195/374 (52%), Positives = 248/374 (66%), Gaps = 47/374 (12%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGT-LLLALGTSNGDILAVDV 59
           IW+T+  SL+AEWK PD +  + YSC+ C FVGKKRRKERG  LLLAL T++G+IL VD+
Sbjct: 32  IWNTSYQSLVAEWKGPDVDSGIGYSCMTCSFVGKKRRKERGACLLLALVTNDGNILVVDI 91

Query: 60  LTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPIS 119
             GEMKW+S+  HPGG+ GL+FA KGR LHVVGTNGM S +KSE GE+I EFKAS+KPIS
Sbjct: 92  FAGEMKWESSRYHPGGIVGLSFANKGRILHVVGTNGMVSTLKSENGELIGEFKASKKPIS 151

Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKH 179
           S AF  +EKI A+A S++R+LSLENG+E+LKF DD+  +Q +  S+ A  I+T+G+GEKH
Sbjct: 152 SLAFSSDEKILAIAGSKIRVLSLENGKELLKFPDDLESVQSIWISNDANTIVTSGFGEKH 211

Query: 180 LQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVS 239
           LQVW CD++ KTV+KGP LSM+H PV  +CK+  N +DG VIL+V+ SG AY W+LKT S
Sbjct: 212 LQVWNCDLNRKTVSKGPVLSMKHPPVVFECKHGWNEDDGLVILSVSSSGTAYLWNLKTTS 271

Query: 240 QDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKT----------- 288
             E  NP KI VK K+A+ DQ++S + KKSRTSII ARLN L++DG  T           
Sbjct: 272 AQE-YNPTKIMVKTKEAEIDQKHSGSTKKSRTSIITARLNALQSDGWVTALVCYGSINSP 330

Query: 289 ----------------------------------ITGLHDLESEAATASAQNEKKSKKRA 314
                                             +    DLE EA     QN+K +KKRA
Sbjct: 331 QFNLLEISNPGEDIVVAATDNIVKTVAEAGRENGVLAGKDLELEAVAEPIQNKKSNKKRA 390

Query: 315 ASDPDLETTRDVVD 328
           ASDPDL    ++VD
Sbjct: 391 ASDPDLAAAENMVD 404


>gi|255553081|ref|XP_002517583.1| conserved hypothetical protein [Ricinus communis]
 gi|223543215|gb|EEF44747.1| conserved hypothetical protein [Ricinus communis]
          Length = 634

 Score =  349 bits (896), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 171/291 (58%), Positives = 220/291 (75%), Gaps = 4/291 (1%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           IWST +GSLLAEWKQ D + V SYSC+AC FVGKKR++E  T LLALGT  GDILA D  
Sbjct: 31  IWSTGNGSLLAEWKQSDDDSV-SYSCMACSFVGKKRKQEHSTFLLALGTHGGDILATDAF 89

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
           TG+MKWKS+G HPGG+ GLAFA KGRSL  VGTNG ASEM S+ GE+  EFKAS++PISS
Sbjct: 90  TGDMKWKSSGHHPGGVIGLAFANKGRSLRSVGTNGKASEMNSKSGELTVEFKASKRPISS 149

Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
            AF  +E++ A  SS  R+LSL NG+E+ KF DD+G +Q++S S  AK I+T G+GEK L
Sbjct: 150 LAFSSDEEVLAAVSSRARLLSLVNGKELFKFPDDLGTVQHISISKSAKNIVTTGFGEKIL 209

Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN--GEDGTVILAVAESGVAYSWDLKTV 238
            +WRC++SSK+++ GP LSMRH P+ I+CK++ +   EDG ++L+V+ESGV Y W+LKT 
Sbjct: 210 HLWRCNLSSKSLSAGPVLSMRHPPLVIECKSTGDEKEEDGLIVLSVSESGVVYVWNLKTS 269

Query: 239 SQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTI 289
           SQDE  NP KITVK  K++ +Q N    KK R SI+AA+L+D++ D Q T+
Sbjct: 270 SQDE-VNPTKITVKGSKSETNQHNGDYSKKHRPSILAAQLHDIKDDKQFTV 319


>gi|356528954|ref|XP_003533062.1| PREDICTED: WD repeat-containing protein 43-like [Glycine max]
          Length = 620

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 159/361 (44%), Positives = 221/361 (61%), Gaps = 37/361 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           IW+TN G LLAEWK  DG+  + YSC+AC F+GKKRRKE+GT LLALGT +G +LAVDV 
Sbjct: 32  IWNTNTGHLLAEWKPSDGDHDIRYSCIACSFIGKKRRKEQGTCLLALGTIDGSVLAVDVS 91

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
           TGE K   T  HPGG+ GL+FA KGR L +VG NG+A  + +E GE+++EFK ++K I+S
Sbjct: 92  TGERKL--TTSHPGGICGLSFANKGRLLRIVGHNGVAYGVNTETGELLKEFKITKKSITS 149

Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
            AF  +EK  A+ SS  R++S E GEE+LKF +D+G ++++S S  AK ++T+ +  KHL
Sbjct: 150 LAFSNDEKYLAIVSSRPRVISWEIGEEILKFPNDLGNVEHISISSDAKNLVTSDFEGKHL 209

Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
           QVW+CD++S  + +GP L +RH P+ +DC +  N ED  V+LAV   G AY W+L   S+
Sbjct: 210 QVWKCDLNSGNLCRGPTLPIRHPPLVLDCHSGCNKED-VVVLAVTGRGSAYIWNLSASSK 268

Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQ-------------- 286
           D+   P K+T + K  + D++N V+ KK  T IIA+RL  +E + Q              
Sbjct: 269 DQ-IQPTKLTTETKIVETDKENGVSSKKRHTLIIASRLQPVEENKQMKALVTYGSVDHPQ 327

Query: 287 ------------------KTITGLHDLESEAATASAQNEKKSKKR-AASDPDLETTRDVV 327
                                  +   +S +  A     KK+KKR A SDPDL TT + V
Sbjct: 328 FSILNISNSGENIVLYVGDETDSVQQHDSPSGKAIPMESKKAKKRQATSDPDLPTTTNEV 387

Query: 328 D 328
           D
Sbjct: 388 D 388


>gi|356522359|ref|XP_003529814.1| PREDICTED: WD repeat-containing protein 43-like [Glycine max]
          Length = 622

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 142/286 (49%), Positives = 196/286 (68%), Gaps = 4/286 (1%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           IW+TN G LLAEWK   G+  + YSC+AC F+G+K RKE+GT LLAL T +G ++AVDV 
Sbjct: 32  IWNTNTGHLLAEWKPSKGDHDIHYSCIACSFIGEKHRKEQGTSLLALSTIDGSVIAVDVS 91

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
           TGE K   T  HPGG+ GL+FA KGR L +VG NG+A E+ +E GEV++EFK S+K I+S
Sbjct: 92  TGERKL--TTSHPGGICGLSFANKGRLLRIVGHNGVAYEVNTETGEVLKEFKISKKSITS 149

Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
            AF  +EK  A+ SS +RI+S E G+E+LKF +D+G +Q +S S  AK ++T+ +  KHL
Sbjct: 150 LAFSNDEKYLAIVSSRLRIISWEIGKEILKFPNDLGNVQLISISSDAKNLVTSDFEGKHL 209

Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
           QVW+CD++S  V +GP L +RH P+ +DC +  N ED  V+LAV   G AY W+L   S+
Sbjct: 210 QVWKCDLNSGNVGRGPTLPIRHPPLILDCHSGCNKED-VVVLAVTGRGSAYIWNLNASSE 268

Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQ 286
           D+   P K+  K K  + +++N V+ KK   SIIA+RL  +E D Q
Sbjct: 269 DQ-IQPTKLNTKTKIVETEKENGVSSKKRHASIIASRLQPVEEDKQ 313


>gi|449436601|ref|XP_004136081.1| PREDICTED: WD repeat-containing protein 43-like [Cucumis sativus]
 gi|449491125|ref|XP_004158808.1| PREDICTED: WD repeat-containing protein 43-like [Cucumis sativus]
          Length = 624

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 140/279 (50%), Positives = 197/279 (70%), Gaps = 3/279 (1%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           IWST DGSLLAEWK  DG+    YSC+AC F+GKKR+      ++A+GT++GD+LAV+  
Sbjct: 32  IWSTRDGSLLAEWKDLDGKNDFGYSCMACCFLGKKRKS--SYCVVAIGTNSGDVLAVNAS 89

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
            GE KW S G HPGG+ GL+FA KG  L  VG+NGMASEM +E G +I+EFKAS+K ISS
Sbjct: 90  NGEKKWVSAGCHPGGVIGLSFANKGCRLRTVGSNGMASEMDTETGNIIKEFKASKKSISS 149

Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
           SAF  +E+   +A  +++ILS ++G+E++   D +GP++ VS SD AK IIT+  G KHL
Sbjct: 150 SAFSLDERYLVVAGKKLKILSTDDGDELIVHPDKLGPVKLVSVSDDAKTIITSELGAKHL 209

Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
           QVW C+IS+   ++GP LSM+H P   +C+N  N ED  V+L+V+ SG AY W LK +S+
Sbjct: 210 QVWWCNISAGKFSRGPILSMKHPPFVSECRNVSNQEDSVVVLSVSVSGAAYLWKLKVLSE 269

Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLN 279
           DE T P K++VK     + ++N  + KK+R S++A+R++
Sbjct: 270 DEVT-PTKVSVKANDNQSAEENHGSAKKNRASVLASRIH 307


>gi|359482669|ref|XP_003632804.1| PREDICTED: uncharacterized WD repeat-containing protein
           all2124-like [Vitis vinifera]
          Length = 234

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 119/192 (61%), Positives = 152/192 (79%), Gaps = 1/192 (0%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGT-LLLALGTSNGDILAVDV 59
           IW+T+  SL+AEWK PD +  + YSC+ C FVGKKRRKERG  LLLAL T++G+IL VD+
Sbjct: 32  IWNTSYQSLVAEWKGPDVDSGIGYSCMTCSFVGKKRRKERGACLLLALVTNDGNILVVDI 91

Query: 60  LTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPIS 119
             GEMKW+S+  HPGG+ GL+FA KGR LHVVGTNGM S +KSE GE+I EFKAS+KPIS
Sbjct: 92  FAGEMKWESSRYHPGGIVGLSFANKGRILHVVGTNGMVSTLKSENGELIGEFKASKKPIS 151

Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKH 179
           S AF  +EKI A+A S++R+LSLENG+E+LKF DD+  +Q +  S+ A  I+T+G+GEKH
Sbjct: 152 SLAFSSDEKILAIAGSKIRVLSLENGKELLKFPDDLESVQSIWISNDANTIVTSGFGEKH 211

Query: 180 LQVWRCDISSKT 191
           LQVW CD++ KT
Sbjct: 212 LQVWNCDLNRKT 223


>gi|147767919|emb|CAN62455.1| hypothetical protein VITISV_028174 [Vitis vinifera]
          Length = 282

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 129/254 (50%), Positives = 167/254 (65%), Gaps = 31/254 (12%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGT-LLLALGTSNGDILAVDV 59
           IW+T+  SL+AEWK PD +  +SYSC+ C FVGKKRRKE G  LLLAL T++G IL VD+
Sbjct: 32  IWNTSYQSLVAEWKGPDVDSSISYSCMTCSFVGKKRRKEHGACLLLALVTNDGSILVVDI 91

Query: 60  LTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPIS 119
             GEMKW+S+  HPGG+ GL+FA KGR LHVVGTNGM S +KSE GE+IREFKAS+KPIS
Sbjct: 92  FAGEMKWESSRYHPGGIVGLSFANKGRILHVVGTNGMVSTLKSENGELIREFKASKKPIS 151

Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKH 179
           S AF  +EKI A+A S++R+LSLENG+E+LKF +D+  +Q +  S+ A  I+T       
Sbjct: 152 SLAFSSDEKILAIAGSKIRVLSLENGKELLKFPNDLEFVQSIWISNDANTIVT------- 204

Query: 180 LQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVS 239
                                 H P   +CK+  N + G VIL+V+  G+AY W+LKT S
Sbjct: 205 ----------------------HPPAVFECKHGWNEDGGLVILSVSSLGIAYLWNLKTTS 242

Query: 240 QDEKTNPAKITVKL 253
             +  NP KI + L
Sbjct: 243 A-QGYNPTKIMLLL 255


>gi|147856060|emb|CAN80733.1| hypothetical protein VITISV_024551 [Vitis vinifera]
          Length = 546

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 97/201 (48%), Positives = 135/201 (67%), Gaps = 27/201 (13%)

Query: 49  TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVI 108
           T++G IL V++  GEMKW+S+G HPGG+  L+FA KGR LHVVGTNGM            
Sbjct: 165 TNDGSILVVNIFAGEMKWESSGYHPGGIVRLSFANKGRILHVVGTNGM------------ 212

Query: 109 REFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAK 168
                         F  ++KI A+ASS++R+LSLENG+E+LKF DD+  +Q +  S+ A 
Sbjct: 213 --------------FFRDKKILAIASSKIRVLSLENGKELLKFPDDLESVQSIWISNDAN 258

Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESG 228
            I+T+G+GEKHLQVW  D++SKT++KG  LSM+H PV  +CK+  N +DG VIL+V+ SG
Sbjct: 259 TIVTSGFGEKHLQVWNYDLNSKTISKGHVLSMKHLPVVFECKHGWNEDDGLVILSVSSSG 318

Query: 229 VAYSWDLKTVSQDEKTNPAKI 249
           + Y W+LKT +  ++ NP KI
Sbjct: 319 ITYLWNLKT-TLAQRYNPTKI 338


>gi|115438873|ref|NP_001043716.1| Os01g0649000 [Oryza sativa Japonica Group]
 gi|55296883|dbj|BAD68336.1| unknown protein [Oryza sativa Japonica Group]
 gi|113533247|dbj|BAF05630.1| Os01g0649000 [Oryza sativa Japonica Group]
 gi|125571385|gb|EAZ12900.1| hypothetical protein OsJ_02821 [Oryza sativa Japonica Group]
          Length = 607

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 105/285 (36%), Positives = 165/285 (57%), Gaps = 14/285 (4%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           +W T  G ++  W  P   P  SYSC+AC  V KK +K+   +L+A+GT+NG +L +D  
Sbjct: 34  VWRTGGGEIIEGWTDPISAPDDSYSCIACCSVQKKHKKDGNLILVAVGTTNGQVLVLDS- 92

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
           TG + WK+   H   +  L FA+ GR L+  G +G+  E+ S  GE     KA++KPI+S
Sbjct: 93  TGVI-WKN-APHTCKVVSLHFARHGRVLYTAGMDGIICELNSRTGESKDTIKATKKPINS 150

Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
                +EK   ++S   R+ S+   +E+L+   DVGP+Q +S SD  + +++     K +
Sbjct: 151 FTLSHDEKFMGVSSKITRLFSVSEKKEILRIPSDVGPVQLMSVSDDGRFLVSHVDNNKEV 210

Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
           QVW CD +S T+    +L+M++ P  ++C  S +  DG ++LAV++ GVA+ W L+T+SQ
Sbjct: 211 QVWSCDQNSCTIVSTASLTMQNQPKIVECTRSTSYGDGGIVLAVSKKGVAHVWHLQTLSQ 270

Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
           +E   P KI+VK         NS++ KK R  II+A+L D   D 
Sbjct: 271 NE-VLPTKISVK---------NSLD-KKGRIPIISAKLCDTNEDN 304


>gi|125527065|gb|EAY75179.1| hypothetical protein OsI_03069 [Oryza sativa Indica Group]
          Length = 607

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 105/285 (36%), Positives = 165/285 (57%), Gaps = 14/285 (4%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           +W T  G ++  W  P   P  SYSC+AC  V KK +K+   +L+A+GT+NG +L +D  
Sbjct: 34  VWRTGGGEIIEGWTDPISAPDDSYSCIACCSVQKKHKKDGNLILVAVGTTNGQVLVLDS- 92

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
           TG + WK+   H   +  L FA+ GR L+  G +G+  E+ S  GE     KA++KPI+S
Sbjct: 93  TGVI-WKN-APHTCKVVSLHFARHGRVLYTAGMDGIICELNSRTGESKDTIKATKKPINS 150

Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
                +EK   ++S   R+ S+   +E+L+   DVGP+Q +S SD  + +++     K +
Sbjct: 151 FTLSHDEKFMGVSSKITRLFSVSEKKEILRIPSDVGPVQLMSVSDDGRFLVSHVDNNKEV 210

Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
           QVW CD +S T+    +L+M++ P  ++C  S +  DG ++LAV++ GVA+ W L+T+SQ
Sbjct: 211 QVWSCDQNSCTIVSTASLTMQNQPKIVECTRSTSYGDGGIVLAVSKKGVAHVWHLQTLSQ 270

Query: 241 DEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
           +E   P KI+VK         NS++ KK R  II+A+L D   D 
Sbjct: 271 NE-VLPTKISVK---------NSLD-KKGRIPIISAKLCDTNEDN 304


>gi|359477163|ref|XP_002271327.2| PREDICTED: WD repeat-containing protein 43-like [Vitis vinifera]
 gi|296083309|emb|CBI22945.3| unnamed protein product [Vitis vinifera]
          Length = 628

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 97/295 (32%), Positives = 162/295 (54%), Gaps = 24/295 (8%)

Query: 1   IWSTNDGSLLAE------------WKQPD-GEPVVSYSCLA-CGFVGKKRRKERGTLLLA 46
           IW T  G +  E            + +P+ G   V Y+CL       KK+RK R +LL+ 
Sbjct: 31  IWDTLKGQVQTEFADIISTDTTNLYSKPERGHLSVDYTCLKWLSLDSKKKRKLRCSLLV- 89

Query: 47  LGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
           LGT +GD+LA+DV  GE+KW+    HPGG++ ++ +  G  ++  G +GM S++ S  G 
Sbjct: 90  LGTGSGDVLALDVSAGELKWRIGDCHPGGVSAISCSTDGSCIYAAGADGMISKIDSSAGN 149

Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDG 166
           ++ +F+AS K ISS +   + K+ A A+++++IL+  + +++ KFS   G ++ +  +D 
Sbjct: 150 LLGKFRASTKAISSMSVSSDGKVLATAAAQLKILNSSDHKKIQKFSGHPGAVRCMIFTDD 209

Query: 167 AKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGE---DGTVILA 223
            + I+++  GE+++ VWR D  SK  +    L+M H  V +D +   NG+    G  +LA
Sbjct: 210 GEYILSSAIGERYIAVWRID-GSKKQSASCVLAMEHPAVFLDSRCIDNGDTDKSGLYVLA 268

Query: 224 VAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARL 278
           ++E GV Y W  K +     T P K++V L     +   S N K S  +I AA+L
Sbjct: 269 ISEIGVCYFWYAKNMEDLRNTKPTKVSVSL-----EDNFSKNHKGSLPTIFAAKL 318


>gi|242041231|ref|XP_002468010.1| hypothetical protein SORBIDRAFT_01g037980 [Sorghum bicolor]
 gi|241921864|gb|EER95008.1| hypothetical protein SORBIDRAFT_01g037980 [Sorghum bicolor]
          Length = 633

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 84/265 (31%), Positives = 146/265 (55%), Gaps = 11/265 (4%)

Query: 18  GEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLA 77
           G   + Y+C+    +  K++++ G+ L+ LGT +GD+LA+DV  G  KWK +  HPGG+ 
Sbjct: 64  GHLALDYTCMKWVQLSSKKKRKAGSSLIVLGTGSGDVLALDVAAGNWKWKVSDCHPGGVT 123

Query: 78  GLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV 137
            +A++K GRS++  G +GM   + +  G V+ +FK+S K IS+ A   +  + A A+ ++
Sbjct: 124 AVAYSKHGRSVYTAGADGMVCRIDASDGAVVGKFKSSSKAISALAVSSDGNVLATAAGQL 183

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA 197
           R     + +++ KFS     ++ +  S+ ++ ++++G GE+++ +W+   S KT +    
Sbjct: 184 RTFDTSSNKKIKKFSGHPVAVRSMVFSNDSQYVLSSGVGERYIAIWKLG-SGKTQSSNCI 242

Query: 198 LSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKAD 257
           LSM H  + +DCK S  GE  T ILA++E GV Y W    V       P KI +      
Sbjct: 243 LSMEHPAIFVDCKCSDEGE--THILAISEIGVCYFWSANDVDDLRNKKPTKIIL------ 294

Query: 258 ADQQNSVNVKKSRTSIIAARLNDLE 282
              ++SV       SI AA+L  ++
Sbjct: 295 --SESSVPTVHQAFSIFAAKLQGVD 317


>gi|449531051|ref|XP_004172501.1| PREDICTED: uncharacterized LOC101213788, partial [Cucumis sativus]
          Length = 547

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 98/310 (31%), Positives = 161/310 (51%), Gaps = 21/310 (6%)

Query: 1   IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
           IW T  G +  E+             K   G   V Y C+    + KKR+++R  LLL L
Sbjct: 31  IWDTLKGQIQTEFADFFTSDSTSILTKPEKGHLSVDYKCMKWLSLEKKRKRKRQCLLLLL 90

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT +GD+LA+DV  GE+KWK +  HPGG+A ++F   G  ++  G +GM  E+ S  G +
Sbjct: 91  GTGSGDVLALDVAAGELKWKISDCHPGGVASISFPTHGSCIYTAGADGMLCEINSLTGNL 150

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
           +R+FKAS K IS  +   + KI A A+S+V+I +  N +++ KFS   G ++ +  ++  
Sbjct: 151 LRKFKASTKAISCISVSPDGKIIATAASQVKIFNCSNHKKIQKFSGHPGAVRCMVFTEDG 210

Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGT--VILAVA 225
           + I+++  GE+++ VW  D   K  +    L+M H  + +D + S +G D T   ILA++
Sbjct: 211 RYILSSAVGERYIVVWSVD-GGKEQSASCVLAMEHPAIFVDSRCSNDGGDETALYILAIS 269

Query: 226 ESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
           E G  Y W  + + +     P KI +       +   S + K++  +I AA+L  +   G
Sbjct: 270 EIGACYLWYGQNLEELRTAKPTKILMS-----GNDIFSKSKKRAIPAIYAAKLQGVPKSG 324

Query: 286 QKTITGLHDL 295
              +   H L
Sbjct: 325 SGQVFLAHGL 334


>gi|449464758|ref|XP_004150096.1| PREDICTED: uncharacterized protein LOC101213788 [Cucumis sativus]
          Length = 570

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 98/310 (31%), Positives = 161/310 (51%), Gaps = 21/310 (6%)

Query: 1   IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
           IW T  G +  E+             K   G   V Y C+    + KKR+++R  LLL L
Sbjct: 31  IWDTLKGQIQTEFADFFTSDSTSILTKPEKGHLSVDYKCMKWLSLEKKRKRKRQCLLLLL 90

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT +GD+LA+DV  GE+KWK +  HPGG+A ++F   G  ++  G +GM  E+ S  G +
Sbjct: 91  GTGSGDVLALDVAAGELKWKISDCHPGGVASISFPTHGSCIYTAGADGMLCEINSLTGNL 150

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
           +R+FKAS K IS  +   + KI A A+S+V+I +  N +++ KFS   G ++ +  ++  
Sbjct: 151 LRKFKASTKAISCISVSPDGKIIATAASQVKIFNCSNHKKIQKFSGHPGAVRCMVFTEDG 210

Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGT--VILAVA 225
           + I+++  GE+++ VW  D   K  +    L+M H  + +D + S +G D T   ILA++
Sbjct: 211 RYILSSAVGERYIVVWSVD-GGKEQSASCVLAMEHPAIFVDSRCSNDGGDETALYILAIS 269

Query: 226 ESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
           E G  Y W  + + +     P KI +       +   S + K++  +I AA+L  +   G
Sbjct: 270 EIGACYLWYGQNLEELRTAKPTKILMS-----GNDIFSKSKKRAIPAIYAAKLQGVPKSG 324

Query: 286 QKTITGLHDL 295
              +   H L
Sbjct: 325 SGQVFLAHGL 334


>gi|115452511|ref|NP_001049856.1| Os03g0300300 [Oryza sativa Japonica Group]
 gi|108707682|gb|ABF95477.1| transducin family protein, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113548327|dbj|BAF11770.1| Os03g0300300 [Oryza sativa Japonica Group]
 gi|215704632|dbj|BAG94260.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222624762|gb|EEE58894.1| hypothetical protein OsJ_10517 [Oryza sativa Japonica Group]
          Length = 637

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 84/267 (31%), Positives = 150/267 (56%), Gaps = 11/267 (4%)

Query: 18  GEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLA 77
           G   + Y+C+    +  K++++ G+ LL LGT +GD+LA+DV  G+ KW+ T  HPGG+ 
Sbjct: 65  GHLALDYTCMKWVQLSSKKKRKAGSSLLVLGTGSGDVLALDVAAGQWKWRVTDCHPGGVT 124

Query: 78  GLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV 137
            +A++K GRSL+  GT+GM   + +  G V+ +FK+S K IS+ A   + +I A A+ ++
Sbjct: 125 AVAYSKHGRSLYTGGTDGMVCRINASDGSVVEKFKSSSKAISALAVSPDGEILATAAGQL 184

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA 197
           R     + +++ KFS     ++ +  S  ++ ++++G GE+++ +W+   S KT +    
Sbjct: 185 RTFDASDNKKIQKFSGHPVAVRSMVFSGDSQYVLSSGVGERYVAIWKLG-SGKTQSSSCI 243

Query: 198 LSMRHSPVAIDCKNSP-NGEDGTV-ILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKK 255
           LSM H  + +DCK S  N  +G + +LA++E GV Y W    +       P KI +    
Sbjct: 244 LSMEHPAIFVDCKCSDINDTEGEIHVLAISEIGVCYFWSGTNMDDLRNKKPTKIAL---- 299

Query: 256 ADADQQNSVNVKKSRTSIIAARLNDLE 282
                 +S++  K   +I AA+L  ++
Sbjct: 300 ----SDSSLSRSKQGFAIFAAKLQGID 322


>gi|224110316|ref|XP_002315482.1| predicted protein [Populus trichocarpa]
 gi|222864522|gb|EEF01653.1| predicted protein [Populus trichocarpa]
          Length = 617

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 88/294 (29%), Positives = 158/294 (53%), Gaps = 22/294 (7%)

Query: 1   IWSTNDGSLLAEW------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALG 48
           IW T  G +  E+            K   G   V Y C+    + KK++++ G+ LL LG
Sbjct: 33  IWDTLKGQVQTEFADITSSEGDFYAKPERGHLSVDYKCMKWLSLDKKKKRKLGSSLLVLG 92

Query: 49  TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVI 108
           T +GD+LA+DV  G++KW  +  HPGG++ ++F+ +   ++  G +GM  +   + G  +
Sbjct: 93  TGSGDVLALDVSAGQLKWSVSDCHPGGVSAVSFSTRDSCIYTSGADGMICKTDPQTGNTL 152

Query: 109 REFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAK 168
            +F+AS K IS  +   + KI A A+++++I +  + +++ KFS   G ++ +  +D  K
Sbjct: 153 GKFRASTKAISCMSVSSDGKILATAAAQLKIFNCSDHKKMQKFSGHPGAVRSMVFTDDGK 212

Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN-GED--GTVILAVA 225
            I+++  GE+++ +WR D   K  +    L+M H  V IDCK   N G D  G  +LA++
Sbjct: 213 YILSSAVGERYVALWRID-GGKRQSASCVLAMEHPAVFIDCKCFENEGVDDAGLYVLAIS 271

Query: 226 ESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLN 279
           E+GV Y+W  K V +   + P K+ +        ++   N K +  +++AA+L 
Sbjct: 272 ETGVCYTWCGKNVEELRNSKPTKVLLSY------EEFPKNHKGALPTVLAAKLQ 319


>gi|414866377|tpg|DAA44934.1| TPA: hypothetical protein ZEAMMB73_517326 [Zea mays]
          Length = 629

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 81/265 (30%), Positives = 145/265 (54%), Gaps = 11/265 (4%)

Query: 18  GEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLA 77
           G   + Y+C+    +  K++++ G+ L+ LGT +GD+LA+DV  G  KWK +  HPGG+ 
Sbjct: 62  GHLALDYTCMKWVQLSSKKKRKAGSSLIVLGTGSGDVLALDVAAGHWKWKVSDCHPGGVT 121

Query: 78  GLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV 137
            + ++K GR+++  G +GM   + +  G V+ +FK+S K IS+ A   +  + A A+ ++
Sbjct: 122 AVEYSKHGRNVYTAGADGMVCRIDASNGAVVGKFKSSSKAISALAVSSDGNMLATAAGQL 181

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA 197
           R     + +++ KFS     ++ +  S+ ++ ++++G GE+++ +W+   S KT +    
Sbjct: 182 RTFDTSSNKKIKKFSGHPVAVRSMVFSNDSQYVLSSGVGERYIAIWKLG-SGKTQSSNCI 240

Query: 198 LSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKAD 257
           LSM H  + +DCK S  GE    ILA++E GV Y W    V       P KI +      
Sbjct: 241 LSMEHPAMFVDCKCSDKGE--IHILAISEIGVCYFWSANVVDDLRNKKPTKIIL------ 292

Query: 258 ADQQNSVNVKKSRTSIIAARLNDLE 282
              ++SV+      SI AA+L  ++
Sbjct: 293 --SESSVSTTHQALSIFAAKLQGVD 315


>gi|218192639|gb|EEC75066.1| hypothetical protein OsI_11186 [Oryza sativa Indica Group]
          Length = 637

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 84/267 (31%), Positives = 150/267 (56%), Gaps = 11/267 (4%)

Query: 18  GEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLA 77
           G   + Y+C+    +  K++++ G+ LL LGT +GD+LA+DV  G+ KW+ T  HPGG+ 
Sbjct: 65  GHLALDYTCMKWVQLLSKKKRKAGSSLLVLGTGSGDVLALDVAAGQWKWRVTDCHPGGVT 124

Query: 78  GLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV 137
            +A++K GRSL+  GT+GM   + +  G V+ +FK+S K IS+ A   + +I A A+ ++
Sbjct: 125 AVAYSKHGRSLYTGGTDGMVCRINASDGSVVEKFKSSSKAISALAVSPDGEILATAAGQL 184

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA 197
           R     + +++ KFS     ++ +  S  ++ ++++G GE+++ +W+   S KT +    
Sbjct: 185 RTFDASDNKKIQKFSGHPVAVRSMVFSGDSQYVLSSGVGERYVAIWKLG-SGKTQSSSCI 243

Query: 198 LSMRHSPVAIDCKNSP-NGEDGTV-ILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKK 255
           LSM H  + +DCK S  N  +G + +LA++E GV Y W    +       P KI +    
Sbjct: 244 LSMEHPAIFVDCKCSDINDTEGEIHVLAISEIGVCYFWSGTNMDDLRNKKPTKIAL---- 299

Query: 256 ADADQQNSVNVKKSRTSIIAARLNDLE 282
                 +S++  K   +I AA+L  ++
Sbjct: 300 ----SDSSLSRSKQGFAIFAAKLQGID 322


>gi|357112630|ref|XP_003558111.1| PREDICTED: uncharacterized protein LOC100843293 [Brachypodium
           distachyon]
          Length = 625

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 89/296 (30%), Positives = 157/296 (53%), Gaps = 23/296 (7%)

Query: 1   IWSTNDGSLLAEWKQ---------PD---GEPVVSYSCLACGFVGKKRRKERGTLLLALG 48
           +W    GSL  E+           P+   G   + Y+C+    +  K++++ G+ LL LG
Sbjct: 33  VWDAVRGSLQTEFADIPAVEVGAVPETKRGHLALDYTCMKWVQLSGKKKRKAGSSLLVLG 92

Query: 49  TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVI 108
           T +GD+LA+DV  G+ KW+    HPGG+  +A+++ GRS++  G +GM   + S  G V 
Sbjct: 93  TGSGDVLALDVAAGQWKWRINDCHPGGVTAVAYSRHGRSVYSAGADGMVCRIDSSDGSVA 152

Query: 109 REFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAK 168
            +FK+S K IS+ A   +  I A A+ ++R     + ++V KFS     ++ ++ SD  K
Sbjct: 153 GKFKSSSKAISALAVSPDGNILATAAGQLRTFDASDNKKVQKFSGHPVAVRSMTFSDNGK 212

Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS-PNGEDGTV-ILAVAE 226
            I+++G GE+++ +W+   S K  +    LSM H  + +DCK S  +  DG + +LA++E
Sbjct: 213 YILSSGVGERYVAIWKLG-SGKAQSSNCILSMEHPAIFVDCKCSDTDATDGEIHVLAISE 271

Query: 227 SGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLE 282
            G+ Y W    ++      P KI +         ++S++  K   +I AA+L  ++
Sbjct: 272 VGICYFWSGNNMNDLRNKKPTKIAL--------SESSLSRAKQPFTIFAAKLKGID 319


>gi|255551877|ref|XP_002516984.1| nucleotide binding protein, putative [Ricinus communis]
 gi|223544072|gb|EEF45598.1| nucleotide binding protein, putative [Ricinus communis]
          Length = 586

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 87/295 (29%), Positives = 160/295 (54%), Gaps = 24/295 (8%)

Query: 1   IWSTNDGSLLAE------------WKQPD-GEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
           IW T  G L  E            + +P+ G   + Y+C+    + +KR+++ G+ LL L
Sbjct: 31  IWDTVKGQLQTEFADITSSNNADFYTKPERGHLSIDYTCMKWLSLDRKRKRKLGSSLLVL 90

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT +GD+LA+DV  G++KW  +  HPGG++ ++F+     ++  G +GM  ++  + G +
Sbjct: 91  GTGSGDVLALDVSAGQLKWTVSDCHPGGVSAISFSTSDSCIYTAGADGMCCKLDPQTGNM 150

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
           + +F+AS K ISS +   + KI A A+++++ ++L + +++ KFS   G ++ +  ++  
Sbjct: 151 LGKFRASTKAISSVSVSPDGKILATAAAQLKTINLSDNKKIQKFSGHPGAVRAMIFTEDG 210

Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN-GED--GTVILAV 224
           K I+++  GE+++ +WR D   K       L+M H  V +D K   N G D  G  ++A+
Sbjct: 211 KYILSSAVGERYIALWRTD-GVKKQTASCVLAMEHPAVFLDSKCLENKGVDDAGLCVMAI 269

Query: 225 AESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLN 279
           +E+GV Y+W  + V +     P KI +         ++S N K    +I AA+L 
Sbjct: 270 SETGVCYTWCGQNVEELRIAKPTKIVLF-------SEDSKNQKGGLRTIFAAKLQ 317


>gi|30683535|ref|NP_196685.2| transducin/WD40 domain-containing protein [Arabidopsis thaliana]
 gi|209414540|gb|ACI46510.1| At5g11240 [Arabidopsis thaliana]
 gi|332004268|gb|AED91651.1| transducin/WD40 domain-containing protein [Arabidopsis thaliana]
          Length = 615

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 155/298 (52%), Gaps = 20/298 (6%)

Query: 1   IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
           IW T  G +  E+             K   G   V Y+C+    + KK++++ GT +L L
Sbjct: 31  IWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTCMKWLSLEKKKKRKLGTSVLVL 90

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT  GD+LA+DV +G++KW+ +  HPGG+  ++ + K   ++  G +GM  ++    G +
Sbjct: 91  GTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKASCIYSGGADGMVCQIDPHSGNL 150

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
           IR+FKAS K +SS     + KI   AS++++  +  + +++ KF+   G ++ V+ ++  
Sbjct: 151 IRKFKASTKTVSSLCVSPDGKILVTASTQLKTFNCSDLKKIQKFTGHPGVVRCVAFTEDG 210

Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAES 227
           K ++++  GE+++ VW+ D  +K  +    L++ H PV +D     N E G  +LA++E 
Sbjct: 211 KYVLSSAVGERYIAVWKTD-GAKKQSASCVLALEHPPVFVDSWGETN-EKGLYVLAISEI 268

Query: 228 GVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
           GV Y W    V +     P K  V L  AD+  +     K S   I AA+L  +   G
Sbjct: 269 GVCYFWYGSNVEELCNATPTK--VALATADSSLK---PYKSSLPLIFAAKLQGILKPG 321


>gi|8953386|emb|CAB96659.1| putative protein [Arabidopsis thaliana]
          Length = 606

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 155/298 (52%), Gaps = 20/298 (6%)

Query: 1   IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
           IW T  G +  E+             K   G   V Y+C+    + KK++++ GT +L L
Sbjct: 31  IWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTCMKWLSLEKKKKRKLGTSVLVL 90

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT  GD+LA+DV +G++KW+ +  HPGG+  ++ + K   ++  G +GM  ++    G +
Sbjct: 91  GTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKASCIYSGGADGMVCQIDPHSGNL 150

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
           IR+FKAS K +SS     + KI   AS++++  +  + +++ KF+   G ++ V+ ++  
Sbjct: 151 IRKFKASTKTVSSLCVSPDGKILVTASTQLKTFNCSDLKKIQKFTGHPGVVRCVAFTEDG 210

Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAES 227
           K ++++  GE+++ VW+ D  +K  +    L++ H PV +D     N E G  +LA++E 
Sbjct: 211 KYVLSSAVGERYIAVWKTD-GAKKQSASCVLALEHPPVFVDSWGETN-EKGLYVLAISEI 268

Query: 228 GVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
           GV Y W    V +     P K  V L  AD+  +     K S   I AA+L  +   G
Sbjct: 269 GVCYFWYGSNVEELCNATPTK--VALATADSSLK---PYKSSLPLIFAAKLQGILKPG 321


>gi|356521817|ref|XP_003529547.1| PREDICTED: uncharacterized protein LOC100815458 [Glycine max]
          Length = 626

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/275 (29%), Positives = 149/275 (54%), Gaps = 11/275 (4%)

Query: 17  DGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGL 76
           +G   + Y+C+      KKR+++  + LL LGT  GD+LA+DV + ++ W+ T  HPGG+
Sbjct: 63  NGHLALDYTCIKWFSFEKKRKRKHISSLLVLGTGGGDVLALDVASAQLTWRLTDCHPGGV 122

Query: 77  AGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
             +A +    S++  G +GM   +    G ++ +FKAS KP+S  +   +    A A+++
Sbjct: 123 RAIASSANVSSIYTAGVDGMVCVIDFMTGNLLEKFKASTKPVSCMSVSPDGNTLATAAAQ 182

Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
           ++I +  N +++ KFS   G ++ +  ++  K ++++  GE+++ VWR D  +K  +   
Sbjct: 183 LKIFNCSNHKKIQKFSGHPGSVRCMLFTEDGKYVLSSAVGERYVAVWRID-GAKKQSASC 241

Query: 197 ALSMRHSPVAIDCKNSPNGED---GTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
            L+M H  V +D +   +GE    G  +LA++E G+ Y W   ++ +     P KI++ L
Sbjct: 242 VLAMEHPAVFLDSRCIDSGEHDEAGICVLAISEIGICYLWFGNSIEELRNAKPTKISLSL 301

Query: 254 KKADADQQNSVNVKKSRTSIIAARLNDLE--ADGQ 286
                +   S N K +  +I AA+L  ++  A GQ
Sbjct: 302 -----EDMPSRNYKGALPAIYAAKLQGIQKPASGQ 331


>gi|356564408|ref|XP_003550446.1| PREDICTED: uncharacterized protein LOC100784148 [Glycine max]
          Length = 626

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/275 (29%), Positives = 152/275 (55%), Gaps = 11/275 (4%)

Query: 17  DGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGL 76
           +G   + Y+C+      KKR+++  + LL LGT  GD+LA+D+ +G++ W+ T  HPGG+
Sbjct: 63  NGHLALDYTCIKWFSFDKKRKRKHISSLLVLGTGGGDVLALDIASGQLTWRLTDCHPGGV 122

Query: 77  AGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
             +A +    +++  G +GM   +    G ++ +FKAS KP+S  +   + K  A A+++
Sbjct: 123 RAIASSANVSTIYTAGADGMVCLIDFMTGNLLEKFKASTKPVSCMSVSPDGKTLATAAAQ 182

Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
           ++I +  N +++ KFS   G ++ +  ++  K I+++  GE+++ VWR +  +K  +   
Sbjct: 183 LKIFNCSNHKKIQKFSGHPGSVRCMVFTEDGKHILSSAVGERYVAVWRIE-GAKKQSASC 241

Query: 197 ALSMRHSPVAID--CKNS-PNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
            L+M H  V +D  C +S  + E G  +LA++E G+ Y W   +V +     P KI++ L
Sbjct: 242 VLAMEHPAVFLDSRCIDSEEHDEAGICVLAISEIGICYLWFGNSVEELRNAKPTKISLSL 301

Query: 254 KKADADQQNSVNVKKSRTSIIAARLNDLE--ADGQ 286
           +        S N K +  +I AA+L  ++  A GQ
Sbjct: 302 EDVP-----SRNYKGALPAIYAAKLQGIQKPASGQ 331


>gi|26451883|dbj|BAC43034.1| unknown protein [Arabidopsis thaliana]
          Length = 615

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 154/298 (51%), Gaps = 20/298 (6%)

Query: 1   IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
           IW T  G +  E+             K   G   V Y+C+    + KK++++ GT +L L
Sbjct: 31  IWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTCMKWLSLEKKKKRKLGTSVLVL 90

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT  GD+LA+DV +G++KW+ +  HPGG+  ++ + K   ++  G +GM  ++    G +
Sbjct: 91  GTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKASCIYSGGADGMVCQIDPHSGNL 150

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
           IR+FKAS K +SS     + KI   AS++++  +  + +++ KF+   G ++ V+ ++  
Sbjct: 151 IRKFKASTKTVSSLCVSPDGKILVTASTQLKTFNCSDLKKIQKFTGHPGVVRCVAFTEDG 210

Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAES 227
           K ++++  GE+++ VW+ D  +K  +    L++ H PV  D     N E G  +LA++E 
Sbjct: 211 KYVLSSAVGERYIAVWKTD-GAKKQSASCVLALEHPPVFADSWGETN-EKGLYVLAISEI 268

Query: 228 GVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
           GV Y W    V +     P K  V L  AD+  +     K S   I AA+L  +   G
Sbjct: 269 GVCYFWYGSNVEELCNATPTK--VALATADSSLK---PYKSSLPLIFAAKLQGILKPG 321


>gi|313184296|emb|CBL94162.1| putative transducin family protein / WD-40 repeat family protein
           [Malus x domestica]
          Length = 629

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 78/254 (30%), Positives = 140/254 (55%), Gaps = 6/254 (2%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           I ++ D S+ A  K+  G   V Y+C+      +K++++ G+  L LGT  GD+LA+DV 
Sbjct: 46  IVASEDTSIYA--KRERGHLSVDYTCMKWFSSERKKKRKLGSSFLVLGTGGGDVLALDVA 103

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
            G +KW+    H GG++ ++FA+    L+  G +GM  ++ S  G +  +FKAS K ISS
Sbjct: 104 AGHLKWRVNDCHAGGVSSISFARNTSCLYTAGADGMICQIDSLTGNLSGKFKASTKAISS 163

Query: 121 SAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
            +      I A A+++++I +L + +++ KFS   G ++ +  ++  K I+++  GE+++
Sbjct: 164 MSVSSNGNILATAAAQMKIFNLSDHKKIQKFSGHPGAVRCMIFTEDGKYILSSAVGERYI 223

Query: 181 QVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV---ILAVAESGVAYSWDLKT 237
            VW  D   K  +    L M H  + +D +   +GE G V   +LA++E GV Y W  + 
Sbjct: 224 AVWSID-GGKKQSASVVLKMEHPAIFVDSRCIDSGEVGDVGLYVLAISEVGVCYFWFGQN 282

Query: 238 VSQDEKTNPAKITV 251
           + +     P KI++
Sbjct: 283 IEELRSAKPTKISL 296


>gi|297807165|ref|XP_002871466.1| hypothetical protein ARALYDRAFT_325654 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297317303|gb|EFH47725.1| hypothetical protein ARALYDRAFT_325654 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 615

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 88/298 (29%), Positives = 155/298 (52%), Gaps = 20/298 (6%)

Query: 1   IWSTNDGSLLAEW-------------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLAL 47
           IW T  G +  E+             K   G   V Y+C+    + KK++++ GT +L L
Sbjct: 31  IWDTVKGQVQTEFADIASTEETNIYTKVGKGHLSVDYTCMKWLSLEKKKKRKLGTSVLVL 90

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT  GD+LA+DV +G++KW+ +  HPGG+  ++ + K   ++  G +GM  ++    G +
Sbjct: 91  GTGGGDVLALDVASGQLKWRISDCHPGGVNAVSSSAKASCIYSGGADGMVCQIDPHSGNL 150

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
           IR+FKAS K +SS     + KI   AS++++  +  + +++ KF+   G ++ V+ ++  
Sbjct: 151 IRKFKASTKTVSSLCVSPDGKILVTASAQLKTFNCSDLKKIQKFTGHPGGVRCVAFTEDG 210

Query: 168 KIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAES 227
           K ++++  GE+++ VW+ +  +K  +    L++ H PV +D     N E G  +LA++E 
Sbjct: 211 KYVLSSAVGERYIAVWKTN-GAKKQSASCVLALEHPPVFVDSWGETN-EKGLYVLAISEI 268

Query: 228 GVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG 285
           GV Y W    V +     P K  V L  AD+  +     K S   I AA+L  +   G
Sbjct: 269 GVCYFWYGSNVEELCNATPTK--VALATADSSLKPH---KGSLPLIFAAKLQGILKPG 321


>gi|356537448|ref|XP_003537239.1| PREDICTED: uncharacterized protein LOC100778281 [Glycine max]
          Length = 557

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 80/278 (28%), Positives = 148/278 (53%), Gaps = 14/278 (5%)

Query: 17  DGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGL 76
           +G   + Y+C+      +KR+++  + LL LGT  GD+LA+DV + ++ W+ T  HPGG+
Sbjct: 63  NGHLALDYTCIKWFSFEEKRKRKHISSLLVLGTGGGDVLALDVASAQLTWRLTDCHPGGV 122

Query: 77  AGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
             +A +    S++  G +GM   +    G ++ +FKAS KP+S  +   +    A A+++
Sbjct: 123 RVIASSANVSSIYTAGVDGMVCVIDFMTGNLLEKFKASTKPVSCMSVSPDGNTLATAAAQ 182

Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSA---SDGAKIIITAGYGEKHLQVWRCDISSKTVN 193
           ++I +  N +++ KFS   G L  V     ++  K ++++  GE+++ VWR +  +K  +
Sbjct: 183 LKIFNCSNHKKIQKFSGHPGSLGSVRCMLFTEEGKYVLSSAIGERYVAVWRIE-GAKKQS 241

Query: 194 KGPALSMRHSPVAIDCKNSPNGED---GTVILAVAESGVAYSWDLKTVSQDEKTNPAKIT 250
               L+M H  V +D +   +GE    G  +LA++E  + Y W   ++ +   T P KI+
Sbjct: 242 ASCVLAMEHPAVFLDSRCIDSGEHDEVGICVLAISEIAICYLWFGNSIEELCNTKPTKIS 301

Query: 251 VKLKKADADQQNSVNVKKSRTSIIAARLNDLE--ADGQ 286
           + L     +   S N K +  +I AA+L  ++  A GQ
Sbjct: 302 LSL-----EGMPSRNYKGALPAIYAAKLQGIQKPASGQ 334


>gi|357478843|ref|XP_003609707.1| WD40 repeat-containing protein SMU1 [Medicago truncatula]
 gi|355510762|gb|AES91904.1| WD40 repeat-containing protein SMU1 [Medicago truncatula]
          Length = 625

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 80/275 (29%), Positives = 145/275 (52%), Gaps = 12/275 (4%)

Query: 18  GEPVVSYSCLA-CGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGL 76
           G   V Y+C+    F  K++RK   + LL LGT +GD+LA+DV  G++ WK    HPGG+
Sbjct: 60  GHLSVDYTCIKWLSFQSKRKRKHSSSSLLLLGTGSGDVLALDVSAGQLSWKINDCHPGGV 119

Query: 77  AGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
             ++    G + +  G +GM        G+++ +FKAS K +S  +   + K  A A+++
Sbjct: 120 RAISSLANGSTFYTAGADGMICATDFSTGKLLEKFKASSKAVSCISVSPDGKKLATAAAQ 179

Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
           ++I +  N +++ K S   G ++ +  ++ +K I+++  GE+++ VW  D  SK  +   
Sbjct: 180 LKIFNCSNKKKIQKCSGHPGSVRCMVFTEDSKYILSSAVGERYVAVWSID-GSKKQSASC 238

Query: 197 ALSMRHSPVAIDCKNSPNGED---GTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
            L+M H  V +D +    GE    G  +LA++E+G+ Y W   ++ +     P KI++  
Sbjct: 239 VLAMEHPAVFLDSRCIDKGEHDEAGICVLAISEAGLCYLWFGNSIEELRNAKPTKISLS- 297

Query: 254 KKADADQQNSVNVKKSRTSIIAARLNDLE--ADGQ 286
               ++  ++ N K +  +I AA L   +  A GQ
Sbjct: 298 ----SEDMSTKNYKGALPAIYAANLQSTQKPASGQ 328


>gi|147853966|emb|CAN79552.1| hypothetical protein VITISV_025726 [Vitis vinifera]
          Length = 692

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 117/212 (55%), Gaps = 9/212 (4%)

Query: 70  GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKI 129
           G+   G++ ++ +  G  ++  G +GM S++ S  G ++ +F+AS K ISS +   + K+
Sbjct: 177 GKWRRGVSAISCSTDGSCIYAAGADGMISKIDSSAGNLLGKFRASTKAISSMSVSSDGKV 236

Query: 130 FALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
            A A+++++IL+  + +++ KFS   G ++ +  +D  + I+++  GE+++ VWR D  S
Sbjct: 237 LATAAAQLKILNSSDHKKIQKFSGHPGAVRCMIFTDDGEYILSSAIGERYIAVWRID-GS 295

Query: 190 KTVNKGPALSMRHSPVAIDCKNSPNGE---DGTVILAVAESGVAYSWDLKTVSQDEKTNP 246
           K  +    L+M H  V +D +   NG+    G  +LA++E GV Y W  K +     T P
Sbjct: 296 KKQSASCVLAMEHPAVFLDSRCIDNGDTDKSGLYVLAISEIGVCYFWYAKNMEDLRNTKP 355

Query: 247 AKITVKLKKADADQQNSVNVKKSRTSIIAARL 278
            K++V L     +   S N K S  +I AA+L
Sbjct: 356 TKVSVSL-----EDNFSKNHKGSLPTIFAAKL 382



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 47/88 (53%), Gaps = 15/88 (17%)

Query: 1   IWSTNDGSLLAE------------WKQPD-GEPVVSYSCLA-CGFVGKKRRKERGTLLLA 46
           IW T  G +  E            + +P+ G   V Y+CL       KK+RK R +LL+ 
Sbjct: 31  IWDTLKGQVQTEFADIISTDTTNLYSKPERGHLSVDYTCLKWLSLDSKKKRKLRCSLLV- 89

Query: 47  LGTSNGDILAVDVLTGEMKWKSTGRHPG 74
           LGT +GD+LA+DV  GE+KW+    HPG
Sbjct: 90  LGTGSGDVLALDVSAGELKWRIGDCHPG 117


>gi|168036674|ref|XP_001770831.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677890|gb|EDQ64355.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 732

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 74/290 (25%), Positives = 133/290 (45%), Gaps = 58/290 (20%)

Query: 18  GEPVVSYSCLACGFVGKK---------RRKERGTLLLALGTSNGDILAVDVLTGEMKWKS 68
           G   + Y+C+A G +             +++ G  ++ +GT +GDI++ D   GE+KW++
Sbjct: 53  GHLALDYTCMAWGALPSNSSTKKKKKGAKEKGGAAVVVVGTGSGDIISWDTTLGELKWRA 112

Query: 69  TGRHPGGLA---------------------------------------------GLAFAK 83
           +  H G +A                                              +AF+K
Sbjct: 113 SDCHAGSVAVSSSLNPEFQFSLLMNALQGRELLIQMFHLFVNDVKSCLNLKSVTSVAFSK 172

Query: 84  KGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLE 143
            G+ L+ VG +    E+ S  G+V+ +F+A++ P+S  A   +    A+AS+EV++  L 
Sbjct: 173 SGKRLYSVGHDAHVCELDSSSGQVLTKFRAAKTPLSCVAVAEDGGTVAVASAEVKLFDLS 232

Query: 144 NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS 203
           + + + KF     P++ +  +   K +++A  GE+H+ +W  +      +   +LS+ H 
Sbjct: 233 SKKRLRKFPGHPTPVKTLCFTPDGKYLLSAATGERHVVLWHVEGGKSETSAPISLSVEHP 292

Query: 204 PVAIDCKNSPNGEDGTV--ILAVAESGVAYSWDLKTVSQDEKTNPAKITV 251
              ID   S   EDG V  ++AV+E G AY W   TV++ E T P  I+V
Sbjct: 293 VATIDI--SGFDEDGDVLRVIAVSEGGCAYIWSAPTVNELETTKPTIISV 340


>gi|302759104|ref|XP_002962975.1| hypothetical protein SELMODRAFT_404454 [Selaginella moellendorffii]
 gi|300169836|gb|EFJ36438.1| hypothetical protein SELMODRAFT_404454 [Selaginella moellendorffii]
          Length = 617

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 77/287 (26%), Positives = 142/287 (49%), Gaps = 25/287 (8%)

Query: 2   WSTNDGSLLAEWKQP--------DGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGD 53
           W    G + AE+ +         DG   ++Y  LA G   KK++K+    L+ LGT  GD
Sbjct: 25  WDVASGHVCAEFSEIAASSSQALDGHLALNYKSLAWGASSKKKKKK--ASLIVLGTGGGD 82

Query: 54  ILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKA 113
           +LA D + GE+ W++   + GG+  + FAK   SL   G +G+  E++   G+++   + 
Sbjct: 83  VLAWDPILGELTWRTNDCNTGGVTSVCFAKNQGSLFSAGIDGIVCELEVASGKILSTLQC 142

Query: 114 SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
           S+  I++ +   +  +  +A++E+R+  +   + + KF+     +     ++  + +++A
Sbjct: 143 SKSSITNISSSPDASLLLVAANELRLFDMRTRKRLRKFTGHASSVNVTVFTEDGRYVLSA 202

Query: 174 GYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGT--VILAVAESGVAY 231
             G++++ VW  +   K      AL+M H  VA+D       +DGT   ILAV+E+G AY
Sbjct: 203 AVGDRNIAVWDTNEVGKEAKV--ALAMEHPAVALDSWGFE--KDGTERKILAVSETGEAY 258

Query: 232 SWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARL 278
            W   ++ +  K  P ++ V  K +          K S+ SI+  RL
Sbjct: 259 IWSGLSLEELGKAEPVRVRVGSKNSG---------KASKQSILLGRL 296


>gi|302824612|ref|XP_002993948.1| hypothetical protein SELMODRAFT_431903 [Selaginella moellendorffii]
 gi|300138220|gb|EFJ04995.1| hypothetical protein SELMODRAFT_431903 [Selaginella moellendorffii]
          Length = 616

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 75/285 (26%), Positives = 137/285 (48%), Gaps = 21/285 (7%)

Query: 2   WSTNDGSLLAEW--------KQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGD 53
           W    G + AE+        +  DG   ++Y  LA G   KK++K+    L+ LGT  GD
Sbjct: 25  WDVASGHVCAEFSGIAASSSQALDGHLALNYKTLAWGVSSKKKKKKAS--LIVLGTGGGD 82

Query: 54  ILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKA 113
           +LA D + GE+ W++   + GG+  + FAK   SL   G +G+  E++   G+++   + 
Sbjct: 83  VLAWDPILGELTWRTNDCNTGGVTSVCFAKNQGSLFSAGIDGIVCELEVASGKILSTLQC 142

Query: 114 SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
           S+  I+S +   +  +  +A++E+R+  +   + + KF+     +     ++  + +++A
Sbjct: 143 SKSSITSISSSPDASLLLVAANELRLFDMRTRKRLRKFTGHASSVNVTVFTEDGRYVLSA 202

Query: 174 GYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSW 233
             GE+++ VW  +   K       L+M H  VA+D            ILAV+E+G AY W
Sbjct: 203 AVGERNIAVWDTNEVGKEAKV--TLAMEHPAVALDSWGFEKDCTERKILAVSETGEAYIW 260

Query: 234 DLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARL 278
              ++ +  K  P ++ V  K +          K S+ SI+  RL
Sbjct: 261 SGLSLEELGKAEPVRVRVGSKNSG---------KASKQSILLGRL 296


>gi|302838626|ref|XP_002950871.1| hypothetical protein VOLCADRAFT_104876 [Volvox carteri f.
           nagariensis]
 gi|300263988|gb|EFJ48186.1| hypothetical protein VOLCADRAFT_104876 [Volvox carteri f.
           nagariensis]
          Length = 747

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 60/215 (27%), Positives = 100/215 (46%), Gaps = 16/215 (7%)

Query: 33  GKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVG 92
           GKK++    T + ALGT+ GD+   +V  GE+KW++     GG+  LAF+ +   L   G
Sbjct: 66  GKKKQSVESTGI-ALGTAAGDVKLYNVQLGELKWRAMDAVQGGVRSLAFSSEQGQLLATG 124

Query: 93  TNGMASEMKSEMGEVIREFKASEKPISSSAF---LCEEKIFALA--------SSEVRILS 141
            NG    +    GEV + ++AS+ PI++ A      E K   +A         S +++  
Sbjct: 125 NNGSLVSLSIPDGEVKKRYEASKYPITAIAIAPGCIEAKSLLIADGKHVFGGGSSMQLWD 184

Query: 142 LENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA---L 198
           +       K++     ++ V    G    ++AG GE+ + VW    + K+  + PA   L
Sbjct: 185 VSESSRSAKYTGHPNEVRAVVFVPGQPHAVSAGSGERQVAVWEVPPAKKSKKQHPAVTTL 244

Query: 199 SMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSW 233
           ++    V +D      GE   V  AV+E G A+ W
Sbjct: 245 NLEDPAVQLDAACVSEGETFNVA-AVSEGGEAFVW 278


>gi|440802916|gb|ELR23832.1| WD repeat domain 43 isoform 5, putative [Acanthamoeba castellanii
           str. Neff]
          Length = 252

 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 44/158 (27%), Positives = 76/158 (48%), Gaps = 6/158 (3%)

Query: 32  VGKKRRKERGTLLL----ALGTSNGDILAVDVLTGEMKWKSTGR--HPGGLAGLAFAKKG 85
           VG+  R +R    L    A+GT+ G I+  DVL GE+  +  G+  H G +  L F +  
Sbjct: 65  VGQTERSKRAKKQLGGAVAIGTAKGTIVVWDVLRGEVHKELGGKEGHTGKVNDLVFTEGA 124

Query: 86  RSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENG 145
             L     +    +   E GE++ +FKA  +PISS A   +    A AS+ +++  L + 
Sbjct: 125 TFLFSCSDDLQVIKWNVENGEIVAKFKADNQPISSLALSADNSTLATASAMIKMWDLASS 184

Query: 146 EEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +   KF+     +  ++ S   K +++A   E++  VW
Sbjct: 185 KSQKKFTGHTSAVTSLNFSANGKFLVSASATERYTSVW 222


>gi|301090479|ref|XP_002895452.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262098632|gb|EEY56684.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 642

 Score = 61.2 bits (147), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 56/215 (26%), Positives = 102/215 (47%), Gaps = 21/215 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV-----GKKRRKERGTLLLALGTSNGDIL 55
           +W  ++G+L  E K+ D    +SY   +  +      GKKR  + G  LLALGTS+G ++
Sbjct: 22  VWDVSNGALQQELKERDH---LSYRYTSLAWTQPKTKGKKRSGDSGLGLLALGTSSGIVV 78

Query: 56  AVDVLTGEMK--WKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFK 112
             D+ TGE+K   ++   H  G +  L F  +G  L+   ++    E     G+V R+F+
Sbjct: 79  VWDLATGEVKHTLQADAVHGSGAVQALTFNPQGSLLYSSSSDKHVLEWNVSNGKVERKFR 138

Query: 113 ASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLK----FSDDVGPLQYVSASDG-- 166
                 S+ A   + +I A+  S +R   L +G++  K     S  V  L++ +   G  
Sbjct: 139 CGSGGASALAVSADGEILAVGGSALRTFDLSSGKKSRKLVSGLSTAVTQLRFANVETGIE 198

Query: 167 -AKIIITAGYGEKHLQVWRCDISSKTVNKGPALSM 200
            ++ +  A  G + + ++  ++   T +  PAL+ 
Sbjct: 199 SSRFLFAATAGARFVNMYDLEL---TEHDAPALTF 230


>gi|301122917|ref|XP_002909185.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262099947|gb|EEY57999.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 642

 Score = 57.8 bits (138), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 55/215 (25%), Positives = 100/215 (46%), Gaps = 21/215 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV-----GKKRRKERGTLLLALGTSNGDIL 55
           +W   +G+L  E K+ D    +SY   +  +       KKR  + G  LLALGTS+G ++
Sbjct: 22  VWDVANGALQQELKERDH---LSYRYTSLAWTQPKTKSKKRSGDSGLGLLALGTSSGIVV 78

Query: 56  AVDVLTGEMK--WKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFK 112
             D+ TGE+K   ++   H  G +  L F  +G  L+   ++    E     G+V R+F+
Sbjct: 79  VWDLATGEVKHTLQADAAHGSGAVQALTFNPQGSLLYSSSSDKHILEWNVSSGKVERKFR 138

Query: 113 ASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLK----FSDDVGPLQYVSASDG-- 166
                 S+ A   + +I A+  S +R   L +G++  K     S  V  L++ +   G  
Sbjct: 139 CGSGGASALAVSADGEILAVGGSALRTFDLSSGKKSRKLVSGLSTAVTQLRFANVETGIE 198

Query: 167 -AKIIITAGYGEKHLQVWRCDISSKTVNKGPALSM 200
            ++ +  A  G + + ++  ++   T +  PAL+ 
Sbjct: 199 SSRFLFAATAGARFVNMYDLEL---TEHDAPALTF 230


>gi|384245316|gb|EIE18810.1| NUC189-domain-containing protein [Coccomyxa subellipsoidea C-169]
          Length = 671

 Score = 57.4 bits (137), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 41/165 (24%), Positives = 77/165 (46%), Gaps = 3/165 (1%)

Query: 33  GKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAF-AKKGRS--LH 89
           G+      G L+L +GT+ G++ A    T +  W + G + GG+  LAF A  G +  L+
Sbjct: 69  GRAVANAAGRLMLVVGTAAGNVRAFSAATAKELWAANGCNEGGVTSLAFEADAGAAGLLY 128

Query: 90  VVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVL 149
            VG+N     +    G  + +FKA   P+S      + +   + SS + + ++   + V 
Sbjct: 129 SVGSNNHVCALDVASGAQVVKFKAGSHPLSCVGAAPDCQSVLVGSSSLALWNVAEQKRVA 188

Query: 150 KFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNK 194
           KF     P++ ++ S      ++A  GE+ + +W     + T +K
Sbjct: 189 KFMGHSLPVRAIAFSPSGSHALSASEGERQVALWALPTIAATTSK 233


>gi|359483009|ref|XP_003632878.1| PREDICTED: uncharacterized protein LOC100265278 isoform 2 [Vitis
           vinifera]
 gi|359483011|ref|XP_002281427.2| PREDICTED: uncharacterized protein LOC100265278 isoform 1 [Vitis
           vinifera]
          Length = 357

 Score = 57.0 bits (136), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 55/123 (44%), Gaps = 45/123 (36%)

Query: 251 VKLKKADADQQNSVNVKKSRTSIIAARLNDLEADG------------------------- 285
           VK K+A+ DQ++S + KKSRTSII ARLN L++DG                         
Sbjct: 2   VKTKEAEIDQKHSGSTKKSRTSIITARLNALQSDGWVTALVCYGSINSPQFNLLEISNPG 61

Query: 286 --------------------QKTITGLHDLESEAATASAQNEKKSKKRAASDPDLETTRD 325
                               +  +    DLE EA     QN+K +KKRAASDPDL    +
Sbjct: 62  EDIVVAATDNIVKTVAEAGRENGVLAGKDLELEAVAEPIQNKKSNKKRAASDPDLAAAEN 121

Query: 326 VVD 328
           +VD
Sbjct: 122 MVD 124


>gi|348675861|gb|EGZ15679.1| hypothetical protein PHYSODRAFT_505395 [Phytophthora sojae]
          Length = 664

 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 54/217 (24%), Positives = 92/217 (42%), Gaps = 21/217 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTL---------LLALGTSN 51
           +W    G+L  E K+ D      Y+CLA           + +          LLALGTSN
Sbjct: 22  LWDVAGGALQQELKERD-HLSYRYTCLAWTQANASTSSSKKSKKRSGASDLGLLALGTSN 80

Query: 52  GDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIRE 110
           G ++  D+ TGE+K         G +  LAF  +G  L+   ++    E     G V R+
Sbjct: 81  GSVVVWDLATGEVKHTLQADASSGAVQALAFNPQGSLLYSSSSDKHVLEWSVSSGSVERK 140

Query: 111 FKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLK----FSDDVGPLQYVSASDG 166
           F+      S+ A   + +I A   S +R   L +G++  K     S  V  L++ +  DG
Sbjct: 141 FRVGSGGASALAVSADGEILAAGGSALRTFDLASGKKSRKLVSGLSSAVTQLRFAAVEDG 200

Query: 167 ---AKIIITAGYGEKHLQVWRCDISSKTVNKGPALSM 200
              ++ +  A  G + + ++   ++    +  PAL+ 
Sbjct: 201 IEASRFLFAATAGARFVNMYDLQLAE---HDAPALTF 234


>gi|159469634|ref|XP_001692968.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158277770|gb|EDP03537.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 546

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 42/145 (28%), Positives = 69/145 (47%), Gaps = 4/145 (2%)

Query: 92  GTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF 151
           G N     +  E G V + F+AS+ PI++ A   + K      S +++  + + E   K+
Sbjct: 8   GPNCSLVSVGVEDGAVKKRFEASKYPITAIALAPDGKHVFGGGSTMQLWDVTSEERAAKY 67

Query: 152 SDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA---LSMRHSPVAID 208
           +     ++ V+   G    I+A  GE+H+ VW    + K+  + PA   LS+    VA+D
Sbjct: 68  TGHPTEVRAVAFVPGQPHAISAATGERHVAVWDVPPAKKSKKQHPAVTTLSLEEPAVAVD 127

Query: 209 CKNSPNGEDGTVILAVAESGVAYSW 233
                 GE  +V  AV+E G AY W
Sbjct: 128 AACVSEGETFSVG-AVSEGGEAYVW 151


>gi|195122410|ref|XP_002005704.1| GI18931 [Drosophila mojavensis]
 gi|193910772|gb|EDW09639.1| GI18931 [Drosophila mojavensis]
          Length = 638

 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 58/236 (24%), Positives = 106/236 (44%), Gaps = 27/236 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG------KKRRKER--------GTLLLA 46
           IW T   +L  E+  P  +  ++  C A  ++       KK RK          G   +A
Sbjct: 30  IWDTESNTLKQEYT-PSLQ--LTGHCTALAWISINAQRIKKSRKSLQINDSTTIGKHYIA 86

Query: 47  LGTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LGTS G ++   V  G++     G +H G +  +A+   G  L+ VG++G A        
Sbjct: 87  LGTSKGTVVLFSVTEGKIDRVLKGDKHDGAVTSIAYDNDGH-LYTVGSDGRAIVWSVAEE 145

Query: 106 EVIREFKA-SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSA- 163
             IR++    EKP+ + A L + +  A+AS ++++  ++  E V  F+   G +  +++ 
Sbjct: 146 RSIRQWPVGPEKPL-NVAILPKSRTLAVASRQLKVYDVDKKELVETFTGHSGEINAINSF 204

Query: 164 -SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA--IDCKNSPNGE 216
             +  + +IT    E+ +  W+ D   K  NK    ++    VA  + C+   NG+
Sbjct: 205 EFNDNEYVITTARMERVISFWKID--RKGRNKASVCTLLMEDVAHSLACEVRDNGQ 258


>gi|66392231|ref|NP_001018176.1| WD repeat-containing protein 43 [Danio rerio]
 gi|21105462|gb|AAM34674.1|AF506230_1 KIAA0007-like protein [Danio rerio]
 gi|66267379|gb|AAH95298.1| WD repeat domain 43 [Danio rerio]
          Length = 650

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 102/244 (41%), Gaps = 23/244 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRK------ERGTLLLAL 47
           IW+T   +L  E+  P      + +C+  G          +KRRK         + LLAL
Sbjct: 33  IWNTESKTLQQEYV-PSAHLSAACTCVTWGPCRAVQEVPQRKRRKCDAASSAEQSDLLAL 91

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL    L G++     G H G +  + +  +   L+    +   +E   + G+V
Sbjct: 92  GTAAGTILIYSTLKGDLHCTLDGGHSGPVNSVQWHPEDSVLYSGSDDTHIAEWDLKTGKV 151

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
             ++KA    ISS     + K+   A   +++  LE  E   KF   S  V  L + +  
Sbjct: 152 CCKWKADRSAISSLCISPDGKMLLSAGMTIKMWDLETKEVYRKFTGHSTMVTTLCFATTR 211

Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
              S+G    ++    ++ L VW+     K  N   + ++   P  ID + S N +D  V
Sbjct: 212 PPDSNGM-YFLSGAVHDRLLSVWQVRSDGKDKNSVVSFTLTDEPQHIDLQTS-NSKDEAV 269

Query: 221 ILAV 224
            LAV
Sbjct: 270 RLAV 273


>gi|161611380|gb|AAI55573.1| WD repeat domain 43 [Danio rerio]
 gi|190336877|gb|AAI62316.1| WD repeat domain 43 [Danio rerio]
 gi|190336879|gb|AAI62617.1| WD repeat domain 43 [Danio rerio]
          Length = 650

 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 102/244 (41%), Gaps = 23/244 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRK------ERGTLLLAL 47
           IW+T   +L  E+  P      + +C+  G          +KRRK         + LLAL
Sbjct: 33  IWNTESKTLQQEYV-PSAHLSAACTCVTWGPCRAVQEVPQRKRRKCDAASSAEQSDLLAL 91

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL    L G++     G H G +  + +  +   L+    +   +E   + G+V
Sbjct: 92  GTAAGTILIYSTLKGDLHCTLDGGHSGPVNSVQWHPEDSVLYSGSDDTHIAEWDLKTGKV 151

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
             ++KA    ISS     + K+   A   +++  LE  E   KF   S  V  L + +  
Sbjct: 152 CCKWKADRSAISSLCISPDGKMLLSAGMTIKMWDLETKEVYRKFTGHSTMVTTLCFATTR 211

Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
              S+G    ++    ++ L VW+     K  N   + ++   P  ID + S N +D  V
Sbjct: 212 PPDSNGM-YFLSGAVHDRLLSVWQVRSDGKDKNSVVSFTLTDEPQHIDLQTS-NSKDEAV 269

Query: 221 ILAV 224
            LAV
Sbjct: 270 RLAV 273


>gi|195997585|ref|XP_002108661.1| hypothetical protein TRIADDRAFT_51818 [Trichoplax adhaerens]
 gi|190589437|gb|EDV29459.1| hypothetical protein TRIADDRAFT_51818 [Trichoplax adhaerens]
          Length = 628

 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 39/175 (22%), Positives = 83/175 (47%), Gaps = 7/175 (4%)

Query: 42  TLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMK 101
           T LLA G+  G +  ++V  GE+  K    H   +  + ++ +   L+    +G+     
Sbjct: 94  TNLLAGGSPTGSLAILNVNKGEVCSKMENGHHAAVTCICWSSED-VLYSASEDGLVIIWD 152

Query: 102 SEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV 161
           S   E+ +++KA + PIS+     +     LAS  +++ ++E  + + K++  +  + Y+
Sbjct: 153 SNTAEIKKQWKAEKSPISAMCTNSDGTKLILASRSIKVWNVETQQILTKYTGHINDVAYL 212

Query: 162 SA-----SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPA-LSMRHSPVAIDCK 210
                  +  A+  I+    +K+L VW+      +V+   A L++ +  V +DCK
Sbjct: 213 KVLPSQKTSNAEYFISGAERDKYLNVWQMSTEKSSVHSPLATLALTNYSVGLDCK 267


>gi|195474843|ref|XP_002089699.1| GE19236 [Drosophila yakuba]
 gi|194175800|gb|EDW89411.1| GE19236 [Drosophila yakuba]
          Length = 626

 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 60/273 (21%), Positives = 125/273 (45%), Gaps = 28/273 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG------KKRRKERGT-------LLLAL 47
           IW+T   +L  E+  P+ +   S SC A  +V       KK RK   +       L +AL
Sbjct: 30  IWNTETNTLYQEYT-PNLQ--TSGSCTALTWVSVSSPRPKKSRKSIQSQDAGGDQLYIAL 86

Query: 48  GTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
           GT  G ++   +  G++    +G  H G +  + +   G+ L+ VG +            
Sbjct: 87  GTFKGSVVIYSLAEGKISRTFSGDSHDGAVTSITYDNAGQ-LYTVGADCRVVVWSWTEER 145

Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA 163
            I E++   +   +  +L +    A+ S ++++  ++  E V  F   S ++  +  V++
Sbjct: 146 SIAEWQVGPEKPQNIVYLQKSGTLAIGSRQLKVYDIKTKELVETFTGHSGEINTISSVAS 205

Query: 164 SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-IL 222
           +DG + ++T    E+ +  W+ D   K  NK  A ++    +A  C  S   +DG + + 
Sbjct: 206 TDGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVA 262

Query: 223 AVAESGV--AYSWDLKTVSQDEKTNPAKITVKL 253
           +V  +G+   Y  ++ ++S ++   P K+++++
Sbjct: 263 SVTRNGIIHIYLLNVNSLSAEKFVKP-KVSLQI 294


>gi|160773180|gb|AAI55127.1| WD repeat domain 43 [Danio rerio]
          Length = 650

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 61/244 (25%), Positives = 101/244 (41%), Gaps = 23/244 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRK------ERGTLLLAL 47
           IW+T   +L  E+  P      + +C+  G          +KRRK         + LLAL
Sbjct: 33  IWNTESKTLQQEYV-PSAHLSAACTCVTWGPCRAVQEVPQRKRRKCDAASSAEQSDLLAL 91

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
            T+ G IL    L G++     G H G +  + +  +   L+    +   +E   + G+V
Sbjct: 92  ATAAGTILIYSTLKGDLHCTLDGGHSGPVNSVQWHPEDSVLYSGSDDTHIAEWDLKTGKV 151

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
             ++KA    ISS     + K+   A   +++  LE  E   KF   S  V  L + +  
Sbjct: 152 CCKWKADRSAISSLCISPDGKMLLSAGMTIKMWDLETKEVYRKFTGHSTMVTTLCFATTR 211

Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
              S+G    ++    ++ L VW+     K  N   + ++   P  ID + S N +D  V
Sbjct: 212 PPDSNGM-YFLSGAVHDRLLSVWQVRSDGKDKNSVVSFTLTDEPQHIDLQTS-NSKDEAV 269

Query: 221 ILAV 224
            LAV
Sbjct: 270 RLAV 273


>gi|195332612|ref|XP_002032991.1| GM21075 [Drosophila sechellia]
 gi|194124961|gb|EDW47004.1| GM21075 [Drosophila sechellia]
          Length = 566

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 57/256 (22%), Positives = 113/256 (44%), Gaps = 24/256 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---GKKRRKERGTLL---------LALG 48
           IW+T   +L  E+  P+ +  +S SC A  +V   G + +K R +L          +ALG
Sbjct: 30  IWNTETNTLYQEFT-PNLQ--LSGSCTALTWVSVAGSRPKKSRKSLQAQDAGDQLHIALG 86

Query: 49  TSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           T  G ++   +  G++     G  H G +  + +   G  L+ VG +             
Sbjct: 87  TYKGSVVIYSLAEGKISRTLIGDSHDGPVTSITYDNAGH-LYTVGADCRVVVWSLTDERS 145

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSAS 164
           I E++   +   +  +L +    A+ S ++++  +   E V  F   S ++  +  V+++
Sbjct: 146 IAEWQVGPEKPQNIVYLQKSGTLAIGSRQLKVYDINTKELVETFTGHSGEINTISSVAST 205

Query: 165 DGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-ILA 223
           DG + ++T    E+ +  W+ D   K  NK  A ++    +A  C  S   +DG + + +
Sbjct: 206 DGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVAS 262

Query: 224 VAESGVAYSWDLKTVS 239
           V  +G+ + + L   S
Sbjct: 263 VTRNGIIHIYLLNVNS 278


>gi|121934022|gb|AAI27561.1| Wdr43 protein [Danio rerio]
          Length = 408

 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 62/244 (25%), Positives = 102/244 (41%), Gaps = 23/244 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRK------ERGTLLLAL 47
           IW+T   +L  E+  P      + +C+  G          +KRRK         + LLAL
Sbjct: 33  IWNTESKTLQQEYV-PSAHLSAACTCVTWGPCRAVQEVPQRKRRKCDAASSAEQSDLLAL 91

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL    L G++     G H G +  + +  +   L+    +   +E   + G+V
Sbjct: 92  GTAAGTILIYSTLKGDLHCTLDGGHSGPVNSVQWHPEDSVLYSGSDDTHIAEWDLKTGKV 151

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
             ++KA    ISS     + K+   A   +++  LE  E   KF   S  V  L + +  
Sbjct: 152 CCKWKADRSAISSLCISPDGKMLLSAGMTIKMWDLETKEVYRKFTGHSTMVTTLCFATTR 211

Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
              S+G    ++    ++ L VW+     K  N   + ++   P  ID + S N +D  V
Sbjct: 212 PPDSNGM-YFLSGAVHDRLLSVWQVWSDGKDKNSVVSFTLTDEPQHIDLQTS-NSKDEAV 269

Query: 221 ILAV 224
            LAV
Sbjct: 270 RLAV 273


>gi|195581695|ref|XP_002080669.1| GD10611 [Drosophila simulans]
 gi|194192678|gb|EDX06254.1| GD10611 [Drosophila simulans]
          Length = 533

 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 59/272 (21%), Positives = 126/272 (46%), Gaps = 27/272 (9%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---GKKRRKERGTLL---------LALG 48
           IW+T   +L  E+  P+ +  +S SC A  +V   G + +K R +L          +ALG
Sbjct: 30  IWNTETNTLYQEFT-PNLQ--LSGSCTALTWVSVAGSRPKKSRKSLHAQDAGDQLHIALG 86

Query: 49  TSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           T  G ++   +  G++    +G  H G +  + +   G  L+ VG +             
Sbjct: 87  TYKGSVVIYSLAEGKISRTLSGDSHDGPVTSITYDNAGH-LYTVGADCRVVVWSLTDERS 145

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSAS 164
           I E++   +   +  +L +    A+ S ++++  ++  E V  F   S ++  +  V+++
Sbjct: 146 IAEWQVGPEKPQNIVYLQKSGTLAIGSRQLKVYDIKTKELVETFTGHSGEINTISSVAST 205

Query: 165 DGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-ILA 223
           DG + ++T    E+ +  W+ D   K  NK  A ++    +A  C  S   +DG + + +
Sbjct: 206 DGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVAS 262

Query: 224 VAESGV--AYSWDLKTVSQDEKTNPAKITVKL 253
           V  +G+   Y  ++ ++S ++   P K+++++
Sbjct: 263 VTRNGIIHIYLLNVNSLSAEKFVKP-KVSLQI 293


>gi|291229286|ref|XP_002734606.1| PREDICTED: will die slowly-like [Saccoglossus kowalevskii]
          Length = 649

 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 55/291 (18%), Positives = 111/291 (38%), Gaps = 39/291 (13%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV----GKKRRKERGTL------------- 43
           +W T  G+L  ++  P      + +C+A   +    G  RRK+R +              
Sbjct: 26  LWDTTTGTLNQQYT-PSSHLSATCTCVAWSTLRESRGAPRRKKRKSEGIRKGVASSITEN 84

Query: 44  --LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMK 101
             LLALGTS G IL    + G+++ K  G H   +  L +  +  SL+    +   +E  
Sbjct: 85  VDLLALGTSVGSILLYSAVKGDLQSKMDGGHADTVNCLCWHLEEDSLYSCSDDHHITEWS 144

Query: 102 SEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV 161
              G+V  ++K  +  I   +      +   A+  +++ +L+N E + +++    P+  +
Sbjct: 145 VSTGKVKCKWKGDKSAIHCISICPGGTVMLSAARTIKVWNLKNKELLKRYTGHASPVSQI 204

Query: 162 SASDGAK-------------------IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRH 202
           +    A                      ++    ++ L  W    +SK  N   A S+  
Sbjct: 205 NCVPMATRTISISNDEDDALHDVNGLYFLSTAVHDRVLNAWHIKSNSKDKNAVAAFSLIE 264

Query: 203 SPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
            P++ D       +    I  +  +G    +D     + +K    + TV++
Sbjct: 265 EPISFDVAQCLQSDQSIFIAVLTRTGQVQVFDPTLNGKSKKPIKPRCTVQV 315


>gi|325189290|emb|CCA23810.1| conserved hypothetical protein [Albugo laibachii Nc14]
          Length = 1146

 Score = 48.9 bits (115), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 47/212 (22%), Positives = 97/212 (45%), Gaps = 15/212 (7%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLA-CGFVGKKRRKERGTLL----LALGTSNGDIL 55
           IW+ + GSL+ E ++ D      Y+C++      +   K+RG  +    +A+GT+ G IL
Sbjct: 537 IWNVSSGSLVHELRERDHLKN-KYTCISWTSSPNRSDSKKRGNSVSLGRIAVGTTQGSIL 595

Query: 56  AVDVLTGEMKW-----KSTGRHPGGLAGLAFAKKGRSLHVVGTN-GMASEMKSEMGEVIR 109
             D+  GEM       +S+  H   +  +AF  +G  L+   +    A E     G+V+R
Sbjct: 596 LWDLDRGEMTQTLGSKESSVSHSHAVQDIAFNAQGTRLYSCASQEKKALEWDVVAGKVLR 655

Query: 110 EFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDV-GPLQYVSASDGAK 168
            FK S++  +      ++ + A+  + +R+  L +G++  +F   +   +  ++ S    
Sbjct: 656 SFKVSKEGGNRLCISQDDDLLAVGGNSIRVFDLSSGKKSRQFRAGIPSTVHQLNFSSCKN 715

Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSM 200
            +  +   E+ + ++  D+     +    LSM
Sbjct: 716 FLFASALSERFINIY--DLEKDVDDTAMILSM 745


>gi|260818743|ref|XP_002604542.1| hypothetical protein BRAFLDRAFT_122327 [Branchiostoma floridae]
 gi|229289869|gb|EEN60553.1| hypothetical protein BRAFLDRAFT_122327 [Branchiostoma floridae]
          Length = 657

 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 58/258 (22%), Positives = 106/258 (41%), Gaps = 37/258 (14%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV------GKKRRKERGTL-------LLAL 47
           +W T+ G+L  E+  P      + SCLA G +       KKR+++   +       ++A+
Sbjct: 34  LWDTDSGALRQEYT-PSAHLSATCSCLAWGPLREARNEHKKRKRKSDQMSALSSLDIVAM 92

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL   V+ G+++ + TG H   +  L +  +  SL     +    E      +V
Sbjct: 93  GTTAGTILLYSVVKGDLQSQLTGGHDDIVNCLCWRAEENSLFSCSDDQHIVEWTVTTAQV 152

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDD---VGPLQYVS-- 162
             ++KA   P+ + +     K    A+  +++  LE  E +L F+     V  L + +  
Sbjct: 153 KCKWKADRGPVQTISLGPGGKTLLSAARTIKLWDLEKREVLLSFTGHASLVTSLLFATLP 212

Query: 163 ----------------ASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA 206
                           A DG   +  A + ++ +  W+  + SK      + S+   PV 
Sbjct: 213 PPTANGRRKSVTNQEAAVDGLYFLSGATH-DRLMNAWQIKMKSKEKTAVVSFSLTEEPVY 271

Query: 207 IDCKNSPNGEDGTVILAV 224
           +D    P  +D    LAV
Sbjct: 272 VDVLR-PRQKDQPAHLAV 288


>gi|443684116|gb|ELT88135.1| hypothetical protein CAPTEDRAFT_102327 [Capitella teleta]
          Length = 564

 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 52/248 (20%), Positives = 106/248 (42%), Gaps = 24/248 (9%)

Query: 34  KKRRKERGTL-----LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSL 88
           K+R+ E G       L+A+GT++G +L   +  G +  +  G H   +  + ++ +G+ L
Sbjct: 62  KRRKSEVGKAVHSLELVAIGTTSGSLLLYSINKGSLHSELNGGHTDQVHCVCWSSEGQRL 121

Query: 89  HVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEV 148
                +    E   E   V  ++K  + P++S      ++    A   +++  LE  E +
Sbjct: 122 FSSSEDQHIIEWSLETASVKHKWKGDKTPVTSVLVCPNDRHLLSAGRSIKLWDLETREML 181

Query: 149 LKF---SDDVGPLQYVS---------ASDG------AKIIITAGYGEKHLQVWRCDISSK 190
            KF   S +V  L  VS         A++G          ++A  G++ +  W+ +++SK
Sbjct: 182 KKFTGHSSEVISLFAVSPACNVENGAATNGHCSDIDGSYFLSAAKGDRMINAWQINLASK 241

Query: 191 TVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKIT 250
             N   + S+   P A+   + P      ++  V  SG    ++     + +K    KI+
Sbjct: 242 DKNSLASFSLPEEPCAMHL-SQPQQAKPFLMSVVTRSGQLLVFEYTLNGRAKKPLTPKIS 300

Query: 251 VKLKKADA 258
           +++   DA
Sbjct: 301 IQVATEDA 308


>gi|434404035|ref|YP_007146920.1| WD40 repeat-containing protein [Cylindrospermum stagnale PCC 7417]
 gi|428258290|gb|AFZ24240.1| WD40 repeat-containing protein [Cylindrospermum stagnale PCC 7417]
          Length = 1215

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 90/189 (47%), Gaps = 17/189 (8%)

Query: 1    IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
            +W+  DG LL  +KQP    +V+   L     GK          LA+  S+GDI  +++ 
Sbjct: 992  LWNVADGRLLKTFKQP--RSIVA--DLNFSPDGKT---------LAVACSDGDIKILNLK 1038

Query: 61   TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
            T  +  +S   H   +  ++F+  G+ L   G++       +E G ++   +     +++
Sbjct: 1039 TATLT-QSFPAHSSWVNTISFSPNGKILASGGSDSKVKLWNAENGRLLFTLEGHLSNVTN 1097

Query: 121  SAFLCEEKIFALAS--SEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEK 178
             +F  + KI A +S  S VR+ ++ENG E+      +G +  V  S   K + +AG  + 
Sbjct: 1098 ISFSPDSKILASSSDDSTVRVWNVENGLEISILEGHLGSVTSVMFSPDGKTLASAGL-DN 1156

Query: 179  HLQVWRCDI 187
             +++W+ ++
Sbjct: 1157 TIKMWKLEL 1165


>gi|390562647|ref|ZP_10244835.1| hypothetical protein NITHO_3120009 [Nitrolancetus hollandicus Lb]
 gi|390172775|emb|CCF84147.1| hypothetical protein NITHO_3120009 [Nitrolancetus hollandicus Lb]
          Length = 271

 Score = 47.4 bits (111), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 42/143 (29%), Positives = 67/143 (46%), Gaps = 6/143 (4%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           + A G+ +G I   DV +G+      G H  G+  +AF+  GR L   G +G        
Sbjct: 85  IAASGSGDGSIRLWDVASGKEAATLPG-HAWGVWSVAFSHDGRQLLSGGGDGALRLWDVT 143

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE---VRILSLENGEEVLKFSDDVGPLQY 160
            GE IR F   E  + ++A +  +   AL+ S    +R+  LE GEE+ K+    G ++ 
Sbjct: 144 TGEEIRHFFGHEDWVYTAA-ISPDGRHALSGSRDETIRLWDLETGEEIRKYEGHEGTVES 202

Query: 161 VSASDGAKIIITAGYGEKHLQVW 183
           V  S   +  I+ G  E  L++W
Sbjct: 203 VVFSPDGRRFISHGEDET-LRLW 224


>gi|321478820|gb|EFX89777.1| hypothetical protein DAPPUDRAFT_303119 [Daphnia pulex]
          Length = 673

 Score = 47.0 bits (110), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 45/225 (20%), Positives = 97/225 (43%), Gaps = 6/225 (2%)

Query: 33  GKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVG 92
           G++R    G+ ++ALGT++G+IL   +   E+  +  G H G +  L F   G SL    
Sbjct: 77  GEERSGVAGSDIIALGTASGNILLYSLNRAELVQQLKGGHTGKIRSLCFNSDGSSLFSAS 136

Query: 93  TNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFS 152
           ++    E     G +   +K  +  +S    L ++    +A   ++  ++E+   V  F+
Sbjct: 137 SDQKIVEWDPVEGVMKSHWKVEKIHVSRICMLQDDCTLVIAGRSIQCWNVESRSIVATFT 196

Query: 153 DDVGPLQYVSAS--DGAKI--IITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAID 208
                + ++      G K+   ++A  G++ +  WR     K      + +++  P    
Sbjct: 197 GHTTEITHLLPVYLPGTKVGYFLSAADGDRCISAWRFVGEEKNGQSQASFTLQDDPYNFT 256

Query: 209 CKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKL 253
                + +D  V+ AV  +G+ + ++     + +K  P +  VK+
Sbjct: 257 VSKPVSDKDPVVLAAVTVTGILHIFEHHLNGRSKK--PLQPVVKI 299


>gi|427783733|gb|JAA57318.1| Hypothetical protein [Rhipicephalus pulchellus]
          Length = 637

 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 64/304 (21%), Positives = 121/304 (39%), Gaps = 38/304 (12%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG------FVGKKRRKERGTL----------- 43
           +W T+ G L  E+  P        +CL+ G      F   K+RK + T            
Sbjct: 28  LWDTSTGRLKQEYT-PSAHLSAKCTCLSWGPTRQQAFAAHKKRKRQKTTDDAGDVLSEAS 86

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           LLA+GT+ G +L   +  G++  + TG H   + G+++     SL  V  +         
Sbjct: 87  LLAMGTAEGTVLLYSLTRGDLHSELTGGHTAAVNGVSWHASSDSLFSVSDDQHIVHWSVS 146

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSA 163
             +V  ++KA    + + A +    +   ASS ++  +++    V+KF+  V  ++ +  
Sbjct: 147 SCKVKSKWKADRHSVYAVAAVDTNTVLT-ASSSIKWWNVDTKSIVMKFTGHVTEVRQI-- 203

Query: 164 SDGAKIIITAGYGEKHLQVWRC-----DISSKTVNKGPALS------MRHSPVAIDCKNS 212
                + + +  GE       C      +S+  + KG   S      +    V I   +S
Sbjct: 204 ---LPVFVPSKSGEVRTYFLSCAANDRQVSAWHLEKGAGSSSLANFVLSEEAVHISVSHS 260

Query: 213 PNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTS 272
                   + AV +SG  + ++L+   + +     K TV++     + QN    K    +
Sbjct: 261 SENNKTAHLSAVTKSGALHIFELQLNGRCKTPMQPKYTVQVA---TETQNGRAPKPQPVA 317

Query: 273 IIAA 276
           I+AA
Sbjct: 318 ILAA 321


>gi|257094183|ref|YP_003167824.1| WD-40 repeat-containing protein [Candidatus Accumulibacter phosphatis
            clade IIA str. UW-1]
 gi|257046707|gb|ACV35895.1| WD-40 repeat protein [Candidatus Accumulibacter phosphatis clade IIA
            str. UW-1]
          Length = 1737

 Score = 46.6 bits (109), Expect = 0.016,   Method: Composition-based stats.
 Identities = 42/178 (23%), Positives = 80/178 (44%), Gaps = 21/178 (11%)

Query: 67   KSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE 126
            +S   H   +A +AF+  GR L     +       +E G+ IR F     P++S AF  +
Sbjct: 1330 RSFAGHQSVVASVAFSPDGRHLVSGSWDDSLLLWNAETGQEIRSFVGHHGPVASVAFSPD 1389

Query: 127  EK--IFALASSEVRILSLENGEEVLKFSDDVGPLQYV-SASDGAKIIITAGYGEKHLQVW 183
             +  +       +R+   E G+E+  ++   GP+  V S++DG +++  +G  +  L++W
Sbjct: 1390 GRRLLSGTWDQTLRLWDAETGQEIRSYTGHQGPVAGVASSADGRRLL--SGSDDHTLRLW 1447

Query: 184  RCDISSK----TVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
              +   +      ++GPA S+  SP            DG  +L+ ++      WD +T
Sbjct: 1448 DAETGQEIRFFAGHQGPATSVAFSP------------DGRRLLSGSDDHTLRLWDAET 1493



 Score = 46.2 bits (108), Expect = 0.021,   Method: Composition-based stats.
 Identities = 50/179 (27%), Positives = 86/179 (48%), Gaps = 14/179 (7%)

Query: 58   DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
            D  TGE + +S   H GG+A +AF+  GR L     +       +E G+ IR F   +  
Sbjct: 1112 DAETGE-EIRSFAGHQGGVASVAFSPDGRRLLSGSDDQTLRLWDAETGQEIRSFTGHQGG 1170

Query: 118  ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAG 174
            + S AF  + +     S +  +R+   E G+E+  F+     +  V+ S DG +++  +G
Sbjct: 1171 VLSVAFSPDGRRLLSGSRDQTLRLWDAETGQEIRSFAGHQSAVTSVALSPDGRRLL--SG 1228

Query: 175  YGEKHLQVWRC----DISSKTVNKGPALSMRHSPVAIDCKNSPNGE-DGTVILAVAESG 228
              ++ L++W      +I S T ++G   S+  SP   D +   +G  D T+ L  AE+G
Sbjct: 1229 SHDRTLRLWDAETGQEIRSFTGHQGGVASVAFSP---DGRRLLSGSFDQTLRLWDAETG 1284



 Score = 44.7 bits (104), Expect = 0.068,   Method: Composition-based stats.
 Identities = 45/189 (23%), Positives = 84/189 (44%), Gaps = 18/189 (9%)

Query: 3    STNDGSLLAEWKQPDGEPVVSYS-----CLACGFVGKKRRKERGTLLLALGTSNGDILAV 57
            S +D   L  W    G+ + S++       +  F    RR       L  G+ +  +   
Sbjct: 1479 SGSDDHTLRLWDAETGQEIRSFAGHQDWVTSVAFSPDGRR-------LLSGSHDHTLRLW 1531

Query: 58   DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
            D  +G+ + +S   H G +  +AF+  GR L     +       +E G+ IR F   + P
Sbjct: 1532 DAESGQ-EIRSFAGHQGWVLSVAFSPDGRRLLSGSDDQTLRLWDAESGQEIRSFAGHQGP 1590

Query: 118  ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAG 174
            ++S AF  + +     S +  +R+   E G+E+  F+   GP+  V+ S DG +++  +G
Sbjct: 1591 VTSVAFSPDGRRLLSGSRDQTLRLWDAETGQEIRSFAGHQGPVASVAFSPDGRRLL--SG 1648

Query: 175  YGEKHLQVW 183
              +  L++W
Sbjct: 1649 SHDGTLRLW 1657


>gi|195430950|ref|XP_002063511.1| GK21367 [Drosophila willistoni]
 gi|194159596|gb|EDW74497.1| GK21367 [Drosophila willistoni]
          Length = 653

 Score = 46.6 bits (109), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 54/243 (22%), Positives = 103/243 (42%), Gaps = 26/243 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---------GKKRRKERG-------TLL 44
           IW T   SL  E+        +S  C A  +V          KK RK +         L 
Sbjct: 30  IWDTEANSLKQEFT---ANLYLSGGCTALTWVSVGAPPTPRAKKSRKSQNDSVVGSNKLY 86

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           +ALGT  G +    +  G+++   +G  H G +  +A+ +    L+ VGT+  A      
Sbjct: 87  IALGTLKGIVSLYSLAEGQIERTLSGDNHDGPVTSIAYNQSKGHLYTVGTDCRALVWSIA 146

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQY 160
               I +++   +   +  +L + +  A+ +  ++I  +E  E V  F   S D+  +  
Sbjct: 147 EERCIADWQVGPEKPHNIVYLAKSRSLAVGARSLKIFDVETKELVETFTGHSSDINAMTS 206

Query: 161 VSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
           ++ +D  + I+T    E+ +  W+ D   +  NK  A ++    +A    +  + EDG++
Sbjct: 207 IADND-LEYILTTARMERVISFWKIDKRGR--NKASACTLLMEDLAYCLASEVHTEDGSL 263

Query: 221 ILA 223
            +A
Sbjct: 264 RVA 266


>gi|403414246|emb|CCM00946.1| predicted protein [Fibroporia radiculosa]
          Length = 359

 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 47/184 (25%), Positives = 85/184 (46%), Gaps = 21/184 (11%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H   ++ L F+  G  L     +G+     ++ GE+I   +  ++ IS  A+  + +  A
Sbjct: 64  HRRSISSLKFSFDGTKLASSAADGLVKIWDADSGEIIHTLQGHDEGISDIAWSPDNEFLA 123

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS- 188
            AS +  +RI S+E    V         +  V+ +  + ++++ G+ E  ++VW  D++ 
Sbjct: 124 SASDDKTIRIWSMETMSSVNVLKGHTNFVFCVNFNPKSNLLVSGGFDET-VRVW--DVAR 180

Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDE 242
            +T+   PA S    PV     N     DGT+I + A  G+   WD      LKT++ D+
Sbjct: 181 GRTLKTLPAHS---DPVTAVTFN----HDGTLIASCAMDGLIRIWDSESGQCLKTLADDD 233

Query: 243 KTNP 246
             NP
Sbjct: 234 --NP 235


>gi|195425626|ref|XP_002061096.1| GK10631 [Drosophila willistoni]
 gi|194157181|gb|EDW72082.1| GK10631 [Drosophila willistoni]
          Length = 638

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 47/217 (21%), Positives = 100/217 (46%), Gaps = 10/217 (4%)

Query: 43  LLLALGTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMK 101
           L +ALGT  G +    +  G+++   +G  H G +  +A+ +    L+ VG +  A    
Sbjct: 342 LYIALGTMGGIVSLYSLAEGQIERTLSGDNHDGPVTSIAYNQSKGHLYTVGEDCRALVWS 401

Query: 102 SEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPL 158
                 I +++   +   +  +L + +  A+ +  ++I  +E  E V  F   S D+  +
Sbjct: 402 ITEERCIADWQVGPEKPQNVVYLAKSRSLAVGARSLKIFDVETKELVETFTGHSSDINAM 461

Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDG 218
             ++ +D  + I+T  + E+ +  W+ D   K  NK  A ++R    A         EDG
Sbjct: 462 TSIADND-LEFILTTAHMERVISFWKIDKRGK--NKASACTLRMEDPAYCLACEVRTEDG 518

Query: 219 TVILA-VAESGV-AYSWDLKTVSQDEKTNPAKITVKL 253
           ++ +A V  +G+  Y   +  +S ++   P K+++++
Sbjct: 519 SLRVASVTRNGIHIYLLKINGLSSEKHIKP-KVSLQI 554


>gi|405971706|gb|EKC36529.1| WD repeat-containing protein 43 [Crassostrea gigas]
          Length = 630

 Score = 46.2 bits (108), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 49/258 (18%), Positives = 106/258 (41%), Gaps = 30/258 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG------FVGKKRRKERGTL-------LLAL 47
           +W  ++G L  E+  P      + +CL+ G        G+K+R++ G +       L+A+
Sbjct: 27  LWEADNGVLKQEYT-PSAHLSATCTCLSWGPKRNLDTPGRKKRRKSGGVAITEQYDLIAV 85

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL   ++ G++K      H   ++ + +  +  ++     +          G+V
Sbjct: 86  GTTAGSILIYSIVKGDVKTIMEEGHSDSVSDVCWHPENNTIFSCSCDRHIIHWDVISGKV 145

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDV-----------G 156
             ++K     I S   LC +     +   +++  LE    + KF+               
Sbjct: 146 KHKWKGDSGSIHSIC-LCAQNHLLSSGRSIKLWDLETYSVLKKFTGHATEVFRLLPILSN 204

Query: 157 PLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGE 216
           P +   + +G    ++A   ++ + VW+ + + K  +     S+   PV +D + +   +
Sbjct: 205 PTELPDSPEGV-YFLSAALNDRLVNVWQVNTAVKDKSSTATFSLPEDPVMLDLRKN---Q 260

Query: 217 DGTVILAVAESGVAYSWD 234
           D  ++ AV  SG    +D
Sbjct: 261 DEVILAAVTRSGQVLIFD 278


>gi|427737264|ref|YP_007056808.1| WD40 repeat-containing protein [Rivularia sp. PCC 7116]
 gi|427372305|gb|AFY56261.1| WD40 repeat-containing protein [Rivularia sp. PCC 7116]
          Length = 927

 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 56/204 (27%), Positives = 84/204 (41%), Gaps = 24/204 (11%)

Query: 43  LLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAG---LAFAKKGRSLHVVGTNGMASE 99
           L +A G S  +I   D+ +GE K +    H G  AG   L F+  G+ L   G +     
Sbjct: 695 LTIASGDSKNNIYIWDINSGE-KIRILEGHTGRFAGVNSLKFSPDGQILASAGGDKTVKL 753

Query: 100 MKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGE--EVLKFSDDV 155
                G  I   K  E+ +SS AF  + KIFA  S++       L  GE  E  K +D++
Sbjct: 754 WNLNTGAEIMTLKGHERWVSSVAFSPDGKIFASGSADETANFWDLTTGEILETFKHNDEI 813

Query: 156 GPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-KGPALSMRHSPVAIDCKNSPN 214
             + +    +    I   G  +  +++W      +    KG   S+R+         SPN
Sbjct: 814 RSIAFSPNGE----IFATGSNDNTIKLWSVSNKEEVCTLKGHKRSIRYITF------SPN 863

Query: 215 GEDGTVILAVAESGVAYS-WDLKT 237
           GE    ILA +  G     WD+ T
Sbjct: 864 GE----ILATSSYGNDIKLWDMNT 883


>gi|84995168|ref|XP_952306.1| pre-mRNA splicing factor (PRP17 homologue) [Theileria annulata
           strain Ankara]
 gi|65302467|emb|CAI74574.1| pre-mRNA splicing factor (PRP17 homologue), putative [Theileria
           annulata]
          Length = 693

 Score = 45.4 bits (106), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 32/149 (21%), Positives = 72/149 (48%), Gaps = 5/149 (3%)

Query: 41  GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
           G +LL+ G  +G +   D+ + +      G H  G+  ++F + G   +    +      
Sbjct: 417 GNVLLSCGM-DGFVKVWDINSQKCLRNYKG-HAKGVRDVSFIENGNKFYSASFDSNVILW 474

Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLE--NGEEVLKFSDDVGPL 158
            +E G+VI  +K  + P   + +  ++ IF +     +++ ++  +GE VL++S+ +G +
Sbjct: 475 DTEYGKVIGVYKIEKTPYCLTPYPLDDNIFLVGGDSNKVIQMDARSGEMVLEYSEHMGCV 534

Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCDI 187
             ++  D  + I+T    +K + VW  +I
Sbjct: 535 NTITFIDNNRRIVTTA-DDKRVLVWDYNI 562


>gi|296269459|ref|YP_003652091.1| pentapeptide repeat-containing protein [Thermobispora bispora DSM
            43833]
 gi|296092246|gb|ADG88198.1| pentapeptide repeat protein [Thermobispora bispora DSM 43833]
          Length = 1807

 Score = 45.4 bits (106), Expect = 0.036,   Method: Composition-based stats.
 Identities = 53/202 (26%), Positives = 85/202 (42%), Gaps = 25/202 (12%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            LLA G   G +   D  TGE+  + TG HPGG+  +AF   G +L    T G        
Sbjct: 1307 LLAAGDGQGVLCLWDPYTGELLHRLTG-HPGGICAIAFHPDGHALVSGDTEGTVRLWDPH 1365

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPL-QY 160
             G+++      E  I   AF    ++F    SE  VR+ S  +GE++ + S   G +  +
Sbjct: 1366 TGQLMGTLSGHEGAIYHVAFSPSGELFVTGDSEGVVRVWS-ASGEQLAELSGHRGSVWPF 1424

Query: 161  VSASDGAKIIITAGYGEKHLQVW-----RCDISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
                 G +++ ++  G   +++W     RC    + V +G     R + VA         
Sbjct: 1425 AFHPKGHRLVTSSSDG--MIRLWDPRTGRC----RRVLRGHG--RRINSVAFSA------ 1470

Query: 216  EDGTVILAVAESGVAYSWDLKT 237
             DG ++ A    G    WD +T
Sbjct: 1471 -DGRMLAACGSDGYVRLWDPQT 1491


>gi|186681444|ref|YP_001864640.1| hypothetical protein Npun_F0966 [Nostoc punctiforme PCC 73102]
 gi|186463896|gb|ACC79697.1| WD-40 repeat protein [Nostoc punctiforme PCC 73102]
          Length = 1211

 Score = 45.4 bits (106), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 42/170 (24%), Positives = 83/170 (48%), Gaps = 20/170 (11%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H G ++ ++F+  G+++    ++  A  + +  G++++EFK  ++ ++S +F  + K  A
Sbjct: 763 HQGEVSSVSFSPDGKTIATASSDKTA-RLWNLQGQLLQEFKGHQRGVNSVSFSLDGKTIA 821

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW---RCD 186
            ASS+   R+ +L+ G+ + +F    G +  VS S   K I T+   +K  ++W   R  
Sbjct: 822 TASSDKTARLWNLQ-GQLLQEFKGHQGLVLSVSFSPDGKTIATSS-DDKTARLWNLQRQL 879

Query: 187 ISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLK 236
           +     ++G   S+  SP            DG  I   +E G A  W+L+
Sbjct: 880 LQEFKGHQGEVSSVSFSP------------DGKTIATASEDGTAQLWNLQ 917



 Score = 38.1 bits (87), Expect = 5.4,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 61/111 (54%), Gaps = 5/111 (4%)

Query: 79   LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE-- 136
            ++F+  G+++    ++  A ++ +  G++++EFK  +  + S +F  + K  A ASS+  
Sbjct: 1064 VSFSPDGKTIATASSDNTA-QLWNLQGQLLQEFKGHQGLVLSVSFSPDGKTIATASSDNT 1122

Query: 137  VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDI 187
             R+ +L+ G+ + +F      +  VS S   K I TA Y +K +++W  D+
Sbjct: 1123 ARLWNLQ-GQLLQEFKGHQRGVNSVSFSPDGKTIATASY-DKTIKLWDLDL 1171


>gi|24651910|ref|NP_610421.1| CG30349 [Drosophila melanogaster]
 gi|21645568|gb|AAF59028.2| CG30349 [Drosophila melanogaster]
 gi|281183429|gb|ADA53583.1| FI03455p [Drosophila melanogaster]
          Length = 620

 Score = 45.4 bits (106), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 58/273 (21%), Positives = 124/273 (45%), Gaps = 28/273 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---GKKRRKERGTLL----------LAL 47
           IW+T   +L  E+  P+ +  +S SC A  +V   G + +K R +L           +AL
Sbjct: 30  IWNTETNTLYQEFT-PNLQ--LSGSCTALTWVLVAGSRPKKSRKSLQAQDAAGDQLHIAL 86

Query: 48  GTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
           GT  G ++   +  G++    +G  H G +  +     G  L+ VG +            
Sbjct: 87  GTYKGSVVIYSLAEGKISRTLSGDSHDGPVTSITNDNAGH-LYTVGADCRVVVWSLTDER 145

Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA 163
            I E++   +   +  +L +    A+   ++++  ++  E V  F   S ++  +  V++
Sbjct: 146 SIAEWQVGPEKPQNIVYLQKSGTLAIGYRQLKVYDIKTKELVETFTGHSGEINTISSVAS 205

Query: 164 SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-IL 222
           +DG + ++T    E+ +  W+ D   K  NK  A ++    +A  C  S   +DG + + 
Sbjct: 206 TDGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVA 262

Query: 223 AVAESGV--AYSWDLKTVSQDEKTNPAKITVKL 253
           +V  +G+   Y  ++ ++S ++   P K+++++
Sbjct: 263 SVTRNGIIHIYLLNVNSLSAEKFVKP-KVSLQI 294


>gi|19527533|gb|AAL89881.1| RE25260p [Drosophila melanogaster]
          Length = 620

 Score = 45.4 bits (106), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 58/273 (21%), Positives = 124/273 (45%), Gaps = 28/273 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV---GKKRRKERGTLL----------LAL 47
           IW+T   +L  E+  P+ +  +S SC A  +V   G + +K R +L           +AL
Sbjct: 30  IWNTETNTLYQEFT-PNLQ--LSGSCTALTWVLVAGSRPKKSRKSLQAQDAAGDQLHIAL 86

Query: 48  GTSNGDILAVDVLTGEMKWKSTG-RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
           GT  G ++   +  G++    +G  H G +  +     G  L+ VG +            
Sbjct: 87  GTYKGSVVIYSLAEGKISRTLSGDSHDGPVTSITNDNAGH-LYTVGADCRVVVWSLTDER 145

Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA 163
            I E++   +   +  +L +    A+   ++++  ++  E V  F   S ++  +  V++
Sbjct: 146 SIAEWQVGPEKPQNIVYLQKSGTLAIGYRQLKVYDIKTKELVETFTGHSGEINTISSVAS 205

Query: 164 SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV-IL 222
           +DG + ++T    E+ +  W+ D   K  NK  A ++    +A  C  S   +DG + + 
Sbjct: 206 TDGTEFVLTTAKMERVICFWKIDKKGK--NKASACTLLMEDLAY-CLTSEVRDDGQLRVA 262

Query: 223 AVAESGV--AYSWDLKTVSQDEKTNPAKITVKL 253
           +V  +G+   Y  ++ ++S ++   P K+++++
Sbjct: 263 SVTRNGIIHIYLLNVNSLSAEKFVKP-KVSLQI 294


>gi|117558475|gb|AAI25999.1| LOC398447 protein [Xenopus laevis]
          Length = 687

 Score = 45.4 bits (106), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 56/254 (22%), Positives = 103/254 (40%), Gaps = 28/254 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRKERGTL------LLAL 47
           +W T  G L  E+  P      + +CLA            KK+RK   +       LL +
Sbjct: 56  VWDTQGGGLRKEYV-PSAHLSATCTCLAWAQGRADKETHQKKKRKSEASDRAGHYDLLTI 114

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL  +++ GE++ K  G H G +  + +  +  SL+    +    E  ++  +V
Sbjct: 115 GTATGTILLYNIVKGELQSKLVGGHDGRVNCVRWHHENNSLYSCSEDKHIIEWNTQTCKV 174

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQY--VS 162
             ++K     +SS     + KI   A   +++  LE  E   +F   S  V  L +  V 
Sbjct: 175 KCKWKGDNSSVSSLCISPDGKILLSAGRTIKLWDLETKEVYRQFTGHSTAVTSLMFLTVQ 234

Query: 163 ASDGAKII--------ITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
              G+  I        ++    ++ + VW+   S K  +   + ++   P+ +D   S +
Sbjct: 235 PPQGSHPIKDSTGLYFLSGAVHDRLISVWQVR-SEKEKSSVLSFTLTEPPLFVDLSPSES 293

Query: 215 GEDGTVILAVAESG 228
            E+   +  V   G
Sbjct: 294 KEEPLKLAVVCRDG 307


>gi|27882005|gb|AAH43856.1| LOC398447 protein, partial [Xenopus laevis]
          Length = 665

 Score = 45.4 bits (106), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 56/254 (22%), Positives = 103/254 (40%), Gaps = 28/254 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRKERGTL------LLAL 47
           +W T  G L  E+  P      + +CLA            KK+RK   +       LL +
Sbjct: 35  VWDTQGGGLRKEY-VPSAHLSATCTCLAWAQGRADKETHQKKKRKSEASDRAGHYDLLTI 93

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL  +++ GE++ K  G H G +  + +  +  SL+    +    E  ++  +V
Sbjct: 94  GTATGTILLYNIVKGELQSKLVGGHDGRVNCVRWHHENNSLYSCSEDKHIIEWNTQTCKV 153

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQY--VS 162
             ++K     +SS     + KI   A   +++  LE  E   +F   S  V  L +  V 
Sbjct: 154 KCKWKGDNSSVSSLCISPDGKILLSAGRTIKLWDLETKEVYRQFTGHSTAVTSLMFLTVQ 213

Query: 163 ASDGAKII--------ITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
              G+  I        ++    ++ + VW+   S K  +   + ++   P+ +D   S +
Sbjct: 214 PPQGSHPIKDSTGLYFLSGAVHDRLISVWQVR-SEKEKSSVLSFTLTEPPLFVDLSPSES 272

Query: 215 GEDGTVILAVAESG 228
            E+   +  V   G
Sbjct: 273 KEEPLKLAVVCRDG 286


>gi|348515929|ref|XP_003445492.1| PREDICTED: WD repeat-containing protein 43 [Oreochromis niloticus]
          Length = 654

 Score = 45.1 bits (105), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 102/248 (41%), Gaps = 22/248 (8%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV-----GKKRRK--------ERGTLLLAL 47
           IW+T+  +L  E+  P      + +C+A G       G +R+K        E    LLA+
Sbjct: 35  IWNTDSKTLHQEY-VPSAHLSATCTCIAWGPCRTVKEGPQRKKRKSEAVQVEEKADLLAM 93

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G +L      G +     G H GG+  + +  +   L+    +    E   + G+ 
Sbjct: 94  GTAAGTVLIYSTAKGALHCTLDGAHSGGVNCVQWHPEESLLYSGSDDTNIVEWDLQTGKT 153

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
             ++KA    ++S     + K+   A   +++  L+  E   KF   S  V  L++ +  
Sbjct: 154 RSKWKADRAAVTSLCVSPDGKLLLSAGHIIKMWDLDTKEVYRKFTGHSTAVTTLRFATTR 213

Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
              S+G   +  A + ++ L VW+     K  N   + ++   P  ID   S + E+   
Sbjct: 214 PPDSNGLYFLSGAAH-DRLLSVWQVREDGKDKNSVVSFTLTDEPQHIDLVTSNSKEEAVR 272

Query: 221 ILAVAESG 228
           +  V + G
Sbjct: 273 LAVVCKDG 280


>gi|50927202|gb|AAH79723.1| LOC398447 protein, partial [Xenopus laevis]
          Length = 667

 Score = 45.1 bits (105), Expect = 0.045,   Method: Compositional matrix adjust.
 Identities = 56/254 (22%), Positives = 103/254 (40%), Gaps = 28/254 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG-------FVGKKRRKERGTL------LLAL 47
           +W T  G L  E+  P      + +CLA            KK+RK   +       LL +
Sbjct: 37  VWDTQGGGLRKEYV-PSAHLSATCTCLAWAQGRADKETHQKKKRKSEASDRAGHYDLLTI 95

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL  +++ GE++ K  G H G +  + +  +  SL+    +    E  ++  +V
Sbjct: 96  GTATGTILLYNIVKGELQSKLVGGHDGRVNCVRWHHENNSLYSCSEDKHIIEWNTQTCKV 155

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQY--VS 162
             ++K     +SS     + KI   A   +++  LE  E   +F   S  V  L +  V 
Sbjct: 156 KCKWKGDNSSVSSLCISPDGKILLSAGRTIKLWDLETKEVYRQFTGHSTAVTSLMFLTVQ 215

Query: 163 ASDGAKII--------ITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
              G+  I        ++    ++ + VW+   S K  +   + ++   P+ +D   S +
Sbjct: 216 PPQGSHPIKDSTGLYFLSGAVHDRLISVWQVR-SEKEKSSVLSFTLTEPPLFVDLSPSES 274

Query: 215 GEDGTVILAVAESG 228
            E+   +  V   G
Sbjct: 275 KEEPLKLAVVCRDG 288


>gi|425449150|ref|ZP_18828993.1| Genome sequencing data, contig C309 (fragment) [Microcystis
           aeruginosa PCC 7941]
 gi|389764351|emb|CCI09334.1| Genome sequencing data, contig C309 (fragment) [Microcystis
           aeruginosa PCC 7941]
          Length = 277

 Score = 45.1 bits (105), Expect = 0.055,   Method: Compositional matrix adjust.
 Identities = 50/195 (25%), Positives = 87/195 (44%), Gaps = 10/195 (5%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           LA G+S+  I   DV TG+     TG H   ++ ++F+  G++L     +G       E 
Sbjct: 52  LATGSSDNTIKLWDVETGQQIRTLTG-HNSYVSSVSFSSDGKTLVSGSDDGTIKLWNVET 110

Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
           G+ IR        ++S++F  + K  A  S +  +++ ++E GEE+   S   G +  VS
Sbjct: 111 GQEIRTLSGHNYSVNSASFSNDGKTLATGSRDDTIKLWNVETGEEIRTLSGHNGYVNSVS 170

Query: 163 ASDGAKIIITAGYGEKHLQVWRCDISSKTVN-KGPALSMRHSPVAIDCKNSPNG-EDGTV 220
            S   K + T  + +  +++W  +   +     G   S+     + D K    G +DGT+
Sbjct: 171 FSPDGKTLATGSW-DSTIKLWNVETGEEIRTLSGHNYSVNSVSFSPDGKTLATGSDDGTI 229

Query: 221 ILAVAESGVAYSWDL 235
            L   E    Y W L
Sbjct: 230 KLWNGE----YGWGL 240


>gi|148657484|ref|YP_001277689.1| protein kinase [Roseiflexus sp. RS-1]
 gi|148569594|gb|ABQ91739.1| protein kinase [Roseiflexus sp. RS-1]
          Length = 1242

 Score = 44.7 bits (104), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 44/169 (26%), Positives = 74/169 (43%), Gaps = 14/169 (8%)

Query: 72   HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
            H   +A +AF+  GR     G + +    + E G VI + +     + S  F   +  +A
Sbjct: 881  HTDSIASVAFSPDGRYALSGGGDRVIRLWEIENGRVICKLEGHTLAVYSVVF-SPDGHYA 939

Query: 132  LASS---EVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS 188
            L+ S    +R+  +  G EV +F   V  +  V+ S   + II+AG+ E  +++W     
Sbjct: 940  LSGSWDKTIRLWEVATGREVNRFDRHVNFVNSVAFSPDGRYIISAGWDET-IRLW----- 993

Query: 189  SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
              T        ++ + V      SP   DG  IL+ +E G    WD+KT
Sbjct: 994  -DTTTGHEMYCLKDTDVIWSVCFSP---DGLYILSGSEDGSVKLWDIKT 1038


>gi|298248183|ref|ZP_06971988.1| WD40 repeat, subgroup [Ktedonobacter racemifer DSM 44963]
 gi|297550842|gb|EFH84708.1| WD40 repeat, subgroup [Ktedonobacter racemifer DSM 44963]
          Length = 433

 Score = 44.7 bits (104), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 61/131 (46%), Gaps = 3/131 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           +LA G+ +G I   D   G +    TG H GG+  LA++  G  L   G +        +
Sbjct: 159 ILASGSIDGSIKLWDPSRGHLLHTLTG-HGGGVFALAWSPSGGLLVSGGQDSAIKLWDPQ 217

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
            G+++R  +     + S A   + +    AS++  VR+  L+ G  +L F D   PL  V
Sbjct: 218 SGKLLRSLEGHGNAVRSLALGTDGQTLVSASTDQTVRLWDLQTGRLLLPFIDHPSPLYSV 277

Query: 162 SASDGAKIIIT 172
           + S   +II++
Sbjct: 278 AMSPNHQIIVS 288


>gi|126658553|ref|ZP_01729700.1| Peptidase C14, caspase catalytic subunit p20 [Cyanothece sp. CCY0110]
 gi|126620140|gb|EAZ90862.1| Peptidase C14, caspase catalytic subunit p20 [Cyanothece sp. CCY0110]
          Length = 1523

 Score = 44.7 bits (104), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 49/196 (25%), Positives = 91/196 (46%), Gaps = 12/196 (6%)

Query: 44   LLALGTSNGDILAV-DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
            +LA  + N +I+ + ++ TG++  ++   H  G+  ++F+  G++L     +        
Sbjct: 940  ILASSSINHNIIEIWNLETGKVI-RTLKEHNEGVQSVSFSFDGKTLASGSNDNTIKLWDV 998

Query: 103  EMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEV--LKFSDDVGPL 158
            + GEVI   K   +PISS +F    KI A  S +  V++ +LE GE +  LK  +D G +
Sbjct: 999  KTGEVIHTLKGHNEPISSVSFSPNGKILASGSDDNTVKLWNLETGELIRTLKGHNDSGFV 1058

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKG------PALSMRHSPVAIDCKNS 212
              +S S   +++ +   G K+  +   +I +  + K          S+  SP      + 
Sbjct: 1059 TSLSFSPNGQLLASGSNGSKNGSIILWNIKTGQIIKNLENREVTIWSVSFSPDGKSLASG 1118

Query: 213  PNGEDGTVILAVAESG 228
               +D TV L   E+G
Sbjct: 1119 SGSDDNTVKLWDIETG 1134


>gi|145515818|ref|XP_001443803.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124411203|emb|CAK76406.1| unnamed protein product [Paramecium tetraurelia]
          Length = 2929

 Score = 44.3 bits (103), Expect = 0.085,   Method: Composition-based stats.
 Identities = 59/270 (21%), Positives = 116/270 (42%), Gaps = 26/270 (9%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            LLA G+ +  I+   + TG++  K  G H   +  +AF+  G  L     + +     ++
Sbjct: 2385 LLASGSEDQSIILWHIKTGKLITKLLG-HSDSVQSVAFSCDGSRLASASGDYLVKIWDTK 2443

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
            +G+ I E       +    F    +I A A  +  +++    +G++++K       +Q +
Sbjct: 2444 LGQEILELSEHNDSLQCVIFSPNGQILASAGGDYIIQLWDAVSGQDIMKLEGHTDAVQSI 2503

Query: 162  SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMR----HSPVAIDCKNSPNGED 217
            +     K++  +G  +  +++W  DI++ T        M+    H+        SPNGE 
Sbjct: 2504 AFYPDGKVL-ASGSSDHSIRIW--DITTGT-------EMQKIDGHTGCVYSIAFSPNGE- 2552

Query: 218  GTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAAR 277
               +++ +E      W+ K++ + ++ N   + +       DQQ S+ +     SI   R
Sbjct: 2553 --ALVSASEDNSILLWNTKSIKEMQQINGDTMWIYSVAQSPDQQ-SLALACIDYSI---R 2606

Query: 278  LNDL--EADGQKTITGLHDLESEAATASAQ 305
            L DL  E + QK I     +E  A +A  Q
Sbjct: 2607 LWDLKSEKERQKLIGHSDQVEVIAFSADGQ 2636



 Score = 41.6 bits (96), Expect = 0.49,   Method: Composition-based stats.
 Identities = 50/204 (24%), Positives = 91/204 (44%), Gaps = 12/204 (5%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            L+A G+S+  +   DV +G+   K  G H   +  +AF+ K   L     +        +
Sbjct: 2343 LIASGSSDTSVRLWDVESGKEISKLEG-HLNWVCSVAFSPKEDLLASGSEDQSIILWHIK 2401

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
             G++I +       + S AF C+    A AS +  V+I   + G+E+L+ S+    LQ V
Sbjct: 2402 TGKLITKLLGHSDSVQSVAFSCDGSRLASASGDYLVKIWDTKLGQEILELSEHNDSLQCV 2461

Query: 162  SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVI 221
              S   +I+ +AG G+  +Q+W   +S + + K       H+         P   DG V+
Sbjct: 2462 IFSPNGQILASAG-GDYIIQLWDA-VSGQDIMKLEG----HTDAVQSIAFYP---DGKVL 2512

Query: 222  LAVAESGVAYSWDLKTVSQDEKTN 245
             + +       WD+ T ++ +K +
Sbjct: 2513 ASGSSDHSIRIWDITTGTEMQKID 2536


>gi|449545659|gb|EMD36630.1| hypothetical protein CERSUDRAFT_115662 [Ceriporiopsis subvermispora
           B]
          Length = 825

 Score = 44.3 bits (103), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 48/190 (25%), Positives = 89/190 (46%), Gaps = 26/190 (13%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           LA G+ + + +  + L+G+    +   H  G+  LA++  GR+L     +G+     +E+
Sbjct: 547 LASGSHDTNAIIWNYLSGDQH--AVLCHTAGIFDLAYSADGRTLATGDLDGVVHLWDAEL 604

Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASS--EVRILSLENGEEVLKFSDDVGPL-QYV 161
           GE  R F ++   I+  AF   +K + LA S   V + + E G+     ++  G + Q+ 
Sbjct: 605 GERHRSFPSAAGAITQLAFSPNDK-YVLAQSLFSVNVWNAETGDLHAALNEHTGIIWQFA 663

Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRH---SPVAIDCKNSPNGE-- 216
              +GA+++  +   +   +VW       ++  G  L + H   SPV I  + SP+GE  
Sbjct: 664 IDREGARVVTAS--EDTTARVW-------SLETGDTLFINHEHTSPVWI-AQFSPDGERI 713

Query: 217 -----DGTVI 221
                DGT +
Sbjct: 714 STVSSDGTFV 723


>gi|389742109|gb|EIM83296.1| WD40 repeat-containing protein [Stereum hirsutum FP-91666 SS1]
          Length = 328

 Score = 43.9 bits (102), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 45/184 (24%), Positives = 86/184 (46%), Gaps = 21/184 (11%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H   ++ + F   G  L    ++ +     ++ GE+++      + IS  A+  + +  A
Sbjct: 36  HTMSISSIKFNPDGNVLASAASDKLIKLWDTDSGEILKTLMGHTEGISDIAWSNDGEYLA 95

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS- 188
            AS +  +RI S+E G EV         +  V+ +  + ++++ G+ E  ++VW  D++ 
Sbjct: 96  SASDDKTIRIWSMETGTEVKVLYGHTNFVFCVNYNPKSNLLVSGGFDET-VRVW--DVAR 152

Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDE 242
            K++   PA S   + VA +        DGT+I++ A  G+   WD      LKT+  D+
Sbjct: 153 GKSLKVLPAHSDPVTAVAFN-------HDGTLIVSCAMDGLIRIWDADSGQCLKTLVDDD 205

Query: 243 KTNP 246
             NP
Sbjct: 206 --NP 207


>gi|291386985|ref|XP_002709985.1| PREDICTED: WD repeat domain 43 [Oryctolagus cuniculus]
          Length = 676

 Score = 43.5 bits (101), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 58/262 (22%), Positives = 101/262 (38%), Gaps = 29/262 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG---------KKRRKE-----RGTLLLA 46
           +W T +  L  E+  P      + +CLA   V          KKR+ E       T LLA
Sbjct: 40  VWETANNRLHQEY-VPSAHLSGTCTCLAWAPVRLQAKESPQRKKRKSEALGTSNQTDLLA 98

Query: 47  LGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LGT+ G IL    + GE+  K T G H   +  + + +    L+    +    E   +  
Sbjct: 99  LGTTVGSILLYSTVKGELHSKLTSGGHDNRVNCIQWHQDSGCLYSCSDDKHIVEWNVQTC 158

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS- 164
           +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  +  + 
Sbjct: 159 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLKFTT 218

Query: 165 ----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
                     DG      ++    ++ L VW+    +K  N   + ++   PV ID   S
Sbjct: 219 IRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDLTLS 278

Query: 213 PNGEDGTVILAVAESGVAYSWD 234
            N E+   +  V   G  + ++
Sbjct: 279 ENKEEPVKLAVVCRDGQVHLFE 300


>gi|195057884|ref|XP_001995342.1| GH23108 [Drosophila grimshawi]
 gi|193899548|gb|EDV98414.1| GH23108 [Drosophila grimshawi]
          Length = 626

 Score = 43.5 bits (101), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 59/277 (21%), Positives = 123/277 (44%), Gaps = 32/277 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGE-----PVVSYSCLACGFVGKKRRKERGTLL--------LAL 47
           IW T   +L  E+  P+ +       +++  +A     K R+ ++ T +        +AL
Sbjct: 30  IWDTEANTLKQEYT-PNLQLAGPCTALTWLTIAAQRPKKTRKTQQQTQISSDNDREYIAL 88

Query: 48  GTSNGDILAVDVLTGEM-KWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGE 106
           GT  G ++   +  G++ +  S   H G +  + +      L+ VGT+  A         
Sbjct: 89  GTLKGAVVLYSLTEGKIERTLSNESHHGAVTAIVYDAHNGHLYSVGTDCRALIWSVAEAR 148

Query: 107 VIREFKA-SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQ----YV 161
            I +F    EKP++   FL + +  A+AS +++I  ++N E V   +   G +     + 
Sbjct: 149 CISDFSVGPEKPVNC-VFLAKSRTLAVASRQLKIYDVDNVELVETCTGHSGEINAINSFT 207

Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA--IDCKNSPNGEDGT 219
              +  + ++T    E+ +  W+  I  K  NK    ++    VA  + C+     EDG 
Sbjct: 208 CTKNNVEYVMTTAKMERVISFWK--IEKKGRNKASMCTLLMEDVAHSLTCEVR---EDGQ 262

Query: 220 VILA-VAESG--VAYSWDLKTVSQDEKTNPAKITVKL 253
           + +A V  +G    Y  +++++S ++   P K+++++
Sbjct: 263 LRVASVTRNGNIHIYMLNVESLSTEKYIKP-KVSLQI 298


>gi|330805930|ref|XP_003290929.1| hypothetical protein DICPUDRAFT_81631 [Dictyostelium purpureum]
 gi|325078927|gb|EGC32553.1| hypothetical protein DICPUDRAFT_81631 [Dictyostelium purpureum]
          Length = 616

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 55/220 (25%), Positives = 87/220 (39%), Gaps = 47/220 (21%)

Query: 29  CGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGR------HPGGLAGLAFA 82
           CGF+  +  KE+   L    T +     VD+       +S+G+       P G     ++
Sbjct: 188 CGFIVTENEKEKTYKL----TQHQIAKQVDI-------QSSGKVFDLNLDPNGPYNFEYS 236

Query: 83  KKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSL 142
           K GR L + G  G  S +  + G+ + E +     I  + FL  E +FALA  +   +  
Sbjct: 237 KNGRYLLLAGARGHYSIVDWQRGKKLTE-RHLGGAIRDACFLHNETMFALAQKKYTYIYN 295

Query: 143 ENGEEVLKFSD-------DVGPLQY--VSASDGAKIIITAGYGEKHLQVWRCDISSKTVN 193
           ++G E+    D       D  P  Y  VSA++  KII               DIS   V 
Sbjct: 296 QDGVELHWLKDHYDPKFIDFLPYHYLLVSATNKGKIIYE-------------DIS---VG 339

Query: 194 KGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSW 233
           +    S  H P++  C+N  N     V+     +G+   W
Sbjct: 340 QKVVESHYHGPLSAMCQNPQNA----VMNLGFSTGIVQMW 375


>gi|354567841|ref|ZP_08987008.1| serine/threonine protein kinase with WD40 repeats [Fischerella sp.
           JSC-11]
 gi|353541515|gb|EHC10982.1| serine/threonine protein kinase with WD40 repeats [Fischerella sp.
           JSC-11]
          Length = 688

 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 46/195 (23%), Positives = 87/195 (44%), Gaps = 21/195 (10%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H G + G+ F++ G++L     +          G  IR FK  +  ++S AF  + +  A
Sbjct: 437 HSGWIWGVVFSRDGQTLASASADQTVKLWDLATGREIRTFKGHKAGVTSVAFSPDGQTLA 496

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
            A  +  V++ ++E G+E+       G +  V+ S   +  + +G  +K +++W  + ++
Sbjct: 497 TAGLDKTVKLWNVETGKEICTLVGHSGAIASVAFSPDGQ-TLASGSWDKTIKLWNVN-TA 554

Query: 190 KTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKI 249
           K +         HS + I    SP   DGT + + ++      WDL T          K 
Sbjct: 555 KNIRTFTG----HSDLIISVAFSP---DGTSLASGSKDKTIKLWDLAT---------GKA 598

Query: 250 TVKLKKADADQQNSV 264
           T+ LK+   D+ NS+
Sbjct: 599 TLTLKE-HTDKVNSI 612


>gi|66814050|ref|XP_641204.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
 gi|60469228|gb|EAL67223.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
          Length = 622

 Score = 43.1 bits (100), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 42/152 (27%), Positives = 62/152 (40%), Gaps = 26/152 (17%)

Query: 73  PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFAL 132
           P G   + ++K GR L + G  G  S +  + G+   E +    PI  + FL  E +FAL
Sbjct: 233 PNGPYKIQYSKNGRYLLLAGAKGHFSMIDWQRGKKFTE-RHLASPIRDAVFLQNESMFAL 291

Query: 133 ASSEVRILSLENGEEVLKFSDDVGP--LQY-------VSASDGAKIIITAGYGEKHLQVW 183
           A  +   +  ++G E+    D   P  L Y       VSA++  KII             
Sbjct: 292 AQKKYTYIYNQDGVEMHWLKDHYDPKFLDYLPYHFLLVSATNKGKIIYE----------- 340

Query: 184 RCDISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
             DIS   V +    S    P++  CKN  N 
Sbjct: 341 --DIS---VGQKVVESFYSGPLSAMCKNPQNA 367


>gi|288916826|ref|ZP_06411199.1| WD-40 repeat protein [Frankia sp. EUN1f]
 gi|288351711|gb|EFC85915.1| WD-40 repeat protein [Frankia sp. EUN1f]
          Length = 682

 Score = 43.1 bits (100), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 49/195 (25%), Positives = 80/195 (41%), Gaps = 40/195 (20%)

Query: 3   STNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTG 62
           S + G++LA    PDG  VVS                        G ++G +   D+  G
Sbjct: 271 SGHHGAVLAVALTPDGRQVVS------------------------GGADGVVRLWDLADG 306

Query: 63  EMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSA 122
             +   TG H G ++ +A    GR +   G +GM    + E GE        E P +S A
Sbjct: 307 TERAALTG-HTGEVSAVALTPDGRRVVTGGGDGMLRIWELETGEEQAHIPPGEIPATSRA 365

Query: 123 -FLCEEKIFALA-------------SSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAK 168
            FL  +++ ALA             S  +RI  L  G  +   + D  P+  V+ +   +
Sbjct: 366 PFLGPDRMSALAVAPDGRWAASGDLSGALRIWDLAGGRRLGCLTSDGVPVWAVAVTPDGR 425

Query: 169 IIITAGYGEKHLQVW 183
            I+T G+G+  +++W
Sbjct: 426 RIVT-GHGDGTVRLW 439


>gi|126656712|ref|ZP_01727926.1| WD-40 repeat protein [Cyanothece sp. CCY0110]
 gi|126621932|gb|EAZ92640.1| WD-40 repeat protein [Cyanothece sp. CCY0110]
          Length = 1072

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 54/198 (27%), Positives = 87/198 (43%), Gaps = 20/198 (10%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           LLA    +G+I   D L+G+ K+  TG H   +  + F+++   L     +G+      E
Sbjct: 618 LLASAGRDGEIYIWDALSGKHKFTLTG-HDDSIREIVFSQESNLLISRSDDGIIKLWNLE 676

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSL--ENGEEVLKFSDDVGPLQ 159
            GE+I     +EK I++ A   +  I A    E  V++ SL  ++   +L        ++
Sbjct: 677 KGELIEHPLNNEKGINTFALSSQLSIVATGQKEGKVKLWSLAQKDFPTLLALKSSQNSIE 736

Query: 160 YVS-ASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDG 218
           +V   SD  K+II       H  +W  D++ K +N  P  S    P   D +   N    
Sbjct: 737 FVQFTSDETKLIIVENNNNIH--IW--DMNKKIINSIP--STDQCPFG-DIQVKGN---- 785

Query: 219 TVILAVAE-SGVAYSWDL 235
             ILAVA+ +G    WD 
Sbjct: 786 --ILAVAQINGTIQIWDF 801


>gi|434387970|ref|YP_007098581.1| WD40 repeat-containing protein [Chamaesiphon minutus PCC 6605]
 gi|428018960|gb|AFY95054.1| WD40 repeat-containing protein [Chamaesiphon minutus PCC 6605]
          Length = 1221

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 40/143 (27%), Positives = 66/143 (46%), Gaps = 6/143 (4%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           LLA G  NG I   D+ TGE     TG H G + GLAF+  G+ L    ++       + 
Sbjct: 613 LLATGDVNGQIYLWDIATGEPILCCTG-HAGWVHGLAFSHDGKMLASASSDLTVKLWDTF 671

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
            G  +R F    + + + AF  + +  A  SS+  +R+    +G+ +   S     +  V
Sbjct: 672 DGSCLRTFTGHHQRVRAIAFSPDSQSIASGSSDATIRLWDTRSGKCLKILSGHQSYIWSV 731

Query: 162 SAS-DGAKIIITAGYGEKHLQVW 183
           + S DG    I +G  +K +++W
Sbjct: 732 AFSPDGTT--IASGSEDKSVRLW 752


>gi|428320673|ref|YP_007118555.1| serine/threonine protein kinase with WD40 repeats [Oscillatoria
           nigro-viridis PCC 7112]
 gi|428244353|gb|AFZ10139.1| serine/threonine protein kinase with WD40 repeats [Oscillatoria
           nigro-viridis PCC 7112]
          Length = 689

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 32/132 (24%), Positives = 65/132 (49%), Gaps = 3/132 (2%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           LA G+ +  +   DV TG  + ++   H GG+  +AF++ G++L     +          
Sbjct: 508 LASGSDDKTVRLWDVKTGS-RLRTLSGHAGGVNAIAFSRDGQTLASGSDDKTVRLWNLNT 566

Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
           GEV R       P+++ AF    K  A AS++  +R+ ++++G+    F    G ++ ++
Sbjct: 567 GEVRRIITGHGGPVNAVAFSPNGKTVASASTDNTIRLSNVQDGKRTRTFKGHSGRVRTIA 626

Query: 163 ASDGAKIIITAG 174
            S  ++ +I+ G
Sbjct: 627 FSPDSRTLISGG 638


>gi|148706445|gb|EDL38392.1| mCG5393 [Mus musculus]
          Length = 688

 Score = 43.1 bits (100), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 58/261 (22%), Positives = 104/261 (39%), Gaps = 28/261 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG---------FVGKKRRKE-RGTL----LLA 46
           +W T +  L  E+  P      + +CLA              KKR+ E  GT     LLA
Sbjct: 51  VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKESHQRKKRKSEVTGTKDQADLLA 109

Query: 47  LGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LGT+ G IL    + GE+  K T G H   +  + + +    L+    +    E  ++  
Sbjct: 110 LGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDNDCLYSCSDDKYIVEWSTQTC 169

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQYVS 162
           +V  ++K     +SS     + K+   A   +++  LE  E    F+    P   L++ +
Sbjct: 170 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLRFTT 229

Query: 163 -------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSP 213
                   SDG      ++    ++ L VW+    +K  +   + ++   PV +D   S 
Sbjct: 230 IRPNESQPSDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYVDLTLSE 289

Query: 214 NGEDGTVILAVAESGVAYSWD 234
           N E+   +  V   G  + ++
Sbjct: 290 NKEEPVKLAVVCRDGQVHLFE 310


>gi|37359726|dbj|BAC97841.1| mKIAA0007 protein [Mus musculus]
          Length = 665

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 58/261 (22%), Positives = 104/261 (39%), Gaps = 28/261 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG---------FVGKKRRKE-RGTL----LLA 46
           +W T +  L  E+  P      + +CLA              KKR+ E  GT     LLA
Sbjct: 28  VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKESHQRKKRKSEVTGTKDQADLLA 86

Query: 47  LGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LGT+ G IL    + GE+  K T G H   +  + + +    L+    +    E  ++  
Sbjct: 87  LGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDNDCLYSCSDDKYIVEWSTQTC 146

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQYVS 162
           +V  ++K     +SS     + K+   A   +++  LE  E    F+    P   L++ +
Sbjct: 147 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLRFTT 206

Query: 163 -------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSP 213
                   SDG      ++    ++ L VW+    +K  +   + ++   PV +D   S 
Sbjct: 207 IRPNESQPSDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYVDLTLSE 266

Query: 214 NGEDGTVILAVAESGVAYSWD 234
           N E+   +  V   G  + ++
Sbjct: 267 NKEEPVKLAVVCRDGQVHLFE 287


>gi|196000408|ref|XP_002110072.1| hypothetical protein TRIADDRAFT_20616 [Trichoplax adhaerens]
 gi|190588196|gb|EDV28238.1| hypothetical protein TRIADDRAFT_20616 [Trichoplax adhaerens]
          Length = 1700

 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 41/100 (41%), Gaps = 23/100 (23%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVV-----SYSCLACGFVGKKRRKERGTLLLALGTSNGDIL 55
           IW  +  + L ++K P   P       S  C ACGF                  +NG + 
Sbjct: 682 IWDIDTYAQLFDFKAPGEVPCAICYHPSQQCFACGF------------------TNGTVR 723

Query: 56  AVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNG 95
              V T  +  + +  HPG + GLAF+  GR ++  GT G
Sbjct: 724 TFHVGTSSLLAEHSKNHPGRIIGLAFSPSGRFMYSAGTLG 763


>gi|75911009|ref|YP_325305.1| WD-40 repeat-containing protein [Anabaena variabilis ATCC 29413]
 gi|75704734|gb|ABA24410.1| WD-40 repeat-containing protein [Anabaena variabilis ATCC 29413]
          Length = 1477

 Score = 43.1 bits (100), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 44/168 (26%), Positives = 86/168 (51%), Gaps = 13/168 (7%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            +  G  +GDI  +  L  + K K+     G +  L+ +  G+ +   G + +A ++ + 
Sbjct: 817 FIVTGGEDGDI-NLWSLQEKQKIKNWMAEQGAIYSLSISSDGQYIATAGKDRIA-KLWNL 874

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALA--SSEVRILSLENGEEVLKFSDDVGPLQYV 161
           +G+ + EFK+      S +F  + ++ A A   S+ R+  L +GE++ +F   VG ++ V
Sbjct: 875 VGQKLSEFKSPNGSFRSISFSPDGRLLATAGDDSKARLWKL-SGEQLAEFKGHVGWVRDV 933

Query: 162 SASDGAKIIITAGYGEKHLQVWRCDISSKTV-----NKGPALSMRHSP 204
           S S   K++ TAG  +  +++W   +S K +     ++G  LS+R SP
Sbjct: 934 SFSPDGKLLATAG-DDGKVRLWH--LSGKQLIEFKGHQGGVLSVRFSP 978



 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 40/133 (30%), Positives = 60/133 (45%), Gaps = 14/133 (10%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           L AL      I  V+ LTG         H G + G+ F+  GR +   G++G    +   
Sbjct: 702 LFALQKILDKIFEVNQLTG---------HQGWVRGIRFSPNGRLIVTSGSDGTV-RIWDY 751

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
           +G+   EFKA    I S  F  + K+ A AS +  VRI +L    E+L        ++ V
Sbjct: 752 LGKQQIEFKAHWGSILSVNFSPDSKLIATASDDGMVRIWNLLG--EMLSEYKHQNVIRDV 809

Query: 162 SASDGAKIIITAG 174
           + S  +K I+T G
Sbjct: 810 AFSPDSKFIVTGG 822


>gi|241896997|ref|NP_783570.1| WD repeat-containing protein 43 [Mus musculus]
 gi|158518600|sp|Q6ZQL4.2|WDR43_MOUSE RecName: Full=WD repeat-containing protein 43
 gi|74214553|dbj|BAE31123.1| unnamed protein product [Mus musculus]
          Length = 677

 Score = 42.7 bits (99), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 58/261 (22%), Positives = 104/261 (39%), Gaps = 28/261 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG---------FVGKKRRKE-RGTL----LLA 46
           +W T +  L  E+  P      + +CLA              KKR+ E  GT     LLA
Sbjct: 40  VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKESHQRKKRKSEVTGTKDQADLLA 98

Query: 47  LGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LGT+ G IL    + GE+  K T G H   +  + + +    L+    +    E  ++  
Sbjct: 99  LGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDNDCLYSCSDDKYIVEWSTQTC 158

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQYVS 162
           +V  ++K     +SS     + K+   A   +++  LE  E    F+    P   L++ +
Sbjct: 159 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLRFTT 218

Query: 163 -------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSP 213
                   SDG      ++    ++ L VW+    +K  +   + ++   PV +D   S 
Sbjct: 219 IRPNESQPSDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYVDLTLSE 278

Query: 214 NGEDGTVILAVAESGVAYSWD 234
           N E+   +  V   G  + ++
Sbjct: 279 NKEEPVKLAVVCRDGQVHLFE 299


>gi|195492305|ref|XP_002093934.1| GE20481 [Drosophila yakuba]
 gi|194180035|gb|EDW93646.1| GE20481 [Drosophila yakuba]
          Length = 1229

 Score = 42.7 bits (99), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
            +A+GTS+G IL  DV T  ++W    +H  G +A LAF      L    + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGYSRGLVAMLDT 189

Query: 103 EMGEVIREF 111
           + G+V+RE 
Sbjct: 190 QTGDVLREL 198


>gi|194867662|ref|XP_001972124.1| GG14056 [Drosophila erecta]
 gi|190653907|gb|EDV51150.1| GG14056 [Drosophila erecta]
          Length = 1229

 Score = 42.7 bits (99), Expect = 0.25,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 38/69 (55%), Gaps = 2/69 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
            +A+GTS+G IL  DV T  ++W    +H  G +A LAF      L    + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGYSRGLVAMLDT 189

Query: 103 EMGEVIREF 111
           + G+V+RE 
Sbjct: 190 QTGDVLREL 198


>gi|320593547|gb|EFX05956.1| mRNA splicing factor [Grosmannia clavigera kw1407]
          Length = 542

 Score = 42.7 bits (99), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 40/185 (21%), Positives = 76/185 (41%), Gaps = 18/185 (9%)

Query: 9   LLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKS 68
           L+  WK P G+PV +               E G +LL+ G+++  +   DV       ++
Sbjct: 238 LVHVWKSPTGKPVTALRLFP----------ESGHVLLS-GSADNTVRVWDVYHERELLRT 286

Query: 69  TGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP--ISSSAFLCE 126
              H   L  L+F++ GR       +       +E G+ IR F+  + P  ++ +  +  
Sbjct: 287 YAGHSRALTDLSFSRDGRQFISGSHDRFVKLWDTETGQCIRRFRTGKTPHCLAFNPSVEG 346

Query: 127 EKIFALASSEVRILSLE----NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQV 182
              F    S  +IL  +    + E V  +   +  +  ++  D A+  +T    +K L+V
Sbjct: 347 AHEFLTGMSNNKILQWDSRAGDNEPVQDYDHHLAAINTITFVDEARRFMTTS-DDKSLRV 405

Query: 183 WRCDI 187
           W  +I
Sbjct: 406 WDYNI 410


>gi|51259783|gb|AAH79913.1| Wrd43 protein, partial [Mus musculus]
          Length = 592

 Score = 42.7 bits (99), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 44/204 (21%), Positives = 84/204 (41%), Gaps = 13/204 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKST-GRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K T G H   +  + + +    L+    +    E  +
Sbjct: 11  LLALGTAVGSILLYSTVRGELHSKLTSGGHENRVNCIQWHQDNDCLYSCSDDKYIVEWST 70

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQ 159
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P   L+
Sbjct: 71  QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLR 130

Query: 160 YVS-------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCK 210
           + +        SDG      ++    ++ L VW+    +K  +   + ++   PV +D  
Sbjct: 131 FTTIRPNESQPSDGITGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYVDLT 190

Query: 211 NSPNGEDGTVILAVAESGVAYSWD 234
            S N E+   +  V   G  + ++
Sbjct: 191 LSENKEEPVKLAVVCRDGQVHLFE 214


>gi|425460715|ref|ZP_18840196.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
 gi|389826568|emb|CCI22821.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9808]
          Length = 1108

 Score = 42.4 bits (98), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 51/200 (25%), Positives = 83/200 (41%), Gaps = 26/200 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D    +++  +T      +  +AF      + + G NG   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G ++REF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 899  KGTMLREFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWYLSNHRPQLINSWTVDDSIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W      +     +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWDL 235
             DG  I A++  G A  WD+
Sbjct: 1005 -DGEKIAAISRDGTARRWDI 1023


>gi|393247627|gb|EJD55134.1| WD40 repeat-like protein [Auricularia delicata TFB-10046 SS5]
          Length = 353

 Score = 42.4 bits (98), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 46/184 (25%), Positives = 83/184 (45%), Gaps = 21/184 (11%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H   ++ + F+  G  L     +G+     +  GE++R FK   K IS  A+  +    A
Sbjct: 59  HTMSISAIKFSPDGSFLASSAGDGLVKLWDAYTGEILRTFKGHVKGISDIAWARDSLYLA 118

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
            AS +  VRI +++ G  V   +     +  V+ +  +  ++ +G  ++ +++W  D++ 
Sbjct: 119 SASDDKTVRIWNVQLGSTVKILTGHTSQVMCVNFNPQSN-LLASGSVDETVRIW--DVAR 175

Query: 190 KTVNKGPALSMRHSPV-AIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDE 242
               +   LS    PV A+D        DGT+I++ A  G+   WD      LKT+  D 
Sbjct: 176 GKCMR--TLSAHSDPVTAVDFN-----RDGTMIVSCAYDGLIRIWDTASGQCLKTIVDD- 227

Query: 243 KTNP 246
             NP
Sbjct: 228 -ANP 230


>gi|444914132|ref|ZP_21234277.1| hypothetical protein D187_06447 [Cystobacter fuscus DSM 2262]
 gi|444715066|gb|ELW55939.1| hypothetical protein D187_06447 [Cystobacter fuscus DSM 2262]
          Length = 2276

 Score = 42.4 bits (98), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 49/190 (25%), Positives = 86/190 (45%), Gaps = 11/190 (5%)

Query: 45   LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
            LA    +G +   D  +G  K +S   H G +  ++++K GR L   G +G      +  
Sbjct: 1342 LASAGEDGTVRLWDAESGR-KLRSLSGHKGWVRSVSWSKDGRRLASAGDDGSVRLWDTAS 1400

Query: 105  GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
            G ++R     +  + S ++  + +  A A  +  VR+ + E+G E+       G +  VS
Sbjct: 1401 GRMLRSLSGEKGRVWSVSWSADGRRLASAGDDGTVRLWNAESGHELHSLPGHKGMIFSVS 1460

Query: 163  ASDGAKIIITAGYGEKHLQVWRC----DISSKTVNKGPALSMRHSPVAIDCKNSPNGEDG 218
             S   ++  + G G  HL  W      ++ S + +KG   S+  S  A   + + +G DG
Sbjct: 1461 WSADGRLASSGGDGTVHL--WDAESGHELHSLSGHKGWVFSVSWS--ADGRRLASSGRDG 1516

Query: 219  TVILAVAESG 228
            TV L  A+SG
Sbjct: 1517 TVRLWDAQSG 1526


>gi|328790602|ref|XP_392867.4| PREDICTED: WD repeat-containing protein 43-like [Apis mellifera]
          Length = 582

 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 59/270 (21%), Positives = 112/270 (41%), Gaps = 48/270 (17%)

Query: 34  KKRRK-------ERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAG------LA 80
           K+RRK       ++G  ++A+G++NG I   D  T  +  +  G H G +        + 
Sbjct: 71  KRRRKSISEESIQKG--IIAMGSTNGKITLYDTTTSSVTAQLDG-HSGSVTAATWSENVG 127

Query: 81  FAKKGRSLHVVG----TNGMASEMKSEMGEVIREFKASEKPISSSA--FLCEEKIFALAS 134
           F       H++      NG+  + KS  G      K +   +S+     L  E++     
Sbjct: 128 FITASDDHHIIQWNLQENGVKCKWKSGKG------KTTSLAVSTDGNNLLSGERV----- 176

Query: 135 SEVRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKT 191
             V+   L   + +  F+   + V  L  +  + G   +I+  YG+ +L VW  D     
Sbjct: 177 --VKWWDLNTKQLIRTFTGHANQVTCLHTIKMTSGNNYVISGAYGDGYLSVWALDEQRNE 234

Query: 192 VNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITV 251
                +L+++  P+++    S N +   +++ V  SG A    L     + ++ P K ++
Sbjct: 235 RTSVASLALQDDPISVSANVSANSQ--VMVVVVTRSGQA---QLFQYQPNGRSKPLKPSL 289

Query: 252 KLKKA-DADQQNSVNVKKSRTSIIAARLND 280
            +  A D  Q++ V     + SI+ A L D
Sbjct: 290 NIAVASDISQKDEVQ----QISILNAMLTD 315


>gi|348574548|ref|XP_003473052.1| PREDICTED: LOW QUALITY PROTEIN: WD repeat-containing protein
           43-like [Cavia porcellus]
          Length = 675

 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 45/205 (21%), Positives = 83/205 (40%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  TG H   +  + + +    L+    +    E  +
Sbjct: 96  LLALGTAVGSILLYSTVKGELHSKLVTGGHDNRVNCIQWHQDSDCLYSCSDDKHIVEWNT 155

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQ 159
              +V  ++K     ++S     + K+   A   +++  LE  E    F+    P   L+
Sbjct: 156 RTCKVKCKWKGDNSSVTSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLK 215

Query: 160 YVSA--------SDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
           + +         SDG      ++    ++ L VW+    +K  N   + ++   PV ID 
Sbjct: 216 FTTVRPPNESQTSDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 275

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V   G  + ++
Sbjct: 276 TLSENKEEPVKLAVVCRDGQVHLFE 300


>gi|299470075|emb|CBN79252.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 570

 Score = 42.4 bits (98), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 61/143 (42%), Gaps = 5/143 (3%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           ++  G  +G I + D  TG + W     H GG+  L  +   R L   G  G     +  
Sbjct: 339 MMLTGWRDGQIRSHDADTGHLLWSIDNAHAGGVTSLVVSHNERFLVTGGMEGELRVWELR 398

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRIL---SLENGEEVLKFSDDVGPLQY 160
             +++   K   + ++S A  C++ + AL+ S  R L    L +  +V   S  +G +  
Sbjct: 399 SRDLVSHLKQHGQRVTSLALFCDD-VHALSVSRDRSLICWDLRSESQVSHHSQRMGGVNA 457

Query: 161 VSASDGAKIIITAGYGEKHLQVW 183
           V+ S     +IT G  EK +  W
Sbjct: 458 VALSQDETTVITMGQ-EKRVTYW 479


>gi|390562646|ref|ZP_10244834.1| hypothetical protein NITHO_3120008 [Nitrolancetus hollandicus Lb]
 gi|390172774|emb|CCF84146.1| hypothetical protein NITHO_3120008 [Nitrolancetus hollandicus Lb]
          Length = 320

 Score = 42.4 bits (98), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 59/132 (44%), Gaps = 3/132 (2%)

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           G+ +G I   DV TG    +  G HP  +  LA++  GR L   G +        E GE 
Sbjct: 95  GSGDGLIRIWDVETGGEVARLEG-HPDWVTALAYSPDGRFLLSGGRDATVRLWDVEQGEE 153

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEV--RILSLENGEEVLKFSDDVGPLQYVSASD 165
           +  FK   + I+S  F  +       S+++  R+  LE  +++  F   + P++  + S 
Sbjct: 154 VHLFKGHTRLITSIGFSPDGTRIVSTSADLGTRLWDLETRKQLRHFMGHLAPVRAAAFSP 213

Query: 166 GAKIIITAGYGE 177
             + I+T G  E
Sbjct: 214 DGQRIVTGGQDE 225


>gi|195588248|ref|XP_002083870.1| GD13124 [Drosophila simulans]
 gi|194195879|gb|EDX09455.1| GD13124 [Drosophila simulans]
          Length = 1229

 Score = 42.4 bits (98), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
            +A+GTS+G IL  DV T  ++W    +H  G +A LAF      L    + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLTGFSRGLVAMLDT 189

Query: 103 EMGEVIREF 111
             G+V+RE 
Sbjct: 190 HTGDVLREL 198


>gi|302695713|ref|XP_003037535.1| hypothetical protein SCHCODRAFT_47625 [Schizophyllum commune H4-8]
 gi|300111232|gb|EFJ02633.1| hypothetical protein SCHCODRAFT_47625 [Schizophyllum commune H4-8]
          Length = 361

 Score = 42.4 bits (98), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 82/192 (42%), Gaps = 20/192 (10%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H   ++ + F+  G+ L     +        E+GE++   K  E+ IS  A+  + +  A
Sbjct: 51  HARSISAVKFSPDGKLLASCAADKTIKIWDPEIGEIVHTLKGHEEGISDIAWSNDGQYIA 110

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW------ 183
            AS +  VRI S+ + +EV +       +  V+ +    +I++ G+ E  +++W      
Sbjct: 111 SASDDKTVRIWSVADAKEVKRLIGHTNFVFCVAYNPAGNLIVSGGFDET-IRIWDASKGE 169

Query: 184 --RCDISSKTVNKGPALSMRHSPVAIDCKNSPN-GEDGTVILAVAESGVAYSWD------ 234
             R     +  +      +R  P   D   S     DGT+I++ A  G+   WD      
Sbjct: 170 YGRLCTCREADDLYAGKFVRFMPAHSDPVTSVGFSHDGTMIVSCAMDGLIRIWDTESGQC 229

Query: 235 LKTVSQDEKTNP 246
           LKT+  D+  NP
Sbjct: 230 LKTLVDDD--NP 239


>gi|389750241|gb|EIM91412.1| WD40 repeat-like protein [Stereum hirsutum FP-91666 SS1]
          Length = 540

 Score = 42.4 bits (98), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 39/154 (25%), Positives = 69/154 (44%), Gaps = 17/154 (11%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            LA   SNG+I   D+ +GE+KW STG H   +  L F+  G  L V  +      + S 
Sbjct: 20  FLAATFSNGNIHLWDIQSGEIKWTSTG-HSMTVKSLRFSPNGTML-VSSSRDCTVRVWSV 77

Query: 104 MG--EVIREFKASEKPISSSAFLCEEKIFALA-----SSEVRILSLENGEEVLK------ 150
           +   E I+  K   + +SS+ FL  E +  ++      S++ I  L + E   K      
Sbjct: 78  LDGTETIQPCKHHRRYVSSATFLSNESVIYISWGHDKGSDICIWDLYSDEVTKKVKNVLD 137

Query: 151 -FSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
            +     P+  +S +     +++A Y +  +++W
Sbjct: 138 LYIGHTKPIYSISFAPDETRVVSASYDDT-IRIW 170


>gi|218506025|gb|ACK77654.1| RE12102p [Drosophila melanogaster]
          Length = 1096

 Score = 42.4 bits (98), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
            +A+GTS+G IL  DV T  ++W    +H  G +A LAF      L    + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFSRGLVAMLDT 189

Query: 103 EMGEVIREF 111
             G+V+RE 
Sbjct: 190 HTGDVLREL 198


>gi|75911051|ref|YP_325347.1| WD-40 repeat-containing serine/threonine protein kinase [Anabaena
           variabilis ATCC 29413]
 gi|75704776|gb|ABA24452.1| serine/threonine protein kinase with WD40 repeats [Anabaena
           variabilis ATCC 29413]
          Length = 676

 Score = 42.4 bits (98), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 47/211 (22%), Positives = 90/211 (42%), Gaps = 21/211 (9%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H G +  +AF+  G++L   G +          G  IR  K   + ++S AF  + K  A
Sbjct: 432 HSGWVWAIAFSPDGKTLVSAGADKTIKLWNLATGTEIRTLKGHSQGVASVAFSPDGKTLA 491

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
             S +  +++ +L  G+E+   S+    +  V+ S   K  + +G  +K +++W      
Sbjct: 492 SGSLDKTIKLWNLATGKEIRTLSEHSNVVANVAFSPDGK-TLASGSWDKTIKLWNL---- 546

Query: 190 KTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKI 249
            T NK       HS + +    +P   DG  + + ++      W+L           A  
Sbjct: 547 -TTNKVFRTLEGHSDLVMSVVFNP---DGKTLASASKDKTIRLWNL----------AAGK 592

Query: 250 TVKLKKADADQQNSVNVKKSRTSIIAARLND 280
           T++  K  +D+ NSV      ++++A+  ND
Sbjct: 593 TIRTLKGHSDKVNSVVYVPRNSTVLASGSND 623


>gi|24659383|ref|NP_648048.1| CG10144 [Drosophila melanogaster]
 gi|7295341|gb|AAF50660.1| CG10144 [Drosophila melanogaster]
          Length = 1229

 Score = 42.0 bits (97), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
            +A+GTS+G IL  DV T  ++W    +H  G +A LAF      L    + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFSRGLVAMLDT 189

Query: 103 EMGEVIREF 111
             G+V+RE 
Sbjct: 190 HTGDVLREL 198


>gi|145476893|ref|XP_001424469.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124391533|emb|CAK57071.1| unnamed protein product [Paramecium tetraurelia]
          Length = 784

 Score = 42.0 bits (97), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 67/278 (24%), Positives = 112/278 (40%), Gaps = 48/278 (17%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           LLA G+ +  I   D+ TGE K++  G H G ++ ++F+  G +L     +         
Sbjct: 169 LLASGSRDKSIRLWDIKTGEEKYRLEG-HNGYVSTISFSFDGITLASGSGDKTIRLWDII 227

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE----VRILSLENGEEVLKFSDDVGPLQ 159
            G+ I+  +     +SS  F     IF LAS      +R+ + + G++  +F      + 
Sbjct: 228 TGKEIQRLEGHNGYVSSVCF--SPDIFTLASCGEDKCIRLWNAKTGQQASQFFGHTHQVY 285

Query: 160 YVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALS--MRHSPVAIDCKNSPNGED 217
            +  S     ++ +G  +K +++W        V +G  +S    HS   I    SP   D
Sbjct: 286 SICFSPNGN-LLASGSDDKSIRLW-------DVKEGQQISKLQGHSGGVISVCFSP---D 334

Query: 218 GTVILAVAESGVAYSWDLKTVSQDEK-----------------TNPA--------KITVK 252
           GT IL+ +       WD+K+  Q  K                 TN A        +I   
Sbjct: 335 GTTILSGSADQSIRLWDVKSGQQQSKLIGHKCGVYSVCFSQKGTNVASGSYDQSIRIWET 394

Query: 253 LKKADADQQNSVNVKKSRTSIIAARLND---LEADGQK 287
           +K+ D  Q NS+ V +S        +N     +AD QK
Sbjct: 395 IKRFDKKQINSLKVSRSEKKTNFTDINQNIHFKADQQK 432


>gi|442318640|ref|YP_007358661.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
 gi|441486282|gb|AGC42977.1| putative lipoprotein [Myxococcus stipitatus DSM 14675]
          Length = 366

 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 32/108 (29%), Positives = 50/108 (46%), Gaps = 12/108 (11%)

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT  GD++ + + TG  +W+ TG  P  L  L  A  G +++VV  +G    +  + G+V
Sbjct: 94  GTVAGDLVRLALSTGTEQWRLTGEPP-VLTPLVMAPGGTAVYVVAPDGAVRALAVDTGKV 152

Query: 108 IREFKASEKP----------ISSSAFLCEEKIFALASSEVRILSLENG 145
            R   A  KP          + S        + AL  + +R LSLE+G
Sbjct: 153 -RWRVAPPKPDEAHLDTRRGLPSPVLAGGRLVVALGDAGLRALSLEDG 199


>gi|78779428|ref|YP_397540.1| WD-40 repeat-containing protein [Prochlorococcus marinus str. MIT
           9312]
 gi|78712927|gb|ABB50104.1| WD-40 repeat-containing protein [Prochlorococcus marinus str. MIT
           9312]
          Length = 357

 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 48/188 (25%), Positives = 78/188 (41%), Gaps = 17/188 (9%)

Query: 22  VSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAF 81
           VS   +ACG+  K ++         +G   G I A +  TG++ WK    H GGL  +A 
Sbjct: 21  VSDYAIACGWALKGKQ-------FIVGDVAGGIFAFEGDTGKIIWKKENTHSGGLLAMAI 73

Query: 82  AKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILS 141
             +G      G +G         G+VI+     +  +    +  +    A+ASS+   + 
Sbjct: 74  HPEGEIFVTSGQDGNVQICNCHEGKVIKTLDLGKGWVEHLKWSNDGLFLAIASSKKVYVF 133

Query: 142 LENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGP 196
            E GEE     D    +  ++ S+  + + TA YG    +V   DI +   N     +G 
Sbjct: 134 NEIGEEKWISEDHPSTVSAITWSNKNE-LATACYG----RVTFFDIVNNKTNQKLEWQGS 188

Query: 197 ALSMRHSP 204
            +SM  SP
Sbjct: 189 LVSMELSP 196


>gi|240849657|gb|ACS54286.1| AT14809p [Drosophila melanogaster]
          Length = 1229

 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
            +A+GTS+G IL  DV T  ++W    +H  G +A LAF      L    + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFSRGLVAMLDT 189

Query: 103 EMGEVIREF 111
             G+V+RE 
Sbjct: 190 HTGDVLREL 198


>gi|195337941|ref|XP_002035584.1| GM13838 [Drosophila sechellia]
 gi|194128677|gb|EDW50720.1| GM13838 [Drosophila sechellia]
          Length = 1229

 Score = 42.0 bits (97), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 2/69 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
            +A+GTS+G IL  DV T  ++W    +H  G +A LAF      L    + G+ + + +
Sbjct: 131 FIAIGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFSRGLVAMLDT 189

Query: 103 EMGEVIREF 111
             G+V+RE 
Sbjct: 190 HTGDVLREL 198


>gi|440696814|ref|ZP_20879262.1| trypsin [Streptomyces turgidiscabies Car8]
 gi|440280845|gb|ELP68526.1| trypsin [Streptomyces turgidiscabies Car8]
          Length = 1389

 Score = 42.0 bits (97), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 36/147 (24%), Positives = 63/147 (42%), Gaps = 3/147 (2%)

Query: 45   LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
            LA G+    I   D+ T +++   TG H  G+  L F+  GR+L   G +          
Sbjct: 1159 LATGSDTKYIRLWDLATRKIRRTLTGHH-DGVNALEFSPDGRTLATAGGDSRVLIWDLAT 1217

Query: 105  GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
            G+V       + P+++ AF  + ++ A AS +   R+     G      +  VG L  + 
Sbjct: 1218 GKVRVTLTGHDAPVNALAFSPDGRVLATASDDGTARVWDAVTGRARSILTKHVGWLSALD 1277

Query: 163  ASDGAKIIITAGYGEKHLQVWRCDISS 189
             S   + + TAG  +  +++W  D  S
Sbjct: 1278 FSPDGRTLATAGGYDGTVRLWDADTGS 1304


>gi|390562645|ref|ZP_10244833.1| hypothetical protein NITHO_3120007 [Nitrolancetus hollandicus Lb]
 gi|390172773|emb|CCF84145.1| hypothetical protein NITHO_3120007 [Nitrolancetus hollandicus Lb]
          Length = 305

 Score = 42.0 bits (97), Expect = 0.42,   Method: Compositional matrix adjust.
 Identities = 40/174 (22%), Positives = 75/174 (43%), Gaps = 16/174 (9%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEK-IF 130
           H  G+   AF+  GR +   G +        E G+ IR+F      ++S  F  + + +F
Sbjct: 14  HTDGVEQAAFSPDGRQIASAGWDATVRLWDVETGKEIRQFIGHPAGVNSVVFSRDGRSLF 73

Query: 131 ALASSE-------VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +  S+E       +R   +E GEEVL+       +  +  S   K+ I+ G  +  +++W
Sbjct: 74  SCGSAEGGTSDRTIRKWDIETGEEVLRLRGHTAGVNAIDCSRDGKLAISGGM-DGSVRLW 132

Query: 184 RCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
             +   +T+         H+ V      SP+ +    I   AE+ V + WD+++
Sbjct: 133 DLERGVETLQ----FLGSHNFVTTSVAFSPDAK--RAISGSAEA-VVWLWDVES 179



 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 50/195 (25%), Positives = 87/195 (44%), Gaps = 17/195 (8%)

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GTS+  I   D+ TGE   +  G H  G+  +  ++ G+     G +G       E G  
Sbjct: 81  GTSDRTIRKWDIETGEEVLRLRG-HTAGVNAIDCSRDGKLAISGGMDGSVRLWDLERGVE 139

Query: 108 IREFKASEKPISSS-AFLCEEK--IFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS 164
             +F  S   +++S AF  + K  I   A + V +  +E+GEE+ +F   VG +  V+ +
Sbjct: 140 TLQFLGSHNFVTTSVAFSPDAKRAISGSAEAVVWLWDVESGEEIRRFRGHVGVITSVAYA 199

Query: 165 DGAKIIITAGYGEKHLQVWRCDISSK----TVNKGPALSMRHSPV---AIDCKNSPNGED 217
              +  ++ G  ++ ++ W  +   +    T + G   S+ +SPV   A+ C       D
Sbjct: 200 PDGRTALSGGL-DRSVRQWDIETGKERRRLTGHIGDVTSVAYSPVDPRALSCAG-----D 253

Query: 218 GTVILAVAESGVAYS 232
           GT+ L   E G   S
Sbjct: 254 GTIRLWNLEDGTEIS 268


>gi|201027430|ref|NP_001032880.2| WD repeat domain 43 [Rattus norvegicus]
 gi|149050703|gb|EDM02876.1| rCG61879 [Rattus norvegicus]
          Length = 674

 Score = 42.0 bits (97), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 43/204 (21%), Positives = 82/204 (40%), Gaps = 13/204 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 96  LLALGTAVGSILLYSTVKGELHSKLISGGHEKRVNCIQWHQDNDCLYSCSDDKYIVEWST 155

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
           +M  V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  + 
Sbjct: 156 QMCRVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLR 215

Query: 163 AS----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCK 210
            +          DG      ++    ++ L VW+    +K  +   + ++   PV ID  
Sbjct: 216 FTTIRPNESQPFDGVTGLYFLSGAVHDRLLNVWQVRSENKEKSAVMSFTVTDEPVYIDLT 275

Query: 211 NSPNGEDGTVILAVAESGVAYSWD 234
            S N E+   +  V   G  + ++
Sbjct: 276 LSENKEEPVKLAVVCRDGQVHLFE 299


>gi|328868055|gb|EGG16435.1| WD40 repeat-containing protein [Dictyostelium fasciculatum]
          Length = 578

 Score = 42.0 bits (97), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 27/109 (24%), Positives = 53/109 (48%), Gaps = 22/109 (20%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYS---------CLACGFVGKKRRKERGTLLLALGTSN 51
           +W+T + S          EP+ +YS         CL+     K    ++G  ++A+GT+ 
Sbjct: 31  VWNTTNSS----------EPLRTYSDDRESSKLTCLSIFTSTKASSIQKG--VIAIGTNY 78

Query: 52  GDILAVDVLTGEMKWKSTGRHPG-GLAGLAFAKKGRSLHVVGTNGMASE 99
           GD+L  ++L+G+M+ + TG H G  + G+ F    + +     +G  ++
Sbjct: 79  GDVLIYNILSGKMEKRLTGAHAGCKVVGVVFGTTEKDIFSCDIDGRVAQ 127


>gi|393222446|gb|EJD07930.1| WD40 repeat-like protein [Fomitiporia mediterranea MF3/22]
          Length = 1569

 Score = 41.6 bits (96), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 43/147 (29%), Positives = 68/147 (46%), Gaps = 9/147 (6%)

Query: 45   LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
            +A GT+ G I   D  TG++K      H G    + F+  G+ L   G +        E 
Sbjct: 910  IACGTTTGTISIRDGQTGDVKVPPIRAHTGQTTSVVFSPDGKLLASSGQDQTIRIWDVES 969

Query: 105  GEVIREFKASEKPISSSAFLCE-EKIFALASSE-VRILSLEN----GEEVLKFS--DDVG 156
            G    +    +  ISS AFL + ++I A A+   +RI  +EN    GE +  +S  D +G
Sbjct: 970  GMPDGDVMEIDTAISSLAFLPDGKRIIAGANDRTIRIWDVENRKQVGEPIQGYSVGDHIG 1029

Query: 157  PLQYVSASDGAKIIITAGYGEKHLQVW 183
             ++ V+ S   +   +A  G K LQ+W
Sbjct: 1030 TIRDVAVSPDGRYFASASDG-KVLQIW 1055


>gi|299755778|ref|XP_001828881.2| WD40 repeat-containing protein [Coprinopsis cinerea okayama7#130]
 gi|298411378|gb|EAU92888.2| WD40 repeat-containing protein [Coprinopsis cinerea okayama7#130]
          Length = 325

 Score = 41.6 bits (96), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 61/238 (25%), Positives = 101/238 (42%), Gaps = 40/238 (16%)

Query: 85  GRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSL 142
           G  L   G +       +E G++I  F+  ++ I+  A+  + +  A AS +  V I SL
Sbjct: 47  GSMLASAGPDKTIKLWDTESGDIIHTFRGHKEGINDLAWAPDGEFIASASDDKTVIIWSL 106

Query: 143 ENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS-SKTVNKGPALSMR 201
           E  E V   S     +  ++ +  + ++++ GY E  + +W  D++  K +   PA S  
Sbjct: 107 ELREPVKTLSRHTSVVFCINYNPNSNLLVSGGYDETVI-IW--DVARGKALKTLPAHSDP 163

Query: 202 HSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDEKTNPAKITVKL-- 253
            + V  +       +DGT+I++ A  G+   WD      LKT+  D+  NP    V    
Sbjct: 164 VTAVGFN-------DDGTLIISCAMDGLIRLWDAESGQCLKTLVDDD--NPICSHVCFSP 214

Query: 254 --KKADADQQNSV----NVKKS-----------RTSIIAARLNDLEADGQKTITGLHD 294
             K A A  Q+S     N++ S           RT  I A      + GQ  +TG  D
Sbjct: 215 NSKFALASTQDSTIRLWNIQSSRCVKTYTGHVNRTYCIPACFATKSSKGQYIVTGSED 272


>gi|194752173|ref|XP_001958397.1| GF10899 [Drosophila ananassae]
 gi|190625679|gb|EDV41203.1| GF10899 [Drosophila ananassae]
          Length = 1229

 Score = 41.6 bits (96), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 2/68 (2%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRH-PGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           +A+GTS+G IL  DV T  ++W    +H  G +A LAF      L      G+ + + ++
Sbjct: 132 IAVGTSHGHILNFDV-TQTLRWAHQDKHGQGAVASLAFNADSTRLLAGFARGLVTMLDTQ 190

Query: 104 MGEVIREF 111
            G+V+RE 
Sbjct: 191 TGDVLREL 198


>gi|126696474|ref|YP_001091360.1| WD-40 repeat-containing G-protein [Prochlorococcus marinus str. MIT
           9301]
 gi|126543517|gb|ABO17759.1| G-protein beta WD-40 repeat-containing protein [Prochlorococcus
           marinus str. MIT 9301]
          Length = 357

 Score = 41.6 bits (96), Expect = 0.50,   Method: Compositional matrix adjust.
 Identities = 44/184 (23%), Positives = 76/184 (41%), Gaps = 9/184 (4%)

Query: 22  VSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAF 81
           VS   +ACG+  K ++         +G   G I A +  TG++ WK    H GGL  ++ 
Sbjct: 21  VSDYAIACGWALKGKQ-------FIVGDVAGGIFAFEGDTGKIIWKKENTHSGGLLAMSI 73

Query: 82  AKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILS 141
             +G      G +G         G+VI+     +  +    +  +    A+ASS+   + 
Sbjct: 74  HPEGEIFATSGQDGNVQICNCHEGKVIKTLNLGKGWVEHLKWSNDGLFLAIASSKKVYVF 133

Query: 142 LENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGE-KHLQVWRCDISSKTVNKGPALSM 200
            E GEE     D    +  ++ S+  + + TA YG      + +   + K   +G  +SM
Sbjct: 134 NEIGEEKWISEDHPSTVSAITWSNKNE-LATACYGRVTFFDIVKNKTNQKLEWQGSLVSM 192

Query: 201 RHSP 204
             SP
Sbjct: 193 ELSP 196


>gi|333984288|ref|YP_004513498.1| hypothetical protein [Methylomonas methanica MC09]
 gi|333808329|gb|AEG00999.1| WD40 repeat-containing protein [Methylomonas methanica MC09]
          Length = 1227

 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 29/125 (23%), Positives = 59/125 (47%), Gaps = 5/125 (4%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            +A G+++  I  +D    ++ + S     G +  +AF+  GR L   G +     +   
Sbjct: 589 FIATGSNDNQIRVIDTACHDILFSSILN--GAVTAVAFSPDGRYLAAAGADPTVQLIDVS 646

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
            G  +R+FK    P+++ AF  +    A+AS++  VR++ + +G EV +          +
Sbjct: 647 TGRTLRQFKLV-GPVANIAFSPDGSRLAVASADKSVRLIDVSSGREVFRVKHGAEATHVI 705

Query: 162 SASDG 166
            ++DG
Sbjct: 706 FSADG 710


>gi|193213052|ref|YP_001999005.1| WD-40 repeat-containing protein, partial [Chlorobaculum parvum NCIB
           8327]
 gi|193086529|gb|ACF11805.1| WD-40 repeat protein [Chlorobaculum parvum NCIB 8327]
          Length = 241

 Score = 41.6 bits (96), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 44/186 (23%), Positives = 80/186 (43%), Gaps = 14/186 (7%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           +W T  G  + ++K        SY   AC F     +K   T  +   T N  ++ +D  
Sbjct: 7   LWDTLTGKRIIKYKS-------SYFFRACAF-SPDGKKILATTDVNTDTRNSLLMLLDAN 58

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
           + E+     G HP  +   AF+  G+       +       SE G+ I  F+   + +  
Sbjct: 59  SAELLMTIQG-HPRDIVSCAFSPNGKKFLSTSLDSTLKLWDSETGQCINTFEDHIEAVWD 117

Query: 121 SAFLCE-EKIFALASSE-VRILSLENGEEVLKFSDDVGPLQYVSA-SDGAKIIITAGYGE 177
            AF  + +KI + +S   +++  + +G  ++ FS    P+      SDG +II  +G  +
Sbjct: 118 CAFSPDGDKILSSSSDHTLKLWDVNSGHCLITFSGHSSPVFSCRFNSDGTRII--SGSSD 175

Query: 178 KHLQVW 183
           K L++W
Sbjct: 176 KTLKLW 181


>gi|393238055|gb|EJD45594.1| WD40 repeat-like protein [Auricularia delicata TFB-10046 SS5]
          Length = 584

 Score = 41.6 bits (96), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 28/110 (25%), Positives = 51/110 (46%), Gaps = 7/110 (6%)

Query: 54  ILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKA 113
           ++  D   G  +WK  G    G     + + GR L +VG  G  +    + G +  E + 
Sbjct: 113 VVGSDAAAGRREWKLDG----GSYRSRYTRNGRHLAIVGKKGHVATFDWQTGTLHSELQL 168

Query: 114 SEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP--LQYV 161
            E       FL ++ +FA+A S+   +  +NG E+ + S+ + P  L+Y+
Sbjct: 169 RET-CRDITFLQDQSMFAVAQSKHVYIYDQNGVELHRLSNHIEPTRLEYL 217


>gi|119490221|ref|ZP_01622734.1| Peptidase C14, caspase catalytic subunit p20 [Lyngbya sp. PCC 8106]
 gi|119454107|gb|EAW35260.1| Peptidase C14, caspase catalytic subunit p20 [Lyngbya sp. PCC 8106]
          Length = 1245

 Score = 41.6 bits (96), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 46/189 (24%), Positives = 95/189 (50%), Gaps = 14/189 (7%)

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS-EMGE 106
           G+ +  I   ++ TGE+    TG H  G+  ++ +   +++ V G++    ++ + E GE
Sbjct: 696 GSGDNTIKVWNLETGELIRTLTG-HRYGVRSVSISNDSKTI-VSGSDDKTIKVWNLETGE 753

Query: 107 VIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS 164
           +IR  K  ++ +SS +   + K     S +  +++ + E G E+   +     ++ VS S
Sbjct: 754 LIRTLKGHDREVSSVSISNDSKTIVSGSDDKTIKVWNRETGAEIRTLTGHRYGVRSVSIS 813

Query: 165 DGAKIIITAGYGEKHLQVWRC----DISSKTVNKGPALSMRHSPVAIDCKNSPNG-EDGT 219
           + +K I+ +G G+  ++VW      +IS+ T + G   S+    ++ D K   +G ED T
Sbjct: 814 NDSKTIV-SGSGDNTIKVWNLQTGKEISNLTGHNGQVWSV---SISNDSKTIVSGSEDST 869

Query: 220 VILAVAESG 228
           + +   E+G
Sbjct: 870 IKVWNLETG 878


>gi|256380418|ref|YP_003104078.1| hypothetical protein Amir_6431 [Actinosynnema mirum DSM 43827]
 gi|255924721|gb|ACU40232.1| WD-40 repeat protein [Actinosynnema mirum DSM 43827]
          Length = 1766

 Score = 41.6 bits (96), Expect = 0.57,   Method: Composition-based stats.
 Identities = 42/164 (25%), Positives = 71/164 (43%), Gaps = 20/164 (12%)

Query: 76   LAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
            +  LA A  GR L   G++   + + +  GE + E    E  I++ AF  +      ASS
Sbjct: 1260 ITALALAPDGRRL-ATGSSAGTAHVWTAGGEHVAELAGHENWINAVAFSPDGARVTTASS 1318

Query: 136  EVRILS-LENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGEKHLQVWRCD---ISSK 190
            +    +   +G +V   +DDVGP+  ++ S DG  +   A  G  H  VW  D   +++ 
Sbjct: 1319 DRTARTWTTDGTQVAVLTDDVGPVTALAHSPDGKHVATGASDGTGH--VWTADGSLVATL 1376

Query: 191  TVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
              ++G   S+ +SP            DG +I        A +W+
Sbjct: 1377 LGHQGVITSIAYSP------------DGAIITTAGSDKTARTWN 1408


>gi|126657815|ref|ZP_01728969.1| beta transducin-like protein [Cyanothece sp. CCY0110]
 gi|126621032|gb|EAZ91747.1| beta transducin-like protein [Cyanothece sp. CCY0110]
          Length = 345

 Score = 41.6 bits (96), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 55/235 (23%), Positives = 100/235 (42%), Gaps = 15/235 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           LL +GT +G++ A+D+ T E  + S        + +AF  + + + VVG +     +   
Sbjct: 74  LLLVGTGDGELTAIDLETKETLY-SKAVLVNNYSSIAFNVE-KDILVVGDDQTVMVLSLS 131

Query: 104 MGEVIREFKASEKPISSSAFLCE-EKIFALASSE--VRILSLENGEEVLKFSDDVGPLQY 160
            G+ +   +     +S  A   +   I +++  +  +RI  LE+GE +     ++GP   
Sbjct: 132 TGQKLSFLREHTGKVSDVAISPDGNNIVSVSGDDQTIRIWDLESGELIKTIGANIGPTTS 191

Query: 161 VSASDGAKIIITAGYG-EKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGT 219
           V  +    + IT   G ++ L+ W    + K +N     S +      D K S    DGT
Sbjct: 192 VQYTPDGTMFITGAIGSDRTLKFWDA-TTFKLLNT----SSQQPGFINDLKIS---NDGT 243

Query: 220 VILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSII 274
            ++A   + V  SWDL T+ Q   T   K+ +       D +      K  T ++
Sbjct: 244 QLVAAVRNFVK-SWDLTTLQQVWSTKGPKLEINTIAVSPDNRTVATANKEGTIML 297


>gi|296120844|ref|YP_003628622.1| serine/threonine protein kinase-related protein [Planctomyces
            limnophilus DSM 3776]
 gi|296013184|gb|ADG66423.1| Serine/threonine protein kinase-related protein [Planctomyces
            limnophilus DSM 3776]
          Length = 1856

 Score = 41.6 bits (96), Expect = 0.58,   Method: Compositional matrix adjust.
 Identities = 45/187 (24%), Positives = 80/187 (42%), Gaps = 20/187 (10%)

Query: 61   TGEMKWKSTGRHPGGLAG-LAFAKKGRSLHVVGTNGMASEM---------KSEMGEVIRE 110
            T + K  S  R P G+   L  ++ GR L +       +E+           EMG  + +
Sbjct: 1477 TADAKLVSEYRLPEGMVNSLEISRDGRLLAIANAETRQAEILVLPGFKRIPMEMGGSVVD 1536

Query: 111  FKASEKPISSSAFLCE-EKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKI 169
             K     + + AF  E E +  +  ++ R+ +L +G E++ FS + G +     S   + 
Sbjct: 1537 LKKRGALLWAVAFGPEAETLMTIGGTDARLWNLRDGREMMSFSPN-GVVASARYSPDNQW 1595

Query: 170  IITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGV 229
            I+T+ + +  ++VW+       V     L   H+        SP+GE   +IL  ++ G 
Sbjct: 1596 IVTSSW-DNSVKVWKAATGESMVR----LEGGHTSAVNMASFSPDGE---LILTASDDGT 1647

Query: 230  AYSWDLK 236
            A  WD K
Sbjct: 1648 AKLWDWK 1654


>gi|194335525|ref|YP_002017319.1| WD-40 repeat-containing protein [Pelodictyon phaeoclathratiforme
           BU-1]
 gi|194308002|gb|ACF42702.1| WD-40 repeat protein [Pelodictyon phaeoclathratiforme BU-1]
          Length = 316

 Score = 41.6 bits (96), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 27/107 (25%), Positives = 55/107 (51%), Gaps = 3/107 (2%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE-EKIFALASSE- 136
           LAF+   R +   G + M   + +  GE+ R F+  +  + S  F  +  ++ + A+ E 
Sbjct: 208 LAFSPDDRFIAFCGRDAMVKILDAASGEITRVFEGHQDAVRSVCFTPDGSRVVSAANDET 267

Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           VR+  +E+G+++  +   V  +Q V  S   K II +G  ++ +++W
Sbjct: 268 VRLWDIESGKQLHLYRGHVLEVQSVDVSPDGK-IIASGSDDRKIKLW 313


>gi|340726869|ref|XP_003401774.1| PREDICTED: WD repeat-containing protein 43-like [Bombus terrestris]
          Length = 629

 Score = 41.6 bits (96), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 58/271 (21%), Positives = 116/271 (42%), Gaps = 45/271 (16%)

Query: 34  KKRRK--------ERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKK- 84
           KKRRK        ++G  ++A+G++NG +   D  T  +  +  G H G +  + +++  
Sbjct: 70  KKRRKKSISEESNQKG--IVAMGSTNGRVTLYDTTTSLVTAQLNG-HSGTVTAVTWSENV 126

Query: 85  -----GRSLHVVG----TNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
                    H++      NG+  + KS  G+ +           S A   + K       
Sbjct: 127 GLFSAADDHHIIQWNLQENGVRCKWKSGKGKTV-----------SLAISPDGKSLLSGER 175

Query: 136 EVRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTV 192
            ++  +L+  + V  F+   ++V  L  V  + G   +I+   G+ +L VW  D      
Sbjct: 176 VIKWWNLDTKQLVRTFTGHANEVSCLHTVQTTSGNNYVISGACGDGYLNVWALDEQRNER 235

Query: 193 NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVK 252
               +L+++  P+++    S N +   ++L V  SG A+   L     + ++ P K ++ 
Sbjct: 236 TAVASLALQDDPISVSTHISENFQ--VIVLVVTRSGQAH---LFQYQPNGRSKPLKPSLN 290

Query: 253 LKKA-DADQQNSVNVKKSRTSIIAARLNDLE 282
           +  A D  Q++SV     + SI+  +L + E
Sbjct: 291 IAVASDISQKDSVQ----QISILNGKLTEDE 317


>gi|291230364|ref|XP_002735136.1| PREDICTED: katanin p80 subunit-like [Saccoglossus kowalevskii]
          Length = 666

 Score = 41.6 bits (96), Expect = 0.61,   Method: Compositional matrix adjust.
 Identities = 26/114 (22%), Positives = 49/114 (42%), Gaps = 2/114 (1%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H   +  + F+  GR +   G +G+A       G++I EFK    P+++  F   E + A
Sbjct: 143 HTNRINSVRFSPDGRWVASAGEDGLAKLWDLAAGKLINEFKHHTGPVNNIEFHPNEFLLA 202

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
             S++  V+   LEN   V     +  P++ +       ++ + G    H+  W
Sbjct: 203 TGSADRTVKFWDLENFNLVGTTDKEASPIRCILFHQDGNVLFSGGQDSLHVYSW 256


>gi|428174103|gb|EKX43001.1| hypothetical protein GUITHDRAFT_73545 [Guillardia theta CCMP2712]
          Length = 623

 Score = 41.2 bits (95), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 47/162 (29%), Positives = 74/162 (45%), Gaps = 12/162 (7%)

Query: 29  CGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSL 88
           CG   K  R      L+A G+ NG +    V TG+  WK    H   +  +++ + GR L
Sbjct: 2   CGSFSKDSR------LVATGSKNGLVRLWCVETGKC-WKELPGHETMIRSISWGQDGRML 54

Query: 89  HVVGTNGMASEMKSEMGEVIREFKASE-KPISSSAFLCEEKIFALASSE--VRILSLENG 145
             V  NG     K    E   +  A   + I+S +   + K+ A AS+E  V + ++E G
Sbjct: 55  AAVSQNGTTRVWKEPWTEEACQVLAGHGETINSVSLGPDGKMLAAASNEKKVWVWNVETG 114

Query: 146 EEVLKFSDDV-GPLQYVSASDGAKIIITAGYGEKHLQVWRCD 186
           E  L+  + V G +  V+     K +I +G  +K +QVW  D
Sbjct: 115 EVRLELKEQVEGGVTSVAWRPDGK-MIASGSLDKAIQVWEVD 155


>gi|47220108|emb|CAF99021.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 651

 Score = 41.2 bits (95), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 55/248 (22%), Positives = 99/248 (39%), Gaps = 22/248 (8%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG--------KKRRKERGTL-----LLAL 47
           IW+T+  +L  E+  P      +  C++ G           KKR+ E G +     LLA+
Sbjct: 35  IWNTDSKTLHQEY-VPSAHLSATCICISWGPCRTLKERPQRKKRKSEAGQVEESADLLAM 93

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G +L      G +     G H GG+  + +      L+    +    E   + G+ 
Sbjct: 94  GTAAGTVLIYSTAKGALHCTLDGGHSGGVNCVQWHPDDSLLYSGSDDTNIVEWDLQTGKT 153

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
             ++KA    ++S     + K+   A   +++  L   E   KF   S  V  L + +  
Sbjct: 154 KSKWKADRAAVTSLCVSPDGKLLLSAGQTIKMWDLNTKEVYRKFTGHSTAVTTLCFATTR 213

Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
              S+G   +  A + ++ L VW+     K  N   + ++   P  ID   S + E+   
Sbjct: 214 PPDSNGLYFLSGAAH-DRLLSVWQVREDGKDKNSVVSFALTDEPQHIDLIPSKSREEVMR 272

Query: 221 ILAVAESG 228
           +  V + G
Sbjct: 273 LAVVCKDG 280


>gi|166366565|ref|YP_001658838.1| WD repeat-containing protein [Microcystis aeruginosa NIES-843]
 gi|166088938|dbj|BAG03646.1| WD-repeat protein [Microcystis aeruginosa NIES-843]
          Length = 962

 Score = 41.2 bits (95), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 44/187 (23%), Positives = 78/187 (41%), Gaps = 22/187 (11%)

Query: 56  AVDVLTGEMKWKSTGR----HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREF 111
            ++ L   + WKS       H   +  + F+  G++L  V  +        E GE IR  
Sbjct: 571 VMNTLQALLNWKSERNRLEGHKDFVTSVNFSPDGKTLVSVSRDNTIKLWNVETGEEIRTL 630

Query: 112 KASEKPISSSAFLCEEKIFALAS--SEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKI 169
           K  +  + S  F  + K     S  + +++ ++E GEE+       GP+  V+ S   K 
Sbjct: 631 KGHD-GVQSVNFSPDGKTLVSGSWDNTIKLWNVETGEEIRTLKGHNGPVNSVNFSPDGKT 689

Query: 170 IITAGYGEKHLQVWRCDISSKTVN-KGPALSMRHSPVAIDCKNSPNGE-------DGTVI 221
           ++ +G G+K +++W  +   +    KG      H    I    SP+G+       D T+ 
Sbjct: 690 LV-SGSGDKTIKLWNVETGQEIRTLKG------HDNSVISVNFSPDGKTLVSGSGDNTIK 742

Query: 222 LAVAESG 228
           L   E+G
Sbjct: 743 LWNVETG 749


>gi|329663507|ref|NP_001192527.1| WD repeat-containing protein 43 [Bos taurus]
 gi|296482332|tpg|DAA24447.1| TPA: WD repeat domain 43 [Bos taurus]
          Length = 681

 Score = 41.2 bits (95), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 54/262 (20%), Positives = 101/262 (38%), Gaps = 29/262 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV--------GKKRRKERGTL------LLA 46
           +W T +  L  E+  P      + +CLA             +K+RK    +      LLA
Sbjct: 40  VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKEGPQRKKRKSEAIVTSNQADLLA 98

Query: 47  LGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LGT+ G IL    + GE+  K   G H   +  + + +    L+    +    E  ++  
Sbjct: 99  LGTAVGSILLYSTVKGELHSKLINGGHDNRVNCIQWHQDNGCLYSCSDDKHIVEWNTQTC 158

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS- 164
           +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  ++ + 
Sbjct: 159 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLTFTT 218

Query: 165 ----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
                     DG      ++    ++ L VW+    +K  N   + ++   PV ID   S
Sbjct: 219 IRPPNEIQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDLTLS 278

Query: 213 PNGEDGTVILAVAESGVAYSWD 234
            N E+   +  V   G  + ++
Sbjct: 279 ENKEEPVKLAVVCRDGQVHLFE 300


>gi|195377108|ref|XP_002047334.1| GJ11984 [Drosophila virilis]
 gi|194154492|gb|EDW69676.1| GJ11984 [Drosophila virilis]
          Length = 1229

 Score = 41.2 bits (95), Expect = 0.68,   Method: Composition-based stats.
 Identities = 23/68 (33%), Positives = 38/68 (55%), Gaps = 2/68 (2%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHP-GGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           +A+GTS+G IL  DV T  ++W    +H  G ++ LAF      L V    G+ + + ++
Sbjct: 131 IAVGTSHGHILNFDV-TQTLRWAHQDKHSQGAVSSLAFNGDSTRLLVGYARGLVTMLDTQ 189

Query: 104 MGEVIREF 111
            G+V+RE 
Sbjct: 190 TGDVLREL 197


>gi|428181079|gb|EKX49944.1| hypothetical protein GUITHDRAFT_161985 [Guillardia theta CCMP2712]
          Length = 676

 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 52/242 (21%), Positives = 99/242 (40%), Gaps = 14/242 (5%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG------FVGKKRRKERGTLLLALGTSNGDI 54
           IW    G+  +E+  P G+     +C++ G         KK++ +RG  L+A+GT  G I
Sbjct: 24  IWDVTSGAASSEF-SPTGKHAAELTCVSWGKALPVTSSAKKKKSKRGADLVAIGTGAGKI 82

Query: 55  LAVDVLTGEMKWKSTGR---HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREF 111
           L  D + GE++    G    H G +  +     G +L+    +   S    E GE+  + 
Sbjct: 83  LLWDCVKGELRKTLGGEKSGHTGRINDIVLT-SGSNLYCCTDDCFFSCWNIETGEMTWKV 141

Query: 112 KASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIII 171
              ++  +  A    E     ASS +++  L   +   K+      +  ++ S  ++ ++
Sbjct: 142 AVGKQSANRLALRSSEDEIVTASSAIKVWDLATRKTKRKYPGHASRVVCLAYSADSRFLV 201

Query: 172 TAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAY 231
           + G  ++ + +W     +         ++  SP  I       GE    +LAV+E G   
Sbjct: 202 S-GASDRFISLWDSSFETDNTVALKTFTLDSSPTQISLVGG-TGE-SMQMLAVSEGGSVS 258

Query: 232 SW 233
            W
Sbjct: 259 VW 260


>gi|153869314|ref|ZP_01998955.1| WD-40 repeat protein [Beggiatoa sp. PS]
 gi|152074160|gb|EDN71045.1| WD-40 repeat protein [Beggiatoa sp. PS]
          Length = 1207

 Score = 41.2 bits (95), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 56/230 (24%), Positives = 101/230 (43%), Gaps = 30/230 (13%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
            WS   G ++  WK PD    ++ S               G  L + GT N  I   ++ 
Sbjct: 685 FWSLPTGKIIKRWKAPDKVKALALS-------------PDGKYLASAGTDN-KITLWNLE 730

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
           T + +   +G H   ++GLAF+  G  L     +G A   + + G+V+   KA    +  
Sbjct: 731 TDQPQQIFSG-HKDQISGLAFSSDGELLASASYDGTARLWQVKTGKVLHTLKAHTDHVQK 789

Query: 121 SAFLCEEKIFALASSE--VRILSLENG--EEVLK-FSDDVGPLQYVSASDGAKIIITAGY 175
            AF  + +  A +S +  +R+ ++ +G  E VL+     +  ++++   D  + +++A  
Sbjct: 790 VAFSHDNQWLATSSKDATIRLWNVNSGKTERVLRGHKQIIFDIRFI---DHGQTLVSAS- 845

Query: 176 GEKHLQVWRCDISS---KTVNKGPALSMRHSPVAIDCKNSPNGEDGTVIL 222
            ++ L++W  DI S   K V +G    +       D K     +DGTVIL
Sbjct: 846 DDRTLRLW--DIQSGVTKRVFQGHTAGVT-GIATFDNKIFSASDDGTVIL 892


>gi|427419559|ref|ZP_18909742.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 7375]
 gi|425762272|gb|EKV03125.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 7375]
          Length = 1143

 Score = 41.2 bits (95), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 34/151 (22%), Positives = 65/151 (43%), Gaps = 9/151 (5%)

Query: 67   KSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE 126
            K+  +H GG+ G+AF+  G++L   G + M     ++ G  ++  +    P+ S  F   
Sbjct: 905  KTLTKHQGGVWGVAFSPDGQTLASAGGDNMVKLWHAD-GTFLKTLEGHRAPVWSVMFNPN 963

Query: 127  EKIFALASSEVRI-LSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRC 185
             +  A  S +    L  ++G+ +  F +D   L  ++ S     ++T G  +  +++W+ 
Sbjct: 964  GRTLATTSGDATAKLWNQDGKVITTFDNDGIILFDIAFSPDGHTLVTGG-SDGIVKLWQA 1022

Query: 186  DISSKTVNKGPALSMRHSPVAIDCKNSPNGE 216
            D +      G      H         SPNGE
Sbjct: 1023 DGTLLNTMVG------HGAAVFQVAFSPNGE 1047


>gi|440899093|gb|ELR50459.1| WD repeat-containing protein 43, partial [Bos grunniens mutus]
          Length = 703

 Score = 41.2 bits (95), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 54/262 (20%), Positives = 101/262 (38%), Gaps = 29/262 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV--------GKKRRKERGTL------LLA 46
           +W T +  L  E+  P      + +CLA             +K+RK    +      LLA
Sbjct: 62  VWETANNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKEGPQRKKRKSEAIVTSNQADLLA 120

Query: 47  LGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LGT+ G IL    + GE+  K   G H   +  + + +    L+    +    E  ++  
Sbjct: 121 LGTAVGSILLYSTVKGELHSKLINGGHDNRVNCIQWHQDNGCLYSCSDDKHIVEWNTQTC 180

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS- 164
           +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  ++ + 
Sbjct: 181 KVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLTFTT 240

Query: 165 ----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
                     DG      ++    ++ L VW+    +K  N   + ++   PV ID   S
Sbjct: 241 IRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDLTLS 300

Query: 213 PNGEDGTVILAVAESGVAYSWD 234
            N E+   +  V   G  + ++
Sbjct: 301 ENKEEPVKLAVVCRDGQVHLFE 322


>gi|262198770|ref|YP_003269979.1| serine/threonine protein kinase with WD40 repeats [Haliangium
            ochraceum DSM 14365]
 gi|262082117|gb|ACY18086.1| serine/threonine protein kinase with WD40 repeats [Haliangium
            ochraceum DSM 14365]
          Length = 1823

 Score = 41.2 bits (95), Expect = 0.70,   Method: Composition-based stats.
 Identities = 31/104 (29%), Positives = 47/104 (45%), Gaps = 3/104 (2%)

Query: 76   LAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
            +A  +F+  G  L   G +G         GE+IR F   E  I ++AF    +  A A S
Sbjct: 1589 IAAASFSPDGERLATAGDDGRVHIWNVATGELIRAFVGHEGTIKAAAFAPSGQHLATAGS 1648

Query: 136  E--VRILSLENGEEVLKFSDDVGPLQYVS-ASDGAKIIITAGYG 176
            +  VR+     GE +  F+    P+  V   SDG+++I  A  G
Sbjct: 1649 DRSVRLWDASTGERLQTFTGHTLPINTVHFNSDGSRLISAAEDG 1692


>gi|66820176|ref|XP_643725.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
 gi|60471897|gb|EAL69851.1| WD40 repeat-containing protein [Dictyostelium discoideum AX4]
          Length = 569

 Score = 41.2 bits (95), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 38/156 (24%), Positives = 67/156 (42%), Gaps = 11/156 (7%)

Query: 34  KKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTG---RHPGGLAGLAFAKKGRSLHV 90
           K  +KE    L+A+  S G I   D +     +K  G    H   +  +   K  R L  
Sbjct: 57  KSEKKEYYKNLVAVSNS-GKIYMYDCI----DFKKKGIFDTHSKNITAVVLGK--RYLFT 109

Query: 91  VGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLK 150
               G  S+   +   +IR F++SEK ISS      E   A+A++++  + L N +E  K
Sbjct: 110 SSHGGKVSQWDIKTKHIIRSFESSEKSISSMGLNNRESQLAIANNDITFIDLSNFKESKK 169

Query: 151 FSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCD 186
            S     ++ +  +   K +I   Y +  +Q++  +
Sbjct: 170 VSTGSNSIKQIQFTQDDKFLICNRY-QPTIQIYNIE 204


>gi|426226305|ref|XP_004007288.1| PREDICTED: WD repeat-containing protein 43 [Ovis aries]
          Length = 716

 Score = 41.2 bits (95), Expect = 0.79,   Method: Compositional matrix adjust.
 Identities = 43/202 (21%), Positives = 82/202 (40%), Gaps = 14/202 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 131 LLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDNGCLYSCSDDKHIVEWNT 190

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  ++
Sbjct: 191 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLT 250

Query: 163 AS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
            +           DG      ++    ++ L VW+    +K  N   + ++   PV ID 
Sbjct: 251 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVFIDL 310

Query: 210 KNSPNGEDGTVILAVAESGVAY 231
             S N E+   +  V + G  +
Sbjct: 311 TLSENKEEPVKLAVVCKDGQVH 332


>gi|334312883|ref|XP_001380617.2| PREDICTED: WD repeat-containing protein 43 [Monodelphis domestica]
          Length = 682

 Score = 40.8 bits (94), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 46/208 (22%), Positives = 85/208 (40%), Gaps = 20/208 (9%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLA+GT+ G IL  + + GE++ K  +G H   +  + + +    L+    +    E  +
Sbjct: 96  LLAIGTAVGSILLYNTVKGELQSKLVSGGHDNKVNCIQWHQDNDCLYSCSDDKYIVEWNT 155

Query: 103 EMGEVIREFKASEKPISSSAFLC---EEKIFALASSEVRILSLENGEEVLKFSDD----- 154
           +   V  ++K     ISS + LC   + K+   A   +++  LE  E    F+       
Sbjct: 156 QTCRVKCKWKGD---ISSVSALCISPDGKMLLSAGRTIKLWDLETKEVYRHFTGHATAVS 212

Query: 155 ------VGPLQYVSASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA 206
                 + PL      DG      ++    ++ L VW+    +K  N   + ++   PV 
Sbjct: 213 SLIFTTIRPLNESQPFDGITGLYFLSGAVHDRLLNVWQIRSENKEKNAVMSFTVTDEPVY 272

Query: 207 IDCKNSPNGEDGTVILAVAESGVAYSWD 234
           ID   S N E+   +  V   G  + ++
Sbjct: 273 IDLTLSENKEEPVKLAVVCRDGQVHLFE 300


>gi|170288772|ref|YP_001739010.1| WD-40 repeat-containing protein [Thermotoga sp. RQ2]
 gi|170176275|gb|ACB09327.1| WD-40 repeat protein [Thermotoga sp. RQ2]
          Length = 580

 Score = 40.8 bits (94), Expect = 0.82,   Method: Compositional matrix adjust.
 Identities = 53/221 (23%), Positives = 99/221 (44%), Gaps = 15/221 (6%)

Query: 13  WKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRH 72
           W  PDGE + S   L    V     + R      +G +NG  +  D L+ +        H
Sbjct: 131 WSFPDGEEISS-QKLGPSVVQIAAYENR----YVVGLANGLAVIRD-LSNQSFSIPLKAH 184

Query: 73  PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFL-CEEKIFA 131
           P G+  + F++ G+ +   G N +     S  G ++ ++  +   ++  AFL  ++ IF 
Sbjct: 185 PEGIKKIVFSENGQLIATCGGNTVKVWNASN-GNLVLQYDHA-ITVNDLAFLDNDDLIFV 242

Query: 132 LASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYG-EKHLQVWRCDISSK 190
               +  +L+++  E V         + +VS+ DG+     A YG +K ++VW  D+  +
Sbjct: 243 ADDYKAVVLNIQKNEVVKSIDAHNNFVLFVSSDDGS----IATYGMDKTVKVWNADLELQ 298

Query: 191 TVNKGPALSMRHSPVAIDCKNSPNG-EDGTVILAVAESGVA 230
               G  LS+    ++ D K   +G +D  +++  AE G+A
Sbjct: 299 YSLYGHQLSVNTVALSSDGKFIVSGSDDREILIWNAEKGIA 339


>gi|443659552|ref|ZP_21132351.1| WD domain, G-beta repeat family protein [Microcystis aeruginosa
            DIANCHI905]
 gi|443332697|gb|ELS47292.1| WD domain, G-beta repeat family protein [Microcystis aeruginosa
            DIANCHI905]
          Length = 1108

 Score = 40.8 bits (94), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 26/200 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D    +++  +T      +  +AF      + + G NG   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G +++EF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 899  KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLNNHRPQLINSWTVDDSIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W      +     +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWDL 235
             DG  I A++  G A  WD+
Sbjct: 1005 -DGEKIAAISRDGTARRWDI 1023


>gi|422304841|ref|ZP_16392180.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9806]
 gi|389789952|emb|CCI14119.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9806]
          Length = 559

 Score = 40.8 bits (94), Expect = 0.89,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 59/106 (55%), Gaps = 6/106 (5%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
           LAF+  G+ L   G++ +      + GE+I+ ++A E+ I S A      + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKVIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +I   + G E++K      PL++  + DG + +IT  YG+K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGDK-VKIW 485


>gi|403362279|gb|EJY80863.1| hypothetical protein OXYTRI_21745 [Oxytricha trifallax]
          Length = 524

 Score = 40.8 bits (94), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 33/151 (21%), Positives = 68/151 (45%), Gaps = 6/151 (3%)

Query: 41  GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
           G LLL+  + +G +   DV+T     ++   H   +  + F+  GR     G + +    
Sbjct: 246 GHLLLS-ASHDGTVKIWDVMTHRKCLRTYMGHTKAVRDICFSNDGRRFLSAGFDRVIQLW 304

Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEK--IFALASSEVRILSLE-NGEEV-LKFSDDVG 156
            +E G+VIR F   + P        ++K  IF    +  +IL  + N  E+ L++ + +G
Sbjct: 305 DTETGKVIRSFTNRKTPFCVKFHPSDDKQNIFLAGCANKKILQYDTNSSEITLQYEEHLG 364

Query: 157 PLQYVSASDGAKIIITAGYGEKHLQVWRCDI 187
            +  ++  +G +  ++    +K + +W   I
Sbjct: 365 SINTITFIEGGRRFVSTA-DDKKIFLWEFGI 394


>gi|159029360|emb|CAO90736.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
          Length = 1078

 Score = 40.8 bits (94), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 26/200 (13%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           ++A+   +G I  +D    +++  +T      +  +AF      + + G NG   ++ S+
Sbjct: 812 IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 868

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
            G +++EF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 869 KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLNNHRPQLINSWTVDDSIIY 928

Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
             V + D  KI  TA  G+  +++W      +     +  P   +  SP           
Sbjct: 929 DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 974

Query: 216 EDGTVILAVAESGVAYSWDL 235
            DG  I A++  G A  WD+
Sbjct: 975 -DGEKIAAISRDGTARRWDI 993


>gi|431911947|gb|ELK14091.1| WD repeat-containing protein 43 [Pteropus alecto]
          Length = 680

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 48/220 (21%), Positives = 88/220 (40%), Gaps = 19/220 (8%)

Query: 34  KKRRKE-----RGTLLLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRS 87
           KKR+ E       T LLALGT+ G IL    + GE+  K  +G H   +  + + +    
Sbjct: 81  KKRKSEPIGTSNQTDLLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGC 140

Query: 88  LHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
           L+    +    E  ++  +V  ++K     +SS     + K+   A   +++  LE  E 
Sbjct: 141 LYSCSDDKHIVEWNTQTCKVKCKWKGDSSGVSSLCISPDGKMLLSAGRTIKLWVLETKEL 200

Query: 148 VLKFSDDVGPLQYVSAS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNK 194
              F+    P+  +  +           DG      ++    ++ L VW+    +K  N 
Sbjct: 201 YRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNA 260

Query: 195 GPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
             + ++   PV ID   S N E+   +  V   G  + ++
Sbjct: 261 VMSFTITDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE 300


>gi|425445434|ref|ZP_18825464.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
 gi|389734586|emb|CCI01787.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9443]
          Length = 1107

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 82/200 (41%), Gaps = 26/200 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D    +++  +T      +  +AF   G  + + G NG   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDGNQMGITGRNGKV-QIWSK 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G +++EF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 899  KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLSNHRPQLINSWTADDNIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W            +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TAARGK--IKIWDLQGNLFEEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWDL 235
             DG  I  ++  G A  WD+
Sbjct: 1005 -DGEKIATISRDGTARRWDI 1023


>gi|345565400|gb|EGX48350.1| hypothetical protein AOL_s00080g320 [Arthrobotrys oligospora ATCC
           24927]
          Length = 365

 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 44/189 (23%), Positives = 85/189 (44%), Gaps = 20/189 (10%)

Query: 63  EMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSA 122
            +K+  TG H GG++ + F+  G+ +  V  +       S  GE+ + F+A    +S  A
Sbjct: 66  RLKYTLTG-HNGGVSSVKFSPDGKWIASVSADKSLRVWDSRTGELEQIFEAHTAGVSDVA 124

Query: 123 FLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
           +  + K  A  S +  +R+  L++G  +         +  ++ +    +I++  Y E  +
Sbjct: 125 WSPDSKTLATGSDDKTIRLWELKSGRMIRILKGHHNYVYCLNFNPQGNMIVSGSYDEA-V 183

Query: 181 QVWRCDISSKTVNKGPALSMRHSPVA-IDCKNSPNGEDGTVILAVAESGVAYSWD----- 234
           ++W  DI S    K   L     PV+ +D        DGT+I++ +   +   WD     
Sbjct: 184 RIW--DIRSGNCQK--TLPAHQDPVSGVDFI-----RDGTMIVSCSHDKLIRIWDTNTGQ 234

Query: 235 -LKTVSQDE 242
            LKT+ ++E
Sbjct: 235 CLKTLVEEE 243


>gi|430743609|ref|YP_007202738.1| WD40 repeat-containing protein [Singulisphaera acidiphila DSM
           18658]
 gi|430015329|gb|AGA27043.1| WD40 repeat-containing protein [Singulisphaera acidiphila DSM
           18658]
          Length = 360

 Score = 40.4 bits (93), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 64/140 (45%), Gaps = 15/140 (10%)

Query: 75  GLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS 134
           G+  LAFA  GR+L V   +G      S  G     F+A    + S AF   +++ A  S
Sbjct: 198 GVTSLAFASDGRTLAVGTVDGTVRLWDSAQGRQFPRFRAHGSEVISLAFAEADRMLASCS 257

Query: 135 SEVRILSLENGE----EVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW------- 183
              R++ L + +     +LK  D    +Q  + + G++ + TAG+ +  +++W       
Sbjct: 258 HNERMVRLWDVDVDAGRLLKEFDGRSAIQAFAFAPGSRELATAGF-DGAVRLWDVATGDS 316

Query: 184 RCDISSKTVNKGPALSMRHS 203
           R D+S+     GP  ++  S
Sbjct: 317 RWDLSA---GDGPVTALDFS 333


>gi|295676345|ref|YP_003604869.1| hypothetical protein BC1002_1278 [Burkholderia sp. CCGE1002]
 gi|295436188|gb|ADG15358.1| WD40 repeat, subgroup [Burkholderia sp. CCGE1002]
          Length = 1445

 Score = 40.4 bits (93), Expect = 1.1,   Method: Composition-based stats.
 Identities = 42/186 (22%), Positives = 77/186 (41%), Gaps = 20/186 (10%)

Query: 58   DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
            D  TG+   +  G H G +   AF+  GR +     +  A    +  G+ I +     + 
Sbjct: 894  DAATGKQIVQLNG-HQGPVFSAAFSPDGRRVVSASADRTARVWDAATGQAIAQLIGHREL 952

Query: 118  ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGY 175
            +SS+AF  + +    AS +   R+    NG+ + + +   GP+   + S   + ++TA  
Sbjct: 953  VSSAAFSPDGRRVVSASDDKTARVWDAANGQVITQLTGHQGPVFSAAFSPDGRRVVTAS- 1011

Query: 176  GEKHLQVWRCD----ISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAY 231
             +K  +VW       I+  T ++GP  S   +P            DG  ++  ++   A 
Sbjct: 1012 DDKTARVWDAATGHVITQLTGHQGPVSSAAFTP------------DGLRVVTASDDKTAR 1059

Query: 232  SWDLKT 237
             WD  T
Sbjct: 1060 VWDAAT 1065



 Score = 40.0 bits (92), Expect = 1.7,   Method: Composition-based stats.
 Identities = 35/145 (24%), Positives = 65/145 (44%), Gaps = 6/145 (4%)

Query: 58   DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
            D  TG+     +G H G +   AF+  GR +     +G A    +  G+ I  F   ++ 
Sbjct: 1270 DAATGKQILVLSGHH-GTVFSAAFSPDGRRVVTAAADGTARVWDAATGKQIARFGGHQRA 1328

Query: 118  ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGY 175
            +SS+AF  + +    AS++   R+     G  + + +   GP+   + S   + ++TA  
Sbjct: 1329 VSSAAFSPDGQRVVTASADQTARVWDAATGRVIAQLAGHRGPVSSAAFSPDGQRVVTAS- 1387

Query: 176  GEKHLQVWRCDISSKTVNKGPALSM 200
             ++  +VW   I     N+G  L+M
Sbjct: 1388 ADQTARVW--PIRWLMQNRGRGLAM 1410



 Score = 37.4 bits (85), Expect = 9.7,   Method: Composition-based stats.
 Identities = 42/188 (22%), Positives = 74/188 (39%), Gaps = 20/188 (10%)

Query: 58   DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
            D   G++  + TG H G +   AF+  GR +     +  A    +  G VI +    + P
Sbjct: 978  DAANGQVITQLTG-HQGPVFSAAFSPDGRRVVTASDDKTARVWDAATGHVITQLTGHQGP 1036

Query: 118  ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGY 175
            +SS+AF  +      AS +   R+     G+ + +     GP+     S   + ++TA  
Sbjct: 1037 VSSAAFTPDGLRVVTASDDKTARVWDAATGQMIAQLIGHEGPVNVAVFSLDGQRVLTASR 1096

Query: 176  GEKHLQVWRCDISSKTVNKGPA---LSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
                      D +++  + G     LS    PV     ++  G DG  ++  +    A  
Sbjct: 1097 ----------DGTARAWDAGQGILLLSGHQEPVV----SAAFGPDGQRVVTASRDRTARV 1142

Query: 233  WDLKTVSQ 240
            WD+ T  Q
Sbjct: 1143 WDVATGRQ 1150


>gi|302851563|ref|XP_002957305.1| hypothetical protein VOLCADRAFT_98340 [Volvox carteri f. nagariensis]
 gi|300257400|gb|EFJ41649.1| hypothetical protein VOLCADRAFT_98340 [Volvox carteri f. nagariensis]
          Length = 2174

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 33/118 (27%), Positives = 54/118 (45%), Gaps = 2/118 (1%)

Query: 58   DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
            D+ TG+M  K    H  G+    F   G     V  +  A+  + + GEVI+ F   E  
Sbjct: 1138 DLNTGKMLRKYDEIHNEGIYDAWFTPDGLRCVSVSCDKTAAIFELQSGEVIQRFVGHEHW 1197

Query: 118  ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
            +  +A   +EKI A AS++  VRI ++  G  +    D  G + Y++      I+ +A
Sbjct: 1198 VRFAALSPDEKILATASNDNTVRIWNVAKGNVLHIIRDHRGTMSYINFVHPTWIVTSA 1255


>gi|196009674|ref|XP_002114702.1| hypothetical protein TRIADDRAFT_58457 [Trichoplax adhaerens]
 gi|190582764|gb|EDV22836.1| hypothetical protein TRIADDRAFT_58457 [Trichoplax adhaerens]
          Length = 1089

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 27/101 (26%), Positives = 47/101 (46%), Gaps = 4/101 (3%)

Query: 117 PISSSAFLCEEKIFALA---SSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
           PI++SA  C  K  A++   S+ V++ +L N ++V  +       Q V  +   K II+A
Sbjct: 785 PITASALSCNSKYLAVSVRGSNAVQVWNLSNQKQVAFYRGHTDYTQAVHFAPNGKTIISA 844

Query: 174 GYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
              +K ++ W C I  K      +  M+  P   +C+   N
Sbjct: 845 S-DDKTVKFWSCQIDLKNTVHVRSKRMQFKPTPFECRFVDN 884


>gi|345782155|ref|XP_532920.3| PREDICTED: WD repeat-containing protein 43 [Canis lupus familiaris]
          Length = 681

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 43/205 (20%), Positives = 83/205 (40%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 96  LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKYIVEWNT 155

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  ++
Sbjct: 156 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLT 215

Query: 163 AS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
            +           DG      ++    ++ L VW+    +K  N   + ++   PV ID 
Sbjct: 216 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 275

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V   G  + ++
Sbjct: 276 ALSENKEEPVKLAVVCRDGEVHLFE 300


>gi|350421517|ref|XP_003492868.1| PREDICTED: WD repeat-containing protein 43-like [Bombus impatiens]
          Length = 629

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 53/253 (20%), Positives = 111/253 (43%), Gaps = 41/253 (16%)

Query: 34  KKRRK--------ERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKK- 84
           KKRRK        ++G  ++A+G++NG +   D  T  +  +  G H G +  + +++  
Sbjct: 70  KKRRKKSISEESNQKG--IVAMGSTNGRVTLYDTTTSLVTAQLNG-HSGTVTAVTWSENV 126

Query: 85  -----GRSLHVVG----TNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
                    H++      NG+  + KS  G+ +    + +      + L  E++      
Sbjct: 127 GLFSAASDHHIIQWNLQENGVRCKWKSGKGKTVSLAVSPD----GKSLLSGERV------ 176

Query: 136 EVRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTV 192
            ++  +L+  + V  F+   ++V  L  V  + G   +I+   G+ +L VW  D      
Sbjct: 177 -IKWWNLDTKQLVRTFTGHANEVSCLHTVQMTSGNNYVISGACGDGYLNVWALDEQRNER 235

Query: 193 NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVK 252
               +L+++  P+++    S N +   ++L V  SG A+   L     + ++ P K ++ 
Sbjct: 236 TAVASLALQDDPISVSTHISENFQ--VIVLVVTRSGQAH---LFQYQPNGRSKPLKPSLN 290

Query: 253 LKKA-DADQQNSV 264
           +  A D  Q++SV
Sbjct: 291 IAVASDISQKDSV 303


>gi|166365791|ref|YP_001658064.1| hypothetical protein MAE_30500 [Microcystis aeruginosa NIES-843]
 gi|166088164|dbj|BAG02872.1| hypothetical protein MAE_30500 [Microcystis aeruginosa NIES-843]
          Length = 438

 Score = 40.4 bits (93), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
           LAF+  G+ L   G++        + GE+I+ ++A E+ I S A      + A AS +E+
Sbjct: 264 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 323

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +I   + G E++K      PL++  + DG   +IT  YG+K +++W
Sbjct: 324 KIWQGQTG-ELIKILRGTAPLKF--SPDG-HFLITGSYGDK-VKIW 364


>gi|148745071|gb|AAI42562.1| LOC100101293 protein [Xenopus laevis]
          Length = 655

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 54/254 (21%), Positives = 103/254 (40%), Gaps = 28/254 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACG--------FVGKKRRKERGTL-----LLAL 47
           +W T  G L  E+  P      + +CLA             KKR+ E         LL +
Sbjct: 33  VWDTQGGGLRKEYV-PSAHLSATCTCLAWAQGRADKETHQKKKRKSEAADRDGQYDLLTI 91

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G IL   ++ GE++ K  G H   +  + +  +  SL+    +    E  ++  +V
Sbjct: 92  GTATGTILLYSIVKGELQSKLVGGHDSRVNCVRWHHENGSLYSCSEDKHIIEWNTQTCKV 151

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYV--- 161
             ++K     +SS     + K+   A   +++  LE  E   +F   S  V  L ++   
Sbjct: 152 KCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWDLETKEVYRQFTGHSTAVTSLIFLTVQ 211

Query: 162 ------SASDGAKI-IITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
                 S  D A +  +++   ++ + VW+   S+K  +   + ++   PV +D   + +
Sbjct: 212 PPRESRSIQDTAGLYFLSSAVHDRLVSVWQVR-SAKDKSSVLSFTLTEPPVFMDLSTTES 270

Query: 215 GEDGTVILAVAESG 228
            E+   +  V   G
Sbjct: 271 KEEPLKLAVVCRDG 284


>gi|158313622|ref|YP_001506130.1| WD-40 repeat-containing protein [Frankia sp. EAN1pec]
 gi|158109027|gb|ABW11224.1| WD-40 repeat protein [Frankia sp. EAN1pec]
          Length = 1427

 Score = 40.4 bits (93), Expect = 1.3,   Method: Composition-based stats.
 Identities = 48/183 (26%), Positives = 71/183 (38%), Gaps = 13/183 (7%)

Query: 5    NDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRR---KERGTLLLALGTSNGDILAVDVLT 61
            +DGSL   W    G P      LA    G + R      G  ++A G  +G I   D LT
Sbjct: 959  DDGSLRT-WDATTGTPR-----LAVPVTGGRLRCCATGPGGAVVATGGEDGTIRLHDPLT 1012

Query: 62   GEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSS 121
            GE+  +  G H G +  LAF   G  L   G +G      +  G          +P+ + 
Sbjct: 1013 GEILRRLAG-HAGPVLALAFGPDGSWLVSAGEDGTLRRWDTAAGRQTGVLSDGSRPVRAC 1071

Query: 122  AFLCE-EKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
            A   +   + A A   + +  L  G +  + +   G    V A DG+ I     YG   +
Sbjct: 1072 AVAPDGSYLVAPAGDAISVRDLPTGGQRAELTGATGTRACVVAPDGSWIASAGRYGT--I 1129

Query: 181  QVW 183
            +VW
Sbjct: 1130 RVW 1132


>gi|332663664|ref|YP_004446452.1| metallophosphoesterase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332478|gb|AEE49579.1| metallophosphoesterase [Haliscomenobacter hydrossis DSM 1100]
          Length = 628

 Score = 40.4 bits (93), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 65/124 (52%), Gaps = 7/124 (5%)

Query: 50  SNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIR 109
           S G ++A+DV TG+ KW  T R  G +       K R + +  T+ +   +  + G+++ 
Sbjct: 326 SAGQLVALDVKTGKQKWTYTTR--GKIFSTPALGKDRVV-LASTDSVVYCLALQTGKLLW 382

Query: 110 EFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
           + K + K I +S  +  E I+ L SSE   R L+L++G+E+  F+   G ++ +  +D  
Sbjct: 383 KLK-TNKSIVASPLIDGETIY-LGSSEGKFRALTLKDGKELWSFNGVAGFVESIPCADAE 440

Query: 168 KIII 171
           ++  
Sbjct: 441 RVYF 444


>gi|425470863|ref|ZP_18849723.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
 gi|389883367|emb|CCI36241.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9701]
          Length = 1108

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 82/200 (41%), Gaps = 26/200 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D     ++  +T      +  +AF      + + G NG   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKNIREFATKMR--SIYSIAFHPDDNQIAITGRNGKV-QIWSQ 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G +++EF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 899  KGTMLQEFTASQAPIYSLAFNGEGTAIITGTSEGKVQYWHLSNHRTKLINSWTVDDSIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W      +     +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWDL 235
             DG  I A++  G A  WD+
Sbjct: 1005 -DGEKIAAISRDGTARRWDM 1023


>gi|148270092|ref|YP_001244552.1| WD-40 repeat-containing protein [Thermotoga petrophila RKU-1]
 gi|147735636|gb|ABQ46976.1| WD-40 repeat protein [Thermotoga petrophila RKU-1]
          Length = 580

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 52/221 (23%), Positives = 95/221 (42%), Gaps = 15/221 (6%)

Query: 13  WKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRH 72
           W  PDG+ + S   L    V     + R      +G +NG  L  D+            H
Sbjct: 131 WSFPDGKEI-SSQRLGPSVVKIAAHENR----YVMGLANGLALISDISNPSFSIPLKA-H 184

Query: 73  PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEK-IFA 131
           P G+  + F++ G+ +   G N +     S  G ++ ++      ++  AFL  +  IF 
Sbjct: 185 PEGIKKIVFSENGQLIATCGGNTVKVWNASN-GRLVLQYDHV-ITVNDVAFLDNDNLIFV 242

Query: 132 LASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYG-EKHLQVWRCDISSK 190
               +  +L+++  E V         + +VS+ DG+     A YG +K ++VW  D+  +
Sbjct: 243 TDDYKAVVLNIQKNEVVKSIDAHNNFVLFVSSDDGS----IATYGMDKTVKVWNADLELQ 298

Query: 191 TVNKGPALSMRHSPVAIDCKNSPNG-EDGTVILAVAESGVA 230
               G  LS+    ++ D K   +G +D  +++  AE G+A
Sbjct: 299 YSLYGHQLSVNTVALSSDGKFIVSGSDDREILIWNAEKGIA 339


>gi|149175385|ref|ZP_01854006.1| vegetatible incompatibility protein HET-E1 [Planctomyces maris DSM
            8797]
 gi|148845653|gb|EDL59995.1| vegetatible incompatibility protein HET-E1 [Planctomyces maris DSM
            8797]
          Length = 1097

 Score = 40.0 bits (92), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 46/216 (21%), Positives = 92/216 (42%), Gaps = 23/216 (10%)

Query: 44   LLALG----TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASE 99
            LLA+G    + +G+I   D  TG++  K    H   + G+ F++ G+SL     +     
Sbjct: 885  LLAVGGGDPSRSGEITIWDAATGKLVKKLGDAHSDTVLGIRFSRSGKSLLTGAADKFVKI 944

Query: 100  MKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEE---VLKFSDD 154
                 G  ++ F+     +   A+  +E     + ++  +++ ++E GE+   +  +S  
Sbjct: 945  FDVASGNFVKSFEGHTHHVLDVAWKADESTIVSSGADNVIKVWNIETGEQKRTISGYSKQ 1004

Query: 155  VGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPN 214
            V  + ++   D     I +G G+K +   R  +++   N       R+   + D   S  
Sbjct: 1005 VTSIAFLGLGDN----IVSGGGDKTV---RMHLTTNGSN------YRNLSGSTDYVYSVA 1051

Query: 215  G-EDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKI 249
            G  D ++++A  E G+   WD KT       NP  +
Sbjct: 1052 GSRDESIVIAGGEDGILRVWDAKTGKLLNSFNPPPV 1087


>gi|410955542|ref|XP_003984410.1| PREDICTED: WD repeat-containing protein 43 [Felis catus]
          Length = 680

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 43/205 (20%), Positives = 83/205 (40%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 96  LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKHIVEWNT 155

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  + 
Sbjct: 156 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLV 215

Query: 163 AS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
            +           DG      ++    ++ L VW+    +K  N   + ++   PV ID 
Sbjct: 216 FTTVRPPSESQPFDGVTGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 275

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V + G  + ++
Sbjct: 276 TLSENKEEPVKLAVVCKDGQVHLFE 300


>gi|307152332|ref|YP_003887716.1| WD40 repeat-containing protein [Cyanothece sp. PCC 7822]
 gi|306982560|gb|ADN14441.1| WD40 repeat, subgroup [Cyanothece sp. PCC 7822]
          Length = 380

 Score = 40.0 bits (92), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 47/202 (23%), Positives = 86/202 (42%), Gaps = 18/202 (8%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVS--YSCLACGFVGKKRRKER----GTLLLALGTS-NGD 53
           IW  + G ++A    PDGE +VS     +    +  +R   R    G  + AL  S +G 
Sbjct: 162 IWLEHQGQVMALRVTPDGEILVSGGLDGIRIWTLNPRRPLYRLTGLGHPVYALAISPDGV 221

Query: 54  ILAVDVLTGEMK-WK--------STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           ILA   L GE+K W         +   H   + GL F   G+ L     +          
Sbjct: 222 ILASGSLDGEVKFWNIKEGKLLSTFYPHQATITGLVFTPDGKKLITSSQDKTIKVWDLAT 281

Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE-VRILSLENGEEVLKFSDDVGPLQYVSA 163
           G++I         I + A   + KI A   ++ +RI ++E GE+  +  ++   +Q ++ 
Sbjct: 282 GQLIYTLAGHTGRIRAIALNPDGKILASGGNDGIRIWNIETGEQYNQIIENYDWVQSLAF 341

Query: 164 SDGAKIIITAGYGEKHLQVWRC 185
           S   + + +  + +  +++W+C
Sbjct: 342 SPDGQFLASGSF-DFQVKIWQC 362


>gi|334119173|ref|ZP_08493260.1| WD40 repeat-containing protein [Microcoleus vaginatus FGP-2]
 gi|333458644|gb|EGK87261.1| WD40 repeat-containing protein [Microcoleus vaginatus FGP-2]
          Length = 547

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 37/143 (25%), Positives = 68/143 (47%), Gaps = 6/143 (4%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           LA G+ +G I   ++ +G+     +G++   L+ LAF   G+SL    +NG      +  
Sbjct: 406 LASGSKDGSIRLWNLASGQAIRTISGKNLSVLS-LAFTPDGKSLAAGNSNGTIGLWNAGN 464

Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
           G++IR        + S AF  +       S +  VR+  + +G+     S   G +  V+
Sbjct: 465 GQLIRRLSGHTDGVWSVAFSRDGTTLVTGSWDKSVRLWDVRSGDLRGNLSGHAGYVSAVA 524

Query: 163 ASDGAKIIITAGY-GEKHLQVWR 184
            S   K I++AG+ GE  +++W+
Sbjct: 525 ISSDGKTIVSAGWLGE--IKIWK 545


>gi|428215135|ref|YP_007088279.1| WD40 repeat-containing protein [Oscillatoria acuminata PCC 6304]
 gi|428003516|gb|AFY84359.1| WD40 repeat-containing protein [Oscillatoria acuminata PCC 6304]
          Length = 306

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/143 (23%), Positives = 64/143 (44%), Gaps = 4/143 (2%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           L  G+ +  I   +V  GE  +++   H   +  + F+  G  L     +      K++ 
Sbjct: 159 LVSGSWDRTIKLWNVAIGE-SYRTIQAHSNPIESVKFSPDGEMLASSSLDSTVKLWKTQT 217

Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
           GE+I         I S AF  + +  A ASS+  ++I ++E GEE+    D    +  ++
Sbjct: 218 GELIHTLTGHTDGIRSVAFSPDGRYLASASSDKTIKIWAVETGEELATLGDHSSYVFAIA 277

Query: 163 ASDGAKIIITAGYGEKHLQVWRC 185
            S   + + T G  +K +++WR 
Sbjct: 278 FSPDGQTLATGG-DDKTIKLWRA 299


>gi|310817435|ref|YP_003949793.1| wd domain g-beta repeat/pbs lyase heat-like repeat protein
           [Stigmatella aurantiaca DW4/3-1]
 gi|309390507|gb|ADO67966.1| WD domain G-beta repeat/PBS lyase HEAT-like repeat protein
           [Stigmatella aurantiaca DW4/3-1]
          Length = 2185

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 42/156 (26%), Positives = 64/156 (41%), Gaps = 18/156 (11%)

Query: 62  GEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSS 121
           G  + +  G HPGG   LA    G+ L+ VG +G         G  ++E++AS +P+ + 
Sbjct: 82  GSQELELAGAHPGGCTALALGLNGQRLYSVGMDGALRAWALASGSKLQEWQASAQPLRAV 141

Query: 122 AFLCEE------------KIFALASSEVRILSLENGE-EVLKFSDDVGPLQYVSASDGAK 168
           A                 + F LAS   R +   +G    L F+   G  + VSA D  +
Sbjct: 142 AVAPNNATVACAGDDAVVRSFTLASRAQRDMPGHDGPVRALAFTPRDG--RLVSAGDDGR 199

Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSP 204
           I I    G    +  R D  S   + GP L++   P
Sbjct: 200 IRIWYLAGAVEFET-RGDKDSG--HAGPVLALLFPP 232


>gi|157413501|ref|YP_001484367.1| WD-40 repeat-containing G-protein [Prochlorococcus marinus str. MIT
           9215]
 gi|157388076|gb|ABV50781.1| G-protein beta WD-40 repeats [Prochlorococcus marinus str. MIT
           9215]
          Length = 357

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 46/188 (24%), Positives = 77/188 (40%), Gaps = 17/188 (9%)

Query: 22  VSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAF 81
           VS   +ACG+  K ++         +G   G I A +   G++ W+    H GGL  +A 
Sbjct: 21  VSDYAIACGWALKGKQ-------FIVGDVAGGIFAFEGDNGKIIWEKENTHSGGLLAMAI 73

Query: 82  AKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILS 141
             +G      G +G         G+VI+     +  +    +  +    A+ASS+   + 
Sbjct: 74  HPEGEIFATSGQDGNVQICNCHEGKVIKTLDLGKGWVEHLKWSNDGLFLAIASSKKVYVF 133

Query: 142 LENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGP 196
            E GEE     D    +  ++ S+  + + TA YG    +V   DI +   N     +G 
Sbjct: 134 NEVGEEKWTSEDHPSTVSAITWSNKNE-LATACYG----RVTFFDIVNNKTNQKLEWQGS 188

Query: 197 ALSMRHSP 204
            +SM  SP
Sbjct: 189 LVSMELSP 196


>gi|124022824|ref|YP_001017131.1| hypothetical protein P9303_11171 [Prochlorococcus marinus str. MIT
           9303]
 gi|123963110|gb|ABM77866.1| G-protein beta WD-40 repeats [Prochlorococcus marinus str. MIT
           9303]
          Length = 357

 Score = 40.0 bits (92), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 53/229 (23%), Positives = 91/229 (39%), Gaps = 30/229 (13%)

Query: 26  CLACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKG 85
            +ACG+V        G  LL    + G  L  + LTG++ WK    H  GL  +A     
Sbjct: 25  AMACGWV------HNGEALLVADVTGGLYL-FNGLTGQINWKINDIHQAGLLSVAIHPSK 77

Query: 86  RSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENG 145
                 G NG  +   SE  +V++  +     +    +  + +  A++ S    +   +G
Sbjct: 78  AVFATAGQNGCLTIWNSEKYDVVKSIELGGGWVEHLHWSPDGQYLAVSMSRYAYVFDIDG 137

Query: 146 EEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGPALSM 200
           EE  + S+    +  ++ S  ++ + TA YG    QV   D++   +N     +G  +SM
Sbjct: 138 EEKWRSSEHTSTVSAIAWSSSSE-LATASYG----QVTFFDVTQGAINQKLEWQGSLVSM 192

Query: 201 RHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTN-PAK 248
             SP            DG V+   ++    + W   T +  E T  P K
Sbjct: 193 VLSP------------DGDVVACGSQDNSVHFWRRSTGNDAEMTGYPGK 229


>gi|452847062|gb|EME48994.1| hypothetical protein DOTSEDRAFT_67892 [Dothistroma septosporum
           NZE10]
          Length = 446

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 47/111 (42%), Gaps = 3/111 (2%)

Query: 63  EMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSA 122
           +  W+ T RH G    LA++  GR L+  GT+G+     SE G V  +      P SS  
Sbjct: 77  DTAWR-TKRHKGSCRCLAYSVDGRQLYSAGTDGIVKAADSETGRVTGKIAIPVDPSSSRD 135

Query: 123 FLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITA 173
            +    +    S +  +LS ++G   L   D   P   V  SD     +++
Sbjct: 136 GVDAPTVVHALSPQTLLLSTDSG--ALHLYDLRDPAPSVPVSDSKTAFVSS 184


>gi|428212952|ref|YP_007086096.1| WD40 repeat-containing protein [Oscillatoria acuminata PCC 6304]
 gi|428001333|gb|AFY82176.1| WD40 repeat-containing protein [Oscillatoria acuminata PCC 6304]
          Length = 795

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 29/121 (23%), Positives = 59/121 (48%), Gaps = 9/121 (7%)

Query: 71  RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIF 130
           R+   +  + F   G+ +    T G  +    E GE +R F A +  I + A   +++I 
Sbjct: 595 RNSEAVYAVIFLNNGQQIIASDTRGSVAFWHRETGEELRRFNAHQGMIRALAISPDDRIL 654

Query: 131 ALASSE--VRILSLENGEEVLKF---SDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRC 185
           A AS E  +++  L+ G+E+  F   +D V  + +  + DG   ++ +G  +  L++W+ 
Sbjct: 655 ATASDEGIIKLWQLQTGQEICVFKTHNDAVNAIAF--SPDGQ--LLASGSTDMTLKLWQV 710

Query: 186 D 186
           +
Sbjct: 711 N 711


>gi|425437527|ref|ZP_18817942.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
 gi|389677472|emb|CCH93583.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9432]
          Length = 1108

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 50/200 (25%), Positives = 83/200 (41%), Gaps = 26/200 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D    +++  +T      +  +AF      + + G NG   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G +++EF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 899  KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLSNYRPQLINSWTADDNIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W      +     +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWDL 235
             DG  I A++  G A  WD+
Sbjct: 1005 -DGEKIAAISRDGTARRWDI 1023


>gi|256422452|ref|YP_003123105.1| hypothetical protein Cpin_3437 [Chitinophaga pinensis DSM 2588]
 gi|256037360|gb|ACU60904.1| WD-40 repeat protein [Chitinophaga pinensis DSM 2588]
          Length = 1160

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 45/193 (23%), Positives = 76/193 (39%), Gaps = 18/193 (9%)

Query: 45  LALGTSNGDILAVDVLTGEM--KWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LA+ T+ GD+   DV TG++   W   G  P     + F +  + L +      A  + +
Sbjct: 250 LAVVTTEGDVQLWDVYTGKLIADWPDVGLRP---TSIEFVQHDKQLLLSTKFEAAILLDA 306

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
             G++++ F A     S      +EK   +A ++  I ++ +          V    Y S
Sbjct: 307 RSGKILQSFDADLVRFSPD----QEKYAVVAQAQATIRNVNDSTSYTSLRGHVSWSVYAS 362

Query: 163 ASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVIL 222
            SD A   I   YG+  L+VW        +   P L +  +   I    +    DG  I 
Sbjct: 363 -SDTANKHIAVCYGDGSLRVWSL------ITGMPELLLADTAAGI--TRTVISPDGRTIA 413

Query: 223 AVAESGVAYSWDL 235
             ++ G   +WDL
Sbjct: 414 GSSQYGTISAWDL 426


>gi|301776542|ref|XP_002923691.1| PREDICTED: WD repeat-containing protein 43-like [Ailuropoda
           melanoleuca]
          Length = 679

 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 43/205 (20%), Positives = 81/205 (39%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 96  LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKHIVEWNT 155

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV- 161
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  + 
Sbjct: 156 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLI 215

Query: 162 ----------SASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
                        DG      ++    ++ L VW+    +K  N   + ++   PV ID 
Sbjct: 216 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRAENKEKNAVMSFTVTDEPVYIDL 275

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V   G  + ++
Sbjct: 276 TLSENKEEPVKLAVVCRDGQVHLFE 300


>gi|115379662|ref|ZP_01466743.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
 gi|115363327|gb|EAU62481.1| conserved hypothetical protein [Stigmatella aurantiaca DW4/3-1]
          Length = 1558

 Score = 40.0 bits (92), Expect = 1.7,   Method: Composition-based stats.
 Identities = 43/156 (27%), Positives = 64/156 (41%), Gaps = 18/156 (11%)

Query: 62  GEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSS 121
           G  + +  G HPGG   LA    G+ L+ VG +G         G  ++E++AS +P+ + 
Sbjct: 82  GSQELELAGAHPGGCTALALGLNGQRLYSVGMDGALRAWALASGSKLQEWQASAQPLRAV 141

Query: 122 AFLCEE------------KIFALASSEVRILSLENGE-EVLKFSDDVGPLQYVSASDGAK 168
           A                 + F LAS   R +   +G    L F+   G L  VSA D  +
Sbjct: 142 AVAPNNATVACAGDDAVVRSFTLASRAQRDMPXHDGPVRALAFTPRDGRL--VSAGDDGR 199

Query: 169 IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSP 204
           I I    G    +  R D  S   + GP L++   P
Sbjct: 200 IRIWYLAGAVEFET-RGDKDSG--HAGPVLALLFPP 232


>gi|425450672|ref|ZP_18830496.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           7941]
 gi|389768356|emb|CCI06492.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           7941]
          Length = 559

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
           LAF+  G+ L   G++        + GE+I+ ++A E+ I S A      + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +I   + G E++K      PL++  + DG + +IT  YG K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGHK-VKIW 485


>gi|425439391|ref|ZP_18819716.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9717]
 gi|389720381|emb|CCH95904.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9717]
          Length = 559

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 58/106 (54%), Gaps = 6/106 (5%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
           LAF+  G+ L   G++        + GE+I+ ++A E+ I S A      + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +I   + G E++K      PL++  + DG + +IT  YG+K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITCSYGDK-VKIW 485


>gi|443652967|ref|ZP_21130939.1| WD domain, G-beta repeat family protein [Microcystis aeruginosa
           DIANCHI905]
 gi|159028403|emb|CAO89845.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
 gi|443334193|gb|ELS48718.1| WD domain, G-beta repeat family protein [Microcystis aeruginosa
           DIANCHI905]
          Length = 1201

 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 38/147 (25%), Positives = 71/147 (48%), Gaps = 7/147 (4%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMAS--EMKS 102
           LA G+ +G I   DV TG+ + ++   H G +  ++F+  G++L      G      +  
Sbjct: 669 LATGSDDGTIKLWDVETGQ-EIRTLSGHNGKVNSVSFSSDGKTLAFDSDGGTIKLWYIDI 727

Query: 103 EMGEVIREFKASEKP-ISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQ 159
           E G+ IR      +  + S +F  + K  A  S++  +++ ++E GEE+   S   G + 
Sbjct: 728 ETGKEIRTLSEWNRGCVYSVSFSNDGKTLATGSADKTIKLWNVETGEEIRTLSGHNGKVN 787

Query: 160 YVSASDGAKIIITAGYGEKHLQVWRCD 186
            VS S   K + T G  +K +++W  +
Sbjct: 788 SVSFSSDGKTLAT-GSADKTIKLWNVE 813


>gi|425466442|ref|ZP_18845740.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9809]
 gi|389831075|emb|CCI26486.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9809]
          Length = 559

 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
           LAF+  G+ L   G++        + GE+I+ ++A E+ I S A      + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISLAINPHRHLIASASRTEI 444

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +I   + G E++K      PL++  + DG   +IT  YG+K +++W
Sbjct: 445 KIWQGQTG-ELIKILRGTAPLKF--SPDG-HFLITGSYGDK-VKIW 485


>gi|432096779|gb|ELK27357.1| WD repeat-containing protein 43 [Myotis davidii]
          Length = 678

 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 42/205 (20%), Positives = 82/205 (40%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 97  LLALGTAVGSILLYSTVKGELHSKLVSGGHDNKVNCIQWHQDNGCLYSCSDDKHIVEWNT 156

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVS 162
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  + 
Sbjct: 157 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKELYRHFTGHATPVSSLM 216

Query: 163 AS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
            +           DG      ++    ++ L VW+    +K  N   + ++   P+ ID 
Sbjct: 217 FTTIRPPNESQPFDGISGLYFLSGAVHDRLLNVWQVRSENKQKNAVMSFTVTDEPIYIDL 276

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V   G  + ++
Sbjct: 277 TLSENKEEPVKLAVVCRDGQVHLFE 301


>gi|395828782|ref|XP_003787544.1| PREDICTED: WD repeat-containing protein 43 [Otolemur garnettii]
          Length = 680

 Score = 39.7 bits (91), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 44/205 (21%), Positives = 83/205 (40%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  L + +    L+    +    E   
Sbjct: 96  LLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCLQWHQDSGCLYSCSDDKHIVEWNV 155

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGP---LQ 159
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P   L+
Sbjct: 156 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLR 215

Query: 160 YVS--------ASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
           + +          DG      ++    ++ + VW+    +K  N   + ++   PV ID 
Sbjct: 216 FTTIRPPNESQPCDGITGLYFLSGAVHDRLINVWQVRSENKEKNAVMSFTVTDEPVCIDL 275

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V + G  + ++
Sbjct: 276 TLSENKEEPVKLAVVCKDGQVHLFE 300


>gi|114763302|ref|ZP_01442726.1| WD domain/cytochrome c family protein [Pelagibaca bermudensis
           HTCC2601]
 gi|114544100|gb|EAU47110.1| WD domain/cytochrome c family protein [Roseovarius sp. HTCC2601]
          Length = 436

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 46/111 (41%), Gaps = 18/111 (16%)

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQV-----WRCDISSKTV 192
           R+L L+ GE++   + D  P+  + A  G    +  G GE  + +     WR     +  
Sbjct: 216 RVLDLDTGEQIADLTLDRRPILAL-AHHGPTNQLAVGDGEGFIMIVDTRDWRITRDFRAT 274

Query: 193 NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEK 243
            KGP  ++  SP            DG VI A     VAY+W +  + Q E 
Sbjct: 275 RKGPVWALAFSP------------DGAVIHAGGLDDVAYAWPVAMLDQFEP 313


>gi|425460391|ref|ZP_18839872.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9808]
 gi|389826904|emb|CCI22215.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9808]
          Length = 559

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
           LAF+  G+ L   G++        + GE+I+ ++A E+ I S A      + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +I   + G E++K      PL++  + DG + +IT  YG K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGHK-VKIW 485


>gi|440755325|ref|ZP_20934527.1| dnaJ domain protein [Microcystis aeruginosa TAIHU98]
 gi|440175531|gb|ELP54900.1| dnaJ domain protein [Microcystis aeruginosa TAIHU98]
          Length = 559

 Score = 39.7 bits (91), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
           LAF+  G+ L   G++        + GE+I+ ++A E+ I S A      + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +I   + G E++K      PL++  + DG + +IT  YG K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGHK-VKIW 485


>gi|451980751|ref|ZP_21929137.1| exported hypothetical protein [Nitrospina gracilis 3/211]
 gi|451762087|emb|CCQ90376.1| exported hypothetical protein [Nitrospina gracilis 3/211]
          Length = 734

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 42/198 (21%), Positives = 81/198 (40%), Gaps = 20/198 (10%)

Query: 47  LGTSNGDILAV-DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           L T+ GD L + D  TG  + +   RH G L   AF+  G+S+   G +G      +  G
Sbjct: 202 LVTAQGDALKLWDASTG-TRLRVFSRHNGKLFAAAFSPDGKSIASAGQDGTVRLFSTATG 260

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSA 163
           E++   K   + +++ AF  E      A ++  VR+    +G  +  F      +  VS 
Sbjct: 261 ELLYALKGHNEKVNAVAFSPEGAHLLSAGTDNTVRLWKTNDGTLLHTFEGHTKEVTSVSF 320

Query: 164 SDGAKIIITAGYGEKHLQVW---------------RCDISSKTVNKGPALSMRHSPVAID 208
           S   + +++ G  ++ +++W               + + + KT    P    +       
Sbjct: 321 SPDNRFVVS-GSADQTVRLWDLAPHIAPTVVAEAPKSETTPKTETSAPESGPKSQTPTPP 379

Query: 209 CKNSPNGEDGTVILAVAE 226
            + SP+ E  T+++   E
Sbjct: 380 VEESPDTEPPTIVITSHE 397



 Score = 38.1 bits (87), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 52/206 (25%), Positives = 92/206 (44%), Gaps = 27/206 (13%)

Query: 41  GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
           G  +++ G  +G ++  D  TG+ +WK+   H G +  +A +  GRSL   G +      
Sbjct: 64  GKFIVSAG-GDGALILWDARTGD-RWKTLSGHNGAVNAIAISPDGRSLATGGADTRIKVW 121

Query: 101 KSEMGEVIREFKASEKPISSSAFLCE-EKIFALASSEVRIL-SLENGEEVLKFSD--DVG 156
             + G  +R        ++  AF  +  ++ +    E  IL  +  G+ +  F+D  D G
Sbjct: 122 DIQSGNEVRSVPGHFDEVTGVAFFPDGTRLISSGLGESVILWDIRTGQPLRVFADQNDSG 181

Query: 157 -------PLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
                  P++ V+AS   K ++TA  G+  L++W  D S+ T  +   +  RH+      
Sbjct: 182 SEFVALEPVRSVAASPSGKTLVTA-QGDA-LKLW--DASTGTRLR---VFSRHNGKLFAA 234

Query: 210 KNSPN-------GEDGTVILAVAESG 228
             SP+       G+DGTV L    +G
Sbjct: 235 AFSPDGKSIASAGQDGTVRLFSTATG 260


>gi|355728913|gb|AES09699.1| WD repeat domain 43 [Mustela putorius furo]
          Length = 678

 Score = 39.7 bits (91), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 42/205 (20%), Positives = 81/205 (39%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLA+GT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 93  LLAIGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKYIVEWNT 152

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQY-- 160
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+    
Sbjct: 153 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSTLI 212

Query: 161 ---------VSASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
                    +   DG      ++    ++ L VW+    +K  N   + ++   PV ID 
Sbjct: 213 FTTIRPPNEIQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 272

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V   G  + ++
Sbjct: 273 TLSENKEEPVKLAVVCRDGQVHLFE 297


>gi|390594263|gb|EIN03676.1| WD40 repeat-like protein, partial [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 312

 Score = 39.7 bits (91), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 48/187 (25%), Positives = 82/187 (43%), Gaps = 18/187 (9%)

Query: 58  DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIRE-FKASEK 116
           DV TG+   +    H   +  +AF+  G  +     +G      +  G+ I E F+    
Sbjct: 124 DVQTGQQIGQPLEGHTDWVYSVAFSPDGIRIVSRSRDGTLRLWDAHTGQAIGESFRGHSN 183

Query: 117 PISSSAFLCEEKIFALASSE--VRILSLENGEEV---LKFSDDVGPLQYVSAS-DGAKII 170
            ++S+AF  + K  A  SS+  +R+L  E G+ V   L+  D  G +  V+ S DGA+I+
Sbjct: 184 WVNSAAFSPDGKHIASGSSDDTIRLLDAETGQPVGDPLQGHD--GWVWSVAYSPDGARIV 241

Query: 171 ITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVA 230
             +G  +  +++W        V  GP    +      D  +     DG  I++ +E G  
Sbjct: 242 --SGSVDNTIRIWNAQTRQTVV--GPLQGHKK-----DVNSVAFSPDGKYIVSGSEDGTM 292

Query: 231 YSWDLKT 237
             WD +T
Sbjct: 293 RIWDAQT 299


>gi|335775362|gb|AEH58546.1| WD repeat-containing protein 43-like protein [Equus caballus]
          Length = 550

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 43/205 (20%), Positives = 82/205 (40%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 28  LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGYLYSCSDDKHIVEWNT 87

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV- 161
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  + 
Sbjct: 88  QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLL 147

Query: 162 ----------SASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
                        DG      ++    ++ L VW+    +K  N   + ++   PV ID 
Sbjct: 148 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVYIDL 207

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V + G  + ++
Sbjct: 208 TLSENKEEPVKLAVVCKDGQVHLFE 232


>gi|281345819|gb|EFB21403.1| hypothetical protein PANDA_012871 [Ailuropoda melanoleuca]
          Length = 605

 Score = 39.7 bits (91), Expect = 2.3,   Method: Compositional matrix adjust.
 Identities = 43/205 (20%), Positives = 81/205 (39%), Gaps = 14/205 (6%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLALGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  +
Sbjct: 118 LLALGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGCLYSCSDDKHIVEWNT 177

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYV- 161
           +  +V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  + 
Sbjct: 178 QTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEIYRHFTGHATPVSSLI 237

Query: 162 ----------SASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
                        DG      ++    ++ L VW+    +K  N   + ++   PV ID 
Sbjct: 238 FTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRAENKEKNAVMSFTVTDEPVYIDL 297

Query: 210 KNSPNGEDGTVILAVAESGVAYSWD 234
             S N E+   +  V   G  + ++
Sbjct: 298 TLSENKEEPVKLAVVCRDGQVHLFE 322


>gi|148231147|ref|NP_001085834.1| apoptotic peptidase activating factor 1 [Xenopus laevis]
 gi|49115497|gb|AAH73405.1| MGC80868 protein [Xenopus laevis]
          Length = 1248

 Score = 39.3 bits (90), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 37/182 (20%), Positives = 79/182 (43%), Gaps = 13/182 (7%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H   +    F+  GR     G +      +SE GE + E +A +  +   AF  +EK+ A
Sbjct: 614 HKDAVFHACFSPDGRKFASCGADKTLQVFRSETGEKLLELEAHDDEVLCCAFSADEKLLA 673

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDG--AKIIITAGYGEKHLQVWRCDI 187
             S++  V+I + + G+ +  + +    +     ++G  A ++ T    +  + +W    
Sbjct: 674 TCSADRKVKIWNAKTGKPIRVYEEHTEQVNCCQFTNGLSAPLLATCS-NDCFIMLW---- 728

Query: 188 SSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPA 247
            S+T      L + H+     C+ SP   D   + + +  G    WD+++ ++++    A
Sbjct: 729 DSETEYSRNTL-IGHTGAVYHCRYSP---DDRYLASSSMDGSLKIWDVESANEEKSIEVA 784

Query: 248 KI 249
           K+
Sbjct: 785 KL 786


>gi|149173262|ref|ZP_01851893.1| WD40-repeat containing protein [Planctomyces maris DSM 8797]
 gi|148848068|gb|EDL62400.1| WD40-repeat containing protein [Planctomyces maris DSM 8797]
          Length = 1705

 Score = 39.3 bits (90), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 41/174 (23%), Positives = 78/174 (44%), Gaps = 10/174 (5%)

Query: 61   TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
            TG+ + K   +H G +  + ++  G+ +     +G +    +E G ++R    S   + S
Sbjct: 1482 TGQAEKKLEQKHNGYVNTVRYSPDGKRILTSSEDGTSKIWNAESGAMLRSLDQSGTHVKS 1541

Query: 121  SAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGE 177
            + F  +      AS +  + +   E G+++  F     P++ V+ S DG ++I  +G  +
Sbjct: 1542 AIFSPDGSQIVTASDDKTLVMWDAETGKKIKTFKGHEWPVREVAYSHDGKRLI--SGSED 1599

Query: 178  KHLQVWRCDISSKTVNKG---PALSMRHSPVAIDCKNSPNGEDGTVILAVAESG 228
                +W  D + KTV  G   P  S+  SP   D +     +DGT  L   ++G
Sbjct: 1600 NTAIIWDIDTAKKTVLSGHTAPVASVVFSP--DDSRAFTASDDGTAKLWDTDTG 1651



 Score = 37.4 bits (85), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 39/146 (26%), Positives = 70/146 (47%), Gaps = 9/146 (6%)

Query: 64   MKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSA- 122
            +K +   +H   +  ++F+  G+ L     +  A       G   +EF      ++SSA 
Sbjct: 1094 LKGRVLAQHVDAILDVSFSHDGKQLVTASRDKTAISWDVSTGNPNKEFTEGHAFLASSAV 1153

Query: 123  FLCEEKIFALAS--SEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHL 180
            FL + K  A A+  + VRI  ++ G E  +F +  G    +  S  +K+++T G  EK +
Sbjct: 1154 FLPDGKRLATAAVDNSVRIWDIQTGTEHKRF-EHTGRSAAIDVSFDSKLLVT-GSDEKTV 1211

Query: 181  QVWRCDISSKTVNKGPALSMRHSPVA 206
            ++W  DI++  + K   LS  HS V+
Sbjct: 1212 RIW--DIATGELLK--ELSGHHSEVS 1233


>gi|57530361|ref|NP_001006396.1| WD repeat-containing protein 43 [Gallus gallus]
 gi|53127376|emb|CAG31071.1| hypothetical protein RCJMB04_2a19 [Gallus gallus]
          Length = 661

 Score = 39.3 bits (90), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 49/234 (20%), Positives = 86/234 (36%), Gaps = 22/234 (9%)

Query: 13  WKQPDGEPVVSYSCLACGFVGKKRRKERGTL-----LLALGTSNGDILAVDVLTGEMKWK 67
           W  P G P         G   KKR+ E   +     +LA+GT+ G IL    + GE++ K
Sbjct: 65  WAPPGGRPPPDKD----GPQRKKRKSEAAEVDKQLDILAIGTAVGSILLYSTVKGELQSK 120

Query: 68  STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEE 127
             G H   +  + + +    L+    +    E  ++  +V  ++K     +SS     + 
Sbjct: 121 LDGGHDSRINCVRWHQDNCCLYSCSEDKHIVEWNTQTCKVKCKWKGDNSSVSSLCISPDG 180

Query: 128 KIFALASSEVRILSLENGEEVLKFS-----------DDVGPLQYVSASDGAK--IIITAG 174
           K+   A   +++  LE  E    F+             V P+      DG      ++  
Sbjct: 181 KMLLSAGRTIKLWDLETKEVYRHFTGHATSVSSLMFTTVKPMNENKPFDGITGLYFLSGA 240

Query: 175 YGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESG 228
             ++ L VW+     K  N   + ++   P  ID   S   E+   +  V   G
Sbjct: 241 IHDRLLSVWQIRSDRKEKNAVMSFTVTDEPTFIDLTVSELKEEPVKLAVVCRDG 294


>gi|281410841|gb|ADA68831.1| NWDp3 [Podospora anserina]
          Length = 210

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 39/169 (23%), Positives = 78/169 (46%), Gaps = 13/169 (7%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H G +  +AF+  G  +     +G     +++ G+ +R+ +     + S AF  +     
Sbjct: 46  HSGSVRSVAFSPDGSRIVSASDDGTIRIWEAKSGKEVRKLEGHSNWVRSVAFSPDSSRIV 105

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGEKHLQVWRCDIS 188
            AS +  +RI   ++G+EV K     G ++ V+ S DG++I+  +   ++ +++W    S
Sbjct: 106 SASDDGTIRIWEAKSGKEVRKLEGHSGSVRSVAFSPDGSRIV--SASNDQTIRIWEAK-S 162

Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
            K V K       HS + +    SP   DG+ I++ +       W+ K+
Sbjct: 163 GKEVRKLEG----HSGLVLSVAFSP---DGSRIVSASNDQTIRIWEAKS 204


>gi|254410842|ref|ZP_05024620.1| hypothetical protein MC7420_320 [Coleofasciculus chthonoplastes PCC
            7420]
 gi|196182197|gb|EDX77183.1| hypothetical protein MC7420_320 [Coleofasciculus chthonoplastes PCC
            7420]
          Length = 1312

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 43/197 (21%), Positives = 78/197 (39%), Gaps = 39/197 (19%)

Query: 1    IWSTNDGSLLAEWK-----------QPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGT 49
            +W T  G LL  ++            PDG+ +VS S                   L L  
Sbjct: 1049 LWDTTSGKLLHTFRGHEASVSAVAFSPDGQTIVSGSTDTT---------------LKLWD 1093

Query: 50   SNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIR 109
            ++G++L  D   G         HPGG+  +AF+  G+ +     +G      +  G+++ 
Sbjct: 1094 TSGNLL--DTFRG---------HPGGVTAVAFSPDGKRIVSGSGDGTLKLWDTTSGKLLH 1142

Query: 110  EFKASEKPISSSAFLCEEKIFALASSEVRI-LSLENGEEVLKFSDDVGPLQYVSASDGAK 168
             F+  E  +S+ AF  + +     S++  + L   +G  +  F      +  V+ S   K
Sbjct: 1143 TFRGHEASVSAVAFSPDGQTIVSGSTDTTLKLWDTSGNLLDTFRGHEDAVDAVAFSPDGK 1202

Query: 169  IIITAGYGEKHLQVWRC 185
             II+  Y +   ++WR 
Sbjct: 1203 RIISGSY-DNTFKLWRA 1218



 Score = 38.1 bits (87), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 30/127 (23%), Positives = 61/127 (48%), Gaps = 11/127 (8%)

Query: 66   WKSTGR-------HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPI 118
            W ++G+       HPGG+  +AF+  G+ +     +G      +  G+++  F+  E  +
Sbjct: 1009 WDTSGKLLHTFRGHPGGVTAVAFSPDGKRIVSGSGDGTLKLWDTTSGKLLHTFRGHEASV 1068

Query: 119  SSSAFLCEEKIFALASSEVRI-LSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYG 176
            S+ AF  + +     S++  + L   +G  +  F    G +  V+ S DG +I+  +G G
Sbjct: 1069 SAVAFSPDGQTIVSGSTDTTLKLWDTSGNLLDTFRGHPGGVTAVAFSPDGKRIV--SGSG 1126

Query: 177  EKHLQVW 183
            +  L++W
Sbjct: 1127 DGTLKLW 1133


>gi|254409388|ref|ZP_05023169.1| YD repeat protein [Coleofasciculus chthonoplastes PCC 7420]
 gi|196183385|gb|EDX78368.1| YD repeat protein [Coleofasciculus chthonoplastes PCC 7420]
          Length = 1162

 Score = 39.3 bits (90), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 50/196 (25%), Positives = 96/196 (48%), Gaps = 18/196 (9%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A  +S+G +   D+  G+++ +    H GG+A +AF+  G+ +    ++G+A  +   
Sbjct: 907  MIATASSDGTVRLWDI-QGKLQ-RRLPNHSGGVAQVAFSPDGQLIATASSDGIA-RLWDI 963

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
             G ++++    +  + S AF  +    A ASS+  VR+  L+ G    +     G ++ V
Sbjct: 964  QGNLLQDLIGHQGWVRSLAFSPDGTQIATASSDRTVRLWDLQ-GNLRQELKGHQGWVKSV 1022

Query: 162  SASDGAKIIITAGYGEKHLQVWRCDIS-SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
            + S     I TA   +  +++W  D +  K +N+ P+  + H  +A     SP   DGT 
Sbjct: 1023 AFSPNGDYIATASI-DGIVRLWDTDGNLVKELNQHPS-GITH--IAF----SP---DGTR 1071

Query: 221  ILAVAESGVAYSWDLK 236
            I   +  G+A  WDL+
Sbjct: 1072 IATASFEGIARLWDLQ 1087


>gi|195359262|ref|XP_002045331.1| GM24417 [Drosophila sechellia]
 gi|194127361|gb|EDW49404.1| GM24417 [Drosophila sechellia]
          Length = 787

 Score = 39.3 bits (90), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 59/279 (21%), Positives = 109/279 (39%), Gaps = 23/279 (8%)

Query: 62  GEMKWKSTGRHPGGLAGLAFAKKG-RSLHVVGTNGMASEMKSEMGEV-IREFKASEKPIS 119
           G++ W   G H   L G A  +   ++  V+ + G+ S +        +      E  I+
Sbjct: 23  GDIAWSPNGHHLYCLNGSAVNQVDVQTSQVIKSYGLTSPLADNKKPAELDNIDVEEDTIT 82

Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEV-LKFSDDVGPLQYVSASDGAKIIITAGYGEK 178
                 E  + A  S  +R+  L+ G+ V L  +   GP+  V  S   ++I T+G  + 
Sbjct: 83  CFGLSQEHLVTAHRSGLLRLWELDTGKLVKLWKAQHKGPVIRVEFSPCGRLICTSGGADA 142

Query: 179 HLQVWRCDISSKTV------NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
            L++W  D S+ +         GPA  +   P  +  +    G D +V         A++
Sbjct: 143 TLRLW--DYSNNSCLGALKDFPGPAWLLVFHPNVLRKEIYAGGSDNSV--------YAWN 192

Query: 233 WDLKTVSQDEKTNPAKIT-VKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTITG 291
           ++ KT+    + + +++T +  K   +D    V V + +  I+      LE+   K +  
Sbjct: 193 YETKTLVHKMRGHLSQVTGLSFKSNASDCNEIVTVSRDKVMIVWQLQEQLESKQLKVLPL 252

Query: 292 LHDLESEAATASAQN---EKKSKKRAASDPDLETTRDVV 327
             +LE        Q       S K    DP     RD++
Sbjct: 253 YDELEGVVYADEGQQIIVASGSGKLQQVDPKSWKIRDIL 291


>gi|123968669|ref|YP_001009527.1| hypothetical protein A9601_11361 [Prochlorococcus marinus str.
           AS9601]
 gi|123198779|gb|ABM70420.1| G-protein beta WD-40 repeats [Prochlorococcus marinus str. AS9601]
          Length = 357

 Score = 39.3 bits (90), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 44/183 (24%), Positives = 76/183 (41%), Gaps = 17/183 (9%)

Query: 27  LACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGR 86
           +ACG+  K ++         +G   G + + +  TG++ WK    H GGL  +A   +G 
Sbjct: 26  IACGWALKGKQ-------FIVGDVAGGLFSFEGDTGKIIWKKENTHSGGLLAMAIHPEGE 78

Query: 87  SLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGE 146
                G +G         G+VI+     +  +    +  +    A+ASS+   +  E GE
Sbjct: 79  IFATSGQDGNVQICNCHEGKVIKTLDLGKGWVEHLKWSNDGLFLAIASSKKVYVFNEIGE 138

Query: 147 EVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGPALSMR 201
           E     D    +  ++ S+  + + TA YG    +V   DI +   N     +G  +SM 
Sbjct: 139 EKWISEDHPSTVSAITWSNKNE-LATACYG----RVTFFDIVNNKTNQKLEWQGSLVSME 193

Query: 202 HSP 204
            SP
Sbjct: 194 LSP 196


>gi|427725983|ref|YP_007073260.1| YD repeat protein [Leptolyngbya sp. PCC 7376]
 gi|427357703|gb|AFY40426.1| YD repeat protein [Leptolyngbya sp. PCC 7376]
          Length = 1808

 Score = 39.3 bits (90), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 49/173 (28%), Positives = 69/173 (39%), Gaps = 24/173 (13%)

Query: 72   HPG--GLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKI 129
            HP   G+  +A++     +   G +G      +E GE IR  +  E PI    F    K 
Sbjct: 1190 HPSAQGINAVAYSPDESMVATGGADGNIRLWSAE-GESIRTLEDHEAPIYEMEFSPNGK- 1247

Query: 130  FALASSE---VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCD 186
            F L+ SE    R+   E GE +  F D    +  VS S  ++II TA             
Sbjct: 1248 FLLSGSEDFTARLWDPETGELLRTFEDHDNSIYGVSFSPDSQIIATASV----------- 1296

Query: 187  ISSKTVNKGPALSMRHSPVAIDCKN---SPNGEDGTVILAVAESGVAYSWDLK 236
                TVN           + ID +N   S N  DG+ I   +E G+   WDL+
Sbjct: 1297 --DGTVNIYSVEGQLLQTLEIDLENYDVSFNA-DGSAIATASEDGILRFWDLE 1346


>gi|390594203|gb|EIN03616.1| hypothetical protein PUNSTDRAFT_139337 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1087

 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 48/204 (23%), Positives = 86/204 (42%), Gaps = 26/204 (12%)

Query: 45   LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
            LA  +++G +   DV TG+   +    H   +  +AF+  G  +     +       +  
Sbjct: 853  LASASTDGTVRLWDVETGQRIGQPLEEHTNWVCCVAFSPDGNRIVSGSVDRTLRLWDAHT 912

Query: 105  GEVIRE-FKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
            G+ I E F+     + S AF  + K  A  SS+  +R+   E GE V +      PLQ  
Sbjct: 913  GQAIGEPFRGHSDYVQSVAFSPDGKHIASGSSDSTIRLWDAETGEPVGE------PLQGH 966

Query: 162  SAS--------DGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSP 213
            ++S        DG +I+  +G  +K +++W  D  ++    GP    +      D  +  
Sbjct: 967  NSSVFSVAYSPDGTRIV--SGSYDKTIRIW--DTQTRQTVVGPLQGHKK-----DVNSVA 1017

Query: 214  NGEDGTVILAVAESGVAYSWDLKT 237
               DG  +++ +E G    WD +T
Sbjct: 1018 FSPDGKHVVSGSEDGTMRIWDTQT 1041


>gi|193215005|ref|YP_001996204.1| WD40 domain-containing protein [Chloroherpeton thalassium ATCC
           35110]
 gi|193088482|gb|ACF13757.1| WD-40 repeat protein [Chloroherpeton thalassium ATCC 35110]
          Length = 671

 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 40/153 (26%), Positives = 70/153 (45%), Gaps = 7/153 (4%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           LLA G ++G +   D    E +  S   H   +  +AF+  GR L   G + +A    + 
Sbjct: 455 LLASGGNDGWVKTWDT-RKESEIDSFHEHEDAVTSVAFSSDGRFLASAGNDKIAVLWNAG 513

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENG-EEVLKFSDDVGPLQY 160
            G+         +P++  AF    K  A  S +  +++ +LE G EE+      VG + +
Sbjct: 514 TGKKKHTLVGHSRPVTCVAFSPNAKFLATGSWDRSIKLWNLETGLEEICLAGHPVG-IDF 572

Query: 161 VSASDGAKIIITAGYGE-KHLQVWRC-DISSKT 191
           ++ S   K++I +GY   + L + R  DI  K+
Sbjct: 573 IAFSPNGKMMIASGYNRVRKLSIMRLWDIEKKS 605


>gi|406834758|ref|ZP_11094352.1| serine/threonine protein kinase-related protein [Schlesneria
            paludicola DSM 18645]
          Length = 1244

 Score = 38.9 bits (89), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 57/131 (43%), Gaps = 5/131 (3%)

Query: 45   LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
             A G+++G +   DV TG   W+S+  H   +  +AF+  G  +   G N + S   +  
Sbjct: 950  FATGSTDGFVRLWDVSTGRALWESS--HGSEINSVAFSPNGALVAAAGENRITSIWDTRT 1007

Query: 105  GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
            G+ ++     E  + S AF    +    +S +   R+ SLE G +        G L   S
Sbjct: 1008 GQRLKSLVGHENEVMSVAFSPNGRSVLTSSHDGTARVWSLETGGQQFLLPHR-GALLEAS 1066

Query: 163  ASDGAKIIITA 173
             S   K+I TA
Sbjct: 1067 FSPDGKLIATA 1077


>gi|290976462|ref|XP_002670959.1| predicted protein [Naegleria gruberi]
 gi|284084523|gb|EFC38215.1| predicted protein [Naegleria gruberi]
          Length = 623

 Score = 38.9 bits (89), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 42/184 (22%), Positives = 74/184 (40%), Gaps = 13/184 (7%)

Query: 27  LACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGR 86
           ++C F+     K+  ++LL  G S+G I AVD  TGE  W     H   +  +  +   +
Sbjct: 381 VSCSFI-----KDDISILLT-GWSDGSIRAVDCETGEQLWTLANAHRNAVTKIDTSFNNK 434

Query: 87  SLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS---EVRILSLE 143
                G  G       +  +++  FK     ++    L E+ +   +SS   E+    L 
Sbjct: 435 FFVSAGQEGDVRVYAMKTRQLVCSFKEHTTMVTCLQIL-EDDVHVFSSSKDREIFCYDLR 493

Query: 144 NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS 203
           N   V  F+  VG +         + II+ G  ++++  W   ISS +  +    S +  
Sbjct: 494 NERRVAAFNMKVGSVNGFVVCKDHETIISIG-SDRYITFWT--ISSDSPTRIVPYSKKFE 550

Query: 204 PVAI 207
           P  I
Sbjct: 551 PKCI 554


>gi|430746307|ref|YP_007205436.1| WD40 repeat-containing protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018027|gb|AGA29741.1| WD40 repeat-containing protein [Singulisphaera acidiphila DSM
           18658]
          Length = 939

 Score = 38.9 bits (89), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 40/156 (25%), Positives = 76/156 (48%), Gaps = 14/156 (8%)

Query: 85  GRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE--EKIFALASSEVRILSL 142
           G     VG + +     +  G V++E  ASE+P+++ A   +  +   ALA+  +RI + 
Sbjct: 287 GSRFATVGPDKVVRVYNAADGAVVKEIPASEEPVAALALKADGSQIAIALANRAIRICNT 346

Query: 143 ENGEEVLKFSDDVGPLQYVS-ASDGAKIIITAGYGEKHLQVWRCDISSKTVNK-GPALSM 200
            +G EV K      P+  ++  +DGA++ +    GE  + +   D ++ TV K  P    
Sbjct: 347 ADGTEVKKSEPLPSPVSALAFRADGAQLAVA---GEDTI-IRVLDAAAATVVKEFPG--- 399

Query: 201 RHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLK 236
            H       + +PN  DG ++++ ++  +A  WD+K
Sbjct: 400 -HGAKVNALEFAPN--DGNLLISASDDKLAKLWDVK 432



 Score = 37.7 bits (86), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 25/115 (21%), Positives = 56/115 (48%), Gaps = 6/115 (5%)

Query: 73  PGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFAL 132
           P  ++ LAF   G  L V G + +   + +    V++EF      +++  F   +    +
Sbjct: 359 PSPVSALAFRADGAQLAVAGEDTIIRVLDAAAATVVKEFPGHGAKVNALEFAPNDGNLLI 418

Query: 133 ASSE---VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGEKHLQVW 183
           ++S+    ++  ++ G+ +  F+    PL  ++ S DG+K++   G  +K ++VW
Sbjct: 419 SASDDKLAKLWDVKEGKLIRDFAGHTEPLLTLNVSRDGSKLVT--GSADKSIKVW 471


>gi|162149625|ref|YP_001192004.2| leucyl-tRNA synthetase [Metallosphaera sedula DSM 5348]
          Length = 938

 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 67/161 (41%), Gaps = 16/161 (9%)

Query: 91  VGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV------------- 137
           VG +    +M+ E+GE +  F  S+    ++A L  E IF   +  V             
Sbjct: 183 VGMHDTKGDMEPEIGEYVVIFFESKMGALAAATLRPETIFGAVAVWVNPKATYTVAEIWG 242

Query: 138 -RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
            +++  E   E LKF  DV  L+ VS SD  KI+       K + +   D    T   G 
Sbjct: 243 KKVIVSEKAAEKLKFQTDVKVLEKVSGSDLLKIVAINPITGKEIPILPADFVDPTTATGV 302

Query: 197 ALSMR-HSPV-AIDCKNSPNGEDGTVILAVAESGVAYSWDL 235
            +S+  H+P      K +  G +   ++AV   G A + DL
Sbjct: 303 VMSVPAHAPFDYFYLKKAKVGIEPIPVVAVEGQGDAPAKDL 343


>gi|189082975|sp|A4YI28.1|SYL2_METS5 RecName: Full=Leucine--tRNA ligase 2; AltName: Full=Leucyl-tRNA
           synthetase 2; Short=LeuRS 2
 gi|145702938|gb|ABP96080.1| leucyl-tRNA synthetase [Metallosphaera sedula DSM 5348]
          Length = 950

 Score = 38.9 bits (89), Expect = 3.4,   Method: Compositional matrix adjust.
 Identities = 43/161 (26%), Positives = 67/161 (41%), Gaps = 16/161 (9%)

Query: 91  VGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEV------------- 137
           VG +    +M+ E+GE +  F  S+    ++A L  E IF   +  V             
Sbjct: 195 VGMHDTKGDMEPEIGEYVVIFFESKMGALAAATLRPETIFGAVAVWVNPKATYTVAEIWG 254

Query: 138 -RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGP 196
            +++  E   E LKF  DV  L+ VS SD  KI+       K + +   D    T   G 
Sbjct: 255 KKVIVSEKAAEKLKFQTDVKVLEKVSGSDLLKIVAINPITGKEIPILPADFVDPTTATGV 314

Query: 197 ALSMR-HSPV-AIDCKNSPNGEDGTVILAVAESGVAYSWDL 235
            +S+  H+P      K +  G +   ++AV   G A + DL
Sbjct: 315 VMSVPAHAPFDYFYLKKAKVGIEPIPVVAVEGQGDAPAKDL 355


>gi|428308856|ref|YP_007119833.1| WD40 repeat-containing protein [Microcoleus sp. PCC 7113]
 gi|428250468|gb|AFZ16427.1| WD40 repeat-containing protein [Microcoleus sp. PCC 7113]
          Length = 1205

 Score = 38.9 bits (89), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 28/106 (26%), Positives = 51/106 (48%), Gaps = 7/106 (6%)

Query: 45  LALGTSNGDILAVDVLTGEM--KWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LA G+ +G +   D+ TG+    W++   H G +  +AF+ +GR+L   G +        
Sbjct: 689 LASGSGDGTLRCWDLNTGQCLKMWQA---HLGQVWSVAFSPQGRTLASSGADNTMKLWDV 745

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGE 146
             G+ ++ F++    + S AF  + KI A   ++  VR   +  GE
Sbjct: 746 STGQCLKTFQSDNNQVQSVAFSPDGKILASGGNDCLVRCWDINTGE 791


>gi|425457085|ref|ZP_18836791.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
 gi|389801671|emb|CCI19204.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9807]
          Length = 1108

 Score = 38.9 bits (89), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 83/199 (41%), Gaps = 26/199 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D    +++  +T      +  +AF   G  + + G +G   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKKIREFTTKMR--SIYSIAFHPDGNQIAITGRSGKV-QIWSK 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G +++EF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 899  KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLSNYRPQLINSWTADDNIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W      +     +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWD 234
             DG  I A++  G A  WD
Sbjct: 1005 -DGEKIAAISRDGTARRWD 1022


>gi|444919762|ref|ZP_21239726.1| High-affnity carbon uptake protein Hat/HatR [Cystobacter fuscus DSM
           2262]
 gi|444707968|gb|ELW49101.1| High-affnity carbon uptake protein Hat/HatR [Cystobacter fuscus DSM
           2262]
          Length = 953

 Score = 38.9 bits (89), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 44/173 (25%), Positives = 77/173 (44%), Gaps = 20/173 (11%)

Query: 71  RHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIF 130
           RH   +  +AF+  GRS+     +G A       GE + +  + E+P+++ AF  E K  
Sbjct: 716 RHADAVNAVAFSPDGRSVATASEDGTARLWSVATGEPLGKPFSHERPVTAVAFSPEGKSL 775

Query: 131 ALASSE--VRILSLENGEEV---LKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRC 185
           A AS++   R+ +   GE +   L+    +  L +  + DG  +   +  G   +++W  
Sbjct: 776 ATASTDNTARLWNTATGEPLGSPLRHDALITSLAF--SPDGQSLATASDDGS--VRLW-- 829

Query: 186 DISSKTVNKGPALSMRHSPVAI-DCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
           D+++     G   S  H P A+     SP   DG  +   +E   A  WD+ T
Sbjct: 830 DVAT-----GSERSRLHHPNAVTSVAFSP---DGKSLATGSEDDSARLWDVAT 874


>gi|395507078|ref|XP_003757855.1| PREDICTED: WD repeat-containing protein 43 [Sarcophilus harrisii]
          Length = 688

 Score = 38.9 bits (89), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 53/250 (21%), Positives = 103/250 (41%), Gaps = 24/250 (9%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           LLA+GT+ G IL  + L GE++ K  +G H   +  + + +    L+    +    E  +
Sbjct: 96  LLAIGTAVGSILLYNTLKGELQSKLVSGGHDNKVNCIQWHQDNDCLYSCSDDKHIVEWNT 155

Query: 103 EMGEVIREFKASEKPISSSAFLC---EEKIFALASSEVRILSLENGEEVLKFSDD----- 154
           +   V  ++K     ISS + LC   + K+   A   +++  LE  E    F+       
Sbjct: 156 QTCRVKCKWKGD---ISSVSALCISPDGKMLLSAGRTIKLWDLETKEVYRHFTGHATAVS 212

Query: 155 ------VGPLQYVSASDGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVA 206
                 + P +     DG      ++    ++ L VW+    +K  N   + ++   PV 
Sbjct: 213 SLMFTTLRPPKESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKNAVMSFTVTDEPVY 272

Query: 207 IDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQNSVNV 266
           ID   S N E+   +  V   G  + ++       +K   +  T+++  +  D++++   
Sbjct: 273 IDLTLSENKEEPVKLAVVCRDGQVHLFEHILNGYCKKPLTSNCTIQIATSGKDKEST--- 329

Query: 267 KKSRTSIIAA 276
             S   I+AA
Sbjct: 330 -PSPIPILAA 338


>gi|295838049|ref|ZP_06824982.1| WD-repeat protein [Streptomyces sp. SPB74]
 gi|295826815|gb|EDY46831.2| WD-repeat protein [Streptomyces sp. SPB74]
          Length = 1631

 Score = 38.9 bits (89), Expect = 3.7,   Method: Composition-based stats.
 Identities = 41/147 (27%), Positives = 64/147 (43%), Gaps = 7/147 (4%)

Query: 41   GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
            GT L + G S+G +   DV TG+   +  GR    L   AF+  G  L   G+ G     
Sbjct: 1288 GTRLASAG-SDGTVRLWDVATGQRTHELLGRG-DRLISAAFSPAGTVLATAGSTGHVYLW 1345

Query: 101  KSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPL 158
             +E G  +RE         + AF  + +  A A+ +  VR+     G   L+     G +
Sbjct: 1346 DAENGTFLRELDVETDRTWAEAFSADGEEIATANDDDSVRLWRRATGSHGLRLDGHEGRV 1405

Query: 159  QYVS-ASDGAKIIITAGYGEKHLQVWR 184
            + V+ A DGA   I  G  +  +++WR
Sbjct: 1406 RSVAFAKDGAS--IATGCDDGRVRLWR 1430


>gi|381207419|ref|ZP_09914490.1| hypothetical protein SclubJA_17565 [SAR324 cluster bacterium
           JCVI-SC AAA005]
          Length = 348

 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 34/136 (25%), Positives = 62/136 (45%), Gaps = 7/136 (5%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           L A+G ++G + A D  +G+  WK+T  H GG+  L+ +  G  L  VG +G  +     
Sbjct: 36  LFAVGDASGSVHAFDGTSGQEIWKNTEAHVGGIMSLSVSPNGTQLASVGQDGQLTIWNFA 95

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVL---KFSDDVGPLQY 160
            G++  + + +   +   A+       A++S  V      +GE++    K    V  L +
Sbjct: 96  DGQLQHQTEIASGWVEHVAWSKSGNYLAVSSGRVVTNLSSSGEKIWSSEKHPSTVSALGW 155

Query: 161 VSASDGAKIIITAGYG 176
           V+  +    + TA YG
Sbjct: 156 VTDQE----LSTACYG 167


>gi|193216393|ref|YP_001997592.1| WD40 domain-containing protein [Chloroherpeton thalassium ATCC
           35110]
 gi|193089870|gb|ACF15145.1| WD-40 repeat protein [Chloroherpeton thalassium ATCC 35110]
          Length = 514

 Score = 38.9 bits (89), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 35/139 (25%), Positives = 67/139 (48%), Gaps = 8/139 (5%)

Query: 54  ILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMAS--EMKSEMGEVI--R 109
           I+  ++  GE+ +K TG H G +  +A  +K   L   G++   S  ++ S+    I  R
Sbjct: 292 IIFWNLRNGEITYKITG-HSGWITSIALNEKNTLLASGGSDETVSLWDIASQPDSAIELR 350

Query: 110 EFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGA 167
            F   +  + S AF  + KI A    +  ++  ++E G E+ +       ++ ++ S   
Sbjct: 351 RFPGHKDYVRSIAFAPDGKILASGGDDRTLKFWNVETGREIYEIEGHASWVRSIAYSPNG 410

Query: 168 KIIITAGYGEKHLQVWRCD 186
           KII TAG  +K +++W  +
Sbjct: 411 KIIATAG-DDKLIKLWNAE 428


>gi|406833633|ref|ZP_11093227.1| serine/threonine protein kinase-related protein [Schlesneria
            paludicola DSM 18645]
          Length = 1664

 Score = 38.9 bits (89), Expect = 3.9,   Method: Compositional matrix adjust.
 Identities = 40/155 (25%), Positives = 67/155 (43%), Gaps = 11/155 (7%)

Query: 81   FAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VR 138
            F+  G  +    ++  A    ++ GE+IREFK  E  +    F  + +     S +   R
Sbjct: 1501 FSPNGDYIVTTSSDKTARLWNAKSGELIREFKGHEWAVVCVDFSKDGRWIVTGSEDNTAR 1560

Query: 139  ILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPAL 198
            + ++E  E++L  S     +  V+ S  +  IIT G  ++  ++W     +KT  +   L
Sbjct: 1561 VWNVETAEQLLTLSGHTASVTSVNFSPDSMRIITGGQ-DQAAKLW----DAKTGKEILTL 1615

Query: 199  SMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSW 233
            S RH+        SP   DG  IL  +  G A  W
Sbjct: 1616 S-RHTEEVTSVAFSP---DGHQILTGSRDGTAVIW 1646


>gi|403222008|dbj|BAM40140.1| uncharacterized protein TOT_020000403 [Theileria orientalis strain
           Shintoku]
          Length = 685

 Score = 38.9 bits (89), Expect = 4.0,   Method: Compositional matrix adjust.
 Identities = 26/118 (22%), Positives = 56/118 (47%), Gaps = 3/118 (2%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H  G+  ++F ++G   +    +  +    +E G++I  +   + P   +    +E +F 
Sbjct: 438 HSKGVRDISFIEEGSKFYSCSFDSNSILWDTEYGKIIGIYTVEKTPYCLTVCPKDEWVFI 497

Query: 132 LASSEVRILSLE--NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDI 187
           +     +    +   GE VL++++ +G +  V+  DG + I+T G  +K L VW  +I
Sbjct: 498 VGGENKKASQFDARTGEVVLEYAEHMGCVNTVTFIDGNRRILTTG-DDKKLLVWDYNI 554


>gi|427718396|ref|YP_007066390.1| serine/threonine protein kinase with WD40 repeats [Calothrix sp.
           PCC 7507]
 gi|427350832|gb|AFY33556.1| serine/threonine protein kinase with WD40 repeats [Calothrix sp.
           PCC 7507]
          Length = 661

 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 30/122 (24%), Positives = 53/122 (43%), Gaps = 9/122 (7%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLC------ 125
           H G +  +AF+  G +L     +          GE IR  K     ++S A+L       
Sbjct: 538 HSGLVLAVAFSPDGINLASGSKDKTIKLWNLVTGEAIRTLKGHTDKVNSVAYLPKSGDNK 597

Query: 126 -EEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQV 182
            +  I    S++  V++ +LE G+E+     D G +  V+ S   K I + G  +  +++
Sbjct: 598 NQNTILISGSNDNTVKLWNLETGKEIRTLKRDSGYIYSVAVSADGKTIASGGSADNIIKI 657

Query: 183 WR 184
           WR
Sbjct: 658 WR 659


>gi|332707508|ref|ZP_08427552.1| WD-40 repeat-containing protein [Moorea producens 3L]
 gi|332353722|gb|EGJ33218.1| WD-40 repeat-containing protein [Moorea producens 3L]
          Length = 1182

 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 50/194 (25%), Positives = 81/194 (41%), Gaps = 16/194 (8%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            LLA  + +  +   DV  G+  +  TG H   +  + F   GR L     +      + +
Sbjct: 915  LLASASLDNTVKLWDVDNGKEIYTLTG-HTSNVRSITFRSDGRILASGSDDRTIKLWRVQ 973

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRIL--SLENGEEVLKFSDDVGPLQYV 161
             GE++R FK     I   +F  + +  A AS + RIL   +E+G  V  F +    L  +
Sbjct: 974  DGELLRTFKGHLHSIRDLSFTPDGQNIATASFDGRILFWQVEDGRMVKVFDNIDSWLATI 1033

Query: 162  SASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGE----- 216
            S S   K++ + G G + +++W  +     V + P     H       + SPNG+     
Sbjct: 1034 SISPNGKLLASGG-GYRGIKLWN-NSDGTIVKELPG----HGIWIRSLRFSPNGKLLASG 1087

Query: 217  --DGTVILAVAESG 228
              D TV L   E G
Sbjct: 1088 SFDRTVKLWRVEDG 1101


>gi|444918527|ref|ZP_21238596.1| WD-40 repeat protein [Cystobacter fuscus DSM 2262]
 gi|444709706|gb|ELW50706.1| WD-40 repeat protein [Cystobacter fuscus DSM 2262]
          Length = 182

 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 27/115 (23%), Positives = 56/115 (48%), Gaps = 5/115 (4%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE-MGEVIREFKASEKPISSSAFLCEEKIF 130
           H G ++  AF+ +G+ +   G +G     +++  GE +      + PI S+AF  + +  
Sbjct: 18  HEGQVSSAAFSPEGQRVITSGADGTVRVWRADGTGEPL-VLDGRQGPIRSAAFSPDGQRI 76

Query: 131 ALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           A +  +   R+   +   E L      GP+Q+V+ S   + ++TAG  ++  ++W
Sbjct: 77  ATSGDDGTARVWRADGTGEPLLLKGHTGPVQFVAFSPDGRQLVTAG-ADRTARIW 130


>gi|425468738|ref|ZP_18847730.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9701]
 gi|389884607|emb|CCI35113.1| conserved membrane hypothetical protein [Microcystis aeruginosa PCC
           9701]
          Length = 559

 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 32/106 (30%), Positives = 57/106 (53%), Gaps = 6/106 (5%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALAS-SEV 137
           LAF+  G+ L   G++        + GE+I+ ++A E+ I S A      + A AS +E+
Sbjct: 385 LAFSNDGQYLISGGSDKTIKIWDIKTGEIIKSWQAHEQAIISIAINPHRHLIASASRTEI 444

Query: 138 RILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           +I   + G E++K      PL++  + DG + +IT  YG K +++W
Sbjct: 445 KIWQGQTG-ELIKVLRGTAPLKF--SPDG-QFLITGSYGGK-VKIW 485


>gi|344280461|ref|XP_003412002.1| PREDICTED: LOW QUALITY PROTEIN: WD repeat-containing protein
           43-like [Loxodonta africana]
          Length = 677

 Score = 38.5 bits (88), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 56/262 (21%), Positives = 100/262 (38%), Gaps = 29/262 (11%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVG---------KKRRKER-GTL----LLA 46
           +W T +  L  E+  P      + +CLA              KKR+ E  GT     LLA
Sbjct: 40  VWETGNNRLHQEY-VPSAHLSGTCTCLAWAPARLQAKESPQRKKRKSEAIGTSDQVDLLA 98

Query: 47  LGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LGT+ G IL    + GE+  K  +G H   +  + + +    L+    +    E  ++  
Sbjct: 99  LGTAVGSILLYSTVKGELHSKLISGGHDNKVNCIQWHQDNGYLYSCSDDKHIVEWNAQTC 158

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS- 164
            V  ++K     +SS     + K+   A   +++  LE  E    F+    P+  +  + 
Sbjct: 159 RVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEVYRHFTGHATPVSSLMFTT 218

Query: 165 ----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
                     DG      ++    ++ L +W+     K  N   + ++   PV ID   S
Sbjct: 219 IRPPNESQPFDGITGLYFLSGAVHDRLLNIWQVRSEKKEKNAVMSFTVTDEPVYIDLTLS 278

Query: 213 PNGEDGTVILAVAESGVAYSWD 234
            N E+   +  V   G  + ++
Sbjct: 279 ENKEEPVRLAVVCRDGQVHLFE 300


>gi|321478591|gb|EFX89548.1| hypothetical protein DAPPUDRAFT_303234 [Daphnia pulex]
          Length = 907

 Score = 38.5 bits (88), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 23/62 (37%), Positives = 39/62 (62%), Gaps = 4/62 (6%)

Query: 132 LASSEVRILSLENGEEVLKFSDDVGPLQYVSA-SDGAKIIITAGYGEKHLQVWRCDISSK 190
           LAS E+ + +L+  E ++KF  D GP+  ++  +DGA I+IT G  + HL +W  D+  +
Sbjct: 214 LASGEMCLHNLKFDETLIKFKQDWGPITALTFRTDGAPIMIT-GSSQGHLAIW--DLEQQ 270

Query: 191 TV 192
           T+
Sbjct: 271 TL 272


>gi|195582010|ref|XP_002080821.1| GD10690 [Drosophila simulans]
 gi|194192830|gb|EDX06406.1| GD10690 [Drosophila simulans]
          Length = 787

 Score = 38.5 bits (88), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 60/284 (21%), Positives = 110/284 (38%), Gaps = 23/284 (8%)

Query: 62  GEMKWKSTGRHPGGLAGLAFAKKG-RSLHVVGTNGMASEMKSEMGEV-IREFKASEKPIS 119
           G++ W   G H   L G A  +   ++  V+ + G+ S +        +      E  I+
Sbjct: 23  GDIAWSPNGHHLYCLNGSAVNQVDVQTSQVLKSYGLTSPLADNKKPAELDNIDVEEDTIT 82

Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEV-LKFSDDVGPLQYVSASDGAKIIITAGYGEK 178
                 E  + A  S  +R+  L+ G+ V L  +   GP+  V  S   ++I T+G  + 
Sbjct: 83  CFGLSQEHLVTAHRSGLLRLWELDTGKLVKLWKAQHKGPVIRVEFSPCGRLICTSGGADA 142

Query: 179 HLQVWRCDISSKTV------NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
            L++W  D S+ +         GPA  +   P  +  +    G D +V         A++
Sbjct: 143 TLRLW--DYSNNSCLGALKDFPGPAWLLVFHPNVLRKEIYAGGSDNSV--------YAWN 192

Query: 233 WDLKTVSQDEKTNPAKIT-VKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTITG 291
           ++ KT+    + + +++T +  K   +D    V V + +  I+      LE+   K +  
Sbjct: 193 YETKTLVHKMRGHLSQVTGLSFKSNASDCNEIVTVSRDKVMIVWQLQEQLESKQLKVLPL 252

Query: 292 LHDLESEAATASAQN---EKKSKKRAASDPDLETTRDVVDGECF 332
             +LE        Q       S K    DP     RD++    F
Sbjct: 253 YDELEGVVYADEGQQIIVASGSGKLQQVDPKSWKIRDILSQSDF 296


>gi|156407954|ref|XP_001641622.1| predicted protein [Nematostella vectensis]
 gi|156228761|gb|EDO49559.1| predicted protein [Nematostella vectensis]
          Length = 686

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 71/323 (21%), Positives = 124/323 (38%), Gaps = 54/323 (16%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKK-----RRKERGTL----------LL 45
           +W   +GSL  ++  P      + +CL      +      RRK++ T           ++
Sbjct: 51  VWDCTNGSLKNQYT-PSSHLSTTCACLKWCNSSRNVEQTPRRKKQKTAHHSTSVSTSDII 109

Query: 46  ALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           ALGT++GDIL   +  GE+  K  G H   +  + +++ G  L+   ++    E  +E  
Sbjct: 110 ALGTTSGDILLYSLSAGEVHKKLEGGHSSKVNDVEWSRNGE-LYSCSSDKYIVEWCTEKA 168

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF-------------- 151
           E   ++KA +  +SS            A   +++  L+  E + KF              
Sbjct: 169 EHKSKWKADKHGVSSLRVSPLGNHLLSAGRTIKLWDLDTKELLKKFTGHSSIITDLLFLP 228

Query: 152 ----SDDVGPLQYVSASDGAKIIITAGY-----------GEKHLQV-WRCDISSKTVNKG 195
               SD  G     S  +G   + +A +           G  HL V W  ++ +K  N  
Sbjct: 229 NPTSSDSNGNHVSDSTINGWYFLSSAEHDRLLNIWQVFIGVIHLLVNWYIELETKAKNSL 288

Query: 196 PALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKK 255
            +LS+   P++ID            I  V + G  + ++       +K      T++LK 
Sbjct: 289 ISLSLTDEPISIDVTRHDTRPKPVHISVVTKDGRLHLFEKSLNGGGKKPLLPSRTIQLKS 348

Query: 256 ADADQQNSVNVKKSRTSIIAARL 278
           AD       +  KS  S I A+L
Sbjct: 349 AD-------DKAKSPISFICAKL 364


>gi|300709644|ref|YP_003735458.1| cell surface protein/ lipoprotein [Halalkalicoccus jeotgali B3]
 gi|448297585|ref|ZP_21487631.1| cell surface protein/ lipoprotein [Halalkalicoccus jeotgali B3]
 gi|299123327|gb|ADJ13666.1| cell surface protein/ lipoprotein [Halalkalicoccus jeotgali B3]
 gi|445579894|gb|ELY34287.1| cell surface protein/ lipoprotein [Halalkalicoccus jeotgali B3]
          Length = 400

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 32/133 (24%), Positives = 63/133 (47%), Gaps = 9/133 (6%)

Query: 40  RGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASE 99
            GT+ +  G   G   A+D  +GE +W+      G   G   A  G +++    +G    
Sbjct: 264 EGTVFV--GDRGGGFYALDAASGEQRWR---VETGDWTGSGPAVVGGTVYAANRDGNVYA 318

Query: 100 MKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQ 159
             +E GE +  F+   + + SS  +  E+++ +A+ E  + +L+  E   ++S  +GP+ 
Sbjct: 319 FDAESGEQLWRFQTGGE-VWSSPTVAGERVY-VANREGSVYALDAAEGTNQWSLSIGPVT 376

Query: 160 YVSAS--DGAKII 170
           + S +  DGA  +
Sbjct: 377 FSSPAVVDGALYV 389


>gi|159900554|ref|YP_001546801.1| serine/threonine protein kinase [Herpetosiphon aurantiacus DSM 785]
 gi|159893593|gb|ABX06673.1| serine/threonine protein kinase [Herpetosiphon aurantiacus DSM 785]
          Length = 774

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 36/150 (24%), Positives = 65/150 (43%), Gaps = 17/150 (11%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           ++  G+ +G I+A+D+  G ++WK   +  GG+   A A     L     N + + +  E
Sbjct: 599 MVIFGSGDGSIVALDLFKGGIRWKQ--KLQGGIVSSATANDKMVLVGCMDNNLHA-LDLE 655

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFSD---------- 153
            G  I +F+ S    SS   +            +  + L+NG++V +++           
Sbjct: 656 GGWPIWKFRTSHYVNSSPIIIGNRAFVGGIDGNIYAVDLKNGKQVWQYNTGAQIVSSPVA 715

Query: 154 DVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
           D G + Y+ A+DG    + AG G     VW
Sbjct: 716 DSGRI-YIGAADGTVYCLDAGSGTP---VW 741


>gi|386852528|ref|YP_006270541.1| Vegetative incompatibility protein HET-E-1 [Actinoplanes sp.
           SE50/110]
 gi|359840032|gb|AEV88473.1| Vegetative incompatibility protein HET-E-1 [Actinoplanes sp.
           SE50/110]
          Length = 395

 Score = 38.5 bits (88), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 39/168 (23%), Positives = 79/168 (47%), Gaps = 13/168 (7%)

Query: 74  GGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVI-REFKASEKPISSSAFLCEEKIFAL 132
           G ++ + F++ G+ L     +      +++ G  I +    S  P+ +++F  + K+ A 
Sbjct: 169 GAVSSVVFSRDGKRLATTAADSTVRLWQADSGRPIGKALIVSPGPVYATSFSPDGKLLAA 228

Query: 133 ASSE--VRILSLENGEEVLK-FSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW----RC 185
           +     VR+ +   GE   +      GP+  ++ S G +++ T+  G+  +++W    R 
Sbjct: 229 SGGGDVVRLWNPATGEPAGEPIVAGPGPVYALTFSRGTRLLATSSGGDDTVRLWDTATRH 288

Query: 186 DISSKTV-NKGPALSMRHSPVAIDCKNSPNG-EDGTVILAVAESGVAY 231
            +++    + GP  +MR  P   D K    G +DGTV L  A +G A+
Sbjct: 289 PVAAPLAGHTGPVRAMRFGP---DGKLLATGSDDGTVRLWDAVTGRAH 333


>gi|403301810|ref|XP_003941571.1| PREDICTED: WD repeat-containing protein 43 [Saimiri boliviensis
           boliviensis]
          Length = 678

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 47/220 (21%), Positives = 88/220 (40%), Gaps = 19/220 (8%)

Query: 34  KKRRKE-----RGTLLLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRS 87
           KKR+ E       T LLALGT+ G IL    + GE+  K  +G H   +  + + + G  
Sbjct: 81  KKRKSEAVGTSNQTDLLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDGGC 140

Query: 88  LHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
           L+    +    E   +  +V  ++K     ++S     + K+   A   +++  LE  E 
Sbjct: 141 LYSCSDDKHIVEWNVQTCKVKCKWKGDNSSVTSLCVSPDGKMLLSAGRTIKLWVLETKEV 200

Query: 148 VLKFSDDVGPLQYVSAS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNK 194
              F+    P+  +  +           DG      ++    ++ L VW+    +K  + 
Sbjct: 201 YRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKSA 260

Query: 195 GPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
             + ++   PV ID   S N E+   +  V   G  + ++
Sbjct: 261 VMSFTVTDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE 300


>gi|269961433|ref|ZP_06175797.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
 gi|269833810|gb|EEZ87905.1| conserved hypothetical protein [Vibrio harveyi 1DA3]
          Length = 383

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 48/205 (23%), Positives = 91/205 (44%), Gaps = 31/205 (15%)

Query: 47  LGTSNGDILAVDVLTGE-MKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMG 105
           LG SNG  + V+++TG  +++ +   H   +  +A +  GR     G +  A    +E G
Sbjct: 192 LGLSNGKAIHVNLVTGRRLEFLA---HQEKVNSVAISPNGRFALTGGNDYKAYLWDTETG 248

Query: 106 EVIREFKASEKPISSSAFLCEEKIFALAS---SEVRILSLENGEEVLKFSDDVGPLQYVS 162
           +V+R F+  ++ +  +  L  +   A+ S   ++  I  LE GEE+ + S     L + +
Sbjct: 249 QVLRSFEHEQRVVRVA--LQRDGKLAMTSDGGNQAIIWDLETGEELTQLSSWSRQLIFST 306

Query: 163 A--SDGAKIIITAGYGEKHLQVW---------RCDISSKTVNKGPALSMRHSPVAIDCKN 211
           A  SD  ++++T G     + VW         R ++  K   + P   +  +  A D K 
Sbjct: 307 ARFSDDGRLLVT-GTPSGRVMVWDTHTGKRVDRFEVEPKKDTRPPRAVVYDA--AFDSKQ 363

Query: 212 SPNGEDGTVILAVAESGVAYSWDLK 236
                    I+    +G+A +W L+
Sbjct: 364 R--------IITATSAGIAQAWQLE 380


>gi|162456674|ref|YP_001619041.1| WD repeat-containing protein [Sorangium cellulosum So ce56]
 gi|161167256|emb|CAN98561.1| WD-repeat protein [Sorangium cellulosum So ce56]
          Length = 1759

 Score = 38.5 bits (88), Expect = 4.4,   Method: Composition-based stats.
 Identities = 42/200 (21%), Positives = 83/200 (41%), Gaps = 20/200 (10%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            +LA G+++  +   +V +G       G H      + F+  GR+L     +      + E
Sbjct: 1496 MLASGSNDTTVRLWEVESGRALRVFEG-HGKAATSVVFSPDGRTLASGSNDTTVRLWEVE 1554

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
             G V+R F    K ++S  F  + +  A  S++  VR+  +E+G  +L F D       V
Sbjct: 1555 SGRVLRTFGGHGKVVTSVVFSPDGRTLASGSNDTTVRLWEVESGRALLVFEDHGKGATSV 1614

Query: 162  SASDGAKIIITAGYGEKHLQVWRCD----ISSKTVNKGPALSMRHSPVAIDCKNSPNGED 217
            + S   + + +  Y +  +++W       + +   +  P +S+  SP            D
Sbjct: 1615 AFSPDGRTLASGSY-DTMVRLWEAGSGRFLGALRGHTAPVVSVSFSP------------D 1661

Query: 218  GTVILAVAESGVAYSWDLKT 237
            GT++ + +  G    W + T
Sbjct: 1662 GTLLASASSDGTLRLWRVAT 1681


>gi|367038693|ref|XP_003649727.1| hypothetical protein THITE_2108573 [Thielavia terrestris NRRL 8126]
 gi|346996988|gb|AEO63391.1| hypothetical protein THITE_2108573 [Thielavia terrestris NRRL 8126]
          Length = 554

 Score = 38.5 bits (88), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 19/63 (30%), Positives = 31/63 (49%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           HPGG+  +A++   + L   G N +A     + GE +      E+PISS  +  + + F 
Sbjct: 314 HPGGIGSVAWSPDSKLLVSCGKNHVAKVWHVDTGECVSTLSGFEEPISSCVWPADGRTFV 373

Query: 132 LAS 134
           L S
Sbjct: 374 LGS 376


>gi|73670605|ref|YP_306620.1| hypothetical protein Mbar_A3154 [Methanosarcina barkeri str.
           Fusaro]
 gi|72397767|gb|AAZ72040.1| hypothetical protein Mbar_A3154 [Methanosarcina barkeri str.
           Fusaro]
          Length = 631

 Score = 38.5 bits (88), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 26/99 (26%), Positives = 51/99 (51%), Gaps = 7/99 (7%)

Query: 50  SNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIR 109
           +NG ILA+ V +GE  WK+     G  A +   K  + L   G + +   + +  GEV++
Sbjct: 397 NNGTILALSVESGEEVWKTNA---GEWADIIEVKNDKMLVNTGKSFL---LNASTGEVLK 450

Query: 110 EFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEV 148
           E    E P++SS  + ++ +++     +R+ + + G+ V
Sbjct: 451 ELPYPELPVASS-MITDKYVYSTGPYNIRVFNSDTGKPV 488


>gi|166365267|ref|YP_001657540.1| WD repeat-containing protein [Microcystis aeruginosa NIES-843]
 gi|166087640|dbj|BAG02348.1| WD-repeat protein Hat [Microcystis aeruginosa NIES-843]
          Length = 1108

 Score = 38.5 bits (88), Expect = 4.6,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 80/199 (40%), Gaps = 26/199 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D     ++  +T      +  +AF      + + G NG   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKNIREFATKMR--SIYSIAFHPDSNQMAITGRNGKV-QIWSQ 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G +++EF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 899  KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLSNYRPQLINSWTADDNIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W            +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TATRGK--IKIWDLQGNLFEEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWD 234
             DG  I A++  G A  WD
Sbjct: 1005 -DGEKIAAISRDGTARRWD 1022


>gi|24652208|ref|NP_610526.1| CG1671 [Drosophila melanogaster]
 gi|7303841|gb|AAF58888.1| CG1671 [Drosophila melanogaster]
 gi|28416343|gb|AAO42644.1| LD47550p [Drosophila melanogaster]
 gi|220946482|gb|ACL85784.1| CG1671-PA [synthetic construct]
          Length = 787

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 52/245 (21%), Positives = 102/245 (41%), Gaps = 20/245 (8%)

Query: 62  GEMKWKSTGRHPGGLAGLAFAKKG-RSLHVVGTNGMAS-EMKSEMGEVIREFKASEKPIS 119
           G++ W   G H   L G A  +   ++  ++ + G++S    ++    +      E  I+
Sbjct: 23  GDIAWSPNGHHLYCLNGSAVNQVDVQTSQILNSYGLSSPSADNKRPAELDNIDVEEDTIT 82

Query: 120 SSAFLCEEKIFALASSEVRILSLENGEEV-LKFSDDVGPLQYVSASDGAKIIITAGYGEK 178
                 E  + A  S  +R+  L+ G+ V L  +   GP+  V  S   ++I T+G  + 
Sbjct: 83  CFGLSQEHLVTAHRSGLLRLWQLDTGKLVKLWKAQHKGPVIRVEFSPCGRLICTSGGADA 142

Query: 179 HLQVWRCDISSKTV------NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
            L++W  D S+ +         GPA  +   P  +  +    G D TV         A++
Sbjct: 143 TLRLW--DYSNNSCLGALKDFPGPAWLLVFHPNVLRKEIYAGGSDNTV--------YAWN 192

Query: 233 WDLKTVSQDEKTNPAKIT-VKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTITG 291
           ++ KT+    + + +++T +  K   +D    V V + +  I+      LE+   K +  
Sbjct: 193 YETKTLVHKMRGHLSQVTGLSFKSNASDCNEIVTVSRDKVMIVWQLQEQLESKQLKVLPL 252

Query: 292 LHDLE 296
             +LE
Sbjct: 253 YDELE 257


>gi|50547673|ref|XP_501306.1| YALI0C00913p [Yarrowia lipolytica]
 gi|49647173|emb|CAG81601.1| YALI0C00913p [Yarrowia lipolytica CLIB122]
          Length = 912

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 64/265 (24%), Positives = 123/265 (46%), Gaps = 31/265 (11%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGT---NGMASEM 100
           LLA G ++G I   D  +G++     G H   +  L F   G  L V G+   N +  ++
Sbjct: 88  LLAAGYTDGSIRVFDYRSGQVMTTFKG-HKSSITCLEFDASGTRL-VSGSRDSNVILWDL 145

Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASS-EVRILSLENGEEVLKFSDDVGPLQ 159
             E+G  +   ++    ++   FL E+K+   A    V+I  LE+ + +      VG  +
Sbjct: 146 VEEVG--VSRLRSHRDQVTGVKFLDEKKLITCAKDGLVKIWDLESRDCI---ETRVGGSE 200

Query: 160 YVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS------PVAIDCKNSP 213
             +    +  ++T G  EK ++VW+ D+ +K    G  LS R S         ID + S 
Sbjct: 201 VWAMDVYSDTVMTVGR-EKEVKVWQVDLDAK---DGEMLSQRGSLERVSNDRGIDIEFSK 256

Query: 214 NGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADA---DQQNSVNVKKSR 270
           +G  G VI +  +  V   W  +T+++ +K+   KI  ++KK D    D+ +  +V +  
Sbjct: 257 DG--GHVIFSGNDKTVEL-WRRRTLAEIKKSIAKKIKRRVKKGDTVDEDEYDEKSVNEIW 313

Query: 271 TSII----AARLNDLEADGQKTITG 291
             ++    +A++ ++   G ++++G
Sbjct: 314 VQVVNFKTSAKIRNVTWTGSESVSG 338


>gi|380018983|ref|XP_003693398.1| PREDICTED: LOW QUALITY PROTEIN: WD repeat-containing protein
           43-like [Apis florea]
          Length = 623

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 59/270 (21%), Positives = 110/270 (40%), Gaps = 48/270 (17%)

Query: 34  KKRRK-------ERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAG------LA 80
           K+RRK       ++G  ++A+G++NG I   D  T  +  +  G H G +        + 
Sbjct: 71  KRRRKSISEESIQKG--IIAMGSTNGKITLYDTTTSSVTAQLDG-HSGSVTAATWSENVG 127

Query: 81  FAKKGRSLHVVG----TNGMASEMKSEMGEVIREFKASEKPISSSA--FLCEEKIFALAS 134
           F       H++      NG+  + KS  G      K +   +S+     L  E++     
Sbjct: 128 FITASDDHHIIQWNFQENGVKCKWKSGKG------KTTSLAVSTDGNNLLSGERV----- 176

Query: 135 SEVRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKT 191
             V+   L     +  F+   + V  L  +  + G   +I+   G+ +L VW  D     
Sbjct: 177 --VKWWDLNTKRLIRTFTGHANQVTCLHTIKMTSGNNYVISGACGDGYLSVWALDEQRNE 234

Query: 192 VNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITV 251
                +L+++  P+++    S N +   ++L V  SG A    L     + ++ P K ++
Sbjct: 235 RTSVASLALQDDPISVSANVSANSQ--VMVLVVTRSGQA---QLFQYQPNGRSKPLKPSL 289

Query: 252 KLKKA-DADQQNSVNVKKSRTSIIAARLND 280
            +  A D  Q++ V     + SI+ A L D
Sbjct: 290 NIAVASDISQKDEVQ----QISILNAMLTD 315


>gi|33861590|ref|NP_893151.1| hypothetical protein PMM1034 [Prochlorococcus marinus subsp.
           pastoris str. CCMP1986]
 gi|33634167|emb|CAE19493.1| G-protein beta WD-40 repeats [Prochlorococcus marinus subsp.
           pastoris str. CCMP1986]
          Length = 357

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 43/183 (23%), Positives = 77/183 (42%), Gaps = 17/183 (9%)

Query: 27  LACGFVGKKRRKERGTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGR 86
           + CG+  K +       L  +G   G +   D  TG++ WK    H GGL  ++    G 
Sbjct: 26  IVCGWALKGK-------LFIVGDVVGGLYGFDGDTGKLIWKQDEAHSGGLLAMSIHPNGE 78

Query: 87  SLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGE 146
                G +G+   +    GE I+  K  +  +   ++       A+A S+   +  ++G 
Sbjct: 79  IFATSGQDGIVRILNCNNGEEIKAIKLGKGWVEHLSWSNGGLYLAVAFSKKVYVFNQDGN 138

Query: 147 EVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGPALSMR 201
           E     +    +  +S S+  + + TA YG    +V   DI++K  N     +G  +SM+
Sbjct: 139 EQWGSEEHPSTVSAISWSNKNE-LATACYG----RVTFFDIANKKTNQKLEWQGSLVSMQ 193

Query: 202 HSP 204
            SP
Sbjct: 194 LSP 196


>gi|384109844|ref|ZP_10010704.1| ABC-type xylose transport system, periplasmic component [Treponema
           sp. JC4]
 gi|383868594|gb|EID84233.1| ABC-type xylose transport system, periplasmic component [Treponema
           sp. JC4]
          Length = 1177

 Score = 38.5 bits (88), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 60/130 (46%), Gaps = 4/130 (3%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            +A G+S+  +   D  TG +    +  H   +A ++++K GR +     +   +++   
Sbjct: 148 YIASGSSDKSVKFWDAATGMLLQTLSDTHTKTVAAISYSKSGRYVATASWDN-TTKIYHA 206

Query: 104 MGEVIREFKASEK-PISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQY 160
           MG + R         + +  F  +EK  A  S++  +RI  +ENG+ V   +D  G +  
Sbjct: 207 MGGIERNLLTGHNGAVYAVEFSPDEKYIATGSADNSIRIYDVENGKFVRAITDITGEVWT 266

Query: 161 VSASDGAKII 170
           +S S   KII
Sbjct: 267 ISYSPDGKII 276


>gi|430813347|emb|CCJ29317.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 887

 Score = 38.5 bits (88), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 26/98 (26%), Positives = 48/98 (48%), Gaps = 12/98 (12%)

Query: 43  LLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKS 102
           +LL +G SNGDIL       ++  + +G H G + GLAF+   + +   G +G       
Sbjct: 117 ILLCIGLSNGDILVFSTTQDKVILRLSGGHTGDVYGLAFSGDTQ-IWSCGADGK------ 169

Query: 103 EMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRIL 140
                + +++  E  I S++ +C E + AL  S+  ++
Sbjct: 170 -----VVQWELFESKILSTSTVCTESLLALVYSDNEVI 202


>gi|400598576|gb|EJP66285.1| transcriptional repressor TUP1 [Beauveria bassiana ARSEF 2860]
          Length = 1055

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 1/68 (1%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           L+A G   GDIL  D+ +G  K + +G H GG+  LAF+  G+ L   G +       +E
Sbjct: 140 LVATGGGGGDILIWDMESGSRKHRISG-HDGGVRALAFSNDGKVLASSGGDQTVCLWDAE 198

Query: 104 MGEVIREF 111
             ++I++F
Sbjct: 199 SADLIKKF 206


>gi|398788810|ref|ZP_10550903.1| hypothetical protein SU9_31091 [Streptomyces auratus AGR0001]
 gi|396991853|gb|EJJ02976.1| hypothetical protein SU9_31091 [Streptomyces auratus AGR0001]
          Length = 1980

 Score = 38.5 bits (88), Expect = 4.9,   Method: Composition-based stats.
 Identities = 35/114 (30%), Positives = 52/114 (45%), Gaps = 4/114 (3%)

Query: 25   SCLACGFVGKKRRKERGT--LLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFA 82
            S  AC F  + R   R T  +L+ +G + G +   D  TG+ +    G H   +A LAF+
Sbjct: 1585 SVWACRFRPRSRFGARETDPMLVTIG-NEGLVRLWDTSTGQGRRILRG-HGRRVASLAFS 1642

Query: 83   KKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
              G  L   G +G+A   +S  G   REF      + S+ F+      A ASS+
Sbjct: 1643 PDGTHLAACGNDGVARVWESATGRRTREFAGDHDRLVSALFVPGSHQLATASSD 1696


>gi|390441802|ref|ZP_10229833.1| conserved hypothetical protein [Microcystis sp. T1-4]
 gi|389834856|emb|CCI33959.1| conserved hypothetical protein [Microcystis sp. T1-4]
          Length = 1107

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 45/168 (26%), Positives = 69/168 (41%), Gaps = 24/168 (14%)

Query: 76   LAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
            +  +AF      + + G NG   ++ S+ G +++EF AS+ PI S AF  E       +S
Sbjct: 872  IYSIAFHPDDNQIAITGRNGKV-QIWSKKGTMLQEFTASQVPIYSLAFNGEGTAIITGTS 930

Query: 136  E--VRILSLENGEEVLKFS---DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCD---I 187
            E  V+   L N    L  S   DD      V + D  KI  TA  G+  +++W      +
Sbjct: 931  EGKVQYWHLSNHRTKLINSWTVDDSIIYDLVFSPDHQKIA-TATRGK--IKIWDLQGNIL 987

Query: 188  SSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDL 235
                 +  P   +  SP            DG  I A++  G A  WD+
Sbjct: 988  KEIKTDSFPVYGVSFSP------------DGEKIAAISRDGTARRWDI 1023


>gi|425452441|ref|ZP_18832258.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
 gi|389765762|emb|CCI08435.1| conserved hypothetical protein [Microcystis aeruginosa PCC 7941]
          Length = 1108

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 50/199 (25%), Positives = 82/199 (41%), Gaps = 26/199 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D    +++  +T      +  +AF      + + G NG   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKKIREFATKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSQ 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G +++EF AS+ PI S AF  E       +SE  V+   L N    L  S   DD    
Sbjct: 899  KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKVQYWHLGNYRPQLINSWTADDNIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W      +     +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TATRGK--IKIWDLQGNLLKEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWD 234
             DG  I A++  G A  WD
Sbjct: 1005 -DGEKIAAISRDGTARRWD 1022


>gi|254501092|ref|ZP_05113243.1| Cytochrome c subfamily, putative [Labrenzia alexandrii DFL-11]
 gi|222437163|gb|EEE43842.1| Cytochrome c subfamily, putative [Labrenzia alexandrii DFL-11]
          Length = 370

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 47/212 (22%), Positives = 89/212 (41%), Gaps = 43/212 (20%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNG----MASEMKSE--------MG-EVIREFKASEKPI 118
           H  G+  +AF K G+SL+    +G       E ++E         G  V++    ++ P 
Sbjct: 76  HNAGVNAVAFTKDGKSLYSASMDGSIILWDVEERAEKRVVDRNGFGINVLKLGGPADGPD 135

Query: 119 SSSAFLCEEKIFALASSEVRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGE 177
           +  A+  ++ +        RIL LE G+ + +FS +  P+  +  S DG +  + +G G+
Sbjct: 136 TWLAYGAQDGV-------TRILDLETGDRLYEFSLERRPILGLDVSPDGTR--LASGDGQ 186

Query: 178 KHLQV-----WRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYS 232
            ++ V     W  +   +   KGP  ++  S             DG+V+ A     + YS
Sbjct: 187 GYITVFETGSWGIETDFRAALKGPIWALAFS------------NDGSVLHAAGIENIVYS 234

Query: 233 W---DLKTVSQDEKTNPAKITVKLKKADADQQ 261
           W   DL +        PA +    + ++ ++Q
Sbjct: 235 WPVADLSSFDPMATQTPAFLRKPEEMSNGERQ 266


>gi|113478009|ref|YP_724070.1| WD-40 repeat-containing protein [Trichodesmium erythraeum IMS101]
 gi|110169057|gb|ABG53597.1| WD-40 repeat [Trichodesmium erythraeum IMS101]
          Length = 1510

 Score = 38.5 bits (88), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 57/240 (23%), Positives = 94/240 (39%), Gaps = 38/240 (15%)

Query: 1    IWSTNDGSLLAEWKQPDGEPVVSYS----CLACGFVGKKRRKERGTLLLALGTSNGDILA 56
            +W T +G++LA     D    V++S     +A     K  R           T NG +LA
Sbjct: 1116 LWDTENGNVLATLLHQDLVIAVAFSPDGKTIATASWDKTAR--------LWDTENGKVLA 1167

Query: 57   VDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEK 116
                        T  H   +  +AF+  G+++    ++  A    +E G+V+      + 
Sbjct: 1168 ------------TLNHQSSVRAVAFSPDGKTIATASSDKTARLWDTENGKVLATLN-HQS 1214

Query: 117  PISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAG 174
             +++ AF  + K  A ASS+   R+   ENG +VL   +    ++ V+ S   K I TA 
Sbjct: 1215 SVNAVAFSPDGKTIATASSDKTARLWDTENG-KVLATLNHQSSVRAVAFSPDGKTIATAS 1273

Query: 175  YGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
              +K  ++W       T N     ++ H         SP   DG  I   +    A  WD
Sbjct: 1274 -SDKTARLW------DTENGKVLATLNHQSRVFAVAFSP---DGKTIATASSDKTARLWD 1323


>gi|329890618|ref|ZP_08268961.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
           protein [Brevundimonas diminuta ATCC 11568]
 gi|328845919|gb|EGF95483.1| histidine kinase-, DNA gyrase B-, and HSP90-like ATPase family
           protein [Brevundimonas diminuta ATCC 11568]
          Length = 642

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 30/97 (30%), Positives = 51/97 (52%), Gaps = 2/97 (2%)

Query: 215 GEDGTVILAVAESGVAYSWD-LKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSI 273
           G +G  IL VA   VA+ +  L T S+      A+    LK+AD  ++ + +V   R+++
Sbjct: 170 GVEGRFILPVAIGVVAFIFFCLTTRSRLHAAGEAEAR-ALKEADGKRRQAESVMAGRSAL 228

Query: 274 IAARLNDLEADGQKTITGLHDLESEAATASAQNEKKS 310
           +AA  +DL       +TG H+L   AA +SA  ++ +
Sbjct: 229 LAAVAHDLRTPIGAILTGAHELTRVAAPSSATRQQTA 265


>gi|401401154|ref|XP_003880944.1| putative pre-mRNA splicing factor [Neospora caninum Liverpool]
 gi|325115356|emb|CBZ50911.1| putative pre-mRNA splicing factor [Neospora caninum Liverpool]
          Length = 612

 Score = 38.5 bits (88), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 31/150 (20%), Positives = 68/150 (45%), Gaps = 6/150 (4%)

Query: 41  GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
           G LLL+  + +  +   DVL     +++   H   +  + +A++GR  +    +      
Sbjct: 336 GHLLLS-ASMDSTVKIWDVLNQRKLYRTYTAHKQAVRDIQWAEEGRRFYSCSFDNTVKLW 394

Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLE--NGEEVLKFSDDVGPL 158
            +E G+VI  F   + P   +    +  +F + S+  R +  +   G   +++++ +G +
Sbjct: 395 DTEAGKVIGSFGNGKTPYCVTVNPNDNNVFVVGSANRRAVQFDARTGNIEVEYAEHIGSV 454

Query: 159 QYVS-ASDGAKIIITAGYGEKHLQVWRCDI 187
             V+   +G +++ TA   +K L VW   I
Sbjct: 455 NTVTFCEEGRRLVTTA--DDKKLFVWEYGI 482


>gi|321466195|gb|EFX77192.1| hypothetical protein DAPPUDRAFT_305906 [Daphnia pulex]
          Length = 1099

 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 38/169 (22%), Positives = 75/169 (44%), Gaps = 16/169 (9%)

Query: 40  RGTLLLALGTSNGDILAVDVLTGEMKWK-------STGRHPGGLAGLAFAKKGRSLHVVG 92
           R  LL+ + T  G+++A++  TG++KWK          +       L    K  SL+++G
Sbjct: 48  RDDLLVLVSTLEGNLIAINKRTGKVKWKLEDEPVIKLSKELSKTFNLLPDPKDGSLYMLG 107

Query: 93  TNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEK-------IFALASSEVRILSLENG 145
            +G  +E  +++   I E  ++    SS   L   K       I  L   +  +LS +  
Sbjct: 108 NSG--AEALTKLPFTIPELVSASPSQSSDGMLYMGKKLDMWFVIDPLTGEKQEVLSFQGL 165

Query: 146 EEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNK 194
           E     +  +GP  ++  ++ + I++ +   E+H  V   D +S T+ +
Sbjct: 166 ETACPRNKPLGPSIFIGRTEYSLILLDSRTRERHWNVTYFDYTSSTLGQ 214


>gi|332661895|ref|YP_004451364.1| WD40 repeat-containing protein [Haliscomenobacter hydrossis DSM
           1100]
 gi|332337392|gb|AEE54491.1| WD40 repeat-containing protein [Haliscomenobacter hydrossis DSM
           1100]
          Length = 598

 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 74/172 (43%), Gaps = 13/172 (7%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H   L+ +AF+  G+ L    ++  A     E G+ +   K     +SS AF  + K  A
Sbjct: 184 HSSYLSSVAFSPDGKRLATGSSDHSAKIWDVESGKQVLSLKGHSSYVSSVAFSPDGKRLA 243

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
             S +   +I  +E+G++ L        +  V+ S   K + T G G+K  ++W  +   
Sbjct: 244 TGSDDKSAKIWDVESGKQTLSLEGHSSYVSSVAFSPDGKRLAT-GSGDKSAKIWDVESGK 302

Query: 190 KTVNKGPALSMR-HSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQ 240
           +T      LS+  HS        SP   DG  ++  ++   A  WD+++  Q
Sbjct: 303 QT------LSLEGHSDYVWSVAFSP---DGKRLVTGSQDQSAKIWDVESGKQ 345


>gi|119490763|ref|ZP_01623095.1| WD-repeat protein [Lyngbya sp. PCC 8106]
 gi|119453747|gb|EAW34905.1| WD-repeat protein [Lyngbya sp. PCC 8106]
          Length = 991

 Score = 38.5 bits (88), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 1/72 (1%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           LA  + +G I   D+ TG  +   +G H GG+ G+ ++ +G+ L   G +GM        
Sbjct: 686 LASSSKDGTIRLWDIQTGHCRQVWSGHH-GGVNGIVWSHRGQMLATCGEDGMIRLWDVRR 744

Query: 105 GEVIREFKASEK 116
           G V RE +A  +
Sbjct: 745 GRVYREMEAKSR 756


>gi|390604124|gb|EIN13515.1| WD40 repeat-containing protein [Punctularia strigosozonata
           HHB-11173 SS5]
          Length = 291

 Score = 38.1 bits (87), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 42/180 (23%), Positives = 80/180 (44%), Gaps = 21/180 (11%)

Query: 76  LAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASS 135
           ++ + F+  G+ L     + +     +  GE+I+      + IS  ++  + +  A AS 
Sbjct: 3   ISSVEFSPDGKVLASAAADKLVKLWDTATGEIIKTLVGHTEGISDVSWSPDGEFLASASD 62

Query: 136 E--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDIS-SKTV 192
           +  +RI SLE G            +   S +  + ++++ G+ E  +++W  DI+  K++
Sbjct: 63  DKTIRIWSLETGLTAKVLLGHTNFVFCASFNPKSNLLVSGGFDET-VRIW--DIARGKSI 119

Query: 193 NKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD------LKTVSQDEKTNP 246
              PA S   + V           DGT+I++ A  G+   WD      LKT+  D+  NP
Sbjct: 120 KVLPAHSDPVTAVGF-------SHDGTLIVSCAMDGLIRVWDTNSGQCLKTIVDDD--NP 170


>gi|449133882|ref|ZP_21769395.1| WD-containing repeat protein [Rhodopirellula europaea 6C]
 gi|448887443|gb|EMB17819.1| WD-containing repeat protein [Rhodopirellula europaea 6C]
          Length = 1173

 Score = 38.1 bits (87), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 41/152 (26%), Positives = 62/152 (40%), Gaps = 10/152 (6%)

Query: 85   GRSLHVVGTNGMASEMKSEMGEVIREFKA-SEKPISSSAFLCEEKIFALASSEVRILSLE 143
            G+ L   G +  A    +E G+++ E    ++  IS  AF  + K  A     V I + E
Sbjct: 921  GQRLVSGGWDATACLWDTETGDLLLELDTKTDSSISDVAFTADGKHIATTGDSVNIWNAE 980

Query: 144  NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS 203
             G+  L F++    +  VS SD  + + T G  +K   VW  +     + K P     H 
Sbjct: 981  TGKHELAFAEHSDHVFAVSFSDDGRWVATGGAKDKTAMVWDRETGEVKL-KLPD----HR 1035

Query: 204  PVAIDCKNSPNGEDGTVILAVAESGVAYSWDL 235
                D   +PN E+    LA   SG    W L
Sbjct: 1036 FWVYDLCFTPNNEE----LATVASGTVTIWRL 1063


>gi|443316153|ref|ZP_21045609.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 6406]
 gi|442784253|gb|ELR94137.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 6406]
          Length = 686

 Score = 38.1 bits (87), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 25/105 (23%), Positives = 52/105 (49%), Gaps = 3/105 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           +L  GT+ G + A+D+ +G  +W+ T   P     LA    G+ + + G +G    ++++
Sbjct: 459 ILYSGTAAGTLQAIDIDSGIPRWEYTAA-PTAFNVLARTPDGQQIMIGGADGTIHIVQAQ 517

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGE 146
            GE++R   A +  +++ A   + +      S+  + I +L  GE
Sbjct: 518 TGELLRTISAHQGAVNTLAATADGQALISGGSDRHIYIWTLATGE 562


>gi|168701051|ref|ZP_02733328.1| NB-ARC domain protein [Gemmata obscuriglobus UQM 2246]
          Length = 491

 Score = 38.1 bits (87), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 2/108 (1%)

Query: 67  KSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCE 126
           KS   H   + G+ +   G+ L   G +        E GE IR+ +  +K +++ AF+ +
Sbjct: 350 KSFEGHTHHVMGVGWTPDGKKLASCGADNFVKVWDYEKGEKIRDMQGHQKQVTALAFVGK 409

Query: 127 EKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIIT 172
              F   S +  VR+ + +NG  V  F      +  VSAS   +++++
Sbjct: 410 TSQFVTGSGDASVRMWNADNGGNVRSFPGASDFVYAVSASPDGEVVVS 457


>gi|296421914|ref|XP_002840508.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295636726|emb|CAZ84699.1| unnamed protein product [Tuber melanosporum]
          Length = 333

 Score = 38.1 bits (87), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 59/133 (44%), Gaps = 20/133 (15%)

Query: 144 NGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHS 203
           NGE V  +   VG L  V   D A  ++  G  +  +++W        V  G  L M   
Sbjct: 41  NGERVGTYKGHVGALWTVDV-DPATTLLATGSADNTIRIW-------DVRTGKQLKMWEF 92

Query: 204 PVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKKADADQQN- 262
           P A+  K     EDGT++LAV E  + +   L TV          ITV L+    D+ N 
Sbjct: 93  PTAV--KRVEFSEDGTLLLAVTEQRMGH---LGTVV------VFPITVDLEAEQVDEPNF 141

Query: 263 SVNVKKSRTSIIA 275
           ++  ++S+ ++ A
Sbjct: 142 TITFRESKATVAA 154


>gi|353243507|emb|CCA75040.1| related to WD40-repeat protein (notchless protein) [Piriformospora
           indica DSM 11827]
          Length = 1186

 Score = 38.1 bits (87), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 40/159 (25%), Positives = 75/159 (47%), Gaps = 33/159 (20%)

Query: 137 VRILSLENGEEVLK-FSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKG 195
           +RI ++  G+++ + F    G +  ++ S    ++++A   +  +Q+W  D+ S   +K 
Sbjct: 851 IRIWNVNTGQQMGRPFQGHKGSVWAIAFSPDGSLLVSAS-EDNTIQIW--DVESGRPSK- 906

Query: 196 PALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKTVSQDEKTNPAKITVKLKK 255
            ALS RH  +      SP   DG++I++V+E  +   WD+ T S      P    ++ + 
Sbjct: 907 -ALSRRHKDLITSVAFSP---DGSLIVSVSEDKIIRLWDVYTGS------PWGELLQGQP 956

Query: 256 ADADQQNSVNVKKSRTSIIAARLNDLEADGQKTITGLHD 294
            DA              +IA     + +DG + I+GLHD
Sbjct: 957 VDA-------------PVIA-----ISSDGSRIISGLHD 977


>gi|171679856|ref|XP_001904874.1| hypothetical protein [Podospora anserina S mat+]
 gi|170939554|emb|CAP64781.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1108

 Score = 38.1 bits (87), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 39/169 (23%), Positives = 78/169 (46%), Gaps = 13/169 (7%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H G +  +AF+  G  +     +G     +++ G+ +R+ +     + S AF  +     
Sbjct: 796 HSGSVRSVAFSPDGSRIVSASDDGTIRIWEAKSGKEVRKLEGHSNWVRSVAFSPDSSRIV 855

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSAS-DGAKIIITAGYGEKHLQVWRCDIS 188
            AS +  +RI   ++G+EV K     G ++ V+ S DG++I+  +   ++ +++W    S
Sbjct: 856 SASDDGTIRIWEAKSGKEVRKLEGHSGSVRSVAFSPDGSRIV--SASNDQTIRIWEAK-S 912

Query: 189 SKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
            K V K       HS + +    SP   DG+ I++ +       W+ K+
Sbjct: 913 GKEVRKLEG----HSGLVLSVAFSP---DGSRIVSASNDQTIRIWEAKS 954


>gi|442323770|ref|YP_007363791.1| hypothetical protein MYSTI_06834 [Myxococcus stipitatus DSM 14675]
 gi|441491412|gb|AGC48107.1| hypothetical protein MYSTI_06834 [Myxococcus stipitatus DSM 14675]
          Length = 816

 Score = 38.1 bits (87), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 35/145 (24%), Positives = 66/145 (45%), Gaps = 7/145 (4%)

Query: 58  DVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKP 117
           DV+TG+      G   G + GLA++  G+ L     +  A     E GE++ +F   E  
Sbjct: 182 DVVTGKCLRVLEGHDIGEILGLAWSPDGKKLASGSRSHDARVWDVETGELLHDFPKQEGR 241

Query: 118 ISSSAFLCEEKIFALASS--EVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGY 175
           ++S AF  + K+ A+ +    V +  LE+GE+V         +  V+     +++ +A  
Sbjct: 242 VTSVAFSPDGKLLAVGNLGWRVHLFDLESGEKVRTLKGHQQSVLCVAFHPSGRLLASAA- 300

Query: 176 GEKHLQVWR----CDISSKTVNKGP 196
            +  +++W       ++S T N  P
Sbjct: 301 SDDTVRIWDMTTGAQVASITTNATP 325


>gi|357393107|ref|YP_004907948.1| hypothetical protein KSE_62270 [Kitasatospora setae KM-6054]
 gi|311899584|dbj|BAJ31992.1| hypothetical protein KSE_62270 [Kitasatospora setae KM-6054]
          Length = 1389

 Score = 38.1 bits (87), Expect = 6.2,   Method: Composition-based stats.
 Identities = 42/170 (24%), Positives = 66/170 (38%), Gaps = 15/170 (8%)

Query: 49  TSNGDILAVDVLTGEMKWK-STGRHPGGLAG-------LAFAKKGRSLHVVGTNGMASEM 100
           + +G  LA       + W  +TGR    LAG       LAF+  GR+L   G +      
Sbjct: 831 SPDGRTLATGGEGAALLWDVATGRTTATLAGFTGAVFSLAFSPDGRTLATGGWDRTVRLW 890

Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPL 158
               G            ++S AF  +    A AS +   R+  +  G     F++  GP+
Sbjct: 891 DPATGRTTATLTGHTANVASLAFSPDGSTLATASEDGTARLWDVATGRTTATFTNSSGPV 950

Query: 159 QYVSASDGAKIIITAGYGEKHLQVWRC----DISSKTVNKGPALSMRHSP 204
             V+ S   + + T G GE    +W       I++ T + G   S+  SP
Sbjct: 951 GAVAFSPDGRTLATGG-GEGAALLWEVATGRTIATLTGHTGAVFSLAFSP 999


>gi|297204688|ref|ZP_06922085.1| WD repeat protein [Streptomyces sviceus ATCC 29083]
 gi|297148756|gb|EDY54791.2| WD repeat protein [Streptomyces sviceus ATCC 29083]
          Length = 1167

 Score = 38.1 bits (87), Expect = 6.2,   Method: Composition-based stats.
 Identities = 48/188 (25%), Positives = 73/188 (38%), Gaps = 15/188 (7%)

Query: 45   LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
            LA G+S+  +   D  TG M     G H G +  LAF   G  L     +          
Sbjct: 984  LATGSSDTTVRLWDPSTGAMVRILNG-HRGPVRALAFHPDGTFLATASHDRTVRIWDPST 1042

Query: 105  GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
            G+V+R        + + AF  + ++ A  SS+  VR+     G  V   S   GP++ V+
Sbjct: 1043 GDVVRSLVGHTDQLHTVAFSPDGRLLATGSSDTTVRLWDASTGAMVRMLSGHRGPVRAVA 1102

Query: 163  ASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAI-DCKNSPNGEDGTVI 221
             S     + + G  E         I       G AL+M  +  A+  C  S    DG V+
Sbjct: 1103 FSPDGSCLASGGADET--------IRIHAPASGEALTMMRTDSAVWSCSWSA---DGRVL 1151

Query: 222  LAVAESGV 229
             A   +G+
Sbjct: 1152 FAGTTAGL 1159



 Score = 37.7 bits (86), Expect = 7.3,   Method: Composition-based stats.
 Identities = 41/152 (26%), Positives = 68/152 (44%), Gaps = 10/152 (6%)

Query: 60  LTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPIS 119
           +TGE     TG H G +  +AF+  GR L   G +  A    +  G+ +R  +  + P+ 
Sbjct: 610 VTGEALHTLTG-HQGPVRAVAFSPDGRLLVTGGRDATARIWDATTGQPVRTMRGHDGPVL 668

Query: 120 SSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGE 177
           + AF  +  + A  SS+  VRI     G EVL  +   G L      D     + +G  +
Sbjct: 669 AVAFSPDGSLLATGSSDTTVRIWDPATG-EVLHTASGHGGLVSAVVFDRDGSRLASGGAD 727

Query: 178 KHLQVWRCDISSKTVNK----GPALSMRHSPV 205
              ++W  D++S   ++    GP  ++R S V
Sbjct: 728 TTARLW--DLTSPGPDRRPGDGPPRALRASRV 757


>gi|290979537|ref|XP_002672490.1| transcription initiation factor TFIID subunit [Naegleria gruberi]
 gi|284086067|gb|EFC39746.1| transcription initiation factor TFIID subunit [Naegleria gruberi]
          Length = 716

 Score = 38.1 bits (87), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 31/100 (31%), Positives = 45/100 (45%), Gaps = 3/100 (3%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           +A G+S+  +   DV TGE      G H G +  LAF+  GR     G +G         
Sbjct: 548 VATGSSDKTVRLWDVQTGECMRMFIGHH-GSINTLAFSHDGRYCASAGDDGQVLVWDIGS 606

Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALAS--SEVRILSL 142
           G++  +F    KP+ S  F   +   A AS  S VR+ S+
Sbjct: 607 GKIAYKFIGHTKPVWSLDFNRNDTFLASASLDSTVRVWSM 646


>gi|432941441|ref|XP_004082853.1| PREDICTED: WD repeat-containing protein 43-like [Oryzias latipes]
          Length = 646

 Score = 38.1 bits (87), Expect = 6.3,   Method: Compositional matrix adjust.
 Identities = 49/248 (19%), Positives = 101/248 (40%), Gaps = 22/248 (8%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFV-----GKKRRKERGTL--------LLAL 47
           IW+T+  ++  E+  P      + +C+A G       G +R+K +           LLA+
Sbjct: 35  IWNTDSKTIHQEY-VPSAHLSATCTCIAWGPCRTVKEGPQRKKRKSEAVQVEEKADLLAM 93

Query: 48  GTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEV 107
           GT+ G +L      G +       H  G+  + +  +   L+    +    E   + G++
Sbjct: 94  GTAAGSVLLYSTAKGSLHCSLDEAHSAGVNCVHWHPEDSVLYSGSDDTNIVEWDLQSGKM 153

Query: 108 IREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKF---SDDVGPLQYVSA- 163
             ++KA    ++S     + K+   A   +++  L+  E   KF   S  V  L++ +  
Sbjct: 154 RSKWKADRAAVTSLCVSPDGKLLLSAGHVIKMWDLDTKELYRKFTGHSTAVTTLRFATTR 213

Query: 164 ---SDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTV 220
              S+G   +  A + ++ + VW+     K  N   + ++   P  +D   S + E+   
Sbjct: 214 PPDSNGLYFVSGAAH-DRLISVWQVREDGKDKNSVVSFTLTDEPRHVDLVPSNSKEEAVR 272

Query: 221 ILAVAESG 228
           +  V + G
Sbjct: 273 LAVVCQDG 280


>gi|50304115|ref|XP_452007.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
 gi|74690425|sp|Q6CVN2.1|JIP5_KLULA RecName: Full=WD repeat-containing protein JIP5
 gi|49641139|emb|CAH02400.1| KLLA0B10736p [Kluyveromyces lactis]
          Length = 525

 Score = 38.1 bits (87), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 45/192 (23%), Positives = 79/192 (41%), Gaps = 26/192 (13%)

Query: 63  EMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFK---ASEKPIS 119
           E+ WK T RH G + G++F   G  L+ +G + +  +  S  G+V+++     +SEK   
Sbjct: 114 ELLWK-TRRHKGSVRGMSFNNDGSKLYTIGIDNVLKKANSLTGKVMKKVTLPLSSEKNHY 172

Query: 120 SSAFLCEEKIFALASSEV--------RILSLENGEEVLKFSDDVGPL-QYVSASDGAKII 170
           +     +   F L   E+          L L+N  + +   D +  + Q+V    G  + 
Sbjct: 173 TKLVTSKTHPFLLLGDELGNIHVLNNDTLQLQNTIKSIHNGDAINDIFQFV----GKSVY 228

Query: 171 ITAGYGEKHLQVWRCDISSKTVNKGPALSM---RHSPVAID------CKNSPNGEDGTVI 221
                G+  L  W    S+++    PA  +   R   V+ D      C +  N EDG ++
Sbjct: 229 KFISLGQTTLAYWDSRESNESDASIPADDLDAKRKVYVSDDQEDEMICGSFVNPEDGDIL 288

Query: 222 LAVAESGVAYSW 233
           +     GV   W
Sbjct: 289 VCGMGEGVLTVW 300


>gi|358459346|ref|ZP_09169545.1| WD40 repeat-containing protein [Frankia sp. CN3]
 gi|357077324|gb|EHI86784.1| WD40 repeat-containing protein [Frankia sp. CN3]
          Length = 1532

 Score = 37.7 bits (86), Expect = 7.0,   Method: Composition-based stats.
 Identities = 50/201 (24%), Positives = 76/201 (37%), Gaps = 22/201 (10%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            +LA   S+      DV TG      TG H GG+  +AF+  G +L   G +         
Sbjct: 1281 MLATAGSDRTTRLWDVATGREIRTLTG-HGGGVLAVAFSPDGNTLTTAGNDRTVRLWDVA 1339

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
             G   R        + S AF  +    A A S+   R+  L  G+E   FS   G +  V
Sbjct: 1340 TGRETRTLTGHRGVVWSVAFSPDGNALATAGSDGTARLWDLATGQETRTFSGHRGIVWSV 1399

Query: 162  SAS-DGAKIIITAGYGEKHLQVWRC----DISSKTVNKGPALSMRHSPVAIDCKNSPNGE 216
            + + DG  +   A  G   L  W      +I +   ++   L +  SP            
Sbjct: 1400 AFTPDGGSLATAADDGVARL--WEVATGREIRTIAGHQDWLLGVAFSP------------ 1445

Query: 217  DGTVILAVAESGVAYSWDLKT 237
            DG  +   A+ G A  WD+++
Sbjct: 1446 DGRTLATAADDGTARLWDVES 1466


>gi|298706017|emb|CBJ29131.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 664

 Score = 37.7 bits (86), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 43/163 (26%), Positives = 67/163 (41%), Gaps = 13/163 (7%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTL---------LLALGTSN 51
           IW+T  G L  ++ +P       Y+C+A      K+ K   +          ++ALGT  
Sbjct: 34  IWNTASGDLRQQYVEPR-HLAKQYTCIAWHRRSSKKSKRGSSGASKPSSSLGMIALGTDK 92

Query: 52  GDILAVDVLTGEMKWKSTGRHPG--GLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIR 109
           G +   D+  G + + S G   G   +    F+  G SL    +     E   E G V R
Sbjct: 93  GSVSVWDLKRGALAY-SLGEGEGFPSVTSGGFSADGSSLFSASSGKEVVEWNVETGTVAR 151

Query: 110 EFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEEVLKFS 152
           + K  +   +        KI A+ASS +R+L L  G+   K S
Sbjct: 152 KLKGFKHGATKICLHPAGKILAIASSAIRLLDLTTGKTARKLS 194


>gi|296817773|ref|XP_002849223.1| WD repeat protein [Arthroderma otae CBS 113480]
 gi|238839676|gb|EEQ29338.1| WD repeat protein [Arthroderma otae CBS 113480]
          Length = 414

 Score = 37.7 bits (86), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 11/87 (12%)

Query: 66  WKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKAS---EKPISSSA 122
           W+ T RH G    L F   G  L+  GT+G     KSE G+V+ +F      +KP+  + 
Sbjct: 62  WR-TRRHKGSCRSLTFGIDGEMLYSAGTDGWVKAAKSETGKVVTKFAVPMPRDKPLHDTD 120

Query: 123 FLC-------EEKIFALASSEVRILSL 142
             C       +  + A  SS + I  L
Sbjct: 121 MPCLLHALSPQTLLLATDSSALHIFDL 147


>gi|425438823|ref|ZP_18819165.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
 gi|389717405|emb|CCH98490.1| conserved hypothetical protein [Microcystis aeruginosa PCC 9717]
          Length = 1108

 Score = 37.7 bits (86), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 49/199 (24%), Positives = 81/199 (40%), Gaps = 26/199 (13%)

Query: 44   LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
            ++A+   +G I  +D    +++  +T      +  +AF      + + G NG   ++ S+
Sbjct: 842  IIAIANKDGQITLLDSQGKKIREFATKMR--SIYSIAFHPDSNQIAITGRNGKV-QIWSK 898

Query: 104  MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFS---DDVGPL 158
             G +++EF AS+ PI S AF  E       +SE  ++   L N    L  S   DD    
Sbjct: 899  KGTMLQEFTASQVPIYSLAFNGEGTAIITGTSEGKIQYWHLSNHRPQLINSWTADDNIIY 958

Query: 159  QYVSASDGAKIIITAGYGEKHLQVWRCD---ISSKTVNKGPALSMRHSPVAIDCKNSPNG 215
              V + D  KI  TA  G+  +++W            +  P   +  SP           
Sbjct: 959  DLVFSPDHQKIA-TATRGK--IKIWDLQGNLFEEIKTDSFPVYGVSFSP----------- 1004

Query: 216  EDGTVILAVAESGVAYSWD 234
             DG  I A++  G A  WD
Sbjct: 1005 -DGEKIAAISRDGTARRWD 1022


>gi|145346364|ref|XP_001417659.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577887|gb|ABO95952.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 534

 Score = 37.7 bits (86), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 45/93 (48%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           +LA G+ +G I    V TG    K    H GG+  + F+K G  +     +G+      +
Sbjct: 285 MLASGSQDGKIKVWRVSTGTCLRKFEKAHQGGVTSVTFSKDGSQVLSGSFDGLVRVHGLK 344

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE 136
            G++++EF+     ++S AF  +E     ASS+
Sbjct: 345 SGKLLKEFRGHTSYVNSVAFTEDETDVVSASSD 377


>gi|428318693|ref|YP_007116575.1| WD-40 repeat-containing protein [Oscillatoria nigro-viridis PCC
           7112]
 gi|428242373|gb|AFZ08159.1| WD-40 repeat-containing protein [Oscillatoria nigro-viridis PCC
           7112]
          Length = 547

 Score = 37.7 bits (86), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 36/143 (25%), Positives = 67/143 (46%), Gaps = 6/143 (4%)

Query: 45  LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
           L  G+ +G I   ++ +G+     +G++   L+ LAF   G+SL    +NG      +  
Sbjct: 406 LVSGSKDGSIRLWNLASGQAIRTISGKNLSVLS-LAFTPDGKSLAAGNSNGTVGLWNAGN 464

Query: 105 GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
           G++IR        + S AF  +       S +  VR+  + +G+     S   G +  V+
Sbjct: 465 GQLIRRLSGHTDGVWSVAFSRDGTTLVTGSWDKSVRLWDVRSGDLRGTLSGHSGYVSAVA 524

Query: 163 ASDGAKIIITAGY-GEKHLQVWR 184
            S   K I++AG+ GE  +++W+
Sbjct: 525 ISSDGKTIVSAGWLGE--IKIWK 545


>gi|260806123|ref|XP_002597934.1| hypothetical protein BRAFLDRAFT_79824 [Branchiostoma floridae]
 gi|229283204|gb|EEN53946.1| hypothetical protein BRAFLDRAFT_79824 [Branchiostoma floridae]
          Length = 415

 Score = 37.7 bits (86), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 27/105 (25%), Positives = 50/105 (47%), Gaps = 4/105 (3%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           L A+G+ NG +   D+ TG  + K +  HPGG+  LA+A+ G  L+    +G+     S 
Sbjct: 312 LAAVGSLNGTLGVWDIPTGVQRHKCS--HPGGIVRLAWAESGYLLYTSCLDGVTRIWDSR 369

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGE 146
            G  +++     + I   A   +  +   +S +   R+ +L+  E
Sbjct: 370 TGGTVQQLTGHSEEILDMAVSRDGNVVLTSSGDNTARVFNLQKPE 414


>gi|66813758|ref|XP_641058.1| hypothetical protein DDB_G0280575 [Dictyostelium discoideum AX4]
 gi|60469093|gb|EAL67089.1| hypothetical protein DDB_G0280575 [Dictyostelium discoideum AX4]
          Length = 1019

 Score = 37.7 bits (86), Expect = 7.8,   Method: Compositional matrix adjust.
 Identities = 36/150 (24%), Positives = 70/150 (46%), Gaps = 16/150 (10%)

Query: 98  SEMKSEMGEVIREFKASEKPISSSAF----LCEEKI--FA--LASSEVRILSLENGEEVL 149
           +E   E+G+++ E   + K I S       L +  I  FA  ++S    +++L   +  +
Sbjct: 473 NEFDDEVGQLMGEMLVTNKSIKSLDLSFNDLADNTIQGFAEAISSPSCALVNLNLSDCAM 532

Query: 150 KFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDC 209
             ++D G  Q+  A  G + I       + L +W CD+SS+ V +  ++++RH+    D 
Sbjct: 533 SVNEDTGA-QFFQALAGNRSI-------QRLLLWSCDLSSQLVKEELSIAIRHNITITDL 584

Query: 210 KNSPNGEDGTVILAVAESGVAYSWDLKTVS 239
               N    + I  +   G+ ++  L+TVS
Sbjct: 585 SLGFNDLTSSDISIILHKGLKFNKTLQTVS 614


>gi|434394209|ref|YP_007129156.1| WD40 repeat-containing protein [Gloeocapsa sp. PCC 7428]
 gi|428266050|gb|AFZ31996.1| WD40 repeat-containing protein [Gloeocapsa sp. PCC 7428]
          Length = 356

 Score = 37.7 bits (86), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 28/117 (23%), Positives = 57/117 (48%), Gaps = 3/117 (2%)

Query: 79  LAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE-- 136
           +AF+   ++L      G      +  GE+ R   A +K +++ +F  + + FA AS +  
Sbjct: 241 VAFSPDSKTLASGSNIGEVKLWNTSNGELRRTIAAHKKEVTAISFSSDGETFATASEDRV 300

Query: 137 VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN 193
           +R+ +++NGE V   +D    +  V+ S    +    G  ++ +++WR   SS  V+
Sbjct: 301 IRLWNIDNGEVVRNLADHSQGITCVAFSQNG-LNFATGSKDRTIKIWRVSPSSVPVS 356


>gi|162452827|ref|YP_001615194.1| protein kinase [Sorangium cellulosum So ce56]
 gi|161163409|emb|CAN94714.1| Protein kinase [Sorangium cellulosum So ce56]
          Length = 1529

 Score = 37.7 bits (86), Expect = 8.4,   Method: Composition-based stats.
 Identities = 58/258 (22%), Positives = 109/258 (42%), Gaps = 34/258 (13%)

Query: 72   HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
            HPGG+ G +++  G+ +     +  A   K++M       +  E  +SS+AF  + +   
Sbjct: 883  HPGGVMGASYSPDGKYVVTASLDRNARVWKADMTGEPLVLRGHENGVSSAAFSPDGRWIV 942

Query: 132  LAS--SEVRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISS 189
             AS    VR+   +   E L   D  G +   + S   + I +A + ++ ++VW+ D + 
Sbjct: 943  TASWDGTVRVWKADGTGEPLVLRDHEGRVNSAAFSPDGQRIASASH-DRTVRVWKADGTG 1001

Query: 190  KTV----NKGPALSMRHSP----VAIDCKN------SPNGEDGTVILAVAE---SGVAYS 232
            + +    +  P  ++  SP    +A    +      S +G    V+L   E    GVA+S
Sbjct: 1002 EPLVLRGHDAPVYAVAFSPDGKRIATGSYDHTARVWSADGSGEPVVLRGHEHEVQGVAFS 1061

Query: 233  WD---LKTVSQDEKTNPAKITVKLKKADADQQNSVNVKKSRTSIIAARLNDLEADGQKTI 289
             D   L T S D+       T ++ KA+   +  V ++     + +A  +    DG++ +
Sbjct: 1062 PDGQQLVTASSDK-------TARVWKANGTGEPLV-LRGHEAPVYSAAFS---PDGRRIV 1110

Query: 290  TGLHDLESEAATASAQNE 307
            TG  D  +    A    E
Sbjct: 1111 TGSRDKTARVWKADGTGE 1128


>gi|426335145|ref|XP_004029093.1| PREDICTED: WD repeat-containing protein 43 [Gorilla gorilla
           gorilla]
          Length = 676

 Score = 37.7 bits (86), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 47/220 (21%), Positives = 87/220 (39%), Gaps = 19/220 (8%)

Query: 34  KKRRKE-----RGTLLLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRS 87
           KKR+ E       T LLALGT+ G IL    + GE+  K  +G H   +  + + +    
Sbjct: 81  KKRKSEAVGTSNQTDLLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDSGC 140

Query: 88  LHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
           L+    +    E   +  +V  ++K     +SS     + K+   A   +++  LE  E 
Sbjct: 141 LYSCSDDKHIVEWNVQTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEI 200

Query: 148 VLKFSDDVGPLQYVSAS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNK 194
              F+    P+  +  +           DG      ++    ++ L VW+    +K  + 
Sbjct: 201 YRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKSA 260

Query: 195 GPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
             + ++   PV ID   S N E+   +  V   G  + ++
Sbjct: 261 VMSFTVTDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE 300


>gi|393214198|gb|EJC99691.1| WD40 repeat-like protein [Fomitiporia mediterranea MF3/22]
          Length = 1229

 Score = 37.7 bits (86), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 52/240 (21%), Positives = 94/240 (39%), Gaps = 24/240 (10%)

Query: 1   IWSTNDGSLLAEWKQPDGEPVVSYSCLACGFVGKKRRKERGTLLLALGTSNGDILAVDVL 60
           IW+   G +L E+ + +G  V S       F    RR       +A GT  G I   D+ 
Sbjct: 611 IWNVESGEVLCEFSEGNGAEVNSVV-----FSPDGRR-------IAFGTCRGTISIWDIE 658

Query: 61  TGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISS 120
           + E+       H G + G+AF+  G  +     +        E    +R  +     + S
Sbjct: 659 SKELVSGPFKGHTGSVRGVAFSPDGMHITSGSADTTIRVWDIEKASTLRVLEGHTASVWS 718

Query: 121 SAFLCEEKIFALASSE--VRILSLENGEEVLK-FSDDVGPLQYVSASDGAKIIITAGYGE 177
            AF  +       S +  +R+   E G+ + K F      +Q V+ S   K I++ G  +
Sbjct: 719 VAFSSDGNCIVSGSEDKTLRVWDPETGQAIGKPFVGHTDGVQCVAISPDCKCIVS-GSND 777

Query: 178 KHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWDLKT 237
             ++VW   + S+ V  GP   +         K+     DG  +++ ++      WD+++
Sbjct: 778 FTVRVW--GMESEKVVAGPFWHLTF------VKSVAFSSDGRRVVSASDDFSIVVWDMES 829


>gi|383778350|ref|YP_005462916.1| hypothetical protein AMIS_31800 [Actinoplanes missouriensis 431]
 gi|381371582|dbj|BAL88400.1| hypothetical protein AMIS_31800 [Actinoplanes missouriensis 431]
          Length = 405

 Score = 37.7 bits (86), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 49/205 (23%), Positives = 86/205 (41%), Gaps = 23/205 (11%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           LLA    +G +   +  +G+     TG H GG+  + F+  G+ L   G  G      ++
Sbjct: 158 LLATAGDDGTVRLWEAASGDPVRTLTG-HTGGVRSVVFSPDGKLLVTAGATGSDKTTDTD 216

Query: 104 MGEVIREFKA-SEKPISSSAFLCEEKIFALA-SSEVRILSLENGEEVLKFSD------DV 155
               +R +   + +P+  +  +    +FA++ +    +L+   G   ++  D      D 
Sbjct: 217 DDNTVRLWHTRTGRPVGKALSISPGPVFAVSFTPGGELLATSGGGRAIRLWDTATATPDG 276

Query: 156 GPLQYVSA-------SDGAKIIITAGYGEKHLQVW-----RCDISSKTVNKGPALSMRHS 203
            PL   SA       S G K++ T+  G+  +Q+W     R   +  T + GP  +MR S
Sbjct: 277 DPLVSSSAPVFALTFSAGGKLLATSSGGDNTVQLWDTASRRPATAPLTGHTGPVRAMRFS 336

Query: 204 PVAIDCKNSPNGEDGTVILAVAESG 228
           P           +DGTV L  A +G
Sbjct: 337 PGGNLLAT--GSDDGTVRLWNAATG 359


>gi|387127265|ref|YP_006295870.1| cell surface protein/ lipoprotein [Methylophaga sp. JAM1]
 gi|386274327|gb|AFI84225.1| putative cell surface protein/ lipoprotein [Methylophaga sp. JAM1]
          Length = 346

 Score = 37.4 bits (85), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 28/104 (26%), Positives = 42/104 (40%), Gaps = 3/104 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           +L +G   G + AVD+ TGE  W       G     + A K + +    T+G    +  E
Sbjct: 204 MLYVGNDAGVLFAVDLQTGEPVWL---FESGSRMFASPALKNQQIIFAATDGRVFALNKE 260

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
            GE   +F A     SS A   E+         +  L + NGE+
Sbjct: 261 SGEPFWQFDAQATIFSSPAIAGEQIYIGAMDGYLYALDINNGEQ 304


>gi|123966361|ref|YP_001011442.1| WD-40 repeat-containing G-protein [Prochlorococcus marinus str. MIT
           9515]
 gi|123200727|gb|ABM72335.1| G-protein beta WD-40 repeats [Prochlorococcus marinus str. MIT
           9515]
          Length = 357

 Score = 37.4 bits (85), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 49/215 (22%), Positives = 87/215 (40%), Gaps = 25/215 (11%)

Query: 41  GTLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEM 100
           G  L  +G   G +   +  TG++ W+    H GGL  ++    G      G +G+    
Sbjct: 33  GGKLFIVGDVAGGVYGFEGDTGKIVWEKNEAHSGGLLAMSIHPNGNIFATSGQDGIVRIC 92

Query: 101 KSEMGEVIREFKASEKPISSSAFLCEEKIFALA-SSEVRILSLENGEEVLKFSDDVGPLQ 159
               GE I++ K  +  +   ++  +    A A S +V +L+ E  E  +   +    + 
Sbjct: 93  NCNNGEKIKDIKLGKCWVEHLSWSNDGSFLAAAFSKKVYVLNQEGNENWVS-EEHPSTVS 151

Query: 160 YVSASDGAKIIITAGYGEKHLQVWRCDISSKTVN-----KGPALSMRHSPVAIDCKNSPN 214
            +S S+  + + TA YG    QV   D+++   N     +G  +SM+ SP          
Sbjct: 152 AISWSNKNE-LATACYG----QVTFFDVANNKTNQKLKWQGSLVSMKLSP---------- 196

Query: 215 GEDGTVILAVAESGVAYSWDLKTVSQDEKTN-PAK 248
             DG ++   ++    + W   T    E T  PAK
Sbjct: 197 --DGDIVACGSQDNSVHFWRRSTGQDAEMTGYPAK 229


>gi|452523|dbj|BAA05499.1| KIAA0007 [Homo sapiens]
          Length = 686

 Score = 37.4 bits (85), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 47/220 (21%), Positives = 87/220 (39%), Gaps = 19/220 (8%)

Query: 34  KKRRKE-----RGTLLLALGTSNGDILAVDVLTGEMKWK-STGRHPGGLAGLAFAKKGRS 87
           KKR+ E       T LLALGT+ G IL    + GE+  K  +G H   +  + + +    
Sbjct: 90  KKRKSEAVGMSNQTDLLALGTAVGSILLYSTVKGELHSKLISGGHDNRVNCIQWHQDSGC 149

Query: 88  LHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSEVRILSLENGEE 147
           L+    +    E   +  +V  ++K     +SS     + K+   A   +++  LE  E 
Sbjct: 150 LYSCSDDKHIVEWNVQTCKVKCKWKGDNSSVSSLCISPDGKMLLSAGRTIKLWVLETKEV 209

Query: 148 VLKFSDDVGPLQYVSAS-----------DGAK--IIITAGYGEKHLQVWRCDISSKTVNK 194
              F+    P+  +  +           DG      ++    ++ L VW+    +K  + 
Sbjct: 210 YRHFTGHATPVSSLMFTTIRPPNESQPFDGITGLYFLSGAVHDRLLNVWQVRSENKEKSA 269

Query: 195 GPALSMRHSPVAIDCKNSPNGEDGTVILAVAESGVAYSWD 234
             + ++   PV ID   S N E+   +  V   G  + ++
Sbjct: 270 VMSFTVTDEPVYIDLTLSENKEEPVKLAVVCRDGQVHLFE 309


>gi|427414552|ref|ZP_18904739.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 7375]
 gi|425755696|gb|EKU96559.1| WD40 repeat-containing protein [Leptolyngbya sp. PCC 7375]
          Length = 1494

 Score = 37.4 bits (85), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 31/141 (21%), Positives = 64/141 (45%), Gaps = 4/141 (2%)

Query: 45   LALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEM 104
            LA+G     +   D++TG++ W   G H   +A + F+  G+ L     + +     +  
Sbjct: 881  LAIGDFKNTVQIWDIVTGQVVWFCLG-HSDWVASVTFSSDGKLLASGSDDHVVKLWSTNS 939

Query: 105  GEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYVS 162
            G+ IR F      + S AF  + K    AS +  +++  +E+G+ +  F      +  V+
Sbjct: 940  GKCIRTFTGHSGWVLSVAFSSDTKTLVSASKDHTIKLWCIESGKCLRTFEGHSDSVWSVA 999

Query: 163  ASDGAKIIITAGYGEKHLQVW 183
             S   K  + +G  ++ +++W
Sbjct: 1000 ISPDGK-TLASGSRDRTIKLW 1019


>gi|145348642|ref|XP_001418755.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578985|gb|ABO97048.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 461

 Score = 37.4 bits (85), Expect = 9.3,   Method: Compositional matrix adjust.
 Identities = 48/215 (22%), Positives = 84/215 (39%), Gaps = 19/215 (8%)

Query: 42  TLLLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMK 101
           T LL+ G ++G ++  D        K+   H GG+  ++  + G+     G +   +   
Sbjct: 107 TRLLS-GGADGALMVWDASDNFELLKTMRAHRGGVCAISAHRSGKVALTSGCDAHVAMWD 165

Query: 102 SEMGEVIREFKASEKPISSSAFLCE-EKIFALASSEVRILSLENGEEVLKFS-------- 152
              G V  +FK  E+ +   AF C+  +  +  + ++ +  +E G  V  F+        
Sbjct: 166 MRRGRVAYKFKTPER-VEGLAFTCDGSEYVSRLTQKITLTDVEAGSVVTSFTTPAKTLTF 224

Query: 153 DDVGPLQYVSASDGAKIIITAGYGEKHLQVWRCDISSKTVNKGPALSMRHSPVAIDCKNS 212
           D  G + YV    G  ++  A    K   V R   +     +G A++       +     
Sbjct: 225 DARGRMVYVGCEGGDVLVYDARMAAKEGAVSRIAKAHPQRVRGLAITSAKGDETV----- 279

Query: 213 PNGEDGTVILAVAES-GVAYSWDLKTVSQDEKTNP 246
             G+ G   L  A S GV  +WDL+  S     NP
Sbjct: 280 --GDSGPSALVTAGSEGVVRAWDLRRASGPRDENP 312


>gi|347441927|emb|CCD34848.1| similar to prolyl oligopeptidase [Botryotinia fuckeliana]
          Length = 446

 Score = 37.4 bits (85), Expect = 9.4,   Method: Compositional matrix adjust.
 Identities = 34/142 (23%), Positives = 65/142 (45%), Gaps = 4/142 (2%)

Query: 44  LLALGTSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASEMKSE 103
           ++A G+ +  I   DV TGE   K  G H   +  +AF+  G+ +     +         
Sbjct: 102 VVASGSYDKTIRLWDVATGESLQKLEG-HSHWVNSVAFSSDGKVVASGSNDNTIRLWDVA 160

Query: 104 MGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEEVLKFSDDVGPLQYV 161
            GE ++ F+   K ++S AF  + K+ A  S +  +R+  +  GE +  F      ++ V
Sbjct: 161 TGESVQTFEGHSKWVNSVAFSPDGKVVASGSYDETIRLWDVATGESLQTFEGHSESVKSV 220

Query: 162 SASDGAKIIITAGYGEKHLQVW 183
           + S   K++ +  Y E  +++W
Sbjct: 221 AFSPDGKVVASGSYDET-IRLW 241


>gi|282901415|ref|ZP_06309340.1| WD-40 repeat protein [Cylindrospermopsis raciborskii CS-505]
 gi|281193694|gb|EFA68666.1| WD-40 repeat protein [Cylindrospermopsis raciborskii CS-505]
          Length = 1341

 Score = 37.4 bits (85), Expect = 9.4,   Method: Composition-based stats.
 Identities = 31/114 (27%), Positives = 57/114 (50%), Gaps = 5/114 (4%)

Query: 72  HPGGLAGLAFAKKGRSLHVVGTNGMASEMKSEMGEVIREFKASEKPISSSAFLCEEKIFA 131
           H GG+  + F+  G+S+     +G A  + +  GE I++F   E  ++S +F  + +I A
Sbjct: 882 HEGGVTSICFSPDGQSIGTGSEDGTA-RLWNLQGENIQQFHGHEDWVTSVSFSPDGQILA 940

Query: 132 LASSE--VRILSLENGEEVLKFSDDVGPLQYVSASDGAKIIITAGYGEKHLQVW 183
             S +  VR+ +L+ GE + +F      +  VS S   K + T    +K  ++W
Sbjct: 941 TTSVDKTVRLWNLQ-GETIQQFHGHENWVTSVSFSPDGKTLATTSV-DKTARLW 992


>gi|406836180|ref|ZP_11095774.1| hypothetical protein SpalD1_31214 [Schlesneria paludicola DSM
           18645]
          Length = 998

 Score = 37.4 bits (85), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 32/152 (21%), Positives = 72/152 (47%), Gaps = 13/152 (8%)

Query: 44  LLALG----TSNGDILAVDVLTGEMKWKSTGRHPGGLAGLAFAKKGRSLHVVGTNGMASE 99
           LLA G    + +G++   +V T  +  +    H   + GL F++ G+SL     +    +
Sbjct: 786 LLATGGGEPSRSGELFLWNVETRSVAKQFVDAHSDTVFGLDFSRDGKSLVSGAADKFVKQ 845

Query: 100 MKSEMGEVIREFKASEKPISSSAFLCEEKIFALASSE--VRILSLENGEE---VLKFSDD 154
              E G+++R F+     +    +  +    A A ++  +++ ++E GE+   +  +S  
Sbjct: 846 FDVESGKLVRSFEGHTHHVLGVTWKGDGSRIASAGADNAIKVWNVETGEQHRTIQNYSKQ 905

Query: 155 VGPLQYVSASDGAKIIITAGYGEKHLQVWRCD 186
           V  + ++ ASD     + +G G+K +++ R +
Sbjct: 906 VTAIHFIGASDN----LISGSGDKTVKMHRSN 933


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.311    0.128    0.362 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,260,860,934
Number of Sequences: 23463169
Number of extensions: 208945587
Number of successful extensions: 535335
Number of sequences better than 100.0: 532
Number of HSP's better than 100.0 without gapping: 61
Number of HSP's successfully gapped in prelim test: 471
Number of HSP's that attempted gapping in prelim test: 534697
Number of HSP's gapped (non-prelim): 971
length of query: 340
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 197
effective length of database: 9,003,962,200
effective search space: 1773780553400
effective search space used: 1773780553400
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 77 (34.3 bits)