RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17902
         (454 letters)



>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
           proteins that cover a wide variety of functions
           including adaptor/regulatory modules in signal
           transduction, pre-mRNA processing and cytoskeleton
           assembly; typically contains a GH dipeptide 11-24
           residues from its N-terminus and the WD dipeptide at its
           C-terminus and is 40 residues long, hence the name WD40;
           between GH and WD lies a conserved core; serves as a
           stable propeller-like platform to which proteins can
           bind either stably or reversibly; forms a propeller-like
           structure with several blades where each blade is
           composed of a four-stranded anti-parallel b-sheet;
           instances with few detectable copies are hypothesized to
           form larger structures by dimerization; each WD40
           sequence repeat forms the first three strands of one
           blade and the last strand in the next blade; the last
           C-terminal WD40 repeat completes the blade structure of
           the first WD40 repeat to create the closed ring
           propeller-structure; residues on the top and bottom
           surface of the propeller are proposed to coordinate
           interactions with other proteins and/or small ligands; 7
           copies of the repeat are present in this alignment.
          Length = 289

 Score =  152 bits (385), Expect = 1e-42
 Identities = 77/299 (25%), Positives = 136/299 (45%), Gaps = 36/299 (12%)

Query: 139 KPHRASITCIAVTSDSKHVYSACKDGIIIKWLLETRSKVCS----SPFVKTDRKSYPPIL 194
           K H   +TC+A + D K + +   DG I  W LET   + +    +  V+          
Sbjct: 6   KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRD--------- 56

Query: 195 ALAVANNNELVASGDNNSNIKLWSSSLEH-LHTFTGHKAPITGLAISFECDTLYSSSNDR 253
             A A+   L +   + + I+LW       + T TGH + ++ +A S +   L SSS D+
Sbjct: 57  VAASADGTYLASGSSDKT-IRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDK 115

Query: 254 SIKVWNIEDRAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQEES-QLVFNG 312
           +IKVW++E    + TL GH   + S+        ++S   D ++++W ++         G
Sbjct: 116 TIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTG 175

Query: 313 HSRGIDMTRRV----NDQQFITCGEDGTLSLWSILKKVPTHTIPVAHGYDSSSSEPNWIT 368
           H+  +     V    + ++ ++   DGT+ LW +       T+   H         N + 
Sbjct: 176 HTGEV---NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTL-RGH--------ENGVN 223

Query: 369 SVACLHNSDLVASGSYDGCVKLWKITPKRTLEFLQDIPL-SGVINSLVFTSDHKHLVAG 426
           SVA   +  L+ASGS DG +++W +   RT E +Q +   +  + SL ++ D K L +G
Sbjct: 224 SVAFSPDGYLLASGSEDGTIRVWDL---RTGECVQTLSGHTNSVTSLAWSPDGKRLASG 279



 Score =  132 bits (335), Expect = 1e-35
 Identities = 62/257 (24%), Positives = 113/257 (43%), Gaps = 18/257 (7%)

Query: 139 KPHRASITCIAVTSDSKHVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAV 198
           K H   +  +A ++D  ++ S   D  I  W LET   V +   +         + ++A 
Sbjct: 48  KGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRT---LTGHTS---YVSSVAF 101

Query: 199 ANNNELVASGDNNSNIKLWS-SSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKV 257
           + +  +++S   +  IK+W   + + L T  GH   +  +A S +   + SSS D +IK+
Sbjct: 102 SPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKL 161

Query: 258 WNIEDRAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQEESQL-VFNGHSRG 316
           W++     V TL GH   + S+        + S  +D ++++W +     L    GH  G
Sbjct: 162 WDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENG 221

Query: 317 IDMTRRVNDQQFIT-CGEDGTLSLWSILKKVPTHTIPVAHGYDSSSSEPNWITSVACLHN 375
           ++      D   +    EDGT+ +W +       T+   H         N +TS+A   +
Sbjct: 222 VNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTLS-GH--------TNSVTSLAWSPD 272

Query: 376 SDLVASGSYDGCVKLWK 392
              +ASGS DG +++W 
Sbjct: 273 GKRLASGSADGTIRIWD 289



 Score =  115 bits (289), Expect = 5e-29
 Identities = 57/235 (24%), Positives = 105/235 (44%), Gaps = 24/235 (10%)

Query: 223 HLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWNIEDRAYVETLFGHQSAITSLDIL 282
              T  GH   +T +A S +   L + S D +IKVW++E    + TL GH   +  +   
Sbjct: 1   LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60

Query: 283 AKNRVISSGGTDKSVRVWKIQEESQL-VFNGHS---RGIDMTRRVNDQQFITCGEDGTLS 338
           A    ++SG +DK++R+W ++    +    GH+     +  +   + +   +   D T+ 
Sbjct: 61  ADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSP--DGRILSSSSRDKTIK 118

Query: 339 LWSILKKVPTHTIPVAHGYDSSSSEPNWITSVACLHNSDLVASGSYDGCVKLWKITPKRT 398
           +W +       T+   H         +W+ SVA   +   VAS S DG +KLW +   + 
Sbjct: 119 VWDVETGKCLTTLR-GH--------TDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKC 169

Query: 399 LEFLQDIPLSGVINSLVFTSDHKHLVAGIGT------EHRFGRW-KTIKGVKNTV 446
           +  L     +G +NS+ F+ D + L++          +   G+   T++G +N V
Sbjct: 170 VATLTG--HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGV 222


>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
           only].
          Length = 466

 Score =  110 bits (276), Expect = 2e-26
 Identities = 89/360 (24%), Positives = 151/360 (41%), Gaps = 18/360 (5%)

Query: 91  EKEKLNDSKDKHVLKKLETVVLEQEGKLRVNIADRLANATVVCTKFNG---KPHRASITC 147
            K KL    +        +++     +  + +   L+++ V     +    + H  SIT 
Sbjct: 11  NKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSLLLRGHEDSITS 70

Query: 148 IAVTSDSKHVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAVANNNELVAS 207
           IA + D + + S   DG I  W L+   K+  S        S   +   +   N+ L+AS
Sbjct: 71  IAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKS-LEGLHDSSVSKLALSSPDGNSILLAS 129

Query: 208 GDNNSNIKLW--SSSLEHLHTFTGHKAPITGLAISFECDTL-YSSSNDRSIKVWNIEDRA 264
              +  +KLW  S+  + + T  GH   +T LA S +   L   SS D +IK+W++    
Sbjct: 130 SSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGK 189

Query: 265 YVETLFGHQSAITSLDILAKNR-VISSGGTDKSVRVWKIQ--EESQLVFNGHSRGIDMTR 321
            + TL GH   ++SL        +I+SG +D ++R+W +   +  +   +GHS  +  + 
Sbjct: 190 PLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSF 249

Query: 322 RVNDQQFITCGEDGTLSLWSILKKVPTHTIPVAHGYDSSSSEPNWITSVACLHNSDLVAS 381
             +     +   DGT+ LW +            H           + SVA   +  L+AS
Sbjct: 250 SPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSS--------SVLSVAFSPDGKLLAS 301

Query: 382 GSYDGCVKLWKITPKRTLEFLQDIPLSGVINSLVFTSDHKHLVAGIGTEHRFGRWKTIKG 441
           GS DG V+LW +   + L  L      G ++SL F+ D   LV+G   +     W    G
Sbjct: 302 GSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTG 361



 Score = 84.4 bits (207), Expect = 2e-17
 Identities = 60/262 (22%), Positives = 116/262 (44%), Gaps = 22/262 (8%)

Query: 139 KPHRASITCIAVTSDSK-HVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALA 197
             H   ++ +A + D    + S   DG I  W L T         +++    +   +  +
Sbjct: 195 AGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLST------GKLLRSTLSGHSDSVVSS 248

Query: 198 VANNNELVASGDNNSNIKLWS--SSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSI 255
            + +  L+ASG ++  I+LW   SS   L T +GH + +  +A S +   L S S+D ++
Sbjct: 249 FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTV 308

Query: 256 KVWNIE--DRAYVETLFGHQSAITSLDILAKN-RVISSGGTDKSVRVWKIQEESQLV-FN 311
           ++W++E        TL GH+  ++SL        ++S G  D ++R+W ++    L    
Sbjct: 309 RLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLE 368

Query: 312 GHSRGIDMTRRVNDQQFITCGEDGTLSLWSILKKVPTHTIPVAHGYDSSSSEPNWITSVA 371
           GHS  + ++   + +   +   DGT+ LW +                +     + +TS+ 
Sbjct: 369 GHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLR---------NLDGHTSRVTSLD 419

Query: 372 CLHNSDLVASGSYDGCVKLWKI 393
              +   +ASGS D  ++LW +
Sbjct: 420 FSPDGKSLASGSSDNTIRLWDL 441



 Score = 73.2 bits (178), Expect = 6e-14
 Identities = 52/209 (24%), Positives = 97/209 (46%), Gaps = 12/209 (5%)

Query: 146 TCIAVTSDSKHVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAVANNNELV 205
              + + D   + S   DG I  W L + S +     ++T       +L++A + + +L+
Sbjct: 245 VVSSFSPDGSLLASGSSDGTIRLWDLRSSSSL-----LRTLSGHSSSVLSVAFSPDGKLL 299

Query: 206 ASGDNNSNIKLWSSS---LEHLHTFTGHKAPITGLAISFECDTLYSS-SNDRSIKVWNIE 261
           ASG ++  ++LW      L    T  GH+ P++ L+ S +   L S  S+D +I++W++ 
Sbjct: 300 ASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLR 359

Query: 262 DRAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQEESQLVFNGHSRG--IDM 319
               ++TL GH S + S+      RV+SSG TD +VR+W +   S L            +
Sbjct: 360 TGKPLKTLEGH-SNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSL 418

Query: 320 TRRVNDQQFITCGEDGTLSLWSILKKVPT 348
               + +   +   D T+ LW +   + +
Sbjct: 419 DFSPDGKSLASGSSDNTIRLWDLKTSLKS 447



 Score = 67.8 bits (164), Expect = 3e-12
 Identities = 40/164 (24%), Positives = 82/164 (50%), Gaps = 5/164 (3%)

Query: 141 HRASITCIAVTSDSKHVYSACKDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAVA- 199
           H +S+  +A + D K + S   DG +  W LET   + S     T +    P+ +L+ + 
Sbjct: 283 HSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSS----LTLKGHEGPVSSLSFSP 338

Query: 200 NNNELVASGDNNSNIKLWSSSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWN 259
           + + LV+ G ++  I+LW              + +  ++ S +   + S S D ++++W+
Sbjct: 339 DGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTVRLWD 398

Query: 260 IEDRAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQ 303
           +   + +  L GH S +TSLD     + ++SG +D ++R+W ++
Sbjct: 399 LSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLK 442



 Score = 48.5 bits (114), Expect = 5e-06
 Identities = 28/124 (22%), Positives = 60/124 (48%), Gaps = 9/124 (7%)

Query: 141 HRASITCIAVTSDSKHVYSAC-KDGIIIKWLLETRSKVCSSPFVKTDRKSYPPILALAVA 199
           H   ++ ++ + D   + S    DG I  W L T   +          + +  +L+++ +
Sbjct: 327 HEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPL-------KTLEGHSNVLSVSFS 379

Query: 200 NNNELVASGDNNSNIKLWS-SSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVW 258
            +  +V+SG  +  ++LW  S+   L    GH + +T L  S +  +L S S+D +I++W
Sbjct: 380 PDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLW 439

Query: 259 NIED 262
           +++ 
Sbjct: 440 DLKT 443


>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat. 
          Length = 39

 Score = 45.0 bits (107), Expect = 8e-07
 Identities = 14/38 (36%), Positives = 24/38 (63%)

Query: 222 EHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWN 259
           + L T  GH  P+T +A S + + L S S+D +++VW+
Sbjct: 2   KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 39.3 bits (92), Expect = 8e-05
 Identities = 13/36 (36%), Positives = 20/36 (55%)

Query: 266 VETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWK 301
           + TL GH   +TS+       +++SG  D +VRVW 
Sbjct: 4   LRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 37.7 bits (88), Expect = 3e-04
 Identities = 13/28 (46%), Positives = 19/28 (67%)

Query: 365 NWITSVACLHNSDLVASGSYDGCVKLWK 392
             +TSVA   + +L+ASGS DG V++W 
Sbjct: 12  GPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 33.5 bits (77), Expect = 0.012
 Identities = 8/29 (27%), Positives = 13/29 (44%)

Query: 141 HRASITCIAVTSDSKHVYSACKDGIIIKW 169
           H   +T +A + D   + S   DG +  W
Sbjct: 10  HTGPVTSVAFSPDGNLLASGSDDGTVRVW 38



 Score = 28.1 bits (63), Expect = 0.95
 Identities = 6/39 (15%), Positives = 15/39 (38%), Gaps = 1/39 (2%)

Query: 304 EESQLVFNGHSRGI-DMTRRVNDQQFITCGEDGTLSLWS 341
            +      GH+  +  +    +     +  +DGT+ +W 
Sbjct: 1   GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39



 Score = 28.1 bits (63), Expect = 1.00
 Identities = 7/27 (25%), Positives = 18/27 (66%)

Query: 192 PILALAVANNNELVASGDNNSNIKLWS 218
           P+ ++A + +  L+ASG ++  +++W 
Sbjct: 13  PVTSVAFSPDGNLLASGSDDGTVRVWD 39


>gnl|CDD|197651 smart00320, WD40, WD40 repeats.  Note that these repeats are
           permuted with respect to the structural repeats (blades)
           of the beta propeller domain.
          Length = 40

 Score = 43.8 bits (104), Expect = 2e-06
 Identities = 17/40 (42%), Positives = 24/40 (60%)

Query: 220 SLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWN 259
           S E L T  GH  P+T +A S +   L S S+D +IK+W+
Sbjct: 1   SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 38.1 bits (89), Expect = 2e-04
 Identities = 10/36 (27%), Positives = 21/36 (58%)

Query: 266 VETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWK 301
           ++TL GH   +TS+      + ++SG  D ++++W 
Sbjct: 5   LKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 38.1 bits (89), Expect = 3e-04
 Identities = 13/28 (46%), Positives = 17/28 (60%)

Query: 365 NWITSVACLHNSDLVASGSYDGCVKLWK 392
             +TSVA   +   +ASGS DG +KLW 
Sbjct: 13  GPVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 34.6 bits (80), Expect = 0.004
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query: 139 KPHRASITCIAVTSDSKHVYSACKDGIIIKW 169
           K H   +T +A + D K++ S   DG I  W
Sbjct: 9   KGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39



 Score = 28.4 bits (64), Expect = 0.59
 Identities = 9/27 (33%), Positives = 18/27 (66%)

Query: 192 PILALAVANNNELVASGDNNSNIKLWS 218
           P+ ++A + + + +ASG ++  IKLW 
Sbjct: 14  PVTSVAFSPDGKYLASGSDDGTIKLWD 40



 Score = 28.0 bits (63), Expect = 1.1
 Identities = 7/34 (20%), Positives = 15/34 (44%), Gaps = 1/34 (2%)

Query: 309 VFNGHSRGI-DMTRRVNDQQFITCGEDGTLSLWS 341
              GH+  +  +    + +   +  +DGT+ LW 
Sbjct: 7   TLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40


>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
          Length = 568

 Score = 38.0 bits (88), Expect = 0.010
 Identities = 29/91 (31%), Positives = 41/91 (45%), Gaps = 14/91 (15%)

Query: 229 GHKAPITGLAISFE---CDTLYSSSNDRSIKVWNI--EDRAYVET------LFGHQSAIT 277
           GH + I  L + F     + L S S D +I+VW I   D +  E       L GH+  I+
Sbjct: 72  GHTSSI--LDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKIS 129

Query: 278 SLDILAKNRVI-SSGGTDKSVRVWKIQEESQ 307
            +D    N  I  S G D  V +W I+ E +
Sbjct: 130 IIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160



 Score = 33.4 bits (76), Expect = 0.25
 Identities = 24/89 (26%), Positives = 39/89 (43%), Gaps = 12/89 (13%)

Query: 266 VETLFGHQSAITSLDI-LAKNRVISSGGTDKSVRVWKIQEES---------QLVFNGHSR 315
           V  L GH S+I  L      + +++SG  D ++RVW+I             Q +  GH +
Sbjct: 67  VIKLKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKK 126

Query: 316 GIDMTRRVNDQQFITC--GEDGTLSLWSI 342
            I +        +I C  G D  +++W I
Sbjct: 127 KISIIDWNPMNYYIMCSSGFDSFVNIWDI 155


>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
          Length = 793

 Score = 35.4 bits (81), Expect = 0.067
 Identities = 40/183 (21%), Positives = 74/183 (40%), Gaps = 22/183 (12%)

Query: 205 VASGDNNSNIKLWSSSLEHLHT-FTGHKAPITGLAISFECDTLYSS-SNDRSIKVWNIED 262
           VAS +    +++W  +   L T    H+  +  +  S    TL +S S+D S+K+W+I  
Sbjct: 548 VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQ 607

Query: 263 RAYVETLFGHQSAITSLDILAKNRVISSGGTDKSVRVWKIQEES--QLVFNGHSRGIDMT 320
              + T+    +           R ++ G  D  V  + ++          GHS+ +   
Sbjct: 608 GVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYV 667

Query: 321 RRVNDQQFITCGEDGTLSLWSI------LKKVPTHT------------IPVAHGYDSSSS 362
           R V+    ++   D TL LW +      + + P H+            + V+ GY ++ S
Sbjct: 668 RFVDSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGS 727

Query: 363 EPN 365
           E N
Sbjct: 728 ETN 730


>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
          Length = 493

 Score = 34.5 bits (79), Expect = 0.10
 Identities = 21/87 (24%), Positives = 39/87 (44%), Gaps = 5/87 (5%)

Query: 216 LWSSSLEHLHTFTGHKAPITGLAISFE---CDTLYSSSNDRSIKVWNIEDRAYVETLFGH 272
           L  +  + +    GH   + G+ +SF     + L S+  D  + VW++E    VE +  H
Sbjct: 110 LTQNISDPIVHLQGHTKKV-GI-VSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCH 167

Query: 273 QSAITSLDILAKNRVISSGGTDKSVRV 299
              ITSL+      ++ +   DK + +
Sbjct: 168 SDQITSLEWNLDGSLLCTTSKDKKLNI 194



 Score = 33.3 bits (76), Expect = 0.26
 Identities = 25/85 (29%), Positives = 40/85 (47%), Gaps = 11/85 (12%)

Query: 229 GHKAPITGLAIS-FECDTLYSSSNDRSIKVWNIEDRAYVET-------LFGH--QSAITS 278
           G + PI  +A + F+   L+++S D +I  W I +    +        L GH  +  I S
Sbjct: 73  GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVS 132

Query: 279 LDILAKNRVISSGGTDKSVRVWKIQ 303
               A N V++S G D  V VW ++
Sbjct: 133 FHPSAMN-VLASAGADMVVNVWDVE 156



 Score = 31.0 bits (70), Expect = 1.2
 Identities = 35/162 (21%), Positives = 72/162 (44%), Gaps = 12/162 (7%)

Query: 153 DSKHVYSACKDGIIIKWLL--ETRSKVCSSPFV--KTDRKSYPPILALAVANNNELVASG 208
           D + +++A +DG I+ W +  E  ++  S P V  +   K    +     A N  ++AS 
Sbjct: 87  DPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMN--VLASA 144

Query: 209 DNNSNIKLWS-SSLEHLHTFTGHKAPITGLAISFECDTLYSSSNDRSIKVWNIEDRAYVE 267
             +  + +W     + +     H   IT L  + +   L ++S D+ + + +  D   V 
Sbjct: 145 GADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVS 204

Query: 268 TLFGHQSAITSLDILAKNR-VISSGGTDKS----VRVWKIQE 304
           ++  H SA +   + AK + +I + G  KS    + +W  ++
Sbjct: 205 SVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRK 246


>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
          Length = 509

 Score = 33.8 bits (78), Expect = 0.20
 Identities = 21/95 (22%), Positives = 42/95 (44%), Gaps = 2/95 (2%)

Query: 7   NKKPAATNNNSLKSKKRK-FSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSE 65
            KK     ++S K++K+    +    +       L++    D+DD+    D  D ++  +
Sbjct: 93  KKKLKDELDSSKKAEKKNALDKDDDLNYVKDIDVLNQADDDDDDDDDDDLDDDDIDDDDD 152

Query: 66  VEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKD 100
            E+D  + +   +  +   E+ E  E EKL+D  D
Sbjct: 153 DEDDDEDDDDDDVDDEDE-EKKEAKELEKLSDDDD 186


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 33.5 bits (77), Expect = 0.21
 Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 1/82 (1%)

Query: 38  PDLDEELTSDEDDEGIGFDSQ-DFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLN 96
            D D E  SD++    GF+S+ +        +  T+ +R     +K LE   K EK+   
Sbjct: 245 SDDDGEEESDDESAWEGFESEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304

Query: 97  DSKDKHVLKKLETVVLEQEGKL 118
                  LK++   V ++E   
Sbjct: 305 KLAQLARLKEIAKEVAQKEKAR 326


>gnl|CDD|215521 PLN02967, PLN02967, kinase.
          Length = 581

 Score = 33.9 bits (77), Expect = 0.21
 Identities = 21/82 (25%), Positives = 39/82 (47%), Gaps = 8/82 (9%)

Query: 6   RNKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSE 65
           R ++ AA  ++ ++ +K   +EK  + ++ VK   +     D +D+G   +  D EE   
Sbjct: 116 RTRRKAAAASSDVEEEK---TEKKVRKRRKVKKMDE-----DVEDQGSESEVSDVEESEF 167

Query: 66  VEEDLTEQERKVILAKKYLEEI 87
           V     E E ++ L K   E+I
Sbjct: 168 VTSLENESEEELDLEKDDGEDI 189


>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
           subunit (TFIIF-alpha).  Transcription initiation factor
           IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
           II-associating protein 74 (RAP74) is the large subunit
           of transcription factor IIF (TFIIF), which is essential
           for accurate initiation and stimulates elongation by RNA
           polymerase II.
          Length = 528

 Score = 33.4 bits (76), Expect = 0.24
 Identities = 31/109 (28%), Positives = 45/109 (41%), Gaps = 17/109 (15%)

Query: 19  KSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQ--DFEEH--------SEVEE 68
           + K +K  +K  K+KK  K D D++     DD+   +DS   D E          S    
Sbjct: 238 EEKSKKKKKKLAKNKK--KLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGN 295

Query: 69  DLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGK 117
           D  E+E      K   E   K E E+  DS++    K  E   L ++GK
Sbjct: 296 DPEERED-----KLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGK 339


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 33.7 bits (77), Expect = 0.26
 Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 17/95 (17%)

Query: 39  DLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDS 98
           DL +  T+D +D         F E +  E+ LT++E ++   +  LE +   E  +L+ S
Sbjct: 715 DLFDCGTADTEDA--------FREAAREEQQLTQRESRLESLEAQLEGVAA-EAYELSAS 765

Query: 99  KDKHVLKKLETVVLE--------QEGKLRVNIADR 125
            D+  LK+ E  +LE        +  +L   +A  
Sbjct: 766 LDQRELKEEELALLEEAIDALDEEVEELHAQVAAL 800


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 32.8 bits (75), Expect = 0.29
 Identities = 16/51 (31%), Positives = 23/51 (45%), Gaps = 9/51 (17%)

Query: 24  KFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQE 74
           K S    K K+    D+DEE   DE++E         EE  E +ED  + +
Sbjct: 152 KLSMLEKKLKELEAEDVDEEDEKDEEEE---------EEEEEEDEDFDDDD 193


>gnl|CDD|218312 pfam04889, Cwf_Cwc_15, Cwf15/Cwc15 cell cycle control protein.
           This family represents Cwf15/Cwc15 (from
           Schizosaccharomyces pombe and Saccharomyces cerevisiae
           respectively) and their homologues. The function of
           these proteins is unknown, but they form part of the
           spliceosome and are thus thought to be involved in mRNA
           splicing.
          Length = 241

 Score = 32.8 bits (75), Expect = 0.30
 Identities = 17/107 (15%), Positives = 39/107 (36%), Gaps = 15/107 (14%)

Query: 6   RNKKPAATNNNSLKS--KKRKFSEKPTKSKKTVKPDLDEE---LTSDEDDEGIGFDSQDF 60
           +  K A  + +   +     +  E   +  +  +  ++E+     +D+ D     DS D 
Sbjct: 74  KENKLAIEDADKSTNLDASNEGDEDDDEEDEIKRKRIEEDARNSDADDSDSSSDSDSSDD 133

Query: 61  EEHSEVEEDLTE----------QERKVILAKKYLEEIEKVEKEKLND 97
           +   +  ED T           +ER     ++  E+  + EK +  +
Sbjct: 134 DSDDDDSEDETAALLRELEKIKKERAEEKEREEEEKAAEEEKAREEE 180


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.1 bits (76), Expect = 0.31
 Identities = 21/66 (31%), Positives = 38/66 (57%), Gaps = 8/66 (12%)

Query: 70  LTEQERKVILAKKYLEEIEKVEKEKLN----DSKDKHVLKKLETVVLEQEGKLRVN--IA 123
           LTE+ RK +L + Y  E++++EKE       + K +  L++LE  VL++E +L     +A
Sbjct: 445 LTEEHRKELLEE-YTAELKRIEKELKEIEEKERKLRKELRELEK-VLKKESELIKLKELA 502

Query: 124 DRLANA 129
           ++L   
Sbjct: 503 EQLKEL 508



 Score = 28.5 bits (64), Expect = 10.0
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 4/60 (6%)

Query: 64  SEVEEDLTEQERKVILAKKYL----EEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKLR 119
            E E++L E  R++      L    EE+EK+EKE     + K  +++LE  +   EG  R
Sbjct: 196 KEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255


>gnl|CDD|218771 pfam05835, Synaphin, Synaphin protein.  This family consists of
           several eukaryotic synaphin 1 and 2 proteins.
           Synaphin/complexin is a cytosolic protein that
           preferentially binds to syntaxin within the SNARE
           complex. Synaphin promotes SNAREs to form precomplexes
           that oligomerise into higher order structures. A peptide
           from the central, syntaxin binding domain of synaphin
           competitively inhibits these two proteins from
           interacting and prevents SNARE complexes from
           oligomerising. It is thought that oligomerisation of
           SNARE complexes into a higher order structure creates a
           SNARE scaffold for efficient, regulated fusion of
           synaptic vesicles. Synaphin promotes neuronal exocytosis
           by promoting interaction between the complementary
           syntaxin and synaptobrevin transmembrane regions that
           reside in opposing membranes prior to fusion.
          Length = 139

 Score = 31.8 bits (72), Expect = 0.34
 Identities = 26/87 (29%), Positives = 38/87 (43%), Gaps = 9/87 (10%)

Query: 33  KKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQE--RKVILAKKYLEEIEKV 90
           K+ V   L +        E  G +S   EE  E++E L E E  RK    K  +EE    
Sbjct: 6   KQMVGNQLKDVKGMLGGKEDEGDESDAEEEDEEIQEALREAEEERKAKHRK--MEE---- 59

Query: 91  EKEKL-NDSKDKHVLKKLETVVLEQEG 116
           E+E +    +DK+ +KK E    E + 
Sbjct: 60  EREVMRQGIRDKYGIKKKEEDEEEPQA 86


>gnl|CDD|197742 smart00466, SRA, SET and RING finger associated domain.  Domain of
           unknown function in SET domain containing proteins and
           in Deinococcus radiodurans DRA1533.
          Length = 155

 Score = 32.0 bits (73), Expect = 0.35
 Identities = 22/102 (21%), Positives = 36/102 (35%), Gaps = 14/102 (13%)

Query: 271 GHQSAITSLDILAKNR----VISSGGTDKSVRVWKIQEESQLVFNGHSRGIDMTRRVNDQ 326
             Q+ I  L+          V+SSGG +         +   L++ G         +  DQ
Sbjct: 31  PTQAGIDGLESDEGEPGATSVVSSGGYED-----DTDDGDVLIYTGQGGRDMTHGQPEDQ 85

Query: 327 QFITCGEDGTLSL-WSILKKVPTHTIPVAHGYDSSSSEPNWI 367
           +     E G L+L  S  K +P   +    GY   +    +I
Sbjct: 86  KL----ERGNLALEASCRKGIPVRVVRGMKGYSKYAPGKGYI 123


>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
          (DUF1754).  This is a eukaryotic protein family of
          unknown function.
          Length = 90

 Score = 30.5 bits (69), Expect = 0.42
 Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 3/74 (4%)

Query: 16 NSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEED--LTEQ 73
            +K KK+K  +K    ++ V    +EE +S E D   G + +D E+  + E+   LTE 
Sbjct: 17 IDVKKKKKKKKKKNKSKEEVVTEKEEEEKSSAESDLKEGEEDEDNEKIEQEEDGMNLTEA 76

Query: 74 ERKVILA-KKYLEE 86
          ER    A +K L+E
Sbjct: 77 ERAFEEAQRKRLKE 90


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 31.8 bits (73), Expect = 0.45
 Identities = 22/61 (36%), Positives = 34/61 (55%), Gaps = 2/61 (3%)

Query: 61  EEHSEVEEDLTEQERKVILA--KKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKL 118
           EEH+E       QE K ++   KK  EE+E++ K+  +  KDK  LK L+  + E E +L
Sbjct: 43  EEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKEL 102

Query: 119 R 119
           +
Sbjct: 103 K 103


>gnl|CDD|222926 PHA02745, PHA02745, hypothetical protein; Provisional.
          Length = 265

 Score = 31.8 bits (72), Expect = 0.63
 Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 1/102 (0%)

Query: 5   IRNKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHS 64
           +R     A   + +K   R+         K +  DL E+++ D D      D    EEH 
Sbjct: 18  LRLLHSPAIRPSVIKEPTRRNYLSNCSQVKKLLEDLFEDISGDLDIRTSTGDYVSEEEHF 77

Query: 65  EVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKK 106
               D  E +    L  K  +++  +E  K ++S+ ++ L K
Sbjct: 78  -GRVDTQEVDEFRGLNAKIRDKVLPIEDNKHDNSETRNPLNK 118


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 31.6 bits (72), Expect = 0.74
 Identities = 18/95 (18%), Positives = 37/95 (38%), Gaps = 22/95 (23%)

Query: 19  KSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVI 78
           + +  + +  PT       P   E+L  D++ E     S D E   ++ +D +E+ER   
Sbjct: 123 RQELLEAAAAPTAPPHVAAPVNGEQLEPDDNGEEA---SADLETDPDM-KDRSEEER--- 175

Query: 79  LAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLE 113
                   +   E       K++ +  +L+ +  E
Sbjct: 176 --------VTYAE-------KNERLQTQLQALKSE 195


>gnl|CDD|109884 pfam00846, Hanta_nucleocap, Hantavirus nucleocapsid protein. 
          Length = 428

 Score = 31.8 bits (72), Expect = 0.75
 Identities = 25/80 (31%), Positives = 44/80 (55%), Gaps = 11/80 (13%)

Query: 65  EVEEDLTEQERKVILAKKYLEEIEK-VEKEKLNDSKDKHVLKKLETVVLEQEGK---LRV 120
           E++E++T  E+++++A++ L++ EK VE +   D  +K  L+     V   E K   L+ 
Sbjct: 6   ELQEEITAHEQQLVIARQKLKDAEKAVEVDP--DDVNKSTLQSRRAAVSALEAKLAELKR 63

Query: 121 NIADRLANATVVCTKFNGKP 140
            +ADR+A       K  GKP
Sbjct: 64  QLADRIATQ-----KLAGKP 78


>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
          Length = 567

 Score = 31.7 bits (73), Expect = 0.96
 Identities = 16/60 (26%), Positives = 31/60 (51%), Gaps = 9/60 (15%)

Query: 66  VEE--DLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKLRVNIA 123
           VEE   L+ +E++  L +   E +E+ E++K    ++K  L  L      ++GK+ +  A
Sbjct: 55  VEEVNSLSLEEQRERLEELAPELLEEEEEKK----EEKKGLPPLPNA---EKGKVVMRFA 107


>gnl|CDD|222512 pfam14048, MBD_C, C-terminal domain of methyl-CpG binding protein
          2 and 3.  CpG-methylation is a frequently occurring
          epigenetic modification of vertebrate genomes resulting
          in transcriptional repression. This domain was found at
          the C-terminus of the methyl-CpG-binding domain (MBD)
          containing proteins MBD2 and MBD3, the latter was shown
          to not bind directly to methyl-CpG DNA but rather
          interact with components of the NuRD/Mi2 complex, an
          abundant deacetylase complex. The domain is subject to
          structure determination by the Joint Center of
          Structural Genomics.
          Length = 96

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 11/21 (52%), Positives = 14/21 (66%)

Query: 67 EEDLTEQERKVILAKKYLEEI 87
          EED+  QER+V  A+K L E 
Sbjct: 75 EEDIRRQERRVKDARKRLAEA 95


>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein. 
          Length = 529

 Score = 31.3 bits (71), Expect = 1.2
 Identities = 20/70 (28%), Positives = 33/70 (47%), Gaps = 6/70 (8%)

Query: 30 TKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEK 89
           K  +   PD +EE    E+ +     + D EE  + EE+  E+++K     K ++E   
Sbjct: 28 EKEVEKEVPD-EEEEEEKEEKKEEEEKTTDKEEEVDEEEEKEEKKKK----TKKVKE-TT 81

Query: 90 VEKEKLNDSK 99
           E E LN +K
Sbjct: 82 TEWELLNKTK 91


>gnl|CDD|200601 cd10979, CE4_PuuE_like, Putative catalytic domain of
           uncharacterized prokaryotic polysaccharide deacetylases
           similar to bacterial PuuE allantoinases.  The family
           includes a group of uncharacterized prokaryotic
           polysaccharide deacetylases (DCAs) that show high
           sequence similarity to the catalytic domain of bacterial
           PuuE (purine utilization E) allantoinases. PuuE
           allantoinase specifically catalyzes the hydrolysis of
           (S)-allantoin into allantoic acid. It functions as a
           homotetramer. Its monomer is composed of a 7-stranded
           barrel with detectable sequence similarity to the
           6-stranded barrel NodB homology domain of DCA-like
           proteins in the CE4 superfamily, which removes N-linked
           or O-linked acetyl groups from cell wall
           polysaccharides. PuuE allantoinase appears to be
           metal-independent and acts on a small substrate
           molecule, which is distinct from the common feature of
           DCAs which are normally metal ion dependent and
           recognize multimeric substrates.
          Length = 281

 Score = 30.7 bits (70), Expect = 1.3
 Identities = 13/62 (20%), Positives = 25/62 (40%), Gaps = 11/62 (17%)

Query: 190 YPPILALAVANNNELVASGDNNSNI----------KLWSSSLEHLHTFTGHKAPITGLAI 239
           YP ++        E +A G +NS +          ++ + SL+ +   TG + P   L+ 
Sbjct: 91  YPELIEAIRERGWEFIAHGISNSTLHAGLDEAQEREVIAESLDRIEKATG-QRPRGWLSP 149

Query: 240 SF 241
             
Sbjct: 150 GL 151


>gnl|CDD|236410 PRK09202, nusA, transcription elongation factor NusA; Validated.
          Length = 470

 Score = 31.0 bits (71), Expect = 1.4
 Identities = 29/108 (26%), Positives = 44/108 (40%), Gaps = 18/108 (16%)

Query: 20  SKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDS---------QDFEEHSEVEEDL 70
           S+KR+              D+DEE+     +EG  F S         ++  E    +E+ 
Sbjct: 344 SEKRQAEFDAILDLFMEALDIDEEIAQLLVEEG--FSSLEELAYVPVEELLEIEGFDEET 401

Query: 71  TEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKL 118
            E+ R+   AK+ LE     ++EKL D      L  LE +  E   KL
Sbjct: 402 VEELRER--AKEALETEALAQEEKLADD-----LLSLEGLDRELAFKL 442


>gnl|CDD|227447 COG5117, NOC3, Protein involved in the nuclear export of
           pre-ribosomes [Translation, ribosomal structure and
           biogenesis / Intracellular trafficking and secretion].
          Length = 657

 Score = 31.1 bits (70), Expect = 1.4
 Identities = 19/94 (20%), Positives = 34/94 (36%), Gaps = 9/94 (9%)

Query: 5   IRNKKPAATNNNSLKSKKRKFSEKPTK--SKKTVKPDLDEELTSDEDDEGIGFDSQDFEE 62
           +R +  ++ NN S      K  +      +         E   S E D  I  + Q   E
Sbjct: 60  LRPRVSSSWNNESYNRLPIKTKDNVVADVNNGEEFLSESESEASLEIDSDIKDEKQKSLE 119

Query: 63  HSEVEED-------LTEQERKVILAKKYLEEIEK 89
             ++  +        +E+ER   +  K +EE E+
Sbjct: 120 EQKIAPEIPVKQQIDSEKERIASICTKIIEEPEE 153


>gnl|CDD|227429 COG5098, COG5098, Chromosome condensation complex Condensin,
           subunit D2 [Chromatin structure and dynamics / Cell
           division and chromosome partitioning].
          Length = 1128

 Score = 30.7 bits (69), Expect = 1.7
 Identities = 18/79 (22%), Positives = 28/79 (35%)

Query: 8   KKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVE 67
           +K   +  N L S K    E     +K V+ D  +  T  E       +S+   E   + 
Sbjct: 433 EKNLGSAENQLNSLKSGLQETLCDGEKEVEQDEGQCRTELEGSFNKSAESRIENEVENIN 492

Query: 68  EDLTEQERKVILAKKYLEE 86
              T    K+ L   Y E+
Sbjct: 493 ATNTSVLMKLKLMIVYYED 511


>gnl|CDD|227772 COG5485, COG5485, Predicted ester cyclase [General function
           prediction only].
          Length = 131

 Score = 29.3 bits (66), Expect = 1.9
 Identities = 11/48 (22%), Positives = 21/48 (43%), Gaps = 6/48 (12%)

Query: 226 TFTGHKA------PITGLAISFECDTLYSSSNDRSIKVWNIEDRAYVE 267
           TF    +      P  G  + F  +  Y   N + ++VW++ D+  +E
Sbjct: 80  TFDCTPSGEIMGIPPNGKRVRFSENVFYEFENGKIVEVWSVIDKMAIE 127


>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein.  This family includes proteins
           related to Mpp10 (M phase phosphoprotein 10). The U3
           small nucleolar ribonucleoprotein (snoRNP) is required
           for three cleavage events that generate the mature 18S
           rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
           depletion of Mpp10, a U3 snoRNP-specific protein, halts
           18S rRNA production and impairs cleavage at the three U3
           snoRNP-dependent sites.
          Length = 613

 Score = 30.7 bits (69), Expect = 1.9
 Identities = 18/81 (22%), Positives = 35/81 (43%), Gaps = 4/81 (4%)

Query: 19  KSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEE-DLTEQERKV 77
             K+    E  +K ++  + D ++E   DE++       +   +   +E  DL   +   
Sbjct: 280 PDKEAVKKEADSKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLDEPVLEGVDLESPKELS 339

Query: 78  ILAK---KYLEEIEKVEKEKL 95
              K   K  ++IE++EKE L
Sbjct: 340 SFEKRQAKLKQQIEQLEKENL 360


>gnl|CDD|220284 pfam09538, FYDLN_acid, Protein of unknown function (FYDLN_acid). 
          Members of this family are bacterial proteins with a
          conserved motif [KR]FYDLN, sometimes flanked by a pair
          of CXXC motifs, followed by a long region of low
          complexity sequence in which roughly half the residues
          are Asp and Glu, including multiple runs of five or
          more acidic residues. The function of members of this
          family is unknown.
          Length = 104

 Score = 28.8 bits (65), Expect = 2.1
 Identities = 8/53 (15%), Positives = 21/53 (39%)

Query: 17 SLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEED 69
          +        +E   K  +  + + D  L  D+DD+         ++  ++++D
Sbjct: 41 AKSRAPAADAEDAAKKDEDEEDEDDVVLDDDDDDDDDDDLPDLDDDDVDLDDD 93


>gnl|CDD|107089 PHA01819, PHA01819, hypothetical protein.
          Length = 129

 Score = 29.2 bits (64), Expect = 2.2
 Identities = 27/85 (31%), Positives = 40/85 (47%), Gaps = 9/85 (10%)

Query: 30  TKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEK 89
           T+  + VKPD D+E TS+  +E +    +  E   E  E+ TE         K +E IE+
Sbjct: 2   TEFDEIVKPD-DKEETSESTEENLESTEETSESTEESTEESTED--------KTVETIEE 52

Query: 90  VEKEKLNDSKDKHVLKKLETVVLEQ 114
             + KL  +       K + VVLEQ
Sbjct: 53  ENENKLEPTTTDEDSSKFDPVVLEQ 77


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 30.2 bits (69), Expect = 2.4
 Identities = 23/78 (29%), Positives = 37/78 (47%), Gaps = 13/78 (16%)

Query: 53  IGFDSQDFEEHSE-VEEDLTEQERKVILAKKYLEEIEKV------EKEKLNDSKDKHVLK 105
           IG D +   E    +EE   E E+K   A+  L+E EK+      +KEKL + +DK + +
Sbjct: 511 IGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEE 570

Query: 106 KLETVVLEQEGKLRVNIA 123
                  E+E +  +  A
Sbjct: 571 ------AEKEAQQAIKEA 582


>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1.  Rec10 / Red1 is involved in meiotic
           recombination and chromosome segregation during
           homologous chromosome formation. This protein localises
           to the synaptonemal complex in S. cerevisiae and the
           analogous structures (linear elements) in S. pombe. This
           family is currently only found in fungi.
          Length = 706

 Score = 30.2 bits (68), Expect = 2.7
 Identities = 30/128 (23%), Positives = 47/128 (36%), Gaps = 7/128 (5%)

Query: 9   KPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEE 68
           K    NNNS ++ K K   K   + K    D+ E  +   + E    D Q     S ++ 
Sbjct: 485 KKQLANNNS-QNIKSKKVVKAKTNNKANLQDVGECSSPPNNKE--KNDKQTSTSSSVLKS 541

Query: 69  DLTEQERKVILAK-KYLEEIEKVEKEKLNDSKDKHVLKKLETV---VLEQEGKLRVNIAD 124
           D +  E +   A  K LE+     K       + + L  + T    V   +    VNI D
Sbjct: 542 DRSSIEVRNPNANVKKLEDTTYNAKFPTVSKNNAYTLVDISTSEDAVNSADDTRSVNITD 601

Query: 125 RLANATVV 132
              + T+ 
Sbjct: 602 ISESTTIS 609


>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein.  This protein is found to be part
           of a large ribonucleoprotein complex containing the U3
           snoRNA. Depletion of the Utp proteins impedes production
           of the 18S rRNA, indicating that they are part of the
           active pre-rRNA processing complex. This large RNP
           complex has been termed the small subunit (SSU)
           processome.
          Length = 728

 Score = 30.0 bits (68), Expect = 3.1
 Identities = 34/134 (25%), Positives = 53/134 (39%), Gaps = 39/134 (29%)

Query: 7   NKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDS--------- 57
           +K          K K     +     + ++K D+D+E   DEDDE + F           
Sbjct: 528 SKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDE--EDEDDEELPFLFKQKDLIKEA 585

Query: 58  -------QDFE-EHSEV-------EEDLT-------------EQERKVILAKKYLEEIEK 89
                   +FE E  EV       E DLT             ++++K    +++L +IE 
Sbjct: 586 FAGDDVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEG 645

Query: 90  VEKEKLNDSKDKHV 103
           V+KEK  D K K+V
Sbjct: 646 VKKEKRKDKKLKNV 659


>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A.  The
          CAF-1 or chromatin assembly factor-1 consists of three
          subunits, and this is the first, or A. The A domain is
          uniquely required for the progression of S phase in
          mouse cells, independent of its ability to promote
          histone deposition but dependent on its ability to
          interact with HP1 - heterochromatin protein 1-rich
          heterochromatin domains next to centromeres that are
          crucial for chromosome segregation during mitosis. This
          HP1-CAF-1 interaction module functions as a built-in
          replication control for heterochromatin, which, like a
          control barrier, has an impact on S-phase progression
          in addition to DNA-based checkpoints.
          Length = 76

 Score = 27.6 bits (62), Expect = 3.1
 Identities = 13/40 (32%), Positives = 21/40 (52%), Gaps = 4/40 (10%)

Query: 34 KTVKPDLDEELTSD----EDDEGIGFDSQDFEEHSEVEED 69
              P LD +  SD    E++EG   +S+D E+  E ++D
Sbjct: 32 SQDLPGLDYDYDSDAEWEEEEEGEDLESEDEEDEEEDDDD 71


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 29.1 bits (66), Expect = 3.1
 Identities = 14/43 (32%), Positives = 25/43 (58%)

Query: 60  FEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKH 102
            +E  ++EE+  E E++V   +  LE IEK E+E+    + +H
Sbjct: 126 EQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRH 168


>gnl|CDD|234096 TIGR03031, cas_csx12, CRISPR-associated protein Cas9/Csx12, subtype
           II-B/NMENI.  Members of this family of CRISPR-associated
           (cas) protein are found, so far, in CRISPR/cas loci in
           Wolinella succinogenes DSM 1740, Legionella pneumophila
           str. Paris, and Francisella tularensis, where the last
           probably is an example of a degenerate CRISPR locus,
           having neither repeats nor a functional Cas1. The
           characteristic repeat length is 37 base pairs and period
           is about 72. One region of this large protein shows
           sequence similarity to pfam01844, HNH endonuclease
           [Mobile and extrachromosomal element functions, Other].
          Length = 802

 Score = 30.0 bits (67), Expect = 3.5
 Identities = 23/97 (23%), Positives = 36/97 (37%), Gaps = 12/97 (12%)

Query: 39  DLDE----ELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEK 94
           DLDE        +   E + F  +   E + ++  L + +        Y EE  K+   K
Sbjct: 104 DLDEYIKDATEINLLKELLPFKLRSISEDNFIDWFLQKIQSSADYLANYSEEFRKILVSK 163

Query: 95  LNDSKDKHVLKKLETVVLEQEGKLRVNIADRLANATV 131
             + KD   LK           K + NI + L  AT+
Sbjct: 164 TEEKKDDKELK--------NAVKDKYNIKEFLTGATI 192


>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown.
          Length = 214

 Score = 28.9 bits (65), Expect = 4.0
 Identities = 15/72 (20%), Positives = 23/72 (31%)

Query: 7   NKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEV 66
            KK        ++  K +  E      K  K D ++E    E++     +    E   E 
Sbjct: 51  AKKSDDQETAEIEEVKEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEESSDENEKET 110

Query: 67  EEDLTEQERKVI 78
           EE       K I
Sbjct: 111 EEKTESNVEKEI 122


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 29.7 bits (67), Expect = 4.0
 Identities = 20/104 (19%), Positives = 41/104 (39%), Gaps = 14/104 (13%)

Query: 16  NSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQER 75
             L+ ++ K  E+  + ++ +   L++E+ + +             E  E+E  + E E 
Sbjct: 726 EQLEQEEEKLKERLEELEEDLS-SLEQEIENVKS------------ELKELEARIEELEE 772

Query: 76  KVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKLR 119
            +   ++ L ++E               L KLE  V   E +LR
Sbjct: 773 DLHKLEEALNDLEARLSHSRIPEIQAE-LSKLEEEVSRIEARLR 815


>gnl|CDD|223642 COG0568, RpoD, DNA-directed RNA polymerase, sigma subunit
           (sigma70/sigma32) [Transcription].
          Length = 342

 Score = 29.2 bits (66), Expect = 4.1
 Identities = 29/104 (27%), Positives = 43/104 (41%), Gaps = 32/104 (30%)

Query: 39  DLDEELTSDEDDEGIGF----DSQDFEEHSE-----------VEEDLTEQERKVILA--- 80
            LD  +  DED E   F     S   E+  E           + E LTE+ER+VI     
Sbjct: 234 SLDTPIGDDEDSELGDFLEDDKSVSPEDAVERESLKEDLNEVLAEALTERERRVIRLRFG 293

Query: 81  -----KKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKLR 119
                 K LEE+     E+   S+++  ++++E   L    KLR
Sbjct: 294 LDDGEPKTLEEL----GEEFGISRER--VRQIEAKALR---KLR 328


>gnl|CDD|218598 pfam05470, eIF-3c_N, Eukaryotic translation initiation factor 3
           subunit 8 N-terminus.  The largest of the mammalian
           translation initiation factors, eIF3, consists of at
           least eight subunits ranging in mass from 35 to 170 kDa.
           eIF3 binds to the 40 S ribosome in an early step of
           translation initiation and promotes the binding of
           methionyl-tRNAi and mRNA.
          Length = 593

 Score = 29.4 bits (66), Expect = 4.4
 Identities = 40/139 (28%), Positives = 60/139 (43%), Gaps = 30/139 (21%)

Query: 8   KKPAATNN----NSLKSKKRK----FSEKPTKSKKTVKPDL-DEELTSDEDDEGIGFDSQ 58
           KK  + NN    N+LK K +K    F +  T+ ++   P+  DEE   DEDD+  G D +
Sbjct: 99  KKKMSKNNAKALNTLKQKVKKNNKQFEDDITRYRED--PESEDEEEEEDEDDDDDGSDDE 156

Query: 59  DFEEHS----------------EVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKH 102
           D +E                   V+ED  E E   +  K  LEE +  +K +     D  
Sbjct: 157 DEDEDGVGATEEVAASSESGVDRVKEDDEEDEDADLSKKDVLEEPKMFKKPEEITWDD-- 214

Query: 103 VLKKLETVVLEQEGKLRVN 121
           V KKL+  ++   GK   +
Sbjct: 215 VFKKLKE-IMSARGKKTTD 232


>gnl|CDD|233819 TIGR02307, RNA_lig_RNL2, RNA ligase, Rnl2 family.  Members of this
           family ligate (seal breaks in) RNA. Members so far
           include phage proteins that can counteract a host
           defense of cleavage of specific tRNA molecules,
           trypanosome ligases involved in RNA editing, but no
           prokaryotic host proteins [Mobile and extrachromosomal
           element functions, Prophage functions, Transcription,
           RNA processing].
          Length = 325

 Score = 29.0 bits (65), Expect = 4.7
 Identities = 27/120 (22%), Positives = 47/120 (39%), Gaps = 12/120 (10%)

Query: 5   IRNKKPAATNNNS-LKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEH 63
            R  K     N + +K K  KFSEK  K+ K +   +   + S+  ++ +        E+
Sbjct: 198 CRPGKWLRNGNRTIIKCKNSKFSEKKNKAGKPILAKV---VLSEAQNKLVALLLCYVTEN 254

Query: 64  ------SEVEEDLTEQERKV--ILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQE 115
                 S++  +  +    V  +LA+  L+E  +      N   D   L K   V + QE
Sbjct: 255 RLNNVLSKIGTEPPKDFGMVMGLLAQDALKEFLRETPGITNTQADNRKLIKRSLVFMAQE 314


>gnl|CDD|227494 COG5165, POB3, Nucleosome-binding factor SPN, POB3 subunit
           [Transcription / DNA replication, recombination, and
           repair / Chromatin structure and dynamics].
          Length = 508

 Score = 29.2 bits (65), Expect = 5.2
 Identities = 23/105 (21%), Positives = 45/105 (42%), Gaps = 10/105 (9%)

Query: 1   MSFFIRNKKPAATNNNSLKSKKRKFSEKPTKSKK-TVKPDLDEE-----LTSDEDDEGIG 54
           ++ F+R+  P +   N++   ++   E+   SK    + +  +E     L S  D E I 
Sbjct: 405 LTLFLRS--PGSYTFNNISKDEQGALEQFLHSKGIKARNEEVQERLQTDLGSISDSEDIN 462

Query: 55  FDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSK 99
             S   E+ SE E+     +  V  A++Y  +    + E  +D +
Sbjct: 463 MGSAGEEDESEDEDFQMVSDSDV--AEEYDLQAALSDAEGGSDEE 505


>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
           subunit.  This is a family of proteins which are
           subunits of the eukaryotic translation initiation factor
           3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
           cerevisiae protein eIF3j (HCR1) has been shown to be
           required for processing of 20S pre-rRNA and binds to 18S
           rRNA and eIF3 subunits Rpg1p and Prt1p.
          Length = 242

 Score = 28.9 bits (65), Expect = 5.4
 Identities = 20/72 (27%), Positives = 31/72 (43%), Gaps = 3/72 (4%)

Query: 47  DEDDEGIGFDSQDFEEHSEVEEDLTEQE---RKVILAKKYLEEIEKVEKEKLNDSKDKHV 103
           DED++    DS D EE  E EE+  +     +     K  +EE EK ++EK      +  
Sbjct: 26  DEDEDDDVKDSWDEEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELE 85

Query: 104 LKKLETVVLEQE 115
               E  + E+ 
Sbjct: 86  EDTPEDELAEKL 97


>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO.  Bacillus
           spores are protected by a protein shell consisting of
           over 50 different polypeptides, known as the coat. This
           family of proteins has an important morphogenetic role
           in coat assembly, it is involved in the assembly of at
           least 5 different coat proteins including CotB, CotG,
           CotS, CotSA and CotW. It is likely to act at a late
           stage of coat assembly.
          Length = 185

 Score = 28.6 bits (64), Expect = 5.5
 Identities = 20/93 (21%), Positives = 37/93 (39%), Gaps = 2/93 (2%)

Query: 2   SFFIRN--KKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQD 59
           +F I+   +K     N+    K ++  +K    +   + +  E    D  ++    +   
Sbjct: 30  TFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQ 89

Query: 60  FEEHSEVEEDLTEQERKVILAKKYLEEIEKVEK 92
            EE  E  ED+ +QE      KK  +E+   EK
Sbjct: 90  EEEKEEEAEDVKQQEVFSFKRKKPFKEMNLEEK 122


>gnl|CDD|220491 pfam09958, DUF2192, Uncharacterized protein conserved in archaea
           (DUF2192).  This domain, found in various hypothetical
           archaeal proteins, has no known function.
          Length = 231

 Score = 28.4 bits (64), Expect = 5.8
 Identities = 28/95 (29%), Positives = 39/95 (41%), Gaps = 21/95 (22%)

Query: 28  KPTKSKKTVKPDL-DEELTS-----------DEDDEGIGFDSQDFEEHSEVEEDLTEQER 75
           KP +      PDL D+EL S           DED  G       F++   +E    E+  
Sbjct: 45  KPFRGLS--PPDLYDKELASLYVVGKYGLGLDEDYPG------VFDKLFYIENRY-EEAL 95

Query: 76  KVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETV 110
           ++IL K+  E  EKV  E    S +  V + L  V
Sbjct: 96  EIILEKEPEEAREKVTGELGGKSDENKVARVLRLV 130


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 29.3 bits (66), Expect = 6.0
 Identities = 15/86 (17%), Positives = 34/86 (39%), Gaps = 3/86 (3%)

Query: 18  LKSKKRKFSEKPTKSKKTVKP--DLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQER 75
           L++   K     +  +   +   +  E+L   +       ++ +     +++     +E 
Sbjct: 822 LQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEE- 880

Query: 76  KVILAKKYLEEIEKVEKEKLNDSKDK 101
             ILAKK  E  +  E+E LN+ +  
Sbjct: 881 LYILAKKEEEFKQFAEEEGLNEEELA 906


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 29.2 bits (65), Expect = 6.1
 Identities = 17/54 (31%), Positives = 25/54 (46%), Gaps = 8/54 (14%)

Query: 41  DEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEK 94
            +E   D      G DS++ EE  E EE+  E+E +        EE E+ E E+
Sbjct: 847 QDEKGVDGGGGSDGGDSEEEEEEEEEEEEEEEEEEE--------EEEEEEENEE 892


>gnl|CDD|227466 COG5137, COG5137, Histone chaperone involved in gene silencing
           [Transcription / Chromatin structure and dynamics].
          Length = 279

 Score = 28.4 bits (63), Expect = 6.9
 Identities = 22/115 (19%), Positives = 43/115 (37%), Gaps = 9/115 (7%)

Query: 5   IRNKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEE----LTSDEDDEGIGFDSQDF 60
           I  +KP  T  N +        E P       +PD+D E    L   +  E    +    
Sbjct: 144 ILAEKPRVTRFNIVWDNDEDNDEAPP-----AQPDVDNEEEERLEESDGREEEEDEEVGS 198

Query: 61  EEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQE 115
           + + E   +L E+E +        E++   E E+++  + +    + E + L + 
Sbjct: 199 DSYGEGNRELNEEEEEEAEGSDDGEDVVDYEGERIDKKQGEEEEMEEEVINLFEI 253


>gnl|CDD|227448 COG5118, BDP1, Transcription initiation factor TFIIIB, Bdp1 subunit
           [Transcription].
          Length = 507

 Score = 28.9 bits (64), Expect = 7.3
 Identities = 21/116 (18%), Positives = 41/116 (35%), Gaps = 20/116 (17%)

Query: 16  NSLKSKKRKFSEKPTKSKKTVKPDL-------------DEELTSDEDDEGIGFDSQDFEE 62
           N    K+ +   KP+ +KK  K                 E  TS E DE   +     + 
Sbjct: 149 NLKTFKRHRVLGKPSSAKKPAKISPPTAMTDSLDRNFSSETSTSREADENENYVISKVK- 207

Query: 63  HSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSKDKHVLKKLETVVLEQEGKL 118
             ++ + + + E     AK +++E      E    +    + +  E   + ++ KL
Sbjct: 208 --DIPKKVRDGES----AKYFIDEENFTMAELCKPNFPIQISENFEKSKMAKKAKL 257


>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein.  This family
           consists of various plant development proteins which are
           homologues of floricaula (FLO) and Leafy (LFY) proteins
           which are floral meristem identity proteins. Mutations
           in the sequences of these proteins affect flower and
           leaf development.
          Length = 382

 Score = 28.4 bits (64), Expect = 7.4
 Identities = 23/102 (22%), Positives = 44/102 (43%), Gaps = 24/102 (23%)

Query: 11  AATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFE-EH------ 63
               ++S   KK++  ++  K  K ++ D D++   D+D EG G   ++ + EH      
Sbjct: 173 GVPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEERQREHPFIVTE 232

Query: 64  -SEVEE----------DLTEQERKVILAKKYLEEIEKVEKEK 94
             EV             L EQ R      ++L +++ + KE+
Sbjct: 233 PGEVARGKKNGLDYLFHLYEQCR------EFLIQVQNIAKER 268


>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein.  The proteins in this family
           are designated YL1. These proteins have been shown to be
           DNA-binding and may be a transcription factor.
          Length = 238

 Score = 28.5 bits (64), Expect = 7.4
 Identities = 14/63 (22%), Positives = 26/63 (41%), Gaps = 3/63 (4%)

Query: 41  DEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEE-IEKVEKEKLNDSK 99
           D +    EDDE       + E   E++ +   +++K +  K Y E   +K +K+      
Sbjct: 55  DSDFDDSEDDEPE--SDDEEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPTAAKS 112

Query: 100 DKH 102
            K 
Sbjct: 113 PKA 115


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.0 bits (65), Expect = 7.5
 Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 2/76 (2%)

Query: 40  LDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDSK 99
           LDE    ++ ++      +  +E     E+L  + +   L +   + +E +E+E     K
Sbjct: 160 LDELFGLEKYEKLSELLKEVIKEAKAKIEEL--EGQLSELLEDIEDLLEALEEELKELKK 217

Query: 100 DKHVLKKLETVVLEQE 115
            + + ++ E   LEQE
Sbjct: 218 LEEIQEEQEEEELEQE 233


>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family.  Emg1 and Nop14 are novel
           proteins whose interaction is required for the
           maturation of the 18S rRNA and for 40S ribosome
           production.
          Length = 809

 Score = 28.8 bits (65), Expect = 7.8
 Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 11/84 (13%)

Query: 39  DLDEELTSDEDDEGIGFDSQDFEE-HSEVEEDLTEQERKVILAKKYLEEIEKVEK----- 92
           D D++L  +E+D  +  + +D E+  S+ E+D  E+E +    KK   E  + E      
Sbjct: 337 DDDDDLEEEEEDVDLSDEEEDEEDEDSDDEDDEEEEEEEKEKKKKKSAESTRSELPFTFP 396

Query: 93  -----EKLNDSKDKHVLKKLETVV 111
                E+  +      L+   TVV
Sbjct: 397 CPKSHEEFLELLKGVSLEDQPTVV 420


>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2.  A Saccharomyces
           cerevisiae member of this family (PGA2) is an ER protein
           which has been implicated in protein trafficking.
          Length = 139

 Score = 27.5 bits (61), Expect = 9.2
 Identities = 20/91 (21%), Positives = 37/91 (40%), Gaps = 18/91 (19%)

Query: 19  KSKKRKFSEKPTKSKKTVKPDL---------DEELTSDEDDEGIGFDSQDFEEHSEVEED 69
             K+R   E+  + K  + P+           EE T DE+DE       DF   S V + 
Sbjct: 46  HEKERAEREEAREKKAKISPNALRGGATAGHGEEDTDDEEDEE------DFATPSAVPQW 99

Query: 70  LTEQERKVILAKKYLEEIEKVEKEKLNDSKD 100
             +  ++    +K + ++ + E++   D  D
Sbjct: 100 GKKARKR---QRKVIRKLLEAEEQLREDQYD 127


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 28.5 bits (64), Expect = 9.2
 Identities = 16/73 (21%), Positives = 32/73 (43%), Gaps = 3/73 (4%)

Query: 39  DLDEELTSDEDDEGIGFDSQDFEEHSEVEED---LTEQERKVILAKKYLEEIEKVEKEKL 95
           +L +++   E++       Q  E     EE      EQ+      K  L EI++ E+E  
Sbjct: 806 ELIDQIIESEEERLKALRIQREEMLMRPEELELINEEQKNLKQEIKLELSEIQEAEEEIQ 865

Query: 96  NDSKDKHVLKKLE 108
           N +++K+   + +
Sbjct: 866 NINENKNEFVEFK 878


>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2.  This family consists
           of several Barren protein homologues from several
           eukaryotic organisms. In Drosophila Barren (barr) is
           required for sister-chromatid segregation in mitosis.
           barr encodes a novel protein that is present in
           proliferating cells and has homologues in yeast and
           human. Mitotic defects in barr embryos become apparent
           during cycle 16, resulting in a loss of PNS and CNS
           neurons. Centromeres move apart at the
           metaphase-anaphase transition and Cyclin B is degraded,
           but sister chromatids remain connected, resulting in
           chromatin bridging. Barren protein localises to
           chromatin throughout mitosis. Colocalisation and
           biochemical experiments indicate that Barren associates
           with Topoisomerase II throughout mitosis and alters the
           activity of Topoisomerase II. It has been suggested that
           this association is required for proper chromosomal
           segregation by facilitating the decatenation of
           chromatids at anaphase. This family forms one of the
           three non-structural maintenance of chromosomes (SMC)
           subunits of the mitotic condensation complex along with
           Cnd1 and Cnd3.
          Length = 719

 Score = 28.6 bits (64), Expect = 9.3
 Identities = 18/71 (25%), Positives = 31/71 (43%), Gaps = 9/71 (12%)

Query: 39  DLDEELTSDEDDEGIGFDSQDFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLNDS 98
           D D +   +   EG   +  D EE S+ EED  +++ K    +K    +EK        +
Sbjct: 151 DSDNKEAPETGREGDDGEDDD-EEGSDGEEDGAKKKAKKKRQRKPESTLEK--------N 201

Query: 99  KDKHVLKKLET 109
            +   +KKL+ 
Sbjct: 202 FEALNVKKLDL 212


>gnl|CDD|225435 COG2880, COG2880, Uncharacterized protein conserved in archaea
          [Function unknown].
          Length = 67

 Score = 25.9 bits (57), Expect = 9.7
 Identities = 10/38 (26%), Positives = 22/38 (57%)

Query: 59 DFEEHSEVEEDLTEQERKVILAKKYLEEIEKVEKEKLN 96
          D +E  +V+  +  +E+   + K  L+EI+++ +E L 
Sbjct: 23 DLKEGEKVKIVIRVKEKIYEILKGSLKEIKEILEEILE 60


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 28.3 bits (64), Expect = 9.8
 Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 1/73 (1%)

Query: 4   FIRNKKPAATNNNSLKSKKRKFSEKPTKSKKTVKPDLDEELTSDEDDEGIGFDSQDFEEH 63
           F+   K A      +  K  K  E+  K KK       ++   +E+++    + +  EE 
Sbjct: 401 FLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKE-KKEEEKEEEE 459

Query: 64  SEVEEDLTEQERK 76
            E EE+  E+E K
Sbjct: 460 EEAEEEKEEEEEK 472


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.130    0.377 

Gapped
Lambda     K      H
   0.267   0.0701    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 22,406,942
Number of extensions: 2126527
Number of successful extensions: 2304
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2148
Number of HSP's successfully gapped: 173
Length of query: 454
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 354
Effective length of database: 6,502,202
Effective search space: 2301779508
Effective search space used: 2301779508
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 61 (27.3 bits)