RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy6276
         (271 letters)



>gnl|CDD|240217 cd05791, S1_CSL4, S1_CSL4: CSL4, S1-like RNA-binding domain.
           S1-like RNA-binding domains are found in a wide variety
           of RNA-associated proteins. ScCSL4 protein is a subunit
           of the exosome complex. The exosome plays a central role
           in 3' to 5' RNA processing and degradation in eukarytes
           and archaea. Its functions include the removal of
           incorrectly processed RNA and the maintenance of proper
           levels of mRNA, rRNA and a number of small RNA species.
           In S. cerevisiae, the exosome includes nine core
           components, six of which are homologous to bacterial
           RNase PH. These form a hexameric ring structure. The
           other three subunits (RrP4, Rrp40, and Csl4) contain an
           S1 RNA binding domain and are part of the "S1 pore
           structure".
          Length = 92

 Score =  118 bits (299), Expect = 3e-34
 Identities = 43/92 (46%), Positives = 64/92 (69%), Gaps = 1/92 (1%)

Query: 65  VIVPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENVRATEIDRIEMYKC 124
           +++P  G +V AR+  +     K  ++ VG   L  +FR ++R+E++RATE D++EMYKC
Sbjct: 1   LVLPKVGSIVIARVTRINPRFAKVDILCVGGRPLKESFRGVIRKEDIRATEKDKVEMYKC 60

Query: 125 YRPGDIILAR-LPLKELHSYQLSTAENELGVV 155
           +RPGDI+ A+ + L +  SY LSTAENELGVV
Sbjct: 61  FRPGDIVRAKVISLGDASSYYLSTAENELGVV 92


>gnl|CDD|192591 pfam10447, EXOSC1, Exosome component EXOSC1/CSL4.  This family of
           proteins are components of the exosome 3'->5'
           exoribonuclease complex. The exosome mediates
           degradation of unstable mRNAs that contain AU-rich
           elements (AREs) within their 3' untranslated regions.
          Length = 72

 Score = 96.5 bits (241), Expect = 8e-26
 Identities = 30/68 (44%), Positives = 49/68 (72%)

Query: 67  VPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENVRATEIDRIEMYKCYR 126
           +P  GD+V AR+  V        +++VG TAL  TF+ ++R+++VRATE DR+++Y+C+R
Sbjct: 1   LPKVGDIVLARVTRVTPRQAYCEILAVGGTALKETFKGIIRKQDVRATEKDRVKIYECFR 60

Query: 127 PGDIILAR 134
           PGDI+ A+
Sbjct: 61  PGDIVRAK 68


>gnl|CDD|224021 COG1096, COG1096, Predicted RNA-binding protein (consists of S1
           domain and a Zn-ribbon domain) [Translation, ribosomal
           structure and biogenesis].
          Length = 188

 Score = 79.3 bits (196), Expect = 4e-18
 Identities = 38/139 (27%), Positives = 63/139 (45%), Gaps = 11/139 (7%)

Query: 23  GRGTYVRQGYLYSTLAGTVSINKHVEDNKEVSVIEVSSYKKEVIVPTPGDVVTARIMAVQ 82
           G GTY   G + +   G V  +     N+ +SV      KK   +P  GD+V  R+  V+
Sbjct: 23  GEGTYEEGGEIRAAATGVVRRDD---KNRVISVK---PGKKTPPLPKGGDIVYGRVTDVR 76

Query: 83  QHLCKAHVISV--GNTALSRTFRALLRRENVRATEIDRIEMYKCYRPGDIILARLPLKEL 140
           +      ++ V      L+ +  A +    VR   ++++     +R GDI+ AR+ +   
Sbjct: 77  EQRALVRIVGVEGKERELATSGAADIHVSQVRDGYVEKLS--DAFRIGDIVRARV-ISTG 133

Query: 141 HSYQLSTAENELGVVIALS 159
              QLST  N+LGV+ A  
Sbjct: 134 DPIQLSTKGNDLGVIYARC 152


>gnl|CDD|236547 PRK09521, PRK09521, exosome complex RNA-binding protein Csl4;
           Provisional.
          Length = 189

 Score = 67.3 bits (165), Expect = 1e-13
 Identities = 38/152 (25%), Positives = 75/152 (49%), Gaps = 12/152 (7%)

Query: 9   PGQRICQLDSDIVGGRGTYVRQGYLYSTLAGTVSINKHVEDNKEVSVIEVSSYKKEVIVP 68
           PG  +  ++ + + G GTY   G +Y+++ G V I+     N+++SVI    +KK   + 
Sbjct: 10  PGDYLAVIE-EYLPGEGTYEDNGEVYASVVGKVFIDD---INRKISVI---PFKKTPPLL 62

Query: 69  TPGDVVTARIMAVQQHLCKAHVISV--GNTALSRTFRALLRRENVRATEIDRIEMYKCYR 126
             GD+V  R++ V++      ++S+      L+ +  A +    V    ++   +   ++
Sbjct: 63  KKGDIVYGRVVDVKEQRALVRIVSIEGSERELATSKLAYIHISQVSDGYVE--SLTDAFK 120

Query: 127 PGDIILARLPLKELHSYQLSTAENELGVVIAL 158
            GDI+ A+  +      QLST   +LGV+ A+
Sbjct: 121 IGDIVRAK-VISYTDPLQLSTKGKDLGVIYAM 151


>gnl|CDD|206550 pfam14382, ECR1_N, Exosome complex exonuclease RRP4 N-terminal
          region.  ECR1_N is an N-terminal region of the exosome
          complex exonuclease RRP proteins. It is a G-rich domain
          which structurally is a rudimentary single hybrid fold
          with a permuted topology.
          Length = 39

 Score = 55.9 bits (136), Expect = 5e-11
 Identities = 17/40 (42%), Positives = 26/40 (65%), Gaps = 1/40 (2%)

Query: 6  ICIPGQRICQLDSDIVGGRGTYVRQGYLYSTLAGTVSINK 45
          I +PG+R+    S+ + G GTYV  G +Y+++AG VSI  
Sbjct: 1  IVVPGERLGT-ASEYMPGHGTYVEDGNIYASVAGVVSIVN 39


>gnl|CDD|239901 cd04454, S1_Rrp4_like, S1_Rrp4_like: Rrp4-like, S1-like RNA-binding
           domain. S1-like RNA-binding domains are found in a wide
           variety of RNA-associated proteins. Rrp4 protein, and
           Rrp40 and Csl4 proteins, also represented in this group,
           are subunits of the exosome complex. The exosome plays a
           central role in 3' to 5' RNA processing and degradation
           in eukarytes and archaea. Its functions include the
           removal of incorrectly processed RNA and the maintenance
           of proper levels of mRNA, rRNA and a number of small RNA
           species. In Saccharomyces cerevisiae, the exosome
           includes nine core components, six of which are
           homologous to bacterial RNase PH. These form a hexameric
           ring structure. The other three subunits (RrP4, Rrp40,
           and Csl4) contain an S1 RNA binding domain and are part
           of the "S1 pore structure".
          Length = 82

 Score = 52.9 bits (127), Expect = 2e-09
 Identities = 31/90 (34%), Positives = 48/90 (53%), Gaps = 11/90 (12%)

Query: 67  VPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENVRATEIDRIEMYKCYR 126
           +P  GD+V   +  V     K  ++S G   L          E+  ATE D+ E+ K  +
Sbjct: 3   LPDVGDIVIGIVTEVNSRFWKVDILSRGTARL----------EDSSATEKDKKEIRKSLQ 52

Query: 127 PGDIILAR-LPLKELHSYQLSTAENELGVV 155
           PGD+ILA+ + L +  +  L+TA+NELGV+
Sbjct: 53  PGDLILAKVISLGDDMNVLLTTADNELGVI 82


>gnl|CDD|224022 COG1097, RRP4, RNA-binding protein Rrp4 and related proteins
           (contain S1 domain and KH domain) [Translation,
           ribosomal structure and biogenesis].
          Length = 239

 Score = 49.6 bits (119), Expect = 3e-07
 Identities = 33/136 (24%), Positives = 55/136 (40%), Gaps = 22/136 (16%)

Query: 2   SNPPICIPGQRICQLDSDIVGGRGTYVRQGYLYSTLAGTVSINKHVEDNKEVSVIEVSS- 60
           S   I +PG  +   +     G GTY   G +YS++ G + +       K V VI +   
Sbjct: 6   SMRKIVLPGDLVLA-EGSYKLGHGTYFEGGKIYSSVVGLLDV-----KGKLVRVIPLEGR 59

Query: 61  YKKEVIVPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENV--RATEIDR 118
           Y     +P  GDVV  +I+ V         + +G+      + ALL   +   R  E   
Sbjct: 60  Y-----IPEVGDVVIGKIIEVGP---SGWKVDIGS-----PYPALLSLSDFLRRKFENAE 106

Query: 119 IEMYKCYRPGDIILAR 134
            ++      GD++ A+
Sbjct: 107 KDLRPFLNVGDLVYAK 122


>gnl|CDD|235233 PRK04163, PRK04163, exosome complex RNA-binding protein Rrp4;
           Provisional.
          Length = 235

 Score = 41.0 bits (97), Expect = 3e-04
 Identities = 33/135 (24%), Positives = 54/135 (40%), Gaps = 29/135 (21%)

Query: 6   ICIPGQRICQLDSDIVGGRGTYVRQGYLYSTLAGTVSINKHVEDNKEVSVIEVSSYKKEV 65
           I +PG  + +   +   GRGTY   G +YST+ G V I      + +V VI +       
Sbjct: 10  IVVPGDLLAE--GEFKAGRGTYKENGKIYSTVVGLVDI-----KDDKVRVIPLEG----K 58

Query: 66  IVPTPGDVVTARIMAVQQHLCKAHVISVGNTALS----RTFRALLRRENV--RATEIDRI 119
            +P  GD+V  +            V  V  +         ++A L    V  R   ++  
Sbjct: 59  YIPKVGDLVIGK------------VTDVTFSGWEVDINSPYKAYLPVSEVLGRPVNVEGT 106

Query: 120 EMYKCYRPGDIILAR 134
           ++ K    GD I+A+
Sbjct: 107 DLRKYLDIGDYIIAK 121


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 39.6 bits (93), Expect = 6e-04
 Identities = 27/93 (29%), Positives = 50/93 (53%)

Query: 178 LKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKL 237
           L+QE  +L EEL+  Q++   L+Q   +L+  L  +      ++ ++  + + + +  +L
Sbjct: 71  LQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIEL 130

Query: 238 SDTIRELNEENIGLKQKVELQEAENQLIQTTKN 270
            +  REL EE   LKQ+ E  EAEN+ +Q  + 
Sbjct: 131 DEENRELREELAELKQENEALEAENERLQENEQ 163


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 38.5 bits (90), Expect = 0.004
 Identities = 35/130 (26%), Positives = 67/130 (51%), Gaps = 3/130 (2%)

Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
           +EL   +    E E  +     E  SL +  E+L  +IE L++E  +L E+L   ++E E
Sbjct: 793 EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELE 852

Query: 198 ILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVEL 257
            L +   +L+  L ++   K E+E ++KEL ++   + +L + +REL  E   LK+++E 
Sbjct: 853 ELEKELEELKEELEELEAEKEELEDELKELEEE---KEELEEELRELESELAELKEEIEK 909

Query: 258 QEAENQLIQT 267
                + ++ 
Sbjct: 910 LRERLEELEA 919



 Score = 34.7 bits (80), Expect = 0.047
 Identities = 27/97 (27%), Positives = 52/97 (53%), Gaps = 4/97 (4%)

Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
           E+L  ++E L++   KL EE+   +++ + L++   +LE    ++ E +  ++   +EL 
Sbjct: 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEELEELE---EELEEAERRLDALERELE 817

Query: 229 KKNQSESKLSDTIRELNEENIGLKQKV-ELQEAENQL 264
              Q   +L   I EL EE   L++K+ EL+E   +L
Sbjct: 818 SLEQRRERLEQEIEELEEEIEELEEKLDELEEELEEL 854



 Score = 31.2 bits (71), Expect = 0.63
 Identities = 28/114 (24%), Positives = 56/114 (49%), Gaps = 11/114 (9%)

Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
           E+L  ++  L++E  +L EEL  ++KE E L+    +L   L ++ E   E++ +I+EL 
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301

Query: 229 KK-----------NQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
            +                +L + + EL E+   LK+++E +E   + ++    E
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAE 355



 Score = 30.8 bits (70), Expect = 0.79
 Identities = 32/146 (21%), Positives = 66/146 (45%), Gaps = 7/146 (4%)

Query: 130 IILARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEEL 189
            +  +  LKEL   +L+  E +L       E  SL      L + +E L+++  +L  +L
Sbjct: 662 SLAQKRELKEL-EEELAELEAQLE--KLEEELKSLKNELRSLEDLLEELRRQLEELERQL 718

Query: 190 SSSQKENEILRQTTGDLEVRL----MQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELN 245
              ++E   L +    L+ RL     ++ E++ E+E   + L +  +    L + + +L 
Sbjct: 719 EELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLK 778

Query: 246 EENIGLKQKVELQEAENQLIQTTKNE 271
           EE   L++K +  + E + ++    E
Sbjct: 779 EEIEELEEKRQALQEELEELEEELEE 804



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 26/95 (27%), Positives = 51/95 (53%), Gaps = 3/95 (3%)

Query: 174 DIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQS 233
            ++ L++E  +L EELS  ++E   L +   +LE    ++ E+K E+E   +EL +  + 
Sbjct: 233 KLKELRKELEELEEELSRLEEE---LEELQEELEEAEKEIEELKSELEELREELEELQEE 289

Query: 234 ESKLSDTIRELNEENIGLKQKVELQEAENQLIQTT 268
             +L + I EL  E   L++++E  E E + ++  
Sbjct: 290 LLELKEEIEELEGEISLLRERLEELENELEELEER 324



 Score = 28.5 bits (64), Expect = 5.2
 Identities = 29/121 (23%), Positives = 53/121 (43%), Gaps = 8/121 (6%)

Query: 137 LKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQ-------EKLKLYEEL 189
            +     +    E E  +     +   L    E+L  ++E LK+       EK +L +EL
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDEL 879

Query: 190 SSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQS-ESKLSDTIRELNEEN 248
              ++E E L +   +LE  L ++ E   ++  +++EL  K +  E +L +   EL EE 
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEY 939

Query: 249 I 249
            
Sbjct: 940 E 940


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 36.2 bits (84), Expect = 0.017
 Identities = 28/131 (21%), Positives = 62/131 (47%), Gaps = 7/131 (5%)

Query: 144 QLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTT 203
           +L  AE  L      +E   L    E+L  +IE  ++ + KL EE +  ++E E LR   
Sbjct: 316 ELEDAEERL--AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAEL 373

Query: 204 GDLEVRLM----QMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVE-LQ 258
            +++        ++ + + ++E   +E+N+  +   +L + ++ L+EE   L   +  ++
Sbjct: 374 EEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIE 433

Query: 259 EAENQLIQTTK 269
              N+L +  +
Sbjct: 434 AKINELEEEKE 444



 Score = 32.7 bits (75), Expect = 0.23
 Identities = 26/143 (18%), Positives = 64/143 (44%), Gaps = 4/143 (2%)

Query: 133 ARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSS 192
           AR+   E   ++L  A N+L   ++ S    +     KL  ++  ++    ++ ++L+  
Sbjct: 765 ARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRL 824

Query: 193 QKENEILRQTTGDLEVRLM----QMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEEN 248
             E E L +   +L+ + +    Q+  ++ E+E    +  +  +   +L   +R+L    
Sbjct: 825 TLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRL 884

Query: 249 IGLKQKVELQEAENQLIQTTKNE 271
             LK++ +  EA+ + ++    E
Sbjct: 885 GDLKKERDELEAQLRELERKIEE 907



 Score = 31.6 bits (72), Expect = 0.57
 Identities = 27/120 (22%), Positives = 50/120 (41%), Gaps = 11/120 (9%)

Query: 160 EAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGE 219
           E        EKL  +I  LK+E  +L EEL    +E   L      +E ++ ++ E K +
Sbjct: 386 ELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKED 445

Query: 220 MEFKIKE-----------LNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTT 268
              +IK+           L+K  Q    L +    + +E   L++++   EA+ +  +  
Sbjct: 446 KALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEER 505



 Score = 30.0 bits (68), Expect = 1.5
 Identities = 28/106 (26%), Positives = 53/106 (50%), Gaps = 2/106 (1%)

Query: 157 ALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV 216
              E +   R   ++  +IE L+QE+ KL E L   +++   L Q   +++  L ++   
Sbjct: 707 LSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEAR 766

Query: 217 KGEMEFKIKELNKK-NQSESKLSDT-IRELNEENIGLKQKVELQEA 260
             E+E  + +L +  N  E++LS + I E+  E   L+++V   EA
Sbjct: 767 IEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEA 812



 Score = 28.9 bits (65), Expect = 3.9
 Identities = 28/120 (23%), Positives = 58/120 (48%), Gaps = 8/120 (6%)

Query: 153 GVVIALSEAASLFRSSEK---LTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVR 209
           G++ + SE A L R  E+   L  ++  L+ E  ++   L    +E     +  G++E  
Sbjct: 665 GILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKE 724

Query: 210 LMQMNEVKGEMEFKIKELNKKNQS-ESKLSDTIREL--NEENIGLKQ--KVELQEAENQL 264
           + Q+ + + +++ +++EL +   S E ++ +   EL   E  I   +    +L+EA N L
Sbjct: 725 IEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDL 784



 Score = 28.1 bits (63), Expect = 5.7
 Identities = 30/125 (24%), Positives = 47/125 (37%), Gaps = 22/125 (17%)

Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQ----MNEVKGEMEFKI 224
           +KLT +   LK+E   L  EL    KE    R    D   +L +    +NE+K E++   
Sbjct: 353 DKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQ 412

Query: 225 KELN------------------KKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQ 266
           +EL                   K N+ E +  D   E+ ++   L+Q         Q + 
Sbjct: 413 EELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELY 472

Query: 267 TTKNE 271
             K E
Sbjct: 473 DLKEE 477


>gnl|CDD|227141 COG4804, COG4804, Predicted nuclease of restriction
           endonuclease-like fold [General    function prediction
           only].
          Length = 159

 Score = 33.6 bits (77), Expect = 0.049
 Identities = 22/109 (20%), Positives = 39/109 (35%), Gaps = 10/109 (9%)

Query: 163 SLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEF 222
           S ++    + N  + L  E   L    S  Q E    RQ    L  R +         + 
Sbjct: 2   SHYKLLLSVKNLDKRLFYEIETLRNRWSVRQLE----RQIASQLYERTL-----LSHDKE 52

Query: 223 KIKELNKKNQSESKLSDTIRE-LNEENIGLKQKVELQEAENQLIQTTKN 270
           K+K+ + + + +      I +    E + LK+     + E  LIQ  + 
Sbjct: 53  KVKQQSLQIEDDLTPEQIIMDPYVLEFLSLKRNYLEFDLEYALIQILQQ 101


>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3.  The SLD3 DNA
           replication regulator is required for loading and
           maintenance of Cdc45 on chromatin during DNA
           replication.
          Length = 437

 Score = 32.9 bits (75), Expect = 0.16
 Identities = 25/111 (22%), Positives = 49/111 (44%), Gaps = 2/111 (1%)

Query: 159 SEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKG 218
           S        S   ++ IEFLK+E  +      SS+  +++       L+ R + +++   
Sbjct: 289 SRRTPSLLRSRTNSSLIEFLKRESSENLLPSLSSRTSSDL--LKNKRLQKRQVDLSDSSR 346

Query: 219 EMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTK 269
           + E K+K+    N+ + +L   I  L + N  L  K  ++ AE +  Q  +
Sbjct: 347 QHEEKLKKKQMLNEQKKELKRAISALKKSNRELSSKDIVETAEKRSSQFGQ 397


>gnl|CDD|237970 PRK15453, PRK15453, phosphoribulokinase; Provisional.
          Length = 290

 Score = 32.5 bits (75), Expect = 0.18
 Identities = 14/46 (30%), Positives = 20/46 (43%), Gaps = 14/46 (30%)

Query: 90  VISV------GNTALSRTFRALLRRENVRATEI--------DRIEM 121
           +I+V      G T + R F  + RREN+ A  +         R EM
Sbjct: 7   IIAVTGSSGAGTTTVKRAFEKIFRRENINAAVVEGDSFHRYTRPEM 52


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 33.1 bits (76), Expect = 0.19
 Identities = 31/134 (23%), Positives = 62/134 (46%), Gaps = 7/134 (5%)

Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
           KEL++     +  E    I     A+L R  E+L   +E L+ +  +L EEL+  +++ E
Sbjct: 288 KELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE 347

Query: 198 ILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK-NQSESKLSDTIRELNEENIGLKQKVE 256
            L++    LE  L ++     E+E +++EL ++     SK++    ++   N       E
Sbjct: 348 ELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNN------E 401

Query: 257 LQEAENQLIQTTKN 270
           ++  E +L +    
Sbjct: 402 IERLEARLERLEDR 415



 Score = 30.4 bits (69), Expect = 1.0
 Identities = 30/110 (27%), Positives = 50/110 (45%), Gaps = 11/110 (10%)

Query: 159 SEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLM--QMNEV 216
           S+ A L      L N+IE L+         L   +   E L+Q   +L  +L   ++ E+
Sbjct: 386 SKVAQLELQIASLNNEIERLE-------ARLERLEDRRERLQQEIEELLKKLEEAELKEL 438

Query: 217 KGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQ 266
           + E+E   +EL +  +   +L + + EL EE    +Q   L  AE +L Q
Sbjct: 439 QAELEELEEELEELQEELERLEEALEELREELEEAEQ--ALDAAERELAQ 486



 Score = 30.0 bits (68), Expect = 1.6
 Identities = 34/128 (26%), Positives = 55/128 (42%), Gaps = 14/128 (10%)

Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
           +EL   Q    E E        E   L    ++L   +E L+ E  +L EE+   QKE  
Sbjct: 239 EELEELQEELKEAE-------EELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291

Query: 198 ILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK-NQSESKLSDTIRELNEENIGLKQKVE 256
            L      LE +   + E    +E +++EL  +  + ESKL +   EL E       + +
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAE------LEEK 345

Query: 257 LQEAENQL 264
           L+E + +L
Sbjct: 346 LEELKEEL 353



 Score = 29.6 bits (67), Expect = 1.9
 Identities = 28/105 (26%), Positives = 56/105 (53%), Gaps = 4/105 (3%)

Query: 171 LTNDIEFLKQEKLKLYEELSSSQKENEI----LRQTTGDLEVRLMQMNEVKGEMEFKIKE 226
           L   +E L++E  +L EEL  +++E E     L++    LE   ++++E++ E+E   KE
Sbjct: 230 LVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKE 289

Query: 227 LNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
           L       S+L    + L E    L++++E  EA+ + +++  +E
Sbjct: 290 LYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334



 Score = 29.3 bits (66), Expect = 2.8
 Identities = 28/133 (21%), Positives = 49/133 (36%), Gaps = 6/133 (4%)

Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
            EL       A     +       A+  R  E L   IE L ++   L  E+   ++  E
Sbjct: 810 AELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIE 869

Query: 198 ILRQTTGDLEVRLMQMNE----VKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQ 253
            L      L      + E    ++ E+E   +EL +     S+L   + EL E+   L+ 
Sbjct: 870 ELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLEL 929

Query: 254 KVELQEAENQLIQ 266
           +  L+  E ++  
Sbjct: 930 R--LEGLEVRIDN 940



 Score = 28.9 bits (65), Expect = 3.4
 Identities = 23/106 (21%), Positives = 49/106 (46%), Gaps = 3/106 (2%)

Query: 166 RSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIK 225
           R  E+L   IE L+++  +L + L+  +KE E L +    L   L +++     +   + 
Sbjct: 677 REIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLA 736

Query: 226 ELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
            L  + +   +L + I +L++E   L+ ++E  E   +  +    E
Sbjct: 737 RLEAEVE---QLEERIAQLSKELTELEAEIEELEERLEEAEEELAE 779



 Score = 28.1 bits (63), Expect = 6.2
 Identities = 22/115 (19%), Positives = 50/115 (43%), Gaps = 3/115 (2%)

Query: 157 ALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV 216
              E  +L  + ++L  ++  L +E   L E L S ++      +   DLE ++ +++E 
Sbjct: 794 LKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSE- 852

Query: 217 KGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
             ++E    E+ +  +   +L   +  L  E   L++ + L  +E + +     E
Sbjct: 853 --DIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRE 905


>gnl|CDD|219934 pfam08614, ATG16, Autophagy protein 16 (ATG16).  Autophagy is a
           ubiquitous intracellular degradation system for
           eukaryotic cells. During autophagy, cytoplasmic
           components are enclosed in autophagosomes and delivered
           to lysosomes/vacuoles. ATG16 (also known as Apg16) has
           been shown to be bind to Apg5 and is required for the
           function of the Apg12p-Apg5p conjugate in the yeast
           autophagy pathway.
          Length = 194

 Score = 32.1 bits (73), Expect = 0.19
 Identities = 29/114 (25%), Positives = 45/114 (39%), Gaps = 3/114 (2%)

Query: 157 ALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV 216
             S ++S           I  ++Q+  KL EEL+   K+   L Q    L   L Q+   
Sbjct: 51  QSSHSSSPSADGPGSDAAIAEMEQKLAKLREELTELHKKRGELAQRLLLLNDELEQLRRE 110

Query: 217 KGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKN 270
             ++E  I EL  +  S   L   IR+L EE    ++  E  + E   +    N
Sbjct: 111 IQQLEKTIAELRSEITS---LETEIRDLREELQEKEKDNETLQDELISLNIELN 161



 Score = 29.4 bits (66), Expect = 1.4
 Identities = 26/104 (25%), Positives = 42/104 (40%), Gaps = 10/104 (9%)

Query: 160 EAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGE 219
           E   L +   +L   +  L  E  +L  E+   +K    LR     LE  +  + E    
Sbjct: 82  ELTELHKKRGELAQRLLLLNDELEQLRREIQQLEKTIAELRSEITSLETEIRDLREE--- 138

Query: 220 MEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQ 263
               ++E  K N+    L D +  LN E   L++K+   + ENQ
Sbjct: 139 ----LQEKEKDNE---TLQDELISLNIELNALEEKLRKLQKENQ 175


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 31.9 bits (73), Expect = 0.32
 Identities = 19/101 (18%), Positives = 45/101 (44%), Gaps = 8/101 (7%)

Query: 172 TNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKN 231
           T ++ +      KL    +S +   + + +   + E R ++    +     KI+E  K+ 
Sbjct: 279 TENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTE-----KIREKEKRL 333

Query: 232 -QSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKNE 271
            + E  L +  +ELN +    + + +L++ E +L +   N+
Sbjct: 334 EELEQNLIEERKELNSKL--EEIQKKLEDLEKRLEKLKSNK 372


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 31.8 bits (73), Expect = 0.33
 Identities = 30/107 (28%), Positives = 50/107 (46%), Gaps = 17/107 (15%)

Query: 179 KQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV-KGEMEFKIKELNKKNQSESKL 237
           K+E  KL  EL   ++E +  R     LE RL+Q  E    +ME   K+     + E +L
Sbjct: 57  KEEVHKLRAEL---ERELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKEL 113

Query: 238 SDTIRELNE-----ENIGLKQKVELQ--------EAENQLIQTTKNE 271
           S+  + L+E     E +  +Q+ EL+        EA+  L++  + E
Sbjct: 114 SNKEKNLDEKEEELEELIAEQREELERISGLTQEEAKEILLEEVEEE 160


>gnl|CDD|240215 cd05789, S1_Rrp4, S1_Rrp4: Rrp4 S1-like RNA-binding domain. S1-like
           RNA-binding domains are found in a wide variety of
           RNA-associated proteins. Rrp4 protein is a subunit of
           the exosome complex. The exosome plays a central role in
           3' to 5' RNA processing and degradation in eukarytes and
           archaea. Its functions include the removal of
           incorrectly processed RNA and the maintenance of proper
           levels of mRNA, rRNA and a number of small RNA species.
           In Saccharomyces cerevisiae, the exosome includes nine
           core components, six of which are homologous to
           bacterial RNase PH. These form a hexameric ring
           structure. The other three subunits (RrP4, Rrp40, and
           Csl4) contain an S1 RNA binding domain and are part of
           the "S1 pore structure".
          Length = 86

 Score = 29.8 bits (68), Expect = 0.38
 Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 10/70 (14%)

Query: 67  VPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENV--RATEIDRIEMYKC 124
           +P  GDVV  R+  V     K   + + +        A+L    V    T+ D + M   
Sbjct: 3   IPEVGDVVIGRVTEVGF---KRWKVDINSPYD-----AVLPLSEVNLPRTDEDELNMRSY 54

Query: 125 YRPGDIILAR 134
              GD+I+A 
Sbjct: 55  LDEGDLIVAE 64


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.6 bits (72), Expect = 0.45
 Identities = 27/94 (28%), Positives = 48/94 (51%)

Query: 163 SLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEF 222
              + +E +   I+  ++E  ++  E++    E   LR+    LE  + ++ E+K E+E 
Sbjct: 183 KFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEE 242

Query: 223 KIKELNKKNQSESKLSDTIRELNEENIGLKQKVE 256
             KEL     S+ KL + IREL E    LK+++E
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIE 276



 Score = 28.9 bits (65), Expect = 3.7
 Identities = 22/101 (21%), Positives = 51/101 (50%), Gaps = 1/101 (0%)

Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
           E+L  +IE L+ EK+K  +EL    +E   L +   +    L ++ +    +E +I  + 
Sbjct: 269 EELKKEIEELE-EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIE 327

Query: 229 KKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTK 269
           ++ +   +  + + EL ++   L++++E  E  ++L +  K
Sbjct: 328 ERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAK 368



 Score = 27.7 bits (62), Expect = 7.1
 Identities = 30/125 (24%), Positives = 61/125 (48%), Gaps = 9/125 (7%)

Query: 144 QLSTAENELGVVIA-LSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQT 202
            +   E EL  V+  ++E +S      +    +E   +E  +L EE+   +KE E L  +
Sbjct: 194 LIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGS 253

Query: 203 TGDLEVRLMQ----MNEVKG---EMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKV 255
              LE ++ +    + E+K    E+E K+KEL K+ + +++    + E  EE +   +++
Sbjct: 254 KRKLEEKIRELEERIEELKKEIEELEEKVKEL-KELKEKAEEYIKLSEFYEEYLDELREI 312

Query: 256 ELQEA 260
           E + +
Sbjct: 313 EKRLS 317


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 30.9 bits (71), Expect = 0.81
 Identities = 35/156 (22%), Positives = 69/156 (44%), Gaps = 21/156 (13%)

Query: 130 IILARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEEL 189
            ++     K++   ++  AE E   ++  ++  +     E L    E + + + +  +EL
Sbjct: 18  AVIGYFVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKEL 77

Query: 190 SSSQKENEI------LRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRE 243
              ++ NE+      L Q   +L+ +L  + + + E+E K KEL +K Q   K  + + E
Sbjct: 78  R--ERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEE 135

Query: 244 LNEENIGLKQKVELQ--------EAENQLIQTTKNE 271
           L EE     Q  EL+        EA+  L++  + E
Sbjct: 136 LIEE-----QLQELERISGLTAEEAKEILLEKVEEE 166


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 30.6 bits (69), Expect = 0.81
 Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 12/97 (12%)

Query: 169 EKLTNDIEFLKQEK--LKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKE 226
           E+L +++   ++E+   K Y     SQ           + E    +++E+K E E  + E
Sbjct: 32  ERLDSELRDAEKERDTYKQYLSKLESQNVEI------SNYEALDSELDELKKEEERLLDE 85

Query: 227 LNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQ 263
           L +  + +  L   + EL EE    K+++E +E +  
Sbjct: 86  LEELEKEDDDLDGELVELQEE----KEQLENEELQYL 118


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 30.8 bits (70), Expect = 0.85
 Identities = 20/98 (20%), Positives = 40/98 (40%), Gaps = 11/98 (11%)

Query: 166 RSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIK 225
           +S EKL   I+ LK +  +L + +   +  +++ R+     E           +++ ++K
Sbjct: 277 KSMEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFE-------RDNEKLDAEVK 329

Query: 226 ELNKKNQSESKLSDTIRELNEE----NIGLKQKVELQE 259
           E  K+ + E K    I  L E      +    K E + 
Sbjct: 330 EKKKEKKKEEKKKKQIERLEERIEKLEVQATDKEENKT 367


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 30.9 bits (70), Expect = 0.88
 Identities = 29/119 (24%), Positives = 49/119 (41%), Gaps = 13/119 (10%)

Query: 150 NELGVVIALSEAA-----SLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTG 204
            E+ V+I    A            EKL  + E LK+E  +  EEL   ++  +       
Sbjct: 511 EEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKK----LEL 566

Query: 205 DLEVRLMQMNEVKGEMEFKIKELNKKN---QSESKLSDTIRELNEENIGLKQKVELQEA 260
           + E +   +  +K E+E  I+EL +K      E K  + + +L E    + QK    +A
Sbjct: 567 EKEAQ-EALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVKLKETKQKIPQKPTNFQA 624


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 37/126 (29%), Positives = 59/126 (46%), Gaps = 7/126 (5%)

Query: 144 QLSTAENELGVVIALSEAA--SLFRSSEKLTNDIEFLKQEKLKL-YEELSSSQKENEILR 200
           +L   E EL  ++ L EA    L    EKL N +E L++ K KL  ++L    ++ E   
Sbjct: 509 ELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRL 568

Query: 201 QTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEA 260
           Q   +L   L  +   K E    ++EL ++ +   K    + E   +   L Q +EL EA
Sbjct: 569 QELKELLEELRLLRTRKEE----LEELRERLKELKKKLKELEERLSQLEELLQSLELSEA 624

Query: 261 ENQLIQ 266
           EN+L +
Sbjct: 625 ENELEE 630


>gnl|CDD|220625 pfam10191, COG7, Golgi complex component 7 (COG7).  COG7 is a
           component of the conserved oligomeric Golgi complex
           which is required for normal Golgi morphology and
           localisation. Mutation in COG7 causes a congenital
           disorder of glycosylation.
          Length = 768

 Score = 30.5 bits (69), Expect = 1.1
 Identities = 18/79 (22%), Positives = 37/79 (46%), Gaps = 1/79 (1%)

Query: 170 KLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNK 229
           ++  D+E L+QE   L E++   +++ +   Q TG     L +++ VK  M+   + L +
Sbjct: 67  RVLRDVEALRQEASFLKEQMILVKQDIKKFEQDTGQSMQVLERLDTVKQRMQAAKESL-Q 125

Query: 230 KNQSESKLSDTIRELNEEN 248
           +      LS  + +  E  
Sbjct: 126 EADGWGTLSAELEDTFETQ 144


>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
           kinase; Provisional.
          Length = 968

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 6/62 (9%)

Query: 189 LSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQ-SESKLSDTIRELNEE 247
           LSS ++EN I R   G       +   +K  M+F +KE+N  N    S+++D + +L   
Sbjct: 689 LSSLKEENVISRGKKG----ASYKGKSIKNGMQFVVKEINDVNSIPSSEIAD-MGKLQHP 743

Query: 248 NI 249
           NI
Sbjct: 744 NI 745


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 30.2 bits (68), Expect = 1.3
 Identities = 28/137 (20%), Positives = 66/137 (48%), Gaps = 3/137 (2%)

Query: 138 KELHSYQLSTAENELGVVI--ALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKE 195
           K+ +  Q+ T E  +  V+  A+ E  S  + S         L Q+  K  E+L  +Q+E
Sbjct: 128 KQEYIQQIMTLEESVQHVVMTAIQELMSKEQGSSPSRESAGNLDQQLKKALEDLKEAQEE 187

Query: 196 NEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK-NQSESKLSDTIRELNEENIGLKQK 254
            + L Q   +L+ +++ + E K  ++ + ++L ++  Q E       +  +++   L+ +
Sbjct: 188 KDELAQRCHELDKQVLLLQEEKNSLQQENEKLQERLAQLEGSSLGPNQLGSKKYNLLQSQ 247

Query: 255 VELQEAENQLIQTTKNE 271
           +E  + EN  ++  +++
Sbjct: 248 LEQLQEENFRLEAARDD 264


>gnl|CDD|238571 cd01166, KdgK, 2-keto-3-deoxygluconate kinase (KdgK) phosphorylates
           2-keto-3-deoxygluconate (KDG) to form
           2-keto-3-deoxy-6-phosphogluconate (KDGP). KDG is the
           common intermediate product, that allows organisms to
           channel D-glucuronate and/or D-galacturinate into the
           glycolysis and therefore use polymers, like pectin and
           xylan as carbon sources.
          Length = 294

 Score = 29.9 bits (68), Expect = 1.3
 Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 3/47 (6%)

Query: 75  TARIMAVQQHLCKAHVIS-VGNTALSRTFRALLRRENVRATEIDRIE 120
            A  +A  +   +  +++ VG+    R   A LRRE V  + +    
Sbjct: 37  VAVGLA--RLGHRVALVTAVGDDPFGRFILAELRREGVDTSHVRVDP 81


>gnl|CDD|185235 PRK15335, PRK15335, type III secretion system protein SpaM;
           Provisional.
          Length = 147

 Score = 29.3 bits (65), Expect = 1.4
 Identities = 19/81 (23%), Positives = 49/81 (60%), Gaps = 2/81 (2%)

Query: 187 EELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRE--L 244
           EE+ +  ++  I+R+   DLE++++Q+ E + E+E K +E  KK++   +     +   +
Sbjct: 66  EEIYTLLRKQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNYQRWII 125

Query: 245 NEENIGLKQKVELQEAENQLI 265
            ++   ++++++ +EAE++ I
Sbjct: 126 RQKRFYIQREIQQEEAESEEI 146


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 29.9 bits (67), Expect = 1.6
 Identities = 24/134 (17%), Positives = 51/134 (38%), Gaps = 2/134 (1%)

Query: 138 KELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENE 197
           K L  YQL          +   +   L      L  ++   +QE+++  ++    +KE E
Sbjct: 209 KALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQEL--EKEEE 266

Query: 198 ILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVEL 257
           IL Q   + +    +    + E++   KE  +      KL     +  E+    +++++ 
Sbjct: 267 ILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRKVDDEEKLKESEKELKK 326

Query: 258 QEAENQLIQTTKNE 271
            E E +  +    E
Sbjct: 327 LEKELKKEKEEIEE 340


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 29.6 bits (67), Expect = 1.7
 Identities = 26/113 (23%), Positives = 49/113 (43%), Gaps = 10/113 (8%)

Query: 162 ASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEV---RLMQ-----M 213
             L    + L  ++E L   K KL +   + ++E   L+Q   +LE      +      +
Sbjct: 154 EGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKL 213

Query: 214 NEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQ 266
            ++  E+  K+K+L +  +   +L   I +L  +   L    E+ EAE +L Q
Sbjct: 214 KKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT--EIAEAEKKLEQ 264


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.5 bits (66), Expect = 1.8
 Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 3/84 (3%)

Query: 178 LKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKL 237
           LK++  +L E+L   QKE E L +   +LE    ++ E    +E    E ++  +   KL
Sbjct: 133 LKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLE---VENSRLEEMLKKL 189

Query: 238 SDTIRELNEENIGLKQKVELQEAE 261
              + +L +    L+  VEL E E
Sbjct: 190 PGEVYDLKKRWDELEPGVELPEEE 213



 Score = 28.7 bits (64), Expect = 3.1
 Identities = 26/101 (25%), Positives = 47/101 (46%), Gaps = 3/101 (2%)

Query: 160 EAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGE 219
           E   L    E+L   +E L++EK +L +EL   + E E +++    LEV   ++ E+  +
Sbjct: 129 EYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKK 188

Query: 220 MEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEA 260
           +  ++ +L K+     +L   +    EE I    K  L  A
Sbjct: 189 LPGEVYDLKKR---WDELEPGVELPEEELISDLVKETLNLA 226


>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein. 
          Length = 326

 Score = 29.5 bits (66), Expect = 1.8
 Identities = 19/91 (20%), Positives = 34/91 (37%), Gaps = 10/91 (10%)

Query: 185 LYEELSSSQKENEILRQTTG-------DLEVRLMQMNEVKGEMEFKIKELNKKNQSESKL 237
           +YE L++ +     L ++         DL   L  M     + +  +  L    Q+   L
Sbjct: 33  IYERLTNLESATASLSESVSTISSRLSDLSADLQDMTTSLDDAKATLNGLITTVQA---L 89

Query: 238 SDTIRELNEENIGLKQKVELQEAENQLIQTT 268
             ++  L+ E   L   V   +A    +QTT
Sbjct: 90  QTSVDSLSSELADLSLTVSAHDAAISDLQTT 120


>gnl|CDD|237258 PRK12903, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 925

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 6/91 (6%)

Query: 181 EKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDT 240
           EK+ +  E+ S    N  +     +L  ++++  E + +     +E       E +L   
Sbjct: 782 EKILIITEILSDGINNSDINDRPQELIDQIIESEEERLKALRIQREEMLMRPEELEL--- 838

Query: 241 IRELNEENIGLKQKVELQEAENQLIQTTKNE 271
              +NEE   LKQ+++L+ +E Q  +     
Sbjct: 839 ---INEEQKNLKQEIKLELSEIQEAEEEIQN 866


>gnl|CDD|227306 COG4972, PilM, Tfp pilus assembly protein, ATPase PilM [Cell
           motility and secretion / Intracellular trafficking and
           secretion].
          Length = 354

 Score = 29.3 bits (66), Expect = 2.2
 Identities = 14/51 (27%), Positives = 24/51 (47%)

Query: 65  VIVPTPGDVVTARIMAVQQHLCKAHVISVGNTALSRTFRALLRRENVRATE 115
           ++V T  +VV +RI A +    +  V+ V + AL R +R L  +       
Sbjct: 142 LLVATRKEVVESRIDAFELAGLEPKVLDVESFALLRAYRLLASQFGPEEAA 192


>gnl|CDD|111032 pfam02090, SPAM, Salmonella surface presentation of antigen gene
           type M protein. 
          Length = 147

 Score = 28.5 bits (63), Expect = 2.5
 Identities = 15/46 (32%), Positives = 30/46 (65%)

Query: 187 EELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQ 232
           EE+ S  ++  I+R+   DLE+++ Q+ E + E+E K +E  +K++
Sbjct: 66  EEIYSLLRKQSIVRRQIKDLELQITQIQEKRCELEKKREEFQEKSK 111


>gnl|CDD|191851 pfam07798, DUF1640, Protein of unknown function (DUF1640).  This
           family consists of sequences derived from hypothetical
           eukaryotic proteins. A region approximately 100 residues
           in length is featured.
          Length = 177

 Score = 28.7 bits (65), Expect = 2.5
 Identities = 31/108 (28%), Positives = 51/108 (47%), Gaps = 24/108 (22%)

Query: 159 SEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKG 218
           SE A+L   +EKL ND+E LK    +L +E++                 VRL  +N  KG
Sbjct: 73  SEFANLQAENEKLKNDLEKLKN---RLRDEITKVTA------------GVRL-DLNLEKG 116

Query: 219 EMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQ 266
               +I+E     + E K+ +T  +++ E  GLK ++E    +   I+
Sbjct: 117 ----RIREE--SAEQELKIKETDTKIDTEIAGLKTQLE--SVKLDTIK 156


>gnl|CDD|238987 cd02029, PRK_like, Phosphoribulokinase-like (PRK-like) is a family
           of proteins similar to phosphoribulokinase (PRK), the
           enzyme involved in the Benson-Calvin cycle in
           chloroplasts or photosynthetic prokaryotes. PRK
           catalyzes the phosphorylation of D-ribulose 5-phosphate
           to form D-ribulose 1, 5-biphosphate, using ATP and NADPH
           produced by the primary reactions of photosynthesis.
          Length = 277

 Score = 29.0 bits (65), Expect = 2.6
 Identities = 9/44 (20%), Positives = 15/44 (34%), Gaps = 8/44 (18%)

Query: 94  GNTALSRTFRALLRRENVRATEI--------DRIEMYKCYRPGD 129
           G T + R F  +  RE +    +        +R+EM        
Sbjct: 11  GTTTVKRAFEHIFAREGIHPAVVEGDSFHRYERMEMKMAIAEAL 54


>gnl|CDD|234309 TIGR03683, A-tRNA_syn_arch, alanyl-tRNA synthetase.  This family of
           alanyl-tRNA synthetases is limited to the archaea, and
           is a subset of those sequences identified by the model
           pfam07973 covering the second additional domain (SAD) of
           alanyl and threonyl tRNA synthetases.
          Length = 902

 Score = 29.2 bits (66), Expect = 3.0
 Identities = 16/81 (19%), Positives = 38/81 (46%), Gaps = 1/81 (1%)

Query: 164 LFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVK-GEMEF 222
           L  SS+ L    E L +   + +EE    +KE E L++   +L++  +     + G +E 
Sbjct: 749 LRESSDILKVPPEQLPETVKRFFEEWKEQRKEIERLKKKLAELKIYELISEAERIGGVEV 808

Query: 223 KIKELNKKNQSESKLSDTIRE 243
            ++ ++       ++++ + E
Sbjct: 809 IVELVDADMDELQEIAEKLAE 829


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 28.4 bits (64), Expect = 3.3
 Identities = 21/88 (23%), Positives = 42/88 (47%), Gaps = 7/88 (7%)

Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
           EKL  ++E LKQ   +L  ++   +K     R+ T +    L ++ +++ E++    EL 
Sbjct: 72  EKLKKELEELKQRIAELQAQIEKLKKG----REETEERTELLEELKQLEKELKKLKAELE 127

Query: 229 KKNQSESKLSDTIRELNEENIGLKQKVE 256
           K  +++ +    I +L EE    K+   
Sbjct: 128 KYEKNDPE---RIEKLKEETKVAKEAAN 152


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 28.9 bits (65), Expect = 3.4
 Identities = 17/106 (16%), Positives = 38/106 (35%), Gaps = 13/106 (12%)

Query: 176 EFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSES 235
             L + K  L  E +  ++E    R+     E RL+Q  E       + ++L+       
Sbjct: 52  AALLEAKELLLRERNQQRQEARREREELQREEERLVQKEEQLDA---RAEKLDNLENQLE 108

Query: 236 KLSDTIRELNEENIGLKQKVELQ----------EAENQLIQTTKNE 271
           +    +     E   L+++++ +          +A   L++    E
Sbjct: 109 EREKALSARELELEELEKQLDNELYRVAGLTPEQARKLLLKLLDAE 154


>gnl|CDD|238159 cd00256, VATPase_H, VATPase_H, regulatory vacuolar ATP synthase
           subunit H (Vma13p); activation component of the
           peripheral V1 complex of V-ATPase, a heteromultimeric
           enzyme which uses  ATP to actively transport protons
           into organelles and extracellular compartments. The
           topology is that of a superhelical spiral, in part the
           geometry is similar to superhelices composed of
           armadillo repeat motifs, as found in importins for
           example.
          Length = 429

 Score = 28.5 bits (64), Expect = 3.6
 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 12/89 (13%)

Query: 167 SSEKLTNDIEFLKQEKLKLYEELSSSQK-ENEILRQTTGDLEVRLMQMNEVKGEMEF--- 222
             E LT+D++FL +E     ++LSS  + ++E+    +G L    +  +E      F   
Sbjct: 291 DDEDLTDDLKFLTEELKNSVQDLSSFDEYKSELR---SGRLHWSPVHKSEK-----FWRE 342

Query: 223 KIKELNKKNQSESKLSDTIRELNEENIGL 251
               LN+KN    K+   + E + + I L
Sbjct: 343 NADRLNEKNYELLKILIHLLETSVDPIIL 371


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 28.8 bits (65), Expect = 3.7
 Identities = 18/121 (14%), Positives = 47/121 (38%), Gaps = 12/121 (9%)

Query: 160 EAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILR---------QTTGDLEVRL 210
           E  +L    E  +  +  L     K+  ++   QK  ++           Q   +   R+
Sbjct: 242 ELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRI 301

Query: 211 MQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLIQTTKN 270
            ++ +   E++  +++L+       ++ D   E +++ + LK K+       Q + T  +
Sbjct: 302 TKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKI---STNKQSLITLVD 358

Query: 271 E 271
           +
Sbjct: 359 K 359


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 28.7 bits (64), Expect = 3.8
 Identities = 31/133 (23%), Positives = 56/133 (42%), Gaps = 21/133 (15%)

Query: 156 IALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMN- 214
             L +   + +++EK   +I  L Q +L+  E+L     E E L+     LE+RL + + 
Sbjct: 128 FQLEDLVGMIQNAEK---NILLLNQARLQALEDLEKILTEKEALQGKINILEMRLSETDA 184

Query: 215 ------EVKGEMEFKIKELNK-----------KNQSESKLSDTIRELNEENIGLKQKVEL 257
                 + K  +E   ++L K           +      LS  +  L EEN+ LK  ++ 
Sbjct: 185 RIKLAAQEKIHVEILEEQLEKLRNELLIRGATEGLCVHSLSKELDVLKEENMLLKDDIQF 244

Query: 258 QEAENQLIQTTKN 270
            +AE   +  T+ 
Sbjct: 245 LKAELIEVAETEE 257


>gnl|CDD|226470 COG3961, COG3961, Pyruvate decarboxylase and related thiamine
           pyrophosphate-requiring enzymes [Carbohydrate transport
           and metabolism / Coenzyme metabolism / General function
           prediction only].
          Length = 557

 Score = 28.8 bits (65), Expect = 3.8
 Identities = 23/80 (28%), Positives = 30/80 (37%), Gaps = 12/80 (15%)

Query: 103 RALLRRENVRATEIDRI--EMYKCYRPGDIIL----ARL----PLKELHSYQLSTAENEL 152
           +A+L   N    EIDR+     K  RP  I L    A L    PL  L     ++    L
Sbjct: 136 QAMLTDINTAPREIDRVIRTALKQRRPVYIGLPADVADLPIEAPLTPLDLQLKTSDPEAL 195

Query: 153 GVVIALSEAASLFRSSEKLT 172
             VI     A L   ++K  
Sbjct: 196 SEVI--DTIAELINKAKKPV 213


>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
          Length = 742

 Score = 28.6 bits (64), Expect = 4.2
 Identities = 16/73 (21%), Positives = 33/73 (45%), Gaps = 1/73 (1%)

Query: 178 LKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK-NQSESK 236
             ++KL + ++  +        +Q  G L + LM+  + K E   K   +NK  N+ + K
Sbjct: 82  KIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVNKSTNKKKKK 141

Query: 237 LSDTIRELNEENI 249
           +  +  EL + + 
Sbjct: 142 VLSSKDELIKYDN 154


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 28.6 bits (64), Expect = 4.4
 Identities = 29/82 (35%), Positives = 46/82 (56%), Gaps = 5/82 (6%)

Query: 175 IEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEV-KGEMEFKIKELNKKNQS 233
           I  L+ E  +L  ELS+ Q ENE L   + +L+    +M E+ +G +  +IKE   +   
Sbjct: 36  ILELQAELKQLRAELSNVQAENERLSSLSQELK-EENEMLELQRGRLRDEIKEYKFR--- 91

Query: 234 ESKLSDTIRELNEENIGLKQKV 255
           E++L     EL EENI L+++V
Sbjct: 92  EARLLQDYSELEEENISLQKQV 113


>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
           This family consists of a number of sequences found in
           Arabidopsis thaliana, Oryza sativa and Lycopersicon
           esculentum (Tomato). The function of this family is
           unknown.
          Length = 767

 Score = 28.4 bits (63), Expect = 4.4
 Identities = 24/103 (23%), Positives = 42/103 (40%), Gaps = 12/103 (11%)

Query: 167 SSEKLTNDIEFLKQEKLKLYEELSSSQKENEILR-------QTTGDLEVRLMQMNEVKGE 219
                  ++E LK EK  L  ELS  + + E  +       +   +L+  L    E    
Sbjct: 588 VCISTLEELETLKSEKENLDGELSKCKDDLEESKNKLQETEKKLEELKSELDASQESNSL 647

Query: 220 MEFKIKELNKKNQS-ESKLSDTIRELNEENIGLKQKVELQEAE 261
            E ++K + +  +S E +  D    L  E+  L++K+   E E
Sbjct: 648 AETQLKCMVESYESLELRAKD----LEAEHKSLQEKISSLEDE 686


>gnl|CDD|226463 COG3954, PrkB, Phosphoribulokinase [Energy production and
           conversion].
          Length = 289

 Score = 28.3 bits (63), Expect = 4.5
 Identities = 14/47 (29%), Positives = 22/47 (46%), Gaps = 6/47 (12%)

Query: 87  KAHVISV------GNTALSRTFRALLRRENVRATEIDRIEMYKCYRP 127
           K  VI+V      G T  S  FR +  + N+ A E++    ++  RP
Sbjct: 4   KHPVIAVTGSSGAGTTTTSLAFRKIFAQLNIHAAEVEGDSFHRYTRP 50


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 27.9 bits (62), Expect = 4.9
 Identities = 24/121 (19%), Positives = 49/121 (40%), Gaps = 1/121 (0%)

Query: 147 TAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSS-QKENEILRQTTGD 205
                   +  + E+  L +   K          E  +L E +    + + EIL      
Sbjct: 81  ENNKGKLKLRLIEESRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYL 140

Query: 206 LEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVELQEAENQLI 265
            + RL ++   + E + +++EL K+ +      D + +L EE    K+  EL+E   +L+
Sbjct: 141 GQERLEELKFKRKEKKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKEELEELL 200

Query: 266 Q 266
           +
Sbjct: 201 E 201


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 27.4 bits (61), Expect = 5.5
 Identities = 31/137 (22%), Positives = 53/137 (38%), Gaps = 5/137 (3%)

Query: 130 IILARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEEL 189
           + L  L L      Q    +    +   LSE        ++   D+E    E     E+L
Sbjct: 53  LALGLLCLLLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDL 112

Query: 190 SSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESK-LSDTIRELNEEN 248
               +    L +    LE RL  + E   E+  +++EL +  + E + L + +  L E  
Sbjct: 113 KEDLQLLRELLK---SLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENL 169

Query: 249 IGLKQKV-ELQEAENQL 264
             L++ + ELQ    QL
Sbjct: 170 QRLQEAIQELQSLLEQL 186


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 28.1 bits (63), Expect = 5.6
 Identities = 25/94 (26%), Positives = 44/94 (46%), Gaps = 11/94 (11%)

Query: 175  IEFLKQEKL-KLYEELSSSQKENEILRQTT------GDLEVRLMQMNEVKGEMEFKIKEL 227
            I  L +EK+ KL  EL   +KE E L+ TT       DL+     + E +   E   KE+
Sbjct: 1096 IWSLTKEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEE---KEI 1152

Query: 228  NKKNQSESKLSDTIRELNEENIGLKQKVELQEAE 261
             K+ + +SK      +L  +    K++ + +++ 
Sbjct: 1153 AKEQRLKSKTKGKASKLR-KPKLKKKEKKKKKSS 1185


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 28.1 bits (63), Expect = 5.8
 Identities = 26/147 (17%), Positives = 48/147 (32%), Gaps = 16/147 (10%)

Query: 141  HSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLK------------LYEE 188
                ++       + +  S ++ L  + E  T  I  LK+                   E
Sbjct: 896  SLKLVNLELESEIIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLEEGPSIEYVKLPE 955

Query: 189  LSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKLSDTIRELNEEN 248
            L+   +    L++T+ + E  L +   +  E      EL    +  ++LS     L E  
Sbjct: 956  LNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQEST 1015

Query: 249  IGLKQK----VELQEAENQLIQTTKNE 271
              LK+      ELQ A   +   +   
Sbjct: 1016 KQLKELPVEVAELQSASKIISSESTEL 1042


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 27.9 bits (63), Expect = 6.0
 Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQ 201
            +L   +  L+ E  KL  EL   + E E L+ 
Sbjct: 18  RQLELKLRDLEAENEKLERELERLKSELEKLKS 50


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 28.1 bits (63), Expect = 6.6
 Identities = 20/101 (19%), Positives = 41/101 (40%), Gaps = 18/101 (17%)

Query: 158 LSEAASLFRS-SEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEV-RLMQMNE 215
           L E A + +    +L   IE       + +EE  + QKE E L+    D ++  L    E
Sbjct: 710 LKELADILKVTPNELPKKIE-------RFFEEWKALQKELESLKLKIADQKIDELKDKAE 762

Query: 216 VKGEMEFKIKELNKKNQSESKLSDTIRELNEENIGLKQKVE 256
               ++  ++ ++ K+         ++ + +    LK K+ 
Sbjct: 763 TINGVKVLVEVVDAKDMKS------LKTMADR---LKSKLG 794


>gnl|CDD|150430 pfam09755, DUF2046, Uncharacterized conserved protein H4 (DUF2046).
            This is the conserved N-terminal 350 residues of a
           family of proteins of unknown function possibly
           containing a coiled-coil domain.
          Length = 308

 Score = 27.6 bits (61), Expect = 6.7
 Identities = 36/118 (30%), Positives = 55/118 (46%), Gaps = 21/118 (17%)

Query: 167 SSEKLTNDIEFLKQEKLKLYEEL-------SSSQKENEILRQTTGDLEVRLMQMNE-VKG 218
           S E+L   I  LKQE   L  EL        S Q+EN  LRQ +  ++ +  Q  E +  
Sbjct: 21  SRERLRGRIHSLKQENKVLKMELDTFKLKCKSLQEENRALRQASVSIQAKAEQEEEFISN 80

Query: 219 EMEFKIKELNKKNQS------------ESKLSDTIRELNEENIGLKQKVELQEAENQL 264
            +  KI+ L K+ ++             + LS  + +L +E I L+Q +E QE E Q+
Sbjct: 81  TLLKKIQALKKEKETLAMNYEQEEEFLTNDLSRKLTQLRQEKIQLEQTLE-QEQEYQV 137


>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
           family.  This model represents a relatively
           well-conserved region near the C-terminus of the tape
           measure protein of a lambda and related phage. This
           protein, which controls phage tail length, is typically
           about 1000 residues in length. Both low-complexity
           sequence and insertion/deletion events appear common in
           this family. Mutational studies suggest a ruler or
           template role in the determination of phage tail length.
           Similar behavior is attributed to proteins from
           distantly related or unrelated families in other phage
           [Mobile and extrachromosomal element functions, Prophage
           functions].
          Length = 332

 Score = 27.9 bits (62), Expect = 6.7
 Identities = 21/94 (22%), Positives = 37/94 (39%), Gaps = 7/94 (7%)

Query: 170 KLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNK 229
            LT  I   K +KL   +E S   + +EI+      L      + E + +    +K+L +
Sbjct: 5   LLTQQIADRKLKKLNTADEKSLQSRSDEIIA-----LIKLEKLLEEAERKALEALKKLAE 59

Query: 230 KNQSESKLSDTIRELNEENIGLKQKVELQEAENQ 263
              S    +   R+L+   +G KQ+  L      
Sbjct: 60  ATASIR--AQNKRQLDRFGLGDKQRERLDARLQI 91


>gnl|CDD|219816 pfam08388, GIIM, Group II intron, maturase-specific domain.  This
           region is found mainly in various bacterial and archaeal
           species, but a few members of this family are expressed
           by fungal and chlamydomonal species. It has been
           implicated in the binding of intron RNA during reverse
           transcription and splicing.
          Length = 80

 Score = 26.0 bits (58), Expect = 7.5
 Identities = 7/23 (30%), Positives = 14/23 (60%)

Query: 223 KIKELNKKNQSESKLSDTIRELN 245
           KI+EL +++       + I++LN
Sbjct: 7   KIRELTRRSNKGLSQEELIKKLN 29


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 27.7 bits (62), Expect = 8.0
 Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 2/75 (2%)

Query: 178 LKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESKL 237
           L++   KL  E  + + ENE L++    ++ ++ Q   V+ E +   KE+ +      +L
Sbjct: 71  LRKRLAKLISENEALKAENERLQKREQSIDQQIQQ--AVQSETQELTKEIEQLKSERQQL 128

Query: 238 SDTIRELNEENIGLK 252
              I +L     G+ 
Sbjct: 129 QGLIDQLQRRLAGVL 143


>gnl|CDD|227569 COG5244, NIP100, Dynactin complex subunit involved in mitotic
           spindle partitioning in anaphase B [Cell division and
           chromosome partitioning].
          Length = 669

 Score = 27.7 bits (61), Expect = 8.1
 Identities = 27/111 (24%), Positives = 45/111 (40%), Gaps = 13/111 (11%)

Query: 166 RSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIK 225
              +KL    E + ++  +L+   S     + IL     D      Q  E    +   I 
Sbjct: 443 ARIDKL-IGKEEISKQDNRLFLYPSCDITLSSILTILFSDKLEVFFQGIE---SLLENIT 498

Query: 226 ELNKKNQSESKLSDTIRE-------LNEENIGLKQKVELQEAENQLIQTTK 269
              ++   ++  S+ I+E       LNEENI LK+ +   + EN L + TK
Sbjct: 499 IFPEQPSQQTSDSENIKENSLLSDRLNEENIRLKEVLV--QKENMLTEETK 547


>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La.  This protein is induced
           by heat shock and other stresses in E. coli, B.
           subtilis, and other species. The yeast member,
           designated PIM1, is located in the mitochondrial matrix,
           required for mitochondrial function, and also induced by
           heat shock [Protein fate, Degradation of proteins,
           peptides, and glycopeptides].
          Length = 775

 Score = 27.6 bits (62), Expect = 8.2
 Identities = 34/151 (22%), Positives = 66/151 (43%), Gaps = 23/151 (15%)

Query: 123 KCYRPGDIILARLPLKELHSYQLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLK-QE 181
           +  R  D + A L LKE    Q       +           L ++ E L  ++E LK Q 
Sbjct: 158 EPGRLADFVAASLQLKEKDELQEVLETVNIEK--------RLKKALELLKKELELLKLQN 209

Query: 182 KL--KLYEELSSSQKE---NEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKKNQSESK 236
           K+  K+ E++  +Q+E    E L+     ++  L    + K E+E K+KE  ++     K
Sbjct: 210 KITKKVEEKMEKTQREYYLREQLKA----IKKELGIEKDDKDELE-KLKEKLEEL----K 260

Query: 237 LSDTIRELNEENIGLKQKVELQEAENQLIQT 267
           L + ++++ E+ +     +E   +E  + + 
Sbjct: 261 LPEEVKKVIEKELTKLSLLEPSSSEFTVTRN 291


>gnl|CDD|235456 PRK05425, PRK05425, asparagine synthetase AsnA; Provisional.
          Length = 327

 Score = 27.5 bits (62), Expect = 8.4
 Identities = 9/34 (26%), Positives = 17/34 (50%), Gaps = 1/34 (2%)

Query: 168 SEKLTNDIEFL-KQEKLKLYEELSSSQKENEILR 200
              L  +I F+  QE    Y +L+  ++E+ I +
Sbjct: 160 LPFLPEEITFITSQELEDRYPDLTPKEREDAIAK 193


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 27.6 bits (61), Expect = 8.5
 Identities = 25/98 (25%), Positives = 43/98 (43%), Gaps = 8/98 (8%)

Query: 171 LTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELNKK 230
           L +DI  L Q   KL  ELS +  + E+ ++               + E+E   KELN  
Sbjct: 492 LKHDINELTQILEKLELELSEANSKFELSKEENEREL------VAQRIEIEKLEKELNDL 545

Query: 231 N-QSESKLSDTIRELNEENIGLKQ-KVELQEAENQLIQ 266
           N  S++ + D  + +    I L + KV+L     ++ +
Sbjct: 546 NLLSKTSILDAEQLVQSTEIKLDELKVDLNRKRYKIHK 583


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 27.6 bits (61), Expect = 9.1
 Identities = 27/138 (19%), Positives = 51/138 (36%), Gaps = 18/138 (13%)

Query: 144 QLSTAENELGVVIALSEAASLFRSSEKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTT 203
           Q   A+ E  +     +       +  L  +I+ LK++ +     LS  ++    LR   
Sbjct: 453 QSHLAKMENQLSELEEDVGQQKDRNNTLETEIKLLKEQLVSNERLLSFVKEATNALRLKI 512

Query: 204 GDLEVRLMQMNEVKGEMEFKIKELNKK---NQSESKL---------------SDTIRELN 245
             LE    ++ + K  +E K++ L  +   N S +K+                 TI  L 
Sbjct: 513 ETLERERDRLRQEKSLLEMKLEHLCLQGDYNASRTKVLHMSLNPASEAEQIAKQTIEALQ 572

Query: 246 EENIGLKQKVELQEAENQ 263
            E   LK++++  E    
Sbjct: 573 AECEKLKERLQALEEGKS 590


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 27.3 bits (61), Expect = 9.2
 Identities = 20/75 (26%), Positives = 38/75 (50%), Gaps = 4/75 (5%)

Query: 160 EAASLFRSSEKLTN----DIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNE 215
           E ASL + +++L      ++E  +QE   L  ++S  +K+ E L+Q   +L + +  +  
Sbjct: 190 EIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTN 249

Query: 216 VKGEMEFKIKELNKK 230
            K E+  +I E  K 
Sbjct: 250 KKSELLEEIAEAEKI 264


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 26.4 bits (59), Expect = 9.9
 Identities = 26/91 (28%), Positives = 46/91 (50%), Gaps = 10/91 (10%)

Query: 169 EKLTNDIEFLKQEKLKLYEELSSSQKENEILRQTTGDLEVRLMQMNEVKGEMEFKIKELN 228
           E+L++ +  L+ E   L +EL+  + E +  RQ      V+L + NE   E++ +I+EL 
Sbjct: 20  ERLSSTLRRLEGELASLKDELARLEAERDEARQEI----VKLTEENEELRELKKEIEELE 75

Query: 229 KKNQSESKLSDTIRELNEENIGLKQKVELQE 259
           K+ +   +  +T  EL      L +K E  E
Sbjct: 76  KELEDLEQRYETTLEL------LGEKSERVE 100


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.130    0.347 

Gapped
Lambda     K      H
   0.267   0.0722    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,356,386
Number of extensions: 1287642
Number of successful extensions: 1792
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1641
Number of HSP's successfully gapped: 301
Length of query: 271
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 176
Effective length of database: 6,723,972
Effective search space: 1183419072
Effective search space used: 1183419072
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 58 (26.1 bits)