RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy10418
         (389 letters)



>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 64.8 bits (158), Expect = 1e-12
 Identities = 39/105 (37%), Positives = 69/105 (65%), Gaps = 5/105 (4%)

Query: 14  VFDIASEIGKEFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHIS 73
           V+D+A+ I +EFE++ID YG + V ++MP VI  LE+L+ LAS+N+    +L+ L     
Sbjct: 1   VYDLANSIYQEFERLIDNYGEEVVKNLMPLVIRVLENLDSLASENQELEVELELL----- 55

Query: 74  KLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH 118
           + ++E+L  + E  +E  K+  Q+L+E  + L++E K+L K++E 
Sbjct: 56  REDNERLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIED 100


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 42.7 bits (101), Expect = 3e-04
 Identities = 36/174 (20%), Positives = 75/174 (43%), Gaps = 11/174 (6%)

Query: 202 RNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL-------EHEGSSPMKSQQLSPEID 254
            N L+ +L  LE + +  +    L+ E ++L   L         E    ++ +    E +
Sbjct: 195 LNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEE 254

Query: 255 I----SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLS 310
           +    + LQ+L  +++ LR ++   E+E+     ++  L  +I RL    + LR +    
Sbjct: 255 LEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANL 314

Query: 311 QVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEER 364
           + Q+  L  +  +  + L +   E+A L+++L   ++E E L    E    E  
Sbjct: 315 ERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELE 368



 Score = 40.8 bits (96), Expect = 0.001
 Identities = 56/267 (20%), Positives = 120/267 (44%), Gaps = 30/267 (11%)

Query: 54  LASKNERENTKLQELGDHISKLEHE--KLEREVEEIEESWKE----------ETQQLVEM 101
              + E    +LQEL + + +L  E  +LE E+EE+++              + Q L E 
Sbjct: 251 AEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRER 310

Query: 102 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGY 161
           +  L+ + ++L  +LE   S   +  +   E++   L++L+ +++SL  +  L+E E   
Sbjct: 311 LANLERQLEELEAQLEELESKLDELAEELAELE-EKLEELKEELESLEAE--LEELEAEL 367

Query: 162 RNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKM 221
              +    ++ +        +           ++  + ++   L+ RL  LE+ R+    
Sbjct: 368 EELESRLEELEEQLETLRSKVAQLEL------QIASLNNEIERLEARLERLEDRRE---- 417

Query: 222 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGY 281
             RLQ+E ++L K+LE    + +K  Q   E     L++L+ +++ L + +    +EL  
Sbjct: 418 --RLQQEIEELLKKLE---EAELKELQAELEELEEELEELQEELERLEEALEELREELEE 472

Query: 282 RNSDVDNLKRQIDRLTSTVRELRRKQR 308
               +D  +R++ +L + +  L R Q 
Sbjct: 473 AEQALDAAERELAQLQARLDSLERLQE 499



 Score = 40.0 bits (94), Expect = 0.002
 Identities = 39/191 (20%), Positives = 78/191 (40%), Gaps = 26/191 (13%)

Query: 203 NTLKTRLNELEEERQCFKM--------VNRLQEENKKLSKRLEHEGSSPM---------- 244
             L+  L  L  E    +         +  L+ + ++L  +LE   S             
Sbjct: 284 EELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELE 343

Query: 245 -KSQQLSPEID--ISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVR 301
            K ++L  E++   + L++L  +++ L  ++   E++L    S V  L+ QI  L + + 
Sbjct: 344 EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIE 403

Query: 302 ELRRKQRLSQVQVRGLIDERADFL-----AALQDHQREVAALKQQLGLAEKENEDLYNSQ 356
            L  +    + +   L  E  + L     A L++ Q E+  L+++L   ++E E L  + 
Sbjct: 404 RLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEAL 463

Query: 357 ETTSEEERPVQ 367
           E   EE    +
Sbjct: 464 EELREELEEAE 474



 Score = 35.4 bits (82), Expect = 0.051
 Identities = 53/303 (17%), Positives = 122/303 (40%), Gaps = 59/303 (19%)

Query: 43  KVIVALEHLECLASKNERENTKLQELGDHIS--KLEHEKLEREVEEIEESWKEETQQLVE 100
            +    + LE L  + E+   +L+EL   IS  + +  +LE EVE++EE   + +++L E
Sbjct: 699 ALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTE 758

Query: 101 MVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELG 160
           +   ++E  ++L +  E    +  + ++L  +I     ++L+ ++ +LR+ +     EL 
Sbjct: 759 LEAEIEELEERLEEAEEELAEAEAEIEELEAQI-----EQLKEELKALREALDELRAEL- 812

Query: 161 YRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHD-RNTLKTRLNELEEERQCF 219
                                             L E   + R  L++    +    +  
Sbjct: 813 --------------------------------TLLNEEAANLRERLESLERRIAATERRL 840

Query: 220 KMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKEL 279
           + +    EE  +  + L  E    ++  +   E       +L  ++++L ++    E+ L
Sbjct: 841 EDLEEQIEELSEDIESLAAE----IEELEELIE-------ELESELEALLNERASLEEAL 889

Query: 280 GYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALK 339
               S+++ L  ++  L S   ELRR+    +       ++ A     L+  +  +  L+
Sbjct: 890 ALLRSELEELSEELRELESKRSELRRELEELR-------EKLAQLELRLEGLEVRIDNLQ 942

Query: 340 QQL 342
           ++L
Sbjct: 943 ERL 945



 Score = 33.9 bits (78), Expect = 0.16
 Identities = 55/280 (19%), Positives = 112/280 (40%), Gaps = 43/280 (15%)

Query: 76  EHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDI 135
           E E+LE ++EE+EE   E  + L E+   L+E  ++L +  +       +   L  +   
Sbjct: 678 EIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKD--- 734

Query: 136 SVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAEL 195
             L +L  +V+ L +++    KEL             +L  +     +          EL
Sbjct: 735 --LARLEAEVEQLEERIAQLSKELT------------ELEAEIEELEERLEE---AEEEL 777

Query: 196 KEILHDRNTLKTRLNELEEERQCFK-MVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEID 254
            E   +   L+ ++ +L+EE +  +  ++ L+ E   L++                    
Sbjct: 778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEA------------------ 819

Query: 255 ISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQV 314
                 LR +++SL  ++   E+ L      ++ L   I+ L + + EL       + ++
Sbjct: 820 ----ANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL 875

Query: 315 RGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYN 354
             L++ERA    AL   + E+  L ++L   E +  +L  
Sbjct: 876 EALLNERASLEEALALLRSELEELSEELRELESKRSELRR 915



 Score = 31.6 bits (72), Expect = 0.95
 Identities = 30/154 (19%), Positives = 66/154 (42%), Gaps = 7/154 (4%)

Query: 209 LNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSL 268
             +LE  R+    +  +  E ++  K LE +     + ++L  E+    L  L  +++ L
Sbjct: 178 ERKLERTRENLDRLEDILNELERQLKSLERQAEKAERYKELKAELRELELALLVLRLEEL 237

Query: 269 RDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAAL 328
           R+++   ++EL     +++ L  ++  L   + ELR       ++V  L +E  +    L
Sbjct: 238 REELEELQEELKEAEEELEELTAELQELEEKLEELR-------LEVSELEEEIEELQKEL 290

Query: 329 QDHQREVAALKQQLGLAEKENEDLYNSQETTSEE 362
                E++ L+QQ  +  +   +L    E    +
Sbjct: 291 YALANEISRLEQQKQILRERLANLERQLEELEAQ 324



 Score = 29.6 bits (67), Expect = 3.2
 Identities = 24/107 (22%), Positives = 48/107 (44%)

Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVR 315
             + +L   +  LR ++   E+EL     +++ L RQI  L   +  L  +    + ++ 
Sbjct: 691 EKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIA 750

Query: 316 GLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEE 362
            L  E  +  A +++ +  +   +++L  AE E E+L    E   EE
Sbjct: 751 QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEE 797


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 42.4 bits (100), Expect = 3e-04
 Identities = 45/238 (18%), Positives = 101/238 (42%), Gaps = 7/238 (2%)

Query: 120 GSSPMKSQQLSPEIDISVLQK----LRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLS 175
           G S  K   L+ + ++  L++    L  Q++ L ++++  + EL      ++        
Sbjct: 654 GGSRNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEE 713

Query: 176 NKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEER-QCFKMVNRLQEENKKLSK 234
            +   +            EL+++      L+  L ELEEE  +  + +  L+EE + L +
Sbjct: 714 LERQLEELK-RELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772

Query: 235 RLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQID 294
            L        + ++    +    L++L  +++    ++   E+EL       + L+++I+
Sbjct: 773 ALAKLKEEIEELEEKRQALQ-EELEELEEELEEAERRLDALERELESLEQRRERLEQEIE 831

Query: 295 RLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDL 352
            L   + EL  K    + ++  L  E  +    L++ + E   L+ +L   E+E E+L
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEEL 889



 Score = 41.6 bits (98), Expect = 6e-04
 Identities = 64/322 (19%), Positives = 137/322 (42%), Gaps = 30/322 (9%)

Query: 59  ERENTKLQELGDHISKLEH--EKLEREVEEIEE----SWKEETQQLVEMVNRLQEENKKL 112
           ER    L+ L D + +LE   EKLER+ E+ E       +    +L  ++ +L+E  K+L
Sbjct: 182 ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKEL 241

Query: 113 SKRLEHEGSSPMKSQQLSPEID--ISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVD--N 168
            +  E       + ++L  E++     +++L+ +++ LR+++   ++EL     +++   
Sbjct: 242 EELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELE 301

Query: 169 SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEE 228
            +I  L  +     ++         ELKE +            L EE         L++ 
Sbjct: 302 GEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE---------LEQL 352

Query: 229 NKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 288
             +L +  E              E   + L++L    ++LR+++   E EL    ++++ 
Sbjct: 353 LAELEEAKEEL-----------EEKLSALLEELEELFEALREELAELEAELAEIRNELEE 401

Query: 289 LKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
           LKR+I+ L   +  L  +    + +++ L  E  +    L++   E+  L++QL      
Sbjct: 402 LKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDR 461

Query: 349 NEDLYNSQETTSEEERPVQGPL 370
            ++L        EE + ++  L
Sbjct: 462 LKELERELAELQEELQRLEKEL 483



 Score = 40.5 bits (95), Expect = 0.001
 Identities = 59/275 (21%), Positives = 122/275 (44%), Gaps = 23/275 (8%)

Query: 51  LECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENK 110
           LE    + E   ++L+EL + + +L+ E LE  ++E  E  + E   L E +  L+ E +
Sbjct: 262 LEEAEKEIEELKSELEELREELEELQEELLE--LKEEIEELEGEISLLRERLEELENELE 319

Query: 111 KLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSD 170
           +L +RLE       K +  + + ++   + L  +++ L  ++   ++EL  + S +    
Sbjct: 320 ELEERLEEL-----KEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEEL 374

Query: 171 IPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKM-VNRLQEEN 229
                                  EL E+  +   ++  L EL+ E +  +  + RL E  
Sbjct: 375 EELFEALR--------------EELAELEAELAEIRNELEELKREIESLEERLERLSERL 420

Query: 230 KKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNL 289
           + L + L+ E  + ++  Q   E     L++L  Q++ LRD+++  E+EL     ++  L
Sbjct: 421 EDLKEELK-ELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQRL 479

Query: 290 KRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADF 324
           ++++  L + +  L  +QR SQ     L    +  
Sbjct: 480 EKELSSLEARLDRLEAEQRASQGVRAVLEALESGL 514



 Score = 40.5 bits (95), Expect = 0.002
 Identities = 64/293 (21%), Positives = 125/293 (42%), Gaps = 38/293 (12%)

Query: 72  ISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSP 131
           +SK +  K E E +       E T++ +E +  L EE +K  ++LE +     + Q+L  
Sbjct: 167 VSKYKERKEEAERKL------ERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKA 220

Query: 132 EIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFT 191
           E+    L  L  ++  LR ++   E+EL     +++                        
Sbjct: 221 ELRELELALLLAKLKELRKELEELEEELSRLEEELEE----------------------L 258

Query: 192 TAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSP 251
             EL+E   +   LK+ L EL EE      +  LQEE  +L + +E E    +   +   
Sbjct: 259 QEELEEAEKEIEELKSELEELREE------LEELQEELLELKEEIE-ELEGEISLLRERL 311

Query: 252 EIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQ 311
           E   + L++L  +++ L++++   ++EL  R + ++ L++ +  L     EL  K     
Sbjct: 312 EELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALL 371

Query: 312 VQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEER 364
            ++  L +   +    L + + E+A ++ +L   ++E E L    E  SE   
Sbjct: 372 EELEELFEALRE---ELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421



 Score = 35.8 bits (83), Expect = 0.047
 Identities = 64/323 (19%), Positives = 140/323 (43%), Gaps = 40/323 (12%)

Query: 35  DAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHISKLEHE-----KLEREVEEIEE 89
           + +  +  ++    E L+ L ++       L+EL   + +LE +     +    +EE  E
Sbjct: 674 EELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELE 733

Query: 90  SWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLR 149
             +   ++L E +  L+EE ++L +RLE      ++ +  S E  ++ L++   +++  R
Sbjct: 734 QLQSRLEELEEELEELEEELEELQERLEE-----LEEELESLEEALAKLKEEIEELEEKR 788

Query: 150 DQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRL 209
             ++ + +EL     + +      L            R      E++E+  +   L+ +L
Sbjct: 789 QALQEELEELEEELEEAER----RLDALERELESLEQRRERLEQEIEELEEEIEELEEKL 844

Query: 210 NELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLR 269
           +ELEEE      +  L++E ++L + LE   +   + +          L++L  + + L 
Sbjct: 845 DELEEE------LEELEKELEELKEELEELEAEKEELE--------DELKELEEEKEELE 890

Query: 270 DQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQ 329
           +++R  E EL         LK +I++L   + EL  K    +V++  L +E  +      
Sbjct: 891 EELRELESEL-------AELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTL 943

Query: 330 DHQREVAALKQQLGLAEKENEDL 352
           + + E     +++   E+E E L
Sbjct: 944 ETELE-----REIERLEEEIEAL 961



 Score = 35.5 bits (82), Expect = 0.051
 Identities = 25/124 (20%), Positives = 60/124 (48%), Gaps = 3/124 (2%)

Query: 240 GSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTST 299
           G S  K   L+ + ++  L++   ++  L  Q+   E+EL    +++ +L+  ++ L   
Sbjct: 654 GGSRNKRSSLAQKRELKELEE---ELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQ 710

Query: 300 VRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETT 359
           + EL R+    + ++  L +E     + L++ + E+  L+++L   ++  E+L    E+ 
Sbjct: 711 LEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESL 770

Query: 360 SEEE 363
            E  
Sbjct: 771 EEAL 774



 Score = 29.3 bits (66), Expect = 5.1
 Identities = 51/261 (19%), Positives = 107/261 (40%), Gaps = 35/261 (13%)

Query: 54   LASKNERENTKLQELGDHISKLEHEKLE-----REVEEIEESWKEETQQLVEMVNRLQEE 108
            L  + E    K Q L + + +LE E  E       +E   ES ++  ++L + +  L+EE
Sbjct: 777  LKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEE 836

Query: 109  NKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 168
             ++L ++L+          +   E     L++L+ +++ L  +    E EL     + + 
Sbjct: 837  IEELEEKLD--------ELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEE 888

Query: 169  SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEE 228
             +                  R   +EL E+  +   L+ RL ELE + +      RL+ E
Sbjct: 889  LE---------------EELRELESELAELKEEIEKLRERLEELEAKLE------RLEVE 927

Query: 229  NKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLK-EKELGYRNSDVD 287
              +L + LE E    ++++       +    +  G V+    +   + E+      S  +
Sbjct: 928  LPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYEELKSQRE 987

Query: 288  NLKRQIDRLTSTVRELRRKQR 308
            +L+   ++L   + EL +++R
Sbjct: 988  DLEEAKEKLLEVIEELDKEKR 1008


>gnl|CDD|221118 pfam11461, RILP, Rab interacting lysosomal protein.  RILP contains
           a domain which contains two coiled-coil regions and is
           found mainly in the cytosol. RILP is recruited onto late
           endosomal and lysosomal membranes by Rab7 and acts as a
           downstream effector of Rab7. This recruitment process is
           important for phagosome maturation and fusion with late
           endosomes and lysosomes.
          Length = 60

 Score = 36.5 bits (85), Expect = 0.001
 Identities = 18/57 (31%), Positives = 30/57 (52%), Gaps = 2/57 (3%)

Query: 191 TTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQ 247
           T  EL+E+L +RN LK ++  L+EE   +K  + L  E ++    L     +P K +
Sbjct: 1   TLQELREVLQERNELKAKVFLLQEELAYYK--SELLPEERRPGLPLPSTKLAPSKQR 55


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 40.8 bits (96), Expect = 0.001
 Identities = 61/330 (18%), Positives = 123/330 (37%), Gaps = 68/330 (20%)

Query: 47  ALEHLECLASKNERENT----KLQELGDHISKLEHE--KLEREVEEIEESWKEETQQLVE 100
           A  +   L  K E E      + + L      +E +   LE E+E++ E   E  ++L E
Sbjct: 210 AERYQALLKEKREYEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISELEKRLEE 269

Query: 101 MVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELG 160
           +   L+E NKK+    E E    +K +    E +I+    L   +     ++   E+ L 
Sbjct: 270 IEQLLEELNKKIKDLGEEE-QLRVKEKIGELEAEIA---SLERSIAEKERELEDAEERLA 325

Query: 161 YRNSDVDN--SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQC 218
              +++D   ++I +L                   E++E    R+ L     EL+EE + 
Sbjct: 326 KLEAEIDKLLAEIEELER-----------------EIEEERKRRDKLTEEYAELKEELE- 367

Query: 219 FKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKE 278
             +   L+E +K+                                     RD+++   ++
Sbjct: 368 -DLRAELEEVDKEF---------------------------------AETRDELKDYREK 393

Query: 279 LGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAAL 338
           L     +++ LKR++DRL   ++ L  +       + G+  +  +     +D   E+   
Sbjct: 394 LEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQ 453

Query: 339 KQQLGLA----EKENEDLYNSQETTSEEER 364
           + +L        K  ++LY+ +E     E+
Sbjct: 454 EWKLEQLAADLSKYEQELYDLKEEYDRVEK 483



 Score = 33.5 bits (77), Expect = 0.22
 Identities = 13/89 (14%), Positives = 45/89 (50%)

Query: 255 ISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQV 314
            S L+++  ++D L  ++    +++G    +++ L+++ ++L   + EL       + ++
Sbjct: 694 QSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEI 753

Query: 315 RGLIDERADFLAALQDHQREVAALKQQLG 343
             +  E  +  A +++ + ++  L++ L 
Sbjct: 754 ENVKSELKELEARIEELEEDLHKLEEALN 782



 Score = 31.6 bits (72), Expect = 0.78
 Identities = 57/352 (16%), Positives = 136/352 (38%), Gaps = 81/352 (23%)

Query: 60  RENTKLQELGDHISKLEHE---------KLEREVEEIEESWKEETQQLVEMVNRLQEENK 110
            E  +LQ L + +  L+ E         ++E  ++E+ +   + ++++ E+   +++  +
Sbjct: 671 SEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQ 730

Query: 111 KLSKRLEHEGSSPMKSQQLSPEI--DISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 168
           +  K  E           L  EI    S L++L  +++ L + +   E+ L    + + +
Sbjct: 731 EEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790

Query: 169 SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEE 228
           S IP++  +                               L++LEEE    ++  RL+E 
Sbjct: 791 SRIPEIQAE-------------------------------LSKLEEEVS--RIEARLREI 817

Query: 229 NKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 288
            +KL++         ++ + L  EI     Q+L+ Q   L++Q              + +
Sbjct: 818 EQKLNRLT-------LEKEYLEKEI-----QELQEQRIDLKEQ--------------IKS 851

Query: 289 LKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
           ++++I+ L     EL  +    +  +R L     D      + + ++  L++++   E +
Sbjct: 852 IEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQ 911

Query: 349 NED-----------LYNSQETTSEEERPVQGPLPYEPDDAPWKKSESGIRKL 389
            E            L   +E  SE E P         ++   +  ++ ++++
Sbjct: 912 IEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRV 963



 Score = 29.7 bits (67), Expect = 3.9
 Identities = 28/168 (16%), Positives = 64/168 (38%), Gaps = 12/168 (7%)

Query: 205 LKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQK---- 260
            + +L  L  ER+         E  + L K         +  ++ + E    A+++    
Sbjct: 196 KRQQLERLRRERE-------KAERYQALLKEKREYEGYELLKEKEALERQKEAIERQLAS 248

Query: 261 LRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTS-TVRELRRKQRLSQVQVRGLID 319
           L  +++ L +++   EK L      ++ L ++I  L       ++ K    + ++  L  
Sbjct: 249 LEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLER 308

Query: 320 ERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEERPVQ 367
             A+    L+D +  +A L+ ++     E E+L    E   +    + 
Sbjct: 309 SIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLT 356



 Score = 29.3 bits (66), Expect = 4.9
 Identities = 25/150 (16%), Positives = 62/150 (41%), Gaps = 4/150 (2%)

Query: 240 GSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTST 299
           GS   +   L    + + LQ+LR +++ L+ ++   + EL    + +D L +++   +  
Sbjct: 658 GSRAPRGGILFSRSEPAELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRK 717

Query: 300 VRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETT 359
           + E+ ++    + +   L +   +    L   ++E+  +K +L   E   E+L    E  
Sbjct: 718 IGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEEL----EED 773

Query: 360 SEEERPVQGPLPYEPDDAPWKKSESGIRKL 389
             +       L      +   + ++ + KL
Sbjct: 774 LHKLEEALNDLEARLSHSRIPEIQAELSKL 803


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 39.7 bits (93), Expect = 0.003
 Identities = 47/238 (19%), Positives = 96/238 (40%), Gaps = 28/238 (11%)

Query: 138 LQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTA---- 193
           LQ+    + S   + +  E++L   N++++     +   +        +  R        
Sbjct: 609 LQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSL 668

Query: 194 --ELKEILHDR--------NTLKTRLNELEEERQCFKMV---NRLQEENKKLSKRLEHEG 240
             +L+  + +R          L  +L +L E++Q F      +  +   ++L+K    EG
Sbjct: 669 KDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLAKWQVVEG 728

Query: 241 SSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTST- 299
               +  QLS  I+ +A  + + ++  L+ Q   +   L    + V  LKRQI+ L +T 
Sbjct: 729 ELDNQLAQLSAAIE-AARTQAKARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTI 787

Query: 300 ---------VRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
                    VRE R   + + +    L +ER +    L++ +     L+Q+L    K+
Sbjct: 788 ERIAVRRPEVREYRAFMQETWLHRDSLREERPNLAIQLRELESSAEELQQELTRLIKD 845



 Score = 31.6 bits (72), Expect = 0.80
 Identities = 18/110 (16%), Positives = 46/110 (41%), Gaps = 1/110 (0%)

Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVR 315
             LQ+    + S   + +  E++L   N++++  KR      + +++ R   +  Q + +
Sbjct: 607 ERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQ 666

Query: 316 GLIDERADFLAALQD-HQREVAALKQQLGLAEKENEDLYNSQETTSEEER 364
            L D+    +A  +   + ++  L  QL    ++ +    + +    E R
Sbjct: 667 SLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELR 716



 Score = 29.3 bits (66), Expect = 4.4
 Identities = 37/184 (20%), Positives = 71/184 (38%), Gaps = 17/184 (9%)

Query: 193 AELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSK---RLEHEGSSPMKSQQL 249
                +     TL+ +L E  +E         L   N KL+     LE         +  
Sbjct: 288 EAKNRLRQQLRTLEDQLKEARDELN-----QELSAANAKLAADRSELELLEDQKGAFE-- 340

Query: 250 SPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRL 309
             + DI  LQ    Q+ S+R ++   E  L        +++R+ +RL     + + ++ L
Sbjct: 341 --DADIEQLQADLDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQ-KIKEQLERDL 397

Query: 310 SQVQVRGLI--DERADFLAALQDH-QREVAALKQQLGLAEKE-NEDLYNSQETTSEEERP 365
            +   R     +E+    AA+++  Q   + L+QQL   + E NE+ Y  +      ++ 
Sbjct: 398 EKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELRLGRLKQR 457

Query: 366 VQGP 369
           +   
Sbjct: 458 LDSA 461


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 35.4 bits (82), Expect = 0.027
 Identities = 22/101 (21%), Positives = 49/101 (48%), Gaps = 10/101 (9%)

Query: 267 SLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDE------ 320
           S R+++   ++EL     ++  L+ Q+  L    +EL+++    + ++  L  E      
Sbjct: 63  SARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQ 122

Query: 321 -RADFLAALQDHQR---EVAALKQQLGLAEKENEDLYNSQE 357
             A+ +   ++++    E+A LKQ+    E ENE L  +++
Sbjct: 123 LSANAIELDEENRELREELAELKQENEALEAENERLQENEQ 163



 Score = 28.4 bits (64), Expect = 4.3
 Identities = 19/118 (16%), Positives = 46/118 (38%), Gaps = 17/118 (14%)

Query: 204 TLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQL--SPEIDISALQKL 261
            L+  L EL+EE         LQE+  +L +  +         +      + +++ +++L
Sbjct: 70  ELQQELAELQEEL------AELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQL 123

Query: 262 RGQVDSLRDQ-MRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLI 318
                 L ++   L+E+        +  LK++ + L +    L+  ++       G +
Sbjct: 124 SANAIELDEENRELREE--------LAELKQENEALEAENERLQENEQRRWFLYGGGV 173



 Score = 28.4 bits (64), Expect = 5.3
 Identities = 20/87 (22%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 76  EHEKLEREVEEIEESWKEETQQLVEMVNR---LQEENKKLSKRLEHEGSSPMKSQQLSPE 132
           E  +L+ E+ E++E   E  Q+  E+      L+ E ++L K L         + +L  E
Sbjct: 74  ELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEE 133

Query: 133 IDISVLQKLRGQVDSLRDQMRLKEKEL 159
                 ++LR ++  L+ +    E E 
Sbjct: 134 -----NRELREELAELKQENEALEAEN 155


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 35.8 bits (83), Expect = 0.034
 Identities = 25/97 (25%), Positives = 50/97 (51%), Gaps = 18/97 (18%)

Query: 213 EEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQM 272
           +  ++  + V RL+EEN +L + LE          +L  EI     +KL  +++  R ++
Sbjct: 422 KRIKKLEETVERLEEENSELKRELE----------ELKREI-----EKLESELERFRREV 466

Query: 273 R---LKEKELGYRNSDVDNLKRQIDRLTSTVRELRRK 306
           R    K++E+  R+  ++ L+++++     V EL RK
Sbjct: 467 RDKVRKDREIRARDRRIERLEKELEEKKKRVEELERK 503



 Score = 31.2 bits (71), Expect = 0.99
 Identities = 25/98 (25%), Positives = 47/98 (47%), Gaps = 22/98 (22%)

Query: 68  LGDHISKLEHEKLERE-------VEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEG 120
           L + +SK++ E+  RE              +++  ++L E V RL+EEN +L + LE   
Sbjct: 390 LAEALSKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELE--- 446

Query: 121 SSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKE 158
                  +L  EI     +KL  +++  R ++R K ++
Sbjct: 447 -------ELKREI-----EKLESELERFRREVRDKVRK 472


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 35.7 bits (83), Expect = 0.047
 Identities = 52/254 (20%), Positives = 97/254 (38%), Gaps = 40/254 (15%)

Query: 2   DDEYSSTLTVIDVFDIASEIGKEFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERE 61
                  L  +    +        E + ++   + +  +   +    E L+ L S   + 
Sbjct: 15  KSYKDEVLEALHELGVV-----HIEDLKEELSNERLRKLRSLLTKLSEALDKLRSYLPKL 69

Query: 62  NTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGS 121
           N  L+E    +S    E+L ++VEE  E  ++E ++L E ++ L+ E K+L + +E    
Sbjct: 70  NP-LREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISELENEIKELEQEIE---- 124

Query: 122 SPMKSQQLSP----EIDISVLQKLR------GQV--DSLRDQMRLKEKELGYRNSDVDNS 169
                 +L P    ++D+S+L   +      G V  D L +     + E     S     
Sbjct: 125 ------RLEPWGNFDLDLSLLLGFKYVSVFVGTVPEDKLEELKLESDVENVEYISTDKGY 178

Query: 170 DI------PDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVN 223
                    +LS++   +L      R    E          +K  L E+E+ER+      
Sbjct: 179 VYVVVVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERE------ 232

Query: 224 RLQEENKKLSKRLE 237
            L EE K+L+K+  
Sbjct: 233 SLLEELKELAKKYL 246


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 34.7 bits (81), Expect = 0.084
 Identities = 22/94 (23%), Positives = 37/94 (39%), Gaps = 25/94 (26%)

Query: 79  KLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEID---- 134
           + E  ++E+    K +  +L E V  L EE K+L K LE      +K++  +        
Sbjct: 698 EQEALLKELAALLKAKPSELPERVEALLEELKELEKELEQ-----LKAKLAAAAAGDLLA 752

Query: 135 -------ISVLQK---------LRGQVDSLRDQM 152
                  + VL           LR   D L+D++
Sbjct: 753 QAKEVNGVKVLAAQVEGVDAKALRTLADDLKDKL 786



 Score = 29.3 bits (67), Expect = 4.2
 Identities = 25/100 (25%), Positives = 36/100 (36%), Gaps = 34/100 (34%)

Query: 193 AELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPE 252
           A LKE+      LK + +EL E       V  L EE K+L K LE      +K++  +  
Sbjct: 701 ALLKEL---AALLKAKPSELPER------VEALLEELKELEKELEQ-----LKAKLAAAA 746

Query: 253 IDISALQK--------------------LRGQVDSLRDQM 272
                 Q                     LR   D L+D++
Sbjct: 747 AGDLLAQAKEVNGVKVLAAQVEGVDAKALRTLADDLKDKL 786


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 33.2 bits (76), Expect = 0.11
 Identities = 20/85 (23%), Positives = 46/85 (54%)

Query: 258 LQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGL 317
           + +L  +++ L++ ++L  + L      +++L+  I  L   +RELR+  R    ++R  
Sbjct: 102 IAELESELEDLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELREE 161

Query: 318 IDERADFLAALQDHQREVAALKQQL 342
           ++   + L  LQ+  +E+ +L +QL
Sbjct: 162 LERLQENLQRLQEAIQELQSLLEQL 186



 Score = 29.7 bits (67), Expect = 2.0
 Identities = 16/96 (16%), Positives = 39/96 (40%)

Query: 275 KEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQRE 334
           + ++L     ++  LK+Q+  L   + +L  +    + ++  L ++       L+  +  
Sbjct: 70  RPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEER 129

Query: 335 VAALKQQLGLAEKENEDLYNSQETTSEEERPVQGPL 370
           + +L++ +    KE  +L        EE R     L
Sbjct: 130 LESLEESIKELAKELRELRQDLREEVEELREELERL 165


>gnl|CDD|237551 PRK13902, alaS, alanyl-tRNA synthetase; Provisional.
          Length = 900

 Score = 34.4 bits (80), Expect = 0.12
 Identities = 21/82 (25%), Positives = 37/82 (45%), Gaps = 7/82 (8%)

Query: 79  KLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH---EGSSPMKSQQLSPEIDI 135
           +LE  ++E  E      +QL + V R  EE K+  K +E    E +  + S+ LS   ++
Sbjct: 740 ELEDLLKEAAEILGVPPEQLPKTVERFFEEWKEQKKEIEKLRKELAELLASELLSKAEEV 799

Query: 136 S----VLQKLRGQVDSLRDQMR 153
                V++++    D LR    
Sbjct: 800 GGVKVVVEEVDADEDELRKIAN 821


>gnl|CDD|148779 pfam07362, CcdA, Post-segregation antitoxin CcdA.  This family
           consists of several Enterobacterial post-segregation
           antitoxin CcdA proteins. The F plasmid-carried bacterial
           toxin, the CcdB protein, is known to act on DNA gyrase
           in two different ways. CcdB poisons the gyrase-DNA
           complex, blocking the passage of polymerases and leading
           to double-strand breakage of the DNA. Alternatively, in
           cells that overexpress CcdB, the A subunit of DNA gyrase
           (GyrA) has been found as an inactive complex with CcdB.
           Both poisoning and inactivation can be prevented and
           reversed in the presence of the F plasmid-encoded
           antidote, the CcdA protein.
          Length = 72

 Score = 31.1 bits (71), Expect = 0.14
 Identities = 16/47 (34%), Positives = 25/47 (53%), Gaps = 1/47 (2%)

Query: 64  KLQELGDHISKLEHEKLEREVEEIE-ESWKEETQQLVEMVNRLQEEN 109
           + + LG ++S      L  E+   + E WKEE ++ +E  NR  EEN
Sbjct: 16  QAKALGINLSATLETALREELRREKAERWKEENREAIEEYNRFIEEN 62


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 33.4 bits (77), Expect = 0.18
 Identities = 23/90 (25%), Positives = 49/90 (54%), Gaps = 3/90 (3%)

Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTV---RELRRKQRLSQV 312
           S    LR Q++ +  Q++  E EL    + +  L++Q++ ++  +   R+L+ K  +S++
Sbjct: 137 SRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRL 196

Query: 313 QVRGLIDERADFLAALQDHQREVAALKQQL 342
           ++  L  ERA+    L   + E+  LK+Q+
Sbjct: 197 ELLELERERAEAQGELGRLEAELEVLKRQI 226



 Score = 32.3 bits (74), Expect = 0.42
 Identities = 23/98 (23%), Positives = 48/98 (48%), Gaps = 11/98 (11%)

Query: 256 SALQKLRGQVDSLRDQMR----LKEKELGYRNS------DVDNLKRQIDRLTSTVRELRR 305
           + LQ LR Q++ + +++     LKEK L  R        +    + ++ RL + +  L+R
Sbjct: 165 AQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELEVLKR 224

Query: 306 KQRLSQVQVRGLIDE-RADFLAALQDHQREVAALKQQL 342
           +    Q++ + +    R + L  L + Q  +A L+++L
Sbjct: 225 QIDELQLERQQIEQTFREEVLEELTEAQARLAELRERL 262


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 33.6 bits (77), Expect = 0.18
 Identities = 64/323 (19%), Positives = 144/323 (44%), Gaps = 43/323 (13%)

Query: 40  IMPKVIVALEHLECLASKNERENTKLQELGDHISKLEH--EKLEREVEEIEESWKEETQQ 97
           +  ++  ALE L+ L    +    +L EL   + +++   E+LE+E+EE+E   +E  ++
Sbjct: 373 LEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEE 432

Query: 98  ---LVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRL 154
              L E +N+L+ +   +++        P+  Q+L  E +  +L+    +++ L +++  
Sbjct: 433 IKKLEEQINQLESKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSR 492

Query: 155 KEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEE 214
           +++E   R                               E++E+  +   L+  L EL E
Sbjct: 493 EKEEAELRE------------------------------EIEELEKELRELEEELIELLE 522

Query: 215 ERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRL 274
             +   +   L+E+ +KL   LE       K Q    + ++  L+    ++  L +++RL
Sbjct: 523 LEE--ALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRL 580

Query: 275 KEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQRE 334
                  R  +++ L+ ++  L   ++EL  ++RLSQ++      E ++    L++ + E
Sbjct: 581 LRT----RKEELEELRERLKELKKKLKEL--EERLSQLEELLQSLELSEAENELEEAEEE 634

Query: 335 VAALKQQLGLAEKENEDLYNSQE 357
           + +  ++L L  +  E L  + E
Sbjct: 635 LESELEKLNLQAELEELLQAALE 657



 Score = 31.3 bits (71), Expect = 1.0
 Identities = 63/306 (20%), Positives = 138/306 (45%), Gaps = 34/306 (11%)

Query: 49  EHLECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEE--SWKEETQQLVEMVNRLQ 106
             +  LA   E+     QEL +   K   E  E E+EE+EE  S ++E  +L E +  L+
Sbjct: 448 LMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIEELE 507

Query: 107 EENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDV 166
           +E ++L + L        ++ +   E  +  L+ L  +++ L+++++L++ +   R  + 
Sbjct: 508 KELRELEEELIELLE-LEEALKEELEEKLEKLENLLEELEELKEKLQLQQLKEELRQLED 566

Query: 167 DNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQ 226
                                      ELKE+L +   L+TR  ELEE R+  K + +  
Sbjct: 567 RLQ------------------------ELKELLEELRLLRTRKEELEELRERLKELKKKL 602

Query: 227 EENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDV 286
           +E ++   +LE         Q L      + L++   +++S  +++ L+ +      + +
Sbjct: 603 KELEERLSQLEEL------LQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAAL 656

Query: 287 DNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAE 346
           + L+ +++ L + +R    ++  ++ Q+   ++E       L+  + E+  L ++LG  E
Sbjct: 657 EELEEKVEELEAEIRRE-LQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIE 715

Query: 347 KENEDL 352
           +  E+L
Sbjct: 716 QLIEEL 721



 Score = 29.3 bits (66), Expect = 4.7
 Identities = 58/342 (16%), Positives = 137/342 (40%), Gaps = 20/342 (5%)

Query: 23  KEFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHISKLEHEKLER 82
           +++EK+ +    + +     K+      L  L    E     L+E    + KLE  + E+
Sbjct: 167 EKYEKLSELLK-EVIKEAKAKIEELEGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQ 225

Query: 83  EVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLR 142
           E EE+E+  +   ++L E+    +   +  ++ LE E       +    E ++  L++L 
Sbjct: 226 EEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIR--EEELRELERLL 283

Query: 143 GQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDR 202
            +++   +++   E+E+     +++         + + +           +  + +    
Sbjct: 284 EELEEKIERLEELEREIEELEEELEGLRALLEELEELLEK--------LKSLEERLEKLE 335

Query: 203 NTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLR 262
             L+   +ELEE  +    + +L EE  K  +    E    ++      +    A+Q+L+
Sbjct: 336 EKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELK 395

Query: 263 GQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERA 322
            ++  L   +   ++EL     +++ L+R+++ L   +++L  +    +         + 
Sbjct: 396 EELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLE--------SKE 447

Query: 323 DFLAALQDHQREVAALKQQL-GLAEKENEDLYNSQETTSEEE 363
             +A L     +     Q+L    EKE  +LY  +    EEE
Sbjct: 448 LMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEE 489


>gnl|CDD|216831 pfam01988, VIT1, VIT family.  This family includes the vacuolar
           Fe2+/Mn2+ uptake transporter, Ccc1 and the vacuolar iron
           transporter VIT1.
          Length = 209

 Score = 33.0 bits (76), Expect = 0.19
 Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 18/68 (26%)

Query: 52  ECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKK 111
           E L+ K+ER+             LE  +LERE  E+E   + E ++L E+        + 
Sbjct: 52  EYLSVKSERD-------------LEEAELEREKRELENDPEGEREELAEI-----YRARG 93

Query: 112 LSKRLEHE 119
           LS+ L  +
Sbjct: 94  LSEELARQ 101


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 33.2 bits (76), Expect = 0.25
 Identities = 14/77 (18%), Positives = 40/77 (51%), Gaps = 3/77 (3%)

Query: 78  EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEID--I 135
           E++E+ ++ +++ WK+  + L      L+E  K+L    + +  +P++ Q++  E D   
Sbjct: 167 EEIEKSLKAMQQKWKKRKKDL-PNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLK 225

Query: 136 SVLQKLRGQVDSLRDQM 152
              +  + ++ S ++ +
Sbjct: 226 KEGKADKQKIKSAKNDL 242


>gnl|CDD|114908 pfam06216, RTBV_P46, Rice tungro bacilliform virus P46 protein.
           This family consists of several Rice tungro bacilliform
           virus P46 proteins. The function of this family is
           unknown.
          Length = 392

 Score = 32.7 bits (74), Expect = 0.33
 Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 9/69 (13%)

Query: 48  LEHLECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQE 107
           ++HL   +S N     KL  + DHI   E E          ES+K E ++L + V+ L++
Sbjct: 42  IDHLFTFSSDNTERVRKLHIIADHIYLKEFEI---------ESYKLECEKLNDQVSHLKK 92

Query: 108 ENKKLSKRL 116
           +N++L  ++
Sbjct: 93  QNEELKAQI 101


>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
          Length = 211

 Score = 31.6 bits (72), Expect = 0.53
 Identities = 22/70 (31%), Positives = 39/70 (55%), Gaps = 4/70 (5%)

Query: 48  LEHLECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQE 107
           +E   C  ++N+ E+   +E  +     E  + E  +E+ EES + + ++L +  N+L+E
Sbjct: 13  MEEDCCKENENKEEDKGKEEDLE----FEEIEKEEIIEDSEESNEVKIEELKDENNKLKE 68

Query: 108 ENKKLSKRLE 117
           ENKKL   LE
Sbjct: 69  ENKKLENELE 78


>gnl|CDD|226291 COG3768, COG3768, Predicted membrane protein [Function unknown].
          Length = 350

 Score = 31.6 bits (72), Expect = 0.63
 Identities = 28/90 (31%), Positives = 39/90 (43%), Gaps = 20/90 (22%)

Query: 296 LTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDH-----QREVAAL-KQQLGLAE--- 346
           + S VRE RR  RL Q Q       R D   AL  H     ++ V  L  QQ G+ E   
Sbjct: 113 VGSVVREWRRLVRLRQRQ-----HLRDDARDALHSHGVKAGRKAVCELLAQQAGIPETAP 167

Query: 347 -----KENE-DLYNSQETTSEEERPVQGPL 370
                +E+  ++ +++E     ER V  PL
Sbjct: 168 GRQRWQESLHEIIDARELVELAEREVLPPL 197


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 32.0 bits (73), Expect = 0.64
 Identities = 52/283 (18%), Positives = 109/283 (38%), Gaps = 36/283 (12%)

Query: 51  LECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRL----- 105
           LE      E    K +  G  +   E E+LER+++EIE + K    QL E+         
Sbjct: 239 LEEAQRSLESLEKKFRSEGGDL-FEEREQLERQLKEIEAARKANRAQLRELAADPLPLLL 297

Query: 106 ---QEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRG------QVDSLRDQM---- 152
                ++ K   + E +      +Q+   E D  +L+ L         V  +  ++    
Sbjct: 298 IPNLLDSTKAQLQKEEQSQQNQLTQEELEERDKELLESLPKLALPAEHVKEIAAELAEID 357

Query: 153 --RLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLN 210
                + E+ +R S  + + +  L  +   +L D         EL+E+  +   +  +++
Sbjct: 358 KPATTDSEIPHRLSGSELTQLEVLIQQVKRELQD--AKSQLLKELRELEEELAEVDKKIS 415

Query: 211 ELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRD 270
            +  E Q  +++  L E   +L +             +   E  +  L+ L+  +++LR 
Sbjct: 416 TIPSEEQIAQLLEELGEAQNELFRS------------EAEIEELLRQLETLKEAIEALRK 463

Query: 271 QMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQ 313
            +  K K+      +++      D+   T++E R K    ++Q
Sbjct: 464 TLDEKTKQKI-NAFELERAITIADKAKKTLKEFREKLLERKLQ 505


>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein. 
          Length = 326

 Score = 31.4 bits (71), Expect = 0.68
 Identities = 18/83 (21%), Positives = 35/83 (42%)

Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVR 315
           SA   L   V ++  ++     +L    + +D+ K  ++ L +TV+ L+        ++ 
Sbjct: 42  SATASLSESVSTISSRLSDLSADLQDMTTSLDDAKATLNGLITTVQALQTSVDSLSSELA 101

Query: 316 GLIDERADFLAALQDHQREVAAL 338
            L    +   AA+ D Q  V A 
Sbjct: 102 DLSLTVSAHDAAISDLQTTVHAN 124


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 31.7 bits (73), Expect = 0.68
 Identities = 13/48 (27%), Positives = 28/48 (58%)

Query: 258 LQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRR 305
           L+ L  +   L +Q+R  E +L    ++ + L+R+++RL S + +L+ 
Sbjct: 3   LEALEERNSELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKS 50



 Score = 29.4 bits (67), Expect = 3.7
 Identities = 15/63 (23%), Positives = 29/63 (46%), Gaps = 13/63 (20%)

Query: 64  KLQELGDHISKLEH--EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGS 121
           +L+ L +  S+LE    +LE ++ ++E               +L+ E ++L   LE   S
Sbjct: 2   RLEALEERNSELEEQIRQLELKLRDLEA-----------ENEKLERELERLKSELEKLKS 50

Query: 122 SPM 124
            P+
Sbjct: 51  PPL 53


>gnl|CDD|235449 PRK05415, PRK05415, hypothetical protein; Provisional.
          Length = 341

 Score = 31.4 bits (72), Expect = 0.83
 Identities = 16/56 (28%), Positives = 21/56 (37%), Gaps = 10/56 (17%)

Query: 296 LTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQRE-----VAALKQQLGLAE 346
           L   VRE RR +RL Q         R +  A L  H           L +Q G+ +
Sbjct: 115 LGIVVREWRRLRRLRQRA-----HLRDEARALLHSHDVGEARAFCEKLAKQAGIPQ 165


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.6 bits (72), Expect = 0.91
 Identities = 53/253 (20%), Positives = 112/253 (44%), Gaps = 32/253 (12%)

Query: 57  KNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRL 116
           K E E  KL+EL + + +LE +  +  +EE+E+   EE ++L E + +L+ E K L K L
Sbjct: 490 KKESELIKLKELAEQLKELEEKLKKYNLEELEKKA-EEYEKLKEKLIKLKGEIKSLKKEL 548

Query: 117 EHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSN 176
           E       K  +L  ++D         +++    ++  + +ELG+ + +     + +L  
Sbjct: 549 EKLEELKKKLAELEKKLD---------ELEEELAELLKELEELGFESVEELEERLKEL-- 597

Query: 177 KAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEE-RQCFKMVNRLQEENKKLSKR 235
                  +P    +   ELK+   +    +  L +LEEE  + F+ +   ++  ++L K 
Sbjct: 598 -------EPFYNEYL--ELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKE 648

Query: 236 LEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDR 295
           LE          +L  +      ++LR +   L  ++     EL       + +K+ +++
Sbjct: 649 LE----------ELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEK 698

Query: 296 LTSTVRELRRKQR 308
           L   + E  + ++
Sbjct: 699 LKEELEEREKAKK 711



 Score = 30.0 bits (68), Expect = 2.7
 Identities = 51/227 (22%), Positives = 103/227 (45%), Gaps = 36/227 (15%)

Query: 13  DVFDIASEIGKEFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHI 72
           ++ ++  E  KE E+++ +     +  I  ++    E LE L    E+E  +L+EL + I
Sbjct: 190 NIEELIKEKEKELEEVLRE-----INEISSELPELREELEKL----EKEVKELEELKEEI 240

Query: 73  SKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPE 132
            +LE E    E  +     +E+ ++L E +  L++E ++L ++++       + ++L  +
Sbjct: 241 EELEKELESLEGSK--RKLEEKIRELEERIEELKKEIEELEEKVK-------ELKELKEK 291

Query: 133 IDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDN--SDIPDLSNKAVYDLDDPNRPRF 190
            +  +  KL    +   D++R  EK L     +++     I +L  K             
Sbjct: 292 AEEYI--KLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK------------- 336

Query: 191 TTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLE 237
               L+E+      L+ RL ELEE  + ++     +EE ++L KRL 
Sbjct: 337 -EERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLT 382


>gnl|CDD|183876 PRK13169, PRK13169, DNA replication intiation control protein YabA;
           Reviewed.
          Length = 110

 Score = 29.4 bits (67), Expect = 0.91
 Identities = 19/71 (26%), Positives = 33/71 (46%), Gaps = 10/71 (14%)

Query: 67  ELGDHISKLEH--EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH----EG 120
           E+ D +  LE     L +E+  +    K++  +L+E    L+ EN KL +RLE     E 
Sbjct: 5   EIFDALDDLEQNLGVLLKELGAL----KKQLAELLEENTALRLENDKLRERLEELEAEEP 60

Query: 121 SSPMKSQQLSP 131
           +   K ++   
Sbjct: 61  AKEKKKKEGEG 71


>gnl|CDD|224643 COG1729, COG1729, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 262

 Score = 30.9 bits (70), Expect = 0.91
 Identities = 21/102 (20%), Positives = 39/102 (38%), Gaps = 2/102 (1%)

Query: 246 SQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRR 305
           +   S      A         S+ D++   E+     +  +  L++Q+ +L   + ELR 
Sbjct: 19  AAPASAPAISGAPLSS-AGNGSVEDRVTPLERVQNAHSYRLTQLEQQLRQLQGKIEELRG 77

Query: 306 KQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEK 347
            Q L Q Q    ++ + +  A L   +    AL Q +G    
Sbjct: 78  IQEL-QYQNNQNVERQEENEARLDSLESGRQALAQGIGDQSG 118


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 31.1 bits (71), Expect = 0.97
 Identities = 27/96 (28%), Positives = 46/96 (47%), Gaps = 9/96 (9%)

Query: 197 EILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDIS 256
           E+  D   LK  L ELEEE      +  L+EE +K+ K L+     P K++   P     
Sbjct: 324 EVGVDFPELKEELKELEEE------LKELEEELEKIKKLLK---KLPKKARGQLPPEKRE 374

Query: 257 ALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQ 292
            L+KL    + L +++   E+EL     ++++L  +
Sbjct: 375 QLEKLLETKEKLSEELEELEEELKELKEELESLYSE 410



 Score = 30.7 bits (70), Expect = 1.3
 Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 11/121 (9%)

Query: 42  PKVIVALEHLECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEM 101
           P++   L+ LE    + E E  K+++L   + K    +L  E  E  E   E  ++L E 
Sbjct: 330 PELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEE 389

Query: 102 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVD-SLRDQMRLKEKELG 160
           +  L+EE K+L + LE           L  E  ISV + +   V   + +++   ++E+G
Sbjct: 390 LEELEEELKELKEELE----------SLYSEGKISVNKTIYPGVKIHIGNKVLRIKREIG 439

Query: 161 Y 161
            
Sbjct: 440 P 440


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 31.2 bits (71), Expect = 1.0
 Identities = 22/81 (27%), Positives = 39/81 (48%), Gaps = 8/81 (9%)

Query: 192 TAELKEILHDRNTLKTRLNELEEERQCFKMVN-RLQEENKKLSKRLEHEGSSPMKSQQLS 250
            AE+KE       L+ RL +L  E +  K  N RLQ+  + + ++++    S  +     
Sbjct: 65  VAEVKE-------LRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKE 117

Query: 251 PEIDISALQKLRGQVDSLRDQ 271
            E   S  Q+L+G +D L+ +
Sbjct: 118 IEQLKSERQQLQGLIDQLQRR 138



 Score = 28.8 bits (65), Expect = 5.4
 Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 5/91 (5%)

Query: 82  REVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKL 141
           +E+ +       E + L     RLQ+  + + ++++    S  +      E   S  Q+L
Sbjct: 69  KELRKRLAKLISENEALKAENERLQKREQSIDQQIQQAVQSETQELTKEIEQLKSERQQL 128

Query: 142 RGQVDSLRDQMRLKEKELGYRNSDVDNSDIP 172
           +G +D L  Q RL     G        SD+P
Sbjct: 129 QGLIDQL--QRRLAGVLTGPSGG---GSDLP 154


>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972).  This
           family consists of several hypothetical bacterial
           sequences. The function of this family is unknown.
          Length = 106

 Score = 29.6 bits (67), Expect = 1.0
 Identities = 19/54 (35%), Positives = 34/54 (62%), Gaps = 2/54 (3%)

Query: 66  QELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHE 119
           +EL D +S+LE ++L + + E+ E  K++  +L+E    L+ EN+ L +RLE  
Sbjct: 4   KELFDALSELE-QQLNQLLAELGE-LKQQLAELLEENAELRIENEHLRERLEEL 55


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 31.2 bits (71), Expect = 1.1
 Identities = 23/77 (29%), Positives = 29/77 (37%), Gaps = 10/77 (12%)

Query: 51  LECLASKNERENTKLQELGDHISKLEHEKLE----------REVEEIEESWKEETQQLVE 100
           LE    K            D     E   LE          +EVEE  ES ++E  +L E
Sbjct: 43  LESKIKKLGIPLKDTGGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEE 102

Query: 101 MVNRLQEENKKLSKRLE 117
            +N L EE   L + LE
Sbjct: 103 WLNVLDEEKSFLDENLE 119



 Score = 30.4 bits (69), Expect = 1.8
 Identities = 24/141 (17%), Positives = 51/141 (36%), Gaps = 16/141 (11%)

Query: 197 EILHDRNTLKTR-LNELEEERQCFKMVN---RLQEENKKLSKR---LEHEGSSPMKSQQL 249
           + L +   ++ R LNE     Q  K VN   R  E  +KL K    ++  G     +   
Sbjct: 2   DALGELGLVQIRDLNEDVSAFQ-RKFVNEVRRCDEVERKLRKLESKIKKLGIPLKDTGGK 60

Query: 250 SPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRL 309
                      L  ++  L       E E+     ++++L+++I+ L   +  L  ++  
Sbjct: 61  PDVPPSKEFLDLEEEILDL-------EAEIKEVEENLESLEKEINELEEWLNVLDEEKSF 113

Query: 310 SQVQVRGLIDERADFLAALQD 330
               +   + E ++     + 
Sbjct: 114 LDENLEE-LSELSNLDIDFKY 133



 Score = 28.5 bits (64), Expect = 6.8
 Identities = 29/115 (25%), Positives = 43/115 (37%), Gaps = 16/115 (13%)

Query: 130 SPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPN--- 186
               D+S  Q  R  V+ +R      E++L    S +    IP        D+       
Sbjct: 14  DLNEDVSAFQ--RKFVNEVRR-CDEVERKLRKLESKIKKLGIPLKDTGGKPDVPPSKEFL 70

Query: 187 --RPRFTT--AELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLE 237
                     AE+KE+  +  +L+  +NELEE        N L EE   L + LE
Sbjct: 71  DLEEEILDLEAEIKEVEENLESLEKEINELEEWL------NVLDEEKSFLDENLE 119


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 21/87 (24%), Positives = 41/87 (47%), Gaps = 13/87 (14%)

Query: 74  KLEHEKLEREVEEIEESWKEETQQLVEM---------VNRLQEENKKLSKRLEHEGSSPM 124
           K   EKL++E+EE+++   E   Q+ ++            L EE K+L K L+   +   
Sbjct: 68  KTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELKKLKAELE 127

Query: 125 KSQQLSPEIDISVLQKLRGQVDSLRDQ 151
           K ++  PE     ++KL+ +    ++ 
Sbjct: 128 KYEKNDPE----RIEKLKEETKVAKEA 150


>gnl|CDD|181627 PRK09053, PRK09053, 3-carboxy-cis,cis-muconate cycloisomerase;
           Provisional.
          Length = 452

 Score = 30.8 bits (70), Expect = 1.3
 Identities = 18/47 (38%), Positives = 27/47 (57%), Gaps = 4/47 (8%)

Query: 303 LRRKQRLSQVQVRGLIDE---RADFLAALQDHQREVA-ALKQQLGLA 345
           LR +QRL+ ++ R L+ +    A  LA+L +    VA AL  +L LA
Sbjct: 176 LRHRQRLAALRPRALVLQFGGAAGTLASLGEQALPVAQALAAELQLA 222


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 30.8 bits (70), Expect = 1.4
 Identities = 39/244 (15%), Positives = 79/244 (32%), Gaps = 17/244 (6%)

Query: 79  KLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVL 138
           ++    ++I E   +   +L + +  L+ E   L  +L        K ++   +++   L
Sbjct: 46  EIAALEKKIREQQDQ-RAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNAR-L 103

Query: 139 QKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEI 198
             L  Q    R+Q R   ++L        N     L +      +D  R          +
Sbjct: 104 NALEVQ---EREQRRRLAEQLAALQRSGRNPPPALLVSP-----EDAQRSVRLAIYYGAL 155

Query: 199 LHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISAL 258
              R     R++ L+   +    V       +     L  E     ++QQ      +   
Sbjct: 156 NPAR---AERIDALKATLKQLAAVRAEIAAEQAELTTLLSE----QRAQQAKLAQLLEER 208

Query: 259 QKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLI 318
           +K   Q++S     + K +EL    S + N     +   +  RE       +  + R   
Sbjct: 209 KKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAAAAKAREAAAAAEAAAARARAAE 268

Query: 319 DERA 322
            +R 
Sbjct: 269 AKRT 272


>gnl|CDD|223327 COG0249, MutS, Mismatch repair ATPase (MutS family) [DNA
           replication, recombination, and repair].
          Length = 843

 Score = 30.7 bits (70), Expect = 1.5
 Identities = 14/48 (29%), Positives = 22/48 (45%), Gaps = 5/48 (10%)

Query: 189 RFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL 236
           RFTT ELKE+       + ++  LE E     + + L+E+       L
Sbjct: 500 RFTTPELKELEEKLLDAEEKILALEYE-----LFDELREKILAHINEL 542


>gnl|CDD|233690 TIGR02025, BchH, magnesium chelatase, H subunit.  This model
           represents the H subunit of the magnesium chelatase
           complex responsible for magnesium insertion into the
           protoporphyrin IX ring in the biosynthesis of both
           chlorophyll and bacteriochlorophyll. In
           chlorophyll-utilizing species, this gene is known as
           ChlH, while in bacteriochlorophyll-utilizing spoecies it
           is called BchH. Subunit H is the largest (~140kDa) of
           the three subunits (the others being BchD/ChlD and
           BchI/ChlI), and is known to bind protoporphyrin IX.
           Subunit H is homologous to the CobN subunit of
           cobaltochelatase and by anology with that enzyme,
           subunit H is believed to also bind the magnesium ion
           which is inserted into the ring. In conjunction with the
           hydrolysis of ATP by subunits I and D, a conformation
           change is believed to happen in subunit H causing the
           magnesium ion insertion into the distorted
           protoporphyrin ring [Biosynthesis of cofactors,
           prosthetic groups, and carriers, Chlorophyll and
           bacteriochlorphyll].
          Length = 1224

 Score = 30.9 bits (70), Expect = 1.6
 Identities = 8/23 (34%), Positives = 11/23 (47%)

Query: 70  DHISKLEHEKLEREVEEIEESWK 92
             I   ++E+L    E IEE W 
Sbjct: 490 VRIPVEQYERLYPRTERIEEQWG 512


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 30.2 bits (68), Expect = 1.6
 Identities = 17/99 (17%), Positives = 38/99 (38%)

Query: 264 QVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERAD 323
           Q  S  +   ++       +S++D LK++ +RL   + EL ++      ++  L +E+  
Sbjct: 50  QYLSKLESQNVEISNYEALDSELDELKKEEERLLDELEELEKEDDDLDGELVELQEEKEQ 109

Query: 324 FLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEE 362
                  + RE     +     E   + L    E +  +
Sbjct: 110 LENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSLNQ 148


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 30.5 bits (69), Expect = 1.9
 Identities = 41/273 (15%), Positives = 96/273 (35%), Gaps = 28/273 (10%)

Query: 74  KLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEI 133
           +L + + E    E E+   E  ++L+ ++       K L ++L   G          P  
Sbjct: 189 ELANLRKELRQLEKEKQRLERLRRLLPLLAER----KALEQQLAALGEVI-----DLPPD 239

Query: 134 DISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIP-DLSNKAVYDLDDPNRPRFTT 192
            +   ++ R ++ + R  + L  + L    +++D   +  +L  +A   ++  ++ R   
Sbjct: 240 AVERYEEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAA-IEALHQQRGEY 298

Query: 193 AELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPE 252
              ++ L DR                   +   +E    L  ++  +             
Sbjct: 299 RNAEQDLPDRE----------------GEIANAREAAAALLAQIGPDADEEAVESLRPSL 342

Query: 253 IDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQV 312
                + +L  + ++L   ++     L  R  ++  ++ Q+  L +       +  L+  
Sbjct: 343 AAKETVTELEKRKEALDQALKSARDALEERERELKQVRAQLAALPTVTVPEPLRAALADA 402

Query: 313 QVRGLIDER-ADFLAALQDHQREVAALKQQLGL 344
              G ID   A     +   +RE+A    +LGL
Sbjct: 403 LRAGDIDAALAALEQEVAVAKRELAQALSRLGL 435



 Score = 29.0 bits (65), Expect = 6.0
 Identities = 18/103 (17%), Positives = 36/103 (34%), Gaps = 1/103 (0%)

Query: 247 QQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQID-RLTSTVRELRR 305
           + L+    I AL + RG+  +    +  +E E+         L  QI         E  R
Sbjct: 280 ELLAQAAAIEALHQQRGEYRNAEQDLPDREGEIANAREAAAALLAQIGPDADEEAVESLR 339

Query: 306 KQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
               ++  V  L   +     AL+  +  +   +++L     +
Sbjct: 340 PSLAAKETVTELEKRKEALDQALKSARDALEERERELKQVRAQ 382


>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal.  This domain is the C-terminal
           region of the CASP family of proteins. It is a Golgi
           membrane protein which is thought to have a role in
           vesicle transport.
          Length = 245

 Score = 29.7 bits (67), Expect = 2.3
 Identities = 11/34 (32%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 202 RNTLKTRLNELEEE-RQCFKMVNRLQEENKKLSK 234
           R+  + R  ELEEE R+  + ++ L+ E + L  
Sbjct: 88  RDRFRKRNTELEEELRKQNQTISSLRRELESLKA 121


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.0 bits (68), Expect = 2.3
 Identities = 34/165 (20%), Positives = 70/165 (42%), Gaps = 23/165 (13%)

Query: 192 TAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL-EHEGSSPMKSQQLS 250
            AEL+++  +   ++ RL   E+  +    + RL+E  + L + + E   +   K ++  
Sbjct: 481 EAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAE 540

Query: 251 P------EIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVREL- 303
                  E++  A +K R       ++     +E+   NS +  LK +I+ L   +R L 
Sbjct: 541 ELRERAAELEAEAEEK-REAAAEAEEEAEEAREEVAELNSKLAELKERIESL-ERIRTLL 598

Query: 304 -----------RRKQRLSQVQVRGLIDERADFLAALQDHQREVAA 337
                      R +++   +      DER + LA  ++ +RE+ A
Sbjct: 599 AAIADAEDEIERLREKREALAELN--DERRERLAEKRERKRELEA 641



 Score = 29.6 bits (67), Expect = 3.8
 Identities = 64/336 (19%), Positives = 131/336 (38%), Gaps = 53/336 (15%)

Query: 49  EHLECLASKNE--RENTKLQELGDHISKLEHEKLEREVEEIEESWKEE---TQQLVEMVN 103
           E LE L  + +       L +      +   E+LE   EE+ +  +E     Q   E   
Sbjct: 286 ERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAE 345

Query: 104 RLQEENKKLSKRLE--HEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGY 161
            L+E+   L +R E   E ++ ++S+       +      R +++ L +++    +  G 
Sbjct: 346 SLREDADDLEERAEELREEAAELESELEEAREAVE---DRREEIEELEEEIEELRERFGD 402

Query: 162 RNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKM 221
              D+ N++                        L+E+  +R+ L+ R  ELE   +  + 
Sbjct: 403 APVDLGNAE----------------------DFLEELREERDELREREAELEATLRTAR- 439

Query: 222 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGY 281
             R++E     ++ L   G  P   Q +     +  +++ R +V+ L  ++   E+E+  
Sbjct: 440 -ERVEE-----AEALLEAGKCPECGQPVEGSPHVETIEEDRERVEELEAELEDLEEEVEE 493

Query: 282 RNSDVDNLK------RQIDRL---TSTVRELRRKQRLS----QVQVRGLIDERADFLAAL 328
               ++  +       +I+RL      + EL  ++R +    + +   L  ERA  L A 
Sbjct: 494 VEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEEL-RERAAELEAE 552

Query: 329 QDHQREVAALKQQLGLAEKENEDLYNSQETTSEEER 364
            + +RE AA  ++     +E     NS+    +E  
Sbjct: 553 AEEKREAAAEAEEEAEEAREEVAELNSKLAELKERI 588


>gnl|CDD|131743 TIGR02696, pppGpp_PNP, guanosine pentaphosphate synthetase
           I/polynucleotide phosphorylase.  Sohlberg, et al present
           characterization of two proteins from Streptomyces
           coelicolor. The protein in this family was shown to have
           poly(A) polymerase activity and may be responsible for
           polyadenylating RNA in this species. Reference 2 showed
           that a nearly identical plasmid-encoded protein from
           Streptomyces antibioticus is a bifunctional enzyme that
           acts also as a guanosine pentaphosphate synthetase.
          Length = 719

 Score = 30.2 bits (68), Expect = 2.4
 Identities = 27/103 (26%), Positives = 45/103 (43%), Gaps = 7/103 (6%)

Query: 16  DIASEIGK---EFEKIIDQYGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGDHI 72
           D+A +  K   EF    D Y  D   ++   V   L     +A K ERE   L E+   +
Sbjct: 242 DLAEKAAKPTGEFPLFPD-YQDDVYEAVEGAVKDELSAALTIAGKQEREE-ALDEVKALV 299

Query: 73  SKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKR 115
           +    E+ E   +EI  +++  T++LV    R+  E  ++  R
Sbjct: 300 AAKLAEQFEGREKEISAAYRAVTKKLVR--ERVLTEGVRIDGR 340


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 19/57 (33%), Positives = 29/57 (50%), Gaps = 1/57 (1%)

Query: 65  LQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGS 121
           L   G+H+    H++L+  V  + E+  E  Q L EM + L+  NK L + L   GS
Sbjct: 216 LDGAGEHLGGRIHDELKSHVLPVLEAIVELGQVLQEMKDGLESVNKTLKE-LREGGS 271


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 30.2 bits (68), Expect = 2.4
 Identities = 36/133 (27%), Positives = 58/133 (43%), Gaps = 30/133 (22%)

Query: 189 RFTTAELKEILHDRNTLKTRLNELE-------EERQCFKMVNRLQEENKKLSKRLEHEGS 241
           +    +LKE   +++ L  R +EL+       EE+      N LQ+EN+KL +RL     
Sbjct: 175 KKALEDLKEAQEEKDELAQRCHELDKQVLLLQEEK------NSLQQENEKLQERLAQLEG 228

Query: 242 SPMKSQQLSPEIDISALQKLRGQVDSL----------RDQMRLKEKELGYRNSDVDNLKR 291
           S +   QL           L+ Q++ L          RD  R+K +EL     ++  L+ 
Sbjct: 229 SSLGPNQLGS----KKYNLLQSQLEQLQEENFRLEAARDDYRIKCEEL---EKELAELQH 281

Query: 292 QIDRLTSTVRELR 304
           + D LTS   E +
Sbjct: 282 RNDELTSLAAESQ 294


>gnl|CDD|132534 TIGR03495, phage_LysB, phage lysis regulatory protein, LysB family.
            Members of this protein family are phage lysis
           regulatory protein, including the well-studied protein
           LysB (lysis protein B) of Enterobacteria phage P2. For
           members of this family, genes are found in phage or in
           prophage regions of bacterial genomes, typically near a
           phage lysozyme or phage holin.
          Length = 135

 Score = 28.9 bits (65), Expect = 2.4
 Identities = 18/87 (20%), Positives = 36/87 (41%), Gaps = 14/87 (16%)

Query: 266 DSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFL 325
            + R  +    + L  + +++ +   Q+  L      L ++   +Q Q+R    + A   
Sbjct: 22  RNARADLERANRVLKAQQAELASKANQLIVLL----ALAKRNEEAQAQLR---QQLAQAR 74

Query: 326 AALQDHQREVAALKQQLGLAEKENEDL 352
           A L   ++ +  LK       +ENEDL
Sbjct: 75  ALLAQREQRIERLK-------RENEDL 94


>gnl|CDD|147197 pfam04906, Tweety, Tweety.  The tweety (tty) gene has not been
           characterized at the protein level. However, it is
           thought to form a membrane protein with five potential
           membrane-spanning regions. A number of potential
           functions have been suggested in.
          Length = 406

 Score = 30.0 bits (68), Expect = 2.5
 Identities = 17/66 (25%), Positives = 27/66 (40%), Gaps = 1/66 (1%)

Query: 278 ELGYRNSDVDNLKRQIDRLTS-TVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVA 336
           +L Y   + ++    ID L S T   L         ++  +  +R D+L  L+  Q  V 
Sbjct: 93  QLTYSLRNANHTVSGIDNLVSDTTNSLNETLEQHLTRLEEIFAKRTDYLQTLRFLQGNVG 152

Query: 337 ALKQQL 342
              QQL
Sbjct: 153 NAVQQL 158


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 29.8 bits (67), Expect = 2.7
 Identities = 61/279 (21%), Positives = 116/279 (41%), Gaps = 31/279 (11%)

Query: 56  SKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKR 115
           ++NER ++  QEL +    LE ++  R  +EI+E +K    +L++  + L+EEN  L K+
Sbjct: 55  AENERLSSLSQELKEENEMLELQRG-RLRDEIKE-YKFREARLLQDYSELEEENISLQKQ 112

Query: 116 LEHEGSSPMKSQQLSPE---IDISVLQK----LRGQVDSLRDQMRLKEKELG------YR 162
           +     S ++  Q+  E    +I  L++    L  Q++       + EK+L         
Sbjct: 113 V-----SVLRQSQVEFEGLKHEIRRLEEETELLNSQLEEAARLKEIAEKQLEEALETLKT 167

Query: 163 NSDVDNSDIPDLSNK-AVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKM 221
             +  N+   +LS+   + D    +    +   LK    D        N+ E+E      
Sbjct: 168 EREQKNALRKELSHHLNLEDFYSLSNLSISLDGLK-FSEDPEASTEPNNDGEDENGHLNG 226

Query: 222 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDS-------LRDQMRL 274
              L +      K  E    +P  S   S E+++S +QKL+ Q++        L   ++ 
Sbjct: 227 GPGLAKSLGTPRKG-ELFPPAPGVSDLFS-ELNLSEIQKLKQQLEQVEREKASLLANLQE 284

Query: 275 KEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQ 313
            + +L      +     +++RLT  +  LR  Q   + Q
Sbjct: 285 LQTQLEQSEGALSEQHEKVNRLTEHLNALRNLQASKEQQ 323


>gnl|CDD|147523 pfam05384, DegS, Sensor protein DegS.  This is small family of
           Bacillus DegS proteins. The DegS-DegU two-component
           regulatory system of Bacillus subtilis controls various
           processes that characterize the transition from the
           exponential to the stationary growth phase, including
           the induction of extracellular degradative enzymes,
           expression of late competence genes and down-regulation
           of the sigma D regulon. The family also contains one
           sequence from Thermoanaerobacter tengcongensis which is
           described as a sensory transduction histidine kinase.
          Length = 159

 Score = 28.7 bits (65), Expect = 3.0
 Identities = 15/44 (34%), Positives = 33/44 (75%), Gaps = 4/44 (9%)

Query: 73  SKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRL 116
           ++ E+E+LE+E+EE+    KEE  ++++ V++L+++ ++  KRL
Sbjct: 25  ARQEYERLEQELEEV----KEEVSEVIKEVDKLEKQERRARKRL 64


>gnl|CDD|236123 PRK07883, PRK07883, hypothetical protein; Validated.
          Length = 557

 Score = 29.9 bits (68), Expect = 3.0
 Identities = 17/55 (30%), Positives = 24/55 (43%), Gaps = 12/55 (21%)

Query: 256 SALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLS 310
           + L  LR ++D L    R +E              R  DRL + +R L R QRL+
Sbjct: 406 AVLAALRARIDRLAAAERFEEAA------------RLRDRLAALLRALARAQRLA 448


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 29.6 bits (66), Expect = 3.3
 Identities = 49/326 (15%), Positives = 114/326 (34%), Gaps = 86/326 (26%)

Query: 59  ERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH 118
               T+   L + I   E  K+ ++++ + E W+       +  N +    +K  +    
Sbjct: 274 ANLKTQNDNLYEKIQ--EAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEW--- 328

Query: 119 EGSSPMKSQQLSPEIDI--SVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSN 176
               P K ++L  EI++    ++ L+  +D L  Q+R +                     
Sbjct: 329 ----PGKLEKLKSEIELKEEEIKALQSNIDELHKQLRKQ--------------------- 363

Query: 177 KAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL 236
                         +T + + +  +R  L              + ++++  ++ KL+K +
Sbjct: 364 ------------GISTEQFELMNQEREKL-------------TRELDKINIQSDKLTKSV 398

Query: 237 EHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKELGYRNSDV---------- 286
                   KS++L  +    +L+K   Q DSL   +     ++G+  +D           
Sbjct: 399 --------KSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLF 450

Query: 287 -------DNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQR----EV 335
                  +++K+ I  L   ++E  + +    + +   I      +  L         E+
Sbjct: 451 PKGSGINESIKKSILELNDEIQERIKTEENKSITLEEDIKNLKHDINELTQILEKLELEL 510

Query: 336 AALKQQLGLAEKENEDLYNSQETTSE 361
           +    +  L+++ENE    +Q    E
Sbjct: 511 SEANSKFELSKEENERELVAQRIEIE 536


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 29.6 bits (66), Expect = 3.4
 Identities = 32/177 (18%), Positives = 65/177 (36%), Gaps = 11/177 (6%)

Query: 194 ELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEI 253
           E +    +R   ++ L +  +ER+  +    L    + L+K  +       ++Q L   +
Sbjct: 96  EKRAAETEREAARSELQKARQEREAVR--QELAAARQNLAKAQQELARLTKQAQDLQTRL 153

Query: 254 DISALQ--KLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQ 311
              A Q  +L  Q  SL+   +  +       S V +LK +  ++    + L  +     
Sbjct: 154 KTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRAN--- 210

Query: 312 VQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEERPVQG 368
              +   +E A   AA Q   + +     Q     ++ + +    E   E ER +Q 
Sbjct: 211 -AAQARTEELARRAAAAQQTAQAIQQRDAQ---ISQKAQQIAARAEQIRERERQLQR 263


>gnl|CDD|226873 COG4467, COG4467, Regulator of replication initiation timing
           [Replication,    recombination, and repair].
          Length = 114

 Score = 28.2 bits (63), Expect = 3.5
 Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 6/55 (10%)

Query: 67  ELGDHISKLEHE--KLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHE 119
           E+ D +  LE +   L  E+  +    K+    LVE    L+ EN+KL +RL   
Sbjct: 5   EIFDQVDNLEEQLGVLLAELGGL----KQHLGSLVEENTALRLENEKLRERLGEP 55


>gnl|CDD|224365 COG1448, TyrB, Aspartate/tyrosine/aromatic aminotransferase [Amino
           acid transport and metabolism].
          Length = 396

 Score = 29.5 bits (67), Expect = 3.6
 Identities = 18/77 (23%), Positives = 28/77 (36%), Gaps = 23/77 (29%)

Query: 85  EEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDISVLQKLRG- 143
            E+   W++E   L EM  R+ E  + L   L+  G             +   +   RG 
Sbjct: 301 PELRAEWEQE---LEEMRQRILEMRQALVDALKALG----------APRNFDFIISQRGM 347

Query: 144 ---------QVDSLRDQ 151
                    QVD LR++
Sbjct: 348 FSYTGLSPEQVDRLREE 364


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 29.2 bits (66), Expect = 3.6
 Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 6/87 (6%)

Query: 73  SKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPE 132
            + + E  E E E + E  KEE Q +       QE  K+L +++E E       ++L  E
Sbjct: 209 ERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAE------REKLLAE 262

Query: 133 IDISVLQKLRGQVDSLRDQMRLKEKEL 159
            +  +  KL+ Q + L++  + + + L
Sbjct: 263 QERMLEHKLQEQEELLKEGFKTEAESL 289


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 29.7 bits (67), Expect = 3.9
 Identities = 13/62 (20%), Positives = 26/62 (41%), Gaps = 6/62 (9%)

Query: 56  SKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKR 115
           S +  EN + +   + I++L+      ++EE            +  +N+L E   KL + 
Sbjct: 916 SSDLIENLEFKT--ELIARLKKLLNNIDLEEG----PSIEYVKLPELNKLHEVESKLKET 969

Query: 116 LE 117
            E
Sbjct: 970 SE 971



 Score = 28.5 bits (64), Expect = 7.1
 Identities = 40/238 (16%), Positives = 80/238 (33%), Gaps = 45/238 (18%)

Query: 135  ISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDNSDIPDLSNKAVYDLDDPNRPRFTTAE 194
            +  L+     + SL+      E E+      + +  I +L  K               A 
Sbjct: 884  LQELKIDVKSISSLKLVNLELESEIIELKKSLSSDLIENLEFKTELI-----------AR 932

Query: 195  LKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLEHEGSSPMKSQQLSPEID 254
            LK++L++ +  +    E  +  +    +N+L E   KL              ++ S E +
Sbjct: 933  LKKLLNNIDLEEGPSIEYVKLPE----LNKLHEVESKL--------------KETSEEYE 974

Query: 255  --ISALQKLRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQV 312
              +     L  + +    +++  +KEL   +     L+    +L     E+   Q  S++
Sbjct: 975  DLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPVEVAELQSASKI 1034

Query: 313  ---------QVRGLIDERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSE 361
                      ++ L   +   L      Q    ALK +     +EN  L + Q    E
Sbjct: 1035 ISSESTELSILKPLQKLKGLLLLENNQLQARYKALKLR-----RENSLLDDKQLYQLE 1087


>gnl|CDD|220597 pfam10139, Virul_Fac, Putative bacterial virulence factor.  Members
           of this family of prokaryotic proteins include various
           putative virulence factor effector proteins. Their exact
           function is, as yet, unknown.
          Length = 852

 Score = 29.3 bits (66), Expect = 4.0
 Identities = 21/91 (23%), Positives = 29/91 (31%), Gaps = 8/91 (8%)

Query: 295 RLTSTVRELRRKQRLSQVQVRGLIDERAD-------FLAALQDHQREVAALKQQLGLAEK 347
           +L    REL      +  Q     DERA+        L ALQ+       L + L     
Sbjct: 560 QLDELQRELGENLLANYYQADD-EDERAEKRRIAEQLLRALQERAHRHGELLEGLLPPRD 618

Query: 348 ENEDLYNSQETTSEEERPVQGPLPYEPDDAP 378
           +   LY  Q+  S     +  P        P
Sbjct: 619 QLRRLYLQQQEFSGGAADLSAPGEVADPPRP 649


>gnl|CDD|236510 PRK09428, pssA, phosphatidylserine synthase; Provisional.
          Length = 451

 Score = 29.4 bits (67), Expect = 4.1
 Identities = 15/36 (41%), Positives = 20/36 (55%), Gaps = 5/36 (13%)

Query: 174 LSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRL 209
           L++ AV  LD PNRP     + KEI +D    + RL
Sbjct: 192 LNSPAVNRLDQPNRP-----KTKEIKNDIRQFRQRL 222


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 29.4 bits (67), Expect = 4.2
 Identities = 32/131 (24%), Positives = 60/131 (45%), Gaps = 26/131 (19%)

Query: 15  FDIASEIGKEFEKIIDQ----YGPDAVTSIMPKVIVALEHLECLASKNERENTKLQELGD 70
           F+IA  +G   E II++     G D       K+   +  LE L  + E++  + + L  
Sbjct: 491 FEIAKRLGLP-ENIIEEAKKLIGEDKE-----KLNELIASLEELERELEQKAEEAEAL-- 542

Query: 71  HISKLEHEKLEREVEEIEESWKEETQQLV--------EMVNRLQEENKKLSKRL---EHE 119
                E EKL+ E+EE +E  +EE  +L+        + +   ++E  ++ K L   +  
Sbjct: 543 ---LKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKG 599

Query: 120 GSSPMKSQQLS 130
           G + +K+ +L 
Sbjct: 600 GYASVKAHELI 610


>gnl|CDD|220664 pfam10267, Tmemb_cc2, Predicted transmembrane and coiled-coil 2
           protein.  This family of transmembrane coiled-coil
           containing proteins is conserved from worms to humans.
           Its function is unknown.
          Length = 387

 Score = 29.0 bits (65), Expect = 4.2
 Identities = 23/90 (25%), Positives = 41/90 (45%), Gaps = 15/90 (16%)

Query: 258 LQKLRGQVDSLRD-QMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRG 316
           L KL  ++  +++ Q RL+E     +      L+R    +T +++E R +    + Q+  
Sbjct: 207 LDKLLEELREIKEGQSRLEESYERLKEQ----LQRDYQYITQSLQEERYRYERLEEQLND 262

Query: 317 LIDERADFLAALQDHQREVAALKQQLGLAE 346
           L           + HQ E+A LKQ+L   E
Sbjct: 263 L----------TELHQNEIANLKQELASME 282


>gnl|CDD|232931 TIGR00344, alaS, alanine--tRNA ligase.  The model describes
           alanine--tRNA ligase. This enzyme catalyzes the reaction
           (tRNAala + L-alanine + ATP = L-alanyl-tRNAala +
           pyrophosphate + AMP) [Protein synthesis, tRNA
           aminoacylation].
          Length = 851

 Score = 29.3 bits (66), Expect = 4.4
 Identities = 12/41 (29%), Positives = 21/41 (51%)

Query: 78  EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH 118
            + E +++E+ +  K    +L + + R  EE K L K LE 
Sbjct: 704 NEQEDKLKELADILKVTPNELPKKIERFFEEWKALQKELES 744


>gnl|CDD|112953 pfam04163, Tht1, Tht1-like nuclear fusion protein. 
          Length = 554

 Score = 29.3 bits (65), Expect = 4.5
 Identities = 24/101 (23%), Positives = 47/101 (46%), Gaps = 2/101 (1%)

Query: 52  ECLASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKK 111
           +  AS+N    + + +L D       ++++  +E  E+S KE  Q+ ++ VN   +E + 
Sbjct: 286 DEFASENAIALSAIGDLKDDFLAFLEDRIKNLIEMSEDSLKESVQRNIDFVNSGFQELED 345

Query: 112 LSKRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQM 152
            S  L+ E       + LS + ++   + L+G  D L D  
Sbjct: 346 FSIGLKEELGG--LKKDLSEQQNLEAEEILQGFNDFLHDSK 384


>gnl|CDD|149105 pfam07851, TMPIT, TMPIT-like protein.  A number of members of this
           family are annotated as being transmembrane proteins
           induced by tumour necrosis factor alpha, but no
           literature was found to support this.
          Length = 330

 Score = 28.9 bits (65), Expect = 4.5
 Identities = 26/98 (26%), Positives = 49/98 (50%), Gaps = 14/98 (14%)

Query: 79  KLEREVEEIEESWK--EET----QQLVEMVNRLQE----ENKKLSKRLEHEGSSPMK--S 126
           + E E +E+E+ ++  +ET    +Q +E V +LQE       +  KRL  +    +K   
Sbjct: 1   ECEEEWKELEKEFQQLQETHRLYKQKLEEVEKLQEQCTSAIARQRKRL-KQLIVSLKKLK 59

Query: 127 QQLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNS 164
           + L+PE D  ++++L  Q+   ++Q    E  L  +N 
Sbjct: 60  KSLTPE-DSELVEQLEEQIKERKNQFFDMESFLPKKNG 96


>gnl|CDD|234309 TIGR03683, A-tRNA_syn_arch, alanyl-tRNA synthetase.  This family of
           alanyl-tRNA synthetases is limited to the archaea, and
           is a subset of those sequences identified by the model
           pfam07973 covering the second additional domain (SAD) of
           alanyl and threonyl tRNA synthetases.
          Length = 902

 Score = 29.2 bits (66), Expect = 4.8
 Identities = 14/41 (34%), Positives = 21/41 (51%)

Query: 78  EKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEH 118
           ++LE  + E  +  K   +QL E V R  EE K+  K +E 
Sbjct: 743 QELEDLLRESSDILKVPPEQLPETVKRFFEEWKEQRKEIER 783


>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
           This family consists of a series of primate specific
           nuclear pore complex interacting protein (NPIP)
           sequences. The function of this family is unknown but is
           well conserved from African apes to humans.
          Length = 350

 Score = 29.0 bits (64), Expect = 4.8
 Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 8/60 (13%)

Query: 56  SKNERENTKLQELGDHISKL-----EHEKLEREVEEIEESWK---EETQQLVEMVNRLQE 107
           + +++ N K +   +H+ KL     EH + ER+V E EE+ K   +E     EM  R Q 
Sbjct: 131 NHHDKINGKRKTAKEHLRKLSMKECEHAEKERQVSEAEENGKLDMKEIHTYKEMFQRAQA 190


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 29.1 bits (66), Expect = 5.0
 Identities = 9/62 (14%), Positives = 23/62 (37%)

Query: 289 LKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAALKQQLGLAEKE 348
           L++++  L   +    R++  SQ        E         + + +   L+ QL   +++
Sbjct: 147 LQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEK 206

Query: 349 NE 350
             
Sbjct: 207 AA 208


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 29.1 bits (65), Expect = 5.0
 Identities = 68/324 (20%), Positives = 123/324 (37%), Gaps = 22/324 (6%)

Query: 54  LASKNERENTKLQELGDHISKLEHE--KLEREVEEIEESWKEETQQLVEMVNRLQEE--N 109
           LA   ERE  +  EL   +  LE    K E E  E EE  K    +L     +LQ E  +
Sbjct: 98  LAENYERELDRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKED 157

Query: 110 KKLSKRLEHEGSSPMKSQ-QLSPEIDISVLQKLRGQVDSLRDQMRLKEKELGYRNSDVDN 168
           +    +          S+ Q   +   + L+ L  +++ LR+Q+   +KEL         
Sbjct: 158 QLKEAKESISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELAEA-----E 212

Query: 169 SDIPDLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEE 228
             +  L+++     D+  + +    ELK    D   +K+   +L +  +  + +  L+EE
Sbjct: 213 KKLQSLTSEQASSADNSVKIKHLEEELKRYEQDAEVVKSMKEQLLQIPELERELAALREE 272

Query: 229 NKKLSKRLEHEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMRLKEKEL--------- 279
           N+KL      E +  +K +    +  +   +K+R ++  L  +    E EL         
Sbjct: 273 NRKLRSM--KEDNELLKEELEDLQSRLERFEKMREKLADLELEKEKLENELKSWKSLLQD 330

Query: 280 -GYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAAL 338
            G      D+L R+I  L +   +L+ K        + L           Q    E+  L
Sbjct: 331 IGLNLRTPDDLSRRIVVLQNEELQLKEKNGSISSSAKQLETTLQQLQLERQKAVSEILEL 390

Query: 339 KQQLGLAEKENEDLYNSQETTSEE 362
           K++L   +     L       ++E
Sbjct: 391 KKKLEALKALVRRLQRRLTLVTKE 414


>gnl|CDD|233830 TIGR02350, prok_dnaK, chaperone protein DnaK.  Members of this
           family are the chaperone DnaK, of the DnaK-DnaJ-GrpE
           chaperone system. All members of the seed alignment were
           taken from completely sequenced bacterial or archaeal
           genomes and (except for Mycoplasma sequence) found
           clustered with other genes of this systems. This model
           excludes DnaK homologs that are not DnaK itself, such as
           the heat shock cognate protein HscA (TIGR01991).
           However, it is not designed to distinguish among DnaK
           paralogs in eukaryotes. Note that a number of dnaK genes
           have shadow ORFs in the same reverse (relative to dnaK)
           reading frame, a few of which have been assigned
           glutamate dehydrogenase activity. The significance of
           this observation is unclear; lengths of such shadow ORFs
           are highly variable as if the presumptive protein
           product is not conserved [Protein fate, Protein folding
           and stabilization].
          Length = 595

 Score = 28.8 bits (65), Expect = 5.5
 Identities = 17/53 (32%), Positives = 32/53 (60%), Gaps = 1/53 (1%)

Query: 65  LQELGDHISKLEHEKLEREVEEIEESWKEE-TQQLVEMVNRLQEENKKLSKRL 116
           L+E GD +   E EK+E+ V E++E+ K E  +++      LQ+  +KL++ +
Sbjct: 541 LKEAGDKLPAEEKEKIEKAVAELKEALKGEDVEEIKAKTEELQQALQKLAEAM 593


>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
           biogenesis, outer membrane].
          Length = 284

 Score = 28.5 bits (64), Expect = 5.5
 Identities = 25/107 (23%), Positives = 46/107 (42%), Gaps = 9/107 (8%)

Query: 54  LASKNERENTKLQELGDHISK--LEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKK 111
           + S  +       E  D + +     + L  E EE+++    E +QL+E V  L+EENK+
Sbjct: 43  VLSPVQSVVAAPFEFVDGVLEFLKSLKDLALENEELKKE-LAELEQLLEEVESLEEENKR 101

Query: 112 LSKRL----EHEGSSPMKSQQLSPEIDISVLQKL--RGQVDSLRDQM 152
           L + L          P+ ++ +S   D      +  +G  D ++  M
Sbjct: 102 LKELLDFKESSSDYDPIAARVISRSPDPWSQTIVIDKGSNDGIKKGM 148



 Score = 28.1 bits (63), Expect = 8.7
 Identities = 22/88 (25%), Positives = 39/88 (44%), Gaps = 9/88 (10%)

Query: 191 TTAELKEILHDRNTLKTRLNELEEERQCFKMVNRLQEENKKLSKRL----EHEGSSPMKS 246
               LK++  +   LK  L ELE+  +    V  L+EENK+L + L          P+ +
Sbjct: 64  FLKSLKDLALENEELKKELAELEQLLE---EVESLEEENKRLKELLDFKESSSDYDPIAA 120

Query: 247 QQLSPEIDISALQKL--RGQVDSLRDQM 272
           + +S   D  +   +  +G  D ++  M
Sbjct: 121 RVISRSPDPWSQTIVIDKGSNDGIKKGM 148


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 28.7 bits (64), Expect = 7.4
 Identities = 30/140 (21%), Positives = 63/140 (45%), Gaps = 11/140 (7%)

Query: 186 NRPRFTTAE-LKEILHDR-------NTLKTRLNELEEERQCFKMVNRLQEENKKLSKRLE 237
           N+ R    E L++IL ++       N L+ RL+E +   +         E  ++  ++L 
Sbjct: 148 NQARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKLR 207

Query: 238 HEGSSPMKSQQLSPEIDISALQKLRGQVDSLRDQMR-LKEKELGYRNSD--VDNLKRQID 294
           +E      ++ L        L  L+ +   L+D ++ LK + +    ++  V  L+++  
Sbjct: 208 NELLIRGATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAELIEVAETEERVFKLEKERS 267

Query: 295 RLTSTVRELRRKQRLSQVQV 314
            L +++REL  K  ++Q  V
Sbjct: 268 LLDASLRELESKFIVAQEDV 287


>gnl|CDD|222347 pfam13729, TraF_2, F plasmid transfer operon, TraF, protein. 
          Length = 267

 Score = 28.0 bits (63), Expect = 7.5
 Identities = 18/67 (26%), Positives = 25/67 (37%), Gaps = 7/67 (10%)

Query: 279 LGYRNSDVDNLKRQIDRLTSTVRELRRKQRLSQVQVRGLIDERADFLAALQDHQREVAAL 338
           +G + SD DNL  + D L+     L      S V         A    AL+D   + A  
Sbjct: 25  VGAQVSDPDNLVDKFDDLSDIYDALSSALDNSDVD-----AAAARLAGALRDLDGDTAYA 79

Query: 339 KQQLGLA 345
             Q G+ 
Sbjct: 80  --QAGVG 84


>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
           family. 
          Length = 308

 Score = 28.0 bits (63), Expect = 8.0
 Identities = 40/207 (19%), Positives = 83/207 (40%), Gaps = 32/207 (15%)

Query: 51  LECLASKNERENTKLQELGD---------HISKLEHEKLEREVEEIEESWKEETQQLVEM 101
           LE L S+ ER     ++L            I +L        +  + +   E   +L+E+
Sbjct: 64  LERLLSRIERGRASPRDLLRLYDSLEGLKEIRQLLESLDGPLLGLLLKVILEPLLELLEL 123

Query: 102 VNRLQEENKKLSKRLEHEGSSPMKSQQLSPEIDIS--VLQKLRGQVDSLRDQMRLKEKEL 159
           +  L  ++  L   +   G   +      PE+D     L++L  +++ L  + R   +EL
Sbjct: 124 LLELLNDDDPL--EVNDGG---LIKDGFDPELDELREKLEELEEELEELLKKER---EEL 175

Query: 160 GYRNSDVDNSDIP----DLSNKAVYDLDD--------PNRPRFTTAELKEILHDRNTLKT 207
           G  +  +  + +     +++      +           N  RFTT ELKE+ ++    K 
Sbjct: 176 GIDSLKLGYNKVHGYYIEVTKSEAKKVPKDFIRRSSLKNTERFTTPELKELENELLEAKE 235

Query: 208 RLNELEEERQCFKMVNRLQEENKKLSK 234
            +  LE+E    +++ ++ E  ++L  
Sbjct: 236 EIERLEKE-ILRELLEKVLEYLEELRA 261


>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
           module that binds membranes and detects membrane
           curvature.  BAR domains are dimerization, lipid binding
           and curvature sensing modules found in many different
           proteins with diverse functions including organelle
           biogenesis, membrane trafficking or remodeling, and cell
           division and migration. Mutations in BAR containing
           proteins have been linked to diseases and their
           inactivation in cells leads to altered membrane
           dynamics. A BAR domain with an additional N-terminal
           amphipathic helix (an N-BAR) can drive membrane
           curvature. These N-BAR domains are found in amphiphysins
           and endophilins, among others. BAR domains are also
           frequently found alongside domains that determine lipid
           specificity, such as the Pleckstrin Homology (PH) and
           Phox Homology (PX) domains which are present in beta
           centaurins (ACAPs and ASAPs) and sorting nexins,
           respectively. A FES-CIP4 Homology (FCH) domain together
           with a coiled coil region is called the F-BAR domain and
           is present in Pombe/Cdc15 homology (PCH) family
           proteins, which include Fes/Fes tyrosine kinases, PACSIN
           or syndapin, CIP4-like proteins, and srGAPs, among
           others. The Inverse (I)-BAR or IRSp53/MIM homology
           Domain (IMD) is found in multi-domain proteins, such as
           IRSp53 and MIM, that act as scaffolding proteins and
           transducers of a variety of signaling pathways that link
           membrane dynamics and the underlying actin cytoskeleton.
           BAR domains form dimers that bind to membranes, induce
           membrane bending and curvature, and may also be involved
           in protein-protein interactions. The I-BAR domain
           induces membrane protrusions in the opposite direction
           compared to classical BAR and F-BAR domains, which
           produce membrane invaginations. BAR domains that also
           serve as protein interaction domains include those of
           arfaptin and OPHN1-like proteins, among others, which
           bind to Rac and Rho GAP domains, respectively.
          Length = 194

 Score = 27.8 bits (62), Expect = 8.3
 Identities = 30/165 (18%), Positives = 65/165 (39%), Gaps = 21/165 (12%)

Query: 54  LASKNERENTKLQELGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLS 113
           L    +      ++L + + +L  E  +    ++     E  ++  ++   L+E   +L 
Sbjct: 19  LLDSLKELPAAAEKLSEALQELGKELPDLSNTDL----GEALEKFGKIQKELEEFRDQLE 74

Query: 114 KRLEHEGSSPMKSQQLSPEIDISVLQKLRGQVDSLRDQM-RLKEKELGYRNSDVDNSDIP 172
           ++LE++   P+K        D+  ++K R ++D  R      +EK    R    D+S + 
Sbjct: 75  QKLENKVIEPLKEYLKK---DLKEIKKRRKKLDKARLDYDAAREKLKKLRKKKKDSSKLA 131

Query: 173 DLSNKAVYDLDDPNRPRFTTAELKEILHDRNTLKTRLNELEEERQ 217
           +   +                  ++    R  L   LN+LEE+R+
Sbjct: 132 EAEEE-------------LQEAKEKYEELREELIEDLNKLEEKRK 163


>gnl|CDD|204811 pfam12026, DUF3513, Domain of unknown function (DUF3513).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 192 to 218 amino acids in length. This domain is
           found associated with pfam00018, pfam08824. This domain
           has a conserved QPP sequence motif.
          Length = 208

 Score = 27.8 bits (62), Expect = 8.6
 Identities = 15/57 (26%), Positives = 24/57 (42%), Gaps = 3/57 (5%)

Query: 59  ERENTKLQELGDHISKLEHEKLEREVEEIEE---SWKEETQQLVEMVNRLQEENKKL 112
           +R   +L E G      + E+LEREV +  E   S            + L  E+++L
Sbjct: 30  QRTQKELLEKGPITQLKQFERLEREVAKPAEADLSGWSPGDLTPPPSSSLSPEDRQL 86


>gnl|CDD|220517 pfam10005, DUF2248, Uncharacterized protein conserved in bacteria
           (DUF2248).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 340

 Score = 28.0 bits (63), Expect = 8.7
 Identities = 9/13 (69%), Positives = 10/13 (76%)

Query: 319 DERADFLAALQDH 331
           DER D+ AALQ H
Sbjct: 209 DERQDYGAALQRH 221


>gnl|CDD|131187 TIGR02132, phaR_Bmeg, polyhydroxyalkanoic acid synthase, PhaR
           subunit.  This model describes a protein, PhaR,
           localized to polyhydroxyalkanoic acid (PHA) inclusion
           granules in Bacillus cereus and related species. PhaR is
           required for PHA biosynthesis along with PhaC and may be
           a regulatory subunit.
          Length = 189

 Score = 27.5 bits (60), Expect = 8.7
 Identities = 22/107 (20%), Positives = 42/107 (39%), Gaps = 5/107 (4%)

Query: 261 LRGQVDSLRDQMRLKEKELGYRNSDVDNLKRQIDRLTSTVRELRRK--QRLSQVQVRGLI 318
           L  +VD + +    K  EL  +      LK+ + +L   ++ L +K  + L  ++ +   
Sbjct: 84  LEEKVDLIEEFFDDKFDELEAQQEQAPALKKDVTKLKQDIKSLDKKLDKILELLEGQ--- 140

Query: 319 DERADFLAALQDHQREVAALKQQLGLAEKENEDLYNSQETTSEEERP 365
            +  D L      Q +    + Q  L EK+       +    EE +P
Sbjct: 141 QKTQDELKETIQKQIKTQGEQLQAQLLEKQEALAAKLKAEAKEEAKP 187


>gnl|CDD|152481 pfam12046, DUF3529, Protein of unknown function (DUF3529).  This
           family of proteins is functionally uncharacterized. This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 176 to 190 amino acids
           in length.
          Length = 173

 Score = 27.6 bits (62), Expect = 9.0
 Identities = 12/47 (25%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 304 RRKQRLSQVQVRGLIDERADFLAAL--QDHQREVAALKQQLGLAEKE 348
           +R  R  QV+++ L++   D  + L  + H+ E+  L ++L L    
Sbjct: 128 QRASRPEQVELK-LVEADDDGGSDLRVRAHRDELIELAKELQLKSDG 173


>gnl|CDD|234005 TIGR02774, rexB_recomb, ATP-dependent nuclease subunit B.  DNA
           repair is accomplished by several different systems in
           prokaryotes. Recombinational repair of double-stranded
           DNA breaks involves the RecBCD pathway in some lineages,
           and AddAB (also called RecAB) in other. The AddA protein
           is conserved between the firmicutes and the
           alphaproteobacteria, while the partner protein is not.
           The partner may be designated AddB, as in Bacillus and
           in alphaproteobacteria, or RexB as in Streptococcus and
           Lactococcus. Note, however, that RexB proteins lack an
           N-terminal GxxGxGK[ST] ATP-binding motif found in
           Bacillus subtilis and related species, and this
           difference may be important; this model represents
           specifically RexB proteins as found in Streptococcus and
           Lactococcus [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 1076

 Score = 28.3 bits (63), Expect = 9.2
 Identities = 17/60 (28%), Positives = 22/60 (36%)

Query: 68  LGDHISKLEHEKLEREVEEIEESWKEETQQLVEMVNRLQEENKKLSKRLEHEGSSPMKSQ 127
           L  ++  L     E E E+ EE WK  T  L +      +E   L   L    S    SQ
Sbjct: 475 LTKNLQDLATTLSEVEQEKQEEVWKTFTDILEQFATIFGQEKLSLDDFLALLHSGMSLSQ 534


>gnl|CDD|129266 TIGR00162, TIGR00162, TIGR00162 family protein.  This model
           represents one out of two closely related ortholgous
           sets of proteins that, so far, are found only in but are
           universal among the Archaea. This ortholog set includes
           MJ1210 from Methanococcus jannaschii and AF0525 from
           Archaeoglobus fulgidus while excluding MJ0106 and AF1251
           [Hypothetical proteins, Conserved].
          Length = 188

 Score = 27.5 bits (61), Expect = 9.7
 Identities = 9/34 (26%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 75  LEHEKLEREVEEIEESWKEETQQLVEMVNRLQEE 108
            +   LE  VE +EE   +E ++++  +  ++EE
Sbjct: 138 CKMLSLEVSVEALEER-AKEMEKIIAKIKEMEEE 170


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.310    0.129    0.344 

Gapped
Lambda     K      H
   0.267   0.0724    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,970,960
Number of extensions: 2013414
Number of successful extensions: 4810
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4108
Number of HSP's successfully gapped: 706
Length of query: 389
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 290
Effective length of database: 6,546,556
Effective search space: 1898501240
Effective search space used: 1898501240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 60 (26.9 bits)