RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy583
         (293 letters)



>gnl|CDD|128778 smart00502, BBC, B-Box C-terminal domain.  Coiled coil region
           C-terminal to (some) B-Box domains.
          Length = 127

 Score = 73.1 bits (180), Expect = 3e-16
 Identities = 33/113 (29%), Positives = 52/113 (46%)

Query: 126 KHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAINLRGKQ 185
           +  ++ LL+++  K   L+ A+K +      + E    +  +I      L NA+N R KQ
Sbjct: 2   REALEELLTKLRKKAAELEDALKQLISIIQEVEENAADVEAQIKAAFDELRNALNKRKKQ 61

Query: 186 LAYKLNEICDSKQKTLNEKKVTLEQLSRLTDHCIEFVNNGLNTGSDMALLSSK 238
           L   L E  ++K K L ++  +L Q      H I F    LN+G    LL SK
Sbjct: 62  LLEDLEEQKENKLKVLEQQLESLTQKQEKLSHAINFTEEALNSGDPTELLLSK 114


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 42.7 bits (101), Expect = 2e-04
 Identities = 40/219 (18%), Positives = 91/219 (41%), Gaps = 11/219 (5%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLR 61
            E+S+L+++L+    RL  L+ E +    +     G    E EQ      K +  L+ L 
Sbjct: 688 RELSSLQSELRRIENRLDELSQELSDASRKI----GEIEKEIEQLEQEEEKLKERLEELE 743

Query: 62  EKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKW 121
           E +  LE ++   + +L+   + +++L++ L  +      L    +  R      +L K 
Sbjct: 744 EDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKL 803

Query: 122 ATETKHL------IKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLL 175
             E   +      I+  L+ ++ ++  L+  ++ + +++  + E+ K + +EI E +   
Sbjct: 804 EEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEI-ENLNGK 862

Query: 176 TNAINLRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSRL 214
              +    ++L   L ++         E+     QL  L
Sbjct: 863 KEELEEELEELEAALRDLESRLGDLKKERDELEAQLREL 901



 Score = 35.8 bits (83), Expect = 0.028
 Identities = 40/211 (18%), Positives = 81/211 (38%), Gaps = 27/211 (12%)

Query: 7   LKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLREKIQD 66
           L   + E+ Q+L  L  E+ K E            E E   LL        + L  + + 
Sbjct: 189 LDLIIDEKRQQLERLRREREKAERYQALL--KEKREYEGYELL-----KEKEALERQKEA 241

Query: 67  LESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWATETK 126
           +E QL++ E++LE     + +L+K L  +    E+L              ++     E +
Sbjct: 242 IERQLASLEEELEKLTEEISELEKRLEEIEQLLEEL------------NKKIKDLGEEEQ 289

Query: 127 HLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAINLRGKQL 186
             +K  + E+  +   L+ ++   +       + ++RL +   E+ KLL     L  +  
Sbjct: 290 LRVKEKIGELEAEIASLERSIAEKERELE---DAEERLAKLEAEIDKLLAEIEELEREIE 346

Query: 187 AYK-----LNEICDSKQKTLNEKKVTLEQLS 212
             +     L E     ++ L + +  LE++ 
Sbjct: 347 EERKRRDKLTEEYAELKEELEDLRAELEEVD 377



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 48/231 (20%), Positives = 97/231 (41%), Gaps = 21/231 (9%)

Query: 2   AEMSALKAKLKEQNQRLITLNHE-----KAKLE------ARTKANPGAHADEQEQTNLLI 50
             +  ++  L+E N+++  L  E     K K+       A  + +      E E     +
Sbjct: 265 KRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERL 324

Query: 51  SKKQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKR 110
           +K +  +  L  +I++LE ++  + K  +       +LK+ L ++ ++ E++   +AE R
Sbjct: 325 AKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETR 384

Query: 111 NINSGLQ--LDKWATETKHLIKGL------LSEVSYKRVLLKSAMKVIDDRQNLISEKKK 162
           +     +  L+K   E   L + L      L  +S +   L +A+  I+ + N + E+K+
Sbjct: 385 DELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKE 444

Query: 163 RLTQEITEMVKLLTNAINLRGK--QLAYKLNEICDSKQKTLNEKKVTLEQL 211
               EI +    L        K  Q  Y L E  D  +K L++ +  L + 
Sbjct: 445 DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEA 495



 Score = 29.3 bits (66), Expect = 2.9
 Identities = 16/88 (18%), Positives = 35/88 (39%), Gaps = 18/88 (20%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLR 61
            +   LK ++K   + +  LN +K +LE                    + + + AL+ L 
Sbjct: 840 EQRIDLKEQIKSIEKEIENLNGKKEELEEE------------------LEELEAALRDLE 881

Query: 62  EKIQDLESQLSTKEKDLEVNNSNLDDLK 89
            ++ DL+ +    E  L      +++L+
Sbjct: 882 SRLGDLKKERDELEAQLRELERKIEELE 909


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 38.2 bits (89), Expect = 0.005
 Identities = 41/203 (20%), Positives = 81/203 (39%), Gaps = 17/203 (8%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLR 61
            E+  L+ +L+E  + L +L    AKL+   +         QE            L+ L 
Sbjct: 751 EELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQE-----------ELEELE 799

Query: 62  EKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAE-KRNINSGLQLDK 120
           E++++ E +L   E++LE      + L++ +  +  + E+L     E +  +     L+K
Sbjct: 800 EELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE---LEK 856

Query: 121 WATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAIN 180
              E K  ++ L +E       LK   +  ++ +  + E +  L +   E+ KL      
Sbjct: 857 ELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916

Query: 181 LRGKQLAYKLNEICDSKQKTLNE 203
           L  K    +L       ++ L E
Sbjct: 917 LEAKLE--RLEVELPELEEELEE 937



 Score = 36.6 bits (85), Expect = 0.015
 Identities = 31/159 (19%), Positives = 66/159 (41%), Gaps = 2/159 (1%)

Query: 6   ALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLREKIQ 65
            LK K++   + L        +LE        A  + +E+ + L+ + +   + LRE++ 
Sbjct: 327 ELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELA 386

Query: 66  DLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWATET 125
           +LE++L+    +LE     ++ L++ L  +    E L     E       LQ +      
Sbjct: 387 ELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNE 446

Query: 126 KHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRL 164
           +  ++ L  ++   R  LK   + + + Q  +   +K L
Sbjct: 447 E--LEELEEQLEELRDRLKELERELAELQEELQRLEKEL 483



 Score = 34.7 bits (80), Expect = 0.055
 Identities = 38/193 (19%), Positives = 75/193 (38%), Gaps = 20/193 (10%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLR 61
            E+  L+ K +   + L  L  E  + E R  A        +++   L  +    ++ L 
Sbjct: 779 EEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQE----IEELE 834

Query: 62  EKIQDLESQLSTKEKDLEVNNSNLDDLKK---TLANMMSDSEKLYNVYAEKRNINSGLQL 118
           E+I++LE +L   E++LE     L++LK+    L     + E       E++      +L
Sbjct: 835 EEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKE-----EL 889

Query: 119 DKWATETKHLIKGLLSEVSYKRVLLKSAMKVID--------DRQNLISEKKKRLTQEITE 170
           ++   E +  +  L  E+   R  L+     ++          + L  E +  L  E+  
Sbjct: 890 EEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELER 949

Query: 171 MVKLLTNAINLRG 183
            ++ L   I   G
Sbjct: 950 EIERLEEEIEALG 962



 Score = 34.3 bits (79), Expect = 0.076
 Identities = 43/222 (19%), Positives = 92/222 (41%), Gaps = 24/222 (10%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLR 61
            E+  L+ +L+E  + +  L  E  +L                     + + Q  L  L+
Sbjct: 253 EELEELQEELEEAEKEIEELKSELEELREE------------------LEELQEELLELK 294

Query: 62  EKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQ---- 117
           E+I++LE ++S   + LE   + L++L++ L  +    E L     E+  +   L+    
Sbjct: 295 EEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLA 354

Query: 118 -LDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLT 176
            L++   E +  +  LL E+      L+  +  ++     I  + + L +EI  + + L 
Sbjct: 355 ELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLE 414

Query: 177 NAINLRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSRLTDHC 218
             ++ R + L  +L E+    ++   E +   E+L  L +  
Sbjct: 415 R-LSERLEDLKEELKELEAELEELQTELEELNEELEELEEQL 455



 Score = 33.9 bits (78), Expect = 0.11
 Identities = 41/216 (18%), Positives = 89/216 (41%), Gaps = 14/216 (6%)

Query: 6   ALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHA---DEQEQTNLLISKKQLALKTLRE 62
           A K +LKE  + L  L  +  KLE   K+         D  E+    + + +  L+ L+ 
Sbjct: 664 AQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKR 723

Query: 63  KIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWA 122
           ++  LE +L   +  LE     L++L++ L  +    E+L      +  + S   L++  
Sbjct: 724 ELAALEEELEQLQSRLEELEEELEELEEELEELQERLEEL------EEELES---LEEAL 774

Query: 123 TETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLT--NAIN 180
            + K  I+ L  +    +  L+   + +++ +  +   ++ L        +L      + 
Sbjct: 775 AKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELE 834

Query: 181 LRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSRLTD 216
              ++L  KL+E+ +  ++   E +   E+L  L  
Sbjct: 835 EEIEELEEKLDELEEELEELEKELEELKEELEELEA 870


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 37.0 bits (86), Expect = 0.012
 Identities = 49/210 (23%), Positives = 87/210 (41%), Gaps = 12/210 (5%)

Query: 3   EMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLRE 62
           E+  L A+L+E  ++L  L  E ++LE   +        E       IS+ +   + LRE
Sbjct: 254 ELEELTAELQELEEKLEELRLEVSELEEEIEELQK----ELYALANEISRLEQQKQILRE 309

Query: 63  KIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWA 122
           ++ +LE QL   E  LE   S LD+L + LA +    E+L             L+ +   
Sbjct: 310 RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEE 369

Query: 123 TETK-HLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAINL 181
            E++   ++  L  +  K   L+  +  +++    +  + +RL        +L      L
Sbjct: 370 LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRE---RLQQEIEEL 426

Query: 182 RGKQLAYKLNEICDSKQKTLNEKKVTLEQL 211
             K    +L E+    Q  L E +  LE+L
Sbjct: 427 LKKLEEAELKEL----QAELEELEEELEEL 452



 Score = 33.1 bits (76), Expect = 0.18
 Identities = 40/172 (23%), Positives = 75/172 (43%), Gaps = 11/172 (6%)

Query: 42  EQEQTNLLISKKQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEK 101
           + E+    I++ + AL  LR+++++LE +L    K+LE  +  +  L+K LA + ++ E+
Sbjct: 685 KIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQ 744

Query: 102 LYNVYAEKRNINSGLQLDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKK 161
           L    A         QL K  TE +  I+ L   +      L  A   I++ +  I + K
Sbjct: 745 LEERIA---------QLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795

Query: 162 KRLTQEITEMVKLLTNAINLRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSR 213
           + L      + +L      L  +     L E  +S ++ +   +  LE L  
Sbjct: 796 EELKALREALDELRAELTLLNEEAA--NLRERLESLERRIAATERRLEDLEE 845



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 39/221 (17%), Positives = 83/221 (37%), Gaps = 18/221 (8%)

Query: 10  KLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLL---ISKKQLALKTLREKIQD 66
           ++    + L  L  E  +LE R         + + +   L   + + +  L     +I++
Sbjct: 727 QISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEE 786

Query: 67  LESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWATETK 126
           LE+Q+   +++L+     LD+L+  L  +  +        A  R     L+    ATE  
Sbjct: 787 LEAQIEQLKEELKALREALDELRAELTLLNEE-------AANLRERLESLERRIAATER- 838

Query: 127 HLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAINLRGKQL 186
             ++ L  ++      ++S    I++ + LI E +  L + +      L  A+ L   +L
Sbjct: 839 -RLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL-EALLNERASLEEALALLRSEL 896

Query: 187 AYKLNEICDSKQKTLNEKKVTLEQLSRLTDHCIEFVNNGLN 227
                E    + + L  K+  L +        +  +   L 
Sbjct: 897 -----EELSEELRELESKRSELRRELEELREKLAQLELRLE 932


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 35.9 bits (84), Expect = 0.020
 Identities = 36/187 (19%), Positives = 73/187 (39%), Gaps = 25/187 (13%)

Query: 11  LKEQNQRLITLNHEKAKL---EARTKANPGAHADEQEQTNLLISKKQLALKTLREK---I 64
            K+  +  I    E+AK    EA+ +A         E    +   +    K LRE+   +
Sbjct: 25  RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNEL 84

Query: 65  QDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWATE 124
           Q LE +L  KE++L+     L+  ++ L     + E+               +L+K   E
Sbjct: 85  QKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQ------------ELEKKEEE 132

Query: 125 TKHLIKGLLSEVSYKRVLLK-SAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAINLRG 183
            + LI+  L E      L + S +   + ++ L+ + ++    E   ++K +        
Sbjct: 133 LEELIEEQLQE------LERISGLTAEEAKEILLEKVEEEARHEAAVLIKEIEEEAKEEA 186

Query: 184 KQLAYKL 190
            + A ++
Sbjct: 187 DKKAKEI 193


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 36.1 bits (84), Expect = 0.020
 Identities = 39/213 (18%), Positives = 94/213 (44%), Gaps = 28/213 (13%)

Query: 2   AEMSAL-KAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTL 60
           +EM  L K K++E NQ++ TL+ +   ++ + K       +++++    I++KQ     L
Sbjct: 166 SEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQNKYDEL 225

Query: 61  REKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDK 120
            E+ + +++++     +L     +++D    L        KL    A+ ++     ++++
Sbjct: 226 VEEAKTIKAEIEELTDELLNLVMDIEDPSAALN-------KLNTAAAKIKS-----KIEQ 273

Query: 121 WATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAIN 180
           +    K   KG          +  +  + I +  + I++ K +L  E+   ++ L  AI+
Sbjct: 274 FQKVIKMYEKG---------GVCPTCTQQISEGPDRITKIKDKLK-ELQHSLEKLDTAID 323

Query: 181 LRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSR 213
               +L   ++E  + + K L E K  +    +
Sbjct: 324 ----ELEEIMDEF-NEQSKKLLELKNKISTNKQ 351


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 34.7 bits (80), Expect = 0.066
 Identities = 39/217 (17%), Positives = 87/217 (40%), Gaps = 6/217 (2%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLR 61
            E+  +  ++ E +  L  L  E  KLE   K       +E E+    +   + + + L 
Sbjct: 200 KELEEVLREINEISSELPELREELEKLEKEVK-ELEELKEEIEELEKELESLEGSKRKLE 258

Query: 62  EKIQDLESQLSTKEKDLEVNNSNLDDLK--KTLANMMSDSEKLYNVYAEKRNINSGLQLD 119
           EKI++LE ++   +K++E     + +LK  K  A       + Y  Y ++       +L 
Sbjct: 259 EKIRELEERIEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELR-EIEKRLS 317

Query: 120 KWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAI 179
           +   E   + +  + E+  K   L+   K + + +  + E ++R  +   E         
Sbjct: 318 RLEEEINGIEE-RIKELEEKEERLEELKKKLKELEKRLEELEERH-ELYEEAKAKKEELE 375

Query: 180 NLRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSRLTD 216
            L+ +       ++    ++    K+   E++S++T 
Sbjct: 376 RLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKITA 412



 Score = 29.3 bits (66), Expect = 2.9
 Identities = 43/193 (22%), Positives = 80/193 (41%), Gaps = 28/193 (14%)

Query: 3   EMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQT------------NLLI 50
           ++  LK KL E  ++L  L  E A+L    +       +E E+              L  
Sbjct: 550 KLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKD 609

Query: 51  SKKQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMS-----DSEKLYNV 105
           ++K+L  +   ++++ LE +L    ++L      L++L+K L  +       + E+L   
Sbjct: 610 AEKEL--EREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREE 667

Query: 106 YAEKRNINSGL-----QLDKWATETKHLIKGL---LSEVSYKRVLLKSAMKVIDDRQNLI 157
           Y E     +GL     +L+K   E K  ++ L   L E    +  L+   K ++  + L 
Sbjct: 668 YLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALERVEELR 727

Query: 158 SEKKKRLTQEITE 170
            EK K+    + E
Sbjct: 728 -EKVKKYKALLKE 739



 Score = 28.1 bits (63), Expect = 7.8
 Identities = 28/129 (21%), Positives = 56/129 (43%), Gaps = 17/129 (13%)

Query: 50  ISKKQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDS---------- 99
           +  +  +LK   EK+++L+ +L+  EK L+     L +L K L  +  +S          
Sbjct: 537 LKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKE 596

Query: 100 -EKLYNVYAEKRNINSGLQLDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLIS 158
            E  YN Y E ++    L+        +  +K L  E+      L    K +++ +  + 
Sbjct: 597 LEPFYNEYLELKDAEKELE------REEKELKKLEEELDKAFEELAETEKRLEELRKELE 650

Query: 159 EKKKRLTQE 167
           E +K+ ++E
Sbjct: 651 ELEKKYSEE 659


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 33.5 bits (77), Expect = 0.12
 Identities = 26/136 (19%), Positives = 57/136 (41%), Gaps = 9/136 (6%)

Query: 11  LKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTL--REKIQDLE 68
           L E    L  L  E  +L+ + +       +     +  ++   L L+    +E++  + 
Sbjct: 22  LYELRLDLARLLLENEELKQKVEE----ALEGATNEDGKLAADLLKLEVARKKERLNQIR 77

Query: 69  SQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRN-INSGLQLDKWATETKH 127
           +++S  ++++E     +++LK+ LA   SD         ++R      LQ +     T+ 
Sbjct: 78  ARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQDE--IKRTRS 135

Query: 128 LIKGLLSEVSYKRVLL 143
            +  L S ++ KR  L
Sbjct: 136 KLNALHSLLAEKRSFL 151


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 31.5 bits (72), Expect = 0.32
 Identities = 16/88 (18%), Positives = 38/88 (43%), Gaps = 2/88 (2%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLL--ISKKQLALKT 59
            E++ L+ +L E  ++L  L  E  +L+             Q++   +  +S   + L  
Sbjct: 73  QELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDE 132

Query: 60  LREKIQDLESQLSTKEKDLEVNNSNLDD 87
              ++++  ++L  + + LE  N  L +
Sbjct: 133 ENRELREELAELKQENEALEAENERLQE 160


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 32.0 bits (73), Expect = 0.39
 Identities = 44/221 (19%), Positives = 93/221 (42%), Gaps = 16/221 (7%)

Query: 3   EMSALKAKLKEQNQRLITLNHEK---AKLEARTKANPGAHADEQEQTNLLISKKQLALKT 59
           E   L+ +++   +RL  L  EK    +L+AR         +  +     + + +  L+ 
Sbjct: 226 EEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEE 285

Query: 60  LREKIQDLE------SQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNIN 113
           L EKI+ LE       +L  + + L      L++L + L ++    EKL     +  +  
Sbjct: 286 LEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESEL 345

Query: 114 SGLQLDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVK 173
             L  +K   E   L++  L E+  +   L+  ++   +R   + E  + L +E+ E   
Sbjct: 346 EELAEEK--NELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAE--- 400

Query: 174 LLTNAINLRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSRL 214
            L+ A+    ++L     E+ + ++  L E +  +++L   
Sbjct: 401 -LSAALEEIQEELEELEKELEELER-ELEELEEEIKKLEEQ 439



 Score = 29.3 bits (66), Expect = 3.0
 Identities = 38/224 (16%), Positives = 92/224 (41%), Gaps = 13/224 (5%)

Query: 1   MAEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTL 60
             E+  L+ +L+E  + LI L   +  L+   +        E     L   K++L L+ L
Sbjct: 500 REEIEELEKELRELEEELIELLELEEALKEELEEK--LEKLENLLEELEELKEKLQLQQL 557

Query: 61  REKIQDLESQLSTKEKDLEVNNSN------LDDLKKTLANMMSDSEKLYNVYAEKRNINS 114
           +E+++ LE +L   ++ LE           L++L++ L  +    ++L    ++   +  
Sbjct: 558 KEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQ 617

Query: 115 GLQLDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISE-KKKRLTQEITEMVK 173
            L+L +   E    ++    E+  +   L    ++ +  Q  + E ++K    E     +
Sbjct: 618 SLELSEAENE----LEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRE 673

Query: 174 LLTNAINLRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSRLTDH 217
           L       + ++   +L ++ +  ++   E +  L++L  +   
Sbjct: 674 LQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQL 717


>gnl|CDD|239423 cd03307, Mta_CmuA_like, MtaA_CmuA_like family. MtaA/CmuA, also
           MtsA, or methyltransferase 2 (MT2) MT2-A and MT2-M
           isozymes, are methylcobamide:Coenzyme M
           methyltransferases, which play a role in metabolic
           pathways of methane formation from various substrates,
           such as methylated amines and methanol. Coenzyme M,
           2-mercaptoethylsulfonate or CoM, is methylated during
           methanogenesis in a reaction catalyzed by three
           proteins. A methyltransferase methylates the corrinoid
           cofactor, which is bound to a second polypeptide, a
           corrinoid protein. The methylated corrinoid protein then
           serves as a substrate for MT2-A and related enzymes,
           which methylate CoM.
          Length = 326

 Score = 31.9 bits (73), Expect = 0.39
 Identities = 22/86 (25%), Positives = 37/86 (43%), Gaps = 13/86 (15%)

Query: 186 LAYKLNEICDSKQKTLNEKKVTLEQLSRLTDHCIEFVNNGLNTGSDM----------ALL 235
           LA  L  + +  +  + + +   E L  LT+ CIE+    L  G+D+           L+
Sbjct: 141 LASHLAGVENFLKWLIKKPEKVREFLEFLTEACIEYAKAQLEAGADIITIADPTASPELI 200

Query: 236 SSKAYKRVLL---KSAMKVIDDRPTI 258
           S + Y+   L   K  +K +   PTI
Sbjct: 201 SPEFYEEFALPYHKKIVKELHGCPTI 226


>gnl|CDD|222949 PHA02947, PHA02947, S-S bond formation pathway protein;
           Provisional.
          Length = 215

 Score = 30.9 bits (70), Expect = 0.51
 Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 1/67 (1%)

Query: 46  TNLLISKKQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNV 105
            N  ++ K+L+   L +  +++ S L  KE+  E+ N    DL     + +S+ E+  N 
Sbjct: 52  INKCLNNKRLSFTLLIKTFKEVISTLPEKERR-ELANEIGIDLDNDDPSYVSELERNCNA 110

Query: 106 YAEKRNI 112
            A+  NI
Sbjct: 111 SADVNNI 117


>gnl|CDD|200553 cd10927, CE4_u3, Putative catalytic domain of uncharacterized
           bacterial proteins from the carbohydrate esterase 4
           superfamily.  This family corresponds to a group of
           uncharacterized bacterial proteins with high sequence
           similarity to the catalytic domain of the six-stranded
           barrel rhizobial NodB-like proteins, which remove
           N-linked or O-linked acetyl groups from cell wall
           polysaccharides and belong to the larger carbohydrate
           esterase 4 (CE4) superfamily.
          Length = 227

 Score = 30.9 bits (70), Expect = 0.56
 Identities = 12/41 (29%), Positives = 18/41 (43%), Gaps = 6/41 (14%)

Query: 249 MKVIDDRPTIPEDTSYKITLKSPSSSTSSKSVTPPSSSVNI 289
            +V+ + P I      K+ L  P S  + K V    + VNI
Sbjct: 130 ARVLAEIPDI------KVWLYKPESDVAGKFVLRRINKVNI 164


>gnl|CDD|220410 pfam09798, LCD1, DNA damage checkpoint protein.  This is a family
           of proteins which regulate checkpoint kinases. In
           Schizosaccharomyces pombe this protein is called Rad26
           and in Saccharomyces cerevisiae it is called LCD1.
          Length = 648

 Score = 31.3 bits (71), Expect = 0.57
 Identities = 52/237 (21%), Positives = 93/237 (39%), Gaps = 30/237 (12%)

Query: 60  LREKIQDLESQLSTKEKDLEVNNSNLDDLK-KTLANMMSDSEKLYNVYAEKRNINSGLQL 118
           LR+K+  L+ Q   K+++     S +++LK K    +    ++L ++  E++ +      
Sbjct: 2   LRDKLDMLQQQ---KQEERNKQKSRVNELKEKHDQELQKLKQELQSLEDERKFLV----- 53

Query: 119 DKWATETKHLIKG-LLSEVSYKRVLLKS--AMKVIDDRQNLISEKKKRLTQEITEMVKLL 175
                E + L    L +E+S    LLK+  A  + D+ Q    ++KKR    + E+V L 
Sbjct: 54  ----LEQRGLSANDLRTELSPPSNLLKTSDASHIADESQPNSIKQKKREISPVKELVPLN 109

Query: 176 TNAINLRGKQLAYKLNEICDSKQKTLNEKKVTLEQLSRL-TDHCIEFVNNGLNTGSDMAL 234
            N I    K L            K    K  TLE L+ +  D   E +       S   L
Sbjct: 110 PNRIIKDDKSLFLD----HILNHKIEGSKLSTLEMLNHIKLDKIDEILGKNKKISSGDPL 165

Query: 235 LSSKAYKRVLLKSAMKVIDDRPTIPEDTSYKITLKSPSSSTSSKSVTPPSSSVNIPL 291
              K+   +LL+        +  +  D      L++ +    S  V P    +++P 
Sbjct: 166 --GKSIVSLLLRL-------KSNLKLDELVDTVLEALADLIKSIIVLPKELKLSVPF 213


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 31.2 bits (71), Expect = 0.61
 Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 7/84 (8%)

Query: 2   AEMSALKAKLKEQNQRLITLN--HEKAKLEARTKANPGAHADEQEQTNLLISK-----KQ 54
            E+   +A+++E  ++L TL    E  +     K     +A E E+   +  K     K+
Sbjct: 435 NELFRSEAEIEELLRQLETLKEAIEALRKTLDEKTKQKINAFELERAITIADKAKKTLKE 494

Query: 55  LALKTLREKIQDLESQLSTKEKDL 78
              K L  K+Q LE +++   K L
Sbjct: 495 FREKLLERKLQQLEEEITKSFKKL 518


>gnl|CDD|216366 pfam01208, URO-D, Uroporphyrinogen decarboxylase (URO-D). 
          Length = 338

 Score = 30.2 bits (69), Expect = 1.3
 Identities = 13/51 (25%), Positives = 25/51 (49%), Gaps = 8/51 (15%)

Query: 211 LSRLTDHCIEFVNNGLNTGSDM--------ALLSSKAYKRVLLKSAMKVID 253
           L +LTD  IE++   +  G+D          LLS + ++  +L    +++D
Sbjct: 175 LDKLTDAVIEYLKAQIEAGADAIQIFDSWAGLLSPEDFREFVLPYLKRIVD 225


>gnl|CDD|240411 PTZ00419, PTZ00419, valyl-tRNA synthetase-like protein;
           Provisional.
          Length = 995

 Score = 30.4 bits (69), Expect = 1.4
 Identities = 30/160 (18%), Positives = 56/160 (35%), Gaps = 23/160 (14%)

Query: 25  KAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLREKIQDLESQLSTKEKDLEVNNSN 84
            A L    K  P  +   ++   + + +    L +   KI  +      +E+        
Sbjct: 848 IATLGIPNKTKPDCYVTAKDAELIELIESAENLISTLAKIGSVSVIPPIEEEAEVPKGCG 907

Query: 85  LDDL-KKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWATETKHLIKGLLSEVS---YKR 140
            D +  K +  +  D     ++  E   +    +L K     +  ++  L ++S   Y+ 
Sbjct: 908 FDVVDNKVIIYLNLDEF--IDLKKELAKLEK--KLAK----LQKSLESYLKKISIPNYE- 958

Query: 141 VLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAIN 180
                  KV +D + L  EK   L +EI    K L  AI 
Sbjct: 959 ------DKVPEDVRKLNDEKIDELNEEI----KQLEQAIE 988


>gnl|CDD|213361 cd12827, EcCorA_ZntB-like_u2, uncharacterized bacterial subfamily
           of the Escherichia coli CorA-Salmonella typhimurium ZntB
           family.  A uncharacterized subfamily of the Escherichia
           coli CorA-Salmonella typhimurium ZntB (EcCorA-ZntB_like)
           family of the MIT superfamily of essential membrane
           proteins involved in transporting divalent cations
           (uptake or efflux) across membranes.The EcCorA-ZntB-like
           family includes the Mg2+ transporters Escherichia coli
           and Salmonella typhimurium CorAs, which can also
           transport Co2+, and Ni2+. Structures of the
           intracellular domain of EcCorA-ZntB-like family members,
           Vibrio parahaemolyticus and Salmonella typhimurium ZntB,
           form funnel-shaped homopentamers, the tip of the funnel
           is formed from two C-terminal transmembrane (TM) helices
           from each monomer, and the large opening of the funnel
           from the N-terminal cytoplasmic domains. The GMN
           signature motif of the MIT superfamily occurs just after
           TM1, mutation within this motif is known to abolish Mg2+
           transport through Salmonella typhimurium CorA. Natural
           variants such as GVN and GIN, such as occur in some ZntB
           family proteins, may be associated with the transport of
           different divalent cations, such as zinc and cadmium.
           The functional diversity of MIT transporters may also be
           due to minor structural differences regulating gating,
           substrate selection, and transport.
          Length = 289

 Score = 29.8 bits (68), Expect = 1.4
 Identities = 24/88 (27%), Positives = 39/88 (44%), Gaps = 13/88 (14%)

Query: 14  QNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLL------ISKKQL-ALKTLREKIQD 66
              RLIT+ + +  L          + D   +T  L      I+   L  L+ + ++  +
Sbjct: 74  TKDRLITICNHETDLLDDFLEGRVKNFDTSSKTRFLLQILYRIATSYLKYLRQINKERDE 133

Query: 67  LESQL--STKEKDLEVNNSNLDDLKKTL 92
           LE +L  STK K+L      L DL+K+L
Sbjct: 134 LEQELRKSTKNKELF----QLLDLEKSL 157


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.6 bits (67), Expect = 1.5
 Identities = 31/171 (18%), Positives = 63/171 (36%), Gaps = 28/171 (16%)

Query: 17  RLITLNHEKAKLEARTKANPGAHAD---EQEQTNLLISKKQLALKTL------------- 60
            +  L+ EK +LE R K    A      E E  N  +   ++ L+ L             
Sbjct: 11  AIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQE 70

Query: 61  -REKIQDLESQLS--TKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQ 117
            RE+I+  E +LS    E++L   N  +   K+ + ++  +  +L     +       L+
Sbjct: 71  IRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLK 130

Query: 118 LDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEI 168
                 E           ++     L+  +  I +    +S K++ L +++
Sbjct: 131 ERLERLEK---------NLAEAEARLEEEVAEIREEGQELSSKREELKEKL 172


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 29.3 bits (66), Expect = 1.8
 Identities = 21/104 (20%), Positives = 44/104 (42%), Gaps = 3/104 (2%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLA---LK 58
            E+S LK +L E  + L  L    A+LE+  +         +E    L  + +     +K
Sbjct: 79  GELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLESLEESIK 138

Query: 59  TLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKL 102
            L +++++L   L  + ++L      L +  + L   + + + L
Sbjct: 139 ELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSL 182


>gnl|CDD|185600 PTZ00410, PTZ00410, NAD-dependent SIR2; Provisional.
          Length = 349

 Score = 29.5 bits (66), Expect = 1.9
 Identities = 17/65 (26%), Positives = 25/65 (38%), Gaps = 6/65 (9%)

Query: 230 SDMALLSSKAYKRVLLK------SAMKVIDDRPTIPEDTSYKITLKSPSSSTSSKSVTPP 283
           + +A +  K   RVL           +   D  T     S     +S SSS+ S S +  
Sbjct: 222 ALLACVVPKDVPRVLFNLERVGGLMFRFPTDPLTTFHADSVAKEGRSSSSSSRSSSDSST 281

Query: 284 SSSVN 288
           SSS +
Sbjct: 282 SSSSD 286


>gnl|CDD|182809 PRK10884, PRK10884, SH3 domain-containing protein; Provisional.
          Length = 206

 Score = 29.2 bits (66), Expect = 2.0
 Identities = 12/37 (32%), Positives = 23/37 (62%)

Query: 56  ALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTL 92
            +  L+E+ Q L++QL   +K ++  N  LDD ++T+
Sbjct: 133 VINGLKEENQKLKNQLIVAQKKVDAANLQLDDKQRTI 169


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 29.2 bits (66), Expect = 2.7
 Identities = 7/42 (16%), Positives = 21/42 (50%)

Query: 53  KQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLAN 94
            +  ++ L+++++ LE ++   E ++E     L+  K    +
Sbjct: 93  IEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLD 134



 Score = 28.8 bits (65), Expect = 4.1
 Identities = 21/92 (22%), Positives = 40/92 (43%), Gaps = 6/92 (6%)

Query: 3   EMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLRE 62
            + A+   ++E N+++  L  EK K + +   +  A   E       I   Q   K L +
Sbjct: 369 IIDAINELIREHNEKIDNLKKEKNKAKKKLWLHLVAELKE------DIDAYQKEKKGLEK 422

Query: 63  KIQDLESQLSTKEKDLEVNNSNLDDLKKTLAN 94
            I  LE ++   E +++     + +L+K L N
Sbjct: 423 AINSLEKEIKQLEAEIKALEKEIKELEKQLTN 454


>gnl|CDD|235519 PRK05588, PRK05588, histidinol-phosphatase; Provisional.
          Length = 255

 Score = 28.8 bits (65), Expect = 2.8
 Identities = 18/56 (32%), Positives = 31/56 (55%), Gaps = 5/56 (8%)

Query: 53  KQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAE 108
           K++     +E I ++   L  KEK LE+N   LDD K+++ N++    K+Y  + E
Sbjct: 157 KEIYYDEFKEIIDEILKVLIEKEKVLEINTRRLDD-KRSVENLV----KIYKRFYE 207


>gnl|CDD|224000 COG1074, RecB, ATP-dependent exoDNAse (exonuclease V) beta subunit
           (contains helicase and exonuclease domains) [DNA
           replication, recombination, and repair].
          Length = 1139

 Score = 29.4 bits (66), Expect = 2.8
 Identities = 28/170 (16%), Positives = 64/170 (37%), Gaps = 11/170 (6%)

Query: 24  EKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLREKIQDLESQLSTKEKDLEVNNS 83
           +KA +  + K    A   E E   L    K+ +     E +     +LS  +    ++ +
Sbjct: 218 QKALIILQLKQAWLASLLELEALILGGLDKRASSL---ENLLKWIEKLSAWDNFERLDKA 274

Query: 84  NLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWATETKHLIKGLLSEVSYKRVLL 143
               L+K         +K        R         ++      ++  L  E+      L
Sbjct: 275 LPTKLEKIFFQKK-LKKKTKEAKLLARPP----LHAEFEEIENRIL--LADELLRLAPRL 327

Query: 144 KSAMKVIDD-RQNLISEKKKRLTQEITEMVKLLTNAINLRGKQLAYKLNE 192
           ++ +K++++  +   +EK++R   +  +++  L   +   G++LA  L E
Sbjct: 328 RALLKLLEELIERYDAEKRQRNALDFDDLIHRLAKLLTSEGEELAESLRE 377


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 29.5 bits (67), Expect = 2.8
 Identities = 43/222 (19%), Positives = 82/222 (36%), Gaps = 48/222 (21%)

Query: 6   ALKAKLKEQNQRLITLNHEKAKLEARTK------ANPGAHADEQEQTNLLISKKQLALKT 59
           AL  K+  Q +    L  + A+  A+ +             DE+ +  L      L+L+ 
Sbjct: 70  ALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETL----STLSLRQ 125

Query: 60  LREKIQDLESQLSTKEKDLEVNNSNLDDLK----KTLANMMSDSE-------KLYNVYAE 108
           L  ++     QL   + DL   NS L  L+    +  A + ++S+        L      
Sbjct: 126 LESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKGGKVG 185

Query: 109 KRNINSGLQLDKWATETKHLIKGLLSEVSYKRVLLKSAMKVID---DRQNLISEKKKRLT 165
            + +    +      E   L     ++   +R  L+   ++ D    +++ ++ + +RL 
Sbjct: 186 GKALRPSQR-VLLQAEQALLN----AQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLE 240

Query: 166 QEITEMVKLLTNAINLRGKQLAYKLNEICDSKQKTLNEKKVT 207
            ++     LL  AIN               SK+ TL+EK V 
Sbjct: 241 HQLQ----LLQEAIN---------------SKRLTLSEKTVQ 263


>gnl|CDD|152058 pfam11622, DUF3251, Protein of unknown function (DUF3251).  This
          family of proteins with unknown function appears to be
          restricted to Enterobacteriaceae. Some members if this
          family are annotated as putative lipoprotein YajI
          however this cannot be confirmed.
          Length = 156

 Score = 28.3 bits (63), Expect = 2.8
 Identities = 11/37 (29%), Positives = 17/37 (45%)

Query: 3  EMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAH 39
          E+  L   +   NQ L TLN +  K+E +   N  + 
Sbjct: 5  EVPQLHQSVSTLNQELTTLNQQTVKIEQQNALNAKST 41


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 29.2 bits (65), Expect = 3.1
 Identities = 25/88 (28%), Positives = 38/88 (43%), Gaps = 10/88 (11%)

Query: 14  QNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALK---TLREKIQDLESQ 70
           Q Q+L      +A  EA+ KA   A A E +       + QLAL+    LR +  +L  +
Sbjct: 305 QRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLALEEKAALRAERDNLAKE 364

Query: 71  LSTKEKDLE-------VNNSNLDDLKKT 91
           L  K+++ E       +  S LD   K 
Sbjct: 365 LEAKKREAEQLRMELAIRISALDTCIKA 392


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 29.1 bits (66), Expect = 3.2
 Identities = 29/117 (24%), Positives = 53/117 (45%), Gaps = 17/117 (14%)

Query: 57  LKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGL 116
           LK  R ++Q LE +L  +E+ L+    +LD  ++ L       EK      EK   N   
Sbjct: 71  LKERRNELQRLERRLLQREETLDRKMESLDKKEENL-------EK-----KEKELSNKEK 118

Query: 117 QLDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVK 173
            LD+   E + LI     E+  +R+   S +   + ++ L+ E ++    E  +++K
Sbjct: 119 NLDEKEEELEELIAEQREEL--ERI---SGLTQEEAKEILLEEVEEEARHEAAKLIK 170


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 29.2 bits (65), Expect = 3.3
 Identities = 45/214 (21%), Positives = 86/214 (40%), Gaps = 12/214 (5%)

Query: 9   AKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLRE---KIQ 65
           A LK QN  L     E  K+  + K         +  +N   +      +  +E   K++
Sbjct: 274 ANLKTQNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLE 333

Query: 66  DLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGL--------Q 117
            L+S++  KE++++   SN+D+L K L      +E+   +  E+  +   L        +
Sbjct: 334 KLKSEIELKEEEIKALQSNIDELHKQLRKQGISTEQFELMNQEREKLTRELDKINIQSDK 393

Query: 118 LDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEIT-EMVKLLT 176
           L K     K   +G+   +        S ++ I   ++ I       + +I  E +    
Sbjct: 394 LTKSVKSRKLEAQGIFKSLEKTLRQYDSLIQNITRSRSQIGHNVNDSSLKINIEQLFPKG 453

Query: 177 NAINLRGKQLAYKLNEICDSKQKTLNEKKVTLEQ 210
           + IN   K+   +LN+    + KT   K +TLE+
Sbjct: 454 SGINESIKKSILELNDEIQERIKTEENKSITLEE 487


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 22/87 (25%), Positives = 34/87 (39%)

Query: 7   LKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLREKIQD 66
           LK   K   + L  LN  K KL  R  A        ++  + L       L   +EK++ 
Sbjct: 156 LKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKK 215

Query: 67  LESQLSTKEKDLEVNNSNLDDLKKTLA 93
           L  ++  K K LE     L +L+  + 
Sbjct: 216 LLQEIMIKVKKLEELEEELQELESKIE 242



 Score = 27.3 bits (61), Expect = 9.5
 Identities = 27/138 (19%), Positives = 59/138 (42%), Gaps = 6/138 (4%)

Query: 79  EVNNSNLDDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWATETKHLIKGLLSEVSY 138
           E  + NL+ LK+    +M + E L ++  + R+    L+      E + L +        
Sbjct: 147 EGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALE-----EELRQLKQLEDELEDC 201

Query: 139 KRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAINLRGKQLAYKLNEICDSKQ 198
               L  A + +      I  K K+L +E+ E ++ L + I     + +    EI ++++
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKL-EELEEELQELESKIEDLTNKKSELNTEIAEAEK 260

Query: 199 KTLNEKKVTLEQLSRLTD 216
           K    +  T +++ +L +
Sbjct: 261 KLEQCRGFTFKEIEKLKE 278


>gnl|CDD|100094 cd05801, PGM_like3, This bacterial PGM-like
           (phosphoglucomutase-like) protein of unknown function
           belongs to the alpha-D-phosphohexomutase superfamily.
           The alpha-D-phosphohexomutases include several related
           enzymes that catalyze a reversible intramolecular
           phosphoryl transfer on their sugar substrates. Other
           members of this superfamily include phosphoglucosamine
           mutase (PNGM), phosphoacetylglucosamine mutase (PAGM),
           the bacterial phosphomannomutase ManB, the bacterial
           phosphoglucosamine mutase GlmM, and the bifunctional
           phosphomannomutase/phosphoglucomutase (PMM/PGM). Each of
           these enzymes has four domains with a centrally located
           active site formed by four loops, one from each domain.
           All four domains are included in this alignment model.
          Length = 522

 Score = 28.8 bits (65), Expect = 3.6
 Identities = 24/104 (23%), Positives = 41/104 (39%), Gaps = 23/104 (22%)

Query: 26  AKLEARTKANPGAHADEQEQ---------TNLLISKKQLA-LKTLREKIQDLESQLSTKE 75
           A++ A T  +PG    E  +          +   + +Q A LK L    + + +     +
Sbjct: 402 AEILAVTGKDPGQLYQELTERFGEPYYARIDAPATPEQKARLKKLS--PEQVTATELAGD 459

Query: 76  KDLEV------NNSNLDDLKKTLAN-----MMSDSEKLYNVYAE 108
             L        N +++  LK T AN       S +E +Y +YAE
Sbjct: 460 PILAKLTRAPGNGASIGGLKVTTANGWFAARPSGTEDVYKIYAE 503


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 28.9 bits (65), Expect = 3.6
 Identities = 17/73 (23%), Positives = 31/73 (42%), Gaps = 2/73 (2%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKL--EARTKANPGAHADEQEQTNLLISKKQLALKT 59
            +   LK K+KEQ +R +  N+E+     E + +       D Q   + L  + +     
Sbjct: 380 RKYERLKQKIKEQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLE 439

Query: 60  LREKIQDLESQLS 72
             E+  +LE +L 
Sbjct: 440 FNEEEYELELRLG 452


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 28.5 bits (64), Expect = 4.3
 Identities = 14/43 (32%), Positives = 21/43 (48%)

Query: 51  SKKQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLA 93
           +K +  LK+L  +I  LE+QL     DL+     + DL   L 
Sbjct: 62  AKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLN 104



 Score = 27.8 bits (62), Expect = 7.2
 Identities = 21/91 (23%), Positives = 37/91 (40%), Gaps = 10/91 (10%)

Query: 6   ALKAKLKEQNQRLITLNHEKAKLEA---RTKANPGAHADEQEQTNLLISKKQLALKTLRE 62
           A   ++      L  L   +A++ A              +Q +   L+ +++   KTL +
Sbjct: 158 ARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERK---KTLAQ 214

Query: 63  KIQDLESQLSTKEKDLEVNNSNLDDLKKTLA 93
               L S+LS  +K LE   +N   LK  +A
Sbjct: 215 ----LNSELSADQKKLEELRANESRLKNEIA 241


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 28.5 bits (64), Expect = 4.5
 Identities = 11/32 (34%), Positives = 17/32 (53%)

Query: 5   SALKAKLKEQNQRLITLNHEKAKLEARTKANP 36
             L A+L+E+  + +    E+A LEA  KA  
Sbjct: 149 KLLDAELEEEKAQRVKKIEEEADLEAERKAQN 180


>gnl|CDD|191367 pfam05761, 5_nucleotid, 5' nucleotidase family.  This family of
           eukaryotic proteins includes 5' nucleotidase enzymes,
           such as purine 5'-nucleotidase EC:3.1.3.5.
          Length = 448

 Score = 28.4 bits (64), Expect = 5.0
 Identities = 13/67 (19%), Positives = 25/67 (37%), Gaps = 9/67 (13%)

Query: 40  ADEQEQTNLLISKKQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDS 99
             EQ +   L       L  L E++ D     +    +L     +L +L   +  +    
Sbjct: 334 NTEQYRFEELQ-----RLDILLERLYDHLDVHA----ELSTLRPDLSELDAEIRELRRAM 384

Query: 100 EKLYNVY 106
           ++L+N Y
Sbjct: 385 KELFNPY 391


>gnl|CDD|218681 pfam05667, DUF812, Protein of unknown function (DUF812).  This
           family consists of several eukaryotic proteins of
           unknown function.
          Length = 536

 Score = 28.6 bits (64), Expect = 5.2
 Identities = 18/87 (20%), Positives = 40/87 (45%), Gaps = 2/87 (2%)

Query: 24  EKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLREKIQDLESQLSTKEKDLEVNNS 83
           EK +  A   A       + E+      +++  L+ L+E+I +LES +     +++   S
Sbjct: 241 EKVEQAATAAAEVPPAEMDTEED--RTKEREAELEALQEQIDELESSIEEVLSEIKALAS 298

Query: 84  NLDDLKKTLANMMSDSEKLYNVYAEKR 110
            +  + + L  +  ++E+L   Y  K+
Sbjct: 299 KIKQVNEELTTVRQENEELEEEYKIKK 325


>gnl|CDD|236427 PRK09246, PRK09246, amidophosphoribosyltransferase; Provisional.
          Length = 501

 Score = 28.2 bits (64), Expect = 5.3
 Identities = 18/50 (36%), Positives = 24/50 (48%), Gaps = 10/50 (20%)

Query: 80  VNNSNL---DDLKKTLA-------NMMSDSEKLYNVYAEKRNINSGLQLD 119
            +N NL   ++L+K L        N  SDSE L NV+A +     GL L 
Sbjct: 99  AHNGNLTNAEELRKELFEKDRRHINTTSDSEVLLNVFAHELQKFRGLPLT 148


>gnl|CDD|237255 PRK12900, secA, preprotein translocase subunit SecA; Reviewed.
          Length = 1025

 Score = 28.6 bits (64), Expect = 5.4
 Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 5/56 (8%)

Query: 40 ADEQ--EQTNLLISKKQLALKTLREKIQDLESQLSTKEKDLEVN---NSNLDDLKK 90
          +D+   E+   L S+ + AL+ + +K +DLE +L   +  LE     N  LD L K
Sbjct: 38 SDDALREKGMELKSRVRGALEPIEQKKKDLEKKLDNPDISLEEAESINEELDTLAK 93


>gnl|CDD|235008 PRK02135, PRK02135, hypothetical protein; Provisional.
          Length = 201

 Score = 27.9 bits (63), Expect = 5.5
 Identities = 10/30 (33%), Positives = 16/30 (53%)

Query: 199 KTLNEKKVTLEQLSRLTDHCIEFVNNGLNT 228
           K L  +K++L       DHCI  ++N L+ 
Sbjct: 167 KRLGAEKISLGPKMLHADHCITLIHNELDR 196


>gnl|CDD|237961 PRK15408, PRK15408, autoinducer 2-binding protein lsrB;
           Provisional.
          Length = 336

 Score = 28.2 bits (63), Expect = 5.5
 Identities = 18/61 (29%), Positives = 31/61 (50%), Gaps = 9/61 (14%)

Query: 219 IEFVNNGLNTGSDMALLSS-------KAYKRVLLKSAMKVID-DRPTIPEDTSYKITLKS 270
           ++ +NN +N G +  ++S+        A KR  ++  +KV+  D  T PE  SY I   +
Sbjct: 70  VQLINNFVNQGYNAIIVSAVSPDGLCPALKRA-MQRGVKVLTWDSDTKPECRSYYINQGT 128

Query: 271 P 271
           P
Sbjct: 129 P 129


>gnl|CDD|221591 pfam12470, SUFU_C, Suppressor of Fused Gli/Ci N terminal binding
           domain.  This domain family is found in eukaryotes, and
           is typically between 192 and 219 amino acids in length.
           The family is found in association with pfam05076. There
           is a conserved HGRHFT sequence motif. This family is the
           C terminal domain of the Suppressor of Fused protein
           (Su(fu)). Su(fu) is a repressor of the Gli and Ci
           transcription factors of the Hedgehog signalling
           cascade. It functions by binding these proteins and
           preventing their translocation to the nucleus. The C
           terminal domain is only found in eukaryotic Su(fu)
           proteins; it is not present in bacterial homologues. The
           C terminal domain binds to the N terminal of Gli/Ci
           while the N terminal of Su(fu) binds to the C terminal
           of Gli/Ci. This dual binding mechanism is likely an
           evolutionary advancement in this signalling cascade
           which is not present in bacterial homologues.
          Length = 201

 Score = 27.5 bits (61), Expect = 6.6
 Identities = 16/68 (23%), Positives = 27/68 (39%), Gaps = 1/68 (1%)

Query: 220 EFVNNGLNT-GSDMALLSSKAYKRVLLKSAMKVIDDRPTIPEDTSYKITLKSPSSSTSSK 278
           E +   L   GSD+A +++    + L K A+   D      +DT    +  S       +
Sbjct: 2   EQLEKDLEKDGSDLAGVNADFTFKELSKPAIDKEDVDSIEDKDTEQIRSPLSEGLEIKEE 61

Query: 279 SVTPPSSS 286
              PP+ S
Sbjct: 62  PSEPPTLS 69


>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein.  This family
           consists of proteins over 500 amino acids long in
           Caenorhabditis elegans and several bacteria (Pseudomonas
           aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
           etc.). The function is unknown.
          Length = 525

 Score = 28.2 bits (63), Expect = 6.9
 Identities = 25/127 (19%), Positives = 44/127 (34%), Gaps = 18/127 (14%)

Query: 54  QLALKTLREKIQDLESQLSTKEK---DLEVNNSNLDDLKKTLANMMSDSEKLYNVYAEKR 110
           +   +   E++ +L  Q+   E    DLE     L  L K L ++    E L     +  
Sbjct: 63  ERTSRPDPERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDI---REGLTEPIKDSA 119

Query: 111 NINSGLQLDKWATETKHLIKGLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITE 170
             N    L +W                 +   L +  +  + R     E +K+L++   E
Sbjct: 120 KRNEPD-LKEWFQ--------AFDFNGSEIERLLTEDREAERRI---RELEKQLSELQNE 167

Query: 171 MVKLLTN 177
           +  LLT 
Sbjct: 168 LNALLTG 174


>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein. 
          Length = 326

 Score = 28.0 bits (62), Expect = 7.3
 Identities = 15/49 (30%), Positives = 26/49 (53%)

Query: 56  ALKTLREKIQDLESQLSTKEKDLEVNNSNLDDLKKTLANMMSDSEKLYN 104
           A  +L E +  + S+LS    DL+   ++LDD K TL  +++  + L  
Sbjct: 43  ATASLSESVSTISSRLSDLSADLQDMTTSLDDAKATLNGLITTVQALQT 91


>gnl|CDD|225519 COG2972, COG2972, Predicted signal transduction protein with a
           C-terminal ATPase domain [Signal transduction
           mechanisms].
          Length = 456

 Score = 27.8 bits (62), Expect = 7.3
 Identities = 22/120 (18%), Positives = 45/120 (37%), Gaps = 9/120 (7%)

Query: 131 GLLSEVSYKRVLLKSAMKVIDDRQNLISEKKKRLTQEITEMVKLLTNAINLRGKQLAYKL 190
            L+  +       +S +  I      +     R+       +K ++  IN   ++L Y +
Sbjct: 188 FLVVILLLSSFSSRSILLPILG----LILLLSRIALGYLSELKEISETINEMEERLEYLI 243

Query: 191 NEICDSKQKTLNEKKVTLEQLSRLTDHCIEFVNNGLNTGSDMALLSSKAYKRVLLKSAMK 250
            E   S ++   E ++   Q S++  H   F+ N L T   +A          ++K+  K
Sbjct: 244 EEN-YSLEQEQLEAELRALQ-SQINPH---FLYNTLETIRMLAEEDDPEEAAKVVKALSK 298


>gnl|CDD|227822 COG5535, RAD4, DNA repair protein RAD4 [DNA replication,
           recombination, and repair].
          Length = 650

 Score = 27.9 bits (62), Expect = 7.4
 Identities = 28/121 (23%), Positives = 44/121 (36%), Gaps = 16/121 (13%)

Query: 33  KANPGAHADEQEQTNLLISKKQLALKTLREKIQDLESQLSTKEKDL-EVNNSNLDDLKKT 91
           +    ++ D+  + N  I KK ++ K   E     +S    +EKD+ E       DL  T
Sbjct: 3   RDLENSYNDQARRKNSKIEKKNVSFKEEEEVDSSFDSFYQDEEKDIDEEPVELDGDLTVT 62

Query: 92  LANMMSDSEKLYNVYAEKRNINSGLQLDKWATETKHLIKGLLSEVSYKRVLLKSAMKVID 151
           + N+  D  K+ + Y++  N                              LLKS  K ID
Sbjct: 63  VNNIRRDRSKV-SKYSDDHNEKLQSSQLHLIMIPFM--------------LLKSRNKWID 107

Query: 152 D 152
           D
Sbjct: 108 D 108


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 27.7 bits (62), Expect = 8.0
 Identities = 18/76 (23%), Positives = 38/76 (50%), Gaps = 3/76 (3%)

Query: 8   KAKLKEQNQRLITLNHEKAKLEARTKANPG---AHADEQEQTNLLISKKQLALKTLREKI 64
             +LK+       ++  K KL+++ + +     A   E+++      KK+  ++ L E+I
Sbjct: 293 LKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERI 352

Query: 65  QDLESQLSTKEKDLEV 80
           + LE Q + KE++  V
Sbjct: 353 EKLEVQATDKEENKTV 368


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 27.6 bits (62), Expect = 9.2
 Identities = 18/63 (28%), Positives = 30/63 (47%), Gaps = 8/63 (12%)

Query: 50  ISKKQLALKTLREKIQDLESQLSTKEKDLEVNNSNLDD----LKKTLANMMSDSEKLYNV 105
           IS K  AL  L  +I +L   LS + +     N +L D    L+ +L+   ++  +L  +
Sbjct: 48  ISGKDSALDRLNSQIAELADLLSLERQ----GNQDLQDSVANLRASLSAAEAERSRLQAL 103

Query: 106 YAE 108
            AE
Sbjct: 104 LAE 106


>gnl|CDD|215507 PLN02939, PLN02939, transferase, transferring glycosyl groups.
          Length = 977

 Score = 27.6 bits (61), Expect = 9.6
 Identities = 40/212 (18%), Positives = 83/212 (39%), Gaps = 23/212 (10%)

Query: 11  LKEQNQRLITLNHEKAKLEARTKANPGAHADEQEQTNLLISKKQLALKTLREKIQDLESQ 70
           L+EQ ++L      +   E     +     D  ++ N+L+      LK    ++ + E +
Sbjct: 199 LEEQLEKLRNELLIRGATEGLCVHSLSKELDVLKEENMLLKDDIQFLKAELIEVAETEER 258

Query: 71  LSTKEKDLEVNNSNLDDLKKTLANMMSDSEKL----YNVYAEKRNINSGLQLDKWATETK 126
           +   EK+  + +++L +L+        D  KL    Y+ + EK   N    LD+   + +
Sbjct: 259 VFKLEKERSLLDASLRELESKFIVAQEDVSKLSPLQYDCWWEKVE-NLQDLLDRATNQVE 317

Query: 127 HLI------KGLLSEVSYKRVLLKSA---------MKVIDDRQNLISEKKKRLTQEITEM 171
                    + L  +V      LK A         ++++  +  L+ E+ +    EI   
Sbjct: 318 KAALVLDQNQDLRDKVDKLEASLKEANVSKFSSYKVELLQQKLKLLEERLQASDHEIHSY 377

Query: 172 VKLLTNAINLRGKQLAYKLNEICDSKQKTLNE 203
           ++L   +I      L+    E   SK+++L  
Sbjct: 378 IQLYQESIKEFQDTLSKLKEE---SKKRSLEH 406


>gnl|CDD|225707 COG3166, PilN, Tfp pilus assembly protein PilN [Cell motility and
          secretion / Intracellular trafficking and secretion].
          Length = 206

 Score = 27.0 bits (60), Expect = 9.6
 Identities = 12/40 (30%), Positives = 20/40 (50%)

Query: 41 DEQEQTNLLISKKQLALKTLREKIQDLESQLSTKEKDLEV 80
           EQ+Q N L++ +   L     +IQ L+ Q     + L+V
Sbjct: 51 AEQQQRNALLTTEIALLDAEIAEIQQLKEQTQALLQRLQV 90


>gnl|CDD|233392 TIGR01391, dnaG, DNA primase, catalytic core.  Members of this
           family are DNA primase, a ubiquitous bacteria protein.
           Most members of this family contain nearly two hundred
           additional residues C-terminal to the region represented
           here, but conservation between species is poor and the
           C-terminal region was not included in the seed
           alignment. This protein contains a CHC2 zinc finger
           (pfam01807) and a Toprim domain (pfam01751) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 415

 Score = 27.6 bits (62), Expect = 9.9
 Identities = 17/64 (26%), Positives = 25/64 (39%)

Query: 86  DDLKKTLANMMSDSEKLYNVYAEKRNINSGLQLDKWATETKHLIKGLLSEVSYKRVLLKS 145
           + LKK L N  S  E L        N+++  +  K   E   LIK +   +     L K 
Sbjct: 352 EALKKLLENSKSLIEFLIARLLSNYNLDTPEEKAKLVEELLPLIKKIPDPILRDYYLQKL 411

Query: 146 AMKV 149
           A  +
Sbjct: 412 AQLL 415


>gnl|CDD|204831 pfam12137, RapA_C, RNA polymerase recycling family C-terminal.
           This domain is found in bacteria. This domain is about
           360 amino acids in length. This domain is found
           associated with pfam00271, pfam00176. The function of
           this domain is not known, but structurally it forms an
           alpha-beta fold in nature with a central beta-sheet
           flanked by helices and loops, the beta-sheet being
           mainly antiparallel and flanked by four alpha helices,
           among which the two longer helices exhibit a coiled-coil
           arrangement.
          Length = 362

 Score = 27.5 bits (62), Expect = 10.0
 Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 9/67 (13%)

Query: 2   AEMSALKAKLKEQNQRLITLNHEKAKLEARTKANPGAHADE-------QEQTNLLISKKQ 54
            ++  +  + K+Q Q+   L+ E A+L+A    NP    +E       +E+    + K Q
Sbjct: 293 EQLEEIIEQAKQQMQQ--QLSAELARLKALKAVNPNIRDEEIEALEQQREELLTALDKAQ 350

Query: 55  LALKTLR 61
           L L  +R
Sbjct: 351 LRLDAIR 357


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.308    0.123    0.319 

Gapped
Lambda     K      H
   0.267   0.0724    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 13,557,554
Number of extensions: 1262121
Number of successful extensions: 2173
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1996
Number of HSP's successfully gapped: 349
Length of query: 293
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 197
Effective length of database: 6,679,618
Effective search space: 1315884746
Effective search space used: 1315884746
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 59 (26.5 bits)