RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy9160
         (297 letters)



>gnl|CDD|152645 pfam12210, Hrs_helical, Hepatocyte growth factor-regulated tyrosine
           kinase substrate.  This domain family is found in
           eukaryotes, and is approximately 100 amino acids in
           length. The family is found in association with
           pfam00790, pfam01363, pfam02809. This domain is the
           helical region of Hrs which forms the core complex of
           ESCRT with STAM.
          Length = 96

 Score =  128 bits (322), Expect = 2e-37
 Identities = 65/95 (68%), Positives = 78/95 (82%)

Query: 143 DDFIRALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQKDDKRV 202
           ++F+  L++ VEIFVNRMKSNS RGRSIANDS+VQ+LF  I  MH +LL  + Q D+KR 
Sbjct: 1   EEFLNNLRTSVEIFVNRMKSNSMRGRSIANDSAVQSLFQTINAMHPQLLSLMNQLDEKRG 60

Query: 203 YYEGLQDKLTQVKDARAALNALREEHNEKLRREAQ 237
           YYEGLQDKL Q+KDARAAL+ALREEH EKLRR A+
Sbjct: 61  YYEGLQDKLAQIKDARAALDALREEHREKLRRAAE 95


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 40.9 bits (97), Expect = 5e-04
 Identities = 18/89 (20%), Positives = 45/89 (50%), Gaps = 9/89 (10%)

Query: 208 QDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQ-----MAHKLDLMRKKKQEYL 262
           ++ L + K+         E   E   R  ++ + +K++      +  KL+L+ K+++E  
Sbjct: 56  KEALLEAKEE--IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELE 113

Query: 263 QYQRNLA--LQKIQEQEYEMQRRQEQQKQ 289
           + ++ L    Q+++++E E++   E+Q Q
Sbjct: 114 KKEKELEQKQQELEKKEEELEELIEEQLQ 142


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 40.0 bits (94), Expect = 0.001
 Identities = 30/156 (19%), Positives = 69/156 (44%), Gaps = 12/156 (7%)

Query: 141 ELDDFIRALKSQVEIFVNRMKS--NSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQKD 198
           +L++ I   +S +E  +  ++S  ++ +  S      V+ L   +  +   L K +Q+ +
Sbjct: 286 QLEELIDKYESHIEKALEELESILDTEKENSEFKLD-VEELKALLEALEEILEKNLQKLE 344

Query: 199 DKR------VYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLD 252
           +K       +  E + D +  + D   A+N L  EHNEK+     + + + + +    L 
Sbjct: 345 EKLKDPSTSIELESITDLIESINDIIDAINELIREHNEKIDN---LKKEKNKAKKKLWLH 401

Query: 253 LMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQK 288
           L+ + K++   YQ+     +      E + +Q + +
Sbjct: 402 LVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 35.7 bits (83), Expect = 0.029
 Identities = 19/88 (21%), Positives = 45/88 (51%), Gaps = 7/88 (7%)

Query: 208 QDKLTQVKD-ARAALNALREEHNEKLRREAQIAER---QKQIQMAHKLDLMRKKKQEYLQ 263
           ++ L + K+        L  E  E+ R E Q  ER   Q++  +  K++ + KK++   +
Sbjct: 50  KEALLEAKEEVHKLRAELERELKER-RNELQRLERRLLQREETLDRKMESLDKKEENLEK 108

Query: 264 YQRNLALQK--IQEQEYEMQRRQEQQKQ 289
            ++ L+ ++  + E+E E++    +Q++
Sbjct: 109 KEKELSNKEKNLDEKEEELEELIAEQRE 136


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 35.5 bits (82), Expect = 0.029
 Identities = 20/95 (21%), Positives = 39/95 (41%), Gaps = 4/95 (4%)

Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALR---EEHNEKLRREAQIAERQKQIQMAHKL 251
            Q    +     L+D+L ++K+    L  LR   EE  E   R  ++ ++ K+++     
Sbjct: 552 LQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELE-ERLS 610

Query: 252 DLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQ 286
            L    +   L    N   +  +E E E+++   Q
Sbjct: 611 QLEELLQSLELSEAENELEEAEEELESELEKLNLQ 645



 Score = 32.4 bits (74), Expect = 0.33
 Identities = 16/115 (13%), Positives = 47/115 (40%), Gaps = 6/115 (5%)

Query: 187 HSRLLKYIQQKDDKRVYYEGLQDKLT----QVKDARAALNALREEHNEKLRREAQI--AE 240
              L + +++ ++K    E L+ ++     +++  RA L  L E   +    E ++   E
Sbjct: 276 LRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLE 335

Query: 241 RQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNN 295
            + +   +   +L  +K +     +  L   + + +E E +  +  ++   ++  
Sbjct: 336 EKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEA 390


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 35.0 bits (81), Expect = 0.045
 Identities = 27/148 (18%), Positives = 62/148 (41%), Gaps = 16/148 (10%)

Query: 147 RALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQKDDKRVYYEG 206
             ++  V IF + + S          +S + +    +  +  +L +   +++++      
Sbjct: 136 PVVQELVSIFNDLIDSIKEDNLKDDLESLIASAKEELDQLSKKLAELKAEEEEE------ 189

Query: 207 LQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQE----YL 262
           L+  L + ++    L+ L EE   +L  +    E+Q +++   + + +RKK +E     L
Sbjct: 190 LERALKEKREE--LLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEEKLRQEL 247

Query: 263 QYQRNLALQK----IQEQEYEMQRRQEQ 286
           + Q     QK    +  Q  E+QR   +
Sbjct: 248 ERQAEAHEQKLKNELALQAIELQREFNK 275


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 33.7 bits (78), Expect = 0.064
 Identities = 24/88 (27%), Positives = 47/88 (53%), Gaps = 11/88 (12%)

Query: 210 KLTQVKDARAALNALREEHNEKL---RREAQIAER---QKQIQMAHKLDLMRKKKQEYLQ 263
            L + K+    ++ LR E   +L   R E Q  E+   QK+  +  K + + KK++   +
Sbjct: 54  ALLEAKEE---IHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEESLEE 110

Query: 264 YQRNLA--LQKIQEQEYEMQRRQEQQKQ 289
            ++ LA   Q+++E+E E++   E+Q+Q
Sbjct: 111 KEKELAARQQQLEEKEEELEELIEEQQQ 138


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 34.3 bits (79), Expect = 0.087
 Identities = 21/85 (24%), Positives = 47/85 (55%), Gaps = 1/85 (1%)

Query: 205 EGLQDKLTQVKDARAALNALREEHNEKLRREAQ-IAERQKQIQMAHKLDLMRKKKQEYLQ 263
           E L+++L  +++A A L    EE  EK +   + + E +++++ A +     +++ E L+
Sbjct: 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLE 820

Query: 264 YQRNLALQKIQEQEYEMQRRQEQQK 288
            +R    Q+I+E E E++  +E+  
Sbjct: 821 QRRERLEQEIEELEEEIEELEEKLD 845



 Score = 33.1 bits (76), Expect = 0.21
 Identities = 28/162 (17%), Positives = 75/162 (46%), Gaps = 21/162 (12%)

Query: 136 EASDTELDDFIRALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQ 195
           +    EL++  +AL+ ++E      +   +  R  A +  ++    ++     RL + I+
Sbjct: 778 KEEIEELEEKRQALQEELEEL--EEELEEAERRLDALERELE----SLEQRRERLEQEIE 831

Query: 196 QKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKL-RREAQIAERQKQIQMAHKLDLM 254
           + +++    E L++KL ++++    L    EE  E+L   EA+  E +         D +
Sbjct: 832 ELEEEI---EELEEKLDELEEELEELEKELEELKEELEELEAEKEELE---------DEL 879

Query: 255 RKKKQEYLQYQRNL--ALQKIQEQEYEMQRRQEQQKQYYMQN 294
           ++ ++E  + +  L     ++ E + E+++ +E+ ++   + 
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKL 921



 Score = 31.2 bits (71), Expect = 0.78
 Identities = 18/90 (20%), Positives = 38/90 (42%), Gaps = 9/90 (10%)

Query: 204 YEGLQDKLTQVKDARAALNA----LREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQ 259
            E LQ++L ++K+    L      LRE   E      ++ ER ++++   +       K+
Sbjct: 283 LEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIE-----ALKE 337

Query: 260 EYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
           E  + +  L   +    E E  + + ++K 
Sbjct: 338 ELEERETLLEELEQLLAELEEAKEELEEKL 367



 Score = 30.8 bits (70), Expect = 1.1
 Identities = 16/106 (15%), Positives = 49/106 (46%), Gaps = 2/106 (1%)

Query: 187 HSRLLKYIQQKDDKR-VYYEGLQDKLTQVKDARAALNALREEHNEKLRR-EAQIAERQKQ 244
            + L +  ++ ++K     E L++    +++  A L A   E   +L   + +I   +++
Sbjct: 353 LAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEER 412

Query: 245 IQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQY 290
           ++   +     K++ + L+ +      +++E   E++  +EQ ++ 
Sbjct: 413 LERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEEL 458


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 32.2 bits (74), Expect = 0.13
 Identities = 18/81 (22%), Positives = 42/81 (51%), Gaps = 3/81 (3%)

Query: 209 DKLTQVKDARAALNALREEHNEKL-RREAQIAERQKQIQMAHKL--DLMRKKKQEYLQYQ 265
            +    K A+  L    ++   +L ++E ++ + ++++Q       +  RK KQ+ LQ +
Sbjct: 28  SESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQK 87

Query: 266 RNLALQKIQEQEYEMQRRQEQ 286
           +    QK Q  + E+Q++Q++
Sbjct: 88  QQELQQKQQAAQQELQQKQQE 108



 Score = 29.1 bits (66), Expect = 1.6
 Identities = 11/72 (15%), Positives = 31/72 (43%), Gaps = 3/72 (4%)

Query: 208 QDKLTQVKDARAALNALREEHNEKLR--REAQIAERQKQIQ-MAHKLDLMRKKKQEYLQY 264
           +    +++     L    ++  ++     E     +Q+++Q    +L   ++  Q+ LQ 
Sbjct: 45  KKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAAQQELQQ 104

Query: 265 QRNLALQKIQEQ 276
           ++   LQ I ++
Sbjct: 105 KQQELLQPIYDK 116


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 32.6 bits (75), Expect = 0.18
 Identities = 20/62 (32%), Positives = 32/62 (51%), Gaps = 3/62 (4%)

Query: 217 ARAALNALREEHNEKLRR-EAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQE 275
           A A    LRE+  E+ +  EAQ  ER  Q  +   ++ M  ++++ L  Q  +   K+QE
Sbjct: 216 AEAEQELLREKQKEEEQMMEAQ--ERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQE 273

Query: 276 QE 277
           QE
Sbjct: 274 QE 275



 Score = 28.0 bits (63), Expect = 6.4
 Identities = 14/65 (21%), Positives = 32/65 (49%), Gaps = 2/65 (3%)

Query: 225 REEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLA-LQKIQEQEYEMQRR 283
           +E+  E  R +A+ AE ++++    + +  +  + +   YQ ++  L +  E E E    
Sbjct: 202 KEKAIEAERAKAEAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKL-L 260

Query: 284 QEQQK 288
            EQ++
Sbjct: 261 AEQER 265


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 32.7 bits (75), Expect = 0.24
 Identities = 21/86 (24%), Positives = 41/86 (47%), Gaps = 5/86 (5%)

Query: 207 LQDKLTQVKDARAALNALREEHNEKL---RREAQIAERQKQIQMAHKLDLMRKKKQEYLQ 263
           L++KL +++   + L    EE  ++L     E    E+QKQI      +L  +++ E L+
Sbjct: 265 LEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLANL--ERQLEELE 322

Query: 264 YQRNLALQKIQEQEYEMQRRQEQQKQ 289
            Q      K+ E   E+   +E+ ++
Sbjct: 323 AQLEELESKLDELAEELAELEEKLEE 348



 Score = 32.7 bits (75), Expect = 0.25
 Identities = 16/109 (14%), Positives = 49/109 (44%), Gaps = 10/109 (9%)

Query: 192 KYIQQKDDKR-----VYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQ 246
           +Y + K + R     +    L++   ++++ +  L    EE  E     A++ E +++++
Sbjct: 214 RYKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEEL---TAELQELEEKLE 270

Query: 247 MAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNN 295
              +L  + + ++E  + Q+ L     +    E Q++  +++   ++  
Sbjct: 271 -ELRL-EVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQ 317



 Score = 31.6 bits (72), Expect = 0.58
 Identities = 17/93 (18%), Positives = 42/93 (45%), Gaps = 4/93 (4%)

Query: 202 VYYEGLQDKLTQVKDARAALNALREEHNE----KLRREAQIAERQKQIQMAHKLDLMRKK 257
              + L++   +++  R  L  L  + +       R EA++ + +++I    K     + 
Sbjct: 702 ELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEA 761

Query: 258 KQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQY 290
           + E L+ +   A +++ E E E++  + Q +Q 
Sbjct: 762 EIEELEERLEEAEEELAEAEAEIEELEAQIEQL 794



 Score = 29.6 bits (67), Expect = 2.7
 Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 16/83 (19%)

Query: 208 QDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRN 267
           ++ L +++D    L    E   + L R+A+ AER K++            K E  + +  
Sbjct: 185 RENLDRLEDILNEL----ERQLKSLERQAEKAERYKEL------------KAELRELELA 228

Query: 268 LALQKIQEQEYEMQRRQEQQKQY 290
           L + +++E   E++  QE+ K+ 
Sbjct: 229 LLVLRLEELREELEELQEELKEA 251



 Score = 28.9 bits (65), Expect = 4.5
 Identities = 13/78 (16%), Positives = 34/78 (43%), Gaps = 3/78 (3%)

Query: 213 QVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQK 272
           ++ +   AL  LR+E  E      Q+  R++  +++ ++  +RK           L  ++
Sbjct: 692 KIAELEKALAELRKELEELEEELEQL--RKELEELSRQISALRKDLARLEAEVEQLE-ER 748

Query: 273 IQEQEYEMQRRQEQQKQY 290
           I +   E+   + + ++ 
Sbjct: 749 IAQLSKELTELEAEIEEL 766



 Score = 28.1 bits (63), Expect = 6.9
 Identities = 13/94 (13%), Positives = 45/94 (47%), Gaps = 7/94 (7%)

Query: 195 QQKDDKRVYYEGLQDKLT----QVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHK 250
           ++ ++ +   E L+ +L     ++++  + L  L E+      + AQ+  +   +   ++
Sbjct: 344 EKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLN--NE 401

Query: 251 LDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQ 284
           ++ +  + +  L+ +R    Q+I+E   +++  +
Sbjct: 402 IERLEARLER-LEDRRERLQQEIEELLKKLEEAE 434



 Score = 27.7 bits (62), Expect = 8.6
 Identities = 21/105 (20%), Positives = 45/105 (42%), Gaps = 2/105 (1%)

Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAH--KLD 252
            + ++     E L+ K+ Q++   A+LN   E    +L R     ER +Q       KL+
Sbjct: 372 SRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLE 431

Query: 253 LMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNNVA 297
               K+ +    +    L+++QE+   ++   E+ ++   +   A
Sbjct: 432 EAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476


>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106.  This
           archaebacterial protein family has no known function.
           Members are predicted to be integral membrane proteins.
          Length = 168

 Score = 31.5 bits (72), Expect = 0.33
 Identities = 9/49 (18%), Positives = 23/49 (46%), Gaps = 4/49 (8%)

Query: 247 MAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNN 295
           +  K++  +K+++E  +  R L     +    + ++RQE      M++ 
Sbjct: 38  IDRKMEKYQKREKEIQKRARELRKNGDKLSPKKFEKRQE----ELMEDQ 82


>gnl|CDD|233365 TIGR01347, sucB, 2-oxoglutarate dehydrogenase complex
           dihydrolipoamide succinyltransferase (E2 component).
           This model describes the TCA cycle 2-oxoglutarate system
           E2 component, dihydrolipoamide succinyltransferase. It
           is closely related to the pyruvate dehydrogenase E2
           component, dihydrolipoamide acetyltransferase. The seed
           for this model includes mitochondrial and Gram-negative
           bacterial forms. Mycobacterial candidates are highly
           derived, differ in having and extra copy of the
           lipoyl-binding domain at the N-terminus. They score
           below the trusted cutoff, but above the noise cutoff and
           above all examples of dihydrolipoamide acetyltransferase
           [Energy metabolism, TCA cycle].
          Length = 403

 Score = 32.0 bits (73), Expect = 0.38
 Identities = 19/69 (27%), Positives = 33/69 (47%), Gaps = 13/69 (18%)

Query: 215 KDARAALNALREEHNEKL-RREAQIAERQKQIQ--MA-----HKLDL-----MRKKKQEY 261
             A+A  N  R E   K+ R   +IAER K+ Q   A     +++D+     +RK+ +E 
Sbjct: 162 AAAKAPANFTRPEERVKMTRLRQRIAERLKEAQNSTAMLTTFNEVDMSAVMELRKRYKEE 221

Query: 262 LQYQRNLAL 270
            + +  + L
Sbjct: 222 FEKKHGVKL 230


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 32.0 bits (73), Expect = 0.45
 Identities = 17/95 (17%), Positives = 42/95 (44%), Gaps = 12/95 (12%)

Query: 196 QKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMR 255
           Q        E L +++++++     +  L EE N+K++   +  E Q +++         
Sbjct: 245 QLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGE--EEQLRVK--------- 293

Query: 256 KKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQY 290
            +K   L+ +     + I E+E E++  +E+  + 
Sbjct: 294 -EKIGELEAEIASLERSIAEKERELEDAEERLAKL 327



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 29/170 (17%), Positives = 64/170 (37%), Gaps = 18/170 (10%)

Query: 140 TELDDFIRALKSQVEIFVNRMKS-----NSSRGRSIANDSSVQTLFVNITTMHSRLLKYI 194
            EL+  I   + + +                R      D         +     +L K  
Sbjct: 339 EELEREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLK 398

Query: 195 QQKDDKRVYYEGLQDKLTQ----VKDARAALNALREEHNE-KLRREAQIAE----RQKQI 245
           ++ ++ +   + LQ++L +    + D  AA+  +  + NE +  +E +  E      K  
Sbjct: 399 REINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLE 458

Query: 246 QMAHKLDLMRKK----KQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYY 291
           Q+A  L    ++    K+EY + ++ L+  + +  E E Q R  +++   
Sbjct: 459 QLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQARASEERVRG 508



 Score = 29.3 bits (66), Expect = 3.2
 Identities = 19/104 (18%), Positives = 52/104 (50%), Gaps = 6/104 (5%)

Query: 188 SRLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRR-EAQIAERQKQI- 245
           +RL    +  + +    + LQ++   +K+   ++    E  N K    E ++ E +  + 
Sbjct: 822 NRLTLEKEYLEKEI---QELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALR 878

Query: 246 QMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
            +  +L  ++K++ E L+ Q     +KI+E E ++++++++  +
Sbjct: 879 DLESRLGDLKKERDE-LEAQLRELERKIEELEAQIEKKRKRLSE 921


>gnl|CDD|179798 PRK04239, PRK04239, hypothetical protein; Provisional.
          Length = 110

 Score = 30.2 bits (69), Expect = 0.45
 Identities = 15/39 (38%), Positives = 25/39 (64%), Gaps = 1/39 (2%)

Query: 251 LDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
           L+ +R++K E LQ ++    Q+ QE++ E Q + E QKQ
Sbjct: 3   LEEIRRRKLEELQ-KQAQEQQQAQEEQEEAQAQAEAQKQ 40


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 31.4 bits (71), Expect = 0.53
 Identities = 25/141 (17%), Positives = 60/141 (42%), Gaps = 6/141 (4%)

Query: 158 NRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQKDDKRVYYEGLQDKLTQVKDA 217
           N++   +S  RS+ +D  +  L      + ++L   +  K   +   + ++ +L    D+
Sbjct: 434 NQISLLTSLERSLKSD--LGQLKKENDMLQTKLNSMVSAKQKDKQSMQSMEKRLKSEADS 491

Query: 218 RAALNALREEHNEKLRREAQIAER---QKQIQMAHKLDLMRKKKQEYLQYQRNLALQ-KI 273
           R        E  ++ + E + A R   Q         + +++ KQ+     + L    K+
Sbjct: 492 RVNAEKQLAEEKKRKKEEEETAARAAAQAAASREECAESLKQAKQDLEMEIKKLEHDLKL 551

Query: 274 QEQEYEMQRRQEQQKQYYMQN 294
           +E+E  M  ++ Q+ + Y ++
Sbjct: 552 KEEECRMLEKEAQELRKYQES 572


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 31.1 bits (71), Expect = 0.70
 Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 11/81 (13%)

Query: 209 DKLTQV---KDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQ 265
           DKL  +    +    ++  REE  EK+ + A+  ERQ++ Q   +     KKK+E     
Sbjct: 246 DKLANLKLSPEVLRKVDKTREEEEEKILKAAE-EERQEEAQEKKE----EKKKEEREAKL 300

Query: 266 RNL---ALQKIQEQEYEMQRR 283
             L     +K++E+E + Q R
Sbjct: 301 AKLSPEEQRKLEEKERKKQAR 321


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 30.9 bits (70), Expect = 0.82
 Identities = 12/58 (20%), Positives = 28/58 (48%), Gaps = 4/58 (6%)

Query: 236 AQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEY----EMQRRQEQQKQ 289
           A+ AE Q++ +   + + +++K+    +  + L  +++  QE     E   +Q   KQ
Sbjct: 74  AKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQ 131


>gnl|CDD|216028 pfam00621, RhoGEF, RhoGEF domain.  Guanine nucleotide exchange
           factor for Rho/Rac/Cdc42-like GTPases Also called
           Dbl-homologous (DH) domain. It appears that pfam00169
           domains invariably occur C-terminal to RhoGEF/DH
           domains.
          Length = 179

 Score = 30.3 bits (69), Expect = 0.85
 Identities = 24/132 (18%), Positives = 53/132 (40%), Gaps = 27/132 (20%)

Query: 139 DTELDDFIRALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTM---HSRLLKYIQ 195
            TE   ++R LK  VE+F+  ++        I ++  ++T+F NI  +   H   L+ ++
Sbjct: 7   QTERS-YVRDLKILVEVFLKPLRE-----SPILSEEEIKTIFSNIEEILELHQEFLEELE 60

Query: 196 QKDDKRVYYEGLQDKLTQVKD--------------ARAALNALREEHNEKLRREAQIAER 241
           ++ ++    + + D   +                 A   L  LR++ N +     +  + 
Sbjct: 61  ERLEEWPDIQRIGDIFLKFAPFFKVYSTYCSNYPRALELLKKLRKK-NPRFA---KFLKE 116

Query: 242 QKQIQMAHKLDL 253
            +   +   LDL
Sbjct: 117 CEASPLCRGLDL 128


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 30.7 bits (70), Expect = 0.94
 Identities = 14/66 (21%), Positives = 31/66 (46%), Gaps = 17/66 (25%)

Query: 241 RQKQIQMAHKLDLMRKKKQEY-LQYQRNLA----------------LQKIQEQEYEMQRR 283
           RQ+  ++  KL+ MR++  E  +Q + N+A                 +  ++ + E++ R
Sbjct: 84  RQRLSRLVAKLEEMRERDLELNVQLKDNIAQLNQEIAEREKAEEARQEAFEQLKNEIKER 143

Query: 284 QEQQKQ 289
           +E Q +
Sbjct: 144 EETQIE 149


>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein. 
          Length = 304

 Score = 30.5 bits (69), Expect = 0.98
 Identities = 12/56 (21%), Positives = 23/56 (41%), Gaps = 1/56 (1%)

Query: 235 EAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQ-EQEYEMQRRQEQQKQ 289
           +A +   + ++  A      R  + E  + Q  +A   IQ EQE +  R   ++  
Sbjct: 39  QAALDRAEAELAAAQADAQARLARAERDRLQALVAQAAIQSEQEADAARAALREAL 94


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 30.6 bits (70), Expect = 1.2
 Identities = 15/91 (16%), Positives = 38/91 (41%), Gaps = 11/91 (12%)

Query: 205 EGLQDKLTQVKDARAALNALREEHNEKLRRE-AQIAERQKQIQMAHKLDLMRK------- 256
             L++   +++       AL +E  EKL+ E  +  E+ ++ +     +  ++       
Sbjct: 523 ASLEELERELEQKAEEAEALLKEA-EKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKE 581

Query: 257 KKQEYLQYQRNLALQKIQEQEYEMQRRQEQQ 287
            K+E  +  +   L+++Q+  Y   +  E  
Sbjct: 582 AKKEADEIIKE--LRQLQKGGYASVKAHELI 610



 Score = 29.8 bits (68), Expect = 2.2
 Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 11/65 (16%)

Query: 227 EHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQ 286
              E+L RE +    QK  +    L    K K+E  +       +K + QE E +  +E 
Sbjct: 523 ASLEELERELE----QKAEEAEALLKEAEKLKEELEE-------KKEKLQEEEDKLLEEA 571

Query: 287 QKQYY 291
           +K+  
Sbjct: 572 EKEAQ 576


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 4/66 (6%)

Query: 213 QVKDARAALNALREEHNEKLR--REAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLAL 270
           +++  +  L A  E  +E +   RE ++A RQ+  Q+  ++  +  +   +L  Q   AL
Sbjct: 555 ELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQARIQRLAARAPAWLAAQD--AL 612

Query: 271 QKIQEQ 276
            +++EQ
Sbjct: 613 ARLREQ 618


>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
           Provisional.
          Length = 693

 Score = 30.4 bits (70), Expect = 1.2
 Identities = 19/59 (32%), Positives = 31/59 (52%), Gaps = 5/59 (8%)

Query: 203 YYEGLQDKLTQ-VKDAR-AALNALREEHNEKLRREAQIAERQKQIQMA-HKL--DLMRK 256
               L++      K  R AAL+A++EE  E L  E +  E +K+I+ A  KL   ++R+
Sbjct: 249 AEAKLKEAYQITDKQEREAALDAIKEEVLEALAAEEEEEEDEKEIKEAFKKLEKKIVRR 307


>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423).  This
           family represents a conserved region approximately 500
           residues long within a number of Arabidopsis thaliana
           proteins of unknown function.
          Length = 446

 Score = 30.5 bits (69), Expect = 1.2
 Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 13/80 (16%)

Query: 209 DKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNL 268
           + + ++K A A +  L+   +E  RREA   ER ++I +A       K ++   +Y    
Sbjct: 356 ESIVRLKQAEADMFQLKA--DEA-RREA---ERLQRIALA-------KTEKSEEEYASKY 402

Query: 269 ALQKIQEQEYEMQRRQEQQK 288
              ++ E E E Q   E+ K
Sbjct: 403 LKLRLSEAEEERQYLFEELK 422


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 30.3 bits (68), Expect = 1.5
 Identities = 13/61 (21%), Positives = 25/61 (40%), Gaps = 4/61 (6%)

Query: 193 YIQQKDDKRVYYEGLQDK----LTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMA 248
           Y + ++  + Y E  + K        +   A +NA      EK++ + +I  R  Q  +A
Sbjct: 164 YKKNEERFKEYLENFKRKKFKRKILKEFENAKINASEIVGLEKVKEKIEIVFRHNQTPLA 223

Query: 249 H 249
            
Sbjct: 224 L 224


>gnl|CDD|221626 pfam12537, DUF3735, Protein of unknown function (DUF3735).  This
           domain family is found in eukaryotes, and is
           approximately 70 amino acids in length. There is a
           conserved LSG sequence motif. There is a single
           completely conserved residue G that may be functionally
           important.
          Length = 71

 Score = 27.9 bits (63), Expect = 1.5
 Identities = 10/29 (34%), Positives = 17/29 (58%), Gaps = 2/29 (6%)

Query: 233 RREAQIAERQKQIQMAHKLDLMRKKKQEY 261
             E+ I   +K++   H LDL+ +KK+E 
Sbjct: 44  VSESDINNLEKRLW--HTLDLLAEKKKEL 70


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 29.9 bits (68), Expect = 1.5
 Identities = 24/116 (20%), Positives = 54/116 (46%), Gaps = 19/116 (16%)

Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALR----EEHNEKLRREAQIAERQKQIQMAHK 250
           ++K++K      L+ +  + +D R  L+ LR    +E  E+  R+ +  E +K+ +   +
Sbjct: 175 ERKEEKEREVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQE 234

Query: 251 LDLMRKK----KQEYLQ---------YQRNLALQKIQEQEYEM--QRRQEQQKQYY 291
           L   R++    K+E LQ          +R L  Q   E+  +   ++R+ ++ ++ 
Sbjct: 235 LQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHR 290



 Score = 28.7 bits (65), Expect = 3.9
 Identities = 17/77 (22%), Positives = 40/77 (51%), Gaps = 5/77 (6%)

Query: 222 NALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQ-----KIQEQ 276
             ++ E  E+ RR  ++ E ++   +A + +  RK+K+E  + +  L  Q     K +++
Sbjct: 30  KRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQE 89

Query: 277 EYEMQRRQEQQKQYYMQ 293
           EYE + ++ +Q    ++
Sbjct: 90  EYEERLQEREQMDEIIE 106



 Score = 28.3 bits (64), Expect = 4.5
 Identities = 17/82 (20%), Positives = 43/82 (52%), Gaps = 5/82 (6%)

Query: 213 QVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMR----KKKQEYLQYQRNL 268
           +  +  A    + E+  E    E + AE+++  ++ H+ +L +    K+++   + +  L
Sbjct: 253 ERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEEL 312

Query: 269 A-LQKIQEQEYEMQRRQEQQKQ 289
              ++++E+E E Q R E+++Q
Sbjct: 313 EEGERLREEEAERQARIEEERQ 334



 Score = 28.0 bits (63), Expect = 5.7
 Identities = 14/76 (18%), Positives = 40/76 (52%)

Query: 213 QVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQK 272
           ++++   A    + E  +KLR E      ++  +   + +  R+++ + L+YQR  A ++
Sbjct: 107 RIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAERE 166

Query: 273 IQEQEYEMQRRQEQQK 288
            + +    +R++E+++
Sbjct: 167 EEREAERRERKEEKER 182


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 30.0 bits (68), Expect = 1.6
 Identities = 24/84 (28%), Positives = 37/84 (44%), Gaps = 2/84 (2%)

Query: 205 EGLQDKLTQVKDARAALNALREEH-NEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQ 263
               +  T +K AR  L  L+ E  + K + E  IAER++Q +   +  L  + KQ   Q
Sbjct: 642 RAEAEARTALKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLR-QLDAQLKQLLEQ 700

Query: 264 YQRNLALQKIQEQEYEMQRRQEQQ 287
            Q  L   K   +E   +R  + Q
Sbjct: 701 QQAFLEALKDDFRELRTERLAKWQ 724


>gnl|CDD|216828 pfam01984, dsDNA_bind, Double-stranded DNA-binding domain.  This
           domain is believed to bind double-stranded DNA of 20
           bases length.
          Length = 105

 Score = 28.3 bits (64), Expect = 1.7
 Identities = 10/36 (27%), Positives = 21/36 (58%), Gaps = 2/36 (5%)

Query: 254 MRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
           +R+ +    + Q+     + Q+Q+ E Q+++E QKQ
Sbjct: 1   IRRARLA--ELQQQQGGLEEQQQQEEQQQQEEAQKQ 34


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 28.7 bits (65), Expect = 1.8
 Identities = 15/68 (22%), Positives = 34/68 (50%), Gaps = 3/68 (4%)

Query: 212 TQVKDARAALNALREEHNEK--LRREAQIAERQKQIQ-MAHKLDLMRKKKQEYLQYQRNL 268
            +++     L  L+E+  +      EA   +++K++Q    +    ++K Q+ LQ ++  
Sbjct: 32  AELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQKKVQEFQRKQQKLQQDLQKRQQE 91

Query: 269 ALQKIQEQ 276
            LQKI ++
Sbjct: 92  ELQKILDK 99



 Score = 28.7 bits (65), Expect = 2.0
 Identities = 19/83 (22%), Positives = 43/83 (51%), Gaps = 15/83 (18%)

Query: 213 QVKDARAALNALREEHNEKLRREAQIAERQKQIQ-MAHKLD----LM----RKKKQEYLQ 263
             K A+  L    ++      R+A++ + +K++Q +  KL      +    R+KK++ LQ
Sbjct: 15  AGKAAQKQLEKEFKK------RQAELEKLEKELQKLKEKLQKDAATLSEAAREKKEKELQ 68

Query: 264 YQRNLALQKIQEQEYEMQRRQEQ 286
            +     +K Q+ + ++Q+RQ++
Sbjct: 69  KKVQEFQRKQQKLQQDLQKRQQE 91


>gnl|CDD|216872 pfam02096, 60KD_IMP, 60Kd inner membrane protein. 
          Length = 193

 Score = 29.0 bits (66), Expect = 1.9
 Identities = 11/42 (26%), Positives = 24/42 (57%), Gaps = 7/42 (16%)

Query: 257 KKQEYLQYQRNLALQKIQEQEY--EMQRRQEQQKQYYMQNNV 296
            K + LQ +    +++IQE +Y  + Q+ Q++  + Y ++ V
Sbjct: 30  AKMQELQPE----IKEIQE-KYKDDPQKLQQEMMKLYKEHGV 66


>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
          Length = 1047

 Score = 29.8 bits (67), Expect = 2.0
 Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 8/89 (8%)

Query: 209 DKLTQVKDARAALNALREEHNEKLRREAQIAE-----RQKQIQMAHKLDLMR---KKKQE 260
            +  + +  R  L AL  +   + +R AQ+        Q+Q Q    L+ MR   K+K +
Sbjct: 416 AQHAEQRPLRQRLVALHGQIVPQQKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQ 475

Query: 261 YLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
            L   + +  Q+ + ++ E QR Q Q  Q
Sbjct: 476 QLADVKTICEQEARIKDLEAQRAQLQAGQ 504


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 29.4 bits (66), Expect = 2.1
 Identities = 19/82 (23%), Positives = 37/82 (45%)

Query: 208 QDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRN 267
           Q KL Q  +      A  +   ++L + A   +  KQ + A K    ++K+ E  + ++ 
Sbjct: 70  QKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQA 129

Query: 268 LALQKIQEQEYEMQRRQEQQKQ 289
              +   E E E + ++E +KQ
Sbjct: 130 AEAKAKAEAEAEKKAKEEAKKQ 151


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 29.5 bits (66), Expect = 2.2
 Identities = 20/96 (20%), Positives = 43/96 (44%), Gaps = 3/96 (3%)

Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRRE-AQIAERQKQIQMAHKLDL 253
            Q+   +   +  + K  QV +      A  +E  ++L +E  +  E+QKQ + A K   
Sbjct: 69  SQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQ 128

Query: 254 MRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQ 289
           + +K+QE  +  R  A ++ ++ E    +   +  +
Sbjct: 129 LEQKQQE--EQARKAAAEQKKKAEAAKAKAAAEAAK 162



 Score = 28.0 bits (62), Expect = 6.1
 Identities = 20/87 (22%), Positives = 40/87 (45%), Gaps = 3/87 (3%)

Query: 204 YEGLQDKLTQVKDARAALNALREEHNEKLR-REAQIAERQKQIQMAHKLDLMRKKKQEYL 262
           Y  +Q + +  K          E+  E+L+ ++A   ER KQ++        ++K+ E  
Sbjct: 64  YGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEA 123

Query: 263 QYQRNLALQKIQEQEYEMQRRQEQQKQ 289
           + Q  L  ++ +EQ    +   EQ+K+
Sbjct: 124 EKQAQLEQKQQEEQ--ARKAAAEQKKK 148


>gnl|CDD|217089 pfam02535, Zip, ZIP Zinc transporter.  The ZIP family consists of
           zinc transport proteins and many putative metal
           transporters. The main contribution to this family is
           from the Arabidopsis thaliana ZIP protein family these
           proteins are responsible for zinc uptake in the plant.
           Also found within this family are C. elegans proteins of
           unknown function which are annotated as being similar to
           human growth arrest inducible gene product, although
           this protein in not found within this family.
          Length = 314

 Score = 29.2 bits (66), Expect = 2.3
 Identities = 12/61 (19%), Positives = 16/61 (26%), Gaps = 3/61 (4%)

Query: 34  HGPHHGQSGYHHQEEDTGNSNPELAKYLDRSYWEQKKVPPNHGSHHGQSGYHHQEEDTGN 93
            G  HG    H  E      + E       S     +       HH   G+   E D  +
Sbjct: 104 RGHSHGHDHGHDGEHSHTVPDEESGA--VESNVVHHEHGEGP-HHHHHEGHEKGESDKES 160

Query: 94  S 94
            
Sbjct: 161 G 161


>gnl|CDD|176480 cd08643, DNA_pol_A_pol_I_B, Polymerase I functions primarily to
           fill DNA gaps that arise during DNA repair,
           recombination and replication.  Family A polymerase
           functions primarily to fill DNA gaps that arise during
           DNA repair, recombination and replication. DNA-dependent
           DNA polymerases can be classified in six main groups
           based upon phylogenetic relationships with E. coli
           polymerase I (classA), E. coli polymerase II (class B),
           E.coli polymerase III (class C), euryarchaaeota
           polymerase II (class D), human polymerase  beta (class
           x), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma
           pigmentosum variant (class Y). Family A polymerase are
           found primarily in organisms related to prokaryotes and
           include prokaryotic DNA polymerase I ,mitochondrial
           polymerase delta, and several bacteriphage polymerases
           including those from odd-numbered phage (T3, T5, and
           T7). Prokaryotic Pol Is have two functional domains
           located on the same polypeptide; a 5'-3' polymerase and
           5'-3' exonuclease. Pol I uses its 5' nuclease activity
           to remove the ribonucleotide portion of newly
           synthesized Okazaki fragments and DNA polymerase
           activity to fill in the resulting gap. A combination of
           phylogenomic and signature sequence-based (or phonetic)
           approaches is used to understand the evolutionary
           relationships among bacteria. DNA polymerase I is one of
           the conserved proteins that is used to search for
           protein signatures. The structure of these polymerases
           resembles in overall morphology a cupped human right
           hand, with fingers (which bind an incoming nucleotide
           and interact with the single-stranded template), palm
           (which harbors the catalytic amino acid residues and
           also binds an incoming dNTP) and thumb (which binds
           double-stranded DNA) subdomains.
          Length = 429

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 13/36 (36%), Positives = 22/36 (61%)

Query: 189 RLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNAL 224
           +L+K +++   KR +  GL  +  +V+ A AALN L
Sbjct: 306 KLIKKVKEAAKKRGHLVGLDGRRIRVRSAHAALNTL 341


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 29.4 bits (66), Expect = 2.5
 Identities = 11/55 (20%), Positives = 26/55 (47%), Gaps = 9/55 (16%)

Query: 235 EAQIAERQKQI-QMAHKLDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQK 288
           +   A+ QKQ  ++  +LD++ K++ +  +        +I++   +     EQ K
Sbjct: 75  QVTAAQMQKQYEEIRRELDVLNKQRGDDQR--------RIEKLGQDNAALAEQVK 121


>gnl|CDD|220964 pfam11068, DUF2869, Protein of unknown function (DUF2869).  This
           bacterial family of proteins has no known function.
          Length = 131

 Score = 28.3 bits (64), Expect = 2.6
 Identities = 19/69 (27%), Positives = 35/69 (50%), Gaps = 2/69 (2%)

Query: 226 EEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQ-EYEMQRRQ 284
           E   E+L+ E Q    Q + ++   L+   +K  + ++ Q    +++IQ+Q E E  +R 
Sbjct: 15  ERWKEELQAELQEQLTQLEQELQQ-LEFQGQKAIKEIRKQSAQQIEQIQQQVEQERAKRL 73

Query: 285 EQQKQYYMQ 293
           EQ+ Q   Q
Sbjct: 74  EQKNQLLFQ 82


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 29.4 bits (66), Expect = 2.7
 Identities = 23/106 (21%), Positives = 48/106 (45%), Gaps = 10/106 (9%)

Query: 192 KYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKL 251
            Y + K++  V  E L       K+        + EH EKL +E +  +++ + +M  +L
Sbjct: 505 FYGEFKEEINVLIEKLSA---LEKELE-----QKNEHLEKLLKEQEKLKKELEQEME-EL 555

Query: 252 DLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQNNVA 297
               + K+  L+ +   AL K  ++E E   R+ ++K+ +    + 
Sbjct: 556 KERERNKKLELEKEAQEAL-KALKKEVESIIRELKEKKIHKAKEIK 600


>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 220 and
           257 amino acids in length.
          Length = 221

 Score = 28.8 bits (65), Expect = 2.7
 Identities = 17/73 (23%), Positives = 41/73 (56%), Gaps = 2/73 (2%)

Query: 204 YEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQ 263
           YE + +++ ++  +       +E+  E++ R  QI E +K+I    K   ++K+KQ   +
Sbjct: 150 YESIVNQIIKLNASSNTGVYEKEDLKERVERLEQIEELEKEIAKLKK--KLKKEKQFNRK 207

Query: 264 YQRNLALQKIQEQ 276
            + NL ++K++++
Sbjct: 208 VELNLEIKKLKKE 220


>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203).
           This domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 120

 Score = 28.0 bits (63), Expect = 2.8
 Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 2/64 (3%)

Query: 214 VKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNL--ALQ 271
           +++A   L  LR    E    +A++ E +  +    +    R  K+E  + +  +  A+ 
Sbjct: 5   LEEANELLPELRRLLEEVREAKAELEELEAALSEGERSLEARGLKEELRELEEQIRAAIA 64

Query: 272 KIQE 275
           +I+E
Sbjct: 65  EIEE 68


>gnl|CDD|220777 pfam10486, PI3K_1B_p101, Phosphoinositide 3-kinase gamma adapter
           protein p101 subunit.  Class I PI3Ks are dual-specific
           lipid and protein kinases involved in numerous
           intracellular signaling pathways. Class IB PI3K,
           p110gamma, is mainly activated by seven-transmembrane
           G-protein-coupled receptors (GPCRs), through its
           regulatory subunit p101 and G-protein beta-gamma
           subunits.
          Length = 856

 Score = 29.2 bits (65), Expect = 2.9
 Identities = 17/67 (25%), Positives = 28/67 (41%)

Query: 137 ASDTELDDFIRALKSQVEIFVNRMKSNSSRGRSIANDSSVQTLFVNITTMHSRLLKYIQQ 196
           A   ELD  +  L   V   V R  S S +  +  + S ++   V I    SRL      
Sbjct: 736 AKYEELDSSMECLNLNVTEVVKRQNSKSKKSFNQISTSQIKVDKVQIIGRDSRLFALCLD 795

Query: 197 KDDKRVY 203
           +D++++ 
Sbjct: 796 QDEQKIL 802


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 28.9 bits (65), Expect = 3.2
 Identities = 28/170 (16%), Positives = 67/170 (39%), Gaps = 25/170 (14%)

Query: 140 TELDDFIRALKSQVEIFVNRMK-SNSSRGRSIAND--SSVQTLFVNITTMHS----RLLK 192
            EL   ++  ++QVE F  +   ++++RG+ ++    S++ T   +     +    RL  
Sbjct: 205 EELRARLQEAEAQVEDFRAQHGLTDAARGQLLSEQQLSALNTQLQSARARLAQAEARLAS 264

Query: 193 YIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAER------QKQIQ 246
            +Q     R         L +V ++   +  LR+++ +  ++ A ++        Q    
Sbjct: 265 LLQLLPLGRE-----AAALREVLESPT-IQDLRQQYAQVRQQIADLSTELGAKHPQLVAL 318

Query: 247 MAHKLDLMRKKKQEYLQY------QRNLALQKIQEQEYEMQRRQEQQKQY 290
            A   +L ++   E  Q       +  L  Q+    E E+ + + +  + 
Sbjct: 319 EAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKL 368


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 29.0 bits (65), Expect = 3.3
 Identities = 20/116 (17%), Positives = 45/116 (38%), Gaps = 12/116 (10%)

Query: 187 HSRLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAER----- 241
           + R+ +  Q  +D       L+ +L         L    E+  + L+ +   +E      
Sbjct: 480 NERITELQQLLEDANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSEDSSLLK 539

Query: 242 ---QKQIQMAHKLDLMRKKKQEYL----QYQRNLALQKIQEQEYEMQRRQEQQKQY 290
              ++ ++  H+ +   +KK+E +      Q     +KI E E  +Q++ E  +  
Sbjct: 540 SKLEEHLEQLHEANEELQKKREQIEELEPDQDQNLSRKIAELEAALQKKDEDMRAM 595


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 28.0 bits (63), Expect = 3.5
 Identities = 18/87 (20%), Positives = 49/87 (56%), Gaps = 11/87 (12%)

Query: 209 DKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQY---- 264
           +K  ++++ + AL+A REE     RRE  + +R++  ++  K + +++   ++ ++    
Sbjct: 4   EKRREMEEVQLALDAKREEFE---RREELLKQREE--ELEKKEEELQESLIKFDKFLKEN 58

Query: 265 --QRNLALQKIQEQEYEMQRRQEQQKQ 289
             +R  A +K +E++   + ++E+ K+
Sbjct: 59  EAKRRRAEKKAEEEKKLRKEKEEEIKE 85


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 28.8 bits (65), Expect = 3.5
 Identities = 18/67 (26%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 207 LQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQR 266
            ++ ++ V ++ A+L  LREE NE+L++E    E + Q     + +  R ++   L  + 
Sbjct: 57  PREFVSGVFESLASLFDLREE-NEELKKELLELESRLQELEQLEAENARLREL--LNLKE 113

Query: 267 NLALQKI 273
           +L  Q I
Sbjct: 114 SLDYQFI 120


>gnl|CDD|132062 TIGR03017, EpsF, chain length determinant protein EpsF.  Sequences
           in this family of proteins are members of the chain
           length determinant family (pfam02706) which includes the
           wzc protein from E.coli. This family of proteins are
           homologous to the EpsF protein of the methanolan
           biosynthesis operon of Methylobacillus species strain
           12S. The distribution of this protein appears to be
           restricted to a subset of exopolysaccharide operons
           containing a syntenic grouping of genes including a
           variant of the EpsH exosortase protein. Exosortase has
           been proposed to be involved in the targetting and
           processing of proteins containing the PEP-CTERM domain
           to the exopolysaccharide layer.
          Length = 444

 Score = 28.6 bits (64), Expect = 3.8
 Identities = 35/179 (19%), Positives = 71/179 (39%), Gaps = 22/179 (12%)

Query: 130 GVTSPDEASDTE---LDDFIRALK-SQVEIFVNRMKSNSSRGRS----IANDSSVQTLFV 181
           G+ S DE  D E   L++    L  +Q ++     K   S G+     +  +  +Q L  
Sbjct: 202 GIVSSDERLDVERARLNELSAQLVAAQAQVMDASSKEGGSSGKDALPEVIANPIIQNLKT 261

Query: 182 NITTMHSRLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAER 241
           +I    S+L +  Q+       Y+  Q          A +N+L+ + N ++++       
Sbjct: 262 DIARAESKLAELSQRLGPNHPQYKRAQ----------AEINSLKSQLNAEIKKVTSSVGT 311

Query: 242 QKQIQMAHKLDL---MRKKKQEYLQYQRNLALQKIQEQEYEM-QRRQEQQKQYYMQNNV 296
             +I    + +L   +  +K + L+  R      + +++ E  QR  +   Q Y Q  +
Sbjct: 312 NSRILKQREAELREALENQKAKVLELNRQRDEMSVLQRDVENAQRAYDAAMQRYTQTRI 370


>gnl|CDD|153312 cd07628, BAR_Atg24p, The Bin/Amphiphysin/Rvs (BAR) domain of yeast
           Sorting Nexin Atg24p.  BAR domains are dimerization,
           lipid binding and curvature sensing modules found in
           many different proteins with diverse functions. Sorting
           nexins (SNXs) are Phox homology (PX) domain containing
           proteins that are involved in regulating membrane
           traffic and protein sorting in the endosomal system.
           SNXs differ from each other in their lipid-binding
           specificity, subcellular localization and specific
           function in the endocytic pathway. A subset of SNXs also
           contain BAR domains. The PX-BAR structural unit
           determines the specific membrane targeting of SNXs.
           Atg24p is involved in membrane fusion events at the
           vacuolar surface during pexophagy. BAR domains form
           dimers that bind to membranes, induce membrane bending
           and curvature, and may also be involved in
           protein-protein interactions.
          Length = 185

 Score = 28.4 bits (64), Expect = 3.8
 Identities = 20/74 (27%), Positives = 37/74 (50%), Gaps = 14/74 (18%)

Query: 183 ITTMHSRLLKYIQQ-------KDDKRVYYEGLQDKLTQ--VKDARAALNALREEHNEKLR 233
           +T++   LL YI         +D K++ YE L D L    V++A+   +A     N+++ 
Sbjct: 89  LTSLKD-LLHYILSLKNLIKLRDQKQLDYEELSDYLLTDEVENAKETSDAF----NKEVL 143

Query: 234 REAQIAERQKQIQM 247
           +E    ER K+ ++
Sbjct: 144 KEYPNFERIKKQEI 157


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 28.9 bits (65), Expect = 3.9
 Identities = 21/101 (20%), Positives = 45/101 (44%), Gaps = 2/101 (1%)

Query: 195 QQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAE-RQKQIQMAHKLDL 253
           ++ ++ R     L  KL ++K+   +L  +R         E +I   R+K+  +A   D 
Sbjct: 565 EEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKREALAELNDE 624

Query: 254 MRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQKQYYMQN 294
            R++  E  + +R L   +  E   E  R  +++ + Y++ 
Sbjct: 625 RRERLAEKRERKRELE-AEFDEARIEEAREDKERAEEYLEQ 664


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 28.7 bits (65), Expect = 4.0
 Identities = 21/79 (26%), Positives = 35/79 (44%), Gaps = 12/79 (15%)

Query: 212 TQVKDARAALNALREEHNEKLRRE-----AQIAERQKQIQMAHKLDLMRKKKQEYLQYQR 266
             +K  +    ALR      L+ E     AQ   ++K ++   +L       Q+ LQ QR
Sbjct: 177 NLLKGGKVGGKALRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQL-------QDLLQKQR 229

Query: 267 NLALQKIQEQEYEMQRRQE 285
           +    +IQ  E+++Q  QE
Sbjct: 230 DYLTARIQRLEHQLQLLQE 248


>gnl|CDD|238945 cd01987, USP_OKCHK, USP domain is located between the N-terminal
           sensor domain and C-terminal catalytic domain of this
           Osmosensitive K+ channel histidine kinase family. The
           family of KdpD sensor kinase proteins regulates the
           kdpFABC operon responsible for potassium transport. The
           USP domain is homologous to the universal stress protein
           Usp Usp is a small cytoplasmic bacterial protein whose
           expression is enhanced when the cell is exposed to
           stress agents. Usp enhances the rate of cell survival
           during prolonged exposure to such conditions, and may
           provide a general "stress endurance" activity.
          Length = 124

 Score = 27.5 bits (62), Expect = 4.2
 Identities = 12/49 (24%), Positives = 24/49 (48%), Gaps = 8/49 (16%)

Query: 228 HNEKL-RREAQIAERQK------QIQMAHKLDLMRKKKQEYLQYQRNLA 269
           + E+L RR A++A+R K       ++    L+ + + ++  L     LA
Sbjct: 11  NAERLIRRAARLADRLKAPWYVVYVETPR-LNRLSEAERRRLAEALRLA 58


>gnl|CDD|237666 PRK14296, PRK14296, chaperone protein DnaJ; Provisional.
          Length = 372

 Score = 28.8 bits (64), Expect = 4.3
 Identities = 16/69 (23%), Positives = 33/69 (47%), Gaps = 1/69 (1%)

Query: 200 KRVYYEGLQ-DKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKKK 258
           K+ YYE L   K    ++ R A   L ++++  L +     ++  +I  A  + L + K+
Sbjct: 3   KKDYYEVLGVSKTASEQEIRQAYRKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKR 62

Query: 259 QEYLQYQRN 267
           ++Y Q+   
Sbjct: 63  KQYDQFGHA 71


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 28.2 bits (63), Expect = 4.5
 Identities = 19/88 (21%), Positives = 44/88 (50%), Gaps = 1/88 (1%)

Query: 207 LQDKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMA-HKLDLMRKKKQEYLQYQ 265
           L+ K+ + +  +A     R     + RR+    + Q++ Q A ++ +L RK+ Q+ L+ Q
Sbjct: 68  LEAKIKEYEAQQAQAKLERARVEAEERRKTLQEQTQQEQQRAQYQDELARKRYQKELEQQ 127

Query: 266 RNLALQKIQEQEYEMQRRQEQQKQYYMQ 293
           R    + ++ QE  + R++  ++    +
Sbjct: 128 RRQNEELLKMQEESVLRQEAMRRATEEE 155


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 28.1 bits (63), Expect = 4.7
 Identities = 13/50 (26%), Positives = 27/50 (54%), Gaps = 8/50 (16%)

Query: 240 ERQKQIQMAHKLDLMR--KKKQEYLQYQRNL-ALQKIQEQEYEMQRRQEQ 286
           E+ K++Q      +M+  +K+    Q   ++  L+K+QE++ EM   Q +
Sbjct: 72  EKMKELQ-----KMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQRE 116


>gnl|CDD|185267 PRK15369, PRK15369, two component system sensor kinase SsrB;
           Provisional.
          Length = 211

 Score = 27.7 bits (62), Expect = 6.0
 Identities = 22/74 (29%), Positives = 29/74 (39%), Gaps = 13/74 (17%)

Query: 209 DKLTQVKDARAALNALREEHNEKL----RREAQIAER----QKQIQMAHKLDLMRK---- 256
             L    D    L   RE    KL         IAE+     K ++  H+L++MRK    
Sbjct: 137 LALLNADDTNPPLLTPRERQILKLITEGYTNRDIAEQLSISIKTVE-THRLNMMRKLDVH 195

Query: 257 KKQEYLQYQRNLAL 270
           K  E L + R L L
Sbjct: 196 KVAELLNWARRLGL 209


>gnl|CDD|222504 pfam14027, DUF4243, Protein of unknown function (DUF4243).  This
          family of proteins is functionally uncharacterized.
          This family of proteins is found in bacteria and
          eukaryotes. Proteins in this family are typically
          between 348 and 477 amino acids in length.
          Length = 322

 Score = 28.1 bits (63), Expect = 6.2
 Identities = 11/45 (24%), Positives = 15/45 (33%), Gaps = 5/45 (11%)

Query: 33 NHGPHHGQSGYHHQEEDTGNSNPELAKYLDRSYWEQKKVPPNHGS 77
          NH PH   + Y       G S  +L +  DR     +   P    
Sbjct: 4  NHLPHALLALYAL-----GASPEQLQRAYDRYSSYLEPAVPPDED 43


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 27.7 bits (62), Expect = 6.5
 Identities = 11/38 (28%), Positives = 25/38 (65%), Gaps = 1/38 (2%)

Query: 251 LDLMRKKKQEYLQYQRNLALQKIQEQEYEMQRRQEQQK 288
           L  +R++++++ Q +RN  L + Q  E   +RR+E+++
Sbjct: 154 LAELRQQQRQFEQ-RRNAELAETQRLEEAERRRREEKE 190


>gnl|CDD|169691 PRK09173, PRK09173, F0F1 ATP synthase subunit B; Validated.
          Length = 159

 Score = 27.4 bits (61), Expect = 6.5
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 22/82 (26%)

Query: 216 DARAALNALREEHNE--KLRREAQ--IAERQKQIQMAHK----------------LDLMR 255
           DARA    ++ E  E  +LR EAQ  +AE Q++ + A K                    +
Sbjct: 32  DARAD--RIKNELAEARRLREEAQQLLAEYQRKRKEAEKEAADIVAAAEREAEALTAEAK 89

Query: 256 KKKQEYLQYQRNLALQKIQEQE 277
           +K +EY+  +  LA QKI + E
Sbjct: 90  RKTEEYVARRNKLAEQKIAQAE 111


>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
           system, R (restriction) subunit and related helicases
           [Defense mechanisms].
          Length = 962

 Score = 28.1 bits (63), Expect = 6.7
 Identities = 15/83 (18%), Positives = 33/83 (39%), Gaps = 13/83 (15%)

Query: 189 RLLKYIQQKDDKRVYYEGLQDKL-----------TQVKDARAALNALREEHNEKLRREAQ 237
            +L+ +Q  +DK   YE L++ +            Q K     L     ++  K    A+
Sbjct: 817 YILELLQTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAE 876

Query: 238 IAERQKQIQMAHKLDLMRKKKQE 260
             + ++   +A K +  ++  +E
Sbjct: 877 --KLEELYILAKKEEEFKQFAEE 897


>gnl|CDD|219139 pfam06691, DUF1189, Protein of unknown function (DUF1189).  This
           family consists of several hypothetical bacterial
           proteins of around 260 residues in length. The function
           of this family is unknown.
          Length = 250

 Score = 27.7 bits (62), Expect = 6.8
 Identities = 7/37 (18%), Positives = 14/37 (37%)

Query: 121 TDHHMNHTSGVTSPDEASDTELDDFIRALKSQVEIFV 157
            +  +   S        +  +L +FI  L+S   I +
Sbjct: 124 ANGQVQEYSYSLLGLSLTKKDLQNFINQLQSLFGIII 160


>gnl|CDD|182745 PRK10803, PRK10803, tol-pal system protein YbgF; Provisional.
          Length = 263

 Score = 27.8 bits (62), Expect = 6.9
 Identities = 19/63 (30%), Positives = 31/63 (49%), Gaps = 16/63 (25%)

Query: 183 ITTMHSRLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKLRREAQIAERQ 242
           I+  HS+LL  +QQ                Q+ D ++ +++LR +  E   +  Q+ ERQ
Sbjct: 48  ISNAHSQLLTQLQQ----------------QLSDNQSDIDSLRGQIQENQYQLNQVVERQ 91

Query: 243 KQI 245
           KQI
Sbjct: 92  KQI 94


>gnl|CDD|200475 cd11336, AmyAc_MTSase, Alpha amylase catalytic domain found in
           maltooligosyl trehalose synthase (MTSase).
           Maltooligosyl trehalose synthase (MTSase) domain. MTSase
           and maltooligosyl trehalose trehalohydrolase (MTHase)
           work together to produce trehalose. MTSase is
           responsible for converting the alpha-1,4-glucosidic
           linkage to an alpha,alpha-1,1-glucosidic linkage at the
           reducing end of the maltooligosaccharide through an
           intramolecular transglucosylation reaction, while MTHase
           hydrolyzes the penultimate alpha-1,4 linkage of the
           reducing end, resulting in the release of trehalose. The
           Alpha-amylase family comprises the largest family of
           glycoside hydrolases (GH), with the majority of enzymes
           acting on starch, glycogen, and related oligo- and
           polysaccharides. These proteins catalyze the
           transformation of alpha-1,4 and alpha-1,6 glucosidic
           linkages with retention of the anomeric center. The
           protein is described as having 3 domains: A, B, C. A is
           a (beta/alpha) 8-barrel; B is a loop between the beta 3
           strand and alpha 3 helix of A; C is the C-terminal
           extension characterized by a Greek key. The majority of
           the enzymes have an active site cleft found between
           domains A and B where a triad of catalytic residues
           (Asp, Glu and Asp) performs catalysis. Other members of
           this family have lost the catalytic activity as in the
           case of the human 4F2hc, or only have 2 residues that
           serve as the catalytic nucleophile and the acid/base,
           such as Thermus A4 beta-galactosidase with 2 Glu
           residues (GH42) and human alpha-galactosidase with 2 Asp
           residues (GH31). The family members are quite extensive
           and include: alpha amylase, maltosyltransferase,
           cyclodextrin glycotransferase, maltogenic amylase,
           neopullulanase, isoamylase, 1,4-alpha-D-glucan
           maltotetrahydrolase, 4-alpha-glucotransferase,
           oligo-1,6-glucosidase, amylosucrase, sucrose
           phosphorylase, and amylomaltase.
          Length = 660

 Score = 27.8 bits (63), Expect = 7.1
 Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 10/53 (18%)

Query: 216 DARAALNALRE---EHNEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQ 265
           D RA L  L E   E  E +RR  ++    +        D       EYL YQ
Sbjct: 453 DVRARLAVLSELPEEWAEAVRRWRRLNAPLRTGPAPDPND-------EYLLYQ 498


>gnl|CDD|184604 PRK14283, PRK14283, chaperone protein DnaJ; Provisional.
          Length = 378

 Score = 27.9 bits (62), Expect = 7.1
 Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 1/67 (1%)

Query: 199 DKRVYYEGLQ-DKLTQVKDARAALNALREEHNEKLRREAQIAERQKQIQMAHKLDLMRKK 257
           +KR YYE L  D+    K+ + A   L  +++  +  E    E+ K+I  A+ +    +K
Sbjct: 3   EKRDYYEVLGVDRNADKKEIKKAYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEK 62

Query: 258 KQEYLQY 264
           +Q Y Q+
Sbjct: 63  RQRYDQF 69


>gnl|CDD|153331 cd07647, F-BAR_PSTPIP, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Proline-Serine-Threonine
           Phosphatase-Interacting Proteins.  F-BAR domains are
           dimerization modules that bind and bend membranes and
           are found in proteins involved in membrane dynamics and
           actin reorganization. Vetebrates contain two
           Proline-Serine-Threonine Phosphatase-Interacting
           Proteins (PSTPIPs), PSTPIP1 and PSTPIP2. PSTPIPs are
           mainly expressed in hematopoietic cells and are involved
           in the regulation of cell adhesion and motility.
           Mutations in PSTPIPs have been shown to cause
           autoinflammatory disorders. PSTPIP1 contains an
           N-terminal F-BAR domain, PEST motifs, and a C-terminal
           SH3 domain, while PSTPIP2 contains only the N-terminal
           F-BAR domain. F-BAR domains form banana-shaped dimers
           with a positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 239

 Score = 27.4 bits (61), Expect = 7.3
 Identities = 10/48 (20%), Positives = 24/48 (50%)

Query: 229 NEKLRREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLALQKIQEQ 276
            E+  +  +  E+QK+ +   +  + R +K +   Y++ +  +K  EQ
Sbjct: 84  REEAEKLEEFREKQKEERKKTEDIMKRSQKNKKELYKKTMKAKKSYEQ 131


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 27.8 bits (62), Expect = 7.9
 Identities = 11/84 (13%), Positives = 39/84 (46%), Gaps = 6/84 (7%)

Query: 215 KDARAALNALREEHNEKL------RREAQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNL 268
           K        + +E  E+L      + E + A+ ++++++  +L+ +++ +        N 
Sbjct: 112 KLDEEQEERVEKEREEELAGDAMKKLENRTADSKREMEVLERLEELKELQSRRADVDVNS 171

Query: 269 ALQKIQEQEYEMQRRQEQQKQYYM 292
            L+ +  +E + +  +E++ +  +
Sbjct: 172 MLEALFRREKKEEEEEEEEDEALI 195


>gnl|CDD|198287 cd03177, GST_C_Delta_Epsilon, C-terminal, alpha helical domain of
           Class Delta and Epsilon Glutathione S-transferases.
           Glutathione S-transferase (GST) C-terminal domain
           family, Class Delta and Epsilon subfamily; GSTs are
           cytosolic dimeric proteins involved in cellular
           detoxification by catalyzing the conjugation of
           glutathione (GSH) with a wide range of endogenous and
           xenobiotic alkylating agents, including carcinogens,
           therapeutic drugs, environmental toxins and products of
           oxidative stress. GSTs also show GSH peroxidase activity
           and are involved in the synthesis of prostaglandins and
           leukotrienes. The GST fold contains an N-terminal
           thioredoxin-fold domain and a C-terminal alpha helical
           domain, with an active site located in a cleft between
           the two domains. GSH binds to the N-terminal domain
           while the hydrophobic substrate occupies a pocket in the
           C-terminal domain. The class Delta and Epsilon subfamily
           is made up primarily of insect GSTs, which play major
           roles in insecticide resistance by facilitating
           reductive dehydrochlorination of insecticides or
           conjugating them with GSH to produce water-soluble
           metabolites that are easily excreted. They are also
           implicated in protection against cellular damage by
           oxidative stress.
          Length = 117

 Score = 26.7 bits (60), Expect = 8.1
 Identities = 16/64 (25%), Positives = 28/64 (43%), Gaps = 12/64 (18%)

Query: 168 RSIANDSSVQTLFVNITTMHSRLLKYIQQKDDKRVYYEGL---QDKLTQVKDARAALNAL 224
           R+I N    Q LF +  T++ RL  Y        + + G    ++KL ++++A   L   
Sbjct: 3   RAIVN----QRLFFDSGTLYQRLRDYYY-----PILFGGAEPPEEKLDKLEEALEFLETF 53

Query: 225 REEH 228
            E  
Sbjct: 54  LEGS 57


>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
           unknown].
          Length = 833

 Score = 27.7 bits (61), Expect = 9.9
 Identities = 15/55 (27%), Positives = 27/55 (49%), Gaps = 2/55 (3%)

Query: 236 AQIAERQKQIQMAHKLDLMRKKKQEYLQYQRNLA-LQKIQEQEYEMQRRQEQQKQ 289
           A + E  KQ Q+  +  L+ + K+E L+Y   LA L   Q   +E+  +    ++
Sbjct: 526 ALLKEEAKQRQLQEQKALL-EHKKETLEYTSQLAELLDQQADRFELSAQAAGSQK 579


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 26.9 bits (60), Expect = 10.0
 Identities = 15/102 (14%), Positives = 44/102 (43%), Gaps = 20/102 (19%)

Query: 189 RLLKYIQQKDDKRVYYEGLQDKLTQVKDARAALNALREEHNEKL---RREAQ------IA 239
            +LK + ++  K      + D L + +  +    AL  E+ ++L   R +A         
Sbjct: 30  PILKALDERQAK------IADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKK 83

Query: 240 ERQKQIQMAHK-----LDLMRKKKQEYLQYQRNLALQKIQEQ 276
           E ++  +         L+ +++  +  ++ ++  AL++++ +
Sbjct: 84  EAEQIAEEIKAEAEEELERIKEAAEAEIEAEKERALEELRAE 125


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.126    0.360 

Gapped
Lambda     K      H
   0.267   0.0788    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,758,560
Number of extensions: 1376156
Number of successful extensions: 2176
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1938
Number of HSP's successfully gapped: 314
Length of query: 297
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 201
Effective length of database: 6,679,618
Effective search space: 1342603218
Effective search space used: 1342603218
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 59 (26.4 bits)