RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy13461
         (194 letters)



>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
           This is a family of eukaryotic translation initiation
           factors.
          Length = 194

 Score = 80.0 bits (198), Expect = 6e-19
 Identities = 24/83 (28%), Positives = 43/83 (51%), Gaps = 1/83 (1%)

Query: 6   GTHIVLAELRDT-GSLEFVDTGDFSVMNAAEHHQATDVEWDPTGRYVMSGVSLWKTKADT 64
           G  ++LA   +  G +EF D  +   +  AE   ATD EW P GRY ++  +  + + D 
Sbjct: 112 GRLVLLAGFGNLAGQIEFWDVKNKKKIATAEASNATDCEWSPDGRYFLTATTSPRLRVDN 171

Query: 65  GYWQWSFQGKIIKRFNSPTFCQL 87
           G+  W + GK++ +++     Q+
Sbjct: 172 GFKIWHYSGKLVYKYDFDELYQV 194


>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
           repeat [General function prediction only].
          Length = 561

 Score = 73.0 bits (179), Expect = 3e-15
 Identities = 38/165 (23%), Positives = 68/165 (41%), Gaps = 5/165 (3%)

Query: 34  AEHHQATDVEWDPTGRYVMSGVSLWKTKADTGYWQWSFQGKIIKRFNSPTFCQLRWRPRP 93
           A+  + T++ WDP+G+YV +  S  K K + GY  +   G +        F    WRP P
Sbjct: 397 AKVFELTNITWDPSGQYVTTSSSCPKHKVEHGYKIFKIAGALYPEEARGGFKNFAWRPEP 456

Query: 94  ASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETW 153
            S L+ E   K  K  +     FE       + A++    ++++L +Q+ E  ++ +   
Sbjct: 457 PSKLTIESAKKYVKPSRHRFVPFEKAVIMEADSANRSSAPRKKELVEQWPEYSDEDKIR- 515

Query: 154 EAEKEERKY----LRNLVDTDELDSENVEEEVVEFVIKEEITICE 194
              K+ R       R     +    +  + E  E V+ E   + E
Sbjct: 516 SLLKKLRAIEALKERMRSGEELEVIQVNKIETEEEVLSELKELVE 560


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 34.5 bits (80), Expect = 0.025
 Identities = 19/67 (28%), Positives = 33/67 (49%), Gaps = 4/67 (5%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
           +E+++KIKK LKK       +         +E +EK  +  ++  E  E+L E  +  KE
Sbjct: 347 EEELEKIKKLLKKLPKKARGQLPPEK----REQLEKLLETKEKLSEELEELEEELKELKE 402

Query: 159 ERKYLRN 165
           E + L +
Sbjct: 403 ELESLYS 409


>gnl|CDD|200413 TIGR04162, exo_VPEID, exosortase E/protease, VPEID-CTERM system.
           Members of this protein family are fusion proteins of
           exosortase (N-terminal) and a CAAX prenyl protease
           domain (C-terminal). Members are restricted to the alpha
           Proteobacteria. The variant C-terminal protein sequence
           VPEID-CTERM occurs only in these species, often
           adjacent.
          Length = 519

 Score = 33.6 bits (77), Expect = 0.059
 Identities = 20/53 (37%), Positives = 27/53 (50%), Gaps = 6/53 (11%)

Query: 50  YVMSGV---SLWKTKADTGY-WQWSFQGKIIKRFNSPTFCQLRWRPRPASLLS 98
           +++SGV   SLW   A  GY  + +F    +  F  P    L WRP PA LL+
Sbjct: 337 FMLSGVLVQSLWANPAQ-GYPLRAAFMAAGLWLF-WPALRALDWRPGPADLLT 387


>gnl|CDD|147573 pfam05461, ApoL, Apolipoprotein L.  Apo L belongs to the high
           density lipoprotein family that plays a central role in
           cholesterol transport. The cholesterol content of
           membranes is important in cellular processes such as
           modulating gene transcription and signal transduction
           both in the adult brain and during neurodevelopment.
           There are six apo L genes located in close proximity to
           each other on chromosome 22q12 in humans. 22q12 is a
           confirmed high-susceptibility locus for schizophrenia
           and close to the region associated with velocardiofacial
           syndrome that includes symptoms of schizophrenia.
          Length = 313

 Score = 32.7 bits (75), Expect = 0.10
 Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 4/58 (6%)

Query: 94  ASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
            + LS+E+ D ++++LKK T     +D+ R+ K  ++    R +  K+F  L+ +L E
Sbjct: 23  EAELSREEADALREALKKLTTDLAIEDKDRLQKDQQD----RERFLKEFPRLKAELEE 76


>gnl|CDD|236686 PRK10416, PRK10416, signal recognition particle-docking protein
           FtsY; Provisional.
          Length = 318

 Score = 32.4 bits (75), Expect = 0.11
 Identities = 25/106 (23%), Positives = 41/106 (38%), Gaps = 30/106 (28%)

Query: 103 DKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQ-------FEELREKL------ 149
             +KK  K+    +  + ++ ++K  +   E    LF +        EEL E L      
Sbjct: 3   SWLKKKKKEKKEGWFERLKKGLSKTRENFGEGINGLFAKKKIDEDLLEELEELLIEADVG 62

Query: 150 RETWEAEK-----EERKYLRNLVDTDELDSENVEEEVVEFVIKEEI 190
            ET   E+      ER   +NL D +EL            ++KEE+
Sbjct: 63  VET--TEEIIEELRERVKRKNLKDPEELKE----------LLKEEL 96


>gnl|CDD|188955 cd09556, SAM_VTS1_fungal, SAM domain of VTS1 RNA-binding proteins. 
           SAM (sterile alpha motif) domain of VTS1 subfamily
           proteins is RNA binding domain located in the C-terminal
           region. SAM interacts with stem-loop structures of mRNA.
           Proteins of this subfamily participate in regulation of
           transcript stability and degradation, and also may be
           involved in vacuolar protein transport regulation. VTS1
           protein of S.cerevisiae induces mRNA degradation via the
           major deadenylation-dependent mRNA decay pathway; VTS1
           recruits CCR4/POP2/NOT deadenylase complex to target
           mRNA. The recruitment is the initial step resulting in
           poly(A) tail removal transcripts. Potentially SAM domain
           may be responsible not only for RNA binding but also for
           deadenylase binding.
          Length = 69

 Score = 30.3 bits (69), Expect = 0.12
 Identities = 15/49 (30%), Positives = 21/49 (42%), Gaps = 8/49 (16%)

Query: 109 LKKYTPAFEAKDRQRMNKASKELIEK--------RRKLFKQFEELREKL 149
           L KYT   +    + + +   E +E         RRKL K FE +RE  
Sbjct: 18  LHKYTDNLKDLSWKELIELDDEDLEDKGVSALGARRKLLKAFEIVREYK 66


>gnl|CDD|221069 pfam11304, DUF3106, Protein of unknown function (DUF3106).  Some
           members in this family of proteins are annotated as
           transmembrane proteins however this cannot be confirmed.
           Currently no function is known.
          Length = 107

 Score = 30.8 bits (70), Expect = 0.17
 Identities = 21/72 (29%), Positives = 38/72 (52%), Gaps = 11/72 (15%)

Query: 104 KIKKSLKKYTPAFEAKDRQRMNKASKELIEKR---RKLFKQFEEL----REKLRETWEAE 156
           +I +   K +P  + + ++RM + +K   E+R   R  +++F +L    +E LR  WEA 
Sbjct: 33  RIAERYPKMSPEEQKRLQERMTRWAKLSPEQREQARLNYQRFRQLPPEQKEALRARWEAY 92

Query: 157 K----EERKYLR 164
           +    EER+ L 
Sbjct: 93  QALSPEERQALA 104


>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ.  Members of
           this family are the FliJ protein found, in nearly every
           case, in the midst of other flagellar biosynthesis genes
           in bacgterial genomes. Typically the fliJ gene is found
           adjacent to the gene for the flagellum-specific ATPase
           FliI. Sequence scoring in the gray zone between trusted
           and noise cutoffs include both probable FliJ proteins
           and components of bacterial type III secretion systems.
          Length = 141

 Score = 30.7 bits (70), Expect = 0.21
 Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 3/45 (6%)

Query: 121 RQRMNKASKELIEKRRKLFKQFEELREKLRETWEAE--KEERKYL 163
           +Q +    + L+E RR+L K  E+L+EK ++ + AE  K E+K +
Sbjct: 88  QQEVEAKRERLLEARREL-KALEKLKEKKQKEYRAEEAKREQKEM 131


>gnl|CDD|223771 COG0699, COG0699, Predicted GTPases (dynamin-related) [General
           function prediction only].
          Length = 546

 Score = 31.0 bits (70), Expect = 0.38
 Identities = 13/65 (20%), Positives = 25/65 (38%), Gaps = 3/65 (4%)

Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEE 159
           E +  + + ++           + +    K L +K+R      E+L ++L  T E   E 
Sbjct: 470 ESLLILAQKIRDS---VLKAIFELLKNKRKRLAQKQRLKRLYLEQLEDELLRTAEEILEL 526

Query: 160 RKYLR 164
           R  L 
Sbjct: 527 RLLLE 531


>gnl|CDD|221326 pfam11935, DUF3453, Domain of unknown function (DUF3453).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 239 to 261 amino acids in length.
          Length = 233

 Score = 30.3 bits (69), Expect = 0.42
 Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 9/72 (12%)

Query: 87  LRWRPRPASLLSKEQVDKIKKSLK-------KYTPAFEAKDRQRMNKASKELIEKRRKLF 139
           L + P     LSK QV  ++KSLK       K+  +     + R+ +A ++L + R ++ 
Sbjct: 159 LNFNPNLPPTLSKLQVRSVEKSLKNFLLHLLKHQLSPPF--QGRIQQALEDLGQSRAEIA 216

Query: 140 KQFEELREKLRE 151
            +  + +  L +
Sbjct: 217 DEAPKEKRILPK 228


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 30.8 bits (70), Expect = 0.42
 Identities = 12/61 (19%), Positives = 32/61 (52%), Gaps = 1/61 (1%)

Query: 97  LSKEQVDKIKKSLK-KYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEA 155
           L  E+ ++++++LK K        + + + +   +     ++L  +FE  +E+LR+ +E 
Sbjct: 182 LKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREKEELRKKYEE 241

Query: 156 E 156
           +
Sbjct: 242 K 242


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 30.4 bits (69), Expect = 0.44
 Identities = 13/56 (23%), Positives = 21/56 (37%), Gaps = 1/56 (1%)

Query: 86  QLRWRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQ 141
            L  R    S  S +   +    L+K     +   R ++N     L EKR  L ++
Sbjct: 100 ALAQRRSDLSSASYQLEKRRASQLEKLQDEIKRT-RSKLNALHSLLAEKRSFLCRE 154


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 30.1 bits (68), Expect = 0.47
 Identities = 18/74 (24%), Positives = 39/74 (52%), Gaps = 2/74 (2%)

Query: 121 RQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDEL--DSENVE 178
           R++  +A +E  E+R+KL ++ E  R++  E  E  +++++        ++   + E  E
Sbjct: 17  RRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYE 76

Query: 179 EEVVEFVIKEEITI 192
           +    FV++EE T 
Sbjct: 77  KLKSSFVVEEEGTD 90


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 30.3 bits (69), Expect = 0.59
 Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 7/95 (7%)

Query: 92  RPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKE---LIEKRRKLFKQFEELREK 148
           + A  L KE + + K+ + K     E + ++R N+  +    L+++   L ++ E L +K
Sbjct: 43  KEAETLKKEALLEAKEEVHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKK 102

Query: 149 LRETWEAEKEERKYLRNLVDTDELDSENVEEEVVE 183
                + EKE     +NL   DE + E +EE + E
Sbjct: 103 EENLEKKEKELSNKEKNL---DEKEEE-LEELIAE 133


>gnl|CDD|232986 TIGR00464, gltX_bact, glutamyl-tRNA synthetase, bacterial family.
           The glutamyl-tRNA synthetases of the eukaryotic cytosol
           and of the Archaea are more similar to glutaminyl-tRNA
           synthetases than to bacterial glutamyl-tRNA synthetases.
           This model models just the bacterial and mitochondrial
           forms of the enzyme. In many species, the charging of
           tRNA(gln) proceeds first through misacylation with Glu
           and then transamidation. For this reason, glutamyl-tRNA
           synthetases may act on both tRNA(gln) and tRNA(glu).
           This model is highly specific. Proteins with positive
           scores below the trusted cutoff may be fragments rather
           than full-length sequences [Protein synthesis, tRNA
           aminoacylation].
          Length = 470

 Score = 30.4 bits (69), Expect = 0.63
 Identities = 20/78 (25%), Positives = 40/78 (51%), Gaps = 5/78 (6%)

Query: 113 TPAFEAKDRQRMNKASKELIEKRRKLFKQF---EELREKLRETWEAEKEERKYLRNLVDT 169
            P ++++      K +KEL+E+    ++ +   E L E+LRE  +A KE  +Y R+  + 
Sbjct: 70  GPYYQSQRLDIYKKYAKELLEEG-LAYRCYCSKERL-ERLREEQKANKETPRYDRHCRNL 127

Query: 170 DELDSENVEEEVVEFVIK 187
              + EN   + +  V++
Sbjct: 128 HNEEIENALAKGIPPVVR 145


>gnl|CDD|188234 TIGR02591, cas_Csh1, CRISPR-associated protein Cas8b/Csh1, subtype
           I-B/HMARI.  This domain is found in the C-terminal 2/3
           of a family of CRISPR associated proteins of the Hmari
           subtype. Except for the two sequences from halophilic
           archaea this domain contains a pair of CXXC motifs
           [Mobile and extrachromosomal element functions, Other].
          Length = 393

 Score = 30.1 bits (68), Expect = 0.67
 Identities = 17/79 (21%), Positives = 32/79 (40%), Gaps = 6/79 (7%)

Query: 86  QLRWRPRPASLLSKEQVDKI-KKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEE 144
           Q   R +   L++K +  KI KK+++        K             E   K +    E
Sbjct: 313 QKSKRNKSKPLVNKLKGLKITKKTIENLLNEVREK-----LTQYGRQSEGGSKRYTSTSE 367

Query: 145 LREKLRETWEAEKEERKYL 163
           L ++L E+W+  ++E  + 
Sbjct: 368 LLDRLIESWKLSEDENVFY 386


>gnl|CDD|224691 COG1777, COG1777, Predicted transcriptional regulators
           [Transcription].
          Length = 217

 Score = 29.7 bits (67), Expect = 0.68
 Identities = 23/112 (20%), Positives = 42/112 (37%)

Query: 81  SPTFCQLRWRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFK 140
           SP F            L  E+ +  K        +        +N+  +EL   + +L K
Sbjct: 87  SPNFFGAERFDLEEDDLESERSEVSKLFKSPEGISELISRLLEINREIEELSRAQTELQK 146

Query: 141 QFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEFVIKEEITI 192
           Q  EL ++++E  E +  +      L    +  + +VEE     V+K E  +
Sbjct: 147 QLNELMDRIKEEIEDKDGDMTERIVLEYLLKNGAADVEETSRRTVLKIEEVL 198


>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
          Length = 449

 Score = 30.1 bits (68), Expect = 0.78
 Identities = 18/72 (25%), Positives = 28/72 (38%)

Query: 120 DRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEE 179
           D++  +    ELI +R +      E  E+L        EE   LR L    E + E+ +E
Sbjct: 377 DQEAEDAKVAELISQREEEEALQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDEDE 436

Query: 180 EVVEFVIKEEIT 191
                  +EE  
Sbjct: 437 LEEAQPEEEEEA 448


>gnl|CDD|216996 pfam02364, Glucan_synthase, 1,3-beta-glucan synthase component.
           This family consists of various 1,3-beta-glucan synthase
           components including Gls1, Gls2 and Gls3 from yeast.
           1,3-beta-glucan synthase EC:2.4.1.34 also known as
           callose synthase catalyzes the formation of a
           beta-1,3-glucan polymer that is a major component of the
           fungal cell wall. The reaction catalyzed is:-
           UDP-glucose + {(1,3)-beta-D-glucosyl}(N) <=> UDP +
           {(1,3)-beta-D-glucosyl}(N+1).
          Length = 818

 Score = 30.1 bits (68), Expect = 0.84
 Identities = 22/68 (32%), Positives = 28/68 (41%), Gaps = 3/68 (4%)

Query: 122 QRMNKASKELIEKRRKLFKQFEELREK--LRETWEAEKEERKYLRNLVDTDELDSENVEE 179
           QR  K  KE  E    L + + +L+      E  EAE  E  Y   L+D      EN   
Sbjct: 199 QRYAKFKKEEAENAEFLLRAYPDLQIAYLDEEPPEAEGGEPDYYSALIDGHCELDEN-GR 257

Query: 180 EVVEFVIK 187
            V +F IK
Sbjct: 258 RVPKFRIK 265


>gnl|CDD|223137 COG0059, IlvC, Ketol-acid reductoisomerase [Amino acid transport
           and metabolism / Coenzyme metabolism].
          Length = 338

 Score = 29.9 bits (68), Expect = 0.86
 Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 16/71 (22%)

Query: 116 FEAKDRQRMNKA---------SKELIEKRRKLFKQFEELREKLRET--WEAEKEERK--- 161
            +A+ ++ M K          +KE I + +    + E LRE+  E    +  KE RK   
Sbjct: 268 IDAETKEEMKKVLKDIQSGEFAKEWILENQAGRPKLEALREETAEHEIEKVGKELRKMMP 327

Query: 162 --YLRNLVDTD 170
                  VD  
Sbjct: 328 WEKKDKGVDMA 338


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 29.9 bits (68), Expect = 0.90
 Identities = 18/65 (27%), Positives = 35/65 (53%), Gaps = 2/65 (3%)

Query: 99  KEQVDKIKKSLKKYTPAFE--AKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAE 156
           K+Q++   +   +     E   ++   +   + EL EK+++L  Q E+L+EK  ET +  
Sbjct: 155 KQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAETSQER 214

Query: 157 KEERK 161
           K++RK
Sbjct: 215 KQKRK 219


>gnl|CDD|221313 pfam11917, DUF3435, Protein of unknown function (DUF3435).  This
           family of proteins are functionally uncharacterized.
           This protein is found in eukaryotes. Proteins in this
           family are typically between 435 to 791 amino acids in
           length. This family is related to pfam00589 suggesting
           it may be an integrase enzyme.
          Length = 418

 Score = 29.6 bits (67), Expect = 0.96
 Identities = 16/64 (25%), Positives = 34/64 (53%), Gaps = 7/64 (10%)

Query: 122 QRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE----ERKYLRNLVDTDELDSENV 177
           +R+ K  +E+  +R++L +   EL++K+RE ++ E+     ER+     V  +  +    
Sbjct: 321 ERLEKRRREVRNERQRLRR---ELKKKIREEFDEEQPVIDIERQLSGLAVSEEVKEDLPD 377

Query: 178 EEEV 181
           E+ V
Sbjct: 378 EQLV 381


>gnl|CDD|223957 COG1026, COG1026, Predicted Zn-dependent peptidases,
           insulinase-like [General function prediction only].
          Length = 978

 Score = 30.0 bits (68), Expect = 0.99
 Identities = 11/71 (15%), Positives = 33/71 (46%), Gaps = 5/71 (7%)

Query: 122 QRMNKASKELIEKRRKLFKQ--FEELREKLRETWEA-EKEERKYLRNLVDTDELDSENVE 178
           +++ K  +EL++KR         E++ +  ++  E  ++ + +     + T  L   +V 
Sbjct: 463 EKLEKEERELLQKRSSELTDEDLEKIIKDSKKLKERQDQPDSEEDLATLPT--LKLGDVP 520

Query: 179 EEVVEFVIKEE 189
           + + +  ++ E
Sbjct: 521 DPIEKTSLETE 531


>gnl|CDD|222290 pfam13654, AAA_32, AAA domain.  This family includes a wide variety
           of AAA domains including some that have lost essential
           nucleotide binding residues in the P-loop.
          Length = 509

 Score = 29.7 bits (68), Expect = 1.0
 Identities = 16/59 (27%), Positives = 31/59 (52%), Gaps = 13/59 (22%)

Query: 102 VDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEER 160
           V+++KK +     AFE          S+E   ++ ++ ++F+E RE+  E  E E +E+
Sbjct: 97  VEELKKEIPA---AFE----------SEEYEARKEEIEEEFQEKREEAFEELEEEAKEK 142


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 29.7 bits (66), Expect = 1.1
 Identities = 25/103 (24%), Positives = 44/103 (42%), Gaps = 6/103 (5%)

Query: 92   RPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
            + A    K + D+ K +      A EAK  + + K   E  +K  +L K  EE + K  E
Sbjct: 1675 KKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEE 1734

Query: 152  TWEAEKEERKYLRNLVDTDELDSENVEEEVVEFVIKEEITICE 194
              +  +E++K        +E   +  E++ +  + KEE    E
Sbjct: 1735 AKKEAEEDKK------KAEEAKKDEEEKKKIAHLKKEEEKKAE 1771



 Score = 29.7 bits (66), Expect = 1.3
 Identities = 22/68 (32%), Positives = 35/68 (51%)

Query: 94   ASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETW 153
            A  L K + +K K    K   A E K  + + KA +E   K  +  K+ EE ++K  E  
Sbjct: 1622 AEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAK 1681

Query: 154  EAEKEERK 161
            +AE++E+K
Sbjct: 1682 KAEEDEKK 1689



 Score = 28.2 bits (62), Expect = 3.8
 Identities = 16/64 (25%), Positives = 35/64 (54%)

Query: 98   SKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEK 157
            +K++ D  KK  ++   A EAK +   +K   + ++K     K+ +E ++K  E  +A++
Sbjct: 1376 AKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADE 1435

Query: 158  EERK 161
             ++K
Sbjct: 1436 AKKK 1439


>gnl|CDD|181215 PRK08059, PRK08059, general stress protein 13; Validated.
          Length = 123

 Score = 28.5 bits (64), Expect = 1.1
 Identities = 15/60 (25%), Positives = 23/60 (38%), Gaps = 7/60 (11%)

Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEE 159
           E+  KI  S++    A EAK ++          +        F  LR+KL E  E   + 
Sbjct: 67  EEKGKISLSIRATEEAPEAKRKKGKILIPNPSEQG-------FNTLRDKLEEWIEQSNKT 119


>gnl|CDD|233224 TIGR00993, 3a0901s04IAP86, chloroplast protein import component
           Toc86/159, G and M domains.  The long precursor of the
           86K protein originally described is proposed to have
           three domains. The N-terminal A-domain is acidic,
           repetitive, weakly conserved, readily removed by
           proteolysis during chloroplast isolation, and not
           required for protein translocation. The other domains
           are designated G (GTPase) and M (membrane anchor); this
           family includes most of the G domain and all of M
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 763

 Score = 29.5 bits (66), Expect = 1.1
 Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 10/73 (13%)

Query: 114 PAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKL--RETWEAE----KEERKYLRNLV 167
           P F+   + +M K SKE    +RK + +  + R KL  ++ W  E    K  +K+ + + 
Sbjct: 411 PPFKPLTKAQMAKLSKE----QRKAYLEEYDYRVKLLQKKQWREELKRMKMMKKFGKEIG 466

Query: 168 DTDELDSENVEEE 180
           +  +  SE V+EE
Sbjct: 467 ELPDGYSEEVDEE 479


>gnl|CDD|234904 PRK01117, PRK01117, adenylosuccinate synthetase; Provisional.
          Length = 430

 Score = 29.2 bits (67), Expect = 1.4
 Identities = 12/42 (28%), Positives = 18/42 (42%), Gaps = 3/42 (7%)

Query: 143 EELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEF 184
           E   EKL E  E  K     L      + +D E + +E +E+
Sbjct: 155 ETFAEKLEENLE-YKNF--LLTKYYGAEAVDFEEILDEYLEY 193


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 29.2 bits (66), Expect = 1.5
 Identities = 18/57 (31%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 127 ASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVE 183
           ASKE +E   K   + EE    L E W+AEK   + ++ +   +E++   +E E  E
Sbjct: 431 ASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQI--KEEIEQVRLELEQAE 485


>gnl|CDD|183017 PRK11177, PRK11177, phosphoenolpyruvate-protein phosphotransferase;
           Provisional.
          Length = 575

 Score = 29.2 bits (66), Expect = 1.5
 Identities = 25/98 (25%), Positives = 44/98 (44%), Gaps = 17/98 (17%)

Query: 96  LLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEA 155
           LL KE  D+I  + KK +     ++ +R        +  R K   Q E ++ K  ET+  
Sbjct: 17  LLLKE--DEIVINRKKISADQVDQEVER-------FLSGRAKASAQLEAIKTKAGETFGE 67

Query: 156 EKEE--RKYLRNLVDTDELDSENVEEEVVEFVIKEEIT 191
           EKE     ++  L D      E +E+E++  +  + +T
Sbjct: 68  EKEAIFEGHIMLLED------EELEQEIIALIKDKHMT 99


>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
          Length = 2316

 Score = 29.3 bits (66), Expect = 1.6
 Identities = 24/91 (26%), Positives = 43/91 (47%), Gaps = 18/91 (19%)

Query: 107  KSLKKYTPAFEAKDRQRMNKASKELIEKR------------RKLFKQFEELREKLRE--- 151
            K  K  TPAF+ + R +  +A++ L  +R             +   Q + LRE + E   
Sbjct: 1571 KPPKTATPAFDEQGRYQAGRAAEILSAQRGQELVGRASTLQFRFLAQPDALREAMAERAM 1630

Query: 152  --TWEA-EKEERKYLRNLVDTDELDSENVEE 179
              T+E  E+E R+ LR L++ +    + ++E
Sbjct: 1631 FGTYEGTERELRRGLRRLLEGEAEAPQALQE 1661


>gnl|CDD|224062 COG1139, COG1139, Uncharacterized conserved protein containing a
           ferredoxin-like domain [Energy production and
           conversion].
          Length = 459

 Score = 28.8 bits (65), Expect = 1.6
 Identities = 19/62 (30%), Positives = 26/62 (41%), Gaps = 9/62 (14%)

Query: 121 RQRMNKASKELIEKRRKL---FKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENV 177
           R+ + KA   L   R K+      +EELR+  RE       +   L NL +  E   ENV
Sbjct: 21  REALRKAQDTLRANREKVLDELPDWEELRDLAREI------KLHVLENLDEYLEQLEENV 74

Query: 178 EE 179
             
Sbjct: 75  TR 76


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.0 bits (66), Expect = 1.7
 Identities = 28/104 (26%), Positives = 49/104 (47%), Gaps = 12/104 (11%)

Query: 98  SKEQVDKIKKSLKKYTPAFEAKDR--QRMNKASKELIEKRRKLFKQFEELREK---LRET 152
           +K++ + IKK         EAK+   +  N+  KEL E+R +L K  + L +K   L   
Sbjct: 47  AKKEAEAIKKEAL-----LEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRK 101

Query: 153 WEAEKEERKYLRNLVDTDELDSENVE--EEVVEFVIKEEITICE 194
            E  ++  + L       E   + +E  EE +E +I+E++   E
Sbjct: 102 LELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELE 145


>gnl|CDD|179730 PRK04051, rps4p, 30S ribosomal protein S4P; Validated.
          Length = 177

 Score = 28.3 bits (64), Expect = 1.8
 Identities = 19/57 (33%), Positives = 28/57 (49%), Gaps = 2/57 (3%)

Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEFV 185
           KE I + R+L  ++  LR K RE W+A+   RKY R       L  E   +E  + +
Sbjct: 20  KERIAEERELLGKYG-LRNK-RELWKAQSILRKYRRQARSLLALPPEERAKEEEQLL 74


>gnl|CDD|215355 PLN02658, PLN02658, homogentisate 1,2-dioxygenase.
          Length = 435

 Score = 28.9 bits (65), Expect = 1.9
 Identities = 10/26 (38%), Positives = 13/26 (50%), Gaps = 5/26 (19%)

Query: 73 GKIIKRFNSPTFC-----QLRWRPRP 93
           K++  F+    C     QLRWRP P
Sbjct: 74 EKLVGEFDPSNSCETTPTQLRWRPFP 99


>gnl|CDD|99866 cd06113, citrate_synt_like_1_2, Citrate synthase (CS) catalyzes the
           condensation of acetyl coenzyme A (AcCoA) and
           oxalacetate (OAA) to form citrate and coenzyme A (CoA),
           the first step in the oxidative citric acid cycle (TCA
           or Krebs cycle). Peroxisomal CS is involved in the
           glyoxylate cycle. This group also includes CS proteins
           which functions as a 2-methylcitrate synthase (2MCS).
           2MCS catalyzes the condensation of propionyl-CoA (PrCoA)
           and OAA to form 2-methylcitrate and CoA during
           propionate metabolism. This group contains proteins
           which functions exclusively as either a CS or a 2MCS, as
           well as those with relaxed specificity which have dual
           functions as both a CS and a 2MCS. The overall CS
           reaction is thought to proceed through three partial
           reactions and involves both closed and open
           conformational forms of the enzyme: a) a carbanion or
           equivalent is generated from AcCoA by base abstraction
           of a proton, b) nucleophilic attack of this carbanion on
           OAA to generate citryl-CoA, and c) hydrolysis of
           citryl-CoA to produce citrate and CoA. CSs are found in
           two structural types: type I (homodimeric) and type II
           CSs (homohexameric). Type II CSs are unique to
           gram-negative bacteria. Type I CSs are found in eukarya,
           gram-positive bacteria, archaea, and in some
           gram-negative bacteria. Type I CS is active as a
           homodimer, both subunits participating in the active
           site. Type II CS is a hexamer of identical subunits
           (approximated as a trimer of dimers). Some type II CSs
           are strongly and specifically inhibited by NADH through
           an allosteric mechanism. This subgroup includes both
           gram-positive and gram-negative bacteria.
          Length = 406

 Score = 28.8 bits (65), Expect = 2.1
 Identities = 11/37 (29%), Positives = 23/37 (62%), Gaps = 1/37 (2%)

Query: 137 KLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELD 173
           K+ +  E+++E +++ W  E E R YLR +++ +  D
Sbjct: 250 KVMEMLEDIKENVKD-WTDEDEVRAYLRKILNKEAFD 285


>gnl|CDD|235246 PRK04184, PRK04184, DNA topoisomerase VI subunit B; Validated.
          Length = 535

 Score = 28.7 bits (65), Expect = 2.1
 Identities = 14/70 (20%), Positives = 25/70 (35%), Gaps = 10/70 (14%)

Query: 126 KASKELIEKRRKLFKQFEELREKLRETWEAEKE----------ERKYLRNLVDTDELDSE 175
           +  +E  +K +   K   E+  KL E  E E+            +K     V+ +E +  
Sbjct: 463 RKEEEAKKKAKTFEKYIPEIARKLAEILEKEEPDIEEVLAKLIGKKLEEEEVEEEEEEEA 522

Query: 176 NVEEEVVEFV 185
            V E    + 
Sbjct: 523 VVVESAKNYT 532


>gnl|CDD|227000 COG4653, COG4653, Predicted phage phi-C31 gp36 major capsid-like
           protein [General function prediction only].
          Length = 422

 Score = 28.6 bits (64), Expect = 2.3
 Identities = 8/68 (11%), Positives = 16/68 (23%)

Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEE 159
           +++D I  +L K T                   E      ++         +   A    
Sbjct: 57  DKLDAIPGNLDKDTAFLALIACVAARLKLGGAAETADPACREGPPAFNSYIQRGLAATGA 116

Query: 160 RKYLRNLV 167
              L+   
Sbjct: 117 LLELKAFS 124



 Score = 27.9 bits (62), Expect = 4.0
 Identities = 12/37 (32%), Positives = 20/37 (54%), Gaps = 3/37 (8%)

Query: 143 EELREKLRETWE-AEKEERKYLRNL--VDTDELDSEN 176
             +R + R   E AE+E R+ L ++  V  D+LD+  
Sbjct: 27  TAIRAEQRRRGEEAEEENRRLLADIERVGGDKLDAIP 63


>gnl|CDD|145900 pfam02994, Transposase_22, L1 transposable element. 
          Length = 370

 Score = 28.5 bits (63), Expect = 2.3
 Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 8/78 (10%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
           +E ++   K ++     F  +  + + K  +  IE   K  K+  EL++K RE     KE
Sbjct: 61  REDIETKGKEVQN----FLKELEECITKQVEAHIENTEKCLKELMELKKKAREL----KE 112

Query: 159 ERKYLRNLVDTDELDSEN 176
           E + ++       LD EN
Sbjct: 113 ECRSIKKEHMETTLDIEN 130


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 28.5 bits (64), Expect = 2.4
 Identities = 17/81 (20%), Positives = 40/81 (49%), Gaps = 7/81 (8%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
             +++++++ +++     E K R ++ +   EL E+   L  + EE+ ++  ET +  K+
Sbjct: 335 LAEIEELEREIEE-----ERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKD 389

Query: 159 ERKYLRNLVDTDELDSENVEE 179
            R+ L  L    E++    E 
Sbjct: 390 YREKLEKL--KREINELKREL 408



 Score = 27.3 bits (61), Expect = 6.7
 Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 5/88 (5%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQ--RMNKASKELIEKRRKLFKQFEELREKLRET---W 153
           + ++ +I+  L + +       R+   + K  ++L ++  KL ++ EEL E L       
Sbjct: 694 QSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEI 753

Query: 154 EAEKEERKYLRNLVDTDELDSENVEEEV 181
           E  K E K L   ++  E D   +EE +
Sbjct: 754 ENVKSELKELEARIEELEEDLHKLEEAL 781


>gnl|CDD|184430 PRK13975, PRK13975, thymidylate kinase; Provisional.
          Length = 196

 Score = 28.2 bits (63), Expect = 2.5
 Identities = 20/71 (28%), Positives = 33/71 (46%), Gaps = 4/71 (5%)

Query: 122 QRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEV 181
           +RM    KE+ EK+  L    ++++EK  E    EK   KY   ++DT     E V  E+
Sbjct: 129 KRMETRDKEIFEKKEFL----KKVQEKYLELANNEKFMPKYGFIVIDTTNKSIEEVFNEI 184

Query: 182 VEFVIKEEITI 192
           +  +  +   I
Sbjct: 185 LNKIKDKIPDI 195


>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein.  This
           is the P3 protein section of the Potyviridae
           polyproteins. The function is not known except that the
           protein is essential to viral survival.
          Length = 445

 Score = 28.3 bits (64), Expect = 2.6
 Identities = 15/56 (26%), Positives = 24/56 (42%), Gaps = 4/56 (7%)

Query: 124 MNKASKELIEK--RRKLFKQFEEL--REKLRETWEAEKEERKYLRNLVDTDELDSE 175
           +N    EL+EK     L + + EL   EKL   W + K  ++  + L      D +
Sbjct: 135 LNDELYELMEKIYLDLLEESWRELSLLEKLSAIWRSRKFRKRSEKCLKPKAAEDLK 190


>gnl|CDD|227899 COG5612, COG5612, Predicted integral membrane protein [Function
           unknown].
          Length = 148

 Score = 27.6 bits (61), Expect = 2.6
 Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 5/70 (7%)

Query: 109 LKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQ-FEELREKLRETWEAEKEERKYLRNLV 167
           ++  TPA           A + L  + R+ F+Q     R+K RE  +  ++ R+    L+
Sbjct: 29  VRLSTPA----PAMTTPAAGEALPPENRRGFRQALRAARQKNREITQRARQARREAAALL 84

Query: 168 DTDELDSENV 177
            +  LD+  +
Sbjct: 85  ASPGLDTAAL 94


>gnl|CDD|206685 cd01898, Obg, Obg GTPase.  The Obg nucleotide binding protein
           subfamily has been implicated in stress response,
           chromosome partitioning, replication initiation,
           mycelium development, and sporulation. Obg proteins are
           among a large group of GTP binding proteins conserved
           from bacteria to humans. The E. coli homolog, ObgE is
           believed to function in ribosomal biogenesis. Members of
           the subfamily contain two equally and highly conserved
           domains, a C-terminal GTP binding domain and an
           N-terminal glycine-rich domain.
          Length = 170

 Score = 27.8 bits (63), Expect = 2.7
 Identities = 11/54 (20%), Positives = 23/54 (42%), Gaps = 5/54 (9%)

Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRM---NKASKELIEKRRKLFKQFEELREKLR 150
           E  + I+  L+ Y P     ++ R+   NK      E+R +  K+  +  +  +
Sbjct: 96  EDYETIRNELEAYNPGLA--EKPRIVVLNKIDLLDAEERFEKLKELLKELKGKK 147


>gnl|CDD|235855 PRK06707, PRK06707, amidase; Provisional.
          Length = 536

 Score = 28.3 bits (63), Expect = 2.9
 Identities = 20/60 (33%), Positives = 31/60 (51%), Gaps = 2/60 (3%)

Query: 107 KSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWE-AEKEERKYLRN 165
           KSL++   AF  KD +R  K  + LIE   K     +E  + ++ + E A+KE  +YL  
Sbjct: 407 KSLEEII-AFNKKDSKRRIKYGQTLIEASEKSAITKDEFEKVVQTSQENAKKELDRYLVE 465


>gnl|CDD|232831 TIGR00115, tig, trigger factor.  Trigger factor is a
           ribosome-associated molecular chaperone and is the first
           chaperone to interact with nascent polypeptide. Trigger
           factor can bind at the same time as the signal
           recognition particle (SRP), but is excluded by the SRP
           receptor (FtsY). The central domain of trigger factor
           has peptidyl-prolyl cis/trans isomerase activity. This
           protein is found in a single copy in virtually every
           bacterial genome [Protein fate, Protein folding and
           stabilization].
          Length = 408

 Score = 28.3 bits (64), Expect = 3.0
 Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 11/73 (15%)

Query: 129 KELIEKRRKLFKQF-------EELREKLRETWEAEKEER--KYLRNLVDTDELDSENVEE 179
           KEL E   +  K         EEL+  +R+  E EK+ER    L+  +  D+L  EN E 
Sbjct: 232 KELPELDDEFAKSLGEEFETLEELKADIRKNLEREKKERAKNKLKEQL-LDKLV-ENNEF 289

Query: 180 EVVEFVIKEEITI 192
           E+ E ++++EI  
Sbjct: 290 ELPESLVEQEIDR 302


>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
           Provisional.
          Length = 1109

 Score = 28.5 bits (64), Expect = 3.0
 Identities = 13/50 (26%), Positives = 27/50 (54%)

Query: 126 KASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSE 175
           K S E  ++ +++   F +L  +LR+    E++E + +   + TD L+ E
Sbjct: 61  KGSLERAKQYQQVIDNFPKLSAELRQQLNNERDEPRSVPPNMSTDALEQE 110


>gnl|CDD|221049 pfam11262, Tho2, Transcription factor/nuclear export subunit
           protein 2.  THO and TREX form a eukaryotic complex which
           functions in messenger ribonucleoprotein metabolism and
           plays a role in preventing the transcription-associated
           genetic instability. Tho2, along with four other
           subunits forms THO.
          Length = 296

 Score = 28.1 bits (63), Expect = 3.0
 Identities = 19/95 (20%), Positives = 36/95 (37%), Gaps = 15/95 (15%)

Query: 98  SKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEK 157
            KE  DK  + L+K     ++             I+K++K  K+ + L +KL E  +   
Sbjct: 25  PKELYDKEIERLEKQIKELDSS---------SSGIDKKKKEKKRLKSLIKKLEEELKKHI 75

Query: 158 EE----RKYLRNLVDT--DELDSENVEEEVVEFVI 186
           E     +K L     +     D + +    ++  I
Sbjct: 76  EHNEKTKKRLSEEKSSWFPSKDPDEINSAFLQLCI 110


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 28.1 bits (63), Expect = 3.1
 Identities = 24/105 (22%), Positives = 45/105 (42%), Gaps = 13/105 (12%)

Query: 86  QLRWRPRPASLLSKEQVDKIKKSLKKY------TPAFEAKDRQRMNK------ASKELIE 133
           Q +W+ R   L +K+ +++ KK L+         P    K ++  +K      A K+ I+
Sbjct: 177 QQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIK 236

Query: 134 K-RRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENV 177
             +  L    ++L+  L E  +A + + K L N       D +N 
Sbjct: 237 SAKNDLQNDKKQLKADLAELKKAPQNDLKRLENKYAIKSGDLKNF 281



 Score = 26.6 bits (59), Expect = 8.9
 Identities = 23/88 (26%), Positives = 42/88 (47%), Gaps = 13/88 (14%)

Query: 95  SLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRK------------LFKQF 142
            L + E  ++I+KSLK     ++ + +   NK   E  +KR +            L K  
Sbjct: 159 DLKTVETAEEIEKSLKAMQQKWKKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIK 218

Query: 143 EELREKLRETWEAEKEERKYLRNLVDTD 170
           EE  +KL++  +A+K++ K  +N +  D
Sbjct: 219 EEF-DKLKKEGKADKQKIKSAKNDLQND 245


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 28.1 bits (63), Expect = 3.1
 Identities = 16/54 (29%), Positives = 30/54 (55%), Gaps = 3/54 (5%)

Query: 117 EAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWE---AEKEERKYLRNLV 167
           E  + QR+ +A +   E++ +  KQ +E +++ +ET E   A    + YL +LV
Sbjct: 171 ELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEKIAARAFAQGYLSDLV 224


>gnl|CDD|219526 pfam07701, HNOBA, Heme NO binding associated.  The HNOBA domain is
           found associated with the HNOB domain and pfam00211 in
           soluble cyclases and signalling proteins. The HNOB
           domain is predicted to function as a heme-dependent
           sensor for gaseous ligands, and transduce diverse
           downstream signals, in both bacteria and animals.
          Length = 209

 Score = 27.6 bits (62), Expect = 3.2
 Identities = 19/52 (36%), Positives = 30/52 (57%), Gaps = 12/52 (23%)

Query: 127 ASKELI---EKRR---KLFKQFEELREKLRETWEAEKEERKYLRNLVDTDEL 172
           A+++LI   E+ R   KL K+ E+L++KL E     +EE+K       TD+L
Sbjct: 151 ATRDLILLGEQARAQLKLKKRLEKLKDKLEEAHRELEEEKK------KTDDL 196


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 28.2 bits (63), Expect = 3.2
 Identities = 16/74 (21%), Positives = 37/74 (50%), Gaps = 4/74 (5%)

Query: 120 DRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE-ERKYLRNLVDTDELDSENVE 178
            ++ +N   ++L    ++L ++ E L + L+E  + +KE E++             E +E
Sbjct: 509 FKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLE-LE 567

Query: 179 EEVVEFV--IKEEI 190
           +E  E +  +K+E+
Sbjct: 568 KEAQEALKALKKEV 581


>gnl|CDD|237841 PRK14867, PRK14867, DNA topoisomerase VI subunit B; Provisional.
          Length = 659

 Score = 28.2 bits (63), Expect = 3.3
 Identities = 18/73 (24%), Positives = 35/73 (47%), Gaps = 2/73 (2%)

Query: 97  LSKEQVDKIKKSLKKYTPAF-EAKDRQRMNKAS-KELIEKRRKLFKQFEELREKLRETWE 154
           L +E++ +IK   KK    F E  ++  +N +   E  +K + + K  EE  ++L++   
Sbjct: 354 LDEEEIKEIKNHFKKKPENFMEWAEKNYLNASLLDEFRKKVKNITKSPEEFVKELKDKAM 413

Query: 155 AEKEERKYLRNLV 167
              EE +  R  +
Sbjct: 414 VSDEELEKYRKEI 426


>gnl|CDD|220764 pfam10454, DUF2458, Protein of unknown function (DUF2458).  This a
           is family of uncharacterized proteins.
          Length = 155

 Score = 27.5 bits (61), Expect = 3.3
 Identities = 14/48 (29%), Positives = 25/48 (52%)

Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSEN 176
           + LI ++ +  +Q+ + RE L +  EA KE++K L       +L S  
Sbjct: 30  RRLIIEQHRHERQWWKEREALIQKQEARKEDKKKLLPNGTDSKLSSVG 77


>gnl|CDD|216925 pfam02194, PXA, PXA domain.  This domain is associated with PX
           domains pfam00787.
          Length = 182

 Score = 27.6 bits (62), Expect = 3.5
 Identities = 20/77 (25%), Positives = 35/77 (45%), Gaps = 4/77 (5%)

Query: 92  RPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
             A LL+   +  + K L+ +  A E   R   + +++    +  +L    E    KL  
Sbjct: 56  DLADLLTDRLLPLLTKHLESFREA-EQAVRNLSSSSTES---EELELALALEPESAKLHP 111

Query: 152 TWEAEKEERKYLRNLVD 168
              +E+EE +YLR LV+
Sbjct: 112 ALSSEEEELEYLRKLVE 128


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 28.1 bits (63), Expect = 3.6
 Identities = 22/93 (23%), Positives = 47/93 (50%), Gaps = 9/93 (9%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQRMNKAS--KELIEKRRKLFKQFEELREKLRETWEAE 156
           +E++++++  L++     E +  +   K    KE +E+R  L ++ E+L  +L E   A+
Sbjct: 308 RERLEELENELEEL----EERLEELKEKIEALKEELEERETLLEELEQLLAELEE---AK 360

Query: 157 KEERKYLRNLVDTDELDSENVEEEVVEFVIKEE 189
           +E  + L  L++  E   E + EE+ E   +  
Sbjct: 361 EELEEKLSALLEELEELFEALREELAELEAELA 393


>gnl|CDD|165679 PLN00107, PLN00107, FAD-dependent oxidoreductase; Provisional.
          Length = 257

 Score = 27.6 bits (61), Expect = 3.7
 Identities = 13/36 (36%), Positives = 18/36 (50%)

Query: 70  SFQGKIIKRFNSPTFCQLRWRPRPASLLSKEQVDKI 105
           +F G I K   +  F +++ R  P  L S E  DKI
Sbjct: 167 AFDGAIAKYKKAGEFLKVKERLDPEGLFSSEWSDKI 202


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 27.9 bits (63), Expect = 4.0
 Identities = 25/141 (17%), Positives = 53/141 (37%), Gaps = 27/141 (19%)

Query: 80  NSPTFCQLRWRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKE--------- 130
            +P   + + + R    L  +++ ++ +  +K     E    +R N+ SKE         
Sbjct: 9   ENPDAVREKLKKRGGDALDVDKLLELDEERRKLLRELEELQAER-NELSKEIGRALKRGE 67

Query: 131 -----LIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTD----ELDSENVE--- 178
                LI + ++L ++ +EL   L E           + N+        + + +NVE   
Sbjct: 68  DDAEELIAEVKELKEKLKELEAALDELEAELDTLLLTIPNIPHESVPVGKDEDDNVEVRR 127

Query: 179 -----EEVVEFVIKEEITICE 194
                  V +F  K+ + + E
Sbjct: 128 WGEPPVFVFDFEPKDHVELGE 148



 Score = 27.2 bits (61), Expect = 6.0
 Identities = 18/90 (20%), Positives = 40/90 (44%), Gaps = 8/90 (8%)

Query: 99  KEQVDKIKKSLKK---YTPAFEA-----KDRQRMNKASKELIEKRRKLFKQFEELREKLR 150
           +E  D +++ LKK        +      ++R+++ +  +EL  +R +L K+     ++  
Sbjct: 8   RENPDAVREKLKKRGGDALDVDKLLELDEERRKLLRELEELQAERNELSKEIGRALKRGE 67

Query: 151 ETWEAEKEERKYLRNLVDTDELDSENVEEE 180
           +  E    E K L+  +   E   + +E E
Sbjct: 68  DDAEELIAEVKELKEKLKELEAALDELEAE 97


>gnl|CDD|237047 PRK12298, obgE, GTPase CgtA; Reviewed.
          Length = 390

 Score = 27.9 bits (63), Expect = 4.1
 Identities = 11/51 (21%), Positives = 21/51 (41%)

Query: 130 ELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEE 180
             IE+  +   +  E  EK+   W+    E+       D D+ D +  E++
Sbjct: 330 TFIEENPREEAEEAEAPEKVEFMWDDYHREQLEEVEEEDDDDWDDDWDEDD 380


>gnl|CDD|227545 COG5220, TFB3, Cdk activating kinase (CAK)/RNA polymerase II
           transcription initiation/nucleotide excision repair
           factor TFIIH, subunit TFB3 [Cell division and chromosome
           partitioning / Transcription / DNA replication,
           recombination, and repair].
          Length = 314

 Score = 27.7 bits (61), Expect = 4.3
 Identities = 25/110 (22%), Positives = 44/110 (40%), Gaps = 22/110 (20%)

Query: 96  LLSKEQVDKIKKSLKKY-------------TPAFEAKDRQRMNKASKELIEKRRKLFKQF 142
           LL    V   ++ +KKY              P  E    ++ N   K +  K+R+L +Q 
Sbjct: 120 LLELIDVSLTEEKVKKYEEMNQDSILNNLERPTPEVMPGKQKNVLQKRMKLKKRQLERQI 179

Query: 143 EELREKLRETWEAE---------KEERKYLRNLVDTDELDSENVEEEVVE 183
           EE     +E  + E         ++  K ++  + +D+ D E +   VVE
Sbjct: 180 EEEERMNKEWTKKEIGNRLGTASEDGNKTIKIGIISDKFDPEELPRIVVE 229


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 27.7 bits (63), Expect = 4.5
 Identities = 16/79 (20%), Positives = 29/79 (36%), Gaps = 17/79 (21%)

Query: 121 RQRMNKASKE-------------LIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLV 167
           +   N  SKE             LI + ++L ++ + L  +L E     +E    + NL 
Sbjct: 48  QAERNALSKEIGQAKRKGEDAEALIAEVKELKEEIKALEAELDELEAELEELLLRIPNLP 107

Query: 168 DTD----ELDSENVEEEVV 182
                  + + +NVE    
Sbjct: 108 HDSVPVGKDEDDNVEVRRW 126


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 27.7 bits (62), Expect = 4.5
 Identities = 12/73 (16%), Positives = 29/73 (39%), Gaps = 9/73 (12%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
            ++   +++ +       EA+ ++ + +  + L ++  +L    EE    L E      E
Sbjct: 66  SKEFLDLEEEILDL----EAEIKE-VEENLESLEKEINEL----EEWLNVLDEEKSFLDE 116

Query: 159 ERKYLRNLVDTDE 171
             + L  L + D 
Sbjct: 117 NLEELSELSNLDI 129


>gnl|CDD|214710 smart00533, MUTSd, DNA-binding domain of DNA mismatch repair MUTS
           family. 
          Length = 308

 Score = 27.3 bits (61), Expect = 4.7
 Identities = 23/96 (23%), Positives = 39/96 (40%), Gaps = 16/96 (16%)

Query: 98  SKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEK 157
           S E + +I++ L+                  K ++E    L +  E L E L +    E 
Sbjct: 87  SLEGLKEIRQLLESLDGPLLGLLL-------KVILE---PLLELLELLLELLNDDDPLEV 136

Query: 158 EERKYLRNLVDTDELDS-----ENVEEEVVEFVIKE 188
            +   +++  D  ELD      E +EEE+ E + KE
Sbjct: 137 NDGGLIKDGFD-PELDELREKLEELEEELEELLKKE 171


>gnl|CDD|213577 TIGR01018, rpsD_arch, ribosomal protein S4(archaeal
           type)/S9(eukaryote cytosolic type).  This model finds
           eukaryotic ribosomal protein S9 as well as archaeal
           ribosomal protein S4 [Protein synthesis, Ribosomal
           proteins: synthesis and modification].
          Length = 162

 Score = 27.0 bits (60), Expect = 4.8
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 2/52 (3%)

Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEE 180
           KE +++  KL  ++  LR K +E W+AE   RKY R   +   LD E    E
Sbjct: 20  KERLDRELKLVGKYG-LRNK-KEVWKAETTLRKYRRAARELLGLDDEQRLFE 69


>gnl|CDD|227488 COG5159, RPN6, 26S proteasome regulatory complex component
           [Posttranslational modification, protein turnover,
           chaperones].
          Length = 421

 Score = 27.6 bits (61), Expect = 4.8
 Identities = 18/70 (25%), Positives = 32/70 (45%), Gaps = 1/70 (1%)

Query: 96  LLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWE- 154
            +SK     +  ++     A E   + ++ K  + LIEK        E+  + L    E 
Sbjct: 55  YVSKGDYCSLGDTITSSREAMEDFTKPKITKIIRTLIEKFPYSSDSLEDQIKVLTALIEW 114

Query: 155 AEKEERKYLR 164
           A++E+RK+LR
Sbjct: 115 ADREKRKFLR 124


>gnl|CDD|226650 COG4187, RocB, Arginine degradation protein (predicted deacylase)
           [Amino acid transport and metabolism].
          Length = 553

 Score = 27.4 bits (61), Expect = 4.8
 Identities = 21/102 (20%), Positives = 39/102 (38%), Gaps = 13/102 (12%)

Query: 95  SLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFK---------QFEEL 145
           S  +KE  D++K+  +  T A EA +  R     +E  +   +             F+EL
Sbjct: 308 SRTAKELFDRLKEEAE--TAAEEAIETLR--DRYEEYGKLVNRPAGPLPAKPRVLTFQEL 363

Query: 146 REKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEFVIK 187
            E+ R     + E  +         ELD      ++ E+++ 
Sbjct: 364 LERARVRGHIDAEYAEKEYEFAQNGELDLRLRSTQLTEWLVG 405


>gnl|CDD|216112 pfam00773, RNB, RNB domain.  This domain is the catalytic domain of
           ribonuclease II.
          Length = 322

 Score = 27.3 bits (61), Expect = 4.9
 Identities = 19/78 (24%), Positives = 33/78 (42%), Gaps = 4/78 (5%)

Query: 92  RPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRK----LFKQFEELRE 147
           R  + L+ E+VD++ +       A  A D + + + +K L +KR K      +   E + 
Sbjct: 117 RSKARLTYEEVDELLEGKPSEREAPVADDLELLYELAKALRKKRLKRGALELEDLPEAKI 176

Query: 148 KLRETWEAEKEERKYLRN 165
            L E  E     R   R+
Sbjct: 177 DLDEDGEPVDIIRIEDRS 194


>gnl|CDD|226133 COG3605, PtsP, Signal transduction protein containing GAF and PtsI
           domains [Signal transduction mechanisms].
          Length = 756

 Score = 27.8 bits (62), Expect = 4.9
 Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 5/56 (8%)

Query: 126 KASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRN---LVDTDELDSENVE 178
           +A +EL+ +  +L +   +LR     T + E+ +   L N    VD  +L     +
Sbjct: 411 RAYQELVREEAELSRLARDLRNLPAITKDGERIK--LLMNAGLSVDLPQLLESGAD 464


>gnl|CDD|219289 pfam07083, DUF1351, Protein of unknown function (DUF1351).  This
           family consists of several bacterial and phage proteins
           of around 230 residues in length. The function of this
           family is unknown.
          Length = 220

 Score = 27.3 bits (61), Expect = 5.0
 Identities = 27/122 (22%), Positives = 50/122 (40%), Gaps = 32/122 (26%)

Query: 99  KEQVDKIKKSLKKYTPAFEA-----KDRQRMNKASKELIEKRRKLFKQF----------- 142
           ++ +D++    K Y    E      K+R  +NK  K L +KR+++ KQ+           
Sbjct: 21  EKAIDEVVAKYKGYVVTEENYKDDKKERAELNKLIKALNDKRKEIKKQYSKPADEFEAYM 80

Query: 143 -----------EELREKLRETWEAEKEERK-----YLRNLVDTDELDSENVEEEVVEFVI 186
                      + +   ++E  E +KE R+     YL    +   LD  + EE+  ++V 
Sbjct: 81  KKLIAPIKAVIDLIDSGIKEFEEKQKELRRDTVKEYLSEKAEEYGLDKRDFEEKYDDYVK 140

Query: 187 KE 188
           K 
Sbjct: 141 KS 142


>gnl|CDD|197366 cd09916, CpxP_like, CpxP component of the bacterial
           Cpx-two-component system and related proteins.  This
           family summarizes bacterial proteins related to CpxP, a
           periplasmic protein that forms part of a two-component
           system which acts as a global modulator of cell-envelope
           stress in gram-negative bacteria. CpxP aids in combating
           extracytoplasmic protein-mediated toxicity, and may also
           be involved in the response to alkaline pH. Functioning
           as a dimer, it inhibits activation of the kinase CpxA,
           but also plays a vital role in the quality control
           system of P pili. It has been suggested that CpxP
           directly interacts with CpxA via its concave polar
           surface. Another member of this family, Spy, is also a
           periplasmic protein that may be involved in the response
           to stress. The homology between CpxP and Spy suggests
           similar functions. A characteristic 5-residue sequence
           motif LTXXQ is found repeated twice in many members of
           this family.
          Length = 96

 Score = 26.0 bits (58), Expect = 5.1
 Identities = 10/47 (21%), Positives = 21/47 (44%)

Query: 133 EKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEE 179
           E++ ++    +  R +++   E  +  R+ LR L+  D  D   V  
Sbjct: 7   EQKAQIKAIRQAARAQMKALREQMRAAREELRALLTADTFDEAAVRA 53


>gnl|CDD|235623 PRK05849, PRK05849, hypothetical protein; Provisional.
          Length = 783

 Score = 27.6 bits (62), Expect = 5.1
 Identities = 16/81 (19%), Positives = 28/81 (34%), Gaps = 11/81 (13%)

Query: 81  SPTFCQLRWRPRPASLLSKEQVDKIKKSLKKYT-------PAFEAKDRQRMNK---ASKE 130
             T  +      P    S+ ++ +I+ SL K T            KD +++ K     KE
Sbjct: 367 DFTLDKKLEELLPG-GFSENEIKRIEFSLLKLTNRIISPKSGLFRKDLEKIEKLEERYKE 425

Query: 131 LIEKRRKLFKQFEELREKLRE 151
           L+        +   L E  + 
Sbjct: 426 LLNSNLDPLDKIYWLLEDCKR 446


>gnl|CDD|239570 cd03488, Topoisomer_IB_N_htopoI_like, Topoisomer_IB_N_htopoI_like :
           N-terminal DNA binding fragment found in eukaryotic DNA
           topoisomerase (topo) IB proteins similar to the
           monomeric yeast and human topo I.  Topo I enzymes are
           divided into:  topo type IA (bacterial) and type IB
           (eukaryotic). Topo I relaxes superhelical tension in
           duplex DNA by creating a single-strand nick, the broken
           strand can then rotate around the unbroken strand to
           remove DNA supercoils and, the nick is religated,
           liberating topo I. These enzymes regulate the
           topological changes that accompany DNA replication,
           transcription and other nuclear processes.  Human topo I
           is the target of a diverse set of anticancer drugs
           including camptothecins (CPTs). CPTs bind to the topo
           I-DNA complex and inhibit religation of the
           single-strand nick, resulting in the accumulation of
           topo I-DNA adducts.  This family may represent more than
           one structural domain.
          Length = 215

 Score = 27.3 bits (61), Expect = 5.2
 Identities = 17/67 (25%), Positives = 27/67 (40%), Gaps = 9/67 (13%)

Query: 106 KKSLKKYTPA----FEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERK 161
           KK + K        F   D  +M    K   E+++ + K+     EK     E EK E +
Sbjct: 68  KKVMTKEEKVIIKDFSKCDFTQMFAYFKAQKEEKKAMSKE-----EKKAIKAEKEKLEEE 122

Query: 162 YLRNLVD 168
           Y   ++D
Sbjct: 123 YGFCILD 129


>gnl|CDD|224477 COG1561, COG1561, Uncharacterized stress-induced protein [Function
           unknown].
          Length = 290

 Score = 27.2 bits (61), Expect = 5.7
 Identities = 22/86 (25%), Positives = 35/86 (40%), Gaps = 18/86 (20%)

Query: 124 MNKASKELIEKRRK-------LFKQ----FEELREKLRETWEAEKEER--KYLRNLVD-T 169
           + +A  +LIE R +          Q     EEL EK+        E    + +  L +  
Sbjct: 137 LEEALDDLIEMREREGAALKADLLQRLDAIEELVEKVESLMPEILEWYRERLVARLNEAQ 196

Query: 170 DELDSENVEEEVVEFVIK----EEIT 191
           D+LD + +E+EV     K    EE+ 
Sbjct: 197 DQLDEDRLEQEVALLAQKADIAEELD 222


>gnl|CDD|204467 pfam10376, Mei5, Double-strand recombination repair protein.  Mei5
           is one of a pair of meiosis-specific proteins which
           facilitate the loading of Dmc1 on to Rad51 on DNA at
           double-strand breaks during recombination. Recombination
           is carried out by a large protein complex based around
           the two RecA homologues, Rad51 and Dmc1. This complex
           may play both a catalytic and a structural role in the
           interaction between homologous chromosomes during
           meiosis. Mei5 is seen to contain a coiled-coli region.
          Length = 212

 Score = 27.1 bits (60), Expect = 5.7
 Identities = 14/33 (42%), Positives = 18/33 (54%)

Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERK 161
           KEL  ++RKL  Q +E  + LR     EK E K
Sbjct: 125 KELEAEKRKLEYQVDEESDDLRRLKLVEKYEIK 157


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 27.4 bits (62), Expect = 5.8
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 13/60 (21%)

Query: 97  LSKEQVDKIKKSLKKYT-----PAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
           L KE   K++K +++         F       + KA +E++EK R+   ++EE   KL+E
Sbjct: 816 LEKEL-AKLEKEIERVEKKLSNEGF-------VAKAPEEVVEKEREKLAEYEEKLAKLKE 867


>gnl|CDD|222389 pfam13801, Metal_resist, Heavy-metal resistance.  This is a
           metal-binding protein which is involved in resistance to
           heavy-metal ions. The protein forms a four-helix hooked
           hairpin, consisting of two long alpha helices each
           flanked by a shorter alpha helix. It binds a metal ion
           in a type-2 like centre. It contains two copies of an
           LTXXQ motif.
          Length = 125

 Score = 26.4 bits (59), Expect = 5.8
 Identities = 13/72 (18%), Positives = 26/72 (36%)

Query: 115 AFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDS 174
           A   + R+ +  A +    + R L +     R +L     AE  +   LR  +       
Sbjct: 41  ALPPEQRRALRAALRAHRREVRPLRRALRAARRELLALLRAEPFDPAALRAALARQREAR 100

Query: 175 ENVEEEVVEFVI 186
             ++  V E ++
Sbjct: 101 AALQARVQEALL 112


>gnl|CDD|223962 COG1031, COG1031, Uncharacterized Fe-S oxidoreductase [Energy
           production and conversion].
          Length = 560

 Score = 27.3 bits (61), Expect = 6.1
 Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 3/30 (10%)

Query: 123 RMNKASKELIEKRRKLFKQFEELREKLRET 152
            M +  K+  EK +KLF  F   + K+RE 
Sbjct: 410 PMWERGKKKAEKHKKLFAAF---KRKVREE 436


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 27.5 bits (62), Expect = 6.2
 Identities = 20/89 (22%), Positives = 42/89 (47%), Gaps = 10/89 (11%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
           K++ D+I K L+      + +     +  + ELIE R++L K  E+  +K ++  E ++E
Sbjct: 583 KKEADEIIKELR------QLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEE 636

Query: 159 ----ERKYLRNLVDTDELDSENVEEEVVE 183
               +     +L    E+ S   ++E + 
Sbjct: 637 LKVGDEVKYLSLGQKGEVLSIPDDKEAIV 665


>gnl|CDD|217372 pfam03105, SPX, SPX domain.  We have named this region the SPX
           domain after (SYG1, Pho81 and XPR1). This 180 residue
           length domain is found at the amino terminus of a
           variety of proteins. In the yeast protein SYG1, the
           N-terminus directly binds to the G- protein beta subunit
           and inhibits transduction of the mating pheromone
           signal. This finding suggests that all the members of
           this family are involved in G-protein associated signal
           transduction. The N-termini of several proteins involved
           in the regulation of phosphate transport, including the
           putative phosphate level sensors PHO81 from
           Saccharomyces cerevisiae and NUC-2 from Neurospora
           crassa, are also members of this family. The SPX domain
           of S. cerevisiae low-affinity phosphate transporters
           Pho87 and Pho90 auto-regulates uptake and prevents
           efflux. This SPX dependent inhibition is mediated by the
           physical interaction with Spl2 NUC-2 contains several
           ankyrin repeats pfam00023. Several members of this
           family are annotated as XPR1 proteins: the xenotropic
           and polytropic retrovirus receptor confers
           susceptibility to infection with murine leukaemia
           viruses (MLV). The similarity between SYG1, phosphate
           regulators and XPR1 sequences has been previously noted,
           as has the additional similarity to several predicted
           proteins, of unknown function, from Drosophila
           melanogaster, Arabidopsis thaliana, Caenorhabditis
           elegans, Schizosaccharomyces pombe, and Saccharomyces
           cerevisiae. In addition, given the similarities between
           XPR1 and SYG1 and phosphate regulatory proteins, it has
           been proposed that XPR1 might be involved in G-protein
           associated signal transduction and may itself function
           as a phosphate sensor.
          Length = 176

 Score = 26.6 bits (59), Expect = 6.3
 Identities = 10/52 (19%), Positives = 27/52 (51%)

Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRE 151
           E++++++K L++     +     + + + K L++   +L K   EL  +L+ 
Sbjct: 89  ERLEELEKQLEELEERRDETSSAKSSPSDKNLLKAFEELKKALLELYRELQL 140


>gnl|CDD|214682 smart00475, 53EXOc, 5'-3' exonuclease. 
          Length = 259

 Score = 26.8 bits (60), Expect = 6.3
 Identities = 15/50 (30%), Positives = 28/50 (56%), Gaps = 5/50 (10%)

Query: 126 KASKELIEKRR---KLFKQFEELREKLRETWEAEKEERKYLRNLV--DTD 170
           K + +L+++      + +  ++L++KLRE   A KE+ K  R L   +TD
Sbjct: 197 KTAAKLLKEFGSLENILENLDKLKKKLREKLLAHKEDAKLSRKLATIETD 246


>gnl|CDD|233842 TIGR02389, RNA_pol_rpoA2, DNA-directed RNA polymerase, subunit A''.
            This family consists of the archaeal A'' subunit of the
           DNA-directed RNA polymerase. The example from
           Methanocaldococcus jannaschii contains an intein
           [Transcription, DNA-dependent RNA polymerase].
          Length = 367

 Score = 26.9 bits (60), Expect = 6.4
 Identities = 14/42 (33%), Positives = 22/42 (52%), Gaps = 6/42 (14%)

Query: 136 RKLFKQFEELREKLRETWEAEKEE------RKYLRNLVDTDE 171
            KL K+ EE  +K   + + E +E       +YLR+L+D  E
Sbjct: 4   EKLLKELEETVKKREISDKEELDEIIKRVEEEYLRSLIDPGE 45


>gnl|CDD|225281 COG2425, COG2425, Uncharacterized protein containing a von
           Willebrand factor type A (vWA) domain [General function
           prediction only].
          Length = 437

 Score = 27.0 bits (60), Expect = 6.6
 Identities = 16/78 (20%), Positives = 33/78 (42%), Gaps = 9/78 (11%)

Query: 86  QLRWRPRPASLLSKEQVDKIKKSLK---KYTPAFEAKDRQRMNKASKELIEKRRKLFKQF 142
            L   P    LLS +   ++K++ +   K++ +      +R  +  ++L  +        
Sbjct: 71  FLVQLPFILKLLSSDVYRRVKEATRLNEKHSLSAPESFLERWEELLQDLQREGS------ 124

Query: 143 EELREKLRETWEAEKEER 160
           E+  E  RE   +EK+E 
Sbjct: 125 EDFLEGEREGLLSEKQEE 142


>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
           binding fragment.  Topoisomerase I promotes the
           relaxation of DNA superhelical tension by introducing a
           transient single-stranded break in duplex DNA and are
           vital for the processes of replication, transcription,
           and recombination. This family may be more than one
           structural domain.
          Length = 215

 Score = 26.8 bits (60), Expect = 6.6
 Identities = 19/67 (28%), Positives = 30/67 (44%), Gaps = 16/67 (23%)

Query: 104 KIKKSLKK--YTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERK 161
            I K L+K  +TP +E  + ++         EK++ + K+     EK     E +K E  
Sbjct: 78  AIIKDLEKCDFTPIYEYFEAEK---------EKKKAMSKE-----EKKAIKEEKDKLEEP 123

Query: 162 YLRNLVD 168
           Y   LVD
Sbjct: 124 YGYCLVD 130


>gnl|CDD|221794 pfam12825, DUF3818, Domain of unknown function in PX-proteins
           (DUF3818).  This domain is found on proteins carrying a
           PX domain. Its function is unknown.
          Length = 340

 Score = 26.9 bits (60), Expect = 6.8
 Identities = 11/54 (20%), Positives = 27/54 (50%), Gaps = 4/54 (7%)

Query: 140 KQFEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEFVIKEEITIC 193
           K+FE+  ++L    +      + L+N V+      E++++ + E  +KE+  + 
Sbjct: 110 KEFEKEIKELEAKIDDYPVICQKLKNFVE----ADEDIKDAIREEAVKEDEDLL 159


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 26.7 bits (59), Expect = 6.9
 Identities = 12/52 (23%), Positives = 25/52 (48%), Gaps = 2/52 (3%)

Query: 117 EAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEA--EKEERKYLRNL 166
           E +    M +A +   E+R+ L  +  E     R+ W    E+E++++ + L
Sbjct: 70  EQQRASFMAQAQEAADEQRQHLLNEAREDVATARDEWLEQLEREKQEFFKAL 121


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 26.8 bits (60), Expect = 7.4
 Identities = 18/73 (24%), Positives = 31/73 (42%), Gaps = 11/73 (15%)

Query: 97  LSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEE-LREKL------ 149
           LS E + K+ K+ ++     E   +    +  +E  EK+ +  K+  E    KL      
Sbjct: 253 LSPEVLRKVDKTREE---EEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQR 309

Query: 150 -RETWEAEKEERK 161
             E  E +K+ RK
Sbjct: 310 KLEEKERKKQARK 322


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 26.7 bits (60), Expect = 7.5
 Identities = 16/48 (33%), Positives = 23/48 (47%), Gaps = 6/48 (12%)

Query: 129 KELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDELDSEN 176
           + L E+  +L +Q  +L  KLR   + E E  K  R L   + L SE 
Sbjct: 4   EALEERNSELEEQIRQLELKLR---DLEAENEKLEREL---ERLKSEL 45


>gnl|CDD|234956 PRK01490, tig, trigger factor; Provisional.
          Length = 435

 Score = 27.0 bits (61), Expect = 7.7
 Identities = 13/72 (18%), Positives = 33/72 (45%), Gaps = 6/72 (8%)

Query: 121 RQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDEL-DSENVE- 178
            + + +A ++ ++   +  +      E+ RE +  + E R  ++  +  DE+  +E +E 
Sbjct: 311 DRLLRQALQQGLDLEGQFLEDTGTTEEEPREEFREQAERR--VKLGLLLDEIAKAEEIEV 368

Query: 179 --EEVVEFVIKE 188
             EEV   + + 
Sbjct: 369 SDEEVKAEIEEM 380



 Score = 26.7 bits (60), Expect = 8.4
 Identities = 18/50 (36%), Positives = 28/50 (56%), Gaps = 4/50 (8%)

Query: 143 EELREKLRETWEAEKEERKYLRNLVDTDELD--SENVEEEVVEFVIKEEI 190
           EEL+  +R+  E EK+E    R  V    LD   EN E ++ E ++++EI
Sbjct: 263 EELKADIRKNLEREKKEA--QRAKVKEAVLDALVENAEIDLPEALVEQEI 310


>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
           Rpc31.  RNA polymerase III contains seventeen subunits
           in yeasts and in human cells. Twelve of these are akin
           to RNA polymerase I or II and the other five are RNA pol
           III-specific, and form the functionally distinct groups
           (i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
           Rpc34 and Rpc82 form a cluster of enzyme-specific
           subunits that contribute to transcription initiation in
           S.cerevisiae and H.sapiens. There is evidence that these
           subunits are anchored at or near the N-terminal Zn-fold
           of Rpc1, itself prolonged by a highly conserved but RNA
           polymerase III-specific domain.
          Length = 221

 Score = 26.6 bits (59), Expect = 7.8
 Identities = 17/98 (17%), Positives = 40/98 (40%), Gaps = 7/98 (7%)

Query: 87  LRWRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELR 146
           ++  P    L  KE    +  + K       +K ++++   ++E  +   KL    ++L+
Sbjct: 102 IKEHPFDLELFPKELYSVMGINKKAGKKLALSKFKRKVGLFTEEEEDIDEKLSMLEKKLK 161

Query: 147 EKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEF 184
           E   E  + E E+        + +E + E  +E+  + 
Sbjct: 162 ELEAEDVDEEDEK-------DEEEEEEEEEEDEDFDDD 192


>gnl|CDD|148363 pfam06717, DUF1202, Protein of unknown function (DUF1202).  This
           family consists of several hypothetical bacterial
           proteins of around 335 residues in length. Members of
           this family are found exclusively in Escherichia coli
           and Salmonella species and are often referred to as YggM
           proteins. The function of this family is unknown.
          Length = 308

 Score = 26.8 bits (59), Expect = 8.0
 Identities = 14/48 (29%), Positives = 24/48 (50%), Gaps = 7/48 (14%)

Query: 106 KKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETW 153
           K+S + Y    + K    +N       E+RR+ F + +EL  KL++ W
Sbjct: 231 KQSEECYEVPIQQKRDFDIN-------EQRRQTFLKSQELSRKLQDDW 271


>gnl|CDD|234829 PRK00750, lysK, lysyl-tRNA synthetase; Reviewed.
          Length = 510

 Score = 26.7 bits (60), Expect = 8.0
 Identities = 16/86 (18%), Positives = 32/86 (37%), Gaps = 5/86 (5%)

Query: 94  ASLLSKEQVDKIKKSLKKYTPAFEAKD----RQRMNKASKELIEKRRKLFKQFEELREKL 149
           A + + E  + +   LK+Y P    +      + +  A            K++    EK 
Sbjct: 372 AQIANAEDKEVLWGFLKRYAPGATPETHPRLDRLVEYAIN-WYRDFVAPEKKYRAPTEKE 430

Query: 150 RETWEAEKEERKYLRNLVDTDELDSE 175
           R   E   +  + L +  D +E+ +E
Sbjct: 431 RAALEDLDDALRELADGADAEEIQNE 456


>gnl|CDD|203043 pfam04546, Sigma70_ner, Sigma-70, non-essential region.  The domain
           is found in the primary vegetative sigma factor. The
           function of this domain is unclear and can be removed
           without loss of function.
          Length = 211

 Score = 26.4 bits (59), Expect = 8.2
 Identities = 17/80 (21%), Positives = 38/80 (47%), Gaps = 10/80 (12%)

Query: 97  LSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLF-------KQFEELREKL 149
            ++E+  ++++  +K   A E   R   +K +++  E   + F       KQF+ L + L
Sbjct: 79  EARERFAELREQYEKTRKAIEKHGRG--SKKAQKAREALAEEFMQFRLVPKQFDRLVDNL 136

Query: 150 RETWEA-EKEERKYLRNLVD 168
           R   +    +ER+ ++  V+
Sbjct: 137 RGMVDRVRAQERQIMKLCVE 156


>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353). 
           Members of this family of uncharacterized proteins have
           no known function.
          Length = 319

 Score = 26.8 bits (59), Expect = 8.3
 Identities = 12/29 (41%), Positives = 19/29 (65%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQRMNKA 127
           +E+ + IK +L KY    EAK+++ M KA
Sbjct: 209 EEESELIKMTLAKYKQMLEAKNKKTMVKA 237


>gnl|CDD|221712 pfam12687, DUF3801, Protein of unknown function (DUF3801).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 158 and 187 amino
           acids in length. This family includes the PcfB protein.
          Length = 137

 Score = 26.1 bits (58), Expect = 8.4
 Identities = 16/58 (27%), Positives = 26/58 (44%), Gaps = 3/58 (5%)

Query: 105 IKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKY 162
                 KY   F+AKD   +N A KE      K  K+ ++ +  +R+  +  KEE+  
Sbjct: 69  KNNGPPKYLVFFKAKDADVINAAFKEF---TAKKLKKEQDEKPSVRKKLKKFKEEQAA 123


>gnl|CDD|132364 TIGR03321, alt_F1F0_F0_B, alternate F1F0 ATPase, F0 subunit B.  A
           small number of taxonomically diverse prokaryotic
           species, including Methanosarcina barkeri, have what
           appears to be a second ATP synthase, in addition to the
           normal F1F0 ATPase in bacteria and A1A0 ATPase in
           archaea. These enzymes use ion gradients to synthesize
           ATP, CC and in principle may run in either direction.
           This model represents the F0 subunit B of this apparent
           second ATP synthase.
          Length = 246

 Score = 26.6 bits (59), Expect = 8.4
 Identities = 19/93 (20%), Positives = 40/93 (43%), Gaps = 21/93 (22%)

Query: 99  KEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLF----KQFEELREKLRETWE 154
           K + ++ ++  ++     + +    + KA +E   +R++L     ++ +E+REK +E   
Sbjct: 52  KREAEQERREYEEKNEELDQQREVLLTKAKEEAQAERQRLLDEAREEADEIREKWQEALR 111

Query: 155 AEKEE-----------------RKYLRNLVDTD 170
            E+                   RK L +L DTD
Sbjct: 112 REQAALSDELRRRTGAEVFAIARKVLTDLADTD 144


>gnl|CDD|180485 PRK06245, cofG, FO synthase subunit 1; Reviewed.
          Length = 336

 Score = 26.8 bits (60), Expect = 8.5
 Identities = 11/34 (32%), Positives = 15/34 (44%)

Query: 79  FNSPTFCQLRWRPRPASLLSKEQVDKIKKSLKKY 112
            N   +C  R  P   SLLS E+V +I +     
Sbjct: 23  RNRCGYCTFRRDPGQPSLLSPEEVKEILRRGADA 56


>gnl|CDD|217417 pfam03194, LUC7, LUC7 N_terminus.  This family contains the N
           terminal region of several LUC7 protein homologues and
           only contains eukaryotic proteins. LUC7 has been shown
           to be a U1 snRNA associated protein with a role in
           splice site recognition. The family also contains human
           and mouse LUC7 like (LUC7L) proteins and human cisplatin
           resistance-associated overexpressed protein (CROP).
          Length = 252

 Score = 26.6 bits (59), Expect = 8.6
 Identities = 13/62 (20%), Positives = 29/62 (46%), Gaps = 7/62 (11%)

Query: 90  RPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREKL 149
           + + A+    E++ ++ + + K     EA   +         +++  KL K+ EEL+ K 
Sbjct: 113 QTKIAADSKAEELAELDEEIGKLLAEAEALGEEGK-------VDEAMKLMKEVEELKAKK 165

Query: 150 RE 151
           +E
Sbjct: 166 KE 167


>gnl|CDD|151490 pfam11043, DUF2856, Protein of unknown function (DUF2856).  Some
           members in this viral family of proteins with unknown
           function are annotated as Abc2 however this cannot be
           confirmed.
          Length = 97

 Score = 25.7 bits (56), Expect = 8.7
 Identities = 17/59 (28%), Positives = 30/59 (50%), Gaps = 8/59 (13%)

Query: 114 PAFEAKDRQRMNKASK-ELIEKRRKLFKQFEEL-------REKLRETWEAEKEERKYLR 164
           P + A D +R +  S  E+++K RK +     L       R+  R+ W AEK+E++ + 
Sbjct: 4   PLYGADDPRRCSGNSVSEVLDKFRKNYDAIMSLPQETKAERKFRRDIWLAEKQEKERIN 62


>gnl|CDD|226967 COG4608, AppF, ABC-type oligopeptide transport system, ATPase
           component [Amino acid transport and metabolism].
          Length = 268

 Score = 26.4 bits (59), Expect = 8.7
 Identities = 11/43 (25%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 70  SFQGKIIKRFNSPTFCQLRWRPRPASLLSK-EQVDKIKKSLKK 111
           S  G++I     PT  ++ +  +  + LSK E+ +++ + L+K
Sbjct: 53  STLGRLILGLEEPTSGEILFEGKDITKLSKEERRERVLELLEK 95


>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
          Length = 871

 Score = 26.8 bits (60), Expect = 8.8
 Identities = 16/50 (32%), Positives = 24/50 (48%)

Query: 123 RMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEERKYLRNLVDTDEL 172
           R NK   ELI KR++ ++ +E+L      T + E    KY R  +    L
Sbjct: 719 RTNKLDYELIAKRKEEYELYEKLTPPRVMTSDGEIITGKYKRENLPAGAL 768


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 26.4 bits (59), Expect = 9.0
 Identities = 13/42 (30%), Positives = 24/42 (57%)

Query: 119 KDRQRMNKASKELIEKRRKLFKQFEELREKLRETWEAEKEER 160
           + +  + +  K+L E++ +L K+  EL  KL    + E+EER
Sbjct: 120 QGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEER 161


>gnl|CDD|225468 COG2916, Hns, DNA-binding protein H-NS [General function prediction
           only].
          Length = 128

 Score = 25.9 bits (57), Expect = 9.0
 Identities = 10/35 (28%), Positives = 18/35 (51%)

Query: 127 ASKELIEKRRKLFKQFEELREKLRETWEAEKEERK 161
              EL+E+  +  +Q  + R++      AE EER+
Sbjct: 15  TYLELLEEMLEKEEQVVQERQEEEAAAIAEIEERQ 49


>gnl|CDD|197875 smart00788, Adenylsucc_synt, Adenylosuccinate synthetase.
           Adenylosuccinate synthetase plays an important role in
           purine biosynthesis, by catalyzing the GTP-dependent
           conversion of IMP and aspartic acid to AMP.
           Adenylosuccinate synthetase has been characterized from
           various sources ranging from Escherichia coli (gene
           purA) to vertebrate tissues. In vertebrates, two
           isozymes are present - one involved in purine
           biosynthesis and the other in the purine nucleotide
           cycle. The crystal structure of adenylosuccinate
           synthetase from E. coli reveals that the dominant
           structural element of each monomer of the homodimer is a
           central beta-sheet of 10 strands. The first nine strands
           of the sheet are mutually parallel with right-handed
           crossover connections between the strands. The 10th
           strand is antiparallel with respect to the first nine
           strands. In addition, the enzyme has two antiparallel
           beta-sheets, comprised of two strands and three strands
           each, 11 alpha-helices and two short 3/10-helices.
           Further, it has been suggested that the similarities in
           the GTP-binding domains of the synthetase and the p21ras
           protein are an example of convergent evolution of two
           distinct families of GTP-binding proteins. Structures of
           adenylosuccinate synthetase from Triticum aestivum and
           Arabidopsis thaliana when compared with the known
           structures from E. coli reveals that the overall fold is
           very similar to that of the E. coli protein.
          Length = 417

 Score = 26.7 bits (60), Expect = 9.3
 Identities = 14/43 (32%), Positives = 23/43 (53%), Gaps = 3/43 (6%)

Query: 142 FEELREKLRETWEAEKEERKYLRNLVDTDELDSENVEEEVVEF 184
            E LREKL E  +    +   L+ L   + +D E + EE++E+
Sbjct: 148 EEVLREKLEELLD---YKNFLLKKLYGAEPVDVEEILEELLEY 187


>gnl|CDD|237215 PRK12815, carB, carbamoyl phosphate synthase large subunit;
           Reviewed.
          Length = 1068

 Score = 26.9 bits (60), Expect = 9.8
 Identities = 10/36 (27%), Positives = 17/36 (47%), Gaps = 3/36 (8%)

Query: 127 ASKELIEK--RRKLFKQF-EELREKLRETWEAEKEE 159
            + E I+K   R+ F+   +EL E + E+      E
Sbjct: 118 TNIEAIQKGEDRERFRALMKELGEPVPESEIVTSVE 153


>gnl|CDD|223610 COG0536, Obg, Predicted GTPase [General function prediction only].
          Length = 369

 Score = 26.4 bits (59), Expect = 9.9
 Identities = 20/87 (22%), Positives = 32/87 (36%), Gaps = 3/87 (3%)

Query: 100 EQVDKIKKSLKKYTPAFEAKDRQRM-NKASKELIEKRRKLFKQFEELREKLRETWEAEKE 158
           E    I+  L+KY+P    K R  + NK    L E+  +  K+           +     
Sbjct: 257 EDYQTIRNELEKYSPKLAEKPRIVVLNKIDLPLDEEELEELKKALAEALGWEVFYLISAL 316

Query: 159 ERKYLRNLVD--TDELDSENVEEEVVE 183
            R+ L  L+    + L+    E E  E
Sbjct: 317 TREGLDELLRALAELLEETKAEAEAAE 343


>gnl|CDD|181574 PRK08881, rpsN, 30S ribosomal protein S14; Reviewed.
          Length = 101

 Score = 25.5 bits (57), Expect = 10.0
 Identities = 10/38 (26%), Positives = 17/38 (44%), Gaps = 3/38 (7%)

Query: 126 KASKELIEKRRKLFKQFEELREKLRETW---EAEKEER 160
           K+     +KR+KL  ++   R +L+          EER
Sbjct: 4   KSMIAREKKRKKLVAKYAAKRAELKAIISDPNLSDEER 41


>gnl|CDD|216510 pfam01450, IlvC, Acetohydroxy acid isomeroreductase, catalytic
           domain.  Acetohydroxy acid isomeroreductase catalyzes
           the conversion of acetohydroxy acids into dihydroxy
           valerates. This reaction is the second in the synthetic
           pathway of the essential branched side chain amino acids
           valine and isoleucine.
          Length = 146

 Score = 26.1 bits (58), Expect = 10.0
 Identities = 16/76 (21%), Positives = 31/76 (40%), Gaps = 15/76 (19%)

Query: 89  WRPRPASLLSKEQVDKIKKSLKKYTPAFEAKDRQRMNKASKELIEKRRKLFKQFEELREK 148
           W PR  +  +K  + +I K ++  T A             KE + +      +  + R++
Sbjct: 79  WGPRIITEATKILMKEILKDIQNGTFA-------------KEWLLENSAGNPRLNKERKE 125

Query: 149 LRET--WEAEKEERKY 162
           +RE    +  K+ RK 
Sbjct: 126 IREHPIEKVGKKLRKL 141


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.316    0.131    0.389 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 10,163,757
Number of extensions: 958213
Number of successful extensions: 3190
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2933
Number of HSP's successfully gapped: 517
Length of query: 194
Length of database: 10,937,602
Length adjustment: 92
Effective length of query: 102
Effective length of database: 6,857,034
Effective search space: 699417468
Effective search space used: 699417468
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 56 (25.5 bits)