RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy3983
         (318 letters)



>gnl|CDD|241348 cd13194, FERM_C_ERM, ERM family FERM domain C-lobe/F3.  The ERM
           family includes ezrin, radixin, moesin and merlin. They
           are composed of a N-terminal FERM (ERM) domain (also
           called N-ERMAD (N-terminal ERM association domain)), a
           coiled coil region (CRR), and a C-terminal domain CERMAD
           (C-terminal ERM association domain) which has an
           F-actin-binding site (ABD). Two actin-binding sites have
           been identified in the middle and N-terminal domains.
           Merlin is structurally similar to the ERM proteins, but
           instead of an actin-binding domain (ABD), it contains a
           C-terminal domain (CTD), just like the proteins from the
           4.1 family. Activated ezrin, radixin and moesin are
           thought to be involved in the linking of actin filaments
           to CD43, CD44, ICAM1-3 cell adhesion molecules, various
           membrane channels and receptors, such as the Na+/H+
           exchanger-3 (NHE3), cystic fibrosis transmembrane
           conductance regulator (CFTR), and the beta2-adrenergic
           receptor. The ERM proteins exist in two states, a
           dormant state in which the FERM domain binds to its own
           C-terminal tail and thereby precludes binding of some
           partner proteins, and an activated state, in which the
           FERM domain binds to one of many membrane binding
           proteins and the C-terminal tail binds to F-actin. The
           FERM domain has a cloverleaf tripart structure (FERM_N,
           FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe,
           C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM domain
           of ERM is part of the PH domain family. The FERM domain
           is found in the cytoskeletal-associated proteins such as
           ezrin, moesin, radixin, 4.1R, and merlin. These proteins
           provide a link between the membrane and cytoskeleton and
           are involved in signal transduction pathways. The FERM
           domain is also found in protein tyrosine phosphatases
           (PTPs) , the tyrosine kinases FAK and JAK, in addition
           to other proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 97

 Score =  145 bits (367), Expect = 9e-44
 Identities = 56/67 (83%), Positives = 62/67 (92%)

Query: 143 PNCLLTPKIGFPWSEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHE 202
               LTPKIGFPWSEIRNISFND+KF+IKPIDKKAPDFVF++PR+R+NKRIL LCMGNHE
Sbjct: 31  LENKLTPKIGFPWSEIRNISFNDKKFVIKPIDKKAPDFVFYSPRLRINKRILDLCMGNHE 90

Query: 203 LYMRRRK 209
           LYMRRRK
Sbjct: 91  LYMRRRK 97



 Score = 81.5 bits (202), Expect = 1e-19
 Identities = 32/40 (80%), Positives = 35/40 (87%)

Query: 51 MYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTNVMS 90
          MYGVNYFEIKNKKGT+LWLGVDALGLNIYE E+ LT  + 
Sbjct: 1  MYGVNYFEIKNKKGTDLWLGVDALGLNIYELENKLTPKIG 40


>gnl|CDD|220216 pfam09380, FERM_C, FERM C-terminal PH-like domain. 
          Length = 90

 Score = 86.2 bits (214), Expect = 3e-21
 Identities = 28/65 (43%), Positives = 34/65 (52%), Gaps = 3/65 (4%)

Query: 151 IGFPWSEIRNISFNDRKFIIKPID---KKAPDFVFFAPRVRVNKRILALCMGNHELYMRR 207
             FPWSEIR ISF  +KF+IK  D    K     F+ P  R  K +  LC+  H  +  R
Sbjct: 26  NTFPWSEIRKISFKRKKFLIKLRDEDESKETTLTFYTPSPRACKYLWKLCVEQHAFFRLR 85

Query: 208 RKPDT 212
           R PDT
Sbjct: 86  RPPDT 90



 Score = 41.9 bits (99), Expect = 2e-05
 Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 5/73 (6%)

Query: 61  NKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPLQFKFRAKFYPEDVVEEIIQDI 120
           +KKGT+LWLGV A G+ +YE  + +      +++K   + FK R KF  +   E+  ++ 
Sbjct: 1   DKKGTDLWLGVSAKGILVYEDNNKINTFPWSEIRK---ISFK-RKKFLIKLRDEDESKET 56

Query: 121 TLRLFYLQVSRSA 133
           TL  FY    R+ 
Sbjct: 57  TLT-FYTPSPRAC 68


>gnl|CDD|241233 cd00836, FERM_C-lobe, FERM domain C-lobe.  The FERM domain has a
           cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N,
           alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3).
           The C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs), the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 91

 Score = 62.0 bits (151), Expect = 2e-12
 Identities = 21/58 (36%), Positives = 25/58 (43%)

Query: 148 TPKIGFPWSEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHELYM 205
            P   F W EIR ISF+ + F I+    K     F     R  K I  LC+G H  Y 
Sbjct: 34  NPLTLFRWPEIRKISFSKKTFTIEIRGDKEEKLAFQTSSPREAKYIWKLCVGYHRFYR 91


>gnl|CDD|215882 pfam00373, FERM_M, FERM central domain.  This domain is the central
           structural domain of the FERM domain.
          Length = 113

 Score = 60.4 bits (147), Expect = 1e-11
 Identities = 17/47 (36%), Positives = 24/47 (51%)

Query: 11  QHKMSPSEWESSITTWWKEHKGMLREDAMMEYLKIAQDLEMYGVNYF 57
             K    E E  I    K  +G+   +A ++YL+IAQ L  YGV +F
Sbjct: 67  IKKEKSKELEKRILEAHKNLRGLSPAEAKLKYLQIAQSLPTYGVTFF 113


>gnl|CDD|214604 smart00295, B41, Band 4.1 homologues.  Also known as
           ezrin/radixin/moesin (ERM) protein domains. Present in
           myosins, ezrin, radixin, moesin, protein tyrosine
           phosphatases. Plasma membrane-binding domain. These
           proteins play structural and regulatory roles in the
           assembly and stabilization of specialized plasmamembrane
           domains. Some PDZ domain containing proteins bind one or
           more of this family. Now includes JAKs.
          Length = 201

 Score = 62.3 bits (152), Expect = 1e-11
 Identities = 18/53 (33%), Positives = 28/53 (52%), Gaps = 2/53 (3%)

Query: 5   PLRVMDQHKMSPSEWESSITTWWKEHKGMLREDAMMEYLKIAQDLEMYGVNYF 57
           P +++D       EW   I    KE  G+  E+A ++YL++A+ L  YGV  F
Sbjct: 151 PKQLLDS--RKLKEWRERIVELHKELIGLSPEEAKLKYLELARKLPTYGVELF 201



 Score = 43.4 bits (103), Expect = 4e-05
 Identities = 28/92 (30%), Positives = 37/92 (40%), Gaps = 19/92 (20%)

Query: 44  KIAQDLEMYGVNYF---EIKNKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPLQ 100
            + + L +    YF        +    WL             D    ++ QDVK E PL 
Sbjct: 28  TVCRKLGIRESEYFGLQFEDPDEDLRHWL-------------DPAKTLLDQDVKSE-PLT 73

Query: 101 FKFRAKFYPEDVVEEIIQDIT-LRLFYLQVSR 131
             FR KFYP D   ++ +D T L L YLQV  
Sbjct: 74  LYFRVKFYPPD-PNQLKEDPTRLNLLYLQVRN 104


>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family.  This family of
           proteins contain a band 4.1 domain (pfam00373), at their
           amino terminus. This family represents the rest of these
           proteins.
          Length = 244

 Score = 55.1 bits (133), Expect = 7e-09
 Identities = 24/71 (33%), Positives = 44/71 (61%)

Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
           E+AE++ QE  ER++Q+E +M +  ++L E +E    LEE+LKQ +   + LE +  EL+
Sbjct: 1   EEAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELE 60

Query: 304 LELQPIRNQSN 314
            E + +  ++ 
Sbjct: 61  EENRRLEEEAA 71



 Score = 34.3 bits (79), Expect = 0.058
 Identities = 23/104 (22%), Positives = 49/104 (47%)

Query: 204 YMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE 263
              R +    D+++ + +  E +    +  +KL+ E    +  EKK  E  E  ++LE E
Sbjct: 10  LEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEEENRRLEEE 69

Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
            A  +++    +  +     ++ +L+  +E+ EA   +LQ EL+
Sbjct: 70  AAASEEERERLEAEVDEATAEVAKLEEEREKKEAETRQLQQELR 113



 Score = 28.2 bits (63), Expect = 5.6
 Identities = 17/102 (16%), Positives = 41/102 (40%), Gaps = 10/102 (9%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           ++   +  EE  A +++R++L+ E+                + +LE E  K++ +  + Q
Sbjct: 60  EEENRRLEEEAAASEEERERLEAEVDEAT----------AEVAKLEEEREKKEAETRQLQ 109

Query: 276 EMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
           + +R  +E  ++ +    E  A  T       P+  +     
Sbjct: 110 QELREAQEAHERARQELLEAAAAPTAPPHVAAPVNGEQLEPD 151


>gnl|CDD|241343 cd13189, FERM_C_PTPN4_PTPN3_like, Protein tyrosine phosphatase
          non-receptor proteins 3 and 4 (PTPN4 and PTPN3) FERM
          domain C-lobe.  PTPN4 (also called PTPMEG, protein
          tyrosine phosphatase, megakaryocyte) is a cytoplasmic
          protein-tyrosine phosphatase (PTP) thought to play a
          role in cerebellar function. PTPMEG-knockout mice have
          impaired memory formation and cerebellar long-term
          depression. PTPN3/PTPH1 is a membrane-associated PTP
          that is implicated in regulating tyrosine
          phosphorylation of growth factor receptors, p97 VCP
          (valosin-containing protein, or Cdc48 in Saccharomyces
          cerevisiae), and HBV (Hepatitis B Virus) gene
          expression; it is mutated in a subset of colon cancers.
          PTPMEG and PTPN3/PTPH1 contains a N-terminal FERM
          domain, a middle PDZ domain, and a C-terminal
          phosphatase domain. PTP1/Tyrosine-protein phosphatase 1
          from nematodes and a FERM_C repeat 1 from Tetraodon
          nigroviridis also included in this cd. The FERM domain
          has a cloverleaf tripart structure (FERM_N, FERM_M,
          FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1,
          F2, F3). The C-lobe within the FERM domain is part of
          the PH domain family. The FERM domain is found in the
          cytoskeletal-associated proteins such as ezrin, moesin,
          radixin, 4.1R, and merlin. These proteins provide a
          link between the membrane and cytoskeleton and are
          involved in signal transduction pathways. The FERM
          domain is also found in protein tyrosine phosphatases
          (PTPs) , the tyrosine kinases FAK and JAK, in addition
          to other proteins involved in signaling. This domain is
          structurally similar to the PH and PTB domains and
          consequently is capable of binding to both peptides and
          phospholipids at different sites.
          Length = 123

 Score = 50.0 bits (120), Expect = 7e-08
 Identities = 14/50 (28%), Positives = 26/50 (52%)

Query: 32 GMLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEK 81
          G    +A   +L  A+ LE+YGV     ++    EL +GV + G+ +++ 
Sbjct: 1  GQSPAEAEFNFLNTARTLELYGVELHSARDASNLELQVGVTSGGILVFQN 50



 Score = 28.4 bits (64), Expect = 2.0
 Identities = 10/18 (55%), Positives = 13/18 (72%)

Query: 153 FPWSEIRNISFNDRKFII 170
           FPWS+I  ISF  ++F I
Sbjct: 57  FPWSKIVKISFKRKQFFI 74


>gnl|CDD|241341 cd13187, FERM_C_PTPH13, Protein tyrosine phosphatase non-receptor
           13 (PTPH13) FERM domain C-lobe.  There are many
           functions of PTPN13 (also called PTPL1, PTP-BAS, hPTP1E,
           FAP1, or PTPL1). Mice lacking PTPN13 activity have
           abnormal regulation of signal transducer and activator
           of transcription signaling in their T cells, mild
           impairment of motor nerve repair, and a significant
           reduction in the growth of retinal glia cultures. It
           also plays a role in adipocyte differentiation. PTPN13
           contains a kinase non-catalytic C-lobe domain (KIND), a
           FERM domain with two potential phosphatidylinositol
           4,5-biphosphate [PtdIns(4,5)P2]-binding motifs, 5 PDZ
           domains, and a carboxy-terminal catalytic domain. There
           is an nteraction between the FERM domain of PTPL1 and
           PtdIns(4,5)P2 which is thought to regulate the membrane
           localization of PTPN13. PDZ are protein/protein
           interaction domains so there is the potential for
           numerous partners that can actively participate in the
           regulation of its phosphatase activity or can permit
           direct or indirect recruitment of tyrosine
           phosphorylated PTPL1 substrates. The FERM domain has a
           cloverleaf tripart structure (FERM_N, FERM_M, FERM_C/N,
           alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3).
           The C-lobe/F3 within the FERM domain is part of the PH
           domain family. The FERM domain is found in the
           cytoskeletal-associated proteins such as ezrin, moesin,
           radixin, 4.1R, and merlin. These proteins provide a link
           between the membrane and cytoskeleton and are involved
           in signal transduction pathways. The FERM domain is also
           found in protein tyrosine phosphatases (PTPs), the
           tyrosine kinases FAK and JAK, in addition to other
           proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 124

 Score = 46.9 bits (112), Expect = 1e-06
 Identities = 25/83 (30%), Positives = 42/83 (50%), Gaps = 11/83 (13%)

Query: 32  GMLREDAMMEYLKIAQDLEMYGVNY---FEIKNKKGTELWLGVDALGLNIYEKEDNLTNV 88
           G+  E+A +E+LK+ Q L  YGV++   +  K      +WLG+ + G+ IYE  +     
Sbjct: 1   GLSEEEAELEFLKVCQKLPEYGVHFHRVYREKKSSTGGIWLGICSRGIIIYEVHNGARTP 60

Query: 89  MS----QDVKKENPLQFKFRAKF 107
           +     ++ KK   + F  R KF
Sbjct: 61  VLRFPWRETKK---ISFD-RKKF 79



 Score = 40.7 bits (96), Expect = 1e-04
 Identities = 20/61 (32%), Positives = 33/61 (54%)

Query: 148 TPKIGFPWSEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHELYMRR 207
           TP + FPW E + ISF+ +KFII+          F+    + ++ +L LC   H+ +M+ 
Sbjct: 59  TPVLRFPWRETKKISFDRKKFIIENRGGSGIKHTFYTDSYKKSQYLLRLCSSQHKFHMKM 118

Query: 208 R 208
           R
Sbjct: 119 R 119


>gnl|CDD|241353 cd13199, FERM_C2_MyoVII, Myosin VII (MyoVII, Myo7) FERM domain
          C-lobe, repeat 2.  MyoVII, a MyTH-FERM myosin, is an
          actin-based motor protein essential for a variety of
          biological processes in the actin cytoskeleton
          function. Mutations in MyoVII leads to problems in
          sensory perception: deafness and blindness in humans
          (Usher Syndrome), retinal defects and deafness in mice
          (shaker 1), and aberrant auditory and vestibular
          function in zebrafish. Myosin VIIAs have plus (barbed)
          end-directed motor activity on actin filaments and a
          characteristic actin-activated ATPase activity. MyoVII
          consists of a conserved spectrin-like, SH3 subdomain
          N-terminal region, a motor/head region, a neck made of
          4-5 IQ motifs, and a tail consisting of a coiled-coil
          domain, followed by a tandem repeat of myosin tail
          homology 4 (MyTH4) domains and partial FERM domains
          that are separated by an SH3 subdomain and are thought
          to mediate dimerization and binding to other proteins
          or cargo. Members include: MyoVIIa, MyoVIIb, and MyoVII
          members that do not have distinct myosin VIIA and
          myosin VIIB genes. The FERM domain has a cloverleaf
          tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-,
          and C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The
          C-lobe/F3 within the FERM domain is part of the PH
          domain family. The FERM domain is found in the
          cytoskeletal-associated proteins such as ezrin, moesin,
          radixin, 4.1R, and merlin. These proteins provide a
          link between the membrane and cytoskeleton and are
          involved in signal transduction pathways. The FERM
          domain is also found in protein tyrosine phosphatases
          (PTPs) , the tyrosine kinases FAK and JAK, in addition
          to other proteins involved in signaling. This domain is
          structurally similar to the PH and PTB domains and
          consequently is capable of binding to both peptides and
          phospholipids at different sites.
          Length = 176

 Score = 46.9 bits (112), Expect = 2e-06
 Identities = 17/48 (35%), Positives = 26/48 (54%)

Query: 13 KMSPSEWESSITTWWKEHKGMLREDAMMEYLKIAQDLEMYGVNYFEIK 60
            SP EW+  I   + +H GM  E+A + +LKI      +G  +FE+K
Sbjct: 41 LQSPDEWKKLIIAAYNKHAGMSSEEAKLAFLKIIYRWPTFGSAFFEVK 88


>gnl|CDD|241338 cd13184, FERM_C_4_1_family, Protein 4.1 family FERM domain
          C-lobe.  The protein 4.1 family includes four
          well-defined members: erythroid protein 4.1 (4.1R), the
          bestknown and characterized member, 4.1G (general),
          4.1N (neuronal), and 4.1 B (brain).  The less well
          understood 4.1O/FRMD3 is not a true member of this
          family and is not included in this hierarchy. Besides
          three highly conserved domains, FERM, SAB (spectrin and
          actin binding domain) and CTD (C-terminal domain), the
          proteins from this family contain several unique
          domains: U1, U2 and U3. FERM domains like other members
          of the FERM domain superfamily have a cloverleaf
          architecture with three distinct lobes (FERM_N, FERM_M,
          FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe, C-lobe/F1,
          F2, F3). The C-lobe/F3 within the FERM domain is part
          of the PH domain family. The brain is a particularly
          rich source of protein 4.1 isoforms. The various 4.1R,
          4.1G, 4.1N, and 4.1B mRNAs are all expressed in
          distinct patterns within the brain. It is likely that
          4.1 proteins play important functional roles in the
          brain including motor coordination and spatial
          learning, postmitotic differentiation, and synaptic
          architecture and function. In addition they are found
          in nonerythroid, nonneuronal cells where they may play
          a general structural role in nuclear architecture
          and/or may interact with splicing factors.  The FERM C
          domain is the third structural domain within the FERM
          domain. The FERM domain is found in the
          cytoskeletal-associated proteins such as ezrin, moesin,
          radixin, 4.1R, and merlin. These proteins provide a
          link between the membrane and cytoskeleton and are
          involved in signal transduction pathways. The FERM
          domain is also found in protein tyrosine phosphatases
          (PTPs) , the tyrosine kinases FAK and JAK, in addition
          to other proteins involved in signaling. This domain is
          structurally similar to the PH and PTB domains and
          consequently is capable of binding to both peptides and
          phospholipids at different sites.
          Length = 113

 Score = 44.2 bits (105), Expect = 7e-06
 Identities = 16/49 (32%), Positives = 29/49 (59%)

Query: 32 GMLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYE 80
          G    +A + +L+ A+ L +YGV+    K+ +G ++ LGV A G+ +Y 
Sbjct: 1  GQTPAEAELHFLENAKKLALYGVDLHPAKDSEGVDIMLGVCAGGILVYR 49



 Score = 32.2 bits (74), Expect = 0.090
 Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 4/53 (7%)

Query: 153 FPWSEIRNISFNDRKFIIK----PIDKKAPDFVFFAPRVRVNKRILALCMGNH 201
           F W +I  IS+    F IK      ++      F  P  R  KR+  +C+ +H
Sbjct: 57  FAWPKILKISYKRNNFYIKLRPGEFEQSETTIGFKLPNHRAAKRLWKVCVEHH 109


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 45.0 bits (107), Expect = 1e-05
 Identities = 14/97 (14%), Positives = 43/97 (44%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIR 279
            + ++E    Q++  +LQ ++A  ++  ++ ++ +  L+     + K    + +      
Sbjct: 69  PELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAI 128

Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLN 316
            L+E+ ++L+    EL+     L+ E + ++      
Sbjct: 129 ELDEENRELREELAELKQENEALEAENERLQENEQRR 165



 Score = 43.8 bits (104), Expect = 3e-05
 Identities = 26/89 (29%), Positives = 44/89 (49%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           +QQ  A+ +EE    Q+Q  +LQ E    ++     +  +ERL++    + +   + +E 
Sbjct: 71  LQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIEL 130

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            E  R L E+L +L+   E LEA    LQ
Sbjct: 131 DEENRELREELAELKQENEALEAENERLQ 159



 Score = 37.3 bits (87), Expect = 0.005
 Identities = 21/86 (24%), Positives = 49/86 (56%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           ++Q+  A+ +E+    QQ+  +L+ E++  E   ++ Q+ + R+KQL     + D++  E
Sbjct: 77  ELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRE 136

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQ 299
            +E +  L+++ + L+A  E L+  +
Sbjct: 137 LREELAELKQENEALEAENERLQENE 162



 Score = 35.0 bits (81), Expect = 0.025
 Identities = 21/59 (35%), Positives = 35/59 (59%)

Query: 255 ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
           ERL +L+ E+A+  ++L E QE +  L+++ ++L+     LEA    LQ EL  I+  S
Sbjct: 66  ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLS 124



 Score = 32.3 bits (74), Expect = 0.22
 Identities = 18/83 (21%), Positives = 40/83 (48%), Gaps = 2/83 (2%)

Query: 231 QQRDKLQLEIA-AREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ 289
           ++  +LQ E+A  +E+  +  ++  E  ++ +    +      E + + + L   +KQL 
Sbjct: 66  ERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELAR-IKQLS 124

Query: 290 AAKEELEARQTELQLELQPIRNQ 312
           A   EL+    EL+ EL  ++ +
Sbjct: 125 ANAIELDEENRELREELAELKQE 147



 Score = 30.0 bits (68), Expect = 1.1
 Identities = 16/74 (21%), Positives = 38/74 (51%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           ++QQ   + ++E +  + + ++LQ E+A  ++      E  E  ++L  E+A+  Q+   
Sbjct: 91  ELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQENEA 150

Query: 274 AQEMIRRLEEQLKQ 287
            +    RL+E  ++
Sbjct: 151 LEAENERLQENEQR 164


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 46.2 bits (110), Expect = 2e-05
 Identities = 26/107 (24%), Positives = 52/107 (48%), Gaps = 11/107 (10%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAARE-----------KAEKKHQESVERLKQLEVEMAKR 267
             + +EE    +++ ++L  E+   E           + E++ +E  + L  L  E+++ 
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
           +Q     +E +  LE QL++L+A  EELE++  EL  EL  +  +  
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLE 347



 Score = 45.8 bits (109), Expect = 2e-05
 Identities = 22/106 (20%), Positives = 43/106 (40%), Gaps = 4/106 (3%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIA----AREKAEKKHQESVERLKQLEVEMAK 266
                ++      E+     +  + L  EI       E+ E + +  +     LE  +A 
Sbjct: 832 RIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALAL 891

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
              +L E  E +R LE +  +L+   EEL  +  +L+L L+ +  +
Sbjct: 892 LRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVR 937



 Score = 43.9 bits (104), Expect = 9e-05
 Identities = 23/98 (23%), Positives = 50/98 (51%), Gaps = 4/98 (4%)

Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK----QLEVEMAKRDQDLMEAQE 276
           + REE    Q++  + + E+       ++ +E +E L+    +LE E+ +  ++L     
Sbjct: 236 ELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALAN 295

Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
            I RLE+Q + L+     LE +  EL+ +L+ + ++ +
Sbjct: 296 EISRLEQQKQILRERLANLERQLEELEAQLEELESKLD 333



 Score = 43.5 bits (103), Expect = 1e-04
 Identities = 22/103 (21%), Positives = 56/103 (54%), Gaps = 4/103 (3%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIA----AREKAEKKHQESVERLKQLEVEMAKRDQ 269
            ++Q K   RE     ++Q ++L+ ++       ++  ++  E  E+L++L+ E+   + 
Sbjct: 299 RLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEA 358

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           +L E +  +  LE +L++L+   E L ++  +L+L++  + N+
Sbjct: 359 ELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401



 Score = 42.0 bits (99), Expect = 3e-04
 Identities = 27/97 (27%), Positives = 56/97 (57%), Gaps = 3/97 (3%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           ++++   Q R+E     +Q   L+ ++A   + E + ++  ER+ QL  E+ + + ++ E
Sbjct: 709 ELEEELEQLRKELEELSRQISALRKDLA---RLEAEVEQLEERIAQLSKELTELEAEIEE 765

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
            +E +   EE+L + +A  EELEA+  +L+ EL+ +R
Sbjct: 766 LEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALR 802



 Score = 41.6 bits (98), Expect = 5e-04
 Identities = 20/97 (20%), Positives = 46/97 (47%), Gaps = 4/97 (4%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAA----REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           A+   E    +++  +L  E+       E+ E++ +E+ E L + E E+ + +  + + +
Sbjct: 736 ARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLK 795

Query: 276 EMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           E ++ L E L +L+A    L      L+  L+ +  +
Sbjct: 796 EELKALREALDELRAELTLLNEEAANLRERLESLERR 832



 Score = 41.6 bits (98), Expect = 5e-04
 Identities = 19/114 (16%), Positives = 54/114 (47%), Gaps = 4/114 (3%)

Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREK----AEKKHQESVERLKQL 260
           +   + +  + ++   +   E    +++ ++L+LE++  E+     +K+       + +L
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300

Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
           E +     + L   +  +  LE QL++L++  +EL     EL+ +L+ ++ +  
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELE 354



 Score = 40.8 bits (96), Expect = 7e-04
 Identities = 24/106 (22%), Positives = 53/106 (50%), Gaps = 11/106 (10%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAR----EKAEKKHQESVERLKQLEVEM--AKR 267
           +++    +  EE    +++ ++L+ E+ +     E+ E + +E   RL++LE ++   + 
Sbjct: 327 ELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRS 386

Query: 268 DQDLMEAQEM-----IRRLEEQLKQLQAAKEELEARQTELQLELQP 308
               +E Q       I RLE +L++L+  +E L+    EL  +L+ 
Sbjct: 387 KVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKKLEE 432



 Score = 40.4 bits (95), Expect = 0.001
 Identities = 22/87 (25%), Positives = 41/87 (47%), Gaps = 1/87 (1%)

Query: 222 AREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
              E   + ++  K    +A    + E++ Q   ERL  LE ++ + +  L E +  +  
Sbjct: 275 EVSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDE 334

Query: 281 LEEQLKQLQAAKEELEARQTELQLELQ 307
           L E+L +L+   EEL+     L+ EL+
Sbjct: 335 LAEELAELEEKLEELKEELESLEAELE 361



 Score = 40.4 bits (95), Expect = 0.001
 Identities = 26/100 (26%), Positives = 52/100 (52%), Gaps = 7/100 (7%)

Query: 215 VQQMKAQAR--EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
           ++ ++ QA   E     + +  +L+L +      E +     E L++L+ E+ + +++L 
Sbjct: 202 LKSLERQAEKAERYKELKAELRELELALLVLRLEELR-----EELEELQEELKEAEEELE 256

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           E    ++ LEE+L++L+    ELE    ELQ EL  + N+
Sbjct: 257 ELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANE 296



 Score = 40.0 bits (94), Expect = 0.001
 Identities = 18/89 (20%), Positives = 42/89 (47%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
           +A+           R++L+         E++ ++  E++++L  ++     ++ E +E+I
Sbjct: 809 RAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELI 868

Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQ 307
             LE +L+ L   +  LE     L+ EL+
Sbjct: 869 EELESELEALLNERASLEEALALLRSELE 897



 Score = 40.0 bits (94), Expect = 0.001
 Identities = 24/122 (19%), Positives = 61/122 (50%), Gaps = 4/122 (3%)

Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK- 258
             E  +   +    ++++  A+ R+E    +++ ++L+ E+    +     ++ + RL+ 
Sbjct: 681 ELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEA 740

Query: 259 ---QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
              QLE  +A+  ++L E +  I  LEE+L++ +    E EA   EL+ +++ ++ +   
Sbjct: 741 EVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKA 800

Query: 316 NK 317
            +
Sbjct: 801 LR 802



 Score = 40.0 bits (94), Expect = 0.002
 Identities = 24/103 (23%), Positives = 51/103 (49%), Gaps = 1/103 (0%)

Query: 206 RRRKPD-TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
           R+   + +  +  ++      +   +Q  +++        + E + +E  ERL++ E E+
Sbjct: 718 RKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEEL 777

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           A+ + ++ E +  I +L+E+LK L+ A +EL A  T L  E  
Sbjct: 778 AEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAA 820



 Score = 38.9 bits (91), Expect = 0.004
 Identities = 25/92 (27%), Positives = 48/92 (52%), Gaps = 5/92 (5%)

Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
           +        + +R++LQ EI    K  ++ +     LK+L+ E+ + +++L E QE + R
Sbjct: 404 RLEARLERLEDRRERLQQEIEELLKKLEEAE-----LKELQAELEELEEELEELQEELER 458

Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           LEE L++L+   EE E      + EL  ++ +
Sbjct: 459 LEEALEELREELEEAEQALDAAERELAQLQAR 490



 Score = 37.0 bits (86), Expect = 0.012
 Identities = 20/95 (21%), Positives = 41/95 (43%), Gaps = 4/95 (4%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKA----EKKHQESVERLKQLEVEMAKRDQDL 271
           ++  A+A  E    + Q ++L+ E+ A  +A      +     E    L   +   ++ +
Sbjct: 774 EEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRI 833

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
              +  +  LEEQ+++L    E L A   EL+  +
Sbjct: 834 AATERRLEDLEEQIEELSEDIESLAAEIEELEELI 868



 Score = 37.0 bits (86), Expect = 0.014
 Identities = 22/92 (23%), Positives = 48/92 (52%), Gaps = 2/92 (2%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ--ESVERLKQLEVEMAKRDQDL 271
            ++  + + ++E     ++ ++ +L+    E  E + +  E  E L++LE  + +  ++L
Sbjct: 411 RLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREEL 470

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            EA++ +   E +L QLQA  + LE  Q  L+
Sbjct: 471 EEAEQALDAAERELAQLQARLDSLERLQENLE 502



 Score = 34.6 bits (80), Expect = 0.068
 Identities = 20/88 (22%), Positives = 38/88 (43%), Gaps = 4/88 (4%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK----QLEVEMAKRDQDLMEAQ 275
                E    ++  ++L+ E+ A        +E++  L+    +L  E+ + +    E +
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914

Query: 276 EMIRRLEEQLKQLQAAKEELEARQTELQ 303
             +  L E+L QL+   E LE R   LQ
Sbjct: 915 RELEELREKLAQLELRLEGLEVRIDNLQ 942



 Score = 34.6 bits (80), Expect = 0.075
 Identities = 24/93 (25%), Positives = 49/93 (52%), Gaps = 8/93 (8%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL----KQLEVEMAKRDQDLMEA 274
            ++  EE    Q++   L  EI+  E+ ++  +E +  L    ++LE ++ + +  L E 
Sbjct: 276 VSELEEEIEELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDEL 335

Query: 275 QEMIRRLEEQLKQLQ----AAKEELEARQTELQ 303
            E +  LEE+L++L+    + + ELE  + EL+
Sbjct: 336 AEELAELEEKLEELKEELESLEAELEELEAELE 368



 Score = 34.3 bits (79), Expect = 0.087
 Identities = 24/109 (22%), Positives = 45/109 (41%), Gaps = 12/109 (11%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ------ 269
           ++++++  E +   +  R K+            + +    RL++LE    +  Q      
Sbjct: 368 EELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELL 427

Query: 270 ------DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
                 +L E Q  +  LEE+L++LQ   E LE    EL+ EL+     
Sbjct: 428 KKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQA 476



 Score = 33.9 bits (78), Expect = 0.14
 Identities = 25/113 (22%), Positives = 52/113 (46%), Gaps = 16/113 (14%)

Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAA----REKAEKKHQESVER 256
            E  +   + +  ++ +   +   +++  +++ ++L+ ++A      E  E +     ER
Sbjct: 885 LEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQER 944

Query: 257 LK---QLEVEMA-----KRDQDLMEAQEMIRRLEEQLKQL----QAAKEELEA 297
           L     L +E A     K + D  EA+  ++RLE ++K+L     AA EE E 
Sbjct: 945 LSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKELGPVNLAAIEEYEE 997



 Score = 28.1 bits (63), Expect = 8.7
 Identities = 16/44 (36%), Positives = 27/44 (61%)

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
           R +++ E +E I  LEE++ +L+ A  EL     EL+ EL+ +R
Sbjct: 675 RRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLR 718


>gnl|CDD|241345 cd13191, FERM_C_FRMD4A_FRMD4B, FERM domain-containing protein 4A
           and 4B (FRMD4A and 4B) FERM domain C-lobe.  FRMD4A is
           part of the Par-3/FRMD4A/cytohesin-1 complex that
           activates Arf6, a central player in actin cytoskeleton
           dynamics and membrane trafficking, during junctional
           remodeling and epithelial polarization. The
           Par-3/Par-6/aPKC/Cdc42 complex regulates the conversion
           of primordial adherens junctions (AJs) into belt-like
           AJs and the formation of linear actin cables. When
           primordial AJs are formed, Par-3 recruits scaffolding
           protein FRMD4A which connects Par-3 and the Arf6
           guanine-nucleotide exchange factor (GEF), cytohesin-1.
           FRMD4B (also called GRP1-binding protein, GRSP1) is a
           novel member of GRP1 signaling complexes that are
           recruited to plasma membrane ruffles in response to
           insulin receptor signaling. The GRSP1/FRMD4B protein
           contains a FERM protein domain as well as two coiled
           coil domains and may function as a scaffolding protein.
           GRP1 and GRSP1 interact through the coiled coil domains
           in the two proteins. The FERM domain has a cloverleaf
           tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3
           within the FERM domain is part of the PH domain family.
           The FERM domain is found in the cytoskeletal-associated
           proteins such as ezrin, moesin, radixin, 4.1R, and
           merlin. These proteins provide a link between the
           membrane and cytoskeleton and are involved in signal
           transduction pathways. The FERM domain is also found in
           protein tyrosine phosphatases (PTPs) , the tyrosine
           kinases FAK and JAK, in addition to other proteins
           involved in signaling. This domain is structurally
           similar to the PH and PTB domains and consequently is
           capable of binding to both peptides and phospholipids at
           different sites.
          Length = 115

 Score = 43.1 bits (102), Expect = 2e-05
 Identities = 32/173 (18%), Positives = 57/173 (32%), Gaps = 75/173 (43%)

Query: 50  EMYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPLQFKFRAKFYP 109
             YGV+Y+E+K+K G   WLG+   G+  Y+ +D           K  P       K + 
Sbjct: 1   PTYGVHYYEVKDKNGIPWWLGLSYKGIGQYDLQD-----------KVKP------RKLFQ 43

Query: 110 EDVVEEIIQDITLRLFYLQVSRSAGSRVRFPPGPNCLLTPKIGFPWSEIRNISFNDRKFI 169
                                                        W ++ N+ F DRKF 
Sbjct: 44  ---------------------------------------------WKQLENLYFRDRKFS 58

Query: 170 IKPID-KKAPD------------FVFFAPRVRVNKRILALCMGNHELYMRRRK 209
           I+  D ++                V++     + K I ++ +  H+ Y+ R++
Sbjct: 59  IEVHDPRRISVSRRTFGQSGVAVHVWYGNTPSLIKSIWSMAISQHQFYLDRKQ 111


>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
           sigma54-dependent transcription [Transcription / Signal
           transduction mechanisms].
          Length = 225

 Score = 44.6 bits (106), Expect = 2e-05
 Identities = 29/126 (23%), Positives = 57/126 (45%), Gaps = 9/126 (7%)

Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAARE-KAEKKHQ--------E 252
           E  +R  + +    +Q  AQA   +   +++ ++ Q      E KAE   Q        E
Sbjct: 30  EQAIRDMESELAKARQALAQAIARQKQLERKLEEAQARAEKLEEKAELALQAGNEDLARE 89

Query: 253 SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           ++E  + LE      + +L +A+E + +L++QL  L+    EL A++  L+      + Q
Sbjct: 90  ALEEKQSLEDLAKALEAELQQAEEQVEKLKKQLAALEQKIAELRAKKEALKARKAAAKAQ 149

Query: 313 SNLNKV 318
             +N+ 
Sbjct: 150 EKVNRS 155


>gnl|CDD|241339 cd13185, FERM_C_FRMD1_FRMD6, FERM domain containing 1 and 6
           proteins FERM domain C-lobe.  FRMD6 (also called willin
           and hEx/human expanded) is localized throughout the
           cytoplasm or along the plasma membrane  The Drosophilla
           protein Ex is a regulator of the Hippo/SWH (Sav/Wts/Hpo)
           signaling pathway, a signaling pathway that plays a
           pivotal role in organ size control and is tumor
           suppression by restricting proliferation and promoting
           apoptosis. Surprisingly, hEx is thought to function
           independently of the Hippo pathway. Instead it is
           hypothesized that hEx inhibits progression through the S
           phase of the cell cycle by upregulating p21(Cip1) and
           downregulating Cyclin A. It is also implicated in the
           progression of Alzheimer disease. Not much is known
           about FRMD1 to date. Both FRMD1 and FRMD6 contains a
           single FERM domain which has a cloverleaf tripart
           structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe is
           a member of the PH superfamily. The FERM domain is found
           in the cytoskeletal-associated proteins such as ezrin,
           moesin, radixin, 4.1R, and merlin. These proteins
           provide a link between the membrane and cytoskeleton and
           are involved in signal transduction pathways. The FERM
           domain is also found in protein tyrosine phosphatases
           (PTPs) , the tyrosine kinases FAK and JAK, in addition
           to other proteins involved in signaling. This domain is
           structurally similar to the PH and PTB domains and
           consequently is capable of binding to both peptides and
           phospholipids at different sites.
          Length = 109

 Score = 42.7 bits (101), Expect = 2e-05
 Identities = 21/67 (31%), Positives = 31/67 (46%), Gaps = 4/67 (5%)

Query: 141 PGPNCLLTPKIGFPWSEIRNISFNDRKFIIKPIDKKAPD-FVFFAPRVRVNKRILALCMG 199
            G   LL     FPWS I  +SF+ +KF I+P    +     ++      +K +LALC  
Sbjct: 37  DGEKQLLRT---FPWSNIGKLSFDRKKFEIRPEGLPSLRKLTYYTGSDEKSKYLLALCRE 93

Query: 200 NHELYMR 206
            H+  M 
Sbjct: 94  THQFSMA 100


>gnl|CDD|241349 cd13195, FERM_C_MYLIP_IDOL, The E3 ubiquitin ligase myosin
          regulatory light chain-interacting protein (MYLIP; also
          called inducible degrader of the LDL receptor, IDOL)
          FERM domain C-lobe.  MYLIP/IDOL is a regulator of the
          LDL receptor (LDLR) pathway via the nuclear receptor
          liver X receptor (LXR). In response to cellular
          cholesterol loading, the activation of LXR leads to the
          induction of MYLIP expression. MYLIP stimulates
          ubiquitination of the LDLR on its cytoplasmic tail,
          directing its degradation. The LXR-MYLIP-LDLR pathway
          provides a complementary pathway to sterol regulatory
          element-binding proteins for the feedback inhibition of
          cholesterol uptake. MYLIP has an N-terminal FERM domain
          and in some cases a C-terminal RING domain. The FERM
          domain has a cloverleaf tripart structure (FERM_N,
          FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe,
          C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM
          domain is part of the PH domain family. The FERM domain
          is found in the cytoskeletal-associated proteins such
          as ezrin, moesin, radixin, 4.1R, and merlin. These
          proteins provide a link between the membrane and
          cytoskeleton and are involved in signal transduction
          pathways. The FERM domain is also found in protein
          tyrosine phosphatases (PTPs) , the tyrosine kinases FAK
          and JAK, in addition to other proteins involved in
          signaling. This domain is structurally similar to the
          PH and PTB domains and consequently is capable of
          binding to both peptides and phospholipids at different
          sites.
          Length = 131

 Score = 42.6 bits (101), Expect = 3e-05
 Identities = 18/56 (32%), Positives = 27/56 (48%)

Query: 32 GMLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTN 87
          GM    A   +LK   +LE YGV  F ++  +G +L +GV   G+ I   +  L  
Sbjct: 1  GMSPSTAEYLFLKEVSELENYGVETFHVRTIEGQKLVIGVGPDGIRICNDDGELIQ 56


>gnl|CDD|241356 cd13202, FERM_C_MyoX, Myosin X (MyoX, Myo10) FERM domain C-lobe. 
          MyoX, a MyTH-FERM myosin, is a molecular motor that has
          crucial functions in the transport and/or tethering of
          integrins in the actin-based extensions known as
          filopodia, microtubule binding, and in netrin-mediated
          axon guidance. It functions as a dimer. MyoX walks on
          bundles of actin, rather than single filaments, unlike
          the other unconventional myosins. MyoX is present in
          organisms ranging from humans to choanoflagellates, but
          not in Drosophila and Caenorhabditis elegans.MyoX
          consists of a N-terminal motor/head region, a neck made
          of 3 IQ motifs, and a tail consisting of a coiled-coil
          domain, a PEST region, 3 PH domains, a myosin tail
          homology 4 (MyTH4), and a FERM domain at its very
          C-terminus. The MyoX FERM domain binds to the NPXY
          motif of several beta-integrins, a key family of cell
          surface receptors that are involved in cell adhesion
          and migration. In addition the FERM domain binds to the
          cytoplasmic domains of the netrin receptors DCC
          (deleted in colorectal cancer) and neogenin. The FERM
          domain also forms a supramodule with its MyTH4 domain
          which binds to the negatively charged E-hook region in
          the tails of alpha- and beta-tubulin forming a proposed
          motorized link between actin filaments and
          microtubules. The FERM domain has a cloverleaf tripart
          structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
          C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The
          C-lobe/F3 within the FERM domain is part of the PH
          domain family. The FERM domain is found in the
          cytoskeletal-associated proteins such as ezrin, moesin,
          radixin, 4.1R, and merlin. These proteins provide a
          link between the membrane and cytoskeleton and are
          involved in signal transduction pathways. The FERM
          domain is also found in protein tyrosine phosphatases
          (PTPs) , the tyrosine kinases FAK and JAK, in addition
          to other proteins involved in signaling. This domain is
          structurally similar to the PH and PTB domains and
          consequently is capable of binding to both peptides and
          phospholipids at different sites.
          Length = 111

 Score = 42.3 bits (100), Expect = 3e-05
 Identities = 21/51 (41%), Positives = 31/51 (60%), Gaps = 4/51 (7%)

Query: 33 MLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGT----ELWLGVDALGLNIY 79
          M +EDAM +Y+ I ++   YG   FE++  K      ELWLGV A G+++Y
Sbjct: 1  MSQEDAMAKYMAIIREWPGYGSTLFEVECNKEGGFPQELWLGVSAKGVSLY 51


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 44.7 bits (106), Expect = 5e-05
 Identities = 26/104 (25%), Positives = 55/104 (52%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
            +++++ Q  E +   ++ + +L       E+ + + +E  E L++LE E+ +  + L E
Sbjct: 703 LLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEE 762

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
            +E +  LEE L +L+   EELE ++  LQ EL+ +  +    +
Sbjct: 763 LEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAE 806



 Score = 44.7 bits (106), Expect = 5e-05
 Identities = 26/107 (24%), Positives = 63/107 (58%), Gaps = 4/107 (3%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAR----EKAEKKHQESVERLKQLEVEMAKRDQD 270
           +++++ QA + +  ++ + +  +LE+A      ++  K+ +E  E L +LE E+ +  ++
Sbjct: 202 LEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEE 261

Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
           L EA++ I  L+ +L++L+   EEL+    EL+ E++ +  + +L +
Sbjct: 262 LEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLR 308



 Score = 43.2 bits (102), Expect = 2e-04
 Identities = 31/103 (30%), Positives = 57/103 (55%), Gaps = 4/103 (3%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREK----AEKKHQESVERLKQLEVEMAKRDQ 269
           + ++       E  + +Q+R++L+ EI   E+     E+K  E  E L++LE E+ +  +
Sbjct: 804 EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKE 863

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           +L E +     LE++LK+L+  KEELE    EL+ EL  ++ +
Sbjct: 864 ELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEE 906



 Score = 42.8 bits (101), Expect = 2e-04
 Identities = 22/94 (23%), Positives = 44/94 (46%), Gaps = 4/94 (4%)

Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAARE----KAEKKHQESVERLKQLEVEMAKRDQDLM 272
               + REE    Q++  +L+ EI   E       ++ +E    L++LE  + +  + + 
Sbjct: 274 SELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIE 333

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
             +E +   E  L++L+    ELE  + EL+ +L
Sbjct: 334 ALKEELEERETLLEELEQLLAELEEAKEELEEKL 367



 Score = 42.4 bits (100), Expect = 2e-04
 Identities = 29/100 (29%), Positives = 54/100 (54%), Gaps = 3/100 (3%)

Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
            +++   A+ R E    +++ + L+      E+  ++ ++  E LK+LE E+ +   +L 
Sbjct: 386 AELEAELAEIRNELEELKREIESLEER---LERLSERLEDLKEELKELEAELEELQTELE 442

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           E  E +  LEEQL++L+   +ELE    ELQ ELQ +  +
Sbjct: 443 ELNEELEELEEQLEELRDRLKELERELAELQEELQRLEKE 482



 Score = 42.4 bits (100), Expect = 3e-04
 Identities = 26/94 (27%), Positives = 52/94 (55%), Gaps = 3/94 (3%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
             +  EE +  +++ ++LQ      E+AEK+ +E    L++L  E+ +  ++L+E +E I
Sbjct: 241 LEELEEELSRLEEELEELQ---EELEEAEKEIEELKSELEELREELEELQEELLELKEEI 297

Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
             LE ++  L+   EELE    EL+  L+ ++ +
Sbjct: 298 EELEGEISLLRERLEELENELEELEERLEELKEK 331



 Score = 42.0 bits (99), Expect = 3e-04
 Identities = 23/101 (22%), Positives = 58/101 (57%), Gaps = 3/101 (2%)

Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
           + K +  ++++ +   +EE    +++ ++ +  + A E+  +  ++  ERL+Q   E+ +
Sbjct: 776 KLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQ---EIEE 832

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
            ++++ E +E +  LEE+L++L+   EEL+    EL+ E +
Sbjct: 833 LEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKE 873



 Score = 40.9 bits (96), Expect = 8e-04
 Identities = 23/98 (23%), Positives = 52/98 (53%), Gaps = 4/98 (4%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVE----RLKQLEVEMAKRDQ 269
            ++       +     ++Q ++L+ ++   ++     +E +E    RL++LE E+ + ++
Sbjct: 692 SLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEE 751

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           +L E QE +  LEE+L+ L+ A  +L+    EL+ + Q
Sbjct: 752 ELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQ 789



 Score = 40.5 bits (95), Expect = 0.001
 Identities = 28/110 (25%), Positives = 56/110 (50%), Gaps = 11/110 (10%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIA-----------AREKAEKKHQESVERLKQLEV 262
           ++Q+   +  EE  + ++   KL+ EI              E+ E++ +E+  RL  LE 
Sbjct: 755 ELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALER 814

Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           E+   +Q     ++ I  LEE++++L+   +ELE    EL+ EL+ ++ +
Sbjct: 815 ELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEE 864



 Score = 40.5 bits (95), Expect = 0.001
 Identities = 27/108 (25%), Positives = 50/108 (46%), Gaps = 7/108 (6%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +++   +  RE     + + ++L+  +   ++  +  +E +E  + L  E+ +   +L E
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEE 358

Query: 274 AQEMIRRLEEQLKQLQAAKEE----LEARQTELQLELQPIRNQSNLNK 317
           A+E    LEE+L  L    EE    L     EL+ EL  IRN+    K
Sbjct: 359 AKE---ELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELK 403



 Score = 38.5 bits (90), Expect = 0.004
 Identities = 25/99 (25%), Positives = 61/99 (61%), Gaps = 3/99 (3%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           ++++   +A    +A +++ + L+     RE+ E++ +E  E +++LE ++ + +++L E
Sbjct: 797 ELEEELEEAERRLDALERELESLE---QRRERLEQEIEELEEEIEELEEKLDELEEELEE 853

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            ++ +  L+E+L++L+A KEELE    EL+ E + +  +
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEE 892



 Score = 38.2 bits (89), Expect = 0.006
 Identities = 28/90 (31%), Positives = 50/90 (55%), Gaps = 10/90 (11%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           D+++   +   E    Q + ++L  E+   E+ E++ +E  +RLK+LE       ++L E
Sbjct: 422 DLKEELKELEAELEELQTELEELNEEL---EELEEQLEELRDRLKELE-------RELAE 471

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            QE ++RLE++L  L+A  + LEA Q   Q
Sbjct: 472 LQEELQRLEKELSSLEARLDRLEAEQRASQ 501



 Score = 38.2 bits (89), Expect = 0.006
 Identities = 28/105 (26%), Positives = 59/105 (56%), Gaps = 11/105 (10%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK----QLEVEMAKRDQDL 271
           +Q   +  EE    +++ D+L+ E+   EK  ++ +E +E L+    +LE E+ + +++ 
Sbjct: 827 EQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEK 886

Query: 272 MEAQEMIR-------RLEEQLKQLQAAKEELEARQTELQLELQPI 309
            E +E +R        L+E++++L+   EELEA+   L++EL  +
Sbjct: 887 EELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPEL 931



 Score = 37.8 bits (88), Expect = 0.007
 Identities = 23/112 (20%), Positives = 54/112 (48%), Gaps = 1/112 (0%)

Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL 260
            ++   + + +  +    + +    +  + ++  + +L     E  E+  +   E L +L
Sbjct: 330 EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEEL-EELFEALREELAEL 388

Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           E E+A+   +L E +  I  LEE+L++L    E+L+    EL+ EL+ ++ +
Sbjct: 389 EAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTE 440



 Score = 37.8 bits (88), Expect = 0.007
 Identities = 30/122 (24%), Positives = 58/122 (47%)

Query: 191 KRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKH 250
           K  L       E   R+ +     ++++K +    +   +Q + +L+      E+ E++ 
Sbjct: 694 KNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL 753

Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
           +E  ERL++LE E+   ++ L + +E I  LEE+ + LQ   EELE    E +  L  + 
Sbjct: 754 EELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALE 813

Query: 311 NQ 312
            +
Sbjct: 814 RE 815



 Score = 37.4 bits (87), Expect = 0.011
 Identities = 28/101 (27%), Positives = 55/101 (54%), Gaps = 4/101 (3%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVE----RLKQLEVEMAKRDQDL 271
           ++  A+ +EE    +++R  LQ E+   E+  ++ +  ++     L+ LE    + +Q++
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            E +E I  LEE+L +L+   EELE    EL+ EL+ +  +
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE 871



 Score = 35.1 bits (81), Expect = 0.046
 Identities = 20/87 (22%), Positives = 43/87 (49%), Gaps = 1/87 (1%)

Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL-KQLEVEMAKRDQDLMEAQEMIRRL 281
            +E    +++ ++L+ E    E   K+ +E  E L ++L    ++  +   E +++  RL
Sbjct: 855 EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERL 914

Query: 282 EEQLKQLQAAKEELEARQTELQLELQP 308
           EE   +L+  + EL   + EL+ E + 
Sbjct: 915 EELEAKLERLEVELPELEEELEEEYED 941



 Score = 35.1 bits (81), Expect = 0.056
 Identities = 26/120 (21%), Positives = 59/120 (49%), Gaps = 9/120 (7%)

Query: 202 ELYMRRRKPDTIDVQQMKAQAR-----EEKNAKQQQRDKLQLEIAAREKAEKKHQESVER 256
           +L   +R+   ++ +  + Q+R     EE    +++ ++LQ  +   E+  +  +E++ +
Sbjct: 717 QLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAK 776

Query: 257 LKQLEVEMAKRDQDLMEAQEM----IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           LK+   E+ ++ Q L E  E     +   E +L  L+   E LE R+  L+ E++ +  +
Sbjct: 777 LKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEE 836



 Score = 34.7 bits (80), Expect = 0.073
 Identities = 18/99 (18%), Positives = 51/99 (51%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +++++K +    +   ++  ++L+      ++ E + +E    L++L  E+ + ++ L E
Sbjct: 398 ELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            ++ ++ LE +L +LQ   + LE   + L+  L  +  +
Sbjct: 458 LRDRLKELERELAELQEELQRLEKELSSLEARLDRLEAE 496



 Score = 34.3 bits (79), Expect = 0.10
 Identities = 25/103 (24%), Positives = 56/103 (54%), Gaps = 2/103 (1%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
             +++++K +  E +  K++  D+L+     +E+ E++ +E    L +L+ E+ K  + L
Sbjct: 855 EKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERL 914

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELE--ARQTELQLELQPIRNQ 312
            E +  + RLE +L +L+   EE      +TEL+ E++ +  +
Sbjct: 915 EELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEE 957



 Score = 33.9 bits (78), Expect = 0.11
 Identities = 24/97 (24%), Positives = 51/97 (52%), Gaps = 4/97 (4%)

Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM----EAQEM 277
            + E    +++  +L+ ++   E+  K  +  +  L+ L  E+ ++ ++L     E +  
Sbjct: 665 QKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRE 724

Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
           +  LEE+L+QLQ+  EELE    EL+ EL+ ++ +  
Sbjct: 725 LAALEEELEQLQSRLEELEEELEELEEELEELQERLE 761



 Score = 33.9 bits (78), Expect = 0.13
 Identities = 19/97 (19%), Positives = 51/97 (52%), Gaps = 7/97 (7%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK------- 266
           ++++++ +  E +   ++ + +L+      E+ +++  E  E +++LE E++        
Sbjct: 254 ELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEE 313

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            + +L E +E +  L+E+++ L+   EE E    EL+
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEERETLLEELE 350



 Score = 32.8 bits (75), Expect = 0.28
 Identities = 18/100 (18%), Positives = 47/100 (47%), Gaps = 6/100 (6%)

Query: 219  KAQAREEKNAKQQQRDKLQLEIAAREKA------EKKHQESVERLKQLEVEMAKRDQDLM 272
              + RE     + + ++L++E+   E+       +    E    +++LE E+       +
Sbjct: 907  IEKLRERLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNL 966

Query: 273  EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
             A E    +EE+ ++L++ +E+LE  + +L   ++ +  +
Sbjct: 967  RAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKE 1006



 Score = 32.4 bits (74), Expect = 0.37
 Identities = 24/104 (23%), Positives = 48/104 (46%), Gaps = 5/104 (4%)

Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAA-----REKAEKKHQESVERLKQ 259
           +  R+    +++Q+ A+  E K   +++   L  E+       RE+  +   E  E   +
Sbjct: 339 LEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNE 398

Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
           LE    + +      + +  RLE+  ++L+  + ELE  QTEL+
Sbjct: 399 LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELE 442



 Score = 29.7 bits (67), Expect = 2.5
 Identities = 17/62 (27%), Positives = 37/62 (59%)

Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
            LK+LE E+A+ +  L + +E ++ L+ +L+ L+   EEL  +  EL+ +L+ ++ +   
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727

Query: 316 NK 317
            +
Sbjct: 728 LE 729



 Score = 28.1 bits (63), Expect = 7.9
 Identities = 20/76 (26%), Positives = 41/76 (53%)

Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
              +    +++ +E  E L +LE ++ K +++L   +  +R LE+ L++L+   EELE +
Sbjct: 658 NKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQ 717

Query: 299 QTELQLELQPIRNQSN 314
             EL+ EL  +  +  
Sbjct: 718 LEELKRELAALEEELE 733


>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
           envelope biogenesis, outer membrane].
          Length = 387

 Score = 43.4 bits (102), Expect = 9e-05
 Identities = 25/89 (28%), Positives = 44/89 (49%), Gaps = 2/89 (2%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           VQQ      ++ +AK+ ++ + + E    E+ + K     ERLKQLE E  K  +   +A
Sbjct: 61  VQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQA 120

Query: 275 QEMIRRLEEQLKQL--QAAKEELEARQTE 301
           +E  ++ + + KQ   QA K   E ++  
Sbjct: 121 EEAEKQAQLEQKQQEEQARKAAAEQKKKA 149



 Score = 33.8 bits (77), Expect = 0.10
 Identities = 15/88 (17%), Positives = 37/88 (42%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
           ++ +      ++  A+  + K A +Q+R K   +   + + ++K  E  E+  QLE +  
Sbjct: 75  KKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQ 134

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
           +       A++  +    + K    A +
Sbjct: 135 EEQARKAAAEQKKKAEAAKAKAAAEAAK 162



 Score = 31.8 bits (72), Expect = 0.41
 Identities = 20/83 (24%), Positives = 41/83 (49%), Gaps = 3/83 (3%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           Q  K + + ++  KQ +  + Q ++  +++ E+  + + E+ K+ E   AK      EA 
Sbjct: 105 QLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKA---AAEAA 161

Query: 276 EMIRRLEEQLKQLQAAKEELEAR 298
           ++    E + K  +AAK   EA+
Sbjct: 162 KLKAAAEAKKKAEEAAKAAEEAK 184


>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
          Length = 1486

 Score = 43.0 bits (102), Expect = 2e-04
 Identities = 29/96 (30%), Positives = 54/96 (56%), Gaps = 6/96 (6%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEI---AAREKAEKKHQESVERLKQLEVEMAKRDQD 270
            + +  ++ARE + A +QQ ++LQ  I   AAR  A    Q+++ RL++   E  +  QD
Sbjct: 569 SLSESVSEARERRMALRQQLEQLQARIQRLAARAPAWLAAQDALARLREQSGEEFEDSQD 628

Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
           + E    +++L E+ ++L   ++EL AR+  L  E+
Sbjct: 629 VTEY---MQQLLERERELTVERDELAARKQALDEEI 661



 Score = 39.9 bits (94), Expect = 0.002
 Identities = 18/92 (19%), Positives = 46/92 (50%), Gaps = 1/92 (1%)

Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
           + RE++    +Q  +L++ ++  E+  ++ Q +   L +    + K   D  E +++   
Sbjct: 504 RLREQR-HLAEQLQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEE 562

Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           LE +L+ L  +  E   R+  L+ +L+ ++ +
Sbjct: 563 LEARLESLSESVSEARERRMALRQQLEQLQAR 594



 Score = 35.7 bits (83), Expect = 0.030
 Identities = 18/88 (20%), Positives = 40/88 (45%), Gaps = 4/88 (4%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           +QQ++ +  E +   +QQ+   +L      +  K+  ++++   +LE    + +  L   
Sbjct: 515 LQQLRMRLSELEQRLRQQQRAERL----LAEFCKRLGKNLDDEDELEQLQEELEARLESL 570

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTEL 302
            E +    E+   L+   E+L+AR   L
Sbjct: 571 SESVSEARERRMALRQQLEQLQARIQRL 598



 Score = 34.2 bits (79), Expect = 0.090
 Identities = 25/113 (22%), Positives = 50/113 (44%), Gaps = 16/113 (14%)

Query: 216  QQMKAQAREE-KNAKQQQRDKLQLEIA---AREKAEKKHQESVERLKQLEV-------EM 264
            +Q + +ARE+ + A+ Q     Q+  +   + +   +  QE  + L+ L V       E 
Sbjct: 998  EQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQELKQELQDLGVPADSGAEER 1057

Query: 265  AKRDQD-----LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            A+  +D     L   +    +LE+QL   +A  + L  +  +L+ +   +R Q
Sbjct: 1058 ARARRDELHARLSANRSRRNQLEKQLTFCEAEMDNLTKKLRKLERDYHEMREQ 1110



 Score = 32.2 bits (74), Expect = 0.41
 Identities = 33/124 (26%), Positives = 59/124 (47%), Gaps = 23/124 (18%)

Query: 206 RRRKPD-TIDVQQMKAQAREEKNAKQQQ----RDKLQLEIAAREKA-EKKHQESVERLKQ 259
           RR   +  +++++    +R +  A+Q +      +L  E+   E   E+ +Q + + L  
Sbjct: 281 RRVHLEEALELRRELYTSRRQLAAEQYRLVEMARELA-ELNEAESDLEQDYQAASDHL-N 338

Query: 260 LEVEMAKRDQ--------DLMEAQEMIRRLEEQLKQLQAAKE---ELEARQTELQLELQP 308
           L V+ A R Q        DL E +E   RLEEQ + ++ A E   E EAR    + E+  
Sbjct: 339 L-VQTALRQQEKIERYQADLEELEE---RLEEQNEVVEEADEQQEENEARAEAAEEEVDE 394

Query: 309 IRNQ 312
           +++Q
Sbjct: 395 LKSQ 398



 Score = 29.5 bits (67), Expect = 3.3
 Identities = 23/89 (25%), Positives = 42/89 (47%), Gaps = 2/89 (2%)

Query: 215  VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
            ++Q   QA++ +   +QQ   L  E+  R +A   ++++ E L +      K  Q L +A
Sbjct: 940  LKQDYQQAQQTQRDAKQQAFALT-EVVQR-RAHFSYEDAAEMLAKNSDLNEKLRQRLEQA 997

Query: 275  QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            ++   R  EQL+Q QA   +       L+
Sbjct: 998  EQERTRAREQLRQAQAQLAQYNQVLASLK 1026



 Score = 28.4 bits (64), Expect = 5.8
 Identities = 24/116 (20%), Positives = 48/116 (41%), Gaps = 16/116 (13%)

Query: 213 IDVQQMKA-QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
           +DVQQ +A Q ++   A ++ +    L     + AE   +E   + ++   E+   +Q L
Sbjct: 406 LDVQQTRAIQYQQAVQALERAKQLCGLPDLTADNAEDWLEEFQAKEQEATEELLSLEQKL 465

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELE-------ARQTELQLE--------LQPIRNQ 312
             AQ    + E+  + ++    E+        AR+   +L         LQ +R +
Sbjct: 466 SVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQLQQLRMR 521



 Score = 28.4 bits (64), Expect = 6.8
 Identities = 18/101 (17%), Positives = 41/101 (40%), Gaps = 15/101 (14%)

Query: 222 AREEKNAK-QQQRDKLQLEIAAREKAEKKHQESVERLKQL-------------EVEMAKR 267
           ARE++  + + +R++L    A      +K Q   +   +              E E+ + 
Sbjct: 783 AREKRIEQLRAEREELAERYATLSFDVQKLQRLHQAFSRFIGSHLAVAFEADPEAELRQL 842

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
           ++  +E +  +   E Q +Q ++  E+ +   + L   L P
Sbjct: 843 NRRRVELERALADHESQEQQQRSQLEQAKEGLSALN-RLLP 882



 Score = 28.0 bits (63), Expect = 9.1
 Identities = 24/107 (22%), Positives = 48/107 (44%), Gaps = 17/107 (15%)

Query: 220  AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIR 279
            A     KN+   ++ + +LE A +E+   +     E+L+Q + ++A+ +Q L   +    
Sbjct: 976  AAEMLAKNSDLNEKLRQRLEQAEQERTRAR-----EQLRQAQAQLAQYNQVLASLKSSYD 1030

Query: 280  RLEEQLKQLQ------------AAKEELEARQTELQLELQPIRNQSN 314
               + L++L+             A+E   AR+ EL   L   R++ N
Sbjct: 1031 AKRQMLQELKQELQDLGVPADSGAEERARARRDELHARLSANRSRRN 1077


>gnl|CDD|241346 cd13192, FERM_C_FRMD3_FRMD5, FERM domain-containing protein 3 and
          5 FERM (FRMD3 and 5) domain C-lobe.  FRMD3 (also called
          Band 4.1-like protein 4O/4.1O though it is not a true
          member of that family) is a novel putative tumor
          suppressor gene that is implicated in the origin and
          progression of lung cancer. In humans there are 5
          isoforms that are produced by alternative splicing.
          Less is known about FRMD5, though there are 2 isoforms
          of the human protein are produced by alternative
          splicing. Both FRMD3 and FRMD5 contain a N-terminal
          FERM domain, followed by a FERM adjacent (FA) domain,
          and 4.1 protein C-terminal domain (CTD). The FERM
          domain has a cloverleaf tripart structure (FERM_N,
          FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe, B-lobe,
          C-lobe/F1, F2, F3). The C-lobe/F3 within the FERM
          domain is part of the PH domain family. The FERM domain
          is found in the cytoskeletal-associated proteins such
          as ezrin, moesin, radixin, 4.1R, and merlin. These
          proteins provide a link between the membrane and
          cytoskeleton and are involved in signal transduction
          pathways. The FERM domain is also found in protein
          tyrosine phosphatases (PTPs) , the tyrosine kinases FAK
          and JAK, in addition to other proteins involved in
          signaling. This domain is structurally similar to the
          PH and PTB domains and consequently is capable of
          binding to both peptides and phospholipids at different
          sites.
          Length = 105

 Score = 39.7 bits (93), Expect = 2e-04
 Identities = 15/42 (35%), Positives = 26/42 (61%)

Query: 38 AMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIY 79
          A + +L+ A  LE YGV+   +K+  GT+L+LG    G+ ++
Sbjct: 1  AELNFLRKAHTLETYGVDPHPVKDHYGTQLYLGFTHTGVVVF 42


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 42.6 bits (101), Expect = 2e-04
 Identities = 38/120 (31%), Positives = 68/120 (56%), Gaps = 13/120 (10%)

Query: 207 RRKPD-TID-VQQMKAQAREEKNAKQQQRDKL-----QLEIAAREKAEK-KHQESVERLK 258
           R  PD  ID + +  A+ R E ++K ++ D+L     QLEI  RE  +K K + S ERL+
Sbjct: 379 RFLPDKAIDLIDEAAARIRMEIDSKPEELDELDRRIIQLEIE-REALKKEKDEASKERLE 437

Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
            LE E+A+ +++  + +E  +  +  ++ +Q  KEE+E    +++LEL+    + +L K 
Sbjct: 438 DLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKEEIE----QVRLELEQAEREGDLAKA 493



 Score = 39.2 bits (92), Expect = 0.003
 Identities = 34/119 (28%), Positives = 61/119 (51%), Gaps = 28/119 (23%)

Query: 209 KPDTID-----VQQMK----AQAREEKNAKQQQRDKLQLEIAARE----------KAEK- 248
           KP+ +D     + Q++    A  +E+  A +++ + L+ E+A  E          KAEK 
Sbjct: 403 KPEELDELDRRIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKA 462

Query: 249 ------KHQESVERLKQLEVEMAKRDQDLMEAQE-MIRRLEEQLKQLQAAKEELEARQT 300
                 + +E +E+++ LE+E A+R+ DL +A E    +L E  K+LQAA+ +L     
Sbjct: 463 AIQGIQQIKEEIEQVR-LELEQAEREGDLAKAAELQYGKLPELEKRLQAAEAKLGEETK 520


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 42.0 bits (99), Expect = 4e-04
 Identities = 24/105 (22%), Positives = 50/105 (47%), Gaps = 10/105 (9%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
            +   K    +E    Q+QR  L+ +I + EK           ++ L  +  + +++L E
Sbjct: 823 RLTLEKEYLEKEIQELQEQRIDLKEQIKSIEK----------EIENLNGKKEELEEELEE 872

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
            +  +R LE +L  L+  ++ELEA+  EL+ +++ +  Q    + 
Sbjct: 873 LEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRK 917



 Score = 41.6 bits (98), Expect = 4e-04
 Identities = 24/104 (23%), Positives = 48/104 (46%), Gaps = 11/104 (10%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEI-----------AAREKAEKKHQESVERLKQLEV 262
           D ++   + + E N  +++ D+LQ E+           AA    E K  E  E  +   +
Sbjct: 389 DYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKAL 448

Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
           E+ K++  L +    + + E++L  L+   + +E   ++LQ EL
Sbjct: 449 EIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQREL 492



 Score = 40.1 bits (94), Expect = 0.001
 Identities = 19/99 (19%), Positives = 47/99 (47%), Gaps = 4/99 (4%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREK----AEKKHQESVERLKQLEVEMAKRDQ 269
            +Q    +     +   Q+      +I   EK     E++ ++  ERL++LE +++  +Q
Sbjct: 692 SLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQ 751

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
           ++   +  ++ LE ++++L+    +LE    +L+  L  
Sbjct: 752 EIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790



 Score = 38.9 bits (91), Expect = 0.004
 Identities = 16/109 (14%), Positives = 40/109 (36%), Gaps = 16/109 (14%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAARE----------------KAEKKHQESVERL 257
           +V+    +        ++   KL+  +   E                K E++      RL
Sbjct: 755 NVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARL 814

Query: 258 KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
           +++E ++ +   +    ++ I+ L+EQ   L+   + +E     L  + 
Sbjct: 815 REIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKK 863



 Score = 38.1 bits (89), Expect = 0.006
 Identities = 27/104 (25%), Positives = 54/104 (51%), Gaps = 10/104 (9%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL--EVEMAKRDQD- 270
           + ++ + +  EE    +++ + L+ E+   E+ +K+  E+ + LK    ++E  KR+ + 
Sbjct: 347 EERKRRDKLTEEYAELKEELEDLRAEL---EEVDKEFAETRDELKDYREKLEKLKREINE 403

Query: 271 ----LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
               L   QE ++RL E+L  L AA   +EA+  EL+ E +   
Sbjct: 404 LKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKEDKA 447



 Score = 37.4 bits (87), Expect = 0.011
 Identities = 21/96 (21%), Positives = 48/96 (50%), Gaps = 3/96 (3%)

Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE 276
           ++K    EE+   +++  +L+ EI      E+   E    L+  E  +AK + ++ +   
Sbjct: 280 KIKDLGEEEQLRVKEKIGELEAEI---ASLERSIAEKERELEDAEERLAKLEAEIDKLLA 336

Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            I  LE ++++ +  +++L     EL+ EL+ +R +
Sbjct: 337 EIEELEREIEEERKRRDKLTEEYAELKEELEDLRAE 372



 Score = 37.0 bits (86), Expect = 0.013
 Identities = 25/101 (24%), Positives = 51/101 (50%), Gaps = 3/101 (2%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
            +++  +  + E    + + D+L         A +K  E  + ++QLE E  K  + L E
Sbjct: 685 GLKRELSSLQSELRRIENRLDELS---QELSDASRKIGEIEKEIEQLEQEEEKLKERLEE 741

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
            +E +  LE++++ +++  +ELEAR  EL+ +L  +    N
Sbjct: 742 LEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALN 782



 Score = 36.6 bits (85), Expect = 0.019
 Identities = 24/116 (20%), Positives = 54/116 (46%), Gaps = 2/116 (1%)

Query: 186 RVRVNKRILALCMGNHELYMRRRKPDTI--DVQQMKAQAREEKNAKQQQRDKLQLEIAAR 243
           R  +      L     E+   +R+ D +  ++Q++  +  +   A      K+      +
Sbjct: 384 RDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEK 443

Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
           E    + ++   +L+QL  +++K +Q+L + +E   R+E++L +LQ    E EA+ 
Sbjct: 444 EDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAEAEAQA 499



 Score = 34.3 bits (79), Expect = 0.083
 Identities = 18/93 (19%), Positives = 43/93 (46%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIR 279
           A+  E      + RD+L+      EK +++  E    L +L+ E+ +  ++L +    I 
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIA 430

Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            +E ++ +L+  KE+      + + +L+ +   
Sbjct: 431 GIEAKINELEEEKEDKALEIKKQEWKLEQLAAD 463



 Score = 32.3 bits (74), Expect = 0.39
 Identities = 20/97 (20%), Positives = 42/97 (43%), Gaps = 1/97 (1%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIA-AREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
           K +A EE    ++  ++L L I   R++ E+  +E  +  +   +   KR+ +  E  + 
Sbjct: 172 KEKALEELEEVEENIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKE 231

Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
              LE Q + ++     LE    +L  E+  +  +  
Sbjct: 232 KEALERQKEAIERQLASLEEELEKLTEEISELEKRLE 268



 Score = 32.3 bits (74), Expect = 0.43
 Identities = 27/123 (21%), Positives = 58/123 (47%), Gaps = 8/123 (6%)

Query: 194 LALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAA----REKAEKK 249
           +A    +     R  +    D ++  A+   E +    + ++L+ EI      R+K  ++
Sbjct: 303 IASLERSIAEKERELE----DAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEE 358

Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPI 309
           + E  E L+ L  E+ + D++  E ++ ++   E+L++L+    EL+     LQ ELQ +
Sbjct: 359 YAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRL 418

Query: 310 RNQ 312
             +
Sbjct: 419 SEE 421



 Score = 32.0 bits (73), Expect = 0.47
 Identities = 18/84 (21%), Positives = 39/84 (46%), Gaps = 5/84 (5%)

Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
           +E    + +  +L+  I   E+   K +E++  L+            + E Q  + +LEE
Sbjct: 751 QEIENVKSELKELEARIEELEEDLHKLEEALNDLEARL-----SHSRIPEIQAELSKLEE 805

Query: 284 QLKQLQAAKEELEARQTELQLELQ 307
           ++ +++A   E+E +   L LE +
Sbjct: 806 EVSRIEARLREIEQKLNRLTLEKE 829



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 15/98 (15%), Positives = 41/98 (41%), Gaps = 5/98 (5%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAAREK-----AEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           A   EE     ++  +L+  +   E+      +K      E   +++ ++ + + ++   
Sbjct: 247 ASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASL 306

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           +  I   E +L+  +    +LEA   +L  E++ +  +
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELERE 344



 Score = 30.0 bits (68), Expect = 1.9
 Identities = 21/80 (26%), Positives = 46/80 (57%), Gaps = 4/80 (5%)

Query: 225 EKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL-EE 283
           EK A ++Q++ ++ ++A+    E++ ++  E + +LE  + + +Q L E  + I+ L EE
Sbjct: 231 EKEALERQKEAIERQLAS---LEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEE 287

Query: 284 QLKQLQAAKEELEARQTELQ 303
           +  +++    ELEA    L+
Sbjct: 288 EQLRVKEKIGELEAEIASLE 307



 Score = 29.3 bits (66), Expect = 3.6
 Identities = 20/103 (19%), Positives = 49/103 (47%), Gaps = 13/103 (12%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE------VEMAKR 267
           +  +++AQ RE +   ++   +++ +     + + K +   E L ++E       E+ + 
Sbjct: 890 ERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEE 949

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAK-------EELEARQTELQ 303
           +  L + Q  ++R+EE+++ L+          EE+  R  EL+
Sbjct: 950 ELSLEDVQAELQRVEEEIRALEPVNMLAIQEYEEVLKRLDELK 992



 Score = 28.9 bits (65), Expect = 4.9
 Identities = 18/80 (22%), Positives = 39/80 (48%), Gaps = 8/80 (10%)

Query: 231 QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
            +R K+  EIA   + ++K ++++E L+++E  +            +I    +QL++L+ 
Sbjct: 153 VERRKIIDEIAGVAEFDRKKEKALEELEEVEENIE-------RLDLIIDEKRQQLERLRR 205

Query: 291 AKEELEARQTELQLELQPIR 310
            +E+ E R   L  E +   
Sbjct: 206 EREKAE-RYQALLKEKREYE 224



 Score = 28.5 bits (64), Expect = 6.6
 Identities = 16/88 (18%), Positives = 42/88 (47%), Gaps = 3/88 (3%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIR 279
            + RE     +++   LQ E+   E    +  + +    + ++   +++ + +E +E   
Sbjct: 677 QRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR-KIGEIEKEIEQLEQEE--E 733

Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQ 307
           +L+E+L++L+     LE     ++ EL+
Sbjct: 734 KLKERLEELEEDLSSLEQEIENVKSELK 761


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 40.8 bits (96), Expect = 4e-04
 Identities = 23/120 (19%), Positives = 55/120 (45%), Gaps = 9/120 (7%)

Query: 204 YMRRRKPDTIDVQQMKAQ---AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL 260
              R K     +++ KA+     +   A + + + L+ +++  E   ++ +E ++R ++ 
Sbjct: 22  LEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEK 81

Query: 261 ------EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
                 E E+   + ++  A+E I  LE++L +L    E+LE    +L+  L+ +     
Sbjct: 82  LSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLA 141


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 41.7 bits (98), Expect = 4e-04
 Identities = 24/100 (24%), Positives = 51/100 (51%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
              A EE   K ++ +        R + E++ +E +E L+QLE E+ +  ++L E  + +
Sbjct: 652 LQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEELLKKL 711

Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
             +E+ +++L++ K ELE  + EL+   + +     L + 
Sbjct: 712 GEIEQLIEELESRKAELEELKKELEKLEKALELLEELREK 751



 Score = 37.8 bits (88), Expect = 0.006
 Identities = 20/107 (18%), Positives = 56/107 (52%)

Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
             +  +   K   ++++ ++ +A + +  + ++ ++L  E+  + +  ++ +  +E L++
Sbjct: 246 EEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEE 305

Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
               +    ++L E  E ++ LEE+L++L+   E+LE+   EL  E 
Sbjct: 306 ELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEK 352



 Score = 37.8 bits (88), Expect = 0.006
 Identities = 27/111 (24%), Positives = 57/111 (51%), Gaps = 6/111 (5%)

Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAKQQ----QRDKLQLEIAAREKAEKKHQESVER 256
            EL  R  + ++++++ +K +  E +  ++     +    +LE   RE  E + +   E 
Sbjct: 252 EELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEEL--EG 309

Query: 257 LKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           L+ L  E+ +  + L   +E + +LEE+L++L++  EEL   + EL   L+
Sbjct: 310 LRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLE 360



 Score = 36.3 bits (84), Expect = 0.021
 Identities = 27/90 (30%), Positives = 51/90 (56%), Gaps = 1/90 (1%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQDLM 272
            ++    +  EEKN   +  ++   E+  R E+ EK+ ++++ERLKQLE  + +  ++L 
Sbjct: 340 KLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELA 399

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           E    +  ++E+L++L+   EELE    EL
Sbjct: 400 ELSAALEEIQEELEELEKELEELERELEEL 429



 Score = 35.9 bits (83), Expect = 0.028
 Identities = 24/108 (22%), Positives = 50/108 (46%), Gaps = 11/108 (10%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE-----------M 264
           ++     R      ++  +KL+      EK E+K ++    L++L  E           +
Sbjct: 304 EEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERL 363

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            + ++ L E ++ + +  E+LKQL+ A +EL+    EL   L+ I+ +
Sbjct: 364 KELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEE 411



 Score = 35.5 bits (82), Expect = 0.033
 Identities = 24/85 (28%), Positives = 45/85 (52%)

Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLE 282
            EEK   ++ + +L    +   +A K  +E +  L++L  E+ ++ + L E +  I  LE
Sbjct: 245 EEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELE 304

Query: 283 EQLKQLQAAKEELEARQTELQLELQ 307
           E+L+ L+A  EELE    +L+   +
Sbjct: 305 EELEGLRALLEELEELLEKLKSLEE 329



 Score = 35.5 bits (82), Expect = 0.038
 Identities = 26/102 (25%), Positives = 56/102 (54%), Gaps = 3/102 (2%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA---KRDQDLM 272
           +++KA+  E ++ + +     + E+   E+  ++ +E +ERL++LE E+    +  + L 
Sbjct: 252 EELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLR 311

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
              E +  L E+LK L+   E+LE +  +L+ EL+ +  + N
Sbjct: 312 ALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEKN 353



 Score = 34.7 bits (80), Expect = 0.066
 Identities = 26/95 (27%), Positives = 53/95 (55%), Gaps = 1/95 (1%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVE-RLKQLEVEMAKRDQDLM 272
            +++   +  E+    + + ++L  E     K  ++  + +E RL++LE E+ K  + L 
Sbjct: 326 SLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALERLK 385

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           + +E I+ L+E+L +L AA EE++    EL+ EL+
Sbjct: 386 QLEEAIQELKEELAELSAALEEIQEELEELEKELE 420



 Score = 34.0 bits (78), Expect = 0.11
 Identities = 23/97 (23%), Positives = 49/97 (50%), Gaps = 2/97 (2%)

Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLE 282
            ++   + +Q +    E+    +  +  +   E L++L   + +  + L E +E + +LE
Sbjct: 554 LQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKELEERLSQLE 613

Query: 283 EQLKQLQA--AKEELEARQTELQLELQPIRNQSNLNK 317
           E L+ L+   A+ ELE  + EL+ EL+ +  Q+ L +
Sbjct: 614 ELLQSLELSEAENELEEAEEELESELEKLNLQAELEE 650



 Score = 32.0 bits (73), Expect = 0.47
 Identities = 23/83 (27%), Positives = 48/83 (57%), Gaps = 4/83 (4%)

Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA----QEMIRRLEEQ 284
           ++Q+ ++L+ EI A E+   + +E  ERL++L+  + + +   +EA    +E +R LE  
Sbjct: 223 EEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERL 282

Query: 285 LKQLQAAKEELEARQTELQLELQ 307
           L++L+   E LE  + E++   +
Sbjct: 283 LEELEEKIERLEELEREIEELEE 305



 Score = 32.0 bits (73), Expect = 0.47
 Identities = 21/91 (23%), Positives = 46/91 (50%), Gaps = 1/91 (1%)

Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
              E      Q +  +L  AA E+ E+K +E    +++    +   +Q   + +E + +L
Sbjct: 635 LESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEE-LEQL 693

Query: 282 EEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           EE+L+QL+   EEL  +  E++  ++ + ++
Sbjct: 694 EEELEQLREELEELLKKLGEIEQLIEELESR 724



 Score = 31.7 bits (72), Expect = 0.68
 Identities = 20/84 (23%), Positives = 44/84 (52%)

Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
            +E     + + ++L+ E++  ++  +  +E  E  K+L     +  + L   + +   L
Sbjct: 472 EKELLELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEEL 531

Query: 282 EEQLKQLQAAKEELEARQTELQLE 305
           EE+L++L+   EELE  + +LQL+
Sbjct: 532 EEKLEKLENLLEELEELKEKLQLQ 555



 Score = 31.3 bits (71), Expect = 0.87
 Identities = 25/111 (22%), Positives = 57/111 (51%), Gaps = 4/111 (3%)

Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
                ++++ + + +  E +   + + +KL L+    E  +   +E  E++++LE E+ +
Sbjct: 613 EELLQSLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRR 672

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
             Q +    E   +LEE+L++L+  +EELE  + EL+  L+ +     L +
Sbjct: 673 ELQRI----ENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIE 719



 Score = 30.9 bits (70), Expect = 0.94
 Identities = 20/92 (21%), Positives = 47/92 (51%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
            R K     +++++ +  +     +Q  + +Q       +     +E  E L++LE E+ 
Sbjct: 361 ERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELE 420

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
           + +++L E +E I++LEEQ+ QL++ +  +  
Sbjct: 421 ELERELEELEEEIKKLEEQINQLESKELMIAE 452



 Score = 30.1 bits (68), Expect = 2.0
 Identities = 14/90 (15%), Positives = 43/90 (47%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +++ ++ +  E +  K++  +     +       +  +   E L++LE  + + ++ +  
Sbjct: 233 EIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIER 292

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            +E+ R +EE  ++L+  +  LE  +  L+
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEELLE 322



 Score = 29.7 bits (67), Expect = 2.1
 Identities = 19/101 (18%), Positives = 43/101 (42%), Gaps = 1/101 (0%)

Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
           R +   +  +  + + R  +  +  Q  +L       E+AE++ +  +E+L         
Sbjct: 592 RERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELEEL 651

Query: 267 RDQDLMEAQEMIRRLEEQLKQ-LQAAKEELEARQTELQLEL 306
               L E +E +  LE ++++ LQ  + E +  +   +LE 
Sbjct: 652 LQAALEELEEKVEELEAEIRRELQRIENEEQLEEKLEELEQ 692



 Score = 29.0 bits (65), Expect = 4.7
 Identities = 19/87 (21%), Positives = 42/87 (48%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
                 E    +++  + + E   RE+ E+  +E  E  ++L   +   +    E +E +
Sbjct: 476 LELYELELEELEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKL 535

Query: 279 RRLEEQLKQLQAAKEELEARQTELQLE 305
            +LE  L++L+  KE+L+ +Q + +L 
Sbjct: 536 EKLENLLEELEELKEKLQLQQLKEELR 562



 Score = 28.2 bits (63), Expect = 6.8
 Identities = 25/122 (20%), Positives = 53/122 (43%), Gaps = 9/122 (7%)

Query: 188 RVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAA--REK 245
            +  R+L +     E    R +      + ++    + +  ++ +R+  +LE        
Sbjct: 253 ELKARLLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRA 312

Query: 246 AEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
             ++ +E +E+LK LE       + L + +E + +LE +L++L   K EL     E   E
Sbjct: 313 LLEELEELLEKLKSLE-------ERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKE 365

Query: 306 LQ 307
           L+
Sbjct: 366 LE 367



 Score = 28.2 bits (63), Expect = 6.9
 Identities = 23/118 (19%), Positives = 49/118 (41%)

Query: 190 NKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKK 249
            + I  L     E     +K + I  +Q + +  +E  A +++  +L+ E    E+ + +
Sbjct: 198 LEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAELEEEKERLEELKAR 257

Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
             E      +      +  ++L    E +    E+L++L+   EELE     L+  L+
Sbjct: 258 LLEIESLELEALKIREEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLE 315


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 40.8 bits (95), Expect = 7e-04
 Identities = 23/100 (23%), Positives = 49/100 (49%), Gaps = 3/100 (3%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           D++      R E    Q ++   + E   RE A  + Q++ +  + +  E+A   Q+L +
Sbjct: 78  DIRPQLRALRTELGTAQGEKRAAETE---REAARSELQKARQEREAVRQELAAARQNLAK 134

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
           AQ+ + RL +Q + LQ   + L  ++ +L+ + Q ++   
Sbjct: 135 AQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQ 174



 Score = 39.6 bits (92), Expect = 0.001
 Identities = 25/112 (22%), Positives = 51/112 (45%), Gaps = 6/112 (5%)

Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE 261
           +L +R  +     ++Q         NA Q + ++L    AA ++  +  Q+   ++ Q  
Sbjct: 190 DLKLRSAQ-----IEQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQISQKA 244

Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA-RQTELQLELQPIRNQ 312
            ++A R + + E +  ++RLE    +L+    +LEA  Q  ++L  Q    Q
Sbjct: 245 QQIAARAEQIRERERQLQRLETAQARLEQEVAQLEAYYQAYVRLRQQAAATQ 296



 Score = 36.2 bits (83), Expect = 0.019
 Identities = 24/139 (17%), Positives = 59/139 (42%), Gaps = 3/139 (2%)

Query: 174 DKKAPDFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQR 233
           ++     VF    +R   R L   +G  +   R  + +    +    +AR+E+ A +Q+ 
Sbjct: 66  NRNLRSGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQEL 125

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
              +  +A  ++   +  +  + L+     +    +   + +   + L+   KQLQA+  
Sbjct: 126 AAARQNLAKAQQELARLTKQAQDLQT---RLKTLAEQRRQLEAQAQSLQASQKQLQASAT 182

Query: 294 ELEARQTELQLELQPIRNQ 312
           +L+++  +L+L    I  +
Sbjct: 183 QLKSQVLDLKLRSAQIEQE 201



 Score = 34.2 bits (78), Expect = 0.094
 Identities = 16/103 (15%), Positives = 37/103 (35%)

Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
           + R      D+Q       E++   + Q   LQ      + +  + +  V  LK    ++
Sbjct: 139 LARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQI 198

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
            +  Q+L       +   E+L +  AA ++      +   ++ 
Sbjct: 199 EQEAQNLATRANAAQARTEELARRAAAAQQTAQAIQQRDAQIS 241



 Score = 30.8 bits (69), Expect = 1.1
 Identities = 29/113 (25%), Positives = 39/113 (34%), Gaps = 18/113 (15%)

Query: 215 VQQMKAQAREEKNAKQQ-------QRDKLQLEIAA----REKAEKKHQESVERLKQLEVE 263
           V+Q  A AR+     QQ       Q   LQ  +      R + E + Q      KQL+  
Sbjct: 121 VRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQASQKQLQAS 180

Query: 264 MAK---RDQDLMEAQEMI----RRLEEQLKQLQAAKEELEARQTELQLELQPI 309
             +   +  DL      I    + L  +    QA  EEL  R    Q   Q I
Sbjct: 181 ATQLKSQVLDLKLRSAQIEQEAQNLATRANAAQARTEELARRAAAAQQTAQAI 233



 Score = 30.0 bits (67), Expect = 2.1
 Identities = 14/96 (14%), Positives = 34/96 (35%), Gaps = 3/96 (3%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
           + Q   +  + Q  + +LQ      +      +    +++Q    +A R      AQ   
Sbjct: 160 RRQLEAQAQSLQASQKQLQASATQLKSQVLDLKLRSAQIEQEAQNLATRAN---AAQART 216

Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
             L  +    Q   + ++ R  ++  + Q I  ++ 
Sbjct: 217 EELARRAAAAQQTAQAIQQRDAQISQKAQQIAARAE 252


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 40.4 bits (95), Expect = 0.001
 Identities = 28/137 (20%), Positives = 56/137 (40%), Gaps = 8/137 (5%)

Query: 176 KAPDFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDK 235
           + PD +  A    V + I     G   L+  R+      ++        +    + +   
Sbjct: 111 EFPDDLLSAEDPAVPELIK----GQQSLFESRK----STLRAQLELILAQIKQLEAELAG 162

Query: 236 LQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
           LQ ++ A  +  +   E +E  ++L+ +      +L+E +      + +L +L+A  E L
Sbjct: 163 LQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELEVL 222

Query: 296 EARQTELQLELQPIRNQ 312
           + +  ELQLE Q I   
Sbjct: 223 KRQIDELQLERQQIEQT 239


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 40.6 bits (96), Expect = 0.001
 Identities = 22/111 (19%), Positives = 52/111 (46%), Gaps = 13/111 (11%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIA-AREKAEKKHQESVERLKQLEVE- 263
           R  +    + + +  +A + K   +++++KLQ E     E+AEK+ Q++++  K+ E + 
Sbjct: 530 RELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKK-EADE 588

Query: 264 ----------MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQL 304
                              E  E  +RL +  ++ +  K++ + +Q EL++
Sbjct: 589 IIKELRQLQKGGYASVKAHELIEARKRLNKANEKKEKKKKKQKEKQEELKV 639



 Score = 32.5 bits (75), Expect = 0.35
 Identities = 22/85 (25%), Positives = 42/85 (49%), Gaps = 18/85 (21%)

Query: 238 LEIAAR--------EKAEKKHQES-------VERLKQLEVEMAKRDQDLMEAQEMIRRLE 282
            EIA R        E+A+K   E        +  L++LE E+   +Q   EA+ +++  E
Sbjct: 491 FEIAKRLGLPENIIEEAKKLIGEDKEKLNELIASLEELEREL---EQKAEEAEALLKEAE 547

Query: 283 EQLKQLQAAKEELEARQTELQLELQ 307
           +  ++L+  KE+L+  + +L  E +
Sbjct: 548 KLKEELEEKKEKLQEEEDKLLEEAE 572



 Score = 31.7 bits (73), Expect = 0.52
 Identities = 20/98 (20%), Positives = 47/98 (47%), Gaps = 14/98 (14%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
           D   + ++ A    E+  ++ ++   + E A  ++AEK  +E  E+ ++L+ E  K    
Sbjct: 514 DKEKLNELIA--SLEELERELEQKAEEAE-ALLKEAEKLKEELEEKKEKLQEEEDK---- 566

Query: 271 LMEAQEMIRRLEEQLKQ-LQAAKEELEARQTELQLELQ 307
                 ++   E++ +Q ++ AK+E +    EL+   +
Sbjct: 567 ------LLEEAEKEAQQAIKEAKKEADEIIKELRQLQK 598


>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
          Length = 508

 Score = 40.1 bits (94), Expect = 0.001
 Identities = 25/117 (21%), Positives = 57/117 (48%), Gaps = 6/117 (5%)

Query: 203 LYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ--ESVERLKQL 260
           + +++R+    + +++  +A++E   K +       E+  RE+ +++ +     E L++ 
Sbjct: 23  VLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARREREELQRE 82

Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQL----KQLQAAKEELEARQTELQLELQPIRNQS 313
           E  + ++++ L    E +  LE QL    K L A + ELE  + +L  EL  +   +
Sbjct: 83  EERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQLDNELYRVAGLT 139



 Score = 30.8 bits (70), Expect = 0.91
 Identities = 21/100 (21%), Positives = 49/100 (49%), Gaps = 2/100 (2%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           V  +K + R  K A++  ++  +      E A  + +E + R +  + + A+R+++  E 
Sbjct: 22  VVLLKKRQRLAKEAERILQEAQKEAEEKLEAALLEAKELLLRERNQQRQEARRERE--EL 79

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
           Q    RL ++ +QL A  E+L+  + +L+   + +  +  
Sbjct: 80  QREEERLVQKEEQLDARAEKLDNLENQLEEREKALSAREL 119


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 39.8 bits (94), Expect = 0.001
 Identities = 25/106 (23%), Positives = 53/106 (50%), Gaps = 13/106 (12%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAARE----KAEKKHQESVERLKQLEVEMAKRDQDL 271
           ++   +A+EE     + R++ + E+  R     K EK+  +  E L +    + KR+++L
Sbjct: 56  KEALLEAKEEI---HKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEEL 112

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
            + ++ + + +++L++    +EELE    E   EL+ I   S L  
Sbjct: 113 EKKEKELEQKQQELEKK---EEELEELIEEQLQELERI---SGLTA 152



 Score = 34.4 bits (80), Expect = 0.076
 Identities = 24/97 (24%), Positives = 53/97 (54%), Gaps = 16/97 (16%)

Query: 207 RRKPDTIDVQQMKAQAREEK-NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
            RK + ++ ++ + + +E++   KQQ+ +K + E+   E+  ++  + +ER+  L  E A
Sbjct: 99  DRKLELLEKREEELEKKEKELEQKQQELEKKEEEL---EELIEEQLQELERISGLTAEEA 155

Query: 266 KRDQDLM---------EAQEMIRRLEEQLKQLQAAKE 293
           K  + L+         EA  +I+ +EE+ K+ +A K+
Sbjct: 156 K--EILLEKVEEEARHEAAVLIKEIEEEAKE-EADKK 189



 Score = 33.2 bits (77), Expect = 0.17
 Identities = 24/114 (21%), Positives = 54/114 (47%), Gaps = 18/114 (15%)

Query: 204 YMRRRKPDTIDVQQMKAQARE-----EKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK 258
           Y  R+K     +++ + +A+      +K A+  +++ L   + A+E+  K   E      
Sbjct: 22  YFVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEAL---LEAKEEIHKLRNE------ 72

Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
             E E+ +R  +L + ++ + + EE L +     E LE R+ EL+ + + +  +
Sbjct: 73  -FEKELRERRNELQKLEKRLLQKEENLDR---KLELLEKREEELEKKEKELEQK 122



 Score = 30.1 bits (69), Expect = 1.8
 Identities = 23/78 (29%), Positives = 44/78 (56%), Gaps = 13/78 (16%)

Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
            A E+A++  +E+     + E E  K++  L+EA+E I +L  + ++      EL  R+ 
Sbjct: 35  EAEEEAKRILEEA-----KKEAEAIKKEA-LLEAKEEIHKLRNEFEK------ELRERRN 82

Query: 301 ELQ-LELQPIRNQSNLNK 317
           ELQ LE + ++ + NL++
Sbjct: 83  ELQKLEKRLLQKEENLDR 100


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 39.5 bits (93), Expect = 0.002
 Identities = 23/81 (28%), Positives = 45/81 (55%), Gaps = 2/81 (2%)

Query: 235 KLQLEIAAREKAEKKHQESVERLKQL--EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAK 292
           +L+ E+   E+  K+ +E +E++K+L  ++    R Q   E +E + +L E  ++L    
Sbjct: 331 ELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEEL 390

Query: 293 EELEARQTELQLELQPIRNQS 313
           EELE    EL+ EL+ + ++ 
Sbjct: 391 EELEEELKELKEELESLYSEG 411


>gnl|CDD|221776 pfam12795, MscS_porin, Mechanosensitive ion channel porin domain.
           The small mechanosensitive channel, MscS, is a part of
           the turgor-driven solute efflux system that protects
           bacteria from lysis in the event of osmotic shock. The
           MscS protein alone is sufficient to form a functional
           mechanosensitive channel gated directly by tension in
           the lipid bilayer. The MscS proteins are heptamers of
           three transmembrane subunits with seven converging M3
           domains, and this MscS_porin is towards the N-terminal
           of the molecules. The high concentration of negative
           charges at the extracellular entrance of the pore helps
           select the cations for efflux.
          Length = 239

 Score = 38.8 bits (91), Expect = 0.002
 Identities = 24/86 (27%), Positives = 43/86 (50%), Gaps = 2/86 (2%)

Query: 231 QQRDKLQLEIAAREKAEKKHQESVER--LKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
           ++  +L+ E+ A +K +      +    L QLE  +A+    L E QE +++   QL +L
Sbjct: 52  KELRELRQELEALKKTDAPVFPELANLSLSQLEQRLAQTLSQLQELQEQLQQENSQLIEL 111

Query: 289 QAAKEELEARQTELQLELQPIRNQSN 314
           Q   E  + + +E +  LQ IRN+  
Sbjct: 112 QTRPERAQQQLSEARRRLQEIRNRLQ 137



 Score = 30.3 bits (69), Expect = 1.0
 Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 9/93 (9%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAARE---KAEKKHQESVERLKQLEVEMAKRDQDLM 272
            +++A +       Q QR  LQ E+AA +   +  +    S    ++L     +  +DL+
Sbjct: 134 NRLQALSPGGTPLAQAQRTLLQAELAALKAQIEELELELLSNNNRQEL----LRLQRDLL 189

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
           + +  I RLE +L+ LQ A       ++E  +E
Sbjct: 190 KKR--IERLEAELQALQNAINRKRLAESEQAVE 220


>gnl|CDD|225638 COG3096, MukB, Uncharacterized protein involved in chromosome
           partitioning [Cell division and chromosome
           partitioning].
          Length = 1480

 Score = 39.5 bits (92), Expect = 0.002
 Identities = 25/109 (22%), Positives = 52/109 (47%), Gaps = 6/109 (5%)

Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEI---AAREKAEKKHQESVERLK 258
           E   +  +     +    + ARE++ A +Q++++LQ  I     R       Q ++E+L 
Sbjct: 556 EALHQELEALIESLSDSVSNAREQRMALRQEQEQLQSRIQSLMQRAPVWLAAQNALEQLS 615

Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           +   E     QD+    E +++L E+ ++    ++EL AR+  L  E++
Sbjct: 616 EQSGEEFTDSQDVT---EYMQQLLEREREATVERDELGARKNALDEEIE 661



 Score = 29.9 bits (67), Expect = 2.1
 Identities = 30/125 (24%), Positives = 49/125 (39%), Gaps = 16/125 (12%)

Query: 204 YMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEI-AAREKAEKKHQES-VERLKQLE 261
           YMR      + + Q     RE   ++QQ   +    +  +RE AE    E  +E   Q  
Sbjct: 274 YMRHANERRVHLDQALEFRRELYTSRQQLAAEQYRHVDMSRELAELNGAEGDLEADYQAA 333

Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQLQA--------------AKEELEARQTELQLELQ 307
            +     Q  +  QE I R +  L++L                 +EE EAR    +LE+ 
Sbjct: 334 SDHLNLVQTALRQQEKIERYQADLEELTIRLEEQNEVVEEANERQEENEARAEAAELEVD 393

Query: 308 PIRNQ 312
            +++Q
Sbjct: 394 ELKSQ 398



 Score = 29.1 bits (65), Expect = 3.9
 Identities = 21/94 (22%), Positives = 43/94 (45%), Gaps = 2/94 (2%)

Query: 216  QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
            +  + +ARE+    Q Q  +    +A+ + +    +E +  L+Q   ++  R     E +
Sbjct: 997  EAERTRAREQLRQHQAQLSQYNQVLASLKSSYDTKKELLNELQQELQDIGVRADSGAEER 1056

Query: 276  EMIRR--LEEQLKQLQAAKEELEARQTELQLELQ 307
              IRR  L  QL   ++ + +LE + T  + E+ 
Sbjct: 1057 ARIRRDELHAQLSTNRSRRNQLEKQLTFCEAEMD 1090


>gnl|CDD|241342 cd13188, FERM_C_PTPN14_PTPN21, Protein tyrosine phosphatase
          non-receptor proteins 14 and 21 (PTPN14 and 21) FERM
          domain.  This CD contains PTP members: pez/PTPN14 and
          PTPN21. A number of mutations in Pez have been shown to
          be associated with breast and colorectal cancer. The
          PTPN protein family belong to larger family of PTPs.
          PTPs are known to be signaling molecules that regulate
          a variety of cellular processes including cell growth,
          differentiation, mitotic cycle, and oncogenic
          transformation. The members are composed of a
          N-terminal FERM domain and a C-terminal PTP catalytic
          domain. The FERM domain has a cloverleaf tripart
          structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
          C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The
          C-lobe/F3 within the FERM domain is part of the PH
          domain family. Like most other ERM members they have a
          phosphoinositide-binding site in their FERM domain. The
          FERM C domain is the third structural domain within the
          FERM domain. The FERM domain is found in the
          cytoskeletal-associated proteins such as ezrin, moesin,
          radixin, 4.1R, and merlin. These proteins provide a
          link between the membrane and cytoskeleton and are
          involved in signal transduction pathways. The FERM
          domain is also found in protein tyrosine phosphatases
          (PTPs) , the tyrosine kinases FAK and JAK, in addition
          to other proteins involved in signaling. This domain is
          structurally similar to the PH and PTB domains and
          consequently is capable of binding to both peptides and
          phospholipids at different sites.
          Length = 111

 Score = 36.9 bits (86), Expect = 0.002
 Identities = 13/53 (24%), Positives = 28/53 (52%), Gaps = 2/53 (3%)

Query: 32 GMLREDAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDN 84
          G+   +A + Y++  + L+ YG   +  K+ +G ++++G    G  I+ K  N
Sbjct: 1  GLSPAEAELLYIQEVEQLDGYGEESYPAKDNQGNDIFIGASFEG--IFVKHKN 51



 Score = 32.7 bits (75), Expect = 0.069
 Identities = 15/52 (28%), Positives = 25/52 (48%), Gaps = 1/52 (1%)

Query: 153 FPWSEIRNISFNDRKFIIKPIDKKAPDFVFFAPRVRVNKRILALCMGNHELY 204
           F W++I NIS N   F ++ I+K+     F        K +  +C+  H+ Y
Sbjct: 58  FKWNDIGNISHNKSFFSVELINKEET-IQFQMEDAETAKYVWRMCVARHKFY 108


>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
          Length = 428

 Score = 38.9 bits (91), Expect = 0.002
 Identities = 21/78 (26%), Positives = 46/78 (58%), Gaps = 4/78 (5%)

Query: 230 QQQRDKLQLEIAAREKA--EKKHQES--VERLKQLEVEMAKRDQDLMEAQEMIRRLEEQL 285
           + Q   +Q +IAA+EK+  +++ Q +  + +LK+ E  +++  + L E Q  + +L +Q+
Sbjct: 46  RDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQI 105

Query: 286 KQLQAAKEELEARQTELQ 303
            +L A+  +LE +Q   +
Sbjct: 106 DELNASIAKLEQQQAAQE 123



 Score = 33.9 bits (78), Expect = 0.089
 Identities = 24/89 (26%), Positives = 43/89 (48%), Gaps = 3/89 (3%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           +++ AQ  E +  + QQ+  L  + A ++K E+   E  + L  LE  + K  Q L E +
Sbjct: 180 EELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSLQKDQQQLSELR 239

Query: 276 EMIRRLEEQLKQLQA---AKEELEARQTE 301
               RL + + + +    A+ E EAR+  
Sbjct: 240 ANESRLRDSIARAEREAKARAEREAREAA 268



 Score = 29.3 bits (66), Expect = 2.8
 Identities = 23/86 (26%), Positives = 44/86 (51%), Gaps = 7/86 (8%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
            +QQ  A   +    +QQQR  L  ++  +E+A     ++  +L++ +  + + ++ + E
Sbjct: 51  SIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAI---SQASRKLRETQNTLNQLNKQIDE 107

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQ 299
               I +LE+Q    QAA+E L A Q
Sbjct: 108 LNASIAKLEQQ----QAAQERLLAAQ 129



 Score = 28.5 bits (64), Expect = 4.5
 Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 2/67 (2%)

Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRR-LEEQLKQLQAAKEELEARQTEL-QLELQP 308
           QE++  LKQ   E+A +  +L E Q   +  L EQ  Q Q  ++    R+  L  LE   
Sbjct: 169 QETIAELKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLESSL 228

Query: 309 IRNQSNL 315
            ++Q  L
Sbjct: 229 QKDQQQL 235


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 39.1 bits (92), Expect = 0.003
 Identities = 32/112 (28%), Positives = 46/112 (41%), Gaps = 4/112 (3%)

Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
           R+K +T  ++Q  AQA  +    Q + + L+         E +   S   L+QLE  +A+
Sbjct: 77  RQKEETEQLKQQLAQAPAKLRQAQAELEALK----DDNDEETRETLSTLSLRQLESRLAQ 132

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
               L  AQ  +     QL  LQ   E  +A        LQ IRN     KV
Sbjct: 133 TLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKGGKV 184



 Score = 32.2 bits (74), Expect = 0.46
 Identities = 23/125 (18%), Positives = 47/125 (37%), Gaps = 33/125 (26%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
            + K    E+K  +Q     L L +   ++ ++         +QL+ ++A+    L +AQ
Sbjct: 49  NKQKLLEAEDKLVQQDLEQTLAL-LDKIDRQKE-------ETEQLKQQLAQAPAKLRQAQ 100

Query: 276 EMIRRLEEQ-------------LKQLQAAKEELEARQTELQLEL------------QPIR 310
             +  L++              L+QL++   +   +    Q +L            QP R
Sbjct: 101 AELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPER 160

Query: 311 NQSNL 315
            Q+ L
Sbjct: 161 AQAAL 165



 Score = 30.3 bits (69), Expect = 1.7
 Identities = 26/95 (27%), Positives = 38/95 (40%), Gaps = 13/95 (13%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
             +K      K  +  QR  LQ E A         ++S+E   QL        QDL++ Q
Sbjct: 177 NLLKGGKVGGKALRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQL--------QDLLQKQ 228

Query: 276 -----EMIRRLEEQLKQLQAAKEELEARQTELQLE 305
                  I+RLE QL+ LQ A        +E  ++
Sbjct: 229 RDYLTARIQRLEHQLQLLQEAINSKRLTLSEKTVQ 263



 Score = 28.7 bits (65), Expect = 4.9
 Identities = 27/107 (25%), Positives = 44/107 (41%), Gaps = 21/107 (19%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEK--KHQESVERLKQLEVEMAKRDQDLMEAQE 276
           +AQA    N+++ Q      +I    K  K         +   L+ E     Q L+ AQ 
Sbjct: 160 RAQAALYANSQRLQ------QIRNLLKGGKVGGKALRPSQRVLLQAE-----QALLNAQN 208

Query: 277 MIRRLE----EQLKQL-QAAKEELEARQTELQLELQPIR---NQSNL 315
            ++R       QL+ L Q  ++ L AR   L+ +LQ ++   N   L
Sbjct: 209 DLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQLQLLQEAINSKRL 255


>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal.  This domain
           corresponds to the alpha helical C terminal domain of
           phospholipase C beta.
          Length = 181

 Score = 37.5 bits (87), Expect = 0.003
 Identities = 27/99 (27%), Positives = 53/99 (53%), Gaps = 2/99 (2%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
           +  ++++   + R E+  + + ++K   E    E      QE V+ +K+LE    +R + 
Sbjct: 64  EKKELKKKLDRKRLERIQEAKTKEKHAQEREKTEINRSHIQEVVQSIKRLEEAQKRRQEK 123

Query: 271 LMEAQ-EMIRRLEEQLKQLQA-AKEELEARQTELQLELQ 307
           L E Q E++++++E+  +LQA A  E EA+   L  E+Q
Sbjct: 124 LEEKQAEILQQIQEEEPKLQAEATAEYEAKLKRLPQEVQ 162


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 38.3 bits (89), Expect = 0.004
 Identities = 20/92 (21%), Positives = 43/92 (46%), Gaps = 2/92 (2%)

Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
           +R  +    ++++ +A  EK AKQ ++   Q E   ++  E K +++ E   + E E  K
Sbjct: 83  QRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAKAKQAAEAKAKAEAEAEK 142

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
           + ++  + Q       +     +A K+  EA+
Sbjct: 143 KAKEEAKKQAEEEAKAKA--AAEAKKKAAEAK 172



 Score = 35.6 bits (82), Expect = 0.027
 Identities = 23/88 (26%), Positives = 48/88 (54%), Gaps = 5/88 (5%)

Query: 215 VQQMKAQAREEKNAKQQQRDKL-QLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
            QQ     +++K A ++++++  +LE  A E  +++  E   R K+LE + A  ++   +
Sbjct: 49  AQQANRIQQQKKPAAKKEQERQKKLEQQAEEAEKQRAAEQ-ARQKELE-QRAAAEKAAKQ 106

Query: 274 AQEMIRRLEEQLKQL--QAAKEELEARQ 299
           A++  ++ EE+ KQ     AK+  EA+ 
Sbjct: 107 AEQAAKQAEEKQKQAEEAKAKQAAEAKA 134



 Score = 33.7 bits (77), Expect = 0.10
 Identities = 21/96 (21%), Positives = 49/96 (51%), Gaps = 5/96 (5%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAARE---KAEKKHQESVERLKQLEV 262
           ++ +     ++Q   +A +++ A+Q ++ +L+   AA +   +AE+  +++ E+ KQ E 
Sbjct: 64  KKEQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAKQAEQAAKQAEEKQKQAEE 123

Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
             AK+  +     E     E++ K+    + E EA+
Sbjct: 124 AKAKQAAEAKAKAEAEA--EKKAKEEAKKQAEEEAK 157



 Score = 33.7 bits (77), Expect = 0.12
 Identities = 19/82 (23%), Positives = 38/82 (46%), Gaps = 6/82 (7%)

Query: 226 KNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQL 285
             A  QQ +++Q +     K E+      ER K+LE +  + ++     Q   + LE++ 
Sbjct: 45  PGAVAQQANRIQQQKKPAAKKEQ------ERQKKLEQQAEEAEKQRAAEQARQKELEQRA 98

Query: 286 KQLQAAKEELEARQTELQLELQ 307
              +AAK+  +A +   + + Q
Sbjct: 99  AAEKAAKQAEQAAKQAEEKQKQ 120



 Score = 27.9 bits (62), Expect = 8.5
 Identities = 15/79 (18%), Positives = 32/79 (40%), Gaps = 7/79 (8%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
           KA+   E  AK +   + + +  A+++AE+      E   +     AK+     + +   
Sbjct: 125 KAKQAAEAKAKAEAEAEKKAKEEAKKQAEE------EAKAK-AAAEAKKKAAEAKKKAEA 177

Query: 279 RRLEEQLKQLQAAKEELEA 297
               +   + +A  EE +A
Sbjct: 178 EAKAKAEAKAKAKAEEAKA 196


>gnl|CDD|217843 pfam04012, PspA_IM30, PspA/IM30 family.  This family includes PspA
           a protein that suppresses sigma54-dependent
           transcription. The PspA protein, a negative regulator of
           the Escherichia coli phage shock psp operon, is produced
           when virulence factors are exported through secretins in
           many Gram-negative pathogenic bacteria and its homologue
           in plants, VIPP1, plays a critical role in thylakoid
           biogenesis, essential for photosynthesis. Activation of
           transcription by the enhancer-dependent bacterial
           sigma(54) containing RNA polymerase occurs through ATP
           hydrolysis-driven protein conformational changes enabled
           by activator proteins that belong to the large AAA(+)
           mechanochemical protein family. It has been shown that
           PspA directly and specifically acts upon and binds to
           the AAA(+) domain of the PspF transcription activator.
          Length = 220

 Score = 37.7 bits (88), Expect = 0.004
 Identities = 24/108 (22%), Positives = 53/108 (49%), Gaps = 4/108 (3%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ----ESVERLKQLEVEMAKRDQD 270
            Q +  Q + E+  ++Q+    +LE  AR    K ++    E++  +  LE +    +  
Sbjct: 47  AQVIARQKQLERKLEEQKEQAKKLENKARAALTKGNEELAREALAEIATLEKQAEALETQ 106

Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
           L + +  + +L +QL  L+   ++L+A++T L+  L+  + Q  +N  
Sbjct: 107 LTQQRSAVEQLRKQLAALETKIQQLKAKKTALKARLKAAKAQEAVNTS 154


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 38.4 bits (90), Expect = 0.004
 Identities = 32/101 (31%), Positives = 50/101 (49%), Gaps = 16/101 (15%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLE-IAAREKAEKKHQESVERL-KQLEVEMAKRDQDL 271
           ++++M   A  E+ A +   DK + E I A+ +AE    E V  L K L+ E  K  +  
Sbjct: 505 EIERMVKDA--EEYAAE---DKKRKERIEAKNEAE----EYVYSLEKSLKEEGDKLPE-- 553

Query: 272 MEAQEMIRRLEEQLKQLQAA--KEELEARQTELQLELQPIR 310
              ++ +    E LK+      KEE+EA+  ELQ  +QPI 
Sbjct: 554 -ADKKKVEEAIEWLKEELEGEDKEEIEAKTEELQKVVQPIG 593


>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
           mechanisms].
          Length = 352

 Score = 37.7 bits (88), Expect = 0.005
 Identities = 25/110 (22%), Positives = 45/110 (40%), Gaps = 5/110 (4%)

Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE 261
            L     +   +  Q   AQA   + A+ Q  D+ Q E+  R +  ++   S E L +  
Sbjct: 99  ALAAAEAQLRNLRAQLASAQALIAQ-AEAQDLDQAQNELERRAELAQRGVVSREELDRAR 157

Query: 262 VEMAKRDQDLMEAQEMIRR----LEEQLKQLQAAKEELEARQTELQLELQ 307
             +   +  L  AQ   ++    LE ++   QA     EA   + +L+L+
Sbjct: 158 AALQAAEAALAAAQAAQKQNLALLESEVSGAQAQVASAEAALDQAKLDLE 207



 Score = 32.3 bits (74), Expect = 0.28
 Identities = 19/113 (16%), Positives = 37/113 (32%), Gaps = 18/113 (15%)

Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKH----QESVERLKQL-------- 260
             ++Q +A     +   +  R +L    A   +AE +     Q  +ER  +L        
Sbjct: 91  AALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQNELERRAELAQRGVVSR 150

Query: 261 -EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            E++ A+      EA      L       +     LE+  +  Q ++      
Sbjct: 151 EELDRARAALQAAEAA-----LAAAQAAQKQNLALLESEVSGAQAQVASAEAA 198


>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
           TolA; Provisional.
          Length = 387

 Score = 37.9 bits (88), Expect = 0.005
 Identities = 18/84 (21%), Positives = 36/84 (42%), Gaps = 1/84 (1%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
             +Q + +  E++    Q++ K Q E AA++ A K+ Q      K      AK + +   
Sbjct: 97  AAEQERLKQLEKERLAAQEQKK-QAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKR 155

Query: 274 AQEMIRRLEEQLKQLQAAKEELEA 297
           A    ++   + K+   A+   +A
Sbjct: 156 AAAAAKKAAAEAKKKAEAEAAKKA 179



 Score = 37.5 bits (87), Expect = 0.007
 Identities = 27/96 (28%), Positives = 42/96 (43%), Gaps = 15/96 (15%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           QQ K+  R      ++QR K + + A  E+ ++K     ERLKQLE E     +   +A+
Sbjct: 69  QQQKSAKR-----AEEQRKKKEQQQA--EELQQKQAAEQERLKQLEKERLAAQEQKKQAE 121

Query: 276 EMIRRLEEQLKQ--------LQAAKEELEARQTELQ 303
           E  ++   + KQ          AAK + EA      
Sbjct: 122 EAAKQAALKQKQAEEAAAKAAAAAKAKAEAEAKRAA 157


>gnl|CDD|241340 cd13186, FERM_C_NBL4_NBL5, Novel band 4.1-like protein 4 and 5
          FERM (NBL4 and 5) domain C-lobe.  NBL4 (also called
          Erythrocyte protein band 4.1-like 4; Epb4 1l4) plays a
          role the beta-catenin/Tcf signaling pathway and is
          thought to be involved in establishing the cell
          polarity or proliferation. NBL4 may be also involved in
          adhesion, in cell motility and/or in cell-to-cell
          communication. No role for NBL5 has been proposed to
          date. Both NBL4 and NBL5 contain a N-terminal FERM
          domain which has a cloverleaf tripart structure
          (FERM_N, FERM_M, FERM_C/N, alpha-, and C-lobe/A-lobe,
          B-lobe, C-lobe/F1, F2, F3).  A single FERM_C repeat 2
          from Tetraodon nigroviridis also included in this cdThe
          C-lobe is a member of the PH superfamily. The FERM
          domain is found in the cytoskeletal-associated proteins
          such as ezrin, moesin, radixin, 4.1R, and merlin. These
          proteins provide a link between the membrane and
          cytoskeleton and are involved in signal transduction
          pathways. The FERM domain is also found in protein
          tyrosine phosphatases (PTPs) , the tyrosine kinases FAK
          and JAK, in addition to other proteins involved in
          signaling. This domain is structurally similar to the
          PH and PTB domains and consequently is capable of
          binding to both peptides and phospholipids at different
          sites.
          Length = 111

 Score = 35.7 bits (83), Expect = 0.006
 Identities = 18/44 (40%), Positives = 28/44 (63%)

Query: 37 DAMMEYLKIAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYE 80
          +A + YL  A+ LEMYGV+   +K + G E +LG+   G+ +YE
Sbjct: 4  EAELNYLNKAKWLEMYGVDLHPVKGEDGNEYFLGLTPTGILVYE 47



 Score = 26.5 bits (59), Expect = 9.7
 Identities = 9/27 (33%), Positives = 15/27 (55%), Gaps = 1/27 (3%)

Query: 150 KIG-FPWSEIRNISFNDRKFIIKPIDK 175
           KIG F W +I  + F  +K  +  ++K
Sbjct: 51  KIGLFFWPKITKLDFKGKKLKLVVVEK 77


>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis).  This nucleolar
           family of proteins are involved in 60S ribosomal
           biogenesis. They are specifically involved in the
           processing beyond the 27S stage of 25S rRNA maturation.
           This family contains sequences that bear similarity to
           the glioma tumour suppressor candidate region gene 2
           protein (p60). This protein has been found to interact
           with herpes simplex type 1 regulatory proteins.
          Length = 387

 Score = 37.0 bits (86), Expect = 0.010
 Identities = 26/93 (27%), Positives = 46/93 (49%), Gaps = 10/93 (10%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM- 277
           KAQ  +EK  K+ +R+      A  EK  KK    + RLK++  E+A++++     +E  
Sbjct: 280 KAQRNKEKRRKELERE------AKEEKQLKKKLAQLARLKEIAKEVAQKEKARARKKEQR 333

Query: 278 -IRRLEEQLKQLQAAKEELEARQTELQL--ELQ 307
             R  +++LK+ +  K +      E+ L  EL 
Sbjct: 334 KERGEKKKLKRRKLGKHKYPEPPLEVVLPDELS 366


>gnl|CDD|115307 pfam06637, PV-1, PV-1 protein (PLVAP).  This family consists of
           several PV-1 (PLVAP) proteins which seem to be specific
           to mammals. PV-1 is a novel protein component of the
           endothelial fenestral and stomatal diaphragms. The
           function of this family is unknown.
          Length = 442

 Score = 36.9 bits (85), Expect = 0.010
 Identities = 26/84 (30%), Positives = 46/84 (54%), Gaps = 1/84 (1%)

Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
           ARE  + ++Q+ +  Q   AA+E   K  +E+  R  +L+ E A++ Q  +E +  +R  
Sbjct: 298 ARENADLQRQKLEAEQAAQAAQEAKAKAEKEAQAREAKLQAECARQTQLALEEKAALRAE 357

Query: 282 EEQL-KQLQAAKEELEARQTELQL 304
            + L K+L+A K E E  + EL +
Sbjct: 358 RDNLAKELEAKKREAEQLRMELAI 381


>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y.  Members of this family are
           RNase Y, an endoribonuclease. The member from Bacillus
           subtilis, YmdA, has been shown to be involved in
           turnover of yitJ riboswitch [Transcription, Degradation
           of RNA].
          Length = 514

 Score = 36.8 bits (86), Expect = 0.011
 Identities = 23/90 (25%), Positives = 42/90 (46%), Gaps = 7/90 (7%)

Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI----R 279
           E K    + R +L+ E+  R    ++ +    RL Q E  + ++ + L + +E +    +
Sbjct: 55  EAKEEVHKLRAELERELKERRNELQRLER---RLLQREETLDRKMESLDKKEENLEKKEK 111

Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQPI 309
            L  + K L   +EELE    E + EL+ I
Sbjct: 112 ELSNKEKNLDEKEEELEELIAEQREELERI 141



 Score = 33.4 bits (77), Expect = 0.16
 Identities = 20/112 (17%), Positives = 55/112 (49%), Gaps = 9/112 (8%)

Query: 190 NKRILALCMGNHELYMRRRKPDTIDVQQMKA--QAREEKNAKQQQRDKLQLEIAAREKAE 247
             +  AL     E++  R + +  ++++ +   Q  E +    Q+ + L  ++ + +K E
Sbjct: 47  TLKKEALLEAKEEVHKLRAELER-ELKERRNELQRLERRL--LQREETLDRKMESLDKKE 103

Query: 248 KKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
           +  ++  + L   E  + ++++   E +E+I    E+L+++    +E EA++
Sbjct: 104 ENLEKKEKELSNKEKNLDEKEE---ELEELIAEQREELERISGLTQE-EAKE 151



 Score = 33.0 bits (76), Expect = 0.18
 Identities = 25/103 (24%), Positives = 55/103 (53%), Gaps = 17/103 (16%)

Query: 208 RKPDTIDVQQMKAQAREEK-NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
           RK +++D ++   + +E++ + K++  D+ + E+   E+   + +E +ER+  L  E AK
Sbjct: 94  RKMESLDKKEENLEKKEKELSNKEKNLDEKEEEL---EELIAEQREELERISGLTQEEAK 150

Query: 267 -------RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
                   ++   EA ++I+ +EE+      AKEE + +  E+
Sbjct: 151 EILLEEVEEEARHEAAKLIKEIEEE------AKEEADKKAKEI 187


>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
           Mitofilin controls mitochondrial cristae morphology.
           Mitofilin is enriched in the narrow space between the
           inner boundary and the outer membranes, where it forms a
           homotypic interaction and assembles into a large
           multimeric protein complex. The first 78 amino acids
           contain a typical amino-terminal-cleavable mitochondrial
           presequence rich in positive-charged and hydroxylated
           residues and a membrane anchor domain. In addition, it
           has three centrally located coiled coil domains.
          Length = 493

 Score = 36.9 bits (86), Expect = 0.011
 Identities = 28/106 (26%), Positives = 55/106 (51%), Gaps = 10/106 (9%)

Query: 214 DVQQMKAQAREE-KNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
            + ++KA+  EE + A +++R++L  ++     A  + +E+    KQL +E  +      
Sbjct: 178 KLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEA-ALEKQLRLEFER------ 230

Query: 273 EAQEMIRRLEEQLK-QLQAAKEELEARQTELQLELQPIRNQSNLNK 317
           E +E+ ++ EE+L+ +L+   E  E +    +L LQ I  Q   NK
Sbjct: 231 EKEELRKKYEEKLRQELERQAEAHEQKLKN-ELALQAIELQREFNK 275



 Score = 30.8 bits (70), Expect = 1.1
 Identities = 19/76 (25%), Positives = 39/76 (51%), Gaps = 2/76 (2%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQ-LQAAK 292
           D+L  ++A  +  E++  E   + K+ E+     ++ L   +     LE+QL+   +  K
Sbjct: 173 DQLSKKLAELKAEEEEELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFEREK 232

Query: 293 EELEAR-QTELQLELQ 307
           EEL  + + +L+ EL+
Sbjct: 233 EELRKKYEEKLRQELE 248


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 37.2 bits (87), Expect = 0.012
 Identities = 18/70 (25%), Positives = 37/70 (52%)

Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ 289
           +++   L+LE+ A E+      +  ERL +L  E+A  + +L   +   ++ +E ++ + 
Sbjct: 425 RRRIAALELELDALEREAALGADHDERLAELRAELAALEAELAALEARWQQEKELVEAIL 484

Query: 290 AAKEELEARQ 299
           A + ELEA  
Sbjct: 485 ALRAELEADA 494



 Score = 28.0 bits (63), Expect = 7.6
 Identities = 18/63 (28%), Positives = 31/63 (49%), Gaps = 6/63 (9%)

Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEE---LEAR-QTELQL--ELQPI 309
           R+  LE+E+   +++     +   RL E   +L A + E   LEAR Q E +L   +  +
Sbjct: 427 RIAALELELDALEREAALGADHDERLAELRAELAALEAELAALEARWQQEKELVEAILAL 486

Query: 310 RNQ 312
           R +
Sbjct: 487 RAE 489


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 36.8 bits (85), Expect = 0.014
 Identities = 32/104 (30%), Positives = 54/104 (51%), Gaps = 15/104 (14%)

Query: 205 MRRRKPDTIDVQQM----KAQAREEKNAKQQ-----QRDKLQLEIAAR--EKAEKKHQES 253
           +  ++ D   +Q M    K++A    NA++Q     +R K + E AAR   +A    +E 
Sbjct: 468 VSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEEETAARAAAQAAASREEC 527

Query: 254 VERLKQ----LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
            E LKQ    LE+E+ K + DL   +E  R LE++ ++L+  +E
Sbjct: 528 AESLKQAKQDLEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQE 571



 Score = 35.3 bits (81), Expect = 0.045
 Identities = 26/92 (28%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           +  M +  +++K + Q    +L+ E  +R  AEK+  E  +R K+ E E A R      A
Sbjct: 464 LNSMVSAKQKDKQSMQSMEKRLKSEADSRVNAEKQLAEEKKRKKEEE-ETAARAAAQAAA 522

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
                  EE  + L+ AK++LE    +L+ +L
Sbjct: 523 SR-----EECAESLKQAKQDLEMEIKKLEHDL 549


>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain.  Axonemal
           dynein light chain proteins play a dynamic role in
           flagellar and cilia motility. Eukaryotic cilia and
           flagella are complex organelles consisting of a core
           structure, the axoneme, which is composed of nine
           microtubule doublets forming a cylinder that surrounds a
           pair of central singlet microtubules. This
           ultra-structural arrangement seems to be one of the most
           stable micro-tubular assemblies known and is responsible
           for the flagellar and ciliary movement of a large number
           of organisms ranging from protozoan to mammals. This
           light chain interacts directly with the N-terminal half
           of the heavy chains.
          Length = 189

 Score = 35.6 bits (83), Expect = 0.015
 Identities = 16/78 (20%), Positives = 39/78 (50%), Gaps = 9/78 (11%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
              +++Q   +  EEK   +++  +L+ ++ A EK E++ ++  E+    E+   K+   
Sbjct: 121 GKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHADEIAFLKKQ-- 178

Query: 271 LMEAQEMIRRLEEQLKQL 288
                   ++L+ QL+Q+
Sbjct: 179 -------NQQLKSQLEQI 189



 Score = 29.5 bits (67), Expect = 1.7
 Identities = 14/57 (24%), Positives = 29/57 (50%), Gaps = 2/57 (3%)

Query: 258 KQLEVEMAKRD--QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           K L+ E  K +  Q++ + +E    LE+++ +L+A  E +E R+ E +   +     
Sbjct: 114 KALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKREEEERQIEEKRHAD 170


>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968).  This
           family of proteins has no known function.
          Length = 192

 Score = 35.5 bits (82), Expect = 0.019
 Identities = 21/84 (25%), Positives = 40/84 (47%), Gaps = 3/84 (3%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
               ++  KAQ   +  A + + ++LQ ++   +  E   Q+   R +Q   E    +  
Sbjct: 106 RRAQLEAQKAQLERQIAASEARAERLQADLQLAQAQE---QQVAARQQQARQEAVALEAQ 162

Query: 271 LMEAQEMIRRLEEQLKQLQAAKEE 294
              AQ  +R+L+ Q++QLQA + E
Sbjct: 163 RQAAQAQLRKLQRQIRQLQAQQNE 186



 Score = 27.8 bits (62), Expect = 6.1
 Identities = 15/81 (18%), Positives = 41/81 (50%), Gaps = 1/81 (1%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
            + QLE A + + E++   S  R ++L+ ++        +     ++  ++   L+A ++
Sbjct: 106 RRAQLE-AQKAQLERQIAASEARAERLQADLQLAQAQEQQVAARQQQARQEAVALEAQRQ 164

Query: 294 ELEARQTELQLELQPIRNQSN 314
             +A+  +LQ +++ ++ Q N
Sbjct: 165 AAQAQLRKLQRQIRQLQAQQN 185


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 35.3 bits (82), Expect = 0.020
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 5/105 (4%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREK--AE--KKHQESVERLKQLEVEMAKRDQDL 271
           + +K +   E    ++  +KL  EIA   K   E  KK +E VE L++   +  K  Q L
Sbjct: 30  KSLKEEI-AEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSL 88

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLN 316
              +  ++ LE++LK L+   E LE R  +++ E   + ++    
Sbjct: 89  KNLKARLKELEKELKNLKWESEVLEQRFEKVERERDELYDKFEAA 133



 Score = 29.9 bits (68), Expect = 1.4
 Identities = 18/94 (19%), Positives = 48/94 (51%), Gaps = 7/94 (7%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           ++ Q   +  E     +++ ++L+ ++   EK ++  +    RLK+LE E+    ++L  
Sbjct: 52  EIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKEL----KNLKW 107

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
             E+   LE++ ++++  ++EL  +      ++Q
Sbjct: 108 ESEV---LEQRFEKVERERDELYDKFEAAIQDVQ 138


>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
          Length = 850

 Score = 36.1 bits (84), Expect = 0.021
 Identities = 31/119 (26%), Positives = 54/119 (45%), Gaps = 20/119 (16%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRD-KLQLEI-------AAREKAEKKHQES---VERLK-- 258
            +DVQ +  +  +E+     Q+  +L +EI       AAR +AE   QE+   +ER K  
Sbjct: 632 NVDVQSV--EPVDERTRDSLQKSVQLAIEITTKSQEAAARHQAELLEQEARGRLERQKMH 689

Query: 259 -QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ--PIRNQSN 314
            + + E  +    L+E Q     +E   +    A  E EAR  E + E++   +R ++ 
Sbjct: 690 DKAKAEEQRT--KLLELQAESAAVESSGQSRAEALAEAEARLIEAEAEVEQAELRAKAL 746


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 36.2 bits (84), Expect = 0.026
 Identities = 27/94 (28%), Positives = 51/94 (54%), Gaps = 1/94 (1%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
            ++   +   E N    +  +L+ E+   EK  K+ +E  E +++LE E+   +    + 
Sbjct: 198 KEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKL 257

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
           +E IR LEE++++L+   EELE +  EL+ EL+ 
Sbjct: 258 EEKIRELEERIEELKKEIEELEEKVKELK-ELKE 290



 Score = 32.3 bits (74), Expect = 0.36
 Identities = 19/92 (20%), Positives = 54/92 (58%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
           T +++++  +  +E     ++ +++  E+    +  +K ++ V+ L++L+ E+ + +++L
Sbjct: 188 TENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKEL 247

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
              +   R+LEE++++L+   EEL+    EL+
Sbjct: 248 ESLEGSKRKLEEKIRELEERIEELKKEIEELE 279



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 20/98 (20%), Positives = 48/98 (48%), Gaps = 2/98 (2%)

Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE 276
           + + +  EEK  + ++  K   E+  R +  ++  E  E  K  + E+ +  + L     
Sbjct: 327 EERIKELEEKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRL--TGL 384

Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
              +LE++L++L+ AKEE+E   +++   +  ++ +  
Sbjct: 385 TPEKLEKELEELEKAKEEIEEEISKITARIGELKKEIK 422



 Score = 29.6 bits (67), Expect = 2.6
 Identities = 26/102 (25%), Positives = 56/102 (54%), Gaps = 6/102 (5%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           ++++  ++  EE N  +++  +L+ +    E+ +KK +E  +RL++LE E  +  ++   
Sbjct: 311 EIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELE-ERHELYEEAKA 369

Query: 274 AQEMIRRLEEQLKQLQAAK-----EELEARQTELQLELQPIR 310
            +E + RL+++L  L   K     EELE  + E++ E+  I 
Sbjct: 370 KKEELERLKKRLTGLTPEKLEKELEELEKAKEEIEEEISKIT 411



 Score = 29.6 bits (67), Expect = 2.8
 Identities = 19/73 (26%), Positives = 40/73 (54%), Gaps = 1/73 (1%)

Query: 236 LQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
           ++  I   EK  K+ +   E +K+ E E+ +  +++ E    +  L E+L++L+   +EL
Sbjct: 174 IKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKEL 233

Query: 296 EARQTEL-QLELQ 307
           E  + E+ +LE +
Sbjct: 234 EELKEEIEELEKE 246



 Score = 29.3 bits (66), Expect = 3.0
 Identities = 21/98 (21%), Positives = 49/98 (50%), Gaps = 9/98 (9%)

Query: 222 AREEKNAKQQQRDKLQLEIAAREK----AEKKHQESVERLKQLEV-----EMAKRDQDLM 272
           A +E   ++++  KL+ E+    +     EK+ +E  + L++LE      E  +  ++ +
Sbjct: 610 AEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYL 669

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
           E    +  L  +L++L+  +EE++    +L+ EL+   
Sbjct: 670 ELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707



 Score = 29.3 bits (66), Expect = 3.2
 Identities = 23/87 (26%), Positives = 44/87 (50%), Gaps = 1/87 (1%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
             +  +E    +++  +L+ EI   E+  K+ +E  ERL++L+ ++ + ++ L E +E  
Sbjct: 302 YEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKLKELEKRLEELEERH 361

Query: 279 RRLEEQLKQLQAAKEELEARQTELQLE 305
              EE  K  +   E L+ R T L  E
Sbjct: 362 ELYEE-AKAKKEELERLKKRLTGLTPE 387



 Score = 28.5 bits (64), Expect = 5.7
 Identities = 21/78 (26%), Positives = 44/78 (56%)

Query: 233 RDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAK 292
            ++L+  I   E  E+  +E  + L+++  E+ +   +L E +E + +LE+++K+L+  K
Sbjct: 178 IERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELK 237

Query: 293 EELEARQTELQLELQPIR 310
           EE+E  + EL+      R
Sbjct: 238 EEIEELEKELESLEGSKR 255



 Score = 28.5 bits (64), Expect = 6.1
 Identities = 17/68 (25%), Positives = 38/68 (55%)

Query: 237 QLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
           +LE   ++ +E++++E  E   +L  E+A    +L E ++    +++ L++L+   EE E
Sbjct: 648 ELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707

Query: 297 ARQTELQL 304
             + EL+ 
Sbjct: 708 KAKKELEK 715



 Score = 28.1 bits (63), Expect = 8.8
 Identities = 17/59 (28%), Positives = 35/59 (59%)

Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           EK  +++++  E+L +L+ E+    ++L + +E+ ++L E  K+L   +EEL     EL
Sbjct: 521 EKKAEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 35.5 bits (82), Expect = 0.032
 Identities = 23/100 (23%), Positives = 37/100 (37%), Gaps = 16/100 (16%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAA-------REKAEKKHQESV---ERLKQLEVE 263
            +  + A+    +  +++QR +L  ++AA          A     E      RL      
Sbjct: 95  QIADLNARLNALEVQEREQRRRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGA 154

Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
           +           E I  L+  LKQL A + E+ A Q EL 
Sbjct: 155 LNP------ARAERIDALKATLKQLAAVRAEIAAEQAELT 188



 Score = 35.5 bits (82), Expect = 0.034
 Identities = 23/106 (21%), Positives = 43/106 (40%), Gaps = 4/106 (3%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKL---QLEIAA-REKAEKKHQESVERLKQLEVEMAK 266
           D     ++         A+ ++ D L     ++AA R +   +  E    L +   + AK
Sbjct: 141 DAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQRAQQAK 200

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
             Q L E ++ + +L  +L   Q   EEL A ++ L+ E+      
Sbjct: 201 LAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAA 246



 Score = 35.1 bits (81), Expect = 0.047
 Identities = 15/97 (15%), Positives = 31/97 (31%), Gaps = 4/97 (4%)

Query: 214 DVQQMKAQAREEKNAKQ----QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ 269
            V+   A  + E          Q+ KL   +  R+K   +    +   ++   E+   + 
Sbjct: 175 AVRAEIAAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANES 234

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
            L            + ++  AA E   AR    + + 
Sbjct: 235 RLKNEIASAEAAAAKAREAAAAAEAAAARARAAEAKR 271



 Score = 34.7 bits (80), Expect = 0.054
 Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 10/85 (11%)

Query: 228 AKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQ 287
           A  +Q  ++Q EIAA EK   K +E  ++  +LE ++   + +       I  LE QL +
Sbjct: 35  ADDKQLKQIQKEIAALEK---KIREQQDQRAKLEKQLKSLETE-------IASLEAQLIE 84

Query: 288 LQAAKEELEARQTELQLELQPIRNQ 312
                ++L  +  +L   L  +  Q
Sbjct: 85  TADDLKKLRKQIADLNARLNALEVQ 109



 Score = 30.5 bits (69), Expect = 1.1
 Identities = 20/62 (32%), Positives = 36/62 (58%)

Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
               ++LKQ++ E+A  ++ + E Q+   +LE+QLK L+     LEA+  E   +L+ +R
Sbjct: 34  AADDKQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLR 93

Query: 311 NQ 312
            Q
Sbjct: 94  KQ 95



 Score = 30.1 bits (68), Expect = 1.5
 Identities = 19/88 (21%), Positives = 38/88 (43%), Gaps = 10/88 (11%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           +Q+  A   ++   +Q QR KL+ ++       K  +     +  LE ++ +   DL + 
Sbjct: 43  IQKEIAALEKKIREQQDQRAKLEKQL-------KSLETE---IASLEAQLIETADDLKKL 92

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTEL 302
           ++ I  L  +L  L+  + E   R  E 
Sbjct: 93  RKQIADLNARLNALEVQEREQRRRLAEQ 120


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 35.0 bits (81), Expect = 0.038
 Identities = 22/99 (22%), Positives = 45/99 (45%), Gaps = 12/99 (12%)

Query: 200 NHELYMR-RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK 258
            + LY   R +  +      +   +E   A+  +      E   ++K  +K +E  +RL+
Sbjct: 280 ENLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEE-----ERELKKKFTEKIREKEKRLE 334

Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
           +LE  + +      E +E+  +LEE  K+L+  ++ LE 
Sbjct: 335 ELEQNLIE------ERKELNSKLEEIQKKLEDLEKRLEK 367


>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
          Length = 567

 Score = 35.2 bits (81), Expect = 0.042
 Identities = 28/87 (32%), Positives = 45/87 (51%), Gaps = 10/87 (11%)

Query: 242 AREKAEKKHQESVERLK-QLEVEM-----AKRDQDLMEAQEMIRRL---EEQLKQLQAAK 292
           AREK E++ +E  +R K +LE E      A + ++ +EA +  RRL   E Q+K  Q  +
Sbjct: 8   AREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKADQQMQ 67

Query: 293 EELEA-RQTELQLELQPIRNQSNLNKV 318
           E L+A R        + +  Q N +K+
Sbjct: 68  ESLQAGRGIVFSRIFEAVSFQGNGDKI 94



 Score = 31.0 bits (70), Expect = 0.93
 Identities = 19/55 (34%), Positives = 34/55 (61%), Gaps = 4/55 (7%)

Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQ--ESVERLKQLEVEMA--KRDQDLME 273
           RE++  KQ+ + KL+ E  A+E+A K+ +  E+ +R ++L+   A  K DQ + E
Sbjct: 14  REQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSRRLDAIEAQIKADQQMQE 68


>gnl|CDD|226883 COG4477, EzrA, Negative regulator of septation ring formation [Cell
           division and chromosome partitioning].
          Length = 570

 Score = 35.0 bits (81), Expect = 0.043
 Identities = 26/106 (24%), Positives = 47/106 (44%), Gaps = 17/106 (16%)

Query: 214 DVQQMKAQAREEKNAKQQQR-----DKLQLEIAAREKA--------EKKHQESVERLKQL 260
           D+ ++     EEKN+++        ++L+ ++ A            EKK +   E L Q 
Sbjct: 126 DLNELVES--EEKNSEEIDHVLELYEELRRDVLANRHQYGEAAPELEKKLENIEEELSQF 183

Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
                    D +EA+E++   EE +  L++  E + +   ELQ EL
Sbjct: 184 VEL--TSSGDYIEAREVLEEAEEHMIALRSIMERIPSLLAELQTEL 227


>gnl|CDD|233791 TIGR02231, TIGR02231, conserved hypothetical protein.  This family
           consists of proteins over 500 amino acids long in
           Caenorhabditis elegans and several bacteria (Pseudomonas
           aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans,
           etc.). The function is unknown.
          Length = 525

 Score = 35.1 bits (81), Expect = 0.045
 Identities = 23/111 (20%), Positives = 44/111 (39%), Gaps = 10/111 (9%)

Query: 210 PDTIDVQQMKAQAREEKNAK---QQQRDKLQLEIA----AREKAEKKHQESVERLKQLEV 262
           PD   + +++ Q RE +      + + D L+         RE   +  ++S +R +    
Sbjct: 68  PDPERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAKRNEPDLK 127

Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
           E  +           I RL  + ++ +    ELE + +ELQ EL  +    
Sbjct: 128 EWFQAFDFNGSE---IERLLTEDREAERRIRELEKQLSELQNELNALLTGK 175


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 35.4 bits (82), Expect = 0.046
 Identities = 26/97 (26%), Positives = 50/97 (51%), Gaps = 6/97 (6%)

Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
           N E Y    +   +  +   A A  E+  + +  D+      A EKA+++ +++   ++Q
Sbjct: 441 NEEEYELELRLGRLKQRLDSATATPEELEQLEINDE------ALEKAQEEQEQAEANVEQ 494

Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
           L+ E+ +  +   EA E ++R E +L QL+ A +ELE
Sbjct: 495 LQSELRQLRKRRDEALEALQRAERRLLQLRQALDELE 531



 Score = 34.7 bits (80), Expect = 0.061
 Identities = 22/114 (19%), Positives = 46/114 (40%), Gaps = 22/114 (19%)

Query: 222 AREEKNAK--------QQQRDKLQLEIAAREKAEKKHQESV----ERLKQLEVEMA---- 265
               K A+        +QQ  K +LE+ A  + +++ +  +     +LKQL  +      
Sbjct: 648 RTALKQARLDLQRLQNEQQSLKDKLEL-AIAERKQQAETQLRQLDAQLKQLLEQQQAFLE 706

Query: 266 ----KRDQDLMEAQEMIRRLEEQLK-QLQAAKEELEARQTELQLELQPIRNQSN 314
                  +   E     + +E +L  QL      +EA +T+ +  L+ ++ Q +
Sbjct: 707 ALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKKQYD 760



 Score = 34.7 bits (80), Expect = 0.070
 Identities = 28/107 (26%), Positives = 50/107 (46%), Gaps = 15/107 (14%)

Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK--------QLEVEM 264
                 + QA E+      + ++ +   A    A K+ +  ++RL+        +LE+ +
Sbjct: 617 QSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRLQNEQQSLKDKLELAI 676

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQL----QAAKEELEARQTELQLELQ 307
           A+R Q   +A+  +R+L+ QLKQL    QA  E L+    EL+ E  
Sbjct: 677 AERKQ---QAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERL 720



 Score = 34.7 bits (80), Expect = 0.080
 Identities = 20/114 (17%), Positives = 48/114 (42%), Gaps = 16/114 (14%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ-------LEVEMAKRD 268
            +++ Q R  ++  ++ RD+L  E++A        +  +E L+         ++E  + D
Sbjct: 291 NRLRQQLRTLEDQLKEARDELNQELSAANAKLAADRSELELLEDQKGAFEDADIEQLQAD 350

Query: 269 QD--------LMEAQEMIRRLEEQLKQLQAAKEELE-ARQTELQLELQPIRNQS 313
            D        L E +  +  L  + + +Q   E L+   + +L+ +L+    + 
Sbjct: 351 LDQLPSIRSELEEVEARLDALTGKHQDVQRKYERLKQKIKEQLERDLEKNNERL 404



 Score = 32.7 bits (75), Expect = 0.25
 Identities = 25/122 (20%), Positives = 46/122 (37%), Gaps = 15/122 (12%)

Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAK-----QQQRDKLQLEIAAREKAEKKHQESVE 255
            E  + +       +++ K + +            Q R +L+       + E + +  + 
Sbjct: 393 LERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELRLG 452

Query: 256 RLKQL--------EVEMAKR--DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
           RLKQ         E        D+ L +AQE   + E  ++QLQ+   +L  R+ E    
Sbjct: 453 RLKQRLDSATATPEELEQLEINDEALEKAQEEQEQAEANVEQLQSELRQLRKRRDEALEA 512

Query: 306 LQ 307
           LQ
Sbjct: 513 LQ 514



 Score = 32.4 bits (74), Expect = 0.42
 Identities = 22/108 (20%), Positives = 52/108 (48%), Gaps = 1/108 (0%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
             +DV    A   E +   QQ  + LQ  +A +++AE++  ++   L++ +   A+    
Sbjct: 591 QRLDVPDYAANETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTA 650

Query: 271 LMEAQEMIRRLEEQLKQLQAAKEE-LEARQTELQLELQPIRNQSNLNK 317
           L +A+  ++RL+ + + L+   E  +  R+ + + +L+ +  Q     
Sbjct: 651 LKQARLDLQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLL 698



 Score = 32.4 bits (74), Expect = 0.42
 Identities = 19/95 (20%), Positives = 42/95 (44%), Gaps = 11/95 (11%)

Query: 216 QQMKAQAREEKNAKQQQR--------DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR 267
           + +K   RE +  +  +         ++L    AA E A  + +    RLK+L+ +  + 
Sbjct: 706 EALKDDFRELRTERLAKWQVVEGELDNQLAQLSAAIEAARTQAKA---RLKELKKQYDRE 762

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
              L      ++ L+ Q+++L+   E +  R+ E+
Sbjct: 763 LASLDVDPNTVKELKRQIEELETTIERIAVRRPEV 797



 Score = 29.7 bits (67), Expect = 2.3
 Identities = 28/129 (21%), Positives = 55/129 (42%), Gaps = 18/129 (13%)

Query: 203 LYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEV 262
           L  ++   +  D++Q++A   +  + + +  + ++  + A     +  Q   ERLKQ   
Sbjct: 332 LEDQKGAFEDADIEQLQADLDQLPSIRSELEE-VEARLDALTGKHQDVQRKYERLKQKIK 390

Query: 263 EMAKRDQDLMEAQEMIRR----------------LEEQL-KQLQAAKEELEARQTELQLE 305
           E  +RD +    +    R                LE QL +QL+A K E    + EL+L 
Sbjct: 391 EQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLEAGKLEFNEEEYELELR 450

Query: 306 LQPIRNQSN 314
           L  ++ + +
Sbjct: 451 LGRLKQRLD 459


>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases.and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain.
           Expression of a construct consisting of only the
           N-terminal domain of Saccharomyces pombe Cdc37 results
           in cellular viability. This indicates that interactions
           with the cochaperone Hsp90 may not be essential for
           Cdc37 function.
          Length = 154

 Score = 33.9 bits (78), Expect = 0.047
 Identities = 21/99 (21%), Positives = 45/99 (45%), Gaps = 9/99 (9%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           Q+   QAR E+   + +  K +L +               R ++L  E    ++ L E Q
Sbjct: 34  QRDIHQARVERME-EIKNLKYELIMNDHLNKRIDKLLKGLREEELSPETPTYNEMLAELQ 92

Query: 276 EMIRRLEEQ--------LKQLQAAKEELEARQTELQLEL 306
           + +++  E+        L++L+  +++L+  Q EL+ +L
Sbjct: 93  DQLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKL 131


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 34.6 bits (80), Expect = 0.052
 Identities = 17/94 (18%), Positives = 41/94 (43%), Gaps = 1/94 (1%)

Query: 226 KNAKQQQRDKLQLEIAAREKAEKKHQE-SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQ 284
           K   +++   L+ EIA+  +   +        L++   E+      + E ++ +  L+++
Sbjct: 177 KPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQE 236

Query: 285 LKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
           L++L  A E L  +++EL  E+          + 
Sbjct: 237 LQELTIAIEALTNKKSELLEEIAEAEKIREECRG 270


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 34.5 bits (80), Expect = 0.056
 Identities = 24/103 (23%), Positives = 40/103 (38%), Gaps = 18/103 (17%)

Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE------------M 264
           Q+ A  R +  A   +   LQ EI   +       +      Q E E            +
Sbjct: 128 QVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEA----QCEAEGTGGTGVAGKGPV 183

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
            K  ++ ++A +   RLE    +L AA  +LEA++  L+   Q
Sbjct: 184 YKEKREKLDAAQ--ARLETLKARLDAAIAQLEAQKAALERNRQ 224


>gnl|CDD|217902 pfam04111, APG6, Autophagy protein Apg6.  In yeast, 15 Apg proteins
           coordinate the formation of autophagosomes. Autophagy is
           a bulk degradation process induced by starvation in
           eukaryotic cells. Apg6/Vps30p has two distinct functions
           in the autophagic process, either associated with the
           membrane or in a retrieval step of the carboxypeptidase
           Y sorting pathway.
          Length = 356

 Score = 34.1 bits (78), Expect = 0.074
 Identities = 24/115 (20%), Positives = 43/115 (37%), Gaps = 4/115 (3%)

Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQES--VERLKQLEV 262
           +     D    +    Q   +   + Q  +    E    E  E K +E   ++ L++LE 
Sbjct: 34  LDSELRDAEKERDTYKQYLSK--LESQNVEISNYEALDSELDELKKEEERLLDELEELEK 91

Query: 263 EMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
           E    D +L+E QE   +LE +  Q        +    +L+  LQ +  Q   + 
Sbjct: 92  EDDDLDGELVELQEEKEQLENEELQYLREYNLFDRNNLQLEDNLQSLELQYEYSL 146


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 34.2 bits (78), Expect = 0.078
 Identities = 25/123 (20%), Positives = 56/123 (45%), Gaps = 10/123 (8%)

Query: 189 VNKRILALCMGNHELYMRRRKPDTIDVQQMKAQ--------AREEKNAKQQQRDKLQLEI 240
            +K+++     ++E  +  R+ D  D+++ ++Q         +EE + KQ   DK Q + 
Sbjct: 178 SDKKVVEALREDNEKGVNFRR-DMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKA 236

Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
              +    K ++ V + KQ E +   +  D    +E  +  E Q ++++ A+ E++    
Sbjct: 237 DFAQDNADKQRDEVRQ-KQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIKKNDE 295

Query: 301 ELQ 303
           E  
Sbjct: 296 EAL 298



 Score = 28.4 bits (63), Expect = 6.4
 Identities = 26/111 (23%), Positives = 46/111 (41%), Gaps = 10/111 (9%)

Query: 208 RKPDTIDVQQMKAQAREEKNAKQQQRD---KLQLEIAAREKA---EKKHQESVERLKQLE 261
           +K    D  Q KA    + NA +Q+ +   K Q      + A     K  + V   ++ E
Sbjct: 224 KKQIDADKAQQKAD-FAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKRE 282

Query: 262 VEMAKRD--QDLMEAQEMIRRLEEQLKQLQAAKEELEARQT-ELQLELQPI 309
           +E A+ +  ++  EA +        LKQ   A E+    +  E Q + +P+
Sbjct: 283 IEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPV 333


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 33.6 bits (77), Expect = 0.081
 Identities = 17/65 (26%), Positives = 32/65 (49%), Gaps = 4/65 (6%)

Query: 248 KKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
            K      R ++LE        +L E ++ +  L+E+L+ L+    ELE+   +L+ +LQ
Sbjct: 62  LKAPVQSVRPQKLEEL----QGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQ 117

Query: 308 PIRNQ 312
            +R  
Sbjct: 118 LLREL 122



 Score = 32.4 bits (74), Expect = 0.18
 Identities = 17/85 (20%), Positives = 37/85 (43%), Gaps = 6/85 (7%)

Query: 226 KNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQL 285
           K   Q  R +   E+       K      ++L +L+ E+   ++ + E +  +  L+E L
Sbjct: 63  KAPVQSVRPQKLEELQGELSELK------QQLSELQEELEDLEERIAELESELEDLKEDL 116

Query: 286 KQLQAAKEELEARQTELQLELQPIR 310
           + L+   + LE R   L+  ++ + 
Sbjct: 117 QLLRELLKSLEERLESLEESIKELA 141



 Score = 29.3 bits (66), Expect = 1.7
 Identities = 18/76 (23%), Positives = 39/76 (51%), Gaps = 1/76 (1%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREK-AEKKHQESVERLKQLEVEMAKRDQDLM 272
           D+++     RE   + +++ + L+  I    K   +  Q+  E +++L  E+ +  ++L 
Sbjct: 111 DLKEDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQ 170

Query: 273 EAQEMIRRLEEQLKQL 288
             QE I+ L+  L+QL
Sbjct: 171 RLQEAIQELQSLLEQL 186



 Score = 29.0 bits (65), Expect = 2.4
 Identities = 29/122 (23%), Positives = 50/122 (40%), Gaps = 15/122 (12%)

Query: 203 LYMRRRKPDTIDVQQMKAQA-REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE 261
           L     K     V+  K +  + E +  +QQ  +LQ E+   E  E++  E    L+ L+
Sbjct: 57  LLCLLLKAPVQSVRPQKLEELQGELSELKQQLSELQEEL---EDLEERIAELESELEDLK 113

Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQL-----------QAAKEELEARQTELQLELQPIR 310
            ++    + L   +E +  LEE +K+L           +   EEL      LQ  LQ ++
Sbjct: 114 EDLQLLRELLKSLEERLESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQ 173

Query: 311 NQ 312
             
Sbjct: 174 EA 175


>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD.  This
           model describes the DndB protein encoded by an operon
           associated with a sulfur-containing modification to DNA.
           The operon is sporadically distributed in bacteria, much
           like some restriction enzyme operons. DndD is described
           as a putative ATPase. The small number of examples known
           so far include species from among the Firmicutes,
           Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
           metabolism, Restriction/modification].
          Length = 650

 Score = 34.3 bits (79), Expect = 0.090
 Identities = 33/120 (27%), Positives = 60/120 (50%), Gaps = 10/120 (8%)

Query: 199 GNHELYMRRRKPD---TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVE 255
           G+    +RRRK     +  + +++A   E K  + ++ + L  EIA      ++ Q S+E
Sbjct: 189 GDLTNVLRRRKKSELPSSILSEIEALEAELK-EQSEKYEDLAQEIAHLRNELEEAQRSLE 247

Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
            L++   +      DL E +E   +LE QLK+++AA++   A+  EL  +  P+    NL
Sbjct: 248 SLEK---KFRSEGGDLFEERE---QLERQLKEIEAARKANRAQLRELAADPLPLLLIPNL 301


>gnl|CDD|129694 TIGR00606, rad50, rad50.  All proteins in this family for which
           functions are known are involvedin recombination,
           recombinational repair, and/or non-homologous end
           joining.They are components of an exonuclease complex
           with MRE11 homologs. This family is distantly related to
           the SbcC family of bacterial proteins.This family is
           based on the phylogenomic analysis of JA Eisen (1999,
           Ph.D. Thesis, Stanford University).
          Length = 1311

 Score = 34.3 bits (78), Expect = 0.090
 Identities = 23/93 (24%), Positives = 44/93 (47%), Gaps = 8/93 (8%)

Query: 225 EKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE----VEMAKRDQDLMEAQEMIRR 280
           +   KQ ++D  +LE     K EK  Q + E+L  L       + +++++L++ Q  + +
Sbjct: 275 KSRKKQMEKDNSELE----LKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEK 330

Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
           L ++ + L   K EL   Q  LQL+    +   
Sbjct: 331 LNKERRLLNQEKTELLVEQGRLQLQADRHQEHI 363



 Score = 32.3 bits (73), Expect = 0.36
 Identities = 22/97 (22%), Positives = 44/97 (45%), Gaps = 4/97 (4%)

Query: 209 KPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK----QLEVEM 264
           K    D+ +   Q  +EK  KQ + D +  +I    K  +  QE ++ LK    +L+ E 
Sbjct: 814 KLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHLKSKTNELKSEK 873

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
            +   +L   Q+   +L E   ++Q+   E++  + +
Sbjct: 874 LQIGTNLQRRQQFEEQLVELSTEVQSLIREIKDAKEQ 910



 Score = 29.2 bits (65), Expect = 3.9
 Identities = 19/94 (20%), Positives = 50/94 (53%), Gaps = 2/94 (2%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQL--EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           Q+++    E K  KQ +    ++  +I ++E   +  +E V+  +     +  R +++  
Sbjct: 200 QKVQEHQMELKYLKQYKEKACEIRDQITSKEAQLESSREIVKSYENELDPLKNRLKEIEH 259

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
               I +L+ ++K L++ K+++E   +EL+L+++
Sbjct: 260 NLSKIMKLDNEIKALKSRKKQMEKDNSELELKME 293


>gnl|CDD|241235 cd00900, PH-like, Pleckstrin homology-like domain.  The PH-like
           family includes the PH domain, both the Shc-like and
           IRS-like PTB domains, the ran-binding domain, the EVH1
           domain, a domain in neurobeachin and the third domain of
           FERM. All of these domains have a PH fold, but lack
           significant sequence similarity. They are generally
           involved in targeting to protein to the appropriate
           cellular location or interacting with a binding partner.
           This domain family possesses multiple functions
           including the ability to bind inositol phosphates and to
           other proteins.
          Length = 88

 Score = 31.6 bits (72), Expect = 0.10
 Identities = 11/54 (20%), Positives = 19/54 (35%), Gaps = 6/54 (11%)

Query: 148 TPKIGFPWSEIRNISFN-----DRKFIIKPIDKKAPDFVFFAPRVRVNKRILAL 196
             K+  P S+I  ++ +     DR  +++  D K     F          I A 
Sbjct: 36  RLKLKIPISDIVKVNPSRASDSDRALVLRLKD-KRESLGFSFKSKEEAVLIKAA 88


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 33.9 bits (78), Expect = 0.11
 Identities = 17/68 (25%), Positives = 37/68 (54%)

Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           + K E++ +E     ++   E+   ++ + + +E + RLEE+  +L+   EEL+    +L
Sbjct: 396 KVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKL 455

Query: 303 QLELQPIR 310
           + EL+  R
Sbjct: 456 ESELERFR 463



 Score = 33.1 bits (76), Expect = 0.21
 Identities = 16/68 (23%), Positives = 33/68 (48%), Gaps = 1/68 (1%)

Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
           E   REK   + +E  E +   E  + K ++ +   +E    L+ +L++L+   E+LE+ 
Sbjct: 400 EERPREKEGTEEEERRE-ITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESE 458

Query: 299 QTELQLEL 306
               + E+
Sbjct: 459 LERFRREV 466



 Score = 31.6 bits (72), Expect = 0.62
 Identities = 27/100 (27%), Positives = 49/100 (49%), Gaps = 13/100 (13%)

Query: 218 MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
            K +  E    K+   ++ + EI   EK  KK +E+VERL++   E ++  ++L E +  
Sbjct: 395 SKVKEEERPREKEGTEEEERREITVYEKRIKKLEETVERLEE---ENSELKRELEELKRE 451

Query: 278 IRRLEEQLKQLQAAKE-------ELEARQTE---LQLELQ 307
           I +LE +L++ +           E+ AR      L+ EL+
Sbjct: 452 IEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELE 491



 Score = 28.1 bits (63), Expect = 7.1
 Identities = 13/72 (18%), Positives = 34/72 (47%), Gaps = 5/72 (6%)

Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
            A  K +++ +   +   + E        +     + I++LEE +++L+    EL+    
Sbjct: 392 EALSKVKEEERPREKEGTEEEERREITVYE-----KRIKKLEETVERLEEENSELKRELE 446

Query: 301 ELQLELQPIRNQ 312
           EL+ E++ + ++
Sbjct: 447 ELKREIEKLESE 458


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 33.7 bits (77), Expect = 0.11
 Identities = 29/101 (28%), Positives = 56/101 (55%), Gaps = 17/101 (16%)

Query: 207 RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
           +R P+ +D+++   + +E+    Q+++++L  E+   E+ E +++E  ERLK+LEVE + 
Sbjct: 125 QRVPEYMDLKEDYEELKEKLEELQKEKEELLKEL---EELEAEYEEVQERLKRLEVENS- 180

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
                        RLEE LK+L     +L+ R  EL+  ++
Sbjct: 181 -------------RLEEMLKKLPGEVYDLKKRWDELEPGVE 208



 Score = 29.5 bits (66), Expect = 2.0
 Identities = 15/47 (31%), Positives = 28/47 (59%)

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           +R  + M+ +E    L+E+L++LQ  KEEL     EL+ E + ++ +
Sbjct: 125 QRVPEYMDLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQER 171


>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
           (DUF2058).  This domain, found in various prokaryotic
           proteins, has no known function.
          Length = 177

 Score = 33.0 bits (76), Expect = 0.11
 Identities = 20/76 (26%), Positives = 41/76 (53%), Gaps = 3/76 (3%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           Q +KA   ++K AK+ +++K +    AR+ A+    E  +  ++ + E A+RD++L   +
Sbjct: 5   QLLKAGLVDKKKAKKAKKEKRKQRKQARKGADDGDDELKQAAEEAKAEKAERDRELNRQR 64

Query: 276 EMIRRLEE---QLKQL 288
           +     +    Q+KQL
Sbjct: 65  QAEAEQKAIVAQIKQL 80


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 34.0 bits (78), Expect = 0.12
 Identities = 16/110 (14%), Positives = 36/110 (32%), Gaps = 9/110 (8%)

Query: 201 HELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL 260
           H L +  R  +        A+ R       +     +       +A  + +E+  RL+ L
Sbjct: 861 HTLEVAVR--ELRHAATRAAEQRAR---AARAESDAREAAEDAAEARAEAEEASLRLRTL 915

Query: 261 EVEMAKRDQDLM----EAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
           E  +     ++     E +  +     +L +L  A    E  +   + + 
Sbjct: 916 EESVGAMVDEIRARLAETRAALASGGRELPRLAEALATAEEARGRAEEKR 965



 Score = 32.9 bits (75), Expect = 0.29
 Identities = 22/100 (22%), Positives = 41/100 (41%), Gaps = 5/100 (5%)

Query: 220 AQAREEKNAKQQQRDKLQLEI-AAREKAEKKHQESV----ERLKQLEVEMAKRDQDLMEA 274
             AREE+     + + L+ E  A R + E           E L++   +         +A
Sbjct: 293 ETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADA 352

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
           ++ IR  E +L++ +   +E   R  + + EL+  R Q  
Sbjct: 353 RQAIREAESRLEEERRRLDEEAGRLDDAERELRAAREQLA 392



 Score = 31.3 bits (71), Expect = 0.71
 Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 6/81 (7%)

Query: 238 LEIAARE--KAEKKHQESVERLKQLEVEMAKRDQDLME----AQEMIRRLEEQLKQLQAA 291
           LE+A RE   A  +  E   R  + E +  +  +D  E    A+E   RL    + + A 
Sbjct: 863 LEVAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAM 922

Query: 292 KEELEARQTELQLELQPIRNQ 312
            +E+ AR  E +  L     +
Sbjct: 923 VDEIRARLAETRAALASGGRE 943



 Score = 30.2 bits (68), Expect = 1.6
 Identities = 20/98 (20%), Positives = 42/98 (42%), Gaps = 10/98 (10%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           ++++A  R  +N  Q+ R         R +A         RL+  + +  +  +DL  A+
Sbjct: 240 ERLEALERALRNFLQRYR--RYARTMLRRRAT--------RLRSAQTQYDQLSRDLGRAR 289

Query: 276 EMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
           + +    E+ ++L A  E LE     L+  L+ ++   
Sbjct: 290 DELETAREEERELDARTEALEREADALRTRLEALQGSP 327


>gnl|CDD|218661 pfam05622, HOOK, HOOK protein.  This family consists of several
           HOOK1, 2 and 3 proteins from different eukaryotic
           organisms. The different members of the human gene
           family are HOOK1, HOOK2 and HOOK3. Different domains
           have been identified in the three human HOOK proteins,
           and it was demonstrated that the highly conserved
           NH2-domain mediates attachment to microtubules, whereas
           the central coiled-coil motif mediates homodimerisation
           and the more divergent C-terminal domains are involved
           in binding to specific organelles (organelle-binding
           domains). It has been demonstrated that endogenous HOOK3
           binds to Golgi membranes, whereas both HOOK1 and HOOK2
           are localised to discrete but unidentified cellular
           structures. In mice the Hook1 gene is predominantly
           expressed in the testis. Hook1 function is necessary for
           the correct positioning of microtubular structures
           within the haploid germ cell. Disruption of Hook1
           function in mice causes abnormal sperm head shape and
           fragile attachment of the flagellum to the sperm head.
          Length = 713

 Score = 33.6 bits (77), Expect = 0.13
 Identities = 20/95 (21%), Positives = 45/95 (47%), Gaps = 1/95 (1%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           D  +   +   +     Q+  +LQ ++   +KA ++     E    L+ ++ +  + L E
Sbjct: 492 DANRRNNELETQLRLANQRILELQQQVEDLQKALQEQGSKSEDSSLLKSKLEEHLEQLHE 551

Query: 274 AQEMIRRLEEQLKQLQA-AKEELEARQTELQLELQ 307
           A E +++  EQ+++L+    + L  +  EL+  LQ
Sbjct: 552 ANEELQKKREQIEELEPDQDQNLSRKIAELEAALQ 586



 Score = 28.2 bits (63), Expect = 6.5
 Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 13/86 (15%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ------ESVERLKQLEVEMAKRDQ 269
           +Q+K    EE+NA   Q   +QLE   ++    + Q      +  E   +L  E  K D+
Sbjct: 332 RQVKLL--EERNAMYMQNT-VQLEEELKKANAARGQLETYKRQVQELHAKLSEESKKADK 388

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEEL 295
              E     +RLEE+L+ LQ  KE L
Sbjct: 389 LEFE----YKRLEEKLEALQKEKERL 410



 Score = 27.8 bits (62), Expect = 9.8
 Identities = 19/96 (19%), Positives = 46/96 (47%), Gaps = 12/96 (12%)

Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE 276
           +   +A EE   K++Q ++L+           + Q    ++ +LE  + K+D+D+   +E
Sbjct: 547 EQLHEANEELQKKREQIEELE---------PDQDQNLSRKIAELEAALQKKDEDMRAMEE 597

Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
             ++  E+ +++      L+ +Q     E+Q ++ Q
Sbjct: 598 RYKKYVEKAREVIKT---LDPKQNPASPEIQLLKKQ 630


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 33.4 bits (77), Expect = 0.13
 Identities = 32/135 (23%), Positives = 53/135 (39%), Gaps = 15/135 (11%)

Query: 188 RVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAE 247
           R+N +I  L      L + R+     D+Q   A  R   +A + +R +LQ   A   +  
Sbjct: 57  RLNSQIAEL---ADLLSLERQG--NQDLQDSVANLRASLSAAEAERSRLQ---ALLAELA 108

Query: 248 KKHQESVERLKQLEVEMAKRDQDLMEAQ-------EMIRRLEEQLKQLQAAKEELEARQT 300
                +  R  +L  E+    Q    A        + I  L  QL  L+AA +  E R  
Sbjct: 109 GAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDR 168

Query: 301 ELQLELQPIRNQSNL 315
           E Q ++  +  + N+
Sbjct: 169 ESQAKIADLGRRLNV 183



 Score = 30.3 bits (69), Expect = 1.2
 Identities = 26/93 (27%), Positives = 36/93 (38%), Gaps = 7/93 (7%)

Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK--QLEVEMAKRDQDLMEAQEMIRR 280
             E + K    D+L  +IA     E     S+ER     L+  +A     L  A+    R
Sbjct: 45  SREISGKDSALDRLNSQIA-----ELADLLSLERQGNQDLQDSVANLRASLSAAEAERSR 99

Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
           L+  L +L  A    E R  EL  EL   +  S
Sbjct: 100 LQALLAELAGAGAAAEGRAGELAQELDSEKQVS 132


>gnl|CDD|241347 cd13193, FERM_C_FARP1-like, FERM, RhoGEF and pleckstrin
           domain-containing protein 1 and related proteins FERM
           domain.  Members here include FARP1 (also called
           Chondrocyte-derived ezrin-like protein; PH
           domain-containing family C member 2), FARP2 (also called
           FIR/FERM domain including RhoGEF; FGD1-related
           Cdc42-GEF/FRG), and FRMD7(FERM domain containing 7).
           FARP1 and FARP2 are members of the Dbl family guanine
           nucleotide exchange factors (GEFs) which are upstream
           positive regulators of Rho GTPases. FARP1 has increased
           expression in differentiated chondrocytes. FARP2 is
           thought to regulate neurite remodeling by mediating the
           signaling pathways from membrane proteins to Rac. It is
           found in brain, lung, and testis, as well as embryonic
           hippocampal and cortical neurons.  These members are
           composed of a N-terminal FERM domain, a proline-rich
           (PR) domain, Dbl-homology (DH), and two C-terminal PH
           domains.  Other members in this family do not contain
           the DH domains such as the Human FERM domain containing
           protein 7 and Caenorhabditis elegans CFRM3, both of
           which have unknown functions. They contain an N-terminal
           FERM domain, a PH domain, followed by a FA (FERM
           adjacent) domain. The FERM domain has a cloverleaf
           tripart structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
           C-lobe/A-lobe, B-lobe, C-lobe/F1, F2, F3). The C-lobe/F3
           within the FERM domain is part of the PH domain family.
           The FERM domain is found in the cytoskeletal-associated
           proteins such as ezrin, moesin, radixin, 4.1R, and
           merlin. These proteins provide a link between the
           membrane and cytoskeleton and are involved in signal
           transduction pathways. The FERM domain is also found in
           protein tyrosine phosphatases (PTPs) , the tyrosine
           kinases FAK and JAK, in addition to other proteins
           involved in signaling. This domain is structurally
           similar to the PH and PTB domains and consequently is
           capable of binding to both peptides and phospholipids at
           different sites.
          Length = 122

 Score = 31.9 bits (73), Expect = 0.14
 Identities = 9/20 (45%), Positives = 15/20 (75%)

Query: 152 GFPWSEIRNISFNDRKFIIK 171
            F W++IR +SF  ++F+IK
Sbjct: 44  TFSWAKIRKLSFKRKRFLIK 63



 Score = 30.4 bits (69), Expect = 0.46
 Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 12/79 (15%)

Query: 45  IAQDLEMYGVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPLQFK-- 102
           IA+  E+YG+     K+++G +L L V  +G+ +++     T + +    K   L FK  
Sbjct: 2   IARRCELYGIRLHPAKDREGVDLNLAVAHMGILVFQ---GFTKINTFSWAKIRKLSFKRK 58

Query: 103 -FRAKFYPE------DVVE 114
            F  K +PE      D VE
Sbjct: 59  RFLIKLHPEAYGYYKDTVE 77


>gnl|CDD|235319 PRK04914, PRK04914, ATP-dependent helicase HepA; Validated.
          Length = 956

 Score = 33.7 bits (78), Expect = 0.14
 Identities = 26/98 (26%), Positives = 45/98 (45%), Gaps = 20/98 (20%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL-EVEMAKRDQDLM 272
           D+ ++  +A  E  A+ Q R   +L   A+++A++K    + RL+ L  V    RD    
Sbjct: 873 DIHKLLQKA--EAQAEAQAR---ELIEQAKQEADEKLSAELSRLEALKAVNPNIRD---- 923

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
              + I  LE Q +++  A +       + QL L  IR
Sbjct: 924 ---DEIEALESQRQEVLEALD-------QAQLRLDAIR 951


>gnl|CDD|182791 PRK10865, PRK10865, protein disaggregation chaperone; Provisional.
          Length = 857

 Score = 33.7 bits (77), Expect = 0.14
 Identities = 29/133 (21%), Positives = 64/133 (48%), Gaps = 11/133 (8%)

Query: 193 ILALCMGNHELYMRRRKPD-TID-VQQMKAQAREEKNAKQQQRDKLQ---LEIAAREKAE 247
           I+A    +H     R+ PD  ID + +  +  R + ++K ++ D+L    +++   ++A 
Sbjct: 370 IVAAATLSHRYIADRQLPDKAIDLIDEAASSIRMQIDSKPEELDRLDRRIIQLKLEQQAL 429

Query: 248 KKHQE--SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
            K  +  S +RL  L  E++ +++   E +E  +  +  L   Q  K ELE    + ++ 
Sbjct: 430 MKESDEASKKRLDMLNEELSDKERQYSELEEEWKAEKASLSGTQTIKAELE----QAKIA 485

Query: 306 LQPIRNQSNLNKV 318
           ++  R   +L ++
Sbjct: 486 IEQARRVGDLARM 498


>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein.  Rotavirus particles
           consist of three concentric proteinaceous capsid layers.
           The innermost capsid (core) is made of VP2. The genomic
           RNA and the two minor proteins VP1 and VP3 are
           encapsidated within this layer. The N-terminus of
           rotavirus VP2 is necessary for the encapsidation of VP1
           and VP3.
          Length = 887

 Score = 33.8 bits (77), Expect = 0.15
 Identities = 26/103 (25%), Positives = 49/103 (47%), Gaps = 8/103 (7%)

Query: 204 YMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE 263
           Y  RR+ +         + +E+ + KQ Q+++++L    +EK   K +E V       V+
Sbjct: 3   YRNRREANI----NNNDRMQEKDDEKQDQKNRMEL----KEKVLDKKEEVVTDNVDSPVK 54

Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
                ++L  A E+ +  +E+ KQL    +  E  Q E+Q E+
Sbjct: 55  EQSSQENLKIADEVKKSTKEESKQLLEVLKTKEEHQKEIQYEI 97


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 33.5 bits (77), Expect = 0.16
 Identities = 20/86 (23%), Positives = 48/86 (55%), Gaps = 2/86 (2%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEI--AAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
            + ++  + RE    K++++ +L+ E   A  E+A +  + + E L+Q+E ++ +  ++ 
Sbjct: 617 ALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYLEQVEEKLDELREER 676

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEA 297
            + Q  I  +E +L++L+  +E  EA
Sbjct: 677 DDLQAEIGAVENELEELEELRERREA 702



 Score = 33.1 bits (76), Expect = 0.22
 Identities = 23/98 (23%), Positives = 52/98 (53%), Gaps = 9/98 (9%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKL-----QLEIAA---REKAEKKHQESVERLKQLEVEMA 265
           D++++ A+ RE    K+++ ++L     +LE  A   RE A +  +E+ E  +++  E+ 
Sbjct: 520 DLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVA-ELN 578

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            +  +L E  E + R+   L  +  A++E+E  + + +
Sbjct: 579 SKLAELKERIESLERIRTLLAAIADAEDEIERLREKRE 616



 Score = 32.7 bits (75), Expect = 0.30
 Identities = 22/94 (23%), Positives = 50/94 (53%), Gaps = 4/94 (4%)

Query: 219 KAQAREEKNA-KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
            A   EE+    +++  +L+ E+   E+A +  ++  E +++LE E+ +  +   +A   
Sbjct: 350 DADDLEERAEELREEAAELESEL---EEAREAVEDRREEIEELEEEIEELRERFGDAPVD 406

Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQPIRN 311
           +   E+ L++L+  ++EL  R+ EL+  L+  R 
Sbjct: 407 LGNAEDFLEELREERDELREREAELEATLRTARE 440



 Score = 31.9 bits (73), Expect = 0.53
 Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 7/79 (8%)

Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
           E++   ++ RD+    +         H+E  E L+ LE E+    + + E +     L E
Sbjct: 227 EQREQARETRDEADEVLEE-------HEERREELETLEAEIEDLRETIAETEREREELAE 279

Query: 284 QLKQLQAAKEELEARQTEL 302
           +++ L+   EELE  + +L
Sbjct: 280 EVRDLRERLEELEEERDDL 298



 Score = 30.8 bits (70), Expect = 1.3
 Identities = 23/111 (20%), Positives = 49/111 (44%), Gaps = 4/111 (3%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
           R  + +  +V +   + REE    + + + L+  IA  E+  ++  E V  L++   E+ 
Sbjct: 233 RETRDEADEVLEEHEERREELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELE 292

Query: 266 KRDQDLMEAQEM----IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           +   DL+    +       +E + ++L+   EEL  R  E ++  Q    +
Sbjct: 293 EERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEE 343



 Score = 30.4 bits (69), Expect = 1.3
 Identities = 22/83 (26%), Positives = 44/83 (53%), Gaps = 5/83 (6%)

Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME----AQEMIRRLEEQ 284
            +++ ++L+ E+   E+  ++ +E +ER + L VE   R + L E     +E+I    E 
Sbjct: 473 DRERVEELEAELEDLEEEVEEVEERLERAEDL-VEAEDRIERLEERREDLEELIAERRET 531

Query: 285 LKQLQAAKEELEARQTELQLELQ 307
           +++ +   EEL  R  EL+ E +
Sbjct: 532 IEEKRERAEELRERAAELEAEAE 554



 Score = 30.4 bits (69), Expect = 1.5
 Identities = 18/58 (31%), Positives = 31/58 (53%)

Query: 246 AEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            EK+ ++  ERL  LE E+A+ D+++   +E   +  E   +     EE E R+ EL+
Sbjct: 197 EEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELE 254



 Score = 30.0 bits (68), Expect = 1.8
 Identities = 21/97 (21%), Positives = 44/97 (45%), Gaps = 11/97 (11%)

Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
           + RE     +++ ++ + E+A       + +E +E L+++   +A       +A++ I R
Sbjct: 555 EKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIA----DAEDEIER 610

Query: 281 LEEQLKQLQA----AKEELEA---RQTELQLELQPIR 310
           L E+ + L       +E L     R+ EL+ E    R
Sbjct: 611 LREKREALAELNDERRERLAEKRERKRELEAEFDEAR 647



 Score = 30.0 bits (68), Expect = 2.2
 Identities = 27/105 (25%), Positives = 47/105 (44%), Gaps = 8/105 (7%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE---VEMAKR 267
           D ++  ++ AQA  E  A+  + D   LE    E+AE+  +E+ E   +LE     +  R
Sbjct: 328 DRLEECRVAAQAHNE-EAESLREDADDLE----ERAEELREEAAELESELEEAREAVEDR 382

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            +++ E +E I  L E+           E    EL+ E   +R +
Sbjct: 383 REEIEELEEEIEELRERFGDAPVDLGNAEDFLEELREERDELRER 427



 Score = 29.6 bits (67), Expect = 2.3
 Identities = 18/81 (22%), Positives = 40/81 (49%), Gaps = 3/81 (3%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRD---QDLMEAQE 276
            + RE     + + + L+ E+   E+  ++ ++ VE   ++E    +R+   + + E +E
Sbjct: 471 EEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRE 530

Query: 277 MIRRLEEQLKQLQAAKEELEA 297
            I    E+ ++L+    ELEA
Sbjct: 531 TIEEKRERAEELRERAAELEA 551



 Score = 29.2 bits (66), Expect = 3.5
 Identities = 28/94 (29%), Positives = 49/94 (52%), Gaps = 7/94 (7%)

Query: 220 AQAREEKNAKQQQRDKLQ-LEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
           A A +E    +++R+ L  L    RE+  +K     ER ++LE E    +  + EA+E  
Sbjct: 602 ADAEDEIERLREKREALAELNDERRERLAEKR----ERKRELEAEF--DEARIEEAREDK 655

Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            R EE L+Q++   +EL   + +LQ E+  + N+
Sbjct: 656 ERAEEYLEQVEEKLDELREERDDLQAEIGAVENE 689


>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota).  DNA
           Topoisomerase I (eukaryota), DNA topoisomerase V,
           Vaccina virus topoisomerase, Variola virus
           topoisomerase, Shope fibroma virus topoisomeras.
          Length = 391

 Score = 33.1 bits (76), Expect = 0.16
 Identities = 23/114 (20%), Positives = 47/114 (41%), Gaps = 11/114 (9%)

Query: 188 RVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAR-----EEKNAKQQQRDKLQLEIAA 242
           R N+ +  LC  NH+    R    T +    K Q +      +    ++     ++    
Sbjct: 256 RANREVAILC--NHQ----RTVSKTHEKSMEKLQEKIKALKYQLKRLKKMILLFEMISDL 309

Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
           + K + K +   E+L     E  K  +   + ++ I RLEE++++L+    + E
Sbjct: 310 KRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDKE 363


>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80.  Spc24 is a component of
           the evolutionarily conserved kinetochore-associated
           Ndc80 complex and is involved in chromosome segregation.
          Length = 117

 Score = 31.5 bits (72), Expect = 0.17
 Identities = 11/45 (24%), Positives = 22/45 (48%)

Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
           + + + +     + +  LE +L  LQ+  EEL+ R  EL+ +   
Sbjct: 1   IQELENEKFRLAKELNDLESELNNLQSELEELKERLEELEEQEVE 45



 Score = 26.9 bits (60), Expect = 6.9
 Identities = 13/52 (25%), Positives = 27/52 (51%), Gaps = 3/52 (5%)

Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
           ++ L+  +  +AK   DL   +  +  L+ +L++L+   EELE ++ E    
Sbjct: 1   IQELENEKFRLAKELNDL---ESELNNLQSELEELKERLEELEEQEVEGDEV 49


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 33.2 bits (75), Expect = 0.19
 Identities = 15/72 (20%), Positives = 43/72 (59%)

Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
           + R + + ++ ++M+   RE    ++ +R++L+ +   R++ ++  +E V+RL++  +E 
Sbjct: 463 LEREERERLERERMERIERERLERERLERERLERDRLERDRLDRLERERVDRLERDRLEK 522

Query: 265 AKRDQDLMEAQE 276
           A+R+   ++  E
Sbjct: 523 ARRNSYFLKGME 534



 Score = 28.9 bits (64), Expect = 4.8
 Identities = 17/80 (21%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 221 QAREEKNAKQQQR------DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           +AR EK    ++       +K ++E   RE+ E+  +E +ER+++  +E  + +++ +E 
Sbjct: 437 RARIEKENAHRKALEMKILEKKRIERLEREERERLERERMERIERERLERERLERERLER 496

Query: 275 QEMIRRLEEQLKQLQAAKEE 294
             + R   ++L++ +  + E
Sbjct: 497 DRLERDRLDRLERERVDRLE 516


>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
           Transcription of the anti-viral guanylate-binding
           protein (GBP) is induced by interferon-gamma during
           macrophage induction. This family contains GBP1 and
           GPB2, both GTPases capable of binding GTP, GDP and GMP.
          Length = 297

 Score = 32.6 bits (75), Expect = 0.23
 Identities = 28/96 (29%), Positives = 50/96 (52%), Gaps = 10/96 (10%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ-LEVEMAK--RD 268
            I+ ++ KA+A E    ++  R+K + E    E  E+ +QE V++L + +E E  K   +
Sbjct: 205 AIEAERAKAEAAEA--EQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAE 262

Query: 269 QDLMEAQEMIRRLEEQLKQLQAA-KEELEARQTELQ 303
           Q+ M   ++     EQ + L+   K E E+ Q E+Q
Sbjct: 263 QERMLEHKLQ----EQEELLKEGFKTEAESLQKEIQ 294


>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200).  This
           family is found in eukaryotes. It is a coiled-coil
           domain of unknwon function.
          Length = 126

 Score = 31.4 bits (72), Expect = 0.24
 Identities = 19/84 (22%), Positives = 43/84 (51%), Gaps = 10/84 (11%)

Query: 216 QQMKAQAREEKNAKQQQ----RDKLQLEI------AAREKAEKKHQESVERLKQLEVEMA 265
           ++ +   + E+  ++++       ++ +       A R +AEKK +E  +  K+ E E+ 
Sbjct: 25  RREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKEEEIK 84

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQ 289
           +   +L E +  I +LEE+L++ Q
Sbjct: 85  ELKAELEELKAEIEKLEEKLEEYQ 108



 Score = 29.1 bits (66), Expect = 1.4
 Identities = 22/95 (23%), Positives = 53/95 (55%), Gaps = 1/95 (1%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVER-LKQLEVEMAKRDQDLM 272
           +VQ      REE   +++   + + E+  +E+  ++     ++ LK+ E +  + ++   
Sbjct: 11  EVQLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAE 70

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           E +++ +  EE++K+L+A  EEL+A   +L+ +L+
Sbjct: 71  EEKKLRKEKEEEIKELKAELEELKAEIEKLEEKLE 105


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 33.0 bits (76), Expect = 0.24
 Identities = 14/78 (17%), Positives = 31/78 (39%)

Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
           + +N     + ++       E   ++  +S    +  + E+   +    E +E  + LE 
Sbjct: 139 DPENLLHALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEA 198

Query: 284 QLKQLQAAKEELEARQTE 301
           QL+QLQ    E    + +
Sbjct: 199 QLEQLQEKAAETSQERKQ 216



 Score = 31.1 bits (71), Expect = 0.90
 Identities = 17/62 (27%), Positives = 37/62 (59%), Gaps = 3/62 (4%)

Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
           Q+ V  LKQ ++E+  R++   ++Q +    +++L  L+    ELE +Q EL+ +L+ ++
Sbjct: 148 QQEVLTLKQ-QLELQAREKA--QSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQ 204

Query: 311 NQ 312
            +
Sbjct: 205 EK 206



 Score = 30.3 bits (69), Expect = 1.6
 Identities = 17/79 (21%), Positives = 38/79 (48%), Gaps = 5/79 (6%)

Query: 230 QQQRDKLQLEIAA-REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
           +     LQ E+   +++ E + +E      Q +     + Q+L+  + +   LEE+ ++L
Sbjct: 141 ENLLHALQQEVLTLKQQLELQARE----KAQSQALAEAQQQELVALEGLAAELEEKQQEL 196

Query: 289 QAAKEELEARQTELQLELQ 307
           +A  E+L+ +  E   E +
Sbjct: 197 EAQLEQLQEKAAETSQERK 215



 Score = 28.8 bits (65), Expect = 4.4
 Identities = 18/87 (20%), Positives = 38/87 (43%), Gaps = 2/87 (2%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK-RDQDLME 273
           V  +K Q   +   K Q +   + +       E    E  E+ ++LE ++ + +++    
Sbjct: 151 VLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQEKAAET 210

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQT 300
           +QE  ++ +E   Q  A + EL   +T
Sbjct: 211 SQERKQKRKEITDQ-AAKRLELSEEET 236


>gnl|CDD|227635 COG5325, COG5325, t-SNARE complex subunit, syntaxin [Intracellular
           trafficking and secretion].
          Length = 283

 Score = 32.5 bits (74), Expect = 0.26
 Identities = 18/89 (20%), Positives = 36/89 (40%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           +QQ  AQ R+ +  + +       +    E+ E +   S    +Q   +    +++L   
Sbjct: 127 IQQKSAQFRKYQVLQAKFLRNKNNDQHPLEEEEDEESLSSLGSQQTLQQQGLSNEELEYQ 186

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
           Q +I   +E++K L     EL     +L 
Sbjct: 187 QILITERDEEIKNLARGIYELNEIFRDLG 215


>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 438

 Score = 32.5 bits (74), Expect = 0.26
 Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 3/105 (2%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
            Q K Q        +Q    LQ EIA  E  E  + E  + L+ LE E+ +  ++L +  
Sbjct: 75  NQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQL 134

Query: 276 EMIRRLEEQLKQLQAAKEEL---EARQTELQLELQPIRNQSNLNK 317
           +    + E+ ++    +E L     ++ E  LEL+  + +  L++
Sbjct: 135 QNTAEIIEKKRENNKNEERLKFENEKKLEESLELEREKFEEQLHE 179


>gnl|CDD|225803 COG3264, COG3264, Small-conductance mechanosensitive channel [Cell
           envelope biogenesis, outer membrane].
          Length = 835

 Score = 32.7 bits (75), Expect = 0.26
 Identities = 18/94 (19%), Positives = 35/94 (37%), Gaps = 7/94 (7%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
              +Q+     R+E+     +  +LQ E+  +E          ++L Q   +  +R   L
Sbjct: 31  LQLLQEAVNSKRQEEAEPAAEEAELQAELIQQE------LAINDQLSQALNQQTERLNAL 84

Query: 272 MEAQEMIRRLEEQLKQLQ-AAKEELEARQTELQL 304
                 +  L  QL Q     +E++   +  L L
Sbjct: 85  ASDDRQLANLLLQLLQSSRTIREQIAVLRGSLLL 118



 Score = 32.0 bits (73), Expect = 0.50
 Identities = 18/100 (18%), Positives = 41/100 (41%), Gaps = 6/100 (6%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           +++ A A +++         LQ     RE+     + S+   + L  ++    +     +
Sbjct: 79  ERLNALASDDRQLANLLLQLLQSSRTIREQIAVL-RGSLLLSRILLQQLGPLPEAGQPQE 137

Query: 276 EMIRRLEEQLKQLQAAKEE---LEARQTELQLELQPIRNQ 312
           +    + ++   LQA K     LE +  +L  E++ I +Q
Sbjct: 138 QFE--VTQERDALQAEKAYINALEGQAEQLTAEVRDILDQ 175



 Score = 27.7 bits (62), Expect = 8.9
 Identities = 21/91 (23%), Positives = 36/91 (39%), Gaps = 9/91 (9%)

Query: 227 NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA-----KRDQDLMEAQEMIRRL 281
              Q +R+ L  E A  E A +  QE+V   +Q E E A      + + + +   +  +L
Sbjct: 11  ELLQSRRELLTAESAQLEAALQLLQEAVNSKRQEEAEPAAEEAELQAELIQQELAINDQL 70

Query: 282 EEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            + L Q Q  +    A       +L  +  Q
Sbjct: 71  SQALNQ-QTERLNALASD---DRQLANLLLQ 97


>gnl|CDD|222629 pfam14257, DUF4349, Domain of unknown function (DUF4349).  This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 282 and
           353 amino acids in length. There is a single completely
           conserved residue D that may be functionally important.
          Length = 262

 Score = 32.2 bits (74), Expect = 0.29
 Identities = 15/48 (31%), Positives = 28/48 (58%), Gaps = 1/48 (2%)

Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
           + S ERL  L +E A   +DL+E +  + R++ +++ L+A    L+ R
Sbjct: 145 RASEERLLAL-LERAGTVEDLLEVERELSRVQAEIESLEAQLRYLDDR 191


>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090).  This
           family consists of several bacterial proteins of unknown
           function and is known as YqjC in E. coli.
          Length = 115

 Score = 30.7 bits (70), Expect = 0.29
 Identities = 23/94 (24%), Positives = 42/94 (44%), Gaps = 17/94 (18%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAARE--------KAEKKHQESVERLKQLEVEMA 265
            ++     A+   N  +       LE A +E            + Q  VE  +Q   E+A
Sbjct: 28  AIETQIEYAKAHGNQHRI----AGLEKALKEVQAHCTDAGLRAERQAKVEEKEQ---EVA 80

Query: 266 KRDQDLMEAQEM--IRRLEEQLKQLQAAKEELEA 297
           +R+Q+L EAQ      ++E++ ++L  A+ EL+ 
Sbjct: 81  EREQELKEAQAKGDADKIEKKQRKLAEAQAELQE 114


>gnl|CDD|219412 pfam07439, DUF1515, Protein of unknown function (DUF1515).  This
           family consists of several hypothetical bacterial
           proteins of around 130 residues in length. Members of
           this family seem to be found exclusively in Rhizobium
           species. The function of this family is unknown.
          Length = 112

 Score = 30.7 bits (69), Expect = 0.30
 Identities = 16/62 (25%), Positives = 29/62 (46%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           QQ+     E KN ++  R       A+R    ++  E VER+  LE  ++K + D+   +
Sbjct: 8   QQLGTLQAEVKNLREDIRRSEDRSDASRASMYRRMDELVERVTTLEGSVSKLNADITSMK 67

Query: 276 EM 277
            +
Sbjct: 68  PV 69


>gnl|CDD|189476 pfam00261, Tropomyosin, Tropomyosin. 
          Length = 237

 Score = 31.8 bits (73), Expect = 0.33
 Identities = 21/83 (25%), Positives = 45/83 (54%), Gaps = 2/83 (2%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
           ++   +EK  + + + K   EIA  E+A++K++E   +L  +E ++ + ++    A+  I
Sbjct: 86  RSLKDDEKMEQLEAQLKEAKEIA--EEADRKYEEVARKLVVVEGDLERAEERAEAAESKI 143

Query: 279 RRLEEQLKQLQAAKEELEARQTE 301
             LEE+LK +    + LE  + +
Sbjct: 144 VELEEELKVVGNNLKSLEVSEEK 166


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 32.0 bits (73), Expect = 0.34
 Identities = 22/127 (17%), Positives = 49/127 (38%), Gaps = 21/127 (16%)

Query: 188 RVNKRILALCMGNHELYMRRRKPDT----IDVQQMKAQA---REEKNAKQQQRDKLQLEI 240
           +V + +      + +L     K +       + Q++A+    +EE   K+++ ++L+  +
Sbjct: 42  KVEEALEGATNEDGKLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRAL 101

Query: 241 AAR-EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
           A R         +  +R              L + Q+ I+R   +L  L      L  ++
Sbjct: 102 AQRRSDLSSASYQLEKRRA----------SQLEKLQDEIKRTRSKLNALH---SLLAEKR 148

Query: 300 TELQLEL 306
           + L  EL
Sbjct: 149 SFLCREL 155



 Score = 30.4 bits (69), Expect = 1.1
 Identities = 20/90 (22%), Positives = 45/90 (50%), Gaps = 1/90 (1%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA-QEM 277
           K  A   K    +++++L    A   + +++ ++  ER+++L+  +A+R  DL  A  ++
Sbjct: 56  KLAADLLKLEVARKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQL 115

Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQ 307
            +R   QL++LQ   +   ++   L   L 
Sbjct: 116 EKRRASQLEKLQDEIKRTRSKLNALHSLLA 145


>gnl|CDD|112890 pfam04094, DUF390, Protein of unknown function (DUF390).  This is a
           family of long proteins currently only found in the rice
           genome. They have no known function. However they may be
           some kind of transposable element.
          Length = 843

 Score = 32.5 bits (73), Expect = 0.34
 Identities = 22/91 (24%), Positives = 42/91 (46%), Gaps = 2/91 (2%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
           +++A +   A+ ++R+  + E A R  +AE+  Q++  R +Q E    +      +A+E 
Sbjct: 224 QSEAEDPAAAEARRREADRREAADRLREAEEAAQDAA-RARQAEEAAREEAARARQAEEA 282

Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQP 308
            R  E   +  +AA     AR      +L P
Sbjct: 283 AREAEAAFRADEAAATSEAARDEAAGAQLAP 313



 Score = 29.4 bits (65), Expect = 2.8
 Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 11/93 (11%)

Query: 217 QMKAQAREEKNAKQQQRDKLQLE--IAAREKAEKKHQ------ESVERLKQLEVEMAKRD 268
           + +A A EE+ A  ++RD    E  +A RE     H+      E   RL+  E  + +R+
Sbjct: 530 EARAMALEER-ACAKERDLADREAAVAIREATLAAHEAACAEEEFALRLR--EDALTERE 586

Query: 269 QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
           + L EA+   ++L + L   +AA+EE   R  E
Sbjct: 587 RALEEAEAAAQQLADSLFLREAAQEEQARRNLE 619


>gnl|CDD|240227 PTZ00009, PTZ00009, heat shock 70 kDa protein; Provisional.
          Length = 653

 Score = 32.5 bits (74), Expect = 0.35
 Identities = 23/101 (22%), Positives = 43/101 (42%), Gaps = 13/101 (12%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEK-----KHQESVERLKQLEVEMAKRD 268
           D+ +M  +A + K   +  R++    + A+   E      K+    E++K  ++  + + 
Sbjct: 516 DIDRMVNEAEKYKAEDEANRER----VEAKNGLENYCYSMKNTLQDEKVKG-KLSDSDKA 570

Query: 269 QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPI 309
                  E +  LE   K   A KEE E +Q E++    PI
Sbjct: 571 TIEKAIDEALEWLE---KNQLAEKEEFEHKQKEVESVCNPI 608


>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region.  This family
           represents an N-terminal conserved region found in
           several huntingtin-associated protein 1 (HAP1)
           homologues. HAP1 binds to huntingtin in a polyglutamine
           repeat-length-dependent manner. However, its possible
           role in the pathogenesis of Huntington's disease is
           unclear. This family also includes a similar N-terminal
           conserved region from hypothetical protein products of
           ALS2CR3 genes found in the human juvenile amyotrophic
           lateral sclerosis critical region 2q33-2q34.
          Length = 307

 Score = 32.1 bits (73), Expect = 0.35
 Identities = 25/109 (22%), Positives = 47/109 (43%), Gaps = 10/109 (9%)

Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
           +R        + +  A+  E+   +Q++   L  +I   +K  K +    E L+Q     
Sbjct: 209 LREANDQIASLSEELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQH---- 264

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
                 L  A++  R+L+ +L++LQ    E      E Q EL+ +RN+ 
Sbjct: 265 ------LAAAKDAQRQLQAELQELQDKYAECMEMLHEAQEELKNLRNKG 307



 Score = 29.0 bits (65), Expect = 3.5
 Identities = 33/125 (26%), Positives = 57/125 (45%), Gaps = 23/125 (18%)

Query: 211 DTIDVQQMKAQAREEKNAK-QQQRDKLQLEIAAREKAE--------KKHQESVERLKQLE 261
             ++  Q K +  EE+N   + +   L+ E    E+ E        K+ +E+ +++  L 
Sbjct: 161 FQLEALQEKLKLLEEENEHLRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIASLS 220

Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQLQ-------AAKEELEA-------RQTELQLELQ 307
            E+AK+ +DL   QE I  L  Q+  LQ          EEL+         Q +LQ ELQ
Sbjct: 221 EELAKKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQLQAELQ 280

Query: 308 PIRNQ 312
            ++++
Sbjct: 281 ELQDK 285


>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 220 and
           257 amino acids in length.
          Length = 221

 Score = 31.8 bits (73), Expect = 0.36
 Identities = 13/50 (26%), Positives = 29/50 (58%), Gaps = 2/50 (4%)

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQL--QAAKEELEARQTELQLELQPIRNQ 312
            +  ++ +E  E I  LE+++ +L  +  KE+   R+ EL LE++ ++ +
Sbjct: 171 KEDLKERVERLEQIEELEKEIAKLKKKLKKEKQFNRKVELNLEIKKLKKE 220


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 32.5 bits (74), Expect = 0.36
 Identities = 18/115 (15%), Positives = 44/115 (38%), Gaps = 9/115 (7%)

Query: 198 MGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL 257
           +       +  +     ++++  +  + K   +Q+ ++L+     R K  +  +E+ E L
Sbjct: 517 IEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKE--RERNKKLELEKEAQEAL 574

Query: 258 KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           K L+ E+    ++L E        +    +   + E+L   +   Q   Q   N 
Sbjct: 575 KALKKEVESIIRELKEK-------KIHKAKEIKSIEDLVKLKETKQKIPQKPTNF 622


>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
          Length = 646

 Score = 32.2 bits (74), Expect = 0.37
 Identities = 17/66 (25%), Positives = 31/66 (46%), Gaps = 7/66 (10%)

Query: 238 LEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
           L     EK +   +   E +K +E E       L + ++ I+ LEE++ +L+   +ELE 
Sbjct: 69  LNPLREEKKKVSVKSLEELIKDVEEE-------LEKIEKEIKELEEEISELENEIKELEQ 121

Query: 298 RQTELQ 303
               L+
Sbjct: 122 EIERLE 127



 Score = 28.4 bits (64), Expect = 5.4
 Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 246 AEKKHQESVER-LKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
             K+  + VE  LK+L  E     ++     E+IR ++E+L++++  +E L   
Sbjct: 185 VLKELSDEVEEELKKLGFE-RLELEEEGTPSELIREIKEELEEIEKERESLLEE 237


>gnl|CDD|226203 COG3678, CpxP, P pilus assembly/Cpx signaling pathway, periplasmic
           inhibitor/zinc-resistance associated protein
           [Intracellular trafficking and secretion / Cell motility
           and secretio / Signal transduction mechanisms /
           Inorganic ion transport and metabolism].
          Length = 160

 Score = 31.2 bits (71), Expect = 0.37
 Identities = 14/84 (16%), Positives = 36/84 (42%), Gaps = 3/84 (3%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
            Q +AQ  + ++ ++   + +  +     KA  + ++   + + L     K D  + +  
Sbjct: 72  AQRRAQREQLRSKRRALHELIAADQFDEAKARAQAEKMENQRQALRELRVKSDNQMYQV- 130

Query: 276 EMIRRLEEQLKQLQA-AKEELEAR 298
            +      +L++L A  +EE + R
Sbjct: 131 -LTPEQRAKLQELLAQRREERQQR 153


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 32.4 bits (74), Expect = 0.40
 Identities = 16/72 (22%), Positives = 35/72 (48%), Gaps = 3/72 (4%)

Query: 249 KHQESVERLKQ---LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
             +  V   K+   LE E+   + ++ E +E +  LE+++ +L+     L+  ++ L   
Sbjct: 58  GGKPDVPPSKEFLDLEEEILDLEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDEN 117

Query: 306 LQPIRNQSNLNK 317
           L+ +   SNL+ 
Sbjct: 118 LEELSELSNLDI 129


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 31.8 bits (73), Expect = 0.42
 Identities = 24/109 (22%), Positives = 43/109 (39%), Gaps = 16/109 (14%)

Query: 199 GNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK 258
           GN+  Y+ ++        +   Q       +Q++  K Q  I   + A  K +++  R+K
Sbjct: 233 GNYSSYLEQK-------AERLRQEAAAYEKQQKELAKEQEWIRRGKAAASKAKKAKSRIK 285

Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           +LE          +EA+    R  E+ K L         R  +L LE +
Sbjct: 286 RLEK---------LEARLAEERPVEEGKPLAFRFPPPGKRLGKLVLEFE 325


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 32.4 bits (73), Expect = 0.42
 Identities = 21/106 (19%), Positives = 50/106 (47%), Gaps = 15/106 (14%)

Query: 207  RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQES-----------VE 255
            ++K + +   + + + +  + AK+ + DK + E A + + ++K                E
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705

Query: 256  RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
             LK+ E E  K+ ++L +A+E  +   E+ K+    + E + ++ E
Sbjct: 1706 ELKKKEAEEKKKAEELKKAEEENKIKAEEAKK----EAEEDKKKAE 1747



 Score = 31.3 bits (70), Expect = 0.85
 Identities = 21/91 (23%), Positives = 50/91 (54%), Gaps = 1/91 (1%)

Query: 216  QQMKA-QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
            ++MKA +A++ + AK +  +  + E   ++  + K +E+ E+ K  E++ A+ +  +  A
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA 1664

Query: 275  QEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
            +E  +  E++ K  +A K E + ++    L+
Sbjct: 1665 EEAKKAEEDKKKAEEAKKAEEDEKKAAEALK 1695



 Score = 30.9 bits (69), Expect = 1.3
 Identities = 26/122 (21%), Positives = 58/122 (47%), Gaps = 10/122 (8%)

Query: 206  RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQES--VERLKQLEVE 263
            +  + + I   +   +A E+K   ++ +   + E  A E  +K+ +E+   E LK+ E E
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAE 1713

Query: 264  MAKRDQDL--------MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
              K+ ++L        ++A+E  +  EE  K+ + AK++ E ++    L+ +  +    +
Sbjct: 1714 EKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEI 1773

Query: 316  NK 317
             K
Sbjct: 1774 RK 1775



 Score = 30.5 bits (68), Expect = 1.4
 Identities = 24/113 (21%), Positives = 56/113 (49%), Gaps = 3/113 (2%)

Query: 207  RRKPDTIDVQQMKAQAREEKNAKQQQR-DKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
                D  +  + KA+A E+K  + +++ D  + +   ++KA++  +++ E  K+ + E+ 
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKAD-ELK 1411

Query: 266  KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT-ELQLELQPIRNQSNLNK 317
            K      +A E  ++ EE+ K  +A K+  EA++  E + + +  +      K
Sbjct: 1412 KAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKK 1464



 Score = 30.1 bits (67), Expect = 1.8
 Identities = 19/97 (19%), Positives = 44/97 (45%), Gaps = 1/97 (1%)

Query: 206  RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQ-ESVERLKQLEVEM 264
             + +      ++ K +A   K   ++++   + +  A E  +K  + +     K+   E 
Sbjct: 1364 EKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEA 1423

Query: 265  AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
             K+ ++  +A E  ++ EE  K  +A K+  EA++ E
Sbjct: 1424 KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAE 1460



 Score = 30.1 bits (67), Expect = 1.8
 Identities = 22/95 (23%), Positives = 46/95 (48%), Gaps = 3/95 (3%)

Query: 207  RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
            ++K D +   +   +A E+K A++ ++ +    +A R+  E K     E  +  EV    
Sbjct: 1545 KKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKK---AEEARIEEVMKLY 1601

Query: 267  RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
             ++  M+A+E  +  E ++K  +  K E E ++ E
Sbjct: 1602 EEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVE 1636



 Score = 29.7 bits (66), Expect = 2.3
 Identities = 20/86 (23%), Positives = 42/86 (48%)

Query: 216  QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
             + K +A E K A + ++   + + A   K + +  +  +  K+   E  K D+   +A+
Sbjct: 1434 DEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAE 1493

Query: 276  EMIRRLEEQLKQLQAAKEELEARQTE 301
            E  ++ +E  K  +A K+  EA++ E
Sbjct: 1494 EAKKKADEAKKAAEAKKKADEAKKAE 1519



 Score = 29.7 bits (66), Expect = 2.7
 Identities = 21/97 (21%), Positives = 48/97 (49%), Gaps = 2/97 (2%)

Query: 207  RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEV--EM 264
            ++K D    +  + +  +E   K ++  K   E+     A+KK  E+ ++ ++ +   E 
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEA 1436

Query: 265  AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
             K+ ++  +A E  ++ EE  K  +A K+  EA++ +
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473



 Score = 29.3 bits (65), Expect = 3.2
 Identities = 17/104 (16%), Positives = 53/104 (50%), Gaps = 1/104 (0%)

Query: 205  MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
            +++   +    +++K +  EEK   ++ +   +      E+A+K+ +E  ++ ++ + + 
Sbjct: 1694 LKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDE 1753

Query: 265  AKRDQDLMEAQEMIRRLEEQLKQLQAA-KEELEARQTELQLELQ 307
             ++ +     +E  ++ EE  K+ +A  +EEL+    + ++E+ 
Sbjct: 1754 EEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVD 1797



 Score = 29.3 bits (65), Expect = 3.3
 Identities = 26/120 (21%), Positives = 50/120 (41%), Gaps = 4/120 (3%)

Query: 184  APRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR 243
            A   R N+ I          + RR+     +  +   + ++ +  K+    K   E    
Sbjct: 1245 AEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKKAEEKKKA 1304

Query: 244  EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            ++A+KK +E+    K+ +    K ++   +A    ++ EE  K  +AAK E EA   E +
Sbjct: 1305 DEAKKKAEEA----KKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE 1360



 Score = 29.0 bits (64), Expect = 4.0
 Identities = 24/98 (24%), Positives = 49/98 (50%), Gaps = 5/98 (5%)

Query: 205  MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
             ++ +   I  +++K +A EEK   +Q + K   E    E+ +K  +E+  +  +     
Sbjct: 1612 AKKAEEAKIKAEELK-KAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEA--- 1667

Query: 265  AKRDQDLMEAQEMIRRLEEQLKQLQA-AKEELEARQTE 301
             K ++D  +A+E  +  E++ K  +A  KE  EA++ E
Sbjct: 1668 KKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705



 Score = 28.6 bits (63), Expect = 5.3
 Identities = 23/81 (28%), Positives = 46/81 (56%), Gaps = 2/81 (2%)

Query: 221  QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
            +AR+ + AK++  D  + E A + +  +K +E+ +      VE+A++ +D  +A+E   R
Sbjct: 1113 EARKAEEAKKKAEDARKAEEARKAEDARKAEEARKAEDAKRVEIARKAEDARKAEE--AR 1170

Query: 281  LEEQLKQLQAAKEELEARQTE 301
              E  K+ +AA++  E R+ E
Sbjct: 1171 KAEDAKKAEAARKAEEVRKAE 1191



 Score = 28.6 bits (63), Expect = 6.0
 Identities = 19/86 (22%), Positives = 43/86 (50%), Gaps = 1/86 (1%)

Query: 216  QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
             + K +A E K A++ ++   + + A   K + +  +  +  K+   E  K+  +  +A 
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAA 1506

Query: 276  EMIRRLEEQLKQLQAAKEELEARQTE 301
            E  ++ +E  K+ + AK+  EA++ E
Sbjct: 1507 EAKKKADE-AKKAEEAKKADEAKKAE 1531



 Score = 28.6 bits (63), Expect = 6.1
 Identities = 19/83 (22%), Positives = 47/83 (56%), Gaps = 2/83 (2%)

Query: 219  KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
            KA   ++  A +++ D+ + +   ++KA++  +++ E  K  E +  K+ ++  +A+E  
Sbjct: 1406 KADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAK--KKAEEAKKAEEAK 1463

Query: 279  RRLEEQLKQLQAAKEELEARQTE 301
            ++ EE  K  +A K+  EA++ +
Sbjct: 1464 KKAEEAKKADEAKKKAEEAKKAD 1486



 Score = 28.6 bits (63), Expect = 6.3
 Identities = 21/93 (22%), Positives = 43/93 (46%), Gaps = 2/93 (2%)

Query: 207  RRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
            ++K D    +  +A+   E    + +    + E AA EKAE   ++  E  K+ +    K
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAADEAE-AAEEKAEAAEKKKEEAKKKAD-AAKK 1385

Query: 267  RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
            + ++  +A E  ++ EE  K+    K+   A++
Sbjct: 1386 KAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418


>gnl|CDD|203244 pfam05400, FliT, Flagellar protein FliT.  This family contains
           several bacterial flagellar FliT proteins. The flagellar
           proteins FlgN and FliT have been proposed to act as
           substrate specific export chaperones, facilitating
           incorporation of the enterobacterial hook-associated
           axial proteins (HAPs) FlgK/FlgL and FliD into the
           growing flagellum. In Salmonella typhimurium flgN and
           fliT mutants, the export of target HAPs is reduced,
           concomitant with loss of unincorporated flagellin into
           the surrounding medium.
          Length = 82

 Score = 29.6 bits (67), Expect = 0.42
 Identities = 16/71 (22%), Positives = 33/71 (46%), Gaps = 7/71 (9%)

Query: 247 EKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
           E + ++ +ERL++  +   + +    E +E++RR+     ++   +  L+ R  EL   L
Sbjct: 17  EDEREQLLERLREAPLSPPESE----EKRELLRRILANDAEI---RALLQPRLDELSQLL 69

Query: 307 QPIRNQSNLNK 317
              R Q   N 
Sbjct: 70  GQARRQKKANN 80


>gnl|CDD|220555 pfam10079, DUF2317, Uncharacterized protein conserved in bacteria
           (DUF2317).  Members of this family of hypothetical
           bacterial proteins have no known function.
          Length = 541

 Score = 31.8 bits (73), Expect = 0.45
 Identities = 22/98 (22%), Positives = 40/98 (40%), Gaps = 17/98 (17%)

Query: 202 ELYMRRRKPDTID--VQQMKAQAREEKNAKQQQRDKLQ--LEIAAREKAEKKHQESVERL 257
           E  +  +    ID   ++ KA+   +    +++  K+   LE A  E  E K  + +E L
Sbjct: 412 EKVLEEKANSEIDIDFEEAKAELEAQFKRLREEAAKIDPSLEGAL-EANEAKILKQLEFL 470

Query: 258 KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
           ++       +      AQ   R+ EE L+Q    +  L
Sbjct: 471 EK----RLLK------AQR--RKHEELLRQFDRLQTAL 496



 Score = 31.8 bits (73), Expect = 0.56
 Identities = 21/94 (22%), Positives = 41/94 (43%), Gaps = 7/94 (7%)

Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE---- 276
                K    +++   +++I   E A+ + +   +RL++   ++    +  +EA E    
Sbjct: 406 GLELLKEKVLEEKANSEIDIDFEE-AKAELEAQFKRLREEAAKIDPSLEGALEANEAKIL 464

Query: 277 -MIRRLEEQLKQLQAAKEELEARQ-TELQLELQP 308
             +  LE++L + Q  K E   RQ   LQ  L P
Sbjct: 465 KQLEFLEKRLLKAQRRKHEELLRQFDRLQTALFP 498


>gnl|CDD|223914 COG0845, AcrA, Membrane-fusion protein [Cell envelope biogenesis,
           outer membrane].
          Length = 372

 Score = 31.6 bits (71), Expect = 0.47
 Identities = 14/82 (17%), Positives = 27/82 (32%), Gaps = 2/82 (2%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREK--AEKKHQESVERLKQLEVEMAKRD 268
            +  +Q    QA  +    Q      +L    RE   A +K   S   L   +  +   +
Sbjct: 99  PSAVLQAALDQAEAQLARAQALLAPAELGDLQREAKLAAEKAAVSQAELDAAQALLRAAE 158

Query: 269 QDLMEAQEMIRRLEEQLKQLQA 290
             +  AQ  +   +  L+  + 
Sbjct: 159 ALVEAAQAALASAKLNLEYTRI 180


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 32.0 bits (73), Expect = 0.48
 Identities = 22/89 (24%), Positives = 36/89 (40%), Gaps = 1/89 (1%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           D++Q  A    +    +++ + L  E  A E       E+   LKQ             +
Sbjct: 754 DLRQRIAAMERDLARFEEEVEAL-AEAVAPEMLGTPADETARALKQRLKRARDTAAAAEK 812

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTEL 302
             E I   E+++ +  AA +E EAR T L
Sbjct: 813 LAEEIEEAEKEVSEAAAALDEAEARLTAL 841



 Score = 30.5 bits (69), Expect = 1.2
 Identities = 15/88 (17%), Positives = 30/88 (34%), Gaps = 16/88 (18%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEK---------KHQESVERLKQLEVEMAKRDQD 270
            +AR E  A ++  + L   + A +                 ++E L Q   E       
Sbjct: 245 EEARAELRAARRNLELLTERLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRN---- 300

Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEAR 298
              A++ +   E ++   + A   L A+
Sbjct: 301 ---AEQDLPDREGEIANAREAAAALLAQ 325



 Score = 30.5 bits (69), Expect = 1.3
 Identities = 20/96 (20%), Positives = 42/96 (43%), Gaps = 6/96 (6%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQE---SVERLKQLEVEMAK---R 267
           ++ Q    A     +    RD+L+ E     K   +  E   +++ LK+LE E+ +   +
Sbjct: 113 ELGQYLFSAAAGVGSLGSVRDELEKEADELWKPRGRKPEINVALKELKELEAEIREVQLK 172

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            +   +  + +   EE+L  L+    +LE  +  L+
Sbjct: 173 TRTWKDLVKALDEAEEELANLRKELRQLEKEKQRLE 208



 Score = 29.3 bits (66), Expect = 3.4
 Identities = 23/115 (20%), Positives = 47/115 (40%), Gaps = 14/115 (12%)

Query: 202 ELYMRRRKPDTIDVQQ---MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLK 258
            L   + + D I + +    +A A E  + ++ +    + ++  RE      +E+   L 
Sbjct: 264 RLEALQAELDEISLDEELLAQAAAIEALHQQRGEYRNAEQDLPDREGEIANAREAAAALL 323

Query: 259 Q-----LEVEMAKRDQDLMEA-----QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
                  + E A        A      E+ +R E   + L++A++ LE R+ EL+
Sbjct: 324 AQIGPDADEE-AVESLRPSLAAKETVTELEKRKEALDQALKSARDALEERERELK 377



 Score = 29.0 bits (65), Expect = 4.5
 Identities = 24/109 (22%), Positives = 41/109 (37%), Gaps = 9/109 (8%)

Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAE---------KKHQESVERLKQLEVE 263
            D    +A ARE+  A  + R +L   +  R +                  E L +L   
Sbjct: 608 EDWLAERATAREQVRAYFKARAELDALLDRRARLRAALRAALKAVAIVLPGEELAELLEL 667

Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
             +  ++  +       L+E+L+  + A EE E R  E Q  L+  + Q
Sbjct: 668 ARQLLEEAEKQAARKASLDERLRDAERALEEAEERHDEAQSALEAWQEQ 716



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 16/80 (20%), Positives = 31/80 (38%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
           + +    K    Q   +L L     E+       S E+++    E  +  Q   + ++ +
Sbjct: 416 EQEVAVAKRELAQALSRLGLWRGDLEELVALPVPSAEQVQAFLKEAEEIMQAKRQKRDRL 475

Query: 279 RRLEEQLKQLQAAKEELEAR 298
             LEE L+ L+     L+A 
Sbjct: 476 LELEEDLETLELQLRHLDAA 495


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 31.8 bits (73), Expect = 0.48
 Identities = 20/81 (24%), Positives = 34/81 (41%), Gaps = 6/81 (7%)

Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
            + QR  L       E A++ +QE+ +  KQ   E+    +   E  +    L+ QL++L
Sbjct: 138 PELQRQLLDAFAGLEELAQEAYQEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEEL 197

Query: 289 QAAK------EELEARQTELQ 303
           +         EELE  +  L 
Sbjct: 198 EELNLQPGEDEELEEERKRLS 218


>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score = 31.9 bits (73), Expect = 0.49
 Identities = 25/96 (26%), Positives = 50/96 (52%), Gaps = 15/96 (15%)

Query: 214 DVQQMKAQAREEK-NAKQQQRDKLQLEIAARE------KAE--------KKHQESVERLK 258
           D  ++  + R+E+ +++  +R + QLE   +E      +AE        K  Q+   R++
Sbjct: 694 DAARLAEELRQEQEHSQHLERLRKQLESQVKELQVRLDEAEAAALKGGKKMIQKLEARVR 753

Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEE 294
           +LE E+    +   E Q+ +R++E ++K+LQ   EE
Sbjct: 754 ELEAELDGEQRRHAETQKNLRKMERRVKELQFQVEE 789



 Score = 31.2 bits (71), Expect = 0.89
 Identities = 25/88 (28%), Positives = 43/88 (48%), Gaps = 10/88 (11%)

Query: 232 QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAA 291
           + ++L+ ++ A  +  K  Q+ +  L     E  +    + E ++  RRLE +  +LQAA
Sbjct: 420 ELEELKDQVEALRRENKNLQDEIHDLTDQLGEGGRN---VHELEKARRRLEAEKDELQAA 476

Query: 292 KEELEA--RQTE-----LQLELQPIRNQ 312
            EE EA     E      Q+EL  IR++
Sbjct: 477 LEEAEAALELEESKVLRAQVELSQIRSE 504



 Score = 31.2 bits (71), Expect = 0.94
 Identities = 36/134 (26%), Positives = 60/134 (44%), Gaps = 36/134 (26%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL-------------- 260
           V Q++ + +E +   ++  ++L+ E AAR KAEK   +    L++L              
Sbjct: 34  VAQLQKKIKELEARIRELEEELEAERAARAKAEKARADLSRELEELSERLEEAGGATAAQ 93

Query: 261 -------EVEMAKRDQDLMEA---------------QEMIRRLEEQLKQLQAAKEELEAR 298
                  E E+AK  +DL EA               Q+ I  L EQ++QLQ  K + E  
Sbjct: 94  IELNKKREAELAKLRKDLEEANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKE 153

Query: 299 QTELQLELQPIRNQ 312
           +++LQ E+  +  Q
Sbjct: 154 KSQLQAEVDDLLAQ 167



 Score = 30.0 bits (68), Expect = 2.0
 Identities = 27/112 (24%), Positives = 57/112 (50%), Gaps = 19/112 (16%)

Query: 213 IDVQQMKA------QAREE-----KNAKQQQRDKLQ--LEIAAREKAE----KKHQESVE 255
           +++ Q+++        +EE     +   Q+  + LQ  LE  A+ KAE    KK  E   
Sbjct: 496 VELSQIRSEIERRLAEKEEEFENTRKNHQRAIESLQATLEAEAKGKAEASRLKKKLEG-- 553

Query: 256 RLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
            + +LE+ +   ++   EAQ+ +++ ++Q+K+LQ   EE +  + + + +L 
Sbjct: 554 DINELEIALDHANKANAEAQKNVKKYQQQVKELQTQVEEEQRAREDAREQLA 605



 Score = 28.9 bits (65), Expect = 4.3
 Identities = 26/92 (28%), Positives = 45/92 (48%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           V++++ Q  EE+ A++  R++L +        E + +E    L+Q E    + + +L EA
Sbjct: 583 VKELQTQVEEEQRAREDAREQLAVAERRATALEAELEELRSALEQAERARKQAETELAEA 642

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
            E +  L  Q   L A K +LE     LQ +L
Sbjct: 643 SERVNELTAQNSSLIAQKRKLEGELAALQSDL 674



 Score = 28.1 bits (63), Expect = 8.9
 Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 10/109 (9%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQD 270
           + I+ Q  K +A+ EK   Q Q +   L +A  ++  K    + ++ KQLE +       
Sbjct: 138 EQIE-QLQKQKAKAEKEKSQLQAEVDDL-LAQLDQIAKAKLNAEKKAKQLESQ------- 188

Query: 271 LMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ-SNLNKV 318
           L E Q  +  L+ QL  L + K  L++  ++L  +L+    Q SNL+K+
Sbjct: 189 LSELQVKLDELQRQLNDLTSQKSRLQSENSDLTRQLEEAEAQVSNLSKL 237


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 31.5 bits (72), Expect = 0.51
 Identities = 22/115 (19%), Positives = 50/115 (43%), Gaps = 12/115 (10%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKL--------QLEIAAREKAEKKHQESVERLKQLEVE 263
             D + +  +     + K + RD+         QL+    E  +    E    L + + +
Sbjct: 157 KEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTE----LDRAKEK 212

Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
           + K  Q++M   + +  LEE+L++L++  E+L  +++EL  E+     +    + 
Sbjct: 213 LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQCRG 267


>gnl|CDD|234801 PRK00591, prfA, peptide chain release factor 1; Validated.
          Length = 359

 Score = 31.6 bits (73), Expect = 0.53
 Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 21/100 (21%)

Query: 210 PDTI-DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRD 268
           P+ I D ++ +  ++E          +L+  I    +  K+ QE +E  K    EM + +
Sbjct: 25  PEVISDQKRFRKLSKEYA--------ELE-PIVEAYREYKQAQEDLEEAK----EMLEEE 71

Query: 269 QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
            D     EM    +E+LK+L+   EELE    EL++ L P
Sbjct: 72  SD----PEMREMAKEELKELEERLEELEE---ELKILLLP 104


>gnl|CDD|214360 CHL00094, dnaK, heat shock protein 70.
          Length = 621

 Score = 31.6 bits (72), Expect = 0.55
 Identities = 22/96 (22%), Positives = 56/96 (58%), Gaps = 8/96 (8%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +V++M  +A  EKNA + +  + ++++  + +AE    ++ ++LK+L+ ++++  ++ +E
Sbjct: 506 EVERMVKEA--EKNAAEDKEKREKIDL--KNQAESLCYQAEKQLKELKDKISEEKKEKIE 561

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPI 309
              +I++L + L+      E +++   ELQ  L  I
Sbjct: 562 --NLIKKLRQALQNDNY--ESIKSLLEELQKALMEI 593


>gnl|CDD|233897 TIGR02499, HrpE_YscL_not, type III secretion apparatus protein,
           HrpE/YscL family.  This model is related to pfam06188,
           but is broader. pfam06188 describes HrpE-like proteins,
           components of bacterial type III secretion systems
           primarily in bacteria that infect plants. This model
           includes also the homologous proteins of animal
           pathogens, such as YscL of Yersinia pestis. This model
           excludes the related protein FliH of the bacterial
           flagellar apparatus (see pfam02108) [Protein fate,
           Protein and peptide secretion and trafficking, Cellular
           processes, Pathogenesis].
          Length = 166

 Score = 30.7 bits (70), Expect = 0.57
 Identities = 22/71 (30%), Positives = 38/71 (53%), Gaps = 4/71 (5%)

Query: 220 AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVER-LKQLEVEMAKRDQD-LMEAQEM 277
           AQA+    A +Q+ + +  +  A E+AE   Q   E+ L+Q   E A +  +   EA+++
Sbjct: 13  AQAQAILAAARQRAEAILAD--AEEEAEASRQLGYEQGLEQFWQEAAAQLAEWQQEAEQL 70

Query: 278 IRRLEEQLKQL 288
              LEE+L +L
Sbjct: 71  EASLEERLAEL 81


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
           This model describes a conserved domain found in surface
           proteins of a number of Firmutes. Many members have
           LPXTG C-terminal anchoring motifs and a substantial
           number have the KxYKxGKxW putative sorting signal at the
           N-terminus. The tetracycline resistance plasmid pCF10 in
           Enterococcus faecalis promotes conjugal plasmid transfer
           in response to sex pheromones, but PgrA/Sec10 encoded by
           that plasmid, a member of this family, specifically
           inhibits the ability of cells to receive homologous
           plasmids. The phenomenon is called surface exclusion.
          Length = 356

 Score = 31.2 bits (71), Expect = 0.58
 Identities = 21/94 (22%), Positives = 35/94 (37%), Gaps = 2/94 (2%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
             Q   A A+   N  Q      Q   AA + A    Q+ +   +   ++ A+   +L  
Sbjct: 265 TAQADLAAAQTALNTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQTAQ--NNLAT 322

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           AQ  +   E +L + + A   L A   + Q  L 
Sbjct: 323 AQAALANAEARLAKAKEALANLNADLAKKQAALD 356


>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
          Length = 425

 Score = 31.6 bits (73), Expect = 0.60
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 11/77 (14%)

Query: 239 EIAAREKAEKKHQESVERLKQL------EVEMAKRDQDLMEAQEMI---RRLEEQLKQLQ 289
           E+   ++  ++ Q  +E L+        E+  AKR  +  +A+ +I   + L+E++K L+
Sbjct: 29  ELLELDEERRELQTELEELQAERNALSKEIGQAKRKGE--DAEALIAEVKELKEEIKALE 86

Query: 290 AAKEELEARQTELQLEL 306
           A  +ELEA   EL L +
Sbjct: 87  AELDELEAELEELLLRI 103


>gnl|CDD|227606 COG5281, COG5281, Phage-related minor tail protein [Function
           unknown].
          Length = 833

 Score = 31.5 bits (71), Expect = 0.62
 Identities = 24/101 (23%), Positives = 45/101 (44%), Gaps = 6/101 (5%)

Query: 209 KPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL--KQLEVEMAK 266
           K    D+ Q +AQ+R +  A ++ ++++     A  KA    Q+ +  L   + +    K
Sbjct: 460 KAYQEDILQREAQSRGKTAAAERSQEQM----TAALKALLAFQQQIADLSGAKEKASDQK 515

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
                 E Q  + + E + +QLQ  K  LE ++  L+   Q
Sbjct: 516 SLLWKAEEQYALLKEEAKQRQLQEQKALLEHKKETLEYTSQ 556


>gnl|CDD|236842 PRK11091, PRK11091, aerobic respiration control sensor protein
           ArcB; Provisional.
          Length = 779

 Score = 31.4 bits (72), Expect = 0.62
 Identities = 18/70 (25%), Positives = 38/70 (54%), Gaps = 4/70 (5%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
            KL+     RE+  + + +  + + QL  E+A+R++     QE   +L+ ++K+ +  + 
Sbjct: 92  AKLE---EMRERDLELNVQLKDNIAQLNQEIAEREKAEEARQEAFEQLKNEIKEREETQI 148

Query: 294 ELEARQTELQ 303
           ELE +Q+ L 
Sbjct: 149 ELE-QQSSLL 157


>gnl|CDD|220388 pfam09766, FimP, Fms-interacting protein.  This entry carries part
           of the crucial 144 N-terminal residues of the FmiP
           protein, which is essential for the binding of the
           protein to the cytoplasmic domain of activated
           Fms-molecules in M-CSF induced haematopoietic
           differentiation of macrophages. The C-terminus contains
           a putative nuclear localisation sequence and a leucine
           zipper which suggest further, as yet unknown, nuclear
           functions. The level of FMIP expression might form a
           threshold that determines whether cells differentiate
           into macrophages or into granulocytes.
          Length = 352

 Score = 31.2 bits (71), Expect = 0.68
 Identities = 28/131 (21%), Positives = 51/131 (38%), Gaps = 24/131 (18%)

Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQ---LEIAAREKAEKKHQE---S 253
           N   ++R  K         + +  EEK+     R +LQ    E +  EK   +  +    
Sbjct: 7   NRLAHIRIEK--------GRDKTHEEKDKVDSLRLQLQNLLYEKSHLEKEINRCLDFKSK 58

Query: 254 VERLKQLEVEM----AKRDQDLMEA------QEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            E ++ + +E     A  D    E         M+ RLE +L+Q +   E+L+  ++E +
Sbjct: 59  DEEIELVPLEEFYAQAPEDISRPEKTPLDEHNLMLARLEWELEQRKELAEKLKELESEKE 118

Query: 304 LELQPIRNQSN 314
              Q I  +  
Sbjct: 119 KLQQDIEKKKE 129


>gnl|CDD|233065 TIGR00634, recN, DNA repair protein RecN.  All proteins in this
           family for which functions are known are ATP binding
           proteins involved in the initiation of recombination and
           recombinational repair [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 563

 Score = 31.2 bits (71), Expect = 0.69
 Identities = 16/70 (22%), Positives = 35/70 (50%), Gaps = 5/70 (7%)

Query: 231 QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME-----AQEMIRRLEEQL 285
           +  +K++ E+   + +++  +   E + +LE E+ K    L       A+ + +R+E++L
Sbjct: 329 EYAEKIKEELDQLDDSDESLEALEEEVDKLEEELDKAAVALSLIRRKAAERLAKRVEQEL 388

Query: 286 KQLQAAKEEL 295
           K L   K E 
Sbjct: 389 KALAMEKAEF 398



 Score = 30.9 bits (70), Expect = 1.0
 Identities = 21/72 (29%), Positives = 33/72 (45%), Gaps = 3/72 (4%)

Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE--ARQTELQLELQ 307
             E V+  ++L     K  Q L + Q+  + L ++L  LQ   EELE    Q      L+
Sbjct: 156 ANEKVKAYRELYQAWLKARQQLKDRQQKEQELAQRLDFLQFQLEELEEADLQPGEDEALE 215

Query: 308 PIRNQ-SNLNKV 318
             + + SNL K+
Sbjct: 216 AEQQRLSNLEKL 227



 Score = 27.8 bits (62), Expect = 9.2
 Identities = 15/86 (17%), Positives = 35/86 (40%), Gaps = 9/86 (10%)

Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           +   +   E   +  Q  +L+ +  A  E+  +  ++  E L QL+      D+ L   +
Sbjct: 297 EFDPERLNEIEERLAQIKRLKRKYGASVEEVLEYAEKIKEELDQLDD----SDESL---E 349

Query: 276 EMIRRLEEQLKQLQAAKEELEA-RQT 300
            +   +++  ++L  A   L   R+ 
Sbjct: 350 ALEEEVDKLEEELDKAAVALSLIRRK 375


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 31.2 bits (71), Expect = 0.71
 Identities = 22/113 (19%), Positives = 43/113 (38%), Gaps = 11/113 (9%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR------EKAEKKHQESVERLKQLEVEMA 265
              +QQ   Q+RE+     +   +L  E          E   +  + + + LKQL  +  
Sbjct: 79  KARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQNLKQLL-KPL 137

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
           +   +    Q + +R+ E  ++     EE++    E+Q   Q  +   NL   
Sbjct: 138 REVLEKFREQ-LEQRIHESAEERSTLLEEIDRLLGEIQ---QLAQEAGNLTAA 186


>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa).  Members of this
           family of proteins are part of the yeast nuclear pore
           complex-associated pre-60S ribosomal subunit. The family
           functions as a highly conserved exonuclease that is
           required for the 5'-end maturation of 5.8S and 25S
           rRNAs, demonstrating that 5'-end processing also has a
           redundant pathway. Nop25 binds late pre-60S ribosomes,
           accompanying them from the nucleolus to the nuclear
           periphery; and there is evidence for both physical and
           functional links between late 60S subunit processing and
           export.
          Length = 134

 Score = 30.0 bits (68), Expect = 0.77
 Identities = 16/59 (27%), Positives = 35/59 (59%), Gaps = 1/59 (1%)

Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
           R+KA+++ +E  ER +++E     R++   E ++ ++  +E LK L+   ++ E  +TE
Sbjct: 35  RKKAQEEAKEK-EREERIEERKRIREERKQELEKQLKERKEALKLLEEENDDEEDAETE 92


>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
           This model represents the MFP (membrane fusion protein)
           component of the RND family of transporters. RND refers
           to Resistance, Nodulation, and cell Division. It is, in
           part, a subfamily of pfam00529 (Pfam release 7.5) but
           hits substantial numbers of proteins missed by that
           model. The related HlyD secretion protein, for which
           pfam00529 is named, is outside the scope of this model.
           Attributed functions imply outward transport. These
           functions include nodulation, acriflavin resistance,
           heavy metal efflux, and multidrug resistance proteins.
           Most members of this family are found in Gram-negative
           bacteria. The proposed function of MFP proteins is to
           bring the inner and outer membranes together and enable
           transport to the outside of the outer membrane. Note,
           however, that a few members of this family are found in
           Gram-positive bacteria, where there is no outer membrane
           [Transport and binding proteins, Unknown substrate].
          Length = 322

 Score = 31.1 bits (71), Expect = 0.78
 Identities = 21/85 (24%), Positives = 35/85 (41%), Gaps = 7/85 (8%)

Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
           AR + +  Q        ++AA E   +  Q S ER ++L     KR+     +Q  +   
Sbjct: 55  ARLDDDDYQLALQAALAQLAAAEAQLELAQRSFERAERL----VKRN---AVSQADLDDA 107

Query: 282 EEQLKQLQAAKEELEARQTELQLEL 306
           +  ++  QA  E  +A     QL L
Sbjct: 108 KAAVEAAQADLEAAKASLASAQLNL 132


>gnl|CDD|220278 pfam09523, DUF2390, Protein of unknown function (DUF2390).  Members
           of this family are bacterial hypothetical proteins,
           about 160 amino acids in length, found in various
           proteobacteria, including members of the genera
           Pseudomonas and Vibrio. The C-terminal region is poorly
           conserved and is not included in the model.
          Length = 109

 Score = 29.5 bits (67), Expect = 0.78
 Identities = 15/55 (27%), Positives = 25/55 (45%), Gaps = 3/55 (5%)

Query: 253 SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           S E L+ L+   A   +++++    +R L  QLKQ      EL   +     EL+
Sbjct: 48  SAEDLQALQAAAAPWREEVVQP---LRALRRQLKQAADETPELALYEQLKAAELE 99


>gnl|CDD|216174 pfam00887, ACBP, Acyl CoA binding protein. 
          Length = 87

 Score = 29.1 bits (66), Expect = 0.79
 Identities = 10/23 (43%), Positives = 19/23 (82%)

Query: 27 WKEHKGMLREDAMMEYLKIAQDL 49
          WK+ KGM +E+AM +Y+++ ++L
Sbjct: 60 WKKLKGMSKEEAMKKYVELVEEL 82


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 31.2 bits (72), Expect = 0.87
 Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 14/82 (17%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKK-HQES-VERLKQLEVEMAKRD 268
             IDV+   A  R EK     +  KL+ EI   E+ EKK   E  V +  +  VE  +  
Sbjct: 805 GLIDVEAELA--RLEK-----ELAKLEKEI---ERVEKKLSNEGFVAKAPEEVVEKEREK 854

Query: 269 QDLMEAQEMIRRLEEQLKQLQA 290
               E +E + +L+E+L +L+A
Sbjct: 855 LA--EYEEKLAKLKERLARLKA 874


>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 157

 Score = 29.9 bits (68), Expect = 0.88
 Identities = 23/97 (23%), Positives = 56/97 (57%), Gaps = 1/97 (1%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
            +DVQ++ +++   K A++Q   + +   A  +K EK+ Q+  ++L++    +++ +   
Sbjct: 20  VVDVQKVLSESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSE-EARK 78

Query: 272 MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
            + QE+ ++ +E  ++ QAA++EL+ +Q EL   +  
Sbjct: 79  AKQQELQQKQQELQQKQQAAQQELQQKQQELLQPIYD 115


>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
          Length = 1036

 Score = 31.0 bits (70), Expect = 0.89
 Identities = 19/71 (26%), Positives = 39/71 (54%), Gaps = 5/71 (7%)

Query: 218 MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
           M   + E+   ++++R   +LE  A+E+AE++ Q   +R  + E E A  + D  +A+  
Sbjct: 241 MDEHSFEDFLLEEKRR---ELEKLAKEEAERERQAEEQR--RREEEKAAMEADRAQAKAE 295

Query: 278 IRRLEEQLKQL 288
           + +  E+L+ L
Sbjct: 296 VEKRREKLQNL 306


>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
          Length = 448

 Score = 30.8 bits (70), Expect = 0.90
 Identities = 25/99 (25%), Positives = 49/99 (49%), Gaps = 12/99 (12%)

Query: 199 GNHELY---MRRRKPDTIDVQQMKA------QAREEKNAKQQQRDKLQLEIAAREKAEKK 249
           G+ EL     RRR      V ++ A      Q + E ++ +++ +KL  E+A  + A++ 
Sbjct: 13  GDPELIRESQRRRFASVELVDEVIALDKEWRQRQFELDSLRKEFNKLNKEVAKLKIAKED 72

Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
             E +   K+L+ E+ +++    E QE    L+ +LK +
Sbjct: 73  ATELIAETKELKKEITEKEA---EVQEAKAALDAKLKTI 108


>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
           DevB family.  Members of this protein family are found
           mostly in the Cyanobacteria, but also in the
           Planctomycetes. DevB from Anabaena sp. strain PCC 7120
           is partially characterized as a membrane fusion protein
           of the DevBCA ABC exporter, probably a glycolipid
           exporter, required for heterocyst formation. Most
           Cyanobacteria have one member only, but Nostoc sp. PCC
           7120 has seven members.
          Length = 327

 Score = 31.0 bits (70), Expect = 0.91
 Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 6/75 (8%)

Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
              E +  + Q D+ +  +A      KK + + +R  +     AK  +D+   Q  + RL
Sbjct: 53  RTAELDVARTQLDEAKARLAQVRAGAKKGEIAAQRAARAA---AKLFKDVAAQQATLNRL 109

Query: 282 EEQLKQLQAAKEELE 296
           E    +L+ A+ E++
Sbjct: 110 E---AELETAQREVD 121


>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
           C4-dicarboxylate transport system [Signal transduction
           mechanisms].
          Length = 603

 Score = 31.1 bits (71), Expect = 0.92
 Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 9/55 (16%)

Query: 203 LYMRRRKPDTIDVQQMKAQAREEKNAKQQQR--------DKLQLEIAAREKAEKK 249
           L++RRR+           +AR E   + ++R         +LQ EIA RE+AE  
Sbjct: 314 LWLRRRR-RARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAA 367


>gnl|CDD|225507 COG2959, HemX, Uncharacterized enzyme of heme biosynthesis
           [Coenzyme metabolism].
          Length = 391

 Score = 30.9 bits (70), Expect = 0.94
 Identities = 23/76 (30%), Positives = 35/76 (46%), Gaps = 10/76 (13%)

Query: 230 QQQRDKLQLEIAARE---KAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLK 286
           QQQ      E+ A +   KA +  QE+ + L QLE         + + Q  + RLE QL+
Sbjct: 55  QQQNVLQTQELQALQQQLKALQLAQENQKLLAQLE-------SLIAQQQAELDRLERQLE 107

Query: 287 QLQAAKEELEARQTEL 302
            LQ    EL+ +   +
Sbjct: 108 TLQKQLSELQKKVATI 123


>gnl|CDD|163506 TIGR03794, NHLM_micro_HlyD, NHLM bacteriocin system secretion
           protein.  Members of this protein family are homologs of
           the HlyD membrane fusion protein of type I secretion
           systems. Their occurrence in prokaryotic genomes is
           associated with the occurrence of a novel class of
           microcin (small bacteriocins) with a leader peptide
           region related to nitrile hydratase. We designate the
           class of bacteriocin as Nitrile Hydratase Leader
           Microcin, or NHLM. This family, therefore, is designated
           as NHLM bacteriocin system secretion protein. Some but
           not all NHLM-class putative microcins belong to the TOMM
           (thiazole/oxazole modified microcin) class as assessed
           by the presence of the scaffolding protein and/or
           cyclodehydratase in the same gene clusters [Transport
           and binding proteins, Amino acids, peptides and amines,
           Cellular processes, Biosynthesis of natural products].
          Length = 421

 Score = 31.0 bits (70), Expect = 0.94
 Identities = 18/91 (19%), Positives = 42/91 (46%), Gaps = 9/91 (9%)

Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ 289
             +RD++  +    E+ +    +    +  L+ +  +R+      + +++ L +   QL 
Sbjct: 168 TFKRDRILQQQWREEQEKYDAADKARAIYALQTKADERN-----LETVLQSLSQADFQLA 222

Query: 290 AAKEE----LEARQTELQLELQPIRNQSNLN 316
              E+    +EAR  E + E++ + N+ NLN
Sbjct: 223 GVAEKELETVEARIKEARYEIEELENKLNLN 253


>gnl|CDD|184696 PRK14474, PRK14474, F0F1 ATP synthase subunit B; Provisional.
          Length = 250

 Score = 30.6 bits (69), Expect = 1.0
 Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 9/86 (10%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQ-----LEIAAREKAEKKHQESVERLKQLEVEMAKR- 267
           D +Q + +A +E    +Q++  L+         A+E A+++ Q  +   ++ +V  A+  
Sbjct: 47  DAEQRQQEAGQEAERYRQKQQSLEQQRASFMAQAQEAADEQRQHLLNEARE-DVATARDE 105

Query: 268 -DQDL-MEAQEMIRRLEEQLKQLQAA 291
             + L  E QE  + L++Q  Q    
Sbjct: 106 WLEQLEREKQEFFKALQQQTGQQMVK 131


>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
           binding.  This is the C-terminal conserved coiled coil
           region of a family of TATA element modulatory factor 1
           proteins conserved in eukaryotes. The proteins bind to
           the TATA element of some RNA polymerase II promoters and
           repress their activity. by competing with the binding of
           TATA binding protein. TMF1_TATA_bd is the most conserved
           part of the TMFs. TMFs are evolutionarily conserved
           golgins that bind Rab6, a ubiquitous ras-like
           GTP-binding Golgi protein, and contribute to Golgi
           organisation in animal and plant cells. The Rab6-binding
           domain appears to be the same region as this C-terminal
           family.
          Length = 121

 Score = 29.5 bits (67), Expect = 1.0
 Identities = 16/77 (20%), Positives = 40/77 (51%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
            A  ++E    + +RD+ + EI    +  ++ +E  + +++LE E+   +Q      E++
Sbjct: 33  LASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELEDLEQRYETTLELL 92

Query: 279 RRLEEQLKQLQAAKEEL 295
               E++++L+A   +L
Sbjct: 93  GEKSERVEELRADVVDL 109



 Score = 26.8 bits (60), Expect = 9.3
 Identities = 18/72 (25%), Positives = 37/72 (51%), Gaps = 1/72 (1%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
           ++L   +   E      ++ + RL+    E  +    L E  E +R L++++++L+   E
Sbjct: 20  ERLSSTLRRLEGELASLKDELARLEAERDEARQEIVKLTEENEELRELKKEIEELEKELE 79

Query: 294 ELEAR-QTELQL 304
           +LE R +T L+L
Sbjct: 80  DLEQRYETTLEL 91


>gnl|CDD|153332 cd07648, F-BAR_FCHO, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of FCH domain Only proteins.
            F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization. Proteins in
           this group have been named FCH domain Only (FCHO)
           proteins. Vertebrates have two members, FCHO1 and FCHO2.
           These proteins contain an F-BAR domain and a C-terminal
           domain of unknown function named SAFF which is also
           present in endophilin interacting protein 1. F-BAR
           domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 261

 Score = 30.4 bits (69), Expect = 1.1
 Identities = 14/51 (27%), Positives = 29/51 (56%), Gaps = 2/51 (3%)

Query: 253 SVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
           S E+L +L +++ ++ Q+L   +++ +  EEQ K+ +  KEE       +Q
Sbjct: 68  STEKLSELHLQLVQKLQEL--IKDVQKYGEEQHKKHKKVKEEESGTAEAVQ 116


>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase.  This domain family is found in
           eukaryotes, and is approximately 140 amino acids in
           length. The family is found in association with
           pfam00069. Polo-like kinase 1 (Plx1) is essential during
           mitosis for the activation of Cdc25C, for spindle
           assembly, and for cyclin B degradation. This family is
           Polo kinase kinase (PKK) which phosphorylates Polo
           kinase and Polo-like kinase to activate them. PKK is a
           serine/threonine kinase.
          Length = 142

 Score = 29.6 bits (67), Expect = 1.1
 Identities = 24/89 (26%), Positives = 44/89 (49%), Gaps = 4/89 (4%)

Query: 228 AKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ--DLMEAQEMIRRLEEQL 285
            K+Q +D+  LE     K  +K  E +ER ++  +E  ++ Q  +L    + IR   EQ 
Sbjct: 5   QKEQLKDRFFLERTQLLKRHEKELEQLERQQKRTIEKLEQRQTQELRRLPKRIRA--EQK 62

Query: 286 KQLQAAKEELEARQTELQLELQPIRNQSN 314
            +L+  KE L+  + EL+ E++ +     
Sbjct: 63  TRLKMFKESLKIEKKELKQEVEKLPRFQE 91


>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA.  During
           the bacterial cell cycle, the tubulin-like cell-division
           protein FtsZ polymerises into a ring structure that
           establishes the location of the nascent division site.
           EzrA modulates the frequency and position of FtsZ ring
           formation.
          Length = 559

 Score = 30.6 bits (70), Expect = 1.2
 Identities = 25/94 (26%), Positives = 46/94 (48%), Gaps = 17/94 (18%)

Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA-------- 290
           E+    + EK+ +E   +  QL   +A++     E QE +  +E+QL++++         
Sbjct: 342 ELETVRELEKQLEELEAQYDQLVERIAEKKVAYSELQEELEEIEKQLEEIEKEQEELSES 401

Query: 291 ----AKEELEARQT--ELQLELQPIR---NQSNL 315
                K+ELEAR+   E + +L  I+    +SNL
Sbjct: 402 LQGLRKDELEAREKLQEYRQKLHEIKRYLEKSNL 435



 Score = 28.7 bits (65), Expect = 4.3
 Identities = 24/105 (22%), Positives = 51/105 (48%), Gaps = 15/105 (14%)

Query: 214 DVQQMKAQAREEKNAKQQQRDK---LQLEIAARE----KAEKKHQESVERLKQLEVEMAK 266
           ++ ++K    + +   ++ +DK   L+  + A+      A  + ++   +L +LE E  +
Sbjct: 123 ELNELKESEEKNRKEVEELKDKYRELRKTLLAKSFSYGPALDELEK---QLDELEEEFEQ 179

Query: 267 RDQ-----DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
             +     D +EA+E++ +LEE+   L+   EE+     ELQ E 
Sbjct: 180 FVELTESGDYLEAREVLLKLEEETDALEQKMEEIPPLLKELQNEF 224


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 22/101 (21%), Positives = 44/101 (43%), Gaps = 14/101 (13%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMA 265
            R     ID++++ A   E K     + ++LQ              +  E  KQ+     
Sbjct: 20  ARGLSVDIDLEKLIALDDERKKL-LSEIEELQ-------------AKRNELSKQIGKAKG 65

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
           ++   + E ++ ++ L+E+L +L AA + LEA   +  L +
Sbjct: 66  QKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSI 106


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 29.6 bits (67), Expect = 1.2
 Identities = 16/70 (22%), Positives = 34/70 (48%), Gaps = 3/70 (4%)

Query: 248 KKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           ++    +E  + LE  + K + ++   Q  I RL+ QL+ L+     L+A++ +L+ +L 
Sbjct: 43  QQRDRDLEFRESLEETLRKLEAEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKL- 101

Query: 308 PIRNQSNLNK 317
             +      K
Sbjct: 102 --KTLEQKLK 109


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 30.5 bits (69), Expect = 1.3
 Identities = 15/77 (19%), Positives = 41/77 (53%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           QQ+ A+ R+   +   +   L+ + AA EK   + +  + +L +L+V++ + +++   A+
Sbjct: 327 QQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLSKLPKLQVQLRELEREAEAAR 386

Query: 276 EMIRRLEEQLKQLQAAK 292
            +   L ++ ++L   +
Sbjct: 387 SLYETLLQRYQELSIQE 403



 Score = 28.6 bits (64), Expect = 4.8
 Identities = 19/99 (19%), Positives = 39/99 (39%), Gaps = 3/99 (3%)

Query: 205 MRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEM 264
                     +  ++AQ  E    +QQ   +L+  +A+        ++    L++   ++
Sbjct: 305 STELGAKHPQLVALEAQLAE---LRQQIAAELRQILASLPNELALLEQQEAALEKELAQL 361

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
             R   L + Q  +R LE + +  ++  E L  R  EL 
Sbjct: 362 KGRLSKLPKLQVQLRELEREAEAARSLYETLLQRYQELS 400



 Score = 28.6 bits (64), Expect = 5.2
 Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 16/92 (17%)

Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLE 282
           R++    +QQ   L  E+ A+            +L  LE ++A+  Q +      +R++ 
Sbjct: 291 RQQYAQVRQQIADLSTELGAKH----------PQLVALEAQLAELRQQI---AAELRQI- 336

Query: 283 EQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
             L  L      LE ++  L+ EL  ++ + +
Sbjct: 337 --LASLPNELALLEQQEAALEKELAQLKGRLS 366


>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 429

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 16/78 (20%), Positives = 37/78 (47%), Gaps = 5/78 (6%)

Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKR-----DQDLMEAQEMIRRLEEQLKQLQAAKE 293
           ++   ++  +K    +E L+    E++K       +   +A+E+I  ++E  ++L+  + 
Sbjct: 30  KLLELDEERRKLLRELEELQAERNELSKEIGRALKRGEDDAEELIAEVKELKEKLKELEA 89

Query: 294 ELEARQTELQLELQPIRN 311
            L+  + EL   L  I N
Sbjct: 90  ALDELEAELDTLLLTIPN 107


>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
           synthetase/uroporphyrin-III C-methyltransferase;
           Reviewed.
          Length = 656

 Score = 30.5 bits (69), Expect = 1.3
 Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 1/70 (1%)

Query: 245 KAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQL 304
           K ++  QE V+R +  + + A+      +AQ  + +L+ Q  QL     + ++ Q  L+ 
Sbjct: 347 KVDRLDQELVQRQQANDAQTAELRVKTEQAQASVHQLDSQFAQLDGKLADAQSAQQALEQ 406

Query: 305 ELQPI-RNQS 313
           + Q + RN+ 
Sbjct: 407 QYQDLSRNRD 416


>gnl|CDD|220710 pfam10351, Apt1, Golgi-body localisation protein domain.  This is
           the C-terminus of a family of proteins conserved from
           plants to humans. The plant members are localised to the
           Golgi proteins and appear to regulate membrane
           trafficking, as they are required for rapid vesicle
           accumulation at the tip of the pollen tube. The
           C-terminus probably contains the Golgi localisation
           signal and it is well-conserved.
          Length = 451

 Score = 30.3 bits (69), Expect = 1.4
 Identities = 15/58 (25%), Positives = 27/58 (46%), Gaps = 5/58 (8%)

Query: 242 AREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
             E  EK  +E +E+L       +    DL      +  L+E++++L   + ELE R+
Sbjct: 68  YVEPKEKSRKERLEKLL-----FSSDFSDLEGLDSRVVALQERIRELAEIRRELEFRE 120


>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3.  This family
           consists of several radial spoke protein 3 (RSP3)
           sequences. Eukaryotic cilia and flagella present in
           diverse types of cells perform motile, sensory, and
           developmental functions in organisms from protists to
           humans. They are centred by precisely organised,
           microtubule-based structures, the axonemes. The axoneme
           consists of two central singlet microtubules, called the
           central pair, and nine outer doublet microtubules. These
           structures are well-conserved during evolution. The
           outer doublet microtubules, each composed of A and B
           sub-fibres, are connected to each other by nexin links,
           while the central pair is held at the centre of the
           axoneme by radial spokes. The radial spokes are T-shaped
           structures extending from the A-tubule of each outer
           doublet microtubule to the centre of the axoneme. Radial
           spoke protein 3 (RSP3), is present at the proximal end
           of the spoke stalk and helps in anchoring the radial
           spoke to the outer doublet. It is thought that radial
           spokes regulate the activity of inner arm dynein through
           protein phosphorylation and dephosphorylation.
          Length = 288

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 19/69 (27%), Positives = 34/69 (49%), Gaps = 7/69 (10%)

Query: 212 TIDVQQMKAQAREE-KNAKQQQRDKLQL------EIAAREKAEKKHQESVERLKQLEVEM 264
           T++   ++    EE    +QQQR   Q       E    E+AE++ +E  ER K+ + E 
Sbjct: 140 TLEQALLEVLEEEELAELRQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKER 199

Query: 265 AKRDQDLME 273
            +R+++  E
Sbjct: 200 KQREKETAE 208


>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355).  This
           family of proteins is found in bacteria and viruses.
           Proteins in this family are typically between 180 and
           214 amino acids in length.
          Length = 125

 Score = 29.1 bits (66), Expect = 1.5
 Identities = 13/62 (20%), Positives = 30/62 (48%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +V +  A+ + +   KQ+++     ++A     EK   E  +  K+LE   A+  +  ++
Sbjct: 12  EVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARRELK 71

Query: 274 AQ 275
           A+
Sbjct: 72  AE 73


>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
          Length = 174

 Score = 29.5 bits (67), Expect = 1.5
 Identities = 18/60 (30%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQ-LQAAKEELEARQTE 301
           RE   +  +E+ ERL+Q    +A+  Q L +AQ+   R+    K   +A + E+E +  E
Sbjct: 52  REAILQALKEAEERLRQAAQALAEAQQKLAQAQQEAERIRADAKARAEAIRAEIEKQAIE 111



 Score = 29.5 bits (67), Expect = 1.6
 Identities = 25/113 (22%), Positives = 51/113 (45%), Gaps = 19/113 (16%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVER--------L 257
            RR+     +++ + + R+   A  + + KL     A+++AE+   ++  R         
Sbjct: 50  ERREAILQALKEAEERLRQAAQALAEAQQKLA---QAQQEAERIRADAKARAEAIRAEIE 106

Query: 258 KQLEVEMAK----RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
           KQ   +MA+       DL   QE   R+  QL++ +AA+  +   +++L   L
Sbjct: 107 KQAIEDMARLKQTAAADLSAEQE---RVIAQLRR-EAAELAIAKAESQLPGRL 155


>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
           and conversion].
          Length = 161

 Score = 29.2 bits (66), Expect = 1.5
 Identities = 22/90 (24%), Positives = 47/90 (52%), Gaps = 4/90 (4%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
           + QA+   +  + +R K + + A   + E++ +E+        +E AK++ +   A+E+ 
Sbjct: 37  ERQAKIADDLAEAERLKEEAQ-ALLAEYEQELEEA-REQASEIIEQAKKEAE-QIAEEIK 93

Query: 279 RRLEEQL-KQLQAAKEELEARQTELQLELQ 307
              EE+L +  +AA+ E+EA +     EL+
Sbjct: 94  AEAEEELERIKEAAEAEIEAEKERALEELR 123



 Score = 27.6 bits (62), Expect = 5.0
 Identities = 19/91 (20%), Positives = 48/91 (52%), Gaps = 11/91 (12%)

Query: 227 NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLK 286
            A  +++ K+  ++A   +AE+  +E+   L + E E+ +  +   +A E+I + +++ +
Sbjct: 33  KALDERQAKIADDLA---EAERLKEEAQALLAEYEQELEEARE---QASEIIEQAKKEAE 86

Query: 287 QL-----QAAKEELEARQTELQLELQPIRNQ 312
           Q+       A+EELE  +   + E++  + +
Sbjct: 87  QIAEEIKAEAEEELERIKEAAEAEIEAEKER 117


>gnl|CDD|219186 pfam06810, Phage_GP20, Phage minor structural protein GP20.  This
           family consists of several phage minor structural
           protein GP20 sequences of around 180 residues in length.
           The function of this family is unknown.
          Length = 156

 Score = 29.2 bits (66), Expect = 1.6
 Identities = 12/68 (17%), Positives = 35/68 (51%), Gaps = 4/68 (5%)

Query: 240 IAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
              + K +K ++E       L+ ++  RD+ + + ++ ++  EE  K+++  K++ +  +
Sbjct: 16  QNPKSKLDKANEE----RDSLKKQLKNRDKQIEDLKKKVKDNEELQKKIEKLKQQNKTAK 71

Query: 300 TELQLELQ 307
            E + +L 
Sbjct: 72  EEYEAKLA 79


>gnl|CDD|133403 cd04776, HTH_GnyR, Helix-Turn-Helix DNA binding domain of the
           regulatory protein GnyR.  Putative helix-turn-helix
           (HTH) regulatory protein, GnyR, and other related
           proteins. GnyR belongs to the gnyRDBHAL cluster, which
           is involved in acyclic isoprenoid degradation in
           Pseudomonas aeruginosa. These proteins share the
           N-terminal DNA binding domain with other transcription
           regulators of the MerR superfamily that promote
           transcription by reconfiguring the spacer between the
           -35 and -10 promoter elements. A typical MerR regulator
           is comprised of distinct domains that harbor the
           regulatory (effector-binding) site and the active
           (DNA-binding) site. Their conserved N-terminal domains
           contain predicted winged HTH motifs that mediate DNA
           binding, while the dissimilar C-terminal domains bind
           specific coactivator molecules.
          Length = 118

 Score = 28.7 bits (65), Expect = 1.6
 Identities = 6/28 (21%), Positives = 19/28 (67%)

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTEL 302
           ++M+ ++E++  +L+  + +++A   EL
Sbjct: 79  EKMLEKIEKRRAELEQQRRDIDAALAEL 106


>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin.  Trichoplein
           or mitostatin, was first defined as a meiosis-specific
           nuclear structural protein. It has since been linked
           with mitochondrial movement. It is associated with the
           mitochondrial outer membrane, and over-expression leads
           to reduction in mitochondrial motility whereas lack of
           it enhances mitochondrial movement. The activity appears
           to be mediated through binding the mitochondria to the
           actin intermediate filaments (IFs).
          Length = 349

 Score = 29.9 bits (68), Expect = 1.6
 Identities = 18/94 (19%), Positives = 46/94 (48%), Gaps = 2/94 (2%)

Query: 216 QQMKAQAREEKNAKQQQ--RDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +++KA+ +EE+    +    ++L+      E+  K+ +E  E    L+ ++ +R++   E
Sbjct: 30  KRIKAEEKEEERRIDEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQE 89

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
             E   +  EQ+ ++    +E +  + + + E Q
Sbjct: 90  EYEERLQEREQMDEIIERIQEEDEAEAQEKREKQ 123



 Score = 29.5 bits (67), Expect = 2.6
 Identities = 18/106 (16%), Positives = 49/106 (46%)

Query: 202 ELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLE 261
           +  M   +   +  ++ + + R+E+  + +   + Q+E   + + E+  +   ER +  E
Sbjct: 44  DEMMEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDE 103

Query: 262 VEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           +    +++D  EAQE   + ++  +++    EE   R+ E +   +
Sbjct: 104 IIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKERER 149



 Score = 29.1 bits (66), Expect = 2.8
 Identities = 19/97 (19%), Positives = 51/97 (52%), Gaps = 1/97 (1%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           + ++ + +  EE+  +++Q D++   I   ++AE + +   ++  + E++    ++   +
Sbjct: 82  EREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERK 141

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
            +E  R  EE+LK L+  +E+   R+ E + E +  +
Sbjct: 142 EEEKEREREEELKILEYQREK-AEREEEREAERRERK 177



 Score = 28.3 bits (64), Expect = 5.0
 Identities = 20/106 (18%), Positives = 48/106 (45%), Gaps = 8/106 (7%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAA--REKAEKKHQESVERLKQLEV------EMAK 266
           +++++ +   E   K++++ KL+ EI     E+ E+K +E     ++         E A+
Sbjct: 105 IERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAE 164

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           R+++    +   +  +E+      A++E    + E   EL+    Q
Sbjct: 165 REEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRADLYQ 210


>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like.  This family is a set
           of eukaryotic tropomyosins. Within the yeast Tmp1 and
           Tmp2, biochemical and sequence analyses indicate that
           Tpm2p spans four actin monomers along a filament,
           whereas Tpmlp spans five. Despite its shorter length,
           Tpm2p can compete with Tpm1p for binding to F-actin.
           Over-expression of Tpm2p in vivo alters the axial
           budding of haploids to a bipolar pattern, and this can
           be partially suppressed by co-over-expression of Tpm1p.
           This suggests distinct functions for the two
           tropomyosins, and indicates that the ratio between them
           is important for correct morphogenesis. The family also
           contains higher eukaryote Tmp3 members.
          Length = 143

 Score = 29.1 bits (66), Expect = 1.6
 Identities = 21/65 (32%), Positives = 37/65 (56%), Gaps = 7/65 (10%)

Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLME-------AQEMIRRLEEQLKQLQAAKEELE 296
           E A+++ +E  E+LK+LE E  +++Q++          +E + +LEEQLK+ +   EE E
Sbjct: 10  ENAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEESE 69

Query: 297 ARQTE 301
              T 
Sbjct: 70  KLATN 74


>gnl|CDD|113902 pfam05149, Flagellar_rod, Paraflagellar rod protein.  This family
           consists of several eukaryotic paraflagellar rod
           component proteins. The eukaryotic flagellum represents
           one of the most complex macromolecular structures found
           in any organism and contains more than 250 proteins. In
           addition to its locomotive role, the flagellum is
           probably involved in nutrient uptake since receptors for
           host low-density lipoproteins are localised on the
           flagellar membrane as well as on the flagellar pocket
           membrane.
          Length = 289

 Score = 30.0 bits (68), Expect = 1.6
 Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 8/84 (9%)

Query: 226 KNAKQQQRDKLQLEI----AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE----M 277
             AK +Q+   + ++      RE+ +    +  +R      E  K  Q  +E Q+     
Sbjct: 8   IEAKHRQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQQACWRA 67

Query: 278 IRRLEEQLKQLQAAKEELEARQTE 301
           I+ LE +L+ L   + E   R+ E
Sbjct: 68  IQELERELRDLAEERREEVERRIE 91



 Score = 28.1 bits (63), Expect = 5.9
 Identities = 24/101 (23%), Positives = 48/101 (47%), Gaps = 16/101 (15%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKA----EKKHQESVERLKQLE 261
           R++     D+++ K +  EE  A  +QR +   +    EK      ++ Q     +++LE
Sbjct: 13  RQKSACEEDLERCKEKREEEDAADAKQRKRYSAQRRESEKFLQQNVEQQQACWRAIQELE 72

Query: 262 VEMAKRDQDLMEAQ-EMIRRLEEQLKQLQAAKEELEARQTE 301
            E+    +DL E + E + R      +++A + E E R+T+
Sbjct: 73  REL----RDLAEERREEVER------RIEAVERE-EQRRTD 102


>gnl|CDD|193472 pfam12999, PRKCSH-like, Glucosidase II beta subunit-like.  The
           sequences found in this family are similar to a region
           found in the beta-subunit of glucosidase II, which is
           also known as protein kinase C substrate 80K-H (PRKCSH).
           The enzyme catalyzes the sequential removal of two
           alpha-1,3-linked glucose residues in the second step of
           N-linked oligosaccharide processing. The beta subunit is
           required for the solubility and stability of the
           heterodimeric enzyme, and is involved in retaining the
           enzyme within the endoplasmic reticulum.
          Length = 176

 Score = 29.4 bits (66), Expect = 1.6
 Identities = 14/59 (23%), Positives = 30/59 (50%), Gaps = 3/59 (5%)

Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           E A +  +   E    ++  +  ++  L+ AQ+    L+++LK+L   ++ L A+Q  L
Sbjct: 121 EIARQFEEYLTEHNNSVKNGLKIKEGLLLAAQKKRDELKKRLKEL---EDSLPAKQQRL 176


>gnl|CDD|214744 smart00611, SEC63, Domain of unknown function in Sec63p, Brr2p and
           other proteins. 
          Length = 312

 Score = 29.9 bits (68), Expect = 1.6
 Identities = 20/77 (25%), Positives = 36/77 (46%), Gaps = 11/77 (14%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
            +++ QM  QA    ++   Q   L  EI   ++ EKK   S+E L +LE E        
Sbjct: 135 ALNLSQMIIQALWPTDSPLLQLPHLPEEIL--KRLEKKKVLSLEDLLELEDE-------- 184

Query: 272 MEAQEMIRRLEEQLKQL 288
            E  E++  L+ + +++
Sbjct: 185 -ERGELLGLLDAEGERV 200


>gnl|CDD|201885 pfam01608, I_LWEQ, I/LWEQ domain.  I/LWEQ domains bind to actin. It
           has been shown that the I/LWEQ domains from mouse talin
           and yeast Sla2p interact with F-actin. I/LWEQ domains
           can be placed into four major groups based on sequence
           similarity: (1) Metazoan talin; (2) Dictyostelium
           TalA/TalB and SLA110; (3) metazoan Hip1p; and (4) yeast
           Sla2p. The domain has four conserved blocks, the name of
           the domain is derived from the initial conserved amino
           acid of each of the four blocks.
          Length = 194

 Score = 29.5 bits (67), Expect = 1.7
 Identities = 17/66 (25%), Positives = 32/66 (48%), Gaps = 6/66 (9%)

Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL-EARQ 299
           A ++  EK+  +    L +L +   K  +  ME Q  I +LE    +L+ A++ L E R+
Sbjct: 134 AVKQGLEKEEDQEDVDLSKLTLHEGKTQE--MEQQVEILKLE---NELEEARKRLGEIRK 188

Query: 300 TELQLE 305
              + +
Sbjct: 189 AHYRDD 194


>gnl|CDD|183211 PRK11578, PRK11578, macrolide transporter subunit MacA;
           Provisional.
          Length = 370

 Score = 30.1 bits (68), Expect = 1.7
 Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 11/71 (15%)

Query: 256 RLKQLEVEMA---KRDQ-----DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           +LK L V +    K+DQ     D  +A+  I+ +E  L +L+A +++ EA   EL+L   
Sbjct: 71  QLKTLSVAIGDKVKKDQLLGVIDPEQAENQIKEVEATLMELRAQRQQAEA---ELKLARV 127

Query: 308 PIRNQSNLNKV 318
            +  Q  L K 
Sbjct: 128 TLSRQQRLAKT 138


>gnl|CDD|226844 COG4420, COG4420, Predicted membrane protein [Function unknown].
          Length = 191

 Score = 29.3 bits (66), Expect = 1.8
 Identities = 16/83 (19%), Positives = 41/83 (49%), Gaps = 4/83 (4%)

Query: 218 MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEM 277
           + +Q R+ +  +   R +L  ++    KAE++     E+L +L +++     +L + +E+
Sbjct: 108 LMSQNRQAERDRL--RAELDYQVN--LKAEQEVAALHEKLDELRLDLGYVRDELDDLREL 163

Query: 278 IRRLEEQLKQLQAAKEELEARQT 300
           +  +E +L   +A +  +   Q 
Sbjct: 164 LAEIEPELADEEALRRAVRGGQH 186


>gnl|CDD|133382 cd01107, HTH_BmrR, Helix-Turn-Helix DNA binding domain of the BmrR
           transcription regulator.  Helix-turn-helix (HTH)
           multidrug-efflux transporter transcription regulator,
           BmrR and YdfL of Bacillus subtilis, and related
           proteins; N-terminal domain. Bmr is a membrane protein
           which causes the efflux of a variety of toxic substances
           and antibiotics. BmrR is comprised of two distinct
           domains that harbor a regulatory (effector-binding) site
           and an active (DNA-binding) site. The conserved
           N-terminal domain contains a winged HTH motif  that
           mediates DNA binding, while the C-terminal domain binds
           coactivating, toxic compounds. BmrR shares the
           N-terminal DNA binding domain with other transcription
           regulators of the MerR superfamily that promote
           transcription by reconfiguring the spacer between the
           -35 and -10 promoter elements.
          Length = 108

 Score = 28.3 bits (64), Expect = 1.8
 Identities = 16/68 (23%), Positives = 32/68 (47%), Gaps = 5/68 (7%)

Query: 237 QLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA--QEMIRRLEEQLKQLQAAKEE 294
           QLE   R K  +     +E +K++   +   + D +    +E +  LE ++++LQ     
Sbjct: 44  QLERLNRIKYLRDLGFPLEEIKEI---LDADNDDELRKLLREKLAELEAEIEELQRILRL 100

Query: 295 LEARQTEL 302
           LE R  ++
Sbjct: 101 LEDRLKQI 108


>gnl|CDD|206243 pfam14073, Cep57_CLD, Centrosome localisation domain of Cep57.  The
           CLD or centrosome localisation domain of Cep57 is found
           at the N-terminus, and lies approximately between
           residues 58 and 239. This region lies within the first
           alpha-helical coiled-coil segment of Cep57, and
           localises to the centrosome internally to gamma-tubulin,
           suggesting that it is either on both centrioles or on a
           centromatrix component. This N-terminal region can also
           multimerise with the N-terminus of other Cep57
           molecules. The C-terminal part, Family Cep57_MT_bd,
           pfam06657, is the microtubule-binding region of Cep57.
          Length = 178

 Score = 29.4 bits (66), Expect = 1.8
 Identities = 29/104 (27%), Positives = 53/104 (50%), Gaps = 13/104 (12%)

Query: 204 YMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE 263
           YMRR          M   A +E+ A  +Q+  LQ E   +E  +   Q  +E+L+ LE E
Sbjct: 82  YMRR----------MVENAEKERTAVLEQQVSLQRE---KEHDQMDVQAQLEKLEVLEKE 128

Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
             +  +    A+  I++LEE+L++ +  ++ ++ +  ELQ  L+
Sbjct: 129 YLRLTRTQSLAETKIQQLEEKLQEEEHQRKLVQDKAAELQTGLE 172


>gnl|CDD|238248 cd00435, ACBP, Acyl CoA binding protein (ACBP) binds thiol esters
          of long fatty acids and coenzyme A in a one-to-one
          binding mode with high specificity and affinity.
          Acyl-CoAs are important intermediates in fatty lipid
          synthesis and fatty acid degradation and play a role in
          regulation of intermediary metabolism and gene
          regulation. The suggested role of ACBP is to act as a
          intracellular acyl-CoA transporter and pool former.
          ACBPs are present in a large group of eukaryotic
          species and several tissue-specific isoforms have been
          detected.
          Length = 85

 Score = 28.1 bits (63), Expect = 1.9
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 27 WKEHKGMLREDAMMEYLKIAQDL 49
          W   KGM +EDAM  Y+   ++L
Sbjct: 58 WNSLKGMSKEDAMKAYIAKVEEL 80


>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
           Nexins.  BAR domains are dimerization, lipid binding and
           curvature sensing modules found in many different
           proteins with diverse functions. Sorting nexins (SNXs)
           are Phox homology (PX) domain containing proteins that
           are involved in regulating membrane traffic and protein
           sorting in the endosomal system. SNXs differ from each
           other in their lipid-binding specificity, subcellular
           localization and specific function in the endocytic
           pathway. A subset of SNXs also contain BAR domains. The
           PX-BAR structural unit determines the specific membrane
           targeting of SNXs. BAR domains form dimers that bind to
           membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 218

 Score = 29.6 bits (67), Expect = 1.9
 Identities = 16/36 (44%), Positives = 23/36 (63%)

Query: 267 RDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
            DQ+  EA++ I +LEEQLK+L    + L  R+ EL
Sbjct: 2   EDQEFEEAKDYILKLEEQLKKLSKQAQRLVKRRREL 37



 Score = 27.3 bits (61), Expect = 9.5
 Identities = 19/72 (26%), Positives = 32/72 (44%), Gaps = 8/72 (11%)

Query: 236 LQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
           L L+   ++ A KK Q  +E+LK        +       +E+   LEE    L+ A++  
Sbjct: 117 LTLQSLKKDLASKKAQ--LEKLKAAPGIKPAK------VEELEEELEEAESALEEARKRY 168

Query: 296 EARQTELQLELQ 307
           E     L+ EL+
Sbjct: 169 EEISERLKEELK 180


>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 257 to 277 amino acids in length. This domain is
           found associated with pfam00004. This domain has a
           conserved LER sequence motif.
          Length = 276

 Score = 29.7 bits (67), Expect = 1.9
 Identities = 20/96 (20%), Positives = 45/96 (46%), Gaps = 8/96 (8%)

Query: 222 AREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
           A++     + Q    Q E+ A+ K  +  Q       Q ++E A+ + +  E ++ ++  
Sbjct: 50  AKKAFELSKMQEKTRQAELEAKIKEYEAQQA------QAKLERARVEAE--ERRKTLQEQ 101

Query: 282 EEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
            +Q +Q    ++EL  ++ + +LE Q  +N+  L  
Sbjct: 102 TQQEQQRAQYQDELARKRYQKELEQQRRQNEELLKM 137


>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2.  This
           family consists of several Eukaryotic rRNA processing
           protein EBP2 sequences. Ebp2p is required for the
           maturation of 25S rRNA and 60S subunit assembly. Ebp2p
           may be one of the target proteins of Rrs1p for executing
           the signal to regulate ribosome biogenesis. This family
           also plays a role in chromosome segregation.
          Length = 270

 Score = 29.6 bits (67), Expect = 1.9
 Identities = 24/94 (25%), Positives = 37/94 (39%), Gaps = 28/94 (29%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEV----------EMAKRDQDL------------ 271
           D  + E+A  ++A     E+ + LK+L V          EM K D+ +            
Sbjct: 70  DDFKRELAFYKQALAAVLEARKLLKKLGVPFKRPDDYFAEMVKSDEHMQKVRKKLLAEAA 129

Query: 272 -MEAQEMIRRLEEQL---KQLQAAKEELEARQTE 301
             +A E  R+  E     KQ+Q  K  L+ R  E
Sbjct: 130 AKKASEEARKQRELKKFGKQVQVEK--LQERAKE 161


>gnl|CDD|177954 PLN02320, PLN02320, seryl-tRNA synthetase.
          Length = 502

 Score = 29.9 bits (67), Expect = 2.0
 Identities = 23/67 (34%), Positives = 35/67 (52%), Gaps = 3/67 (4%)

Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE--QLKQ-LQAAKEELEARQTELQLELQ 307
           Q+ VERL+     +A + +  +E  E    +EE   LK+ L   +E+L     ELQLE Q
Sbjct: 106 QKEVERLRAERNAVANKMKGKLEPSERQALVEEGKNLKEGLVTLEEDLVKLTDELQLEAQ 165

Query: 308 PIRNQSN 314
            I N ++
Sbjct: 166 SIPNMTH 172


>gnl|CDD|219176 pfam06785, UPF0242, Uncharacterized protein family (UPF0242). 
          Length = 401

 Score = 29.9 bits (67), Expect = 2.1
 Identities = 23/103 (22%), Positives = 40/103 (38%), Gaps = 22/103 (21%)

Query: 221 QAREEKNAKQQQRDKLQL-------------EIAAREKAEKKHQES-VERLK-------- 258
             RE    +Q++ D+L+L              +  + K   +  E     LK        
Sbjct: 89  NIRESIEERQKESDQLKLQNQKLVNQLGHARGVFIKTKGRYQKLEELSRHLKEENQCLQI 148

Query: 259 QLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
           QL+  + + ++ + E QE+ R L E L   Q   +E +A   E
Sbjct: 149 QLDALVQECNEKIEENQELNRELAETLAYQQELNDEYQATFVE 191


>gnl|CDD|222255 pfam13600, DUF4140, N-terminal domain of unknown function
           (DUF4140).  This family is often found at the N-terminus
           of its member proteins, with DUF4139, pfam13598, at the
           C-terminus.
          Length = 101

 Score = 28.0 bits (63), Expect = 2.1
 Identities = 11/48 (22%), Positives = 22/48 (45%)

Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
           L V + +           +  LEE+L+ L+     LEA +  L+ +++
Sbjct: 51  LSVSVRRDYLPDEATSPALAELEEELRALEDELAALEAEKEALEAQIK 98



 Score = 27.3 bits (61), Expect = 4.4
 Identities = 15/43 (34%), Positives = 22/43 (51%)

Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            +        L E +E +R LE++L  L+A KE LEA+   L 
Sbjct: 59  YLPDEATSPALAELEEELRALEDELAALEAEKEALEAQIKFLS 101



 Score = 26.5 bits (59), Expect = 7.3
 Identities = 7/35 (20%), Positives = 12/35 (34%)

Query: 281 LEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
               L +L+     LE     L+ E + +  Q   
Sbjct: 65  TSPALAELEEELRALEDELAALEAEKEALEAQIKF 99


>gnl|CDD|218703 pfam05700, BCAS2, Breast carcinoma amplified sequence 2 (BCAS2).
           This family consists of several eukaryotic sequences of
           unknown function. The mammalian members of this family
           are annotated as breast carcinoma amplified sequence 2
           (BCAS2) proteins. BCAS2 is a putative spliceosome
           associated protein.
          Length = 221

 Score = 29.4 bits (66), Expect = 2.1
 Identities = 18/68 (26%), Positives = 34/68 (50%), Gaps = 1/68 (1%)

Query: 230 QQQRDKLQLEIAAREKAEKKHQESV-ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
           +++  +L+ EI    +  K  QE   E+L+ LE          +E +     LE++++QL
Sbjct: 149 EKELAELKEEIEEVNRQRKYSQEEAGEKLRSLEQRWVDLVSKNLEIEVACAELEQEIEQL 208

Query: 289 QAAKEELE 296
           +  K EL+
Sbjct: 209 KRKKAELK 216


>gnl|CDD|147845 pfam05914, RIB43A, RIB43A.  This family consists of several
           RIB43A-like eukaryotic proteins. Ciliary and flagellar
           microtubules contain a specialised set of
           protofilaments, termed ribbons, that are composed of
           tubulin and several associated proteins. RIB43A was
           first characterized in the unicellular biflagellate,
           Chlamydomonas reinhardtii although highly related
           sequences are present in several higher eukaryotes
           including humans. The function of this protein is
           unknown although the structure of RIB43A and its
           association with the specialised protofilament ribbons
           and with basal bodies is relevant to the proposed role
           of ribbons in forming and stabilising doublet and
           triplet microtubules and in organising their
           three-dimensional structure. Human RIB43A homologues
           could represent a structural requirement in centriole
           replication in dividing cells.
          Length = 379

 Score = 29.6 bits (67), Expect = 2.2
 Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 3/80 (3%)

Query: 240 IAAREKAEKKHQESVERLKQLEVEMAKR-DQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
            A R+  E++ QE  ER ++ E    K  D+  +        LE Q ++L+    + +  
Sbjct: 283 AAIRKGQEQQLQEK-ERRREEEQLREKEWDRQAINQARAAVLLERQERRLRKELRK-QLD 340

Query: 299 QTELQLELQPIRNQSNLNKV 318
               QL  +    Q  LNK 
Sbjct: 341 SANQQLAKEQKAQQEYLNKE 360



 Score = 29.2 bits (66), Expect = 2.9
 Identities = 20/94 (21%), Positives = 44/94 (46%), Gaps = 5/94 (5%)

Query: 221 QAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRR 280
           Q ++ +  +QQ +++ Q + AA ++A+  H +    L     E+A  +++    + + + 
Sbjct: 157 QEQQREWLEQQVQERQQAK-AAEKQADTLHDQLRVELDHRAQELASLEEEC--RRAVEQA 213

Query: 281 LEEQLKQL--QAAKEELEARQTELQLELQPIRNQ 312
             +  K L  +  + E   +Q E +  L  I NQ
Sbjct: 214 TADFNKALAEEQRERERREKQEEQEDNLAEIYNQ 247


>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein.  This model
           represents a relatively rare but broadly distributed
           uncharacterized protein family, distributed in 1-2
           percent of bacterial genomes, all of which have outer
           membranes. In many of these genomes, it is part of a
           two-gene pair.
          Length = 555

 Score = 29.7 bits (67), Expect = 2.2
 Identities = 18/87 (20%), Positives = 43/87 (49%), Gaps = 6/87 (6%)

Query: 233 RDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQ----LKQL 288
            + L+  +   E+ EK  +   ++ K+ + ++    QDL E ++ +  ++++      +L
Sbjct: 157 GEDLK-TVETAEEIEKSLKAMQQKWKKRKKDLPN-KQDLEEYKKRLEAIKKKDIKNPLEL 214

Query: 289 QAAKEELEARQTELQLELQPIRNQSNL 315
           Q  KEE +  + E + + Q I++  N 
Sbjct: 215 QKIKEEFDKLKKEGKADKQKIKSAKND 241


>gnl|CDD|215101 PLN00203, PLN00203, glutamyl-tRNA reductase.
          Length = 519

 Score = 29.7 bits (67), Expect = 2.2
 Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 7/67 (10%)

Query: 254 VERLKQL-EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT--ELQLELQPIR 310
           V+ LK++       R +  MEAQ +IR   E+ K  +A ++ LE   T  +L+   + IR
Sbjct: 389 VDDLKEVVAANKEDRLRKAMEAQTIIR---EESKNFEAWRDSLETVPTIKKLRSYAERIR 445

Query: 311 NQSNLNK 317
             + L K
Sbjct: 446 -AAELEK 451


>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
           Provisional.
          Length = 569

 Score = 29.8 bits (68), Expect = 2.3
 Identities = 26/104 (25%), Positives = 50/104 (48%), Gaps = 13/104 (12%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESV------ERLKQLEVEMAKR 267
           ++Q++     EEKN ++ ++ K       +     +           ++L+ LE E ++ 
Sbjct: 127 ELQELLES--EEKNREEVEQLKDLYRELRKSLLANRFSFGPALDELEKQLENLEEEFSQF 184

Query: 268 DQ-----DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
            +     D +EA+E++ +LEE+L  L+   EE+     ELQ EL
Sbjct: 185 VELTESGDYVEAREILDQLEEELAALEQIMEEIPELLKELQTEL 228



 Score = 29.8 bits (68), Expect = 2.3
 Identities = 20/94 (21%), Positives = 46/94 (48%), Gaps = 17/94 (18%)

Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA-------- 290
           E+ +  + EK+ +   ++  ++   +A+++    E QE +  + +QL++++         
Sbjct: 346 ELESVRQLEKQLESLEKQYDEITERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEM 405

Query: 291 ----AKEELEARQT--ELQLELQPIR---NQSNL 315
                K+ELEAR+     + +L  I+    +SNL
Sbjct: 406 LQGLRKDELEAREKLERYRNKLHEIKRYLEKSNL 439


>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572).  Family of
           eukaryotic proteins with undetermined function.
          Length = 321

 Score = 29.3 bits (66), Expect = 2.3
 Identities = 20/66 (30%), Positives = 36/66 (54%), Gaps = 5/66 (7%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
           DKL  E   R + E++ + + + +K+LE     R  D     E++ RLEE LK+LQ+ + 
Sbjct: 111 DKLDEEQEERVEKEREEELAGDAMKKLE----NRTADSKREMEVLERLEE-LKELQSRRA 165

Query: 294 ELEARQ 299
           +++   
Sbjct: 166 DVDVNS 171


>gnl|CDD|218532 pfam05276, SH3BP5, SH3 domain-binding protein 5 (SH3BP5).  This
           family consists of several eukaryotic SH3 domain-binding
           protein 5 or c-Jun N-terminal kinase (JNK)-interacting
           proteins (SH3BP5 or Sab). Sab binds to and serves as a
           substrate for JNK in vitro, and has been found to
           interact with the Src homology 3 (SH3) domain of
           Bruton's tyrosine kinase (Btk). Inspection of the
           sequence of Sab reveals the presence of two putative
           mitogen-activated protein kinase interaction motifs
           (KIMs) similar to that found in the JNK docking domain
           of the c-Jun transcription factor, and four potential
           serine-proline JNK phosphorylation sites in the
           C-terminal half of the molecule.
          Length = 240

 Score = 29.4 bits (66), Expect = 2.4
 Identities = 14/65 (21%), Positives = 26/65 (40%)

Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           R +AEK+H      L   E+ + + ++ L  A    R   E  +      E  +    EL
Sbjct: 137 RTRAEKEHASEEAELLVAELRLRQLEKILKRAINKSRPYFELKQVYTQQLEIQKLLVQEL 196

Query: 303 QLELQ 307
           + ++ 
Sbjct: 197 EAKVA 201


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 29.6 bits (67), Expect = 2.4
 Identities = 19/94 (20%), Positives = 43/94 (45%), Gaps = 4/94 (4%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +      + +++    Q   +KL   I   E+   +  E  ++L +L+ +++   Q L+ 
Sbjct: 296 EGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLIT 355

Query: 274 AQEMIRRLEEQLKQLQAA----KEELEARQTELQ 303
             +  ++++  +++LQA      EEL   Q EL 
Sbjct: 356 LVDKAKKVKAAIEELQAEFVDNAEELAKLQDELD 389



 Score = 29.2 bits (66), Expect = 3.4
 Identities = 25/109 (22%), Positives = 47/109 (43%), Gaps = 19/109 (17%)

Query: 207 RRK--PDTIDVQ---QM----KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL 257
           RRK   D +D+    +M    K + RE      QQ   L ++I   ++  K + +++E  
Sbjct: 152 RRKLVEDLLDISVLSEMDKLNKDKIRE----LNQQIQTLDMKIDHIQQQIKTYNKNIEEQ 207

Query: 258 KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLEL 306
           +       K  +++   Q     L E+ K ++A  EEL      L +++
Sbjct: 208 R------KKNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDI 250



 Score = 27.7 bits (62), Expect = 8.7
 Identities = 19/119 (15%), Positives = 45/119 (37%), Gaps = 20/119 (16%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR-------- 267
           Q    +  EE    + + ++L  E+       +    ++ +L     ++  +        
Sbjct: 219 QNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVI 278

Query: 268 ------------DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSN 314
                        Q + E  + I +++++LK+LQ + E+L+    EL+  +     QS 
Sbjct: 279 KMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSK 337


>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like).  This family
           includes outer membrane proteins such as OmpH among
           others. Skp (OmpH) has been characterized as a molecular
           chaperone that interacts with unfolded proteins as they
           emerge in the periplasm from the Sec translocation
           machinery.
          Length = 140

 Score = 28.7 bits (65), Expect = 2.5
 Identities = 22/99 (22%), Positives = 52/99 (52%), Gaps = 7/99 (7%)

Query: 213 IDVQQMKAQAREEKNAKQQ---QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ 269
           +DVQ++  ++   K A++Q   +  K Q E+   EK  +K +E +++      E A+  +
Sbjct: 4   VDVQKILQESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKK 63

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
                +E+ ++++E  ++ Q  +++L+ RQ E   ++  
Sbjct: 64  ----EKELQKKVQEFQRKQQKLQQDLQKRQQEELQKILD 98


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 29.6 bits (66), Expect = 2.5
 Identities = 28/99 (28%), Positives = 47/99 (47%), Gaps = 13/99 (13%)

Query: 215 VQQMKAQAREEKNAKQQ---QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR---- 267
           +Q++ A+  + +   Q+   QR+KL++E+   EK   + Q S  R +  E+  A R    
Sbjct: 373 IQKLSAELLKLEEWLQEERSQREKLEVELG-IEKDCNRVQLSENRRELSELRSALRVLQK 431

Query: 268 --DQDLMEAQEM---IRRLEEQLKQLQAAKEELEARQTE 301
             +Q   E QE+   IR LE +L +    K   +A   E
Sbjct: 432 EKEQLQEEKQELLDYIRVLELRLDKEADEKWAEDAATCE 470



 Score = 27.6 bits (61), Expect = 9.3
 Identities = 28/162 (17%), Positives = 64/162 (39%), Gaps = 28/162 (17%)

Query: 179 DFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKL-- 236
           D +   P+  + +  L  C       M++      DV  M+++  + + A +   +K   
Sbjct: 130 DLLLVVPKATILQNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEE 189

Query: 237 ------------QLEIAAREKAEKKHQESVERLKQLEVEMA-----KRDQDLM------- 272
                       +   A R     +  E+ +R+ +LE ++      K++ D +       
Sbjct: 190 LEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQDI 249

Query: 273 --EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
             E + M   L+++LK++   + + E  + +LQLE + +   
Sbjct: 250 EAELERMKGELKQRLKKMTIQRRDEETERIDLQLENEQLHED 291


>gnl|CDD|111811 pfam02963, EcoRI, Restriction endonuclease EcoRI. 
          Length = 257

 Score = 29.1 bits (65), Expect = 2.6
 Identities = 14/44 (31%), Positives = 25/44 (56%), Gaps = 1/44 (2%)

Query: 246 AEKKHQ-ESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
           AE KHQ + +E ++   +   ++DQDLM A   I R  + + ++
Sbjct: 94  AEAKHQGKDIENIRNGLLVGKRKDQDLMAAGNAIERSHKNISEI 137


>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein.  This family consists
           of several eukaryotic mitotic checkpoint (Mitotic arrest
           deficient or MAD) proteins. The mitotic spindle
           checkpoint monitors proper attachment of the bipolar
           spindle to the kinetochores of aligned sister chromatids
           and causes a cell cycle arrest in prometaphase when
           failures occur. Multiple components of the mitotic
           spindle checkpoint have been identified in yeast and
           higher eukaryotes. In S.cerevisiae, the existence of a
           Mad1-dependent complex containing Mad2, Mad3, Bub3 and
           Cdc20 has been demonstrated.
          Length = 722

 Score = 29.5 bits (66), Expect = 2.6
 Identities = 25/103 (24%), Positives = 42/103 (40%), Gaps = 5/103 (4%)

Query: 220 AQAREEKNAKQQQRD----KLQLEIAAREKAEKKHQESVE-RLKQLEVEMAKRDQDLMEA 274
           A+A EE    + + D    KLQ E   + K  K+    ++  L +++      D +L   
Sbjct: 131 AEAEEEAKLLKDKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLL 190

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
           +  +  L EQL++ Q    E E +   L  E     + S   K
Sbjct: 191 ESELEELREQLEECQKELAEAEKKLQSLTSEQASSADNSVKIK 233



 Score = 28.4 bits (63), Expect = 6.8
 Identities = 24/109 (22%), Positives = 46/109 (42%), Gaps = 1/109 (0%)

Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
           N+E  + R     + ++ ++   ++ +N   +  ++ +L     +    K Q   E   +
Sbjct: 101 NYERELDRNLELEVRLKALEELEKKAENEAAEAEEEAKLLKDKLDAESLKLQNEKEDQLK 160

Query: 260 LEVEMAKR-DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
              E   R   DL E Q   +  + +LK L++  EEL  +  E Q EL 
Sbjct: 161 EAKESISRIKNDLSEMQCRAQNADTELKLLESELEELREQLEECQKELA 209


>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil.  Members of
           this family are predominantly found in the
           actin-bundling protein Cortexillin I from Dictyostelium
           discoideum. They adopt a structure consisting of an
           18-heptad-repeat alpha-helical coiled-coil, and are a
           prerequisite for the assembly of Cortexillin I.
          Length = 107

 Score = 28.0 bits (62), Expect = 2.6
 Identities = 26/90 (28%), Positives = 47/90 (52%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +  +++A   E  N      + L+ E A++E+  K+  +    L  LE + A+R+Q L E
Sbjct: 3   EKARLEASKNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQRLRE 62

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
            Q  I  L + L+  + AK ELE+R ++ +
Sbjct: 63  LQAKIDELLKNLELEKLAKMELESRLSKTE 92


>gnl|CDD|145995 pfam03148, Tektin, Tektin family.  Tektins are cytoskeletal
           proteins. They have been demonstrated in such cellular
           sites as centrioles, basal bodies, and along ciliary and
           flagellar doublet microtubules. Tektins form unique
           protofilaments, organised as longitudinal polymers of
           tektin heterodimers with axial periodicity matching
           tubulin. Tektin polypeptides consist of several
           alpha-helical regions that are predicted to form coiled
           coils. Indeed, tektins share considerable structural
           similarities with intermediate filament proteins.
           Possible functional roles for tektins are: stabilisation
           of tubulin protofilaments; attachment of A and B-tubules
           in ciliary/flagellar microtubule doublets and C-tubules
           in centrioles; binding of axonemal components.
          Length = 384

 Score = 29.5 bits (67), Expect = 2.7
 Identities = 23/109 (21%), Positives = 47/109 (43%), Gaps = 24/109 (22%)

Query: 223 REEKNAKQQ---QRDKLQLEIAAREKAEKKHQESV----ERLKQLEVEMAKRDQ------ 269
            E + AK +   Q  K   EIA +EK  +  ++++      LK  +  +  R Q      
Sbjct: 254 SETEEAKNKLEWQLKKTLEEIAQQEKNIEALEKAIRDKEGPLKVAQTRLENRTQRPNVEL 313

Query: 270 -----------DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
                      ++ E ++ I  L+E+L + +A+ + LE  +  L+ ++ 
Sbjct: 314 CRDQAQYRLIDEVKELEDTIAALKEKLAEAEASLKALEETRARLEEDIA 362


>gnl|CDD|191382 pfam05837, CENP-H, Centromere protein H (CENP-H).  This family
           consists of several eukaryotic centromere protein H
           (CENP-H) sequences. Macromolecular
           centromere-kinetochore complex plays a critical role in
           sister chromatid separation, but its complete protein
           composition as well as its precise dynamic function
           during mitosis has not yet been clearly determined.
           CENP-H contains a coiled-coil structure and a nuclear
           localisation signal. CENP-H is specifically and
           constitutively localised in kinetochores throughout the
           cell cycle. CENP-H may play a role in kinetochore
           organisation and function throughout the cell cycle.
           This the C-terminus of the region, which is conserved
           from fungi to humans.
          Length = 106

 Score = 28.0 bits (63), Expect = 2.8
 Identities = 24/73 (32%), Positives = 41/73 (56%), Gaps = 5/73 (6%)

Query: 236 LQLEIAAREKAEKKHQESVERL--KQLEVEMAKRDQDLMEAQEMIRR---LEEQLKQLQA 290
           L L+ A+RE  E+      ERL  KQ  VE+A    +L + +E  R    L+EQL++L+A
Sbjct: 6   LNLQQASRETREELSDLEKERLQLKQKNVELALELLELTKKKESWREDMELKEQLEKLEA 65

Query: 291 AKEELEARQTELQ 303
             ++ +A+   ++
Sbjct: 66  DLKKSKAKWEVMK 78


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 29.6 bits (66), Expect = 2.8
 Identities = 24/102 (23%), Positives = 44/102 (43%), Gaps = 7/102 (6%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAA--REKAEKKHQESVERLKQLEVEMAKRDQDLME 273
             +  +      AKQ     LQ   AA   + A ++H + V+  +       ++  D + 
Sbjct: 732 AMLSRKVHSI--AKQGMTGALQALGAAHFPQVAPEQHDDIVDIERIEHRRQLQKRIDAVN 789

Query: 274 AQEMIRRLEEQL-KQLQAAKEELEARQTELQLELQPIRNQSN 314
           A+  +RRL E++  ++  AK+E  A  +E+  EL  I     
Sbjct: 790 AR--LRRLREEIIGRMSDAKKEDTAALSEVGAELDDIPEYLA 829


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 29.4 bits (66), Expect = 2.9
 Identities = 26/127 (20%), Positives = 54/127 (42%), Gaps = 18/127 (14%)

Query: 194 LALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQE 252
           L   +G   L + R+  D + +  +     EE    +++  +L+ E   R E+ +   Q 
Sbjct: 117 LCNELGEPPLSLLRKDADPLSLPNL-----EELEHFRERLGELREEKVRRLEEVDSLKQS 171

Query: 253 SVERLKQLEVEMAKRD--QDLME----------AQEMIRRLEEQLKQLQAAKEELEARQT 300
                  L    A+ D  QD++            +E I +L++ L+ L A K++   +  
Sbjct: 172 IKSLCSLLGTPPARTDFEQDVLSYGEIPNDHCVTRETIDKLDKMLENLAAQKKQRADKID 231

Query: 301 ELQLELQ 307
           +L+ ++Q
Sbjct: 232 DLREQIQ 238


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 29.2 bits (66), Expect = 2.9
 Identities = 13/78 (16%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 239 EIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
            +A      +  +   ERL++ E  + ++ Q  ++++   + L ++++QL++ +++L+  
Sbjct: 74  RLAKLISENEALKAENERLQKREQSIDQQIQQAVQSET--QELTKEIEQLKSERQQLQGL 131

Query: 299 QTELQLELQPIRNQSNLN 316
             +LQ  L  +    +  
Sbjct: 132 IDQLQRRLAGVLTGPSGG 149


>gnl|CDD|222160 pfam13476, AAA_23, AAA domain. 
          Length = 204

 Score = 28.7 bits (64), Expect = 2.9
 Identities = 24/94 (25%), Positives = 49/94 (52%), Gaps = 2/94 (2%)

Query: 211 DTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERLKQLEVEMAKRDQ 269
           + +++ +++    E   + ++    L      R E+ + K +E  ERL++LE E+ +  +
Sbjct: 112 EIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKEKKERLEELEKEL-EELE 170

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
           D  +  E +   +E+ K+L+  KEELE    EL+
Sbjct: 171 DEKDLLEKLLEEKEKKKELEELKEELEELLEELE 204


>gnl|CDD|213395 cd12152, F1-ATPase_delta, mitochondrial ATP synthase delta subunit.
            The F-ATPase is found in bacterial plasma membranes,
           mitochondrial inner membranes and in chloroplast
           thylakoid membranes. It has also been found in the
           archaea Methanosarcina barkeri. It uses a proton
           gradient to drive ATP synthesis and hydrolyzes ATP to
           build the proton gradient. The extrinisic membrane
           domain, F1, is composed of alpha, beta, gamma, delta,
           and epsilon subunits with a stoichiometry of 3:3:1:1:1.
           Alpha and beta subunit form the globular catalytic
           moiety, a hexameric ring of alternating subunits. Gamma,
           delta and epsilon subunits form a stalk, connecting F1
           to F0, the integral membrane proton translocating
           domain. In bacteria, which is lacking a eukaryotic
           epsilon subunit homolog, this subunit is called the
           epsilon subunit.
          Length = 123

 Score = 28.2 bits (64), Expect = 2.9
 Identities = 15/36 (41%), Positives = 21/36 (58%), Gaps = 3/36 (8%)

Query: 270 DLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
           D+  A+E + R EE+L Q   AK+E E  + E  LE
Sbjct: 87  DVERAEEALERAEERLAQ---AKDEREKARAEAALE 119


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 28.7 bits (65), Expect = 2.9
 Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 2/70 (2%)

Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERL-KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQ 287
             Q  +KL+  +   +K  ++ ++ +  L  Q+E     R++   E  E++  L++  K+
Sbjct: 60  PSQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETE-ERTELLEELKQLEKE 118

Query: 288 LQAAKEELEA 297
           L+  K ELE 
Sbjct: 119 LKKLKAELEK 128



 Score = 27.6 bits (62), Expect = 7.5
 Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 15/81 (18%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
            ++K +  + K   ++ + ++    A  EK +K  +E+ ER + LE    +  Q      
Sbjct: 65  NKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLE----ELKQ------ 114

Query: 276 EMIRRLEEQLKQLQAAKEELE 296
                LE++LK+L+A  E+ E
Sbjct: 115 -----LEKELKKLKAELEKYE 130


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 29.2 bits (66), Expect = 2.9
 Identities = 15/51 (29%), Positives = 33/51 (64%), Gaps = 2/51 (3%)

Query: 230 QQQRDKLQLEIAAREKAEKKHQES--VERLKQLEVEMAKRDQDLMEAQEMI 278
           Q++ +KL+ ++A  E+  +K+  S    +L +LE ++A  ++ + EA+E+I
Sbjct: 248 QEKLEKLEKDLAKLEEKLEKNPNSKKKNKLAELEQQLASLEKRIDEAKELI 298


>gnl|CDD|222259 pfam13608, Potyvirid-P3, Protein P3 of Potyviral polyprotein.  This
           is the P3 protein section of the Potyviridae
           polyproteins. The function is not known except that the
           protein is essential to viral survival.
          Length = 445

 Score = 29.1 bits (66), Expect = 3.0
 Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 10/92 (10%)

Query: 215 VQQMKAQAREEKNAKQQ-QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           +  +  + R+ K    + + +K + EI       K   +     + LE    K  + L  
Sbjct: 265 LVSIINEHRKLKREAAEEKEEKEEKEIRKLYLYSKLEGKLPTEEEFLEYIEEKNPELLET 324

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
           A+E+I   EE+++  QA          E  LE
Sbjct: 325 AEELIE--EEEVEH-QA------KTADEKNLE 347


>gnl|CDD|234937 PRK01294, PRK01294, lipase chaperone; Provisional.
          Length = 336

 Score = 29.3 bits (66), Expect = 3.0
 Identities = 17/102 (16%), Positives = 38/102 (37%), Gaps = 16/102 (15%)

Query: 206 RRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEI----------AAREKAEKKHQESV- 254
               P  +D+Q ++         + +     +++            A E+       S+ 
Sbjct: 145 EDDGPGKLDLQALQQLLDARLALRARFFSDWEIQAFFGEENQYQRYALERLRIAQDPSLS 204

Query: 255 -----ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAA 291
                 RL  LE ++ +  +  ++  +  + L +QL QLQA+
Sbjct: 205 DAQKAARLAALEAQLPEDLRAALQESQRQQALLQQLAQLQAS 246


>gnl|CDD|233467 TIGR01554, major_cap_HK97, phage major capsid protein, HK97 family.
            This model family represents the major capsid protein
           component of the heads (capsids) of bacteriophage HK97,
           phi-105, P27, and related phage. This model represents
           one of several analogous families lacking detectable
           sequence similarity. The gene encoding this component is
           typically located in an operon encoding the small and
           large terminase subunits, the portal protein and the
           prohead or maturation protease [Mobile and
           extrachromosomal element functions, Prophage functions].
          Length = 384

 Score = 29.2 bits (66), Expect = 3.1
 Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 5/65 (7%)

Query: 255 ERLKQLEVEM--AKRDQDLMEA---QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPI 309
           E L +L  E+     D+ L EA   +     L+E++ +L A  + LE    EL+ +    
Sbjct: 6   EALAELAKELRKLTEDEKLAEAEEEKAEYDALKEEIDKLDAEIDRLEELLDELEAKPAAS 65

Query: 310 RNQSN 314
                
Sbjct: 66  GEGGG 70


>gnl|CDD|219737 pfam08172, CASP_C, CASP C terminal.  This domain is the C-terminal
           region of the CASP family of proteins. It is a Golgi
           membrane protein which is thought to have a role in
           vesicle transport.
          Length = 245

 Score = 28.9 bits (65), Expect = 3.1
 Identities = 11/37 (29%), Positives = 20/37 (54%)

Query: 257 LKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
           L++L+ E++  +  L E QE+  +LE  L  +Q    
Sbjct: 1   LEELQKELSSLNAKLEEQQELNAKLENDLATVQDEIS 37


>gnl|CDD|220215 pfam09379, FERM_N, FERM N-terminal domain.  This domain is the
           N-terminal ubiquitin-like structural domain of the FERM
           domain.
          Length = 79

 Score = 27.2 bits (61), Expect = 3.1
 Identities = 8/19 (42%), Positives = 9/19 (47%)

Query: 91  QDVKKENPLQFKFRAKFYP 109
           Q    + P    FR KFYP
Sbjct: 61  QGKDGDPPFTLFFRVKFYP 79


>gnl|CDD|235179 PRK03947, PRK03947, prefoldin subunit alpha; Reviewed.
          Length = 140

 Score = 28.4 bits (64), Expect = 3.1
 Identities = 24/125 (19%), Positives = 54/125 (43%), Gaps = 25/125 (20%)

Query: 211 DTIDVQQMKAQAR---EEKNAKQQQRDKLQLEIAAREKAE------KKHQESVERL---- 257
              +++++ AQ +    +  A QQQ ++LQ  I   + A+      K   E  E L    
Sbjct: 4   SEQELEELAAQLQALQAQIEALQQQLEELQASINELDTAKETLEELKSKGEGKETLVPIG 63

Query: 258 --KQLEVEMAKRDQDLM----------EAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
               ++ ++  +D+ ++          +  E I  L+++ ++L+ A E+LE    +L   
Sbjct: 64  AGSFVKAKVKDKDKVIVSLGAGYSAEKDLDEAIEILDKRKEELEKALEKLEEALQKLASR 123

Query: 306 LQPIR 310
           +  + 
Sbjct: 124 IAQLA 128


>gnl|CDD|114219 pfam05483, SCP-1, Synaptonemal complex protein 1 (SCP-1).
           Synaptonemal complex protein 1 (SCP-1) is the major
           component of the transverse filaments of the
           synaptonemal complex. Synaptonemal complexes are
           structures that are formed between homologous
           chromosomes during meiotic prophase.
          Length = 787

 Score = 29.3 bits (65), Expect = 3.2
 Identities = 26/103 (25%), Positives = 54/103 (52%), Gaps = 4/103 (3%)

Query: 209 KPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKH----QESVERLKQLEVEM 264
           K D   +Q ++ + ++E N K++Q   L ++I  +E   K      +ES ++  QLE + 
Sbjct: 218 KEDHEKIQHLEEEYKKEINDKEKQVSLLLIQITEKENKMKDLTFLLEESRDKANQLEEKT 277

Query: 265 AKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
             +D++L E  E    L ++L+ ++ + +   + Q  L+ +LQ
Sbjct: 278 KLQDENLKELIEKKDHLTKELEDIKMSLQRSMSTQKALEEDLQ 320


>gnl|CDD|221982 pfam13206, VSG_B, Trypanosomal VSG domain.  This family represents
           the B-type variant surface glycoproteins from
           trypanosomal parasites. This family is related to
           pfam00913.
          Length = 253

 Score = 29.0 bits (65), Expect = 3.2
 Identities = 7/30 (23%), Positives = 19/30 (63%)

Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
           +++L++   E+ +R++   EA+ +  +LE 
Sbjct: 223 LKKLEEAATELEEREKAAQEAKALAAQLEA 252


>gnl|CDD|204831 pfam12137, RapA_C, RNA polymerase recycling family C-terminal.
           This domain is found in bacteria. This domain is about
           360 amino acids in length. This domain is found
           associated with pfam00271, pfam00176. The function of
           this domain is not known, but structurally it forms an
           alpha-beta fold in nature with a central beta-sheet
           flanked by helices and loops, the beta-sheet being
           mainly antiparallel and flanked by four alpha helices,
           among which the two longer helices exhibit a coiled-coil
           arrangement.
          Length = 362

 Score = 29.0 bits (66), Expect = 3.2
 Identities = 26/98 (26%), Positives = 46/98 (46%), Gaps = 20/98 (20%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL-EVEMAKRDQDLM 272
            ++Q+  +A  EK A++Q     ++   A+++ +++    + RLK L  V    RD++  
Sbjct: 279 QIEQLLQKA--EKLAEEQLE---EIIEQAKQQMQQQLSAELARLKALKAVNPNIRDEE-- 331

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIR 310
                I  LE+Q       +EEL     + QL L  IR
Sbjct: 332 -----IEALEQQ-------REELLTALDKAQLRLDAIR 357


>gnl|CDD|222588 pfam14198, TnpV, Transposon-encoded protein TnpV.  This family of
           proteins is functionally uncharacterized. This family of
           proteins is found in bacteria and viruses. Proteins in
           this family are typically between 114 and 125 amino
           acids in length.
          Length = 111

 Score = 27.9 bits (63), Expect = 3.2
 Identities = 8/28 (28%), Positives = 17/28 (60%)

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQT 300
           +A+E + R+ +Q+ + +   EEL+A   
Sbjct: 61  QAEERLERIVKQMAEKEGVTEELKAEDQ 88


>gnl|CDD|119001 pfam10481, Cenp-F_N, Cenp-F N-terminal domain.  Mitosin or
           centromere-associated protein-F (Cenp-F) is found bound
           across the centromere as one of the proteins of the
           outer layer of the kinetochore. Most of the
           kinetochore/centromere functions appear to depend upon
           binding of the C-terminal par to f the molecule, whereas
           the N-terminal part, here, may be a cytoplasmic player
           in controlling the function of microtubules and dynein.
          Length = 288

 Score = 29.0 bits (64), Expect = 3.3
 Identities = 21/89 (23%), Positives = 49/89 (55%), Gaps = 5/89 (5%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           +Q+++ Q  + K  KQQ+    Q ++ + E A +K ++ VE  K     + + +Q LME 
Sbjct: 20  IQELEGQLDKLKKEKQQR----QFQLDSLEAALQKQKQKVENEKTEGANLKRENQRLMEI 75

Query: 275 QEMIRRLEEQLK-QLQAAKEELEARQTEL 302
            E + + ++++  +LQ  + ++  ++ +L
Sbjct: 76  CEHLEKTKQKISHELQVKESQVNFQEGQL 104


>gnl|CDD|223716 COG0643, CheA, Chemotaxis protein histidine kinase and related
           kinases [Cell motility and secretion / Signal
           transduction mechanisms].
          Length = 716

 Score = 29.2 bits (66), Expect = 3.3
 Identities = 27/107 (25%), Positives = 46/107 (42%), Gaps = 14/107 (13%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           DV   +A+   EK+A +  +        A + A      S      + V++ K D+ +  
Sbjct: 286 DVLAAQAEPLAEKSAAEAAKLSALEAAPAAKAAAAAAGAS----SSIRVDVEKLDRLMNL 341

Query: 274 AQEMI---RRLEEQLKQLQAAKEELE--ARQ-----TELQLELQPIR 310
             E++    RL +   +L   +EEL+   RQ     T+LQ E+  IR
Sbjct: 342 VGELVITRSRLSQLASELDEVREELDEALRQLSRLTTDLQDEVMKIR 388


>gnl|CDD|150420 pfam09744, Jnk-SapK_ap_N, JNK_SAPK-associated protein-1.  This is
           the N-terminal 200 residues of a set of proteins
           conserved from yeasts to humans. Most of the proteins in
           this entry have an RhoGEF pfam00621 domain at their
           C-terminal end.
          Length = 158

 Score = 28.2 bits (63), Expect = 3.5
 Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 3/80 (3%)

Query: 234 DKLQLEIA-AREKAEKKH--QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
           D  +L     REK  +K   Q+ +E   +LE E  +  + + + +E +R+LE + K L  
Sbjct: 58  DNERLSTQYEREKELRKQAEQKLLEFEDELEQEKKELQKKIEDLEENVRQLELKAKNLSD 117

Query: 291 AKEELEARQTELQLELQPIR 310
               LE R+TEL+ E   + 
Sbjct: 118 QVSRLEERETELKKEYNSLH 137


>gnl|CDD|224455 COG1538, TolC, Outer membrane protein [Cell envelope biogenesis,
           outer membrane / Intracellular trafficking and
           secretion].
          Length = 457

 Score = 29.0 bits (65), Expect = 3.5
 Identities = 16/80 (20%), Positives = 34/80 (42%), Gaps = 2/80 (2%)

Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ-DL 271
              +Q    AR+E        +    ++ A  +A +  QE++E L +   +   R   D+
Sbjct: 352 AQYEQTVLTARQEVADALAALEAALEQLQALRQAVEAAQEALE-LARERYQAGVRTLLDV 410

Query: 272 MEAQEMIRRLEEQLKQLQAA 291
           ++AQ  + +  + L Q +  
Sbjct: 411 LDAQRTLLQARQALLQARYD 430


>gnl|CDD|221952 pfam13166, AAA_13, AAA domain.  This family of domains contain a
           P-loop motif that is characteristic of the AAA
           superfamily. Many of the proteins in this family are
           conjugative transfer proteins. This family includes the
           PrrC protein that is thought to be the active component
           of the anticodon nuclease.
          Length = 713

 Score = 29.2 bits (66), Expect = 3.5
 Identities = 18/75 (24%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 226 KNAKQQQRDKLQLEIAAREKAE----KKHQESVER-LKQLEVEMAKRDQDLMEAQEMIRR 280
           K  K + + KL L + A  K +    +K ++ +E+ +  LE E+ + + ++   ++ I+ 
Sbjct: 388 KKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAEIKALEKEIKE 447

Query: 281 LEEQLKQLQAAKEEL 295
           LE+QL  ++   +E+
Sbjct: 448 LEKQLTNIEPTADEI 462


>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
          Length = 201

 Score = 28.5 bits (64), Expect = 3.6
 Identities = 16/48 (33%), Positives = 28/48 (58%), Gaps = 2/48 (4%)

Query: 255 ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           E++K+L+  M +  ++  EAQE      ++LK+LQ  + E+   Q EL
Sbjct: 72  EKMKELQKMMKEFQKEFREAQE--SGDMKKLKKLQEKQMEMMDDQREL 117


>gnl|CDD|234701 PRK00252, alaS, alanyl-tRNA synthetase; Reviewed.
          Length = 865

 Score = 28.9 bits (66), Expect = 3.7
 Identities = 19/68 (27%), Positives = 31/68 (45%), Gaps = 17/68 (25%)

Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ 289
            +Q   L+ E+AA  KA+    E  ER++ L               E ++ LE++L+QL+
Sbjct: 697 NEQEALLK-ELAALLKAKPS--ELPERVEAL--------------LEELKELEKELEQLK 739

Query: 290 AAKEELEA 297
           A      A
Sbjct: 740 AKLAAAAA 747


>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
          Length = 663

 Score = 29.2 bits (65), Expect = 3.8
 Identities = 12/58 (20%), Positives = 22/58 (37%), Gaps = 5/58 (8%)

Query: 247 EKKHQESVERLKQLE-----VEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ 299
            ++H    +R +  E     +E  K  + L E QE + +  E        K+  E+  
Sbjct: 459 PQEHGHERKRKRGGELKEELIETLKLVKKLKEEQESLAKELEATAHKSEIKKIAESEF 516


>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria, archaea and eukaryotes.
           This domain is about 200 amino acids in length. This
           domain is found associated with pfam00013, pfam01966.
           This domain has a single completely conserved residue A
           that may be functionally important.
          Length = 201

 Score = 28.3 bits (64), Expect = 3.9
 Identities = 22/82 (26%), Positives = 37/82 (45%), Gaps = 3/82 (3%)

Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEE 283
           E K    + R + + E+  R     + Q   +RL Q E  + ++D+ L + +E +   E+
Sbjct: 57  EAKEEIHKLRAEAERELKERRN---ELQRQEKRLLQKEETLDRKDESLEKKEESLEEKEK 113

Query: 284 QLKQLQAAKEELEARQTELQLE 305
           +L   Q   EE E    EL  E
Sbjct: 114 ELAARQQQLEEKEEELEELIEE 135



 Score = 27.5 bits (62), Expect = 7.8
 Identities = 23/88 (26%), Positives = 50/88 (56%), Gaps = 11/88 (12%)

Query: 208 RKPDTIDVQQMKAQAREEK-NAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAK 266
           RK ++++ ++   + +E++  A+QQQ ++ + E+   E+  ++ Q+ +ER+  L  E AK
Sbjct: 96  RKDESLEKKEESLEEKEKELAARQQQLEEKEEEL---EELIEEQQQELERISGLTAEEAK 152

Query: 267 -------RDQDLMEAQEMIRRLEEQLKQ 287
                    +   EA +MI+ +EE+ K+
Sbjct: 153 EILLEEVEAELKHEAAKMIKEIEEEAKE 180


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 28.9 bits (65), Expect = 3.9
 Identities = 26/86 (30%), Positives = 46/86 (53%), Gaps = 6/86 (6%)

Query: 218 MKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL-KQLEVEMAKRDQDLMEAQE 276
           MK +A+ E   +++Q + L+  + +  K  K+  +S++RL K L+ E  +R QD     +
Sbjct: 128 MKGEAKVE-EEEEKQMEILEKALKSYLKIVKEENKSLQRLAKALQKESEERTQD---ETK 183

Query: 277 MIRRLEEQLKQLQAAKE-ELEARQTE 301
           MI    +++  L+ A E E E  Q E
Sbjct: 184 MIEEYRDKIDALKNAIEVEKEGLQEE 209


>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
           Provisional.
          Length = 389

 Score = 28.6 bits (65), Expect = 4.0
 Identities = 13/38 (34%), Positives = 25/38 (65%)

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           +E    LEEQ++QL+    +LEA   +L+ EL+ ++++
Sbjct: 7   EERNSELEEQIRQLELKLRDLEAENEKLERELERLKSE 44


>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201).  This is
           a family of coiled-coil proteins from eukaryotes. The
           function is not known.
          Length = 177

 Score = 28.3 bits (64), Expect = 4.1
 Identities = 12/48 (25%), Positives = 30/48 (62%)

Query: 255 ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           E+L  LE E+ +  ++L E +E + +L ++L +++  +++L  +  +L
Sbjct: 84  EKLHMLEEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNKKL 131


>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
          Length = 1047

 Score = 29.0 bits (65), Expect = 4.1
 Identities = 23/93 (24%), Positives = 37/93 (39%), Gaps = 6/93 (6%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAARE-----KAEKKHQESVERLKQLEVEMAKRDQ 269
            QQ   QA   +   Q Q   L L   AR+     +  ++   ++   +Q   E+  R Q
Sbjct: 262 RQQALQQALAAEEKAQPQLAALSLAQPARQLRPHWERIQEQSAALAHTRQQIEEVNTRLQ 321

Query: 270 DLMEAQEMIRR-LEEQLKQLQAAKEELEARQTE 301
             M  +  IR    +Q  +LQA ++ L     E
Sbjct: 322 STMALRARIRHHAAKQSAELQAQQQSLNTWLAE 354



 Score = 29.0 bits (65), Expect = 4.5
 Identities = 15/77 (19%), Positives = 30/77 (38%), Gaps = 11/77 (14%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
           Q+  AQ +       Q++ +    +    +  K+  + +  +K            + E +
Sbjct: 439 QKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKT-----------ICEQE 487

Query: 276 EMIRRLEEQLKQLQAAK 292
             I+ LE Q  QLQA +
Sbjct: 488 ARIKDLEAQRAQLQAGQ 504


>gnl|CDD|182730 PRK10787, PRK10787, DNA-binding ATP-dependent protease La;
           Provisional.
          Length = 784

 Score = 28.8 bits (64), Expect = 4.2
 Identities = 23/80 (28%), Positives = 39/80 (48%), Gaps = 2/80 (2%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
           D LQ+E   R +  KK  E  +R   L  +M    ++L E  +     E   +++ AAK 
Sbjct: 205 DLLQVEKRIRNRV-KKQMEKSQREYYLNEQMKAIQKELGEMDDAPDENEALKRKIDAAKM 263

Query: 294 ELEARQTELQLELQPIRNQS 313
             EA++ + + ELQ ++  S
Sbjct: 264 PKEAKE-KAEAELQKLKMMS 282


>gnl|CDD|148363 pfam06717, DUF1202, Protein of unknown function (DUF1202).  This
           family consists of several hypothetical bacterial
           proteins of around 335 residues in length. Members of
           this family are found exclusively in Escherichia coli
           and Salmonella species and are often referred to as YggM
           proteins. The function of this family is unknown.
          Length = 308

 Score = 28.4 bits (63), Expect = 4.4
 Identities = 14/72 (19%), Positives = 40/72 (55%), Gaps = 1/72 (1%)

Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
           A R+++E+ ++  +++ +  ++   +R Q  +++QE+ R+L++    L+  +  L  + +
Sbjct: 228 ACRKQSEECYEVPIQQKRDFDINEQRR-QTFLKSQELSRKLQDDWITLEKGQYPLTMKVS 286

Query: 301 ELQLELQPIRNQ 312
           EL  +   I  +
Sbjct: 287 ELNSQQSTILMK 298


>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1.  This model
           describes peptide chain release factor 1 (PrfA, RF-1),
           and excludes the related peptide chain release factor 2
           (PrfB, RF-2). RF-1 helps recognize and terminate
           translation at UAA and UAG stop codons. The
           mitochondrial release factors are prfA-like, although
           not included above the trusted cutoff for this model.
           RF-1 does not have a translational frameshift [Protein
           synthesis, Translation factors].
          Length = 360

 Score = 28.5 bits (64), Expect = 4.4
 Identities = 18/75 (24%), Positives = 37/75 (49%), Gaps = 6/75 (8%)

Query: 235 KLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEE 294
           KL  E +  E+    ++E  +  + ++      ++   E +EM    +E+L++L+   EE
Sbjct: 37  KLSKEYSQLEEIVDCYREYQQAQEDIKEAKEILEESDPEMREMA---KEELEELEEKIEE 93

Query: 295 LEARQTELQLELQPI 309
           LE    +L++ L P 
Sbjct: 94  LE---EQLKVLLLPK 105


>gnl|CDD|222698 pfam14346, DUF4398, Domain of unknown function (DUF4398).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 127 and
           269 amino acids in length.
          Length = 105

 Score = 27.3 bits (61), Expect = 4.5
 Identities = 20/89 (22%), Positives = 38/89 (42%), Gaps = 11/89 (12%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIA------AREKAEKKHQESVERLKQLEVEMAKR 267
           D +    +A E   A+Q     ++L++A      A+   ++   E   RL     E A+ 
Sbjct: 21  DAEAALERA-EAAGAEQYAPPYVELKLAREKLAQAKAALDEGKYEEARRL----AEQAEA 75

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
           D  L EA+    + +  L++ + + E L 
Sbjct: 76  DARLAEAKARAAKAQAALEEAKDSIERLR 104


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 28.4 bits (64), Expect = 4.5
 Identities = 15/59 (25%), Positives = 27/59 (45%), Gaps = 9/59 (15%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQE--------SVERLKQLEVEMAKRDQ 269
           K +  EE+   +   ++ + E A  +K EKK +E        S E  ++LE +  K+  
Sbjct: 263 KTREEEEEKILKAA-EEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERKKQA 320



 Score = 28.0 bits (63), Expect = 6.8
 Identities = 14/54 (25%), Positives = 28/54 (51%), Gaps = 2/54 (3%)

Query: 242 AREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL--EEQLKQLQAAKE 293
            RE+ E+K  ++ E  +Q E +  K ++   E +  + +L  EEQ K  +  ++
Sbjct: 264 TREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAKLAKLSPEEQRKLEEKERK 317


>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC.  All proteins in this family for
           which functions are known are part of an exonuclease
           complex with sbcD homologs. This complex is involved in
           the initiation of recombination to regulate the levels
           of palindromic sequences in DNA. This family is based on
           the phylogenomic analysis of JA Eisen (1999, Ph.D.
           Thesis, Stanford University) [DNA metabolism, DNA
           replication, recombination, and repair].
          Length = 1042

 Score = 28.8 bits (64), Expect = 4.6
 Identities = 21/105 (20%), Positives = 36/105 (34%), Gaps = 5/105 (4%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREK-AEKKHQESVERLKQLEVEMAKRDQDLME 273
             Q+      +K +   + + L L                 ER + LE E+    + L E
Sbjct: 178 YTQLALMEFAKKKSLHGKAELLTLRSQLLTLCTPCMPDTYHERKQVLEKEL----KHLRE 233

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNKV 318
           A +  ++    L Q + A+EE   +Q  L+     I        V
Sbjct: 234 ALQQTQQSHAYLTQKREAQEEQLKKQQLLKQLRARIEELRAQEAV 278


>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain.  This is a family of proteins of
           approximately 300 residues, found in plants and
           vertebrates. They contain a highly conserved DDRGK
           motif.
          Length = 189

 Score = 28.1 bits (63), Expect = 4.6
 Identities = 23/93 (24%), Positives = 43/93 (46%), Gaps = 8/93 (8%)

Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
             +   K    EEK A++QQR   + E   RE+ +K  ++     K+ E    +R++   
Sbjct: 1   KKIGAKKRAKLEEKQARRQQR---EAEEEEREERKKLEEKREGERKEEEELEEEREK--- 54

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
             +E   R E + +Q +  +EE E  ++   +E
Sbjct: 55  -KKEEEERKERE-EQARKEQEEYEKLKSSFVVE 85


>gnl|CDD|216249 pfam01025, GrpE, GrpE. 
          Length = 165

 Score = 27.9 bits (63), Expect = 4.6
 Identities = 13/46 (28%), Positives = 24/46 (52%)

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRN 311
           +  ++  E ++    LEE+L++L+   EEL+ R   L  E +  R 
Sbjct: 1   EEKEEEEELEDEEEALEEELEELEEEIEELKDRLLRLLAEFENYRK 46


>gnl|CDD|236335 PRK08724, fliD, flagellar capping protein; Validated.
          Length = 673

 Score = 28.7 bits (64), Expect = 4.6
 Identities = 55/286 (19%), Positives = 99/286 (34%), Gaps = 64/286 (22%)

Query: 53  GVNYFEIKNKKGTELWLGVDALGLNIYEKEDNLTNVMSQDVKKENPL-QFKFRAKFYPED 111
           GV    I + +G  L L  +  G      +DN    +S D +  NPL +F++        
Sbjct: 159 GVRASIINDVEGPRLILASNVSG------KDNQIK-ISVDAEYGNPLKRFEY-------- 203

Query: 112 VVEEIIQDITLRLFYLQVSRSAGSRVRFPPGPNCLLTPKIGFPWSEIRNISFNDRKFIIK 171
                 + +  R+  L+ +R+A  +V  P      LTP+      E+     ++    I 
Sbjct: 204 ------KTLEDRVRALEKARAAAQQVIAP------LTPEEQKVAPEL----SDEEGNAIP 247

Query: 172 PIDKKAPDFVFFAPRVRVNKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQ 231
           P D++  + +  A ++   +   A      E             Q     A  +   + +
Sbjct: 248 PADQEVAEEIQDAAQIAQQQEATAALAALEEPISAGGATAAAAGQAAIDAAEAKAYLRPE 307

Query: 232 QR------------------DKLQLEIAAREK----------AEKKHQESVERLK----Q 259
            R                   + +L+ AA  K          A     +S    K    +
Sbjct: 308 DRIPGWTETASGTLLDSYPEPEEELDEAAIAKAPDVPGWSNTASGTLTDSYVTPKEAQAE 367

Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
           +E ++A+    L  A E      EQ KQ+  AK E E R+   +++
Sbjct: 368 IEQKLAQEKAQLDAAVEKGELTPEQAKQIARAKLEPEERERLEKID 413


>gnl|CDD|224002 COG1076, DjlA, DnaJ-domain-containing proteins 1 [Posttranslational
           modification, protein turnover, chaperones].
          Length = 174

 Score = 28.1 bits (63), Expect = 4.7
 Identities = 11/51 (21%), Positives = 26/51 (50%), Gaps = 4/51 (7%)

Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQ----EMIRRLEEQLKQLQAAKEELE 296
             +     K     ++++  D   A+    E I +L+E+L+++Q A E+++
Sbjct: 124 KADQDAIKKAYRKLLSEQHPDKAAAKGLKLEFIEKLKEKLQEIQEAYEDIK 174


>gnl|CDD|203922 pfam08377, MAP2_projctn, MAP2/Tau projection domain.  This domain is
            found in the MAP2/Tau family of proteins which includes
            MAP2, MAP4, Tau, and their homologs. All isoforms contain
            a conserved C-terminal domain containing tubulin-binding
            repeats (pfam00418), and a N-terminal projection domain
            of varying size. This domain has a net negative charge
            and exerts a long-range repulsive force. This provides a
            mechanism that can regulate microtubule spacing which
            might facilitate efficient organelle transport.
          Length = 1134

 Score = 29.0 bits (64), Expect = 4.8
 Identities = 23/97 (23%), Positives = 47/97 (48%), Gaps = 5/97 (5%)

Query: 212  TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
            +I  +Q++   +EE+  K+ +R  L+     + + EK  +    R+   E ++AK++   
Sbjct: 1027 SILTEQLETIPKEERAEKEARRPSLE-----KHRKEKPFKTGRGRISTPERKVAKKEPST 1081

Query: 272  MEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
            +   E+ R+     K   A K E++A     +L L+P
Sbjct: 1082 VSRDEVRRKKAVYKKAELAKKSEVQAHSPSRKLILKP 1118


>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1.  This domain
           family is found in eukaryotes, and is approximately 40
           amino acids in length. The family is found in
           association with pfam07719, pfam00515. There is a single
           completely conserved residue L that may be functionally
           important. NARP1 is the mammalian homologue of a yeast
           N-terminal acetyltransferase that regulates entry into
           the G(0) phase of the cell cycle.
          Length = 516

 Score = 28.7 bits (65), Expect = 4.9
 Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 3/56 (5%)

Query: 219 KAQAREEKNAKQQQRD---KLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDL 271
                E K  +++QR    K + E A +  A+KK + + ++ K  + E  K D D 
Sbjct: 405 NLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKAKGPDGETKKVDPDP 460


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 28.6 bits (64), Expect = 4.9
 Identities = 20/91 (21%), Positives = 44/91 (48%), Gaps = 8/91 (8%)

Query: 219 KAQAREEKNAK--QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQE 276
           +A +   K A   +++ + L  E A++E         ++R+ +L+ E+ +   +L   Q 
Sbjct: 2   QAYSSHRKVAADGEEREESLLQESASKE------AYYLQRILELQAELKQLRAELSNVQA 55

Query: 277 MIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
              RL    ++L+   E LE ++  L+ E++
Sbjct: 56  ENERLSSLSQELKEENEMLELQRGRLRDEIK 86


>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57.  Eukaryotic
           cilia and flagella are specialised organelles found at
           the periphery of cells of diverse organisms.
           Intra-flagellar transport (IFT) is required for the
           assembly and maintenance of eukaryotic cilia and
           flagella, and consists of the bidirectional movement of
           large protein particles between the base and the distal
           tip of the organelle. IFT particles contain multiple
           copies of two distinct protein complexes, A and B, which
           contain at least 6 and 11 protein subunits. IFT57 is
           part of complex B but is not, however, required for the
           core subunits to stay associated. This protein is known
           as Huntington-interacting protein-1 in humans.
          Length = 355

 Score = 28.5 bits (64), Expect = 5.0
 Identities = 21/104 (20%), Positives = 45/104 (43%), Gaps = 11/104 (10%)

Query: 213 IDVQQMKAQAREEKNA---KQQQRDKLQLEIA-AREKAEKKHQESVERLKQLEVEMAKRD 268
             ++QM    +    +    +Q   KL  +++ A EK   + +    +L+QL        
Sbjct: 216 SHLEQMHQLKKAISKSLPETKQYLKKLSQDLSKALEKISSREKHLNNQLEQLV------- 268

Query: 269 QDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
           Q+  EA+  + +++E+  Q      EL     E+  EL+ ++ +
Sbjct: 269 QEYREARRTLSQVQEKYNQASQGVSELTRELNEISEELEQVKQE 312


>gnl|CDD|222647 pfam14282, FlxA, FlxA-like protein.  This family includes FlxA from
           E. coli. The expression of FlxA is regulated by the FliA
           sigma factor, a transcription factor specific for class
           3 flagellar operons. However FlxA is not required for
           flagellar function or formation.
          Length = 106

 Score = 27.0 bits (60), Expect = 5.0
 Identities = 14/49 (28%), Positives = 29/49 (59%), Gaps = 4/49 (8%)

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAK----EELEARQTELQLELQPIRNQ 312
           D  +   Q+ I+ L +QLK+L +++    EE + +Q  +Q ++Q ++ Q
Sbjct: 18  DSQIKSLQKQIQSLTKQLKELSSSEDLTAEEKQKQQQLIQQQIQMLQAQ 66


>gnl|CDD|184303 PRK13753, PRK13753, dihydropteroate synthase; Provisional.
          Length = 279

 Score = 28.5 bits (63), Expect = 5.1
 Identities = 14/41 (34%), Positives = 22/41 (53%)

Query: 109 PEDVVEEIIQDITLRLFYLQVSRSAGSRVRFPPGPNCLLTP 149
           PED ++EI++    R+  L+ S  A  R+   PG    L+P
Sbjct: 142 PEDALDEIVRFFEARVSALRRSGVAADRLILDPGMGFFLSP 182


>gnl|CDD|185272 PRK15374, PRK15374, pathogenicity island 1 effector protein SipB;
           Provisional.
          Length = 593

 Score = 28.8 bits (64), Expect = 5.1
 Identities = 15/53 (28%), Positives = 28/53 (52%), Gaps = 1/53 (1%)

Query: 251 QESVERLKQLEVEMAKRDQDLM-EAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           Q  +E  K++ ++++K  Q  + EAQE     E  +K+   AK   +A + +L
Sbjct: 112 QAMIESQKEMGIQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKL 164



 Score = 28.0 bits (62), Expect = 8.8
 Identities = 17/95 (17%), Positives = 41/95 (43%), Gaps = 10/95 (10%)

Query: 213 IDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
           + V Q   ++++E     Q   + Q  +   ++A   ++ S+++    +      ++ L 
Sbjct: 108 LAVWQAMIESQKEMG--IQVSKEFQTALGEAQEATDLYEASIKKTDTAKSVYDAAEKKLT 165

Query: 273 EAQEMIRRL--------EEQLKQLQAAKEELEARQ 299
           +AQ  ++ L        + +    QA KE  EA++
Sbjct: 166 QAQNKLQSLDPADPGYAQAEAAVEQAGKEATEAKE 200


>gnl|CDD|225530 COG2983, COG2983, Uncharacterized conserved protein [Function
           unknown].
          Length = 153

 Score = 27.8 bits (62), Expect = 5.2
 Identities = 16/53 (30%), Positives = 20/53 (37%), Gaps = 3/53 (5%)

Query: 106 KFYPEDVVEEIIQDITLRLFYLQVSRSAGSRVRFPPGPNCL-LTPKI--GFPW 155
           K   ED  E     +   L   +  R      RF   P+C+ LTPK    F W
Sbjct: 39  KLEDEDTDEIYYTRVACELLDPETCRCKDYENRFKTVPDCIKLTPKNLPEFEW 91


>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
          Length = 1068

 Score = 28.9 bits (65), Expect = 5.2
 Identities = 25/101 (24%), Positives = 48/101 (47%), Gaps = 16/101 (15%)

Query: 223 REEKNAKQQQ----RDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI 278
           R  +   QQQ    R+  Q E+  + + + + Q++  R +Q      +R+ +  +AQ+  
Sbjct: 640 RRNRRQAQQQTAETRESQQAEVTEKARTQDEQQQAPRRERQ-----RRRNDEKRQAQQEA 694

Query: 279 RRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLN-KV 318
           + L  + + +Q  ++E   +Q      +QP R Q  LN KV
Sbjct: 695 KALNVEEQSVQETEQEERVQQ------VQPRRKQRQLNQKV 729


>gnl|CDD|202204 pfam02321, OEP, Outer membrane efflux protein.  The OEP family
           (Outer membrane efflux protein) form trimeric channels
           that allow export of a variety of substrates in Gram
           negative bacteria. Each member of this family is
           composed of two repeats. The trimeric channel is
           composed of a 12 stranded all beta sheet barrel that
           spans the outer membrane, and a long all helical barrel
           that spans the periplasm.
          Length = 186

 Score = 27.9 bits (62), Expect = 5.2
 Identities = 29/110 (26%), Positives = 43/110 (39%), Gaps = 18/110 (16%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
           V+  KAQ    +   +Q R +L+LE+A         +E +E  KQ           L  A
Sbjct: 83  VKAAKAQLEAAEAELEQARRQLRLEVAQAYFNLLAAKEQLELAKQA----------LELA 132

Query: 275 QEMIRRLEEQLKQ--------LQAAKEELEARQTELQLELQPIRNQSNLN 316
           +E +   E + +         LQA  E LEAR   L  E      ++ L 
Sbjct: 133 EEALELAEARYEAGLISLLDVLQAEVELLEARLELLNAEADLELARAQLE 182


>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain.  This domain is
           found at the N terminus of SMC proteins. The SMC
           (structural maintenance of chromosomes) superfamily
           proteins have ATP-binding domains at the N- and
           C-termini, and two extended coiled-coil domains
           separated by a hinge in the middle. The eukaryotic SMC
           proteins form two kind of heterodimers: the SMC1/SMC3
           and the SMC2/SMC4 types. These heterodimers constitute
           an essential part of higher order complexes, which are
           involved in chromatin and DNA dynamics. This family also
           includes the RecF and RecN proteins that are involved in
           DNA metabolism and recombination.
          Length = 1162

 Score = 28.8 bits (64), Expect = 5.2
 Identities = 19/88 (21%), Positives = 39/88 (44%), Gaps = 4/88 (4%)

Query: 224 EEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMI----R 279
           +E    +++  K + EI   EK  K+ +   E  ++ E ++ K  + L + +E +    +
Sbjct: 322 KELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKKK 381

Query: 280 RLEEQLKQLQAAKEELEARQTELQLELQ 307
              E+L      KEE    + E + E +
Sbjct: 382 LESERLSSAAKLKEEELELKNEEEKEAK 409


>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
          Length = 1463

 Score = 28.9 bits (65), Expect = 5.2
 Identities = 24/106 (22%), Positives = 39/106 (36%), Gaps = 4/106 (3%)

Query: 208  RKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR 267
               D  +   ++     E N   +   KL+      E   KK    V    +   E+   
Sbjct: 938  NNIDLEEGPSIEYVKLPELNKLHEVESKLKETSEEYEDLLKKSTILVREGNKANSELKNF 997

Query: 268  DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQS 313
             ++L E  +    L+E  KQL    +EL     ELQ   + I ++S
Sbjct: 998  KKELAELSKQYGALQESTKQL----KELPVEVAELQSASKIISSES 1039


>gnl|CDD|234345 TIGR03755, conj_TIGR03755, integrating conjugative element protein,
           PFL_4711 family.  Members of this protein family are
           found in genomic regions associated with conjugative
           transfer and integrated TOL-like plasmids. The specific
           function is unknown [Mobile and extrachromosomal element
           functions, Plasmid functions].
          Length = 418

 Score = 28.4 bits (64), Expect = 5.4
 Identities = 16/70 (22%), Positives = 34/70 (48%), Gaps = 11/70 (15%)

Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRL------EEQLKQLQAAKEELEARQTELQLELQ 307
           V+RL   E+ +A    D +E   ++RR+      E  +   + A++E++    +L  E+ 
Sbjct: 313 VQRLAS-EIALA----DTLEKALLMRRMLLTGLQEPNVAANKPAQQEVDKAIDKLDREIN 367

Query: 308 PIRNQSNLNK 317
            ++ +  L K
Sbjct: 368 NLKTELELRK 377


>gnl|CDD|224644 COG1730, GIM5, Predicted prefoldin, molecular chaperone implicated
           in de novo protein folding [Posttranslational
           modification, protein turnover, chaperones].
          Length = 145

 Score = 27.7 bits (62), Expect = 5.5
 Identities = 16/48 (33%), Positives = 27/48 (56%)

Query: 264 MAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRN 311
           MA+  Q+L E    ++ L+ Q++ LQA    L A  +ELQ  ++ + N
Sbjct: 1   MAQTQQELEELAAQLQILQSQIESLQAQIAALNAAISELQTAIETLEN 48



 Score = 27.3 bits (61), Expect = 6.6
 Identities = 17/58 (29%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 246 AEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQ 303
           AEK   E++E LK+   E+ K  + L   Q+ +  L ++++QL+   ++L+ +Q   Q
Sbjct: 88  AEKSADEAIEFLKKRIEELEKAIEKL---QQALAELAQRIEQLEQEAQQLQQKQAAAQ 142


>gnl|CDD|150931 pfam10335, DUF294_C, Putative nucleotidyltransferase substrate
           binding domain.  This domain is found associated with
           presumed nucleotidyltransferase domains and seems to be
           distantly related to other helical substrate binding
           domains.
          Length = 145

 Score = 27.5 bits (62), Expect = 5.5
 Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 11/48 (22%)

Query: 254 VERLKQLE------VEMAKRDQDLMEAQEMI--RRLEEQLKQLQAAKE 293
           +ERL+ L        E+A   +DL+EA E +   RL  QL+ ++A + 
Sbjct: 65  LERLRALAEAGVLSAELA---EDLLEAYEFLLRLRLRHQLEAIEAGRP 109


>gnl|CDD|184254 PRK13698, PRK13698, plasmid-partitioning protein; Provisional.
          Length = 323

 Score = 28.3 bits (63), Expect = 5.5
 Identities = 18/65 (27%), Positives = 32/65 (49%), Gaps = 4/65 (6%)

Query: 239 EIAAR--EKAEKKHQESVERLKQLEVEMAKRDQD--LMEAQEMIRRLEEQLKQLQAAKEE 294
           E++AR  E  +K   +  E LKQ    + ++ +   + E +E+I  L   LK   A++  
Sbjct: 215 ELSARSGEALQKAFTDKEELLKQQTSNLHEQKKAGVIFETEEVITLLTSVLKTSSASRTS 274

Query: 295 LEARQ 299
           L +R 
Sbjct: 275 LSSRH 279


>gnl|CDD|221489 pfam12252, SidE, Dot/Icm substrate protein.  This family of proteins
            is found in bacteria. Proteins in this family are
            typically between 397 and 1543 amino acids in length.
            This family is the SidE protein in the Dot/Icm pathway of
            Legionella pneumophila bacteria. There is little
            literature describing the family.
          Length = 1443

 Score = 28.7 bits (64), Expect = 5.5
 Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 6/95 (6%)

Query: 217  QMKAQA--REEKNAKQ--QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLM 272
            QMK +    +EK  +Q  +   KL+  +   + AEK  +E + +LK L +   K   +++
Sbjct: 1271 QMKPKTFEMQEKEIQQNFELLAKLEKTLDKSDTAEKL-REDIPKLKDLLIAKQKAYPEMV 1329

Query: 273  EAQEMIRRLEEQLKQL-QAAKEELEARQTELQLEL 306
            + Q     L  QL++L QA  ++L   +T    EL
Sbjct: 1330 QLQYKSEALITQLRELCQAHHDDLAKTRTARLQEL 1364


>gnl|CDD|151277 pfam10828, DUF2570, Protein of unknown function (DUF2570).  This is
           a family of proteins with unknown function.
          Length = 110

 Score = 27.1 bits (60), Expect = 5.6
 Identities = 12/59 (20%), Positives = 29/59 (49%)

Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQ 312
            +++  L  E   + Q + + Q+  +RL + L+Q + A EE +    E++ + +    +
Sbjct: 24  SQKIDSLRAENKAQAQTIEQQQKANQRLTDALEQERQAVEEQQKIANEIERKAEENAEE 82


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 28.1 bits (63), Expect = 5.6
 Identities = 21/77 (27%), Positives = 39/77 (50%), Gaps = 4/77 (5%)

Query: 211 DTIDVQQMK-AQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQ 269
           D I  Q  K ++ ++EK   Q + + L  +I   E+ + K  E  + + Q + E+ K  +
Sbjct: 31  DKIQNQDSKLSELQKEKKNIQNEIESLDNQI---EEIQSKIDELQKEIDQSKAEIKKLQK 87

Query: 270 DLMEAQEMIRRLEEQLK 286
           ++ E +E I   +E LK
Sbjct: 88  EIAELKENIVERQELLK 104



 Score = 27.8 bits (62), Expect = 8.1
 Identities = 19/75 (25%), Positives = 35/75 (46%), Gaps = 5/75 (6%)

Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERL-KQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL 288
             +      +++  +K +K  Q  +E L  Q+E   +K D    E Q+ I + + ++K+L
Sbjct: 30  SDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKID----ELQKEIDQSKAEIKKL 85

Query: 289 QAAKEELEARQTELQ 303
           Q    EL+    E Q
Sbjct: 86  QKEIAELKENIVERQ 100


>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
          Length = 281

 Score = 28.1 bits (63), Expect = 5.6
 Identities = 20/99 (20%), Positives = 46/99 (46%), Gaps = 9/99 (9%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL-EVEMAK-----RD 268
           V+  + +A+EE    ++  +   +    + + E++ +E  E L++L E   A+      +
Sbjct: 72  VEAAEEEAKEE--LLKKTDEASSIIEKLQMQIEREQEEWEEELERLIEEAKAEGYEEGYE 129

Query: 269 QDLMEAQEMIRRLEEQLKQ-LQAAKEELEARQTELQLEL 306
           +   E  E +R L EQL + ++   ++ E      + E+
Sbjct: 130 KGREEGLEEVRELIEQLNKIIEKLIKKREEILESSEEEI 168


>gnl|CDD|220605 pfam10156, Med17, Subunit 17 of Mediator complex.  This Mediator
           complex subunit was formerly known as Srb4 in yeasts or
           Trap80 in Drosophila and human. The Med17 subunit is
           located within the head domain and is essential for cell
           viability to the extent that a mutant strain of
           cerevisiae lacking it shows all RNA polymerase
           II-dependent transcription ceasing at non-permissive
           temperatures.
          Length = 454

 Score = 28.5 bits (64), Expect = 5.8
 Identities = 17/88 (19%), Positives = 38/88 (43%), Gaps = 1/88 (1%)

Query: 209 KPDTIDVQQMKAQAREEKNA-KQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR 267
             D + ++++  +  +E+ + +    + L+ EIA         +ES E     E +    
Sbjct: 26  DEDPLSLEELIERINQERGSFRDLTEESLREEIAKEAAKIDFSEESDEEEDDEEDDNDDS 85

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEEL 295
           +++    +E  +  +E L+QL  A  E 
Sbjct: 86  EENKDTVEEFPKARDEVLEQLNQALNES 113


>gnl|CDD|119241 pfam10721, DUF2514, Protein of unknown function (DUF2514).  This
           family is conserved in bacteria and some viruses. The
           function is not known.
          Length = 162

 Score = 27.8 bits (62), Expect = 5.8
 Identities = 22/82 (26%), Positives = 32/82 (39%), Gaps = 10/82 (12%)

Query: 216 QQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQ 275
            +  A A  E  A+   R + Q   AA+ +A K  QE          E A    D  +A 
Sbjct: 36  DRDAADALAEVIAETAARAEEQRRQAAQNEAAKDAQE----------EAAAARADAADAD 85

Query: 276 EMIRRLEEQLKQLQAAKEELEA 297
               RL ++  +L AAK    +
Sbjct: 86  AAAGRLRQEADKLAAAKSCPTS 107


>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding.  Cdc37 is a
           molecular chaperone required for the activity of
           numerous eukaryotic protein kinases. This domain
           corresponds to the N terminal domain which binds
           predominantly to protein kinases and is found N terminal
           to the Hsp (Heat shocked protein) 90-binding domain
           pfam08565. Expression of a construct consisting of only
           the N-terminal domain of Saccharomyces pombe Cdc37
           results in cellular viability. This indicates that
           interactions with the cochaperone Hsp90 may not be
           essential for Cdc37 function.
          Length = 172

 Score = 27.8 bits (62), Expect = 6.1
 Identities = 17/92 (18%), Positives = 39/92 (42%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
            V ++ ++ +EE     Q   K   E    ++  +  Q +   + +   +  K + D   
Sbjct: 64  RVDKLLSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMVEDLFDQVKDEVDEKN 123

Query: 274 AQEMIRRLEEQLKQLQAAKEELEARQTELQLE 305
              +I  L++   +L+  ++EL  +  EL+ E
Sbjct: 124 GAALIEELQKHRDKLKKEQKELLKKLDELEKE 155


>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase.  BRE1 is an E3 ubiquitin
           ligase that has been shown to act as a transcriptional
           activator through direct activator interactions.
          Length = 96

 Score = 26.9 bits (60), Expect = 6.1
 Identities = 20/83 (24%), Positives = 33/83 (39%), Gaps = 15/83 (18%)

Query: 236 LQLEIAAREKAEKKHQESV-----------ERLKQLEVEMAKRDQDLMEAQEMIRRLEEQ 284
           LQ E+ + EKA K+                ++  +L  E AK DQ    A      L+ +
Sbjct: 1   LQKELVSLEKAFKELSSLALKKVAELTILEQKKLRLTAEKAKADQKYFAAMRSKDALDAE 60

Query: 285 LKQLQAAKEELEARQTELQLELQ 307
            K+L         + +EL  +L+
Sbjct: 61  NKKLNT----QLNKSSELIEQLK 79


>gnl|CDD|181417 PRK08416, PRK08416, 7-alpha-hydroxysteroid dehydrogenase;
           Provisional.
          Length = 260

 Score = 28.2 bits (63), Expect = 6.2
 Identities = 11/43 (25%), Positives = 24/43 (55%), Gaps = 7/43 (16%)

Query: 75  GLNI---YEKEDNLTNVMSQDVKKENPLQFKFRAKFYPEDVVE 114
           G+NI   Y       N +++D++++    +  +AK YP +++E
Sbjct: 32  GVNIAFTYNSNVEEANKIAEDLEQK----YGIKAKAYPLNILE 70


>gnl|CDD|221120 pfam11488, Lge1, Transcriptional regulatory protein LGE1.  This
           family of proteins is conserved from fungi to human. In
           yeasts it is involved in the ubiquitination of histones
           H2A and H2B. This ubiquitination step is a vital one in
           the regulation of the transcriptional activity of RNA
           polymerase II. In S. cerevisiae, Rad6 and Bre1 are
           present in a complex, also containing Lge1, that is
           required for H2B ubiquitination. Bre1 is the H2B
           ubiquitin ligase that interacts with acidic activators,
           such as Gal4, and recruits Rad6 and its binding partner
           Lge1 to target promoters. In S. pombe the equivalent
           protein to Lge1 appears to be Shf1.
          Length = 75

 Score = 26.1 bits (58), Expect = 6.2
 Identities = 15/54 (27%), Positives = 27/54 (50%), Gaps = 2/54 (3%)

Query: 254 VERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQL--QAAKEELEARQTELQLE 305
            E+L+    E+   DQ L E +    +LE +L  L  Q  ++ L  + T+ +L+
Sbjct: 17  QEKLRSRYKELESLDQKLEELKLQKLKLELELALLETQVERDALNVQLTQEKLD 70


>gnl|CDD|221198 pfam11740, KfrA_N, Plasmid replication region DNA-binding N-term.
           The broad host-range plasmid RK2 is able to replicate in
           and be inherited in a stable manner in diverse
           Gram-negative bacterial species. It encodes a number of
           co-ordinately regulated operons including a central
           control korF1 operon that represses the kfrA operon. The
           KfrA polypeptide is a site-specific DNA-binding protein
           whose operator overlaps the kfrA promoter. The
           N-terminus, containing an helix-turn-helix motif, is
           essential for function. Downstream from this family is
           an extended coiled-coil domain containing a heptad
           repeat segment which is probably responsible for
           formation of multimers, and may provide an example of a
           bridge to host structures required for plasmid
           partitioning.
          Length = 120

 Score = 27.3 bits (61), Expect = 6.2
 Identities = 13/53 (24%), Positives = 18/53 (33%), Gaps = 2/53 (3%)

Query: 251 QESVERLKQLEVEMAKR--DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTE 301
            E V  L +   E A+         A+     LE +L +  A  E LE     
Sbjct: 68  AELVAALWEAAQEEAEERLAAARAAAEAERAELEAELAEAAAEAEALEEELEA 120


>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724).  This
           family contains several uncharacterized proteins found
           in Arabidopsis thaliana and other plants. This region is
           often found associated with Agenet domains and may
           contain coiled-coil.
          Length = 190

 Score = 27.8 bits (62), Expect = 6.4
 Identities = 19/70 (27%), Positives = 30/70 (42%), Gaps = 8/70 (11%)

Query: 248 KKHQ-ESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQ-------AAKEELEARQ 299
           K  Q + +E  K LE E+A+++    E    I  LE ++ +LQ         KE  +   
Sbjct: 102 KDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEKKEAEDKEI 161

Query: 300 TELQLELQPI 309
             L+ E   I
Sbjct: 162 ARLKSEASKI 171


>gnl|CDD|235751 PRK06241, PRK06241, phosphoenolpyruvate synthase; Validated.
          Length = 871

 Score = 28.3 bits (64), Expect = 6.4
 Identities = 14/39 (35%), Positives = 22/39 (56%), Gaps = 4/39 (10%)

Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRL 281
           R++A +K +E + RL+QL       +Q   E + MI RL
Sbjct: 628 RQEALEKEEELLSRLQQLP----DGEQKAKETKRMISRL 662


>gnl|CDD|234082 TIGR02997, Sig70-cyanoRpoD, RNA polymerase sigma factor,
           cyanobacterial RpoD-like family.  This family includes a
           number of closely related sigma-70 (TIGR02937) factors
           in the cyanobacteria. All appear most closely related to
           the essential sigma-70 factor RpoD, and some score above
           trusted to the RpoD C-terminal domain model (TIGR02393).
          Length = 298

 Score = 28.0 bits (63), Expect = 6.5
 Identities = 15/46 (32%), Positives = 27/46 (58%), Gaps = 1/46 (2%)

Query: 261 EVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQ-TELQLE 305
           E+E+A++ Q +M  +E+   LEEQL +  + +E   A   +E +L 
Sbjct: 21  EIELARQVQQMMVLEELREELEEQLGREPSKEEWAAAAGLSEAELR 66


>gnl|CDD|190336 pfam02520, DUF148, Domain of unknown function DUF148.  This domain
           has no known function nor do any of the proteins that
           possess it. In one member of this family the aligned
           region is repeated twice.
          Length = 114

 Score = 26.8 bits (60), Expect = 6.8
 Identities = 19/81 (23%), Positives = 34/81 (41%), Gaps = 5/81 (6%)

Query: 212 TIDVQQMKAQAREEKNAKQQQRDKLQLEIAAR-EKAEKKHQESVERL----KQLEVEMAK 266
           TI     + +A  EK     Q ++ +  + A   +AEK     +  L     +L   +  
Sbjct: 17  TIAEIDAQLEAWAEKYGVADQYEEFKKNVKALKAEAEKNVTAVISNLSSVDAKLSAILDN 76

Query: 267 RDQDLMEAQEMIRRLEEQLKQ 287
           +D    + +E I  L++QL Q
Sbjct: 77  KDLTRRQKKEAIDALKKQLPQ 97


>gnl|CDD|223618 COG0544, Tig, FKBP-type peptidyl-prolyl cis-trans isomerase
           (trigger factor) [Posttranslational modification,
           protein turnover, chaperones].
          Length = 441

 Score = 28.0 bits (63), Expect = 7.0
 Identities = 13/84 (15%), Positives = 34/84 (40%), Gaps = 1/84 (1%)

Query: 215 VQQMKAQAREE-KNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLME 273
           ++++K + R+  +   ++   + + E       E    +  E L + E++   +      
Sbjct: 262 LEELKEKLRKNLERELKEATLEKRKEQLLDALVEANDFDLPESLVEAEIDNLLKQALQQL 321

Query: 274 AQEMIRRLEEQLKQLQAAKEELEA 297
            Q+ I  LE   +  +  +EE + 
Sbjct: 322 QQQGIDSLEASGESEEELREEFKE 345


>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207).  This
           family is found in eukaryotes; it has several conserved
           tryptophan residues. The function is not known.
          Length = 261

 Score = 27.8 bits (62), Expect = 7.1
 Identities = 21/87 (24%), Positives = 38/87 (43%), Gaps = 8/87 (9%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQL--EVEMAKRDQDLM 272
            +Q   QA +++  K ++        +    A+ +   S E  K+   E E+ K      
Sbjct: 138 CRQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKL----- 192

Query: 273 EAQEMIRRLEEQLKQLQAAKEELEARQ 299
             Q+  +R EE+ KQ +  +EE E +Q
Sbjct: 193 -KQQQQKREEERRKQRKKQQEEEERKQ 218


>gnl|CDD|115072 pfam06391, MAT1, CDK-activating kinase assembly factor MAT1.  MAT1
           is an assembly/targeting factor for cyclin-dependent
           kinase-activating kinase (CAK), which interacts with the
           transcription factor TFIIH. The domain found to the
           N-terminal side of this domain is a C3HC4 RING finger.
          Length = 200

 Score = 27.8 bits (62), Expect = 7.1
 Identities = 22/101 (21%), Positives = 51/101 (50%), Gaps = 2/101 (1%)

Query: 200 NHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
           N +  MR ++  T + ++++    EEK  K+++R  LQ E   +E+   K ++  E + +
Sbjct: 77  NKDSIMRNKRRLTREQEELEQALEEEKEMKEEKRLHLQKE--EQEQKMAKEKDKQEIIDE 134

Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
           LE      +  + + ++  ++LE Q+++L+  K    +   
Sbjct: 135 LETSNLPANVIIAQHKKQSKQLESQVEKLERKKRVTFSTGI 175


>gnl|CDD|220441 pfam09849, DUF2076, Uncharacterized protein conserved in bacteria
           (DUF2076).  This domain, found in various hypothetical
           prokaryotic proteins, has no known function. The domain,
           however, is found in various periplasmic ligand-binding
           sensor proteins.
          Length = 234

 Score = 27.7 bits (62), Expect = 7.1
 Identities = 21/61 (34%), Positives = 23/61 (37%), Gaps = 23/61 (37%)

Query: 255 ERLKQLEVEMAKRDQDLMEAQEMIRR------------------LEEQLKQLQAAKEELE 296
            RLKQ   E A RD    EA+ +I                     E  LKQ  A  EELE
Sbjct: 14  SRLKQ--AEGAPRDA---EAEALIAEALRRQPDAPYYLVQTILVQEAALKQANARIEELE 68

Query: 297 A 297
           A
Sbjct: 69  A 69


>gnl|CDD|206361 pfam14193, DUF4315, Domain of unknown function (DUF4315).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 90 amino acids in length.
          Length = 81

 Score = 26.1 bits (58), Expect = 7.1
 Identities = 18/68 (26%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKE 293
           +K++ EI   EK ++K  E   RLK+L  E  K + + +E  +++R ++   ++L A  +
Sbjct: 4   EKIEAEI---EKTKEKIAEQQARLKEL--EKQKTELENLEIVQIVRSMDMTPEELAAFLK 58

Query: 294 ELEARQTE 301
            +++ Q  
Sbjct: 59  AMKSGQAA 66


>gnl|CDD|225871 COG3334, COG3334, Uncharacterized conserved protein [Function
           unknown].
          Length = 192

 Score = 27.4 bits (61), Expect = 7.2
 Identities = 9/47 (19%), Positives = 20/47 (42%)

Query: 252 ESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEAR 298
                      ++    ++L+E  + +  + E+LK L+  K EL+  
Sbjct: 60  FCANIADAAADQLYALQKELLEKLKDLAEVNERLKALEKKKAELKDL 106


>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex.  Med21 has been
           known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
           Drosophila. The heterodimer of the two subunits Med7 and
           Med21 appears to act as a hinge between the middle and
           the tail regions of Mediator.
          Length = 132

 Score = 26.8 bits (60), Expect = 7.4
 Identities = 13/46 (28%), Positives = 28/46 (60%), Gaps = 3/46 (6%)

Query: 251 QESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELE 296
           +E + R+K+LE E+ + + +  EA   ++  E+ LK+++A   E+ 
Sbjct: 88  EEQLRRIKELEEELREVEAEREEA---VKEKEKLLKKVEALLREIA 130


>gnl|CDD|183703 PRK12724, PRK12724, flagellar biosynthesis regulator FlhF;
           Provisional.
          Length = 432

 Score = 28.0 bits (62), Expect = 7.4
 Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 4/61 (6%)

Query: 241 AAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQT 300
           A+REK E+K Q+  E LKQ      +R + L   + +  RLEE+    ++A  E+E+ + 
Sbjct: 61  ASREKIERKLQDLKELLKQKSYTEPERKKTLQTLKPLSERLEEK----ESAIYEIESFEL 116

Query: 301 E 301
           E
Sbjct: 117 E 117


>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203).
           This domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 120

 Score = 26.9 bits (60), Expect = 7.5
 Identities = 17/51 (33%), Positives = 29/51 (56%), Gaps = 3/51 (5%)

Query: 270 DLMEAQEM---IRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNLNK 317
            L EA E+   +RRL E++++ +A  EELEA  +E +  L+    +  L +
Sbjct: 4   TLEEANELLPELRRLLEEVREAKAELEELEAALSEGERSLEARGLKEELRE 54


>gnl|CDD|132697 TIGR03658, IsdH_HarA, haptoglobin-binding heme uptake protein HarA.
            HarA is a heme-binding NEAT-domain (NEAr Transporter,
           pfam05031) protein which has been shown to bind to the
           haptoglobin-hemoglobin complex in order to extract heme
           from it. HarA has also been reported to bind hemoglobin
           directly. HarA (also known as IsdH) contains three NEAT
           domains as well as a sortase A C-terminal signal for
           localization to the cell wall. The heme bound at the
           third of these NEAT domains has been shown to be
           transferred to the IsdA protein also localized at the
           cell wall, presumably through an additional specific
           protein-protein interaction. Haptoglobin is a hemoglobin
           carrier protein involved in scavenging hemoglobin in the
           blood following red blood cell lysis and targetting it
           to the liver.
          Length = 895

 Score = 28.3 bits (62), Expect = 7.5
 Identities = 23/83 (27%), Positives = 44/83 (53%), Gaps = 7/83 (8%)

Query: 211 DTIDVQQMKAQAREEKNAKQQ--QRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRD 268
           +T ++Q++ A   + K  ++Q  + +KLQ ++  + KAE K     ++L Q  VE+A + 
Sbjct: 473 ETYNLQKLLAPYHKAKTLERQVYELEKLQEKLPEKYKAEYK-----KKLDQTRVELADQV 527

Query: 269 QDLMEAQEMIRRLEEQLKQLQAA 291
           +  +   E +    +QL  LQ A
Sbjct: 528 KSAVTEFENVTPTNDQLTDLQEA 550


>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
           ribosomal structure and biogenesis].
          Length = 363

 Score = 27.9 bits (63), Expect = 7.5
 Identities = 31/109 (28%), Positives = 54/109 (49%), Gaps = 21/109 (19%)

Query: 201 HELYMRRRKPDTI-DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQ 259
            EL      P+ I D  + +  ++E          +L+  I  + +  KK QE +E  K+
Sbjct: 17  EELEALLSDPEVISDPDEYRKLSKEYA--------ELE-PIVEKYREYKKAQEDLEDAKE 67

Query: 260 LEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQP 308
           +  E  ++D ++   +EM    EE++K+L+A  EELE    EL++ L P
Sbjct: 68  MLAE--EKDPEM---REMA---EEEIKELEAKIEELE---EELKILLLP 105


>gnl|CDD|233242 TIGR01035, hemA, glutamyl-tRNA reductase.  This enzyme, together
           with glutamate-1-semialdehyde-2,1-aminomutase
           (TIGR00713), leads to the production of
           delta-amino-levulinic acid from Glu-tRNA [Biosynthesis
           of cofactors, prosthetic groups, and carriers, Heme,
           porphyrin, and cobalamin].
          Length = 417

 Score = 28.1 bits (63), Expect = 7.6
 Identities = 15/79 (18%), Positives = 32/79 (40%), Gaps = 5/79 (6%)

Query: 234 DKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQL-----KQL 288
           D LQ  +       ++  E  E + + E    K+    +E +  I+ L         K+L
Sbjct: 298 DDLQPVVEENLAERREEAEKAEEIVEEETAEFKQWLRSLEVEPTIKALRSLAEIVREKEL 357

Query: 289 QAAKEELEARQTELQLELQ 307
           + A ++L     +++  L+
Sbjct: 358 EKALKKLPGLSKDVEEVLE 376


>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1.  All
           proteins in this family for which functions are known
           are cyclin dependent protein kinases that are components
           of TFIIH, a complex that is involved in nucleotide
           excision repair and transcription initiation. Also known
           as MAT1 (menage a trois 1). This family is based on the
           phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
           Stanford University) [DNA metabolism, DNA replication,
           recombination, and repair].
          Length = 309

 Score = 27.8 bits (62), Expect = 7.7
 Identities = 19/103 (18%), Positives = 47/103 (45%), Gaps = 2/103 (1%)

Query: 190 NKRILALCMGNHELYMRRRKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKK 249
            K+I      N ++  + ++  T + ++++     EK  ++Q+R  L L+    E+   K
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRR--LLLQKEEEEQQMNK 176

Query: 250 HQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAK 292
            +     L +LE       + + + ++   +LE Q+++ +  K
Sbjct: 177 RKNKQALLDELETSTLPAAELIAQHKKNSVKLEMQVEKPKPEK 219


>gnl|CDD|215487 PLN02900, PLN02900, alanyl-tRNA synthetase.
          Length = 936

 Score = 28.1 bits (63), Expect = 7.8
 Identities = 16/75 (21%), Positives = 34/75 (45%), Gaps = 2/75 (2%)

Query: 231 QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
              D L+ E+ +  K E    E   ++  L+  +        + +E+  R+    K+L+A
Sbjct: 744 NAADSLERELDSALKVEGSDLEK--KVASLKSRVDAAVIPAAKKEEIRARVSALQKELRA 801

Query: 291 AKEELEARQTELQLE 305
           A++E  A + +L + 
Sbjct: 802 AQKEAAALRAKLAVA 816


>gnl|CDD|222742 pfam14415, DUF4424, Domain of unknown function (DUF4424).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 310 and 361 amino
           acids in length.
          Length = 248

 Score = 27.8 bits (62), Expect = 7.9
 Identities = 9/24 (37%), Positives = 13/24 (54%), Gaps = 1/24 (4%)

Query: 89  MSQDVKKENPLQFKFRAK-FYPED 111
              ++KK NP  F+F  K F P+ 
Sbjct: 224 FDGNLKKINPTTFEFTKKNFTPKR 247


>gnl|CDD|218152 pfam04568, IATP, Mitochondrial ATPase inhibitor, IATP.  ATP
           synthase inhibitor prevents the enzyme from switching to
           ATP hydrolysis during collapse of the electrochemical
           gradient, for example during oxygen deprivation ATP
           synthase inhibitor forms a one to one complex with the
           F1 ATPase, possibly by binding at the alpha-beta
           interface. It is thought to inhibit ATP synthesis by
           preventing the release of ATP. The minimum inhibitory
           region for bovine inhibitor is from residues 39 to 72.
           The inhibitor has two oligomeric states, dimer (the
           active state) and tetramer. At low pH, the inhibitor
           forms a dimer via antiparallel coiled coil interactions
           between the C terminal regions of two monomers. At high
           pH, the inhibitor forms tetramers and higher oligomers
           by coiled coil interactions involving the N terminus and
           inhibitory region, thus preventing the inhibitory
           activity.
          Length = 90

 Score = 26.2 bits (58), Expect = 7.9
 Identities = 12/41 (29%), Positives = 26/41 (63%), Gaps = 4/41 (9%)

Query: 217 QMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERL 257
             +  A+EE+  +Q+++++L    A +EK E +H++ +E L
Sbjct: 49  GKREAAQEEEYFRQKEKEQLA---ALKEKLE-EHKKEIEEL 85


>gnl|CDD|218186 pfam04636, PA26, PA26 p53-induced protein (sestrin).  PA26 is a
           p53-inducible protein. Its function is unknown. It has
           similarity to pfam04636 in its N-terminus.
          Length = 450

 Score = 27.8 bits (62), Expect = 8.0
 Identities = 12/50 (24%), Positives = 26/50 (52%), Gaps = 4/50 (8%)

Query: 266 KRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQPIRNQSNL 315
           K  +   E + ++    E++K+LQ  +++ EA Q E+    +  + +S L
Sbjct: 221 KVQRPETEVEALM----ERMKKLQEERDDDEASQEEMTTRFEKEKKESLL 266


>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
          Length = 635

 Score = 28.0 bits (63), Expect = 8.2
 Identities = 19/96 (19%), Positives = 42/96 (43%), Gaps = 12/96 (12%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAA-REKAEKKHQESVER-LKQLEVEMAKRDQDL 271
           D +Q +AQ    K    +++++     A   +++ KK    ++R L+QL   +   + ++
Sbjct: 525 DARQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAEI 584

Query: 272 MEAQEMI----------RRLEEQLKQLQAAKEELEA 297
              Q  +           + ++ L  L  A++ELE 
Sbjct: 585 EALQAQVADADFFSQPHEQTQKVLADLADAEQELEV 620


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 28.1 bits (62), Expect = 8.3
 Identities = 24/86 (27%), Positives = 39/86 (45%), Gaps = 7/86 (8%)

Query: 219 KAQAREEKNAKQQQRDKLQLEIAAREKA----EKKHQESVERLKQLEVEMAKRDQDLMEA 274
           +   +E + A++  +++L     A  +A      K +   + L+ LE   A   + L  A
Sbjct: 136 EGGQQELEEAQRLHQEQLSSLTQAHLEALSSLRSKAEGLEKSLQSLETRRAGEAKALAAA 195

Query: 275 QEMIRRLEEQLKQLQAAKEELEARQT 300
           Q     L EQL + Q   EELEA+ T
Sbjct: 196 QAEADTLREQLSKTQ---EELEAQVT 218


>gnl|CDD|227608 COG5283, COG5283, Phage-related tail protein [Function unknown].
          Length = 1213

 Score = 28.0 bits (62), Expect = 8.7
 Identities = 18/93 (19%), Positives = 40/93 (43%), Gaps = 4/93 (4%)

Query: 215 VQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA 274
            + ++ Q +  K+     + K +    A EK +K +++  + +K++        +   E 
Sbjct: 38  WKMLEKQQKLTKDGLSASKGKYEGLSEAMEKQKKAYEDLKQEVKEVNRATQASKKAYQEY 97

Query: 275 QEMIRRLEEQLK----QLQAAKEELEARQTELQ 303
                + E +L+    Q   A E+L  +Q E+Q
Sbjct: 98  NAQYTQAENKLRSLSGQFGVASEQLMLQQKEIQ 130


>gnl|CDD|236729 PRK10636, PRK10636, putative ABC transporter ATP-binding protein;
           Provisional.
          Length = 638

 Score = 27.8 bits (62), Expect = 8.8
 Identities = 17/68 (25%), Positives = 35/68 (51%), Gaps = 4/68 (5%)

Query: 231 QQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
            Q+ + Q + A +E      Q   ++ K+ E E+  + Q L   ++ I RLE+++++L A
Sbjct: 522 VQKQENQTDEAPKENNANSAQARKDQ-KRREAELRTQTQPL---RKEIARLEKEMEKLNA 577

Query: 291 AKEELEAR 298
              + E +
Sbjct: 578 QLAQAEEK 585


>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family.  The organisation of
           microtubules varies with the cell type and is presumably
           controlled by tissue-specific microtubule-associated
           proteins (MAPs). The 115-kDa epithelial MAP
           (E-MAP-115/MAP7) has been identified as a
           microtubule-stabilising protein predominantly expressed
           in cell lines of epithelial origin. The binding of this
           microtubule associated protein is nucleotide
           independent.
          Length = 171

 Score = 27.4 bits (60), Expect = 8.8
 Identities = 16/76 (21%), Positives = 37/76 (48%)

Query: 243 REKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTEL 302
           R+  E++ QE  ER +Q E +  +R++    A E   R EE+ ++ +  +   +  + + 
Sbjct: 33  RQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKR 92

Query: 303 QLELQPIRNQSNLNKV 318
           + E +  + Q    ++
Sbjct: 93  KAEEEEKQEQEEQERI 108


>gnl|CDD|184056 PRK13446, atpC, F0F1 ATP synthase subunit epsilon; Provisional.
          Length = 136

 Score = 26.8 bits (60), Expect = 8.9
 Identities = 9/39 (23%), Positives = 23/39 (58%), Gaps = 2/39 (5%)

Query: 252 ESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQA 290
           E+ ER ++++VE A+   +   A++ +++L  +    +A
Sbjct: 81  ETAERAEEIDVERARAALE--RAEQRLKKLTPEDDSARA 117


>gnl|CDD|235672 PRK06012, flhA, flagellar biosynthesis protein FlhA; Validated.
          Length = 697

 Score = 27.8 bits (63), Expect = 9.0
 Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 12/64 (18%)

Query: 229 KQQQRDKLQLEIAAREKAEKKHQESVERLKQ---LEVEMAKR---------DQDLMEAQE 276
           ++++  +L  E A  E+A +  +ES + +     LE+E+              +L++   
Sbjct: 329 REKKAAELAAEEAEEEEAAEPEEESWDDVLPVDPLELEVGYGLIPLVDENQGGELLDRIR 388

Query: 277 MIRR 280
            IR+
Sbjct: 389 SIRK 392


>gnl|CDD|215974 pfam00529, HlyD, HlyD family secretion protein. 
          Length = 304

 Score = 27.5 bits (61), Expect = 9.2
 Identities = 20/94 (21%), Positives = 29/94 (30%), Gaps = 9/94 (9%)

Query: 223 REEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEA-----QEM 277
           R +    Q      Q   AAR    +    S     + ++E A+ +     A        
Sbjct: 67  RLQALVAQAAIQSEQEADAARAALREALAASRGAASKAQLEAARDNLRSALALLPISLGR 126

Query: 278 IRR----LEEQLKQLQAAKEELEARQTELQLELQ 307
           I      L   L  LQA      A+  E  L+L 
Sbjct: 127 IGASQATLGALLNNLQAQGLAARAQLDEAYLQLA 160


>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit.  This family includes
           prefoldin subunits that are not detected by pfam02996.
          Length = 106

 Score = 26.4 bits (59), Expect = 9.2
 Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 12/90 (13%)

Query: 230 QQQRDKLQLEIAAREKAEKKHQESVERLKQLEVE----------MAKRDQDLMEAQ--EM 277
           QQQ   L  +I   E   K+ +  +E L+ L+ +          + K+D++ ++ +  E 
Sbjct: 11  QQQLQLLAQQIKQLETQLKEIELVLEELELLDEDTKVYKLIGDVLVKQDKEEVKEELEER 70

Query: 278 IRRLEEQLKQLQAAKEELEARQTELQLELQ 307
              LE+++K L+   E+LE    EL+ EL 
Sbjct: 71  KETLEKEIKTLEKQLEKLEKELEELKEELY 100


>gnl|CDD|148750 pfam07321, YscO, Type III secretion protein YscO.  This family
           contains the bacterial type III secretion protein YscO,
           which is approximately 150 residues long. YscO has been
           shown to be required for high-level expression and
           secretion of the anti-host proteins V antigen and Yops
           in Yersinia pestis.
          Length = 152

 Score = 27.0 bits (60), Expect = 9.3
 Identities = 10/54 (18%), Positives = 28/54 (51%)

Query: 244 EKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
            +A+++ +   +RL+Q   ++ +  +   +  E+ R+ + + +  +   EELE 
Sbjct: 91  AEAKQRLEAERQRLRQARQQLQEARKAQEKFAELARQEQAEAQAQRQYLEELEQ 144


>gnl|CDD|184048 PRK13428, PRK13428, F0F1 ATP synthase subunit delta; Provisional.
          Length = 445

 Score = 27.8 bits (62), Expect = 9.4
 Identities = 18/74 (24%), Positives = 31/74 (41%), Gaps = 7/74 (9%)

Query: 214 DVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQE--SVERLKQLEVEMAKRDQDL 271
           +  Q   +A E+  A+  +  +   E A R   + + Q     ER+K        R   L
Sbjct: 53  EADQAHTKAVEDAKAEAARVVEEAREDAERIAEQLRAQADAEAERIK----VQGARQVQL 108

Query: 272 MEAQEMIRRLEEQL 285
           + AQ + R+L  +L
Sbjct: 109 LRAQ-LTRQLRLEL 121


>gnl|CDD|213755 TIGR02977, phageshock_pspA, phage shock protein A.  Members of this
           family are the phage shock protein PspA, from the phage
           shock operon. This is a narrower family than the set of
           PspA and its homologs, sometimes several in a genome, as
           described by pfam04012. PspA appears to maintain the
           protonmotive force under stress conditions that include
           overexpression of certain phage secretins, heat shock,
           ethanol, and protein export defects [Cellular processes,
           Adaptations to atypical conditions].
          Length = 219

 Score = 27.2 bits (61), Expect = 9.6
 Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 14/100 (14%)

Query: 208 RKPDTIDVQQMKAQAREEKNAKQQQRDKLQLEIAAREKAEKKHQESVERLKQLEVEMAKR 267
           R+   ++ Q    Q + E  A  + R+ L     AR    +K +   E  + LE E+A  
Sbjct: 59  RRVSRLEAQVADWQDKAEL-ALSKGREDL-----ARAALIEKQKAQ-ELAEALERELA-- 109

Query: 268 DQDLMEAQEMIRRLEEQLKQLQAAKEELEARQTELQLELQ 307
                  +E + +L+E + +LQA   E  ARQ  L +  Q
Sbjct: 110 -----AVEETLAKLQEDIAKLQAKLAEARARQKALAIRHQ 144


>gnl|CDD|223092 COG0013, AlaS, Alanyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 879

 Score = 27.6 bits (62), Expect = 9.8
 Identities = 15/62 (24%), Positives = 32/62 (51%)

Query: 238 LEIAAREKAEKKHQESVERLKQLEVEMAKRDQDLMEAQEMIRRLEEQLKQLQAAKEELEA 297
           +E    E A +  QE  + LK++   +    + L + + ++  L+E  K+L+  K++L A
Sbjct: 698 IEAVTGEAALEYLQEQEDLLKEIASLLKVPPELLPKVERLLEELKELEKELERLKKKLAA 757

Query: 298 RQ 299
            +
Sbjct: 758 AE 759


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.317    0.133    0.377 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,440,727
Number of extensions: 1650056
Number of successful extensions: 7147
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5573
Number of HSP's successfully gapped: 1423
Length of query: 318
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 221
Effective length of database: 6,635,264
Effective search space: 1466393344
Effective search space used: 1466393344
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 59 (26.6 bits)