RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2823
         (122 letters)



>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 38.1 bits (89), Expect = 5e-04
 Identities = 24/96 (25%), Positives = 46/96 (47%), Gaps = 3/96 (3%)

Query: 30  KEELERSQATLATL---STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKT 86
           K EL   +  L  L       EL+ LQE+  +   +L    AEL+ ++   E +R E   
Sbjct: 219 KAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSE 278

Query: 87  MQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
           ++E++E+ Q E+Y L  ++   + + + ++E     
Sbjct: 279 LEEEIEELQKELYALANEISRLEQQKQILRERLANL 314



 Score = 34.6 bits (80), Expect = 0.007
 Identities = 19/111 (17%), Positives = 51/111 (45%), Gaps = 2/111 (1%)

Query: 11  TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
            L  L          ++  +E++E     + +L+  AE++ L+E  ++  ++L     E 
Sbjct: 825 RLESLERRIAATERRLEDLEEQIEELSEDIESLA--AEIEELEELIEELESELEALLNER 882

Query: 71  RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
             ++     +RSE + + E++ + + +   L+ +LE  + ++  ++   E 
Sbjct: 883 ASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEG 933



 Score = 34.3 bits (79), Expect = 0.009
 Identities = 25/122 (20%), Positives = 54/122 (44%), Gaps = 12/122 (9%)

Query: 12  LSRLLSERDKAVLEMDKSKEELERSQATLATL-----STSAELDRLQEKYDKTCA----- 61
           L  L  E  +A  E+++   EL+  +  L  L         E++ LQ++           
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300

Query: 62  --DLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEY 119
               +  +  L  ++   E + ++ + ++ K+++   E+  L+ KLE  + E+ES++ E 
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAEL 360

Query: 120 ER 121
           E 
Sbjct: 361 EE 362



 Score = 31.2 bits (71), Expect = 0.13
 Identities = 26/110 (23%), Positives = 57/110 (51%), Gaps = 2/110 (1%)

Query: 11  TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
            L  L ++ ++   ++D+  EEL   +  L  L    EL+ L+ + ++  A+L   ++ L
Sbjct: 317 QLEELEAQLEELESKLDELAEELAELEEKLEELK--EELESLEAELEELEAELEELESRL 374

Query: 71  RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
             ++   E +RS+   ++ ++     E+ RL+A+LE  +   E +++E E
Sbjct: 375 EELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIE 424



 Score = 28.9 bits (65), Expect = 0.75
 Identities = 14/87 (16%), Positives = 43/87 (49%)

Query: 35  RSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS 94
            ++   + L    E++ L+EK ++    +   +  L  ++ + E +  E + +++++E+ 
Sbjct: 665 SAKTNSSILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEEL 724

Query: 95  QGEVYRLKAKLENTQGEMESMKEEYER 121
             ++  L+  L   + E+E ++E   +
Sbjct: 725 SRQISALRKDLARLEAEVEQLEERIAQ 751



 Score = 27.7 bits (62), Expect = 1.5
 Identities = 21/117 (17%), Positives = 50/117 (42%), Gaps = 12/117 (10%)

Query: 11  TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
            L  L  + ++   +++    E+E  +  +  L   +EL+ L  +       L   ++EL
Sbjct: 839 RLEDLEEQIEELSEDIESLAAEIEELEELIEELE--SELEALLNERASLEEALALLRSEL 896

Query: 71  RVVQADNERVRSEEKTM-------QEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
             +  +   + S+   +       +EK+ + +  +  L+ +++N Q   E + EEY 
Sbjct: 897 EELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNLQ---ERLSEEYS 950


>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
            This is a family of proteins that form part of the CAZ
           (cytomatrix at the active zone) complex which is
           involved in determining the site of synaptic vesicle
           fusion. The C-terminus is a PDZ-binding motif that binds
           directly to RIM (a small G protein Rab-3A effector). The
           family also contains four coiled-coil domains.
          Length = 774

 Score = 36.9 bits (85), Expect = 0.001
 Identities = 28/105 (26%), Positives = 50/105 (47%), Gaps = 2/105 (1%)

Query: 10  GTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAE 69
             L     E +    E+++ + +L  S  ++ T   S EL R +       A +   + +
Sbjct: 3   AQLENAQRENELLKKELERKQSKLGSSMNSIKTF-WSPELKRERALRKDEAARITVWKEQ 61

Query: 70  LRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMES 114
           LRV Q +N+  +   + +QE++ KSQ E+ RL+ KLE +  E   
Sbjct: 62  LRVTQEENQHAQLTVQALQEEL-KSQRELNRLQQKLEKSGNETGE 105



 Score = 27.3 bits (60), Expect = 2.3
 Identities = 27/103 (26%), Positives = 50/103 (48%), Gaps = 2/103 (1%)

Query: 20  DKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNER 79
           D+A  E+ + K EL   Q  L TL+       +++  DK   DL RA+ E  ++Q + + 
Sbjct: 290 DRAAQELSRKKTELLGLQTELETLANQD--SDMRQHLDKLKEDLTRAEQEKAILQTEVDA 347

Query: 80  VRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
           +R E +     + K    +   + +     GE+E M++ YE++
Sbjct: 348 LRYELERKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKT 390


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 35.0 bits (81), Expect = 0.006
 Identities = 25/104 (24%), Positives = 50/104 (48%), Gaps = 9/104 (8%)

Query: 18  ERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADN 77
           E      +++K K E+   +          ELDRLQE+  +   +L    A +  ++A  
Sbjct: 386 ELKDYREKLEKLKREINELKR---------ELDRLQEELQRLSEELADLNAAIAGIEAKI 436

Query: 78  ERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
             +  E++    +++K + ++ +L A L   + E+  +KEEY+R
Sbjct: 437 NELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDR 480



 Score = 32.3 bits (74), Expect = 0.040
 Identities = 24/114 (21%), Positives = 62/114 (54%), Gaps = 6/114 (5%)

Query: 8   PPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQ 67
           P G +    SE  + +  + +  E L+R  ++L      +EL R++ + D+   +L  A 
Sbjct: 662 PRGGILFSRSEPAE-LQRLRERLEGLKRELSSLQ-----SELRRIENRLDELSQELSDAS 715

Query: 68  AELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
            ++  ++ + E++  EE+ ++E++E+ + ++  L+ ++EN + E++ ++   E 
Sbjct: 716 RKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEE 769



 Score = 31.2 bits (71), Expect = 0.11
 Identities = 32/112 (28%), Positives = 57/112 (50%), Gaps = 9/112 (8%)

Query: 10  GTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAE 69
             L++L +E DK + E+++ + E+E  +            D+L E+Y +   +L   +AE
Sbjct: 322 ERLAKLEAEIDKLLAEIEELEREIEEERK---------RRDKLTEEYAELKEELEDLRAE 372

Query: 70  LRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
           L  V  +    R E K  +EK+EK + E+  LK +L+  Q E++ + EE   
Sbjct: 373 LEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELAD 424



 Score = 30.8 bits (70), Expect = 0.15
 Identities = 23/110 (20%), Positives = 46/110 (41%), Gaps = 2/110 (1%)

Query: 12  LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
           L     + +K   E+++ K EL+R Q  L  LS   EL  L        A +   + E  
Sbjct: 387 LKDYREKLEKLKREINELKRELDRLQEELQRLS--EELADLNAAIAGIEAKINELEEEKE 444

Query: 72  VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
               + ++   + + +   + K + E+Y LK + +  + E+  ++ E   
Sbjct: 445 DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKELSKLQRELAE 494



 Score = 27.7 bits (62), Expect = 1.6
 Identities = 21/96 (21%), Positives = 53/96 (55%), Gaps = 2/96 (2%)

Query: 12  LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
           LS L SE  +    +D+  +EL  +   +  +    E+++L+++ +K    L   + +L 
Sbjct: 690 LSSLQSELRRIENRLDELSQELSDASRKIGEIE--KEIEQLEQEEEKLKERLEELEEDLS 747

Query: 72  VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
            ++ + E V+SE K ++ ++E+ + ++++L+  L +
Sbjct: 748 SLEQEIENVKSELKELEARIEELEEDLHKLEEALND 783



 Score = 27.3 bits (61), Expect = 2.4
 Identities = 12/98 (12%), Positives = 42/98 (42%)

Query: 24  LEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSE 83
           +E   ++  LE+            +   L+E+      ++     +   ++ + E + + 
Sbjct: 817 IEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAA 876

Query: 84  EKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
            + ++ ++   + E   L+A+L   + ++E ++ + E+
Sbjct: 877 LRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEK 914



 Score = 25.4 bits (56), Expect = 9.4
 Identities = 22/118 (18%), Positives = 55/118 (46%), Gaps = 9/118 (7%)

Query: 11  TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
            +  L +E       + + + ELE ++  LA L   AE+D+L  + ++   ++   +   
Sbjct: 295 KIGELEAEIASLERSIAEKERELEDAEERLAKLE--AEIDKLLAEIEELEREIEEERKRR 352

Query: 71  RVVQADNERVRSEEKTMQEKVEK-------SQGEVYRLKAKLENTQGEMESMKEEYER 121
             +  +   ++ E + ++ ++E+       ++ E+   + KLE  + E+  +K E +R
Sbjct: 353 DKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDR 410


>gnl|CDD|153339 cd07655, F-BAR_PACSIN, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins.
           F-BAR domains are dimerization modules that bind and
           bend membranes and are found in proteins involved in
           membrane dynamics and actin reorganization. Protein
           kinase C and Casein kinase Substrate in Neurons (PACSIN)
           proteins, also called Synaptic dynamin-associated
           proteins (Syndapins), act as regulators of cytoskeletal
           and membrane dynamics. They bind both dynamin and
           Wiskott-Aldrich syndrome protein (WASP), and may provide
           direct links between the actin cytoskeletal machinery
           through WASP and dynamin-dependent endocytosis.
           Vetebrates harbor three isoforms with distinct
           expression patterns and specific functions. PACSINs
           contain an N-terminal F-BAR domain and a C-terminal SH3
           domain. F-BAR domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 258

 Score = 33.1 bits (76), Expect = 0.018
 Identities = 19/84 (22%), Positives = 40/84 (47%), Gaps = 8/84 (9%)

Query: 47  AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 106
            ++++ ++ Y   C   + AQ +    ++D      + K +Q+KVEK + EV + K K E
Sbjct: 133 KKVEKAKKAYHAACKAEKSAQKQENNAKSDTSLSPDQVKKLQDKVEKCKQEVSKTKDKYE 192

Query: 107 NTQGE--------MESMKEEYERS 122
               +        ME M++ +++ 
Sbjct: 193 KALEDLNKYNPRYMEDMEQVFDKC 216


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 33.0 bits (75), Expect = 0.027
 Identities = 21/71 (29%), Positives = 40/71 (56%)

Query: 48  ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
           +L+ LQ++ ++   +L   +AE   VQ   +R+  E   ++E ++K  GEVY LK + + 
Sbjct: 143 KLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDE 202

Query: 108 TQGEMESMKEE 118
            +  +E  +EE
Sbjct: 203 LEPGVELPEEE 213


>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
           This model represents the MFP (membrane fusion protein)
           component of the RND family of transporters. RND refers
           to Resistance, Nodulation, and cell Division. It is, in
           part, a subfamily of pfam00529 (Pfam release 7.5) but
           hits substantial numbers of proteins missed by that
           model. The related HlyD secretion protein, for which
           pfam00529 is named, is outside the scope of this model.
           Attributed functions imply outward transport. These
           functions include nodulation, acriflavin resistance,
           heavy metal efflux, and multidrug resistance proteins.
           Most members of this family are found in Gram-negative
           bacteria. The proposed function of MFP proteins is to
           bring the inner and outer membranes together and enable
           transport to the outside of the outer membrane. Note,
           however, that a few members of this family are found in
           Gram-positive bacteria, where there is no outer membrane
           [Transport and binding proteins, Unknown substrate].
          Length = 322

 Score = 32.7 bits (75), Expect = 0.027
 Identities = 19/79 (24%), Positives = 37/79 (46%), Gaps = 1/79 (1%)

Query: 18  ERDKAVLEMDKSKEELERSQATLATLSTSAE-LDRLQEKYDKTCADLRRAQAELRVVQAD 76
           + D   L +  +  +L  ++A L     S E  +RL ++   + ADL  A+A +   QAD
Sbjct: 58  DDDDYQLALQAALAQLAAAEAQLELAQRSFERAERLVKRNAVSQADLDDAKAAVEAAQAD 117

Query: 77  NERVRSEEKTMQEKVEKSQ 95
            E  ++   + Q  +  ++
Sbjct: 118 LEAAKASLASAQLNLRYTE 136



 Score = 29.6 bits (67), Expect = 0.31
 Identities = 18/67 (26%), Positives = 30/67 (44%), Gaps = 4/67 (5%)

Query: 50  DRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQ 109
           D  Q       A L  A+A+L + Q   ER       + ++   SQ ++   KA +E  Q
Sbjct: 60  DDYQLALQAALAQLAAAEAQLELAQRSFERAE----RLVKRNAVSQADLDDAKAAVEAAQ 115

Query: 110 GEMESMK 116
            ++E+ K
Sbjct: 116 ADLEAAK 122


>gnl|CDD|212132 cd11667, GH38N_Man2A2, N-terminal catalytic domain of Golgi
           alpha-mannosidase IIx, and similar proteins; glycoside
           hydrolase family 38 (GH38).  This subfamily is
           represented by human alpha-mannosidase 2x (MX, also
           known as mannosyl-oligosaccharide 1,3- 1,6-alpha
           mannosidase, EC 3.2.1.114, Man2A2). MX is enzymatically
           and functionally very similar to GMII (found  in another
           subfamily), and as an isoenzyme of GMII. It is thought
           to also function in the N-glycosylation pathway. MX
           specifically hydrolyzes the same oligosaccharide
           substrate as does MII. It specifically removes two
           mannosyl residues from GlcNAc(Man)5(GlcNAc)2 to yield
           GlcNAc(Man)3(GlcNAc)2(GlcNAc, N-acetylglucosmine).
          Length = 344

 Score = 31.1 bits (70), Expect = 0.095
 Identities = 15/49 (30%), Positives = 24/49 (48%)

Query: 6   VPPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQE 54
           VPP       ++ER + +L+  + K +L RS+  L  L      D+ QE
Sbjct: 222 VPPRAITEANVAERAQLLLDQYRKKSKLYRSKVLLVPLGDDFRYDKPQE 270


>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
          Length = 991

 Score = 30.6 bits (69), Expect = 0.17
 Identities = 14/51 (27%), Positives = 23/51 (45%), Gaps = 5/51 (9%)

Query: 7   PPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYD 57
           PP G           AV E+ ++  +L  +Q +    +  A LDRL++  D
Sbjct: 939 PPRGP-----DVPPAAVAELRETLADLRSAQRSGDFTAYGAALDRLEKAID 984


>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
           partitioning / Cytoskeleton].
          Length = 373

 Score = 30.4 bits (69), Expect = 0.21
 Identities = 13/46 (28%), Positives = 26/46 (56%)

Query: 77  NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
            E++R +EK ++E  +    E   L +KLE  Q ++E +++  E+ 
Sbjct: 323 TEKIREKEKRLEELEQNLIEERKELNSKLEEIQKKLEDLEKRLEKL 368


>gnl|CDD|223024 PHA03252, PHA03252, DNA packaging tegument protein UL25;
          Provisional.
          Length = 589

 Score = 30.5 bits (69), Expect = 0.21
 Identities = 21/60 (35%), Positives = 29/60 (48%), Gaps = 5/60 (8%)

Query: 5  LVPPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLR 64
          L  P   L RL   RD + L + + +E+L R +     L    ELD LQ++    C DLR
Sbjct: 24 LRCPEEDLRRL---RDDSALRLRRYREDLLRDRLLRRRLGE--ELDDLQKRLQTECEDLR 78


>gnl|CDD|222629 pfam14257, DUF4349, Domain of unknown function (DUF4349).  This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 282 and
           353 amino acids in length. There is a single completely
           conserved residue D that may be functionally important.
          Length = 262

 Score = 29.9 bits (68), Expect = 0.23
 Identities = 17/60 (28%), Positives = 30/60 (50%), Gaps = 4/60 (6%)

Query: 37  QATLATLSTSAELDRLQEKYDK--TCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS 94
           +A L  L   A  +RL    ++  T  DL   + EL  VQA+ E + ++ + + ++V  S
Sbjct: 138 EARLKALR--ASEERLLALLERAGTVEDLLEVERELSRVQAEIESLEAQLRYLDDRVAYS 195


>gnl|CDD|224482 COG1566, EmrA, Multidrug resistance efflux pump [Defense
           mechanisms].
          Length = 352

 Score = 30.0 bits (68), Expect = 0.28
 Identities = 8/44 (18%), Positives = 20/44 (45%)

Query: 61  ADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 104
           A L +A+A L   +A    +R++  + Q  + +++ +       
Sbjct: 91  AALEQAEAALAAAEAQLRNLRAQLASAQALIAQAEAQDLDQAQN 134


>gnl|CDD|217011 pfam02388, FemAB, FemAB family.  The femAB operon codes for two
           nearly identical approximately 50-kDa proteins involved
           in the formation of the Staphylococcal pentaglycine
           interpeptide bridge in peptidoglycan. These proteins are
           also considered as a factor influencing the level of
           methicillin resistance.
          Length = 408

 Score = 29.5 bits (67), Expect = 0.36
 Identities = 22/86 (25%), Positives = 37/86 (43%), Gaps = 13/86 (15%)

Query: 34  ERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEK 93
           ++++  LA L+    LD LQEK +K   DL + + +L             EK    K + 
Sbjct: 229 DKAEFMLAYLNFKEYLDELQEKLEKLEKDLAKLEEKL-------------EKNPNSKKKN 275

Query: 94  SQGEVYRLKAKLENTQGEMESMKEEY 119
              E+ +  A LE    E + +  +Y
Sbjct: 276 KLAELEQQLASLEKRIDEAKELIAKY 301


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 29.2 bits (66), Expect = 0.37
 Identities = 27/111 (24%), Positives = 47/111 (42%), Gaps = 15/111 (13%)

Query: 12  LSRLLSERDKAVLEMDKSKEELERSQATLATLST-----SAELDRLQEKYDKTCADLRRA 66
           LSR LS+   A   + + ++EL   Q  LA L         E   L+++     A+L R 
Sbjct: 54  LSRYLSDTPSARERLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERL 113

Query: 67  QAELRVVQ----------ADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
           Q EL  ++           +N  +R E   ++++ E  + E  RL+   + 
Sbjct: 114 QKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164



 Score = 28.8 bits (65), Expect = 0.49
 Identities = 14/59 (23%), Positives = 31/59 (52%)

Query: 63  LRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
           L   Q EL  +Q +   ++ +   +Q++ ++ + E+  L+A+LE  Q E+  +K+    
Sbjct: 68  LPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSAN 126



 Score = 28.8 bits (65), Expect = 0.58
 Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 1/77 (1%)

Query: 10  GTLSRLLSERDKAVLEMDKSKEELERSQATLATL-STSAELDRLQEKYDKTCADLRRAQA 68
             L+ L  E  +   E+   + ELER Q  LA +   SA    L E+  +   +L   + 
Sbjct: 87  EQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENRELREELAELKQ 146

Query: 69  ELRVVQADNERVRSEEK 85
           E   ++A+NER++  E+
Sbjct: 147 ENEALEAENERLQENEQ 163



 Score = 25.7 bits (57), Expect = 6.4
 Identities = 18/86 (20%), Positives = 39/86 (45%), Gaps = 10/86 (11%)

Query: 46  SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQG--------- 96
             EL  LQE+  +    L   Q E + ++ +   + +E + +Q+++ + +          
Sbjct: 72  QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELD 131

Query: 97  -EVYRLKAKLENTQGEMESMKEEYER 121
            E   L+ +L   + E E+++ E ER
Sbjct: 132 EENRELREELAELKQENEALEAENER 157


>gnl|CDD|224455 COG1538, TolC, Outer membrane protein [Cell envelope biogenesis,
           outer membrane / Intracellular trafficking and
           secretion].
          Length = 457

 Score = 29.4 bits (66), Expect = 0.49
 Identities = 16/113 (14%), Positives = 49/113 (43%), Gaps = 15/113 (13%)

Query: 13  SRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRV 72
            R+ +    A      ++ +LE ++ +LA    +A        Y     DL  AQ +L +
Sbjct: 125 GRVRANVRAAEAAAKAARAQLEAARLSLAAEVATA--------Y----FDLLAAQEQLAL 172

Query: 73  VQADNERVRSEEKTMQEKVEK---SQGEVYRLKAKLENTQGEMESMKEEYERS 122
            +        + +  +++ +    ++ +V + +A+L + + ++ + + +  ++
Sbjct: 173 AEETLAAAEEQLELAEKRYDAGLATRLDVLQAEAQLASARAQLAAAQAQLAQA 225


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 29.0 bits (66), Expect = 0.52
 Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 9/67 (13%)

Query: 64  RRAQAELRVVQADNERVRSEEKTMQ---------EKVEKSQGEVYRLKAKLENTQGEMES 114
            R + E  + +  NE  + E++ +Q         E +EK + E+ + + +LE  Q E+E 
Sbjct: 69  LRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEELEKKEKELEQKQQELEK 128

Query: 115 MKEEYER 121
            +EE E 
Sbjct: 129 KEEELEE 135


>gnl|CDD|221415 pfam12083, DUF3560, Domain of unknown function (DUF3560).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria. This domain is about 120
           amino acids in length. This domain has a conserved GHHSE
           sequence motif.
          Length = 127

 Score = 28.4 bits (64), Expect = 0.53
 Identities = 14/52 (26%), Positives = 24/52 (46%)

Query: 20  DKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
            KAV   DK++   +R+ A         + D +  +  K  ADLR +Q  ++
Sbjct: 67  GKAVEAQDKAEYWEQRAAAAARGAIARDDPDAVLRRLKKLLADLRASQRRMK 118


>gnl|CDD|233973 TIGR02680, TIGR02680, TIGR02680 family protein.  Members of this
           protein family belong to a conserved gene four-gene
           neighborhood found sporadically in a phylogenetically
           broad range of bacteria: Nocardia farcinica,
           Symbiobacterium thermophilum, and Streptomyces
           avermitilis (Actinobacteria), Geobacillus kaustophilus
           (Firmicutes), Azoarcus sp. EbN1 and Ralstonia
           solanacearum (Betaproteobacteria). Proteins in this
           family average over 1400 amino acids in length
           [Hypothetical proteins, Conserved].
          Length = 1353

 Score = 29.0 bits (65), Expect = 0.57
 Identities = 16/85 (18%), Positives = 32/85 (37%), Gaps = 3/85 (3%)

Query: 37  QATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQG 96
            A       +      + +  +  +D R A  +    +A+ E      +T++E V     
Sbjct: 865 VAVRELRHAATRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESV---GA 921

Query: 97  EVYRLKAKLENTQGEMESMKEEYER 121
            V  ++A+L  T+  + S   E  R
Sbjct: 922 MVDEIRARLAETRAALASGGRELPR 946


>gnl|CDD|237110 PRK12472, PRK12472, hypothetical protein; Provisional.
          Length = 508

 Score = 29.1 bits (65), Expect = 0.59
 Identities = 25/104 (24%), Positives = 45/104 (43%), Gaps = 10/104 (9%)

Query: 11  TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
           TL+R   +  +A  E   +     R  A L      A L +L+    +  A+L+RA   L
Sbjct: 194 TLAREAEDAARAADEAKTAAAAAAREAAPLK-----ASLRKLERAKARADAELKRADKAL 248

Query: 71  RVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMES 114
              + D  + R+EE+  Q+K  +   E      +L+  + + E+
Sbjct: 249 AAAKTDEAKARAEER--QQKAAQQAAEA---ATQLDTAKADAEA 287


>gnl|CDD|153363 cd07679, F-BAR_PACSIN2, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons 2 (PACSIN2).  F-BAR
           domains are dimerization modules that bind and bend
           membranes and are found in proteins involved in membrane
           dynamics and actin reorganization. Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins,
           also called Synaptic dynamin-associated proteins
           (Syndapins), act as regulators of cytoskeletal and
           membrane dynamics. Vetebrates harbor three isoforms with
           distinct expression patterns and specific functions.
           PACSIN 2 or Syndapin II is expressed ubiquitously and is
           involved in the regulation of tubulin polymerization. It
           associates with Golgi membranes and forms a complex with
           dynamin II which is crucial in promoting vesicle
           formation from the trans-Golgi network. PACSIN 2
           contains an N-terminal F-BAR domain and a C-terminal SH3
           domain. F-BAR domains form banana-shaped dimers with a
           positively-charged concave surface that binds to
           negatively-charged lipid membranes. They can induce
           membrane deformation in the form of long tubules.
          Length = 258

 Score = 28.9 bits (64), Expect = 0.64
 Identities = 18/72 (25%), Positives = 36/72 (50%)

Query: 48  ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
           E++  ++ Y   C + + A +     +AD      + K +Q+KVEK + +V + K K E 
Sbjct: 134 EVEAAKKAYHTACKEEKLATSREANSKADPALNPEQLKKLQDKVEKCKQDVLKTKEKYEK 193

Query: 108 TQGEMESMKEEY 119
           +  E++    +Y
Sbjct: 194 SLKELDQTTPQY 205


>gnl|CDD|216941 pfam02233, PNTB, NAD(P) transhydrogenase beta subunit.  This family
           corresponds to the beta subunit of NADP transhydrogenase
           in prokaryotes, and either the protein N- or C terminal
           in eukaryotes. The domain is often found in conjunction
           with pfam01262. Pyridine nucleotide transhydrogenase
           catalyzes the reduction of NAD+ to NADPH. A complete
           loss of activity occurs upon mutation of Gly314 in E.
           coli.
          Length = 464

 Score = 28.9 bits (65), Expect = 0.69
 Identities = 15/50 (30%), Positives = 26/50 (52%), Gaps = 8/50 (16%)

Query: 9   PGTLSRLLSERD---KAVLEMDKSKEELERSQATLA-----TLSTSAELD 50
           PG ++ LL+E +     VLEMD+  ++   +   L      T++ +AE D
Sbjct: 354 PGHMNVLLAEANVPYDIVLEMDEINDDFADTDVVLVIGANDTVNPAAEED 403


>gnl|CDD|240301 PTZ00166, PTZ00166, DNA polymerase delta catalytic subunit;
           Provisional.
          Length = 1054

 Score = 28.8 bits (65), Expect = 0.72
 Identities = 9/28 (32%), Positives = 14/28 (50%)

Query: 9   PGTLSRLLSERDKAVLEMDKSKEELERS 36
           P  +  L++ R KA  EM   K+ L + 
Sbjct: 614 PLIVEELIAARKKAKKEMKDEKDPLLKK 641


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 28.7 bits (65), Expect = 0.78
 Identities = 15/54 (27%), Positives = 29/54 (53%), Gaps = 12/54 (22%)

Query: 25  EMDKSKEELER--------SQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
           ++ +++ ELE         ++ TL+TLS    L +L+ +  +T   L+ AQ +L
Sbjct: 95  KLRQAQAELEALKDDNDEETRETLSTLS----LRQLESRLAQTLDQLQNAQNDL 144



 Score = 26.8 bits (60), Expect = 3.1
 Identities = 23/104 (22%), Positives = 52/104 (50%), Gaps = 13/104 (12%)

Query: 15  LLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQ 74
           LL   DK V      +++LE++ A L  +      DR +E+ ++    L +A A+LR  Q
Sbjct: 53  LLEAEDKLV------QQDLEQTLALLDKI------DRQKEETEQLKQQLAQAPAKLRQAQ 100

Query: 75  ADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 118
           A+ E ++ +      +   +   + +L+++L  T  ++++ + +
Sbjct: 101 AELEALKDDNDEETRETLSTL-SLRQLESRLAQTLDQLQNAQND 143


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 28.5 bits (64), Expect = 0.80
 Identities = 26/105 (24%), Positives = 54/105 (51%)

Query: 18  ERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADN 77
           E +   LE   +K + E  +      +   EL+ L+E+ ++    L   + EL  ++   
Sbjct: 764 EEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRR 823

Query: 78  ERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
           ER+  E + ++E++E+ + ++  L+ +LE  + E+E +KEE E  
Sbjct: 824 ERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEEL 868



 Score = 28.5 bits (64), Expect = 0.99
 Identities = 20/76 (26%), Positives = 43/76 (56%)

Query: 46  SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 105
             EL  L+ + +K   +L+  + ELR ++   E +R + + ++ ++E+ + E+  L+ +L
Sbjct: 673 EEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEEL 732

Query: 106 ENTQGEMESMKEEYER 121
           E  Q  +E ++EE E 
Sbjct: 733 EQLQSRLEELEEELEE 748



 Score = 27.0 bits (60), Expect = 3.0
 Identities = 26/113 (23%), Positives = 59/113 (52%), Gaps = 5/113 (4%)

Query: 12  LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
           L  L  E  +   E+++ +EELE ++  +  L   +EL+ L+E+ ++   +L   + E+ 
Sbjct: 241 LEELEEELSRLEEELEELQEELEEAEKEIEELK--SELEELREELEELQEELLELKEEIE 298

Query: 72  VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESM---KEEYER 121
            ++ +   +R   + ++ ++E+ +  +  LK K+E  + E+E      EE E+
Sbjct: 299 ELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEELEQ 351



 Score = 26.6 bits (59), Expect = 3.9
 Identities = 20/97 (20%), Positives = 47/97 (48%), Gaps = 2/97 (2%)

Query: 25  EMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEE 84
             +  +EEL   +A LA +    EL+ L+ + +     L R    L  ++ + + + +E 
Sbjct: 377 LFEALREELAELEAELAEIR--NELEELKREIESLEERLERLSERLEDLKEELKELEAEL 434

Query: 85  KTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
           + +Q ++E+   E+  L+ +LE  +  ++ ++ E   
Sbjct: 435 EELQTELEELNEELEELEEQLEELRDRLKELERELAE 471



 Score = 26.6 bits (59), Expect = 4.4
 Identities = 18/97 (18%), Positives = 48/97 (49%)

Query: 24  LEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSE 83
           LE    +   +  +          EL  L+E+ ++  + L   + EL  ++ + E ++  
Sbjct: 700 LEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQER 759

Query: 84  EKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
            + ++E++E  +  + +LK ++E  + + ++++EE E
Sbjct: 760 LEELEEELESLEEALAKLKEEIEELEEKRQALQEELE 796


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 28.4 bits (64), Expect = 0.81
 Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 2/75 (2%)

Query: 47  AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 106
            EL  L+E+  +   +L + +  L+ +          EK  +E++EK      +L  +LE
Sbjct: 334 EELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEK--REQLEKLLETKEKLSEELE 391

Query: 107 NTQGEMESMKEEYER 121
             + E++ +KEE E 
Sbjct: 392 ELEEELKELKEELES 406


>gnl|CDD|222613 pfam14235, DUF4337, Domain of unknown function (DUF4337).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 187 and 201 amino
           acids in length. There is a single completely conserved
           residue Q that may be functionally important.
          Length = 158

 Score = 27.9 bits (63), Expect = 0.84
 Identities = 14/56 (25%), Positives = 21/56 (37%), Gaps = 2/56 (3%)

Query: 32  ELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL--RVVQADNERVRSEEK 85
           EL      L   +  AEL     +Y K  A  R    EL  +  +A+ E   +  +
Sbjct: 55  ELAADLLELEAAAPRAELQAKIARYKKEKARYRSEAKELEAKAKEAEAESDHALHQ 110


>gnl|CDD|226809 COG4372, COG4372, Uncharacterized protein conserved in bacteria
           with the myosin-like domain [Function unknown].
          Length = 499

 Score = 28.5 bits (63), Expect = 0.85
 Identities = 18/99 (18%), Positives = 41/99 (41%)

Query: 23  VLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRS 82
             + +K   E ER  A         E + ++++      +L +AQ EL  +    + +++
Sbjct: 92  TAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQT 151

Query: 83  EEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
             KT+ E+  + + +   L+A  +  Q     +K +   
Sbjct: 152 RLKTLAEQRRQLEAQAQSLQASQKQLQASATQLKSQVLD 190



 Score = 27.7 bits (61), Expect = 1.8
 Identities = 17/78 (21%), Positives = 36/78 (46%)

Query: 44  STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKA 103
           S   +LD ++ +      +L  AQ E R  + + E  RSE +  +++ E  + E+   + 
Sbjct: 71  SGVFQLDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQEREAVRQELAAARQ 130

Query: 104 KLENTQGEMESMKEEYER 121
            L   Q E+  + ++ + 
Sbjct: 131 NLAKAQQELARLTKQAQD 148


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 28.4 bits (64), Expect = 0.91
 Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 14/76 (18%)

Query: 46  SAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKL 105
           +AEL RL+++ +K   ++ R + +L      NE   ++     E VEK        K KL
Sbjct: 813 AAELARLEKELEKLEKEIDRIEKKL-----SNEGFVAKAPE--EVVEKE-------KEKL 858

Query: 106 ENTQGEMESMKEEYER 121
              Q ++  ++E    
Sbjct: 859 AEYQVKLAKLEERLAV 874


>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein.  This domain is found
           in cell division proteins which are required for
           kinetochore-spindle association.
          Length = 312

 Score = 28.1 bits (63), Expect = 1.2
 Identities = 17/113 (15%), Positives = 48/113 (42%), Gaps = 2/113 (1%)

Query: 12  LSRLLSERDKAVLEMDKSKEELERSQATLATL-STSAELDRLQ-EKYDKTCADLRRAQAE 69
           L  L  +    + E++       + +     L     +L +L+ E  D    +L RA+ +
Sbjct: 153 LEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEK 212

Query: 70  LRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
           L+ +  +      + + ++E++++ + ++  L  K      E+   +++ E+ 
Sbjct: 213 LKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKKLEQC 265


>gnl|CDD|222630 pfam14258, DUF4350, Domain of unknown function (DUF4350).  This
          domain family is found in bacteria, archaea and
          eukaryotes, and is approximately 70 amino acids in
          length.
          Length = 69

 Score = 26.8 bits (60), Expect = 1.2
 Identities = 13/48 (27%), Positives = 21/48 (43%), Gaps = 5/48 (10%)

Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATL-----ATLSTSAELDRLQE 54
          L+ LL ER   V  +    + L+ S  TL     +   +  E++ L E
Sbjct: 10 LAELLEERGVEVERVTSPYDALDGSDGTLLVIGPSLNLSPEEIEALLE 57


>gnl|CDD|233594 TIGR01844, type_I_sec_TolC, type I secretion outer membrane
           protein, TolC family.  Members of This model are outer
           membrane proteins from the TolC subfamily within the RND
           (Resistance-Nodulation-cell Division) efflux systems.
           These proteins, unlike the NodT subfamily, appear not to
           be lipoproteins. All are believed to participate in type
           I protein secretion, an ABC transporter system for
           protein secretion without cleavage of a signal sequence,
           although they may, like TolC, participate also in the
           efflux of smaller molecules as well. This family
           includes the well-documented examples TolC (E. coli),
           PrtF (Erwinia), and AprF (Pseudomonas aeruginosa)
           [Protein fate, Protein and peptide secretion and
           trafficking, Transport and binding proteins, Porins].
          Length = 415

 Score = 28.1 bits (63), Expect = 1.3
 Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 5/79 (6%)

Query: 44  STSAELDRLQEKYD--KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRL 101
           S S+ L   Q  +D   T   +R A+A      A  E +R+  + +  +  ++  EV R 
Sbjct: 69  SGSSTLTLSQPLFDGGSTWNAVRAAEAA---ALAARETLRATAQDLILRTAEAYMEVLRA 125

Query: 102 KAKLENTQGEMESMKEEYE 120
           +  L   +  + ++KE+ +
Sbjct: 126 QEILALAEANLAALKEQLD 144


>gnl|CDD|215558 PLN03073, PLN03073, ABC transporter F family; Provisional.
          Length = 718

 Score = 27.9 bits (62), Expect = 1.4
 Identities = 13/23 (56%), Positives = 15/23 (65%)

Query: 15  LLSERDKAVLEMDKSKEELERSQ 37
           LLSERD A +E  K KEE +R  
Sbjct: 121 LLSERDLAKIERRKRKEERQREV 143


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 27.7 bits (62), Expect = 1.6
 Identities = 10/58 (17%), Positives = 33/58 (56%), Gaps = 3/58 (5%)

Query: 61  ADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 118
           A+++  +  L  + ++NE +++E + +Q++ E+S     +++  +++   E+    E+
Sbjct: 66  AEVKELRKRLAKLISENEALKAENERLQKR-EQSID--QQIQQAVQSETQELTKEIEQ 120


>gnl|CDD|215529 PLN02978, PLN02978, pyridoxal kinase.
          Length = 308

 Score = 27.4 bits (61), Expect = 1.7
 Identities = 17/53 (32%), Positives = 28/53 (52%), Gaps = 1/53 (1%)

Query: 26  MDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNE 78
             K  + L+++ A LA  S  A L R    Y +  AD + +  ELR+VQ+ ++
Sbjct: 243 SHKYPDNLDKA-AELAVSSLQAVLRRTLADYKRAGADPKSSSLELRLVQSQDD 294


>gnl|CDD|179310 PRK01622, PRK01622, OxaA-like protein precursor; Validated.
          Length = 256

 Score = 27.4 bits (61), Expect = 1.7
 Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 7/40 (17%)

Query: 84  EKTMQEKVEKSQGEVYRLKAKLENT-------QGEMESMK 116
           ++ MQEK+   + E+ +++AKL+ T       + + E M+
Sbjct: 86  QRGMQEKMAVMKPELDKIQAKLKVTKDLEKQKEYQKEMME 125


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 27.7 bits (62), Expect = 1.7
 Identities = 14/96 (14%), Positives = 40/96 (41%), Gaps = 3/96 (3%)

Query: 25  EMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEE 84
           E+++ +E +E  +  LA   +        +   +  + L + Q E++  ++    + S  
Sbjct: 86  EIEQKRERIEELKRALAQRRSDLSS-ASYQLEKRRASQLEKLQDEIKRTRSKLNALHSLL 144

Query: 85  KTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
              +  + +   +++ L+  +   +G  +S  E Y 
Sbjct: 145 AEKRSFLCRELAKLFPLRRVIRGRKG--DSSSEPYT 178


>gnl|CDD|216534 pfam01499, Herpes_UL25, Herpesvirus UL25 family.  The herpesvirus
          UL25 gene product is a virion component involved in
          virus penetration and capsid assembly. The product of
          the UL25 gene is required for packaging but not
          cleavage of replicated viral DNA. This family includes
          a number of herpesvirus proteins: EHV-1 36, EBV BVRF1,
          HCMV UL77, ILTV ORF2, and VZV gene 34.
          Length = 538

 Score = 27.7 bits (62), Expect = 1.8
 Identities = 12/78 (15%), Positives = 23/78 (29%), Gaps = 10/78 (12%)

Query: 5  LVPPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLR 64
          L     +L      R    L     +  L +++         AELD L++ +    A++ 
Sbjct: 19 LRCDEPSL-FETRPRFALRLRTRAEQLRLAKAKKRAL----KAELDNLEQLHQTRSAEID 73

Query: 65 RA-----QAELRVVQADN 77
                      V    +
Sbjct: 74 ADLRPIENQLEEVADPLD 91


>gnl|CDD|223914 COG0845, AcrA, Membrane-fusion protein [Cell envelope biogenesis,
           outer membrane].
          Length = 372

 Score = 27.4 bits (60), Expect = 2.1
 Identities = 17/69 (24%), Positives = 28/69 (40%), Gaps = 4/69 (5%)

Query: 11  TLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAEL 70
            L+R  +    A L   + + +L   +A ++     AELD  Q       A +  AQA L
Sbjct: 113 QLARAQALLAPAELGDLQREAKLAAEKAAVSQ----AELDAAQALLRAAEALVEAAQAAL 168

Query: 71  RVVQADNER 79
              + + E 
Sbjct: 169 ASAKLNLEY 177


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 27.4 bits (62), Expect = 2.2
 Identities = 19/83 (22%), Positives = 39/83 (46%), Gaps = 2/83 (2%)

Query: 27  DKSKEELERSQATLATLS--TSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEE 84
            + +  LE ++A +  L+   S E+    E+  +  A      AEL +       V +E 
Sbjct: 754 AEDRARLEANEAYIKRLARLESLEILADDEEAPEGAASAVVGGAELFLPLEGLIDVEAEL 813

Query: 85  KTMQEKVEKSQGEVYRLKAKLEN 107
             +++++ K + E+ R++ KL N
Sbjct: 814 ARLEKELAKLEKEIERVEKKLSN 836


>gnl|CDD|220640 pfam10234, Cluap1, Clusterin-associated protein-1.  This protein is
           conserved from worms to humans. The protein of 413 amino
           acids contains a central coiled-coil domain, possibly
           the region that binds to clusterin. Cluap1 expression is
           highest in the nucleus and gradually increases during
           late S to G2/M phases of the cell cycle and returns to
           the basal level in the G0/G1 phases. In addition, it is
           upregulated in colon cancer tissues compared to
           corresponding non-cancerous mucosa. It thus plays a
           crucial role in the life of the cell.
          Length = 269

 Score = 26.9 bits (60), Expect = 2.4
 Identities = 16/56 (28%), Positives = 34/56 (60%)

Query: 66  AQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
             A L+ +QA+ + ++S+E  ++ K+E+ + E+ R + +L+  Q    +  EEYE+
Sbjct: 176 LAARLQQLQAELDNLKSDEANLEAKIERKKQELERTQKRLQALQSVRPAFMEEYEK 231


>gnl|CDD|223841 COG0770, MurF, UDP-N-acetylmuramyl pentapeptide synthase [Cell
           envelope biogenesis, outer membrane].
          Length = 451

 Score = 26.9 bits (60), Expect = 2.7
 Identities = 10/22 (45%), Positives = 14/22 (63%)

Query: 9   PGTLSRLLSERDKAVLEMDKSK 30
           P TL RL ++ + AVLEM  + 
Sbjct: 145 PLTLLRLPADTEYAVLEMGMNH 166


>gnl|CDD|216101 pfam00755, Carn_acyltransf, Choline/Carnitine o-acyltransferase. 
          Length = 590

 Score = 26.9 bits (60), Expect = 3.1
 Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 6/51 (11%)

Query: 6  VPP-PGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEK 55
          VP    TL R L +  K +L    S+EE ER++A +   S   E ++LQE 
Sbjct: 7  VPALEDTLDRYL-DSVKPLL----SEEEFERTEALVKDFSAGGEGEKLQEY 52


>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
           This model describes a conserved domain found in surface
           proteins of a number of Firmutes. Many members have
           LPXTG C-terminal anchoring motifs and a substantial
           number have the KxYKxGKxW putative sorting signal at the
           N-terminus. The tetracycline resistance plasmid pCF10 in
           Enterococcus faecalis promotes conjugal plasmid transfer
           in response to sex pheromones, but PgrA/Sec10 encoded by
           that plasmid, a member of this family, specifically
           inhibits the ability of cells to receive homologous
           plasmids. The phenomenon is called surface exclusion.
          Length = 356

 Score = 26.6 bits (59), Expect = 3.3
 Identities = 15/85 (17%), Positives = 30/85 (35%), Gaps = 11/85 (12%)

Query: 27  DKSKEELERSQATLATLSTS-----AELDRLQEKYDKTCADLRRAQAELRVVQADNERVR 81
           + ++  L  +Q   A    +      EL   Q +  +T      AQ  L   QA      
Sbjct: 278 NTAQAALTTAQTAYAAAQAALATAQKELANAQAQALQT------AQNNLATAQAALANAE 331

Query: 82  SEEKTMQEKVEKSQGEVYRLKAKLE 106
           +     +E +     ++ + +A L+
Sbjct: 332 ARLAKAKEALANLNADLAKKQAALD 356


>gnl|CDD|233229 TIGR00999, 8a0102, Membrane Fusion Protein cluster 2 (function with
           RND porters).  [Transport and binding proteins, Other].
          Length = 265

 Score = 26.6 bits (59), Expect = 3.5
 Identities = 17/66 (25%), Positives = 29/66 (43%), Gaps = 6/66 (9%)

Query: 54  EKYDKTCADLRRAQAELRVVQADNERVRSE-EKTMQEKVEKSQGEVYRLKAKLENTQGEM 112
            +  K  A+L+ AQ      + +  R   E EK + E+    + E    +  LE  Q E+
Sbjct: 16  PELAKMAAELKVAQK-----RVELARKTYEREKKLFEQGVIPRQEFESAEYALEEAQAEV 70

Query: 113 ESMKEE 118
           ++ K E
Sbjct: 71  QAAKSE 76


>gnl|CDD|192773 pfam11559, ADIP, Afadin- and alpha -actinin-Binding.  This family
           is found in mammals where it is localised at cell-cell
           adherens junctions, and in Sch. pombe and other fungi
           where it anchors spindle-pole bodies to spindle
           microtubules. It is a coiled-coil structure, and in
           pombe, it is required for anchoring the minus end of
           spindle microtubules to the centrosome equivalent, the
           spindle-pole body. The name ADIP derives from the family
           being composed of Afadin- and alpha -Actinin-Binding
           Proteins Localised at Cell-Cell Adherens Junctions.
          Length = 149

 Score = 26.5 bits (59), Expect = 3.5
 Identities = 24/82 (29%), Positives = 44/82 (53%), Gaps = 2/82 (2%)

Query: 27  DKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKT 86
           D+  E  E  + TL  L   AE++RLQ   ++    L   + EL ++QA   ++  + KT
Sbjct: 46  DRDLEFRESLEETLRKLE--AEIERLQNTIERLKTQLEDLERELALLQAKERQLEKKLKT 103

Query: 87  MQEKVEKSQGEVYRLKAKLENT 108
           +++K++  + EV RLK  ++  
Sbjct: 104 LEQKLKNEKEEVQRLKNIIQQR 125


>gnl|CDD|233228 TIGR00998, 8a0101, efflux pump membrane protein (multidrug
           resistance protein A).  [Transport and binding proteins,
           Other].
          Length = 334

 Score = 26.7 bits (59), Expect = 3.6
 Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 2/63 (3%)

Query: 61  ADLRRAQAELRVVQADNERVRSEEKTMQ--EKVEKSQGEVYRLKAKLENTQGEMESMKEE 118
             L    AEL + +A+        +T Q    V++ Q +V  LK KLE  + ++   + +
Sbjct: 71  VRLDPTNAELALAKAEANLAALVRQTKQLEITVQQLQAKVESLKIKLEQAREKLLQAELD 130

Query: 119 YER 121
             R
Sbjct: 131 LRR 133


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 27.0 bits (60), Expect = 3.6
 Identities = 14/84 (16%), Positives = 35/84 (41%), Gaps = 8/84 (9%)

Query: 46  SAELDRLQEKYDKTCADLRRAQAELRVVQAD--------NERVRSEEKTMQEKVEKSQGE 97
             +L  L+ +      +L   +  L   Q +         +++R+ E  ++E  ++   E
Sbjct: 255 FEQLLSLELRLQHLHGELVADEERLAEEQEERQEAKNRLRQQLRTLEDQLKEARDELNQE 314

Query: 98  VYRLKAKLENTQGEMESMKEEYER 121
           +    AKL   + E+E ++++   
Sbjct: 315 LSAANAKLAADRSELELLEDQKGA 338


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 26.6 bits (59), Expect = 3.7
 Identities = 20/105 (19%), Positives = 55/105 (52%), Gaps = 5/105 (4%)

Query: 18  ERDKAVLEM--DKSKEELERSQATLATLSTSAE--LDRLQEKYDKTCADLRRAQAELRVV 73
              KA L+    +S+E+L+    +LA LS+  +   + + E+ ++  A+L +   + +++
Sbjct: 76  NELKARLQQQLLQSREQLQLLIESLAQLSSEFQELANEIFEELNRRLAELNQQNLK-QLL 134

Query: 74  QADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEE 118
           +   E +    + +++++ +S  E   L  +++   GE++ + +E
Sbjct: 135 KPLREVLEKFREQLEQRIHESAEERSTLLEEIDRLLGEIQQLAQE 179


>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
          Length = 562

 Score = 26.5 bits (59), Expect = 3.8
 Identities = 12/70 (17%), Positives = 30/70 (42%)

Query: 44  STSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKA 103
                + R Q KYD+   + +  +AE+  +  +   +  + +     + K      ++K+
Sbjct: 210 KNGENIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKS 269

Query: 104 KLENTQGEME 113
           K+E  Q  ++
Sbjct: 270 KIEQFQKVIK 279


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 26.6 bits (59), Expect = 4.1
 Identities = 30/116 (25%), Positives = 53/116 (45%), Gaps = 8/116 (6%)

Query: 5   LVPPPGTLSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLR 64
           L      L R   E  K   E+DK+ EEL  ++  L  L    EL+ L++KY +   +  
Sbjct: 607 LKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEEL--RKELEELEKKYSEE--EYE 662

Query: 65  RAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
             + E   +  +   +R+E + ++++ E    E+ +   KL+    E E  K+E E
Sbjct: 663 ELREEYLELSRELAGLRAELEELEKRRE----EIKKTLEKLKEELEEREKAKKELE 714


>gnl|CDD|131452 TIGR02399, salt_tol_Pase, glucosylglycerol 3-phosphatase.  Proteins
           in this family are glucosylglycerol-phosphate
           phosphatase, with the gene symbol stpA (Salt Tolerance
           Protein A). A motif characteristic of acid phosphatases
           is found, but otherwise this family shows little
           sequence similarity to other phosphatases. This enzyme
           acts on the glucosylglycerol phosphate, product of
           glucosylglycerol phosphate synthase and immediate
           precursor of the osmoprotectant glucosylglycerol.
          Length = 389

 Score = 26.3 bits (58), Expect = 4.1
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 91  VEKSQGEVYRLKAKLENTQG 110
           V+ S GEVYR   K+E  +G
Sbjct: 337 VDSSSGEVYRPSLKIEGLKG 356


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 26.2 bits (58), Expect = 4.1
 Identities = 9/53 (16%), Positives = 29/53 (54%)

Query: 69  ELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYER 121
            ++ +  + +R+    K +++ ++K++ E+  L   LE  + E+E ++ +  +
Sbjct: 11  AIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQ 63


>gnl|CDD|226096 COG3566, COG3566, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 379

 Score = 26.4 bits (58), Expect = 4.5
 Identities = 18/74 (24%), Positives = 30/74 (40%), Gaps = 2/74 (2%)

Query: 20  DKAVLEMDKSKEELERSQATLAT--LSTSAELDRLQEKYDKTCADLRRAQAELRVVQADN 77
             A  ++D  +++L  +QA L +    T  ELD  + +  K  A L  A A     Q  +
Sbjct: 215 KTATEKVDALEKDLHAAQAKLDSGQALTKEELDAKKAELSKALAALEAANAADEDPQDRD 274

Query: 78  ERVRSEEKTMQEKV 91
             V +  + M    
Sbjct: 275 AAVEAAARLMGAAR 288


>gnl|CDD|235595 PRK05762, PRK05762, DNA polymerase II; Reviewed.
          Length = 786

 Score = 26.4 bits (59), Expect = 4.5
 Identities = 12/30 (40%), Positives = 15/30 (50%), Gaps = 5/30 (16%)

Query: 9   PGTLSRLLSERDKAVLEMDKSKEELERSQA 38
           P  + RL   RD+A  EM+K       SQA
Sbjct: 468 PEIVERLWEGRDEAKREMNK-----PLSQA 492


>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 265

 Score = 26.2 bits (58), Expect = 4.6
 Identities = 18/73 (24%), Positives = 39/73 (53%)

Query: 45  TSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 104
           T+     L +K     + L   Q E + +Q + E + ++ + +Q K+++ Q E+ + KA+
Sbjct: 22  TTVFAALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQSKAE 81

Query: 105 LENTQGEMESMKE 117
           ++  Q E+  +KE
Sbjct: 82  IKKLQKEIAELKE 94



 Score = 25.4 bits (56), Expect = 8.3
 Identities = 12/86 (13%), Positives = 39/86 (45%)

Query: 35  RSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKS 94
               T+     S ++     K  +   + +  Q E+  +    E ++S+   +Q+++++S
Sbjct: 19  AFLTTVFAALLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIEEIQSKIDELQKEIDQS 78

Query: 95  QGEVYRLKAKLENTQGEMESMKEEYE 120
           + E+ +L+ ++   +  +   +E  +
Sbjct: 79  KAEIKKLQKEIAELKENIVERQELLK 104


>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
           Provisional.
          Length = 695

 Score = 26.4 bits (59), Expect = 4.7
 Identities = 17/88 (19%), Positives = 30/88 (34%), Gaps = 2/88 (2%)

Query: 33  LERSQATLATLSTSAELDRLQE--KYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEK 90
           + R++A  A    +      +   K     A + RA+A+    QA +     +   +  K
Sbjct: 554 IARAKAKKAAQQAANAEAEEEVDPKKAAVAAAIARAKAKKAAQQAASAEPEEQVAEVDPK 613

Query: 91  VEKSQGEVYRLKAKLENTQGEMESMKEE 118
                  + R KAK    Q   E  +  
Sbjct: 614 KAAVAAAIARAKAKKAEQQANAEPEEPV 641


>gnl|CDD|153329 cd07645, I-BAR_IMD_BAIAP2L1, Inverse (I)-BAR, also known as the
           IRSp53/MIM homology Domain (IMD), of Brain-specific
           Angiogenesis Inhibitor 1-Associated Protein 2-Like 1.
           The IMD domain, also called Inverse-Bin/Amphiphysin/Rvs
           (I-BAR) domain, is a dimerization and lipid-binding
           module that bends membranes and induces membrane
           protrusions. BAIAP2L1 (Brain-specific Angiogenesis
           Inhibitor 1-Associated Protein 2-Like 1) is also known
           as IRTKS (Insulin Receptor Tyrosine Kinase Substrate).
           It is widely expressed, serves as a substrate for the
           insulin receptor, and binds the small GTPase Rac. It
           plays a role in regulating the actin cytoskeleton and
           colocalizes with F-actin, cortactin, VASP, and vinculin.
           BAIAP2L1 expression leads to the formation of short
           actin bundles, distinct from filopodia-like protrusions
           induced by the expression of the related protein IRSp53.
           It contains an N-terminal IMD, an SH3 domain, and a WASP
           homology 2 (WH2) actin-binding motif at the C-terminus.
           The IMD domain of BAIAP2L1 binds and bundles actin
           filaments, and binds the small GTPase Rac.
          Length = 226

 Score = 26.0 bits (57), Expect = 4.8
 Identities = 14/57 (24%), Positives = 33/57 (57%), Gaps = 3/57 (5%)

Query: 64  RRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYE 120
           R+   +++ + A  +R ++E K   + +EKSQ ++ +++ K   +QG   + K E++
Sbjct: 101 RKTDLDVKYMTATLKRYQTEHKNKLDSLEKSQADLKKIRRK---SQGRRNASKYEHK 154


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 26.1 bits (58), Expect = 5.3
 Identities = 22/84 (26%), Positives = 39/84 (46%), Gaps = 11/84 (13%)

Query: 47  AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRS--------EEKTMQEKVEKSQGEV 98
           AE+  LQ + D+  A++  A  E    Q + E              K  +EK++ +Q  +
Sbjct: 142 AEIAALQAEIDEAQAEVNAAYQE---AQCEAEGTGGTGVAGKGPVYKEKREKLDAAQARL 198

Query: 99  YRLKAKLENTQGEMESMKEEYERS 122
             LKA+L+    ++E+ K   ER+
Sbjct: 199 ETLKARLDAAIAQLEAQKAALERN 222


>gnl|CDD|213754 TIGR02971, heterocyst_DevB, ABC exporter membrane fusion protein,
           DevB family.  Members of this protein family are found
           mostly in the Cyanobacteria, but also in the
           Planctomycetes. DevB from Anabaena sp. strain PCC 7120
           is partially characterized as a membrane fusion protein
           of the DevBCA ABC exporter, probably a glycolipid
           exporter, required for heterocyst formation. Most
           Cyanobacteria have one member only, but Nostoc sp. PCC
           7120 has seven members.
          Length = 327

 Score = 25.9 bits (57), Expect = 5.3
 Identities = 18/71 (25%), Positives = 34/71 (47%)

Query: 47  AELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLE 106
           AELD  + + D+  A L + +A  +  +   +R       + + V   Q  + RL+A+LE
Sbjct: 55  AELDVARTQLDEAKARLAQVRAGAKKGEIAAQRAARAAAKLFKDVAAQQATLNRLEAELE 114

Query: 107 NTQGEMESMKE 117
             Q E++  + 
Sbjct: 115 TAQREVDRYRS 125


>gnl|CDD|153283 cd07599, BAR_Rvs167p, The Bin/Amphiphysin/Rvs (BAR) domain of
           Saccharomyces cerevisiae Reduced viability upon
           starvation protein 167 and similar proteins.  BAR
           domains are dimerization, lipid binding and curvature
           sensing modules found in many different proteins with
           diverse functions. This subfamily is composed of fungal
           proteins with similarity to Saccharomyces cerevisiae
           Reduced viability upon starvation protein 167 (Rvs167p)
           and Schizosaccharomyces pombe Hob1 (homolog of Bin1). S.
           cerevisiae Rvs167p plays a role in regulation of the
           actin cytoskeleton, endocytosis, and sporulation. It
           forms a heterodimer with another BAR domain protein
           Rvs161p. Rvs161p and Rvs167p share common functions but
           are not interchangeable. Their BAR domains cannot be
           replaced with each other and the overexpression of one
           cannot suppress the mutant phenotypes of the other.
           Rvs167p also interacts with the GTPase activating
           protein (GAP) Gyp5p, which is involved in ER to Golgi
           vesicle trafficking. BAR domains form dimers that bind
           to membranes, induce membrane bending and curvature, and
           may also be involved in protein-protein interactions.
          Length = 216

 Score = 25.7 bits (57), Expect = 6.0
 Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 2/60 (3%)

Query: 14  RLLSERDKAVLEMDKSKEELERSQATLATLST--SAELDRLQEKYDKTCADLRRAQAELR 71
           + + +RD   L+ DK + +L +       LS     +L +L+ K ++   +       L+
Sbjct: 114 KTIKKRDHKKLDYDKLQNKLNKLLQKKKELSLKDEKQLAKLERKLEEAKEEYEALNELLK 173


>gnl|CDD|236222 PRK08292, PRK08292, AMP nucleosidase; Provisional.
          Length = 489

 Score = 26.0 bits (58), Expect = 6.2
 Identities = 15/42 (35%), Positives = 25/42 (59%), Gaps = 1/42 (2%)

Query: 43 LSTSAELDRLQEKYDKTCADLRRA-QAELRVVQADNERVRSE 83
          L  +A +DRL+  Y+++ A LR A  A +R  +  +ER R+ 
Sbjct: 10 LDPAAAVDRLEALYERSTAALRDAFAAYIRGGELPDERARAY 51


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 26.1 bits (58), Expect = 7.1
 Identities = 22/108 (20%), Positives = 40/108 (37%), Gaps = 5/108 (4%)

Query: 8   PPGTLSRLLSERDKAVLEMDKSKEELERS---QATLATLSTSAELDRLQEKYDKTCADLR 64
            P  L  L        LE+D  + E          LA L   AEL  L+ +     A  +
Sbjct: 417 TPAALEDLRRRIAALELELDALEREAALGADHDERLAELR--AELAALEAELAALEARWQ 474

Query: 65  RAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEM 112
           + +  +  + A    + ++     +     + ++  L+A L + QGE 
Sbjct: 475 QEKELVEAILALRAELEADADAPADDDAALRAQLAELEAALASAQGEE 522


>gnl|CDD|220494 pfam09969, DUF2203, Uncharacterized conserved protein (DUF2203). 
          This domain, found in various hypothetical bacterial
          proteins, has no known function.
          Length = 120

 Score = 25.3 bits (56), Expect = 7.2
 Identities = 18/58 (31%), Positives = 30/58 (51%)

Query: 14 RLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
           LL E  + + E+ ++K ELE  +A L+    S E   L+E+  +    +R A AE+ 
Sbjct: 10 ELLPELRRLLEEVREAKAELEELEAALSEGERSLEARGLKEELRELEEQIRAAIAEIE 67


>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
           interacts with SMC proteins [Cell division and
           chromosome partitioning].
          Length = 622

 Score = 25.7 bits (56), Expect = 7.4
 Identities = 14/74 (18%), Positives = 41/74 (55%), Gaps = 3/74 (4%)

Query: 48  ELDRLQEKYD---KTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAK 104
           + D L EK     K    ++  + + R +++D+ +  +    M++K ++  G++ +LK++
Sbjct: 279 QNDNLYEKIQEAMKISQKIKTLREKWRALKSDSNKYENYVNAMKQKSQEWPGKLEKLKSE 338

Query: 105 LENTQGEMESMKEE 118
           +E  + E+++++  
Sbjct: 339 IELKEEEIKALQSN 352


>gnl|CDD|221016 pfam11180, DUF2968, Protein of unknown function (DUF2968).  This
           family of proteins has no known function.
          Length = 192

 Score = 25.5 bits (56), Expect = 7.5
 Identities = 10/68 (14%), Positives = 37/68 (54%)

Query: 51  RLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQG 110
           +L+ +  +    +  ++A    +QAD +  +++E+ +  + ++++ E   L+A+ +  Q 
Sbjct: 109 QLEAQKAQLERQIAASEARAERLQADLQLAQAQEQQVAARQQQARQEAVALEAQRQAAQA 168

Query: 111 EMESMKEE 118
           ++  ++ +
Sbjct: 169 QLRKLQRQ 176


>gnl|CDD|153364 cd07680, F-BAR_PACSIN1, The F-BAR (FES-CIP4 Homology and
           Bin/Amphiphysin/Rvs) domain of Protein kinase C and
           Casein kinase Substrate in Neurons 1 (PACSIN1).  F-BAR
           domains are dimerization modules that bind and bend
           membranes and are found in proteins involved in membrane
           dynamics and actin reorganization. Protein kinase C and
           Casein kinase Substrate in Neurons (PACSIN) proteins,
           also called Synaptic dynamin-associated proteins
           (Syndapins), act as regulators of cytoskeletal and
           membrane dynamics. Vetebrates harbor three isoforms with
           distinct expression patterns and specific functions.
           PACSIN 1 or Syndapin I is expressed specifically in the
           brain and is localized in neurites and synaptic boutons.
           It binds the brain-specific proteins dynamin I,
           synaptojanin, synapsin I, and neural Wiskott-Aldrich
           syndrome protein (nWASP), and functions as a link
           between the cytoskeletal machinery and synaptic vesicle
           endocytosis. PACSIN 1 interacts with huntingtin and may
           be implicated in the neuropathology of Huntington's
           disease. It contains an N-terminal F-BAR domain and a
           C-terminal SH3 domain. F-BAR domains form banana-shaped
           dimers with a positively-charged concave surface that
           binds to negatively-charged lipid membranes. They can
           induce membrane deformation in the form of long tubules.
          Length = 258

 Score = 25.4 bits (55), Expect = 8.0
 Identities = 15/75 (20%), Positives = 35/75 (46%)

Query: 48  ELDRLQEKYDKTCADLRRAQAELRVVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
           EL+  ++ Y   C + + A       +A+      ++K +Q+KV+K + +V + + K E 
Sbjct: 134 ELEAAKKAYHLACKEEKLAMTREANSKAEQSVTPEQQKKLQDKVDKCKQDVQKTQEKYEK 193

Query: 108 TQGEMESMKEEYERS 122
              ++     +Y  +
Sbjct: 194 VLDDVGKTTPQYMEN 208


>gnl|CDD|221380 pfam12017, Tnp_P_element, Transposase protein.  Protein in this
           family are transposases found in insects. This region is
           about 230 amino acids in length and is found associated
           with pfam05485.
          Length = 236

 Score = 25.6 bits (56), Expect = 8.0
 Identities = 10/40 (25%), Positives = 23/40 (57%)

Query: 77  NERVRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMK 116
           N     E K++++K+   + E++ L+ +LE +Q   +S+ 
Sbjct: 13  NHTTLMENKSLRQKIRALEKEMHSLRQQLEESQQLEKSLG 52


>gnl|CDD|131264 TIGR02209, ftsL_broad, cell division protein FtsL.  This model
          represents FtsL, both forms similar to that in E. coli
          and similar to that in B. subtilis. FtsL is one of the
          later proteins active in cell division septum
          formation. FtsL is small, low in complexity, and highly
          divergent. The scope of this model is broader than that
          of the pfam04999.3 for FtsL, as this one includes FtsL
          from Bacillus subtilis and related species [Cellular
          processes, Cell division].
          Length = 85

 Score = 24.5 bits (54), Expect = 8.0
 Identities = 16/63 (25%), Positives = 31/63 (49%), Gaps = 4/63 (6%)

Query: 12 LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
            +L +E  K  LE+DK ++E    Q  +A LS    ++++ +K       ++   A ++
Sbjct: 26 TRQLNNELQKLQLEIDKLQKEWRDLQLEVAELSRHERIEKIAKKQLG----MKLPDANIK 81

Query: 72 VVQ 74
          VV 
Sbjct: 82 VVV 84


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 25.7 bits (57), Expect = 8.4
 Identities = 21/96 (21%), Positives = 41/96 (42%), Gaps = 12/96 (12%)

Query: 12  LSRLLSERDKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELR 71
           LSR +S +D A          L+R  + +A L+    L+R Q   D     +   +A L 
Sbjct: 44  LSREISGKDSA----------LDRLNSQIAELADLLSLER-QGNQDLQ-DSVANLRASLS 91

Query: 72  VVQADNERVRSEEKTMQEKVEKSQGEVYRLKAKLEN 107
             +A+  R+++    +      ++G    L  +L++
Sbjct: 92  AAEAERSRLQALLAELAGAGAAAEGRAGELAQELDS 127


>gnl|CDD|169982 PRK09578, PRK09578, periplasmic multidrug efflux lipoprotein
           precursor; Reviewed.
          Length = 385

 Score = 25.5 bits (56), Expect = 8.8
 Identities = 15/35 (42%), Positives = 19/35 (54%)

Query: 19  RDKAVLEMDKSKEELERSQATLATLSTSAELDRLQ 53
           RD+AV E D ++   +  QA  A  S  AEL R Q
Sbjct: 132 RDRAVSERDYTEAVADERQAKAAVASAKAELARAQ 166


>gnl|CDD|182843 PRK10925, PRK10925, superoxide dismutase; Provisional.
          Length = 206

 Score = 25.3 bits (55), Expect = 9.5
 Identities = 21/103 (20%), Positives = 42/103 (40%), Gaps = 2/103 (1%)

Query: 20  DKAVLEMDKSKEELERSQATLATLSTSAELDRLQEKYDKTCADLRRAQAELRVVQADNER 79
           DK  +E+  +K          A L +  E   L    ++    L +  A+ + V  +N  
Sbjct: 20  DKQTMEIHHTKHHQTYVNNANAALESLPEFANLP--VEELITKLDQLPADKKTVLRNNAG 77

Query: 80  VRSEEKTMQEKVEKSQGEVYRLKAKLENTQGEMESMKEEYERS 122
             +      + ++K       LKA +E   G +++ K E+E++
Sbjct: 78  GHANHSLFWKGLKKGTTLQGDLKAAIERDFGSVDNFKAEFEKA 120


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.306    0.122    0.310 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 5,731,544
Number of extensions: 481800
Number of successful extensions: 1560
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1375
Number of HSP's successfully gapped: 540
Length of query: 122
Length of database: 10,937,602
Length adjustment: 84
Effective length of query: 38
Effective length of database: 7,211,866
Effective search space: 274050908
Effective search space used: 274050908
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 53 (24.3 bits)