RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy16513
         (156 letters)



>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail.  The myosin molecule is a
           multi-subunit complex made up of two heavy chains and
           four light chains it is a fundamental contractile
           protein found in all eukaryote cell types. This family
           consists of the coiled-coil myosin heavy chain tail
           region. The coiled-coil is composed of the tail from two
           molecules of myosin. These can then assemble into the
           macromolecular thick filament. The coiled-coil region
           provides the structural backbone the thick filament.
          Length = 859

 Score =  121 bits (306), Expect = 9e-33
 Identities = 79/109 (72%), Positives = 94/109 (86%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +ARI ELEEE+E ER +RAKAEK RADL+RELEEL ERLEEAGGAT+AQIELNKKREAE+
Sbjct: 45  EARIRELEEELEAERAARAKAEKARADLSRELEELSERLEEAGGATAAQIELNKKREAEL 104

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEA 110
           +KLR+DLEEAN+QHE  LA LRKKH DA++E+ EQI+QL K K K ++ 
Sbjct: 105 AKLRKDLEEANLQHEEALATLRKKHQDAINELSEQIEQLQKQKAKAEKE 153



 Score = 37.7 bits (88), Expect = 0.001
 Identities = 36/152 (23%), Positives = 67/152 (44%), Gaps = 17/152 (11%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           +  +I ELEE  E         EK ++ L  ELE+L   LE A  A S   +  K  +  
Sbjct: 326 LNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQKNFDKI 385

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDA------VSEMGEQIDQLNK-----------L 103
           +++ +R ++E   + ++     R    +       + E+ +Q++ L +           L
Sbjct: 386 LAEWKRKVDELQAELDTAQREARNLSTELFRLKNELEELKDQVEALRRENKNLQDEIHDL 445

Query: 104 KTKLDEANRTINDLDVARKKLIIENSDLVRQL 135
             +L E  R +++L+ AR++L  E  +L   L
Sbjct: 446 TDQLGEGGRNVHELEKARRRLEAEKDELQAAL 477



 Score = 31.6 bits (72), Expect = 0.14
 Identities = 37/154 (24%), Positives = 75/154 (48%), Gaps = 3/154 (1%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           +QA + +L  +++   +++  AEK+   L  +L EL  +L+E     +       + ++E
Sbjct: 157 LQAEVDDLLAQLDQIAKAKLNAEKKAKQLESQLSELQVKLDELQRQLNDLTSQKSRLQSE 216

Query: 61  MSKLRRDLEEANIQHESTL---ANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
            S L R LEEA  Q  +     + L  +  +A   + E+  +   L+ +L +    ++ L
Sbjct: 217 NSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSL 276

Query: 118 DVARKKLIIENSDLVRQLSKVTADARMWKIKYET 151
               ++     ++L RQLSK  A+ + W+ K+E+
Sbjct: 277 REQLEEESEAKAELERQLSKANAEIQQWRSKFES 310



 Score = 31.2 bits (71), Expect = 0.22
 Identities = 37/137 (27%), Positives = 57/137 (41%), Gaps = 10/137 (7%)

Query: 7   ELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 66
           E +  +E E + RA  + Q   L  +L+ L E+LEE   A +       K  AE+ + R 
Sbjct: 247 EAKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEESEAKAELERQLSKANAEIQQWRS 306

Query: 67  DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLII 126
             E         L  L+KK N  +SE+ E            + AN   + L+  + +L  
Sbjct: 307 KFESEGALRAEELEELKKKLNQKISELEEAA----------EAANAKCDSLEKTKSRLQS 356

Query: 127 ENSDLVRQLSKVTADAR 143
           E  DL  +L +  A A 
Sbjct: 357 ELEDLQIELERANAAAS 373



 Score = 26.9 bits (60), Expect = 5.2
 Identities = 32/143 (22%), Positives = 61/143 (42%), Gaps = 4/143 (2%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
            Q  I  L+  +E E + +A+A + +  L  ++ EL   L+ A  A +   +  KK + +
Sbjct: 523 HQRAIESLQATLEAEAKGKAEASRLKKKLEGDINELEIALDHANKANAEAQKNVKKYQQQ 582

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
           + +L+  +EE     E     L      A +   E    L +L++ L++A R     +  
Sbjct: 583 VKELQTQVEEEQRAREDAREQLAVAERRATALEAE----LEELRSALEQAERARKQAETE 638

Query: 121 RKKLIIENSDLVRQLSKVTADAR 143
             +     ++L  Q S + A  R
Sbjct: 639 LAEASERVNELTAQNSSLIAQKR 661



 Score = 26.2 bits (58), Expect = 8.8
 Identities = 25/84 (29%), Positives = 46/84 (54%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++AR+ ELE E++ E++  A+ +K    + R ++EL  ++EE         +L  K +A+
Sbjct: 748 LEARVRELEAELDGEQRRHAETQKNLRKMERRVKELQFQVEEDKKNLERLQDLVDKLQAK 807

Query: 61  MSKLRRDLEEANIQHESTLANLRK 84
           +   +R LEEA    +  L+  RK
Sbjct: 808 IKTYKRQLEEAEEVAQINLSKYRK 831


>gnl|CDD|217933 pfam04156, IncA, IncA protein.  Chlamydia trachomatis is an
           obligate intracellular bacterium that develops within a
           parasitophorous vacuole termed an inclusion. The
           inclusion is non-fusogenic with lysosomes but intercepts
           lipids from a host cell exocytic pathway. Initiation of
           chlamydial development is concurrent with modification
           of the inclusion membrane by a set of C.
           trachomatis-encoded proteins collectively designated
           Incs. One of these Incs, IncA, is functionally
           associated with the homotypic fusion of inclusions. This
           family probably includes members of the wider Inc family
           rather than just IncA.
          Length = 186

 Score = 37.8 bits (88), Expect = 7e-04
 Identities = 25/116 (21%), Positives = 49/116 (42%)

Query: 9   EEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDL 68
            +++E  +   ++ ++Q ++L  ELE+L ER+ E         E  +     +  L   L
Sbjct: 71  PQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERL 130

Query: 69  EEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL 124
           E      +     LR+   D   E+ E  ++L +L+  L      I +L    ++L
Sbjct: 131 ESLEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQELQSLLEQL 186


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 38.1 bits (89), Expect = 0.001
 Identities = 26/117 (22%), Positives = 51/117 (43%), Gaps = 4/117 (3%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++ RI   E  +E   +   +  +    LA E+EEL E +EE      +++E      A 
Sbjct: 829 LERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELIEEL----ESELEALLNERAS 884

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
           + +    L     +    L  L  K ++   E+ E  ++L +L+ +L+     I++L
Sbjct: 885 LEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEVRIDNL 941



 Score = 37.3 bits (87), Expect = 0.002
 Identities = 33/144 (22%), Positives = 68/144 (47%), Gaps = 3/144 (2%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
              I  LE++ ++ R+  A  E+Q  +L  +LEEL  +L+E     +   E  ++ + E+
Sbjct: 294 ANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEEL 353

Query: 62  SKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
             L  +LEE   +    ES L  L ++     S++ +   Q+  L  +++     +  L+
Sbjct: 354 ESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLE 413

Query: 119 VARKKLIIENSDLVRQLSKVTADA 142
             R++L  E  +L+++L +     
Sbjct: 414 DRRERLQQEIEELLKKLEEAELKE 437



 Score = 36.6 bits (85), Expect = 0.003
 Identities = 28/126 (22%), Positives = 59/126 (46%), Gaps = 3/126 (2%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +A+I +L+EE++  R++  +   +   L  E   L ERLE      +A     +  E ++
Sbjct: 788 EAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQI 847

Query: 62  SKLRRDLEEANIQHES---TLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
            +L  D+E    + E     +  L  +    ++E     + L  L+++L+E +  + +L+
Sbjct: 848 EELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELE 907

Query: 119 VARKKL 124
             R +L
Sbjct: 908 SKRSEL 913



 Score = 36.6 bits (85), Expect = 0.003
 Identities = 41/148 (27%), Positives = 74/148 (50%), Gaps = 4/148 (2%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +  I ELEE++E   +  A+ EK  A+L +ELEEL E LE+       ++E   ++ + +
Sbjct: 676 RREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQL----RKELEELSRQISAL 731

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
            K    LE    Q E  +A L K+  +  +E+ E  ++L + + +L EA   I +L+   
Sbjct: 732 RKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQI 791

Query: 122 KKLIIENSDLVRQLSKVTADARMWKIKY 149
           ++L  E   L   L ++ A+  +   + 
Sbjct: 792 EQLKEELKALREALDELRAELTLLNEEA 819



 Score = 36.2 bits (84), Expect = 0.004
 Identities = 29/118 (24%), Positives = 55/118 (46%), Gaps = 1/118 (0%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           +++R+ ELEE++E  R   A+ E Q A L  E+E L  RLE          +  ++   +
Sbjct: 370 LESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLKK 429

Query: 61  MSKLRRDLEEANI-QHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
           + +      +A + + E  L  L+++       + E  ++L + +  LD A R +  L
Sbjct: 430 LEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAERELAQL 487



 Score = 35.8 bits (83), Expect = 0.005
 Identities = 27/127 (21%), Positives = 54/127 (42%), Gaps = 3/127 (2%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           +  +I  L +++        + E++ A L++EL EL   +EE         E   + EAE
Sbjct: 724 LSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAE 783

Query: 61  MSKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
           + +L   +E+   +       L  LR +      E     ++L  L+ ++    R + DL
Sbjct: 784 IEELEAQIEQLKEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDL 843

Query: 118 DVARKKL 124
           +   ++L
Sbjct: 844 EEQIEEL 850



 Score = 33.5 bits (77), Expect = 0.032
 Identities = 26/126 (20%), Positives = 55/126 (43%), Gaps = 4/126 (3%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
              + ELE E+E   +   +AE++ A+   E+EEL  ++E+      A  E   +  AE+
Sbjct: 753 SKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAEL 812

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
           + L  +           L +L ++       + +  +Q+ +L   ++     I +L+   
Sbjct: 813 TLLNEEAANL----RERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEIEELEELI 868

Query: 122 KKLIIE 127
           ++L  E
Sbjct: 869 EELESE 874



 Score = 33.5 bits (77), Expect = 0.034
 Identities = 35/136 (25%), Positives = 60/136 (44%), Gaps = 11/136 (8%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +A +  L EE    R+     E++ A   R LE+L E++EE     S  IE      AE+
Sbjct: 809 RAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEEL----SEDIESL---AAEI 861

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
            +L   +EE   + E+ L N R    +A   +     +L +L  +L E     ++L    
Sbjct: 862 EELEELIEELESELEA-LLNERASLEEA---LALLRSELEELSEELRELESKRSELRREL 917

Query: 122 KKLIIENSDLVRQLSK 137
           ++L  + + L  +L  
Sbjct: 918 EELREKLAQLELRLEG 933



 Score = 32.3 bits (74), Expect = 0.084
 Identities = 25/139 (17%), Positives = 53/139 (38%), Gaps = 4/139 (2%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGA---TSAQIELNKKR 57
           ++ ++ EL+EE+E       + E +  +L   LEEL E+LE           QI      
Sbjct: 342 LEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNE 401

Query: 58  EAEMSKLRRDLEEA-NIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTIND 116
              +      LE+      +     L+K     + E+  ++++L +   +L E    + +
Sbjct: 402 IERLEARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEE 461

Query: 117 LDVARKKLIIENSDLVRQL 135
                ++ + E    +   
Sbjct: 462 ALEELREELEEAEQALDAA 480



 Score = 31.2 bits (71), Expect = 0.22
 Identities = 30/120 (25%), Positives = 49/120 (40%), Gaps = 11/120 (9%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++  I ELE E+E     RA  E+  A L  ELEEL E L E     S      ++   +
Sbjct: 864 LEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREK 923

Query: 61  MSKLRRDLEEANIQHESTLANLR-----------KKHNDAVSEMGEQIDQLNKLKTKLDE 109
           +++L   LE   ++ ++    L               N    +  E   +L +L+ K+ E
Sbjct: 924 LAQLELRLEGLEVRIDNLQERLSEEYSLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983



 Score = 26.6 bits (59), Expect = 6.9
 Identities = 34/132 (25%), Positives = 65/132 (49%), Gaps = 7/132 (5%)

Query: 13  EVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEAN 72
           E+ERQ +   E+Q A+ A   +EL   L E   A        ++   E+ +L+ +L+EA 
Sbjct: 197 ELERQLK-SLERQ-AEKAERYKELKAELRELELALLVL--RLEELREELEELQEELKEAE 252

Query: 73  IQHESTLANLRK---KHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENS 129
            + E   A L++   K  +   E+ E  +++ +L+ +L      I+ L+  ++ L    +
Sbjct: 253 EELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRERLA 312

Query: 130 DLVRQLSKVTAD 141
           +L RQL ++ A 
Sbjct: 313 NLERQLEELEAQ 324



 Score = 26.2 bits (58), Expect = 8.8
 Identities = 32/161 (19%), Positives = 69/161 (42%), Gaps = 20/161 (12%)

Query: 3   ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEE-----------------AGG 45
            R+ EL EE+E  ++   +AE++  +L  EL+EL E+LEE                    
Sbjct: 232 LRLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELY 291

Query: 46  ATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDA---VSEMGEQIDQLNK 102
           A + +I   ++++  + +   +LE    + E+ L  L  K ++    ++E+ E++++L +
Sbjct: 292 ALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351

Query: 103 LKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTADAR 143
               L+     +       +  + E  + +  L    A   
Sbjct: 352 ELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQLE 392


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 37.7 bits (88), Expect = 0.001
 Identities = 26/129 (20%), Positives = 68/129 (52%), Gaps = 4/129 (3%)

Query: 3   ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMS 62
           AR+ E+E+++      +   EK+  +L  +  +L E+++        +IE    ++ E+ 
Sbjct: 812 ARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIE----KEIENLNGKKEELE 867

Query: 63  KLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARK 122
           +   +LE A    ES L +L+K+ ++  +++ E   ++ +L+ ++++  + +++L    +
Sbjct: 868 EELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLE 927

Query: 123 KLIIENSDL 131
            L  E S++
Sbjct: 928 ALEEELSEI 936



 Score = 35.0 bits (81), Expect = 0.011
 Identities = 22/118 (18%), Positives = 60/118 (50%), Gaps = 4/118 (3%)

Query: 8   LEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRD 67
           LE+E++  ++ R   ++Q   + +E+E L  + EE       ++E  +    ++     D
Sbjct: 831 LEKEIQELQEQRIDLKEQIKSIEKEIENLNGKKEE----LEEELEELEAALRDLESRLGD 886

Query: 68  LEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLI 125
           L++   + E+ L  L +K  +  +++ ++  +L++LK KL+     +++++  + +  
Sbjct: 887 LKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDE 944



 Score = 33.9 bits (78), Expect = 0.024
 Identities = 37/141 (26%), Positives = 62/141 (43%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           Q R+ E   E+E E  S  ++  ++     + EE   +LE       A+IE  ++   E 
Sbjct: 289 QLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEE 348

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
            K R  L E   + +  L +LR +  +   E  E  D+L   + KL++  R IN+L    
Sbjct: 349 RKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKREL 408

Query: 122 KKLIIENSDLVRQLSKVTADA 142
            +L  E   L  +L+ + A  
Sbjct: 409 DRLQEELQRLSEELADLNAAI 429



 Score = 32.0 bits (73), Expect = 0.12
 Identities = 37/165 (22%), Positives = 68/165 (41%), Gaps = 20/165 (12%)

Query: 3   ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIE---------L 53
              YEL +E E   + +   E+Q A L  ELE+L E + E       +IE         +
Sbjct: 223 YEGYELLKEKEALERQKEAIERQLASLEEELEKLTEEISEL-EKRLEEIEQLLEELNKKI 281

Query: 54  NKKREAEMSKLRRDLEEANIQHESTLANLRKKHN---DAVSEMGEQIDQLNKLKTKLDEA 110
               E E  +++  + E   +  S   ++ +K     DA   + +   +++KL  +++E 
Sbjct: 282 KDLGEEEQLRVKEKIGELEAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEEL 341

Query: 111 NRTINDLDVARKKLIIENS-------DLVRQLSKVTADARMWKIK 148
            R I +    R KL  E +       DL  +L +V  +    + +
Sbjct: 342 EREIEEERKRRDKLTEEYAELKEELEDLRAELEEVDKEFAETRDE 386



 Score = 31.2 bits (71), Expect = 0.23
 Identities = 21/116 (18%), Positives = 56/116 (48%), Gaps = 4/116 (3%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++ R+  L+ E+   +    + E +  +L++EL +   ++ E       +IE  ++ E +
Sbjct: 679 LRERLEGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEI----EKEIEQLEQEEEK 734

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTIND 116
           + +   +LEE     E  + N++ +  +  + + E  + L+KL+  L++    ++ 
Sbjct: 735 LKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSH 790



 Score = 30.0 bits (68), Expect = 0.50
 Identities = 30/144 (20%), Positives = 61/144 (42%), Gaps = 4/144 (2%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +  + +L  E+E   +  A+   +  D   +LE+L   + E         E  ++   E+
Sbjct: 363 KEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEEL 422

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
           + L   +       E+ +  L ++  D   E+ +Q  +L +L   L +  + + DL    
Sbjct: 423 ADLNAAIAGI----EAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEY 478

Query: 122 KKLIIENSDLVRQLSKVTADARMW 145
            ++  E S L R+L++  A AR  
Sbjct: 479 DRVEKELSKLQRELAEAEAQARAS 502



 Score = 26.6 bits (59), Expect = 8.3
 Identities = 30/165 (18%), Positives = 66/165 (40%), Gaps = 15/165 (9%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEEL------------GERLEEAGGATSA 49
           +  +  LE+E+E  +    + E +  +L  +L +L              R+ E     S 
Sbjct: 743 EEDLSSLEQEIENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSK 802

Query: 50  QIELNKKREAEMSKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNKLKTK 106
             E   + EA + ++ + L    ++    E  +  L+++  D   ++     ++  L  K
Sbjct: 803 LEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQIKSIEKEIENLNGK 862

Query: 107 LDEANRTINDLDVARKKLIIENSDLVRQLSKVTADARMWKIKYET 151
            +E    + +L+ A + L     DL ++  ++ A  R  + K E 
Sbjct: 863 KEELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEE 907


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 37.0 bits (86), Expect = 0.003
 Identities = 34/134 (25%), Positives = 69/134 (51%), Gaps = 4/134 (2%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           + R+  LE E+E   Q R + E++  +L  E+EEL E+L+E       ++E  +K   E+
Sbjct: 806 ERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL----EEELEELEKELEEL 861

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
            +   +LE    + E  L  L ++  +   E+ E   +L +LK ++++    + +L+   
Sbjct: 862 KEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKL 921

Query: 122 KKLIIENSDLVRQL 135
           ++L +E  +L  +L
Sbjct: 922 ERLEVELPELEEEL 935



 Score = 35.5 bits (82), Expect = 0.009
 Identities = 32/146 (21%), Positives = 67/146 (45%), Gaps = 4/146 (2%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++A++ +LEEE++  +      E    +L R+LEEL  +LEE       ++   ++   +
Sbjct: 679 LEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKR----ELAALEEELEQ 734

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
           +     +LEE   + E  L  L+++  +   E+    + L KLK +++E       L   
Sbjct: 735 LQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEE 794

Query: 121 RKKLIIENSDLVRQLSKVTADARMWK 146
            ++L  E  +  R+L  +  +    +
Sbjct: 795 LEELEEELEEAERRLDALERELESLE 820



 Score = 35.1 bits (81), Expect = 0.010
 Identities = 32/146 (21%), Positives = 75/146 (51%), Gaps = 4/146 (2%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           +Q R+ ELEEE+E   ++ AK +++  +L  + + L E LEE       ++E  ++R   
Sbjct: 756 LQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEE----LEEELEEAERRLDA 811

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
           + +    LE+   + E  +  L ++  +   ++ E  ++L +L+ +L+E    + +L+  
Sbjct: 812 LERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAE 871

Query: 121 RKKLIIENSDLVRQLSKVTADARMWK 146
           +++L  E  +L  +  ++  + R  +
Sbjct: 872 KEELEDELKELEEEKEELEEELRELE 897



 Score = 35.1 bits (81), Expect = 0.012
 Identities = 25/140 (17%), Positives = 63/140 (45%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           + +I  L+EE+E       + E+  A+L    EEL E+L           E  ++  AE+
Sbjct: 329 KEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAEL 388

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
                ++     + +  + +L ++       + +  ++L +L+ +L+E    + +L+   
Sbjct: 389 EAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEEL 448

Query: 122 KKLIIENSDLVRQLSKVTAD 141
           ++L  +  +L  +L ++  +
Sbjct: 449 EELEEQLEELRDRLKELERE 468



 Score = 33.5 bits (77), Expect = 0.032
 Identities = 30/141 (21%), Positives = 63/141 (44%), Gaps = 7/141 (4%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++ R+ ELE E+E   +   + +++   L  ELEE    LEE        +   ++ + E
Sbjct: 307 LRERLEELENELEELEERLEELKEKIEALKEELEERETLLEEL----EQLLAELEEAKEE 362

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
           + +    L E     E     LR++  +  +E+ E  ++L +LK +++     +  L   
Sbjct: 363 LEEKLSALLEE---LEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSER 419

Query: 121 RKKLIIENSDLVRQLSKVTAD 141
            + L  E  +L  +L ++  +
Sbjct: 420 LEDLKEELKELEAELEELQTE 440



 Score = 33.1 bits (76), Expect = 0.053
 Identities = 37/167 (22%), Positives = 76/167 (45%), Gaps = 14/167 (8%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGG----------ATSAQ 50
           ++  + EL+EE+   ++   + E + + L   LEEL   LEE             A   +
Sbjct: 279 LREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEE 338

Query: 51  IELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLN----KLKTK 106
           +E  +    E+ +L  +LEEA  + E  L+ L ++  +    + E++ +L     +++ +
Sbjct: 339 LEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNE 398

Query: 107 LDEANRTINDLDVARKKLIIENSDLVRQLSKVTADARMWKIKYETNK 153
           L+E  R I  L+   ++L     DL  +L ++ A+    + + E   
Sbjct: 399 LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELN 445



 Score = 33.1 bits (76), Expect = 0.054
 Identities = 33/150 (22%), Positives = 70/150 (46%), Gaps = 3/150 (2%)

Query: 7   ELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 66
           ELE+ +    +++ + E++ + L  ELEEL E L E      A++   +    E+ +   
Sbjct: 348 ELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIE 407

Query: 67  DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL-- 124
            LEE   +    L +L+++  +  +E+ E   +L +L  +L+E    + +L    K+L  
Sbjct: 408 SLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELER 467

Query: 125 -IIENSDLVRQLSKVTADARMWKIKYETNK 153
            + E  + +++L K  +       + E  +
Sbjct: 468 ELAELQEELQRLEKELSSLEARLDRLEAEQ 497



 Score = 32.8 bits (75), Expect = 0.067
 Identities = 32/152 (21%), Positives = 69/152 (45%), Gaps = 10/152 (6%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +  + +L+EE+E   + R   +++  +L  ELEE   RL+       +  +  ++ E E+
Sbjct: 771 EEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEI 830

Query: 62  SKLRRDLEEANIQ----------HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEAN 111
            +L  ++EE   +           E  L  L+++  +  +E  E  D+L +L+ + +E  
Sbjct: 831 EELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELE 890

Query: 112 RTINDLDVARKKLIIENSDLVRQLSKVTADAR 143
             + +L+    +L  E   L  +L ++ A   
Sbjct: 891 EELRELESELAELKEEIEKLRERLEELEAKLE 922



 Score = 31.2 bits (71), Expect = 0.20
 Identities = 36/155 (23%), Positives = 75/155 (48%), Gaps = 3/155 (1%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +  +  LE+ +E  R+   + E+Q  +L REL  L E LE+         E  ++ E E+
Sbjct: 694 KNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEEL 753

Query: 62  SKLRRDLEEANIQHEST---LANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
            +L+  LEE   + ES    LA L+++  +   +     ++L +L+ +L+EA R ++ L+
Sbjct: 754 EELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALE 813

Query: 119 VARKKLIIENSDLVRQLSKVTADARMWKIKYETNK 153
              + L      L +++ ++  +    + K +  +
Sbjct: 814 RELESLEQRRERLEQEIEELEEEIEELEEKLDELE 848



 Score = 30.1 bits (68), Expect = 0.53
 Identities = 30/122 (24%), Positives = 55/122 (45%), Gaps = 3/122 (2%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           + R   LE+E+E   +   + E++  +L  ELEEL + LEE            ++ E E+
Sbjct: 820 EQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDEL 879

Query: 62  SKLRRDLEEANIQ---HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
            +L  + EE   +    ES LA L+++       + E   +L +L+ +L E    + +  
Sbjct: 880 KELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAKLERLEVELPELEEELEEEY 939

Query: 119 VA 120
             
Sbjct: 940 ED 941



 Score = 29.7 bits (67), Expect = 0.67
 Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 1/106 (0%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +     L EE+       A+   +  +L RE+E L ERLE          E  K+ EAE+
Sbjct: 375 EELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAEL 434

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKL 107
            +L+ +LEE N + E     L +   D + E+  ++ +L +   +L
Sbjct: 435 EELQTELEELNEELEELEEQLEELR-DRLKELERELAELQEELQRL 479



 Score = 29.7 bits (67), Expect = 0.74
 Identities = 30/129 (23%), Positives = 62/129 (48%), Gaps = 3/129 (2%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEEL---GERLEEAGGATSAQIELNKKRE 58
           Q+R+ ELEEE+E   +   + +++  +L  ELE L     +L+E       + +  ++  
Sbjct: 736 QSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEEL 795

Query: 59  AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLD 118
            E+ +   + E      E  L +L ++      E+ E  +++ +L+ KLDE    + +L+
Sbjct: 796 EELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELE 855

Query: 119 VARKKLIIE 127
              ++L  E
Sbjct: 856 KELEELKEE 864



 Score = 29.7 bits (67), Expect = 0.76
 Identities = 33/141 (23%), Positives = 64/141 (45%)

Query: 3   ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMS 62
               ELEE++    +   +  +   +   ELE     +         +IE  ++R   +S
Sbjct: 358 EAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLS 417

Query: 63  KLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARK 122
           +   DL+E   + E+ L  L+ +  +   E+ E  +QL +L+ +L E  R + +L    +
Sbjct: 418 ERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERELAELQEELQ 477

Query: 123 KLIIENSDLVRQLSKVTADAR 143
           +L  E S L  +L ++ A+ R
Sbjct: 478 RLEKELSSLEARLDRLEAEQR 498



 Score = 28.9 bits (65), Expect = 1.2
 Identities = 29/124 (23%), Positives = 56/124 (45%), Gaps = 4/124 (3%)

Query: 14  VERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANI 73
             ++S    +++  +L  EL EL  +LE+      +     +  E  + +LRR LEE   
Sbjct: 657 RNKRSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELER 716

Query: 74  QHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENSDLVR 133
           Q E     L+++      E+ +   +L +L+ +L+E    + +L    ++L  E   L  
Sbjct: 717 QLEE----LKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEE 772

Query: 134 QLSK 137
            L+K
Sbjct: 773 ALAK 776



 Score = 28.1 bits (63), Expect = 2.0
 Identities = 29/136 (21%), Positives = 60/136 (44%), Gaps = 12/136 (8%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++  + ELE E E       + E+++ +L  EL EL   L E         E  ++ EA+
Sbjct: 861 LKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEAK 920

Query: 61  MSKLRRDLEEANIQ--------HESTLANLRKKHNDAVSEMG----EQIDQLNKLKTKLD 108
           + +L  +L E   +         E+ L    ++  + +  +G      I++  +++ + +
Sbjct: 921 LERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEYEEVEERYE 980

Query: 109 EANRTINDLDVARKKL 124
           E      DL+ A++KL
Sbjct: 981 ELKSQREDLEEAKEKL 996



 Score = 27.4 bits (61), Expect = 4.3
 Identities = 27/137 (19%), Positives = 61/137 (44%), Gaps = 7/137 (5%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           +Q  + E E+E+E  +    +  ++  +L  EL EL E +EE       +I L ++R  E
Sbjct: 258 LQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEE----LEGEISLLRERLEE 313

Query: 61  MSKLRRDLEEANIQHESTLANLRKK---HNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
           +     +LEE   + +  +  L+++       + E+ + + +L + K +L+E    + + 
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEE 373

Query: 118 DVARKKLIIENSDLVRQ 134
                + + E    +  
Sbjct: 374 LEELFEALREELAELEA 390


>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
           Provisional.
          Length = 1123

 Score = 32.2 bits (74), Expect = 0.10
 Identities = 19/104 (18%), Positives = 36/104 (34%), Gaps = 15/104 (14%)

Query: 7   ELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 66
            L++EV   +Q       Q    ARE  +     E       A   L  + E +  +L  
Sbjct: 146 ALQQEVLTLKQ-------QLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEA 198

Query: 67  DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEA 110
            LE+   +   T    ++K  +   +  ++++        L E 
Sbjct: 199 QLEQLQEKAAETSQERKQKRKEITDQAAKRLE--------LSEE 234


>gnl|CDD|234172 TIGR03345, VI_ClpV1, type VI secretion ATPase, ClpV1 family.
           Members of this protein family are homologs of ClpB, an
           ATPase associated with chaperone-related functions.
           These ClpB homologs, designated ClpV1, are a key
           component of the bacterial pathogenicity-associated type
           VI secretion system [Protein fate, Protein and peptide
           secretion and trafficking, Cellular processes,
           Pathogenesis].
          Length = 852

 Score = 31.8 bits (73), Expect = 0.12
 Identities = 19/82 (23%), Positives = 28/82 (34%), Gaps = 8/82 (9%)

Query: 1   MQARIYELEEEVEVERQSRAKAEK---QRADLARELEELGERLEEAGGATSAQIELNKKR 57
              R+ EL  E+       A  E    Q  +L   +  L   LE    A +         
Sbjct: 448 HDERLAELRAELAALEAELAALEARWQQEKELVEAILALRAELEADADAPADDDAA---L 504

Query: 58  EAEMSKLRRDLEEANIQHESTL 79
            A++++L   L  A  Q E  L
Sbjct: 505 RAQLAELEAALASA--QGEEPL 524


>gnl|CDD|236508 PRK09426, PRK09426, methylmalonyl-CoA mutase; Reviewed.
          Length = 714

 Score = 31.8 bits (73), Expect = 0.13
 Identities = 33/109 (30%), Positives = 49/109 (44%), Gaps = 23/109 (21%)

Query: 14  VERQSRAKAEKQRADLARELEELG------------ERLEEAGGATSAQIELNKKREAEM 61
           VE  +   AEK  A    E+E LG             R+EEA   T A+I+  K+    +
Sbjct: 399 VESLTHELAEKAWA-HIEEVEALGGMAKAIEAGIPKLRIEEAAARTQARIDSGKQVIVGV 457

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEA 110
           +K R D E+          ++ +  N AV    EQI +L +L+ + DEA
Sbjct: 458 NKYRLDKEDP--------IDVLEVDNTAVRA--EQIARLERLRAERDEA 496


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 31.3 bits (72), Expect = 0.16
 Identities = 35/149 (23%), Positives = 73/149 (48%), Gaps = 14/149 (9%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           +  + E +EE+    + R + EK+  +   EL++L +RL +       ++EL +KRE E+
Sbjct: 56  KEALLEAKEEIH---KLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREEEL 112

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVAR 121
            K  ++LE+   + E     L +   + + E+ E+I  L       +EA   +  L+   
Sbjct: 113 EKKEKELEQKQQELEKKEEELEELIEEQLQEL-ERISGLTA-----EEAKEIL--LEKVE 164

Query: 122 KKLIIENSDLVRQL---SKVTADARMWKI 147
           ++   E + L++++   +K  AD +  +I
Sbjct: 165 EEARHEAAVLIKEIEEEAKEEADKKAKEI 193


>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
           Provisional.
          Length = 475

 Score = 31.3 bits (71), Expect = 0.17
 Identities = 13/49 (26%), Positives = 22/49 (44%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSA 49
           MQ    +++++ E  R+      KQR D  R +E+LG+          A
Sbjct: 74  MQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKA 122


>gnl|CDD|182871 PRK10966, PRK10966, exonuclease subunit SbcD; Provisional.
          Length = 407

 Score = 31.4 bits (72), Expect = 0.17
 Identities = 27/94 (28%), Positives = 42/94 (44%), Gaps = 17/94 (18%)

Query: 1   MQARIYELEEEVEVE----RQSRAKAEKQRADLARE-LEELGERLEEAGGATSAQIELNK 55
           +Q RI  L E + VE    R+SR + E+  A   RE L EL   +EE      A  EL++
Sbjct: 326 IQRRIQALTESLPVEVLLVRRSREQRERSLASEQRETLSEL--SVEEVFERRLALEELDE 383

Query: 56  KREAEMSKLRRDLEEANIQHESTLANLRKKHNDA 89
            ++  +++L              L  L ++H  A
Sbjct: 384 PQQQRLTQL----------FTQVLHELAEEHEAA 407


>gnl|CDD|236038 PRK07521, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 483

 Score = 31.4 bits (72), Expect = 0.18
 Identities = 10/38 (26%), Positives = 20/38 (52%), Gaps = 3/38 (7%)

Query: 77  STLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTI 114
             + + R    DA +E+ + +D LN L  + ++AN  +
Sbjct: 143 DAVQSAR---ADADAEIADSVDTLNDLLAQFEDANNAV 177


>gnl|CDD|205692 pfam13514, AAA_27, AAA domain.  This domain is found in a number of
           double-strand DNA break proteins. This domain contains a
           P-loop motif.
          Length = 1118

 Score = 31.3 bits (71), Expect = 0.21
 Identities = 26/135 (19%), Positives = 48/135 (35%)

Query: 10  EEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLE 69
              E   +   +AEK+ ++ A  L+E   RL     A             E S   R+L 
Sbjct: 808 AAAEKLAEEIEEAEKEVSEAAAALDEAEARLTALLRAARCTTIEELLAAVERSDTYRELR 867

Query: 70  EANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENS 129
           +     E TL       +             ++L  +L+E  R I +L+    +L  E  
Sbjct: 868 KRIAALERTLVRAGGGLSLEALVAEAAALDPDELPARLEELARDIEELEEELNELAQEVG 927

Query: 130 DLVRQLSKVTADARM 144
              ++L+++   +  
Sbjct: 928 AAKQELARMDGGSTA 942


>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342).  This
           family of bacterial proteins has no known function. The
           proteins are in the region of 500-600 amino acid
           residues in length.
          Length = 450

 Score = 31.1 bits (71), Expect = 0.22
 Identities = 20/87 (22%), Positives = 41/87 (47%), Gaps = 6/87 (6%)

Query: 42  EAGGATSAQIELN----KKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQI 97
            AG  T  ++ ++    K+   E+ +  ++LEE   + +  L  L KK    +    E+ 
Sbjct: 316 PAGTKTEIEVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPP--EKR 373

Query: 98  DQLNKLKTKLDEANRTINDLDVARKKL 124
           +QL KL    ++ +  + +L+   K+L
Sbjct: 374 EQLEKLLETKEKLSEELEELEEELKEL 400


>gnl|CDD|191367 pfam05761, 5_nucleotid, 5' nucleotidase family.  This family of
           eukaryotic proteins includes 5' nucleotidase enzymes,
           such as purine 5'-nucleotidase EC:3.1.3.5.
          Length = 448

 Score = 30.7 bits (70), Expect = 0.27
 Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 16/68 (23%)

Query: 5   IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 64
           + ELE E+E+    + + E+ +      L+ L ERL                  AE+S L
Sbjct: 323 VPELEREIEIWNTEQYRFEELQ-----RLDILLERLY----DHLDVH-------AELSTL 366

Query: 65  RRDLEEAN 72
           R DL E +
Sbjct: 367 RPDLSELD 374


>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.8 bits (70), Expect = 0.27
 Identities = 26/117 (22%), Positives = 55/117 (47%), Gaps = 2/117 (1%)

Query: 10  EEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKRE--AEMSKLRRD 67
           E +E +R+   + E +  DL  E+EE+ ERLE A     A+  + +  E   ++ +L  +
Sbjct: 468 ETIEEDRERVEELEAELEDLEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAE 527

Query: 68  LEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL 124
             E   +       LR++  +  +E  E+ +   + + + +EA   + +L+    +L
Sbjct: 528 RRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAEL 584



 Score = 30.0 bits (68), Expect = 0.49
 Identities = 31/143 (21%), Positives = 65/143 (45%), Gaps = 3/143 (2%)

Query: 4   RIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEE---AGGATSAQIELNKKREAE 60
              +L  E  ++       E +R +L    EEL +RLEE   A  A + + E  ++   +
Sbjct: 294 ERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADD 353

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVA 120
           + +   +L E   + ES L   R+   D   E+ E  +++ +L+ +  +A   + + +  
Sbjct: 354 LEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDF 413

Query: 121 RKKLIIENSDLVRQLSKVTADAR 143
            ++L  E  +L  + +++ A  R
Sbjct: 414 LEELREERDELREREAELEATLR 436



 Score = 30.0 bits (68), Expect = 0.53
 Identities = 37/132 (28%), Positives = 62/132 (46%), Gaps = 17/132 (12%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAG----------GATSAQI 51
           +  +  LE E+E  R++ A+ E++R +LA E+ +L ERLEE            G   A  
Sbjct: 250 REELETLEAEIEDLRETIAETEREREELAEEVRDLRERLEELEEERDDLLAEAGLDDADA 309

Query: 52  ELNKKR----EAEMSKLRRDLEEANIQ---HESTLANLRKKHNDAVSEMGEQIDQLNKLK 104
           E  + R    E    +LR  LEE  +    H     +LR+  +D      E  ++  +L+
Sbjct: 310 EAVEARREELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELE 369

Query: 105 TKLDEANRTIND 116
           ++L+EA   + D
Sbjct: 370 SELEEAREAVED 381



 Score = 26.5 bits (59), Expect = 8.6
 Identities = 29/118 (24%), Positives = 53/118 (44%), Gaps = 8/118 (6%)

Query: 7   ELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRR 66
           E E+ +E   + R   E+  A+    +EE  ER EE         E   + EAE  + R 
Sbjct: 506 EAEDRIERLEERREDLEELIAERRETIEEKRERAEELR-------ERAAELEAEAEEKRE 558

Query: 67  DLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL 124
              EA  + E     +  + N  ++E+ E+I+ L +++T L       ++++  R+K 
Sbjct: 559 AAAEAEEEAEEAREEV-AELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREKR 615


>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D.  BicD
           proteins consist of three coiled-coiled domains and are
           involved in dynein-mediated minus end-directed transport
           from the Golgi apparatus to the endoplasmic reticulum
           (ER). For full functioning they bind with GSK-3beta
           pfam05350 to maintain the anchoring of microtubules to
           the centromere. It appears that amino-acid residues
           437-617 of BicD and the kinase activity of GSK-3 are
           necessary for the formation of a complex between BicD
           and GSK-3beta in intact cells.
          Length = 711

 Score = 30.6 bits (69), Expect = 0.32
 Identities = 30/86 (34%), Positives = 47/86 (54%), Gaps = 3/86 (3%)

Query: 1   MQARIYELEEEVEVERQ--SRAKAEKQR-ADLARELEELGERLEEAGGATSAQIELNKKR 57
              RI EL+ E++  R   S  +AE +R + L++EL+E  E LE   G    +I+  K R
Sbjct: 32  YLQRILELQAELKQLRAELSNVQAENERLSSLSQELKEENEMLELQRGRLRDEIKEYKFR 91

Query: 58  EAEMSKLRRDLEEANIQHESTLANLR 83
           EA + +   +LEE NI  +  ++ LR
Sbjct: 92  EARLLQDYSELEEENISLQKQVSVLR 117


>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase.  This model represents the
           seryl-tRNA synthetase found in most organisms. This
           protein is a class II tRNA synthetase, and is recognized
           by the pfam model tRNA-synt_2b. The seryl-tRNA
           synthetases of two archaeal species, Methanococcus
           jannaschii and Methanobacterium thermoautotrophicum,
           differ considerably and are included in a different
           model [Protein synthesis, tRNA aminoacylation].
          Length = 418

 Score = 30.4 bits (69), Expect = 0.35
 Identities = 21/116 (18%), Positives = 45/116 (38%)

Query: 26  RADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKK 85
           R  L    + + E L+  G +    +E     + E  KL  ++EE   +       + K 
Sbjct: 4   RKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEELQAKRNELSKQIGKA 63

Query: 86  HNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTAD 141
                 ++ E   +L +LK +L E +  +  L+   +  ++   ++  +   V  D
Sbjct: 64  KGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKLLSIPNIPHESVPVGKD 119


>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 908

 Score = 29.7 bits (67), Expect = 0.60
 Identities = 32/120 (26%), Positives = 55/120 (45%)

Query: 5   IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKL 64
           +  LEE +E   +   K E +  +LA E  EL + LEE       ++E  +K   +  + 
Sbjct: 324 LKSLEERLEKLEEKLEKLESELEELAEEKNELAKLLEERLKELEERLEELEKELEKALER 383

Query: 65  RRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKL 124
            + LEEA  + +  LA L     +   E+ E   +L +L+ +L+E    I  L+    +L
Sbjct: 384 LKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQL 443


>gnl|CDD|224188 COG1269, NtpI, Archaeal/vacuolar-type H+-ATPase subunit I [Energy
           production and conversion].
          Length = 660

 Score = 29.6 bits (67), Expect = 0.60
 Identities = 15/72 (20%), Positives = 27/72 (37%), Gaps = 5/72 (6%)

Query: 3   ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEA----GGATSAQIELNKKRE 58
             +YE+ E      +  ++ E+  A++  ELE L   LE           A  E+  + E
Sbjct: 215 FELYEVPEFDGGPSELISELEEVIAEIQDELESLRSELEALAEKIAEELLAVREI-LEIE 273

Query: 59  AEMSKLRRDLEE 70
             +  +   L  
Sbjct: 274 KALGDVLSKLAR 285


>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
          Length = 1113

 Score = 29.9 bits (68), Expect = 0.61
 Identities = 25/112 (22%), Positives = 45/112 (40%), Gaps = 26/112 (23%)

Query: 49  AQIELNKKR---EAEMSKLRRDLEEA-----NIQ-HESTLANLRKKHNDAVSEMGEQIDQ 99
           AQ++   K+   EAE   +++DLE+       I   +     L+++   A +++ +   +
Sbjct: 43  AQLDALNKQKLLEAEDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAE 102

Query: 100 LNKLKTKLDEA-----------------NRTINDLDVARKKLIIENSDLVRQ 134
           L  LK   DE                   +T++ L  A+  L   NS LV  
Sbjct: 103 LEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSL 154


>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584).  This
           protein is found in bacteria and eukaryotes. Proteins in
           this family are typically between 943 to 1234 amino
           acids in length. This family contains a P-loop motif
           suggesting it is a nucleotide binding protein. It may be
           involved in replication.
          Length = 1198

 Score = 29.7 bits (67), Expect = 0.61
 Identities = 27/140 (19%), Positives = 49/140 (35%), Gaps = 20/140 (14%)

Query: 3   ARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGG----------------- 45
           AR+ EL+++ + E  S         +L R++EEL   +E                     
Sbjct: 750 ARLKELKKQYDRELASLDVDPNTVKELKRQIEELETTIERIAVRRPEVREYRAFMQETWL 809

Query: 46  ATSAQIELNKKREAEMSKLRRDLEEANIQH---ESTLANLRKKHNDAVSEMGEQIDQLNK 102
              +  E       ++ +L    EE   +           RKK       + +Q+DQL++
Sbjct: 810 HRDSLREERPNLAIQLRELESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDE 869

Query: 103 LKTKLDEANRTINDLDVARK 122
           L   L +  R + +L     
Sbjct: 870 LLRGLRDEMRQLAELKEPAN 889



 Score = 27.0 bits (60), Expect = 5.9
 Identities = 23/123 (18%), Positives = 52/123 (42%), Gaps = 8/123 (6%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++ R+ + EE ++     + +AE+Q      ELEE      EA  A        K+   +
Sbjct: 605 LRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTAL-------KQARLD 657

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQL-NKLKTKLDEANRTINDLDV 119
           + +L+ + +    + E  +A  +++    + ++  Q+ QL  + +  L+       +L  
Sbjct: 658 LQRLQNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRT 717

Query: 120 ARK 122
            R 
Sbjct: 718 ERL 720


>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
           potential nuclease [General function prediction only].
          Length = 290

 Score = 29.5 bits (66), Expect = 0.65
 Identities = 19/70 (27%), Positives = 37/70 (52%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++    EL+E++E  ++ + +  K+  +L  E EE+ ERL+      S   E+ KK   E
Sbjct: 133 LKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGE 192

Query: 61  MSKLRRDLEE 70
           +  L++  +E
Sbjct: 193 VYDLKKRWDE 202



 Score = 26.0 bits (57), Expect = 9.7
 Identities = 20/73 (27%), Positives = 38/73 (52%), Gaps = 1/73 (1%)

Query: 28  DLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHN 87
           DL  + EEL E+LEE        ++  ++ EAE  +++  L+   ++  S L  + KK  
Sbjct: 132 DLKEDYEELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVE-NSRLEEMLKKLP 190

Query: 88  DAVSEMGEQIDQL 100
             V ++ ++ D+L
Sbjct: 191 GEVYDLKKRWDEL 203


>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein.  This domain is found in
           cell division proteins which are required for
           kinetochore-spindle association.
          Length = 321

 Score = 29.6 bits (67), Expect = 0.69
 Identities = 19/89 (21%), Positives = 38/89 (42%)

Query: 29  LARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHND 88
           + R+ E L + L           +  +  + E++ LR+  +E N+     L   R++   
Sbjct: 159 MKRDEELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCDPLELEKARQELRS 218

Query: 89  AVSEMGEQIDQLNKLKTKLDEANRTINDL 117
              ++ E+  QL +L+ +L E    I  L
Sbjct: 219 LSVKISEKRKQLEELQQELQELTIAIEAL 247


>gnl|CDD|219412 pfam07439, DUF1515, Protein of unknown function (DUF1515).  This
          family consists of several hypothetical bacterial
          proteins of around 130 residues in length. Members of
          this family seem to be found exclusively in Rhizobium
          species. The function of this family is unknown.
          Length = 112

 Score = 28.4 bits (63), Expect = 0.74
 Identities = 15/49 (30%), Positives = 28/49 (57%), Gaps = 3/49 (6%)

Query: 1  MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSA 49
          +QA +  L E++   R+S  +++  RA + R ++EL ER+    G+ S 
Sbjct: 13 LQAEVKNLREDI---RRSEDRSDASRASMYRRMDELVERVTTLEGSVSK 58


>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR).  This
           family consists of several mammalian alpha helical
           coiled-coil rod HCR proteins. The function of HCR is
           unknown but it has been implicated in psoriasis in
           humans and is thought to affect keratinocyte
           proliferation.
          Length = 739

 Score = 29.6 bits (66), Expect = 0.81
 Identities = 32/108 (29%), Positives = 52/108 (48%), Gaps = 21/108 (19%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++A+  ELE     E+  RA+AE  RA LA   EE+ + LEE G                
Sbjct: 96  LEAQAEELEALAVAEKAGRAEAEGLRAALAGA-EEVRKNLEEGG---------------- 138

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLD 108
               +++LEEA   H+  L++L + H +A+S +  + + L K    L+
Sbjct: 139 ----QQELEEAQRLHQEQLSSLTQAHLEALSSLRSKAEGLEKSLQSLE 182


>gnl|CDD|224202 COG1283, NptA, Na+/phosphate symporter [Inorganic ion transport and
           metabolism].
          Length = 533

 Score = 29.1 bits (66), Expect = 0.84
 Identities = 18/93 (19%), Positives = 33/93 (35%), Gaps = 10/93 (10%)

Query: 55  KKREAEMSKLRRDLEEANIQHESTLANLRKKHN-------DAVSEMGEQIDQLNKLKTKL 107
            +R AE+     +LE      E  L    KK         D + E+        +    L
Sbjct: 399 SRRWAEIIDAAINLEHIGDIIERLLELADKKIANGRAFSEDGLEELDALFALTLEN---L 455

Query: 108 DEANRTINDLDVARKKLIIENSDLVRQLSKVTA 140
             A   +   D+   + ++E    VR+L + ++
Sbjct: 456 RLAISVLVTGDLELARRLVERKKRVRRLERRSS 488


>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
           HlyD family.  Type I secretion is an ABC transport
           process that exports proteins, without cleavage of any
           signal sequence, from the cytosol to extracellular
           medium across both inner and outer membranes. The
           secretion signal is found in the C-terminus of the
           transported protein. This model represents the adaptor
           protein between the ATP-binding cassette (ABC) protein
           of the inner membrane and the outer membrane protein,
           and is called the membrane fusion protein. This model
           selects a subfamily closely related to HlyD; it is
           defined narrowly and excludes, for example, colicin V
           secretion protein CvaA and multidrug efflux proteins
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 423

 Score = 28.8 bits (65), Expect = 1.1
 Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 3/117 (2%)

Query: 1   MQARIYELEEEVEVERQS-RAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREA 59
           + A    + E ++ ++    ++    RA L   L ++ +   E  G  +    L ++ E 
Sbjct: 117 LSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALRQQLEV 176

Query: 60  EMSKL--RRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTI 114
              +L  RR L+E  +     L  L ++  +A  E+G    +L  LK ++DE     
Sbjct: 177 ISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELEVLKRQIDELQLER 233



 Score = 26.9 bits (60), Expect = 5.1
 Identities = 26/123 (21%), Positives = 61/123 (49%), Gaps = 12/123 (9%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAG-GATSAQIELNKKR-- 57
           + A+I +LE E+    Q++ +A +Q+ ++  E  E   +L+E G  +    +EL ++R  
Sbjct: 149 ILAQIKQLEAELA-GLQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAE 207

Query: 58  --------EAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDE 109
                   EAE+  L+R ++E  ++ +      R++  + ++E   ++ +L +   K  +
Sbjct: 208 AQGELGRLEAELEVLKRQIDELQLERQQIEQTFREEVLEELTEAQARLAELRERLNKARD 267

Query: 110 ANR 112
             +
Sbjct: 268 RLQ 270


>gnl|CDD|188998 cd06460, M32_Taq, Peptidase family M32 includes thermostable
           carboxypeptidases TaqCP and PfuCP.  Peptidase family M32
           is a subclass of metallocarboxypeptidases which are
           distributed mainly in bacteria and archaea, and contain
           a HEXXH motif that coordinates a divalent cation such as
           Zn2+ or Co2+, so far only observed in the active site of
           neutral metallopeptidases but not in carboxypeptidases.
           M32 includes the thermostable carboxypeptidases (E.C.
           3.4.17.19) from Thermus aquaticus (TaqCP) and Pyrococcus
           furiosus (PfuCP), which have broad specificities toward
           a wide range of C-terminal substrates that include
           basic, aromatic, neutral and polar amino acids. These
           enzymes have a similar fold to the M3 peptidases such as
           neurolysin and the M2 angiotensin converting enzyme
           (ACE). Novel peptidases from protozoa Trypanosoma cruzi,
           a causative agent of Chagas' disease, and Leishmania
           major, a parasite that causes leishmaniasis, are the
           first eukaryotic M32 enzymes identified so far, thus
           making these enzymes an attractive potential target for
           drug development against these organisms.
          Length = 484

 Score = 29.0 bits (66), Expect = 1.1
 Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 8/70 (11%)

Query: 77  STLANLRKKHNDAVS-EMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIEN---SDLV 132
           +TL+ L   H    S E+GE ++ L + +  LDE       +   R+         ++LV
Sbjct: 36  ATLSGLA--HELLTSPEIGELLEALEEEELALDEVEAAN--VRELRRDYERATKIPAELV 91

Query: 133 RQLSKVTADA 142
            +L+++T++A
Sbjct: 92  EELARLTSEA 101


>gnl|CDD|204614 pfam11221, Med21, Subunit 21 of Mediator complex.  Med21 has been
           known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in
           Drosophila. The heterodimer of the two subunits Med7 and
           Med21 appears to act as a hinge between the middle and
           the tail regions of Mediator.
          Length = 132

 Score = 27.6 bits (62), Expect = 1.8
 Identities = 13/42 (30%), Positives = 21/42 (50%), Gaps = 7/42 (16%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEA 43
             RI ELEEE+          E +R +  +E E+L +++E  
Sbjct: 91  LRRIKELEEELRE-------VEAEREEAVKEKEKLLKKVEAL 125


>gnl|CDD|109884 pfam00846, Hanta_nucleocap, Hantavirus nucleocapsid protein. 
          Length = 428

 Score = 28.4 bits (63), Expect = 1.9
 Identities = 24/76 (31%), Positives = 33/76 (43%), Gaps = 14/76 (18%)

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNK------------LKTKLD 108
           MS L  +L+E    HE  L   R+K  DA   +    D +NK            L+ KL 
Sbjct: 1   MSNLE-ELQEEITAHEQQLVIARQKLKDAEKAVEVDPDDVNKSTLQSRRAAVSALEAKLA 59

Query: 109 EANRTINDLDVARKKL 124
           E  R + D  +A +KL
Sbjct: 60  ELKRQLADR-IATQKL 74


>gnl|CDD|137505 PRK09751, PRK09751, putative ATP-dependent helicase Lhr;
           Provisional.
          Length = 1490

 Score = 28.4 bits (63), Expect = 2.0
 Identities = 17/54 (31%), Positives = 29/54 (53%), Gaps = 6/54 (11%)

Query: 5   IYELEEEVE-VERQSRAKAEKQRADLAREL-----EELGERLEEAGGATSAQIE 52
           I ++EEE++ +    RAK E+   DL REL     E+L +R   +    ++ +E
Sbjct: 871 IRQVEEELQRLAPGRRAKGEEGLFDLLRELGPMTVEDLAQRHTGSSEEVASYLE 924


>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
           unknown].
          Length = 652

 Score = 28.1 bits (63), Expect = 2.2
 Identities = 25/138 (18%), Positives = 57/138 (41%), Gaps = 9/138 (6%)

Query: 9   EEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDL 68
           EE    +  +  +  ++     + +++L E +E      S      ++ + E+ KL  +L
Sbjct: 400 EERPREKEGTEEEERREITVYEKRIKKLEETVERLEEENSELKRELEELKREIEKLESEL 459

Query: 69  EEANIQHESTLANLRKKHNDAVSEMGEQIDQLNK----LKTKLDEANRTINDLDVARKKL 124
           E    +    +   R+     +     +I++L K     K +++E  R + +L   RK  
Sbjct: 460 ERFRREVRDKVRKDRE-----IRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLE 514

Query: 125 IIENSDLVRQLSKVTADA 142
           +      V+ + K+T +A
Sbjct: 515 LSGKGTPVKVVEKLTLEA 532


>gnl|CDD|129660 TIGR00569, ccl1, cyclin ccl1.  All proteins in this family for
           which functions are known are cyclins that are
           components of TFIIH, a complex that is involved in
           nucleotide excision repair and transcription initiation.
           This family is based on the phylogenomic analysis of JA
           Eisen (1999, Ph.D. Thesis, StanfordUniversity) [DNA
           metabolism, DNA replication, recombination, and repair].
          Length = 305

 Score = 27.9 bits (62), Expect = 2.2
 Identities = 20/82 (24%), Positives = 32/82 (39%), Gaps = 8/82 (9%)

Query: 58  EAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDL 117
           E       R+     I     L  L KK+ +  SE      ++  LK KLDE +    +L
Sbjct: 230 EQLSVPGNREELPQLIDIMRELRILVKKYEEPRSE------KVAALKKKLDECHSIEENL 283

Query: 118 DVA--RKKLIIENSDLVRQLSK 137
                RK+   E+   + + +K
Sbjct: 284 SSVKIRKRKGYEDDTYISKKAK 305


>gnl|CDD|227250 COG4913, COG4913, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 1104

 Score = 28.1 bits (62), Expect = 2.6
 Identities = 24/102 (23%), Positives = 42/102 (41%), Gaps = 12/102 (11%)

Query: 2   QARIYELEEEVEVERQSRAKAE--KQRADLARELEELGERLEEAGGATSAQIELNKKREA 59
            AR+  L EE+ + R S AK E     +++  EL+++ E L            L +    
Sbjct: 789 NARLRRLREEI-IGRMSDAKKEDTAALSEVGAELDDIPEYLARLQ-------TLTEDALP 840

Query: 60  EMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLN 101
           E   L R  E  N   +  +  L    +   + + E+I+ +N
Sbjct: 841 E--FLARFQELLNRSSDDGVTQLLSHLDHERALIEERIEAIN 880


>gnl|CDD|215656 pfam00012, HSP70, Hsp70 protein.  Hsp70 chaperones help to fold
           many proteins. Hsp70 assisted folding involves repeated
           cycles of substrate binding and release. Hsp70 activity
           is ATP dependent. Hsp70 proteins are made up of two
           regions: the amino terminus is the ATPase domain and the
           carboxyl terminus is the substrate binding region.
          Length = 598

 Score = 27.6 bits (62), Expect = 3.0
 Identities = 22/93 (23%), Positives = 40/93 (43%), Gaps = 12/93 (12%)

Query: 38  ERLEEAGGATSAQIE-LNKKRE--AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMG 94
             +  + G +  +IE + K  E  A   K R++  EA  + E  + +L K     + E G
Sbjct: 493 ITITASSGLSDDEIERMVKDAEEYAAEDKKRKERIEAKNEAEEYVYSLEKS----LKEEG 548

Query: 95  EQIDQLNKLKTKLDEANRTINDLDVARKKLIIE 127
           +++ +    K K++EA   I  L    +    E
Sbjct: 549 DKLPE--ADKKKVEEA---IEWLKEELEGEDKE 576


>gnl|CDD|222709 pfam14362, DUF4407, Domain of unknown function (DUF4407).  This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 366 and 597 amino
           acids in length. There is a single completely conserved
           residue R that may be functionally important.
          Length = 297

 Score = 27.6 bits (62), Expect = 3.0
 Identities = 9/52 (17%), Positives = 19/52 (36%)

Query: 71  ANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARK 122
             IQ E   A   +       ++     ++  L+ ++DEA   +N      +
Sbjct: 115 LEIQQEEQAAAQAQVAAGFRPKIAALTAEIAALQAEIDEAQAEVNAAYQEAQ 166


>gnl|CDD|225747 COG3206, GumC, Uncharacterized protein involved in
           exopolysaccharide biosynthesis [Cell envelope
           biogenesis, outer membrane].
          Length = 458

 Score = 27.4 bits (61), Expect = 3.2
 Identities = 32/137 (23%), Positives = 59/137 (43%), Gaps = 15/137 (10%)

Query: 12  VEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQI-ELNKKREAEMSKLRRDLE- 69
           ++  RQ  A+  +Q ADL+ EL     +L     A  AQ+ EL ++  AE+ ++   L  
Sbjct: 287 IQDLRQQYAQVRQQIADLSTELGAKHPQLV----ALEAQLAELRQQIAAELRQILASLPN 342

Query: 70  --EANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIE 127
                 Q E+ L     +    +S++ +   QL +L+ + + A      L    ++L I 
Sbjct: 343 ELALLEQQEAALEKELAQLKGRLSKLPKLQVQLRELEREAEAARSLYETLLQRYQELSI- 401

Query: 128 NSDLVRQLSKVTADARM 144
                 Q +    +AR+
Sbjct: 402 ------QEASPIGNARV 412


>gnl|CDD|111809 pfam02961, BAF, Barrier to autointegration factor.  The BAF
          protein has a SAM-domain-like bundle of orthogonally
          packed alpha-hairpins - one classic and one pseudo
          helix-hairpin-helix motif. The protein is involved in
          the prevention of retroviral DNA integration.
          Length = 89

 Score = 26.2 bits (58), Expect = 3.4
 Identities = 19/63 (30%), Positives = 25/63 (39%), Gaps = 4/63 (6%)

Query: 23 EKQRADLARELEELGERLEEAGGATS----AQIELNKKREAEMSKLRRDLEEANIQHEST 78
          +K    LA   E LG RLE+ G   +     Q  L KK E    +  +D   AN +    
Sbjct: 18 DKPVTALAGIGEVLGGRLEDKGFDKAYVVLGQFLLLKKDEELFKEWLKDTCGANAKQARD 77

Query: 79 LAN 81
            N
Sbjct: 78 CYN 80


>gnl|CDD|226654 COG4191, COG4191, Signal transduction histidine kinase regulating
           C4-dicarboxylate transport system [Signal transduction
           mechanisms].
          Length = 603

 Score = 27.3 bits (61), Expect = 3.9
 Identities = 16/59 (27%), Positives = 29/59 (49%)

Query: 14  VERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEAN 72
             R   A+ ++ RA+L R +EE    L  A     A+I   ++ EA + + + +L +A 
Sbjct: 321 RARLRLAELQEARAELERRVEERTADLTRANARLQAEIAEREQAEAALRRAQDELVQAG 379


>gnl|CDD|198091 smart01023, BAF, Barrier to autointegration factor.
          Barrier-to-autointegration factor (BAF) is an essential
          protein that is highly conserved in metazoan evolution,
          and which may act as a DNA-bridging protein. BAF binds
          directly to double-stranded DNA, to transcription
          activators, and to inner nuclear membrane proteins,
          including lamin A filament proteins that anchor
          nuclear-pore complexes in place, and nuclear LEM-domain
          proteins that bind to laminins filaments and chromatin.
          New findings suggest that BAF has structural roles in
          nuclear assembly and chromatin organization, represses
          gene expression and might interlink chromatin
          structure, nuclear architecture and gene regulation in
          metazoans. BAF can be exploited by retroviruses to act
          as a host component of pre-integration complexes, which
          promote the integration of the retroviral DNA into the
          host chromosome by preventing autointegration of
          retroviral DNA. BAF might contribute to the assembly or
          activity of retroviral pre-integration complexes
          through direct binding to the retroviral proteins p55
          Gag and matrix, as well as to DNA.
          Length = 87

 Score = 26.2 bits (58), Expect = 3.9
 Identities = 20/63 (31%), Positives = 24/63 (38%), Gaps = 4/63 (6%)

Query: 23 EKQRADLARELEELGERLEEAGGATS----AQIELNKKREAEMSKLRRDLEEANIQHEST 78
          EK    LA   E LG RLE  G   +     Q  L KK E    +  +D   AN +    
Sbjct: 16 EKPVTALAGIGEVLGGRLETKGFDKAYVVLGQFLLLKKDEELFKEWLKDTCGANAKQARD 75

Query: 79 LAN 81
            N
Sbjct: 76 CYN 78


>gnl|CDD|233459 TIGR01541, tape_meas_lam_C, phage tail tape measure protein, lambda
           family.  This model represents a relatively
           well-conserved region near the C-terminus of the tape
           measure protein of a lambda and related phage. This
           protein, which controls phage tail length, is typically
           about 1000 residues in length. Both low-complexity
           sequence and insertion/deletion events appear common in
           this family. Mutational studies suggest a ruler or
           template role in the determination of phage tail length.
           Similar behavior is attributed to proteins from
           distantly related or unrelated families in other phage
           [Mobile and extrachromosomal element functions, Prophage
           functions].
          Length = 332

 Score = 27.1 bits (60), Expect = 3.9
 Identities = 12/57 (21%), Positives = 24/57 (42%), Gaps = 2/57 (3%)

Query: 46  ATSAQIELNKKRE--AEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQL 100
            T+ +  L  + +    + KL + LEEA  +    L  L +      ++   Q+D+ 
Sbjct: 19  NTADEKSLQSRSDEIIALIKLEKLLEEAERKALEALKKLAEATASIRAQNKRQLDRF 75


>gnl|CDD|168351 PRK06023, PRK06023, enoyl-CoA hydratase; Provisional.
          Length = 251

 Score = 27.1 bits (60), Expect = 3.9
 Identities = 18/73 (24%), Positives = 38/73 (52%), Gaps = 11/73 (15%)

Query: 5   IYELEEEVEVERQSRAKAEK------QRADLAREL-----EELGERLEEAGGATSAQIEL 53
           I+++ +E  VE ++   AE+      Q   +AR+L     E++  R++E     +A+++ 
Sbjct: 179 IWKIVDEEAVEAETLKAAEELAAKPPQALQIARDLMRGPREDILARIDEEAKHFAARLKS 238

Query: 54  NKKREAEMSKLRR 66
            + R A  + +RR
Sbjct: 239 AEARAAFEAFMRR 251


>gnl|CDD|233004 TIGR00513, accA, acetyl-CoA carboxylase, carboxyl transferase,
          alpha subunit.  The enzyme acetyl-CoA carboxylase
          contains a biotin carboxyl carrier protein or domain, a
          biotin carboxylase, and a carboxyl transferase. This
          model represents the alpha chain of the carboxyl
          transferase for cases in which the architecture of the
          protein is as in E. coli, in which the
          carboxyltransferase portion consists of two
          non-identical subnits, alpha and beta [Fatty acid and
          phospholipid metabolism, Biosynthesis].
          Length = 316

 Score = 27.1 bits (60), Expect = 4.0
 Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 3/38 (7%)

Query: 5  IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEE 42
          I ELE ++E     RA++  +  DL+ E+E L +R  E
Sbjct: 12 IAELEAKIE---SLRARSRDEDVDLSEEIERLEKRSVE 46


>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
           cryoelectron microscopy indicates that VP5 is a trimer
           implying that there are 360 copies of VP5 per virion.
          Length = 507

 Score = 27.3 bits (61), Expect = 4.0
 Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 10/87 (11%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           +Q ++ ELE E + E   R K  K+       +E+ GE LEE       + ++ ++ E +
Sbjct: 89  LQRKLKELEREQKEEEV-REKHNKKI------IEKFGEDLEEVYKFMKGEAKVEEEEEKQ 141

Query: 61  MSKLRRDLEEAN--IQHES-TLANLRK 84
           M  L + L+     ++ E+ +L  L K
Sbjct: 142 MEILEKALKSYLKIVKEENKSLQRLAK 168


>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB.  Members
           of this protein family are the bacterial ATP-dependent
           chaperone ClpB. This protein belongs to the AAA family,
           ATPases associated with various cellular activities
           (pfam00004). This molecular chaperone does not act as a
           protease, but rather serves to disaggregate misfolded
           and aggregated proteins [Protein fate, Protein folding
           and stabilization].
          Length = 852

 Score = 27.2 bits (61), Expect = 4.2
 Identities = 23/72 (31%), Positives = 42/72 (58%), Gaps = 4/72 (5%)

Query: 4   RIYELE-EEVEVERQSRAKAEKQRADLARELEELGER---LEEAGGATSAQIELNKKREA 59
           RI +LE E   ++++    ++++  DL +EL EL E    LEE   A  A I+  ++ + 
Sbjct: 413 RIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKAAIQGIQQIKE 472

Query: 60  EMSKLRRDLEEA 71
           E+ ++R +LE+A
Sbjct: 473 EIEQVRLELEQA 484


>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family.  This family of proteins includes
           MND1 from S. cerevisiae. The mnd1 protein forms a
           complex with hop2 to promote homologous chromosome
           pairing and meiotic double-strand break repair.
          Length = 188

 Score = 26.8 bits (60), Expect = 4.3
 Identities = 26/140 (18%), Positives = 58/140 (41%), Gaps = 29/140 (20%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++ R+ +L++E+E  +Q  A+ + Q   L +  EE  ER E           L + ++  
Sbjct: 67  LKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTE----------LLEELKQ-- 114

Query: 61  MSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDV- 119
                  LE+   + ++ L    K   + + ++ E+       K   + ANR  +++   
Sbjct: 115 -------LEKELKKLKAELEKYEKNDPERIEKLKEET------KVAKEAANRWTDNIFSL 161

Query: 120 ---ARKKLIIENSDLVRQLS 136
                KK  ++ + L ++  
Sbjct: 162 KSYLSKKFGLDEAQLRKEFG 181


>gnl|CDD|234342 TIGR03752, conj_TIGR03752, integrating conjugative element protein,
           PFL_4705 family.  Members of this protein family are
           found occasionally on plasmids such as the Pseudomonas
           putida toluene catabolic TOL plasmid pWWO_p085. Usually,
           however, they are found on the bacterial main chromosome
           in regions flanked by markers of conjugative transfer
           and/or transposition [Mobile and extrachromosomal
           element functions, Plasmid functions].
          Length = 472

 Score = 27.3 bits (61), Expect = 4.4
 Identities = 19/95 (20%), Positives = 44/95 (46%), Gaps = 18/95 (18%)

Query: 27  ADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKH 86
             L  E++EL +RL +        I  N+  +AE  +L++  +  + Q +  + +  +  
Sbjct: 62  RTLVAEVKELRKRLAKL-------ISENEALKAENERLQKREQSIDQQIQQAVQSETQ-- 112

Query: 87  NDAVSEMGEQIDQLN----KLKTKLDEANRTINDL 117
                E+ ++I+QL     +L+  +D+  R +  +
Sbjct: 113 -----ELTKEIEQLKSERQQLQGLIDQLQRRLAGV 142


>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
          Length = 343

 Score = 26.9 bits (60), Expect = 4.7
 Identities = 19/70 (27%), Positives = 37/70 (52%)

Query: 2   QARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEM 61
           + R  EL +E++ E+Q  A+A  Q   L +++  L  +L     A  A  + +++ +A++
Sbjct: 115 EGRAGELAQELDSEKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKI 174

Query: 62  SKLRRDLEEA 71
           + L R L  A
Sbjct: 175 ADLGRRLNVA 184


>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein.  Members of this
           protein family have a signal peptide, a strongly
           conserved SH3 domain, a variable region, and then a
           C-terminal hydrophobic transmembrane alpha helix region.
          Length = 198

 Score = 26.5 bits (59), Expect = 4.8
 Identities = 18/112 (16%), Positives = 49/112 (43%), Gaps = 15/112 (13%)

Query: 4   RIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSK 63
           R+ EL++E+   ++  A+ ++Q A+L +E +EL + L              ++ + E+++
Sbjct: 67  RLPELQQELAELQEELAELQEQLAELQQENQELKQELSTLEAE-------LERLQKELAR 119

Query: 64  LRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTIN 115
           +++    A          L +++ +   E+ E   +   L+ + +       
Sbjct: 120 IKQLSANAI--------ELDEENRELREELAELKQENEALEAENERLQENEQ 163


>gnl|CDD|224176 COG1256, FlgK, Flagellar hook-associated protein [Cell motility and
           secretion].
          Length = 552

 Score = 27.0 bits (60), Expect = 4.9
 Identities = 21/63 (33%), Positives = 29/63 (46%), Gaps = 7/63 (11%)

Query: 77  STLANLRKKHNDA-------VSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENS 129
             L +LRK  N         V+ + +QI  LNK   K+  A    NDL   R +L+ E S
Sbjct: 152 EQLTDLRKDINAEIAATVDEVNSLLKQIADLNKQIRKVKAAGNDPNDLLDQRDQLVDELS 211

Query: 130 DLV 132
            L+
Sbjct: 212 QLI 214


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 26.9 bits (60), Expect = 5.0
 Identities = 17/89 (19%), Positives = 29/89 (32%), Gaps = 26/89 (29%)

Query: 27  ADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKH 86
              A EL+ L   L          +++  K EA                     +LR   
Sbjct: 626 NSFAEELDALANNLPNGK------LKVALKNEAL--------------------DLRSIQ 659

Query: 87  NDAVSEMGEQIDQLNKLKTKLDEANRTIN 115
            + V  M   +++LN     LDE++  + 
Sbjct: 660 QNFVPPMESLLEKLNSNVKTLDESSNDLQ 688


>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein.  Function of MutS2 is
           unknown. It should not be considered a DNA mismatch
           repair protein. It is likely a DNA mismatch binding
           protein of unknown cellular function [DNA metabolism,
           Other].
          Length = 771

 Score = 27.1 bits (60), Expect = 5.2
 Identities = 27/150 (18%), Positives = 54/150 (36%), Gaps = 12/150 (8%)

Query: 17  QSRAKAEKQRADLAREL-----EELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEA 71
           +S A    QR  +   +        GE  EE          L K+ E +   L + L+E 
Sbjct: 482 ESYAFEIAQRYGIPHFIIEQAKTFYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQ 541

Query: 72  NIQHESTLANLRKKHNDAVSEMGEQIDQ-----LNKLKTKLDEANRTINDLDVARKKLII 126
               +  L    ++  +       ++++     L  LK +++   R + +  + + K I 
Sbjct: 542 EKLKKE-LEQEMEELKERERNKKLELEKEAQEALKALKKEVESIIRELKEKKIHKAKEIK 600

Query: 127 ENSDLVRQLSKVTADARMWKIKYETNKPKD 156
              DLV +L +           ++ +K  D
Sbjct: 601 SIEDLV-KLKETKQKIPQKPTNFQADKIGD 629


>gnl|CDD|218618 pfam05508, Ran-binding, RanGTP-binding protein.  The small Ras-like
           GTPase Ran plays an essential role in the transport of
           macromolecules in and out of the nucleus and has been
           implicated in spindle and nuclear envelope formation
           during mitosis in higher eukaryotes. The S. cerevisiae
           ORF YGL164c encoding a novel RanGTP-binding protein,
           termed Yrb30p was identified. The protein competes with
           yeast RanBP1 (Yrb1p) for binding to the GTP-bound form
           of yeast Ran (Gsp1p) and is, like Yrb1p, able to form
           trimeric complexes with RanGTP and some of the
           karyopherins.
          Length = 302

 Score = 26.5 bits (59), Expect = 6.1
 Identities = 11/43 (25%), Positives = 23/43 (53%)

Query: 28  DLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEE 70
            L RE++E  ERL+EA  +       ++  +  + K+ R +++
Sbjct: 88  SLRREIDEFDERLDEAASSEELSSRTSENEKESIKKIERYMKD 130


>gnl|CDD|239651 cd03679, MM_CoA_mutase_alpha_like, Coenzyme
           B12-dependent-methylmalonyl coenzyme A (CoA) mutase
           (MCM) family, Alpha subunit-like subfamily; contains
           proteins similar to the alpha subunit of
           Propionbacterium shermanni MCM, as well as human and E.
           coli MCM. Members of this subfamily contain an
           N-terminal MCM domain and a C-terminal coenzyme B12
           binding domain. MCM catalyzes the isomerization of
           methylmalonyl-CoA to succinyl-CoA. The reaction proceeds
           via radical intermediates beginning with a
           substrate-induced homolytic cleavage of the Co-C bond of
           coenzyme B12 to produce cob(II)alamin and the
           deoxyadenosyl radical. MCM plays an important role in
           the conversion of propionyl-CoA to succinyl-CoA during
           the degradation of propionate for the Krebs cycle. In
           higher animals, MCM is involved in the breakdown of
           odd-chain fatty acids, several amino acids, and
           cholesterol. Methylobacterium extorquens MCM
           participates in the glyoxylate regeneration pathway. In
           M. extorquens, MCM forms a complex with MeaB; MeaB may
           protect MCM from irreversible inactivation. In some
           bacteria, MCM is involved in the reverse metabolic
           reaction, the rearrangement of succinyl-CoA to
           methylmalonyl-CoA. Examples include P. shermanni MCM
           during propionic acid fermentation, E.coli MCM in a
           pathway for the conversion of succinate to propionate
           and Streptomyces MCM in polyketide biosynthesis.
           Sinorhizobium meliloti strain SU47 MCM plays a role in
           the polyhydroxyalkanoate degradation pathway. P.
           shermanni and Streptomyces cinnamonensis MCMs are
           alpha/beta heterodimers. It has been shown for P.
           shermanni MCM that only the alpha subunit binds coenzyme
           B12 and substrates. Human MCM is a homodimer with two
           active sites. Mouse and E.coli MCMs are also homodimers.
           In humans, impaired activity of MCM results in
           methylmalonic aciduria, a disorder of propionic acid
           metabolism.
          Length = 536

 Score = 26.6 bits (59), Expect = 6.2
 Identities = 32/109 (29%), Positives = 48/109 (44%), Gaps = 23/109 (21%)

Query: 14  VERQSRAKAEKQRADLARELEELG------------ERLEEAGGATSAQIELNKKREAEM 61
           +E  +   AEK  A +  E+EELG             R+EEA     A+I+  ++    +
Sbjct: 393 MESLTDDLAEKAWALIQ-EIEELGGMAKAIESGIPKLRIEEAAARRQARIDSGREVIVGV 451

Query: 62  SKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEA 110
           +K R D EE     +     + K  N AV    EQI +L KL+ + D  
Sbjct: 452 NKYRLDHEE---PLD-----VLKIDNTAVRA--EQIARLKKLRAERDPE 490


>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
           and chromosome partitioning].
          Length = 420

 Score = 26.6 bits (59), Expect = 6.2
 Identities = 21/80 (26%), Positives = 37/80 (46%), Gaps = 4/80 (5%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           +Q  I  LE+++  ++  RAK EKQ   L  E+      LE     T+  ++  +K+ A+
Sbjct: 43  IQKEIAALEKKIREQQDQRAKLEKQLKSLETEI----ASLEAQLIETADDLKKLRKQIAD 98

Query: 61  MSKLRRDLEEANIQHESTLA 80
           ++     LE    +    LA
Sbjct: 99  LNARLNALEVQEREQRRRLA 118


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 26.8 bits (59), Expect = 6.4
 Identities = 31/127 (24%), Positives = 56/127 (44%), Gaps = 10/127 (7%)

Query: 5   IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKRE-AEMSK 63
           +  +E+ ++ E  SR  AEKQ A+     E+  ++ EE   A +A      + E AE  K
Sbjct: 478 MQSMEKRLKSEADSRVNAEKQLAE-----EKKRKKEEEETAARAAAQAAASREECAESLK 532

Query: 64  LRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKK 123
             +   E  I+    L +  K   +    + ++  +L K +    E    ++ L   + K
Sbjct: 533 QAKQDLEMEIKK---LEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDK 589

Query: 124 -LIIENS 129
            L++ENS
Sbjct: 590 NLMLENS 596


>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family.  This
           family consists of the 116kDa V-type ATPase (vacuolar
           (H+)-ATPases) subunits, as well as V-type ATP synthase
           subunit i. The V-type ATPases family are proton pumps
           that acidify intracellular compartments in eukaryotic
           cells for example yeast central vacuoles,
           clathrin-coated and synaptic vesicles. They have
           important roles in membrane trafficking processes. The
           116kDa subunit (subunit a) in the V-type ATPase is part
           of the V0 functional domain responsible for proton
           transport. The a subunit is a transmembrane glycoprotein
           with multiple putative transmembrane helices it has a
           hydrophilic amino terminal and a hydrophobic carboxy
           terminal. It has roles in proton transport and assembly
           of the V-type ATPase complex. This subunit is encoded by
           two homologous gene in yeast VPH1 and STV1.
          Length = 707

 Score = 26.6 bits (59), Expect = 6.5
 Identities = 10/42 (23%), Positives = 19/42 (45%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEE 42
               +Y++ E      +  +K  K+  +L R LE+    LE+
Sbjct: 206 FGFELYDVPETEGERSELISKVNKRIEELQRVLEQTESHLEK 247


>gnl|CDD|214396 CHL00206, ycf2, Ycf2; Provisional.
          Length = 2281

 Score = 26.8 bits (59), Expect = 6.6
 Identities = 15/52 (28%), Positives = 23/52 (44%), Gaps = 7/52 (13%)

Query: 87  NDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARK-KLIIENSDLVRQLSK 137
           N  +  +G QI  L KLK  L      ++D D ++K K +I    +   L  
Sbjct: 920 NQLLESIGLQIVHLKKLKPFL------LDDHDTSQKSKFLINGGTISPFLFN 965


>gnl|CDD|235558 PRK05683, flgK, flagellar hook-associated protein FlgK; Validated.
          Length = 676

 Score = 26.6 bits (59), Expect = 6.6
 Identities = 15/50 (30%), Positives = 22/50 (44%), Gaps = 7/50 (14%)

Query: 88  DAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSK 137
           D V+ +   I   NK   +   +  T NDL  AR        + VRQL++
Sbjct: 166 DQVNNLTTSIASYNKQIAQASASGATPNDLLDAR-------DEAVRQLNE 208


>gnl|CDD|218489 pfam05192, MutS_III, MutS domain III.  This domain is found in
           proteins of the MutS family (DNA mismatch repair
           proteins) and is found associated with pfam00488,
           pfam05188, pfam01624 and pfam05190. The MutS family of
           proteins is named after the Salmonella typhimurium MutS
           protein involved in mismatch repair; other members of
           the family included the eukaryotic MSH 1,2,3, 4,5 and 6
           proteins. These have various roles in DNA repair and
           recombination. Human MSH has been implicated in
           non-polyposis colorectal carcinoma (HNPCC) and is a
           mismatch binding protein. The aligned region corresponds
           with domain III, which is central to the structure of
           Thermus aquaticus MutS as characterized in.
          Length = 290

 Score = 26.2 bits (58), Expect = 7.0
 Identities = 22/120 (18%), Positives = 49/120 (40%), Gaps = 8/120 (6%)

Query: 24  KQRADLARELEELGERLEEAGGATSAQIELNKKRE---AEMSKLRRDLEEANIQHESTLA 80
               +L  +L EL ER  E  G  S ++  N+       E+   + D    +    ST  
Sbjct: 166 ALLDELREKLAELLERERERTGIKSLKVGYNRVFGYYVIEVKASKADKVPGDYIRRSTTK 225

Query: 81  NLRKKHNDAVSEMGEQIDQLNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTA 140
           N  +     + E+  ++ +    +  L      + +L    ++++ E ++L+++L+   A
Sbjct: 226 NAVRFTTPELKELERKLLEAE--ERALALEKEILEEL---LERVLSEYAELLQELADAIA 280


>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
           ATPase domains [General function prediction only].
          Length = 530

 Score = 26.4 bits (59), Expect = 7.0
 Identities = 21/64 (32%), Positives = 32/64 (50%), Gaps = 5/64 (7%)

Query: 8   LEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRD 67
           LE++ E  RQ  A  EKQ+ +LA+E E +       G A +++ +  K R   + KL   
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQEWI-----RRGKAAASKAKKAKSRIKRLEKLEAR 293

Query: 68  LEEA 71
           L E 
Sbjct: 294 LAEE 297


>gnl|CDD|206167 pfam13997, YqjK, YqjK-like protein.  The YqjK-like protein family
          includes the E. coli YqjK protein, which is
          functionally uncharacterized. This family of proteins
          is found in bacteria. Proteins in this family are
          approximately 100 amino acids in length. There is a
          single completely conserved residue R that may be
          functionally important.
          Length = 73

 Score = 24.9 bits (55), Expect = 7.1
 Identities = 7/22 (31%), Positives = 10/22 (45%)

Query: 20 AKAEKQRADLARELEELGERLE 41
           + ++QR DLA   E   E   
Sbjct: 3  QRIQQQRLDLAAHAEPWLEATA 24


>gnl|CDD|235580 PRK05724, PRK05724, acetyl-CoA carboxylase carboxyltransferase
          subunit alpha; Validated.
          Length = 319

 Score = 26.3 bits (59), Expect = 7.6
 Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 3/39 (7%)

Query: 5  IYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEA 43
          I ELE ++E   + RA AE    DL+ E+E L ++LEE 
Sbjct: 12 IAELEAKIE---ELRAVAEDSDVDLSEEIERLEKKLEEL 47


>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
          Length = 981

 Score = 26.7 bits (59), Expect = 7.7
 Identities = 18/95 (18%), Positives = 36/95 (37%), Gaps = 9/95 (9%)

Query: 15  ERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAEMSKLRRDLEEANIQ 74
           +       +K  ++   E  E  ++  + G         + K +   +K  RD+  +   
Sbjct: 176 QEAVLPLVKKSPSEAKEEKAEERKQESKKGAKRKKDASGDDKSKK--AKTDRDVSTST-- 231

Query: 75  HESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDE 109
                A  +KK +D  S++  Q  +L  LK  L +
Sbjct: 232 -----AASQKKSSDLESKLEAQSKELWSLKDDLKK 261


>gnl|CDD|227184 COG4847, COG4847, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 103

 Score = 25.6 bits (56), Expect = 8.1
 Identities = 16/34 (47%), Positives = 20/34 (58%)

Query: 3   ARIYELEEEVEVERQSRAKAEKQRADLARELEEL 36
           A I E EE  E  RQ R + EK   DL +++EEL
Sbjct: 70  AVIAESEEAKESIRQVRQEVEKLAGDLTKKIEEL 103


>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
           autophagy-related subunit 14.  The Atg14 or Apg14
           proteins are hydrophilic proteins with a predicted
           molecular mass of 40.5 kDa, and have a coiled-coil motif
           at the N terminus region. Yeast cells with mutant Atg14
           are defective not only in autophagy but also in sorting
           of carboxypeptidase Y (CPY), a vacuolar-soluble
           hydrolase, to the vacuole. Subcellular fractionation
           indicate that Apg14p and Apg6p are peripherally
           associated with a membrane structure(s). Apg14p was
           co-immunoprecipitated with Apg6p, suggesting that they
           form a stable protein complex. These results imply that
           Apg6/Vps30p has two distinct functions: in the
           autophagic process and in the vacuolar protein sorting
           pathway. Apg14p may be a component specifically required
           for the function of Apg6/Vps30p through the autophagic
           pathway. There are 17 auto-phagosomal component proteins
           which are categorized into six functional units, one of
           which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
           AS-PI3K complex and the Atg2-Atg18 complex are essential
           for nucleation, and the specific function of the AS-PI3K
           apparently is to produce phosphatidylinositol
           3-phosphate (PtdIns(3)P) at the pre-autophagosomal
           structure (PAS). The localisation of this complex at the
           PAS is controlled by Atg14. Autophagy mediates the
           cellular response to nutrient deprivation, protein
           aggregation, and pathogen invasion in humans, and
           malfunction of autophagy has been implicated in multiple
           human diseases including cancer. This effect seems to be
           mediated through direct interaction of the human Atg14
           with Beclin 1 in the human phosphatidylinositol 3-kinase
           class III complex.
          Length = 307

 Score = 26.2 bits (58), Expect = 8.4
 Identities = 17/78 (21%), Positives = 39/78 (50%), Gaps = 13/78 (16%)

Query: 1   MQARIYELEEEVEVERQSRAKAEKQRADLARELEELGERLEEAGGATSAQIELNKKREAE 60
           ++ARI +L+EE+E +R+   + ++  A    +L              SA  +L K+R ++
Sbjct: 76  IRARISQLKEEIEQKRERIEELKRALAQRRSDLS-------------SASYQLEKRRASQ 122

Query: 61  MSKLRRDLEEANIQHEST 78
           + KL+ +++    +  + 
Sbjct: 123 LEKLQDEIKRTRSKLNAL 140


>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding.  This
           family is the highly conserved central region of a
           number of metazoan proteins referred to as growth-arrest
           proteins. In mouse, Gas8 is predominantly a testicular
           protein, whose expression is developmentally regulated
           during puberty and spermatogenesis. In humans, it is
           absent in infertile males who lack the ability to
           generate gametes. The localisation of Gas8 in the
           motility apparatus of post-meiotic gametocytes and
           mature spermatozoa, together with the detection of Gas8
           also in cilia at the apical surfaces of epithelial cells
           lining the pulmonary bronchi and Fallopian tubes
           suggests that the Gas8 protein may have a role in the
           functioning of motile cellular appendages. Gas8 is a
           microtubule-binding protein localised to regions of
           dynein regulation in mammalian cells.
          Length = 201

 Score = 26.0 bits (58), Expect = 8.8
 Identities = 20/89 (22%), Positives = 37/89 (41%), Gaps = 4/89 (4%)

Query: 50  QIELNKKREAEMSKLRRDLEEANIQHESTLANLRKKHNDAVSEMGEQIDQLNKLKTKLDE 109
           +I   KK E    KL  ++ + N +    L    ++    V E+ +++    K K  L  
Sbjct: 35  EIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEE----VEELRKKLKDYEKDKQSLKN 90

Query: 110 ANRTINDLDVARKKLIIENSDLVRQLSKV 138
               + +L+   K L  E+  L ++  KV
Sbjct: 91  LKARLKELEKELKNLKWESEVLEQRFEKV 119


>gnl|CDD|187736 cd02255, Peptidase_C12, Cysteine peptidase C12 contains ubiquitin
           carboxyl-terminal hydrolase (UCH) families L1, L3, L5
           and BAP1.  The ubiquitin C-terminal hydrolase (UCH;
           ubiquitinyl hydrolase; ubiquitin thiolesterase) family
           of deubiquitinating enzymes (DUBs) consists of four
           members to date: UCH-L1, UCH-L3, UCH-L5 (UCH37) and
           BRCA1-associated protein-1 (BAP1), all containing a
           conserved catalytic domain with cysteine peptidase
           activity.  UCH-L1 hydrolyzes carboxyl terminal esters
           and amides of ubiquitin (Ub). Dysfunction of this
           hydrolase activity can lead to an accumulation of
           alpha-synuclein, which is linked to Parkinson's disease
           (PD) and neurofibrillary tangles, linked to Alzheimer's
           disease (AD).  UCH-L1, in its dimeric form, has
           additional enzymatic activity as a ubiquitin ligase.
           UCH-L3 hydrolyzes isopeptide bonds at the C-terminal
           glycine of either Ub or Nedd8, a ubiquitin-like protein.
           UCH-L3 can also interact with Lys48-linked Ub dimers to
           protect it from degradation while inhibiting its
           hydrolase activity at the same time.  UCH-L1 and UCH-L3
           are the most closely related of the UCH members. UCH-L5
           (UCH37) is involved in the deubiquitinating activity in
           the 19S proteasome regulatory complex. It is also
           associated with the human Ino80 chromatin-remodeling
           complex (hINO80) in the nucleus. BAP1 binds to the
           wild-type BRCA1 RING finger domain, localized in the
           nucleus.  It consists of the N-terminal UCH domain and
           two predicted nuclear localization signals (NLSs), only
           one of which is functional. The full-length human BRCA1
           is a ubiquitin ligase. However, BAP1 does not appear to
           function in the deubiquitination of autoubiquitinated
           BRCA1. There is growing evidence that UCH enzymes and
           human malignancies are closely correlated. Studies show
           that UCH enzymes play a crucial role in some signaling
           pathways and in cell-cycle regulation.
          Length = 222

 Score = 25.9 bits (56), Expect = 9.4
 Identities = 7/57 (12%), Positives = 18/57 (31%), Gaps = 2/57 (3%)

Query: 100 LNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTADARMWKIKYETNKPKD 156
             +  + L +       +    +   +EN D +R   + +A       +  +   K 
Sbjct: 107 HFESGSTLKKFLEESVSMSPEERARYLENYDAIRVTHETSAHEGQT--EAPSIDEKV 161


>gnl|CDD|219526 pfam07701, HNOBA, Heme NO binding associated.  The HNOBA domain is
           found associated with the HNOB domain and pfam00211 in
           soluble cyclases and signalling proteins. The HNOB
           domain is predicted to function as a heme-dependent
           sensor for gaseous ligands, and transduce diverse
           downstream signals, in both bacteria and animals.
          Length = 209

 Score = 25.7 bits (57), Expect = 9.9
 Identities = 17/46 (36%), Positives = 23/46 (50%), Gaps = 10/46 (21%)

Query: 28  DLARELEELGERLEEAGGATSAQIELNKKREAEMSKLR---RDLEE 70
           D  R+L  LGE+         AQ++L K+ E    KL    R+LEE
Sbjct: 150 DATRDLILLGEQ-------ARAQLKLKKRLEKLKDKLEEAHRELEE 188


>gnl|CDD|177432 PHA02607, wac, fibritin; Provisional.
          Length = 454

 Score = 26.1 bits (58), Expect = 9.9
 Identities = 12/48 (25%), Positives = 25/48 (52%), Gaps = 3/48 (6%)

Query: 95  EQIDQ-LNKLKTKLDEANRTINDLDVARKKLIIENSDLVRQLSKVTAD 141
           EQ+D+   K K K++E    +N +       +I +  ++ Q+++  AD
Sbjct: 55  EQLDENTKKTKDKINEVVDDVNTIQENLD--VIGDISVIDQINQNVAD 100


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.307    0.124    0.315 

Gapped
Lambda     K      H
   0.267   0.0709    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 7,826,972
Number of extensions: 743898
Number of successful extensions: 3103
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2480
Number of HSP's successfully gapped: 788
Length of query: 156
Length of database: 10,937,602
Length adjustment: 89
Effective length of query: 67
Effective length of database: 6,990,096
Effective search space: 468336432
Effective search space used: 468336432
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 55 (25.0 bits)