RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2380
         (733 letters)



>gnl|CDD|234597 PRK00050, PRK00050, 16S rRNA m(4)C1402 methyltranserfase;
           Provisional.
          Length = 296

 Score =  352 bits (907), Expect = e-117
 Identities = 118/253 (46%), Positives = 169/253 (66%), Gaps = 6/253 (2%)

Query: 229 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNK-- 286
           L+E ++ L I   + +GIY+D TFG GGHS  ILERLG KGRLIAID+D ++++      
Sbjct: 8   LDEVVDALAI---KPDGIYVDGTFGGGGHSRAILERLGPKGRLIAIDRDPDAIAAAKDRL 64

Query: 287 ITDSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDMR 346
               RF+++H  F+ L  +L +  + K+DGIL DLG+SS Q+++  RGFSF  DGPLDMR
Sbjct: 65  KPFGRFTLVHGNFSNLKEVLAE-GLGKVDGILLDLGVSSPQLDDAERGFSFRKDGPLDMR 123

Query: 347 MDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEII 406
           MD ++G++A++ +   +E  + ++ ++YGEERFA++IA+ IV  R   PIT T +L EII
Sbjct: 124 MDQSQGLTAAEVVNTYSEEELARIFKEYGEERFARRIARAIVEARPKKPITTTGELAEII 183

Query: 407 LKSIRGNKRYKNPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSLEDR 466
             ++   ++  +PATRTFQA+RI +N EL+ L  AL+     L   GR+ VISFHSLEDR
Sbjct: 184 KSAVPPRRKGIHPATRTFQALRIEVNDELEELERALEAALDLLKPGGRLAVISFHSLEDR 243

Query: 467 IVKNFINFNTKIP 479
           IVK F    +K  
Sbjct: 244 IVKRFFRELSKGC 256


>gnl|CDD|223352 COG0275, COG0275, Predicted S-adenosylmethionine-dependent
           methyltransferase involved in cell envelope biogenesis
           [Cell envelope biogenesis, outer membrane].
          Length = 314

 Score =  344 bits (885), Expect = e-113
 Identities = 130/281 (46%), Positives = 178/281 (63%), Gaps = 10/281 (3%)

Query: 229 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKIT 288
           LNE +  L     + +GIYID T G GGHS  ILE+L   GRLI ID+D +++++  +  
Sbjct: 12  LNEVVELLAP---KPDGIYIDGTLGAGGHSRAILEKLPDLGRLIGIDRDPQAIAIAKERL 68

Query: 289 ---DSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDM 345
              D R +++H  F  L   LK+  I K+DGIL DLG+SS Q+++  RGFSF  DGPLDM
Sbjct: 69  KEFDGRVTLVHGNFANLAEALKELGIGKVDGILLDLGVSSPQLDDAERGFSFRKDGPLDM 128

Query: 346 RMDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEI 405
           RMD T+G+SA++ +   +E ++ ++ ++YGEERFAK+IA+ IV  R   PI  TK+L EI
Sbjct: 129 RMDQTQGLSAAEVVNTYSEEDLARIFKEYGEERFAKRIARAIVERRKKKPIETTKELAEI 188

Query: 406 ILKSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSL 463
           I  +I   ++ K  +PATRTFQAIRIY+N EL+ L  AL+     L   GR+ VISFHSL
Sbjct: 189 IKSAIPAKEKRKKIHPATRTFQAIRIYVNDELEELEEALEAALDLLKPGGRLAVISFHSL 248

Query: 464 EDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 504
           EDRIVKNF    +K     + LP+       K K + + KP
Sbjct: 249 EDRIVKNFFKELSKPGV-PKGLPVTEEGPALKFKLITK-KP 287


>gnl|CDD|216706 pfam01795, Methyltransf_5, MraW methylase family.  Members of this
           family are probably SAM dependent methyltransferases
           based on Escherichia coli rsmH. This family appears to
           be related to pfam01596.
          Length = 310

 Score =  315 bits (809), Expect = e-102
 Identities = 129/281 (45%), Positives = 181/281 (64%), Gaps = 9/281 (3%)

Query: 229 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSL-GNKI 287
           L E +  LNI+    +GIYID T G GGHS  ILE+L ++GRLI ID+D ++++    ++
Sbjct: 9   LQEVVELLNIKP---DGIYIDCTLGGGGHSEAILEQLPEEGRLIGIDRDPQAIANAKERL 65

Query: 288 T--DSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDM 345
                R ++IH+ F  L   LK+  + K+DGILFDLG+SS Q++   RGFSF  D PLDM
Sbjct: 66  KPFKGRVTLIHSNFANLFAYLKELGVGKVDGILFDLGVSSPQLDGAERGFSFKHDAPLDM 125

Query: 346 RMDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEI 405
           RMD + G++A++ L   +E ++ ++ + YGEERF+K+IA+ IV  R   PI  TK+L EI
Sbjct: 126 RMDQSAGLTAAEVLNTYSEEDLARIFKKYGEERFSKRIARAIVERRKKKPIETTKELAEI 185

Query: 406 ILKSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSL 463
           I K++    + K  +PATRTFQAIRIY+N EL++L  AL+     L   GR+ VISFHSL
Sbjct: 186 IKKAVPAKAKRKKIHPATRTFQAIRIYVNDELESLKEALEQAPDLLAPGGRLSVISFHSL 245

Query: 464 EDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 504
           EDRIVKNF    +K   +   LP+     EP +K + + KP
Sbjct: 246 EDRIVKNFFREYSKGCQVPAGLPVILEGLEPLLKIITK-KP 285


>gnl|CDD|232778 TIGR00006, TIGR00006, 16S rRNA
           (cytosine(1402)-N(4))-methyltransferase.  This model
           describes RsmH, a 16S rRNA methyltransferase.
           Previously, this gene was designated MraW, known to be
           essential in E. coli and widely conserved in bacteria
           [Cell envelope, Biosynthesis and degradation of murein
           sacculus and peptidoglycan].
          Length = 307

 Score =  283 bits (725), Expect = 1e-89
 Identities = 131/279 (46%), Positives = 184/279 (65%), Gaps = 8/279 (2%)

Query: 229 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKI- 287
           L+E +  LNI   + +GIYID T G+GGHS  IL +LG+ GRLI ID+D ++++   +I 
Sbjct: 9   LDEVVEGLNI---KPDGIYIDGTLGRGGHSKAILSQLGE-GRLIGIDRDPQAIAFAKEIL 64

Query: 288 TDSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDMRM 347
           +D RF IIH+ F      LK+  + KIDGIL DLG+SS Q+++  RGFSF+ DGPLDMRM
Sbjct: 65  SDLRFVIIHDNFANFFEYLKELLVTKIDGILVDLGVSSPQLDDPERGFSFMHDGPLDMRM 124

Query: 348 DITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEIIL 407
           D ++ +SA++ L   +E +++ +++ YGEERF+K+IA+ IV  R   PI  TK+L E+I 
Sbjct: 125 DQSQKLSAAEILNTYSEEDLEWILKKYGEERFSKRIARAIVERRKKKPIQTTKELAEVIS 184

Query: 408 KSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSLED 465
           K++ G  +YK  +PATR FQAIRIY+N EL+ L  AL+     L   GR+ +ISFHSLED
Sbjct: 185 KAVPGFSKYKSIHPATRVFQAIRIYVNDELEELEEALQFAPNLLAPGGRLSIISFHSLED 244

Query: 466 RIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 504
           RIVKNF    +K P +   LP+         K + + KP
Sbjct: 245 RIVKNFFRELSKGPQVPPGLPVKEEQLYALGKRITK-KP 282


>gnl|CDD|182741 PRK10799, PRK10799, metal-binding protein; Provisional.
          Length = 247

 Score =  237 bits (607), Expect = 2e-73
 Identities = 103/220 (46%), Positives = 139/220 (63%), Gaps = 13/220 (5%)

Query: 507 KLNIKKYEDYCPNGLQVEGRSNINVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKGEN 566
           KLN     DY PNGLQVEGR  +  IVTGVTAS  L+  AV + ADA++VHHGYFWKGE+
Sbjct: 13  KLNSAAISDYAPNGLQVEGRETVQKIVTGVTASQALLDEAVRLQADAVIVHHGYFWKGES 72

Query: 567 SNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNIGWI 626
             I G+K+ RL+ L+ N INLY +HLPLD HP+LGNNAQLA +L            I  +
Sbjct: 73  PVIRGMKRNRLKTLLANDINLYGWHLPLDAHPELGNNAQLAALL-----------GITVM 121

Query: 627 GKIINLKRYN-FKKIITIKDLFHHITRKIGKKPIVIGDLN-KKIYEIGWCTGAAQNLLTD 684
           G+I  L  +      +   +L   I  ++G+KP+  GD   + +  + WCTG  Q+ +  
Sbjct: 122 GEIEPLVPWGELTMPVPGLELASWIEARLGRKPLWCGDTGPEVVQRVAWCTGGGQSFIDS 181

Query: 685 AINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 724
           A   GV A+I+GE+SE T++ +RE G+ ++AAGHHATER 
Sbjct: 182 AARFGVDAFITGEVSEQTIHSAREQGLHFYAAGHHATERG 221


>gnl|CDD|223404 COG0327, COG0327, Uncharacterized conserved protein [Function
           unknown].
          Length = 250

 Score =  193 bits (493), Expect = 9e-57
 Identities = 79/222 (35%), Positives = 126/222 (56%), Gaps = 12/222 (5%)

Query: 506 KKLNIKKYEDYCPNGLQVEG-RSNINVIVTGVTASLDLIKTAVDMNADAILVHHG-YFWK 563
           + L  +  ED+  NGLQV G    +  +   + A+L ++  A+++ AD ++VHH   FW 
Sbjct: 12  EFLPPELAEDWDNNGLQVGGPLEEVKKVAVALDATLAVLDEAIELGADLLIVHHPLIFWP 71

Query: 564 GENSNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNI 623
            ++  I G K KR++ LI N INLYA H  LD HP+ GN+A  A +L       F    +
Sbjct: 72  VKSLAIDGPKGKRIKALIQNDINLYAAHTNLDAHPEGGNDALAAALLGAEELPPF-GEGL 130

Query: 624 GWIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPI-VIGDLNKKIYEIGWCTGAAQNLL 682
           G +G          K+  T+++L   I  K+G+ P+ V+ D +++I  +  C+G+ Q  L
Sbjct: 131 GRVG--------ELKEPTTLEELAERIKAKLGRPPLRVVKDGSEEIKRVAVCSGSGQGFL 182

Query: 683 TDAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 724
           ++A  EGV AYI+G++S  T + +RE G++   AGH+ATER 
Sbjct: 183 SEAAAEGVDAYITGDLSHHTAHDARELGLSVIDAGHYATERP 224


>gnl|CDD|213534 TIGR00486, YbgI_SA1388, dinuclear metal center protein, YbgI/SA1388
           family.  The characterization of this family of
           uncharacterized proteins as orthologous is tentative.
           Members are found in all three domains of life. Several
           members (from Bacillus subtilis, Listeria monocytogenes,
           and Mycobacterium tuberculosis - all classified as
           Firmicutes within the Eubacteria) share a long insert
           relative to other members [Unknown function, General].
          Length = 249

 Score =  181 bits (462), Expect = 2e-52
 Identities = 73/221 (33%), Positives = 111/221 (50%), Gaps = 11/221 (4%)

Query: 506 KKLNIKKYEDYCPNGLQVE-GRSNINVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKG 564
           + L  +  ED   NGLQV  G   +  +V  V AS  +   AV + AD I+ HH   WK 
Sbjct: 12  RFLPKELAEDGDNNGLQVGSGNEEVKKVVVAVDASESVADEAVRLGADLIITHHPLIWKP 71

Query: 565 ENSNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNIG 624
               I GIK  RL+ L+ N I+LY+ H  LD H   GNN  LA+ L     + F    +G
Sbjct: 72  LKRLIRGIKPGRLKILLQNDISLYSAHTNLDAHD-GGNNDALARALGLENPKEFEDYGLG 130

Query: 625 WIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPIVIGDLNK-KIYEIGWCTGAAQNLLT 683
            +G+        FK  I   +    I + +  KP+++       + ++   +G+  + + 
Sbjct: 131 RVGE--------FKAPIESLEEVLEIKKVLNVKPLLVVKNGPEYVKKVAVVSGSGLSFIM 182

Query: 684 DAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 724
            A+ EGV AYI+G++S  T +++RE G+    AGH+ATER 
Sbjct: 183 KALREGVDAYITGDLSHHTAHLARELGLNVIDAGHYATERG 223


>gnl|CDD|216697 pfam01784, NIF3, NIF3 (NGG1p interacting factor 3).  This family
           contains several NIF3 (NGG1p interacting factor 3)
           protein homologues. NIF3 interacts with the yeast
           transcriptional coactivator NGG1p which is part of the
           ADA complex, the exact function of this interaction is
           unknown.
          Length = 237

 Score =  180 bits (460), Expect = 3e-52
 Identities = 63/227 (27%), Positives = 113/227 (49%), Gaps = 17/227 (7%)

Query: 506 KKLNIKKYEDYCPNGLQVEGRSN-INVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKG 564
           +       ED+   GLQV      +  I+  + A+ ++I+ A++  AD I+ HH   +K 
Sbjct: 8   ELAPPSLAEDWDNVGLQVGDPEGEVKKILVALDATEEVIEEAIENGADLIITHHPLIFKP 67

Query: 565 ENSNI-VGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKN-- 621
                    K +R+ +LI N I+LY+ H  LD  P  G N  LA++L          +  
Sbjct: 68  LKRITGDDPKGRRILKLIKNNISLYSAHTNLDAAPG-GVNDWLAELLGLESIEPLEPSVS 126

Query: 622 ---NIGWIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPI-VIGDLNKKIYEIGWCTGA 677
               +G IG++         + I++++L   +  ++G + + V+GDL+K I ++  C G+
Sbjct: 127 EGEGLGRIGEL--------PEPISLEELAKRVKERLGLEGVRVVGDLDKPIKKVAICGGS 178

Query: 678 AQNLLTDAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 724
             +L+ +A  +G    I+GE+       ++E G+A   AGH+ATER+
Sbjct: 179 GGSLIEEAKAKGADLLITGELKHHDALDAKEKGIALIDAGHYATERF 225


>gnl|CDD|240528 cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug
           and toxic compound extrusion (MATE)-like proteins.
           Escherichia coli MurJ (MviN) has been identified as
           essential for murein biosynthesis. It has been suggested
           that MurJ functions as the peptidoglycan lipid II
           flippase which is involved in translocation of
           lipid-anchored peptidoglycan precursors across the
           cytoplasmic membrane, though results obtained in
           Bacillus subtilis seem to indicate that its MurJ
           homologs are not essential for growth. Some MviN family
           members (e.g. in Mycobacterium tuberculosis) possess an
           extended C-terminal region that contains an
           intracellular pseudo-kinase domain and an extracellular
           domain resembling carbohydrate-binding proteins.
           Proteins from the MATE family are involved in exporting
           metabolites across the cell membrane and are often
           responsible for multidrug resistance (MDR).
          Length = 420

 Score =  122 bits (308), Expect = 1e-29
 Identities = 48/125 (38%), Positives = 79/125 (63%), Gaps = 4/125 (3%)

Query: 1   MAFAVVLGGALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQ 60
           +A+ V+LGG LQ+++Q+P+L ++G     +L+    F++  VRR+LK M P++  V  +Q
Sbjct: 177 LAWGVLLGGVLQLLVQLPALRRLGFRFRPRLD----FRDPGVRRVLKLMLPALLGVSVSQ 232

Query: 61  ISLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYS 120
           I+L+++T +AS + EGS+S L YADRL + P  + GV   T LLP LS+     + +E+ 
Sbjct: 233 INLLVDTILASFLPEGSISALYYADRLYQLPLGIFGVAIATALLPRLSRLAAAGDLDEFR 292

Query: 121 AILDW 125
             L  
Sbjct: 293 RTLSR 297



 Score = 34.4 bits (80), Expect = 0.21
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 126 IPIFAHSGLALSIGLGACLHASFLYWYLRHK 156
           I    H GLAL+  L A ++A  L   LR +
Sbjct: 390 IKPLGHVGLALATSLSAWVNALLLLVLLRRR 420


>gnl|CDD|223800 COG0728, MviN, Uncharacterized membrane protein, putative virulence
           factor [General function prediction only].
          Length = 518

 Score =  121 bits (306), Expect = 4e-29
 Identities = 49/124 (39%), Positives = 80/124 (64%), Gaps = 4/124 (3%)

Query: 2   AFAVVLGGALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQI 61
           A+ V++GG LQ+++Q+P+L K G+    +      FK+  ++R LK M P++  V  +QI
Sbjct: 195 AWGVLIGGLLQLLVQLPALRKAGLLIKPRFG----FKDPGLKRFLKLMLPALLGVSISQI 250

Query: 62  SLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYSA 121
           +L+++T IAS + EGS+S L YADRL + P  + GV  +T+LLP+LS+     +  E+  
Sbjct: 251 NLLIDTAIASFLAEGSVSWLYYADRLYQLPLGIFGVALSTVLLPSLSRHAANGDWPEFLK 310

Query: 122 ILDW 125
           +LDW
Sbjct: 311 LLDW 314



 Score = 66.2 bits (162), Expect = 3e-11
 Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 2/110 (1%)

Query: 126 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFI-RLVIALLLLVIVAL 184
           IP   H GLAL+  L A ++A  LY+ LR + +Y+P  GWG+F I +L++A  ++    L
Sbjct: 407 IPPLGHVGLALATSLAAWVNALLLYYLLRKRLVYLPGRGWGLFLILKLLLASAIMAAALL 466

Query: 185 WGNSYFNWLGMQAHPIFRIVALLLIL-LFCGITYFLALRIMGFLFLNEAI 233
                     +    +   + +LL+L L     YF  L ++GF     A 
Sbjct: 467 ALLHLAQREWLLGGLLLIRLGVLLLLVLLGAGVYFAMLLLLGFRVRGFAR 516


>gnl|CDD|217323 pfam03023, MVIN, MviN-like protein.  Deletion of the mviN virulence
           gene in Salmonella enterica serovar. Typhimurium greatly
           reduces virulence in a mouse model of typhoid-like
           disease. Open reading frames encoding homologues of MviN
           have since been identified in a variety of bacteria,
           including pathogens and non-pathogens and
           plant-symbionts. In the nitrogen-fixing symbiont
           Rhizobium tropici, mviN is required for motility. The
           MviM protein is predicted to be membrane-associated.
          Length = 452

 Score = 98.5 bits (246), Expect = 9e-22
 Identities = 50/152 (32%), Positives = 87/152 (57%), Gaps = 13/152 (8%)

Query: 2   AFAVVLGGALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQI 61
           A  V++GG LQ+++Q+P L K G+    + N    F++  V+R L    P++  V  +Q+
Sbjct: 161 AIGVLIGGVLQLLVQLPFLRKAGLLLKPRFN----FRDKGVKRFLALALPTLLGVSVSQL 216

Query: 62  SLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYSA 121
           +L+++T +AS + EGS+S L YADR+ + P  + GV+ +T+LLP LS+   + + +E+  
Sbjct: 217 NLLIDTALASFLAEGSISYLYYADRIYQLPLGIFGVSVSTVLLPRLSRHAKDGDWDEFLD 276

Query: 122 ILDWIPIFAHSGLALSIGLGACLHASFLYWYL 153
           +LD          A+ + L   + ASF    L
Sbjct: 277 LLDQ---------AIRLTLLLMIPASFGLLVL 299



 Score = 51.9 bits (125), Expect = 8e-07
 Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 7/87 (8%)

Query: 126 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALW 185
           +P     GLAL+  L A ++  FLY+ LR +       GW  F   LV+   L+  V L 
Sbjct: 373 LPPLGVVGLALATSLSAWINLVFLYYILRKRLGGHSLRGWKTFLASLVVPTALMGGVILL 432

Query: 186 GNSYFNWLGMQAHPIFRIVALLLILLF 212
            +S           ++ + +LLLIL+ 
Sbjct: 433 LSSLT-------QGLWVVGSLLLILVG 452


>gnl|CDD|233536 TIGR01695, mviN, integral membrane protein MviN.  This model
           represents MviN, a family of integral membrane proteins
           predicted to have ten or more transmembrane regions.
           Although frequently listed as a virulence protein, it is
           not restricted to pathogens and it is an essential
           protein in Sinorhizobium meliloti. In a number of
           species its gene is adjacent to that of the
           uridylyltransferase GlnD , the signal-transducing enzyme
           that performs the key modification to the nitrogen
           regulatory protein PII [Unknown function, General].
          Length = 502

 Score = 96.6 bits (241), Expect = 6e-21
 Identities = 47/142 (33%), Positives = 76/142 (53%), Gaps = 5/142 (3%)

Query: 2   AFAVVLGGALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQI 61
           A  V++GG  Q++IQ+P L K G     + N         ++R LK   P+     A+QI
Sbjct: 186 AIGVLIGGVAQLLIQLPFLRKAGFLLKPRFNFRDP----GLKRFLKLFLPTTLGSSASQI 241

Query: 62  SLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYSA 121
           +L++NT +AS +  GS+S L YA+R+ + P  + G++ +T+LLP LS+   E N  E   
Sbjct: 242 TLLINTALASFLEIGSVSALYYANRIYQLPLGIFGISLSTVLLPKLSRHASEGNWNELRD 301

Query: 122 ILDW-IPIFAHSGLALSIGLGA 142
           +L+  I +     +  S GL  
Sbjct: 302 LLNQGIRLSLLLTIPSSFGLLI 323



 Score = 65.0 bits (159), Expect = 6e-11
 Identities = 24/108 (22%), Positives = 41/108 (37%), Gaps = 4/108 (3%)

Query: 126 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALW 185
           I      G+AL+    + + +  LY  L  +   I   G      +LVIA  ++  V   
Sbjct: 398 IFPLGLVGIALATSAASMVSSVLLYLMLNRRLKGILPFGVLKVLAKLVIASAIIGGVLYL 457

Query: 186 GNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFLFLNEAI 233
             S      +  + +  +   L+ LL     YFL L ++    LN  +
Sbjct: 458 IVSVSLGGVLVKNLLGLLAIGLIGLLV----YFLGLALLKIEELNLLL 501


>gnl|CDD|218341 pfam04947, Pox_VLTF3, Poxvirus Late Transcription Factor VLTF3
           like.  Members of this family are approximately 26 KDa,
           and are involved in trans-activator of late
           transcription.
          Length = 171

 Score = 36.1 bits (84), Expect = 0.027
 Identities = 27/137 (19%), Positives = 51/137 (37%), Gaps = 17/137 (12%)

Query: 381 KKIAKEIVHYRSITPITRTKQLVEIILKSIRGNKRYKNPATRTFQAIRIYINQELKNLSI 440
           KK+ K  +    +T        V   LK +  +K Y N     F  + +   +   NLS 
Sbjct: 42  KKLKKYNIDTDDVT-----TTKVLDFLKKLGYSKYYDN----VFLILNMLTGKPPPNLSS 92

Query: 441 ALKIGFKKLNIKGRIVVISFHSLEDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLA 500
            L+   +++    + +   F  L  R  KNFI+++     + + L +  Y        L 
Sbjct: 93  ELE---ERIKEIFKELQKFFDKLNTR--KNFISYSY---VLYKLLELLGYDENLPDFPLL 144

Query: 501 RCKPKKKLNIKKYEDYC 517
           +   K     + ++  C
Sbjct: 145 KSLTKLNSQDEIWKKLC 161


>gnl|CDD|225795 COG3256, NorB, Nitric oxide reductase large subunit [Inorganic ion
           transport and metabolism].
          Length = 717

 Score = 36.3 bits (84), Expect = 0.064
 Identities = 33/142 (23%), Positives = 44/142 (30%), Gaps = 14/142 (9%)

Query: 97  VTFNTILLPNLSKARIENNTEEYSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHK 156
           +     LLP L   +     E  S  L     F   G      LG        +W+    
Sbjct: 315 LAAGLYLLPELGGPK---FQELGSPKLLIALFFVVVGSLAGAWLGYLQLLPAPFWFWFGH 371

Query: 157 RIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGIT 216
           + Y    G G    +L++   LLV VAL      N          R   L+ ILL     
Sbjct: 372 QGY-EYLGRG-RLWQLLLIKGLLVWVALMFR--ANV-TATKLKG-RKTTLIAILLLSLTG 425

Query: 217 YFLALRIMGFLFLNEAINWLNI 238
             + L    F   N    W N+
Sbjct: 426 --IGL-FFLFGLYNP--EWTNL 442


>gnl|CDD|225546 COG3000, ERG3, Sterol desaturase [Lipid metabolism].
          Length = 271

 Score = 35.1 bits (81), Expect = 0.093
 Identities = 27/114 (23%), Positives = 43/114 (37%), Gaps = 13/114 (11%)

Query: 123 LDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGV----------FFIRL 172
           L   P+     LA+++ L A     FL       +       W                 
Sbjct: 5   LWPAPLDFVLALAIALFLLAFGLHLFLLRLYLFPQE-PLRKAWPTRDWLLDLVLLTLSIS 63

Query: 173 VIALLLLVIVALW--GNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 224
           V+ L+LL+ + LW     Y   LG+   P+   + LLL  LF  + Y+ A R++
Sbjct: 64  VVGLVLLLALILWALLGGYLPGLGVAPGPLPFALQLLLAFLFLDLGYYWAHRLL 117


>gnl|CDD|220569 pfam10101, DUF2339, Predicted membrane protein (DUF2339).  This
           domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 738

 Score = 35.8 bits (83), Expect = 0.098
 Identities = 23/109 (21%), Positives = 33/109 (30%), Gaps = 27/109 (24%)

Query: 119 YSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRI-YIPCA----GWGVFFIRLV 173
           Y+     +       L    GL   L A      LR +R  Y P A    G    ++ + 
Sbjct: 24  YAIEAGLLGPELRVALGALAGLA--LLA--AGERLRRRRYAYFPAALQGGGLATLYLTVF 79

Query: 174 IALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALR 222
            A  L  ++                     VA  L++L    T  LAL 
Sbjct: 80  AAFHLYGLLPPT------------------VAFALLVLVTAGTVVLALL 110



 Score = 33.5 bits (77), Expect = 0.48
 Identities = 22/109 (20%), Positives = 37/109 (33%), Gaps = 7/109 (6%)

Query: 128 IFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGN 187
           +     LAL++       A +   Y     +       G+  +  ++ALLLL    L G 
Sbjct: 526 LLTLLALALALVYLYRRRARWPLVYRAAAYLL-----LGLAPLAALLALLLLENPLLSGI 580

Query: 188 SYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFLFLNEAINWL 236
                LG    PI   + L  +LL   +      R+   + L      +
Sbjct: 581 GGAAPLG--FLPILNPLLLAYLLLAPALLALWLRRLATPIRLKALRPAV 627


>gnl|CDD|223885 COG0815, Lnt, Apolipoprotein N-acyltransferase [Cell envelope
           biogenesis, outer membrane].
          Length = 518

 Score = 33.5 bits (77), Expect = 0.38
 Identities = 27/125 (21%), Positives = 41/125 (32%), Gaps = 28/125 (22%)

Query: 121 AILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLV 180
               W+       L L++  GA    +F            P   W       +  LL LV
Sbjct: 5   LARQWVLGRPARRLLLALVFGALFALAF------------PPPDWWWLAWFSLAPLLWLV 52

Query: 181 IVA------------LWGNSYF----NWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 224
             A            L+G  +F     WLG        ++A+ L LL   +  +LAL ++
Sbjct: 53  RGAPTSWEGLAKSGFLFGFGFFLAGFYWLGTSLGVGLGLLAVALPLLVLLLAAWLALFLL 112

Query: 225 GFLFL 229
               L
Sbjct: 113 LVAVL 117


>gnl|CDD|218473 pfam05159, Capsule_synth, Capsule polysaccharide biosynthesis
           protein.  This family includes export proteins involved
           in capsule polysaccharide biosynthesis, such as KpsS and
           LipB.
          Length = 268

 Score = 33.1 bits (76), Expect = 0.45
 Identities = 28/124 (22%), Positives = 45/124 (36%), Gaps = 15/124 (12%)

Query: 531 VIVTGVTASLDLIKTAVDMNADA--ILVHHGYFWKGENSNIVGIKKKRLEQLIINKINLY 588
           V+V G       I  A         I +  G+      S  +G        L+++KI +Y
Sbjct: 3   VVVWGDKRPYHRIAAAAAKKKGIPVIRLEDGFL----RSLGLGSNGYPPYSLVLDKIGIY 58

Query: 589 AY-HLPLDM-----HPKLGNNAQLA---KILNFSCTRRFSKNNIGWIGKIINLKRYNFKK 639
               LP D+        + + A LA    ++N   T + SK N+        L     KK
Sbjct: 59  YDSSLPSDLEQLLSEKSVLDQAGLARAEYLMNRIVTHKLSKYNLRRAAPAFRLLLAKDKK 118

Query: 640 IITI 643
           I+ +
Sbjct: 119 IVLV 122


>gnl|CDD|148050 pfam06212, GRIM-19, GRIM-19 protein.  This family consists of
          several eukaryotic gene associated with
          retinoic-interferon-induced mortality 19 (GRIM-19)
          proteins. GRIM-19, was reported to encode a small
          protein primarily distributed in the nucleus and was
          able to promote cell death induced by IFN-# and RA. A
          bovine homologue of GRIM-19 was co-purified with
          mitochondrial NADH:ubiquinone oxidoreductase (complex
          I) in bovine heart. Therefore, its exact cellular
          localisation and function are unclear. It has now been
          discovered that GRIM-19 is a specific interacting
          protein which negatively regulates Stat3 activity.
          Length = 130

 Score = 31.2 bits (71), Expect = 0.66
 Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 33 PSHGFKNIAVRRILKKMGPSVFSVFAAQISLML 65
          P  G+  I  +R L K G S +++FA       
Sbjct: 11 PPGGYGPIPFKRNLPKTGFSGYAMFATGAGATA 43


>gnl|CDD|185281 PRK15383, PRK15383, type III secretion system protein; Provisional.
          Length = 335

 Score = 31.9 bits (72), Expect = 1.2
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 5/69 (7%)

Query: 228 FLNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKI 287
           +LN  IN   IE ERI GI++D  F +G    K L +L +  R I +  +++   L   +
Sbjct: 75  YLNNIINAAIIEKERIIGIFVDGDFSKGQR--KALGKLEQNYRNIKVIYNSD---LNYSM 129

Query: 288 TDSRFSIIH 296
            D + + I+
Sbjct: 130 YDKKLTTIY 138


>gnl|CDD|220473 pfam09925, DUF2157, Predicted membrane protein (DUF2157).  This
           domain, found in various hypothetical prokaryotic
           proteins, has no known function.
          Length = 144

 Score = 30.7 bits (70), Expect = 1.2
 Identities = 27/107 (25%), Positives = 35/107 (32%), Gaps = 22/107 (20%)

Query: 110 ARIENNTEEYSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVF- 168
             I     E  A    + I    G AL +G G      F+               W    
Sbjct: 16  EAILARYGEDPARSSLVRILLWLG-ALLLGAGV---IFFVAAN------------WEEIP 59

Query: 169 -FIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCG 214
            F++  + L LLV +AL G     WL  +  P      LLL  L  G
Sbjct: 60  RFVKFGLLLALLVGLALGGF----WLWRRGKPRLGEALLLLGALLFG 102


>gnl|CDD|225127 COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and
           metabolism].
          Length = 713

 Score = 32.2 bits (74), Expect = 1.3
 Identities = 13/53 (24%), Positives = 21/53 (39%), Gaps = 2/53 (3%)

Query: 168 FFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLA 220
              RL+IA LL + + L   S    LG    P    +    +L + G  ++  
Sbjct: 87  LLRRLIIAGLLTLPLLLL--SLGLLLGAFLLPWVSFLLATPVLFYGGWPFYRG 137


>gnl|CDD|234462 TIGR04086, TIGR04086_membr, putative membrane protein, TIGR04086
           family.  Members of this family of strongly hydrophobic
           putative transmembrane protein average about 125 amino
           acids in length and occur mostly, but not exclusively,
           in the Firmicutes. Members are quite diverse in
           sequence. The function is unknown.
          Length = 115

 Score = 29.8 bits (68), Expect = 1.6
 Identities = 19/94 (20%), Positives = 32/94 (34%), Gaps = 22/94 (23%)

Query: 164 GWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGIT------- 216
             G+     ++ L+LL+I+AL    YF  L   +     ++  +L +   G         
Sbjct: 4   LKGLLVA-YILTLILLLILALL--LYFTDLSEGSLSWLILIIYILSVFIGGFIAGKKAGS 60

Query: 217 ------------YFLALRIMGFLFLNEAINWLNI 238
                       YFL L ++  LF       L  
Sbjct: 61  KGWLWGLLVGLLYFLILLLISLLFGPGPGFTLGQ 94


>gnl|CDD|237913 PRK15119, PRK15119, undecaprenyl-phosphate
           alpha-N-acetylglucosaminyl 1-phosphate transferase;
           Provisional.
          Length = 365

 Score = 31.6 bits (72), Expect = 1.7
 Identities = 20/81 (24%), Positives = 33/81 (40%), Gaps = 17/81 (20%)

Query: 131 HSGL-----ALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFF------------IRLV 173
           H GL      +S+  G C       +Y+ H  +Y+ CAG  VF             IR  
Sbjct: 43  HQGLIPLVGGISVYAGICFTFGITDYYIPHASLYLACAGVLVFVGALDDRFDISVKIRAT 102

Query: 174 IALLLLVIVALWGNSYFNWLG 194
           I   + +++ ++   Y + LG
Sbjct: 103 IQAAVGIVMMVFAGLYLSSLG 123


>gnl|CDD|240539 cd13134, MATE_like_8, Uncharacterized subfamily of the multidrug
           and toxic compound extrusion (MATE) proteins.  The
           integral membrane proteins from the MATE family are
           involved in exporting metabolites across the cell
           membrane and are responsible for multidrug resistance
           (MDR) in many bacteria and animals. A number of family
           members are involved in the synthesis of peptidoglycan
           components in bacteria.
          Length = 438

 Score = 31.3 bits (72), Expect = 1.7
 Identities = 17/80 (21%), Positives = 33/80 (41%), Gaps = 10/80 (12%)

Query: 169 FIRLVIALLLLVIVALWGNSYFNWLGMQAHPI------FRIVALLLILLFCGITYFLALR 222
              L++ L+  +++  +G      +G+    +       RIV   L L    +T    LR
Sbjct: 87  LFNLLLGLVFSLLLLFFGAELLRLMGLPPELLALGSTYLRIVGGSLFLQALSLTLSAILR 146

Query: 223 IMGF----LFLNEAINWLNI 238
             G     ++++  +N LNI
Sbjct: 147 SHGRTKIPMYVSLIMNILNI 166


>gnl|CDD|212167 cd11653, rap1_RCT, C-terminal domain of RAP1 recruits proteins to
           telomeres.  The RAP1 (repressor activator protein 1)
           C-terminal domain (RCT) mediates interactions with other
           proteins such as TRF2 (human), Rif1, Rif2, Sir3, Sir4
           (Saccharomyces cerevisiae), and Taz1
           (Schizosaccharomyces pombe) at telomeres and other loci.
           RAP1, identified in budding yeast as repressor/activator
           protein 1, is a well-conserved telomere binding protein,
           also found in fission yeast and mammals. In
           Saccharomyces cerevisiae, RAP1 directly binds DNA and is
           involved in transcriptional activation, gene silencing,
           as well as binding at numerous sites at each telemore,
           where it functions in telomere length regulation,
           telomeric position effect gene silencing and telomere
           end protection. Human RAP1 apparently does not bind
           telomeric DNA directly, but binds telomere repeat
           binding factor 2 (TRF2) via the RCT. RAP1 might act by
           suppressing nonhomologous end-joining. Yeast RAP1 has
           two myb-type DNA binding modules, and an RCT domain that
           recruits Sir proteins 3 and 4 (Sir3, Sir4) for gene
           silencing, and Rif1 and Rif2 for telomere length
           maintenance. Schizosaccharomyces pombe RAP1 (spRap1),
           like human RAP1, lacks direct DNA-binding activity and
           is localized to telomeres via Taz1, an ortholog of TRF1
           and TRF2. The S. pompe RCT resembles the first 3-helix
           bundle of the yeast and human RCT forms, but is not
           included in this larger model.
          Length = 100

 Score = 29.2 bits (66), Expect = 1.9
 Identities = 14/60 (23%), Positives = 24/60 (40%), Gaps = 16/60 (26%)

Query: 332 LRGFSFLLDGPLDMRMDITRGISASKW-------LANATEFNIKKVIQDYGEERFAKKIA 384
           L    +L+  P DM      GI    W       L +     I+++I+ +G E  A++  
Sbjct: 46  LNVLRYLVGIPPDMP-----GI----WTREDDECLQSDDPRTIERLIKKHGAENVARRWE 96


>gnl|CDD|224880 COG1969, HyaC, Ni,Fe-hydrogenase I cytochrome b subunit [Energy
           production and conversion].
          Length = 227

 Score = 30.9 bits (70), Expect = 2.0
 Identities = 23/91 (25%), Positives = 30/91 (32%), Gaps = 23/91 (25%)

Query: 151 WYL------RHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSY--------------F 190
           WYL        K  + P A    FF  L+I  ++L   AL+G  Y              F
Sbjct: 113 WYLFLGKKPHTKGGHNPIAQVAYFFYFLMIVFMILTGFALYGEHYHYGLGAPLRYVFEWF 172

Query: 191 NWLG---MQAHPIFRIVALLLILLFCGITYF 218
            W G      H   R+   L+I       Y 
Sbjct: 173 EWTGGGSADVHFWHRLGMWLIIAFVIVHVYM 203


>gnl|CDD|172166 PRK13603, PRK13603, fumarate reductase subunit C; Provisional.
          Length = 126

 Score = 29.6 bits (66), Expect = 2.2
 Identities = 26/80 (32%), Positives = 37/80 (46%), Gaps = 17/80 (21%)

Query: 150 YW------YLRHKRIYIPC--AGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHP-- 199
           YW      YLR     I C    W V ++     +L+L  V   GNSY  +L   A+P  
Sbjct: 11  YWWARRRSYLRFMLREISCIFVAWFVLYL-----VLVLRAVGAGGNSYQRFLDFSANPVV 65

Query: 200 -IFRIVALLLILLFCGITYF 218
            +  +VAL  +LL   +T+F
Sbjct: 66  VVLNVVALSFLLLH-AVTWF 84


>gnl|CDD|237526 PRK13837, PRK13837, two-component VirA-like sensor kinase;
           Provisional.
          Length = 828

 Score = 31.2 bits (71), Expect = 2.3
 Identities = 11/55 (20%), Positives = 18/55 (32%)

Query: 225 GFLFLNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTE 279
           GF  L  AI W++   ER + + +D        +   L        +I       
Sbjct: 726 GFSTLAAAIAWISKGPERFDLVLVDDRLLDEEQAAAALHAAAPTLPIILGGNSKT 780


>gnl|CDD|179400 PRK02261, PRK02261, methylaspartate mutase subunit S; Provisional.
          Length = 137

 Score = 29.9 bits (68), Expect = 2.4
 Identities = 13/28 (46%), Positives = 17/28 (60%), Gaps = 1/28 (3%)

Query: 530 NVIVTGVTASL-DLIKTAVDMNADAILV 556
            VI  GV  S  + I  A++ +ADAILV
Sbjct: 33  EVINLGVMTSQEEFIDAAIETDADAILV 60


>gnl|CDD|217153 pfam02628, COX15-CtaA, Cytochrome oxidase assembly protein.  This
           is a family of integral membrane proteins. CtaA is
           required for cytochrome aa3 oxidase assembly in Bacillus
           subtilis. COX15 is required for cytochrome c oxidase
           assembly in yeast.
          Length = 301

 Score = 30.7 bits (70), Expect = 2.6
 Identities = 21/121 (17%), Positives = 29/121 (23%), Gaps = 29/121 (23%)

Query: 130 AHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIV------- 182
           AH  LA  + L   L  +                        L + LLLL I        
Sbjct: 133 AHLLLAFLL-LALLLWLALRLREPDPAPAPAAGPRRLRGLALLALVLLLLQIALGALVAG 191

Query: 183 --------------------ALWGNSYFNWLGMQ-AHPIFRIVALLLILLFCGITYFLAL 221
                                 W N + N   +Q  H +   +  LL+L           
Sbjct: 192 LDAGLACNTFPLMGPDFPAEPDWRNFFENPATVQFLHRLGAYLTFLLLLALAVAAARRRR 251

Query: 222 R 222
           R
Sbjct: 252 R 252



 Score = 29.9 bits (68), Expect = 5.1
 Identities = 27/118 (22%), Positives = 36/118 (30%), Gaps = 24/118 (20%)

Query: 125 WIPIFAHSGLALSI------GLGACLHASFLY-WYLRHKRIYIPCAGWGVFFI------- 170
            +   A   L L +       L A L A      +      +     W  FF        
Sbjct: 167 RLRGLALLALVLLLLQIALGALVAGLDAGLACNTFPLMGPDFPAEPDWRNFFENPATVQF 226

Query: 171 --RLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFC----GI-TYFLAL 221
             RL   L  L+++AL           +   + R   LLL LL      GI T  L L
Sbjct: 227 LHRLGAYLTFLLLLALA---VAAARRRRRPALRRAALLLLALLLLQVALGILTVLLGL 281


>gnl|CDD|222102 pfam13398, Peptidase_M50B, Peptidase M50B-like.  This is a family
           of bacterial and plant peptidases in the same family as
           MEROPS:M50B.
          Length = 201

 Score = 30.2 bits (69), Expect = 2.6
 Identities = 22/112 (19%), Positives = 36/112 (32%), Gaps = 35/112 (31%)

Query: 131 HSGLALSIG---------------LGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIA 175
            SGL +S G               LG  L    L             AG       L + 
Sbjct: 52  GSGLTVSRGRPGGLGMFLTLAAGYLGPALWGLLLILLAF--------AGRARAL--LWLG 101

Query: 176 LLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFL 227
           + LL+++ LW     N  G        ++ LL   L   + +    +++ F+
Sbjct: 102 VALLLVLLLWVR---NLFG-------LLILLLTGALLLALWWLAPPQVLAFV 143


>gnl|CDD|205007 pfam12670, DUF3792, Protein of unknown function (DUF3792).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 130 amino acids in length.
           These proteins are integral membrane proteins.
          Length = 117

 Score = 29.4 bits (67), Expect = 2.8
 Identities = 18/86 (20%), Positives = 30/86 (34%), Gaps = 21/86 (24%)

Query: 172 LVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFC------------------ 213
            ++ L+LL+I+AL    YF  L         ++  LL +                     
Sbjct: 16  YIVTLILLLILALL--LYFTDLSESVLSWLIVIITLLSVFLGGFVAGKKAGSKGWLWGLL 73

Query: 214 -GITYFLALRIMGFLFLNEAINWLNI 238
            G+ YFL L ++  L      + L  
Sbjct: 74  VGLLYFLILLLISLLGGGGFFSGLEQ 99


>gnl|CDD|206279 pfam14110, DUF4282, Domain of unknown function (DUF4282).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 93 and 155
           amino acids in length. There is a single completely
           conserved residue E that may be functionally important.
          Length = 89

 Score = 28.7 bits (65), Expect = 2.9
 Identities = 11/56 (19%), Positives = 24/56 (42%)

Query: 169 FIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 224
            I+++  L L++I     +  F  L   A+     +  LL+     +   L +R++
Sbjct: 13  IIKVLYWLGLILIGLFGLSGLFGGLATMAYSPGGGLLALLLGPLGFLLGLLLVRVV 68


>gnl|CDD|219117 pfam06638, Strabismus, Strabismus protein.  This family consists of
           several strabismus (STB) or Van Gogh-like (VANGL)
           proteins 1 and 2. The exact function of this family is
           unknown. It is thought, however that STB1 gene and STB2
           may be potent tumour suppressor gene candidates.
          Length = 507

 Score = 30.8 bits (70), Expect = 3.0
 Identities = 30/109 (27%), Positives = 46/109 (42%), Gaps = 18/109 (16%)

Query: 128 IFAHSGLALSIGLGACLHA-SF-----LYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVI 181
            +    L   +GL A +   +F     + W  R +     C G     I L   LL+L+I
Sbjct: 92  RYLGLALGGLLGLLAFVTPLAFIVLPQVGWRDRLEPCGTACEG---LLISLAFKLLILLI 148

Query: 182 VALWGNSYFNWLGMQAHP---IFRIVALLLILLFCGITY--FLALRIMG 225
            A W  + F        P   +FR + L+L+ LF   +Y  F  +RI+G
Sbjct: 149 -ASW--ALFLRPPRADMPRIFVFRALLLVLVFLFL-FSYWLFYGVRILG 193


>gnl|CDD|219354 pfam07264, EI24, Etoposide-induced protein 2.4 (EI24).  This family
           contains a number of eukaryotic etoposide-induced 2.4
           (EI24) proteins approximately 350 residues long as well
           as bacterial CysZ proteins (formerly known as DUF540).
           In cells treated with the cytotoxic drug etoposide, EI24
           is induced by p53. It has been suggested to play an
           important role in negative cell growth control.
          Length = 205

 Score = 30.2 bits (69), Expect = 3.2
 Identities = 16/92 (17%), Positives = 30/92 (32%), Gaps = 17/92 (18%)

Query: 152 YLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHP---------IFR 202
            L   R+        +  + L++ LLL   +  +  S    L               +  
Sbjct: 7   DLLSPRL------RRLVLLPLLLNLLLFAGLLYFLFSLLGPLLDALLSLLWLGWLSWLLW 60

Query: 203 IVALLLILLFCGIT-YFLALRIMGFLFLNEAI 233
           I+A+LL+L+        +A  I    F    +
Sbjct: 61  ILAVLLLLVLLWFLFVVVASLIAAP-FWTPDL 91


>gnl|CDD|150723 pfam10081, DUF2319, Predicted membrane protein (DUF2319).  Members
           of this family of hypothetical bacterial proteins have
           no known function.
          Length = 534

 Score = 30.7 bits (70), Expect = 3.3
 Identities = 27/115 (23%), Positives = 41/115 (35%), Gaps = 23/115 (20%)

Query: 132 SGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVAL------- 184
           SG++ +IG G  +   +L+ YL    +            RLV   L   +  L       
Sbjct: 38  SGVSAAIGYGIGVFLRWLWRYLELPEL----RPRPRRGARLVAVALGAAVTVLMLVQAAR 93

Query: 185 WGNSYFNWLGMQ----AHPIFRI-----VALLLILLFCGITY---FLALRIMGFL 227
           W NS    +GM     A  +  +     V   L+ L   +     FLA R+   L
Sbjct: 94  WQNSVRALMGMPPLETAAYLRGVLVALLVFGALVALGRLLRRLIRFLARRLQRVL 148


>gnl|CDD|163613 cd00144, MPP_PPP_family, phosphoprotein phosphatases of the
           metallophosphatase superfamily, metallophosphatase
           domain.  The PPP (phosphoprotein phosphatase) family is
           one of two known protein phosphatase families specific
           for serine and threonine.  This family includes: PP1,
           PP2A, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1,
           RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP
           catalytic domain is defined by three conserved motifs
           (-GDXHG-, -GDXVDRG- and -GNHE-).  The PPP enzyme family
           is ancient with members found in all eukaryotes, and in
           most bacterial and archeal genomes.  Dephosphorylation
           of phosphoserines and phosphothreonines on target
           proteins plays a central role in the regulation of many
           cellular processes.  PPPs belong to the
           metallophosphatase (MPP) superfamily.  MPPs are
           functionally diverse, but all share a conserved domain
           with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 225

 Score = 30.1 bits (68), Expect = 3.8
 Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 1/28 (3%)

Query: 295 IHNCFTELDIILKKYNIKKIDGILFDLG 322
           IH C  +L  +L+K      D ++F LG
Sbjct: 6   IHGCLDDLLRLLEKIGFPPNDKLIF-LG 32


>gnl|CDD|221252 pfam11833, DUF3353, Protein of unknown function (DUF3353).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 205 to 258
           amino acids in length.
          Length = 193

 Score = 29.5 bits (67), Expect = 4.7
 Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 14/85 (16%)

Query: 133 GLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLL--LVIVALWGNSYF 190
            L L++G GAC     +Y+  R  R +     W +    LV+ LLL  L+ V L      
Sbjct: 117 QLQLALGTGAC-----IYFLNRKGRRFGRALLWSLGG--LVVGLLLGSLLAVLLPPFILP 169

Query: 191 NWLG---MQAHPIFRIVALLLILLF 212
             L    +Q+ P +  + L L  LF
Sbjct: 170 GSLSPEQIQSLPAY--LLLWLGSLF 192


>gnl|CDD|133453 cd05311, NAD_bind_2_malic_enz, NAD(P) binding domain of malic
           enzyme (ME), subgroup 2.  Malic enzyme (ME), a member of
           the amino acid dehydrogenase (DH)-like domain family,
           catalyzes the oxidative decarboxylation of L-malate to
           pyruvate in the presence of cations (typically  Mg++ or
           Mn++) with the concomitant reduction of cofactor NAD+ or
           NADP+.  ME has been found in all organisms, and plays
           important roles in diverse metabolic pathways such as
           photosynthesis and lipogenesis. This enzyme generally
           forms homotetramers. The conversion of malate to
           pyruvate by ME typically involves oxidation of malate to
           produce oxaloacetate, followed by decarboxylation of
           oxaloacetate to produce pyruvate and CO2.  This
           subfamily consists primarily of archaeal and bacterial
           ME.  Amino acid DH-like NAD(P)-binding domains are
           members of the Rossmann fold superfamily and include
           glutamate, leucine, and phenylalanine DHs, methylene
           tetrahydrofolate DH, methylene-tetrahydromethanopterin
           DH, methylene-tetrahydropholate DH/cyclohydrolase,
           Shikimate DH-like proteins, malate oxidoreductases, and
           glutamyl tRNA reductase. Amino acid DHs catalyze the
           deamination of amino acids to keto acids with NAD(P)+ as
           a cofactor. The NAD(P)-binding Rossmann fold superfamily
           includes a wide variety of protein families including
           NAD(P)- binding domains of alcohol DHs,
           tyrosine-dependent oxidoreductases,
           glyceraldehyde-3-phosphate DH, lactate/malate DHs,
           formate/glycerate DHs, siroheme synthases,
           6-phosphogluconate DH, amino acid DHs, repressor rex,
           NAD-binding potassium channel  domain, CoA-binding, and
           ornithine cyclodeaminase-like domains. These domains
           have an alpha-beta-alpha configuration. NAD binding
           involves numerous hydrogen and van der Waals contacts.
          Length = 226

 Score = 29.5 bits (67), Expect = 5.1
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 10/69 (14%)

Query: 326 NQINNELRGFSFLLDGPLDMRM-DITRG--ISASKWLANATEFNIKKVIQDYGEERFAKK 382
           NQ+NN L GF  +  G LD+R   IT    ++A++ +A+  E       +  GEE     
Sbjct: 156 NQVNNVL-GFPGIFRGALDVRATKITEEMKLAAAEAIADLAE------EEVLGEEYIIPT 208

Query: 383 IAKEIVHYR 391
                V  R
Sbjct: 209 PFDPRVVPR 217


>gnl|CDD|224596 COG1682, TagG, ABC-type polysaccharide/polyol phosphate export
           systems, permease component [Carbohydrate transport and
           metabolism / Cell envelope biogenesis, outer membrane].
          Length = 263

 Score = 29.5 bits (67), Expect = 5.2
 Identities = 23/95 (24%), Positives = 41/95 (43%), Gaps = 8/95 (8%)

Query: 135 ALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLG 194
           A+S G G+ +  + L   +    + +P A         +I L++++I  +      +W  
Sbjct: 85  AISEGAGSVVANAALIKKINFPPLILPVARTLSRLFNFLIHLIIILIFLIILGVEPSW-- 142

Query: 195 MQAHPIFRIVAL-LLILLFCGITYFLALRIMGFLF 228
              H +  + AL LLIL   G+   LA   +G  F
Sbjct: 143 ---HWLLLLPALLLLILFSVGLGLILA--SLGVRF 172


>gnl|CDD|148632 pfam07136, DUF1385, Protein of unknown function (DUF1385).  This
           family contains a number of hypothetical bacterial
           proteins of unknown function approximately 300 residues
           in length. Some family members are predicted to be
           metal-dependent.
          Length = 235

 Score = 29.5 bits (67), Expect = 5.7
 Identities = 11/43 (25%), Positives = 23/43 (53%)

Query: 164 GWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVAL 206
           GW   ++R++  +LLL +VA        + G   +P+ +I++ 
Sbjct: 160 GWDPLWLRILSRILLLPVVAGLSYEIIKFAGKSQNPLVKILSA 202


>gnl|CDD|215222 PLN02397, PLN02397, aspartate transaminase.
          Length = 423

 Score = 29.9 bits (68), Expect = 5.8
 Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 135 ALSIGLGAC-LHASFLYWYLRHKRIYIPCAGWG 166
            LS G G+  L A FL  +     IYIP   WG
Sbjct: 122 CLS-GTGSLRLGAEFLARFYPGSTIYIPNPTWG 153


>gnl|CDD|220626 pfam10192, GpcrRhopsn4, Rhodopsin-like GPCR transmembrane domain.
           This region of 270 amino acids is the seven
           transmembrane alpha-helical domains included within five
           GPCRRHODOPSN4 motifs of a G-protein-coupled-receptor
           (GPCR) protein, conserved from nematodes to humans.
           GPCRs are integral membrane receptors whose
           intracellular actions are mediated by signalling
           pathways involving G proteins and downstream secondary
           messengers.
          Length = 256

 Score = 29.5 bits (67), Expect = 6.2
 Identities = 17/97 (17%), Positives = 37/97 (38%), Gaps = 16/97 (16%)

Query: 138 IGLGACLHASFLYWYLRHK-------RIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYF 190
           I L        + WY+++         +Y    G+ +  +R+V+A+       L+G   +
Sbjct: 141 IVLIVVFQVILVIWYIKYFFDPASKFHLYESWPGYLLLALRIVLAIYF-----LYGL--Y 193

Query: 191 NWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFL 227
           N    +  P  +     L     G  +FL+  ++  +
Sbjct: 194 NTYKKEKSPAKK--QFYLKFGIVGTLWFLSFPVIVII 228


>gnl|CDD|200943 pfam00032, Cytochrom_B_C, Cytochrome b(C-terminal)/b6/petD. 
          Length = 102

 Score = 27.8 bits (63), Expect = 6.6
 Identities = 19/72 (26%), Positives = 37/72 (51%), Gaps = 10/72 (13%)

Query: 148 FLYWY--LRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVA 205
           FL+ Y  LR     IP    GV  I +V+++L+L ++     S      ++  P+F+I+ 
Sbjct: 17  FLFAYAILRS----IPNKLGGV--IAMVLSILILFLLPFLDRSKVR--SLRFRPLFKILF 68

Query: 206 LLLILLFCGITY 217
            + ++ F  +T+
Sbjct: 69  WIFVVNFLLLTW 80


>gnl|CDD|222372 pfam13777, DUF4173, Domain of unknown function (DUF4173).  This
           domain of unknown function contains multiple predicted
           transmembrane domains.
          Length = 188

 Score = 29.0 bits (66), Expect = 7.0
 Identities = 12/66 (18%), Positives = 24/66 (36%), Gaps = 13/66 (19%)

Query: 153 LRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLF 212
           L + RI          F+ L+   LLLV+  ++      WL        R      +++ 
Sbjct: 97  LTYLRIL---VLA---FMLLLAIGLLLVLAKIFRGKSNRWL-------LRRNLAAGVVVL 143

Query: 213 CGITYF 218
             +++ 
Sbjct: 144 YILSFV 149


>gnl|CDD|239023 cd02072, Glm_B12_BD, B12 binding domain of glutamate mutase (Glm).
           Glutamate mutase catalysis the conversion of
           (S)-glutamate with (2S,3S)-3-methylaspartate. The
           rearrangement reaction is initiated by the extraction of
           a hydrogen from the protein-bound substrate by a
           5'-desoxyadenosyl radical, which is generated by the
           homolytic cleavage of the organometallic bond of the
           cofactor B12. Glm is a heterotetrameric molecule
           consisting of two alpha and two epsilon polypeptide
           chains.
          Length = 128

 Score = 28.2 bits (63), Expect = 7.6
 Identities = 13/35 (37%), Positives = 20/35 (57%), Gaps = 1/35 (2%)

Query: 527 SNINVIVTGV-TASLDLIKTAVDMNADAILVHHGY 560
           +  NV+  GV +   + I  A++ +ADAILV   Y
Sbjct: 26  AGFNVVNLGVLSPQEEFIDAAIETDADAILVSSLY 60


>gnl|CDD|216972 pfam02322, Cyto_ox_2, Cytochrome oxidase subunit II.  This Family
           consists of cytochrome bd type terminal oxidases that
           catalyzes Quinol dependent, Na+ independent oxygen
           uptake. Members of this family are integral membrane
           proteins andi contain a protohaem IX centre B558. One
           member of the family,cydB from Klebsiella pneumoniae, is
           implicated in having an important role in micro-aerobic
           nitrogen fixation in the enteric bacterium Klebsiella
           pneumoniae.
          Length = 327

 Score = 29.4 bits (67), Expect = 7.9
 Identities = 27/144 (18%), Positives = 46/144 (31%), Gaps = 16/144 (11%)

Query: 91  PTTLLGVTFNTILLPNLSKARIENNTEEYSA-ILDWIPIFA-HSGLALSIGLGACLHASF 148
              L GV    ++        +  +   Y+    D +  FA   GLA+       L  + 
Sbjct: 125 AAFLQGVALGNLVQG------VPFDAGNYAGGFFDLLNPFALLGGLAVV--ALYALLGAT 176

Query: 149 LYWYLRHK---RIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVA 205
              +L  K    +             +++ L  LV V +   +   +    + P   I+ 
Sbjct: 177 ---WLILKTEGELQERARRLAKKLGLVLLVLFALVSVWVPLAAPALFANWFSSPWLLILP 233

Query: 206 LLLILLFCGITYFLALRIMGFLFL 229
            L +L        L LR  G  FL
Sbjct: 234 ALPVLAALLAVLLLRLRREGLAFL 257


>gnl|CDD|226960 COG4597, BatB, ABC-type amino acid transport system, permease
           component [Amino acid transport and metabolism].
          Length = 397

 Score = 29.4 bits (66), Expect = 8.1
 Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 6/76 (7%)

Query: 148 FLYWYLRHKRIYIPCAGWG----VFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRI 203
               +L ++ +Y P   WG     F + LV+A++  V +A W  +     G Q       
Sbjct: 166 PDLIFLSNRGLYFPSPQWGEGFIAFILALVMAIVASVFLARWAKTRQIATGQQFRTW--P 223

Query: 204 VALLLILLFCGITYFL 219
            A +LI+    +  +L
Sbjct: 224 TAAVLIIGLPLLAQWL 239


>gnl|CDD|224748 COG1835, COG1835, Predicted acyltransferases [Lipid metabolism].
          Length = 386

 Score = 29.4 bits (66), Expect = 8.2
 Identities = 20/112 (17%), Positives = 32/112 (28%), Gaps = 4/112 (3%)

Query: 119 YSAILDWIPIFAHSGLAL-SIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALL 177
           Y      + +      AL  + L   L + FL               +G      +   L
Sbjct: 162 YLLFPLLLLLLRRVRFALALLLLLLALASFFLL-LALGPFATGAALYYGTA--TRLWEFL 218

Query: 178 LLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFLFL 229
           L  + ALW      W  + A  +  +   LL+L    I           L +
Sbjct: 219 LGALAALWRPKRRPWPALLAAALALLGLALLLLAVALIPGSSPFFPGDLLLV 270


>gnl|CDD|238227 cd00386, Heme_Cu_Oxidase_III_like, Heme-copper oxidase subunit III.
            Heme-copper oxidases are transmembrane protein
           complexes in the respiratory chains of prokaryotes and
           mitochondria which couple the reduction of molecular
           oxygen to water to, proton pumping across the membrane.
           The heme-copper oxidase superfamily is diverse in terms
           of electron donors, subunit composition, and heme types.
            This superfamily includes cytochrome c and ubiquinol
           oxidases.  Bacterial oxidases typically contain 3 or 4
           subunits in contrast to the 13 subunit bovine cytochrome
           c oxidase (CcO). Subunits I, II, and III of mammalian
           CcO are encoded within the mitochondrial genome and the
           remaining 10 subunits are encoded within the nuclear
           genome. Subunits I, II and III of ubiquinol oxidase are
           homologous to the corresponding subunits in CcO.  This
           group additionally contains proteins which are fusions
           between subunits I and III, such as Sulfolobus
           acidocaldarius SoxM, a subunit of the SoxM terminal
           oxidase complex. It also includes NorE which has been
           speculated to be a subunit of nitric oxide reductase.
           Some archaebacterial cytochrome oxidases lack subunit
           III.   Although not required for catalytic activity,
           subunit III is believed to play a role in assembly of
           the multimer complex. Rhodobacter CcO subunit III
           stabilizes the integrity of the binuclear center in
           subunit I.  It has been proposed that archaea acquired
           heme-copper oxidases through gene transfer from
           gram-positive bacteria.
          Length = 183

 Score = 28.7 bits (65), Expect = 8.2
 Identities = 19/85 (22%), Positives = 28/85 (32%), Gaps = 8/85 (9%)

Query: 146 ASFL--YWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRI 203
            SF   Y++ R      P   +G     L + LL    + L G+S        A      
Sbjct: 25  GSFFWAYFHSRL----SPPVEFGAGLDPLDLPLLNTNTLLLSGSSVTWAHASLAARRGNR 80

Query: 204 VALLLILLFCGI--TYFLALRIMGF 226
               L LL   +    FL L+   +
Sbjct: 81  KKARLWLLLTILLGLAFLGLQAYEY 105


>gnl|CDD|223674 COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport
           systems, permease components [Amino acid transport and
           metabolism / Inorganic ion transport and metabolism].
          Length = 317

 Score = 29.0 bits (66), Expect = 8.6
 Identities = 14/51 (27%), Positives = 25/51 (49%), Gaps = 7/51 (13%)

Query: 168 FFIRLVIALLLLVIVALWGNSYFNWLGM------QAHPIFRIVALLLILLF 212
             + +++ + L ++ AL   S+ + L         + P F  + LLLILLF
Sbjct: 106 LILAVLLGIPLGILAALKRGSWIDRLLRVLSLLGISIPSF-WLGLLLILLF 155


>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
           permease components [Defense mechanisms].
          Length = 567

 Score = 29.3 bits (66), Expect = 8.6
 Identities = 13/85 (15%), Positives = 31/85 (36%), Gaps = 5/85 (5%)

Query: 148 FLYWYLRHKRIYI-PCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVAL 206
            L  YL++K + +          + L++ LL+  I+    ++    LG     +  ++ L
Sbjct: 6   RLLKYLKYKLLLLAILLLLLSALLSLLLPLLIGRII----DALLADLGELLELLLLLLLL 61

Query: 207 LLILLFCGITYFLALRIMGFLFLNE 231
            L+              +G   + +
Sbjct: 62  ALLGGVLRALQSYLGSRLGQKIVAD 86


>gnl|CDD|223905 COG0835, CheW, Chemotaxis signal transduction protein [Cell
           motility and secretion / Signal transduction
           mechanisms].
          Length = 165

 Score = 28.4 bits (64), Expect = 9.2
 Identities = 14/41 (34%), Positives = 21/41 (51%), Gaps = 3/41 (7%)

Query: 375 GEERFAKKIAK--EIVHYRSITPITRTKQLVEIILKSIRGN 413
           G+E +   I K  EIV Y  ITP+      V  ++ ++RG 
Sbjct: 26  GDEEYGIDILKVREIVGYPPITPVPNAPPYVLGVI-NLRGE 65


>gnl|CDD|224287 COG1368, MdoB, Phosphoglycerol transferase and related proteins,
           alkaline phosphatase superfamily [Cell envelope
           biogenesis, outer membrane].
          Length = 650

 Score = 29.3 bits (66), Expect = 9.9
 Identities = 19/70 (27%), Positives = 25/70 (35%), Gaps = 7/70 (10%)

Query: 167 VFFIRLVIALLLL------VIVALWGNSYFNWLGMQAHPIFR-IVALLLILLFCGITYFL 219
            F I LV  L LL      +   L     F  L +        IV+LLL+LL       L
Sbjct: 20  GFGILLVFLLWLLTLLIYFLGFVLPILLLFQGLRLLFSLPILFIVSLLLLLLLFKGVDAL 79

Query: 220 ALRIMGFLFL 229
            +  +    L
Sbjct: 80  NIFRLILALL 89


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.325    0.141    0.422 

Gapped
Lambda     K      H
   0.267   0.0765    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 38,698,822
Number of extensions: 4044010
Number of successful extensions: 6597
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6486
Number of HSP's successfully gapped: 278
Length of query: 733
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 629
Effective length of database: 6,324,786
Effective search space: 3978290394
Effective search space used: 3978290394
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 63 (28.0 bits)