RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy2376
         (896 letters)



>gnl|CDD|234597 PRK00050, PRK00050, 16S rRNA m(4)C1402 methyltranserfase;
           Provisional.
          Length = 296

 Score =  352 bits (907), Expect = e-115
 Identities = 118/253 (46%), Positives = 169/253 (66%), Gaps = 6/253 (2%)

Query: 392 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNK-- 449
           L+E ++ L I   + +GIY+D TFG GGHS  ILERLG KGRLIAID+D ++++      
Sbjct: 8   LDEVVDALAI---KPDGIYVDGTFGGGGHSRAILERLGPKGRLIAIDRDPDAIAAAKDRL 64

Query: 450 ITDSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDMR 509
               RF+++H  F+ L  +L +  + K+DGIL DLG+SS Q+++  RGFSF  DGPLDMR
Sbjct: 65  KPFGRFTLVHGNFSNLKEVLAE-GLGKVDGILLDLGVSSPQLDDAERGFSFRKDGPLDMR 123

Query: 510 MDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEII 569
           MD ++G++A++ +   +E  + ++ ++YGEERFA++IA+ IV  R   PIT T +L EII
Sbjct: 124 MDQSQGLTAAEVVNTYSEEELARIFKEYGEERFARRIARAIVEARPKKPITTTGELAEII 183

Query: 570 LKSIRGNKRYKNPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSLEDR 629
             ++   ++  +PATRTFQA+RI +N EL+ L  AL+     L   GR+ VISFHSLEDR
Sbjct: 184 KSAVPPRRKGIHPATRTFQALRIEVNDELEELERALEAALDLLKPGGRLAVISFHSLEDR 243

Query: 630 IVKNFINFNTKIP 642
           IVK F    +K  
Sbjct: 244 IVKRFFRELSKGC 256


>gnl|CDD|223352 COG0275, COG0275, Predicted S-adenosylmethionine-dependent
           methyltransferase involved in cell envelope biogenesis
           [Cell envelope biogenesis, outer membrane].
          Length = 314

 Score =  344 bits (885), Expect = e-111
 Identities = 130/281 (46%), Positives = 178/281 (63%), Gaps = 10/281 (3%)

Query: 392 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKIT 451
           LNE +  L     + +GIYID T G GGHS  ILE+L   GRLI ID+D +++++  +  
Sbjct: 12  LNEVVELLAP---KPDGIYIDGTLGAGGHSRAILEKLPDLGRLIGIDRDPQAIAIAKERL 68

Query: 452 ---DSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDM 508
              D R +++H  F  L   LK+  I K+DGIL DLG+SS Q+++  RGFSF  DGPLDM
Sbjct: 69  KEFDGRVTLVHGNFANLAEALKELGIGKVDGILLDLGVSSPQLDDAERGFSFRKDGPLDM 128

Query: 509 RMDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEI 568
           RMD T+G+SA++ +   +E ++ ++ ++YGEERFAK+IA+ IV  R   PI  TK+L EI
Sbjct: 129 RMDQTQGLSAAEVVNTYSEEDLARIFKEYGEERFAKRIARAIVERRKKKPIETTKELAEI 188

Query: 569 ILKSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSL 626
           I  +I   ++ K  +PATRTFQAIRIY+N EL+ L  AL+     L   GR+ VISFHSL
Sbjct: 189 IKSAIPAKEKRKKIHPATRTFQAIRIYVNDELEELEEALEAALDLLKPGGRLAVISFHSL 248

Query: 627 EDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 667
           EDRIVKNF    +K     + LP+       K K + + KP
Sbjct: 249 EDRIVKNFFKELSKPGV-PKGLPVTEEGPALKFKLITK-KP 287


>gnl|CDD|216706 pfam01795, Methyltransf_5, MraW methylase family.  Members of this
           family are probably SAM dependent methyltransferases
           based on Escherichia coli rsmH. This family appears to
           be related to pfam01596.
          Length = 310

 Score =  315 bits (809), Expect = e-100
 Identities = 129/281 (45%), Positives = 181/281 (64%), Gaps = 9/281 (3%)

Query: 392 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSL-GNKI 450
           L E +  LNI+    +GIYID T G GGHS  ILE+L ++GRLI ID+D ++++    ++
Sbjct: 9   LQEVVELLNIKP---DGIYIDCTLGGGGHSEAILEQLPEEGRLIGIDRDPQAIANAKERL 65

Query: 451 T--DSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDM 508
                R ++IH+ F  L   LK+  + K+DGILFDLG+SS Q++   RGFSF  D PLDM
Sbjct: 66  KPFKGRVTLIHSNFANLFAYLKELGVGKVDGILFDLGVSSPQLDGAERGFSFKHDAPLDM 125

Query: 509 RMDITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEI 568
           RMD + G++A++ L   +E ++ ++ + YGEERF+K+IA+ IV  R   PI  TK+L EI
Sbjct: 126 RMDQSAGLTAAEVLNTYSEEDLARIFKKYGEERFSKRIARAIVERRKKKPIETTKELAEI 185

Query: 569 ILKSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSL 626
           I K++    + K  +PATRTFQAIRIY+N EL++L  AL+     L   GR+ VISFHSL
Sbjct: 186 IKKAVPAKAKRKKIHPATRTFQAIRIYVNDELESLKEALEQAPDLLAPGGRLSVISFHSL 245

Query: 627 EDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 667
           EDRIVKNF    +K   +   LP+     EP +K + + KP
Sbjct: 246 EDRIVKNFFREYSKGCQVPAGLPVILEGLEPLLKIITK-KP 285


>gnl|CDD|240528 cd13123, MATE_MurJ_like, MurJ/MviN, a subfamily of the multidrug
           and toxic compound extrusion (MATE)-like proteins.
           Escherichia coli MurJ (MviN) has been identified as
           essential for murein biosynthesis. It has been suggested
           that MurJ functions as the peptidoglycan lipid II
           flippase which is involved in translocation of
           lipid-anchored peptidoglycan precursors across the
           cytoplasmic membrane, though results obtained in
           Bacillus subtilis seem to indicate that its MurJ
           homologs are not essential for growth. Some MviN family
           members (e.g. in Mycobacterium tuberculosis) possess an
           extended C-terminal region that contains an
           intracellular pseudo-kinase domain and an extracellular
           domain resembling carbohydrate-binding proteins.
           Proteins from the MATE family are involved in exporting
           metabolites across the cell membrane and are often
           responsible for multidrug resistance (MDR).
          Length = 420

 Score =  287 bits (737), Expect = 2e-88
 Identities = 118/303 (38%), Positives = 172/303 (56%), Gaps = 31/303 (10%)

Query: 11  SSITLFSRITGLFREILFARVFGASIYTDAFNIAFRIPNLLRRLFAEGAFSQAFVPILTE 70
           +  TL SRI G  R++L A  FGA    DAF +AFRIPNLLRRLFAEGA S AFVP+ +E
Sbjct: 1   AIGTLLSRILGFVRDVLIAAAFGAGPLADAFFVAFRIPNLLRRLFAEGALSAAFVPVFSE 60

Query: 71  YKNKRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIVYLVATGFIDNYKIFNITVSL 130
           Y  K G    ++    V T+L+  +  + + GI+ AP++V L+A GF  +   F + V+L
Sbjct: 61  YLEKEGEEEARRFASRVLTLLLLVLLLLTLLGILFAPLLVKLLAPGFSGDK--FELAVAL 118

Query: 131 TRIMFPYVGFMAFITLASSILNIWSQFKIPAFTPILLNISC------------------- 171
           TRIMFPY+ F++   L   ILN   +F  PA  P+LLN+                     
Sbjct: 119 TRIMFPYLLFISLSALLGGILNAHGRFFAPALAPVLLNLVIIAGLLLLAPLFDLGIYALA 178

Query: 172 ------GALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQIS 225
                 G LQ+++Q+P+L ++G     +L+    F++  VRR+LK M P++  V  +QI+
Sbjct: 179 WGVLLGGVLQLLVQLPALRRLGFRFRPRLD----FRDPGVRRVLKLMLPALLGVSVSQIN 234

Query: 226 LMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYSAI 285
           L+++T +AS + EGS+S L YADRL + P  + GV   T LLP LS+     + +E+   
Sbjct: 235 LLVDTILASFLPEGSISALYYADRLYQLPLGIFGVAIATALLPRLSRLAAAGDLDEFRRT 294

Query: 286 LDW 288
           L  
Sbjct: 295 LSR 297



 Score = 33.6 bits (78), Expect = 0.53
 Identities = 12/31 (38%), Positives = 16/31 (51%)

Query: 289 IPIFAHSGLALSIGLGACLHASFLYWYLRHK 319
           I    H GLAL+  L A ++A  L   LR +
Sbjct: 390 IKPLGHVGLALATSLSAWVNALLLLVLLRRR 420


>gnl|CDD|232778 TIGR00006, TIGR00006, 16S rRNA
           (cytosine(1402)-N(4))-methyltransferase.  This model
           describes RsmH, a 16S rRNA methyltransferase.
           Previously, this gene was designated MraW, known to be
           essential in E. coli and widely conserved in bacteria
           [Cell envelope, Biosynthesis and degradation of murein
           sacculus and peptidoglycan].
          Length = 307

 Score =  283 bits (725), Expect = 2e-88
 Identities = 131/279 (46%), Positives = 184/279 (65%), Gaps = 8/279 (2%)

Query: 392 LNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKI- 450
           L+E +  LNI   + +GIYID T G+GGHS  IL +LG+ GRLI ID+D ++++   +I 
Sbjct: 9   LDEVVEGLNI---KPDGIYIDGTLGRGGHSKAILSQLGE-GRLIGIDRDPQAIAFAKEIL 64

Query: 451 TDSRFSIIHNCFTELDIILKKYNIKKIDGILFDLGISSNQINNELRGFSFLLDGPLDMRM 510
           +D RF IIH+ F      LK+  + KIDGIL DLG+SS Q+++  RGFSF+ DGPLDMRM
Sbjct: 65  SDLRFVIIHDNFANFFEYLKELLVTKIDGILVDLGVSSPQLDDPERGFSFMHDGPLDMRM 124

Query: 511 DITRGISASKWLANATEFNIKKVIQDYGEERFAKKIAKEIVHYRSITPITRTKQLVEIIL 570
           D ++ +SA++ L   +E +++ +++ YGEERF+K+IA+ IV  R   PI  TK+L E+I 
Sbjct: 125 DQSQKLSAAEILNTYSEEDLEWILKKYGEERFSKRIARAIVERRKKKPIQTTKELAEVIS 184

Query: 571 KSIRGNKRYK--NPATRTFQAIRIYINQELKNLSIALKIGFKKLNIKGRIVVISFHSLED 628
           K++ G  +YK  +PATR FQAIRIY+N EL+ L  AL+     L   GR+ +ISFHSLED
Sbjct: 185 KAVPGFSKYKSIHPATRVFQAIRIYVNDELEELEEALQFAPNLLAPGGRLSIISFHSLED 244

Query: 629 RIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLARCKP 667
           RIVKNF    +K P +   LP+         K + + KP
Sbjct: 245 RIVKNFFRELSKGPQVPPGLPVKEEQLYALGKRITK-KP 282


>gnl|CDD|223800 COG0728, MviN, Uncharacterized membrane protein, putative virulence
           factor [General function prediction only].
          Length = 518

 Score =  283 bits (727), Expect = 7e-86
 Identities = 129/314 (41%), Positives = 184/314 (58%), Gaps = 32/314 (10%)

Query: 1   MSLHKTFITISSITLFSRITGLFREILFARVFGASIYTDAFNIAFRIPNLLRRLFAEGAF 60
           MSL K+ I +SS TL SRI G  R++L A  FGA    DAF +AF++PNLLRRLFAEGAF
Sbjct: 7   MSLLKSLIIVSSATLLSRILGFVRDVLIAAAFGAGAAADAFFVAFKLPNLLRRLFAEGAF 66

Query: 61  SQAFVPILTEYKNKRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIV-YLVATGFID 119
           S AFVP+L E K K G  A +     V  +L   +  + + GI+ AP +V  L+A GF +
Sbjct: 67  SSAFVPVLAEAKKKEGEEAARFFSRLVTGLLTLVLLLVTLLGILFAPWLVRLLLAPGFDE 126

Query: 120 NYKIFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKIPAFTPILLNISC-------- 171
             K   + V LTRI+FPY+ F++   L  +ILN  ++F IPAF P+LLN+S         
Sbjct: 127 TDKF--LAVLLTRILFPYLLFISLSALFGAILNSRNRFFIPAFAPVLLNVSVIGLALFLG 184

Query: 172 -----------------GALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGP 214
                            G LQ+++Q+P+L K G+    +      FK+  ++R LK M P
Sbjct: 185 PYFDPPLLALAWGVLIGGLLQLLVQLPALRKAGLLIKPRFG----FKDPGLKRFLKLMLP 240

Query: 215 SVFSVFAAQISLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKAR 274
           ++  V  +QI+L+++T IAS + EGS+S L YADRL + P  + GV  +T+LLP+LS+  
Sbjct: 241 ALLGVSISQINLLIDTAIASFLAEGSVSWLYYADRLYQLPLGIFGVALSTVLLPSLSRHA 300

Query: 275 IENNTEEYSAILDW 288
              +  E+  +LDW
Sbjct: 301 ANGDWPEFLKLLDW 314



 Score = 63.5 bits (155), Expect = 3e-10
 Identities = 34/110 (30%), Positives = 54/110 (49%), Gaps = 2/110 (1%)

Query: 289 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFI-RLVIALLLLVIVAL 347
           IP   H GLAL+  L A ++A  LY+ LR + +Y+P  GWG+F I +L++A  ++    L
Sbjct: 407 IPPLGHVGLALATSLAAWVNALLLYYLLRKRLVYLPGRGWGLFLILKLLLASAIMAAALL 466

Query: 348 WGNSYFNWLGMQAHPIFRIVALLLIL-LFCGITYFLALRIMGFLFLNEAI 396
                     +    +   + +LL+L L     YF  L ++GF     A 
Sbjct: 467 ALLHLAQREWLLGGLLLIRLGVLLLLVLLGAGVYFAMLLLLGFRVRGFAR 516


>gnl|CDD|182741 PRK10799, PRK10799, metal-binding protein; Provisional.
          Length = 247

 Score =  239 bits (611), Expect = 8e-73
 Identities = 103/220 (46%), Positives = 139/220 (63%), Gaps = 13/220 (5%)

Query: 670 KLNIKKYEDYCPNGLQVEGRSNINVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKGEN 729
           KLN     DY PNGLQVEGR  +  IVTGVTAS  L+  AV + ADA++VHHGYFWKGE+
Sbjct: 13  KLNSAAISDYAPNGLQVEGRETVQKIVTGVTASQALLDEAVRLQADAVIVHHGYFWKGES 72

Query: 730 SNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNIGWI 789
             I G+K+ RL+ L+ N INLY +HLPLD HP+LGNNAQLA +L            I  +
Sbjct: 73  PVIRGMKRNRLKTLLANDINLYGWHLPLDAHPELGNNAQLAALL-----------GITVM 121

Query: 790 GKIINLKRYN-FKKIITIKDLFHHITRKIGKKPIVIGDLN-KKIYEIGWCTGAAQNLLTD 847
           G+I  L  +      +   +L   I  ++G+KP+  GD   + +  + WCTG  Q+ +  
Sbjct: 122 GEIEPLVPWGELTMPVPGLELASWIEARLGRKPLWCGDTGPEVVQRVAWCTGGGQSFIDS 181

Query: 848 AINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 887
           A   GV A+I+GE+SE T++ +RE G+ ++AAGHHATER 
Sbjct: 182 AARFGVDAFITGEVSEQTIHSAREQGLHFYAAGHHATERG 221


>gnl|CDD|233536 TIGR01695, mviN, integral membrane protein MviN.  This model
           represents MviN, a family of integral membrane proteins
           predicted to have ten or more transmembrane regions.
           Although frequently listed as a virulence protein, it is
           not restricted to pathogens and it is an essential
           protein in Sinorhizobium meliloti. In a number of
           species its gene is adjacent to that of the
           uridylyltransferase GlnD , the signal-transducing enzyme
           that performs the key modification to the nitrogen
           regulatory protein PII [Unknown function, General].
          Length = 502

 Score =  227 bits (582), Expect = 2e-65
 Identities = 118/312 (37%), Positives = 170/312 (54%), Gaps = 32/312 (10%)

Query: 2   SLHKTFITISSITLFSRITGLFREILFARVFGASIYTDAFNIAFRIPNLLRRLFAEGAFS 61
           SL K+ + +S  TLFSRITG  R+ + A  FGA +  DAFN+AF IPN  RRLFAEGAF+
Sbjct: 1   SLLKSTLIVSLGTLFSRITGFVRDAIIASAFGAGLTADAFNVAFVIPNFFRRLFAEGAFN 60

Query: 62  QAFVPILTEYKNKRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIVYLVATGFIDNY 121
            AFVP+ T+ K K    A +   + V T+LI  +  + + GI  AP ++ L+A GF D  
Sbjct: 61  SAFVPVFTKAKKK-EKEARRAFANTVTTLLILSLLLVVLIGIFFAPFVISLLAPGFADET 119

Query: 122 KIFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKIPAFTPILLNISC---------- 171
           +     VSLTRIMFPY+  ++   +   ILN   +F IP+F+PIL NI            
Sbjct: 120 RSL--AVSLTRIMFPYLLLISLAAVFGGILNARKRFFIPSFSPILFNIGVILSLLFFDWN 177

Query: 172 ---------------GALQIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSV 216
                          G  Q++IQ+P L K G     + N         ++R LK   P+ 
Sbjct: 178 YGQYSLALAIGVLIGGVAQLLIQLPFLRKAGFLLKPRFNFRDP----GLKRFLKLFLPTT 233

Query: 217 FSVFAAQISLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIE 276
               A+QI+L++NT +AS +  GS+S L YA+R+ + P  + G++ +T+LLP LS+   E
Sbjct: 234 LGSSASQITLLINTALASFLEIGSVSALYYANRIYQLPLGIFGISLSTVLLPKLSRHASE 293

Query: 277 NNTEEYSAILDW 288
            N  E   +L+ 
Sbjct: 294 GNWNELRDLLNQ 305



 Score = 62.7 bits (153), Expect = 5e-10
 Identities = 24/108 (22%), Positives = 41/108 (37%), Gaps = 4/108 (3%)

Query: 289 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALW 348
           I      G+AL+    + + +  LY  L  +   I   G      +LVIA  ++  V   
Sbjct: 398 IFPLGLVGIALATSAASMVSSVLLYLMLNRRLKGILPFGVLKVLAKLVIASAIIGGVLYL 457

Query: 349 GNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFLFLNEAI 396
             S      +  + +  +   L+ LL     YFL L ++    LN  +
Sbjct: 458 IVSVSLGGVLVKNLLGLLAIGLIGLLV----YFLGLALLKIEELNLLL 501


>gnl|CDD|217323 pfam03023, MVIN, MviN-like protein.  Deletion of the mviN virulence
           gene in Salmonella enterica serovar. Typhimurium greatly
           reduces virulence in a mouse model of typhoid-like
           disease. Open reading frames encoding homologues of MviN
           have since been identified in a variety of bacteria,
           including pathogens and non-pathogens and
           plant-symbionts. In the nitrogen-fixing symbiont
           Rhizobium tropici, mviN is required for motility. The
           MviM protein is predicted to be membrane-associated.
          Length = 452

 Score =  203 bits (519), Expect = 2e-57
 Identities = 112/315 (35%), Positives = 174/315 (55%), Gaps = 42/315 (13%)

Query: 28  FARVFGASIYTDAFNIAFRIPNLLRRLFAEGAFSQAFVPILTEYKNKRGNIATKKLIDHV 87
            A  FGA    DAFN+AFRIPNLLRRLFAEGAFS AFVP+  E K    +    + +  V
Sbjct: 1   IAAYFGAGPLADAFNVAFRIPNLLRRLFAEGAFSSAFVPVFAELKQADKD-EAAEFVRRV 59

Query: 88  ATVLIWFMFFICVFGIIMAPIIVYLVATGFIDNYKIFNITVSLTRIMFPYVGFMAFITLA 147
           +T+LI  +  + + GI+ AP ++ L+A GF  + + F++TVSL RI FPY+  ++   + 
Sbjct: 60  STLLIGVLLLVTLLGILAAPWVIRLLAPGF--DAETFSLTVSLLRITFPYLLLVSLSAVF 117

Query: 148 SSILNIWSQFKIPAFTPILLNISC--------------------------GALQIIIQIP 181
            ++LN   +F  PAF+P+LLNI                            G LQ+++Q+P
Sbjct: 118 GAVLNARKKFFAPAFSPVLLNIGVILTLLFLANYFGREPIYGLAIGVLIGGVLQLLVQLP 177

Query: 182 SLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAAQISLMLNTNIASQMREGSL 241
            L K G+    + N    F++  V+R L    P++  V  +Q++L+++T +AS + EGS+
Sbjct: 178 FLRKAGLLLKPRFN----FRDKGVKRFLALALPTLLGVSVSQLNLLIDTALASFLAEGSI 233

Query: 242 SCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEYSAILDWIPIFAHSGLALSI 301
           S L YADR+ + P  + GV+ +T+LLP LS+   + + +E+  +LD          A+ +
Sbjct: 234 SYLYYADRIYQLPLGIFGVSVSTVLLPRLSRHAKDGDWDEFLDLLDQ---------AIRL 284

Query: 302 GLGACLHASFLYWYL 316
            L   + ASF    L
Sbjct: 285 TLLLMIPASFGLLVL 299



 Score = 50.0 bits (120), Expect = 4e-06
 Identities = 26/87 (29%), Positives = 40/87 (45%), Gaps = 7/87 (8%)

Query: 289 IPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALW 348
           +P     GLAL+  L A ++  FLY+ LR +       GW  F   LV+   L+  V L 
Sbjct: 373 LPPLGVVGLALATSLSAWINLVFLYYILRKRLGGHSLRGWKTFLASLVVPTALMGGVILL 432

Query: 349 GNSYFNWLGMQAHPIFRIVALLLILLF 375
            +S           ++ + +LLLIL+ 
Sbjct: 433 LSSLT-------QGLWVVGSLLLILVG 452


>gnl|CDD|223404 COG0327, COG0327, Uncharacterized conserved protein [Function
           unknown].
          Length = 250

 Score =  194 bits (494), Expect = 2e-56
 Identities = 79/222 (35%), Positives = 126/222 (56%), Gaps = 12/222 (5%)

Query: 669 KKLNIKKYEDYCPNGLQVEG-RSNINVIVTGVTASLDLIKTAVDMNADAILVHHG-YFWK 726
           + L  +  ED+  NGLQV G    +  +   + A+L ++  A+++ AD ++VHH   FW 
Sbjct: 12  EFLPPELAEDWDNNGLQVGGPLEEVKKVAVALDATLAVLDEAIELGADLLIVHHPLIFWP 71

Query: 727 GENSNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNI 786
            ++  I G K KR++ LI N INLYA H  LD HP+ GN+A  A +L       F    +
Sbjct: 72  VKSLAIDGPKGKRIKALIQNDINLYAAHTNLDAHPEGGNDALAAALLGAEELPPF-GEGL 130

Query: 787 GWIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPI-VIGDLNKKIYEIGWCTGAAQNLL 845
           G +G          K+  T+++L   I  K+G+ P+ V+ D +++I  +  C+G+ Q  L
Sbjct: 131 GRVG--------ELKEPTTLEELAERIKAKLGRPPLRVVKDGSEEIKRVAVCSGSGQGFL 182

Query: 846 TDAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 887
           ++A  EGV AYI+G++S  T + +RE G++   AGH+ATER 
Sbjct: 183 SEAAAEGVDAYITGDLSHHTAHDARELGLSVIDAGHYATERP 224


>gnl|CDD|213534 TIGR00486, YbgI_SA1388, dinuclear metal center protein, YbgI/SA1388
           family.  The characterization of this family of
           uncharacterized proteins as orthologous is tentative.
           Members are found in all three domains of life. Several
           members (from Bacillus subtilis, Listeria monocytogenes,
           and Mycobacterium tuberculosis - all classified as
           Firmicutes within the Eubacteria) share a long insert
           relative to other members [Unknown function, General].
          Length = 249

 Score =  182 bits (465), Expect = 2e-52
 Identities = 73/221 (33%), Positives = 111/221 (50%), Gaps = 11/221 (4%)

Query: 669 KKLNIKKYEDYCPNGLQVE-GRSNINVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKG 727
           + L  +  ED   NGLQV  G   +  +V  V AS  +   AV + AD I+ HH   WK 
Sbjct: 12  RFLPKELAEDGDNNGLQVGSGNEEVKKVVVAVDASESVADEAVRLGADLIITHHPLIWKP 71

Query: 728 ENSNIVGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKNNIG 787
               I GIK  RL+ L+ N I+LY+ H  LD H   GNN  LA+ L     + F    +G
Sbjct: 72  LKRLIRGIKPGRLKILLQNDISLYSAHTNLDAHD-GGNNDALARALGLENPKEFEDYGLG 130

Query: 788 WIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPIVIGDLNK-KIYEIGWCTGAAQNLLT 846
            +G+        FK  I   +    I + +  KP+++       + ++   +G+  + + 
Sbjct: 131 RVGE--------FKAPIESLEEVLEIKKVLNVKPLLVVKNGPEYVKKVAVVSGSGLSFIM 182

Query: 847 DAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 887
            A+ EGV AYI+G++S  T +++RE G+    AGH+ATER 
Sbjct: 183 KALREGVDAYITGDLSHHTAHLARELGLNVIDAGHYATERG 223


>gnl|CDD|216697 pfam01784, NIF3, NIF3 (NGG1p interacting factor 3).  This family
           contains several NIF3 (NGG1p interacting factor 3)
           protein homologues. NIF3 interacts with the yeast
           transcriptional coactivator NGG1p which is part of the
           ADA complex, the exact function of this interaction is
           unknown.
          Length = 237

 Score =  181 bits (461), Expect = 6e-52
 Identities = 63/227 (27%), Positives = 113/227 (49%), Gaps = 17/227 (7%)

Query: 669 KKLNIKKYEDYCPNGLQVEGRSN-INVIVTGVTASLDLIKTAVDMNADAILVHHGYFWKG 727
           +       ED+   GLQV      +  I+  + A+ ++I+ A++  AD I+ HH   +K 
Sbjct: 8   ELAPPSLAEDWDNVGLQVGDPEGEVKKILVALDATEEVIEEAIENGADLIITHHPLIFKP 67

Query: 728 ENSNI-VGIKKKRLEQLIINKINLYAYHLPLDMHPKLGNNAQLAKILNFSCTRRFSKN-- 784
                    K +R+ +LI N I+LY+ H  LD  P  G N  LA++L          +  
Sbjct: 68  LKRITGDDPKGRRILKLIKNNISLYSAHTNLDAAPG-GVNDWLAELLGLESIEPLEPSVS 126

Query: 785 ---NIGWIGKIINLKRYNFKKIITIKDLFHHITRKIGKKPI-VIGDLNKKIYEIGWCTGA 840
               +G IG++         + I++++L   +  ++G + + V+GDL+K I ++  C G+
Sbjct: 127 EGEGLGRIGEL--------PEPISLEELAKRVKERLGLEGVRVVGDLDKPIKKVAICGGS 178

Query: 841 AQNLLTDAINEGVTAYISGEISESTVYISRESGVAYFAAGHHATERY 887
             +L+ +A  +G    I+GE+       ++E G+A   AGH+ATER+
Sbjct: 179 GGSLIEEAKAKGADLLITGELKHHDALDAKEKGIALIDAGHYATERF 225


>gnl|CDD|225153 COG2244, RfbX, Membrane protein involved in the export of O-antigen
           and teichoic acid [General function prediction only].
          Length = 480

 Score = 42.0 bits (99), Expect = 0.001
 Identities = 47/305 (15%), Positives = 101/305 (33%), Gaps = 36/305 (11%)

Query: 1   MSLHKTFITISSITLFSRITGLFREILFARVFGASIYTDAFNIAFRIPNLLRRLFAEGAF 60
             L K  + +   TL S + GL    L AR+ G   +   + +A  I  L     A+   
Sbjct: 6   KKLIKGALWLLLGTLISALLGLITIPLLARLLGPEGF-GLYALALAIIGLFSI-LADFGL 63

Query: 61  SQAFVPILTEYKNKRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIVYLVATGFIDN 120
             A    + EY+ K   +        +  +L   +  +    +++  +++  +       
Sbjct: 64  PAAITREIAEYREKGEYL--------LLILLSVLLLLLLALILLLLLLLIAYLLAPIDPV 115

Query: 121 YKIFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKIPAFTPILLNISCGALQIIIQI 180
             +      L RI+   +  +   ++   +   + +F   A + +       A+  ++  
Sbjct: 116 LAL------LLRILSLALLLLPLSSVLRGLFQGFGRFGPLALSIVSSIFLLAAVFALLFA 169

Query: 181 PS------------------LIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAA 222
                               LI +G        P   F    ++ +L+   P + S    
Sbjct: 170 ALGLAVWALVLGAVVSLLVLLILLGKKKRGLKRPILRFSLALLKELLRFGLPLLLSSLLN 229

Query: 223 QISLMLNTNIASQMR-EGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEE 281
            +   ++T +         +   S A RL+     +     N +L P LS+A  E + + 
Sbjct: 230 FLFTNIDTLLLGLFLGPAQVGIYSAAQRLVSLLLIVAS-ALNRVLFPALSRAYAEGDRKA 288

Query: 282 YSAIL 286
              +L
Sbjct: 289 LKKLL 293



 Score = 34.0 bits (78), Expect = 0.37
 Identities = 29/192 (15%), Positives = 73/192 (38%), Gaps = 23/192 (11%)

Query: 5   KTFITISSITLFSRITGLFRE----ILFARVFGAS---IYTDAFNIAFRIPNLLRRLFAE 57
           K  +      L S +          +L     G +   IY+ A  +   +  +       
Sbjct: 213 KELLRFGLPLLLSSLLNFLFTNIDTLLLGLFLGPAQVGIYSAAQRLVSLLLIVA------ 266

Query: 58  GAFSQAFVPILTEYKNKRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIVYLVATGF 117
            A ++   P L+    +    A KKL+     +L+       +  +++AP I+ L+    
Sbjct: 267 SALNRVLFPALSRAYAEGDRKALKKLLRQSLKLLLLISIPALLGLLLLAPPIITLLFG-- 324

Query: 118 IDNYKIFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKIPAFTPILLNISCGALQII 177
               + +     + +++     F++ ++L SS+L    + ++      L+++    L +I
Sbjct: 325 ----EKYASAAPILQLLALAGLFLSLVSLTSSLLQALGKQRLLL----LISLISALLNLI 376

Query: 178 IQIPSLIKIGMF 189
           + +  + + G+ 
Sbjct: 377 LNLLLIPRFGLI 388


>gnl|CDD|240527 cd12082, MATE_like, Multidrug and toxic compound extrusion family
           and similar proteins.  The integral membrane proteins
           from the MATE family are involved in exporting
           metabolites across the cell membrane and are responsible
           for multidrug resistance (MDR) in many bacteria and
           animals. MATE has also been identified as a large
           multigene family in plants, where the proteins are
           linked to disease resistance. A number of family members
           are involved in the synthesis of peptidoglycan
           components in bacteria.
          Length = 420

 Score = 38.5 bits (90), Expect = 0.014
 Identities = 26/146 (17%), Positives = 50/146 (34%), Gaps = 14/146 (9%)

Query: 11  SSITLFSRITGLFREILFARVFGASIYTDAFNIAFRIPNLLRRLFAEGAFSQAFVPILTE 70
           S+I       GL   +     FG +    A+ +AFRI +L          +QA +P++ +
Sbjct: 228 SAIQNSLLSLGLLIIVAIVAAFGGAAALAAYTVAFRIASLAFMPA--LGLAQAALPVVGQ 285

Query: 71  Y----KNKRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIVYLVATGFIDNYKIFNI 126
                   R    T   +     +       I +F   +  +        F ++ +   +
Sbjct: 286 NLGAGNFDRARRITWVALLIGLLIGAVLGLLILLFPEPILSL--------FTNDPEFLEL 337

Query: 127 TVSLTRIMFPYVGFMAFITLASSILN 152
            VS  RI+  Y  F+    +  +   
Sbjct: 338 AVSYLRILAIYYLFVGIGYVLQAGFQ 363



 Score = 35.4 bits (82), Expect = 0.12
 Identities = 17/130 (13%), Positives = 43/130 (33%), Gaps = 6/130 (4%)

Query: 40  AFNIAFRIPNLLRRLFAEGAFSQAFVPILTEYKNKRGNIATKKLIDHVATVLIWFMFFIC 99
           A  +AF +  LL  L      S     ++++          ++++     + I     + 
Sbjct: 32  AVGLAFPLIALLIALGV--GLSVGTSALISQAIGAGDEEKARRVLVQSIVLAILLGLLLA 89

Query: 100 VFGIIMAPIIVYLVATGFIDNYKIFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKI 159
              +  +P+I+ L         ++  +  +   I+   +       + S IL      + 
Sbjct: 90  ALLLFFSPLILSL----LGAEEEVIELAATYLTILILGLPITFLGAVLSGILQGEGDTRT 145

Query: 160 PAFTPILLNI 169
                +L N+
Sbjct: 146 AMIISVLSNL 155


>gnl|CDD|218341 pfam04947, Pox_VLTF3, Poxvirus Late Transcription Factor VLTF3
           like.  Members of this family are approximately 26 KDa,
           and are involved in trans-activator of late
           transcription.
          Length = 171

 Score = 36.1 bits (84), Expect = 0.027
 Identities = 27/137 (19%), Positives = 51/137 (37%), Gaps = 17/137 (12%)

Query: 544 KKIAKEIVHYRSITPITRTKQLVEIILKSIRGNKRYKNPATRTFQAIRIYINQELKNLSI 603
           KK+ K  +    +T        V   LK +  +K Y N     F  + +   +   NLS 
Sbjct: 42  KKLKKYNIDTDDVT-----TTKVLDFLKKLGYSKYYDN----VFLILNMLTGKPPPNLSS 92

Query: 604 ALKIGFKKLNIKGRIVVISFHSLEDRIVKNFINFNTKIPHIDRRLPIYNYLFEPKMKFLA 663
            L+   +++    + +   F  L  R  KNFI+++     + + L +  Y        L 
Sbjct: 93  ELE---ERIKEIFKELQKFFDKLNTR--KNFISYSY---VLYKLLELLGYDENLPDFPLL 144

Query: 664 RCKPKKKLNIKKYEDYC 680
           +   K     + ++  C
Sbjct: 145 KSLTKLNSQDEIWKKLC 161


>gnl|CDD|240532 cd13127, MATE_tuaB_like, Uncharacterized subfamily of the multidrug
           and toxic compound extrusion (MATE) proteins.  This
           family might function as a translocase for
           lipopolysaccharides and participate in the biosynthesis
           of cell wall components such as teichuronic acid. The
           integral membrane proteins from the MATE family are
           involved in exporting metabolites across the cell
           membrane and are responsible for multidrug resistance
           (MDR) in many bacteria and animals. A number of family
           members are involved in the synthesis of peptidoglycan
           components in bacteria.
          Length = 406

 Score = 37.1 bits (87), Expect = 0.039
 Identities = 38/254 (14%), Positives = 89/254 (35%), Gaps = 48/254 (18%)

Query: 54  LFAEGAFSQAFVPILTEYKNKRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIVYLV 113
           LF++   S A +         +    T++ +  V  + +     + +   ++AP+I    
Sbjct: 50  LFSDLGLSSALI---------QRKDLTEEDLSTVFWLNLLLGLLLYLLLFLLAPLIA--- 97

Query: 114 ATGFIDNYKIFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKIPAFTPILLNISCGA 173
              F +        + L R++          ++  ++L    +FK  A   ++  +  G 
Sbjct: 98  --AFYNE----PELIPLLRVLSLSFLLSGLGSVPRALLQRELRFKKLAIIELISTLVSGV 151

Query: 174 LQI------------IIQ--IPSLIK-IGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFS 218
           + I            + Q  + +L+  + ++   +  P   F   +++ +L       F 
Sbjct: 152 VAIVLALLGFGVWALVAQQLVGALVSTVLLWILSRWRPRLSFSLASLKELLS------FG 205

Query: 219 --VFAAQISLMLNTN-----IASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLS 271
             +  + +    + N     I   +   +L   S A RL   P   +      +L P LS
Sbjct: 206 SKLLGSNLLNYFSRNLDNLIIGKFLGAAALGYYSRAYRLALLPVQNITGPITRVLFPALS 265

Query: 272 KARIENNTEEYSAI 285
             R++++ E     
Sbjct: 266 --RLQDDPERLRRA 277



 Score = 30.5 bits (70), Expect = 3.8
 Identities = 15/90 (16%), Positives = 38/90 (42%), Gaps = 8/90 (8%)

Query: 26  ILFARVFGASIYTDAFNIAFRIPNLLRRLFAEGAFSQAFVPILTEYKNKRGNI--ATKKL 83
           ++  +  GA+     ++ A+R+  L  +    G  ++   P L+  ++    +  A  K 
Sbjct: 224 LIIGKFLGAA-ALGYYSRAYRLALLPVQNIT-GPITRVLFPALSRLQDDPERLRRAYLKA 281

Query: 84  IDHVATVLIWFMFFICVFGIIMAPIIVYLV 113
           +  +A +    M  +     ++A  +V L+
Sbjct: 282 LRLLALITFPLMVGL----ALLAEPLVLLL 307


>gnl|CDD|225795 COG3256, NorB, Nitric oxide reductase large subunit [Inorganic ion
           transport and metabolism].
          Length = 717

 Score = 34.7 bits (80), Expect = 0.22
 Identities = 33/142 (23%), Positives = 43/142 (30%), Gaps = 14/142 (9%)

Query: 260 VTFNTILLPNLSKARIENNTEEYSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHK 319
           +     LLP L   +     E  S  L     F   G      LG        +W+    
Sbjct: 315 LAAGLYLLPELGGPK---FQELGSPKLLIALFFVVVGSLAGAWLGYLQLLPAPFWFWFGH 371

Query: 320 RIYIPCAGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGIT 379
           + Y    G G    +L++   LLV VAL      N          R   L+ ILL     
Sbjct: 372 QGY-EYLGRG-RLWQLLLIKGLLVWVALMFR--ANV-TATKLKG-RKTTLIAILLLSLTG 425

Query: 380 YFLALRIMGFLFLNEAINWLNI 401
             L      F   N    W N+
Sbjct: 426 IGL---FFLFGLYNP--EWTNL 442


>gnl|CDD|225546 COG3000, ERG3, Sterol desaturase [Lipid metabolism].
          Length = 271

 Score = 34.0 bits (78), Expect = 0.27
 Identities = 27/114 (23%), Positives = 43/114 (37%), Gaps = 13/114 (11%)

Query: 286 LDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGV----------FFIRL 335
           L   P+     LA+++ L A     FL       +       W                 
Sbjct: 5   LWPAPLDFVLALAIALFLLAFGLHLFLLRLYLFPQE-PLRKAWPTRDWLLDLVLLTLSIS 63

Query: 336 VIALLLLVIVALW--GNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 387
           V+ L+LL+ + LW     Y   LG+   P+   + LLL  LF  + Y+ A R++
Sbjct: 64  VVGLVLLLALILWALLGGYLPGLGVAPGPLPFALQLLLAFLFLDLGYYWAHRLL 117


>gnl|CDD|220569 pfam10101, DUF2339, Predicted membrane protein (DUF2339).  This
           domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 738

 Score = 33.9 bits (78), Expect = 0.40
 Identities = 23/109 (21%), Positives = 33/109 (30%), Gaps = 27/109 (24%)

Query: 282 YSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRI-YIPCA----GWGVFFIRLV 336
           Y+     +       L    GL   L A      LR +R  Y P A    G    ++ + 
Sbjct: 24  YAIEAGLLGPELRVALGALAGLA--LLA--AGERLRRRRYAYFPAALQGGGLATLYLTVF 79

Query: 337 IALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALR 385
            A  L  ++                     VA  L++L    T  LAL 
Sbjct: 80  AAFHLYGLLPPT------------------VAFALLVLVTAGTVVLALL 110



 Score = 32.3 bits (74), Expect = 1.3
 Identities = 22/109 (20%), Positives = 37/109 (33%), Gaps = 7/109 (6%)

Query: 291 IFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIVALWGN 350
           +     LAL++       A +   Y     +       G+  +  ++ALLLL    L G 
Sbjct: 526 LLTLLALALALVYLYRRRARWPLVYRAAAYLL-----LGLAPLAALLALLLLENPLLSGI 580

Query: 351 SYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFLFLNEAINWL 399
                LG    PI   + L  +LL   +      R+   + L      +
Sbjct: 581 GGAAPLG--FLPILNPLLLAYLLLAPALLALWLRRLATPIRLKALRPAV 627


>gnl|CDD|240533 cd13128, MATE_Wzx_like, Wzx, a subfamily of the multidrug and toxic
           compound extrusion (MATE)-like proteins.  Escherichia
           coli Wzx and related proteins from other gram-negative
           bacteria are thought to act as flippases, assisting in
           the membrane translocation of lipopolysaccharides
           including those containing O-antigens. Proteins from the
           MATE family are involved in exporting metabolites across
           the cell membrane and are often responsible for
           multidrug resistance (MDR).
          Length = 402

 Score = 33.7 bits (78), Expect = 0.49
 Identities = 48/309 (15%), Positives = 102/309 (33%), Gaps = 50/309 (16%)

Query: 3   LHKTFITISSITLFSRITGLFREILFARVFGASIYTDAFNIAFRIPNLLRRLFAEGAFSQ 62
              +        + S++ G    +  AR  G   +    ++A     L   +FA+     
Sbjct: 2   AKNSGWLFVG-NIISKLLGFLVRVYLARYLGPEGF-GILSLALAFVGLFS-IFADLGLPT 58

Query: 63  AFVPILTEYKNKRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIVYLVATGFIDNYK 122
           A V  +  Y+ ++              ++   +    + GI+   ++   +    I  Y 
Sbjct: 59  ALVREIARYRKEKIR-----------EIISTSLVLKLILGILALLLL--FLFAFLIAFYD 105

Query: 123 IFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKIPAFTPILLNISCGALQII----- 177
                V L  I+   + F A  +L   I   + + K      I+ ++    L +I     
Sbjct: 106 --PELVLLLYILALSLPFSALNSLFRGIFQGFEKMKYIVIARIIESVLSLILALILVLLG 163

Query: 178 ---------------IQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAA 222
                          +    L+ I     I L     F     + +L+   P    +  +
Sbjct: 164 GGLIGFALAYLIASVLSAILLLYI-YRKKILLPKFFSFDLKLAKELLRFSIP----LALS 218

Query: 223 QISLMLNTNIASQM-----REGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIEN 277
            I  ++ T I + M      + ++   + A R+ E    +     +T L P +S+   +N
Sbjct: 219 GIFSLIYTRIDTIMLGYFLGDAAVGIYNAAYRIAELLLFIPSA-ISTALFPRISELY-KN 276

Query: 278 NTEEYSAIL 286
           + +E   +L
Sbjct: 277 DKDELKKLL 285


>gnl|CDD|218473 pfam05159, Capsule_synth, Capsule polysaccharide biosynthesis
           protein.  This family includes export proteins involved
           in capsule polysaccharide biosynthesis, such as KpsS and
           LipB.
          Length = 268

 Score = 32.7 bits (75), Expect = 0.65
 Identities = 28/124 (22%), Positives = 45/124 (36%), Gaps = 15/124 (12%)

Query: 694 VIVTGVTASLDLIKTAVDMNADA--ILVHHGYFWKGENSNIVGIKKKRLEQLIINKINLY 751
           V+V G       I  A         I +  G+      S  +G        L+++KI +Y
Sbjct: 3   VVVWGDKRPYHRIAAAAAKKKGIPVIRLEDGFL----RSLGLGSNGYPPYSLVLDKIGIY 58

Query: 752 AY-HLPLDM-----HPKLGNNAQLA---KILNFSCTRRFSKNNIGWIGKIINLKRYNFKK 802
               LP D+        + + A LA    ++N   T + SK N+        L     KK
Sbjct: 59  YDSSLPSDLEQLLSEKSVLDQAGLARAEYLMNRIVTHKLSKYNLRRAAPAFRLLLAKDKK 118

Query: 803 IITI 806
           I+ +
Sbjct: 119 IVLV 122


>gnl|CDD|148050 pfam06212, GRIM-19, GRIM-19 protein.  This family consists of
           several eukaryotic gene associated with
           retinoic-interferon-induced mortality 19 (GRIM-19)
           proteins. GRIM-19, was reported to encode a small
           protein primarily distributed in the nucleus and was
           able to promote cell death induced by IFN-# and RA. A
           bovine homologue of GRIM-19 was co-purified with
           mitochondrial NADH:ubiquinone oxidoreductase (complex I)
           in bovine heart. Therefore, its exact cellular
           localisation and function are unclear. It has now been
           discovered that GRIM-19 is a specific interacting
           protein which negatively regulates Stat3 activity.
          Length = 130

 Score = 31.2 bits (71), Expect = 0.78
 Identities = 10/33 (30%), Positives = 15/33 (45%)

Query: 196 PSHGFKNIAVRRILKKMGPSVFSVFAAQISLML 228
           P  G+  I  +R L K G S +++FA       
Sbjct: 11  PPGGYGPIPFKRNLPKTGFSGYAMFATGAGATA 43


>gnl|CDD|223885 COG0815, Lnt, Apolipoprotein N-acyltransferase [Cell envelope
           biogenesis, outer membrane].
          Length = 518

 Score = 32.4 bits (74), Expect = 1.1
 Identities = 27/125 (21%), Positives = 41/125 (32%), Gaps = 28/125 (22%)

Query: 284 AILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLV 343
               W+       L L++  GA    +F            P   W       +  LL LV
Sbjct: 5   LARQWVLGRPARRLLLALVFGALFALAF------------PPPDWWWLAWFSLAPLLWLV 52

Query: 344 IVA------------LWGNSYF----NWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 387
             A            L+G  +F     WLG        ++A+ L LL   +  +LAL ++
Sbjct: 53  RGAPTSWEGLAKSGFLFGFGFFLAGFYWLGTSLGVGLGLLAVALPLLVLLLAAWLALFLL 112

Query: 388 GFLFL 392
               L
Sbjct: 113 LVAVL 117


>gnl|CDD|185281 PRK15383, PRK15383, type III secretion system protein; Provisional.
          Length = 335

 Score = 31.9 bits (72), Expect = 1.3
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 5/69 (7%)

Query: 391 FLNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTESVSLGNKI 450
           +LN  IN   IE ERI GI++D  F +G    K L +L +  R I +  +++   L   +
Sbjct: 75  YLNNIINAAIIEKERIIGIFVDGDFSKGQR--KALGKLEQNYRNIKVIYNSD---LNYSM 129

Query: 451 TDSRFSIIH 459
            D + + I+
Sbjct: 130 YDKKLTTIY 138


>gnl|CDD|225127 COG2217, ZntA, Cation transport ATPase [Inorganic ion transport and
           metabolism].
          Length = 713

 Score = 31.8 bits (73), Expect = 1.9
 Identities = 13/53 (24%), Positives = 21/53 (39%), Gaps = 2/53 (3%)

Query: 331 FFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLA 383
              RL+IA LL + + L   S    LG    P    +    +L + G  ++  
Sbjct: 87  LLRRLIIAGLLTLPLLLL--SLGLLLGAFLLPWVSFLLATPVLFYGGWPFYRG 137


>gnl|CDD|179400 PRK02261, PRK02261, methylaspartate mutase subunit S; Provisional.
          Length = 137

 Score = 29.9 bits (68), Expect = 2.9
 Identities = 13/28 (46%), Positives = 17/28 (60%), Gaps = 1/28 (3%)

Query: 693 NVIVTGVTASL-DLIKTAVDMNADAILV 719
            VI  GV  S  + I  A++ +ADAILV
Sbjct: 33  EVINLGVMTSQEEFIDAAIETDADAILV 60


>gnl|CDD|237526 PRK13837, PRK13837, two-component VirA-like sensor kinase;
           Provisional.
          Length = 828

 Score = 31.2 bits (71), Expect = 3.0
 Identities = 11/55 (20%), Positives = 18/55 (32%)

Query: 388 GFLFLNEAINWLNIENERINGIYIDATFGQGGHSCKILERLGKKGRLIAIDKDTE 442
           GF  L  AI W++   ER + + +D        +   L        +I       
Sbjct: 726 GFSTLAAAIAWISKGPERFDLVLVDDRLLDEEQAAAALHAAAPTLPIILGGNSKT 780


>gnl|CDD|220473 pfam09925, DUF2157, Predicted membrane protein (DUF2157).  This
           domain, found in various hypothetical prokaryotic
           proteins, has no known function.
          Length = 144

 Score = 29.9 bits (68), Expect = 3.0
 Identities = 27/107 (25%), Positives = 35/107 (32%), Gaps = 22/107 (20%)

Query: 273 ARIENNTEEYSAILDWIPIFAHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVF- 331
             I     E  A    + I    G AL +G G      F+               W    
Sbjct: 16  EAILARYGEDPARSSLVRILLWLG-ALLLGAGV---IFFVAAN------------WEEIP 59

Query: 332 -FIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCG 377
            F++  + L LLV +AL G     WL  +  P      LLL  L  G
Sbjct: 60  RFVKFGLLLALLVGLALGGF----WLWRRGKPRLGEALLLLGALLFG 102


>gnl|CDD|240529 cd13124, MATE_SpoVB_like, Stage V sporulation protein B, also known
           as Stage III sporulation protein F, and related
           proteins.  The integral membrane protein SpoVB has been
           implicated in the biosynthesis of the peptidoglycan
           component of the spore cortex in Bacillus subtilis. This
           model represents a subfamily of the multidrug and toxic
           compound extrusion (MATE)-like proteins. Proteins from
           the MATE family are involved in exporting metabolites
           across the cell membrane and are often responsible for
           multidrug resistance (MDR).
          Length = 434

 Score = 30.9 bits (71), Expect = 3.1
 Identities = 22/131 (16%), Positives = 50/131 (38%), Gaps = 14/131 (10%)

Query: 59  AFSQAFVPILTEYKNKRGNIATKKLIDHVATVLIWFMFFICV-FGIIMAPIIVYLVATGF 117
           A S + VP ++E   K      ++ I+    + +       V   ++  PI   L     
Sbjct: 289 ALSTSLVPAISEAYAKGDKKELRRRINQALRLTLLIGLPAAVGLAVLAEPINTLL----- 343

Query: 118 IDNYKIFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKIPAFTPILLNISCGALQII 177
              +      + L +I+ P + F++   + ++IL    + KIP        +    ++I+
Sbjct: 344 ---FGDSEGGLIL-QILAPSILFLSLQQVTAAILQGLGKVKIPVIN----LLIGLLVKIV 395

Query: 178 IQIPSLIKIGM 188
           +    +   G+
Sbjct: 396 LNYVLIPIFGI 406


>gnl|CDD|237913 PRK15119, PRK15119, undecaprenyl-phosphate
           alpha-N-acetylglucosaminyl 1-phosphate transferase;
           Provisional.
          Length = 365

 Score = 30.8 bits (70), Expect = 3.6
 Identities = 20/81 (24%), Positives = 33/81 (40%), Gaps = 17/81 (20%)

Query: 294 HSGL-----ALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFF------------IRLV 336
           H GL      +S+  G C       +Y+ H  +Y+ CAG  VF             IR  
Sbjct: 43  HQGLIPLVGGISVYAGICFTFGITDYYIPHASLYLACAGVLVFVGALDDRFDISVKIRAT 102

Query: 337 IALLLLVIVALWGNSYFNWLG 357
           I   + +++ ++   Y + LG
Sbjct: 103 IQAAVGIVMMVFAGLYLSSLG 123


>gnl|CDD|223608 COG0534, NorM, Na+-driven multidrug efflux pump [Defense
           mechanisms].
          Length = 455

 Score = 30.7 bits (70), Expect = 4.2
 Identities = 41/349 (11%), Positives = 94/349 (26%), Gaps = 59/349 (16%)

Query: 81  KKLIDHVATVLIWFMFFICVFGIIMA-----PIIVYLVATGFIDNYKIFNITVSLTRIMF 135
           +K    V    +     + +   I+      P++  L A       ++  +     RI+ 
Sbjct: 89  RKKAKRVLGQGLLLALLLGLLLAILLLFFAEPLLRLLGAPA-----EVLELAAEYLRIIL 143

Query: 136 PYVGFMAFITLASSILNIWSQFKIPAFTPILLNISCGAL--------------------- 174
               F     + S IL      K P +  +L N+    L                     
Sbjct: 144 LGAPFALLSFVLSGILRGLGDTKTPMYILLLGNLLNIVLNYLLIFGLFGGLGVAGAALAT 203

Query: 175 ------------QIIIQIPSLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVFAA 222
                         +++   L+ +     +K +         ++ IL+   P      + 
Sbjct: 204 VIARWIGALLLLIYLLRKKRLLSLFKKKLLKPDRK------LLKEILRLGLPIFLESLSE 257

Query: 223 QISLMLNTNIASQMREGSLSCLSYADRLMEFPTTLLGVTFNTILLPNLSKARIENNTEEY 282
            +  +L T   +++   +L+    A R+  F   +        +   + +     N +  
Sbjct: 258 SLGFLLLTLFVARLGTVALAAYGIALRIASF-IFMPPFGIAQAVTILVGQNLGAGNYKRA 316

Query: 283 SAILDWIPIFAHSGLALSIGLGACLHASFL-YWYLRHKRIYIPCAGWGVFFIRLVIALLL 341
                 + +     +AL I L   L    +   +     +        V  + +      
Sbjct: 317 RRAA-RLALKLSLLIALLIALLLLLFREPIISLFTTDPEVIA----LAVILLLIAALFQP 371

Query: 342 LVIVALWGNSYFNWLGMQAHPIF-RIVALLLILLFCGITYFLALRIMGF 389
              +    +      G    P    +++     L     Y L    +G 
Sbjct: 372 FDGIQFVLSGVLRGAGDAKIPFIISLLSYWGFRLPLA--YLLGFFFLGL 418


>gnl|CDD|163613 cd00144, MPP_PPP_family, phosphoprotein phosphatases of the
           metallophosphatase superfamily, metallophosphatase
           domain.  The PPP (phosphoprotein phosphatase) family is
           one of two known protein phosphatase families specific
           for serine and threonine.  This family includes: PP1,
           PP2A, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1,
           RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP
           catalytic domain is defined by three conserved motifs
           (-GDXHG-, -GDXVDRG- and -GNHE-).  The PPP enzyme family
           is ancient with members found in all eukaryotes, and in
           most bacterial and archeal genomes.  Dephosphorylation
           of phosphoserines and phosphothreonines on target
           proteins plays a central role in the regulation of many
           cellular processes.  PPPs belong to the
           metallophosphatase (MPP) superfamily.  MPPs are
           functionally diverse, but all share a conserved domain
           with an active site consisting of two metal ions
           (usually manganese, iron, or zinc) coordinated with
           octahedral geometry by a cage of histidine, aspartate,
           and asparagine residues. The MPP superfamily includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  The conserved domain is a
           double beta-sheet sandwich with a di-metal active site
           made up of residues located at the C-terminal side of
           the sheets.  This domain is thought to allow for
           productive metal coordination.
          Length = 225

 Score = 30.1 bits (68), Expect = 4.7
 Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 1/28 (3%)

Query: 458 IHNCFTELDIILKKYNIKKIDGILFDLG 485
           IH C  +L  +L+K      D ++F LG
Sbjct: 6   IHGCLDDLLRLLEKIGFPPNDKLIF-LG 32


>gnl|CDD|219117 pfam06638, Strabismus, Strabismus protein.  This family consists of
           several strabismus (STB) or Van Gogh-like (VANGL)
           proteins 1 and 2. The exact function of this family is
           unknown. It is thought, however that STB1 gene and STB2
           may be potent tumour suppressor gene candidates.
          Length = 507

 Score = 30.5 bits (69), Expect = 5.0
 Identities = 30/109 (27%), Positives = 46/109 (42%), Gaps = 18/109 (16%)

Query: 291 IFAHSGLALSIGLGACLHA-SF-----LYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVI 344
            +    L   +GL A +   +F     + W  R +     C G     I L   LL+L+I
Sbjct: 92  RYLGLALGGLLGLLAFVTPLAFIVLPQVGWRDRLEPCGTACEG---LLISLAFKLLILLI 148

Query: 345 VALWGNSYFNWLGMQAHP---IFRIVALLLILLFCGITY--FLALRIMG 388
            A W  + F        P   +FR + L+L+ LF   +Y  F  +RI+G
Sbjct: 149 -ASW--ALFLRPPRADMPRIFVFRALLLVLVFLFL-FSYWLFYGVRILG 193


>gnl|CDD|240539 cd13134, MATE_like_8, Uncharacterized subfamily of the multidrug
           and toxic compound extrusion (MATE) proteins.  The
           integral membrane proteins from the MATE family are
           involved in exporting metabolites across the cell
           membrane and are responsible for multidrug resistance
           (MDR) in many bacteria and animals. A number of family
           members are involved in the synthesis of peptidoglycan
           components in bacteria.
          Length = 438

 Score = 30.2 bits (69), Expect = 5.0
 Identities = 17/80 (21%), Positives = 33/80 (41%), Gaps = 10/80 (12%)

Query: 332 FIRLVIALLLLVIVALWGNSYFNWLGMQAHPI------FRIVALLLILLFCGITYFLALR 385
              L++ L+  +++  +G      +G+    +       RIV   L L    +T    LR
Sbjct: 87  LFNLLLGLVFSLLLLFFGAELLRLMGLPPELLALGSTYLRIVGGSLFLQALSLTLSAILR 146

Query: 386 IMGF----LFLNEAINWLNI 401
             G     ++++  +N LNI
Sbjct: 147 SHGRTKIPMYVSLIMNILNI 166


>gnl|CDD|172166 PRK13603, PRK13603, fumarate reductase subunit C; Provisional.
          Length = 126

 Score = 28.9 bits (64), Expect = 5.4
 Identities = 26/80 (32%), Positives = 37/80 (46%), Gaps = 17/80 (21%)

Query: 313 YW------YLRHKRIYIPC--AGWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHP-- 362
           YW      YLR     I C    W V ++     +L+L  V   GNSY  +L   A+P  
Sbjct: 11  YWWARRRSYLRFMLREISCIFVAWFVLYL-----VLVLRAVGAGGNSYQRFLDFSANPVV 65

Query: 363 -IFRIVALLLILLFCGITYF 381
            +  +VAL  +LL   +T+F
Sbjct: 66  VVLNVVALSFLLLH-AVTWF 84


>gnl|CDD|224880 COG1969, HyaC, Ni,Fe-hydrogenase I cytochrome b subunit [Energy
           production and conversion].
          Length = 227

 Score = 29.7 bits (67), Expect = 5.8
 Identities = 20/80 (25%), Positives = 27/80 (33%), Gaps = 17/80 (21%)

Query: 319 KRIYIPCAGWGVFFIRLVIALLLLVIVALWGNSY--------------FNWLG---MQAH 361
           K  + P A    FF  L+I  ++L   AL+G  Y              F W G      H
Sbjct: 124 KGGHNPIAQVAYFFYFLMIVFMILTGFALYGEHYHYGLGAPLRYVFEWFEWTGGGSADVH 183

Query: 362 PIFRIVALLLILLFCGITYF 381
              R+   L+I       Y 
Sbjct: 184 FWHRLGMWLIIAFVIVHVYM 203


>gnl|CDD|234525 TIGR04259, oxa_formateAnti, oxalate/formate antiporter.  This model
           represents a subgroup of the more broadly defined model
           TIGR00890, which in turn belongs to the Major
           Facilitator transporter family. Seed members for this
           family include the known oxalate/formate antiporter of
           Oxalobacter formigenes, as well as transporter subunits
           co-clustered with the two genes of a system that
           decarboxylates oxalate into formate. In many of these
           cassettes, two subunits are found rather than one,
           suggesting the antiporter is sometimes homodimeric,
           sometimes heterodimeric.
          Length = 405

 Score = 30.1 bits (68), Expect = 5.9
 Identities = 18/72 (25%), Positives = 34/72 (47%), Gaps = 10/72 (13%)

Query: 91  LIWFMFFICVFGIIMAPIIVYLVATGFIDNYKIFNITVSLTRIMFPYVGFMAFITLASSI 150
           +++ MF +   G +MA   +  +A     ++ + ++ V+L  +       MA +T A SI
Sbjct: 211 VMYTMFVMVAAGGLMATAQLGPIA----KDFGVADVPVTLLGLT------MAALTFALSI 260

Query: 151 LNIWSQFKIPAF 162
             I + F  P F
Sbjct: 261 DRILNGFTRPFF 272


>gnl|CDD|190033 pfam01554, MatE, MatE.  The MatE domain.
          Length = 161

 Score = 29.1 bits (66), Expect = 6.0
 Identities = 13/98 (13%), Positives = 33/98 (33%), Gaps = 4/98 (4%)

Query: 81  KKLIDHVATVLIWFMFFICVFGIIMAPIIVYLVATGFIDNYKIFNITVSLTRIMFPYVGF 140
            K         +     I +   ++  +    + + F  + ++  +     RI+   + F
Sbjct: 65  FKRAGVALRQGLILSLLISLPIALLIILFREPILSLFTQDEEVLELASKYLRILALGIPF 124

Query: 141 MAFITLASSILNIWSQFKIPAFTPILLNISCGALQIII 178
           ++   + S  L      K P +    +N+    L I +
Sbjct: 125 LSLQFVLSGFLRGAGDTKPPLY----INLISLLLNIPL 158


>gnl|CDD|133453 cd05311, NAD_bind_2_malic_enz, NAD(P) binding domain of malic
           enzyme (ME), subgroup 2.  Malic enzyme (ME), a member of
           the amino acid dehydrogenase (DH)-like domain family,
           catalyzes the oxidative decarboxylation of L-malate to
           pyruvate in the presence of cations (typically  Mg++ or
           Mn++) with the concomitant reduction of cofactor NAD+ or
           NADP+.  ME has been found in all organisms, and plays
           important roles in diverse metabolic pathways such as
           photosynthesis and lipogenesis. This enzyme generally
           forms homotetramers. The conversion of malate to
           pyruvate by ME typically involves oxidation of malate to
           produce oxaloacetate, followed by decarboxylation of
           oxaloacetate to produce pyruvate and CO2.  This
           subfamily consists primarily of archaeal and bacterial
           ME.  Amino acid DH-like NAD(P)-binding domains are
           members of the Rossmann fold superfamily and include
           glutamate, leucine, and phenylalanine DHs, methylene
           tetrahydrofolate DH, methylene-tetrahydromethanopterin
           DH, methylene-tetrahydropholate DH/cyclohydrolase,
           Shikimate DH-like proteins, malate oxidoreductases, and
           glutamyl tRNA reductase. Amino acid DHs catalyze the
           deamination of amino acids to keto acids with NAD(P)+ as
           a cofactor. The NAD(P)-binding Rossmann fold superfamily
           includes a wide variety of protein families including
           NAD(P)- binding domains of alcohol DHs,
           tyrosine-dependent oxidoreductases,
           glyceraldehyde-3-phosphate DH, lactate/malate DHs,
           formate/glycerate DHs, siroheme synthases,
           6-phosphogluconate DH, amino acid DHs, repressor rex,
           NAD-binding potassium channel  domain, CoA-binding, and
           ornithine cyclodeaminase-like domains. These domains
           have an alpha-beta-alpha configuration. NAD binding
           involves numerous hydrogen and van der Waals contacts.
          Length = 226

 Score = 29.5 bits (67), Expect = 6.7
 Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 10/69 (14%)

Query: 489 NQINNELRGFSFLLDGPLDMRM-DITRG--ISASKWLANATEFNIKKVIQDYGEERFAKK 545
           NQ+NN L GF  +  G LD+R   IT    ++A++ +A+  E       +  GEE     
Sbjct: 156 NQVNNVL-GFPGIFRGALDVRATKITEEMKLAAAEAIADLAE------EEVLGEEYIIPT 208

Query: 546 IAKEIVHYR 554
                V  R
Sbjct: 209 PFDPRVVPR 217


>gnl|CDD|217153 pfam02628, COX15-CtaA, Cytochrome oxidase assembly protein.  This
           is a family of integral membrane proteins. CtaA is
           required for cytochrome aa3 oxidase assembly in Bacillus
           subtilis. COX15 is required for cytochrome c oxidase
           assembly in yeast.
          Length = 301

 Score = 29.5 bits (67), Expect = 6.9
 Identities = 21/121 (17%), Positives = 29/121 (23%), Gaps = 29/121 (23%)

Query: 293 AHSGLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLLLVIV------- 345
           AH  LA  + L   L  +                        L + LLLL I        
Sbjct: 133 AHLLLAFLL-LALLLWLALRLREPDPAPAPAAGPRRLRGLALLALVLLLLQIALGALVAG 191

Query: 346 --------------------ALWGNSYFNWLGMQ-AHPIFRIVALLLILLFCGITYFLAL 384
                                 W N + N   +Q  H +   +  LL+L           
Sbjct: 192 LDAGLACNTFPLMGPDFPAEPDWRNFFENPATVQFLHRLGAYLTFLLLLALAVAAARRRR 251

Query: 385 R 385
           R
Sbjct: 252 R 252


>gnl|CDD|234462 TIGR04086, TIGR04086_membr, putative membrane protein, TIGR04086
           family.  Members of this family of strongly hydrophobic
           putative transmembrane protein average about 125 amino
           acids in length and occur mostly, but not exclusively,
           in the Firmicutes. Members are quite diverse in
           sequence. The function is unknown.
          Length = 115

 Score = 28.2 bits (64), Expect = 7.1
 Identities = 19/94 (20%), Positives = 32/94 (34%), Gaps = 22/94 (23%)

Query: 327 GWGVFFIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGIT------- 379
             G+     ++ L+LL+I+AL    YF  L   +     ++  +L +   G         
Sbjct: 4   LKGLLVA-YILTLILLLILALL--LYFTDLSEGSLSWLILIIYILSVFIGGFIAGKKAGS 60

Query: 380 ------------YFLALRIMGFLFLNEAINWLNI 401
                       YFL L ++  LF       L  
Sbjct: 61  KGWLWGLLVGLLYFLILLLISLLFGPGPGFTLGQ 94


>gnl|CDD|215222 PLN02397, PLN02397, aspartate transaminase.
          Length = 423

 Score = 29.9 bits (68), Expect = 7.3
 Identities = 14/33 (42%), Positives = 16/33 (48%), Gaps = 2/33 (6%)

Query: 298 ALSIGLGAC-LHASFLYWYLRHKRIYIPCAGWG 329
            LS G G+  L A FL  +     IYIP   WG
Sbjct: 122 CLS-GTGSLRLGAEFLARFYPGSTIYIPNPTWG 153


>gnl|CDD|221252 pfam11833, DUF3353, Protein of unknown function (DUF3353).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria and eukaryotes.
           Proteins in this family are typically between 205 to 258
           amino acids in length.
          Length = 193

 Score = 28.8 bits (65), Expect = 8.6
 Identities = 25/85 (29%), Positives = 37/85 (43%), Gaps = 14/85 (16%)

Query: 296 GLALSIGLGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIALLL--LVIVALWGNSYF 353
            L L++G GAC     +Y+  R  R +     W +    LV+ LLL  L+ V L      
Sbjct: 117 QLQLALGTGAC-----IYFLNRKGRRFGRALLWSLGG--LVVGLLLGSLLAVLLPPFILP 169

Query: 354 NWLG---MQAHPIFRIVALLLILLF 375
             L    +Q+ P +  + L L  LF
Sbjct: 170 GSLSPEQIQSLPAY--LLLWLGSLF 192


>gnl|CDD|206279 pfam14110, DUF4282, Domain of unknown function (DUF4282).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 93 and 155
           amino acids in length. There is a single completely
           conserved residue E that may be functionally important.
          Length = 89

 Score = 27.5 bits (62), Expect = 8.9
 Identities = 11/56 (19%), Positives = 24/56 (42%)

Query: 332 FIRLVIALLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIM 387
            I+++  L L++I     +  F  L   A+     +  LL+     +   L +R++
Sbjct: 13  IIKVLYWLGLILIGLFGLSGLFGGLATMAYSPGGGLLALLLGPLGFLLGLLLVRVV 68


>gnl|CDD|129428 TIGR00328, flhB, flagellar biosynthetic protein FlhB.  FlhB and its
           functionally equivalent orthologs, from among a larger
           superfamily of proteins involved in type III protein
           export systems, are specifically involved in flagellar
           protein export. The seed members are restricted and the
           trusted cutoff is set high such that the proteins
           gathered by this model play roles specifically related
           to flagellar structures. Full-length homologs scoring
           below the trusted cutoff are involved in peptide export
           but not necessarily in the creation of flagella
           [Cellular processes, Chemotaxis and motility].
          Length = 347

 Score = 29.5 bits (67), Expect = 9.1
 Identities = 26/159 (16%), Positives = 54/159 (33%), Gaps = 39/159 (24%)

Query: 74  KRGNIATKKLIDHVATVLIWFMFFICVFGIIMAPIIVYLVATGF-------IDNYK---- 122
           + GN+   + ++  A +L      +  FG  +A ++V L++          ++       
Sbjct: 16  EEGNVPKSREVNSAAVLLAGV-LSLLFFGDGLADLLVVLMSFFLSFETSDILNLGSLVGI 74

Query: 123 IFNITVSLTRIMFPYVGFMAFITLASSILNIWSQFKI-PAFTPILLNISCGALQIIIQIP 181
            F     +  ++ P    +  + + S+I     QF       P+    S           
Sbjct: 75  FFQSIREVLLLLLPIFVLLLVVGVLSNIA----QFGFLFTTKPLKPKFS----------- 119

Query: 182 SLIKIGMFPHIKLNPSHGFKNIAVRRILKKMGPSVFSVF 220
                      K+NP  G K +   + L ++  S+  VF
Sbjct: 120 -----------KINPIKGLKRLFSLQSLVELLKSLLKVF 147


>gnl|CDD|216554 pfam01529, zf-DHHC, DHHC palmitoyltransferase.  This family
           includes the well known DHHC zinc binding domain as well
           as three of the four conserved transmembrane regions
           found in this family of palmitoyltransferase enzymes.
          Length = 167

 Score = 28.6 bits (64), Expect = 9.4
 Identities = 10/60 (16%), Positives = 25/60 (41%), Gaps = 1/60 (1%)

Query: 92  IWFMFFICVFGIIMAPIIVYLVATGFIDNYKIFNITVSLTRIMFPYVGFMAFITLASSIL 151
           + F+ ++ ++ I++  +  Y +     +    F + +SL   +   +    F  L  S L
Sbjct: 88  LLFLLYLTLYLILLLVLSFYYLVYLIRNIELFFFLILSLFSSII-LLVLSLFFLLFLSFL 146


>gnl|CDD|222102 pfam13398, Peptidase_M50B, Peptidase M50B-like.  This is a family
           of bacterial and plant peptidases in the same family as
           MEROPS:M50B.
          Length = 201

 Score = 28.7 bits (65), Expect = 9.8
 Identities = 21/112 (18%), Positives = 35/112 (31%), Gaps = 35/112 (31%)

Query: 294 HSGLALSIG---------------LGACLHASFLYWYLRHKRIYIPCAGWGVFFIRLVIA 338
            SGL +S G               LG  L    L       R              L + 
Sbjct: 52  GSGLTVSRGRPGGLGMFLTLAAGYLGPALWGLLLILLAFAGRARA----------LLWLG 101

Query: 339 LLLLVIVALWGNSYFNWLGMQAHPIFRIVALLLILLFCGITYFLALRIMGFL 390
           + LL+++ LW     N  G        ++ LL   L   + +    +++ F+
Sbjct: 102 VALLLVLLLWVR---NLFG-------LLILLLTGALLLALWWLAPPQVLAFV 143


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.327    0.142    0.426 

Gapped
Lambda     K      H
   0.267   0.0765    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 47,399,606
Number of extensions: 4994844
Number of successful extensions: 8525
Number of sequences better than 10.0: 1
Number of HSP's gapped: 8334
Number of HSP's successfully gapped: 370
Length of query: 896
Length of database: 10,937,602
Length adjustment: 106
Effective length of query: 790
Effective length of database: 6,236,078
Effective search space: 4926501620
Effective search space used: 4926501620
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 64 (28.4 bits)