RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy12992
         (319 letters)



>gnl|CDD|116627 pfam08017, Fibrinogen_BP, Fibrinogen binding protein.  Proteins in
           this family bind to fibrinogen. Members of this family
           includes the fibrinogen receptor, FbsA, which mediates
           platelet aggregation.
          Length = 393

 Score = 41.8 bits (97), Expect = 3e-04
 Identities = 43/140 (30%), Positives = 69/140 (49%)

Query: 175 QRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMG 234
           ++R  +N+SQ      + R  +NKSQ      + R  +NKSQ      + R ++N+SQ  
Sbjct: 162 RQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGN 221

Query: 235 SGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNK 294
               + R  +NKSQ      + R  +NKSQ      + R ++N+SQ      + R  +NK
Sbjct: 222 VLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDVENK 281

Query: 295 SQMGSGLEQSRKSDNKSQMG 314
           SQ      + R ++NKSQ+G
Sbjct: 282 SQGNVLERRQRDAENKSQVG 301



 Score = 41.4 bits (96), Expect = 4e-04
 Identities = 39/141 (27%), Positives = 69/141 (48%), Gaps = 1/141 (0%)

Query: 173 LEQRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQ 232
           LE+R+++      G+ LE+ +R       G+ LE+ +R       G+ LE+ +R      
Sbjct: 95  LERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDAENKS 154

Query: 233 MGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRR-S 291
            G+ LE+ +R       G+ LE+ +R       G+ LE+ +R       G+ LE+ +R +
Sbjct: 155 QGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDA 214

Query: 292 DNKSQMGSGLEQSRKSDNKSQ 312
           +N+SQ      + R  +NKSQ
Sbjct: 215 ENRSQGNVLERRQRDVENKSQ 235



 Score = 40.6 bits (94), Expect = 7e-04
 Identities = 40/141 (28%), Positives = 70/141 (49%), Gaps = 1/141 (0%)

Query: 173 LEQRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQ 232
           LE+R+++      G+ LE+ +R       G+ LE+ +R       G+ LE+ +R      
Sbjct: 127 LERRQRDAENRSQGNVLERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKS 186

Query: 233 MGSGLEQSRRSDNKSQMGSGLEQSRR-SDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRS 291
            G+ LE+ +R       G+ LE+ +R ++N+SQ      + R  +NKSQ      + R  
Sbjct: 187 QGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDV 246

Query: 292 DNKSQMGSGLEQSRKSDNKSQ 312
           +NKSQ      + R ++N+SQ
Sbjct: 247 ENKSQGNVLERRQRDAENRSQ 267



 Score = 40.6 bits (94), Expect = 7e-04
 Identities = 39/141 (27%), Positives = 69/141 (48%), Gaps = 1/141 (0%)

Query: 173 LEQRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQ 232
           LE+R+++      G+ LE+ +R       G+ LE+ +R       G+ LE+ +R      
Sbjct: 79  LERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRS 138

Query: 233 MGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRR-S 291
            G+ LE+ +R       G+ LE+ +R       G+ LE+ +R       G+ LE+ +R  
Sbjct: 139 QGNVLERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDV 198

Query: 292 DNKSQMGSGLEQSRKSDNKSQ 312
           +NKSQ      + R ++N+SQ
Sbjct: 199 ENKSQGNVLERRQRDAENRSQ 219



 Score = 40.2 bits (93), Expect = 9e-04
 Identities = 41/141 (29%), Positives = 70/141 (49%), Gaps = 1/141 (0%)

Query: 173 LEQRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQ 232
           LE+R+++      G+ LE+ +R       G+ LE+ +R       G+ LE+ +R      
Sbjct: 143 LERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKS 202

Query: 233 MGSGLEQSRR-SDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRS 291
            G+ LE+ +R ++N+SQ      + R  +NKSQ      + R  +NKSQ      + R +
Sbjct: 203 QGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDA 262

Query: 292 DNKSQMGSGLEQSRKSDNKSQ 312
           +N+SQ      + R  +NKSQ
Sbjct: 263 ENRSQGNVLERRQRDVENKSQ 283



 Score = 40.2 bits (93), Expect = 0.001
 Identities = 39/141 (27%), Positives = 69/141 (48%), Gaps = 1/141 (0%)

Query: 173 LEQRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQ 232
           LE+R+++      G+ LE+ +R       G+ LE+ +R       G+ LE+ +R      
Sbjct: 47  LERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKS 106

Query: 233 MGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRR-S 291
            G+ LE+ +R       G+ LE+ +R       G+ LE+ +R       G+ LE+ +R +
Sbjct: 107 QGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDAENKSQGNVLERRQRDA 166

Query: 292 DNKSQMGSGLEQSRKSDNKSQ 312
           +N+SQ      + R  +NKSQ
Sbjct: 167 ENRSQGNVLERRQRDVENKSQ 187



 Score = 40.2 bits (93), Expect = 0.001
 Identities = 39/141 (27%), Positives = 71/141 (50%), Gaps = 1/141 (0%)

Query: 173 LEQRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQ 232
           LE+R+++      G+ LE+ +R       G+ LE+ +R       G+ LE+ +R      
Sbjct: 31  LERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRS 90

Query: 233 MGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRR-SDNKSQMGSGLEQSRRS 291
            G+ LE+ +R       G+ LE+ +R       G+ LE+ +R ++N+SQ      + R +
Sbjct: 91  QGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDA 150

Query: 292 DNKSQMGSGLEQSRKSDNKSQ 312
           +NKSQ      + R ++N+SQ
Sbjct: 151 ENKSQGNVLERRQRDAENRSQ 171



 Score = 39.8 bits (92), Expect = 0.001
 Identities = 41/139 (29%), Positives = 68/139 (48%)

Query: 174 EQRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQM 233
            ++R  +N+SQ      + R ++N+SQ      + R  +NKSQ      + R  +NKSQ 
Sbjct: 65  RRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQG 124

Query: 234 GSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDN 293
                + R ++N+SQ      + R ++NKSQ      + R ++N+SQ      + R  +N
Sbjct: 125 NVLERRQRDAENRSQGNVLERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVEN 184

Query: 294 KSQMGSGLEQSRKSDNKSQ 312
           KSQ      + R  +NKSQ
Sbjct: 185 KSQGNVLERRQRDVENKSQ 203



 Score = 39.8 bits (92), Expect = 0.001
 Identities = 40/141 (28%), Positives = 69/141 (48%), Gaps = 1/141 (0%)

Query: 173 LEQRRKNDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQ 232
           LE+R+++      G+ LE+ +R       G+ LE+ +R       G+ LE+ +R      
Sbjct: 111 LERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDAENKSQGNVLERRQRDAENRS 170

Query: 233 MGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRR-SDNKSQMGSGLEQSRRS 291
            G+ LE+ +R       G+ LE+ +R       G+ LE+ +R ++N+SQ      + R  
Sbjct: 171 QGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDV 230

Query: 292 DNKSQMGSGLEQSRKSDNKSQ 312
           +NKSQ      + R  +NKSQ
Sbjct: 231 ENKSQGNVLERRQRDVENKSQ 251



 Score = 37.5 bits (86), Expect = 0.007
 Identities = 38/132 (28%), Positives = 66/132 (50%)

Query: 181 NKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQS 240
           N+SQ      + R ++N+SQ      + R ++N+SQ      + R ++N+SQ      + 
Sbjct: 24  NQSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQ 83

Query: 241 RRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSG 300
           R ++N+SQ      + R  +NKSQ      + R  +NKSQ      + R ++N+SQ    
Sbjct: 84  RDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVL 143

Query: 301 LEQSRKSDNKSQ 312
             + R ++NKSQ
Sbjct: 144 ERRQRDAENKSQ 155



 Score = 37.1 bits (85), Expect = 0.009
 Identities = 68/292 (23%), Positives = 121/292 (41%), Gaps = 34/292 (11%)

Query: 7   GSGLEQRRKNNNKSQMGSGLEQSRKNNNKSQMGSGLEQSRRSDNKSQMGSGLEQRRKNNN 66
           G+ LE+R+++      G+ LE+ +++      G+ LE+ +R       G+ LE+R+++  
Sbjct: 44  GNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVE 103

Query: 67  KSQMGSDLEQREGKVITKVRWEVNLLGRAWQCNSPKQKWMWHKRDHSSSSSSLVLSIVHT 126
               G+ LE+R+  V  K +  V L  R     +  Q  +  +R   + + S    +   
Sbjct: 104 NKSQGNVLERRQRDVENKSQGNV-LERRQRDAENRSQGNVLERRQRDAENKSQGNVLERR 162

Query: 127 DIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMG 186
                 +S   V+                                   ++R  +NKSQ  
Sbjct: 163 QRDAENRSQGNVLE---------------------------------RRQRDVENKSQGN 189

Query: 187 SGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNK 246
               + R  +NKSQ      + R ++N+SQ      + R  +NKSQ      + R  +NK
Sbjct: 190 VLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENK 249

Query: 247 SQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMG 298
           SQ      + R ++N+SQ      + R  +NKSQ      + R ++NKSQ+G
Sbjct: 250 SQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDAENKSQVG 301



 Score = 32.9 bits (74), Expect = 0.19
 Identities = 35/129 (27%), Positives = 60/129 (46%), Gaps = 1/129 (0%)

Query: 185 MGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSD 244
           +GS    +  S      G+ LE+ +R       G+ LE+ +R       G+ LE+ +R  
Sbjct: 11  LGSSPVSAMDSVGNQSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDA 70

Query: 245 NKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRR-SDNKSQMGSGLEQ 303
                G+ LE+ +R       G+ LE+ +R       G+ LE+ +R  +NKSQ      +
Sbjct: 71  ENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERR 130

Query: 304 SRKSDNKSQ 312
            R ++N+SQ
Sbjct: 131 QRDAENRSQ 139



 Score = 32.5 bits (73), Expect = 0.30
 Identities = 66/281 (23%), Positives = 118/281 (41%), Gaps = 35/281 (12%)

Query: 2   NKSQMGSGLEQRRKNNNKSQMGSGLEQSRKNNNKSQMGSGLEQSRRSDNKSQMGSGLEQR 61
           N+SQ G+ LE+R+++      G+ LE+ +++      G+ LE+ +R       G+ LE+R
Sbjct: 56  NRSQ-GNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERR 114

Query: 62  RKNNNKSQMGSDLEQREGKVITKVRWEVNLLGRAWQCNSPKQKWMWHKRDHSSSSSSLVL 121
           +++      G+ LE+R+     + +  V L  R     +  Q  +  +R   + + S   
Sbjct: 115 QRDVENKSQGNVLERRQRDAENRSQGNV-LERRQRDAENKSQGNVLERRQRDAENRSQGN 173

Query: 122 SIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDN 181
            +      V  KS   V+                                   ++R  +N
Sbjct: 174 VLERRQRDVENKSQGNVLE---------------------------------RRQRDVEN 200

Query: 182 KSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSR 241
           KSQ      + R ++N+SQ      + R  +NKSQ      + R  +NKSQ      + R
Sbjct: 201 KSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQR 260

Query: 242 RSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMG 282
            ++N+SQ      + R  +NKSQ      + R ++NKSQ+G
Sbjct: 261 DAENRSQGNVLERRQRDVENKSQGNVLERRQRDAENKSQVG 301


>gnl|CDD|165227 PHA02909, PHA02909, hypothetical protein; Provisional.
          Length = 72

 Score = 34.5 bits (79), Expect = 0.007
 Identities = 17/58 (29%), Positives = 35/58 (60%), Gaps = 7/58 (12%)

Query: 119 LVLSIVHTD-IRVVIKSHSWVIRVNFLVFIFLMRRVF------FLFIVIFLLLILLFF 169
           L+LS+ + +  +V    +++ I V+F++F+ +   +F      +++I I + LILLFF
Sbjct: 13  LMLSVDYGNGKKVYYTENTFCIMVSFILFVIIFLSMFTILACSYVYIAIIISLILLFF 70


>gnl|CDD|192859 pfam11872, DUF3392, Protein of unknown function (DUF3392).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria. Proteins in this
           family are about 110 amino acids in length.
          Length = 106

 Score = 34.5 bits (80), Expect = 0.013
 Identities = 13/69 (18%), Positives = 30/69 (43%), Gaps = 22/69 (31%)

Query: 128 IRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFL------------------LLILLFF 169
           +R ++    +++R   L FI L+    +  ++++L                  +++ +F 
Sbjct: 41  LRRLLSGRHFLLRT--LAFI-LVCAFGYGLLIVWLSPWLARGLASIPNYWLAPVVVAVFI 97

Query: 170 FIG-LEQRR 177
            IG L QR+
Sbjct: 98  LIGVLAQRK 106


>gnl|CDD|223674 COG0601, DppB, ABC-type dipeptide/oligopeptide/nickel transport
           systems, permease components [Amino acid transport and
           metabolism / Inorganic ion transport and metabolism].
          Length = 317

 Score = 36.3 bits (85), Expect = 0.014
 Identities = 6/28 (21%), Positives = 15/28 (53%)

Query: 146 FIFLMRRVFFLFIVIFLLLILLFFFIGL 173
             ++++R+      +F++  L FF + L
Sbjct: 2   LKYIIKRLLQAIPTLFIVSTLTFFLLRL 29


>gnl|CDD|216682 pfam01757, Acyl_transf_3, Acyltransferase family.  This family
           includes a range of acyltransferase enzymes. This domain
           is found in many as yet uncharacterized C. elegans
           proteins and it is approximately 300 amino acids long.
          Length = 326

 Score = 36.0 bits (83), Expect = 0.018
 Identities = 13/74 (17%), Positives = 29/74 (39%), Gaps = 6/74 (8%)

Query: 106 MWHKRDHSSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIF------LMRRVFFLFIV 159
                  S     L+L ++          H W +   F+ ++       L+R++  L ++
Sbjct: 95  GLLVGGLSVYGLLLLLLLLGLLFGGGPNGHLWFLPALFVFYLLLPLLLRLLRKLKKLLLL 154

Query: 160 IFLLLILLFFFIGL 173
           + L L+LL   + +
Sbjct: 155 LLLALLLLLSLLYI 168


>gnl|CDD|204928 pfam12459, DUF3687, D-Ala-teichoic acid biosynthesis protein.  This
           family of proteins is found in bacteria. Proteins in
           this family are approximately 50 amino acids in length.
           There are two completely conserved residues (L and Y)
           that may be functionally important.
          Length = 42

 Score = 32.5 bits (75), Expect = 0.018
 Identities = 9/31 (29%), Positives = 18/31 (58%), Gaps = 2/31 (6%)

Query: 145 VFIFLMRRVFFLFIVIFLLLILLFFFIGLEQ 175
              F+++ +F+   +I L LI L+ + G+ Q
Sbjct: 7   AVKFILKTLFYF--LILLALIYLYGYSGIGQ 35


>gnl|CDD|238536 cd01116, P_permease, Permease P (pink-eyed dilution). Mutations in
           the human melanosomal P gene were responsible for
           classic phenotype of oculocutaneous albinism type 2
           (OCA2). Although the precise function of the P protein
           is unknown, it was predicted to regulate the
           intraorganelle pH, together with the ATP-driven proton
           pump. It shows significant sequence similarity to the
           Na+/H+ antiporter NhaD from Vibrio parahaemolyticus.
           Both proteins belong to ArsB/NhaD superfamily of
           permeases that translocate sodium, arsenate, sulfate,
           and organic anions across biological membranes in all
           three kingdoms of life.  A typical ArsB/NhaD permease
           contains 8-13 transmembrane domains.
          Length = 413

 Score = 35.7 bits (83), Expect = 0.024
 Identities = 11/60 (18%), Positives = 22/60 (36%), Gaps = 3/60 (5%)

Query: 140 RVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGLEQSRRSDNKS 199
            + F  F+  M     L +V+  +L+   +   L+ R ++         LE      +K 
Sbjct: 159 GLTFNDFLLHMLPSVVLALVVTFILLYFLYRNILKAREEDVLALA---ELEPKYPIKDKV 215


>gnl|CDD|172260 PRK13718, PRK13718, conjugal transfer protein TrbE; Provisional.
          Length = 84

 Score = 33.2 bits (76), Expect = 0.026
 Identities = 12/34 (35%), Positives = 18/34 (52%)

Query: 150 MRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKS 183
           +  VF +     LL IL FFF  L + +K+D + 
Sbjct: 44  LAAVFVILYSGVLLFILYFFFSALTKLQKHDERK 77


>gnl|CDD|233631 TIGR01906, integ_TIGR01906, integral membrane protein TIGR01906.
           This model represents a family of highly hydrophobic,
           uncharacterized predicted integral membrane proteins
           found almost entirely in low-GC Gram-positive bacteria,
           although a member is also found in the early-branching
           bacterium Aquifex aeolicus.
          Length = 207

 Score = 35.0 bits (81), Expect = 0.029
 Identities = 8/39 (20%), Positives = 19/39 (48%), Gaps = 1/39 (2%)

Query: 137 WVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQ 175
           W+     L  + L+ + F  F+++  L+I +   +  E+
Sbjct: 109 WIALTFKLRSLSLLIKSFI-FLLLLPLIIGILILLNFEK 146


>gnl|CDD|214793 smart00730, PSN, Presenilin, signal peptide peptidase, family.
           Presenilin 1 and presenilin 2 are polytopic membrane
           proteins, whose genes are mutated in some individuals
           with Alzheimer's disease. Distant homologues, present in
           eukaryotes and archaea, also contain conserved aspartic
           acid residues which are predicted to contribute to
           catalysis. At least one member of this family has been
           shown to possess signal peptide peptidase activity.
          Length = 249

 Score = 34.9 bits (81), Expect = 0.036
 Identities = 7/42 (16%), Positives = 13/42 (30%), Gaps = 4/42 (9%)

Query: 143 FLVFIFLMRRVFFLF----IVIFLLLILLFFFIGLEQRRKND 180
            LV ++   +   +          +L L      LE  R + 
Sbjct: 20  VLVLLYKFFKYLVIVLVIYFSSLGVLFLYSLLYPLEVFRVDY 61


>gnl|CDD|227829 COG5542, COG5542, Predicted integral membrane protein [Function
           unknown].
          Length = 420

 Score = 35.1 bits (81), Expect = 0.042
 Identities = 7/38 (18%), Positives = 14/38 (36%), Gaps = 2/38 (5%)

Query: 132 IKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFF 169
           + +   ++R   L+ I  M   +    V F  L   + 
Sbjct: 328 LAALIVLLRP--LILIVCMHERYLNRAVSFSPLAYWYL 363


>gnl|CDD|217315 pfam03006, HlyIII, Haemolysin-III related.  Members of this family
           are integral membrane proteins. This family includes a
           protein with hemolytic activity from Bacillus cereus. It
           has been proposed that YOL002c encodes a Saccharomyces
           cerevisiae protein that plays a key role in metabolic
           pathways that regulate lipid and phosphate metabolism.
           In eukaryotes, members are seven-transmembrane pass
           molecules found to encode functional receptors with a
           broad range of apparent ligand specificities, including
           progestin and adipoQ receptors, and hence have been
           named PAQR proteins. The mammalian members include
           progesterone binding proteins. Unlike the case with GPCR
           receptor proteins, the evolutionary ancestry of the
           members of this family can be traced back to the
           Archaea.
          Length = 207

 Score = 34.5 bits (80), Expect = 0.045
 Identities = 15/91 (16%), Positives = 37/91 (40%), Gaps = 13/91 (14%)

Query: 96  WQCNSP-KQKWMWHKRDHSS-------SSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFI 147
           + C+S  + K++  K DHS        S +  +L  +   +  ++    W + +  ++  
Sbjct: 63  FSCHSEGRAKYVLRKLDHSGIYLLIAGSYTPFLLYALCGPLGWILLIFIWGLALLGILLK 122

Query: 148 FLMRRVF-----FLFIVIFLLLILLFFFIGL 173
               + F      L++++  L I+    + L
Sbjct: 123 LFWLKRFRWLRTVLYLLMGWLGIIPIKHLIL 153


>gnl|CDD|148697 pfam07242, DUF1430, Protein of unknown function (DUF1430).  This
           family represents the C-terminus (approximately 120
           residues) of a number of hypothetical bacterial proteins
           of unknown function. These are possibly membrane
           proteins involved in immunity.
          Length = 100

 Score = 32.2 bits (74), Expect = 0.073
 Identities = 9/40 (22%), Positives = 18/40 (45%)

Query: 132 IKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
            K H   + +  +V++F +   F L     +LL +L   +
Sbjct: 37  FKRHKTYLIIQLIVWVFGLILAFILTKSNIILLFVLVLLL 76


>gnl|CDD|218858 pfam06024, DUF912, Nucleopolyhedrovirus protein of unknown function
           (DUF912).  This family consists of several
           Nucleopolyhedrovirus proteins of unknown function.
          Length = 101

 Score = 32.3 bits (74), Expect = 0.078
 Identities = 12/42 (28%), Positives = 22/42 (52%), Gaps = 4/42 (9%)

Query: 146 FIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKN--DNKSQM 185
            I L+      F  + +LL  +++F+ L +RRK   +N S +
Sbjct: 62  NIILI--GLLAFFCVLVLLYAIYYFVILRERRKYSTNNPSYV 101


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 33.8 bits (78), Expect = 0.099
 Identities = 8/31 (25%), Positives = 16/31 (51%)

Query: 141 VNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
           +  L+ +FL+R +  L +  FLL    +  +
Sbjct: 165 LLALLLLFLLRLLLLLALAFFLLSFGYYGLL 195



 Score = 31.9 bits (73), Expect = 0.43
 Identities = 9/29 (31%), Positives = 14/29 (48%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
            L      R +F +  ++ LLL LL  F+
Sbjct: 145 LLAESLGWRWLFLILAILGLLLALLLLFL 173


>gnl|CDD|177125 MTH00050, ATP6, ATP synthase F0 subunit 6; Validated.
          Length = 170

 Score = 32.9 bits (76), Expect = 0.10
 Identities = 17/64 (26%), Positives = 35/64 (54%), Gaps = 6/64 (9%)

Query: 115 SSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRV-----FFLFIVIFLLLILLFF 169
           +  S + S+++  I     S+ + + +  ++F+FL+ R+      FLF+V FL +++   
Sbjct: 4   NDFSSLFSLIYKLILGGSVSYYYSVVLFIVLFLFLLYRLPYIYSPFLFVV-FLFVVVFPL 62

Query: 170 FIGL 173
           FI L
Sbjct: 63  FISL 66


>gnl|CDD|224970 COG2059, ChrA, Chromate transport protein ChrA [Inorganic ion
           transport and metabolism].
          Length = 195

 Score = 33.0 bits (76), Expect = 0.11
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 1/48 (2%)

Query: 135 HSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNK 182
               + V  ++ I L+   F   +VI L   LL + +GL+ + K   K
Sbjct: 142 GWVGLAVLTVIAILLLPTKFVHPLVIILAG-LLGWLLGLKPKSKRIRK 188


>gnl|CDD|232943 TIGR00367, TIGR00367, K+-dependent Na+/Ca+ exchanger
           related-protein.  This model models a family of
           bacterial and archaeal proteins that is homologous,
           except for lacking a central region of ~ 250 amino acids
           and an N-terminal region of > 100 residues, to a
           functionally proven potassium-dependent sodium-calcium
           exchanger of the rat [Unknown function, General].
          Length = 307

 Score = 33.5 bits (77), Expect = 0.13
 Identities = 10/68 (14%), Positives = 23/68 (33%), Gaps = 11/68 (16%)

Query: 132 IKSHSWVIRVNF---LVFIFLMRRVFF--------LFIVIFLLLILLFFFIGLEQRRKND 180
           I      +R +    L+   L+               +++ L ++ L F +  E+  K D
Sbjct: 93  IIVDKDWLRRDILFYLLVSILLLFFGLDGQISRIDGVVLLILYIVYLLFLVKNERWVKYD 152

Query: 181 NKSQMGSG 188
             ++    
Sbjct: 153 TYTEENLD 160


>gnl|CDD|219521 pfam07695, 7TMR-DISM_7TM, 7TM diverse intracellular signalling.
           This entry represents the transmembrane region of the
           7TM-DISM (7TM Receptors with Diverse Intracellular
           Signalling Modules).
          Length = 207

 Score = 33.0 bits (76), Expect = 0.13
 Identities = 9/30 (30%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 142 NFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
           N  +F  L R   +L+ V+++L  LL+   
Sbjct: 19  NLFLFFSL-RDRSYLYYVLYILSFLLYQLS 47



 Score = 32.2 bits (74), Expect = 0.21
 Identities = 9/54 (16%), Positives = 17/54 (31%), Gaps = 16/54 (29%)

Query: 134 SHSWVIRVNFLVFIFL----------------MRRVFFLFIVIFLLLILLFFFI 171
           +    + +  LV  F                 + R+     ++ LLL+LL    
Sbjct: 66  NKLLYLFLALLVIFFALLFARSFLELKKYLPRLDRLLLGLALLLLLLLLLAPLF 119


>gnl|CDD|222700 pfam14348, DUF4400, Domain of unknown function (DUF4400).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria. Proteins in
           this family are typically between 209 and 249 amino
           acids in length. There is a single completely conserved
           residue P that may be functionally important.
          Length = 189

 Score = 32.6 bits (75), Expect = 0.14
 Identities = 12/44 (27%), Positives = 24/44 (54%), Gaps = 1/44 (2%)

Query: 135 HSWVIRVNFLVFIFLMR-RVFFLFIVIFLLLILLFFFIGLEQRR 177
            S+++    ++++FL R  +  L++ +FLL  L     GL +R 
Sbjct: 82  RSYLLAFWLVLYVFLYRLAILLLWLPLFLLAALAGLVDGLVRRD 125


>gnl|CDD|233951 TIGR02614, ftsW, cell division protein FtsW.  This family consists
           of FtsW, an integral membrane protein with ten
           transmembrane segments. In general, it is one of two
           paralogs involved in peptidoglycan biosynthesis, the
           other being RodA, and is essential for cell division.
           All members of the seed alignment for this model are
           encoded in operons for the biosynthesis of
           UDP-N-acetylmuramoyl-pentapeptide, a precursor of murein
           (peptidoglycan). The FtsW designation is not used in
           endospore-forming bacterial (e.g. Bacillus subtilis),
           where the member of this family is designated SpoVE and
           three or more RodA/FtsW/SpoVE family paralogs are
           present. SpoVE acts in spore cortex formation and is
           dispensible for growth. Biological rolls for FtsW in
           cell division include recruitment of penicillin-binding
           protein 3 to the division site [Cell envelope,
           Biosynthesis and degradation of murein sacculus and
           peptidoglycan, Cellular processes, Cell division].
          Length = 356

 Score = 33.3 bits (77), Expect = 0.14
 Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 8/70 (11%)

Query: 113 SSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLM------RR--VFFLFIVIFLLL 164
            S+S+++ L +       + +   + +    L+F+         R+  V  L I I LL+
Sbjct: 20  YSASAAVALRLGGDPFYFLKRQLFYALLGLILMFVASRLPLRFWRKLSVPILLIAIVLLV 79

Query: 165 ILLFFFIGLE 174
           ++L   IG E
Sbjct: 80  LVLIPGIGKE 89


>gnl|CDD|223839 COG0768, FtsI, Cell division protein FtsI/penicillin-binding
           protein 2 [Cell envelope biogenesis, outer membrane].
          Length = 599

 Score = 33.5 bits (77), Expect = 0.15
 Identities = 13/48 (27%), Positives = 26/48 (54%), Gaps = 2/48 (4%)

Query: 150 MRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGLEQSRRSDN 197
            RR+  LF++IFLL  LL   +   Q  + D+ ++  +   +++R+  
Sbjct: 14  RRRLNLLFLIIFLLFALLILRLFYLQIVQGDDYAKEAA--NRNQRTVP 59


>gnl|CDD|217354 pfam03073, TspO_MBR, TspO/MBR family.  Tryptophan-rich sensory
           protein (TspO) is an integral membrane protein that acts
           as a negative regulator of the expression of specific
           photosynthesis genes in response to oxygen/light. It is
           involved in the efflux of porphyrin intermediates from
           the cell. This reduces the activity of
           coproporphyrinogen III oxidase, which is thought to lead
           to the accumulation of a putative repressor molecule
           that inhibits the expression of specific photosynthesis
           genes. Several conserved aromatic residues are necessary
           for TspO function: they are thought to be involved in
           binding porphyrin intermediates. In, the rat
           mitochondrial peripheral benzodiazepine receptor (MBR)
           was shown to not only retain its structure within a
           bacterial outer membrane, but also to be able to
           functionally substitute for TspO in TspO- mutants, and
           to act in a similar manner to TspO in its in situ
           location: the outer mitochondrial membrane. The
           biological significance of MBR remains unclear, however.
           It is thought to be involved in a variety of cellular
           functions, including cholesterol transport in
           steroidogenic tissues.
          Length = 147

 Score = 32.2 bits (74), Expect = 0.16
 Identities = 11/34 (32%), Positives = 16/34 (47%), Gaps = 4/34 (11%)

Query: 141 VNFL---VFIFLMRRVFFLFIVIFLLLILLFFFI 171
           +N L   +F F +       IVI LL +L+   I
Sbjct: 81  LNALWSPLF-FGLHSPGLALIVILLLWLLVLATI 113


>gnl|CDD|222119 pfam13425, O-antigen_lig, O-antigen ligase like membrane protein. 
          Length = 136

 Score = 31.6 bits (72), Expect = 0.19
 Identities = 6/35 (17%), Positives = 17/35 (48%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRR 177
            L+ + L+++   L I++ +L ++    +     R
Sbjct: 30  ALLLLLLLKKRRKLLILLAILGVIALLVLSGRFSR 64


>gnl|CDD|216575 pfam01566, Nramp, Natural resistance-associated macrophage protein.
            The natural resistance-associated macrophage protein
           (NRAMP) family consists of Nramp1, Nramp2, and yeast
           proteins Smf1 and Smf2. The NRAMP family is a novel
           family of functional related proteins defined by a
           conserved hydrophobic core of ten transmembrane domains.
           This family of membrane proteins are divalent cation
           transporters. Nramp1 is an integral membrane protein
           expressed exclusively in cells of the immune system and
           is recruited to the membrane of a phagosome upon
           phagocytosis. By controlling divalent cation
           concentrations Nramp1 may regulate the interphagosomal
           replication of bacteria. Mutations in Nramp1 may
           genetically predispose an individual to susceptibility
           to diseases including leprosy and tuberculosis
           conversely this might however provide protection form
           rheumatoid arthritis. Nramp2 is a multiple divalent
           cation transporter for Fe2+, Mn2+ and Zn2+ amongst
           others it is expressed at high levels in the intestine;
           and is major transferrin-independent iron uptake system
           in mammals. The yeast proteins Smf1 and Smf2 may also
           transport divalent cations.
          Length = 364

 Score = 32.9 bits (76), Expect = 0.19
 Identities = 10/44 (22%), Positives = 16/44 (36%), Gaps = 5/44 (11%)

Query: 135 HSWVIRVNFLVFIFLMR-----RVFFLFIVIFLLLILLFFFIGL 173
           +  V+    LV + L       R     I+  + +I L F   L
Sbjct: 92  YIGVVITAVLVLLLLFLQRGGYRRLERVIIALVAVIALSFIAEL 135


>gnl|CDD|131651 TIGR02602, 8TM_EpsH, exosortase.  This family is designated
           exosortase, and it is the predicted protein-sorting
           transpeptidase for the PEP-CTERM protein-sorting signal
           of many biofilm-producing Gram-negative bacteria. This
           system is analogous to the sortase/LPXTG system found
           mostly in Gram-positive bacteria. Members of this family
           are integral membrane proteins with eight predicted
           transmembrane helices in common, and with a triad of
           invariant residues that matches the catalytic triad of
           sortases. Some members of this family have long trailing
           sequences past the region described by this model, which
           in other species is a separate protein EpsI. This model
           does not include the region of the first predicted
           transmembrane region. The only partially characterized
           member is EpsH of Methylobacillus sp. 12S, part of a
           locus associated with biosynthesis of the
           exopolysaccharide methanolan but itself not involved in
           polysaccharide biosynthesis [Protein fate, Protein and
           peptide secretion and trafficking].
          Length = 241

 Score = 32.7 bits (75), Expect = 0.20
 Identities = 18/81 (22%), Positives = 31/81 (38%), Gaps = 16/81 (19%)

Query: 105 WM-WHKRD---------HSSSSSSLVLS----IVHTDIRVVIKSHSWVIRVNFLVFIFLM 150
           ++ W KR+          S   + LVL     ++ + + V   +   +  V  L    L 
Sbjct: 22  YLAWSKRERLSELPSRPSSLGLALLVLGLAAYLLASVLSVTWMASLSL--VALLAGSVLA 79

Query: 151 RRVFFLFIVIFLLLILLFFFI 171
              F  F ++   L+LL F I
Sbjct: 80  LFGFRAFRLLLFPLLLLLFAI 100


>gnl|CDD|222228 pfam13567, DUF4131, Domain of unknown function (DUF4131).  This
           domain is frequently found to the N-terminus of the
           Competence domain, pfam03772.
          Length = 173

 Score = 32.0 bits (73), Expect = 0.20
 Identities = 11/38 (28%), Positives = 17/38 (44%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKND 180
            L+     RR   L +++ LLL  L   +   +R  ND
Sbjct: 28  LLLLYRRKRRRTLLLLLLLLLLGGLGLGLRAYRRAAND 65



 Score = 31.6 bits (72), Expect = 0.30
 Identities = 10/39 (25%), Positives = 19/39 (48%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDN 181
            L+ +   R+     +++ LLL+L    +GL   R+  N
Sbjct: 26  LLLLLLYRRKRRRTLLLLLLLLLLGGLGLGLRAYRRAAN 64


>gnl|CDD|223843 COG0772, FtsW, Bacterial cell division membrane protein [Cell
           division and chromosome partitioning].
          Length = 381

 Score = 33.0 bits (76), Expect = 0.21
 Identities = 13/67 (19%), Positives = 30/67 (44%), Gaps = 12/67 (17%)

Query: 113 SSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRV---------FFLFIVIFLL 163
           +SS+ +      +  IR ++     ++ +   + + L   +           L+++I LL
Sbjct: 36  ASSAVAERRGPFYFLIRQLV---WLLLGLILALLVILRLPLRFLRKRSFAILLYVIILLL 92

Query: 164 LILLFFF 170
           L+L+ FF
Sbjct: 93  LVLVLFF 99



 Score = 32.6 bits (75), Expect = 0.26
 Identities = 17/75 (22%), Positives = 34/75 (45%), Gaps = 4/75 (5%)

Query: 104 KWMWHKRDHSSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLL 163
           KW+    D       L+L+ +     V++ S S  +      F FL+R++ +  ++  +L
Sbjct: 8   KWLKRLIDRLLLLLLLLLAAIGL---VMVYSASSAVAERRGPFYFLIRQLVW-LLLGLIL 63

Query: 164 LILLFFFIGLEQRRK 178
            +L+   + L   RK
Sbjct: 64  ALLVILRLPLRFLRK 78


>gnl|CDD|223866 COG0795, COG0795, Predicted permeases [General function prediction
           only].
          Length = 364

 Score = 32.7 bits (75), Expect = 0.22
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 144 LVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
           L+  +L+R     F+ I L+L+ +   I L
Sbjct: 3   LLDRYLLREYLKTFLAILLVLLAILLIIDL 32


>gnl|CDD|100082 cd06161, S2P-M50_SpoIVFB, SpoIVFB Site-2 protease (S2P), a zinc
           metalloprotease (MEROPS family M50B), regulates
           intramembrane proteolysis (RIP), and is involved in the
           pro-sigmaK pathway of bacterial spore formation. SpoIVFB
           (sporulation protein, stage IV cell wall formation, F
           locus, promoter-distal B) is one of 4 proteins involved
           in endospore formation; the others are SpoIVFA
           (sporulation protein, stage IV cell wall formation, F
           locus, promoter-proximal A), BofA (bypass-of-forespore
           A), and SpoIVB (sporulation protein, stage IV cell wall
           formation, B locus). SpoIVFB is negatively regulated by
           SpoIVFA and BofA and activated by SpoIVB. It is thought
           that SpoIVFB, SpoIVFA, and BofA are located in the
           mother-cell membrane that surrounds the forespore and
           that SpoIVB is secreted from the forespore into the
           space between the two where it activates SpoIVFB.
          Length = 208

 Score = 32.1 bits (74), Expect = 0.23
 Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 4/48 (8%)

Query: 133 KSHSWVIRVNFLVFIFLMRRVFFLFIV----IFLLLILLFFFIGLEQR 176
           ++     R+  L  I L+    FL  +    ++LLLI LF ++  E  
Sbjct: 157 RATRIAARIGQLFAILLVVLGLFLLFLGLGNLWLLLIALFIYLAAEAE 204


>gnl|CDD|222381 pfam13789, DUF4181, Domain of unknown function (DUF4181). 
          Length = 109

 Score = 31.1 bits (71), Expect = 0.23
 Identities = 12/46 (26%), Positives = 24/46 (52%), Gaps = 5/46 (10%)

Query: 137 WVIRVNFLVFIFLMRRVFF-----LFIVIFLLLILLFFFIGLEQRR 177
            ++ + FL+ + +   +F      ++ +IFL LILLF F    + +
Sbjct: 35  IILLIIFLIILIIFIYIFITEYLYMYYLIFLFLILLFAFRAFMEWK 80


>gnl|CDD|205037 pfam12716, Apq12, Nuclear pore assembly and biogenesis.  This is a
           family of conserved fungal proteins involved in nuclear
           pore assembly. Apq12 is an integral membrane protein of
           the nuclear envelope (NE) and endoplasmic reticulum. Its
           absence leads to a partial block in mRNA export and
           cold-sensitive defects in the growth and localisation of
           a subset of nucleoporins, particularly those
           asymmetrically localised to the cytoplasmic fibrils. The
           defects in nuclear pore assembly appear to be due to
           defects in regulating membrane fluidity.
          Length = 54

 Score = 29.5 bits (67), Expect = 0.25
 Identities = 10/38 (26%), Positives = 19/38 (50%)

Query: 130 VVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILL 167
           + +K   +  RV     I ++R +F+L IV   + + L
Sbjct: 16  ISLKILDYARRVVMFWLILVVRLLFWLAIVGLGVYVYL 53


>gnl|CDD|177152 MTH00087, ATP6, ATP synthase F0 subunit 6; Provisional.
          Length = 195

 Score = 31.9 bits (73), Expect = 0.25
 Identities = 10/30 (33%), Positives = 14/30 (46%), Gaps = 2/30 (6%)

Query: 143 FLVFIFLMRRVFFLFI--VIFLLLILLFFF 170
           +L  + +M   F  FI   IF  LI L+  
Sbjct: 165 WLSILAIMMECFVAFIQSYIFSRLIYLYLN 194


>gnl|CDD|110122 pfam01098, FTSW_RODA_SPOVE, Cell cycle protein.  This entry
           includes the following members; FtsW, RodA, SpoVE.
          Length = 356

 Score = 32.6 bits (75), Expect = 0.26
 Identities = 18/69 (26%), Positives = 33/69 (47%), Gaps = 10/69 (14%)

Query: 116 SSSLVLSIV--HTDIRVVIKSHSWVIRVNFLVFIFLMR-------RVFFLFIVIFLLLIL 166
           S+S V S+V          +   +++ + F++F+ L+R       +  F   +I LLL++
Sbjct: 21  SASGVTSLVLFGDPFFFFKRQLVYLL-LGFIIFVVLLRIPLRFLRKWAFYLFIIGLLLLV 79

Query: 167 LFFFIGLEQ 175
           L F IG   
Sbjct: 80  LVFVIGPSV 88


>gnl|CDD|224209 COG1290, QcrB, Cytochrome b subunit of the bc complex [Energy
           production and conversion].
          Length = 381

 Score = 32.7 bits (75), Expect = 0.26
 Identities = 6/30 (20%), Positives = 13/30 (43%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFIG 172
           FL F  ++R +      +  +  +L  F+ 
Sbjct: 282 FLPFYAILRAIPSKLGGVLAMFPILILFLL 311


>gnl|CDD|217759 pfam03845, Spore_permease, Spore germination protein. 
          Length = 320

 Score = 32.2 bits (74), Expect = 0.27
 Identities = 14/49 (28%), Positives = 23/49 (46%), Gaps = 8/49 (16%)

Query: 135 HSWVIRVNFL-VFIFLMR-------RVFFLFIVIFLLLILLFFFIGLEQ 175
            +W+I +  L V I+L+        R F   I I L+++L      L+Q
Sbjct: 112 PTWIIILTLLLVSIYLISKGIEVLAREFCFIIPILLIILLFILIFSLKQ 160


>gnl|CDD|224900 COG1989, PulO, Type II secretory pathway, prepilin signal peptidase
           PulO and related peptidases [Cell motility and secretion
           / Posttranslational modification, protein turnover,
           chaperones / Intracellular trafficking and secretion].
          Length = 254

 Score = 32.4 bits (74), Expect = 0.28
 Identities = 9/74 (12%), Positives = 19/74 (25%), Gaps = 15/74 (20%)

Query: 100 SPKQKWMWHKRDHSSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIV 159
            P +  +   R       SL                  +  +  L ++ L  +       
Sbjct: 25  LPDKLSISLPR-------SLCPFCGRGL--------KLLDLIPLLSYLLLRGKCRICKSK 69

Query: 160 IFLLLILLFFFIGL 173
           I +   L+    G+
Sbjct: 70  ISIRYPLVELLTGV 83


>gnl|CDD|217705 pfam03739, YjgP_YjgQ, Predicted permease YjgP/YjgQ family.  Members
           of this family are predicted integral membrane proteins
           of unknown function. They are about 350 amino acids long
           and contain about 6 transmembrane regions. They are
           predicted to be permeases although there is no
           verification of this.
          Length = 354

 Score = 32.3 bits (74), Expect = 0.28
 Identities = 8/26 (30%), Positives = 17/26 (65%)

Query: 148 FLMRRVFFLFIVIFLLLILLFFFIGL 173
           +++R     F+++ L+L+LLF  + L
Sbjct: 2   YILREFLKPFLLVLLVLLLLFLLVDL 27


>gnl|CDD|226574 COG4089, COG4089, Predicted membrane protein [Function unknown].
          Length = 235

 Score = 32.0 bits (73), Expect = 0.28
 Identities = 8/28 (28%), Positives = 14/28 (50%)

Query: 145 VFIFLMRRVFFLFIVIFLLLILLFFFIG 172
           + +      F L +   LLL+++F F G
Sbjct: 8   IILSPFYGPFRLLLAFLLLLLMMFIFTG 35


>gnl|CDD|179518 PRK02975, PRK02975, putative common antigen polymerase;
           Provisional.
          Length = 450

 Score = 32.3 bits (74), Expect = 0.29
 Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 2/38 (5%)

Query: 137 WVIRVNFLVFI--FLMRRVFFLFIVIFLLLILLFFFIG 172
           W++   F+  +  F  RRV F F V F LL LL F+ G
Sbjct: 13  WLLSTLFIATLTWFEFRRVRFNFNVFFSLLYLLTFYFG 50


>gnl|CDD|240530 cd13125, MATE_like_10, Uncharacterized subfamily of the multidrug
           and toxic compound extrusion (MATE) proteins.  This
           family might function as a translocase for
           lipopolysaccharides, such as O-antigen. The integral
           membrane proteins from the MATE family are involved in
           exporting metabolites across the cell membrane and are
           responsible for multidrug resistance (MDR) in many
           bacteria and animals. A number of family members are
           involved in the synthesis of peptidoglycan components in
           bacteria.
          Length = 409

 Score = 32.5 bits (75), Expect = 0.29
 Identities = 19/52 (36%), Positives = 27/52 (51%), Gaps = 13/52 (25%)

Query: 131 VIKSHSWVIRVNFLVFIFL---MRRVFF----LFIVIFLLLILLFFFI-GLE 174
             K  SW+     L +I L   M ++F     LF ++F+LL LLF  + GLE
Sbjct: 338 FFKIASWL-----LGYILLAKGMTKLFIITEILFALLFVLLSLLFIPLFGLE 384


>gnl|CDD|217647 pfam03631, Virul_fac_BrkB, Virulence factor BrkB.  This family acts
           as a virulence factor. In Bordetella pertussis, brkB is
           essential for resistance to complement-dependent killing
           by serum. This family was originally predicted to be
           ribonuclease BN, but this prediction has since been
           shown to be incorrect.
          Length = 255

 Score = 32.2 bits (74), Expect = 0.29
 Identities = 11/53 (20%), Positives = 23/53 (43%), Gaps = 2/53 (3%)

Query: 118 SLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFF 170
           SL+LS+    +   +     +     +  +  + R     +++FLL  LL+ F
Sbjct: 128 SLLLSVFLPSLLGFLILLLGLG--LEIAVLLQLLRWLLSLLLLFLLFALLYRF 178


>gnl|CDD|130787 TIGR01726, HEQRo_perm_3TM, amine acid ABC transporter, permease
           protein, 3-TM region, His/Glu/Gln/Arg/opine family.
           This model represents one of several classes of multiple
           membrane spanning regions found immediately N-terminal
           to the domain described by pfam00528,
           binding-protein-dependent transport systems inner
           membrane component. The region covered by This model
           generally is predicted to contain three transmembrane
           helices. Substrate specificities attributed to members
           of this family include histidine, arginine, glutamine,
           glutamate, and (in Agrobacterium) the opines octopine
           and nopaline [Transport and binding proteins, Amino
           acids, peptides and amines].
          Length = 99

 Score = 30.6 bits (70), Expect = 0.31
 Identities = 14/57 (24%), Positives = 24/57 (42%), Gaps = 3/57 (5%)

Query: 118 SLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLE 174
            LVL ++   +R+   S +  +R    V++ L R    L  + F+   L    I L 
Sbjct: 23  GLVLGLLLALLRL---SGNRPLRWIATVYVELFRGTPLLVQLFFIYFGLPLIGIRLS 76


>gnl|CDD|224191 COG1272, COG1272, Predicted membrane protein, hemolysin III homolog
           [General function prediction only].
          Length = 226

 Score = 31.8 bits (73), Expect = 0.33
 Identities = 17/88 (19%), Positives = 37/88 (42%), Gaps = 11/88 (12%)

Query: 99  NSPKQKWMWHKRDHSS-------SSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMR 151
           N  K K +  K DHS        S +  +L  ++  +  ++    W + +  ++F    +
Sbjct: 80  NGQKAKAILRKFDHSGIYVLIAGSYTPFLLVGLYGPLGWILLGLIWGLALAGILFKLFFK 139

Query: 152 RVF-FLFIVIFLL---LILLFFFIGLEQ 175
           + F  L +V++L    L L+     + +
Sbjct: 140 KRFRKLSLVLYLAMGWLGLIVIKPLIAK 167


>gnl|CDD|148492 pfam06899, WzyE, WzyE protein.  This family consists of several
           WzyE proteins which appear to be specific to
           Enterobacteria. Members of this family are described as
           putative ECA polymerases this has been found to be
           incorrect. The function of this family is unknown.
          Length = 449

 Score = 32.1 bits (73), Expect = 0.36
 Identities = 14/39 (35%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 137 WVIRVNFLVFI--FLMRRVFFLFIVIFLLLILLFFFIGL 173
           +++ + F+  +     RRV F F V F LL LL F+ G 
Sbjct: 13  YLLSLTFIATLTYREFRRVRFNFNVFFSLLYLLTFYFGF 51


>gnl|CDD|218602 pfam05478, Prominin, Prominin.  The prominins are an emerging
           family of proteins that among the multispan membrane
           proteins display a novel topology. Mouse prominin and
           human prominin (mouse)-like 1 (PROML1) are predicted to
           contain five membrane spanning domains, with an
           N-terminal domain exposed to the extracellular space
           followed by four, alternating small cytoplasmic and
           large extracellular, loops and a cytoplasmic C-terminal
           domain. The exact function of prominin is unknown
           although in humans defects in PROM1, the gene coding for
           prominin, cause retinal degeneration.
          Length = 807

 Score = 32.3 bits (74), Expect = 0.36
 Identities = 11/28 (39%), Positives = 15/28 (53%), Gaps = 2/28 (7%)

Query: 147 IFLMRRV--FFLFIVIFLLLILLFFFIG 172
            FLM  V   FLF    +L++L  F +G
Sbjct: 461 RFLMAGVGFSFLFSWFLMLVVLALFLVG 488


>gnl|CDD|222537 pfam14093, DUF4271, Domain of unknown function (DUF4271).  This
           family of integral membrane proteins is functionally
           uncharacterized. This family of proteins is found in
           bacteria. Proteins in this family are typically between
           221 and 326 amino acids in length.
          Length = 207

 Score = 31.4 bits (72), Expect = 0.40
 Identities = 19/85 (22%), Positives = 32/85 (37%), Gaps = 16/85 (18%)

Query: 105 WMWHKRDHSSSSS----SLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMR---RVFFLF 157
           W++ K   SS        LVL +V+ D      +   ++ +  L  I L+    ++FF  
Sbjct: 121 WLFSKLSLSSYFGLLLFPLVLLLVYFDSLSFYIAVIILLIILLLGKILLLYKYYKIFFRK 180

Query: 158 IVIFLLLIL---------LFFFIGL 173
           +  FL  IL         L     +
Sbjct: 181 LFSFLYFILYLCALEIAPLLILYKV 205



 Score = 29.9 bits (68), Expect = 1.3
 Identities = 12/34 (35%), Positives = 16/34 (47%), Gaps = 7/34 (20%)

Query: 145 VFIFLMRRVFFL-------FIVIFLLLILLFFFI 171
           +FI+L  R F L        IV  L+   +F FI
Sbjct: 65  LFIYLYLRDFGLELFFSVQLIVFGLIFGGVFLFI 98


>gnl|CDD|224214 COG1295, Rbn, Ribonuclease BN family enzyme [Replication,
           recombination, and repair].
          Length = 303

 Score = 31.9 bits (73), Expect = 0.40
 Identities = 9/43 (20%), Positives = 15/43 (34%), Gaps = 5/43 (11%)

Query: 132 IKSHSWVIRVNFLVFIFLMRRVFFLFIVI-FLLLILLFFFIGL 173
           +K     IR   L   +         + + F LL+ +   I L
Sbjct: 131 VKPRRSFIRRRLLSLFW----TLGTLLPLLFALLLSVVGPIAL 169


>gnl|CDD|222658 pfam14296, O-ag_pol_Wzy, O-antigen polysaccharide polymerase Wzy.
           This family includes O-antigen polysaccharide
           polymerases. These enzymes link O-units via a glycosidic
           linkage to form a long O-antigen. These enzymes vary in
           specificity and sequence.
          Length = 440

 Score = 31.9 bits (73), Expect = 0.43
 Identities = 22/108 (20%), Positives = 35/108 (32%), Gaps = 24/108 (22%)

Query: 111 DHSSSSSSLVLSI-VHTDIRVVIKSHSWVIRVNFLV--FIFLMRRVF------------- 154
           +        +L I +   +  + K     +   FL+  F+FL  R F             
Sbjct: 13  NDLLFLLVALLLIWIANFVYALKKLKKRSLYFLFLITFFVFLFGRPFLDYLFGYNDSIFA 72

Query: 155 --------FLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGLEQSRR 194
                       +I L+L LLF F+G    RK   K +      +  R
Sbjct: 73  NDFYNEDIIFEALILLILSLLFLFLGYILGRKKKRKKEKKKNQLEYER 120



 Score = 29.6 bits (67), Expect = 2.4
 Identities = 8/43 (18%), Positives = 18/43 (41%), Gaps = 2/43 (4%)

Query: 132 IKSHSWVIRVNFLVFIFLM--RRVFFLFIVIFLLLILLFFFIG 172
           I        + F +++     ++     ++I+LL +LL    G
Sbjct: 171 IILIGSFFPIAFCLYLATKPSKKKILFILIIYLLYLLLSLLTG 213



 Score = 29.6 bits (67), Expect = 2.5
 Identities = 9/29 (31%), Positives = 12/29 (41%)

Query: 145 VFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
             +F  R  F  FI   L LI +   I +
Sbjct: 411 NILFAPRSSFSGFISPILSLIFILLLILI 439



 Score = 28.0 bits (63), Expect = 7.9
 Identities = 12/40 (30%), Positives = 18/40 (45%), Gaps = 6/40 (15%)

Query: 137 WVIRVNFLVFIFLM------RRVFFLFIVIFLLLILLFFF 170
            V   +FL+ I L       +R  FL ++  LLL  + F 
Sbjct: 376 GVFIGSFLLGILLRKLDKKFKRNTFLLVLSLLLLPNILFA 415


>gnl|CDD|224748 COG1835, COG1835, Predicted acyltransferases [Lipid metabolism].
          Length = 386

 Score = 31.7 bits (72), Expect = 0.43
 Identities = 15/44 (34%), Positives = 22/44 (50%), Gaps = 6/44 (13%)

Query: 134 SHSWVIRVN------FLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
            H W + V       F + + L+RRV F   ++ LLL L  FF+
Sbjct: 150 LHLWSLSVEEQFYLLFPLLLLLLRRVRFALALLLLLLALASFFL 193


>gnl|CDD|234638 PRK00108, mraY, phospho-N-acetylmuramoyl-pentapeptide-transferase;
           Provisional.
          Length = 344

 Score = 31.6 bits (73), Expect = 0.44
 Identities = 12/37 (32%), Positives = 17/37 (45%), Gaps = 1/37 (2%)

Query: 143 FLVFIFLMRRVFFLFIVIFLL-LILLFFFIGLEQRRK 178
           F VF +L  R     +   L+ LIL   FI   ++ K
Sbjct: 1   FNVFRYLTFRALLALLTALLISLILGPRFIRFLRKLK 37


>gnl|CDD|216181 pfam00902, TatC, Sec-independent protein translocase protein
           (TatC).  The bacterial Tat system has a remarkable
           ability to transport folded proteins even enzyme
           complexes across the cytoplasmic membrane. It is
           structurally and mechanistically similar to the Delta
           pH-driven thylakoidal protein import pathway. A
           functional Tat system or Delta pH-dependent pathway
           requires three integral membrane proteins: TatA/Tha4,
           TatB/Hcf106 and TatC/cpTatC. The TatC protein is
           essential for the function of both pathways. It might be
           involved in twin-arginine signal peptide recognition,
           protein translocation and proton translocation. Sequence
           analysis predicts that TatC contains six transmembrane
           helices (TMHs), and experimental data confirmed that N-
           and C-termini of TatC or cpTatC are exposed to the
           cytoplasmic or stromal face of the membrane. The
           cytoplasmic N-terminus and the first cytoplasmic loop
           region of the Escherichia coli TatC protein are
           essential for protein export. At least two TatC
           molecules co-exist within each Tat translocon.
          Length = 212

 Score = 31.3 bits (72), Expect = 0.46
 Identities = 7/22 (31%), Positives = 12/22 (54%)

Query: 149 LMRRVFFLFIVIFLLLILLFFF 170
           L +R+ +  I   L  I+ +FF
Sbjct: 7   LRKRLIYSLIAFILGFIICYFF 28


>gnl|CDD|226780 COG4330, COG4330, Predicted membrane protein [Function unknown].
          Length = 211

 Score = 31.3 bits (71), Expect = 0.48
 Identities = 21/106 (19%), Positives = 42/106 (39%), Gaps = 28/106 (26%)

Query: 87  WEVNLLGRAWQCNSPKQKWMWHKRDHSSSSSSLVLSIVH-----------TDIRVVIKSH 135
           +E+++     +  S ++ W++             L +V            TD+  + K  
Sbjct: 35  FELSVWLFRKR-RSWRRSWLF-----------WPLFLVFLLFLPNAPYLLTDLIHLQKLE 82

Query: 136 SWVIRVNFLVFIFLMRRVFFLFIVIFLLL---ILLFFFIGLEQRRK 178
            +   ++  + I L R   +LF+ +F  L    LL     LE+RR+
Sbjct: 83  RYGAELS--LIILLWRPFTYLFLGVFFGLYVMSLLNAGYLLERRRR 126


>gnl|CDD|214413 MTH00062, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 417

 Score = 31.8 bits (73), Expect = 0.49
 Identities = 22/89 (24%), Positives = 31/89 (34%), Gaps = 24/89 (26%)

Query: 106 MWHKRDHSSSSSSLVLSIVH-TDIRVVIKSHSWVIRV----NFLVFIFL----M------ 150
            W   D  S     VLS +  T + +    H+W+  V    +  V IFL    M      
Sbjct: 156 DWGSDDSVSLIVYYVLSFMFITKVPLF-PFHTWLPIVHAEASSPVSIFLSGYIMKLGLLG 214

Query: 151 --RRVFFLF------IVIFLLLILLFFFI 171
             R   F+F       +    L  +FF I
Sbjct: 215 VYRFCGFIFSGSFLGYLFLCCLFSVFFLI 243



 Score = 28.4 bits (64), Expect = 5.5
 Identities = 12/41 (29%), Positives = 18/41 (43%), Gaps = 3/41 (7%)

Query: 136 SWVIRVNFLVFIFLMRRVF---FLFIVIFLLLILLFFFIGL 173
             +        I ++  VF   F+F  +   LILL  F+GL
Sbjct: 19  CLLYSSGVGCCILVVNNVFNGLFIFDSLSFYLILLVLFLGL 59


>gnl|CDD|216416 pfam01292, Ni_hydr_CYTB, Prokaryotic cytochrome b561.  This family
           includes cytochrome b561 and related proteins, in
           addition to the nickel-dependent hydrogenases b-type
           cytochrome subunit. Cytochrome b561 is a secretory
           vesicle-specific electron transport protein. It is an
           integral membrane protein, that binds two heme groups
           non-covalently. This is a prokaryotic family. Members of
           the 'eukaryotic cytochrome b561' family can be found in
           Pfam: PF03188.
          Length = 140

 Score = 30.4 bits (69), Expect = 0.49
 Identities = 7/25 (28%), Positives = 18/25 (72%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILL 167
             + + L+R ++ + ++++LLL+LL
Sbjct: 50  VFLALLLLRLLWAVHLLLYLLLLLL 74


>gnl|CDD|219137 pfam06687, SUR7, SUR7/PalI family.  This family consists of several
           fungal-specific SUR7 proteins. Its activity regulates
           expression of RVS161, a homologue of human endophilin,
           suggesting a function for both in endocytosis. The
           protein carries four transmembrane domains and is thus
           likely to act as an anchoring protein for the eisosome
           to the plasma membrane. Eisosomes are the immobile
           protein complexes, that include the proteins Pil1 and
           Lsp1, which co-localise with sites of protein and lipid
           endocytosis at the plasma membrane. SUR7 protein may
           play a role in sporulation. This family also includes
           PalI which is part of a pH signal transduction cascade.
           Based on the similarity of PalI to the yeast Rim9
           meiotic signal transduction component it has been
           suggested that PalI might be a membrane sensor for
           ambient pH.
          Length = 205

 Score = 31.2 bits (71), Expect = 0.50
 Identities = 13/68 (19%), Positives = 23/68 (33%), Gaps = 5/68 (7%)

Query: 120 VLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKN 179
           V++++   I     S      VNFL+ +      F   ++   L   LF          N
Sbjct: 124 VIALILAGILAHFSSPRRGSLVNFLLSLL----AFLFTLLAAALDTALFVKA-RNAFNSN 178

Query: 180 DNKSQMGS 187
              + +G 
Sbjct: 179 GISASLGP 186


>gnl|CDD|221501 pfam12273, RCR, Chitin synthesis regulation, resistance to Congo
           red.  RCR proteins are ER membrane proteins that
           regulate chitin deposition in fungal cell walls.
           Although chitin, a linear polymer of beta-1,4-linked
           N-acetylglucosamine, constitutes only 2% of the cell
           wall it plays a vital role in the overall protection of
           the cell wall against stress, noxious chemicals and
           osmotic pressure changes. Congo red is a cell
           wall-disrupting benzidine-type dye extensively used in
           many cell wall mutant studies that specifically targets
           chitin in yeast cells and inhibits growth. RCR proteins
           render the yeasts resistant to Congo red by diminishing
           the content of chitin in the cell wall. RCR proteins are
           probably regulating chitin synthase III interact
           directly with ubiquitin ligase Rsp5, and the VPEY motif
           is necessary for this, via interaction with the WW
           domains of Rsp5.
          Length = 124

 Score = 30.1 bits (68), Expect = 0.52
 Identities = 11/30 (36%), Positives = 16/30 (53%), Gaps = 1/30 (3%)

Query: 151 RRVFF-LFIVIFLLLILLFFFIGLEQRRKN 179
           R V F +FI+  L+L  L   I   +RR+ 
Sbjct: 1   RWVLFAIFIIALLILFFLTARINRRRRRRG 30


>gnl|CDD|133466 cd06856, GT_GPT_archaea, UDP-GlcNAc:dolichol-P GlcNAc-1-P
           transferase (GPT)-like proteins in archaea. Eukaryotic
           GPT catalyzes the transfer of GlcNAc-1-P from UDP-GlcNAc
           to dolichol-P to form GlcNAc-P-P-dolichol. The reaction
           is the first step in the assembly of dolichol-linked
           oligosaccharide intermediates and is essential for
           eukaryotic N-linked glycosylation. Evidence for the
           existence of the N-glycosylation pathway in archaea has
           emerged and genes responsible for the pathway have been
           identified. A glycosyl transferase gene Mv1751 in M.
           voltae encodes for the enzyme that carries out the first
           step in the pathway, the attachment of GlcNAc to a
           dolichol lipid carrier in the membrane. A lethal
           mutation in the alg7 (GPT) gene in Saccharomyces
           cerevisiae was successfully complemented with Mv1751,
           the archaea gene, indicating that eukaryotic and
           archaeal enzymes may use the same substrates and are
           evolutionarily closer than the bacterial enzyme, which
           uses a different substrate.
          Length = 280

 Score = 31.4 bits (72), Expect = 0.55
 Identities = 10/37 (27%), Positives = 15/37 (40%)

Query: 137 WVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
            V+    L  +FL      +  +  L+  LL   IGL
Sbjct: 20  AVLLGFSLGLLFLSALTHSVEALALLITSLLAGLIGL 56


>gnl|CDD|221739 pfam12730, ABC2_membrane_4, ABC-2 family transporter protein.  This
           family is related to the ABC-2 membrane transporter
           family pfam01061.
          Length = 230

 Score = 31.2 bits (71), Expect = 0.55
 Identities = 17/61 (27%), Positives = 31/61 (50%), Gaps = 3/61 (4%)

Query: 112 HSSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
            S  S SL+L  +   + + + +   ++    ++F+ L+ R   + I I LLLILL   +
Sbjct: 123 GSGFSLSLLLGALLLLLLLSLLALLLIL---LILFLSLLLRNSAVAIGIGLLLILLGLLL 179

Query: 172 G 172
           G
Sbjct: 180 G 180



 Score = 30.8 bits (70), Expect = 0.79
 Identities = 11/56 (19%), Positives = 25/56 (44%), Gaps = 1/56 (1%)

Query: 118 SLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
           SL+ S++   + ++          +  + +  +  +  L ++  LL ILL  F+ L
Sbjct: 103 SLLASLLLLLLSLIAGLLLGGSGFSLSLLLGALLLLLLLSLLALLL-ILLILFLSL 157



 Score = 29.3 bits (66), Expect = 2.5
 Identities = 8/26 (30%), Positives = 13/26 (50%)

Query: 146 FIFLMRRVFFLFIVIFLLLILLFFFI 171
            + L R   FL ++I  LL+ L   +
Sbjct: 4   LLKLKRTKIFLILLILPLLLALLALL 29


>gnl|CDD|220321 pfam09624, DUF2393, Protein of unknown function (DUF2393).  The
           function of this protein is unknown. It is always found
           as part of a two-gene operon with IPR013416, a protein
           that appears to span the membrane seven times. It has so
           far been found in the bacteria Nostoc sp. PCC 7120,
           Agrobacterium tumefaciens, Rhizobium meliloti, and
           Gloeobacter violaceus.
          Length = 149

 Score = 30.4 bits (69), Expect = 0.57
 Identities = 12/35 (34%), Positives = 16/35 (45%), Gaps = 6/35 (17%)

Query: 143 FLVFIFL------MRRVFFLFIVIFLLLILLFFFI 171
           FL+ +FL            L I   +LLIL+ FF 
Sbjct: 3   FLLILFLGIKLKKKSLALSLIIASIILLILIPFFG 37


>gnl|CDD|221995 pfam13231, PMT_2, Dolichyl-phosphate-mannose-protein
           mannosyltransferase.  This family contains members that
           are not captured by pfam02366.
          Length = 158

 Score = 30.6 bits (70), Expect = 0.58
 Identities = 9/37 (24%), Positives = 16/37 (43%)

Query: 136 SWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIG 172
           +W+I +   +F      V     +  LL ILL + + 
Sbjct: 7   AWLIALFTKLFGDSEFAVRLPSALAGLLSILLLYLLA 43


>gnl|CDD|236913 PRK11463, fxsA, phage T7 F exclusion suppressor FxsA; Reviewed.
          Length = 148

 Score = 30.6 bits (70), Expect = 0.60
 Identities = 7/23 (30%), Positives = 12/23 (52%)

Query: 150 MRRVFFLFIVIFLLLILLFFFIG 172
           MR    LF++  L+ I +F  + 
Sbjct: 1   MRPFLLLFLLYPLIEIAVFIAVA 23


>gnl|CDD|177154 MTH00091, ND2, NADH dehydrogenase subunit 2; Provisional.
          Length = 273

 Score = 31.0 bits (71), Expect = 0.60
 Identities = 9/31 (29%), Positives = 13/31 (41%)

Query: 141 VNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
           V F V IF +  +  +     L L+ L F  
Sbjct: 204 VTFFVKIFSLSEILKVLGFFLLFLLFLMFLS 234



 Score = 29.8 bits (68), Expect = 1.6
 Identities = 15/38 (39%), Positives = 19/38 (50%), Gaps = 4/38 (10%)

Query: 134 SHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
           S +W+I    L F F    VFFLF   FLL++ L    
Sbjct: 147 SFNWLI----LSFFFSFFNVFFLFFYYFLLMLFLIPKF 180



 Score = 27.5 bits (62), Expect = 8.4
 Identities = 7/28 (25%), Positives = 12/28 (42%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFF 170
            LV +    + FF  ++  LL+     F
Sbjct: 104 LLVLLQFFSKNFFFLLLFGLLICFFQMF 131


>gnl|CDD|222479 pfam13967, RSN1_TM, Late exocytosis, associated with Golgi
           transport.  This family represents the first three
           transmembrane regions of 11-TM proteins involved in
           vesicle transport. In S. cerevisiae these proteins are
           members of the yeast facilitator superfamily and are
           integral membrane proteins localised to the cell
           periphery, in particular to the bud-neck region. The
           distribution is consistent with a role in late
           exocytosis which is in agreement with the proteins'
           ability to substitute for the function of Sro7p,
           required for the sorting of the protein Enap1 into
           Golgi-derived vesicles destined for the cell surface.
          Length = 151

 Score = 30.6 bits (70), Expect = 0.61
 Identities = 11/47 (23%), Positives = 20/47 (42%), Gaps = 8/47 (17%)

Query: 128 IRVVIK-SHSWVIRVN------FLVFIFLMRRVFFLFIVIFLLLILL 167
           I  ++K S   +++        FL F+ +  +  FL   +    ILL
Sbjct: 57  IPPLLKISDEEILQHAGLDAYVFLRFLKMGIK-IFLVGCVIGWPILL 102


>gnl|CDD|217991 pfam04258, Peptidase_A22B, Signal peptide peptidase.  The members
           of this family are membrane proteins. In some proteins
           this region is found associated with pfam02225. This
           family corresponds with Merops subfamily A22B, the type
           example of which is signal peptide peptidase. There is a
           sequence-similarity relationship with pfam01080.
          Length = 291

 Score = 31.2 bits (71), Expect = 0.63
 Identities = 20/82 (24%), Positives = 34/82 (41%), Gaps = 14/82 (17%)

Query: 99  NSPKQKWMWHKRDHSSSSSSLV---LSIVHTDIRVVIKSHSW----VIRVNFLV-FIFLM 150
             PK+K     ++   + S LV   L IV      V++   W    ++ +   +  I ++
Sbjct: 77  RFPKKKSKLELKNGRFTYSELVALLLCIVFAVWWYVLRKEHWILQDILGIALCINVIKIL 136

Query: 151 RRVFFL--FIVIFLLLILLFFF 170
           R    L    V  +LL  LFF+
Sbjct: 137 R----LPNLKVGTILLSGLFFY 154


>gnl|CDD|215864 pfam00335, Tetraspannin, Tetraspanin family. 
          Length = 221

 Score = 30.8 bits (70), Expect = 0.63
 Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 10/57 (17%)

Query: 144 LVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGLEQSRRSDNKSQ 200
           L +  L+     LFI+     IL F +       ++  +S +  GL    +S N   
Sbjct: 77  LTYFILL---LILFILEIAAGILAFVY-------RDKLESSLKEGLNYKDKSYNDDP 123


>gnl|CDD|217165 pfam02653, BPD_transp_2, Branched-chain amino acid transport system
           / permease component.  This is a large family mainly
           comprising high-affinity branched-chain amino acid
           transporter proteins such as E. coli LivH and LivM, both
           of which are form the LIV-I transport system. Also found
           with in this family are proteins from the galactose
           transport system permease and a ribose transport system.
          Length = 267

 Score = 31.0 bits (71), Expect = 0.66
 Identities = 11/60 (18%), Positives = 23/60 (38%), Gaps = 6/60 (10%)

Query: 119 LVLSIVHTDIRVVIKSHSWVIRV-----NFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
           L L ++   +       S ++ +      FL F F +  +  L + + + L+L     G 
Sbjct: 97  LTLFLLTGILGGP-AGTSTIVDLLGFPGLFLSFGFGVIFLLALLLALLVWLLLRRTRFGR 155


>gnl|CDD|192487 pfam10225, DUF2215, Uncharacterized conserved protein (DUF2215).
           This entry is the central 200 residues of a family of
           proteins conserved from worms to humans. The function is
           unknown.
          Length = 248

 Score = 30.8 bits (70), Expect = 0.67
 Identities = 21/106 (19%), Positives = 34/106 (32%), Gaps = 8/106 (7%)

Query: 135 HSWVIRVNFLVFIF---LMRRVFFLFIVIFLLLILLFFFIG--LEQRRKNDNKSQMGSG- 188
             W +++  L  I    L   +    +++ LL   L   I       RK   + Q GS  
Sbjct: 130 VKWTLQLIALTLILQSSLDPLLATGALILALLTKFLLTPISWLKGLYRKYRRRFQPGSRR 189

Query: 189 --LEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQ 232
              E+      + +  S LE+ R   N          SR    +  
Sbjct: 190 LLTEEEYEEQGEIETDSSLEELREYCNSPDCSPWKTFSRLRSPRRF 235


>gnl|CDD|223913 COG0843, CyoB, Heme/copper-type cytochrome/quinol oxidases, subunit
           1 [Energy production and conversion].
          Length = 566

 Score = 31.5 bits (72), Expect = 0.70
 Identities = 12/49 (24%), Positives = 18/49 (36%), Gaps = 14/49 (28%)

Query: 136 SWVIRVNFLVFIFLMRR-----------VFFLFIVIFLLLILLFFFIGL 173
           S +  +NF+V I  MR             + + I   L   +L  F  L
Sbjct: 161 SLLGAINFIVTILNMRAPGMTMMKMPLFTWSILITAIL---ILLAFPVL 206


>gnl|CDD|223425 COG0348, NapH, Polyferredoxin [Energy production and conversion].
          Length = 386

 Score = 31.3 bits (71), Expect = 0.71
 Identities = 13/57 (22%), Positives = 21/57 (36%), Gaps = 6/57 (10%)

Query: 123 IVHTDIRVVIKSHSWVIRV--NFLVFIFLMRRVFFLFIVI----FLLLILLFFFIGL 173
           +V     ++  S  W+  +    ++  FL R   F  +      FLLL LL      
Sbjct: 14  LVLLATLILFLSTPWLRVLGSQAVLLDFLGRPFGFFGLQFWAQEFLLLALLLIGALF 70


>gnl|CDD|177215 MTH00158, ATP8, ATP synthase F0 subunit 8; Provisional.
          Length = 32

 Score = 27.8 bits (63), Expect = 0.74
 Identities = 8/18 (44%), Positives = 13/18 (72%)

Query: 154 FFLFIVIFLLLILLFFFI 171
           F LF++ F+L  +L +FI
Sbjct: 13  FILFLITFILFNILNYFI 30


>gnl|CDD|233180 TIGR00912, 2A0309, spore germination protein (amino acid permease).
            This model describes spore germination protein GerKB
           and paralogs from Bacillus subtilis, Clostridium tetani,
           and other known or predicted endospore-forming members
           of the Firmicutes (low-GC Gram positive bacteria).
           Members show some similarity to amino acid permeases
           [Transport and binding proteins, Amino acids, peptides
           and amines].
          Length = 359

 Score = 31.1 bits (71), Expect = 0.78
 Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 1/57 (1%)

Query: 119 LVLSIVHTDIRVVIKSHSWVIRVN-FLVFIFLMRRVFFLFIVIFLLLILLFFFIGLE 174
           +++ I+   I +V K    ++R    L+ IFL+  +  L ++   L  +      LE
Sbjct: 118 IIILIIIVSIYIVRKGIEVLLRTAEILLIIFLILFILVLILLAPKLGNIKNLLPVLE 174


>gnl|CDD|223912 COG0842, COG0842, ABC-type multidrug transport system, permease
           component [Defense mechanisms].
          Length = 286

 Score = 31.1 bits (70), Expect = 0.79
 Identities = 12/65 (18%), Positives = 28/65 (43%), Gaps = 10/65 (15%)

Query: 119 LVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLM----------RRVFFLFIVIFLLLILLF 168
           L++S V     ++ K   +++  + +  + L+               L +++ LLL+L  
Sbjct: 123 LLVSPVSRLFILLGKIVPYLVVASLIAGLVLLVIAFLLGVPFLGSLLLLLLLLLLLLLAT 182

Query: 169 FFIGL 173
             +GL
Sbjct: 183 VALGL 187


>gnl|CDD|225764 COG3223, COG3223, Predicted membrane protein [Function unknown].
          Length = 138

 Score = 30.0 bits (68), Expect = 0.79
 Identities = 13/50 (26%), Positives = 21/50 (42%), Gaps = 15/50 (30%)

Query: 139 IRVNFLVFIFLMRRVFFLFIVIFLLL-----------ILLFF----FIGL 173
           I +  ++ +FL++    L +V F              IL+FF    FI L
Sbjct: 25  IELGLILVVFLIKETVHLALVAFFSGSVVSYYELLEGILIFFLYFEFIAL 74


>gnl|CDD|216329 pfam01148, CTP_transf_1, Cytidylyltransferase family.  The members
           of this family are integral membrane protein
           cytidylyltransferases. The family includes phosphatidate
           cytidylyltransferase EC:2.7.7.41 as well as Sec59 from
           yeast. Sec59 is a dolichol kinase EC:2.7.1.108.
          Length = 259

 Score = 30.8 bits (70), Expect = 0.80
 Identities = 14/46 (30%), Positives = 23/46 (50%), Gaps = 10/46 (21%)

Query: 128 IRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
            R++  +H  +I    L+F+ L     FL  +IFLLL+     + L
Sbjct: 2   QRII--THIVLI----LIFLIL----LFLGPLIFLLLVAAIIILAL 37


>gnl|CDD|219503 pfam07666, MpPF26, M penetrans paralogue family 26.  These proteins
           include those ascribed to M penetrans paralogue family
           26 in.
          Length = 133

 Score = 29.7 bits (67), Expect = 0.83
 Identities = 12/55 (21%), Positives = 24/55 (43%), Gaps = 4/55 (7%)

Query: 136 SWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI-GLEQRRKNDNKSQMGSGL 189
           + +I V  ++   +   + F +I  F+L I+L   I  L+ +     K  +   L
Sbjct: 57  NLLIGVLLILLDDI---IVFFYIANFILEIILTIKISLLKGKHPEFKKVSLHKIL 108


>gnl|CDD|224596 COG1682, TagG, ABC-type polysaccharide/polyol phosphate export
           systems, permease component [Carbohydrate transport and
           metabolism / Cell envelope biogenesis, outer membrane].
          Length = 263

 Score = 30.6 bits (70), Expect = 0.85
 Identities = 10/34 (29%), Positives = 14/34 (41%), Gaps = 3/34 (8%)

Query: 143 FLVFIFLMRRV---FFLFIVIFLLLILLFFFIGL 173
            +  I L       + L +   LLLIL    +GL
Sbjct: 130 LIFLIILGVEPSWHWLLLLPALLLLILFSVGLGL 163



 Score = 28.7 bits (65), Expect = 3.3
 Identities = 7/36 (19%), Positives = 18/36 (50%), Gaps = 3/36 (8%)

Query: 140 RVNFLVFIFLMRRV---FFLFIVIFLLLILLFFFIG 172
           ++NF   I  + R     F F++  +++++    +G
Sbjct: 102 KINFPPLILPVARTLSRLFNFLIHLIIILIFLIILG 137



 Score = 28.7 bits (65), Expect = 4.1
 Identities = 12/32 (37%), Positives = 18/32 (56%), Gaps = 2/32 (6%)

Query: 141 VNFLVFIFLMRRVFFLFIVIFLLL-ILLFFFI 171
           V  LVF  L+R    L  + +LL  ++L+FF 
Sbjct: 52  VYTLVFGALLRS-PGLNFLAYLLAGLILWFFF 82


>gnl|CDD|233283 TIGR01131, ATP_synt_6_or_A, ATP synthase subunit 6
           (eukaryotes),also subunit A (prokaryotes).  Bacterial
           forms should be designated ATP synthase, F0 subunit A;
           eukaryotic (chloroplast and mitochondrial) forms should
           be designated ATP synthase, F0 subunit 6. The F1/F0 ATP
           synthase is a multisubunit, membrane associated enzyme
           found in bacteria and mitochondria and chloroplast. This
           enzyme is principally involved in the synthesis of ATP
           from ADP and inorganic phosphate by coupling the energy
           derived from the proton electrochemical gradient across
           the biological membrane. A brief description of this
           multisubunit enzyme complex: F1 and F0 represent two
           major clusters of subunits. Individual subunits in each
           of these clusters are named differently in prokaryotes
           and in organelles e.g., mitochondria and chloroplast.
           The bacterial equivalent of subunit 6 is named subunit
           'A'. It has been shown that proton is conducted though
           this subunit. Typically, deprotonation and reprotonation
           of the acidic amino acid side-chains are implicated in
           the process [Energy metabolism, ATP-proton motive force
           interconversion].
          Length = 226

 Score = 30.6 bits (70), Expect = 0.88
 Identities = 15/68 (22%), Positives = 28/68 (41%), Gaps = 10/68 (14%)

Query: 103 QKWMWHKRDHSSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFL 162
            +W+        S    ++  ++  +  ++KS     +  F   IF      FLFI+I  
Sbjct: 36  SRWLI------PSRWQNLMESIYEFVLSIVKSQIGGKKGKFFPLIF----TLFLFILISN 85

Query: 163 LLILLFFF 170
           LL L+ + 
Sbjct: 86  LLGLIPYS 93


>gnl|CDD|216594 pfam01594, UPF0118, Domain of unknown function DUF20.  This
           transmembrane region is found in putative permeases and
           predicted transmembrane proteins it has no known
           function. It is not clear what source suggested that
           these proteins may be permeases and this information
           should be treated with caution.
          Length = 327

 Score = 30.7 bits (70), Expect = 0.89
 Identities = 9/33 (27%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query: 141 VNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
           V FL     + R   + +V+ L L+ L     L
Sbjct: 39  VRFLK-RRGIPRSLAILLVLLLFLVALVLLGLL 70


>gnl|CDD|234919 PRK01209, cobD, cobalamin biosynthesis protein; Provisional.
          Length = 312

 Score = 30.5 bits (70), Expect = 0.96
 Identities = 8/22 (36%), Positives = 14/22 (63%)

Query: 147 IFLMRRVFFLFIVIFLLLILLF 168
           + L+RR   L++ +F  L+LL 
Sbjct: 291 LKLVRRAVLLWLALFAALLLLL 312


>gnl|CDD|216273 pfam01061, ABC2_membrane, ABC-2 type transporter. 
          Length = 210

 Score = 30.3 bits (69), Expect = 1.0
 Identities = 12/35 (34%), Positives = 19/35 (54%), Gaps = 4/35 (11%)

Query: 143 FLVFIFLM----RRVFFLFIVIFLLLILLFFFIGL 173
           FL+ ++ M       FFLF+++ LL  L    +GL
Sbjct: 115 FLLIVYFMVGLPVSRFFLFLLVLLLTALAASGLGL 149


>gnl|CDD|234471 TIGR04104, cxxc_20_cxxc, cxxc_20_cxxc protein.  This small,
           uncommon, poorly conserved protein is found primarily in
           the Firmicutes. It features are pair of CxxC motifs
           separated by about 20 amino acids, followed by a highly
           hydrophobic region of about 45 amino acids. It has no
           conserved gene neighborhood, and its function is
           unknown.
          Length = 94

 Score = 28.9 bits (65), Expect = 1.0
 Identities = 9/30 (30%), Positives = 16/30 (53%)

Query: 141 VNFLVFIFLMRRVFFLFIVIFLLLILLFFF 170
           + FL  IF +  +  + I I LLL+++   
Sbjct: 56  ILFLSNIFGLPFILIILIYIILLLLIVLLT 85


>gnl|CDD|205807 pfam13630, SdpI, SdpI/YhfL protein family.  This family of proteins
           includes the SdpI and YhfL proteins from B. subtilis.
           The SdpI protein is a multipass integral membrane
           protein that protects toxin-producing cells from being
           killed. Killing is mediated by the exported toxic
           protein SdpC an extracellular protein that induces the
           synthesis of an immunity protein.
          Length = 72

 Score = 28.3 bits (64), Expect = 1.0
 Identities = 5/29 (17%), Positives = 12/29 (41%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
            L  + L      L ++I + +I+   + 
Sbjct: 43  LLGLLILDILFLILLLLILVAVIIPIIYS 71


>gnl|CDD|200378 TIGR04127, flavo_near_exo, exosortase F-associated protein.
           Members of this protein family are always found next to
           an exosortase/archaeosortase-like protein, and occur so
           far only in the flavobacteria, within the Bacteroidetes.
           Members do not have an obvious PEP-CTERM-like C-terminal
           protein sorting domain.
          Length = 136

 Score = 29.5 bits (67), Expect = 1.0
 Identities = 11/51 (21%), Positives = 28/51 (54%), Gaps = 10/51 (19%)

Query: 131 VIKSHSWVIRVNFLVFIFLMRRVF----------FLFIVIFLLLILLFFFI 171
           +  + S+   +N L+ + ++  +F           L++V+F++LI+LF+ +
Sbjct: 48  LFANLSFRYLLNTLLSLLIIYILFKDKGIIKFSAILYLVVFVILIILFYIL 98


>gnl|CDD|220691 pfam10323, 7TM_GPCR_Srv, Serpentine type 7TM GPCR chemoreceptor
           Srv.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srv is a member of the
           Srg superfamily of chemoreceptors. Chemoperception is
           one of the central senses of soil nematodes like C.
           elegans which are otherwise 'blind' and 'deaf'.
          Length = 283

 Score = 30.3 bits (69), Expect = 1.1
 Identities = 17/62 (27%), Positives = 25/62 (40%), Gaps = 12/62 (19%)

Query: 138 VIRVNFL-VFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGLEQSRRSD 196
           VI+   L   IF    VF   I   +    LFF I    R+K +  SQ      +S + +
Sbjct: 160 VIQRFTLIALIF----VFVTCIYCIIAYGALFFVI----RKKFNKNSQ---TKSKSLKRE 208

Query: 197 NK 198
            +
Sbjct: 209 VR 210



 Score = 29.9 bits (68), Expect = 1.7
 Identities = 10/33 (30%), Positives = 17/33 (51%), Gaps = 5/33 (15%)

Query: 139 IRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
           +R+ F VFI     +   F++I +  IL  +F 
Sbjct: 209 VRLAFQVFI-----LVLAFLIILIFYILSNYFS 236


>gnl|CDD|220690 pfam10321, 7TM_GPCR_Srt, Serpentine type 7TM GPCR chemoreceptor
           Srt.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srt is a member of the
           Srg superfamily of chemoreceptors. Chemoperception is
           one of the central senses of soil nematodes like C.
           elegans which are otherwise 'blind' and 'deaf'.
          Length = 313

 Score = 30.6 bits (70), Expect = 1.1
 Identities = 16/67 (23%), Positives = 28/67 (41%), Gaps = 17/67 (25%)

Query: 105 WMWHKRDHSSSSSSLVLSIVHTDIRVV-IKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLL 163
           WM       S ++S++L+I     R + I           L FIF  ++ + + I+  + 
Sbjct: 117 WM------GSCATSILLAI----NRCLDINPPFP------LAFIFEGKKTYIVLILPIIY 160

Query: 164 LILLFFF 170
            I   FF
Sbjct: 161 GIYFLFF 167


>gnl|CDD|223833 COG0762, COG0762, Predicted integral membrane protein [Function
           unknown].
          Length = 96

 Score = 28.7 bits (65), Expect = 1.1
 Identities = 5/31 (16%), Positives = 15/31 (48%), Gaps = 1/31 (3%)

Query: 141 VNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
              L+ +  +  V  + I +++ L+L+   +
Sbjct: 2   FLMLILLLSLL-VLLILIGVYIFLLLIRALL 31


>gnl|CDD|217009 pfam02386, TrkH, Cation transport protein.  This family consists of
           various cation transport proteins (Trk) and V-type
           sodium ATP synthase subunit J or translocating ATPase J
           EC:3.6.1.34. These proteins are involved in active
           sodium up-take utilising ATP in the process. TrkH a
           member of the family from E. coli is a hydrophobic
           membrane protein and determines the specificity and
           kinetics of cation transport by the TrK system in E.
           coli.
          Length = 334

 Score = 30.3 bits (69), Expect = 1.1
 Identities = 9/46 (19%), Positives = 20/46 (43%), Gaps = 2/46 (4%)

Query: 130 VVIKSHSWV--IRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
           V ++   W+   +    +F+    R  FL ++   + ILL   + +
Sbjct: 103 VHLRLLIWIRKKKKELSLFLKDHPRRTFLLLLPGFITILLLEVLVV 148


>gnl|CDD|164750 MTH00204, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 485

 Score = 30.8 bits (70), Expect = 1.1
 Identities = 19/58 (32%), Positives = 31/58 (53%), Gaps = 6/58 (10%)

Query: 119 LVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVF--FLFIVIFLLLILLFFFIGLE 174
           ++L+ +   I ++I   SW   + FL   FL+  +F   L I +F +L LL F+I  E
Sbjct: 89  ILLTTLLIPICILI---SWK-SIKFLFKEFLLCLLFLEVLLIGVFSVLDLLLFYILFE 142


>gnl|CDD|219354 pfam07264, EI24, Etoposide-induced protein 2.4 (EI24).  This family
           contains a number of eukaryotic etoposide-induced 2.4
           (EI24) proteins approximately 350 residues long as well
           as bacterial CysZ proteins (formerly known as DUF540).
           In cells treated with the cytotoxic drug etoposide, EI24
           is induced by p53. It has been suggested to play an
           important role in negative cell growth control.
          Length = 205

 Score = 30.2 bits (69), Expect = 1.1
 Identities = 8/35 (22%), Positives = 22/35 (62%), Gaps = 2/35 (5%)

Query: 136 SWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFF 170
           SW++ +  ++ + ++  ++FLF+V+  L+   F+ 
Sbjct: 56  SWLLWILAVLLLLVL--LWFLFVVVASLIAAPFWT 88


>gnl|CDD|221690 pfam12650, DUF3784, Domain of unknown function (DUF3784).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in bacteria and archaea.
           Proteins in this family are typically between 96 and 110
           amino acids in length.
          Length = 96

 Score = 28.7 bits (65), Expect = 1.2
 Identities = 10/40 (25%), Positives = 23/40 (57%), Gaps = 4/40 (10%)

Query: 141 VNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL--EQRRK 178
           +  L+ +FL +  +F+  +I  +++L+F  I L    R++
Sbjct: 59  IGALLSLFLSK--YFIIAIIAFIVLLIFLVIYLAKGNRKR 96


>gnl|CDD|216554 pfam01529, zf-DHHC, DHHC palmitoyltransferase.  This family
           includes the well known DHHC zinc binding domain as well
           as three of the four conserved transmembrane regions
           found in this family of palmitoyltransferase enzymes.
          Length = 167

 Score = 29.8 bits (67), Expect = 1.2
 Identities = 9/36 (25%), Positives = 20/36 (55%), Gaps = 3/36 (8%)

Query: 136 SWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
           + + R N   F+     + +L + + LLL+L F+++
Sbjct: 77  NCIGRRNHKYFLLF---LLYLTLYLILLLVLSFYYL 109



 Score = 28.6 bits (64), Expect = 2.6
 Identities = 7/57 (12%), Positives = 16/57 (28%)

Query: 117 SSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
              +  I+   +      +       F   I  +     L ++    L+ L F +  
Sbjct: 93  YLTLYLILLLVLSFYYLVYLIRNIELFFFLILSLFSSIILLVLSLFFLLFLSFLLFF 149


>gnl|CDD|222193 pfam13520, AA_permease_2, Amino acid permease. 
          Length = 425

 Score = 30.4 bits (69), Expect = 1.2
 Identities = 10/54 (18%), Positives = 22/54 (40%), Gaps = 3/54 (5%)

Query: 117 SSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFF 170
           +  +L I      + I+      ++  ++ I  +     L I++ L+L L   F
Sbjct: 122 AIAILIIFAL---INIRGIKESAKIQNILGIVKLLLPLILIILLGLVLALGGGF 172


>gnl|CDD|217187 pfam02687, FtsX, FtsX-like permease family.  This is a family of
           predicted permeases and hypothetical transmembrane
           proteins. Buchnera aphidicola lolC has been shown to
           transport lipids targeted to the outer membrane across
           the inner membrane. Both lolC and Streptococcus
           cristatus tptD have been shown to require ATP. This
           region contains three transmembrane helices.
          Length = 121

 Score = 29.1 bits (66), Expect = 1.2
 Identities = 7/26 (26%), Positives = 16/26 (61%)

Query: 153 VFFLFIVIFLLLILLFFFIGLEQRRK 178
           +  L +++ LL++L    I + +RR+
Sbjct: 3   LSVLILLLALLILLNLLSISISERRR 28


>gnl|CDD|177156 MTH00093, ND4L, NADH dehydrogenase subunit 4L; Provisional.
          Length = 77

 Score = 28.3 bits (64), Expect = 1.2
 Identities = 9/34 (26%), Positives = 19/34 (55%), Gaps = 1/34 (2%)

Query: 141 VNFLVFIFLMRR-VFFLFIVIFLLLILLFFFIGL 173
           ++ L+  F  +R +F L  + FL++ L  +F+  
Sbjct: 6   ISLLMLFFKYKRLIFILISLEFLMMSLFIYFMYF 39



 Score = 26.4 bits (59), Expect = 5.6
 Identities = 10/26 (38%), Positives = 14/26 (53%), Gaps = 4/26 (15%)

Query: 153 VFFLFIVIFLLLI----LLFFFIGLE 174
           + FLFI + +L      L+F  I LE
Sbjct: 1   MIFLFISLLMLFFKYKRLIFILISLE 26


>gnl|CDD|237140 PRK12567, PRK12567, putative monovalent cation/H+ antiporter
           subunit B; Reviewed.
          Length = 218

 Score = 29.9 bits (68), Expect = 1.2
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 150 MRRVFFLFIVIFLLLILLFFFIGLEQ 175
           M+ V+ L I++  + +LL     LE 
Sbjct: 1   MKWVYLLAILLLFIKVLLSPSFALEP 26


>gnl|CDD|177124 MTH00049, COX3, cytochrome c oxidase subunit III; Validated.
          Length = 215

 Score = 29.9 bits (68), Expect = 1.3
 Identities = 9/38 (23%), Positives = 16/38 (42%), Gaps = 1/38 (2%)

Query: 136 SWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
           SW    N   F+       FL+    LL+ L+ + + +
Sbjct: 2   SWFPLFN-ASFVGFGLVGLFLWKPFILLVFLILWVLLI 38


>gnl|CDD|214443 MTH00163, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 445

 Score = 30.5 bits (70), Expect = 1.3
 Identities = 9/17 (52%), Positives = 13/17 (76%)

Query: 154 FFLFIVIFLLLILLFFF 170
            FLF+++FLL+ LL  F
Sbjct: 88  LFLFLILFLLIFLLLTF 104


>gnl|CDD|223548 COG0472, Rfe, UDP-N-acetylmuramyl pentapeptide
           phosphotransferase/UDP-N- acetylglucosamine-1-phosphate
           transferase [Cell envelope biogenesis, outer membrane].
          Length = 319

 Score = 30.3 bits (69), Expect = 1.3
 Identities = 8/33 (24%), Positives = 14/33 (42%), Gaps = 1/33 (3%)

Query: 143 FLVFIFLMRRVF-FLFIVIFLLLILLFFFIGLE 174
            L+   L   +    ++ + LL +L F  IG  
Sbjct: 59  ILLASLLAANLLTNPYVWLVLLGLLGFGLIGFL 91


>gnl|CDD|181988 PRK09609, PRK09609, hypothetical protein; Provisional.
          Length = 312

 Score = 30.0 bits (68), Expect = 1.4
 Identities = 15/63 (23%), Positives = 27/63 (42%), Gaps = 4/63 (6%)

Query: 113 SSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFI----FLMRRVFFLFIVIFLLLILLF 168
           S  ++ ++L I+   I  V+        +    FI     LM  +   FI++ + +I   
Sbjct: 170 SWIAALIILVIIILFIYFVVGFLDPDFFIQSFSFIKNKRSLMILIISGFILMIIFVIWAR 229

Query: 169 FFI 171
           FFI
Sbjct: 230 FFI 232


>gnl|CDD|218853 pfam06011, TRP, Transient receptor potential (TRP) ion channel.
           This family of proteins are transient receptor potential
           (TRP) ion channels. They are essential for cellular
           viability and are involved in cell growth and cell wall
           synthesis. The genes for these proteins are homologous
           to polycystic kidney disease related ion channel genes.
          Length = 531

 Score = 30.3 bits (69), Expect = 1.4
 Identities = 6/22 (27%), Positives = 15/22 (68%)

Query: 153 VFFLFIVIFLLLILLFFFIGLE 174
           +FFL ++  ++++++ F   LE
Sbjct: 254 IFFLIVIALIVVLVIQFKFILE 275


>gnl|CDD|214404 MTH00025, ATP8, ATP synthase F0 subunit 8; Validated.
          Length = 70

 Score = 28.1 bits (63), Expect = 1.4
 Identities = 16/42 (38%), Positives = 21/42 (50%), Gaps = 3/42 (7%)

Query: 157 FIVIFLLLILLFFFIGLEQRRKN--DNKSQMGSGLEQSRRSD 196
            IV+FLLL  L FF+ L   ++N    KS M  G   S+   
Sbjct: 17  LIVLFLLLFFLVFFV-LPTIKRNWLIRKSLMKKGAAFSKGLG 57


>gnl|CDD|225694 COG3152, COG3152, Predicted membrane protein [Function unknown].
          Length = 125

 Score = 29.2 bits (66), Expect = 1.4
 Identities = 11/37 (29%), Positives = 18/37 (48%)

Query: 137 WVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
           +VI  N ++ + L    F L + I  +L+LL     L
Sbjct: 25  YVILFNLIIGVVLFVLAFILGLSIAGVLVLLTALYLL 61


>gnl|CDD|177155 MTH00092, ND3, NADH dehydrogenase subunit 3; Provisional.
          Length = 111

 Score = 28.7 bits (65), Expect = 1.5
 Identities = 10/28 (35%), Positives = 15/28 (53%), Gaps = 3/28 (10%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFF 170
            LV IF+     F  +++ LL +L FF 
Sbjct: 1   MLVLIFV---FLFTLLLLMLLYLLNFFL 25



 Score = 27.2 bits (61), Expect = 6.1
 Identities = 13/47 (27%), Positives = 22/47 (46%), Gaps = 14/47 (29%)

Query: 139 IRVNFLVFIFLMRRVFFLF---IVIFLLLI-----------LLFFFI 171
           I+ +F +  F +  +F +F   IV+FL L+           +L  FI
Sbjct: 48  IQNSFSIHFFKIMLMFVIFDLEIVMFLGLLVSDFSSLISFFMLMIFI 94


>gnl|CDD|146748 pfam04276, DUF443, Protein of unknown function (DUF443).  Family of
           uncharacterized proteins.
          Length = 197

 Score = 29.5 bits (67), Expect = 1.5
 Identities = 6/30 (20%), Positives = 14/30 (46%), Gaps = 4/30 (13%)

Query: 153 VFFLFIVIFLLLILLFFFIGLEQRRKNDNK 182
              + I++ L +++L  +I     +K   K
Sbjct: 91  NIIICIIVILGVLILRLYI----NKKLKLK 116


>gnl|CDD|222059 pfam13346, ABC2_membrane_5, ABC-2 family transporter protein.  This
           family is related to the ABC-2 membrane transporter
           family pfam01061.
          Length = 206

 Score = 29.5 bits (67), Expect = 1.6
 Identities = 7/26 (26%), Positives = 17/26 (65%)

Query: 147 IFLMRRVFFLFIVIFLLLILLFFFIG 172
            +L+++   + ++IFLLL+++ F   
Sbjct: 7   FYLIKKYLKILLIIFLLLLIVIFIFL 32



 Score = 28.8 bits (65), Expect = 3.0
 Identities = 12/57 (21%), Positives = 24/57 (42%), Gaps = 3/57 (5%)

Query: 128 IRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQ 184
           I+  +K    +  +  +V    +     +F +I L+LIL+   +        D KS+
Sbjct: 10  IKKYLKILLIIFLLLLIVIFIFLNEDGSIFFIILLILILIMLVLTT---FSYDEKSK 63



 Score = 28.4 bits (64), Expect = 3.9
 Identities = 9/26 (34%), Positives = 16/26 (61%)

Query: 146 FIFLMRRVFFLFIVIFLLLILLFFFI 171
           F  + + +  L I+  LLLI++F F+
Sbjct: 7   FYLIKKYLKILLIIFLLLLIVIFIFL 32


>gnl|CDD|224136 COG1215, COG1215, Glycosyltransferases, probably involved in cell
           wall biogenesis [Cell envelope biogenesis, outer
           membrane].
          Length = 439

 Score = 30.3 bits (68), Expect = 1.6
 Identities = 8/43 (18%), Positives = 14/43 (32%)

Query: 129 RVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
           R  ++       +  L       R   L   + L L+L   F+
Sbjct: 282 RGGLQVLLLHPYLPALFRPLSGYRFILLLFSLLLPLLLTLLFL 324


>gnl|CDD|220693 pfam10325, 7TM_GPCR_Srz, Serpentine type 7TM GPCR chemoreceptor
           Srz.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srz is a solo families
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'. The genes encoding Srz appear to be under
           strong adaptive evolutionary pressure.
          Length = 266

 Score = 29.9 bits (68), Expect = 1.6
 Identities = 11/43 (25%), Positives = 23/43 (53%), Gaps = 8/43 (18%)

Query: 137 WVIRVNFLVFIFLMRRVFFLF--------IVIFLLLILLFFFI 171
            ++++ + +FIFL+  +  +F          +FLLL+ L  +I
Sbjct: 45  KMVKITYFLFIFLIIFLILIFFFTKNSIIGFLFLLLLFLILYI 87



 Score = 28.7 bits (65), Expect = 4.1
 Identities = 6/35 (17%), Positives = 16/35 (45%)

Query: 141 VNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQ 175
           +  ++  F  +     F+ + LL ++L+    + Q
Sbjct: 59  IFLILIFFFTKNSIIGFLFLLLLFLILYILYIITQ 93


>gnl|CDD|220496 pfam09972, DUF2207, Predicted membrane protein (DUF2207).  This
           domain, found in various hypothetical bacterial
           proteins, has no known function.
          Length = 503

 Score = 30.0 bits (68), Expect = 1.7
 Identities = 11/43 (25%), Positives = 22/43 (51%), Gaps = 2/43 (4%)

Query: 137 WVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKN 179
           +    N    + L+  +  L +V+ +LLIL+ + I L+  R+ 
Sbjct: 216 YAREANIRKNLLLI--LLPLLLVLLILLILIPYLIWLKYGREP 256


>gnl|CDD|116129 pfam07509, DUF1523, Protein of unknown function (DUF1523). 
          Length = 174

 Score = 29.2 bits (66), Expect = 1.7
 Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 13/62 (20%)

Query: 121 LSIVHTDIRVVIKSHSWVIRVNFL-VF---IFL-------MRRVFFLFIVIFLLLILLFF 169
           LS       V I  + W  R+N   +F   + +       +R + +  I+ ++LL+LL F
Sbjct: 107 LSTADDPQWVEITYYGW--RINMFSIFPNAVSIKPVTEPDVRSIPWFNIIFYVLLLLLLF 164

Query: 170 FI 171
             
Sbjct: 165 LS 166


>gnl|CDD|222635 pfam14264, Glucos_trans_II, Glucosyl transferase GtrII.  This
           family includes glucosyl transferase II from the
           Shigella phage SfII, which mediates seroconversion of S.
           flexneri when the phage is integrated into the host
           chromosome.
          Length = 317

 Score = 29.9 bits (68), Expect = 1.7
 Identities = 18/77 (23%), Positives = 32/77 (41%), Gaps = 9/77 (11%)

Query: 109 KRDHSSSSSSLVLSIVH--TDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLIL 166
           +       SSL+LS+      I   IK        +FL         ++ +++I LLL+ 
Sbjct: 201 ELSDYQGISSLILSLKDLLKSILNNIKEF-----FSFLKSQS--GYPYYTYLLIVLLLLF 253

Query: 167 LFFFIGLEQRRKNDNKS 183
           L   + + ++RK   K 
Sbjct: 254 LLLILLVVRKRKKRKKP 270


>gnl|CDD|216595 pfam01595, DUF21, Domain of unknown function DUF21.  This
           transmembrane region has no known function. Many of the
           sequences in this family are annotated as hemolysins,
           however this is due to a similarity to Treponema
           hyodysenteriae hemolysin C that does not contain this
           domain. This domain is found in the N-terminus of the
           proteins adjacent to two intracellular CBS domains
           pfam00571.
          Length = 182

 Score = 29.5 bits (67), Expect = 1.8
 Identities = 9/20 (45%), Positives = 12/20 (60%)

Query: 155 FLFIVIFLLLILLFFFIGLE 174
            L ++I LLL+L  FF   E
Sbjct: 1   LLLLIIVLLLLLSAFFSASE 20


>gnl|CDD|215968 pfam00520, Ion_trans, Ion transport protein.  This family contains
           Sodium, Potassium, Calcium ion channels. This family is
           6 transmembrane helices in which the last two helices
           flank a loop which determines ion selectivity. In some
           sub-families (e.g. Na channels) the domain is repeated
           four times, whereas in others (e.g. K channels) the
           protein forms as a tetramer in the membrane. A bacterial
           structure of the protein is known for the last two
           helices but is not the Pfam family due to it lacking the
           first four helices.
          Length = 194

 Score = 29.6 bits (67), Expect = 1.8
 Identities = 10/71 (14%), Positives = 26/71 (36%), Gaps = 19/71 (26%)

Query: 120 VLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLF-------------------IVI 160
            L ++ + + +++        +  ++ +  + R+  L                    ++ 
Sbjct: 37  FLVVLPSLVSLILFLLGEDSGLLRVLRLLRLLRLLRLLRRFPGLRTLLQSLGRSLKSLLN 96

Query: 161 FLLLILLFFFI 171
            LLL+LL  FI
Sbjct: 97  LLLLLLLLLFI 107


>gnl|CDD|204446 pfam10322, 7TM_GPCR_Sru, Serpentine type 7TM GPCR chemoreceptor
           Sru.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Sru is a member of the
           Srg superfamily of chemoreceptors. Chemoperception is
           one of the central senses of soil nematodes like C.
           elegans which are otherwise 'blind' and 'deaf'.
          Length = 307

 Score = 29.5 bits (67), Expect = 1.9
 Identities = 14/49 (28%), Positives = 24/49 (48%), Gaps = 5/49 (10%)

Query: 147 IFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGLEQSRRS 195
           IF +    F  I+  +L ILLF  +   ++ K    S   S  ++SR++
Sbjct: 198 IFHLLNSVFWLILCLILNILLFVKL---RKAKLKLSSNRTS--KKSRKA 241


>gnl|CDD|224011 COG1086, COG1086, Predicted nucleoside-diphosphate sugar epimerases
           [Cell envelope biogenesis, outer membrane / Carbohydrate
           transport and metabolism].
          Length = 588

 Score = 29.9 bits (68), Expect = 1.9
 Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 14/86 (16%)

Query: 126 TDIRVVIKSHSWVIRVNFLVFIF---LMRRVFFLFIVIFLLLILL--FFFIGLEQRRKND 180
           +D+ +++K+      V+ L F     L R V  ++ ++  +LI      F  L    + D
Sbjct: 55  SDLFIILKAVLTSALVSLLFFSTRTDLPRSVVLVYWLLLFVLIGASRLLFRALRDALQKD 114

Query: 181 NKSQ---MGSG------LEQSRRSDN 197
           N+ +   +G+G      L   RR   
Sbjct: 115 NRIRLLIIGAGSAGDLLLRALRRDPE 140


>gnl|CDD|227100 COG4758, COG4758, Predicted membrane protein [Function unknown].
          Length = 235

 Score = 29.6 bits (67), Expect = 2.0
 Identities = 9/27 (33%), Positives = 16/27 (59%)

Query: 144 LVFIFLMRRVFFLFIVIFLLLILLFFF 170
           L+ I ++    F + ++FLL +LLF  
Sbjct: 17  LMIIQIVIETLFFWPLVFLLALLLFRI 43


>gnl|CDD|225574 COG3030, FxsA, Protein affecting phage T7 exclusion by the F
           plasmid [General function prediction only].
          Length = 158

 Score = 28.8 bits (65), Expect = 2.0
 Identities = 10/24 (41%), Positives = 17/24 (70%)

Query: 149 LMRRVFFLFIVIFLLLILLFFFIG 172
           +MR +F LF++  L+ I LF ++G
Sbjct: 1   MMRLLFLLFLLYPLIEIALFIWVG 24


>gnl|CDD|233174 TIGR00893, 2A0114, D-galactonate transporter.  [Transport and
           binding proteins, Carbohydrates, organic alcohols, and
           acids].
          Length = 399

 Score = 29.6 bits (67), Expect = 2.1
 Identities = 13/69 (18%), Positives = 26/69 (37%), Gaps = 1/69 (1%)

Query: 104 KWMWHKRDHSSSSSSLVLSIVHTDIRVVIKSHSWV-IRVNFLVFIFLMRRVFFLFIVIFL 162
           K++      +   +     IV   +    +       +   +  +   RRV+ L +  FL
Sbjct: 167 KFIPDPPQKAKWLTEEEKYIVVGGLLAEQQGKGPSTPKKYQIKELLKDRRVWGLALGQFL 226

Query: 163 LLILLFFFI 171
           + I L FF+
Sbjct: 227 VNIGLGFFL 235


>gnl|CDD|225844 COG3307, RfaL, Lipid A core - O-antigen ligase and related enzymes
           [Cell envelope biogenesis, outer membrane].
          Length = 424

 Score = 29.8 bits (67), Expect = 2.1
 Identities = 7/34 (20%), Positives = 14/34 (41%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQR 176
            L   FL+ ++  L+I   + L +   F+     
Sbjct: 7   ILALSFLIPKLGALWIAALVFLFISLAFLAGLGS 40


>gnl|CDD|216251 pfam01027, Bax1-I, Inhibitor of apoptosis-promoting Bax1.
           Programmed cell-death involves a set of Bcl-2 family
           proteins, some of which inhibit apoptosis (Bcl-2 and
           Bcl-XL) and some of which promote it (Bax and Bak).
           Human Bax inhibitor, BI-1, is an evolutionarily
           conserved integral membrane protein containing multiple
           membrane-spanning segments predominantly localised to
           intracellular membranes. It has 6-7 membrane-spanning
           domains. The C termini of the mammalian BI-1 proteins
           are comprised of basic amino acids resembling some
           nuclear targeting sequences, but otherwise the predicted
           proteins lack motifs that suggest a function. As plant
           BI-1 appears to localise predominantly to the ER, we
           hypothesized that plant BI-1 could also regulate cell
           death triggered by ER stress. BI-1 appears to exert its
           effect through an interaction with calmodulin.
          Length = 205

 Score = 29.1 bits (66), Expect = 2.2
 Identities = 5/42 (11%), Positives = 17/42 (40%)

Query: 136 SWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRR 177
           ++++     +   L    +   ++   LLI +F      +++
Sbjct: 22  AYLVLSTPALLQLLFPLFWVGLLLPLALLIFVFLLFSRIKKK 63


>gnl|CDD|223777 COG0705, COG0705, Membrane associated serine protease [Amino acid
           transport and metabolism].
          Length = 228

 Score = 29.4 bits (66), Expect = 2.4
 Identities = 12/47 (25%), Positives = 22/47 (46%), Gaps = 3/47 (6%)

Query: 130 VVIKSHSW--VIRVNFLVFIFLMRRVFFLFI-VIFLLLILLFFFIGL 173
            +  S +   ++   FL+F F    + FL +    L+LIL++    L
Sbjct: 133 SLGASGAIFGLLGAYFLLFPFARILLLFLSLPRPALILILIWLLYSL 179


>gnl|CDD|219516 pfam07690, MFS_1, Major Facilitator Superfamily. 
          Length = 346

 Score = 29.3 bits (66), Expect = 2.4
 Identities = 7/41 (17%), Positives = 15/41 (36%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKS 183
            L  +F  R  F +  ++ LL  +L   +      ++    
Sbjct: 143 LLASLFGWRAAFLILAILALLAAVLAALLLPRPPPESKRPK 183


>gnl|CDD|221085 pfam11337, DUF3139, Protein of unknown function (DUF3139).  This
           family of proteins with unknown function appears to be
           restricted to Firmicutes.
          Length = 85

 Score = 27.8 bits (62), Expect = 2.4
 Identities = 12/53 (22%), Positives = 25/53 (47%), Gaps = 4/53 (7%)

Query: 150 MRRVFFLFIVIFLLLILLF----FFIGLEQRRKNDNKSQMGSGLEQSRRSDNK 198
           M++   +FIVI L++I LF     F    Q +++     +   L +    +++
Sbjct: 1   MKKYKVIFIVIILIVISLFVGIVLFFAGNQYQRHKALKAIDWYLAKQGIKEDQ 53


>gnl|CDD|217749 pfam03824, NicO, High-affinity nickel-transport protein.  High
           affinity nickel transporters involved in the
           incorporation of nickel into H2-uptake hydrogenase and
           urease enzymes. Essential for the expression of
           catalytically active hydrogenase and urease. Ion uptake
           is dependent on proton motive force. HoxN in Alcaligenes
           eutrophus is thought to be an integral membrane protein
           with seven transmembrane helices. The family also
           includes a cobalt transporter.
          Length = 278

 Score = 29.3 bits (66), Expect = 2.5
 Identities = 14/79 (17%), Positives = 22/79 (27%), Gaps = 7/79 (8%)

Query: 118 SLVLSIVHTDIRVVIKSHSWVI-----RVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIG 172
           SL  S V   + +++     ++            I  +    FL I+      LL     
Sbjct: 45  SLGHSSVVGLVALLLALGVKLVLRLPSLQEIGSTIGTLVSGSFLLIIGL--GNLLLLREL 102

Query: 173 LEQRRKNDNKSQMGSGLEQ 191
               RK          LE 
Sbjct: 103 RHLFRKLRRGRAPEPDLEA 121


>gnl|CDD|226960 COG4597, BatB, ABC-type amino acid transport system, permease
           component [Amino acid transport and metabolism].
          Length = 397

 Score = 29.4 bits (66), Expect = 2.5
 Identities = 13/55 (23%), Positives = 24/55 (43%), Gaps = 12/55 (21%)

Query: 134 SHSWVIRVNFLVFIFLMR------RVFFLFIVIFLLL------ILLFFFIGLEQR 176
           S +W+IR    V++ + R      ++FF +  +  +L      + L   I L  R
Sbjct: 120 SDNWLIRKLSTVYVEIFRNIPPLLQIFFWYFGVLSVLPGARQSVSLPDLIFLSNR 174


>gnl|CDD|223731 COG0658, ComEC, Predicted membrane metal-binding protein [General
           function prediction only].
          Length = 453

 Score = 29.4 bits (66), Expect = 2.6
 Identities = 16/74 (21%), Positives = 29/74 (39%), Gaps = 5/74 (6%)

Query: 116 SSSLVLSIVHTDIRVVIKSH---SW--VIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFF 170
            S L+L  V   + + I      SW  ++    L+ + L+         + L+L+L FF 
Sbjct: 6   LSRLLLRFVAALLLLGILLALLLSWLPLLLAALLLLLALLLARALPLSKLLLVLLLAFFL 65

Query: 171 IGLEQRRKNDNKSQ 184
           +G+         S 
Sbjct: 66  LGIFAALFLLLLSL 79


>gnl|CDD|227696 COG5409, COG5409, EXS domain-containing protein [Signal
           transduction mechanisms].
          Length = 384

 Score = 29.4 bits (66), Expect = 2.7
 Identities = 16/80 (20%), Positives = 26/80 (32%), Gaps = 28/80 (35%)

Query: 96  WQCNSPKQKWMWHKRDHSSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFF 155
           W  +S      W KR                    ++K H  +I +NFL+      R  +
Sbjct: 289 WSLDSLTSLRSWSKRAV-----------------TLLKYHIAMI-INFLL------RFSW 324

Query: 156 LFIVIFLLLIL----LFFFI 171
           +   +    I     +F FI
Sbjct: 325 IVYYLPPNHIQHSADIFIFI 344


>gnl|CDD|223692 COG0619, CbiQ, ABC-type cobalt transport system, permease component
           CbiQ and related transporters [Inorganic ion transport
           and metabolism].
          Length = 252

 Score = 29.3 bits (66), Expect = 2.8
 Identities = 10/29 (34%), Positives = 14/29 (48%)

Query: 143 FLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
            LV +  +   F L ++ FLLL LL    
Sbjct: 50  LLVVLAKIPLKFLLLLLGFLLLGLLLIAF 78


>gnl|CDD|234534 TIGR04286, MSEP-CTERM, MSEP-CTERM protein.  Members of this protein
           family average over 900 residues in length and appear to
           have multiple membrane-spanning helices in the
           N-terminal half. The extreme C-terminal region consists
           of a motif with consensus sequence MSEP, then a
           transmembrane alpha helix, then a short region with
           several basic residues. This region, hereby dubbed
           MSEP-CTERM, resembles other putative sorting signals
           associated with the archaeosortase/exosortase protein
           family (see TIGR04178). Genes for all members of this
           family are found next to a gene for exosortase K.
          Length = 920

 Score = 29.3 bits (66), Expect = 3.0
 Identities = 11/47 (23%), Positives = 20/47 (42%)

Query: 139 IRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQM 185
           I  +F+    L+          FL LIL F +I + +R  +    ++
Sbjct: 165 INGSFIGLNHLIMIFAIGATASFLFLILRFLYILITRRNLSFKNPEL 211


>gnl|CDD|165187 PHA02845, PHA02845, hypothetical protein; Provisional.
          Length = 91

 Score = 27.4 bits (61), Expect = 3.0
 Identities = 14/42 (33%), Positives = 20/42 (47%)

Query: 128 IRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFF 169
           IR+ I     +I + FL+   L R  F LFI+    L + F 
Sbjct: 42  IRLTIDLILLIIVMIFLMIRILKRNYFALFIIFLFYLFISFR 83



 Score = 27.0 bits (60), Expect = 4.2
 Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 14/63 (22%)

Query: 122 SIVHTDIRVVIKSHSWVIR----------VNFLVFIFLM----RRVFFLFIVIFLLLILL 167
           SI+   I  +  +H ++ R          +  +V IFLM    +R +F   +IFL  + +
Sbjct: 21  SILDCIINEIENNHHYLFRPFIRLTIDLILLIIVMIFLMIRILKRNYFALFIIFLFYLFI 80

Query: 168 FFF 170
            F 
Sbjct: 81  SFR 83


>gnl|CDD|233109 TIGR00739, yajC, preprotein translocase, YajC subunit.  While this
           protein is part of the preprotein translocase in
           Escherichia coli, it is not essential for viability or
           protein secretion. The N-terminus region contains a
           predicted membrane-spanning region followed by a region
           consisting almost entirely of residues with charged
           (acidic, basic, or zwitterionic) side chains. This small
           protein is about 100 residues in length, and is
           restricted to bacteria; however, this protein is absent
           from some lineages, including spirochetes and
           Mycoplasmas [Protein fate, Protein and peptide secretion
           and trafficking].
          Length = 84

 Score = 27.3 bits (61), Expect = 3.1
 Identities = 8/37 (21%), Positives = 20/37 (54%), Gaps = 1/37 (2%)

Query: 154 FFLFIVIFLLLILLFFFIGLE-QRRKNDNKSQMGSGL 189
           F   ++  +L+ L+F+F+ +  QR++     ++   L
Sbjct: 2   FLTTLLPLVLIFLIFYFLIIRPQRKRRKAHKKLIESL 38


>gnl|CDD|221116 pfam11457, DUF3021, Protein of unknown function (DUF3021).  This is
           a bacterial family of uncharacterized proteins.
          Length = 134

 Score = 28.0 bits (63), Expect = 3.1
 Identities = 12/69 (17%), Positives = 27/69 (39%), Gaps = 7/69 (10%)

Query: 117 SSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFL------MRRVFFLFIVIFLLL-ILLFF 169
           +SL+       +      H  +  + FL    L            LF +IF+L+ ++++ 
Sbjct: 58  ASLIFETERWSLLKQTLIHFILTLLTFLPLAILNGWFPLSGIALLLFTIIFILIYLIIWL 117

Query: 170 FIGLEQRRK 178
            + L  ++ 
Sbjct: 118 ILYLRIKKD 126


>gnl|CDD|224096 COG1175, UgpA, ABC-type sugar transport systems, permease
           components [Carbohydrate transport and metabolism].
          Length = 295

 Score = 29.1 bits (66), Expect = 3.2
 Identities = 9/27 (33%), Positives = 14/27 (51%)

Query: 147 IFLMRRVFFLFIVIFLLLILLFFFIGL 173
               R + +L +   LLL+L+FF   L
Sbjct: 5   RRSRRLLPYLLLAPALLLLLVFFLYPL 31


>gnl|CDD|220687 pfam10316, 7TM_GPCR_Srbc, Serpentine type 7TM GPCR chemoreceptor
           Srbc.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srbc is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 272

 Score = 29.0 bits (66), Expect = 3.3
 Identities = 8/39 (20%), Positives = 17/39 (43%)

Query: 142 NFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKND 180
           + +  I  +  + F     FL + LL      ++++K D
Sbjct: 2   SLIAVIVTLIGILFSIFTCFLNIYLLKKIFYSKKKKKPD 40


>gnl|CDD|214789 smart00724, TLC, TRAM, LAG1 and CLN8 homology domains.  Protein
           domain with at least 5 transmembrane alpha-helices.
           Lag1p and Lac1p are essential for acyl-CoA-dependent
           ceramide synthesis, TRAM is a subunit of the translocon
           and the CLN8 gene is mutated in Northern epilepsy
           syndrome. The family may possess multiple functions such
           as lipid trafficking, metabolism, or sensing. Trh
           homologues possess additional homeobox domains.
          Length = 205

 Score = 28.4 bits (64), Expect = 3.5
 Identities = 9/33 (27%), Positives = 19/33 (57%), Gaps = 3/33 (9%)

Query: 133 KSHSWVIRVNFLVFI---FLMRRVFFLFIVIFL 162
           +  S +  VNF++F    F+ R + F F+++ +
Sbjct: 126 RKKSLLYDVNFVLFAVVFFVFRLILFPFLILTV 158


>gnl|CDD|215958 pfam00507, Oxidored_q4, NADH-ubiquinone/plastoquinone
           oxidoreductase, chain 3. 
          Length = 102

 Score = 27.5 bits (62), Expect = 3.5
 Identities = 10/40 (25%), Positives = 15/40 (37%), Gaps = 9/40 (22%)

Query: 143 FLVF----IFLM-----RRVFFLFIVIFLLLILLFFFIGL 173
           FL+F    I L+              + + L LL   +GL
Sbjct: 52  FLIFDVEIILLLPWAVVFNGLGFLSFVEMFLFLLILLLGL 91


>gnl|CDD|214456 MTH00203, ND3, NADH dehydrogenase subunit 3; Provisional.
          Length = 112

 Score = 27.6 bits (62), Expect = 3.7
 Identities = 9/21 (42%), Positives = 12/21 (57%)

Query: 153 VFFLFIVIFLLLILLFFFIGL 173
           +  LF +I LLL L+  FI  
Sbjct: 3   LIMLFFLIALLLSLILLFIAF 23


>gnl|CDD|221295 pfam11893, DUF3413, Domain of unknown function (DUF3413).  This
           presumed domain is functionally uncharacterized. This
           domain is found in bacteria. This domain is about 250
           amino acids in length. This domain is found associated
           with pfam00884.
          Length = 253

 Score = 28.8 bits (65), Expect = 3.7
 Identities = 10/28 (35%), Positives = 11/28 (39%), Gaps = 2/28 (7%)

Query: 144 LVFIFLMRRVFFLFIVIF--LLLILLFF 169
           L FI   RR+      I   L L LL  
Sbjct: 76  LSFIIPPRRLLRGLAAIIATLGLTLLLI 103


>gnl|CDD|149223 pfam08019, DUF1705, Domain of unknown function (DUF1705).  Some
           members of this family are putative bacterial membrane
           proteins. This domain is found immediately N terminal to
           the sulfatase domain in many sulfatases.
          Length = 156

 Score = 28.2 bits (64), Expect = 3.7
 Identities = 5/35 (14%), Positives = 20/35 (57%), Gaps = 1/35 (2%)

Query: 137 WVIRVNFLVFI-FLMRRVFFLFIVIFLLLILLFFF 170
             +++ +  ++  L++R+  +   + ++L++ F F
Sbjct: 84  LRVKIRYRPWLRELLQRLLSILASLLVILLIAFLF 118


>gnl|CDD|233511 TIGR01654, bact_immun_7tm, bacteriocin-associated integral membrane
           (putative immunity) protein.  This model represents a
           family of integral membrane proteins, most of which are
           about 650 residues in size and predicted to span the
           membrane seven times. Nearly half of the members of this
           family are found in association with a member of the
           lactococcin 972 family of bacteriocins (TIGR01653).
           Others may be associated with uncharacterized proteins
           that may also act as bacteriocins. Although this protein
           is suggested to be an immunity protein, and the
           bacteriocin is suggested to be exported by a
           Sec-dependent process, the role of this protein is
           unclear [Cellular processes, Toxin production and
           resistance].
          Length = 679

 Score = 28.9 bits (65), Expect = 3.7
 Identities = 17/62 (27%), Positives = 27/62 (43%), Gaps = 10/62 (16%)

Query: 133 KSHSWVIRVNFLVFIFLMRRVFF---------LFIVIFLLLILLFFFIGLEQRRKNDNKS 183
           +++   +  N  V++F++              LF VIFLL ILL   I L +  K    S
Sbjct: 617 RTYKQYLVSNLAVYVFILILALIFSHAFYILLLFAVIFLLEILLILGI-LSKLEKRFFVS 675

Query: 184 QM 185
            +
Sbjct: 676 IL 677


>gnl|CDD|214457 MTH00205, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 448

 Score = 28.8 bits (65), Expect = 3.9
 Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 4/39 (10%)

Query: 137 WVIRVNFLV--FIFLMRRVF--FLFIVIFLLLILLFFFI 171
           W+  +  L    +F   +    F   VI LLLIL+  F 
Sbjct: 68  WISLLMVLASYKVFKSNKNSKLFTVSVIMLLLILILSFS 106


>gnl|CDD|220299 pfam09586, YfhO, Bacterial membrane protein YfhO.  This protein is
           a conserved membrane protein. The yfhO gene is
           transcribed in Difco sporulation medium and the
           transcription is affected by the YvrGHb two-component
           system. Some members of this family have been annotated
           as glycosyl transferases of the PMT family.
          Length = 835

 Score = 29.1 bits (66), Expect = 3.9
 Identities = 9/32 (28%), Positives = 15/32 (46%), Gaps = 4/32 (12%)

Query: 143 FLVFIFLMR----RVFFLFIVIFLLLILLFFF 170
            L+  F  +    +   LF ++ L LIL F+ 
Sbjct: 290 LLILYFFNKLIKLKEKLLFALLLLFLILSFYL 321


>gnl|CDD|215869 pfam00344, SecY, SecY translocase. 
          Length = 340

 Score = 28.6 bits (65), Expect = 4.1
 Identities = 11/56 (19%), Positives = 25/56 (44%), Gaps = 2/56 (3%)

Query: 116 SSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
           +SSL L+   T  +  + S   +  V+ L +   +    ++   I L++   +F+ 
Sbjct: 206 ASSL-LANPQTIAQF-LGSSFPLWPVSGLAYYLPIGSPVYILFYIVLIIFFSYFYT 259


>gnl|CDD|212036 cd10326, SLC5sbd_NIS-like, Na(+)/iodide (NIS) and
           Na(+)/multivitamin (SMVT) cotransporters, and related
           proteins; solute binding domain.  NIS (product of the
           SLC5A5 gene) transports I-, and other anions including
           ClO4-, SCN-, and Br-. SMVT (product of the SLC5A6 gene)
           transports biotin, pantothenic acid and lipoate. This
           subfamily also includes SMCT1 and 2. SMCT1(the product
           of the SLC5A8 gene) is a high-affinity transporter of
           various monocarboxylates including lactate and pyruvate,
           short-chain fatty acids, ketone bodies, nicotinate and
           its structural analogs, pyroglutamate, benzoate and its
           derivatives, and iodide. SMCT2 (product of the SLC5A12
           gene) is a low-affinity transporter for short-chain
           fatty acids, lactate, pyruvate, and nicotinate. This
           subfamily belongs to the solute carrier 5 (SLC5)
           transporter family.
          Length = 477

 Score = 28.7 bits (65), Expect = 4.2
 Identities = 6/23 (26%), Positives = 12/23 (52%)

Query: 151 RRVFFLFIVIFLLLILLFFFIGL 173
           ++  +   V  + + LLF  +GL
Sbjct: 266 QKNMYWNGVGLVPVNLLFLLLGL 288


>gnl|CDD|237811 PRK14767, PRK14767, lipoprotein signal peptidase; Provisional.
          Length = 174

 Score = 28.1 bits (63), Expect = 4.2
 Identities = 10/32 (31%), Positives = 18/32 (56%)

Query: 147 IFLMRRVFFLFIVIFLLLILLFFFIGLEQRRK 178
           +   + VFFL I + +++I+ F F  L   +K
Sbjct: 62  MLSGKAVFFLVITVAVVIIMAFVFFKLPATKK 93


>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
           Provisional.
          Length = 276

 Score = 28.4 bits (64), Expect = 4.4
 Identities = 7/44 (15%), Positives = 16/44 (36%)

Query: 146 FIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGL 189
           F    + +  L +++ LLL L            +  +  +G  +
Sbjct: 4   FSRGKKLLLLLLLILLLLLALALLLADRRLGSLSPVRQVVGDVV 47



 Score = 27.6 bits (62), Expect = 9.2
 Identities = 6/27 (22%), Positives = 10/27 (37%)

Query: 144 LVFIFLMRRVFFLFIVIFLLLILLFFF 170
           L        +  L I++ LL + L   
Sbjct: 3   LFSRGKKLLLLLLLILLLLLALALLLA 29


>gnl|CDD|223742 COG0670, COG0670, Integral membrane protein, interacts with FtsH
           [General function prediction only].
          Length = 233

 Score = 28.4 bits (64), Expect = 4.5
 Identities = 8/50 (16%), Positives = 17/50 (34%), Gaps = 1/50 (2%)

Query: 130 VVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLEQRRKN 179
           +V    + +     L  + L+      F++I   L  +FF      +   
Sbjct: 36  LVTAVGAALA-AYLLFNLGLLSLSPPGFVLIIAGLAGVFFLSRKINKSSP 84


>gnl|CDD|212037 cd10327, SLC5sbd_PanF, Na(+)/pantothenate cotransporters: PanF of
           Escherichia coli and related proteins; solute binding
           domain.  PanF catalyzes the Na+-coupled uptake of
           extracellular pantothenate for coenzyme A biosynthesis
           in cells. This subfamily belongs to the solute carrier 5
           (SLC5) transporter family.
          Length = 472

 Score = 28.7 bits (65), Expect = 4.5
 Identities = 11/33 (33%), Positives = 19/33 (57%), Gaps = 1/33 (3%)

Query: 141 VNFLVFIFLMRRVFFLF-IVIFLLLILLFFFIG 172
           V++++  +L  ++  L  IV  LLL L+ F I 
Sbjct: 432 VSYILITYLKIKILGLHPIVPSLLLSLIAFIIV 464


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 28.7 bits (64), Expect = 4.5
 Identities = 22/117 (18%), Positives = 33/117 (28%), Gaps = 4/117 (3%)

Query: 191 QSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMG 250
           Q +    +   G    Q  R        +   QS+  +   +   G     R+    + G
Sbjct: 461 QPQNQRRQQNRGRQSSQPPRQQQNRSNQNNQRQSQGPNQGPRGQGGYNLRPRTYQPQRYG 520

Query: 251 SGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRKS 307
            G  Q RR            Q R    + Q  +   QSR        G    Q+R  
Sbjct: 521 GG--QGRR--WNPNPYRQSGQGRSQQQQPQPEARGNQSRTPGPGRGQGGRGNQNRNQ 573


>gnl|CDD|217379 pfam03125, Sre, C. elegans Sre G protein-coupled chemoreceptor.
           Caenorhabditis elegans Sre proteins are candidate
           chemosensory receptors. There are four main recognised
           groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre
           (this family), Sra pfam02117 and Srb pfam02175 comprise
           the Sra group. All of the above receptors are thought to
           be G protein-coupled seven transmembrane domain
           proteins. The existence of several different
           chemosensory receptors underlies the fact that in spite
           of having only 20-30 chemosensory neurones, C. elegans
           detects hundreds of different chemicals, with the
           ability to discern individual chemicals among
           combinations.
          Length = 363

 Score = 28.4 bits (64), Expect = 5.0
 Identities = 13/58 (22%), Positives = 21/58 (36%), Gaps = 8/58 (13%)

Query: 118 SLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVF---FLFIVIFLLLILLFFFIG 172
            L+LSI      ++         V   +++ L  R+F      + I L  I     IG
Sbjct: 25  QLLLSIFELLFYILC-----AYIVFVSLYVILKIRLFHKNLTILFIPLFGIWFELIIG 77


>gnl|CDD|227725 COG5438, COG5438, Predicted multitransmembrane protein [Function
           unknown].
          Length = 385

 Score = 28.5 bits (64), Expect = 5.1
 Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 5/59 (8%)

Query: 144 LVFIFLM--RRVFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGLEQSRRSDNKSQ 200
           L  I L+  +  F +F+VIF L++L+ F         N  K ++      S  S++KS 
Sbjct: 6   LKNIPLVKEKLFFLIFLVIFTLVLLVKFIAESA---YNAPKGEITDVKIVSNNSEDKSS 61


>gnl|CDD|233317 TIGR01218, Gpos_tandem_5TM, tandem five-transmembrane protein.
           Members of this family of proteins, with average length
           of 210, have no invariant residues but five predicted
           transmembrane segments. Strangely, most members occur in
           groups of consecutive paralogous genes. A striking
           example is a set of eleven encoded consecutively,
           head-to-tail, in Staphylococcus aureus strain COL.
          Length = 208

 Score = 28.0 bits (63), Expect = 5.7
 Identities = 15/49 (30%), Positives = 26/49 (53%), Gaps = 7/49 (14%)

Query: 126 TDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGLE 174
              ++ I+  S+  R+ F++F+++    FFL + IFL    LF   G E
Sbjct: 133 KRYKIKIRPTSFKYRI-FILFLYI----FFLGLSIFL--FYLFINNGGE 174



 Score = 27.6 bits (62), Expect = 7.4
 Identities = 12/57 (21%), Positives = 20/57 (35%), Gaps = 18/57 (31%)

Query: 143 FLVFIFLMRR--------------VFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQM 185
             V ++ + R                 LF++  LL+ LL  +I     +KN  K   
Sbjct: 74  ISVLLYRLLRPLFDYFDISQSIFIKIILFLISILLVYLLRIYI----NKKNKKKLYN 126



 Score = 27.2 bits (61), Expect = 9.6
 Identities = 11/41 (26%), Positives = 19/41 (46%), Gaps = 11/41 (26%)

Query: 139 IRVNFLVFIFLMRRVFFLF--------IVIFLLLILLFFFI 171
           + + FL     +   F+LF        I++   +I LFFF+
Sbjct: 153 LYIFFLGLSIFL---FYLFINNGGENIILVINGIIFLFFFL 190


>gnl|CDD|234350 TIGR03766, TIGR03766, conserved hypothetical integral membrane
           protein.  Models TIGR03110, TIGR03111, and TIGR03112
           describe a three-gene system found in several
           Gram-positive bacteria, where TIGR03110 (XrtG) is
           distantly related to a putative transpeptidase,
           exosortase (TIGR02602). This model describes a small
           clade that correlates by both gene clustering and
           phyletic pattern, although imperfectly, to the three
           gene system. Both this narrow clade, and the larger set
           of full-length homologous integral membrane proteins,
           have an especially well-conserved region near the
           C-terminus with an invariant tyrosine. The function is
           unknown.
          Length = 483

 Score = 28.4 bits (64), Expect = 5.7
 Identities = 13/72 (18%), Positives = 25/72 (34%), Gaps = 5/72 (6%)

Query: 103 QKWMWHKRDHSSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFL 162
              +W           L++ I+     ++I  + W        F+   + V  + I I L
Sbjct: 21  NLNLWGISTTLIVL--LLILII---FLILIIIYFWKKVKKRYRFLMRHKGVTIIVIFILL 75

Query: 163 LLILLFFFIGLE 174
           L++ L     L 
Sbjct: 76  LILQLILVTALH 87


>gnl|CDD|219625 pfam07895, DUF1673, Protein of unknown function (DUF1673).  This
           family contains hypothetical proteins of unknown
           function expressed by two archaeal species.
          Length = 204

 Score = 28.1 bits (63), Expect = 5.9
 Identities = 12/43 (27%), Positives = 21/43 (48%), Gaps = 1/43 (2%)

Query: 132 IKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLI-LLFFFIGL 173
           +K+  W+ + +  + +  +      F+VI  L I LLF   GL
Sbjct: 46  LKNPGWLRKRSNRLLLASIFLTLAYFLVISQLGINLLFLLAGL 88


>gnl|CDD|222242 pfam13584, BatD, Oxygen tolerance.  This family of proteins carries
           up to three membrane spanning regions and is involved in
           tolerance to oxygen in in Bacteroides spp.
          Length = 488

 Score = 28.4 bits (64), Expect = 6.0
 Identities = 9/34 (26%), Positives = 20/34 (58%)

Query: 148 FLMRRVFFLFIVIFLLLILLFFFIGLEQRRKNDN 181
           F    +F+L +++ LLL +L   +  ++R +N +
Sbjct: 425 FFGSTLFWLLLLLPLLLFVLLLILYRKRRARNAD 458


>gnl|CDD|177157 MTH00094, ND4, NADH dehydrogenase subunit 4; Provisional.
          Length = 403

 Score = 28.3 bits (64), Expect = 6.2
 Identities = 12/39 (30%), Positives = 18/39 (46%), Gaps = 6/39 (15%)

Query: 136 SWVIRVNFLVFIFLMRRVFFLFIVIFL--LLILLFFFIG 172
           S     N   F F+    FF F+V F   L +++  F+G
Sbjct: 338 SNSFMKNKFSFFFI----FFYFLVSFYYSLFLIVSSFMG 372


>gnl|CDD|236412 PRK09204, secY, preprotein translocase subunit SecY; Reviewed.
          Length = 426

 Score = 28.2 bits (64), Expect = 6.4
 Identities = 12/25 (48%), Positives = 15/25 (60%)

Query: 153 VFFLFIVIFLLLILLFFFIGLEQRR 177
           VF L IV+FL +I    F+   QRR
Sbjct: 215 VFLLLIVLFLAVIAFVVFVEQAQRR 239


>gnl|CDD|226638 COG4168, SapB, ABC-type antimicrobial peptide transport system,
           permease component [Defense mechanisms].
          Length = 321

 Score = 28.2 bits (63), Expect = 6.5
 Identities = 7/23 (30%), Positives = 14/23 (60%)

Query: 146 FIFLMRRVFFLFIVIFLLLILLF 168
            IF +RR+  L + + +L ++ F
Sbjct: 2   IIFTLRRILLLILTLLILTLVGF 24


>gnl|CDD|220958 pfam11055, Gsf2, Glucose signalling factor 2.  Gsf2 is localised to
           the ER and functions to promote the secretion of certain
           hexose transporters.
          Length = 376

 Score = 28.1 bits (63), Expect = 6.5
 Identities = 8/21 (38%), Positives = 15/21 (71%)

Query: 153 VFFLFIVIFLLLILLFFFIGL 173
           +FF+F ++ +L I LF + G+
Sbjct: 180 LFFVFHIVKILFIFLFLYSGM 200


>gnl|CDD|223964 COG1033, COG1033, Predicted exporters of the RND superfamily
           [General function prediction only].
          Length = 727

 Score = 28.0 bits (63), Expect = 6.9
 Identities = 8/25 (32%), Positives = 13/25 (52%)

Query: 148 FLMRRVFFLFIVIFLLLILLFFFIG 172
           F++RR   + ++  LL IL  F   
Sbjct: 8   FIIRRRKSIILIFLLLTILSAFGAQ 32


>gnl|CDD|177159 MTH00097, ND6, NADH dehydrogenase subunit 6; Provisional.
          Length = 121

 Score = 26.8 bits (60), Expect = 7.5
 Identities = 9/16 (56%), Positives = 11/16 (68%)

Query: 154 FFLFIVIFLLLILLFF 169
            F  IV F+L +LLFF
Sbjct: 89  IFWLIVFFILFVLLFF 104


>gnl|CDD|216994 pfam02361, CbiQ, Cobalt transport protein.  This family consists of
           various cobalt transport proteins Most of which are
           found in Cobalamin (Vitamin B12) biosynthesis operons.
           In Salmonella the cbiN cbiQ (product CbiQ in this
           family) and cbiO are likely to form an active cobalt
           transport system.
          Length = 217

 Score = 27.7 bits (62), Expect = 7.7
 Identities = 11/36 (30%), Positives = 18/36 (50%), Gaps = 1/36 (2%)

Query: 138 VIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFIGL 173
           +I +  L+   L+  +   F+  FL L LLF  +G 
Sbjct: 34  LIIIALLLLTLLLVGLP-RFLARFLRLPLLFPLLGF 68


>gnl|CDD|177203 MTH00145, CYTB, cytochrome b; Provisional.
          Length = 379

 Score = 27.9 bits (63), Expect = 8.4
 Identities = 6/19 (31%), Positives = 12/19 (63%)

Query: 146 FIFLMRRVFFLFIVIFLLL 164
           F  L + +F++F+  F +L
Sbjct: 318 FYPLNQVLFWIFVGNFFIL 336


>gnl|CDD|233155 TIGR00860, LIC, Cation transporter family protein.  The
           Ligand-gated Ion Channel (LIC) Family of
           Neurotransmitter Receptors TC 1.A.9)Members of the LIC
           family of ionotropic neurotransmitter receptors are
           found only in vertebrate and invertebrate animals. They
           exhibit receptor specificity for (1)acetylcholine, (2)
           serotonin, (3) glycine, (4) glutamate and (5)
           g-aminobutyric acid (GABA). All of these receptor
           channels are probably hetero- orhomopentameric. The best
           characterized are the nicotinic acetyl-choline receptors
           which are pentameric channels of a2bgd subunit
           composition. All subunits arehomologous. The three
           dimensional structures of the protein complex in both
           the open and closed configurations have been solved at
           0.9 nm resolution.The channel protein complexes of the
           LIC family preferentially transport cations or anions
           depending on the channel (e.g., the acetylcholine
           receptors are cationselective while glycine receptors
           are anion selective) [Transport and binding proteins,
           Cations and iron carrying compounds].
          Length = 459

 Score = 27.8 bits (62), Expect = 8.4
 Identities = 20/116 (17%), Positives = 33/116 (28%), Gaps = 8/116 (6%)

Query: 153 VFFLFIVIFLLLILLFFFIGLEQRRKNDNKSQMGSGLEQSRRSDNKSQMGSG-----LEQ 207
           V+F   + F+ L LL          K+  + +    LE+      K + G       L+ 
Sbjct: 305 VYFAVCMAFVFLALLETAFVNYVHHKDPAQGKRHLLLERCAWRLCKQEPGEDYRRPALDH 364

Query: 208 SRRSD---NKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSDNKSQMGSGLEQSRRSD 260
           +  S         G GLE               E         +      + R+ D
Sbjct: 365 ASLSSVEMRAKDGGRGLESPTERQRLLHSPPPAEGDLDLAGILEEVRIAHRFRKRD 420


>gnl|CDD|233780 TIGR02210, rodA_shape, rod shape-determining protein RodA.  This
           protein is a member of the FtsW/RodA/SpoVE family
           (pfam01098). It is found only in species with rod (or
           spiral) shapes. In many species, mutation of rodA has
           been shown to correlate with loss of the normal rod
           shape. Note that RodA homologs are found, scoring below
           the cutoffs for this model, in a number of both
           rod-shaped and coccoid bacteria, including four proteins
           in Bacillus anthracis, for example [Cell envelope,
           Biosynthesis and degradation of murein sacculus and
           peptidoglycan, Cellular processes, Cell division].
          Length = 352

 Score = 27.9 bits (63), Expect = 8.7
 Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 11/43 (25%)

Query: 141 VNFLVFIFLM-----------RRVFFLFIVIFLLLILLFFFIG 172
           V F + + LM           RR+ +   V+ LLL++     G
Sbjct: 36  VWFGIGLVLMIIVALIDYRFLRRLAYPLYVLGLLLLVAVLLFG 78


>gnl|CDD|224055 COG1132, MdlB, ABC-type multidrug transport system, ATPase and
           permease components [Defense mechanisms].
          Length = 567

 Score = 27.8 bits (62), Expect = 8.7
 Identities = 10/62 (16%), Positives = 21/62 (33%), Gaps = 8/62 (12%)

Query: 112 HSSSSSSLVLSIVHTDIRVVIKSHSWVIRVNFLVFIFLMRRVFFLFIVIFLLLILLFFFI 171
              + S  ++S +  D+          +       + L+     L I   +LL  L + +
Sbjct: 103 FDKAKSGDLISRLTNDV--------EAVSNLVSTVLVLVFTSILLLIGSLVLLFSLSWRL 154

Query: 172 GL 173
            L
Sbjct: 155 AL 156


>gnl|CDD|234609 PRK00068, PRK00068, hypothetical protein; Validated.
          Length = 970

 Score = 27.6 bits (62), Expect = 9.7
 Identities = 9/22 (40%), Positives = 14/22 (63%)

Query: 149 LMRRVFFLFIVIFLLLILLFFF 170
           LMRR   L I+  ++++LL F 
Sbjct: 10  LMRRSKILIIIALIIILLLLFG 31


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.311    0.125    0.341 

Gapped
Lambda     K      H
   0.267   0.0760    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 14,605,800
Number of extensions: 1339436
Number of successful extensions: 4137
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3848
Number of HSP's successfully gapped: 268
Length of query: 319
Length of database: 10,937,602
Length adjustment: 97
Effective length of query: 222
Effective length of database: 6,635,264
Effective search space: 1473028608
Effective search space used: 1473028608
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 59 (26.4 bits)