RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy929
         (483 letters)



>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
           This family contains, amongst other G-protein-coupled
           receptors (GCPRs), members of the opsin family, which
           have been considered to be typical members of the
           rhodopsin superfamily. They share several motifs, mainly
           the seven transmembrane helices, GCPRs of the rhodopsin
           superfamily. All opsins bind a chromophore, such as
           11-cis-retinal. The function of most opsins other than
           the photoisomerases is split into two steps: light
           absorption and G-protein activation. Photoisomerases, on
           the other hand, are not coupled to G-proteins - they are
           thought to generate and supply the chromophore that is
           used by visual opsins.
          Length = 251

 Score =  173 bits (441), Expect = 4e-51
 Identities = 64/182 (35%), Positives = 91/182 (50%), Gaps = 5/182 (2%)

Query: 93  IAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLYEVSGKWNFGFMMCNLWVSFDVL 152
           + +   +KLR P N  L++LAV+DL   + + P AL Y V G W FG  +C L     V+
Sbjct: 1   LVILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWPFGDALCKLVGFLFVV 60

Query: 153 SCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLGAACISLPPLIVLGND 212
           +  ASIL L  IS+DRY AI  PL Y   RTP+R  + ++ VW+ A  +SLPPL+     
Sbjct: 61  NGYASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLPPLLFSWLR 120

Query: 213 YREDNGVSFCSVC-----QNYGYQIYATLGSFYIPLTVMITVYYKIFCAARKIVAEEKRA 267
             E+  V+ C +          Y + +TL  F +PL V++  Y  I    RK        
Sbjct: 121 TVEEGNVTTCLIDFPEESTKRSYTLLSTLLGFVLPLLVILVCYTLILRTLRKRARSGASQ 180

Query: 268 QS 269
             
Sbjct: 181 AR 182


>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
           protein; Provisional.
          Length = 335

 Score = 74.0 bits (182), Expect = 2e-14
 Identities = 51/197 (25%), Positives = 93/197 (47%), Gaps = 11/197 (5%)

Query: 71  LIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLY 130
            I+ ++   I    +VGNI+V I V    K++ P +  L++LAVSDL + ++ +P  + Y
Sbjct: 41  TILIVVYSTIFFFGLVGNIIV-IYVLTKTKIKTPMDIYLLNLAVSDL-LFVMTLPFQIYY 98

Query: 131 EVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILC 190
            +  +W+FG   C +      +    S+  + ++SVDRY AI  P++     T K   + 
Sbjct: 99  YILFQWSFGEFACKIVSGLYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIV 158

Query: 191 VVFVWLGAACISLPPLIVLGNDYREDNGVSFCSVCQNYGYQIYATLGSF-------YIPL 243
            + +W+ +   + P L V     ++D+    C +  N     +    +F        IPL
Sbjct: 159 SLVIWIISIIETTPILFV--YTTKKDHETLICCMFYNNKTMNWKLFINFEINIIGMLIPL 216

Query: 244 TVMITVYYKIFCAARKI 260
           T+++  Y KI    + I
Sbjct: 217 TILLYCYSKILITLKGI 233


>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
           Provisional.
          Length = 417

 Score = 50.4 bits (120), Expect = 1e-06
 Identities = 45/185 (24%), Positives = 91/185 (49%), Gaps = 4/185 (2%)

Query: 20  PQTHSPEVCMDSFSPHNISRFSLTNDTSPASFHNYTRLGLLSHSPYSPLQALIIGLILLI 79
           P + SP    +S S  + S  S+T D    +   Y  + + +   Y P  +  I +  +I
Sbjct: 49  PTSESPSPNSNSSSSSSSSSSSITYDYEYENNITYELINIKNKCMY-PSISEYIKIFYII 107

Query: 80  IIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLYEVSGKWNFG 139
           I I+ + GN  + I +   +K++   +  + +LA+SDL + ++  P  ++Y    +W FG
Sbjct: 108 IFILGLFGNAAI-IMILFCKKIKTITDIYIFNLAISDL-IFVIDFPF-IIYNEFDQWIFG 164

Query: 140 FMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLGAA 199
             MC +  +   +   +++  + ++S+DRY+AI  P+ +   RT    I+  +  W+ + 
Sbjct: 165 DFMCKVISASYYIGFFSNMFLITLMSIDRYFAILYPISFQKYRTFNIGIILCIISWILSL 224

Query: 200 CISLP 204
            I+ P
Sbjct: 225 IITSP 229


>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
          Length = 323

 Score = 49.5 bits (118), Expect = 2e-06
 Identities = 46/187 (24%), Positives = 88/187 (47%), Gaps = 19/187 (10%)

Query: 78  LIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLYEVSGKWN 137
           +++ I  ++GN+LV IAV IV++     +  L ++A+SDL + +   P  +++    +W 
Sbjct: 36  ILLFIFGLIGNVLV-IAVLIVKRFMFVVDVYLFNIAMSDLML-VFSFPF-IIHNDLNEWI 92

Query: 138 FGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLG 197
           FG  MC L +    +   +++  + +IS+DRY  +    +   K     ++L V   W+ 
Sbjct: 93  FGEFMCKLVLGVYFVGFFSNMFFVTLISIDRYILVVNATKIKNKSISLSVLLSVA-AWVC 151

Query: 198 AACISLPPLIVLGNDYREDNGVSFCSVCQNYGYQIYATLGSFY----------IPLTVMI 247
           +  +S+P +++    Y  DN       C    Y    +  +F+          IPL ++I
Sbjct: 152 SVILSMPAMVL----YYVDN-TDNLKQCIFNDYHENFSWSAFFNFEINIFGIVIPLIILI 206

Query: 248 TVYYKIF 254
             Y KI 
Sbjct: 207 YCYSKIL 213


>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
           Srsx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srsx is a solo family
           amongst the superfamilies of chemoreceptors.
           Chemoperception is one of the central senses of soil
           nematodes like C. elegans which are otherwise 'blind'
           and 'deaf'.
          Length = 257

 Score = 45.7 bits (109), Expect = 2e-05
 Identities = 32/152 (21%), Positives = 60/152 (39%), Gaps = 23/152 (15%)

Query: 81  IIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDL--CVAILVMPMALLYEVSGKWNF 138
            ++ I GN+++ I     +KLR  C+YL+    ++DL      ++  + L        N 
Sbjct: 1   SVIGIFGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLLFTGTQLTRNE 60

Query: 139 GFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLGA 198
            F++    +   +   TA    + MI +DR  A+  P+ Y  +       L +  +    
Sbjct: 61  CFLI----IIPYIFGQTAQSPLMLMIGIDRLIAVKFPIFY--RLLSSSKYLFIQLI---- 110

Query: 199 ACISLPPLI------VLGNDYREDNGVSFCSV 224
                 P+I      V G   R+D  +  C+ 
Sbjct: 111 -----FPVIYSSFILVYGFLERDDETIIVCAP 137


>gnl|CDD|223017 PHA03235, PHA03235, DNA packaging protein UL33; Provisional.
          Length = 409

 Score = 38.6 bits (90), Expect = 0.005
 Identities = 26/129 (20%), Positives = 50/129 (38%), Gaps = 13/129 (10%)

Query: 153 SCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMILCVVFVWLGAACISLPPLI---VL 209
           SCT     + +I+ DRY  I +          +     +   W  +   S P  +   V+
Sbjct: 115 SCTVGFATVALIAADRYRVIHQR-TRARSSAYRSTYKILGLTWFASLICSGPAPVYTTVV 173

Query: 210 GNDYREDNGVSFCSVCQNYGY-QIYATLGSFYIPLT--------VMITVYYKIFCAARKI 260
            +D  +     + +    +   Q+   L +F + LT        VM+T +Y  F    K 
Sbjct: 174 AHDDVDPEAPGYETCVIYFRADQVKTVLSTFKVLLTLVWGIAPVVMMTWFYTFFYRTLKR 233

Query: 261 VAEEKRAQS 269
            + +KR+++
Sbjct: 234 ASYKKRSRT 242


>gnl|CDD|217323 pfam03023, MVIN, MviN-like protein.  Deletion of the mviN virulence
           gene in Salmonella enterica serovar. Typhimurium greatly
           reduces virulence in a mouse model of typhoid-like
           disease. Open reading frames encoding homologues of MviN
           have since been identified in a variety of bacteria,
           including pathogens and non-pathogens and
           plant-symbionts. In the nitrogen-fixing symbiont
           Rhizobium tropici, mviN is required for motility. The
           MviM protein is predicted to be membrane-associated.
          Length = 452

 Score = 36.1 bits (84), Expect = 0.040
 Identities = 19/93 (20%), Positives = 40/93 (43%), Gaps = 9/93 (9%)

Query: 72  IIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLYE 131
           +  L++ ++++VT++G +    A  ++R L    +    SL VS   +  +  P  LL  
Sbjct: 59  VSTLLIGVLLLVTLLGILA---APWVIRLLAPGFDAETFSLTVS---LLRITFPYLLLV- 111

Query: 132 VSGKWNFGFMMCNLWVSFDVLSCTASILNLCMI 164
                     + N    F   + +  +LN+ +I
Sbjct: 112 --SLSAVFGAVLNARKKFFAPAFSPVLLNIGVI 142


>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
           Srx.  Chemoreception is mediated in Caenorhabditis
           elegans by members of the seven-transmembrane
           G-protein-coupled receptor class (7TM GPCRs) of proteins
           which are of the serpentine type. Srx is part of the Srg
           superfamily of chemoreceptors. Chemoperception is one of
           the central senses of soil nematodes like C. elegans
           which are otherwise 'blind' and 'deaf'.
          Length = 275

 Score = 34.9 bits (81), Expect = 0.061
 Identities = 35/208 (16%), Positives = 73/208 (35%), Gaps = 24/208 (11%)

Query: 78  LIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSD----LCVAILVMPMALLYEVS 133
            +I +  I+ N L+      +  L+     L  + A+S+          V+PM LL    
Sbjct: 1   FLISLFGIILNWLIFYIFLKLPSLKNSFGILCANKAISNAIICTIFLFYVVPMTLLDLSF 60

Query: 134 GK-----WNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKRMI 188
                     G ++  L+          S L   +IS++R+ A+  P +Y    + K   
Sbjct: 61  LPELLNSHIGGLILLGLY--------EISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTK 112

Query: 189 LCVVFVWLGAACISLPPLIVLGNDYR-EDNGVSF------CSVCQNYGYQIYATLGSFYI 241
           + + F+W+ A  IS      LG  +      +++            +       L    I
Sbjct: 113 IIITFIWIIAIIISTLFYFPLGCHFYYSPESLTWSFDEDPPCAEIGWYGDFLKNLVLVII 172

Query: 242 PLTVMITVYYKIFCAARKIVAEEKRAQS 269
              + +  + K+   ++K     + ++ 
Sbjct: 173 TNIINVITFIKLRKFSKKSSLSSEESKK 200


>gnl|CDD|177562 PHA03234, PHA03234, DNA packaging protein UL33; Provisional.
          Length = 338

 Score = 32.6 bits (74), Expect = 0.39
 Identities = 45/205 (21%), Positives = 91/205 (44%), Gaps = 14/205 (6%)

Query: 68  LQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPC-NYLLVSLAVSDLCVAILVMPM 126
           L++ I G++L +II + I   I++C  +   +  +    ++ L++L  SD    + V  +
Sbjct: 34  LESAINGIMLTLIIPMII---IVICTLIIYHKVAKHNATSFYLITLFASDFLHMLCVFFL 90

Query: 127 ALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVKRTPKR 186
            L  E    +N  F  C L++     SC+ SI  L +I+  RY  + +  +   K     
Sbjct: 91  TLNREALFNFNQAFCQCVLFIYH--ASCSYSICMLAIIATIRYKTLHRRKKNDKKNNHIG 148

Query: 187 MILCVVFVWLGAACISLPP-LIVLGNDYREDNGVSFCSVCQNYGYQIYATLGSFYIPL-- 243
             + ++F  L +A  ++P  L V     + + G     +     Y ++  +   +  +  
Sbjct: 149 RNIGILF--LASAMCAIPAALFVKTEGKKGNYGKCNIHISSKKAYDLFIAIKIVFCFIWG 206

Query: 244 ---TVMITVYYKIFCAARKIVAEEK 265
              T++ + +Y IFC A   + E+K
Sbjct: 207 IFPTMIFSFFYVIFCKALHALTEKK 231


>gnl|CDD|223800 COG0728, MviN, Uncharacterized membrane protein, putative virulence
           factor [General function prediction only].
          Length = 518

 Score = 32.6 bits (75), Expect = 0.43
 Identities = 22/96 (22%), Positives = 43/96 (44%), Gaps = 10/96 (10%)

Query: 70  ALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRP-CNYLLVSLAVSDLCVAILVMPMAL 128
            L+ GL+ L++++VT++G +       +VR L  P  +     LAV    +  ++ P  L
Sbjct: 91  RLVTGLLTLVLLLVTLLGILFAPW---LVRLLLAPGFDETDKFLAVL---LTRILFPYLL 144

Query: 129 LYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMI 164
              +S        + N    F + +    +LN+ +I
Sbjct: 145 FISLSA---LFGAILNSRNRFFIPAFAPVLLNVSVI 177


>gnl|CDD|226122 COG3594, NolL, Fucose 4-O-acetylase and related acetyltransferases
           [Carbohydrate transport and metabolism].
          Length = 343

 Score = 32.3 bits (74), Expect = 0.49
 Identities = 15/54 (27%), Positives = 22/54 (40%)

Query: 51  FHNYTRLGLLSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRP 104
           F        + +   S L  L I LI+ +III  +    L  +    V +LRR 
Sbjct: 282 FKVLRLQFGILNLSTSYLAFLPILLIVSLIIIAVLSRVFLKTLLYLPVIELRRK 335


>gnl|CDD|216273 pfam01061, ABC2_membrane, ABC-2 type transporter. 
          Length = 210

 Score = 31.5 bits (72), Expect = 0.61
 Identities = 23/102 (22%), Positives = 39/102 (38%), Gaps = 16/102 (15%)

Query: 63  SPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAIL 122
           SP     A ++  IL+ + I  +   I + I             Y +V L VS   + +L
Sbjct: 88  SPLYSPSAYVLAKILVELPISLLQAIIFLLIV------------YFMVGLPVSRFFLFLL 135

Query: 123 VMPMALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMI 164
           V+ +  L         G  +  L  SF+  S    +L L ++
Sbjct: 136 VLLLTALAASG----LGLFIAALAPSFEDASQIGPLLLLPLL 173


>gnl|CDD|222432 pfam13882, Bravo_FIGEY, C-terminal domain of Fibronectin type III. 
           This is the very C-terminal region of neural adhesion
           molecule L1 proteins that are also known as Bravo or
           NrCAM. It lies upstream of the IG and Fn3 domains and
           has the highly conserved motif FIGEY. The function is
           not known.
          Length = 118

 Score = 30.4 bits (69), Expect = 0.70
 Identities = 5/24 (20%), Positives = 14/24 (58%)

Query: 79  IIIIVTIVGNILVCIAVCIVRKLR 102
           I ++  I   +L+ + VC +++ +
Sbjct: 9   IGLMCAIALLLLILLIVCFIKRNK 32


>gnl|CDD|216260 pfam01040, UbiA, UbiA prenyltransferase family. 
          Length = 259

 Score = 31.3 bits (71), Expect = 0.80
 Identities = 34/156 (21%), Positives = 64/156 (41%), Gaps = 18/156 (11%)

Query: 66  SPLQALIIGLILLII--IIVTIVG-NILVCIAVCIV--------RKLRRPCNYLLVSLAV 114
           SP +AL++GL LL++  ++  ++G  +L+   + ++         +L+R     L+   V
Sbjct: 68  SPREALLLGLALLLLGLLLALLLGPLLLLLGLLGLLLGLLYSPPLRLKR---RPLLGELV 124

Query: 115 SDLCVAILVMPMALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAI-T 173
             L   +L++  A    V G      ++  L V    L+      ++  +  DR   I T
Sbjct: 125 VGLAFGLLILLGAYA--VGGDIPSPLLLLALPVFLLSLA-ILLTNDIRDVEGDRKAGIRT 181

Query: 174 KPLEYGVKRTPKRMILCVVFVWLGAACISLPPLIVL 209
            P+  G KR      L +    L    + L    +L
Sbjct: 182 LPVRLGRKRALALYALLLAAALLLLLLLLLLLAPLL 217


>gnl|CDD|220495 pfam09971, DUF2206, Predicted membrane protein (DUF2206).  This
           domain, found in various hypothetical archaeal proteins,
           has no known function.
          Length = 367

 Score = 31.6 bits (72), Expect = 0.91
 Identities = 24/95 (25%), Positives = 39/95 (41%), Gaps = 10/95 (10%)

Query: 36  NISRFSLTNDTSPASFHNYTRLGLLSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAV 95
            I     T   +P      ++   L  +  S L  L+  LI L I  + ++G +      
Sbjct: 85  KILSSIFTESLNP----TNSQGLELLTTSSSFLH-LLAKLINLFIQGLIVIGLL-----Y 134

Query: 96  CIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLY 130
            + + +RR     L  LA S L +AILV  + L +
Sbjct: 135 VLYKLIRRRSKKSLEYLAFSFLFLAILVASIVLPF 169


>gnl|CDD|225154 COG2245, COG2245, Predicted membrane protein [Function unknown].
          Length = 182

 Score = 30.4 bits (69), Expect = 1.4
 Identities = 27/139 (19%), Positives = 57/139 (41%), Gaps = 14/139 (10%)

Query: 73  IGLILLII--------IIVTIVGNILVCIAVCIVRKL---RRPCNYLLVSLAVSDLCVAI 121
           IG IL +I         I+++VG +LV +A+ ++ K     R  N  L+++ +  +   +
Sbjct: 12  IGAILQLIGFFAGWPLWILSLVGLVLVLVALYMISKQVGEDRIFNNYLIAVVLGLVGFVV 71

Query: 122 LVMPMALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNLCMISVDRYYAITKPLEYGVK 181
           ++   A    V+       +  +      + S  A  + L ++ +   Y   K LE   +
Sbjct: 72  VIFATAGFSLVALLLGTFMLPAHGL--SALGSFLAGFVILWILYIISAYFQKKALEALAQ 129

Query: 182 RTPKRMI-LCVVFVWLGAA 199
            T   +     +  ++GA 
Sbjct: 130 YTGVDLFRTAGLLYFIGAI 148


>gnl|CDD|221239 pfam11812, DUF3333, Domain of unknown function (DUF3333).  This
          family of proteins are functionally uncharacterized.
          This family is only found in bacteria. This presumed
          domain is typically between 116 to 159 amino acids in
          length.
          Length = 155

 Score = 29.9 bits (68), Expect = 1.7
 Identities = 9/19 (47%), Positives = 15/19 (78%)

Query: 70 ALIIGLILLIIIIVTIVGN 88
          A++IGL  L+I++ +IV N
Sbjct: 22 AILIGLAFLVILLGSIVSN 40


>gnl|CDD|198415 cd10433, YccA_like, YccA-like proteins.  A prokaryotic member of
           the BAX inhibitor (BI)-1 like family of small
           transmembrane proteins, Escherichia coli YccA, has been
           shown to interact with ATP-dependent protease FtsH,
           which degrades abnormal membrane proteins as part of a
           quality control mechanism to keep the integrity of
           biological membranes.
          Length = 205

 Score = 30.2 bits (69), Expect = 1.7
 Identities = 25/100 (25%), Positives = 41/100 (41%), Gaps = 26/100 (26%)

Query: 71  LIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMALLY 130
           L +G I   ++++  V NI + I                +SLA+S     +L     +LY
Sbjct: 125 LFVGAI---VLLLAAVANIFLQIPA--------------LSLAIS--AAFVLFSSGFILY 165

Query: 131 EVS-----GKWNFGFMMCNLWVSFDVLSCTASILNLCMIS 165
           + S     G+ N+     +L+VS   L    S+LNL    
Sbjct: 166 DTSRIIHGGETNYIMATVSLYVSIYNL--FVSLLNLLGAF 203


>gnl|CDD|226685 COG4232, COG4232, Thiol:disulfide interchange protein
           [Posttranslational modification, protein turnover,
           chaperones / Energy production and conversion].
          Length = 569

 Score = 30.8 bits (70), Expect = 1.8
 Identities = 18/94 (19%), Positives = 31/94 (32%), Gaps = 15/94 (15%)

Query: 60  LSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCV 119
            +   +S L A + GL+L     V  +  +L    + +    R         L+      
Sbjct: 163 AATLKWSLLLAFLGGLLLNFTPCVLPMLPLLS--GIVLGSAKRASKARAFG-LSF----- 214

Query: 120 AILVMPMALLYEVSG------KWNFGFMMCNLWV 147
            + V  MAL Y + G         +   +   WV
Sbjct: 215 -VYVQGMALAYTLLGLVAAAAGLGWQAQLQQPWV 247


>gnl|CDD|204597 pfam11151, DUF2929, Protein of unknown function (DUF2929).  This
          family of proteins with unknown function appears to be
          restricted to Firmicutes.
          Length = 57

 Score = 27.6 bits (62), Expect = 2.2
 Identities = 9/29 (31%), Positives = 17/29 (58%)

Query: 57 LGLLSHSPYSPLQALIIGLILLIIIIVTI 85
          +  L+   Y+ L A+II +I  I++ + I
Sbjct: 21 VSSLTGGTYNFLTAIIIAVIFGILVFLLI 49


>gnl|CDD|100079 cd06158, S2P-M50_like_1, Uncharacterized homologs of Site-2
           protease (S2P), zinc metalloproteases (MEROPS family
           M50) which cleave transmembrane domains of substrate
           proteins, regulating intramembrane proteolysis (RIP) of
           diverse signal transduction mechanisms. Members of the
           S2P/M50 family of RIP proteases use proteolytic activity
           within the membrane to transfer information across
           membranes to integrate gene expression with physiologic
           stresses occurring in another cellular compartment. In
           eukaryotic cells they regulate such processes as sterol
           and lipid metabolism, and endoplasmic reticulum stress
           responses. In prokaryotes they regulate such processes
           as sporulation, cell division, stress response, and cell
           differentiation. This group includes bacterial,
           eukaryotic, and Archaeal S2P/M50s homologs with a
           minimal core protein and no PDZ domains.
          Length = 181

 Score = 29.1 bits (66), Expect = 3.2
 Identities = 28/99 (28%), Positives = 47/99 (47%), Gaps = 16/99 (16%)

Query: 72  IIGLILLIIIIVTIVGNILVCIA--VCIV-RKLRRPCN-YLLVSLA--VSDLCVAIL-VM 124
            IG I+L +++  + G      A  V +  R  + P    LLVSLA  +S+L +A+L  +
Sbjct: 47  PIGTIILPLLLPFLFG-----WAKPVPVNPRNFKNPRRGMLLVSLAGPLSNLLLALLFAL 101

Query: 125 PMALLYEVSGKWN-FGFMMCNLWVSFDVLSCTASILNLC 162
            + LL    G    F F+M    V   +++   ++ NL 
Sbjct: 102 LLRLLPAFGGVVASFLFLMLAYGV---LINLVLAVFNLL 137


>gnl|CDD|215647 pfam00002, 7tm_2, 7 transmembrane receptor (Secretin family).  This
           family is known as Family B, the secretin-receptor
           family or family 2 of the G-protein-coupled receptors
           (GCPRs).They have been described in many animal species,
           but not in plants, fungi or prokaryotes. Three distinct
           sub-families are recognised. Subfamily B1 contains
           classical hormone receptors, such as receptors for
           secretin and glucagon, that are all involved in
           cAMP-mediated signalling pathways. Subfamily B2 contains
           receptors with long extracellular N-termini, such as the
           leukocyte cell-surface antigen CD97 ;
           calcium-independent receptors for latrotoxin, and
           brain-specific angiogenesis inhibitors amongst others.
           Subfamily B3 includes Methuselah and other Drosophila
           proteins. Other than the typical seven-transmembrane
           region, characteristic structural features include an
           amino-terminal extracellular domain involved in ligand
           binding, and an intracellular loop (IC3) required for
           specific G-protein coupling.
          Length = 239

 Score = 29.1 bits (66), Expect = 3.9
 Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 7/50 (14%)

Query: 82  IVTIVGNI--LVCIAVCIV-----RKLRRPCNYLLVSLAVSDLCVAILVM 124
           ++  VG    LV + + I      RKLR   NY+ ++L  S +  AIL +
Sbjct: 6   VIYTVGYSLSLVALLLAIAIFLLFRKLRCTRNYIHLNLFASFILRAILFL 55


>gnl|CDD|219227 pfam06912, DUF1275, Protein of unknown function (DUF1275).  This
           family consists of several hypothetical bacterial
           proteins of around 200 residues in length. The function
           of this family is unknown although a few members are
           thought to be membrane proteins.
          Length = 206

 Score = 28.3 bits (64), Expect = 6.4
 Identities = 15/74 (20%), Positives = 34/74 (45%), Gaps = 7/74 (9%)

Query: 53  NYTRLGL-LSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVS 111
           N   LG+ L+   ++        L  L+ ++  ++G  L  + + ++R+ R      L+ 
Sbjct: 33  NTVLLGVALAGGNWALA------LHKLLALLAFVLGAALAGLLIRLLRRRRLRRWLALLL 86

Query: 112 LAVSDLCVAILVMP 125
           L  + L +A  ++P
Sbjct: 87  LLEAVLLLAAALLP 100


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 28.8 bits (65), Expect = 6.4
 Identities = 13/55 (23%), Positives = 28/55 (50%), Gaps = 5/55 (9%)

Query: 73  IGLILLIIIIVTIVGNILVCIAVCIVRKLRRPCNYLLVSLAVSDLCVAILVMPMA 127
            GL+L +  +  I+G +L  +   +  +L R    LL+ L +    + +L++ +A
Sbjct: 213 AGLLLSLFGLGGILGALLGGL---LSDRLGRRRLLLLIGLLL--AALGLLLLALA 262


>gnl|CDD|215733 pfam00122, E1-E2_ATPase, E1-E2 ATPase. 
          Length = 222

 Score = 28.3 bits (64), Expect = 7.8
 Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 14/77 (18%)

Query: 61  SHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAVCIVRKLRR--PCNYLLVSLAVSDLC 118
           + S  +PLQ L+  L  +++ IV     + + I V ++   R       LL +LAV  L 
Sbjct: 133 AKSAKTPLQRLLDKLAKILVPIV-----LALAILVFLIWFFRGGDFLEALLRALAV--LV 185

Query: 119 VAI-----LVMPMALLY 130
            A      L +P+AL  
Sbjct: 186 AACPEALPLAVPLALAV 202


>gnl|CDD|219559 pfam07760, DUF1616, Protein of unknown function (DUF1616).  This is
           a family of sequences from hypothetical archaeal
           proteins. The region in question is approximately 330
           amino acid residues long.
          Length = 290

 Score = 28.5 bits (64), Expect = 9.1
 Identities = 14/59 (23%), Positives = 21/59 (35%)

Query: 37  ISRFSLTNDTSPASFHNYTRLGLLSHSPYSPLQALIIGLILLIIIIVTIVGNILVCIAV 95
           + R  L ND             L          +  + L+LLII I+ +VG +   I  
Sbjct: 106 LRRRRLPNDERFIVLFRALDSALKIGLYLPNSYSDNVLLVLLIISILALVGLLGYVILF 164


>gnl|CDD|217547 pfam03419, Peptidase_U4, Sporulation factor SpoIIGA. 
          Length = 291

 Score = 28.3 bits (64), Expect = 9.3
 Identities = 25/163 (15%), Positives = 52/163 (31%), Gaps = 36/163 (22%)

Query: 70  ALIIGLILLIIII------VTIVGNILV-CIAVCIVRKLRRPCNYL-------LVSLAVS 115
           AL+  L +L++ I         +  +L   + V I    +R   +L        V+    
Sbjct: 40  ALLGSLYVLLLFIPFLPIFFHPLFKLLFSVLMVLIAFGPKRFRFFLKTLLIFYFVTFLFG 99

Query: 116 DLCVAILVMPMALLYEVSGKWNFGFMMCNLWVSFDVLSCTASILNL---CMISVDRYYAI 172
               A+                 G ++ N      +     S   +     I    Y   
Sbjct: 100 GGLFALHYF-----------TGTGSLVQNGIFGDPI-----SWKFVLIGFPIL---YIFS 140

Query: 173 TKPLEYGVKRTPKRMILCVVFVWLGAACISLPPLIVLGNDYRE 215
            +  EY  +R  +  ++  V + +    +++  L+  GN  R+
Sbjct: 141 KRRWEYIKRRKIQYELIVEVEIVINGKKLNVKGLVDTGNQLRD 183


>gnl|CDD|224397 COG1480, COG1480, Predicted membrane-associated HD superfamily
           hydrolase [General function prediction only].
          Length = 700

 Score = 28.5 bits (64), Expect = 9.3
 Identities = 21/118 (17%), Positives = 41/118 (34%), Gaps = 20/118 (16%)

Query: 56  RLGLLSHS-PYSPLQALIIGLILLIIIIV---------------TIVGNILVCIAVCIVR 99
            LGLLS S    PL  L+I +I LI++                  ++  + + I    + 
Sbjct: 264 LLGLLSLSVNILPLLGLLILVIFLILLFALYERRTKSPLKLRNSLLLLYLSLAILTLSLL 323

Query: 100 KLRRPCNYLLVSLAV-SDLCVAILVMPMALLYEVSGKWNF---GFMMCNLWVSFDVLS 153
           ++    NY    L V   L   +L++ + L   +            +     + ++  
Sbjct: 324 RIVGYFNYSASGLLVPPALGPMLLILLVFLRIAIFSSSMIAIALLYLFGGSYNSEIAL 381


>gnl|CDD|113196 pfam04415, DUF515, Protein of unknown function (DUF515).  Family
          of hypothetical Archaeal proteins.
          Length = 416

 Score = 28.4 bits (63), Expect = 9.7
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 69 QALIIGLILLIIIIVTIVGNI 89
          + LIIG ++L +II+ +V  I
Sbjct: 30 KRLIIGALVLSVIILLLVFAI 50


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.325    0.137    0.431 

Gapped
Lambda     K      H
   0.267   0.0635    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,212,103
Number of extensions: 2320532
Number of successful extensions: 3766
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3734
Number of HSP's successfully gapped: 73
Length of query: 483
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 382
Effective length of database: 6,457,848
Effective search space: 2466897936
Effective search space used: 2466897936
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 61 (27.5 bits)