RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy980
         (587 letters)



>gnl|CDD|217991 pfam04258, Peptidase_A22B, Signal peptide peptidase.  The members
           of this family are membrane proteins. In some proteins
           this region is found associated with pfam02225. This
           family corresponds with Merops subfamily A22B, the type
           example of which is signal peptide peptidase. There is a
           sequence-similarity relationship with pfam01080.
          Length = 291

 Score =  231 bits (590), Expect = 2e-71
 Identities = 110/251 (43%), Positives = 151/251 (60%), Gaps = 29/251 (11%)

Query: 4   LLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSEALLVIF 63
           L+LTGYF +LG +AL +L+ P                     GA    KK      L + 
Sbjct: 44  LVLTGYFCILGAIALAFLILPPC------------IRRLSFMGAYRFPKKKSK---LELK 88

Query: 64  DYKFSSHDIVCFIVCSVFG-SWYLVKK-HWIANNLFGLAFAVNGIELLHLNNVMIGVILL 121
           + +F+  ++V  ++C VF   WY+++K HWI  ++ G+A  +N I++L L N+ +G ILL
Sbjct: 89  NGRFTYSELVALLLCIVFAVWWYVLRKEHWILQDILGIALCINVIKILRLPNLKVGTILL 148

Query: 122 CGLFIYDIFWVF------GTNVMVTVAKS-FEA----PIKLVFPQDLLEHGVSANNFAML 170
            GLF YDIFWVF      GT+VMVTVA   F+A    P+KLVFP+    +  +   F+ML
Sbjct: 149 SGLFFYDIFWVFISPYIFGTSVMVTVATGPFDAAEKLPMKLVFPRLND-YPGNWGPFSML 207

Query: 171 GLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLY 230
           GLGDIV+PG+ IA  LRFD+S N+ S TYF +  +AY LGL+ T   +++FK AQPALLY
Sbjct: 208 GLGDIVMPGLLIAFCLRFDISKNKSSRTYFISTMIAYGLGLLITFVALNLFKAAQPALLY 267

Query: 231 LVPACLGLPLL 241
           LVP  LG  LL
Sbjct: 268 LVPCTLGTLLL 278



 Score =  165 bits (419), Expect = 1e-46
 Identities = 98/317 (30%), Positives = 145/317 (45%), Gaps = 87/317 (27%)

Query: 301 TAFLAYFLGLMATIFVMHVFKHAQPALLYLFFSKDMIN--LLLTGYFFLLGVLALCYLLS 358
            +  A    + A+  ++ ++      L +   SK+ IN  L+LTGYF +LG +AL +L+ 
Sbjct: 10  DSRHAIIFPISASCTLLGLY------LFFKSLSKEYINSVLVLTGYFCILGAIALAFLIL 63

Query: 359 PVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSFDRGATNEEKKDGSEALLVIFDYKFSS 418
           P                     GA    KK    +                 + + +F+ 
Sbjct: 64  PPC------------IRRLSFMGAYRFPKKKSKLE-----------------LKNGRFTY 94

Query: 419 HDIVCFIVCSVFG-SWYLVKK-HWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGLFIY 476
            ++V  ++C VF   WY+++K HWI  ++ G+A  +N I++L L N+ +G ILL GLF Y
Sbjct: 95  SELVALLLCIVFAVWWYVLRKEHWILQDILGIALCINVIKILRLPNLKVGTILLSGLFFY 154

Query: 477 DIFWVF------GTNVMVTVAKS-FEA----PIKYV------------------------ 501
           DIFWVF      GT+VMVTVA   F+A    P+K V                        
Sbjct: 155 DIFWVFISPYIFGTSVMVTVATGPFDAAEKLPMKLVFPRLNDYPGNWGPFSMLGLGDIVM 214

Query: 502 -------------HESFKGLTQWFSNFFAWHLGLMATIFVMHVFKHAQPALLYLVPACLG 548
                         ++    T + S   A+ LGL+ T   +++FK AQPALLYLVP  LG
Sbjct: 215 PGLLIAFCLRFDISKNKSSRTYFISTMIAYGLGLLITFVALNLFKAAQPALLYLVPCTLG 274

Query: 549 LPLLVALVKGDLSALIN 565
             LLVAL +G+L  L N
Sbjct: 275 TLLLVALWRGELKKLWN 291



 Score = 60.4 bits (147), Expect = 5e-10
 Identities = 22/41 (53%), Positives = 29/41 (70%)

Query: 290 SLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLYL 330
           S N+ S TYF +  +AY LGL+ T   +++FK AQPALLYL
Sbjct: 228 SKNKSSRTYFISTMIAYGLGLLITFVALNLFKAAQPALLYL 268


>gnl|CDD|214793 smart00730, PSN, Presenilin, signal peptide peptidase, family.
           Presenilin 1 and presenilin 2 are polytopic membrane
           proteins, whose genes are mutated in some individuals
           with Alzheimer's disease. Distant homologues, present in
           eukaryotes and archaea, also contain conserved aspartic
           acid residues which are predicted to contribute to
           catalysis. At least one member of this family has been
           shown to possess signal peptide peptidase activity.
          Length = 249

 Score =  156 bits (397), Expect = 5e-44
 Identities = 76/199 (38%), Positives = 104/199 (52%), Gaps = 15/199 (7%)

Query: 55  GSEALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKK-HWIANNLFGLAFAVNGIELLHLNN 113
                L +F        ++  +  +V G W + +K  WI  +L G++  +  + +L L +
Sbjct: 49  SLLYPLEVFRV-DYPTLLILLLNFAVVGFWCIHRKGAWIQQDLIGISLCMAILFILRLPS 107

Query: 114 VMIGVILLCGLFIYDIFWVFGT----NVMVTVAKSFEAPIKLVFPQDLLEHGVS------ 163
                ILL  LFIYDIF VFGT     VMV VA   + PI  VFP  L    +       
Sbjct: 108 EWTAWILLGALFIYDIFAVFGTPGPLRVMVEVATGRDEPI-KVFPALLYVPRLVVSFEDD 166

Query: 164 -ANNFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFK 222
               F+MLGLGDIV PGI +A   RFD+S+   SN +    F+AY +GL+ T+ ++ +FK
Sbjct: 167 EEERFSMLGLGDIVFPGILVASAARFDVSVRSDSNYFLAC-FVAYGIGLILTLVLLALFK 225

Query: 223 HAQPALLYLVPACLGLPLL 241
            AQPAL YLVP  L   LL
Sbjct: 226 KAQPALPYLVPFTLVFYLL 244



 Score =  101 bits (254), Expect = 3e-24
 Identities = 66/279 (23%), Positives = 99/279 (35%), Gaps = 98/279 (35%)

Query: 327 LLYLFFSKDMINLLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEE 386
           L+ L+     + ++L  YF  LGVL L  LL P+    V                     
Sbjct: 21  LVLLYKFFKYLVIVLVIYFSSLGVLFLYSLLYPLEVFRV--------------------- 59

Query: 387 KKDGSFDRGATNEEKKDGSEALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKK-HWIANNL 445
                                             ++  +  +V G W + +K  WI  +L
Sbjct: 60  -----------------------------DYPTLLILLLNFAVVGFWCIHRKGAWIQQDL 90

Query: 446 FGLAFAVNGIELLHLNNVMIGVILLCGLFIYDIFWVFGT----NVMVTVAKSFEAPI--- 498
            G++  +  + +L L +     ILL  LFIYDIF VFGT     VMV VA   + PI   
Sbjct: 91  IGISLCMAILFILRLPSEWTAWILLGALFIYDIFAVFGTPGPLRVMVEVATGRDEPIKVF 150

Query: 499 ----------------------------------------KYVHESFKGLTQWFSNFFAW 518
                                                   ++          + + F A+
Sbjct: 151 PALLYVPRLVVSFEDDEEERFSMLGLGDIVFPGILVASAARFDVSVRSDSNYFLACFVAY 210

Query: 519 HLGLMATIFVMHVFKHAQPALLYLVPACLGLPLLVALVK 557
            +GL+ T+ ++ +FK AQPAL YLVP  L   LL AL++
Sbjct: 211 GIGLILTLVLLALFKKAQPALPYLVPFTLVFYLLTALLR 249



 Score = 45.7 bits (109), Expect = 3e-05
 Identities = 18/38 (47%), Positives = 26/38 (68%)

Query: 293 RRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLYL 330
           R  + YF   F+AY +GL+ T+ ++ +FK AQPAL YL
Sbjct: 197 RSDSNYFLACFVAYGIGLILTLVLLALFKKAQPALPYL 234


>gnl|CDD|219084 pfam06550, DUF1119, Protein of unknown function (DUF1119).  This
           family consists of several hypothetical archaeal
           proteins of unknown function.
          Length = 283

 Score = 38.2 bits (89), Expect = 0.009
 Identities = 47/188 (25%), Positives = 73/188 (38%), Gaps = 14/188 (7%)

Query: 58  ALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKKHWIANNLFGLAFAVNGIELLHLN-NVMI 116
            LL IF     + +I   ++             W   +L G   AV    L  ++  V  
Sbjct: 79  ILLSIFLVNAGAINIASILLAIGLFYLLYFYPEWYVIDLAGFFLAVGISALWGISFGVWP 138

Query: 117 GVILLCGLFIYDIFWVFGTNVMVTVAK---SFEAPIKLVFPQDL--------LEHGVSAN 165
            V+LL  L +YD   V+ T  M+T+A+     + PI  V P++L         E+     
Sbjct: 139 AVVLLIALAVYDAISVYKTKHMITLARGVMDLDLPILFVIPENLDYSFIEESFENRGERE 198

Query: 166 NFA--MLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKH 223
                 +GLGD V+P I +     F +S      +YF    +A  L  MA ++ +     
Sbjct: 199 AGNALFIGLGDAVMPSILVVSAAVFLISPILGGISYFTLLAMAGTLVGMAVLYFLVNRGR 258

Query: 224 AQPALLYL 231
               L YL
Sbjct: 259 PAAGLPYL 266


>gnl|CDD|225924 COG3389, COG3389, Uncharacterized protein conserved in archaea
           [Function unknown].
          Length = 277

 Score = 36.7 bits (85), Expect = 0.025
 Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 35/195 (17%)

Query: 58  ALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKKHWIANNLFGLAFAVNGIELLHLN-NVMI 116
            LL IF     + +I    +             W   +L G   AV    +  ++  V+ 
Sbjct: 79  ILLSIFLVLVYAINIASIGLAIGLVYLLYKYPEWYVIDLAGFFLAVGIAAIFGISFGVLP 138

Query: 117 GVILLCGLFIYDIFWVFGTNVMVTVAKS---FEAPIKLVFPQDL--------LEHGVSAN 165
            V+LL  L +YD   V+ T  M+++A+     + P+  V P++L         E+    N
Sbjct: 139 AVVLLIALAVYDAISVYKTRHMISLAEGVMDLDLPMLFVIPENLAYSFVEDAFENRGDGN 198

Query: 166 NFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAY---------FLGLMATIF 216
            + M+GLGD V+P I +             +  +  +  LA+          +GL    F
Sbjct: 199 AY-MIGLGDAVMPSILVV-----------SAAFFLISPILAFIVLPALAGGLVGLAVLYF 246

Query: 217 VMHVFKHAQPALLYL 231
           V        P L +L
Sbjct: 247 VNR--GRPHPGLPFL 259


>gnl|CDD|225514 COG2966, COG2966, Uncharacterized conserved protein [Function
           unknown].
          Length = 250

 Score = 33.8 bits (78), Expect = 0.18
 Identities = 29/106 (27%), Positives = 46/106 (43%), Gaps = 12/106 (11%)

Query: 145 FEAPIKLVFPQDLLEHGVSANNFAML-GLGDIVVPGIFIALLL--RFDLSLNRRSNTYFN 201
            + P++      LL  G++A  FA+L G G +     F A LL       L+R+ N  F 
Sbjct: 117 QKQPLRYSRWLVLLMAGLAAAAFALLFGGGWLDFLIAFFAGLLGFLLRQYLSRKGNPDFF 176

Query: 202 TAFLAYFL-GLMATIFVMHVFKH-----AQPALLYLVPACLGLPLL 241
              LA F+  ++A +F               +++ LVP   G+PL 
Sbjct: 177 FEVLASFIASIVAVLFGSLGLGITPDALIIGSIMLLVP---GVPLT 219


>gnl|CDD|133463 cd06853, GT_WecA_like, This subfamily contains Escherichia coli
           WecA, Bacillus subtilis TagO and related proteins. WecA
           is an UDP-N-acetylglucosamine
           (GlcNAc):undecaprenyl-phosphate (Und-P)
           GlcNAc-1-phosphate transferase that catalyzes the
           formation of a phosphodiester bond between a
           membrane-associated undecaprenyl-phosphate molecule and
           N-acetylglucosamine 1-phosphate, which is usually
           donated by a soluble UDP-N-acetylglucosamine precursor.
           WecA participates in the biosynthesis of O antigen LPS
           in many enteric bacteria and is also involved in the
           biosynthesis of enterobacterial common antigen. A
           conserved short sequence motif and a conserved arginine
           at a cytosolic loop of this integral membrane protein
           were shown to be critical in recognition of substrate
           UDP-N-acetylglucosamine.
          Length = 249

 Score = 30.1 bits (69), Expect = 2.9
 Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 8/64 (12%)

Query: 206 AYFLGLMATIFVMHVFKHAQPALLYLVP-ACLGLPLL-----IIARISLMDNRYPTAGQR 259
           + FLG +  +  +   + +  A+  +VP   L +PL      II R+  +  R P    R
Sbjct: 180 SLFLGFLLAVLSILGTQKSSTAISPVVPLLILAVPLFDTLFVIIRRL--LRGRSPFQADR 237

Query: 260 SHLH 263
            HLH
Sbjct: 238 DHLH 241


>gnl|CDD|235412 PRK05329, PRK05329, anaerobic glycerol-3-phosphate dehydrogenase
           subunit B; Validated.
          Length = 422

 Score = 29.0 bits (66), Expect = 7.6
 Identities = 12/41 (29%), Positives = 18/41 (43%), Gaps = 1/41 (2%)

Query: 534 HAQPALLYLVPACLGLPLLVALVKGDLSALINVVVSEVDMG 574
            A  A   L+PA LGL    A++  +L   +   V E+   
Sbjct: 213 LAGDAEAVLLPAVLGLDDDAAVLA-ELEEALGCPVFELPTL 252


>gnl|CDD|227583 COG5258, GTPBP1, GTPase [General function prediction only].
          Length = 527

 Score = 29.0 bits (65), Expect = 8.5
 Identities = 18/77 (23%), Positives = 25/77 (32%), Gaps = 10/77 (12%)

Query: 141 VAKSFEAPIKLV--FPQDLLEHGVSANNFAMLGLGDIVVPGIFI--------ALLLRFDL 190
           V +   A +K V   P  + +          +  G  VVP  +          LL  F L
Sbjct: 276 VVEEISALLKRVGRIPLIVKDTDDVVLAAKAMKAGRGVVPIFYTSSVTGEGLDLLDEFFL 335

Query: 191 SLNRRSNTYFNTAFLAY 207
            L +R        FL Y
Sbjct: 336 LLPKRRRWDDEGPFLMY 352


>gnl|CDD|224397 COG1480, COG1480, Predicted membrane-associated HD superfamily
           hydrolase [General function prediction only].
          Length = 700

 Score = 28.9 bits (65), Expect = 8.8
 Identities = 25/134 (18%), Positives = 53/134 (39%), Gaps = 21/134 (15%)

Query: 455 IELLHLNNVMIGVILLCGLFIYDIF---WVFGTNVMVTVAKSFEAPIKYVHESFKGLTQW 511
           ++LL L ++ + ++ L GL I  IF               +  ++P+K            
Sbjct: 262 LDLLGLLSLSVNILPLLGLLILVIFLILLFAL------YERRTKSPLKL----------R 305

Query: 512 FSNFFAWHLGLMATIFVMH-VFKHAQPALLYLVPACLGLPLLVALVKGDLSALINVVVSE 570
            S    +    + T+ ++  V      A   LVP  LG  LL+ LV   ++   + +++ 
Sbjct: 306 NSLLLLYLSLAILTLSLLRIVGYFNYSASGLLVPPALGPMLLILLVFLRIAIFSSSMIA- 364

Query: 571 VDMGLAVGWWFDTM 584
           + +    G  +++ 
Sbjct: 365 IALLYLFGGSYNSE 378


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.331    0.145    0.458 

Gapped
Lambda     K      H
   0.267   0.0760    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 31,113,429
Number of extensions: 3209166
Number of successful extensions: 4832
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4795
Number of HSP's successfully gapped: 131
Length of query: 587
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 485
Effective length of database: 6,413,494
Effective search space: 3110544590
Effective search space used: 3110544590
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 62 (27.6 bits)