RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy980
(587 letters)
>gnl|CDD|217991 pfam04258, Peptidase_A22B, Signal peptide peptidase. The members
of this family are membrane proteins. In some proteins
this region is found associated with pfam02225. This
family corresponds with Merops subfamily A22B, the type
example of which is signal peptide peptidase. There is a
sequence-similarity relationship with pfam01080.
Length = 291
Score = 231 bits (590), Expect = 2e-71
Identities = 110/251 (43%), Positives = 151/251 (60%), Gaps = 29/251 (11%)
Query: 4 LLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSEALLVIF 63
L+LTGYF +LG +AL +L+ P GA KK L +
Sbjct: 44 LVLTGYFCILGAIALAFLILPPC------------IRRLSFMGAYRFPKKKSK---LELK 88
Query: 64 DYKFSSHDIVCFIVCSVFG-SWYLVKK-HWIANNLFGLAFAVNGIELLHLNNVMIGVILL 121
+ +F+ ++V ++C VF WY+++K HWI ++ G+A +N I++L L N+ +G ILL
Sbjct: 89 NGRFTYSELVALLLCIVFAVWWYVLRKEHWILQDILGIALCINVIKILRLPNLKVGTILL 148
Query: 122 CGLFIYDIFWVF------GTNVMVTVAKS-FEA----PIKLVFPQDLLEHGVSANNFAML 170
GLF YDIFWVF GT+VMVTVA F+A P+KLVFP+ + + F+ML
Sbjct: 149 SGLFFYDIFWVFISPYIFGTSVMVTVATGPFDAAEKLPMKLVFPRLND-YPGNWGPFSML 207
Query: 171 GLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLY 230
GLGDIV+PG+ IA LRFD+S N+ S TYF + +AY LGL+ T +++FK AQPALLY
Sbjct: 208 GLGDIVMPGLLIAFCLRFDISKNKSSRTYFISTMIAYGLGLLITFVALNLFKAAQPALLY 267
Query: 231 LVPACLGLPLL 241
LVP LG LL
Sbjct: 268 LVPCTLGTLLL 278
Score = 165 bits (419), Expect = 1e-46
Identities = 98/317 (30%), Positives = 145/317 (45%), Gaps = 87/317 (27%)
Query: 301 TAFLAYFLGLMATIFVMHVFKHAQPALLYLFFSKDMIN--LLLTGYFFLLGVLALCYLLS 358
+ A + A+ ++ ++ L + SK+ IN L+LTGYF +LG +AL +L+
Sbjct: 10 DSRHAIIFPISASCTLLGLY------LFFKSLSKEYINSVLVLTGYFCILGAIALAFLIL 63
Query: 359 PVISPLVPAAIPNIPFHLKFDRGATNEEKKDGSFDRGATNEEKKDGSEALLVIFDYKFSS 418
P GA KK + + + +F+
Sbjct: 64 PPC------------IRRLSFMGAYRFPKKKSKLE-----------------LKNGRFTY 94
Query: 419 HDIVCFIVCSVFG-SWYLVKK-HWIANNLFGLAFAVNGIELLHLNNVMIGVILLCGLFIY 476
++V ++C VF WY+++K HWI ++ G+A +N I++L L N+ +G ILL GLF Y
Sbjct: 95 SELVALLLCIVFAVWWYVLRKEHWILQDILGIALCINVIKILRLPNLKVGTILLSGLFFY 154
Query: 477 DIFWVF------GTNVMVTVAKS-FEA----PIKYV------------------------ 501
DIFWVF GT+VMVTVA F+A P+K V
Sbjct: 155 DIFWVFISPYIFGTSVMVTVATGPFDAAEKLPMKLVFPRLNDYPGNWGPFSMLGLGDIVM 214
Query: 502 -------------HESFKGLTQWFSNFFAWHLGLMATIFVMHVFKHAQPALLYLVPACLG 548
++ T + S A+ LGL+ T +++FK AQPALLYLVP LG
Sbjct: 215 PGLLIAFCLRFDISKNKSSRTYFISTMIAYGLGLLITFVALNLFKAAQPALLYLVPCTLG 274
Query: 549 LPLLVALVKGDLSALIN 565
LLVAL +G+L L N
Sbjct: 275 TLLLVALWRGELKKLWN 291
Score = 60.4 bits (147), Expect = 5e-10
Identities = 22/41 (53%), Positives = 29/41 (70%)
Query: 290 SLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLYL 330
S N+ S TYF + +AY LGL+ T +++FK AQPALLYL
Sbjct: 228 SKNKSSRTYFISTMIAYGLGLLITFVALNLFKAAQPALLYL 268
>gnl|CDD|214793 smart00730, PSN, Presenilin, signal peptide peptidase, family.
Presenilin 1 and presenilin 2 are polytopic membrane
proteins, whose genes are mutated in some individuals
with Alzheimer's disease. Distant homologues, present in
eukaryotes and archaea, also contain conserved aspartic
acid residues which are predicted to contribute to
catalysis. At least one member of this family has been
shown to possess signal peptide peptidase activity.
Length = 249
Score = 156 bits (397), Expect = 5e-44
Identities = 76/199 (38%), Positives = 104/199 (52%), Gaps = 15/199 (7%)
Query: 55 GSEALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKK-HWIANNLFGLAFAVNGIELLHLNN 113
L +F ++ + +V G W + +K WI +L G++ + + +L L +
Sbjct: 49 SLLYPLEVFRV-DYPTLLILLLNFAVVGFWCIHRKGAWIQQDLIGISLCMAILFILRLPS 107
Query: 114 VMIGVILLCGLFIYDIFWVFGT----NVMVTVAKSFEAPIKLVFPQDLLEHGVS------ 163
ILL LFIYDIF VFGT VMV VA + PI VFP L +
Sbjct: 108 EWTAWILLGALFIYDIFAVFGTPGPLRVMVEVATGRDEPI-KVFPALLYVPRLVVSFEDD 166
Query: 164 -ANNFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFK 222
F+MLGLGDIV PGI +A RFD+S+ SN + F+AY +GL+ T+ ++ +FK
Sbjct: 167 EEERFSMLGLGDIVFPGILVASAARFDVSVRSDSNYFLAC-FVAYGIGLILTLVLLALFK 225
Query: 223 HAQPALLYLVPACLGLPLL 241
AQPAL YLVP L LL
Sbjct: 226 KAQPALPYLVPFTLVFYLL 244
Score = 101 bits (254), Expect = 3e-24
Identities = 66/279 (23%), Positives = 99/279 (35%), Gaps = 98/279 (35%)
Query: 327 LLYLFFSKDMINLLLTGYFFLLGVLALCYLLSPVISPLVPAAIPNIPFHLKFDRGATNEE 386
L+ L+ + ++L YF LGVL L LL P+ V
Sbjct: 21 LVLLYKFFKYLVIVLVIYFSSLGVLFLYSLLYPLEVFRV--------------------- 59
Query: 387 KKDGSFDRGATNEEKKDGSEALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKK-HWIANNL 445
++ + +V G W + +K WI +L
Sbjct: 60 -----------------------------DYPTLLILLLNFAVVGFWCIHRKGAWIQQDL 90
Query: 446 FGLAFAVNGIELLHLNNVMIGVILLCGLFIYDIFWVFGT----NVMVTVAKSFEAPI--- 498
G++ + + +L L + ILL LFIYDIF VFGT VMV VA + PI
Sbjct: 91 IGISLCMAILFILRLPSEWTAWILLGALFIYDIFAVFGTPGPLRVMVEVATGRDEPIKVF 150
Query: 499 ----------------------------------------KYVHESFKGLTQWFSNFFAW 518
++ + + F A+
Sbjct: 151 PALLYVPRLVVSFEDDEEERFSMLGLGDIVFPGILVASAARFDVSVRSDSNYFLACFVAY 210
Query: 519 HLGLMATIFVMHVFKHAQPALLYLVPACLGLPLLVALVK 557
+GL+ T+ ++ +FK AQPAL YLVP L LL AL++
Sbjct: 211 GIGLILTLVLLALFKKAQPALPYLVPFTLVFYLLTALLR 249
Score = 45.7 bits (109), Expect = 3e-05
Identities = 18/38 (47%), Positives = 26/38 (68%)
Query: 293 RRSNTYFNTAFLAYFLGLMATIFVMHVFKHAQPALLYL 330
R + YF F+AY +GL+ T+ ++ +FK AQPAL YL
Sbjct: 197 RSDSNYFLACFVAYGIGLILTLVLLALFKKAQPALPYL 234
>gnl|CDD|219084 pfam06550, DUF1119, Protein of unknown function (DUF1119). This
family consists of several hypothetical archaeal
proteins of unknown function.
Length = 283
Score = 38.2 bits (89), Expect = 0.009
Identities = 47/188 (25%), Positives = 73/188 (38%), Gaps = 14/188 (7%)
Query: 58 ALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKKHWIANNLFGLAFAVNGIELLHLN-NVMI 116
LL IF + +I ++ W +L G AV L ++ V
Sbjct: 79 ILLSIFLVNAGAINIASILLAIGLFYLLYFYPEWYVIDLAGFFLAVGISALWGISFGVWP 138
Query: 117 GVILLCGLFIYDIFWVFGTNVMVTVAK---SFEAPIKLVFPQDL--------LEHGVSAN 165
V+LL L +YD V+ T M+T+A+ + PI V P++L E+
Sbjct: 139 AVVLLIALAVYDAISVYKTKHMITLARGVMDLDLPILFVIPENLDYSFIEESFENRGERE 198
Query: 166 NFA--MLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAYFLGLMATIFVMHVFKH 223
+GLGD V+P I + F +S +YF +A L MA ++ +
Sbjct: 199 AGNALFIGLGDAVMPSILVVSAAVFLISPILGGISYFTLLAMAGTLVGMAVLYFLVNRGR 258
Query: 224 AQPALLYL 231
L YL
Sbjct: 259 PAAGLPYL 266
>gnl|CDD|225924 COG3389, COG3389, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 277
Score = 36.7 bits (85), Expect = 0.025
Identities = 44/195 (22%), Positives = 74/195 (37%), Gaps = 35/195 (17%)
Query: 58 ALLVIFDYKFSSHDIVCFIVCSVFGSWYLVKKHWIANNLFGLAFAVNGIELLHLN-NVMI 116
LL IF + +I + W +L G AV + ++ V+
Sbjct: 79 ILLSIFLVLVYAINIASIGLAIGLVYLLYKYPEWYVIDLAGFFLAVGIAAIFGISFGVLP 138
Query: 117 GVILLCGLFIYDIFWVFGTNVMVTVAKS---FEAPIKLVFPQDL--------LEHGVSAN 165
V+LL L +YD V+ T M+++A+ + P+ V P++L E+ N
Sbjct: 139 AVVLLIALAVYDAISVYKTRHMISLAEGVMDLDLPMLFVIPENLAYSFVEDAFENRGDGN 198
Query: 166 NFAMLGLGDIVVPGIFIALLLRFDLSLNRRSNTYFNTAFLAY---------FLGLMATIF 216
+ M+GLGD V+P I + + + + LA+ +GL F
Sbjct: 199 AY-MIGLGDAVMPSILVV-----------SAAFFLISPILAFIVLPALAGGLVGLAVLYF 246
Query: 217 VMHVFKHAQPALLYL 231
V P L +L
Sbjct: 247 VNR--GRPHPGLPFL 259
>gnl|CDD|225514 COG2966, COG2966, Uncharacterized conserved protein [Function
unknown].
Length = 250
Score = 33.8 bits (78), Expect = 0.18
Identities = 29/106 (27%), Positives = 46/106 (43%), Gaps = 12/106 (11%)
Query: 145 FEAPIKLVFPQDLLEHGVSANNFAML-GLGDIVVPGIFIALLL--RFDLSLNRRSNTYFN 201
+ P++ LL G++A FA+L G G + F A LL L+R+ N F
Sbjct: 117 QKQPLRYSRWLVLLMAGLAAAAFALLFGGGWLDFLIAFFAGLLGFLLRQYLSRKGNPDFF 176
Query: 202 TAFLAYFL-GLMATIFVMHVFKH-----AQPALLYLVPACLGLPLL 241
LA F+ ++A +F +++ LVP G+PL
Sbjct: 177 FEVLASFIASIVAVLFGSLGLGITPDALIIGSIMLLVP---GVPLT 219
>gnl|CDD|133463 cd06853, GT_WecA_like, This subfamily contains Escherichia coli
WecA, Bacillus subtilis TagO and related proteins. WecA
is an UDP-N-acetylglucosamine
(GlcNAc):undecaprenyl-phosphate (Und-P)
GlcNAc-1-phosphate transferase that catalyzes the
formation of a phosphodiester bond between a
membrane-associated undecaprenyl-phosphate molecule and
N-acetylglucosamine 1-phosphate, which is usually
donated by a soluble UDP-N-acetylglucosamine precursor.
WecA participates in the biosynthesis of O antigen LPS
in many enteric bacteria and is also involved in the
biosynthesis of enterobacterial common antigen. A
conserved short sequence motif and a conserved arginine
at a cytosolic loop of this integral membrane protein
were shown to be critical in recognition of substrate
UDP-N-acetylglucosamine.
Length = 249
Score = 30.1 bits (69), Expect = 2.9
Identities = 18/64 (28%), Positives = 29/64 (45%), Gaps = 8/64 (12%)
Query: 206 AYFLGLMATIFVMHVFKHAQPALLYLVP-ACLGLPLL-----IIARISLMDNRYPTAGQR 259
+ FLG + + + + + A+ +VP L +PL II R+ + R P R
Sbjct: 180 SLFLGFLLAVLSILGTQKSSTAISPVVPLLILAVPLFDTLFVIIRRL--LRGRSPFQADR 237
Query: 260 SHLH 263
HLH
Sbjct: 238 DHLH 241
>gnl|CDD|235412 PRK05329, PRK05329, anaerobic glycerol-3-phosphate dehydrogenase
subunit B; Validated.
Length = 422
Score = 29.0 bits (66), Expect = 7.6
Identities = 12/41 (29%), Positives = 18/41 (43%), Gaps = 1/41 (2%)
Query: 534 HAQPALLYLVPACLGLPLLVALVKGDLSALINVVVSEVDMG 574
A A L+PA LGL A++ +L + V E+
Sbjct: 213 LAGDAEAVLLPAVLGLDDDAAVLA-ELEEALGCPVFELPTL 252
>gnl|CDD|227583 COG5258, GTPBP1, GTPase [General function prediction only].
Length = 527
Score = 29.0 bits (65), Expect = 8.5
Identities = 18/77 (23%), Positives = 25/77 (32%), Gaps = 10/77 (12%)
Query: 141 VAKSFEAPIKLV--FPQDLLEHGVSANNFAMLGLGDIVVPGIFI--------ALLLRFDL 190
V + A +K V P + + + G VVP + LL F L
Sbjct: 276 VVEEISALLKRVGRIPLIVKDTDDVVLAAKAMKAGRGVVPIFYTSSVTGEGLDLLDEFFL 335
Query: 191 SLNRRSNTYFNTAFLAY 207
L +R FL Y
Sbjct: 336 LLPKRRRWDDEGPFLMY 352
>gnl|CDD|224397 COG1480, COG1480, Predicted membrane-associated HD superfamily
hydrolase [General function prediction only].
Length = 700
Score = 28.9 bits (65), Expect = 8.8
Identities = 25/134 (18%), Positives = 53/134 (39%), Gaps = 21/134 (15%)
Query: 455 IELLHLNNVMIGVILLCGLFIYDIF---WVFGTNVMVTVAKSFEAPIKYVHESFKGLTQW 511
++LL L ++ + ++ L GL I IF + ++P+K
Sbjct: 262 LDLLGLLSLSVNILPLLGLLILVIFLILLFAL------YERRTKSPLKL----------R 305
Query: 512 FSNFFAWHLGLMATIFVMH-VFKHAQPALLYLVPACLGLPLLVALVKGDLSALINVVVSE 570
S + + T+ ++ V A LVP LG LL+ LV ++ + +++
Sbjct: 306 NSLLLLYLSLAILTLSLLRIVGYFNYSASGLLVPPALGPMLLILLVFLRIAIFSSSMIA- 364
Query: 571 VDMGLAVGWWFDTM 584
+ + G +++
Sbjct: 365 IALLYLFGGSYNSE 378
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.331 0.145 0.458
Gapped
Lambda K H
0.267 0.0760 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 31,113,429
Number of extensions: 3209166
Number of successful extensions: 4832
Number of sequences better than 10.0: 1
Number of HSP's gapped: 4795
Number of HSP's successfully gapped: 131
Length of query: 587
Length of database: 10,937,602
Length adjustment: 102
Effective length of query: 485
Effective length of database: 6,413,494
Effective search space: 3110544590
Effective search space used: 3110544590
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.9 bits)
S2: 62 (27.6 bits)