Query 023272
Match_columns 284
No_of_seqs 34 out of 36
Neff 2.7
Searched_HMMs 46136
Date Fri Mar 29 02:46:35 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/023272.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/023272hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03703 bPH_2: Bacterial PH d 98.2 7.8E-07 1.7E-11 63.8 1.8 75 100-185 3-78 (80)
2 COG3402 Uncharacterized conser 97.4 0.00031 6.7E-09 61.9 6.3 87 84-192 65-155 (161)
3 COG3428 Predicted membrane pro 93.9 0.17 3.6E-06 51.4 7.0 107 68-195 237-343 (494)
4 COG3428 Predicted membrane pro 93.9 0.099 2.2E-06 53.0 5.4 79 66-166 45-124 (494)
5 PF04281 Tom22: Mitochondrial 68.9 6.6 0.00014 34.0 3.7 34 61-94 76-113 (137)
6 PF06305 DUF1049: Protein of u 61.5 5.9 0.00013 28.5 1.8 35 66-100 17-52 (68)
7 KOG3156 Uncharacterized membra 51.1 15 0.00033 34.4 3.1 32 50-82 185-216 (220)
8 PF07798 DUF1640: Protein of u 42.9 27 0.00059 30.1 3.2 24 59-82 150-173 (177)
9 PRK07353 F0F1 ATP synthase sub 35.9 56 0.0012 26.5 3.9 30 63-95 7-36 (140)
10 CHL00118 atpG ATP synthase CF0 35.1 59 0.0013 27.4 4.0 39 53-94 13-52 (156)
11 PF06459 RR_TM4-6: Ryanodine R 32.6 47 0.001 31.6 3.3 21 66-86 171-191 (274)
12 PF12955 DUF3844: Domain of un 31.5 67 0.0015 26.9 3.7 31 53-83 60-90 (103)
13 PF07782 DC_STAMP: DC-STAMP-li 29.4 79 0.0017 27.5 3.9 40 62-101 143-182 (191)
14 PF05529 Bap31: B-cell recepto 29.4 46 0.00099 28.7 2.5 35 63-99 2-36 (192)
15 KOG0158 Cytochrome P450 CYP3/C 29.3 3E+02 0.0066 28.3 8.5 59 117-194 36-98 (499)
16 PRK14475 F0F1 ATP synthase sub 28.9 85 0.0018 26.8 4.0 32 62-94 9-40 (167)
17 PHA00022 VII minor coat protei 28.9 35 0.00076 23.0 1.3 22 61-82 2-23 (28)
18 PF07263 DMP1: Dentin matrix p 28.2 26 0.00056 36.3 0.9 22 69-94 3-24 (514)
19 PF15128 T_cell_tran_alt: T-ce 27.1 53 0.0012 27.3 2.3 14 66-79 34-48 (92)
20 KOG4551 GPI-GlcNAc transferase 26.4 95 0.0021 28.5 4.0 55 72-145 64-128 (184)
21 PF05569 Peptidase_M56: BlaR1 23.9 1.4E+02 0.0029 27.2 4.6 26 62-87 94-120 (299)
22 PF00430 ATP-synt_B: ATP synth 23.8 1.1E+02 0.0023 24.1 3.5 29 63-94 1-29 (132)
23 PRK14472 F0F1 ATP synthase sub 22.1 1.3E+02 0.0028 25.7 3.8 34 59-95 16-49 (175)
24 KOG3249 Uncharacterized conser 21.3 1.7E+02 0.0037 26.9 4.6 33 55-89 88-125 (181)
25 PF02893 GRAM: GRAM domain; I 21.2 1.1E+02 0.0023 21.9 2.8 20 97-116 28-47 (69)
26 PF06866 DUF1256: Protein of u 20.2 27 0.00059 31.1 -0.6 44 143-187 92-143 (163)
27 PF11014 DUF2852: Protein of u 20.1 1.3E+02 0.0028 25.8 3.4 25 66-90 11-36 (115)
28 PF06679 DUF1180: Protein of u 20.1 1E+02 0.0022 27.5 2.9 25 72-96 97-121 (163)
No 1
>PF03703 bPH_2: Bacterial PH domain; InterPro: IPR005182 A domain that is found in uncharacterised family of membrane proteins. 1-3 copies found in each protein, with each copy flanked by transmembrane helices.
Probab=98.18 E-value=7.8e-07 Score=63.78 Aligned_cols=75 Identities=16% Similarity=0.298 Sum_probs=59.0
Q ss_pred cceeEeecCceEEeecCCcccccccc-ccccccccCchhhhhhhhhchhhhhhceeEEEEeeeecCCCCCCCceeEeeec
Q 023272 100 SRKLYVTPSEIVYKVSRPSFIPFWGK-TTNERHVPLSLVIDVIIEQGCLQSIYGIHTFRVESIARGKAAPVDELQVQGVD 178 (284)
Q Consensus 100 srkLyvT~~~IVYKvsRPs~~P~~Gv-~r~Ek~vpL~~V~DiiieQG~Lqs~fGihs~riE~ig~~k~~~~D~lqi~GV~ 178 (284)
+.+..+|++.++ --.|+ .+++..+|++.|.+|-++||++|+.||+.++.+++++.++..+ .+-..++.
T Consensus 3 ~~~y~i~~~~l~---------i~~G~~~~~~~~i~~~~Iq~v~~~q~~~~r~~g~~~i~i~~~~~~~~~~--~i~~~~~~ 71 (80)
T PF03703_consen 3 NTGYTITDDRLI---------IRSGLFSKRTTIIPLDRIQSVSIKQNPLQRLFGLGTIKIDTAGGSGEKI--EIPFLSIE 71 (80)
T ss_pred EEEEEEECCEEE---------EEECeEEEEEEEEEhhHeEEEEEEcCHHHHhCccEEEEEEECCCCCcee--EEecCCHH
Confidence 345677777777 34465 8899999999999999999999999999999999999998642 35555555
Q ss_pred ChhhHHH
Q 023272 179 NPVLLRK 185 (284)
Q Consensus 179 ~p~~fRk 185 (284)
+|..+++
T Consensus 72 ~a~~i~~ 78 (80)
T PF03703_consen 72 DAEEIYD 78 (80)
T ss_pred HHHHHHh
Confidence 6665554
No 2
>COG3402 Uncharacterized conserved protein [Function unknown]
Probab=97.44 E-value=0.00031 Score=61.91 Aligned_cols=87 Identities=22% Similarity=0.363 Sum_probs=69.0
Q ss_pred HHHhh--hhhhhhhhccccceeEeecCceEEeecCCccccccccccccccccCchhhhhhhhhchhhhhhceeEEEEeee
Q 023272 84 LLYLP--IRRYVLKKDISSRKLYVTPSEIVYKVSRPSFIPFWGKTTNERHVPLSLVIDVIIEQGCLQSIYGIHTFRVESI 161 (284)
Q Consensus 84 LLylP--irryv~rkdi~srkLyvT~~~IVYKvsRPs~~P~~Gv~r~Ek~vpL~~V~DiiieQG~Lqs~fGihs~riE~i 161 (284)
|.-.| +|.-++|-+++--.++++.+.++ ++++-+|..-|--|=.+||||++.||+-+++|-||
T Consensus 65 l~iiP~~~Ryr~wry~v~~~el~iq~GiLv---------------~~r~viP~~RVQhVDt~~GPL~R~~~La~vti~TA 129 (161)
T COG3402 65 LFIIPQLVRYRVWRYEVEEDELDIQHGILV---------------RTRTVIPYVRVQHVDTEQGPLLRRYGLATVTITTA 129 (161)
T ss_pred hhhhhHHHhhhhheeecccceEEeeccEEE---------------EEEEEeeEEEEEeeecccChHHHHhCcceEEEEec
Confidence 34468 67778899999999999998554 77888888888888899999999999999999999
Q ss_pred ecCCCCCCCceeEeee--cChhhHHHHHHHHHH
Q 023272 162 ARGKAAPVDELQVQGV--DNPVLLRKVIVTEAA 192 (284)
Q Consensus 162 g~~k~~~~D~lqi~GV--~~p~~fRk~Vl~~~s 192 (284)
|. +..|.|+ ..++.+|..+...+.
T Consensus 130 ss-------~~~IeaL~~~eAdrlr~~l~~la~ 155 (161)
T COG3402 130 SS-------DHTIEALDREEADRLRERLANLAR 155 (161)
T ss_pred cc-------cceecccCHHHHHHHHHHHHHHHH
Confidence 86 3566665 456777776655443
No 3
>COG3428 Predicted membrane protein [Function unknown]
Probab=93.90 E-value=0.17 Score=51.42 Aligned_cols=107 Identities=14% Similarity=0.247 Sum_probs=71.2
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhccccceeEeecCceEEeecCCccccccccccccccccCchhhhhhhhhchh
Q 023272 68 SISLLLVLAWGVGVFMLLYLPIRRYVLKKDISSRKLYVTPSEIVYKVSRPSFIPFWGKTTNERHVPLSLVIDVIIEQGCL 147 (284)
Q Consensus 68 liSlLLvLAWGvGilmLLylPirryv~rkdi~srkLyvT~~~IVYKvsRPs~~P~~Gv~r~Ek~vpL~~V~DiiieQG~L 147 (284)
++=++++.||++++...++ +-|..|.---++.|.++.+ .| -|++-++||..+.-|-++|-+|
T Consensus 237 ~~vl~llvaW~~s~~~tm~---rf~~fq~~~~~~~L~ierG----------Ll-----err~~Tipl~RlqaV~l~esll 298 (494)
T COG3428 237 LAVLLLLVAWLLSIALTMF---RFYGFQVMKENGVLHIERG----------LL-----ERRKVTIPLRRLQAVTLRESLL 298 (494)
T ss_pred HHHHHHHHHHHHHHHHHhh---hhhceEEEecCCEEEEEee----------ee-----ecccceeehhheEEEEeechHH
Confidence 4456777899999844332 2233322222555555544 22 7889999999999999999999
Q ss_pred hhhhceeEEEEeeeecCCCCCCCceeEeeecChhhHHHHHHHHHHHhh
Q 023272 148 QSIYGIHTFRVESIARGKAAPVDELQVQGVDNPVLLRKVIVTEAAKVI 195 (284)
Q Consensus 148 qs~fGihs~riE~ig~~k~~~~D~lqi~GV~~p~~fRk~Vl~~~s~~~ 195 (284)
.++||.-++.|+++|..+-.+. +-.=..=|-.=|+.++.-+.++.
T Consensus 299 rrl~G~~~v~v~~aG~~gd~~~---sgs~~llP~~~k~~v~~~L~~~~ 343 (494)
T COG3428 299 RRLFGYAAVDVVTAGVHGDSQS---SGSTPLLPFIKKEAVLKLLRENT 343 (494)
T ss_pred HHhhccEEEEEEEecCCCCCcC---CCceeccccccHHHHHHHHHHhC
Confidence 9999999999999993222221 11112345555777777777664
No 4
>COG3428 Predicted membrane protein [Function unknown]
Probab=93.87 E-value=0.099 Score=53.02 Aligned_cols=79 Identities=16% Similarity=0.356 Sum_probs=58.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccccceeEeecCceEEeecCCcccccccc-ccccccccCchhhhhhhhh
Q 023272 66 WFSISLLLVLAWGVGVFMLLYLPIRRYVLKKDISSRKLYVTPSEIVYKVSRPSFIPFWGK-TTNERHVPLSLVIDVIIEQ 144 (284)
Q Consensus 66 W~liSlLLvLAWGvGilmLLylPirryv~rkdi~srkLyvT~~~IVYKvsRPs~~P~~Gv-~r~Ek~vpL~~V~DiiieQ 144 (284)
|..+-+..+++|+|-= |-.|.. =+-.+|++++. --.|+ .|+++.+|.+-|--|=..|
T Consensus 45 w~~~~~vv~vi~~i~~-------ii~w~~------~~Yried~~~~---------l~~Gi~~rk~r~i~~~RIQsVd~t~ 102 (494)
T COG3428 45 WGGAALVVLVIFLILQ-------IIKWIT------FTYRIEDEEVR---------LQTGIFSRKKRYIPIDRIQSVDTTA 102 (494)
T ss_pred eeeehhhHHHHHHHHh-------hhEEEE------EEEEEecceEE---------EEeeEEeecceecchHhhhhhHHHH
Confidence 8888888888887643 122222 12234444443 34455 8999999999999999999
Q ss_pred chhhhhhceeEEEEeeeecCCC
Q 023272 145 GCLQSIYGIHTFRVESIARGKA 166 (284)
Q Consensus 145 G~Lqs~fGihs~riE~ig~~k~ 166 (284)
|-+|++||+-.++|||+|-++-
T Consensus 103 ~lv~rlfgl~~l~IeTaGg~~e 124 (494)
T COG3428 103 GLVARLFGLVVLRIETAGGGGE 124 (494)
T ss_pred HHHHHhhccEEEEEEcCCCCCc
Confidence 9999999999999999986544
No 5
>PF04281 Tom22: Mitochondrial import receptor subunit Tom22 ; InterPro: IPR005683 The mitochondrial protein translocase family, which is responsible for movement of nuclear encoded pre-proteins into mitochondria, is very complex with at least 19 components. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family represents the Tom22 proteins []. The N-terminal region of Tom22 has been shown to have chaperone-like activity, and the C-terminal region faces the intermembrane face []. ; GO: 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane
Probab=68.87 E-value=6.6 Score=33.98 Aligned_cols=34 Identities=24% Similarity=0.179 Sum_probs=25.6
Q ss_pred chhHHHHHHHHHHH---HHHHHHH-HHHHHhhhhhhhh
Q 023272 61 YDTIIWFSISLLLV---LAWGVGV-FMLLYLPIRRYVL 94 (284)
Q Consensus 61 Ydt~~W~liSlLLv---LAWGvGi-lmLLylPirryv~ 94 (284)
+.++-|..-++.-+ .+|+++- .|||.+|+-.-+.
T Consensus 76 ~~~~~~~~k~~~~~~g~a~Wi~tTSallLgvPl~l~ie 113 (137)
T PF04281_consen 76 VSTTSSAVKSLFSFSGKALWIVTTSALLLGVPLALEIE 113 (137)
T ss_pred HHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHHHHHHH
Confidence 55666666666555 7999999 9999999976554
No 6
>PF06305 DUF1049: Protein of unknown function (DUF1049); InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=61.53 E-value=5.9 Score=28.48 Aligned_cols=35 Identities=23% Similarity=0.345 Sum_probs=25.8
Q ss_pred HHHHHHHHHHHHHHHH-HHHHHhhhhhhhhhhcccc
Q 023272 66 WFSISLLLVLAWGVGV-FMLLYLPIRRYVLKKDISS 100 (284)
Q Consensus 66 W~liSlLLvLAWGvGi-lmLLylPirryv~rkdi~s 100 (284)
|+-.+++++.+-++|+ ++.|+.-...+-.|+.+++
T Consensus 17 ~~pl~l~il~~f~~G~llg~l~~~~~~~~~r~~~~~ 52 (68)
T PF06305_consen 17 PLPLGLLILIAFLLGALLGWLLSLPSRLRLRRRIRR 52 (68)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4778888999999998 6777766666666665544
No 7
>KOG3156 consensus Uncharacterized membrane protein [Function unknown]
Probab=51.12 E-value=15 Score=34.36 Aligned_cols=32 Identities=22% Similarity=0.197 Sum_probs=21.7
Q ss_pred cHHHHhcccccchhHHHHHHHHHHHHHHHHHHH
Q 023272 50 SFEELASNSVKYDTIIWFSISLLLVLAWGVGVF 82 (284)
Q Consensus 50 SFeE~~~~~vkYdt~~W~liSlLLvLAWGvGil 82 (284)
+|....+ .+||||+||+.-.+.-++|=+.|++
T Consensus 185 ~lk~qi~-s~K~qt~qw~~g~v~~~~Al~La~~ 216 (220)
T KOG3156|consen 185 NLKTQIE-SVKTQTIQWLIGVVTGTSALVLAYL 216 (220)
T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444443 4699999999888776666555543
No 8
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=42.93 E-value=27 Score=30.13 Aligned_cols=24 Identities=21% Similarity=0.479 Sum_probs=18.6
Q ss_pred ccchhHHHHHHHHHHHHHHHHHHH
Q 023272 59 VKYDTIIWFSISLLLVLAWGVGVF 82 (284)
Q Consensus 59 vkYdt~~W~liSlLLvLAWGvGil 82 (284)
+|+||++|+...++.++|=++|++
T Consensus 150 ~K~~~lr~~~g~i~~~~a~~la~~ 173 (177)
T PF07798_consen 150 LKWDTLRWLVGVIFGCVALVLAIL 173 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 899999998766666667666663
No 9
>PRK07353 F0F1 ATP synthase subunit B'; Validated
Probab=35.86 E-value=56 Score=26.47 Aligned_cols=30 Identities=20% Similarity=0.474 Sum_probs=20.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh
Q 023272 63 TIIWFSISLLLVLAWGVGVFMLLYLPIRRYVLK 95 (284)
Q Consensus 63 t~~W~liSlLLvLAWGvGilmLLylPirryv~r 95 (284)
|+.|.++ .+++|+|.++. +||-|+..++-.
T Consensus 7 t~~~~~i-~flil~~ll~~--~l~~pi~~~l~~ 36 (140)
T PRK07353 7 TLPLMAV-QFVLLTFILNA--LFYKPVGKVVEE 36 (140)
T ss_pred hHHHHHH-HHHHHHHHHHH--HHHHHHHHHHHH
Confidence 5555555 55688887776 579998877663
No 10
>CHL00118 atpG ATP synthase CF0 B' subunit; Validated
Probab=35.05 E-value=59 Score=27.41 Aligned_cols=39 Identities=21% Similarity=0.384 Sum_probs=25.2
Q ss_pred HHhcccccch-hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q 023272 53 ELASNSVKYD-TIIWFSISLLLVLAWGVGVFMLLYLPIRRYVL 94 (284)
Q Consensus 53 E~~~~~vkYd-t~~W~liSlLLvLAWGvGilmLLylPirryv~ 94 (284)
++.++.+.+| |+.|.++. .++|.|.++ -++|-|+...+-
T Consensus 13 ~~~~~~~~~n~t~~~~~in-FliL~~lL~--k~l~~Pi~~~l~ 52 (156)
T CHL00118 13 EGAGGLFDFNATLPLMALQ-FLLLMVLLN--IILYKPLLKVLD 52 (156)
T ss_pred cccCCccccchHHHHHHHH-HHHHHHHHH--HHHHHHHHHHHH
Confidence 4555556665 55566664 456777665 457999887765
No 11
>PF06459 RR_TM4-6: Ryanodine Receptor TM 4-6; InterPro: IPR009460 The release of Ca2+ ions from intracellular stores is a key step in a wide variety of cellular functions. In striated muscle, the release of Ca2+ from the sarcoplasmic reticulum (SR) leads to muscle contraction. Ca2+ release occurs through large, high-conductance Ca2+ release channels, also known as ryanodine receptors (RyRs) because they bind the plant alkaloid ryanodine with high affinity and specificity []. This region covers TM regions 4-6 of the ryanodine receptor 1 family.; GO: 0005219 ryanodine-sensitive calcium-release channel activity, 0006874 cellular calcium ion homeostasis, 0016021 integral to membrane
Probab=32.57 E-value=47 Score=31.59 Aligned_cols=21 Identities=29% Similarity=0.513 Sum_probs=17.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q 023272 66 WFSISLLLVLAWGVGVFMLLY 86 (284)
Q Consensus 66 W~liSlLLvLAWGvGilmLLy 86 (284)
.-+=-|+||||..|=+++|+|
T Consensus 171 YNlr~lALflAFaINFILLFY 191 (274)
T PF06459_consen 171 YNLRFLALFLAFAINFILLFY 191 (274)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 445568999999999999987
No 12
>PF12955 DUF3844: Domain of unknown function (DUF3844); InterPro: IPR024382 This presumed domain is found in fungal species. It contains 8 largely conserved cysteine residues. This domain is found in proteins thought to be located in the endoplasmic reticulum.
Probab=31.50 E-value=67 Score=26.88 Aligned_cols=31 Identities=23% Similarity=0.389 Sum_probs=25.7
Q ss_pred HHhcccccchhHHHHHHHHHHHHHHHHHHHH
Q 023272 53 ELASNSVKYDTIIWFSISLLLVLAWGVGVFM 83 (284)
Q Consensus 53 E~~~~~vkYdt~~W~liSlLLvLAWGvGilm 83 (284)
+-.|-.+.+.-+-|..+.|+++++++||+|.
T Consensus 60 qKkDvS~~F~L~~~~ti~lv~~~~~~I~lL~ 90 (103)
T PF12955_consen 60 QKKDVSVPFWLFAGFTIALVVLVAGAIGLLF 90 (103)
T ss_pred ccccccchhhHHHHHHHHHHHHHHHHHHHHH
Confidence 4566677888888999999999999999864
No 13
>PF07782 DC_STAMP: DC-STAMP-like protein; InterPro: IPR012858 This group of sequences is similar to a region of the dendritic cell-specific transmembrane protein (DC-STAMP, Q9H295 from SWISSPROT). This is thought to be a novel receptor protein that shares no identity with other multimembrane-spanning proteins []. It is thought to have seven putative transmembrane regions [], two of which are found in the region featured in this family. DC-STAMP is also described as having potential N-linked glycosylation sites and a potential phosphorylation site for PKC [], but these are not conserved. ; GO: 0016021 integral to membrane
Probab=29.41 E-value=79 Score=27.52 Aligned_cols=40 Identities=25% Similarity=0.398 Sum_probs=29.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccccc
Q 023272 62 DTIIWFSISLLLVLAWGVGVFMLLYLPIRRYVLKKDISSR 101 (284)
Q Consensus 62 dt~~W~liSlLLvLAWGvGilmLLylPirryv~rkdi~sr 101 (284)
+...|+.+++++.++|..+++=---+=+||.|+-..+..|
T Consensus 143 ~~~~~~~i~~l~~l~~ll~~le~Y~~RLR~~I~a~fyP~r 182 (191)
T PF07782_consen 143 DYSVYIQIGLLYLLLWLLVLLEPYALRLRRVICASFYPER 182 (191)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhChHH
Confidence 3456888999999999998876666667777776555443
No 14
>PF05529 Bap31: B-cell receptor-associated protein 31-like ; InterPro: IPR008417 Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31 [].; GO: 0006886 intracellular protein transport, 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=29.39 E-value=46 Score=28.72 Aligned_cols=35 Identities=26% Similarity=0.434 Sum_probs=21.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc
Q 023272 63 TIIWFSISLLLVLAWGVGVFMLLYLPIRRYVLKKDIS 99 (284)
Q Consensus 63 t~~W~liSlLLvLAWGvGilmLLylPirryv~rkdi~ 99 (284)
+++|.+.-.+|+. -+++++||.+|+-..+-|+=++
T Consensus 2 sl~~~lvf~~L~~--Ei~~~~lL~lPlp~~~R~~i~~ 36 (192)
T PF05529_consen 2 SLQWSLVFGLLYA--EIAVLLLLVLPLPSPIRRKIFK 36 (192)
T ss_pred ccHHHHHHHHHHH--HHHHHHHHHHhCCcHHHHHHHH
Confidence 4678887666554 4567778887775544444333
No 15
>KOG0158 consensus Cytochrome P450 CYP3/CYP5/CYP6/CYP9 subfamilies [Secondary metabolites biosynthesis, transport and catabolism]
Probab=29.29 E-value=3e+02 Score=28.29 Aligned_cols=59 Identities=25% Similarity=0.526 Sum_probs=38.1
Q ss_pred Ccccccccc----ccccccccCchhhhhhhhhchhhhhhceeEEEEeeeecCCCCCCCceeEeeecChhhHHHHHHHHHH
Q 023272 117 PSFIPFWGK----TTNERHVPLSLVIDVIIEQGCLQSIYGIHTFRVESIARGKAAPVDELQVQGVDNPVLLRKVIVTEAA 192 (284)
Q Consensus 117 Ps~~P~~Gv----~r~Ek~vpL~~V~DiiieQG~Lqs~fGihs~riE~ig~~k~~~~D~lqi~GV~~p~~fRk~Vl~~~s 192 (284)
|-..|+||. .+.|++ =....|+.-.+ -.+||+|-++ +|+-+ |.||...|++.+...+
T Consensus 36 ~~p~p~~Gn~~~~~~~~~~--~~~~~~~~~~~---~~~~G~y~~~-------~p~l~-------v~D~elik~I~ik~F~ 96 (499)
T KOG0158|consen 36 PKPLPFLGNLPGMLKRERP--GDLLLDIYTKY---RPVVGIYEGR-------QPALL-------VSDPELIKEILIKDFD 96 (499)
T ss_pred CCCCCcEecHHHHHhccCc--HHHHHHHHhcC---CCEEEEEecC-------CcceE-------ecCHHHHHHHHHHhCc
Confidence 334599997 344444 33334444444 5789999765 34333 7899999999887777
Q ss_pred Hh
Q 023272 193 KV 194 (284)
Q Consensus 193 ~~ 194 (284)
+-
T Consensus 97 ~F 98 (499)
T KOG0158|consen 97 NF 98 (499)
T ss_pred cC
Confidence 54
No 16
>PRK14475 F0F1 ATP synthase subunit B; Provisional
Probab=28.91 E-value=85 Score=26.75 Aligned_cols=32 Identities=13% Similarity=0.158 Sum_probs=20.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q 023272 62 DTIIWFSISLLLVLAWGVGVFMLLYLPIRRYVL 94 (284)
Q Consensus 62 dt~~W~liSlLLvLAWGvGilmLLylPirryv~ 94 (284)
++..|.++++++++ +++-.+-.||=||...+-
T Consensus 9 ~~~~w~~i~f~il~-~iL~~~k~l~~pi~~~le 40 (167)
T PRK14475 9 NPEFWVGAGLLIFF-GILIALKVLPKALAGALD 40 (167)
T ss_pred chHHHHHHHHHHHH-HHHHHHHHhHHHHHHHHH
Confidence 56679887766443 222223557888887765
No 17
>PHA00022 VII minor coat protein
Probab=28.87 E-value=35 Score=22.98 Aligned_cols=22 Identities=41% Similarity=0.714 Sum_probs=19.5
Q ss_pred chhHHHHHHHHHHHHHHHHHHH
Q 023272 61 YDTIIWFSISLLLVLAWGVGVF 82 (284)
Q Consensus 61 Ydt~~W~liSlLLvLAWGvGil 82 (284)
+||++-+.+.+-||+..|.|++
T Consensus 2 ~D~i~q~v~~lglVi~FgLG~I 23 (28)
T PHA00022 2 FDTIYQVVIALGLVICFGLGAI 23 (28)
T ss_pred hHhHHHHHHHHhHHHhhcccee
Confidence 6889999999999999999874
No 18
>PF07263 DMP1: Dentin matrix protein 1 (DMP1); InterPro: IPR009889 This family consists of several mammalian dentin matrix protein 1 (DMP1) sequences. The dentin matrix acidic phosphoprotein 1 (DMP1) gene has been mapped to human chromosome 4q21 []. DMP1 is a bone and teeth specific protein initially identified from mineralised dentin. DMP1 is primarily localised in the nuclear compartment of undifferentiated osteoblasts. In the nucleus, DMP1 acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the early phase of osteoblast maturation, Ca2+ surges into the nucleus from the cytoplasm, triggering the phosphorylation of DMP1 by a nuclear isoform of casein kinase II. This phosphorylated DMP1 is then exported out into the extracellular matrix, where it regulates nucleation of hydroxyapatite. DMP1 is a unique molecule that initiates osteoblast differentiation by transcription in the nucleus and orchestrates mineralised matrix formation extracellularly, at later stages of osteoblast maturation []. The DMP1 gene has been found to be ectopically expressed in lung cancer although the reason for this is unknown [].; GO: 0001503 ossification, 0030198 extracellular matrix organization
Probab=28.19 E-value=26 Score=36.33 Aligned_cols=22 Identities=41% Similarity=0.851 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhhh
Q 023272 69 ISLLLVLAWGVGVFMLLYLPIRRYVL 94 (284)
Q Consensus 69 iSlLLvLAWGvGilmLLylPirryv~ 94 (284)
.++||+++||.. .-||+.||--
T Consensus 3 tsill~~lwgls----~alpvary~n 24 (514)
T PF07263_consen 3 TSILLMFLWGLS----CALPVARYQN 24 (514)
T ss_pred ceeHHHHHHHhh----cccchhhhcC
Confidence 367899999944 4589999964
No 19
>PF15128 T_cell_tran_alt: T-cell leukemia translocation-altered
Probab=27.12 E-value=53 Score=27.25 Aligned_cols=14 Identities=57% Similarity=1.260 Sum_probs=8.9
Q ss_pred HHHHHHHHH-HHHHH
Q 023272 66 WFSISLLLV-LAWGV 79 (284)
Q Consensus 66 W~liSlLLv-LAWGv 79 (284)
|++.||+.| +||.+
T Consensus 34 WlvlsLl~I~lAWk~ 48 (92)
T PF15128_consen 34 WLVLSLLAIHLAWKV 48 (92)
T ss_pred HHHHHHHHHHHHHHH
Confidence 777776655 46654
No 20
>KOG4551 consensus GPI-GlcNAc transferase complex, PIG-H component, involved in glycosylphosphatidylinositol anchor biosynthesis [Cell wall/membrane/envelope biogenesis; Posttranslational modification, protein turnover, chaperones]
Probab=26.36 E-value=95 Score=28.49 Aligned_cols=55 Identities=29% Similarity=0.587 Sum_probs=36.1
Q ss_pred HHHHHHHHHH-HHHHHhhhhhhhhhhccccceeEeecCceEEeecCCcccccccc-cc--------ccccccCchhhhhh
Q 023272 72 LLVLAWGVGV-FMLLYLPIRRYVLKKDISSRKLYVTPSEIVYKVSRPSFIPFWGK-TT--------NERHVPLSLVIDVI 141 (284)
Q Consensus 72 LLvLAWGvGi-lmLLylPirryv~rkdi~srkLyvT~~~IVYKvsRPs~~P~~Gv-~r--------~Ek~vpL~~V~Dii 141 (284)
...++||+-+ .+|-||-++..|-+ -+|. ++|-+|| +. .-+-.|+++|-|++
T Consensus 64 ~~~~~~~~~LsglL~~lH~~k~V~~-----Esl~--------------Ii~~lGIQ~~~sy~sgK~ss~fIpidkI~div 124 (184)
T KOG4551|consen 64 ARTLSWGCLLSGLLVMLHSRKFVKK-----ESLI--------------IIPTLGIQLETSYLSGKTSSRFIPIDKILDIV 124 (184)
T ss_pred hHHHHHHHHHHHHHHHHHHHhhcCc-----ceEE--------------EEcccceEEEeeeccCcccceeeEcchhHHhH
Confidence 5788999988 66667766655432 2333 3355555 21 12457999999999
Q ss_pred hhhc
Q 023272 142 IEQG 145 (284)
Q Consensus 142 ieQG 145 (284)
|-.|
T Consensus 125 inE~ 128 (184)
T KOG4551|consen 125 INEC 128 (184)
T ss_pred Hhhc
Confidence 9876
No 21
>PF05569 Peptidase_M56: BlaR1 peptidase M56; InterPro: IPR008756 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M56 (clan M-). The predicted active site residues for members of this family occur in the motif HEXXH. The type example is BlaR1 peptidase from Bacillus licheniformis. Production of beta-Lactamase and penicillin-binding protein 2a (which mediate staphylococcal resistance to beta-lactam antibiotics) is regulated by a signal-transducing integral membrane protein and a transcriptional repressor. The signal transducer is a fusion protein with penicillin-binding and zinc metalloprotease domains. The signal for protein expression is transmitted by site-specific proteolytic cleavage of both the transducer, which auto-activates, and the repressor, which is inactivated, unblocking gene transcription.
Probab=23.93 E-value=1.4e+02 Score=27.15 Aligned_cols=26 Identities=31% Similarity=0.641 Sum_probs=15.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHH-HHHh
Q 023272 62 DTIIWFSISLLLVLAWGVGVFM-LLYL 87 (284)
Q Consensus 62 dt~~W~liSlLLvLAWGvGilm-LLyl 87 (284)
.+..|..+.-.+...|.+|+++ ++++
T Consensus 94 ~~~~~~~~~~~l~~iw~~g~~~~~~~~ 120 (299)
T PF05569_consen 94 ESFNWSSILDILFAIWLIGAIVLLIRL 120 (299)
T ss_pred ccccchhHHHHHHHHHHHHHHHHHHHH
Confidence 3334555555668889888844 4444
No 22
>PF00430 ATP-synt_B: ATP synthase B/B' CF(0); InterPro: IPR002146 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunits B and B' from the F0 complex in F-ATPases found in chloroplasts and in bacterial plasma membranes. The B subunits are part of the peripheral stalk that links the F1 and F0 complexes together, and which acts as a stator to prevent certain subunits from rotating with the central rotary element. The peripheral stalk differs in subunit composition between mitochondrial, chloroplast and bacterial F-ATPases. In bacterial and chloroplast F-ATPases, the peripheral stalk is composed of one copy of the delta subunit (homologous to OSCP in mitochondria), and two copies of subunit B in bacteria, or one copy each of subunits B and B' in chloroplasts and photosynthetic bacteria []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0015078 hydrogen ion transmembrane transporter activity, 0015986 ATP synthesis coupled proton transport, 0045263 proton-transporting ATP synthase complex, coupling factor F(o); PDB: 1L2P_A 2KHK_A 1B9U_A.
Probab=23.83 E-value=1.1e+02 Score=24.06 Aligned_cols=29 Identities=31% Similarity=0.722 Sum_probs=17.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhh
Q 023272 63 TIIWFSISLLLVLAWGVGVFMLLYLPIRRYVL 94 (284)
Q Consensus 63 t~~W~liSlLLvLAWGvGilmLLylPirryv~ 94 (284)
|+.|.++..+ ||.|.+.. ++|=||..++-
T Consensus 1 tl~~~~i~Fl-il~~~l~~--~~~~pi~~~l~ 29 (132)
T PF00430_consen 1 TLFWQLINFL-ILFFLLNK--FLYKPIKKFLD 29 (132)
T ss_dssp HHHHHHHHHH-HHHHHHHH--HTHHHHHHHCS
T ss_pred ChHHHHHHHH-HHHHHHHH--HHHHHHHHHHH
Confidence 5667776654 34444443 36778877765
No 23
>PRK14472 F0F1 ATP synthase subunit B; Provisional
Probab=22.05 E-value=1.3e+02 Score=25.74 Aligned_cols=34 Identities=12% Similarity=0.250 Sum_probs=23.0
Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhh
Q 023272 59 VKYDTIIWFSISLLLVLAWGVGVFMLLYLPIRRYVLK 95 (284)
Q Consensus 59 vkYdt~~W~liSlLLvLAWGvGilmLLylPirryv~r 95 (284)
+.+.++.|.++..+ ||.|.+ --++|=||...+-.
T Consensus 16 ~~~~~~~~~~i~Fl-il~~lL--~~~l~kpi~~~l~~ 49 (175)
T PRK14472 16 PNPGLIFWTAVTFV-IVLLIL--KKIAWGPILSALEE 49 (175)
T ss_pred CCHHHHHHHHHHHH-HHHHHH--HHHhHHHHHHHHHH
Confidence 44668889888865 445553 34578899877663
No 24
>KOG3249 consensus Uncharacterized conserved protein [Function unknown]
Probab=21.28 E-value=1.7e+02 Score=26.91 Aligned_cols=33 Identities=30% Similarity=0.397 Sum_probs=18.6
Q ss_pred hcccccchhHHHHHHHH-----HHHHHHHHHHHHHHHhhh
Q 023272 55 ASNSVKYDTIIWFSISL-----LLVLAWGVGVFMLLYLPI 89 (284)
Q Consensus 55 ~~~~vkYdt~~W~liSl-----LLvLAWGvGilmLLylPi 89 (284)
+.+..+|+ +|+.+|| +.|+.|.|+.-+.-++-+
T Consensus 88 ~n~~~lp~--~w~~~~ls~~~~v~vllW~vL~~ia~~l~f 125 (181)
T KOG3249|consen 88 LNKLKLPD--IWLMFSLSLKMWVIVLLWFVLAPIAHRLDF 125 (181)
T ss_pred cccCCCch--HHHHhhhhhhHHHHHHHHHHHHHHHHHHHh
Confidence 35555665 4766665 567777776544333333
No 25
>PF02893 GRAM: GRAM domain; InterPro: IPR004182 The GRAM domain is found in glucosyltransferases, myotubularins and other putative membrane-associated proteins. It is normally about 70 amino acids in length. It is thought to be an intracellular protein-binding or lipid-binding signalling domain, which has an important function in membrane-associated processes. Mutations in the GRAM domain of myotubularins cause a muscle disease, which suggests that the domain is essential for the full function of the enzyme []. Myotubularin-related proteins are a large subfamily of protein tyrosine phosphatases (PTPs) that dephosphorylate D3-phosphorylated inositol lipids [].; PDB: 1M7R_B 1LW3_A 1ZVR_A 1ZSQ_A.
Probab=21.24 E-value=1.1e+02 Score=21.91 Aligned_cols=20 Identities=15% Similarity=0.295 Sum_probs=14.3
Q ss_pred ccccceeEeecCceEEeecC
Q 023272 97 DISSRKLYVTPSEIVYKVSR 116 (284)
Q Consensus 97 di~srkLyvT~~~IVYKvsR 116 (284)
-...=+||||++-+.|+-+.
T Consensus 28 ~~~~G~LyiT~~~lcF~s~~ 47 (69)
T PF02893_consen 28 IPVQGRLYITNNYLCFYSNK 47 (69)
T ss_dssp --EEEEEEEESSEEEEEESS
T ss_pred cceeeEEEECCCEEEEEECC
Confidence 34566899999999986643
No 26
>PF06866 DUF1256: Protein of unknown function (DUF1256); InterPro: IPR009665 This family consists of several uncharacterised bacterial proteins, which seem to be specific to the orders Clostridia and Bacillales. Family members are typically around 180 residues in length. The function of this family is unknown.
Probab=20.23 E-value=27 Score=31.13 Aligned_cols=44 Identities=18% Similarity=0.327 Sum_probs=33.0
Q ss_pred hhchhhhhhceeEEEEee------eecCC--CCCCCceeEeeecChhhHHHHH
Q 023272 143 EQGCLQSIYGIHTFRVES------IARGK--AAPVDELQVQGVDNPVLLRKVI 187 (284)
Q Consensus 143 eQG~Lqs~fGihs~riE~------ig~~k--~~~~D~lqi~GV~~p~~fRk~V 187 (284)
.+-||-+.=.+-.+.+.+ +|++| | |+-|++|+||+|...|....
T Consensus 92 IDAcLG~~~~vG~I~~~~gpl~PG~gv~K~LP-~VGd~sItGiVn~~g~~e~~ 143 (163)
T PF06866_consen 92 IDACLGRPDNVGYITLGNGPLKPGAGVGKKLP-PVGDISITGIVNVSGFMEYF 143 (163)
T ss_pred EECCCCCcccceEEEEcCCCCCCchhhCCCCC-CCcCEEEEEEEcCCCcccee
Confidence 345666666666666665 56677 6 89999999999999987654
No 27
>PF11014 DUF2852: Protein of unknown function (DUF2852); InterPro: IPR021273 This bacterial family of proteins has no known function.
Probab=20.09 E-value=1.3e+02 Score=25.77 Aligned_cols=25 Identities=24% Similarity=0.711 Sum_probs=18.5
Q ss_pred HHH-HHHHHHHHHHHHHHHHHHhhhh
Q 023272 66 WFS-ISLLLVLAWGVGVFMLLYLPIR 90 (284)
Q Consensus 66 W~l-iSlLLvLAWGvGilmLLylPir 90 (284)
|+. .-|-+|+-|-+|+.||.|.=-+
T Consensus 11 ~Ia~mVlGFi~fWPlGla~Lay~iw~ 36 (115)
T PF11014_consen 11 WIAAMVLGFIVFWPLGLALLAYMIWG 36 (115)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 443 3455778899999999998555
No 28
>PF06679 DUF1180: Protein of unknown function (DUF1180); InterPro: IPR009565 This entry consists of several hypothetical eukaryotic proteins thought to be membrane proteins. Their function is unknown.
Probab=20.07 E-value=1e+02 Score=27.48 Aligned_cols=25 Identities=28% Similarity=0.610 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHHhhhhhhhhhh
Q 023272 72 LLVLAWGVGVFMLLYLPIRRYVLKK 96 (284)
Q Consensus 72 LLvLAWGvGilmLLylPirryv~rk 96 (284)
.++.++|+..++++||=||.|=+||
T Consensus 97 ~~~Vl~g~s~l~i~yfvir~~R~r~ 121 (163)
T PF06679_consen 97 ALYVLVGLSALAILYFVIRTFRLRR 121 (163)
T ss_pred hHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 3466788888999999999888887
Done!