Query psy10100
Match_columns 65
No_of_seqs 78 out of 80
Neff 3.8
Searched_HMMs 46136
Date Fri Aug 16 17:54:04 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10100.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10100hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF10853 DUF2650: Protein of u 96.6 0.003 6.4E-08 35.2 3.0 28 2-29 11-38 (38)
2 PF01788 PsbJ: PsbJ; InterPro 63.2 14 0.00031 20.8 3.1 21 13-33 12-32 (40)
3 PF02419 PsbL: PsbL protein; 59.8 12 0.00025 20.9 2.4 17 17-33 18-34 (37)
4 PF05082 Rop-like: Rop-like; 55.3 8.4 0.00018 23.6 1.5 31 31-64 22-52 (66)
5 PRK00753 psbL photosystem II r 55.2 14 0.00031 20.7 2.3 16 17-32 20-35 (39)
6 PF10958 DUF2759: Protein of u 54.0 27 0.00059 20.6 3.4 27 13-39 6-36 (52)
7 CHL00038 psbL photosystem II p 53.0 17 0.00037 20.3 2.3 16 17-32 19-34 (38)
8 PF11084 DUF2621: Protein of u 52.1 33 0.00071 23.9 4.1 33 11-43 6-38 (141)
9 PF08362 TetR_C_3: YcdC-like p 48.5 22 0.00047 23.9 2.8 43 11-56 91-133 (143)
10 PF08285 DPM3: Dolichol-phosph 45.7 36 0.00078 21.5 3.3 52 9-63 34-85 (91)
11 PF07709 SRR: Seven Residue Re 45.4 14 0.0003 16.0 1.0 10 56-65 4-13 (14)
12 PF06196 DUF997: Protein of un 44.8 68 0.0015 19.8 4.4 32 14-47 47-78 (80)
13 PF01484 Col_cuticle_N: Nemato 38.3 65 0.0014 17.1 3.9 17 49-65 37-53 (53)
14 PF12537 DUF3735: Protein of u 37.5 40 0.00086 20.0 2.4 26 10-35 8-33 (72)
15 CHL00108 psbJ photosystem II p 37.0 5.7 0.00012 22.4 -1.2 17 17-33 16-32 (40)
16 PRK02565 photosystem II reacti 36.4 7.8 0.00017 21.8 -0.7 17 17-33 15-31 (39)
17 COG5420 Uncharacterized conser 35.9 26 0.00056 21.9 1.4 32 30-64 25-56 (71)
18 PF05545 FixQ: Cbb3-type cytoc 33.9 84 0.0018 17.1 3.3 6 27-32 24-29 (49)
19 KOG1513|consensus 33.8 21 0.00045 31.9 1.1 20 10-29 1237-1256(1300)
20 PF07589 VPEP: PEP-CTERM motif 33.5 63 0.0014 15.9 2.4 15 16-30 6-20 (25)
21 PF06480 FtsH_ext: FtsH Extrac 31.6 16 0.00034 21.1 0.0 17 17-33 3-19 (110)
22 KOG3202|consensus 31.0 49 0.0011 24.3 2.5 21 6-26 211-231 (235)
23 PF04995 CcmD: Heme exporter p 30.4 91 0.002 17.0 3.0 20 13-32 6-25 (46)
24 COG5336 Uncharacterized protei 29.6 59 0.0013 22.0 2.5 18 16-33 73-90 (116)
25 PF11346 DUF3149: Protein of u 28.8 95 0.0021 17.3 2.9 18 14-31 13-30 (42)
26 PF11833 DUF3353: Protein of u 27.5 27 0.00059 24.6 0.6 19 47-65 19-37 (194)
27 PF07503 zf-HYPF: HypF finger; 25.5 24 0.00051 18.9 0.0 10 1-10 24-33 (35)
28 COG0109 CyoE Polyprenyltransfe 24.9 1.1E+02 0.0024 23.4 3.5 26 8-33 101-137 (304)
29 PF00746 Gram_pos_anchor: Gram 23.2 27 0.00059 18.0 0.0 18 13-30 18-35 (39)
30 COG3790 Predicted membrane pro 22.2 66 0.0014 21.2 1.6 25 13-37 51-75 (97)
31 PF10270 MMgT: Membrane magnes 21.9 85 0.0018 19.9 2.1 16 9-24 42-57 (106)
32 PF05814 DUF843: Baculovirus p 21.0 1.5E+02 0.0033 18.9 3.1 44 16-63 31-74 (83)
33 PF03530 SK_channel: Calcium-a 20.9 96 0.0021 20.7 2.3 19 7-25 14-32 (119)
34 PRK13747 putative mercury resi 20.9 2.3E+02 0.0049 18.0 3.8 26 12-37 46-71 (78)
35 PF10661 EssA: WXG100 protein 20.4 1.7E+02 0.0036 19.8 3.4 20 13-32 121-140 (145)
36 PLN03160 uncharacterized prote 20.2 70 0.0015 22.6 1.6 25 7-31 35-59 (219)
No 1
>PF10853 DUF2650: Protein of unknown function (DUF2650); InterPro: IPR022559 This region is found in proteins with unknown function in metazoa.
Probab=96.59 E-value=0.003 Score=35.15 Aligned_cols=28 Identities=39% Similarity=0.691 Sum_probs=26.1
Q ss_pred cccCCcchhHHHHHHHHHHHHHHHHHHH
Q psy10100 2 KICGPKLSLCGVIISAWGIVQLALMGVF 29 (65)
Q Consensus 2 ~~cGpk~s~cc~vlSvWGii~L~llGi~ 29 (65)
+.||.+..-||.-+..|.+++|+++|+.
T Consensus 11 ~CCg~~~~eCCf~lq~Wv~v~l~v~~v~ 38 (38)
T PF10853_consen 11 KCCGDLNKECCFRLQIWVIVLLAVLGVC 38 (38)
T ss_pred ECCCCChHhHHHHHHHHHHHHHHHHhcC
Confidence 5799999999999999999999999973
No 2
>PF01788 PsbJ: PsbJ; InterPro: IPR002682 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbJ found in PSII. PsbJ is one of the most hydrophobic proteins in the thylakoid membrane, and is located in a gene cluster with PsbE, PsbF and PsbL (PsbEFJL). Both PsbJ and PsbL (IPR003372 from INTERPRO) are essential for proper assembly of the OEC. Mutations in PsbJ cause the light-harvesting antenna to remain detached from the PSII dimers []. In addition, both PsbJ and PsbL are involved in the unidirectional flow of electrons, where PsbJ regulates the forward electron flow from D2 (Qa) to the plastoquinone pool, and PsbL prevents the reduction of PSII by back electron flow from plastoquinol protecting PSII from photo-inactivation [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_J 3ARC_J 3A0B_J 3KZI_J 2AXT_J 3PRQ_J 4FBY_b 3BZ2_J 1S5L_j 3PRR_J ....
Probab=63.21 E-value=14 Score=20.83 Aligned_cols=21 Identities=24% Similarity=0.479 Sum_probs=16.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhh
Q psy10100 13 VIISAWGIVQLALMGVFYHIR 33 (65)
Q Consensus 13 ~vlSvWGii~L~llGi~F~~~ 33 (65)
+|--+=|+.-+.++|+|||-.
T Consensus 12 lVgtv~G~~vi~lvglFfYGs 32 (40)
T PF01788_consen 12 LVGTVAGIAVIGLVGLFFYGS 32 (40)
T ss_dssp HHHHHHHHHHHHHHHHHHHCT
T ss_pred HHHHHHHHHHHHHHHHheecc
Confidence 344566889999999999853
No 3
>PF02419 PsbL: PsbL protein; InterPro: IPR003372 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbL found in PSII. PsbL is located in a gene cluster with PsbE, PsbF and PsbJ (PsbEFJL). Both PsbL and PsbJ (IPR002682 from INTERPRO) are essential for proper assembly of the OEC. Mutations in PsbL prevent the formation of both PSII core dimers and PSII-light harvesting complex []. In addition, both PsbL and PsbJ are involved in the unidirectional flow of electrons, where PsbJ regulates the forward electron flow from D2 (Qa) to the plastoquinone pool, and PsbL prevents the reduction of PSII by back electron flow from plastoquinol protecting PSII from photo-inactivation [].; GO: 0015979 photosynthesis, 0009523 photosystem II, 0009539 photosystem II reaction center, 0016020 membrane; PDB: 3A0H_L 3A0B_l 3ARC_l 1S5L_l 2AXT_l 3BZ2_L 4FBY_L 3PRQ_L 3PRR_L 3KZI_L ....
Probab=59.78 E-value=12 Score=20.86 Aligned_cols=17 Identities=18% Similarity=0.665 Sum_probs=14.8
Q ss_pred HHHHHHHHHHHHHHhhh
Q psy10100 17 AWGIVQLALMGVFYHIR 33 (65)
Q Consensus 17 vWGii~L~llGi~F~~~ 33 (65)
-||.+...++|++|+..
T Consensus 18 Y~GLllifvl~vLFssy 34 (37)
T PF02419_consen 18 YWGLLLIFVLAVLFSSY 34 (37)
T ss_dssp HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhhhh
Confidence 49999999999999764
No 4
>PF05082 Rop-like: Rop-like; InterPro: IPR007774 This family contains several uncharacterised bacterial proteins. These proteins are found in nitrogen fixation operons, so are likely to play a role in this process.; PDB: 3CSX_A 2JS5_B.
Probab=55.27 E-value=8.4 Score=23.56 Aligned_cols=31 Identities=26% Similarity=0.380 Sum_probs=21.6
Q ss_pred hhhhhhhhcCCCCCCcccCchhhHHHHHHHhhhc
Q psy10100 31 HIRSVALIEDLPGEETHFASIDDFYQTVERGYTL 64 (65)
Q Consensus 31 ~~~s~~L~edlp~~~~~~~s~~~f~~~~~~~Y~~ 64 (65)
..+=.-|.||||. +|.++.+--++.+++|..
T Consensus 22 kmdLHDLaEdLP~---~w~~i~~vA~~ty~a~~~ 52 (66)
T PF05082_consen 22 KMDLHDLAEDLPT---NWEEIPEVAQKTYDAYAE 52 (66)
T ss_dssp HHHHHHHHHCTTT---TGGGHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHccch---hHHHHHHHHHHHHHHHHH
Confidence 3445678999997 777776666777777653
No 5
>PRK00753 psbL photosystem II reaction center L; Provisional
Probab=55.16 E-value=14 Score=20.70 Aligned_cols=16 Identities=13% Similarity=0.534 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHHHhh
Q psy10100 17 AWGIVQLALMGVFYHI 32 (65)
Q Consensus 17 vWGii~L~llGi~F~~ 32 (65)
-||.+...++|++|+.
T Consensus 20 y~GlLlifvl~vLFss 35 (39)
T PRK00753 20 YLGLLLVFVLGILFSS 35 (39)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 4999999999999975
No 6
>PF10958 DUF2759: Protein of unknown function (DUF2759); InterPro: IPR024490 This family of proteins with unknown function appear to be restricted to Bacillales.
Probab=53.96 E-value=27 Score=20.61 Aligned_cols=27 Identities=19% Similarity=0.570 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHH----HHHHHHhhhhhhhhc
Q psy10100 13 VIISAWGIVQLA----LMGVFYHIRSVALIE 39 (65)
Q Consensus 13 ~vlSvWGii~L~----llGi~F~~~s~~L~e 39 (65)
.++++||++.-+ ++|+.|..-|++.++
T Consensus 6 tlla~~g~~rslK~KN~l~i~F~~~t~~VFG 36 (52)
T PF10958_consen 6 TLLAAFGVLRSLKNKNFLGIGFALVTVAVFG 36 (52)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 678999999864 889998888877654
No 7
>CHL00038 psbL photosystem II protein L
Probab=52.97 E-value=17 Score=20.32 Aligned_cols=16 Identities=19% Similarity=0.723 Sum_probs=14.2
Q ss_pred HHHHHHHHHHHHHHhh
Q psy10100 17 AWGIVQLALMGVFYHI 32 (65)
Q Consensus 17 vWGii~L~llGi~F~~ 32 (65)
-||.+...++|++|+.
T Consensus 19 y~GLLlifvl~vlfss 34 (38)
T CHL00038 19 YWGLLLIFVLAVLFSN 34 (38)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4999999999999975
No 8
>PF11084 DUF2621: Protein of unknown function (DUF2621); InterPro: IPR020203 This entry represents a group of uncharacterised proteins.
Probab=52.11 E-value=33 Score=23.91 Aligned_cols=33 Identities=21% Similarity=0.261 Sum_probs=24.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhcCCCC
Q psy10100 11 CGVIISAWGIVQLALMGVFYHIRSVALIEDLPG 43 (65)
Q Consensus 11 cc~vlSvWGii~L~llGi~F~~~s~~L~edlp~ 43 (65)
+-.++-.|+++|+.+|+|.=+--=.-++.-+|-
T Consensus 6 Fm~fI~~W~~vli~l~~IGGfFMFRKFLK~lPK 38 (141)
T PF11084_consen 6 FMWFILFWVVVLIGLMAIGGFFMFRKFLKRLPK 38 (141)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHhCCc
Confidence 456788999999998866544444567788885
No 9
>PF08362 TetR_C_3: YcdC-like protein, C-terminal region; InterPro: IPR013573 This entry represents the C-terminal domain found in the hypothetical transcriptional regulators RutR and YcdC (P75899 from SWISSPROT) from Escherichia coli. Both of these proteins are member of the TetR (tetracycline resistance) transcriptional regulator family of proteins. RutR negatively controls the transcription of the rut operon involved in pyrimidine utilization. The C-terminal domains of RutR, YsiA and TetR share a multi-helical, interlocking structure. These proteins also contain helix-turn-helix (HTH) DNA-binding domains.; GO: 0045892 negative regulation of transcription, DNA-dependent; PDB: 3LOC_B.
Probab=48.53 E-value=22 Score=23.87 Aligned_cols=43 Identities=14% Similarity=0.221 Sum_probs=30.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCCCcccCchhhHHH
Q psy10100 11 CGVIISAWGIVQLALMGVFYHIRSVALIEDLPGEETHFASIDDFYQ 56 (65)
Q Consensus 11 cc~vlSvWGii~L~llGi~F~~~s~~L~edlp~~~~~~~s~~~f~~ 56 (65)
--+++|+|+.-|-= .=|..+..++.+.-++++++|++..++..
T Consensus 91 ~hL~f~IWa~TQ~Y---ADf~~Qi~~~~g~~~~~~~d~e~a~~~v~ 133 (143)
T PF08362_consen 91 EHLFFMIWAMTQHY---ADFAAQIRAVLGKSELSEEDFEQAAEFVT 133 (143)
T ss_dssp HHHHHHHHHHHHHH---HHTHHHHHHHHS--TTSHHHHHHHHHHHH
T ss_pred HHHHHHHHHhhhhh---hhHHHHHHHHhCCCCCCHHHHHHHHHHHH
Confidence 35789999999843 45778888899888887777766555443
No 10
>PF08285 DPM3: Dolichol-phosphate mannosyltransferase subunit 3 (DPM3); InterPro: IPR013174 This family corresponds to subunit 3 of dolichol-phosphate mannosyltransferase, an enzyme which generates mannosyl donors for glycosylphosphatidylinositols, N-glycan and protein O- and C-mannosylation. DPM3 is an integral membrane protein and plays a role in stabilising the dolichol-phosphate mannosyl transferase complex [].
Probab=45.68 E-value=36 Score=21.50 Aligned_cols=52 Identities=12% Similarity=0.110 Sum_probs=36.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhhhhhhhhcCCCCCCcccCchhhHHHHHHHhhh
Q psy10100 9 SLCGVIISAWGIVQLALMGVFYHIRSVALIEDLPGEETHFASIDDFYQTVERGYT 63 (65)
Q Consensus 9 s~cc~vlSvWGii~L~llGi~F~~~s~~L~edlp~~~~~~~s~~~f~~~~~~~Y~ 63 (65)
...-..+-+|.++.+++-..+=-.-.++-+.|-| |.+.+++...+++.+-++
T Consensus 34 ~~ii~~lP~~~Lv~fG~Ysl~~lgy~v~tFnDcp---eA~~eL~~eI~eAK~dLr 85 (91)
T PF08285_consen 34 QEIIPYLPFYALVSFGCYSLFTLGYGVATFNDCP---EAAKELQKEIKEAKADLR 85 (91)
T ss_pred HHHHHHhhHHHHHHHHHHHHHHHHHhhhccCCCH---HHHHHHHHHHHHHHHHHH
Confidence 3456677788888888777777677788888877 455666655566655443
No 11
>PF07709 SRR: Seven Residue Repeat; InterPro: IPR011714 This repeat is found in some Plasmodium and Theileria proteins.
Probab=45.39 E-value=14 Score=15.97 Aligned_cols=10 Identities=30% Similarity=0.577 Sum_probs=8.1
Q ss_pred HHHHHhhhcC
Q psy10100 56 QTVERGYTLL 65 (65)
Q Consensus 56 ~~~~~~Y~~~ 65 (65)
+++..+|+++
T Consensus 4 ~~V~~aY~~l 13 (14)
T PF07709_consen 4 EKVKNAYEQL 13 (14)
T ss_pred HHHHHHHHhc
Confidence 7888888875
No 12
>PF06196 DUF997: Protein of unknown function (DUF997); InterPro: IPR010398 This is a family of predicted bacterial membrane protein with unknown function.
Probab=44.78 E-value=68 Score=19.77 Aligned_cols=32 Identities=19% Similarity=0.335 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhcCCCCCCcc
Q psy10100 14 IISAWGIVQLALMGVFYHIRSVALIEDLPGEETH 47 (65)
Q Consensus 14 vlSvWGii~L~llGi~F~~~s~~L~edlp~~~~~ 47 (65)
++|.-+.-.+.+..+.+-++ -++.|+|+|+|+
T Consensus 47 ~~SCi~~~il~~~l~~~~vk--~~Fkd~~Ld~~~ 78 (80)
T PF06196_consen 47 FYSCIGGPILFIILVWLMVK--FFFKDIPLDDEE 78 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHH--HHhhcCCCCccc
Confidence 45555444555555555544 356999996653
No 13
>PF01484 Col_cuticle_N: Nematode cuticle collagen N-terminal domain; InterPro: IPR002486 The function of this domain is unknown. It is found in the N-terminal region of nematode cuticle collagens (see IPR008160 from INTERPRO). Cuticle is a tough elastic structure secreted by hypodermal cells and is primarily composed of collagen proteins [, ].; GO: 0042302 structural constituent of cuticle
Probab=38.27 E-value=65 Score=17.11 Aligned_cols=17 Identities=6% Similarity=0.399 Sum_probs=12.8
Q ss_pred CchhhHHHHHHHhhhcC
Q psy10100 49 ASIDDFYQTVERGYTLL 65 (65)
Q Consensus 49 ~s~~~f~~~~~~~Y~~~ 65 (65)
.++++|+...+++++++
T Consensus 37 ~em~~fk~~s~d~W~~m 53 (53)
T PF01484_consen 37 DEMEEFKEISDDAWNEM 53 (53)
T ss_pred HHHHHHHHHHHHHHhhC
Confidence 45678888888888763
No 14
>PF12537 DUF3735: Protein of unknown function (DUF3735); InterPro: IPR022535 This conserved domain is found in a subunit of a voltage dependent anion channel required for acidification and functions of the Golgi apparatus; it may function in counter-ion conductance. It belongs to the Golgi pH regulator (1.A.38 from TC) family
Probab=37.46 E-value=40 Score=19.99 Aligned_cols=26 Identities=15% Similarity=0.165 Sum_probs=20.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy10100 10 LCGVIISAWGIVQLALMGVFYHIRSV 35 (65)
Q Consensus 10 ~cc~vlSvWGii~L~llGi~F~~~s~ 35 (65)
.|-.=+|+-||.+|++|-.|=.++++
T Consensus 8 ~~l~ri~ViGVt~mAiLSG~gaVstp 33 (72)
T PF12537_consen 8 NVLSRIGVIGVTLMAILSGFGAVSTP 33 (72)
T ss_pred HHHHHHHHHHHHHHHHHhhhhHHccH
Confidence 34456899999999999988777765
No 15
>CHL00108 psbJ photosystem II protein J
Probab=37.03 E-value=5.7 Score=22.44 Aligned_cols=17 Identities=29% Similarity=0.602 Sum_probs=13.0
Q ss_pred HHHHHHHHHHHHHHhhh
Q psy10100 17 AWGIVQLALMGVFYHIR 33 (65)
Q Consensus 17 vWGii~L~llGi~F~~~ 33 (65)
+=|+.-+.++|+|||-.
T Consensus 16 v~G~~vi~~vgiFfyGs 32 (40)
T CHL00108 16 VAGIAVIGLLGIFFYGS 32 (40)
T ss_pred hhhHhHHheeeeEEeec
Confidence 34677889999999853
No 16
>PRK02565 photosystem II reaction center protein J; Provisional
Probab=36.37 E-value=7.8 Score=21.78 Aligned_cols=17 Identities=18% Similarity=0.360 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHhhh
Q psy10100 17 AWGIVQLALMGVFYHIR 33 (65)
Q Consensus 17 vWGii~L~llGi~F~~~ 33 (65)
+=|+.-+.++|+|||-.
T Consensus 15 v~G~~vi~~vgiFfyGs 31 (39)
T PRK02565 15 VAGMGVIFVVGLFFYGS 31 (39)
T ss_pred hhHHHHHhheeeEEeec
Confidence 44777889999999843
No 17
>COG5420 Uncharacterized conserved small protein containing a coiled-coil domain [Function unknown]
Probab=35.86 E-value=26 Score=21.88 Aligned_cols=32 Identities=19% Similarity=0.211 Sum_probs=21.0
Q ss_pred HhhhhhhhhcCCCCCCcccCchhhHHHHHHHhhhc
Q psy10100 30 YHIRSVALIEDLPGEETHFASIDDFYQTVERGYTL 64 (65)
Q Consensus 30 F~~~s~~L~edlp~~~~~~~s~~~f~~~~~~~Y~~ 64 (65)
-.++-..|.||+|. +|....+--.+..++|.+
T Consensus 25 akm~LhDLAEgLP~---~wtei~~VA~kt~~~yae 56 (71)
T COG5420 25 AKMELHDLAEGLPV---KWTEIMAVAEKTFEAYAE 56 (71)
T ss_pred HHhhHHHHhccCCc---cHHHHHHHHHHHHHHHHH
Confidence 34556778999997 555555555666666654
No 18
>PF05545 FixQ: Cbb3-type cytochrome oxidase component FixQ; InterPro: IPR008621 This family consists of several Cbb3-type cytochrome oxidase components (FixQ/CcoQ). FixQ is found in nitrogen fixing bacteria. Since nitrogen fixation is an energy-consuming process, effective symbioses depend on operation of a respiratory chain with a high affinity for O2, closely coupled to ATP production. This requirement is fulfilled by a special three-subunit terminal oxidase (cytochrome terminal oxidase cbb3), which was first identified in Bradyrhizobium japonicum as the product of the fixNOQP operon [].
Probab=33.91 E-value=84 Score=17.08 Aligned_cols=6 Identities=17% Similarity=0.418 Sum_probs=2.9
Q ss_pred HHHHhh
Q psy10100 27 GVFYHI 32 (65)
Q Consensus 27 Gi~F~~ 32 (65)
|+.++.
T Consensus 24 gi~~w~ 29 (49)
T PF05545_consen 24 GIVIWA 29 (49)
T ss_pred HHHHHH
Confidence 554443
No 19
>KOG1513|consensus
Probab=33.84 E-value=21 Score=31.91 Aligned_cols=20 Identities=35% Similarity=0.665 Sum_probs=16.8
Q ss_pred hHHHHHHHHHHHHHHHHHHH
Q psy10100 10 LCGVIISAWGIVQLALMGVF 29 (65)
Q Consensus 10 ~cc~vlSvWGii~L~llGi~ 29 (65)
+|+.||||||.|=.++-+.-
T Consensus 1237 LsGsvLsVW~rVE~Vla~~s 1256 (1300)
T KOG1513|consen 1237 LSGSVLSVWGRVEAVLADSS 1256 (1300)
T ss_pred eccchhhhhHHHHHHHhcCC
Confidence 68999999999988876654
No 20
>PF07589 VPEP: PEP-CTERM motif; InterPro: IPR013424 This entry describes a 25-residue region including an invariant Pro-Glu-Pro (PEP) motif, a thirteen residue strongly hydrophobic sequence likely to span the membrane, and a five-residue strongly basic motif that often contains four Arg residues. In most cases, this motif is found within nine residues of the C-terminal end of the protein. Proteins containing this motif typically have signal sequences at the N terminus [].
Probab=33.46 E-value=63 Score=15.90 Aligned_cols=15 Identities=27% Similarity=0.638 Sum_probs=10.8
Q ss_pred HHHHHHHHHHHHHHH
Q psy10100 16 SAWGIVQLALMGVFY 30 (65)
Q Consensus 16 SvWGii~L~llGi~F 30 (65)
|.|+.+.++++|+.+
T Consensus 6 st~~l~~~gl~~l~~ 20 (25)
T PF07589_consen 6 STLALLGLGLLGLAF 20 (25)
T ss_pred HHHHHHHHHHHHHHH
Confidence 447777777777776
No 21
>PF06480 FtsH_ext: FtsH Extracellular; InterPro: IPR011546 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain is found in the FtsH family of proteins that include FtsH a membrane-bound ATP-dependent protease universally conserved in prokaryotes []. The FtsH peptidases, which belong to MEROPS peptidase family M41 (clan MA(E)), efficiently degrade proteins that have a low thermodynamic stability - e.g. they lack robust unfoldase activity. This feature may be key and implies that this could be a criterion for degrading a protein. In Oenococcus oeni (Leuconostoc oenos) FtsH is involved in protection against environmental stress [], and shows increased expression under heat or osmotic stress. These two lines of evidence suggest that it is a fundamental prokaryotic self-protection mechanism that checks if proteins are correctly folded. The precise function of this N-terminal region is unclear. ; GO: 0004222 metalloendopeptidase activity, 0005524 ATP binding, 0008270 zinc ion binding, 0016021 integral to membrane; PDB: 2LNA_A.
Probab=31.59 E-value=16 Score=21.11 Aligned_cols=17 Identities=12% Similarity=0.252 Sum_probs=0.4
Q ss_pred HHHHHHHHHHHHHHhhh
Q psy10100 17 AWGIVQLALMGVFYHIR 33 (65)
Q Consensus 17 vWGii~L~llGi~F~~~ 33 (65)
+|-+|++++++++++..
T Consensus 3 ~~ili~~vi~~l~~~~~ 19 (110)
T PF06480_consen 3 LYILIILVILLLFNFFF 19 (110)
T ss_dssp ----------------S
T ss_pred ceehhHHHHHHHHHHHH
Confidence 57888888888887754
No 22
>KOG3202|consensus
Probab=30.96 E-value=49 Score=24.30 Aligned_cols=21 Identities=19% Similarity=0.271 Sum_probs=17.7
Q ss_pred CcchhHHHHHHHHHHHHHHHH
Q psy10100 6 PKLSLCGVIISAWGIVQLALM 26 (65)
Q Consensus 6 pk~s~cc~vlSvWGii~L~ll 26 (65)
.+|+-||+++++-|++.+++.
T Consensus 211 s~~~~~~~il~l~~~~~lvv~ 231 (235)
T KOG3202|consen 211 SQCSQWCAILLLVGLLLLVVI 231 (235)
T ss_pred ccccchhHHHHHHHHHHHHHH
Confidence 367889999999999988764
No 23
>PF04995 CcmD: Heme exporter protein D (CcmD); InterPro: IPR007078 The CcmD protein is part of a C-type cytochrome biogenesis operon []. The exact function of this protein is uncertain. It has been proposed that CcmC, CcmD and CcmE interact directly with each other, establishing a cytoplasm to periplasm haem delivery pathway for cytochrome c maturation []. This protein is found fused to CcmE in P52224 from SWISSPROT. These proteins contain a predicted transmembrane helix.; GO: 0006810 transport, 0016021 integral to membrane
Probab=30.44 E-value=91 Score=16.95 Aligned_cols=20 Identities=25% Similarity=0.504 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHHHHHHHhh
Q psy10100 13 VIISAWGIVQLALMGVFYHI 32 (65)
Q Consensus 13 ~vlSvWGii~L~llGi~F~~ 32 (65)
.|.+.+|+-++++.+.....
T Consensus 6 yVW~sYg~t~~~l~~l~~~~ 25 (46)
T PF04995_consen 6 YVWSSYGVTALVLAGLIVWS 25 (46)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 57889999999998887764
No 24
>COG5336 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.59 E-value=59 Score=22.02 Aligned_cols=18 Identities=28% Similarity=0.632 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHHhhh
Q psy10100 16 SAWGIVQLALMGVFYHIR 33 (65)
Q Consensus 16 SvWGii~L~llGi~F~~~ 33 (65)
|=||.|.++++|..--++
T Consensus 73 sPwglIv~lllGf~AG~l 90 (116)
T COG5336 73 SPWGLIVFLLLGFGAGVL 90 (116)
T ss_pred CcHHHHHHHHHHHHHHHH
Confidence 459999999988654443
No 25
>PF11346 DUF3149: Protein of unknown function (DUF3149); InterPro: IPR021494 This bacterial family of proteins has no known function.
Probab=28.85 E-value=95 Score=17.31 Aligned_cols=18 Identities=39% Similarity=0.440 Sum_probs=13.4
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy10100 14 IISAWGIVQLALMGVFYH 31 (65)
Q Consensus 14 vlSvWGii~L~llGi~F~ 31 (65)
++|.-.|+..+.||.+|.
T Consensus 13 L~Sl~vI~~~igm~~~~~ 30 (42)
T PF11346_consen 13 LMSLIVIVFTIGMGVFFI 30 (42)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 567778888888887763
No 26
>PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length.
Probab=27.47 E-value=27 Score=24.61 Aligned_cols=19 Identities=16% Similarity=0.391 Sum_probs=14.9
Q ss_pred ccCchhhHHHHHHHhhhcC
Q psy10100 47 HFASIDDFYQTVERGYTLL 65 (65)
Q Consensus 47 ~~~s~~~f~~~~~~~Y~~~ 65 (65)
+|...+.-++++++||+.|
T Consensus 19 ~y~gd~~~~~~IEaAYD~I 37 (194)
T PF11833_consen 19 QYAGDEKSREAIEAAYDAI 37 (194)
T ss_pred HhcCCHHHHHHHHHHHHHH
Confidence 5666677789999999864
No 27
>PF07503 zf-HYPF: HypF finger; InterPro: IPR011125 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Proteins of the HypF family are involved in the maturation and regulation of hydrogenase []. In the N terminus they appear to have two zinc finger domains that are similar to those found in the DnaJ chaperone []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 3TTD_A 3TSQ_A 3TTC_A 3TSP_A 3TTF_A 3TSU_A.
Probab=25.54 E-value=24 Score=18.85 Aligned_cols=10 Identities=40% Similarity=0.992 Sum_probs=6.5
Q ss_pred CcccCCcchh
Q psy10100 1 MKICGPKLSL 10 (65)
Q Consensus 1 ~~~cGpk~s~ 10 (65)
|+-|||..+.
T Consensus 24 C~~CGPr~~i 33 (35)
T PF07503_consen 24 CTNCGPRYSI 33 (35)
T ss_dssp BTTCC-SCCC
T ss_pred CCCCCCCEEE
Confidence 4679998764
No 28
>COG0109 CyoE Polyprenyltransferase (cytochrome oxidase assembly factor) [Posttranslational modification, protein turnover, chaperones]
Probab=24.90 E-value=1.1e+02 Score=23.44 Aligned_cols=26 Identities=23% Similarity=0.535 Sum_probs=20.0
Q ss_pred chhHHHHHHHHHHHHH-----------HHHHHHHhhh
Q psy10100 8 LSLCGVIISAWGIVQL-----------ALMGVFYHIR 33 (65)
Q Consensus 8 ~s~cc~vlSvWGii~L-----------~llGi~F~~~ 33 (65)
.-..++++++-|+++| ++.|+|||+-
T Consensus 101 al~fgl~L~~~g~~~l~~~vn~laa~l~~~gi~~Yv~ 137 (304)
T COG0109 101 ALAFGLVLGVAGFSLLWFLVNLLAAVLGLFGIFFYVV 137 (304)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3467889999999888 5678888763
No 29
>PF00746 Gram_pos_anchor: Gram positive anchor; InterPro: IPR019948 Viruses, parasites and bacteria are covered in protein and sugar molecules that help them gain entry into a host by counteracting the host's defences. One such molecule is the M protein produced by certain streptococcal bacteria. M proteins embody a motif that is now known to be shared by many Gram-positive bacterial surface proteins. The motif includes a conserved hexapeptide, which precedes a hydrophobic C-terminal membrane anchor, which itself precedes a cluster of basic residues [, ]. This structure is represented in the following schematic representation: +--------------------------------------------+-+--------+-+ | Variable length extracellular domain |H| Anchor |B| +--------------------------------------------+-+--------+-+ 'H': conserved hexapeptide. 'B': cluster of basic residues. It has been proposed that this hexapeptide sequence is responsible for a post- translational modification necessary for the proper anchoring of the proteins which bear it, to the cell wall.; PDB: 2XTL_B 3QDH_A 2Y1V_C 2X9X_A 3RPK_A 2X9W_A 2X9Y_A.
Probab=23.21 E-value=27 Score=17.98 Aligned_cols=18 Identities=22% Similarity=0.318 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy10100 13 VIISAWGIVQLALMGVFY 30 (65)
Q Consensus 13 ~vlSvWGii~L~llGi~F 30 (65)
.++.+.|++++++.++++
T Consensus 18 ~~~~~~G~l~~~~~~~~~ 35 (39)
T PF00746_consen 18 SILTILGALLLLGGGLLL 35 (39)
T ss_dssp ------------------
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 456778885555555444
No 30
>COG3790 Predicted membrane protein [Function unknown]
Probab=22.21 E-value=66 Score=21.18 Aligned_cols=25 Identities=24% Similarity=0.417 Sum_probs=21.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhhhhhh
Q psy10100 13 VIISAWGIVQLALMGVFYHIRSVAL 37 (65)
Q Consensus 13 ~vlSvWGii~L~llGi~F~~~s~~L 37 (65)
-++-+||+--..+-|+.|..+|+..
T Consensus 51 ~llmiWavca~~IhGVGFrpr~~~w 75 (97)
T COG3790 51 GLLMIWAVCAGVIHGVGFRPRSVLW 75 (97)
T ss_pred HHHHHHHHHHHHHhcccCchHHHHH
Confidence 3577999999999999999998754
No 31
>PF10270 MMgT: Membrane magnesium transporter; InterPro: IPR018937 This entry represents a novel family of membrane magnesium transporters (MMgT) []. The proteins, MMgT1 and MMgT2, are localised to the Golgi complex and post-Golgi vesicles, including the early endosomes, suggesting that they may provide regulated pathways for Mg2+ transport in the Golgi and post-Golgi organelles of epithelium-derived cells [].
Probab=21.92 E-value=85 Score=19.88 Aligned_cols=16 Identities=38% Similarity=0.586 Sum_probs=11.9
Q ss_pred hhHHHHHHHHHHHHHH
Q psy10100 9 SLCGVIISAWGIVQLA 24 (65)
Q Consensus 9 s~cc~vlSvWGii~L~ 24 (65)
+.+++++.+||+++..
T Consensus 42 ~lv~l~l~~~G~v~~~ 57 (106)
T PF10270_consen 42 TLVSLILFVFGIVLSA 57 (106)
T ss_pred HHHHHHHHHHHHHHcc
Confidence 3578888888887654
No 32
>PF05814 DUF843: Baculovirus protein of unknown function (DUF843); InterPro: IPR008561 This family consists of several unidentified baculovirus proteins of around 85 residues long with no known function.
Probab=20.98 E-value=1.5e+02 Score=18.92 Aligned_cols=44 Identities=14% Similarity=0.246 Sum_probs=27.2
Q ss_pred HHHHHHHHHHHHHHHhhhhhhhhcCCCCCCcccCchhhHHHHHHHhhh
Q psy10100 16 SAWGIVQLALMGVFYHIRSVALIEDLPGEETHFASIDDFYQTVERGYT 63 (65)
Q Consensus 16 SvWGii~L~llGi~F~~~s~~L~edlp~~~~~~~s~~~f~~~~~~~Y~ 63 (65)
=+-=+++..++++. |++...-..|++- ++-++... .++.+++++
T Consensus 31 LilfviF~~~L~~y-y~kteS~~~dL~t--~k~K~~KK-K~~ln~afD 74 (83)
T PF05814_consen 31 LILFVIFFCVLQVY-YIKTESTPQDLQT--EKAKSIKK-KRDLNDAFD 74 (83)
T ss_pred HHHHHHHHHHHHHH-HcCCCCcHHHHhh--hhhhhHHH-HHHHHHHHH
Confidence 33446677777776 4477777888874 34455533 366777665
No 33
>PF03530 SK_channel: Calcium-activated SK potassium channel; InterPro: IPR011996 Potassium channels are the most diverse group of the ion channel family [, ]. They are important in shaping the action potential, and in neuronal excitability and plasticity []. The potassium channel family is composed of several functionally distinct isoforms, which can be broadly separated into 2 groups []: the practically non-inactivating 'delayed' group and the rapidly inactivating 'transient' group. These are all highly similar proteins, with only small amino acid changes causing the diversity of the voltage-dependent gating mechanism, channel conductance and toxin binding properties. Each type of K+ channel is activated by different signals and conditions depending on their type of regulation: some open in response to depolarisation of the plasma membrane; others in response to hyperpolarisation or an increase in intracellular calcium concentration; some can be regulated by binding of a transmitter, together with intracellular kinases; while others are regulated by GTP-binding proteins or other second messengers []. In eukaryotic cells, K+ channels are involved in neural signalling and generation of the cardiac rhythm, act as effectors in signal transduction pathways involving G protein-coupled receptors (GPCRs) and may have a role in target cell lysis by cytotoxic T-lymphocytes []. In prokaryotic cells, they play a role in the maintenance of ionic homeostasis []. All K+ channels discovered so far possess a core of alpha subunits, each comprising either one or two copies of a highly conserved pore loop domain (P-domain). The P-domain contains the sequence (T/SxxTxGxG), which has been termed the K+ selectivity sequence. In families that contain one P-domain, four subunits assemble to form a selective pathway for K+ across the membrane. However, it remains unclear how the 2 P-domain subunits assemble to form a selective pore. The functional diversity of these families can arise through homo- or hetero-associations of alpha subunits or association with auxiliary cytoplasmic beta subunits. K+ channel subunits containing one pore domain can be assigned into one of two superfamilies: those that possess six transmembrane (TM) domains and those that possess only two TM domains. The six TM domain superfamily can be further subdivided into conserved gene families: the voltage-gated (Kv) channels; the KCNQ channels (originally known as KvLQT channels); the EAG-like K+ channels; and three types of calcium (Ca)-activated K+ channels (BK, IK and SK) []. The 2TM domain family comprises inward-rectifying K+ channels. In addition, there are K+ channel alpha-subunits that possess two P-domains. These are usually highly regulated K+ selective leak channels. Ca2+-activated K+ channels are a diverse group of channels that are activated by an increase in intracellular Ca2+ concentration. They are found in the majority of nerve cells, where they modulate cell excitability and action potential. Three types of Ca2+-activated K+ channel have been characterised, termed small-conductance (SK), intermediate conductance (IK) and large conductance (BK) respectively []. SK channels are thought to play an important role in the functioning of all excitable tissues. To date, 3 subtypes (designated SK1-SK3) have been cloned, each of which possesses a different tissue expression profile: SK1 channels are expressed in the heart; SK2 channels are found in the adrenal gland; and SK3 channels are known to be present in skeletal muscle []. SK channels have a single-channel conductance of 2-20 pS and are activated by rises in cytosolic calcium with half maximal activation in the 400-800 nM range [, ]. Unlike BK channels, they are voltage insensitive and unaffected by low concentrations of TEA, charybdotoxin, or iberiotoxin. However, they are potently blocked by the bee venom apamin [, ], tubocurarine, and quaternary salts of bicuculline [, ]. A new series of compounds that block SK channels include dequalinium Synonym(s): SK Channel This entry represents a conserved region, found in proteins of SK channels family.
Probab=20.93 E-value=96 Score=20.68 Aligned_cols=19 Identities=26% Similarity=0.541 Sum_probs=13.4
Q ss_pred cchhHHHHHHHHHHHHHHH
Q psy10100 7 KLSLCGVIISAWGIVQLAL 25 (65)
Q Consensus 7 k~s~cc~vlSvWGii~L~l 25 (65)
+.+.+.+++++.||+.|++
T Consensus 14 rlsD~aL~~a~~GIvlMvi 32 (119)
T PF03530_consen 14 RLSDYALFFAMFGIVLMVI 32 (119)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4566777888888777654
No 34
>PRK13747 putative mercury resistance protein; Provisional
Probab=20.92 E-value=2.3e+02 Score=18.00 Aligned_cols=26 Identities=31% Similarity=0.538 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhh
Q psy10100 12 GVIISAWGIVQLALMGVFYHIRSVAL 37 (65)
Q Consensus 12 c~vlSvWGii~L~llGi~F~~~s~~L 37 (65)
..+-+-|+|+.+.+.|.|+..-+.+|
T Consensus 46 afl~e~w~iaal~lt~LFvlsl~~~l 71 (78)
T PRK13747 46 AFLGEHWGIAALTLTGLFVLSVTRLL 71 (78)
T ss_pred HHHHHhHHHHHHHHHHHHHHHHHHHH
Confidence 45677899999999998876655544
No 35
>PF10661 EssA: WXG100 protein secretion system (Wss), protein EssA; InterPro: IPR018920 The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria []. Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions.
Probab=20.41 E-value=1.7e+02 Score=19.81 Aligned_cols=20 Identities=20% Similarity=0.408 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHHHhh
Q psy10100 13 VIISAWGIVQLALMGVFYHI 32 (65)
Q Consensus 13 ~vlSvWGii~L~llGi~F~~ 32 (65)
+++++-|+|+++.-||..-.
T Consensus 121 i~~~i~g~ll~i~~giy~~~ 140 (145)
T PF10661_consen 121 ILLSIGGILLAICGGIYVVL 140 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44555566655555555443
No 36
>PLN03160 uncharacterized protein; Provisional
Probab=20.23 E-value=70 Score=22.58 Aligned_cols=25 Identities=16% Similarity=0.072 Sum_probs=12.5
Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHh
Q psy10100 7 KLSLCGVIISAWGIVQLALMGVFYH 31 (65)
Q Consensus 7 k~s~cc~vlSvWGii~L~llGi~F~ 31 (65)
+|..||.++-.+-++..++.+++++
T Consensus 35 ~~~~c~~~~~a~~l~l~~v~~~l~~ 59 (219)
T PLN03160 35 NCIKCCGCITATLLILATTILVLVF 59 (219)
T ss_pred cceEEHHHHHHHHHHHHHHHHheee
Confidence 4444555554444444555555544
Done!