Query psy18049
Match_columns 204
No_of_seqs 245 out of 1398
Neff 5.8
Searched_HMMs 46136
Date Fri Aug 16 21:10:15 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18049.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18049hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4568|consensus 99.9 6.7E-28 1.4E-32 232.3 9.3 174 27-203 7-201 (664)
2 PF01302 CAP_GLY: CAP-Gly doma 99.9 3.9E-26 8.4E-31 162.9 8.7 66 40-105 1-69 (69)
3 KOG3206|consensus 99.8 2.3E-21 4.9E-26 163.3 7.0 73 36-108 147-223 (234)
4 KOG0971|consensus 99.8 1.6E-19 3.4E-24 176.2 9.3 76 35-110 3-79 (1243)
5 COG5244 NIP100 Dynactin comple 99.8 3.7E-19 8.1E-24 164.7 7.2 67 37-104 3-69 (669)
6 PF01302 CAP_GLY: CAP-Gly doma 99.6 1.4E-15 3E-20 108.5 6.4 47 156-203 1-48 (69)
7 KOG3207|consensus 99.6 3.6E-15 7.7E-20 138.0 7.2 70 37-107 2-73 (505)
8 KOG3206|consensus 99.6 3.9E-15 8.5E-20 125.8 5.8 53 151-203 146-199 (234)
9 KOG4568|consensus 99.2 4.2E-12 9.1E-17 123.4 2.1 76 32-107 149-224 (664)
10 KOG3556|consensus 99.1 2.5E-10 5.5E-15 107.5 6.8 73 36-108 234-316 (724)
11 KOG0241|consensus 99.0 2.2E-10 4.9E-15 113.6 5.2 71 34-108 1627-1697(1714)
12 KOG3556|consensus 97.1 0.00059 1.3E-08 65.3 4.9 50 153-202 235-291 (724)
13 KOG0241|consensus 96.9 0.0017 3.8E-08 65.9 6.1 47 150-203 1627-1673(1714)
14 PF10781 DSRB: Dextransucrase 92.3 0.54 1.2E-05 32.6 5.6 55 38-102 1-61 (62)
15 PRK10708 hypothetical protein; 92.2 0.56 1.2E-05 32.5 5.5 55 38-102 1-61 (62)
16 PTZ00243 ABC transporter; Prov 92.0 0.16 3.5E-06 54.7 4.2 59 37-95 124-200 (1560)
17 PRK10708 hypothetical protein; 88.6 1.4 3E-05 30.6 5.0 41 154-196 1-43 (62)
18 PF10781 DSRB: Dextransucrase 87.4 1.7 3.7E-05 30.1 4.8 41 154-196 1-43 (62)
19 PF09926 DUF2158: Uncharacteri 83.8 4.5 9.7E-05 27.4 5.5 47 154-202 1-47 (53)
20 PTZ00243 ABC transporter; Prov 78.8 2.4 5.2E-05 46.0 4.3 44 150-194 121-166 (1560)
21 PF09926 DUF2158: Uncharacteri 67.8 23 0.0005 23.9 5.6 46 38-83 1-47 (53)
22 PF14801 GCD14_N: tRNA methylt 51.1 5.9 0.00013 27.1 0.3 48 152-202 4-52 (54)
23 TIGR02266 gmx_TIGR02266 Myxoco 50.3 75 0.0016 22.5 6.2 64 11-75 11-83 (96)
24 PF07238 PilZ: PilZ domain; I 48.4 26 0.00055 24.4 3.4 61 10-74 19-87 (102)
25 TIGR00739 yajC preprotein tran 43.8 61 0.0013 23.7 4.8 32 36-73 36-67 (84)
26 PRK05886 yajC preprotein trans 40.9 72 0.0016 24.7 5.0 32 36-73 37-68 (109)
27 PRK12281 rplX 50S ribosomal pr 40.4 42 0.00091 24.2 3.4 27 153-179 6-33 (76)
28 CHL00141 rpl24 ribosomal prote 39.5 43 0.00094 24.5 3.4 26 153-178 8-34 (83)
29 PF00924 MS_channel: Mechanose 36.0 32 0.00069 28.0 2.5 24 36-60 59-82 (206)
30 PF02470 MCE: mce related prot 33.9 1.6E+02 0.0035 20.4 5.8 43 34-76 11-53 (81)
31 PRK05585 yajC preprotein trans 33.1 98 0.0021 23.7 4.6 32 36-73 51-82 (106)
32 PF13403 Hint_2: Hint domain 32.4 58 0.0013 26.0 3.4 28 35-62 18-45 (147)
33 PF13437 HlyD_3: HlyD family s 32.2 1.1E+02 0.0025 21.9 4.8 43 149-191 45-91 (105)
34 COG3350 Uncharacterized conser 29.8 20 0.00044 24.4 0.3 29 67-95 7-35 (53)
35 PRK00004 rplX 50S ribosomal pr 29.5 76 0.0016 24.2 3.5 25 153-177 4-29 (105)
36 TIGR01079 rplX_bact ribosomal 29.2 79 0.0017 24.1 3.5 26 153-178 3-29 (104)
37 PF00924 MS_channel: Mechanose 28.3 47 0.001 26.9 2.3 26 151-180 58-83 (206)
38 PRK06531 yajC preprotein trans 28.0 75 0.0016 24.8 3.2 23 36-58 35-57 (113)
39 smart00739 KOW KOW (Kyprides, 25.6 1.2E+02 0.0027 16.4 3.6 22 153-174 1-23 (28)
40 PRK09816 thrL thr operon leade 25.5 59 0.0013 18.6 1.6 11 3-13 9-19 (26)
41 COG4466 Veg Uncharacterized pr 25.3 1.1E+02 0.0024 22.5 3.5 42 153-196 15-57 (80)
42 PF07154 DUF1392: Protein of u 24.8 1E+02 0.0022 25.3 3.5 45 32-76 82-128 (150)
43 PRK10334 mechanosensitive chan 21.1 1.1E+02 0.0025 27.1 3.5 25 151-179 127-151 (286)
44 PF11141 DUF2914: Protein of u 20.3 1.6E+02 0.0035 20.4 3.5 23 154-176 41-64 (66)
45 PF04246 RseC_MucC: Positive r 20.3 1.8E+02 0.004 22.4 4.2 43 5-47 8-61 (135)
46 PRK10334 mechanosensitive chan 20.2 1.2E+02 0.0026 26.9 3.5 24 36-60 128-151 (286)
No 1
>KOG4568|consensus
Probab=99.95 E-value=6.7e-28 Score=232.34 Aligned_cols=174 Identities=47% Similarity=0.755 Sum_probs=130.2
Q ss_pred CcccccCCCCCcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEcCCCCCCCceeCCEEeeEcCCCCeeeeecCccee
Q psy18049 27 SSQVLTEDTDSFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLDDPVGKNDGQVGQARYFQCEPKHGIFSRLTRLTR 106 (204)
Q Consensus 27 ~~~~~~~~~~~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd~p~Gk~dGs~~G~rYF~C~~~~GiFv~~~kv~~ 106 (204)
++.+...+...+.||++|+|.+...|.|||+|+++|+.|+|+||+|++|.|||||+|+|+|||.|.+.+|+|++..+...
T Consensus 7 ~~~~q~~~~~~~~ig~~v~v~~~~~G~v~y~G~t~f~~G~w~GveL~~p~GKNdGsv~G~rYF~ce~~kgif~~~~~~t~ 86 (664)
T KOG4568|consen 7 SADVQLDDLGQFIIGRRVWVNNVELGFVRYAGETDFAKGIWAGVELLEPKGKNDGSVHGKRYFQCEPKKGIFFRQGRGTA 86 (664)
T ss_pred cccchhhhhhhhhhcceEEecCCcceeeeeccCcccccceeeceeeccccCCCCcccchhhhhccccccceeeccccccc
Confidence 55667778889999999999999999999999999999999999999999999999999999999999999998877766
Q ss_pred cCCCCCC--CCCCCCCCCCCC-c-ccCCC----CCCccc-----------cCCCC--CCCCCCCCCCCCCcCCEEEEecC
Q psy18049 107 TPLDLPP--PATPTPGARGDG-F-ISPTG----SARSLS-----------FSTPG--PGLSTPAKGGDIRLGDRVIVMSA 165 (204)
Q Consensus 107 ~~~~~~~--~~ts~pd~~~~s-~-lS~s~----S~~sl~-----------~s~~~--~S~s~~~~~~~~~vG~RV~v~~~ 165 (204)
...+... .++..|+..-.. + .+.+. +.+... .+.+. ++...+.....+.+|+||.+.
T Consensus 87 ~~~~~~q~~~~~~~~~~~~~~~~~~~~sp~~~~~ip~~~s~~~~~~kss~~s~s~~~~~~~s~~~~~~l~v~dr~l~~-- 164 (664)
T KOG4568|consen 87 SKNEKGQSSPPPSVPNSRLSSAAAASTSPNLQSSIPLLASLSNTIVKSSLESYSTLSSTGKSPTSMLGLRVGDRVLVG-- 164 (664)
T ss_pred ccccccccCCCCCCCchhhccccccccCccccccccccccccccccccccCCCCCcccccccccCccccccccceeec--
Confidence 5433221 111111111000 0 00000 000000 00001 112234567789999999985
Q ss_pred CCCeeeeEEEeccCCCCCceEEEEEeCCCCcCCCcccC
Q psy18049 166 QGSKTGVLRFKGTTQFAQGEWCGVELDEPIGKNDGNLG 203 (204)
Q Consensus 166 ~~~~~GtVryiG~~~~~~g~w~GVElD~p~GkndGsv~ 203 (204)
++..|++||+|.+.|+.|-|+|||||+|.|+|||+|.
T Consensus 165 -gq~~~~~r~~~~t~fa~g~w~~~elde~~~~~d~sv~ 201 (664)
T KOG4568|consen 165 -GQKSGTLRYSGHTKFASGSWLGVELDEPVGKNDGSVG 201 (664)
T ss_pred -CcccchhhhhccccccCCcccchhccccccccccccc
Confidence 6899999999999999999999999999999999985
No 2
>PF01302 CAP_GLY: CAP-Gly domain; InterPro: IPR000938 Cytoskeleton-associated proteins (CAP) are made of three distinct parts, an N-terminal section that is most probably globular and contains the CAP-Gly domain, a large central region predicted to be in an alpha-helical coiled-coil conformation and, finally, a short C-terminal globular domain. The CAP-Gly domain is a conserved, glycine-rich domain of about 42 residues found in some CAPs []. Proteins known to contain this domain include restin (also known as cytoplasmic linker protein-170 or CLIP-170), a 160 kDa protein associated with intermediate filaments and that links endocytic vesicles to microtubules; vertebrate dynactin (150 kDa dynein-associated polypeptide; DAP) and Drosophila glued, a major component of activator I; yeast protein BIK1, which seems to be required for the formation or stabilisation of microtubules during mitosis and for spindle pole body fusion during conjugation; yeast protein NIP100 (NIP80); human protein CKAP1/TFCB; Schizosaccharomyces pombe protein alp11 and Caenorhabditis elegans hypothetical protein F53F4.3. The latter proteins contain a N-terminal ubiquitin domain and a C-terminal CAP-Gly domain. The crystal structure of the CAP-Gly domain of C. elegans F53F4.3 protein, solved by single wavelength sulphur-anomalous phasing, revealed a novel protein fold containing three beta-sheets. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove. Residues in the groove are highly conserved as measured from the information content of the aligned sequences. The C-terminal tail of another molecule in the crystal is bound in this groove []. ; PDB: 2CP6_A 2E4H_A 2E3H_A 3E2U_B 2HL3_B 3TQ7_Q 2HKQ_B 2HQH_A 2HKN_B 2HL5_C ....
Probab=99.93 E-value=3.9e-26 Score=162.94 Aligned_cols=66 Identities=50% Similarity=1.012 Sum_probs=59.8
Q ss_pred cccEEEE--CCCceEEEEEEeecC-CCCCcEEEEEEcCCCCCCCceeCCEEeeEcCCCCeeeeecCcce
Q psy18049 40 IGDRVYV--GGTKSGRIAFIGETK-FAPGDWAGVVLDDPVGKNDGQVGQARYFQCEPKHGIFSRLTRLT 105 (204)
Q Consensus 40 vG~rV~v--~g~~~G~vryiG~~~-~~~g~w~GVELd~p~Gk~dGs~~G~rYF~C~~~~GiFv~~~kv~ 105 (204)
|||||.| .....|+|||+|+++ +.+|+|+|||||++.|+|||+++|+|||+|+|+||+||++++|+
T Consensus 1 VG~rV~v~~~~~~~G~vryiG~~~~~~~g~~vGVEld~~~G~~dGt~~G~rYF~c~~~~G~Fv~~~~v~ 69 (69)
T PF01302_consen 1 VGDRVRVDDPGGRRGTVRYIGPVPGFKSGIWVGVELDEPRGKNDGTVKGKRYFECPPNHGIFVRPSKVE 69 (69)
T ss_dssp TTSEEEESSTTTEEEEEEEEEE-SSSSSSEEEEEEESSSTSSBSSEETTEESS-SSTTTEEEEEGGGE-
T ss_pred CCCEEEEeeCCCCEEEEEEEeeCCCCCCCEEEEEEEcCCCCCCCcEECCEEEeeeCCCCEEEecHHHCC
Confidence 7999999 456899999999999 78899999999999999999999999999999999999999974
No 3
>KOG3206|consensus
Probab=99.84 E-value=2.3e-21 Score=163.34 Aligned_cols=73 Identities=41% Similarity=0.833 Sum_probs=68.1
Q ss_pred CCcccccEEEEC--CC--ceEEEEEEeecCCCCCcEEEEEEcCCCCCCCceeCCEEeeEcCCCCeeeeecCcceecC
Q psy18049 36 DSFIIGDRVYVG--GT--KSGRIAFIGETKFAPGDWAGVVLDDPVGKNDGQVGQARYFQCEPKHGIFSRLTRLTRTP 108 (204)
Q Consensus 36 ~~l~vG~rV~v~--g~--~~G~vryiG~~~~~~g~w~GVELd~p~Gk~dGs~~G~rYF~C~~~~GiFv~~~kv~~~~ 108 (204)
..+.||+||+|. |+ ++|+|||+|++++.+|.|+|||.|||.|||||+++|+|||.|+|+||-||+|..++-..
T Consensus 147 ~~i~vG~rCeVtv~G~~~Rrg~vrYvG~~~~k~G~wiGVeydEplGKnDGsv~G~ryF~c~p~yGgfVrP~~V~Vgd 223 (234)
T KOG3206|consen 147 ATIAVGRRCEVTVPGQAPRRGTVRYVGPLEFKPGYWIGVEYDEPLGKNDGSVNGKRYFECAPKYGGFVRPRAVTVGD 223 (234)
T ss_pred hccccCCeeEEecCCCCCcceEEEEecccCCCCceEEEEecCCccccCCCcccceEeeecCCccCCccccceeeecC
Confidence 348999999886 54 89999999999999999999999999999999999999999999999999999998753
No 4
>KOG0971|consensus
Probab=99.79 E-value=1.6e-19 Score=176.16 Aligned_cols=76 Identities=41% Similarity=0.835 Sum_probs=70.5
Q ss_pred CCCcccccEEEECC-CceEEEEEEeecCCCCCcEEEEEEcCCCCCCCceeCCEEeeEcCCCCeeeeecCcceecCCC
Q psy18049 35 TDSFIIGDRVYVGG-TKSGRIAFIGETKFAPGDWAGVVLDDPVGKNDGQVGQARYFQCEPKHGIFSRLTRLTRTPLD 110 (204)
Q Consensus 35 ~~~l~vG~rV~v~g-~~~G~vryiG~~~~~~g~w~GVELd~p~Gk~dGs~~G~rYF~C~~~~GiFv~~~kv~~~~~~ 110 (204)
...|+||.||.|.| +..|+|.|+|.+.|+.|.|+||-||+|.|||||+++|++||+|++|||+||+.+.|....+.
T Consensus 3 ~r~lkvG~RVevtgknl~G~VayvG~T~FA~G~WvGVvLDep~GKNnGsVqg~qYF~Cd~ncG~FVr~sq~r~lEda 79 (1243)
T KOG0971|consen 3 PRSLKVGTRVEVTGKNLQGTVAYVGQTQFAEGKWVGVVLDEPKGKNNGSVQGVQYFECDENCGVFVRSSQVRELEDA 79 (1243)
T ss_pred CccccccceEEeccCCccceEEEecccccccCceEEEEeccccCCCCCcccceeeEecCCCcceEeehhhhHHhhcc
Confidence 46799999999997 57899999999999999999999999999999999999999999999999999997665544
No 5
>COG5244 NIP100 Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning]
Probab=99.77 E-value=3.7e-19 Score=164.74 Aligned_cols=67 Identities=52% Similarity=0.960 Sum_probs=62.8
Q ss_pred CcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEcCCCCCCCceeCCEEeeEcCCCCeeeeecCcc
Q psy18049 37 SFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLDDPVGKNDGQVGQARYFQCEPKHGIFSRLTRL 104 (204)
Q Consensus 37 ~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd~p~Gk~dGs~~G~rYF~C~~~~GiFv~~~kv 104 (204)
-+.|||+|.+.+ ..|+|||||.+.|+.|+|+|||||+|.|||||+++|+|||.|..+||+|+++..=
T Consensus 3 ~lSv~D~Vll~~-~~g~VrfiG~t~f~~GiW~GlELd~p~GkNDGSvnGvRYF~Ckk~~giFir~~~~ 69 (669)
T COG5244 3 LLSVNDRVLLGD-KFGTVRFIGKTKFKDGIWIGLELDDPVGKNDGSVNGVRYFHCKKRHGIFIRPDDD 69 (669)
T ss_pred eeecCCEEEecc-ccceEEEeeecccccceEEEEEecCCCCCCCCccCceEEEEecCCcceEEecCch
Confidence 478999999987 5899999999999999999999999999999999999999999999999996543
No 6
>PF01302 CAP_GLY: CAP-Gly domain; InterPro: IPR000938 Cytoskeleton-associated proteins (CAP) are made of three distinct parts, an N-terminal section that is most probably globular and contains the CAP-Gly domain, a large central region predicted to be in an alpha-helical coiled-coil conformation and, finally, a short C-terminal globular domain. The CAP-Gly domain is a conserved, glycine-rich domain of about 42 residues found in some CAPs []. Proteins known to contain this domain include restin (also known as cytoplasmic linker protein-170 or CLIP-170), a 160 kDa protein associated with intermediate filaments and that links endocytic vesicles to microtubules; vertebrate dynactin (150 kDa dynein-associated polypeptide; DAP) and Drosophila glued, a major component of activator I; yeast protein BIK1, which seems to be required for the formation or stabilisation of microtubules during mitosis and for spindle pole body fusion during conjugation; yeast protein NIP100 (NIP80); human protein CKAP1/TFCB; Schizosaccharomyces pombe protein alp11 and Caenorhabditis elegans hypothetical protein F53F4.3. The latter proteins contain a N-terminal ubiquitin domain and a C-terminal CAP-Gly domain. The crystal structure of the CAP-Gly domain of C. elegans F53F4.3 protein, solved by single wavelength sulphur-anomalous phasing, revealed a novel protein fold containing three beta-sheets. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove. Residues in the groove are highly conserved as measured from the information content of the aligned sequences. The C-terminal tail of another molecule in the crystal is bound in this groove []. ; PDB: 2CP6_A 2E4H_A 2E3H_A 3E2U_B 2HL3_B 3TQ7_Q 2HKQ_B 2HQH_A 2HKN_B 2HL5_C ....
Probab=99.61 E-value=1.4e-15 Score=108.47 Aligned_cols=47 Identities=51% Similarity=0.967 Sum_probs=42.1
Q ss_pred cCCEEEEecCCCCeeeeEEEeccCC-CCCceEEEEEeCCCCcCCCcccC
Q psy18049 156 LGDRVIVMSAQGSKTGVLRFKGTTQ-FAQGEWCGVELDEPIGKNDGNLG 203 (204)
Q Consensus 156 vG~RV~v~~~~~~~~GtVryiG~~~-~~~g~w~GVElD~p~GkndGsv~ 203 (204)
||+||.| .....+.|+|||+|+++ ++.|+|+|||||+|.|||||+++
T Consensus 1 VG~rV~v-~~~~~~~G~vryiG~~~~~~~g~~vGVEld~~~G~~dGt~~ 48 (69)
T PF01302_consen 1 VGDRVRV-DDPGGRRGTVRYIGPVPGFKSGIWVGVELDEPRGKNDGTVK 48 (69)
T ss_dssp TTSEEEE-SSTTTEEEEEEEEEE-SSSSSSEEEEEEESSSTSSBSSEET
T ss_pred CCCEEEE-eeCCCCEEEEEEEeeCCCCCCCEEEEEEEcCCCCCCCcEEC
Confidence 7999999 33468999999999999 88899999999999999999986
No 7
>KOG3207|consensus
Probab=99.57 E-value=3.6e-15 Score=138.02 Aligned_cols=70 Identities=40% Similarity=0.727 Sum_probs=65.6
Q ss_pred CcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEcCCC-CCCCceeCCEEeeEcC-CCCeeeeecCcceec
Q psy18049 37 SFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLDDPV-GKNDGQVGQARYFQCE-PKHGIFSRLTRLTRT 107 (204)
Q Consensus 37 ~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd~p~-Gk~dGs~~G~rYF~C~-~~~GiFv~~~kv~~~ 107 (204)
.+++|+||.|.+. .+||||+|.+++..+.|+|||||+|. |||||+++|++||.|. |+.|+|+++.+|...
T Consensus 2 ~~~IG~RvkI~~~-~~Tvr~iG~V~g~~~~w~GvEWDd~~RGKH~G~vdgk~YF~~q~P~GGSFik~~kV~~p 73 (505)
T KOG3207|consen 2 TMEIGTRVKIGGE-IATVRYIGEVEGNNSKWYGVEWDDPVRGKHDGIVDGKRYFQTQHPNGGSFIKPGKVKFP 73 (505)
T ss_pred ceeccceEEEcCE-EEEEEEEEEEcCCCCcceeeEecCCCccccCceecceeeeeeecCCCccccCCccCCCC
Confidence 4789999999984 99999999999999999999999996 9999999999999998 999999999999763
No 8
>KOG3206|consensus
Probab=99.56 E-value=3.9e-15 Score=125.77 Aligned_cols=53 Identities=42% Similarity=0.809 Sum_probs=47.7
Q ss_pred CCCCCcCCEEEEec-CCCCeeeeEEEeccCCCCCceEEEEEeCCCCcCCCcccC
Q psy18049 151 GGDIRLGDRVIVMS-AQGSKTGVLRFKGTTQFAQGEWCGVELDEPIGKNDGNLG 203 (204)
Q Consensus 151 ~~~~~vG~RV~v~~-~~~~~~GtVryiG~~~~~~g~w~GVElD~p~GkndGsv~ 203 (204)
+..+.||.||+|+. .+..|+|+|||+|+++|.+|+|||||+|||.|||||+|+
T Consensus 146 ~~~i~vG~rCeVtv~G~~~Rrg~vrYvG~~~~k~G~wiGVeydEplGKnDGsv~ 199 (234)
T KOG3206|consen 146 QATIAVGRRCEVTVPGQAPRRGTVRYVGPLEFKPGYWIGVEYDEPLGKNDGSVN 199 (234)
T ss_pred hhccccCCeeEEecCCCCCcceEEEEecccCCCCceEEEEecCCccccCCCccc
Confidence 34499999999984 344699999999999999999999999999999999986
No 9
>KOG4568|consensus
Probab=99.21 E-value=4.2e-12 Score=123.36 Aligned_cols=76 Identities=51% Similarity=0.979 Sum_probs=71.8
Q ss_pred cCCCCCcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEcCCCCCCCceeCCEEeeEcCCCCeeeeecCcceec
Q psy18049 32 TEDTDSFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLDDPVGKNDGQVGQARYFQCEPKHGIFSRLTRLTRT 107 (204)
Q Consensus 32 ~~~~~~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd~p~Gk~dGs~~G~rYF~C~~~~GiFv~~~kv~~~ 107 (204)
......+.+||||.+.++..|++||.|.+.|++|-|+|||||+|.|++||++.+++||.|++.+|+|.+..++.+.
T Consensus 149 ~~~~~~l~v~dr~l~~gq~~~~~r~~~~t~fa~g~w~~~elde~~~~~d~sv~k~~~~~~k~ky~~f~~v~kv~~~ 224 (664)
T KOG4568|consen 149 PTSMLGLRVGDRVLVGGQKSGTLRYSGHTKFASGSWLGVELDEPVGKNDGSVGKVRYFQCKPKYGKFAPVQKVFKI 224 (664)
T ss_pred ccCccccccccceeecCcccchhhhhccccccCCcccchhcccccccccccccccccccCcchhcchhHHHHHhhc
Confidence 3456689999999999999999999999999999999999999999999999999999999999999999998776
No 10
>KOG3556|consensus
Probab=99.06 E-value=2.5e-10 Score=107.52 Aligned_cols=73 Identities=27% Similarity=0.377 Sum_probs=66.6
Q ss_pred CCcccccEEEECC-----CceEEEEEEeecCCCCCcEEEEEEcCCC-----CCCCceeCCEEeeEcCCCCeeeeecCcce
Q psy18049 36 DSFIIGDRVYVGG-----TKSGRIAFIGETKFAPGDWAGVVLDDPV-----GKNDGQVGQARYFQCEPKHGIFSRLTRLT 105 (204)
Q Consensus 36 ~~l~vG~rV~v~g-----~~~G~vryiG~~~~~~g~w~GVELd~p~-----Gk~dGs~~G~rYF~C~~~~GiFv~~~kv~ 105 (204)
..|+||+.|+|.. ..+|+|||||..+.++|+|+|||+++.. +..||++.|.|||+|-.+.++||+...++
T Consensus 234 ~~L~IGslveV~np~~~~~~ygvVrWIG~pP~~~~Vl~gve~edet~~~~~~~~dgs~~g~Rlftc~d~~a~Fv~~~scs 313 (724)
T KOG3556|consen 234 LNLKIGSLVEVENPETMRRVYGVVRWIGEPPEASGVLYGVEFEDETSQWPSSNQDGSSSGDRLFTCFDTNANFVMSGSCS 313 (724)
T ss_pred ccceecceEEecCCccccccceeeEecccCCcccCceEeeeecCccccCccccccCcccCceeeeeccccceeeEcCCCC
Confidence 3599999999974 3899999999999999999999999864 88999999999999999999999999987
Q ss_pred ecC
Q psy18049 106 RTP 108 (204)
Q Consensus 106 ~~~ 108 (204)
..+
T Consensus 314 ~d~ 316 (724)
T KOG3556|consen 314 VDP 316 (724)
T ss_pred cCc
Confidence 754
No 11
>KOG0241|consensus
Probab=99.03 E-value=2.2e-10 Score=113.57 Aligned_cols=71 Identities=35% Similarity=0.539 Sum_probs=65.2
Q ss_pred CCCCcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEcCCCCCCCceeCCEEeeEcCCCCeeeeecCcceecC
Q psy18049 34 DTDSFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLDDPVGKNDGQVGQARYFQCEPKHGIFSRLTRLTRTP 108 (204)
Q Consensus 34 ~~~~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd~p~Gk~dGs~~G~rYF~C~~~~GiFv~~~kv~~~~ 108 (204)
.+..+..|.+|.+.....++|||+|.++|.+ |||||.|.|||||+..|++||+|.|+||.||+++++.+.+
T Consensus 1627 ~pew~~~ge~v~~~~h~t~v~r~vg~tefq~----gveld~p~gkndgs~gg~qyfrc~p~~g~~vr~s~~~~~~ 1697 (1714)
T KOG0241|consen 1627 SPEWRGFGERVVTVEHTTNVLRDVGFTEFQG----GVELDDPDGKNDGSAGGLQYFRCLPNKGGGVRVSDGLHHP 1697 (1714)
T ss_pred CchhhhcCceeEEeecccceeeecchhhccC----cccccCCCCCcCccccceeEEeecCCCCCcccchhhccCc
Confidence 4457888999999877889999999999987 9999999999999999999999999999999999998764
No 12
>KOG3556|consensus
Probab=97.13 E-value=0.00059 Score=65.29 Aligned_cols=50 Identities=30% Similarity=0.541 Sum_probs=42.1
Q ss_pred CCCcCCEEEEecCC--CCeeeeEEEeccCCCCCceEEEEEeCC-----CCcCCCccc
Q psy18049 153 DIRLGDRVIVMSAQ--GSKTGVLRFKGTTQFAQGEWCGVELDE-----PIGKNDGNL 202 (204)
Q Consensus 153 ~~~vG~RV~v~~~~--~~~~GtVryiG~~~~~~g~w~GVElD~-----p~GkndGsv 202 (204)
.|+||+-|.|.... ...+|.|||||.-|.+.|+|+|||+|. |.+..||+.
T Consensus 235 ~L~IGslveV~np~~~~~~ygvVrWIG~pP~~~~Vl~gve~edet~~~~~~~~dgs~ 291 (724)
T KOG3556|consen 235 NLKIGSLVEVENPETMRRVYGVVRWIGEPPEASGVLYGVEFEDETSQWPSSNQDGSS 291 (724)
T ss_pred cceecceEEecCCccccccceeeEecccCCcccCceEeeeecCccccCccccccCcc
Confidence 48999999998531 257999999999999999999999996 457777775
No 13
>KOG0241|consensus
Probab=96.90 E-value=0.0017 Score=65.89 Aligned_cols=47 Identities=47% Similarity=0.685 Sum_probs=40.2
Q ss_pred CCCCCCcCCEEEEecCCCCeeeeEEEeccCCCCCceEEEEEeCCCCcCCCcccC
Q psy18049 150 KGGDIRLGDRVIVMSAQGSKTGVLRFKGTTQFAQGEWCGVELDEPIGKNDGNLG 203 (204)
Q Consensus 150 ~~~~~~vG~RV~v~~~~~~~~GtVryiG~~~~~~g~w~GVElD~p~GkndGsv~ 203 (204)
....+..|.||.+.. -..++|||+|.++|-+ |||||.|.|||||+++
T Consensus 1627 ~pew~~~ge~v~~~~---h~t~v~r~vg~tefq~----gveld~p~gkndgs~g 1673 (1714)
T KOG0241|consen 1627 SPEWRGFGERVVTVE---HTTNVLRDVGFTEFQG----GVELDDPDGKNDGSAG 1673 (1714)
T ss_pred CchhhhcCceeEEee---cccceeeecchhhccC----cccccCCCCCcCcccc
Confidence 445677899998863 3577999999999987 9999999999999975
No 14
>PF10781 DSRB: Dextransucrase DSRB; InterPro: IPR019717 DSRB is a novel dextransucrase which produces a dextran different from the typical dextran, as it contains (1-6) and (1-2) linkages, when this strain is grown in the presence of sucrose [].
Probab=92.32 E-value=0.54 Score=32.56 Aligned_cols=55 Identities=29% Similarity=0.477 Sum_probs=42.3
Q ss_pred cccccEEEEC--C--CceEEEEEEeecCCCCCcEEEEEEcC-CCCCCCceeCCEEeeEcC-CCCeeeeecC
Q psy18049 38 FIIGDRVYVG--G--TKSGRIAFIGETKFAPGDWAGVVLDD-PVGKNDGQVGQARYFQCE-PKHGIFSRLT 102 (204)
Q Consensus 38 l~vG~rV~v~--g--~~~G~vryiG~~~~~~g~w~GVELd~-p~Gk~dGs~~G~rYF~C~-~~~GiFv~~~ 102 (204)
++|.|||.|. | .+.|+|.-+ =+|..|+.+=|.|++ |. |+-||.=. ...|+||.+.
T Consensus 1 MkvnD~VtVKTDG~~rR~G~ilav--E~F~EG~MYLvaL~dYP~--------GiWFFNE~~~~dG~FVep~ 61 (62)
T PF10781_consen 1 MKVNDRVTVKTDGGPRREGVILAV--EPFNEGTMYLVALEDYPA--------GIWFFNEKDSPDGTFVEPR 61 (62)
T ss_pred CccccEEEEecCCcccccceEEEE--eeccCcEEEEEEcCcCCc--------ceEEEecCCCCCcEEeeec
Confidence 4789999996 3 378998877 457889999999987 44 44588754 7789999764
No 15
>PRK10708 hypothetical protein; Provisional
Probab=92.21 E-value=0.56 Score=32.50 Aligned_cols=55 Identities=29% Similarity=0.526 Sum_probs=42.3
Q ss_pred cccccEEEEC--C--CceEEEEEEeecCCCCCcEEEEEEcC-CCCCCCceeCCEEeeEcC-CCCeeeeecC
Q psy18049 38 FIIGDRVYVG--G--TKSGRIAFIGETKFAPGDWAGVVLDD-PVGKNDGQVGQARYFQCE-PKHGIFSRLT 102 (204)
Q Consensus 38 l~vG~rV~v~--g--~~~G~vryiG~~~~~~g~w~GVELd~-p~Gk~dGs~~G~rYF~C~-~~~GiFv~~~ 102 (204)
++|+|||.|. | .+.|+|.-+ =+|..|+.+=|.|++ |. |+-||.=. ..-|+||.+.
T Consensus 1 MkvnD~VtVKTDG~~rR~G~iLav--E~F~EG~MyLvaL~dYP~--------GiWFFNE~~~~~G~FVep~ 61 (62)
T PRK10708 1 MKVNDRVTVKTDGGPRRPGVVLAV--EEFSEGTMYLVSLEDYPL--------GIWFFNEAGHQDGIFVEKA 61 (62)
T ss_pred CccccEEEEecCCCccccceEEEE--eeccCcEEEEEEcCcCCC--------ceEEEeccCCCCceEeccc
Confidence 4789999996 3 378999887 457889999999987 44 44588754 6789999763
No 16
>PTZ00243 ABC transporter; Provisional
Probab=92.03 E-value=0.16 Score=54.70 Aligned_cols=59 Identities=19% Similarity=0.213 Sum_probs=47.7
Q ss_pred CcccccEEEECC--CceEEEEEEeecC--CCCCcEEEEEEcCC--------------CCCCCceeCCEEeeEcCCCC
Q psy18049 37 SFIIGDRVYVGG--TKSGRIAFIGETK--FAPGDWAGVVLDDP--------------VGKNDGQVGQARYFQCEPKH 95 (204)
Q Consensus 37 ~l~vG~rV~v~g--~~~G~vryiG~~~--~~~g~w~GVELd~p--------------~Gk~dGs~~G~rYF~C~~~~ 95 (204)
+-.||.+|-+.. .-.|++||+|.+. ..++.++||||.-| ..-|||.++|.++|.--...
T Consensus 124 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 200 (1560)
T PTZ00243 124 NCMVGAEVVSTLDASSRGVLRWVGVPQQGGYTRMMAGVEWSVPPALRTAARSDDSGASPFFNGVAHGEHLFTPEQSN 200 (1560)
T ss_pred ccccccEEEeeccccccceEEeecccccCCCcceeeeeeeccCchhhhcccCCCCCCccccccccccccccCcccCC
Confidence 467999998864 4799999999997 46789999999965 12499999999999865433
No 17
>PRK10708 hypothetical protein; Provisional
Probab=88.61 E-value=1.4 Score=30.59 Aligned_cols=41 Identities=29% Similarity=0.567 Sum_probs=30.0
Q ss_pred CCcCCEEEEecCC-CCeeeeEEEeccCCCCCceEEEEEeCC-CCc
Q psy18049 154 IRLGDRVIVMSAQ-GSKTGVLRFKGTTQFAQGEWCGVELDE-PIG 196 (204)
Q Consensus 154 ~~vG~RV~v~~~~-~~~~GtVryiG~~~~~~g~w~GVElD~-p~G 196 (204)
++++|||.|+++. ..|.|+|.-+- +|.+|+.+=|.|++ |.|
T Consensus 1 MkvnD~VtVKTDG~~rR~G~iLavE--~F~EG~MyLvaL~dYP~G 43 (62)
T PRK10708 1 MKVNDRVTVKTDGGPRRPGVVLAVE--EFSEGTMYLVSLEDYPLG 43 (62)
T ss_pred CccccEEEEecCCCccccceEEEEe--eccCcEEEEEEcCcCCCc
Confidence 4789999999864 36789998775 46777777777765 544
No 18
>PF10781 DSRB: Dextransucrase DSRB; InterPro: IPR019717 DSRB is a novel dextransucrase which produces a dextran different from the typical dextran, as it contains (1-6) and (1-2) linkages, when this strain is grown in the presence of sucrose [].
Probab=87.37 E-value=1.7 Score=30.14 Aligned_cols=41 Identities=29% Similarity=0.516 Sum_probs=30.0
Q ss_pred CCcCCEEEEecCC-CCeeeeEEEeccCCCCCceEEEEEeCC-CCc
Q psy18049 154 IRLGDRVIVMSAQ-GSKTGVLRFKGTTQFAQGEWCGVELDE-PIG 196 (204)
Q Consensus 154 ~~vG~RV~v~~~~-~~~~GtVryiG~~~~~~g~w~GVElD~-p~G 196 (204)
++|+|||.|+++. ..|.|+|.-+- +|.+|+.+=|.|++ |.|
T Consensus 1 MkvnD~VtVKTDG~~rR~G~ilavE--~F~EG~MYLvaL~dYP~G 43 (62)
T PF10781_consen 1 MKVNDRVTVKTDGGPRREGVILAVE--PFNEGTMYLVALEDYPAG 43 (62)
T ss_pred CccccEEEEecCCcccccceEEEEe--eccCcEEEEEEcCcCCcc
Confidence 4789999999864 36789998775 46777777777765 544
No 19
>PF09926 DUF2158: Uncharacterized small protein (DUF2158); InterPro: IPR019226 This entry represents a family of predominantly prokaryotic proteins with no known function.
Probab=83.83 E-value=4.5 Score=27.42 Aligned_cols=47 Identities=21% Similarity=0.338 Sum_probs=38.6
Q ss_pred CCcCCEEEEecCCCCeeeeEEEeccCCCCCceEEEEEeCCCCcCCCccc
Q psy18049 154 IRLGDRVIVMSAQGSKTGVLRFKGTTQFAQGEWCGVELDEPIGKNDGNL 202 (204)
Q Consensus 154 ~~vG~RV~v~~~~~~~~GtVryiG~~~~~~g~w~GVElD~p~GkndGsv 202 (204)
|++||.|++++ ++..=||.++|+..+..+.|+=-.|=+..|.+.+.+
T Consensus 1 f~~GDvV~LKS--GGp~MTV~~v~~~~~~~~~~v~C~WFd~~~~~~~~F 47 (53)
T PF09926_consen 1 FKIGDVVQLKS--GGPRMTVTEVGPNAGASGGWVECQWFDGHGEQRETF 47 (53)
T ss_pred CCCCCEEEEcc--CCCCeEEEEccccccCCCCeEEEEeCCCCCcccccc
Confidence 57999999997 466779999999999999998888887766655544
No 20
>PTZ00243 ABC transporter; Provisional
Probab=78.78 E-value=2.4 Score=45.99 Aligned_cols=44 Identities=30% Similarity=0.369 Sum_probs=36.0
Q ss_pred CCCCCCcCCEEEEecCCCCeeeeEEEeccCC--CCCceEEEEEeCCC
Q psy18049 150 KGGDIRLGDRVIVMSAQGSKTGVLRFKGTTQ--FAQGEWCGVELDEP 194 (204)
Q Consensus 150 ~~~~~~vG~RV~v~~~~~~~~GtVryiG~~~--~~~g~w~GVElD~p 194 (204)
+.=+-.||.||-+.-+ ..-+|++||+|.+. ..++.++||||+-|
T Consensus 121 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 166 (1560)
T PTZ00243 121 NAWNCMVGAEVVSTLD-ASSRGVLRWVGVPQQGGYTRMMAGVEWSVP 166 (1560)
T ss_pred cccccccccEEEeecc-ccccceEEeecccccCCCcceeeeeeeccC
Confidence 3446678999988753 47799999999997 56889999999965
No 21
>PF09926 DUF2158: Uncharacterized small protein (DUF2158); InterPro: IPR019226 This entry represents a family of predominantly prokaryotic proteins with no known function.
Probab=67.83 E-value=23 Score=23.91 Aligned_cols=46 Identities=26% Similarity=0.367 Sum_probs=36.3
Q ss_pred cccccEEEEC-CCceEEEEEEeecCCCCCcEEEEEEcCCCCCCCcee
Q psy18049 38 FIIGDRVYVG-GTKSGRIAFIGETKFAPGDWAGVVLDDPVGKNDGQV 83 (204)
Q Consensus 38 l~vG~rV~v~-g~~~G~vryiG~~~~~~g~w~GVELd~p~Gk~dGs~ 83 (204)
|++||.|++. |...=||.++|+.....+.|+=-.|-+..+...+.|
T Consensus 1 f~~GDvV~LKSGGp~MTV~~v~~~~~~~~~~v~C~WFd~~~~~~~~F 47 (53)
T PF09926_consen 1 FKIGDVVQLKSGGPRMTVTEVGPNAGASGGWVECQWFDGHGEQRETF 47 (53)
T ss_pred CCCCCEEEEccCCCCeEEEEccccccCCCCeEEEEeCCCCCcccccc
Confidence 5799999886 556889999999988888888888877766554444
No 22
>PF14801 GCD14_N: tRNA methyltransferase complex GCD14 subunit N-term; PDB: 1I9G_A.
Probab=51.08 E-value=5.9 Score=27.09 Aligned_cols=48 Identities=25% Similarity=0.543 Sum_probs=21.6
Q ss_pred CCCCcCCEEEEecCCCCeeeeEEE-eccCCCCCceEEEEEeCCCCcCCCccc
Q psy18049 152 GDIRLGDRVIVMSAQGSKTGVLRF-KGTTQFAQGEWCGVELDEPIGKNDGNL 202 (204)
Q Consensus 152 ~~~~vG~RV~v~~~~~~~~GtVry-iG~~~~~~g~w~GVElD~p~GkndGsv 202 (204)
.+|+.|+||+++... ++.=||.- -|.. | --.+=.|.-|+=+|+.+|+|
T Consensus 4 Gpf~~GdrVQlTD~K-gr~~Ti~L~~G~~-f-hThrG~i~HDdlIG~~eGsV 52 (54)
T PF14801_consen 4 GPFRAGDRVQLTDPK-GRKHTITLEPGGE-F-HTHRGAIRHDDLIGRPEGSV 52 (54)
T ss_dssp -S--TT-EEEEEETT---EEEEE--TT-E-E-EETTEEEEHHHHTT--TTEE
T ss_pred CCCCCCCEEEEccCC-CCeeeEEECCCCe-E-EcCccccchhheecCCCcEE
Confidence 579999999998643 33333321 1111 0 01223566777778877776
No 23
>TIGR02266 gmx_TIGR02266 Myxococcus xanthus paralogous domain TIGR02266. This domain is related to Type IV pilus assembly protein PilZ (Pfam model pfam07238). It is found in at least 12 copies in Myxococcus xanthus DK 1622.
Probab=50.32 E-value=75 Score=22.50 Aligned_cols=64 Identities=17% Similarity=0.303 Sum_probs=43.7
Q ss_pred cCCCccccccccccCCCcccccCCCCCcccccEEEEC----C-----CceEEEEEEeecCCCCCcEEEEEEcCC
Q psy18049 11 ENRNPFQNCFLVSISDSSQVLTEDTDSFIIGDRVYVG----G-----TKSGRIAFIGETKFAPGDWAGVVLDDP 75 (204)
Q Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vG~rV~v~----g-----~~~G~vryiG~~~~~~g~w~GVELd~p 75 (204)
+..+.+..|...+.|-.++-+..+ ..+.+|++|.+. + ...|.|+|+-+.......++||++.+.
T Consensus 11 ~~~~~~~~~~~~diS~gG~~~~~~-~~~~~g~~v~l~l~l~~~~~~i~~~g~Vv~~~~~~~~~~~~~Gv~F~~~ 83 (96)
T TIGR02266 11 RTDSEFLRDYSINLSKGGLFIRTR-KPLAVGTRVELKLTLPGGERPVELKGVVAWVRPAADGGPPGMGVRFEDL 83 (96)
T ss_pred CChhhHHHHHhhhcCCceEEEecC-CCcCCCCEEEEEEEcCCCCeEEEEEEEEEEeCCCCCCCCCeeEEEeccC
Confidence 445667778777888888877654 568999988774 2 257888887443222225789998764
No 24
>PF07238 PilZ: PilZ domain; InterPro: IPR009875 The ubiquitous bacterial second messenger cyclic-di-GMP (c-di-GMP) is associated with the regulation of biofilm formation, the control of exopolysaccharide synthesis, flagellar- and pili-based motility, gene expression, interactions of bacteria with eukaryotic hosts and multicellular behaviour in diverse bacteria. With the exception of bacterial cellulose synthases, the identities of c-di-GMP receptors and end targets of the proteins having one or more PilZ domains are mostly uncharacterised. However it was suggested that the PilZ domains present in the BcsA subunits of bacterial cellulose synthases function in c-di-GMP binding []. More recently YcgR (see IPR023787 from INTERPRO) was found to bind c-di-GMP tightly and specifically; also isolated PilZ domains from YcgR and BcsA bound c-di-GMP indicating that the PilZ domain was sufficient for binding of c-di-GMP and significantly that site-directed mutagenesis performed on YcgR implicated the most conserved residues in the PilZ domain directly in c-di-GMP binding []. It was suggested that c-di-GMP binding to PilZ brings about conformational changes in the protein that stabilise the bound ligand and probability initiates the downstream signal transduction cascade. In the case of YcgR, c-di-GMP binding regulates flagellum-based motility in a c-di-GMP-dependent manner (see IPR023787 from INTERPRO) []. The association of the PilZ domain with a variety of other domains, including likely components of bacterial multidrug secretion system, could provide clues to multiple functions of the c-di-GMP in bacterial pathogenesis and cell development. Binding and mutagenesis studies of several PilZ domain proteins have confirmed this observation and demonstrated that c-di-GMP binding depends on residues in RxxxR and D/NxSxxG sequence motifs. The crystal structure, at 1.7 A, of a PilZ domain::c-di-GMP complex from Vibrio cholerae shows c-di-GMP contacting seven of nine strongly conserved residues. Binding of c-di-GMP causes a conformational switch whereby the C- and N-terminal domains are brought into close opposition forming a new allosteric interaction surface that spans these domains and the c-di-GMP at their interface []. ; GO: 0035438 cyclic-di-GMP binding; PDB: 2RDE_B 1YLN_A 3KYG_A 3DSG_B 2GJG_A 3KYF_A 1YWU_A 2L74_A 2L1T_A 3CNR_A ....
Probab=48.43 E-value=26 Score=24.38 Aligned_cols=61 Identities=26% Similarity=0.399 Sum_probs=42.3
Q ss_pred ecCCCccccccccccCCCcccccCCCCCcccccEEEEC-----C--C-ceEEEEEEeecCCCCCcEEEEEEcC
Q psy18049 10 TENRNPFQNCFLVSISDSSQVLTEDTDSFIIGDRVYVG-----G--T-KSGRIAFIGETKFAPGDWAGVVLDD 74 (204)
Q Consensus 10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~vG~rV~v~-----g--~-~~G~vryiG~~~~~~g~w~GVELd~ 74 (204)
..+..++ .|...+.|.+++.+..+. .+.+|+.|.+. . . ..|.|+++-+. ....++|+++++
T Consensus 19 ~~~~~~~-~~~~~diS~~G~~~~~~~-~~~~~~~v~l~~~~~~~~~~~~~~~V~~~~~~--~~~~~~g~~f~~ 87 (102)
T PF07238_consen 19 DPGGSSF-QGTIVDISEGGCAFRSPK-PLEPGDRVRLSFSLPGGGFPIVTGRVVRIQKD--SDGYRVGVEFVD 87 (102)
T ss_dssp EETTEEE-EEEEEEETTSEEEEEECT-G--TTSEEEEEEECTTTSCEEEEEEEEEEEEE--SSEEEEEEEECH
T ss_pred ecCCcEE-EEEEEEECccceEEEECC-CCCCCCEEEEEEEeCCCCeeEEEEEEEEEECC--CCceEEEEEEee
Confidence 3344444 465667888888877755 89999988764 1 2 78899998777 667889999854
No 25
>TIGR00739 yajC preprotein translocase, YajC subunit. While this protein is part of the preprotein translocase in Escherichia coli, it is not essential for viability or protein secretion. The N-terminus region contains a predicted membrane-spanning region followed by a region consisting almost entirely of residues with charged (acidic, basic, or zwitterionic) side chains. This small protein is about 100 residues in length, and is restricted to bacteria; however, this protein is absent from some lineages, including spirochetes and Mycoplasmas.
Probab=43.75 E-value=61 Score=23.69 Aligned_cols=32 Identities=28% Similarity=0.357 Sum_probs=25.1
Q ss_pred CCcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEc
Q psy18049 36 DSFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLD 73 (204)
Q Consensus 36 ~~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd 73 (204)
.+|++||+|.-.+...|+|.-++. .++-||+.
T Consensus 36 ~~L~~Gd~VvT~gGi~G~V~~i~d------~~v~vei~ 67 (84)
T TIGR00739 36 ESLKKGDKVLTIGGIIGTVTKIAE------NTIVIELN 67 (84)
T ss_pred HhCCCCCEEEECCCeEEEEEEEeC------CEEEEEEC
Confidence 479999999999889999998863 34555554
No 26
>PRK05886 yajC preprotein translocase subunit YajC; Validated
Probab=40.87 E-value=72 Score=24.70 Aligned_cols=32 Identities=25% Similarity=0.230 Sum_probs=26.1
Q ss_pred CCcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEc
Q psy18049 36 DSFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLD 73 (204)
Q Consensus 36 ~~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd 73 (204)
.+|++||+|.-.|...|+|.-+. +.++-||+.
T Consensus 37 ~~Lk~GD~VvT~gGi~G~V~~I~------d~~v~leia 68 (109)
T PRK05886 37 ESLQPGDRVHTTSGLQATIVGIT------DDTVDLEIA 68 (109)
T ss_pred HhcCCCCEEEECCCeEEEEEEEe------CCEEEEEEC
Confidence 47999999999988999999885 246677764
No 27
>PRK12281 rplX 50S ribosomal protein L24; Reviewed
Probab=40.36 E-value=42 Score=24.22 Aligned_cols=27 Identities=26% Similarity=0.308 Sum_probs=21.3
Q ss_pred CCCcCCEEEEecC-CCCeeeeEEEeccC
Q psy18049 153 DIRLGDRVIVMSA-QGSKTGVLRFKGTT 179 (204)
Q Consensus 153 ~~~vG~RV~v~~~-~~~~~GtVryiG~~ 179 (204)
.|..||+|.|.++ +.++.|+|..+=.-
T Consensus 6 ~I~kGD~V~Vi~G~dKGK~G~V~~V~~~ 33 (76)
T PRK12281 6 KVKKGDMVKVIAGDDKGKTGKVLAVLPK 33 (76)
T ss_pred cccCCCEEEEeEcCCCCcEEEEEEEEcC
Confidence 6899999999874 45788999887543
No 28
>CHL00141 rpl24 ribosomal protein L24; Validated
Probab=39.48 E-value=43 Score=24.51 Aligned_cols=26 Identities=23% Similarity=0.348 Sum_probs=20.5
Q ss_pred CCCcCCEEEEecC-CCCeeeeEEEecc
Q psy18049 153 DIRLGDRVIVMSA-QGSKTGVLRFKGT 178 (204)
Q Consensus 153 ~~~vG~RV~v~~~-~~~~~GtVryiG~ 178 (204)
.|..||+|.|..+ +.++.|+|..+=.
T Consensus 8 ~I~~GD~V~Vi~G~dKGK~G~V~~V~~ 34 (83)
T CHL00141 8 HVKIGDTVKIISGSDKGKIGEVLKIIK 34 (83)
T ss_pred cccCCCEEEEeEcCCCCcEEEEEEEEc
Confidence 6889999999874 4577888888743
No 29
>PF00924 MS_channel: Mechanosensitive ion channel; InterPro: IPR006685 Mechanosensitive (MS) channels provide protection against hypo-osmotic shock, responding both to stretching of the cell membrane and to membrane depolarisation. They are present in the membranes of organisms from the three domains of life: bacteria, archaea, and eukarya []. There are two families of MS channels: large-conductance MS channels (MscL) and small-conductance MS channels (MscS or YGGB). The pressure threshold for MscS opening is 50% that of MscL []. The MscS family is much larger and more variable in size and sequence than the MscL family. Much of the diversity in MscS proteins occurs in the size of the transmembrane regions, which ranges from three to eleven transmembrane helices, although the three C-terminal helices are conserved. This family contains sequences form the MscS family of proteins. MscS folds as a homo-heptamer with a cylindrical shape, and can be divided into transmembrane and extramembrane regions: an N-terminal periplasmic region, a transmembrane region, and a C-terminal cytoplasmic region (middle and C-terminal domains). The transmembrane region forms a channel through the membrane that opens into a chamber enclosed by the extramembrane portion, the latter connecting to the cytoplasm through distinct portals [].; GO: 0055085 transmembrane transport, 0016020 membrane; PDB: 2OAU_E 2VV5_F.
Probab=35.98 E-value=32 Score=27.98 Aligned_cols=24 Identities=42% Similarity=0.797 Sum_probs=17.3
Q ss_pred CCcccccEEEECCCceEEEEEEeec
Q psy18049 36 DSFIIGDRVYVGGTKSGRIAFIGET 60 (204)
Q Consensus 36 ~~l~vG~rV~v~g~~~G~vryiG~~ 60 (204)
..|++||+|.+.+ ..|.|.-+|-.
T Consensus 59 ~pf~vGD~I~i~~-~~G~V~~I~l~ 82 (206)
T PF00924_consen 59 RPFKVGDRIEIGG-VEGRVEEIGLR 82 (206)
T ss_dssp -SS-TT-EEESSS--EEEEEEE-SS
T ss_pred CCccCCCEEEEEE-eehHHHhcCcc
Confidence 4799999999996 69999999776
No 30
>PF02470 MCE: mce related protein; InterPro: IPR003399 This domain is found in all 24 mce genes associated with the four mammalian cell entry (mce) operons of Mycobacterium tuberculosis and their homologs in other Actinomycetales [, ]. The archetype (mce1A, Rv0169), was isolated as being necessary for colonisation of, and survival within, the macrophage []. The domain is also found in: Chloroplast Ycf22 and related cyanobacterial homologs, the majority of which have an N-terminal transmembrane domain and are putative ABC transporters. Proteobacterial homologs, which include YrbD, YebT, VpsC and Ttg2C, the latter being annotated as a toluene tolerance proteins, belong to the periplasmic substrate-binding ABC transporter superfamily.
Probab=33.85 E-value=1.6e+02 Score=20.38 Aligned_cols=43 Identities=16% Similarity=0.196 Sum_probs=32.1
Q ss_pred CCCCcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEcCCC
Q psy18049 34 DTDSFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLDDPV 76 (204)
Q Consensus 34 ~~~~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd~p~ 76 (204)
+...|.+|..|.+.|...|.|.=+.--......-+-+++++..
T Consensus 11 ~~~GL~~gs~V~~~Gv~VG~V~~i~l~~~~~~v~v~~~i~~~~ 53 (81)
T PF02470_consen 11 DAGGLSVGSPVRYRGVEVGKVTSIELDPDGNRVRVTLRIDPDY 53 (81)
T ss_pred CcCCCCCcCEEEECCEEEEEEEEEEEcCCCCEEEEEEEEcCCc
Confidence 3468999999999998899999884323344566777777655
No 31
>PRK05585 yajC preprotein translocase subunit YajC; Validated
Probab=33.06 E-value=98 Score=23.66 Aligned_cols=32 Identities=28% Similarity=0.479 Sum_probs=24.9
Q ss_pred CCcccccEEEECCCceEEEEEEeecCCCCCcEEEEEEc
Q psy18049 36 DSFIIGDRVYVGGTKSGRIAFIGETKFAPGDWAGVVLD 73 (204)
Q Consensus 36 ~~l~vG~rV~v~g~~~G~vryiG~~~~~~g~w~GVELd 73 (204)
..|++||+|...+..+|+|.-+.. ..+=||+.
T Consensus 51 ~~Lk~Gd~VvT~gGi~G~Vv~i~~------~~v~lei~ 82 (106)
T PRK05585 51 SSLAKGDEVVTNGGIIGKVTKVSE------DFVIIELN 82 (106)
T ss_pred HhcCCCCEEEECCCeEEEEEEEeC------CEEEEEEC
Confidence 579999999998889999998853 35555554
No 32
>PF13403 Hint_2: Hint domain
Probab=32.43 E-value=58 Score=26.03 Aligned_cols=28 Identities=21% Similarity=0.345 Sum_probs=23.0
Q ss_pred CCCcccccEEEECCCceEEEEEEeecCC
Q psy18049 35 TDSFIIGDRVYVGGTKSGRIAFIGETKF 62 (204)
Q Consensus 35 ~~~l~vG~rV~v~g~~~G~vryiG~~~~ 62 (204)
..+|++||+|.-.....-.|+|||....
T Consensus 18 Ve~L~~GD~V~T~dgg~~~V~wig~~~~ 45 (147)
T PF13403_consen 18 VEDLRPGDRVLTRDGGFQPVRWIGRRTV 45 (147)
T ss_pred eeccCCCCEEEecCCCEEEEEEEEEEEe
Confidence 4579999999887667889999998643
No 33
>PF13437 HlyD_3: HlyD family secretion protein
Probab=32.19 E-value=1.1e+02 Score=21.89 Aligned_cols=43 Identities=21% Similarity=0.180 Sum_probs=30.4
Q ss_pred CCCCCCC-cCCEEEEecCC---CCeeeeEEEeccCCCCCceEEEEEe
Q psy18049 149 AKGGDIR-LGDRVIVMSAQ---GSKTGVLRFKGTTQFAQGEWCGVEL 191 (204)
Q Consensus 149 ~~~~~~~-vG~RV~v~~~~---~~~~GtVryiG~~~~~~g~w~GVEl 191 (204)
.....++ +|++|.+...+ ....|+|.+|.+.+-..+..+-|++
T Consensus 45 ~~~~~i~~~g~~v~v~~~~~~~~~~~g~V~~I~~~~~~~~~~~~v~~ 91 (105)
T PF13437_consen 45 KDIARIKDPGQKVTVRLDPGPEKTIEGKVSSISPSPDPQGGTYRVEI 91 (105)
T ss_pred HhhcceEeCCCEEEEEECCCCCcEEEEEEEEEeCcccCCCcEEEEEE
Confidence 3456787 99999987542 3679999999998655544455444
No 34
>COG3350 Uncharacterized conserved protein [Function unknown]
Probab=29.78 E-value=20 Score=24.42 Aligned_cols=29 Identities=17% Similarity=0.428 Sum_probs=22.1
Q ss_pred EEEEEEcCCCCCCCceeCCEEeeEcCCCC
Q psy18049 67 WAGVVLDDPVGKNDGQVGQARYFQCEPKH 95 (204)
Q Consensus 67 w~GVELd~p~Gk~dGs~~G~rYF~C~~~~ 95 (204)
.+|..+|...-..-=+|+|+-||-|.+.+
T Consensus 7 Vcgm~v~~~~a~~k~~Y~GktYYFcse~~ 35 (53)
T COG3350 7 VCGMKVDNENAEYKSSYGGKTYYFCSEEC 35 (53)
T ss_pred CcCccccccccceeEEeCCEEEEEeCHHH
Confidence 46777776666666689999999998654
No 35
>PRK00004 rplX 50S ribosomal protein L24; Reviewed
Probab=29.51 E-value=76 Score=24.17 Aligned_cols=25 Identities=32% Similarity=0.393 Sum_probs=20.2
Q ss_pred CCCcCCEEEEecC-CCCeeeeEEEec
Q psy18049 153 DIRLGDRVIVMSA-QGSKTGVLRFKG 177 (204)
Q Consensus 153 ~~~vG~RV~v~~~-~~~~~GtVryiG 177 (204)
.|..||+|.|.++ +.++.|+|..+=
T Consensus 4 ~i~kGD~V~Vi~G~dKGk~G~V~~V~ 29 (105)
T PRK00004 4 KIKKGDTVIVIAGKDKGKRGKVLKVL 29 (105)
T ss_pred cccCCCEEEEeEcCCCCcEEEEEEEE
Confidence 6889999999874 457888888874
No 36
>TIGR01079 rplX_bact ribosomal protein L24, bacterial/organelle. This model recognizes bacterial and organellar forms of ribosomal protein L24. It excludes eukaryotic and archaeal forms, designated L26 in eukaryotes.
Probab=29.19 E-value=79 Score=24.13 Aligned_cols=26 Identities=31% Similarity=0.316 Sum_probs=20.6
Q ss_pred CCCcCCEEEEecC-CCCeeeeEEEecc
Q psy18049 153 DIRLGDRVIVMSA-QGSKTGVLRFKGT 178 (204)
Q Consensus 153 ~~~vG~RV~v~~~-~~~~~GtVryiG~ 178 (204)
.|..||+|.|.++ +.+..|+|..+=.
T Consensus 3 ~ikkGD~V~Vi~G~dKGK~G~V~~V~~ 29 (104)
T TIGR01079 3 KIKKGDTVKVISGKDKGKRGKVLKVLP 29 (104)
T ss_pred cccCCCEEEEeEcCCCCcEEEEEEEEc
Confidence 5889999999874 4578899988843
No 37
>PF00924 MS_channel: Mechanosensitive ion channel; InterPro: IPR006685 Mechanosensitive (MS) channels provide protection against hypo-osmotic shock, responding both to stretching of the cell membrane and to membrane depolarisation. They are present in the membranes of organisms from the three domains of life: bacteria, archaea, and eukarya []. There are two families of MS channels: large-conductance MS channels (MscL) and small-conductance MS channels (MscS or YGGB). The pressure threshold for MscS opening is 50% that of MscL []. The MscS family is much larger and more variable in size and sequence than the MscL family. Much of the diversity in MscS proteins occurs in the size of the transmembrane regions, which ranges from three to eleven transmembrane helices, although the three C-terminal helices are conserved. This family contains sequences form the MscS family of proteins. MscS folds as a homo-heptamer with a cylindrical shape, and can be divided into transmembrane and extramembrane regions: an N-terminal periplasmic region, a transmembrane region, and a C-terminal cytoplasmic region (middle and C-terminal domains). The transmembrane region forms a channel through the membrane that opens into a chamber enclosed by the extramembrane portion, the latter connecting to the cytoplasm through distinct portals [].; GO: 0055085 transmembrane transport, 0016020 membrane; PDB: 2OAU_E 2VV5_F.
Probab=28.30 E-value=47 Score=26.94 Aligned_cols=26 Identities=19% Similarity=0.305 Sum_probs=17.7
Q ss_pred CCCCCcCCEEEEecCCCCeeeeEEEeccCC
Q psy18049 151 GGDIRLGDRVIVMSAQGSKTGVLRFKGTTQ 180 (204)
Q Consensus 151 ~~~~~vG~RV~v~~~~~~~~GtVryiG~~~ 180 (204)
..++++||+|.+. +..|.|..+|...
T Consensus 58 ~~pf~vGD~I~i~----~~~G~V~~I~l~~ 83 (206)
T PF00924_consen 58 ERPFKVGDRIEIG----GVEGRVEEIGLRS 83 (206)
T ss_dssp C-SS-TT-EEESS----S-EEEEEEE-SSE
T ss_pred cCCccCCCEEEEE----EeehHHHhcCcce
Confidence 3589999999994 6899999998764
No 38
>PRK06531 yajC preprotein translocase subunit YajC; Validated
Probab=27.96 E-value=75 Score=24.76 Aligned_cols=23 Identities=22% Similarity=0.289 Sum_probs=20.3
Q ss_pred CCcccccEEEECCCceEEEEEEe
Q psy18049 36 DSFIIGDRVYVGGTKSGRIAFIG 58 (204)
Q Consensus 36 ~~l~vG~rV~v~g~~~G~vryiG 58 (204)
..|+.||+|.-.|...|+|..+.
T Consensus 35 ~sLk~GD~VvT~GGi~G~V~~I~ 57 (113)
T PRK06531 35 NAIQKGDEVVTIGGLYGTVDEVD 57 (113)
T ss_pred HhcCCCCEEEECCCcEEEEEEEe
Confidence 47999999999988999998885
No 39
>smart00739 KOW KOW (Kyprides, Ouzounis, Woese) motif. Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
Probab=25.58 E-value=1.2e+02 Score=16.37 Aligned_cols=22 Identities=27% Similarity=0.421 Sum_probs=14.2
Q ss_pred CCCcCCEEEEecC-CCCeeeeEE
Q psy18049 153 DIRLGDRVIVMSA-QGSKTGVLR 174 (204)
Q Consensus 153 ~~~vG~RV~v~~~-~~~~~GtVr 174 (204)
.+++|++|.|... ..++.|+|.
T Consensus 1 ~~~~G~~V~I~~G~~~g~~g~i~ 23 (28)
T smart00739 1 KFEVGDTVRVIAGPFKGKVGKVL 23 (28)
T ss_pred CCCCCCEEEEeECCCCCcEEEEE
Confidence 3689999999863 224455554
No 40
>PRK09816 thrL thr operon leader peptide; Provisional
Probab=25.47 E-value=59 Score=18.64 Aligned_cols=11 Identities=73% Similarity=0.712 Sum_probs=8.4
Q ss_pred eeeEEEEecCC
Q psy18049 3 TTTMTITTENR 13 (204)
Q Consensus 3 ~~~~~~~~~~~ 13 (204)
+||+||++-|-
T Consensus 9 ~ttitittgng 19 (26)
T PRK09816 9 TTTITITTGNG 19 (26)
T ss_pred EEEEEEEeCCc
Confidence 57888888774
No 41
>COG4466 Veg Uncharacterized protein conserved in bacteria [Function unknown]
Probab=25.35 E-value=1.1e+02 Score=22.46 Aligned_cols=42 Identities=26% Similarity=0.431 Sum_probs=31.4
Q ss_pred CCCcCCEEEEecCCCCeeeeEEEeccCCC-CCceEEEEEeCCCCc
Q psy18049 153 DIRLGDRVIVMSAQGSKTGVLRFKGTTQF-AQGEWCGVELDEPIG 196 (204)
Q Consensus 153 ~~~vG~RV~v~~~~~~~~GtVryiG~~~~-~~g~w~GVElD~p~G 196 (204)
+-++|+||.+++ +++|.=+++-.|.+.- .|-.++ ||||+..|
T Consensus 15 ~ah~G~~v~lk~-ngGRKk~~~r~G~L~EtYpSvFI-iel~~d~~ 57 (80)
T COG4466 15 DAHLGERVTLKA-NGGRKKTIERSGILIETYPSVFI-IELDQDEG 57 (80)
T ss_pred HhccCcEEEEEe-cCCceeeehhceEEeeecCcEEE-EEecccCC
Confidence 467999999986 5678888888888854 455544 88887654
No 42
>PF07154 DUF1392: Protein of unknown function (DUF1392); InterPro: IPR009824 This family consists of several hypothetical cyanobacterial proteins of around 150 residues in length, which seem to be specific to Anabaena species. The function of this family is unknown.
Probab=24.78 E-value=1e+02 Score=25.26 Aligned_cols=45 Identities=22% Similarity=0.310 Sum_probs=27.0
Q ss_pred cCCCCCcccccEEEECC-CceEEEEEEeecCCCCCcE-EEEEEcCCC
Q psy18049 32 TEDTDSFIIGDRVYVGG-TKSGRIAFIGETKFAPGDW-AGVVLDDPV 76 (204)
Q Consensus 32 ~~~~~~l~vG~rV~v~g-~~~G~vryiG~~~~~~g~w-~GVELd~p~ 76 (204)
+-..+.|.+||||.+.- ..--..|-|=-+.--.+.| +.|||-.|.
T Consensus 82 tv~kp~F~LGd~V~~~f~~~~pkqRlIlGv~lv~~~W~Y~VE~~SPt 128 (150)
T PF07154_consen 82 TVQKPAFRLGDRVEFRFYSDGPKQRLILGVFLVNNSWFYAVEWRSPT 128 (150)
T ss_pred eccCCceecCCEEEEEecCCCCceEEEEEEEEecCceEEEEEEeCCc
Confidence 34567899999999863 1222334332233334444 678888875
No 43
>PRK10334 mechanosensitive channel MscS; Provisional
Probab=21.12 E-value=1.1e+02 Score=27.09 Aligned_cols=25 Identities=16% Similarity=0.005 Sum_probs=20.9
Q ss_pred CCCCCcCCEEEEecCCCCeeeeEEEeccC
Q psy18049 151 GGDIRLGDRVIVMSAQGSKTGVLRFKGTT 179 (204)
Q Consensus 151 ~~~~~vG~RV~v~~~~~~~~GtVryiG~~ 179 (204)
..++++||+|.+. +..|+|.-+|--
T Consensus 127 ~rpf~vGD~I~i~----~~~G~V~~I~~r 151 (286)
T PRK10334 127 FRPFRAGEYVDLG----GVAGTVLSVQIF 151 (286)
T ss_pred cCCCCCCCEEEEC----CEEEEEEEEEeE
Confidence 4789999999994 689999988753
No 44
>PF11141 DUF2914: Protein of unknown function (DUF2914); InterPro: IPR022606 This bacterial family of proteins has no known function.
Probab=20.35 E-value=1.6e+02 Score=20.38 Aligned_cols=23 Identities=43% Similarity=0.634 Sum_probs=19.2
Q ss_pred CCcCC-EEEEecCCCCeeeeEEEe
Q psy18049 154 IRLGD-RVIVMSAQGSKTGVLRFK 176 (204)
Q Consensus 154 ~~vG~-RV~v~~~~~~~~GtVryi 176 (204)
-..|+ ||.|...++...+.++|-
T Consensus 41 ~~~G~WrV~V~~~~G~~l~~~~F~ 64 (66)
T PF11141_consen 41 DQPGDWRVEVVDEDGQVLGSLRFS 64 (66)
T ss_pred CCCcCEEEEEEcCCCCEEEEEEEE
Confidence 57788 999988778889999883
No 45
>PF04246 RseC_MucC: Positive regulator of sigma(E), RseC/MucC; InterPro: IPR007359 This bacterial family of integral membrane proteins represents a positive regulator of the sigma(E) transcription factor, namely RseC/MucC. The sigma(E) transcription factor is up-regulated by cell envelope protein misfolding, and regulates the expression of genes that are collectively termed ECF (devoted to Extra-Cellular Functions) []. In Pseudomonas aeruginosa, derepression of sigma(E) is associated with the alginate-overproducing phenotype characteristic of chronic respiratory tract colonization in cystic fibrosis patients. The mechanism by which RseC/MucC positively regulates the sigma(E) transcription factor is unknown. RseC is also thought to have a role in thiamine biosynthesis in Salmonella typhimurium []. In addition, this family also includes an N-terminal part of RnfF, a Rhodobacter capsulatus protein, of unknown function, that is essential for nitrogen fixation. This protein also contains a domain found in ApbE protein IPR003374 from INTERPRO, which is itself involved in thiamine biosynthesis.
Probab=20.28 E-value=1.8e+02 Score=22.42 Aligned_cols=43 Identities=14% Similarity=0.139 Sum_probs=24.0
Q ss_pred eEEEEecCCCcccccc---ccc--------cCCCcccccCCCCCcccccEEEEC
Q psy18049 5 TMTITTENRNPFQNCF---LVS--------ISDSSQVLTEDTDSFIIGDRVYVG 47 (204)
Q Consensus 5 ~~~~~~~~~~~~~~~~---~~~--------~~~~~~~~~~~~~~l~vG~rV~v~ 47 (204)
..++..++..--.+|. .|. ..............+++||+|.|.
T Consensus 8 ~~~V~~~r~saC~~C~~~~~Cg~~~~~~~~~~~~~~~~~~~~~~~~~GD~V~v~ 61 (135)
T PF04246_consen 8 IAWVEVQRSSACGSCSASGGCGTGLLAKLFSGKPITFRAPNPIGAKVGDRVEVE 61 (135)
T ss_pred EEEEEEccCCcCcccCCCCCCCcchhhhhcCCCcEEEEecCCCCCCCCCEEEEE
Confidence 4566666666655554 222 112222233455678889988875
No 46
>PRK10334 mechanosensitive channel MscS; Provisional
Probab=20.19 E-value=1.2e+02 Score=26.90 Aligned_cols=24 Identities=25% Similarity=0.405 Sum_probs=20.4
Q ss_pred CCcccccEEEECCCceEEEEEEeec
Q psy18049 36 DSFIIGDRVYVGGTKSGRIAFIGET 60 (204)
Q Consensus 36 ~~l~vG~rV~v~g~~~G~vryiG~~ 60 (204)
..+++||.|++.+ ..|+|.-||-.
T Consensus 128 rpf~vGD~I~i~~-~~G~V~~I~~r 151 (286)
T PRK10334 128 RPFRAGEYVDLGG-VAGTVLSVQIF 151 (286)
T ss_pred CCCCCCCEEEECC-EEEEEEEEEeE
Confidence 4699999999986 69999999654
Done!