Query psy18108
Match_columns 102
No_of_seqs 138 out of 1031
Neff 7.3
Searched_HMMs 46136
Date Fri Aug 16 22:43:41 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18108.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18108hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF08246 Inhibitor_I29: Cathep 99.9 6.3E-22 1.4E-26 115.3 7.3 58 34-91 1-58 (58)
2 PTZ00203 cathepsin L protease; 99.8 6.4E-19 1.4E-23 134.1 11.0 68 27-95 31-98 (348)
3 smart00848 Inhibitor_I29 Cathe 99.8 1.6E-19 3.6E-24 104.1 5.0 57 34-90 1-57 (57)
4 PTZ00021 falcipain-2; Provisio 99.7 3E-17 6.6E-22 129.4 8.9 67 30-96 165-231 (489)
5 PTZ00200 cysteine proteinase; 99.7 9.9E-17 2.1E-21 125.5 10.5 69 26-96 118-186 (448)
6 KOG1542|consensus 99.6 8.8E-15 1.9E-19 110.6 7.4 72 29-100 66-137 (372)
7 KOG1543|consensus 98.8 2.5E-08 5.4E-13 75.6 8.0 60 38-97 30-89 (325)
8 PF08127 Propeptide_C1: Peptid 96.4 0.0041 8.9E-08 33.7 2.8 37 63-101 4-40 (41)
9 PF07172 GRP: Glycine rich pro 51.3 9.6 0.00021 24.1 1.3 13 1-13 1-13 (95)
10 TIGR02792 PCA_ligA protocatech 49.3 17 0.00036 24.0 2.2 19 82-100 93-113 (117)
11 PF13986 DUF4224: Domain of un 45.2 19 0.00041 19.7 1.7 15 86-100 3-17 (47)
12 PRK13377 protocatechuate 4,5-d 43.2 23 0.0005 23.7 2.2 19 82-100 99-119 (129)
13 PF05443 ROS_MUCR: ROS/MUCR tr 38.1 21 0.00045 24.0 1.4 14 85-98 94-107 (132)
14 cd07924 PCA_45_Doxase_A The A 35.9 37 0.0008 22.5 2.3 19 82-100 96-116 (121)
15 COG1678 Putative transcription 32.6 8.1 0.00018 27.7 -1.3 42 57-100 28-70 (194)
16 PF03032 Brevenin: Brevenin/es 29.7 37 0.0008 18.7 1.3 15 8-22 7-21 (46)
17 PRK12702 mannosyl-3-phosphogly 29.5 32 0.0007 26.3 1.4 24 76-99 114-137 (302)
18 PF02209 VHP: Villin headpiece 28.6 38 0.00082 17.6 1.1 13 86-98 2-14 (36)
19 PF03047 ComC: COMC family; I 27.3 21 0.00045 18.2 0.0 20 78-97 6-25 (32)
20 COG4957 Predicted transcriptio 27.0 51 0.0011 22.5 1.8 14 85-98 98-111 (148)
21 PF08194 DIM: DIM protein; In 26.6 91 0.002 16.3 2.4 15 6-20 5-19 (36)
22 PF05391 Lsm_interact: Lsm int 25.8 55 0.0012 15.1 1.3 11 85-95 9-19 (21)
23 KOG2245|consensus 25.1 2.9E+02 0.0064 23.0 6.1 27 30-57 33-59 (562)
24 smart00153 VHP Villin headpiec 24.9 56 0.0012 16.9 1.4 12 86-97 2-13 (36)
25 PF04021 Class_IIIsignal: Clas 23.9 88 0.0019 15.2 2.0 14 4-17 4-17 (28)
26 PF14940 TMEM219: Transmembran 21.2 1.7E+02 0.0036 21.4 3.7 18 26-43 38-55 (223)
No 1
>PF08246 Inhibitor_I29: Cathepsin propeptide inhibitor domain (I29); InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.87 E-value=6.3e-22 Score=115.30 Aligned_cols=58 Identities=45% Similarity=0.782 Sum_probs=51.6
Q ss_pred HHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHH
Q psy18108 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM 91 (102)
Q Consensus 34 f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf 91 (102)
|++|+++|+|.|.+.+|...|+.+|++|++.|.+||..+..+|++|+|+|||||++||
T Consensus 1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf 58 (58)
T PF08246_consen 1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF 58 (58)
T ss_dssp HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence 7899999999999999999999999999999999997777789999999999999997
No 2
>PTZ00203 cathepsin L protease; Provisional
Probab=99.80 E-value=6.4e-19 Score=134.09 Aligned_cols=68 Identities=31% Similarity=0.531 Sum_probs=61.9
Q ss_pred ChHHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhh
Q psy18108 27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRL 95 (102)
Q Consensus 27 ~~~~~~~f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~ 95 (102)
..+++..|++|+.+|+|.|.+..|+..|+.+|++|+++|++||+++. +|++|+|+|+|||+|||.+++
T Consensus 31 ~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~~-~~~lg~N~FaDlT~eEf~~~~ 98 (348)
T PTZ00203 31 GTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNP-HARFGITKFFDLSEAEFAARY 98 (348)
T ss_pred ccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccCC-CeEEeccccccCCHHHHHHHh
Confidence 33578899999999999999988888999999999999999998754 799999999999999999875
No 3
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.79 E-value=1.6e-19 Score=104.06 Aligned_cols=57 Identities=44% Similarity=0.797 Sum_probs=53.8
Q ss_pred HHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHH
Q psy18108 34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE 90 (102)
Q Consensus 34 f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eE 90 (102)
|+.|+.+|+|.|.+.+|...|+.+|++|++.|..||..+..+|++|+|+|||||++|
T Consensus 1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE 57 (57)
T smart00848 1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE 57 (57)
T ss_pred ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence 688999999999999999999999999999999999887778999999999999876
No 4
>PTZ00021 falcipain-2; Provisional
Probab=99.71 E-value=3e-17 Score=129.36 Aligned_cols=67 Identities=31% Similarity=0.504 Sum_probs=62.0
Q ss_pred HHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhhC
Q psy18108 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLG 96 (102)
Q Consensus 30 ~~~~f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g 96 (102)
....|++|+.+|+|.|.+.+|+..|+.+|++|+++|++||+.+..+|++|+|+|+|||+|||++++.
T Consensus 165 ~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l 231 (489)
T PTZ00021 165 NVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYL 231 (489)
T ss_pred HHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhc
Confidence 5678999999999999998899999999999999999999876668999999999999999998764
No 5
>PTZ00200 cysteine proteinase; Provisional
Probab=99.70 E-value=9.9e-17 Score=125.51 Aligned_cols=69 Identities=26% Similarity=0.375 Sum_probs=62.0
Q ss_pred CChHHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhhC
Q psy18108 26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLG 96 (102)
Q Consensus 26 ~~~~~~~~f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g 96 (102)
.+.++...|++|+.+|+|.|.+..|+..|+.+|++|++.|++||.. .+|++|+|+|||||+|||.+++.
T Consensus 118 ~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~--~~y~lgiN~FsDlT~eEF~~~~~ 186 (448)
T PTZ00200 118 LEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGD--EPYSKEINKFSDLTEEEFRKLFP 186 (448)
T ss_pred chHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCc--CCeEEeccccccCCHHHHHHHhc
Confidence 3445778899999999999999889999999999999999999963 47999999999999999998763
No 6
>KOG1542|consensus
Probab=99.56 E-value=8.8e-15 Score=110.62 Aligned_cols=72 Identities=35% Similarity=0.528 Sum_probs=65.9
Q ss_pred HHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhhCCCCC
Q psy18108 29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLS 100 (102)
Q Consensus 29 ~~~~~f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g~~~~ 100 (102)
.....|..|+.+|+|+|.+.+|..+|+.+|+.|+..+++++.....|..+|+|+|||||+|||+++++..++
T Consensus 66 ~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~ 137 (372)
T KOG1542|consen 66 GLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKR 137 (372)
T ss_pred chHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhcccc
Confidence 347889999999999999999999999999999999999998877789999999999999999999875543
No 7
>KOG1543|consensus
Probab=98.80 E-value=2.5e-08 Score=75.57 Aligned_cols=60 Identities=33% Similarity=0.416 Sum_probs=52.0
Q ss_pred HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhhCC
Q psy18108 38 IRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGL 97 (102)
Q Consensus 38 ~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g~ 97 (102)
+.++.+.|.+..+...|+.+|.+|++.|..||.....+|++++|+|+|+|.+|+++.+..
T Consensus 30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~ 89 (325)
T KOG1543|consen 30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTG 89 (325)
T ss_pred hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhcc
Confidence 666666676667888899999999999999999756689999999999999999998754
No 8
>PF08127 Propeptide_C1: Peptidase family C1 propeptide; InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=96.41 E-value=0.0041 Score=33.72 Aligned_cols=37 Identities=24% Similarity=0.281 Sum_probs=24.0
Q ss_pred HHHHHhhcCCCcceeeecccCCCCCHHHHHHhhCCCCCC
Q psy18108 63 KLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLSK 101 (102)
Q Consensus 63 ~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g~~~~~ 101 (102)
++|+..|.. ..+++-|.| |.+.|.+.+++++|..+-+
T Consensus 4 e~I~~IN~~-~~tWkAG~N-F~~~~~~~ik~LlGv~~~~ 40 (41)
T PF08127_consen 4 EFIDYINSK-NTTWKAGRN-FENTSIEYIKRLLGVLPDP 40 (41)
T ss_dssp HHHHHHHHC-T-SEEE-----SSB-HHHHHHCS-B-TTS
T ss_pred HHHHHHHcC-CCcccCCCC-CCCCCHHHHHHHcCCCCCC
Confidence 568888887 558999999 8999999999999976544
No 9
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=51.30 E-value=9.6 Score=24.08 Aligned_cols=13 Identities=38% Similarity=0.194 Sum_probs=6.5
Q ss_pred CcchhhHHHHHHH
Q psy18108 1 MAEDASAEATLAL 13 (102)
Q Consensus 1 ~~~~~~~~~~~~~ 13 (102)
||..+-++++|+|
T Consensus 1 MaSK~~llL~l~L 13 (95)
T PF07172_consen 1 MASKAFLLLGLLL 13 (95)
T ss_pred CchhHHHHHHHHH
Confidence 6655544444443
No 10
>TIGR02792 PCA_ligA protocatechuate 4,5-dioxygenase, alpha subunit. Protocatechuate (PCA) 4,5-dioxygenase is the first enzyme in the PCA 4,5-cleavage pathway that is an alternative to PCA 3,4-cleavage and PCA 2,3 cleavage pathways. PCA is an intermediate in the breakdown of lignin (hence the gene symbol ligA) and other compounds. Members of this family are the alpha chain of PCA 4,5-dioxygenase, or the equivalent domain of a fusion protein.
Probab=49.34 E-value=17 Score=24.01 Aligned_cols=19 Identities=21% Similarity=0.382 Sum_probs=14.7
Q ss_pred cCCCCCHHHHHHhh--CCCCC
Q psy18108 82 HLSDLTREEMKSRL--GLNLS 100 (102)
Q Consensus 82 ~faD~T~eEf~~~~--g~~~~ 100 (102)
.+..||.|||++++ |.+.|
T Consensus 93 ~mtG~t~eef~~mm~~ggr~~ 113 (117)
T TIGR02792 93 SMTGMTEEEYRQMMIGGGRSP 113 (117)
T ss_pred HhcCCCHHHHHHHHHhCCCCC
Confidence 46789999999987 45544
No 11
>PF13986 DUF4224: Domain of unknown function (DUF4224)
Probab=45.16 E-value=19 Score=19.72 Aligned_cols=15 Identities=33% Similarity=0.445 Sum_probs=13.1
Q ss_pred CCHHHHHHhhCCCCC
Q psy18108 86 LTREEMKSRLGLNLS 100 (102)
Q Consensus 86 ~T~eEf~~~~g~~~~ 100 (102)
||++|...+-|+++|
T Consensus 3 LT~~El~elTG~k~~ 17 (47)
T PF13986_consen 3 LTDEELQELTGYKRP 17 (47)
T ss_pred CCHHHHHHHHCCCCH
Confidence 799999999998875
No 12
>PRK13377 protocatechuate 4,5-dioxygenase subunit alpha; Provisional
Probab=43.15 E-value=23 Score=23.74 Aligned_cols=19 Identities=21% Similarity=0.382 Sum_probs=14.6
Q ss_pred cCCCCCHHHHHHhh--CCCCC
Q psy18108 82 HLSDLTREEMKSRL--GLNLS 100 (102)
Q Consensus 82 ~faD~T~eEf~~~~--g~~~~ 100 (102)
.+..||.|||++++ |.|.|
T Consensus 99 ~mtG~t~eef~~mm~~ggr~~ 119 (129)
T PRK13377 99 SMTGMTEEEYRQMMLGGGRSP 119 (129)
T ss_pred HhcCCCHHHHHHHHHhCCCCC
Confidence 46789999999987 45544
No 13
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=38.09 E-value=21 Score=24.00 Aligned_cols=14 Identities=43% Similarity=0.672 Sum_probs=10.2
Q ss_pred CCCHHHHHHhhCCC
Q psy18108 85 DLTREEMKSRLGLN 98 (102)
Q Consensus 85 D~T~eEf~~~~g~~ 98 (102)
+||++||++.+|+.
T Consensus 94 gltp~eYR~kwGlp 107 (132)
T PF05443_consen 94 GLTPEEYRAKWGLP 107 (132)
T ss_dssp -S-HHHHHHHTT-G
T ss_pred CCCHHHHHHHhCcC
Confidence 79999999999975
No 14
>cd07924 PCA_45_Doxase_A The A subunit of Protocatechuate 4,5-dioxygenase (LigAB) is the smaller, non-catalytic subunit. The A subunit is the non-catalytic subunit of Protocatechuate (PCA) 4,5-dioxygenase (LigAB), which is composed of A and B subunits that form a tetramer. PCA 4,5-dioxygenase catalyzes the oxidization and subsequent ring-opening of PCA (or 3,4-dihydroxybenzoic acid), which is an intermediate in the breakdown of lignin and other compounds. PCA 4,5-dioxygenase is one of the aromatic ring opening dioxygenases which play key roles in the degradation of aromatic compounds. As a member of the Class III extradiol dioxygenase family, LigAB uses a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon.
Probab=35.85 E-value=37 Score=22.54 Aligned_cols=19 Identities=16% Similarity=0.328 Sum_probs=14.7
Q ss_pred cCCCCCHHHHHHhh--CCCCC
Q psy18108 82 HLSDLTREEMKSRL--GLNLS 100 (102)
Q Consensus 82 ~faD~T~eEf~~~~--g~~~~ 100 (102)
.+..+|.|||++++ |.|.|
T Consensus 96 ~mtG~s~eef~~mm~~ggr~~ 116 (121)
T cd07924 96 SMTGMSMEEYRQMMVDGGRSP 116 (121)
T ss_pred HhcCCCHHHHHHHHHhCCCCC
Confidence 57789999999987 45544
No 15
>COG1678 Putative transcriptional regulator [Transcription]
Probab=32.58 E-value=8.1 Score=27.66 Aligned_cols=42 Identities=26% Similarity=0.356 Sum_probs=34.9
Q ss_pred HHHHHHHHHHHhhcCCCcceeeecccCCC-CCHHHHHHhhCCCCC
Q psy18108 57 VFEDNLKLIEDLNKGEHGTATYGINHLSD-LTREEMKSRLGLNLS 100 (102)
Q Consensus 57 ~F~~N~~~I~~hN~~~~~s~~lglN~faD-~T~eEf~~~~g~~~~ 100 (102)
+|.+.+-+|-+||.+| ++-+-+|+-++ +|-+|+-...+...+
T Consensus 28 ~F~RSVvyiceH~e~G--amGliiNrp~~~~~~~~l~~~~~~~~~ 70 (194)
T COG1678 28 IFARSVVYICEHNENG--AMGLIINRPSETLTLAELLEKLKLEIK 70 (194)
T ss_pred ccceeEEEEEEeCCCC--cEEEEecCCccccCHHHHHHHhccccc
Confidence 5777888899999877 58899999999 999999888775543
No 16
>PF03032 Brevenin: Brevenin/esculentin/gaegurin/rugosin family; InterPro: IPR004275 In addition to the highly specific cell-mediated immune system, vertebrates possess an efficient host-defence mechanism against invading microorganisms which involves the synthesis of highly potent antimicrobial peptides with a large spectrum of activity. This entry represents a number of these defence peptides secreted from the skin of amphibians, including the opiate-like dermorphins and deltorphins, and the antimicrobial dermoseptins and temporins.; GO: 0006952 defense response, 0042742 defense response to bacterium, 0005576 extracellular region
Probab=29.68 E-value=37 Score=18.65 Aligned_cols=15 Identities=13% Similarity=0.056 Sum_probs=9.9
Q ss_pred HHHHHHHHHhhcccc
Q psy18108 8 EATLALFGQMKSNNE 22 (102)
Q Consensus 8 ~~~~~~~~~~~~~~~ 22 (102)
++.++.||++|.+.+
T Consensus 7 llLlfflG~ISlSlC 21 (46)
T PF03032_consen 7 LLLLFFLGTISLSLC 21 (46)
T ss_pred HHHHHHHHHcccchH
Confidence 456677787766655
No 17
>PRK12702 mannosyl-3-phosphoglycerate phosphatase; Reviewed
Probab=29.51 E-value=32 Score=26.26 Aligned_cols=24 Identities=21% Similarity=0.358 Sum_probs=19.7
Q ss_pred eeeecccCCCCCHHHHHHhhCCCC
Q psy18108 76 ATYGINHLSDLTREEMKSRLGLNL 99 (102)
Q Consensus 76 ~~lglN~faD~T~eEf~~~~g~~~ 99 (102)
+...+..|+|||.+|+.+..|+..
T Consensus 114 ~~~~f~gF~d~t~~ei~~~TGL~~ 137 (302)
T PRK12702 114 SHLDLIGFGDWTASELAAATGIPL 137 (302)
T ss_pred hCCCceehhhCCHHHHHHHhCcCH
Confidence 445678999999999999988753
No 18
>PF02209 VHP: Villin headpiece domain; InterPro: IPR003128 Villin is an F-actin bundling protein involved in the maintenance of the microvilli of the absorptive epithelia. The villin-type "headpiece" domain is a modular motif found at the extreme C terminus of larger "core" domains in over 25 cytoskeletal proteins in plants and animals, often in assocation with the Gelsolin repeat. Although the headpiece is classified as an F-actin-binding domain, it has been shown that not all headpiece domains are intrinsically F-actin-binding motifs, surface charge distribution may be an important element for F-actin recognition []. An autonomously folding, 35 residue, thermostable subdomain (HP36) of the full-length 76 amino acid residue villin headpiece, is the smallest known example of a cooperatively folded domain of a naturally occurring protein. The structure of HP36, as determined by NMR spectroscopy, consists of three short helices surrounding a tightly packed hydrophobic core []. ; GO: 0003779 actin binding, 0007010 cytoskeleton organization; PDB: 1ZV6_A 1QZP_A 1UND_A 2PPZ_A 3TJW_B 1YU8_X 2JM0_A 1WY4_A 3MYC_A 1YU5_X ....
Probab=28.57 E-value=38 Score=17.61 Aligned_cols=13 Identities=23% Similarity=0.634 Sum_probs=8.3
Q ss_pred CCHHHHHHhhCCC
Q psy18108 86 LTREEMKSRLGLN 98 (102)
Q Consensus 86 ~T~eEf~~~~g~~ 98 (102)
+|++||.+.+|..
T Consensus 2 Lsd~dF~~vFgm~ 14 (36)
T PF02209_consen 2 LSDEDFEKVFGMS 14 (36)
T ss_dssp S-HHHHHHHHSS-
T ss_pred cCHHHHHHHHCCC
Confidence 5778888877754
No 19
>PF03047 ComC: COMC family; InterPro: IPR004288 Competence is the ability of a cell to take up exogenous DNA from its environment, resulting in transformation. It is widespread among bacteria and is probably an important mechanism for the horizontal transfer of genes. Cells that take up DNA inevitably acquire the nucleotides the DNA consists of, and, because nucleotides are needed for DNA and RNA synthesis and are expensive to synthesise, these may make a significant contribution to the cell's energy budget []. The lateral gene transfer caused by competence also contributes to the genetic diversity that makes evolution possible. DNA usually becomes available by the death and lysis of other cells. Competent bacteria use components of extracellular filaments called type 4 pili to create pores in their membranes and pull DNA through the pores into the cytoplasm. This process, including the development of competence and the expression of the uptake machinery, is regulated in response to cell-cell signalling and/or nutritional conditions []. This family consists of streptococcal competence stimulating peptide precursors, which are generally up to 50 amino acid residues long. In all the members of this family, the leader sequence is cleaved after two conserved glycine residues; thus the leader sequence is of the double- glycine type []. Competence stimulating peptides (CSP) are small (less than 25 amino acid residues) cationic peptides. The N-terminal amino acid residue is negatively charged, either glutamate or aspartate. The C-terminal end is positively charged. The third residue is also positively charged: a highly conserved arginine []. Some COMC proteins and their precursors (not included in this family) do not fully follow the above description. Functionally, CSP act as pheromones, stimulating competence for genetic transformation in streptococci. In streptococci, the (CSP mediated) competence response requires exponential cell growth at a critical density, a relatively simple requirement when compared to the stationary-phase requirement of Haemophilus, or the late-logarithmic- phase of Bacillus []. All bacteria induced to competence by a particular CSP are said to belong to the same pherotype, because each CSP is recognised by a specific receptor (the signalling domain of a histidine kinase ComD). Pherotypes are not necessarily species-specific. In addition, an organism may change pherotype. There are two possible mechanisms for pherotype switching: horizontal gene transfer, and accumulation of point mutations. The biological significance of pherotypes and pherotype switching is not definitively determined. Pherotype switching occurs frequently enough in naturally competent streptococci to suggest that it may be an important contributor to genetic exchange between different bacterial species []. This entry also includes proteins that form bacteriocin-like propetides with a glycine-glycine cleavage site. The bacteriocin is initially formed as a pre-propeptide and upon cleavage at the glycine-glycine cleavage site, a leader peptide and the propeptide would be formed. The propeptide then undergoes posttranslational modification before becoming functional [].; GO: 0005186 pheromone activity; PDB: 2I2J_A 2I2H_A 2A1C_A.
Probab=27.30 E-value=21 Score=18.16 Aligned_cols=20 Identities=25% Similarity=0.391 Sum_probs=0.0
Q ss_pred eecccCCCCCHHHHHHhhCC
Q psy18108 78 YGINHLSDLTREEMKSRLGL 97 (102)
Q Consensus 78 lglN~faD~T~eEf~~~~g~ 97 (102)
..+++|.-+|++|....-|.
T Consensus 6 ~~L~~F~~lt~~eL~~I~GG 25 (32)
T PF03047_consen 6 KTLEQFEELTEEELQEIQGG 25 (32)
T ss_dssp --------------------
T ss_pred hhHHHHhcCCHHHHhhccCC
Confidence 36789999999999888764
No 20
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=27.00 E-value=51 Score=22.50 Aligned_cols=14 Identities=36% Similarity=0.650 Sum_probs=12.5
Q ss_pred CCCHHHHHHhhCCC
Q psy18108 85 DLTREEMKSRLGLN 98 (102)
Q Consensus 85 D~T~eEf~~~~g~~ 98 (102)
+||++||++..|+.
T Consensus 98 gmTPd~YR~KW~LP 111 (148)
T COG4957 98 GLTPDEYRAKWGLP 111 (148)
T ss_pred CCCHHHHHHhcCCC
Confidence 79999999998875
No 21
>PF08194 DIM: DIM protein; InterPro: IPR013172 Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila []. This entry includes DIMs 1 to 4 and DIM23.
Probab=26.61 E-value=91 Score=16.28 Aligned_cols=15 Identities=20% Similarity=0.109 Sum_probs=8.4
Q ss_pred hHHHHHHHHHHhhcc
Q psy18108 6 SAEATLALFGQMKSN 20 (102)
Q Consensus 6 ~~~~~~~~~~~~~~~ 20 (102)
|+-+++++++++++.
T Consensus 5 ~~a~~l~lLal~~a~ 19 (36)
T PF08194_consen 5 SLAFALLLLALAAAV 19 (36)
T ss_pred HHHHHHHHHHHHhcc
Confidence 444566666655544
No 22
>PF05391 Lsm_interact: Lsm interaction motif; InterPro: IPR008669 This short motif is found at the C terminus of Prp24 proteins and probably interacts with the Lsm proteins to promote U4/U6 formation [].
Probab=25.83 E-value=55 Score=15.07 Aligned_cols=11 Identities=0% Similarity=0.277 Sum_probs=8.3
Q ss_pred CCCHHHHHHhh
Q psy18108 85 DLTREEMKSRL 95 (102)
Q Consensus 85 D~T~eEf~~~~ 95 (102)
-+++++|++++
T Consensus 9 p~SNddFrkmf 19 (21)
T PF05391_consen 9 PKSNDDFRKMF 19 (21)
T ss_pred ccchHHHHHHH
Confidence 46788888875
No 23
>KOG2245|consensus
Probab=25.09 E-value=2.9e+02 Score=22.96 Aligned_cols=27 Identities=22% Similarity=0.317 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHhCCCCCCHHHHHHHHHH
Q psy18108 30 HLKQFEKFIRDFSKSYPTKEEVAKRFAV 57 (102)
Q Consensus 30 ~~~~f~~f~~~y~K~Y~~~~e~~~R~~~ 57 (102)
.-..-.+.++.+| .|.+.+|..+|..+
T Consensus 33 lt~~L~~~L~~~g-~fEs~eEt~~R~~V 59 (562)
T KOG2245|consen 33 LTQELIKTLKNEG-LFESKEETQRREEV 59 (562)
T ss_pred HHHHHHHHHHhcC-CCCCHHHHHHHHHH
Confidence 3445677777776 78888888888654
No 24
>smart00153 VHP Villin headpiece domain.
Probab=24.88 E-value=56 Score=16.90 Aligned_cols=12 Identities=25% Similarity=0.664 Sum_probs=8.3
Q ss_pred CCHHHHHHhhCC
Q psy18108 86 LTREEMKSRLGL 97 (102)
Q Consensus 86 ~T~eEf~~~~g~ 97 (102)
++++||...+|.
T Consensus 2 LsdeeF~~vfgm 13 (36)
T smart00153 2 LSDEDFEEVFGM 13 (36)
T ss_pred CCHHHHHHHHCC
Confidence 567777777764
No 25
>PF04021 Class_IIIsignal: Class III signal peptide; InterPro: IPR007166 This entry represents an amino terminal motif QXSXEXXXL thought to be part of a class III signal sequence for a family of archaeal proteins. The Q residue is the +1 residue of the signal peptidase cleavage site []. Two proteins containing this motif are cleaved by a type IV pilin-like signal peptidase.
Probab=23.94 E-value=88 Score=15.19 Aligned_cols=14 Identities=14% Similarity=0.138 Sum_probs=10.1
Q ss_pred hhhHHHHHHHHHHh
Q psy18108 4 DASAEATLALFGQM 17 (102)
Q Consensus 4 ~~~~~~~~~~~~~~ 17 (102)
..|+++.+++++++
T Consensus 4 Q~SlE~~ili~~vl 17 (28)
T PF04021_consen 4 QISLEFIILIAAVL 17 (28)
T ss_pred HHhHHHHHHHHHHH
Confidence 35788888777765
No 26
>PF14940 TMEM219: Transmembrane 219
Probab=21.22 E-value=1.7e+02 Score=21.40 Aligned_cols=18 Identities=22% Similarity=0.586 Sum_probs=14.8
Q ss_pred CChHHHHHHHHHHHHhCC
Q psy18108 26 ENPEHLKQFEKFIRDFSK 43 (102)
Q Consensus 26 ~~~~~~~~f~~f~~~y~K 43 (102)
.++++.++|..|....++
T Consensus 38 ~nPDi~~DWN~fL~~ls~ 55 (223)
T PF14940_consen 38 KNPDIPQDWNTFLLSLSQ 55 (223)
T ss_pred CcccchhhHHHHHHhhcC
Confidence 344688999999999987
Done!