Query         psy18108
Match_columns 102
No_of_seqs    138 out of 1031
Neff          7.3 
Searched_HMMs 46136
Date          Fri Aug 16 22:43:41 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy18108.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18108hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF08246 Inhibitor_I29:  Cathep  99.9 6.3E-22 1.4E-26  115.3   7.3   58   34-91      1-58  (58)
  2 PTZ00203 cathepsin L protease;  99.8 6.4E-19 1.4E-23  134.1  11.0   68   27-95     31-98  (348)
  3 smart00848 Inhibitor_I29 Cathe  99.8 1.6E-19 3.6E-24  104.1   5.0   57   34-90      1-57  (57)
  4 PTZ00021 falcipain-2; Provisio  99.7   3E-17 6.6E-22  129.4   8.9   67   30-96    165-231 (489)
  5 PTZ00200 cysteine proteinase;   99.7 9.9E-17 2.1E-21  125.5  10.5   69   26-96    118-186 (448)
  6 KOG1542|consensus               99.6 8.8E-15 1.9E-19  110.6   7.4   72   29-100    66-137 (372)
  7 KOG1543|consensus               98.8 2.5E-08 5.4E-13   75.6   8.0   60   38-97     30-89  (325)
  8 PF08127 Propeptide_C1:  Peptid  96.4  0.0041 8.9E-08   33.7   2.8   37   63-101     4-40  (41)
  9 PF07172 GRP:  Glycine rich pro  51.3     9.6 0.00021   24.1   1.3   13    1-13      1-13  (95)
 10 TIGR02792 PCA_ligA protocatech  49.3      17 0.00036   24.0   2.2   19   82-100    93-113 (117)
 11 PF13986 DUF4224:  Domain of un  45.2      19 0.00041   19.7   1.7   15   86-100     3-17  (47)
 12 PRK13377 protocatechuate 4,5-d  43.2      23  0.0005   23.7   2.2   19   82-100    99-119 (129)
 13 PF05443 ROS_MUCR:  ROS/MUCR tr  38.1      21 0.00045   24.0   1.4   14   85-98     94-107 (132)
 14 cd07924 PCA_45_Doxase_A The A   35.9      37  0.0008   22.5   2.3   19   82-100    96-116 (121)
 15 COG1678 Putative transcription  32.6     8.1 0.00018   27.7  -1.3   42   57-100    28-70  (194)
 16 PF03032 Brevenin:  Brevenin/es  29.7      37  0.0008   18.7   1.3   15    8-22      7-21  (46)
 17 PRK12702 mannosyl-3-phosphogly  29.5      32  0.0007   26.3   1.4   24   76-99    114-137 (302)
 18 PF02209 VHP:  Villin headpiece  28.6      38 0.00082   17.6   1.1   13   86-98      2-14  (36)
 19 PF03047 ComC:  COMC family;  I  27.3      21 0.00045   18.2   0.0   20   78-97      6-25  (32)
 20 COG4957 Predicted transcriptio  27.0      51  0.0011   22.5   1.8   14   85-98     98-111 (148)
 21 PF08194 DIM:  DIM protein;  In  26.6      91   0.002   16.3   2.4   15    6-20      5-19  (36)
 22 PF05391 Lsm_interact:  Lsm int  25.8      55  0.0012   15.1   1.3   11   85-95      9-19  (21)
 23 KOG2245|consensus               25.1 2.9E+02  0.0064   23.0   6.1   27   30-57     33-59  (562)
 24 smart00153 VHP Villin headpiec  24.9      56  0.0012   16.9   1.4   12   86-97      2-13  (36)
 25 PF04021 Class_IIIsignal:  Clas  23.9      88  0.0019   15.2   2.0   14    4-17      4-17  (28)
 26 PF14940 TMEM219:  Transmembran  21.2 1.7E+02  0.0036   21.4   3.7   18   26-43     38-55  (223)

No 1  
>PF08246 Inhibitor_I29:  Cathepsin propeptide inhibitor domain (I29);  InterPro: IPR013201 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  This entry represents a peptidase inhibitor domain, which belongs to MEROPS peptidase inhibitor family I29. The domain is also found at the N terminus of a variety of peptidase precursors that belong to MEROPS peptidase subfamily C1A; these include cathepsin L, papain, and procaricain (P10056 from SWISSPROT) []. It forms an alpha-helical domain that runs through the substrate-binding site, preventing access. Removal of this region by proteolytic cleavage results in activation of the enzyme. This domain is also found, in one or more copies, in a variety of cysteine peptidase inhibitors such as salarin [].; PDB: 3QT4_A 3QJ3_A 2C0Y_A 2L95_A 1CJL_A 1CS8_A 7PCK_A 1BY8_A 1PCI_A 2O6X_A ....
Probab=99.87  E-value=6.3e-22  Score=115.30  Aligned_cols=58  Identities=45%  Similarity=0.782  Sum_probs=51.6

Q ss_pred             HHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHH
Q psy18108         34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEM   91 (102)
Q Consensus        34 f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf   91 (102)
                      |++|+++|+|.|.+.+|...|+.+|++|++.|.+||..+..+|++|+|+|||||++||
T Consensus         1 F~~~~~~~~k~Y~~~~e~~~R~~~F~~N~~~I~~~N~~~~~~~~~~~N~fsD~t~eEf   58 (58)
T PF08246_consen    1 FEQFKKKYGKSYKSAEEEARRFAIFKENLRRIEEHNANGNNTYKLGLNQFSDMTPEEF   58 (58)
T ss_dssp             HHHHHHHCT---SSHHHHHHHHHHHHHHHHHHHHHHHTTSSSEEE-SSTTTTSSHHHH
T ss_pred             CHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCeEEeCccccCcChhhC
Confidence            7899999999999999999999999999999999997777789999999999999997


No 2  
>PTZ00203 cathepsin L protease; Provisional
Probab=99.80  E-value=6.4e-19  Score=134.09  Aligned_cols=68  Identities=31%  Similarity=0.531  Sum_probs=61.9

Q ss_pred             ChHHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhh
Q psy18108         27 NPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRL   95 (102)
Q Consensus        27 ~~~~~~~f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~   95 (102)
                      ..+++..|++|+.+|+|.|.+..|+..|+.+|++|+++|++||+++. +|++|+|+|+|||+|||.+++
T Consensus        31 ~~~~~~~f~~~~~~~~K~Y~~~~E~~~R~~iF~~N~~~I~~~N~~~~-~~~lg~N~FaDlT~eEf~~~~   98 (348)
T PTZ00203         31 GTPAAALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQARNP-HARFGITKFFDLSEAEFAARY   98 (348)
T ss_pred             ccHHHHHHHHHHHHhCCCCCChHHHHHHHHHHHHHHHHHHHHhccCC-CeEEeccccccCCHHHHHHHh
Confidence            33578899999999999999988888999999999999999998754 799999999999999999875


No 3  
>smart00848 Inhibitor_I29 Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS. Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a s
Probab=99.79  E-value=1.6e-19  Score=104.06  Aligned_cols=57  Identities=44%  Similarity=0.797  Sum_probs=53.8

Q ss_pred             HHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHH
Q psy18108         34 FEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREE   90 (102)
Q Consensus        34 f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eE   90 (102)
                      |+.|+.+|+|.|.+.+|...|+.+|++|++.|..||..+..+|++|+|+|||||++|
T Consensus         1 f~~~~~~~~k~y~~~~e~~~r~~~f~~n~~~i~~~N~~~~~~~~~~~N~fsDlt~eE   57 (57)
T smart00848        1 FEQWKKKYGKSYSSEEEELRRFEIFKENLKFIEEHNKKNDHSYTLGLNQFADLTNEE   57 (57)
T ss_pred             ChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHhcCCCCeEecCcccccCCCCC
Confidence            688999999999999999999999999999999999887778999999999999876


No 4  
>PTZ00021 falcipain-2; Provisional
Probab=99.71  E-value=3e-17  Score=129.36  Aligned_cols=67  Identities=31%  Similarity=0.504  Sum_probs=62.0

Q ss_pred             HHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhhC
Q psy18108         30 HLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLG   96 (102)
Q Consensus        30 ~~~~f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g   96 (102)
                      ....|++|+.+|+|.|.+.+|+..|+.+|++|+++|++||+.+..+|++|+|+|+|||+|||++++.
T Consensus       165 ~~~~F~~wk~ky~K~Y~~~eE~~~R~~iF~~Nl~~Ie~hN~~~~~ty~lgiNqFsDlT~EEF~~~~l  231 (489)
T PTZ00021        165 NVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGMNRFGDLSFEEFKKKYL  231 (489)
T ss_pred             HHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhhccCCCCEEEeccccccCCHHHHHHHhc
Confidence            5678999999999999998899999999999999999999876668999999999999999998764


No 5  
>PTZ00200 cysteine proteinase; Provisional
Probab=99.70  E-value=9.9e-17  Score=125.51  Aligned_cols=69  Identities=26%  Similarity=0.375  Sum_probs=62.0

Q ss_pred             CChHHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhhC
Q psy18108         26 ENPEHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLG   96 (102)
Q Consensus        26 ~~~~~~~~f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g   96 (102)
                      .+.++...|++|+.+|+|.|.+..|+..|+.+|++|++.|++||..  .+|++|+|+|||||+|||.+++.
T Consensus       118 ~e~e~~~~F~~f~~ky~K~Y~~~~E~~~R~~iF~~Nl~~I~~hN~~--~~y~lgiN~FsDlT~eEF~~~~~  186 (448)
T PTZ00200        118 LEFEVYLEFEEFNKKYNRKHATHAERLNRFLTFRNNYLEVKSHKGD--EPYSKEINKFSDLTEEEFRKLFP  186 (448)
T ss_pred             chHHHHHHHHHHHHHhCCcCCCHHHHHHHHHHHHHHHHHHHHhcCc--CCeEEeccccccCCHHHHHHHhc
Confidence            3445778899999999999999889999999999999999999963  47999999999999999998763


No 6  
>KOG1542|consensus
Probab=99.56  E-value=8.8e-15  Score=110.62  Aligned_cols=72  Identities=35%  Similarity=0.528  Sum_probs=65.9

Q ss_pred             HHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhhCCCCC
Q psy18108         29 EHLKQFEKFIRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLS  100 (102)
Q Consensus        29 ~~~~~f~~f~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g~~~~  100 (102)
                      .....|..|+.+|+|+|.+.+|..+|+.+|+.|+..+++++.....|..+|+|+|||||+|||+++++..++
T Consensus        66 ~~~~~F~~F~~kf~r~Y~s~eE~~~Rl~iF~~N~~~a~~~q~~d~gsA~yGvtqFSDlT~eEFkk~~l~~~~  137 (372)
T KOG1542|consen   66 GLEDSFKLFTIKFGRSYASREEHAHRLSIFKHNLLRAERLQENDPGSAEYGVTQFSDLTEEEFKKIYLGVKR  137 (372)
T ss_pred             chHHHHHHHHHhcCcccCcHHHHHHHHHHHHHHHHHHHHhhhcCccccccCccchhhcCHHHHHHHhhcccc
Confidence            347889999999999999999999999999999999999998877789999999999999999999875543


No 7  
>KOG1543|consensus
Probab=98.80  E-value=2.5e-08  Score=75.57  Aligned_cols=60  Identities=33%  Similarity=0.416  Sum_probs=52.0

Q ss_pred             HHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhcCCCcceeeecccCCCCCHHHHHHhhCC
Q psy18108         38 IRDFSKSYPTKEEVAKRFAVFEDNLKLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGL   97 (102)
Q Consensus        38 ~~~y~K~Y~~~~e~~~R~~~F~~N~~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g~   97 (102)
                      +.++.+.|.+..+...|+.+|.+|++.|..||.....+|++++|+|+|+|.+|+++.+..
T Consensus        30 ~~~~~~~y~~~~~~~~r~~~f~~n~~~~~~~n~~~~~~~~~g~n~~~d~~~ee~~~~~~~   89 (325)
T KOG1543|consen   30 LVKFLKRYEDRVEKKARRAIFKENLQKIESHNLKYVLSFLMGVNQFADLTTEEFKRKKTG   89 (325)
T ss_pred             hhhhccccccHHHHHHHHHHHHHHHHHHHhhhhhhceeeeeccccccccchHHHHHhhcc
Confidence            666666676667888899999999999999999756689999999999999999998754


No 8  
>PF08127 Propeptide_C1:  Peptidase family C1 propeptide;  InterPro: IPR012599 This domain is found at the N-terminal of cathepsin B and cathepsin B-like peptidases that belong to MEROPS peptidase subfamily C1A. Cathepsin B are lysosomal cysteine proteinases belonging to the papain superfamily and are unique in their ability to act as both an endo- and an exopeptidases. They are synthesized as inactive zymogens. Activation of the peptidases occurs with the removal of the propeptide [, ]. ; GO: 0004197 cysteine-type endopeptidase activity, 0050790 regulation of catalytic activity; PDB: 1MIR_A 1PBH_A 2PBH_A 3PBH_A.
Probab=96.41  E-value=0.0041  Score=33.72  Aligned_cols=37  Identities=24%  Similarity=0.281  Sum_probs=24.0

Q ss_pred             HHHHHhhcCCCcceeeecccCCCCCHHHHHHhhCCCCCC
Q psy18108         63 KLIEDLNKGEHGTATYGINHLSDLTREEMKSRLGLNLSK  101 (102)
Q Consensus        63 ~~I~~hN~~~~~s~~lglN~faD~T~eEf~~~~g~~~~~  101 (102)
                      ++|+..|.. ..+++-|.| |.+.|.+.+++++|..+-+
T Consensus         4 e~I~~IN~~-~~tWkAG~N-F~~~~~~~ik~LlGv~~~~   40 (41)
T PF08127_consen    4 EFIDYINSK-NTTWKAGRN-FENTSIEYIKRLLGVLPDP   40 (41)
T ss_dssp             HHHHHHHHC-T-SEEE-----SSB-HHHHHHCS-B-TTS
T ss_pred             HHHHHHHcC-CCcccCCCC-CCCCCHHHHHHHcCCCCCC
Confidence            568888887 558999999 8999999999999976544


No 9  
>PF07172 GRP:  Glycine rich protein family;  InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress [].
Probab=51.30  E-value=9.6  Score=24.08  Aligned_cols=13  Identities=38%  Similarity=0.194  Sum_probs=6.5

Q ss_pred             CcchhhHHHHHHH
Q psy18108          1 MAEDASAEATLAL   13 (102)
Q Consensus         1 ~~~~~~~~~~~~~   13 (102)
                      ||..+-++++|+|
T Consensus         1 MaSK~~llL~l~L   13 (95)
T PF07172_consen    1 MASKAFLLLGLLL   13 (95)
T ss_pred             CchhHHHHHHHHH
Confidence            6655544444443


No 10 
>TIGR02792 PCA_ligA protocatechuate 4,5-dioxygenase, alpha subunit. Protocatechuate (PCA) 4,5-dioxygenase is the first enzyme in the PCA 4,5-cleavage pathway that is an alternative to PCA 3,4-cleavage and PCA 2,3 cleavage pathways. PCA is an intermediate in the breakdown of lignin (hence the gene symbol ligA) and other compounds. Members of this family are the alpha chain of PCA 4,5-dioxygenase, or the equivalent domain of a fusion protein.
Probab=49.34  E-value=17  Score=24.01  Aligned_cols=19  Identities=21%  Similarity=0.382  Sum_probs=14.7

Q ss_pred             cCCCCCHHHHHHhh--CCCCC
Q psy18108         82 HLSDLTREEMKSRL--GLNLS  100 (102)
Q Consensus        82 ~faD~T~eEf~~~~--g~~~~  100 (102)
                      .+..||.|||++++  |.+.|
T Consensus        93 ~mtG~t~eef~~mm~~ggr~~  113 (117)
T TIGR02792        93 SMTGMTEEEYRQMMIGGGRSP  113 (117)
T ss_pred             HhcCCCHHHHHHHHHhCCCCC
Confidence            46789999999987  45544


No 11 
>PF13986 DUF4224:  Domain of unknown function (DUF4224)
Probab=45.16  E-value=19  Score=19.72  Aligned_cols=15  Identities=33%  Similarity=0.445  Sum_probs=13.1

Q ss_pred             CCHHHHHHhhCCCCC
Q psy18108         86 LTREEMKSRLGLNLS  100 (102)
Q Consensus        86 ~T~eEf~~~~g~~~~  100 (102)
                      ||++|...+-|+++|
T Consensus         3 LT~~El~elTG~k~~   17 (47)
T PF13986_consen    3 LTDEELQELTGYKRP   17 (47)
T ss_pred             CCHHHHHHHHCCCCH
Confidence            799999999998875


No 12 
>PRK13377 protocatechuate 4,5-dioxygenase subunit alpha; Provisional
Probab=43.15  E-value=23  Score=23.74  Aligned_cols=19  Identities=21%  Similarity=0.382  Sum_probs=14.6

Q ss_pred             cCCCCCHHHHHHhh--CCCCC
Q psy18108         82 HLSDLTREEMKSRL--GLNLS  100 (102)
Q Consensus        82 ~faD~T~eEf~~~~--g~~~~  100 (102)
                      .+..||.|||++++  |.|.|
T Consensus        99 ~mtG~t~eef~~mm~~ggr~~  119 (129)
T PRK13377         99 SMTGMTEEEYRQMMLGGGRSP  119 (129)
T ss_pred             HhcCCCHHHHHHHHHhCCCCC
Confidence            46789999999987  45544


No 13 
>PF05443 ROS_MUCR:  ROS/MUCR transcriptional regulator protein;  InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=38.09  E-value=21  Score=24.00  Aligned_cols=14  Identities=43%  Similarity=0.672  Sum_probs=10.2

Q ss_pred             CCCHHHHHHhhCCC
Q psy18108         85 DLTREEMKSRLGLN   98 (102)
Q Consensus        85 D~T~eEf~~~~g~~   98 (102)
                      +||++||++.+|+.
T Consensus        94 gltp~eYR~kwGlp  107 (132)
T PF05443_consen   94 GLTPEEYRAKWGLP  107 (132)
T ss_dssp             -S-HHHHHHHTT-G
T ss_pred             CCCHHHHHHHhCcC
Confidence            79999999999975


No 14 
>cd07924 PCA_45_Doxase_A The A subunit of Protocatechuate 4,5-dioxygenase (LigAB) is the smaller, non-catalytic subunit. The A subunit is the non-catalytic subunit of Protocatechuate (PCA) 4,5-dioxygenase (LigAB), which is composed of A and B subunits that form a tetramer. PCA 4,5-dioxygenase catalyzes the oxidization and subsequent ring-opening of PCA (or 3,4-dihydroxybenzoic acid), which is an intermediate in the breakdown of lignin and other compounds. PCA 4,5-dioxygenase is one of the aromatic ring opening dioxygenases which  play key roles in the degradation of aromatic compounds. As a member of the Class III extradiol dioxygenase family, LigAB uses a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon.
Probab=35.85  E-value=37  Score=22.54  Aligned_cols=19  Identities=16%  Similarity=0.328  Sum_probs=14.7

Q ss_pred             cCCCCCHHHHHHhh--CCCCC
Q psy18108         82 HLSDLTREEMKSRL--GLNLS  100 (102)
Q Consensus        82 ~faD~T~eEf~~~~--g~~~~  100 (102)
                      .+..+|.|||++++  |.|.|
T Consensus        96 ~mtG~s~eef~~mm~~ggr~~  116 (121)
T cd07924          96 SMTGMSMEEYRQMMVDGGRSP  116 (121)
T ss_pred             HhcCCCHHHHHHHHHhCCCCC
Confidence            57789999999987  45544


No 15 
>COG1678 Putative transcriptional regulator [Transcription]
Probab=32.58  E-value=8.1  Score=27.66  Aligned_cols=42  Identities=26%  Similarity=0.356  Sum_probs=34.9

Q ss_pred             HHHHHHHHHHHhhcCCCcceeeecccCCC-CCHHHHHHhhCCCCC
Q psy18108         57 VFEDNLKLIEDLNKGEHGTATYGINHLSD-LTREEMKSRLGLNLS  100 (102)
Q Consensus        57 ~F~~N~~~I~~hN~~~~~s~~lglN~faD-~T~eEf~~~~g~~~~  100 (102)
                      +|.+.+-+|-+||.+|  ++-+-+|+-++ +|-+|+-...+...+
T Consensus        28 ~F~RSVvyiceH~e~G--amGliiNrp~~~~~~~~l~~~~~~~~~   70 (194)
T COG1678          28 IFARSVVYICEHNENG--AMGLIINRPSETLTLAELLEKLKLEIK   70 (194)
T ss_pred             ccceeEEEEEEeCCCC--cEEEEecCCccccCHHHHHHHhccccc
Confidence            5777888899999877  58899999999 999999888775543


No 16 
>PF03032 Brevenin:  Brevenin/esculentin/gaegurin/rugosin family;  InterPro: IPR004275 In addition to the highly specific cell-mediated immune system, vertebrates possess an efficient host-defence mechanism against invading microorganisms which involves the synthesis of highly potent antimicrobial peptides with a large spectrum of activity. This entry represents a number of these defence peptides secreted from the skin of amphibians, including the opiate-like dermorphins and deltorphins, and the antimicrobial dermoseptins and temporins.; GO: 0006952 defense response, 0042742 defense response to bacterium, 0005576 extracellular region
Probab=29.68  E-value=37  Score=18.65  Aligned_cols=15  Identities=13%  Similarity=0.056  Sum_probs=9.9

Q ss_pred             HHHHHHHHHhhcccc
Q psy18108          8 EATLALFGQMKSNNE   22 (102)
Q Consensus         8 ~~~~~~~~~~~~~~~   22 (102)
                      ++.++.||++|.+.+
T Consensus         7 llLlfflG~ISlSlC   21 (46)
T PF03032_consen    7 LLLLFFLGTISLSLC   21 (46)
T ss_pred             HHHHHHHHHcccchH
Confidence            456677787766655


No 17 
>PRK12702 mannosyl-3-phosphoglycerate phosphatase; Reviewed
Probab=29.51  E-value=32  Score=26.26  Aligned_cols=24  Identities=21%  Similarity=0.358  Sum_probs=19.7

Q ss_pred             eeeecccCCCCCHHHHHHhhCCCC
Q psy18108         76 ATYGINHLSDLTREEMKSRLGLNL   99 (102)
Q Consensus        76 ~~lglN~faD~T~eEf~~~~g~~~   99 (102)
                      +...+..|+|||.+|+.+..|+..
T Consensus       114 ~~~~f~gF~d~t~~ei~~~TGL~~  137 (302)
T PRK12702        114 SHLDLIGFGDWTASELAAATGIPL  137 (302)
T ss_pred             hCCCceehhhCCHHHHHHHhCcCH
Confidence            445678999999999999988753


No 18 
>PF02209 VHP:  Villin headpiece domain;  InterPro: IPR003128 Villin is an F-actin bundling protein involved in the maintenance of the microvilli of the absorptive epithelia. The villin-type "headpiece" domain is a modular motif found at the extreme C terminus of larger "core" domains in over 25 cytoskeletal proteins in plants and animals, often in assocation with the Gelsolin repeat. Although the headpiece is classified as an F-actin-binding domain, it has been shown that not all headpiece domains are intrinsically F-actin-binding motifs, surface charge distribution may be an important element for F-actin recognition []. An autonomously folding, 35 residue, thermostable subdomain (HP36) of the full-length 76 amino acid residue villin headpiece, is the smallest known example of a cooperatively folded domain of a naturally occurring protein. The structure of HP36, as determined by NMR spectroscopy, consists of three short helices surrounding a tightly packed hydrophobic core []. ; GO: 0003779 actin binding, 0007010 cytoskeleton organization; PDB: 1ZV6_A 1QZP_A 1UND_A 2PPZ_A 3TJW_B 1YU8_X 2JM0_A 1WY4_A 3MYC_A 1YU5_X ....
Probab=28.57  E-value=38  Score=17.61  Aligned_cols=13  Identities=23%  Similarity=0.634  Sum_probs=8.3

Q ss_pred             CCHHHHHHhhCCC
Q psy18108         86 LTREEMKSRLGLN   98 (102)
Q Consensus        86 ~T~eEf~~~~g~~   98 (102)
                      +|++||.+.+|..
T Consensus         2 Lsd~dF~~vFgm~   14 (36)
T PF02209_consen    2 LSDEDFEKVFGMS   14 (36)
T ss_dssp             S-HHHHHHHHSS-
T ss_pred             cCHHHHHHHHCCC
Confidence            5778888877754


No 19 
>PF03047 ComC:  COMC family;  InterPro: IPR004288 Competence is the ability of a cell to take up exogenous DNA from its environment, resulting in transformation. It is widespread among bacteria and is probably an important mechanism for the horizontal transfer of genes. Cells that take up DNA inevitably acquire the nucleotides the DNA consists of, and, because nucleotides are needed for DNA and RNA synthesis and are expensive to synthesise, these may make a significant contribution to the cell's energy budget []. The lateral gene transfer caused by competence also contributes to the genetic diversity that makes evolution possible.  DNA usually becomes available by the death and lysis of other cells. Competent bacteria use components of extracellular filaments called type 4 pili to create pores in their membranes and pull DNA through the pores into the cytoplasm. This process, including the development of competence and the expression of the uptake machinery, is regulated in response to cell-cell signalling and/or nutritional conditions []. This family consists of streptococcal competence stimulating peptide precursors, which are generally up to 50 amino acid residues long. In all the members of this family, the leader sequence is cleaved after two conserved glycine residues; thus the leader sequence is of the double- glycine type []. Competence stimulating peptides (CSP) are small (less than 25 amino acid residues) cationic peptides. The N-terminal amino acid residue is negatively charged, either glutamate or aspartate. The C-terminal end is positively charged. The third residue is also positively charged: a highly conserved arginine []. Some COMC proteins and their precursors (not included in this family) do not fully follow the above description.  Functionally, CSP act as pheromones, stimulating competence for genetic transformation in streptococci. In streptococci, the (CSP mediated) competence response requires exponential cell growth at a critical density, a relatively simple requirement when compared to the stationary-phase requirement of Haemophilus, or the late-logarithmic- phase of Bacillus []. All bacteria induced to competence by a particular CSP are said to belong to the same pherotype, because each CSP is recognised by a specific receptor (the signalling domain of a histidine kinase ComD). Pherotypes are not necessarily species-specific. In addition, an organism may change pherotype. There are two possible mechanisms for pherotype switching: horizontal gene transfer, and accumulation of point mutations. The biological significance of pherotypes and pherotype switching is not definitively determined. Pherotype switching occurs frequently enough in naturally competent streptococci to suggest that it may be an important contributor to genetic exchange between different bacterial species []. This entry also includes proteins that form bacteriocin-like propetides with a glycine-glycine cleavage site. The bacteriocin is initially formed as a pre-propeptide and upon cleavage at the glycine-glycine cleavage site, a leader peptide and the propeptide would be formed. The propeptide then undergoes posttranslational modification before becoming functional [].; GO: 0005186 pheromone activity; PDB: 2I2J_A 2I2H_A 2A1C_A.
Probab=27.30  E-value=21  Score=18.16  Aligned_cols=20  Identities=25%  Similarity=0.391  Sum_probs=0.0

Q ss_pred             eecccCCCCCHHHHHHhhCC
Q psy18108         78 YGINHLSDLTREEMKSRLGL   97 (102)
Q Consensus        78 lglN~faD~T~eEf~~~~g~   97 (102)
                      ..+++|.-+|++|....-|.
T Consensus         6 ~~L~~F~~lt~~eL~~I~GG   25 (32)
T PF03047_consen    6 KTLEQFEELTEEELQEIQGG   25 (32)
T ss_dssp             --------------------
T ss_pred             hhHHHHhcCCHHHHhhccCC
Confidence            36789999999999888764


No 20 
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=27.00  E-value=51  Score=22.50  Aligned_cols=14  Identities=36%  Similarity=0.650  Sum_probs=12.5

Q ss_pred             CCCHHHHHHhhCCC
Q psy18108         85 DLTREEMKSRLGLN   98 (102)
Q Consensus        85 D~T~eEf~~~~g~~   98 (102)
                      +||++||++..|+.
T Consensus        98 gmTPd~YR~KW~LP  111 (148)
T COG4957          98 GLTPDEYRAKWGLP  111 (148)
T ss_pred             CCCHHHHHHhcCCC
Confidence            79999999998875


No 21 
>PF08194 DIM:  DIM protein;  InterPro: IPR013172 Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila []. This entry includes DIMs 1 to 4 and DIM23.
Probab=26.61  E-value=91  Score=16.28  Aligned_cols=15  Identities=20%  Similarity=0.109  Sum_probs=8.4

Q ss_pred             hHHHHHHHHHHhhcc
Q psy18108          6 SAEATLALFGQMKSN   20 (102)
Q Consensus         6 ~~~~~~~~~~~~~~~   20 (102)
                      |+-+++++++++++.
T Consensus         5 ~~a~~l~lLal~~a~   19 (36)
T PF08194_consen    5 SLAFALLLLALAAAV   19 (36)
T ss_pred             HHHHHHHHHHHHhcc
Confidence            444566666655544


No 22 
>PF05391 Lsm_interact:  Lsm interaction motif;  InterPro: IPR008669 This short motif is found at the C terminus of Prp24 proteins and probably interacts with the Lsm proteins to promote U4/U6 formation [].
Probab=25.83  E-value=55  Score=15.07  Aligned_cols=11  Identities=0%  Similarity=0.277  Sum_probs=8.3

Q ss_pred             CCCHHHHHHhh
Q psy18108         85 DLTREEMKSRL   95 (102)
Q Consensus        85 D~T~eEf~~~~   95 (102)
                      -+++++|++++
T Consensus         9 p~SNddFrkmf   19 (21)
T PF05391_consen    9 PKSNDDFRKMF   19 (21)
T ss_pred             ccchHHHHHHH
Confidence            46788888875


No 23 
>KOG2245|consensus
Probab=25.09  E-value=2.9e+02  Score=22.96  Aligned_cols=27  Identities=22%  Similarity=0.317  Sum_probs=19.2

Q ss_pred             HHHHHHHHHHHhCCCCCCHHHHHHHHHH
Q psy18108         30 HLKQFEKFIRDFSKSYPTKEEVAKRFAV   57 (102)
Q Consensus        30 ~~~~f~~f~~~y~K~Y~~~~e~~~R~~~   57 (102)
                      .-..-.+.++.+| .|.+.+|..+|..+
T Consensus        33 lt~~L~~~L~~~g-~fEs~eEt~~R~~V   59 (562)
T KOG2245|consen   33 LTQELIKTLKNEG-LFESKEETQRREEV   59 (562)
T ss_pred             HHHHHHHHHHhcC-CCCCHHHHHHHHHH
Confidence            3445677777776 78888888888654


No 24 
>smart00153 VHP Villin headpiece domain.
Probab=24.88  E-value=56  Score=16.90  Aligned_cols=12  Identities=25%  Similarity=0.664  Sum_probs=8.3

Q ss_pred             CCHHHHHHhhCC
Q psy18108         86 LTREEMKSRLGL   97 (102)
Q Consensus        86 ~T~eEf~~~~g~   97 (102)
                      ++++||...+|.
T Consensus         2 LsdeeF~~vfgm   13 (36)
T smart00153        2 LSDEDFEEVFGM   13 (36)
T ss_pred             CCHHHHHHHHCC
Confidence            567777777764


No 25 
>PF04021 Class_IIIsignal:  Class III signal peptide;  InterPro: IPR007166 This entry represents an amino terminal motif QXSXEXXXL thought to be part of a class III signal sequence for a family of archaeal proteins. The Q residue is the +1 residue of the signal peptidase cleavage site []. Two proteins containing this motif are cleaved by a type IV pilin-like signal peptidase. 
Probab=23.94  E-value=88  Score=15.19  Aligned_cols=14  Identities=14%  Similarity=0.138  Sum_probs=10.1

Q ss_pred             hhhHHHHHHHHHHh
Q psy18108          4 DASAEATLALFGQM   17 (102)
Q Consensus         4 ~~~~~~~~~~~~~~   17 (102)
                      ..|+++.+++++++
T Consensus         4 Q~SlE~~ili~~vl   17 (28)
T PF04021_consen    4 QISLEFIILIAAVL   17 (28)
T ss_pred             HHhHHHHHHHHHHH
Confidence            35788888777765


No 26 
>PF14940 TMEM219:  Transmembrane 219
Probab=21.22  E-value=1.7e+02  Score=21.40  Aligned_cols=18  Identities=22%  Similarity=0.586  Sum_probs=14.8

Q ss_pred             CChHHHHHHHHHHHHhCC
Q psy18108         26 ENPEHLKQFEKFIRDFSK   43 (102)
Q Consensus        26 ~~~~~~~~f~~f~~~y~K   43 (102)
                      .++++.++|..|....++
T Consensus        38 ~nPDi~~DWN~fL~~ls~   55 (223)
T PF14940_consen   38 KNPDIPQDWNTFLLSLSQ   55 (223)
T ss_pred             CcccchhhHHHHHHhhcC
Confidence            344688999999999987


Done!