Query         psy9049
Match_columns 64
No_of_seqs    52 out of 54
Neff          3.8 
Searched_HMMs 46136
Date          Fri Aug 16 17:40:07 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy9049.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9049hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2556|consensus               98.5 1.6E-08 3.6E-13   81.1  -1.4   52    2-53     93-147 (666)
  2 PTZ00257 Glycoprotein GP63 (le  97.0 0.00043 9.4E-09   56.2   2.3   40    3-42    144-183 (622)
  3 KOG2556|consensus               96.9 0.00017 3.8E-09   58.4  -0.8   34   31-64    100-133 (666)
  4 PTZ00337 surface protease GP63  96.8 0.00064 1.4E-08   54.3   1.7   40    3-42    120-159 (567)
  5 PF01457 Peptidase_M8:  Leishma  94.1   0.021 4.6E-07   44.4   1.0   40    3-42     98-137 (521)
  6 PTZ00337 surface protease GP63  84.0    0.54 1.2E-05   38.0   1.2   32   32-63    127-158 (567)
  7 PTZ00257 Glycoprotein GP63 (le  79.9     1.2 2.6E-05   36.7   1.8   32   32-63    151-182 (622)
  8 PF01457 Peptidase_M8:  Leishma  73.6    0.18 3.9E-06   39.3  -4.2   59    4-62     62-135 (521)
  9 PF13424 TPR_12:  Tetratricopep  72.0     8.8 0.00019   21.2   3.6   23   15-37     18-40  (78)
 10 PF00515 TPR_1:  Tetratricopept  66.8     5.6 0.00012   19.0   1.8   18   16-33     15-32  (34)
 11 PF10265 DUF2217:  Uncharacteri  66.5     6.8 0.00015   31.8   3.1   36    3-38    128-163 (514)
 12 PF13176 TPR_7:  Tetratricopept  65.3       7 0.00015   19.6   2.0   16   18-33     15-30  (36)
 13 PF03538 VRP1:  Salmonella viru  51.7      17 0.00036   27.7   2.8   48    2-50    273-320 (325)
 14 KOG2997|consensus               51.0      17 0.00037   28.6   2.9   50    3-55      1-50  (366)
 15 COG4492 PheB ACT domain-contai  47.8     5.8 0.00012   27.8  -0.1   30    2-34      2-31  (150)
 16 PF13414 TPR_11:  TPR repeat; P  45.6      22 0.00049   19.0   2.1   33   12-44     13-45  (69)
 17 PF13374 TPR_10:  Tetratricopep  44.3      25 0.00055   16.7   2.0   19   17-35     17-35  (42)
 18 PF07719 TPR_2:  Tetratricopept  40.0      29 0.00062   16.0   1.8   18   14-31     13-30  (34)
 19 PF13181 TPR_8:  Tetratricopept  36.1      42 0.00091   15.6   2.0   16   18-33     17-32  (34)
 20 PF04761 Phage_Treg:  Lactococc  32.8      42 0.00092   20.0   2.0   16    5-20     26-41  (57)
 21 PRK04435 hypothetical protein;  32.5      13 0.00028   24.6  -0.3   25    8-35      5-29  (147)
 22 cd03074 PDI_b'_Calsequestrin_C  32.3      22 0.00047   24.2   0.8   31   17-47     64-97  (120)
 23 PF00956 NAP:  Nucleosome assem  31.4      31 0.00067   24.2   1.5   18    9-26    224-241 (244)
 24 PF02071 NSF:  Aromatic-di-Alan  31.3      26 0.00057   14.9   0.7    8   21-28      1-8   (12)
 25 PHA00687 hypothetical protein   31.1      28  0.0006   20.6   1.0   21   10-30      4-24  (56)
 26 KOG1173|consensus               30.4      72  0.0016   26.8   3.5   40   16-55    428-486 (611)
 27 PF14559 TPR_19:  Tetratricopep  29.2      79  0.0017   16.6   2.6   30   14-43      3-32  (68)
 28 cd06588 PhnB_like Escherichia   29.2      38 0.00082   20.6   1.4   13   18-30     11-23  (128)
 29 KOG1717|consensus               28.5      33 0.00072   26.8   1.3   18   15-32    234-251 (343)
 30 PF13013 F-box-like_2:  F-box-l  27.6      76  0.0016   20.6   2.7   30    7-36     66-95  (109)
 31 PTZ00008 (NAP-S) nucleosome as  27.4      33 0.00071   23.9   1.0   20    7-26    158-177 (185)
 32 PTZ00007 (NAP-L) nucleosome as  25.4      33 0.00072   26.3   0.8   18    9-26    259-276 (337)
 33 PF09533 DUF2380:  Predicted li  23.3      34 0.00074   24.7   0.5   38    4-45     92-133 (188)
 34 PF10579 Rapsyn_N:  Rapsyn N-te  23.0 1.1E+02  0.0024   19.2   2.7   32   19-52     23-54  (80)
 35 PF01216 Calsequestrin:  Calseq  22.4      58  0.0013   25.9   1.6   15   17-31    312-326 (383)
 36 smart00671 SEL1 Sel1-like repe  21.9      74  0.0016   14.8   1.5   11   19-29     22-32  (36)
 37 PF10373 EST1_DNA_bind:  Est1 D  20.7 1.2E+02  0.0026   20.3   2.7   36   21-56      1-48  (278)
 38 PF05794 Tcp11:  T-complex prot  20.6 1.9E+02  0.0041   21.6   4.0   45   11-55    163-207 (441)

No 1  
>KOG2556|consensus
Probab=98.47  E-value=1.6e-08  Score=81.09  Aligned_cols=52  Identities=17%  Similarity=0.077  Sum_probs=46.8

Q ss_pred             cccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchhhH---HHHHHHHhh
Q psy9049           2 KVRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPEAV---HFWEQALMV   53 (64)
Q Consensus         2 ~~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac---~y~~~~~~V   53 (64)
                      |+.|.+|+..+|+++|+|||+.|||.+|+||+...+|+|.|.|   .|+.+..|-
T Consensus        93 Ve~l~~e~~n~v~s~l~pea~~y~e~~~~vr~~a~~i~L~R~C~~~~y~~K~g~p  147 (666)
T KOG2556|consen   93 VEKLFHENGNVVLSVLDPEADVYQEIRIYVRYDAVGISLDRDCQRVGYIVKLGEP  147 (666)
T ss_pred             HHHHhHhhchhhhhhcChhhhhcceeeEEEecCCcceeeehhhhhhhhhhhcCCC
Confidence            5789999999999999999999999999999999999999999   566665554


No 2  
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=96.99  E-value=0.00043  Score=56.21  Aligned_cols=40  Identities=20%  Similarity=0.287  Sum_probs=37.2

Q ss_pred             ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchh
Q psy9049           3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPE   42 (64)
Q Consensus         3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpR   42 (64)
                      |=|++||+++++++|+|+|+.+|.+.|.||++.+++.+|.
T Consensus       144 DILT~eK~~il~~tllP~avk~~~erL~Vrp~~~~i~v~~  183 (622)
T PTZ00257        144 DILTEEKRDILVSYLIPQALQLHVERLRVRQVQGSWRVTG  183 (622)
T ss_pred             cccCHHHHHHHHHHhHHHHHHHHHHhheeecCCCCeEeec
Confidence            5689999999999999999999999999999999998874


No 3  
>KOG2556|consensus
Probab=96.88  E-value=0.00017  Score=58.44  Aligned_cols=34  Identities=26%  Similarity=0.225  Sum_probs=32.1

Q ss_pred             hhhhcccccchhhHHHHHHHHhhhhccceeeecC
Q psy9049          31 VRKTEKNTVLPEAVHFWEQALMVRKTEKVIRLNR   64 (64)
Q Consensus        31 VR~~v~~iLLpRac~y~~~~~~VR~~~~~ilL~R   64 (64)
                      +-..++++++|+|..||+.+++||..+++|+|+|
T Consensus       100 ~~n~v~s~l~pea~~y~e~~~~vr~~a~~i~L~R  133 (666)
T KOG2556|consen  100 NGNVVLSVLDPEADVYQEIRIYVRYDAVGISLDR  133 (666)
T ss_pred             hchhhhhhcChhhhhcceeeEEEecCCcceeeeh
Confidence            4578999999999999999999999999999998


No 4  
>PTZ00337 surface protease GP63; Provisional
Probab=96.78  E-value=0.00064  Score=54.34  Aligned_cols=40  Identities=25%  Similarity=0.430  Sum_probs=37.2

Q ss_pred             ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchh
Q psy9049           3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPE   42 (64)
Q Consensus         3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpR   42 (64)
                      |-|++||++++.+.+||.|+.|+++.|.|+++.+++.+|+
T Consensus       120 DILT~eK~~~i~~~~lp~a~~~~~~~L~V~p~~~~i~v~~  159 (567)
T PTZ00337        120 DVLTEERRSIILRQTLPAAIQLHAERLSVRPVTRPVLIPR  159 (567)
T ss_pred             cccCHHHHHHHHHHHHHHHHHHHHHheEEecCCCCeeecc
Confidence            4589999999989999999999999999999999999885


No 5  
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=94.15  E-value=0.021  Score=44.43  Aligned_cols=40  Identities=30%  Similarity=0.376  Sum_probs=29.1

Q ss_pred             ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchh
Q psy9049           3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPE   42 (64)
Q Consensus         3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpR   42 (64)
                      |=|+++|++.+.+.++|.|++|+++.|.|+++.+++.+|.
T Consensus        98 dIlt~~k~~~l~~~~~~~~~~~~~~~l~V~~~~~~~~v~~  137 (521)
T PF01457_consen   98 DILTEEKRDTLIKTLIPNAVKLHSERLKVRPVCGNIKVPN  137 (521)
T ss_dssp             G---HHHHHHHHHTHHHHHHHHHHTTEEE--EESEE----
T ss_pred             ccccHHHHHHHHHHHhhhHHHHHHhhhcccccCCCcCccc
Confidence            3478889999999999999999999999999999998876


No 6  
>PTZ00337 surface protease GP63; Provisional
Probab=83.98  E-value=0.54  Score=37.98  Aligned_cols=32  Identities=22%  Similarity=0.356  Sum_probs=28.2

Q ss_pred             hhhcccccchhhHHHHHHHHhhhhccceeeec
Q psy9049          32 RKTEKNTVLPEAVHFWEQALMVRKTEKVIRLN   63 (64)
Q Consensus        32 R~~v~~iLLpRac~y~~~~~~VR~~~~~ilL~   63 (64)
                      +..+.+.+||.+..|+++.+.|+++.+||.+.
T Consensus       127 ~~~i~~~~lp~a~~~~~~~L~V~p~~~~i~v~  158 (567)
T PTZ00337        127 RSIILRQTLPAAIQLHAERLSVRPVTRPVLIP  158 (567)
T ss_pred             HHHHHHHHHHHHHHHHHHheEEecCCCCeeec
Confidence            56666788999999999999999999999764


No 7  
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=79.88  E-value=1.2  Score=36.73  Aligned_cols=32  Identities=22%  Similarity=0.371  Sum_probs=28.9

Q ss_pred             hhhcccccchhhHHHHHHHHhhhhccceeeec
Q psy9049          32 RKTEKNTVLPEAVHFWEQALMVRKTEKVIRLN   63 (64)
Q Consensus        32 R~~v~~iLLpRac~y~~~~~~VR~~~~~ilL~   63 (64)
                      |....+.+||.++..|.+.+.|||+.++|.+.
T Consensus       151 ~~il~~tllP~avk~~~erL~Vrp~~~~i~v~  182 (622)
T PTZ00257        151 RDILVSYLIPQALQLHVERLRVRQVQGSWRVT  182 (622)
T ss_pred             HHHHHHHhHHHHHHHHHHhheeecCCCCeEee
Confidence            88889999999999999999999999998763


No 8  
>PF01457 Peptidase_M8:  Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ;  InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA.   Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=73.57  E-value=0.18  Score=39.30  Aligned_cols=59  Identities=24%  Similarity=0.223  Sum_probs=36.1

Q ss_pred             cchHHHHHHHhhcchhhHHHHHHHHHh---------------hhhhcccccchhhHHHHHHHHhhhhccceeee
Q psy9049           4 RLDAEKFQLINNTVLPEAVHFWEQALM---------------VRKTEKNTVLPEAVHFWEQALMVRKTEKVIRL   62 (64)
Q Consensus         4 ~l~~ek~~lv~~~llPeav~y~e~~l~---------------VR~~v~~iLLpRac~y~~~~~~VR~~~~~ilL   62 (64)
                      |+..++++|++++..+++++.+...+.               ++......++|.+..|+++.+.|++..++|.+
T Consensus        62 RI~~~~~dl~d~~~~C~~vGq~~~~~~g~~~~C~~~dIlt~~k~~~l~~~~~~~~~~~~~~~l~V~~~~~~~~v  135 (521)
T PF01457_consen   62 RIKVSTDDLNDPKYYCTAVGQVISNFQGTVYTCTEEDILTEEKRDTLIKTLIPNAVKLHSERLKVRPVCGNIKV  135 (521)
T ss_dssp             -EEEE-HHHHSTTSS--STT-EEE-SSS-EEE--GGG---HHHHHHHHHTHHHHHHHHHHTTEEE--EESEE--
T ss_pred             EEEeehhhccCCCccccCCCEEEEcccCceEEccccccccHHHHHHHHHHHhhhHHHHHHhhhcccccCCCcCc
Confidence            445566788888999999988876655               22234445567777899999999999998865


No 9  
>PF13424 TPR_12:  Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=72.05  E-value=8.8  Score=21.20  Aligned_cols=23  Identities=22%  Similarity=0.322  Sum_probs=16.8

Q ss_pred             hcchhhHHHHHHHHHhhhhhccc
Q psy9049          15 NTVLPEAVHFWEQALMVRKTEKN   37 (64)
Q Consensus        15 ~~llPeav~y~e~~l~VR~~v~~   37 (64)
                      ..=+.+|+.++++++.+....++
T Consensus        18 ~~~~~~A~~~~~~al~~~~~~~~   40 (78)
T PF13424_consen   18 LGRYDEALDYYEKALDIEEQLGD   40 (78)
T ss_dssp             TT-HHHHHHHHHHHHHHHHHTTT
T ss_pred             cCCHHHHHHHHHHHHHHHHHHCC
Confidence            33457899999999988766654


No 10 
>PF00515 TPR_1:  Tetratricopeptide repeat;  InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=66.81  E-value=5.6  Score=19.01  Aligned_cols=18  Identities=22%  Similarity=0.397  Sum_probs=12.8

Q ss_pred             cchhhHHHHHHHHHhhhh
Q psy9049          16 TVLPEAVHFWEQALMVRK   33 (64)
Q Consensus        16 ~llPeav~y~e~~l~VR~   33 (64)
                      .=+++|+.+|++|+...+
T Consensus        15 ~~~~~A~~~~~~al~~~p   32 (34)
T PF00515_consen   15 GDYEEALEYYQRALELDP   32 (34)
T ss_dssp             T-HHHHHHHHHHHHHHST
T ss_pred             CCchHHHHHHHHHHHHCc
Confidence            346888888888887643


No 11 
>PF10265 DUF2217:  Uncharacterized conserved protein (DUF2217);  InterPro: IPR019392  This is a family of conserved proteins varying in length from 500-600 residues. Their function is not known. 
Probab=66.45  E-value=6.8  Score=31.79  Aligned_cols=36  Identities=28%  Similarity=0.542  Sum_probs=28.5

Q ss_pred             ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccc
Q psy9049           3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNT   38 (64)
Q Consensus         3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~i   38 (64)
                      +-+++|.-.+.--.+|-+|++|||+||-.|....+.
T Consensus       128 ~~~t~EnLy~mGMEafEeal~~WE~ALt~~~~~~~~  163 (514)
T PF10265_consen  128 PDTTPENLYLMGMEAFEEALRYWEQALTARQSQDED  163 (514)
T ss_pred             CcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCC
Confidence            346677766677778999999999999998776653


No 12 
>PF13176 TPR_7:  Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=65.31  E-value=7  Score=19.61  Aligned_cols=16  Identities=31%  Similarity=0.661  Sum_probs=10.5

Q ss_pred             hhhHHHHHHHHHhhhh
Q psy9049          18 LPEAVHFWEQALMVRK   33 (64)
Q Consensus        18 lPeav~y~e~~l~VR~   33 (64)
                      ..+|+.|++++|..+.
T Consensus        15 ~~~Ai~~y~~aL~l~~   30 (36)
T PF13176_consen   15 YEKAIEYYEQALALAR   30 (36)
T ss_dssp             HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHhcc
Confidence            3667777777665554


No 13 
>PF03538 VRP1:  Salmonella virulence plasmid 28.1kDa A protein;  InterPro: IPR018003 This entry included insecticidal toxin complex proteins (TcaA, TccA, TcbA, TcdA) from Photorhabdus luminescens subsp. laumondii and Xenorhabdus nematophilus (Achromobacter nematophilus) [], and virulence proteins from Salmonella typhimurium that are encoded on a 90kb plasmid. P. luminescens and X. nematophilus are Gram-negative bacteria that form entomopathogenic symbioses with soil nematodes. The bacteria are found in the gut of entomopathogenic nematodes that invade and kill insects. When the nematode invades an insect host the bacteria are released into the insect haemocoel (the open circulatory system), both the bacteria and the nematode undergo multiple rounds of replication which kills the insect host. Mapping of the insecticidal toxin loci and studies on knockout mutants in P. luminescens showed that deletion of either tca or tcd loci dramatically reduced toxicity, while the double mutant tca/tcd abolished toxicity []. However the biology of toxin action is unclear as is the species range of insects the toxins are active against. S. typhimurium contains a 90kb plasmid that is associated with virulence. This plasmid encodes at least 6 genes needed by the bacterium for invading host macrophages during infection. These include the 70kDa mkaA protein [], a recognised virulence factor, and more recently described, four spv genes under the control of a regulator []. Deletion studies on the virulence plasmid have shown that an open reading frame encoding a 28kDa protein was needed for successful invasion of the host. This protein, designated mkfA [], VRP4 [] or VirA [] by different groups, is utilised by the microbe upon entry into macrophages, although the exact mechanism is unclear.
Probab=51.74  E-value=17  Score=27.69  Aligned_cols=48  Identities=25%  Similarity=0.380  Sum_probs=36.5

Q ss_pred             cccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchhhHHHHHHH
Q psy9049           2 KVRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPEAVHFWEQA   50 (64)
Q Consensus         2 ~~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac~y~~~~   50 (64)
                      -++|++|.+.++.+.+.|.+-.||...|-+. ...-.-|...-.|-++|
T Consensus       273 ~~~l~p~q~~~~~e~l~~~~~~Fy~~nyG~~-~~~~~~l~~~~~F~~~T  320 (325)
T PF03538_consen  273 ASRLSPEQQAIITEPLTPSADDFYKTNYGVS-TDDSQPLEDLDVFCQQT  320 (325)
T ss_pred             hccCCHHHHHHHhhhcchhHHHHHHHhcCCc-cchhhhHHHHHHHHHHh
Confidence            4789999999999999999999999988885 22223455555666655


No 14 
>KOG2997|consensus
Probab=50.96  E-value=17  Score=28.60  Aligned_cols=50  Identities=20%  Similarity=0.359  Sum_probs=39.8

Q ss_pred             ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchhhHHHHHHHHhhhh
Q psy9049           3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPEAVHFWEQALMVRK   55 (64)
Q Consensus         3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac~y~~~~~~VR~   55 (64)
                      |+|+++....=++-+..+|+..|++|...-+-   =-+..|+.||.+|+.+=+
T Consensus         1 d~ls~~~~~~ekd~~~kkA~~l~~~av~~Eq~---G~l~dai~fYR~AlqI~~   50 (366)
T KOG2997|consen    1 DQLSAMIVDYEKDPLAKKAIALYEKAVLKEQD---GSLYDAINFYRDALQIVP   50 (366)
T ss_pred             CccchhhcccccchHHHHHHHHHHHHHHHhhc---CcHHHHHHHHHhhhcCCc
Confidence            57788877777888889999999999876543   335899999999987643


No 15 
>COG4492 PheB ACT domain-containing protein [General function prediction only]
Probab=47.82  E-value=5.8  Score=27.78  Aligned_cols=30  Identities=30%  Similarity=0.583  Sum_probs=23.8

Q ss_pred             cccchHHHHHHHhhcchhhHHHHHHHHHhhhhh
Q psy9049           2 KVRLDAEKFQLINNTVLPEAVHFWEQALMVRKT   34 (64)
Q Consensus         2 ~~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~   34 (64)
                      ++--+.+|+.||.+.+||+|+   |+++.|..+
T Consensus         2 ~~~~e~KkfyLvredvLp~s~---~ktlkvk~l   31 (150)
T COG4492           2 KKDTENKKFYLVREDVLPDSM---QKTLKVKDL   31 (150)
T ss_pred             CCccccceEEEEehhhchHHH---HHHHHHHHH
Confidence            444567799999999999998   788877644


No 16 
>PF13414 TPR_11:  TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=45.61  E-value=22  Score=18.96  Aligned_cols=33  Identities=12%  Similarity=0.257  Sum_probs=19.7

Q ss_pred             HHhhcchhhHHHHHHHHHhhhhhcccccchhhH
Q psy9049          12 LINNTVLPEAVHFWEQALMVRKTEKNTVLPEAV   44 (64)
Q Consensus        12 lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac   44 (64)
                      +....=+.+|+.++++++..-+.-......++.
T Consensus        13 ~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~   45 (69)
T PF13414_consen   13 YFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGL   45 (69)
T ss_dssp             HHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHH
T ss_pred             HHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHH
Confidence            344444678888888888776544443333333


No 17 
>PF13374 TPR_10:  Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=44.31  E-value=25  Score=16.71  Aligned_cols=19  Identities=32%  Similarity=0.399  Sum_probs=12.4

Q ss_pred             chhhHHHHHHHHHhhhhhc
Q psy9049          17 VLPEAVHFWEQALMVRKTE   35 (64)
Q Consensus        17 llPeav~y~e~~l~VR~~v   35 (64)
                      =+.+|..++++++..+.-+
T Consensus        17 ~~~~A~~~~~~al~~~~~~   35 (42)
T PF13374_consen   17 RYEEALELLEEALEIRERL   35 (42)
T ss_dssp             -HHHHHHHHHHHHHHH---
T ss_pred             hcchhhHHHHHHHHHHHHH
Confidence            3578888888888877544


No 18 
>PF07719 TPR_2:  Tetratricopeptide repeat;  InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=40.01  E-value=29  Score=16.03  Aligned_cols=18  Identities=28%  Similarity=0.434  Sum_probs=12.4

Q ss_pred             hhcchhhHHHHHHHHHhh
Q psy9049          14 NNTVLPEAVHFWEQALMV   31 (64)
Q Consensus        14 ~~~llPeav~y~e~~l~V   31 (64)
                      ...=+.+|+.+|++++..
T Consensus        13 ~~~~~~~A~~~~~~al~l   30 (34)
T PF07719_consen   13 QLGNYEEAIEYFEKALEL   30 (34)
T ss_dssp             HTT-HHHHHHHHHHHHHH
T ss_pred             HhCCHHHHHHHHHHHHHH
Confidence            334467888888888765


No 19 
>PF13181 TPR_8:  Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=36.10  E-value=42  Score=15.63  Aligned_cols=16  Identities=31%  Similarity=0.571  Sum_probs=11.8

Q ss_pred             hhhHHHHHHHHHhhhh
Q psy9049          18 LPEAVHFWEQALMVRK   33 (64)
Q Consensus        18 lPeav~y~e~~l~VR~   33 (64)
                      +.+|+.+|++++...+
T Consensus        17 ~~~A~~~~~~a~~~~~   32 (34)
T PF13181_consen   17 YEEALEYFEKALELNP   32 (34)
T ss_dssp             HHHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHhhCC
Confidence            4678888888877644


No 20 
>PF04761 Phage_Treg:  Lactococcus bacteriophage putative transcription regulator;  InterPro: IPR006848 This family represents a number of putative transcription repressor proteins found in several Lactococcus bacteriophages. Horizontal transfer may account for the presence of similar proteins in Lactococcus species [].
Probab=32.84  E-value=42  Score=20.02  Aligned_cols=16  Identities=19%  Similarity=0.619  Sum_probs=8.0

Q ss_pred             chHHHHHHHhhcchhh
Q psy9049           5 LDAEKFQLINNTVLPE   20 (64)
Q Consensus         5 l~~ek~~lv~~~llPe   20 (64)
                      |+-.-.+++.|.|+||
T Consensus        26 l~~~s~~flq~slipq   41 (57)
T PF04761_consen   26 LSVDSKDFLQNSLIPQ   41 (57)
T ss_pred             HHHHHHHHHHHhhHHH
Confidence            3334445555555555


No 21 
>PRK04435 hypothetical protein; Provisional
Probab=32.53  E-value=13  Score=24.60  Aligned_cols=25  Identities=44%  Similarity=0.675  Sum_probs=19.9

Q ss_pred             HHHHHHhhcchhhHHHHHHHHHhhhhhc
Q psy9049           8 EKFQLINNTVLPEAVHFWEQALMVRKTE   35 (64)
Q Consensus         8 ek~~lv~~~llPeav~y~e~~l~VR~~v   35 (64)
                      .|+.+|.+.+|||++   ++++.++.+.
T Consensus         5 ~~~~~v~~~~lp~~~---~k~~~~k~ll   29 (147)
T PRK04435          5 KKFYLVREDVLPEAV---EKTLKAKELL   29 (147)
T ss_pred             CeEEEEecccChHHH---HHHHHHHHHH
Confidence            477899999999986   7888777554


No 22 
>cd03074 PDI_b'_Calsequestrin_C Protein Disulfide Isomerase (PDIb') family, Calsequestrin subfamily, C-terminal TRX-fold domain; Calsequestrin is the major calcium storage protein in the sarcoplasmic reticulum (SR) of skeletal and cardiac muscle. It stores calcium ions in sufficient quantities (up to 20 mM) to allow repetitive contractions and is essential to maintain movement, respiration and heart beat. A missense mutation in human cardiac calsequestrin is associated with catecholamine-induced polymorphic ventricular tachycardia (CPVT), a rare disease characterized by seizures or sudden death in response to physiologic or emotional stress. Calsequestrin is a highly acidic protein with up to 50 calcium binding sites formed simply by the clustering of two or more acidic residues. The monomer contains three redox inactive TRX-fold domains. Calsequestrin is condensed as a linear polymer in the SR lumen and is membrane-anchored through binding with intra-membrane proteins triadin, junctin 
Probab=32.31  E-value=22  Score=24.17  Aligned_cols=31  Identities=23%  Similarity=0.492  Sum_probs=20.6

Q ss_pred             chhhHHHHHHHHHhh---hhhcccccchhhHHHH
Q psy9049          17 VLPEAVHFWEQALMV---RKTEKNTVLPEAVHFW   47 (64)
Q Consensus        17 llPeav~y~e~~l~V---R~~v~~iLLpRac~y~   47 (64)
                      =||--+.|||++|.+   ++.+|-+-...|-+.|
T Consensus        64 ~FPllv~yWektF~IDl~~PqIGVV~vtdadSvW   97 (120)
T cd03074          64 DFPLLVPYWEKTFGIDLFRPQIGVVNVTDADSVW   97 (120)
T ss_pred             cCchhhHHHHhhcCcccCCCceeeEeccccccee
Confidence            478999999999976   3555544444444444


No 23 
>PF00956 NAP:  Nucleosome assembly protein (NAP);  InterPro: IPR002164 It is thought that NAPs act as histone chaperones, shuttling both core and linker histones from their site of synthesis in the cytoplasm to the nucleus. The proteins may be involved in regulating gene expression and therefore cellular differentiation [, ].  The centrosomal protein c-Nap1, also known as Cep250, has been implicated in the cell-cycle-regulated cohesion of microtubule-organizing centres. This 281 kDa protein consists mainly of domains predicted to form coiled coil structures. The C-terminal region defines a novel histone-binding domain that is responsible for targeting CNAP1, and possibly condensin, to mitotic chromosomes []. During interphase, C-Nap1 localizes to the proximal ends of both parental centrioles, but it dissociates from these structures at the onset of mitosis. Re-association with centrioles then occurs in late telophase or at the very beginning of G1 phase, when daughter cells are still connected by post-mitotic bridges. Electron microscopic studies performed on isolated centrosomes suggest that a proteinaceous linker connects parental centrioles and C-Nap1 may be part of a linker structure that assures the cohesion of duplicated centrosomes during interphase, but that is dismantled upon centrosome separation at the onset of mitosis []. ; GO: 0006334 nucleosome assembly, 0005634 nucleus; PDB: 2E50_Q 2Z2R_A 2AYU_A 3Q66_A 3C9B_A 3Q68_B 3Q33_B 2ZD7_B 3DM7_A 3C9D_A ....
Probab=31.42  E-value=31  Score=24.19  Aligned_cols=18  Identities=22%  Similarity=0.632  Sum_probs=15.4

Q ss_pred             HHHHHhhcchhhHHHHHH
Q psy9049           9 KFQLINNTVLPEAVHFWE   26 (64)
Q Consensus         9 k~~lv~~~llPeav~y~e   26 (64)
                      --..|++.|.|.|+.||-
T Consensus       224 i~~~i~d~i~P~av~yy~  241 (244)
T PF00956_consen  224 IGEIIKDDIIPNAVKYYT  241 (244)
T ss_dssp             HHHHHHHTCCCHHHHHHH
T ss_pred             HHHHHHhheechHHHHhC
Confidence            447899999999999984


No 24 
>PF02071 NSF:  Aromatic-di-Alanine (AdAR) repeat ;  InterPro: IPR000744 Regulated exocytosis of neurotransmitters and hormones, as well as intracellular traffic, requires fusion of two lipid bilayers. SNARE proteins are thought to form a protein bridge, the SNARE complex, between an incoming vesicle and the acceptor compartment. SNARE proteins contribute to the specificity of membrane fusion, implying that the mechanisms by which SNAREs are targeted to subcellular compartments are important for specific docking and fusion of vesicles. This mechanism involves a family of conserved proteins, members of which appear to function at all sites of constitutive and regulated secretion in eukaryotes []. Among them are 2 types of cytosolic protein, NSF (N-ethyl-maleimide-sensitive protein) and the SNAPs (alpha-, beta- and gamma-soluble NSF attachment proteins). The yeast vesicular fusion protein, sec17, a cytoplasmic peripheral membrane protein involved in vesicular transport between the endoplasmic reticulum and the golgi apparatus, shows a high degree of sequence similarity to the alpha-SNAP family.  SNAP-25 and its non-neuronal homologue Syndet/SNAP-23 are synthesized as soluble proteins in the cytosol. Both SNAP-25 and Syndet/SNAP-23 are palmitoylated at cysteine residues clustered in a loop between two N- and C-terminal coils and palmitoylation is essential for membrane binding and plasma membrane targeting. The C-terminal and the N-terminal helices of SNAP-25, are each targeted to the plasma membrane by two distinct cysteine-rich domains and appear to regulate the availability of SNAP to form complexes with SNARE [].; GO: 0006886 intracellular protein transport
Probab=31.31  E-value=26  Score=14.86  Aligned_cols=8  Identities=38%  Similarity=0.704  Sum_probs=4.1

Q ss_pred             HHHHHHHH
Q psy9049          21 AVHFWEQA   28 (64)
Q Consensus        21 av~y~e~~   28 (64)
                      |+.+|++|
T Consensus         1 A~~~y~~A    8 (12)
T PF02071_consen    1 AIKCYEKA    8 (12)
T ss_pred             CcHHHHHH
Confidence            34555554


No 25 
>PHA00687 hypothetical protein
Probab=31.11  E-value=28  Score=20.58  Aligned_cols=21  Identities=33%  Similarity=0.401  Sum_probs=17.6

Q ss_pred             HHHHhhcchhhHHHHHHHHHh
Q psy9049          10 FQLINNTVLPEAVHFWEQALM   30 (64)
Q Consensus        10 ~~lv~~~llPeav~y~e~~l~   30 (64)
                      ++-|..+|-|||...+++|-+
T Consensus         4 rqpvqttlppeamrllqqaaq   24 (56)
T PHA00687          4 RQPVQTTLPPEAMRLLQQAAQ   24 (56)
T ss_pred             cCcccccCCHHHHHHHHHHhc
Confidence            466888999999999998865


No 26 
>KOG1173|consensus
Probab=30.40  E-value=72  Score=26.80  Aligned_cols=40  Identities=30%  Similarity=0.547  Sum_probs=26.2

Q ss_pred             cchhhHHHHHHHHHhhhhhccccc-------------------chhhHHHHHHHHhhhh
Q psy9049          16 TVLPEAVHFWEQALMVRKTEKNTV-------------------LPEAVHFWEQALMVRK   55 (64)
Q Consensus        16 ~llPeav~y~e~~l~VR~~v~~iL-------------------LpRac~y~~~~~~VR~   55 (64)
                      +-+|+|+.|+++|+.+=+.+.+..                   +..|+.||++++..-+
T Consensus       428 ~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~  486 (611)
T KOG1173|consen  428 EEYPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEAIDYYQKALLLSP  486 (611)
T ss_pred             hhhHHHHHHHHHHHHHhhhccccccchhHHHHhHHHHHHHHhhHHHHHHHHHHHHHcCC
Confidence            446888888888885554444432                   4567777777776544


No 27 
>PF14559 TPR_19:  Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=29.22  E-value=79  Score=16.62  Aligned_cols=30  Identities=23%  Similarity=0.190  Sum_probs=19.7

Q ss_pred             hhcchhhHHHHHHHHHhhhhhcccccchhh
Q psy9049          14 NNTVLPEAVHFWEQALMVRKTEKNTVLPEA   43 (64)
Q Consensus        14 ~~~llPeav~y~e~~l~VR~~v~~iLLpRa   43 (64)
                      .+.=+.+|+.++++++..-+--..+.+.-+
T Consensus         3 ~~~~~~~A~~~~~~~l~~~p~~~~~~~~la   32 (68)
T PF14559_consen    3 KQGDYDEAIELLEKALQRNPDNPEARLLLA   32 (68)
T ss_dssp             HTTHHHHHHHHHHHHHHHTTTSHHHHHHHH
T ss_pred             hccCHHHHHHHHHHHHHHCCCCHHHHHHHH
Confidence            334467888888888887766555554333


No 28 
>cd06588 PhnB_like Escherichia coli PhnB and similar proteins; the E. coli phnB gene is found next to an operon involved in the cleavage of carbon-phosphorus bonds in unactivated alkylphosphonates. The Escherichia coli phnB gene is found next to an operon of fourteen genes (phnC-to-phnP) related to the cleavage of carbon-phosphorus (C-P) bonds in unactivated alkylphosphonates, supporting bacterial growth on alkylphosphonates as the sole phosphorus source. It was originally considered part of that operon. PhnB appears to play no direct catalytic role in the usage of alkylphosphonate. Although many of the proteins in this family have been annotated as 3-demethylubiquinone-9 3-methyltransferase enzymes by automatic annotation programs, the experimental evidence for this assignment is lacking. In Escherichia coli, the gene coding 3-demethylubiquinone-9 3-methyltransferase enzyme is ubiG, which belongs to the AdoMet-MTase protein family. PhnB-like proteins adopt a structural fold similar to 
Probab=29.17  E-value=38  Score=20.63  Aligned_cols=13  Identities=23%  Similarity=0.465  Sum_probs=10.7

Q ss_pred             hhhHHHHHHHHHh
Q psy9049          18 LPEAVHFWEQALM   30 (64)
Q Consensus        18 lPeav~y~e~~l~   30 (64)
                      ..+|+.||+++|-
T Consensus        11 ~~eAi~FY~~~fg   23 (128)
T cd06588          11 AEEALEFYQSVFG   23 (128)
T ss_pred             HHHHHHHHHHHhC
Confidence            3689999999884


No 29 
>KOG1717|consensus
Probab=28.53  E-value=33  Score=26.76  Aligned_cols=18  Identities=28%  Similarity=0.492  Sum_probs=15.1

Q ss_pred             hcchhhHHHHHHHHHhhh
Q psy9049          15 NTVLPEAVHFWEQALMVR   32 (64)
Q Consensus        15 ~~llPeav~y~e~~l~VR   32 (64)
                      ...|||||+|-.+|.++-
T Consensus       234 s~ffpEAIsfIdeArsk~  251 (343)
T KOG1717|consen  234 SQFFPEAISFIDEARSKN  251 (343)
T ss_pred             hhhhHHHHHHHHHhhccC
Confidence            468999999999998754


No 30 
>PF13013 F-box-like_2:  F-box-like domain
Probab=27.58  E-value=76  Score=20.59  Aligned_cols=30  Identities=17%  Similarity=0.287  Sum_probs=15.6

Q ss_pred             HHHHHHHhhcchhhHHHHHHHHHhhhhhcc
Q psy9049           7 AEKFQLINNTVLPEAVHFWEQALMVRKTEK   36 (64)
Q Consensus         7 ~ek~~lv~~~llPeav~y~e~~l~VR~~v~   36 (64)
                      ++....+++-+=|-++.||.+.++++..--
T Consensus        66 p~~c~~lr~lls~~s~~~w~~~l~~~~~~~   95 (109)
T PF13013_consen   66 PQSCTRLRSLLSWTSKMFWKSFLRVRDDYR   95 (109)
T ss_pred             HHHHHHHHHHcCcccHHHHHHHHHHHhccH
Confidence            344444554444445666666666655443


No 31 
>PTZ00008 (NAP-S) nucleosome assembly protein-S; Provisional
Probab=27.39  E-value=33  Score=23.93  Aligned_cols=20  Identities=5%  Similarity=0.219  Sum_probs=16.6

Q ss_pred             HHHHHHHhhcchhhHHHHHH
Q psy9049           7 AEKFQLINNTVLPEAVHFWE   26 (64)
Q Consensus         7 ~ek~~lv~~~llPeav~y~e   26 (64)
                      -|--++|++.|+|.|+.|+.
T Consensus       158 ~eIg~~i~e~i~P~av~yy~  177 (185)
T PTZ00008        158 PDVGEIIRREIWHAPLLYYL  177 (185)
T ss_pred             HHHHHHHHHhhccchHHhhC
Confidence            34567899999999999985


No 32 
>PTZ00007 (NAP-L) nucleosome assembly protein -L; Provisional
Probab=25.44  E-value=33  Score=26.28  Aligned_cols=18  Identities=22%  Similarity=0.253  Sum_probs=15.2

Q ss_pred             HHHHHhhcchhhHHHHHH
Q psy9049           9 KFQLINNTVLPEAVHFWE   26 (64)
Q Consensus         9 k~~lv~~~llPeav~y~e   26 (64)
                      .=..+++.|+|.||.||.
T Consensus       259 iG~~ikd~IIP~AV~yft  276 (337)
T PTZ00007        259 IGITIRDKLIPYAVYWFL  276 (337)
T ss_pred             HHHHHHHhcccccHHhhC
Confidence            446799999999999984


No 33 
>PF09533 DUF2380:  Predicted lipoprotein of unknown function (DUF2380);  InterPro: IPR011755 This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N terminus; the others are predicted lipoproteins. The function is unknown.
Probab=23.33  E-value=34  Score=24.74  Aligned_cols=38  Identities=21%  Similarity=0.487  Sum_probs=23.0

Q ss_pred             cchHHHHHHHhhcchhhH---HHHHH-HHHhhhhhcccccchhhHH
Q psy9049           4 RLDAEKFQLINNTVLPEA---VHFWE-QALMVRKTEKNTVLPEAVH   45 (64)
Q Consensus         4 ~l~~ek~~lv~~~llPea---v~y~e-~~l~VR~~v~~iLLpRac~   45 (64)
                      +++++  .+++..|||||   -.||+ +-+-+...+.  .+|+.++
T Consensus        92 ~~~~g--~~ekHHIFPQa~~la~wF~~~Gi~IHd~ti--~Ip~~vH  133 (188)
T PF09533_consen   92 RLPPG--RWEKHHIFPQAEELAEWFERRGIDIHDYTI--PIPRDVH  133 (188)
T ss_pred             CCCCC--cchhhccCCCcHHHHHHHHHcCCChhheeE--ecCHHHH
Confidence            34444  56999999996   34444 4444445444  4488775


No 34 
>PF10579 Rapsyn_N:  Rapsyn N-terminal myristoylation and linker region;  InterPro: IPR019568  Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=22.97  E-value=1.1e+02  Score=19.24  Aligned_cols=32  Identities=22%  Similarity=0.293  Sum_probs=20.3

Q ss_pred             hhHHHHHHHHHhhhhhcccccchhhHHHHHHHHh
Q psy9049          19 PEAVHFWEQALMVRKTEKNTVLPEAVHFWEQALM   52 (64)
Q Consensus        19 Peav~y~e~~l~VR~~v~~iLLpRac~y~~~~~~   52 (64)
                      -+|++-|.+||.+..--..  -=++..|+.+|++
T Consensus        23 ~~Al~~W~~aL~k~~~~~~--rf~~lG~l~qA~~   54 (80)
T PF10579_consen   23 QQALQKWRKALEKITDRED--RFRVLGYLIQAHM   54 (80)
T ss_pred             HHHHHHHHHHHhhcCChHH--HHHHHHHHHHHHH
Confidence            5799999999987644221  2345566666654


No 35 
>PF01216 Calsequestrin:  Calsequestrin;  InterPro: IPR001393 Calsequestrin is the principal calcium-binding protein present in the sarcoplasmic reticulum of cardiac and skeletal muscle []. It is a highly acidic protein that is able to bind over 40 calcium ions and acts as an internal calcium store in muscle. Sequence analysis has suggested that calcium is not bound in distinct pockets via EF-hand motifs, but rather via presentation of a charged protein surface. Two forms of calsequestrin have been identified. The cardiac form is present in cardiac and slow skeletal muscle and the fast skeletal form is found in fast skeletal muscle. The release of calsequestrin-bound calcium (through a a calcium release channel) triggers muscle contraction. The active protein is not highly structured, more than 50% of it adopting a random coil conformation []. When calcium binds there is a structural change whereby the alpha-helical content of the protein increases from 3 to 11% []. Both forms of calsequestrin are phosphorylated by casein kinase II, but the cardiac form is phosphorylated more rapidly and to a higher degree [].; GO: 0005509 calcium ion binding; PDB: 1SJI_B 2VAF_A 3UOM_C 1A8Y_A 3TRQ_A 3TRP_A 3US3_A.
Probab=22.38  E-value=58  Score=25.88  Aligned_cols=15  Identities=27%  Similarity=0.702  Sum_probs=12.0

Q ss_pred             chhhHHHHHHHHHhh
Q psy9049          17 VLPEAVHFWEQALMV   31 (64)
Q Consensus        17 llPeav~y~e~~l~V   31 (64)
                      =||-.+.|||++|.+
T Consensus       312 ~fPllv~yWE~tF~I  326 (383)
T PF01216_consen  312 DFPLLVPYWEKTFGI  326 (383)
T ss_dssp             G-HHHHHHHHHHHTT
T ss_pred             CCchhHHHHHhhcCc
Confidence            379999999999865


No 36 
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=21.87  E-value=74  Score=14.79  Aligned_cols=11  Identities=18%  Similarity=0.558  Sum_probs=8.2

Q ss_pred             hhHHHHHHHHH
Q psy9049          19 PEAVHFWEQAL   29 (64)
Q Consensus        19 Peav~y~e~~l   29 (64)
                      .+|+.||++|-
T Consensus        22 ~~A~~~~~~Aa   32 (36)
T smart00671       22 EKALEYYKKAA   32 (36)
T ss_pred             HHHHHHHHHHH
Confidence            57888888764


No 37 
>PF10373 EST1_DNA_bind:  Est1 DNA/RNA binding domain;  InterPro: IPR018834  Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=20.71  E-value=1.2e+02  Score=20.30  Aligned_cols=36  Identities=22%  Similarity=0.330  Sum_probs=18.9

Q ss_pred             HHHHHHHHHhhhhhcccc------------cchhhHHHHHHHHhhhhc
Q psy9049          21 AVHFWEQALMVRKTEKNT------------VLPEAVHFWEQALMVRKT   56 (64)
Q Consensus        21 av~y~e~~l~VR~~v~~i------------LLpRac~y~~~~~~VR~~   56 (64)
                      |.+||.+|..+-+..|.-            --=+|+-||.|++.++.+
T Consensus         1 A~~~Y~~A~~l~P~~G~p~nQLAvl~~~~~~~l~avy~y~Rsl~~~~P   48 (278)
T PF10373_consen    1 AERYYRKAIRLLPSNGNPYNQLAVLASYQGDDLDAVYYYIRSLAVRIP   48 (278)
T ss_dssp             HHHHHHHHHHH-TTBSHHHHHHHHHHHHTT-HHHHHHHHHHHHSSSB-
T ss_pred             CHHHHHHHHHhCCCCCCcccchhhhhccccchHHHHHHHHHHHhcCCC
Confidence            455666666665554432            112556677777766654


No 38 
>PF05794 Tcp11:  T-complex protein 11;  InterPro: IPR008862 This family consists of several eukaryotic T-complex protein 11 (Tcp11) related sequences. Tcp11 is only expressed in fertile adult mammalian testes and is thought to be important in sperm function and fertility. The family also contains the Saccharomyces cerevisiae Sok1 protein which is known to suppress cyclic AMP-dependent protein kinase mutants [].
Probab=20.62  E-value=1.9e+02  Score=21.59  Aligned_cols=45  Identities=11%  Similarity=0.189  Sum_probs=38.1

Q ss_pred             HHHhhcchhhHHHHHHHHHhhhhhcccccchhhHHHHHHHHhhhh
Q psy9049          11 QLINNTVLPEAVHFWEQALMVRKTEKNTVLPEAVHFWEQALMVRK   55 (64)
Q Consensus        11 ~lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac~y~~~~~~VR~   55 (64)
                      +...-.|...||.|-.+.|+-+--.++.-|+.....|.++..--.
T Consensus       163 ~~~rp~L~~~sv~yEr~~F~~~l~~~~~~l~~T~~Wl~~~~~~~~  207 (441)
T PF05794_consen  163 RSLRPQLIEHSVEYERKKFQERLEKGPNSLPRTKAWLKRARSELE  207 (441)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhCcCChHHHHHHHHHHHHHHh
Confidence            344556788899999999999999999999999999999876543


Done!