Query psy9049
Match_columns 64
No_of_seqs 52 out of 54
Neff 3.8
Searched_HMMs 46136
Date Fri Aug 16 17:40:07 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9049.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9049hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2556|consensus 98.5 1.6E-08 3.6E-13 81.1 -1.4 52 2-53 93-147 (666)
2 PTZ00257 Glycoprotein GP63 (le 97.0 0.00043 9.4E-09 56.2 2.3 40 3-42 144-183 (622)
3 KOG2556|consensus 96.9 0.00017 3.8E-09 58.4 -0.8 34 31-64 100-133 (666)
4 PTZ00337 surface protease GP63 96.8 0.00064 1.4E-08 54.3 1.7 40 3-42 120-159 (567)
5 PF01457 Peptidase_M8: Leishma 94.1 0.021 4.6E-07 44.4 1.0 40 3-42 98-137 (521)
6 PTZ00337 surface protease GP63 84.0 0.54 1.2E-05 38.0 1.2 32 32-63 127-158 (567)
7 PTZ00257 Glycoprotein GP63 (le 79.9 1.2 2.6E-05 36.7 1.8 32 32-63 151-182 (622)
8 PF01457 Peptidase_M8: Leishma 73.6 0.18 3.9E-06 39.3 -4.2 59 4-62 62-135 (521)
9 PF13424 TPR_12: Tetratricopep 72.0 8.8 0.00019 21.2 3.6 23 15-37 18-40 (78)
10 PF00515 TPR_1: Tetratricopept 66.8 5.6 0.00012 19.0 1.8 18 16-33 15-32 (34)
11 PF10265 DUF2217: Uncharacteri 66.5 6.8 0.00015 31.8 3.1 36 3-38 128-163 (514)
12 PF13176 TPR_7: Tetratricopept 65.3 7 0.00015 19.6 2.0 16 18-33 15-30 (36)
13 PF03538 VRP1: Salmonella viru 51.7 17 0.00036 27.7 2.8 48 2-50 273-320 (325)
14 KOG2997|consensus 51.0 17 0.00037 28.6 2.9 50 3-55 1-50 (366)
15 COG4492 PheB ACT domain-contai 47.8 5.8 0.00012 27.8 -0.1 30 2-34 2-31 (150)
16 PF13414 TPR_11: TPR repeat; P 45.6 22 0.00049 19.0 2.1 33 12-44 13-45 (69)
17 PF13374 TPR_10: Tetratricopep 44.3 25 0.00055 16.7 2.0 19 17-35 17-35 (42)
18 PF07719 TPR_2: Tetratricopept 40.0 29 0.00062 16.0 1.8 18 14-31 13-30 (34)
19 PF13181 TPR_8: Tetratricopept 36.1 42 0.00091 15.6 2.0 16 18-33 17-32 (34)
20 PF04761 Phage_Treg: Lactococc 32.8 42 0.00092 20.0 2.0 16 5-20 26-41 (57)
21 PRK04435 hypothetical protein; 32.5 13 0.00028 24.6 -0.3 25 8-35 5-29 (147)
22 cd03074 PDI_b'_Calsequestrin_C 32.3 22 0.00047 24.2 0.8 31 17-47 64-97 (120)
23 PF00956 NAP: Nucleosome assem 31.4 31 0.00067 24.2 1.5 18 9-26 224-241 (244)
24 PF02071 NSF: Aromatic-di-Alan 31.3 26 0.00057 14.9 0.7 8 21-28 1-8 (12)
25 PHA00687 hypothetical protein 31.1 28 0.0006 20.6 1.0 21 10-30 4-24 (56)
26 KOG1173|consensus 30.4 72 0.0016 26.8 3.5 40 16-55 428-486 (611)
27 PF14559 TPR_19: Tetratricopep 29.2 79 0.0017 16.6 2.6 30 14-43 3-32 (68)
28 cd06588 PhnB_like Escherichia 29.2 38 0.00082 20.6 1.4 13 18-30 11-23 (128)
29 KOG1717|consensus 28.5 33 0.00072 26.8 1.3 18 15-32 234-251 (343)
30 PF13013 F-box-like_2: F-box-l 27.6 76 0.0016 20.6 2.7 30 7-36 66-95 (109)
31 PTZ00008 (NAP-S) nucleosome as 27.4 33 0.00071 23.9 1.0 20 7-26 158-177 (185)
32 PTZ00007 (NAP-L) nucleosome as 25.4 33 0.00072 26.3 0.8 18 9-26 259-276 (337)
33 PF09533 DUF2380: Predicted li 23.3 34 0.00074 24.7 0.5 38 4-45 92-133 (188)
34 PF10579 Rapsyn_N: Rapsyn N-te 23.0 1.1E+02 0.0024 19.2 2.7 32 19-52 23-54 (80)
35 PF01216 Calsequestrin: Calseq 22.4 58 0.0013 25.9 1.6 15 17-31 312-326 (383)
36 smart00671 SEL1 Sel1-like repe 21.9 74 0.0016 14.8 1.5 11 19-29 22-32 (36)
37 PF10373 EST1_DNA_bind: Est1 D 20.7 1.2E+02 0.0026 20.3 2.7 36 21-56 1-48 (278)
38 PF05794 Tcp11: T-complex prot 20.6 1.9E+02 0.0041 21.6 4.0 45 11-55 163-207 (441)
No 1
>KOG2556|consensus
Probab=98.47 E-value=1.6e-08 Score=81.09 Aligned_cols=52 Identities=17% Similarity=0.077 Sum_probs=46.8
Q ss_pred cccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchhhH---HHHHHHHhh
Q psy9049 2 KVRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPEAV---HFWEQALMV 53 (64)
Q Consensus 2 ~~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac---~y~~~~~~V 53 (64)
|+.|.+|+..+|+++|+|||+.|||.+|+||+...+|+|.|.| .|+.+..|-
T Consensus 93 Ve~l~~e~~n~v~s~l~pea~~y~e~~~~vr~~a~~i~L~R~C~~~~y~~K~g~p 147 (666)
T KOG2556|consen 93 VEKLFHENGNVVLSVLDPEADVYQEIRIYVRYDAVGISLDRDCQRVGYIVKLGEP 147 (666)
T ss_pred HHHHhHhhchhhhhhcChhhhhcceeeEEEecCCcceeeehhhhhhhhhhhcCCC
Confidence 5789999999999999999999999999999999999999999 566665554
No 2
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=96.99 E-value=0.00043 Score=56.21 Aligned_cols=40 Identities=20% Similarity=0.287 Sum_probs=37.2
Q ss_pred ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchh
Q psy9049 3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPE 42 (64)
Q Consensus 3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpR 42 (64)
|=|++||+++++++|+|+|+.+|.+.|.||++.+++.+|.
T Consensus 144 DILT~eK~~il~~tllP~avk~~~erL~Vrp~~~~i~v~~ 183 (622)
T PTZ00257 144 DILTEEKRDILVSYLIPQALQLHVERLRVRQVQGSWRVTG 183 (622)
T ss_pred cccCHHHHHHHHHHhHHHHHHHHHHhheeecCCCCeEeec
Confidence 5689999999999999999999999999999999998874
No 3
>KOG2556|consensus
Probab=96.88 E-value=0.00017 Score=58.44 Aligned_cols=34 Identities=26% Similarity=0.225 Sum_probs=32.1
Q ss_pred hhhhcccccchhhHHHHHHHHhhhhccceeeecC
Q psy9049 31 VRKTEKNTVLPEAVHFWEQALMVRKTEKVIRLNR 64 (64)
Q Consensus 31 VR~~v~~iLLpRac~y~~~~~~VR~~~~~ilL~R 64 (64)
+-..++++++|+|..||+.+++||..+++|+|+|
T Consensus 100 ~~n~v~s~l~pea~~y~e~~~~vr~~a~~i~L~R 133 (666)
T KOG2556|consen 100 NGNVVLSVLDPEADVYQEIRIYVRYDAVGISLDR 133 (666)
T ss_pred hchhhhhhcChhhhhcceeeEEEecCCcceeeeh
Confidence 4578999999999999999999999999999998
No 4
>PTZ00337 surface protease GP63; Provisional
Probab=96.78 E-value=0.00064 Score=54.34 Aligned_cols=40 Identities=25% Similarity=0.430 Sum_probs=37.2
Q ss_pred ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchh
Q psy9049 3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPE 42 (64)
Q Consensus 3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpR 42 (64)
|-|++||++++.+.+||.|+.|+++.|.|+++.+++.+|+
T Consensus 120 DILT~eK~~~i~~~~lp~a~~~~~~~L~V~p~~~~i~v~~ 159 (567)
T PTZ00337 120 DVLTEERRSIILRQTLPAAIQLHAERLSVRPVTRPVLIPR 159 (567)
T ss_pred cccCHHHHHHHHHHHHHHHHHHHHHheEEecCCCCeeecc
Confidence 4589999999989999999999999999999999999885
No 5
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=94.15 E-value=0.021 Score=44.43 Aligned_cols=40 Identities=30% Similarity=0.376 Sum_probs=29.1
Q ss_pred ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchh
Q psy9049 3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPE 42 (64)
Q Consensus 3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpR 42 (64)
|=|+++|++.+.+.++|.|++|+++.|.|+++.+++.+|.
T Consensus 98 dIlt~~k~~~l~~~~~~~~~~~~~~~l~V~~~~~~~~v~~ 137 (521)
T PF01457_consen 98 DILTEEKRDTLIKTLIPNAVKLHSERLKVRPVCGNIKVPN 137 (521)
T ss_dssp G---HHHHHHHHHTHHHHHHHHHHTTEEE--EESEE----
T ss_pred ccccHHHHHHHHHHHhhhHHHHHHhhhcccccCCCcCccc
Confidence 3478889999999999999999999999999999998876
No 6
>PTZ00337 surface protease GP63; Provisional
Probab=83.98 E-value=0.54 Score=37.98 Aligned_cols=32 Identities=22% Similarity=0.356 Sum_probs=28.2
Q ss_pred hhhcccccchhhHHHHHHHHhhhhccceeeec
Q psy9049 32 RKTEKNTVLPEAVHFWEQALMVRKTEKVIRLN 63 (64)
Q Consensus 32 R~~v~~iLLpRac~y~~~~~~VR~~~~~ilL~ 63 (64)
+..+.+.+||.+..|+++.+.|+++.+||.+.
T Consensus 127 ~~~i~~~~lp~a~~~~~~~L~V~p~~~~i~v~ 158 (567)
T PTZ00337 127 RSIILRQTLPAAIQLHAERLSVRPVTRPVLIP 158 (567)
T ss_pred HHHHHHHHHHHHHHHHHHheEEecCCCCeeec
Confidence 56666788999999999999999999999764
No 7
>PTZ00257 Glycoprotein GP63 (leishmanolysin); Provisional
Probab=79.88 E-value=1.2 Score=36.73 Aligned_cols=32 Identities=22% Similarity=0.371 Sum_probs=28.9
Q ss_pred hhhcccccchhhHHHHHHHHhhhhccceeeec
Q psy9049 32 RKTEKNTVLPEAVHFWEQALMVRKTEKVIRLN 63 (64)
Q Consensus 32 R~~v~~iLLpRac~y~~~~~~VR~~~~~ilL~ 63 (64)
|....+.+||.++..|.+.+.|||+.++|.+.
T Consensus 151 ~~il~~tllP~avk~~~erL~Vrp~~~~i~v~ 182 (622)
T PTZ00257 151 RDILVSYLIPQALQLHVERLRVRQVQGSWRVT 182 (622)
T ss_pred HHHHHHHhHHHHHHHHHHhheeecCCCCeEee
Confidence 88889999999999999999999999998763
No 8
>PF01457 Peptidase_M8: Leishmanolysin This Prosite motif covers only the active site. This is family M8 in the peptidase classification. ; InterPro: IPR001577 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to the MEROPS peptidase family M8 (leishmanolysin family, clan MA(M)). The protein fold of the peptidase domain for members of this family resembles that of thermolysin, the type example for clan MA. Leishmanolysin is an enzyme found in the eukaryotes including Leishmania and related parasitic protozoa []. The endopeptidase is the most abundant protein on the cell surface during the promastigote stage of the parasite, and is attached to the membrane by a glycosylphosphatidylinositol anchor []. In the amastigote form, the parasite lives in lysosomes of host macrophages, producing a form of the protease that has an acidic pH optimum []. This differs from most other metalloproteases and may be an adaptation to the environment in which the organism survives [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis, 0007155 cell adhesion, 0016020 membrane; PDB: 1LML_A.
Probab=73.57 E-value=0.18 Score=39.30 Aligned_cols=59 Identities=24% Similarity=0.223 Sum_probs=36.1
Q ss_pred cchHHHHHHHhhcchhhHHHHHHHHHh---------------hhhhcccccchhhHHHHHHHHhhhhccceeee
Q psy9049 4 RLDAEKFQLINNTVLPEAVHFWEQALM---------------VRKTEKNTVLPEAVHFWEQALMVRKTEKVIRL 62 (64)
Q Consensus 4 ~l~~ek~~lv~~~llPeav~y~e~~l~---------------VR~~v~~iLLpRac~y~~~~~~VR~~~~~ilL 62 (64)
|+..++++|++++..+++++.+...+. ++......++|.+..|+++.+.|++..++|.+
T Consensus 62 RI~~~~~dl~d~~~~C~~vGq~~~~~~g~~~~C~~~dIlt~~k~~~l~~~~~~~~~~~~~~~l~V~~~~~~~~v 135 (521)
T PF01457_consen 62 RIKVSTDDLNDPKYYCTAVGQVISNFQGTVYTCTEEDILTEEKRDTLIKTLIPNAVKLHSERLKVRPVCGNIKV 135 (521)
T ss_dssp -EEEE-HHHHSTTSS--STT-EEE-SSS-EEE--GGG---HHHHHHHHHTHHHHHHHHHHTTEEE--EESEE--
T ss_pred EEEeehhhccCCCccccCCCEEEEcccCceEEccccccccHHHHHHHHHHHhhhHHHHHHhhhcccccCCCcCc
Confidence 445566788888999999988876655 22234445567777899999999999998865
No 9
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=72.05 E-value=8.8 Score=21.20 Aligned_cols=23 Identities=22% Similarity=0.322 Sum_probs=16.8
Q ss_pred hcchhhHHHHHHHHHhhhhhccc
Q psy9049 15 NTVLPEAVHFWEQALMVRKTEKN 37 (64)
Q Consensus 15 ~~llPeav~y~e~~l~VR~~v~~ 37 (64)
..=+.+|+.++++++.+....++
T Consensus 18 ~~~~~~A~~~~~~al~~~~~~~~ 40 (78)
T PF13424_consen 18 LGRYDEALDYYEKALDIEEQLGD 40 (78)
T ss_dssp TT-HHHHHHHHHHHHHHHHHTTT
T ss_pred cCCHHHHHHHHHHHHHHHHHHCC
Confidence 33457899999999988766654
No 10
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=66.81 E-value=5.6 Score=19.01 Aligned_cols=18 Identities=22% Similarity=0.397 Sum_probs=12.8
Q ss_pred cchhhHHHHHHHHHhhhh
Q psy9049 16 TVLPEAVHFWEQALMVRK 33 (64)
Q Consensus 16 ~llPeav~y~e~~l~VR~ 33 (64)
.=+++|+.+|++|+...+
T Consensus 15 ~~~~~A~~~~~~al~~~p 32 (34)
T PF00515_consen 15 GDYEEALEYYQRALELDP 32 (34)
T ss_dssp T-HHHHHHHHHHHHHHST
T ss_pred CCchHHHHHHHHHHHHCc
Confidence 346888888888887643
No 11
>PF10265 DUF2217: Uncharacterized conserved protein (DUF2217); InterPro: IPR019392 This is a family of conserved proteins varying in length from 500-600 residues. Their function is not known.
Probab=66.45 E-value=6.8 Score=31.79 Aligned_cols=36 Identities=28% Similarity=0.542 Sum_probs=28.5
Q ss_pred ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccc
Q psy9049 3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNT 38 (64)
Q Consensus 3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~i 38 (64)
+-+++|.-.+.--.+|-+|++|||+||-.|....+.
T Consensus 128 ~~~t~EnLy~mGMEafEeal~~WE~ALt~~~~~~~~ 163 (514)
T PF10265_consen 128 PDTTPENLYLMGMEAFEEALRYWEQALTARQSQDED 163 (514)
T ss_pred CcCCHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCC
Confidence 346677766677778999999999999998776653
No 12
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=65.31 E-value=7 Score=19.61 Aligned_cols=16 Identities=31% Similarity=0.661 Sum_probs=10.5
Q ss_pred hhhHHHHHHHHHhhhh
Q psy9049 18 LPEAVHFWEQALMVRK 33 (64)
Q Consensus 18 lPeav~y~e~~l~VR~ 33 (64)
..+|+.|++++|..+.
T Consensus 15 ~~~Ai~~y~~aL~l~~ 30 (36)
T PF13176_consen 15 YEKAIEYYEQALALAR 30 (36)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhcc
Confidence 3667777777665554
No 13
>PF03538 VRP1: Salmonella virulence plasmid 28.1kDa A protein; InterPro: IPR018003 This entry included insecticidal toxin complex proteins (TcaA, TccA, TcbA, TcdA) from Photorhabdus luminescens subsp. laumondii and Xenorhabdus nematophilus (Achromobacter nematophilus) [], and virulence proteins from Salmonella typhimurium that are encoded on a 90kb plasmid. P. luminescens and X. nematophilus are Gram-negative bacteria that form entomopathogenic symbioses with soil nematodes. The bacteria are found in the gut of entomopathogenic nematodes that invade and kill insects. When the nematode invades an insect host the bacteria are released into the insect haemocoel (the open circulatory system), both the bacteria and the nematode undergo multiple rounds of replication which kills the insect host. Mapping of the insecticidal toxin loci and studies on knockout mutants in P. luminescens showed that deletion of either tca or tcd loci dramatically reduced toxicity, while the double mutant tca/tcd abolished toxicity []. However the biology of toxin action is unclear as is the species range of insects the toxins are active against. S. typhimurium contains a 90kb plasmid that is associated with virulence. This plasmid encodes at least 6 genes needed by the bacterium for invading host macrophages during infection. These include the 70kDa mkaA protein [], a recognised virulence factor, and more recently described, four spv genes under the control of a regulator []. Deletion studies on the virulence plasmid have shown that an open reading frame encoding a 28kDa protein was needed for successful invasion of the host. This protein, designated mkfA [], VRP4 [] or VirA [] by different groups, is utilised by the microbe upon entry into macrophages, although the exact mechanism is unclear.
Probab=51.74 E-value=17 Score=27.69 Aligned_cols=48 Identities=25% Similarity=0.380 Sum_probs=36.5
Q ss_pred cccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchhhHHHHHHH
Q psy9049 2 KVRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPEAVHFWEQA 50 (64)
Q Consensus 2 ~~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac~y~~~~ 50 (64)
-++|++|.+.++.+.+.|.+-.||...|-+. ...-.-|...-.|-++|
T Consensus 273 ~~~l~p~q~~~~~e~l~~~~~~Fy~~nyG~~-~~~~~~l~~~~~F~~~T 320 (325)
T PF03538_consen 273 ASRLSPEQQAIITEPLTPSADDFYKTNYGVS-TDDSQPLEDLDVFCQQT 320 (325)
T ss_pred hccCCHHHHHHHhhhcchhHHHHHHHhcCCc-cchhhhHHHHHHHHHHh
Confidence 4789999999999999999999999988885 22223455555666655
No 14
>KOG2997|consensus
Probab=50.96 E-value=17 Score=28.60 Aligned_cols=50 Identities=20% Similarity=0.359 Sum_probs=39.8
Q ss_pred ccchHHHHHHHhhcchhhHHHHHHHHHhhhhhcccccchhhHHHHHHHHhhhh
Q psy9049 3 VRLDAEKFQLINNTVLPEAVHFWEQALMVRKTEKNTVLPEAVHFWEQALMVRK 55 (64)
Q Consensus 3 ~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac~y~~~~~~VR~ 55 (64)
|+|+++....=++-+..+|+..|++|...-+- =-+..|+.||.+|+.+=+
T Consensus 1 d~ls~~~~~~ekd~~~kkA~~l~~~av~~Eq~---G~l~dai~fYR~AlqI~~ 50 (366)
T KOG2997|consen 1 DQLSAMIVDYEKDPLAKKAIALYEKAVLKEQD---GSLYDAINFYRDALQIVP 50 (366)
T ss_pred CccchhhcccccchHHHHHHHHHHHHHHHhhc---CcHHHHHHHHHhhhcCCc
Confidence 57788877777888889999999999876543 335899999999987643
No 15
>COG4492 PheB ACT domain-containing protein [General function prediction only]
Probab=47.82 E-value=5.8 Score=27.78 Aligned_cols=30 Identities=30% Similarity=0.583 Sum_probs=23.8
Q ss_pred cccchHHHHHHHhhcchhhHHHHHHHHHhhhhh
Q psy9049 2 KVRLDAEKFQLINNTVLPEAVHFWEQALMVRKT 34 (64)
Q Consensus 2 ~~~l~~ek~~lv~~~llPeav~y~e~~l~VR~~ 34 (64)
++--+.+|+.||.+.+||+|+ |+++.|..+
T Consensus 2 ~~~~e~KkfyLvredvLp~s~---~ktlkvk~l 31 (150)
T COG4492 2 KKDTENKKFYLVREDVLPDSM---QKTLKVKDL 31 (150)
T ss_pred CCccccceEEEEehhhchHHH---HHHHHHHHH
Confidence 444567799999999999998 788877644
No 16
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=45.61 E-value=22 Score=18.96 Aligned_cols=33 Identities=12% Similarity=0.257 Sum_probs=19.7
Q ss_pred HHhhcchhhHHHHHHHHHhhhhhcccccchhhH
Q psy9049 12 LINNTVLPEAVHFWEQALMVRKTEKNTVLPEAV 44 (64)
Q Consensus 12 lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac 44 (64)
+....=+.+|+.++++++..-+.-......++.
T Consensus 13 ~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~ 45 (69)
T PF13414_consen 13 YFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGL 45 (69)
T ss_dssp HHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHH
T ss_pred HHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHH
Confidence 344444678888888888776544443333333
No 17
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=44.31 E-value=25 Score=16.71 Aligned_cols=19 Identities=32% Similarity=0.399 Sum_probs=12.4
Q ss_pred chhhHHHHHHHHHhhhhhc
Q psy9049 17 VLPEAVHFWEQALMVRKTE 35 (64)
Q Consensus 17 llPeav~y~e~~l~VR~~v 35 (64)
=+.+|..++++++..+.-+
T Consensus 17 ~~~~A~~~~~~al~~~~~~ 35 (42)
T PF13374_consen 17 RYEEALELLEEALEIRERL 35 (42)
T ss_dssp -HHHHHHHHHHHHHHH---
T ss_pred hcchhhHHHHHHHHHHHHH
Confidence 3578888888888877544
No 18
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=40.01 E-value=29 Score=16.03 Aligned_cols=18 Identities=28% Similarity=0.434 Sum_probs=12.4
Q ss_pred hhcchhhHHHHHHHHHhh
Q psy9049 14 NNTVLPEAVHFWEQALMV 31 (64)
Q Consensus 14 ~~~llPeav~y~e~~l~V 31 (64)
...=+.+|+.+|++++..
T Consensus 13 ~~~~~~~A~~~~~~al~l 30 (34)
T PF07719_consen 13 QLGNYEEAIEYFEKALEL 30 (34)
T ss_dssp HTT-HHHHHHHHHHHHHH
T ss_pred HhCCHHHHHHHHHHHHHH
Confidence 334467888888888765
No 19
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=36.10 E-value=42 Score=15.63 Aligned_cols=16 Identities=31% Similarity=0.571 Sum_probs=11.8
Q ss_pred hhhHHHHHHHHHhhhh
Q psy9049 18 LPEAVHFWEQALMVRK 33 (64)
Q Consensus 18 lPeav~y~e~~l~VR~ 33 (64)
+.+|+.+|++++...+
T Consensus 17 ~~~A~~~~~~a~~~~~ 32 (34)
T PF13181_consen 17 YEEALEYFEKALELNP 32 (34)
T ss_dssp HHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHhhCC
Confidence 4678888888877644
No 20
>PF04761 Phage_Treg: Lactococcus bacteriophage putative transcription regulator; InterPro: IPR006848 This family represents a number of putative transcription repressor proteins found in several Lactococcus bacteriophages. Horizontal transfer may account for the presence of similar proteins in Lactococcus species [].
Probab=32.84 E-value=42 Score=20.02 Aligned_cols=16 Identities=19% Similarity=0.619 Sum_probs=8.0
Q ss_pred chHHHHHHHhhcchhh
Q psy9049 5 LDAEKFQLINNTVLPE 20 (64)
Q Consensus 5 l~~ek~~lv~~~llPe 20 (64)
|+-.-.+++.|.|+||
T Consensus 26 l~~~s~~flq~slipq 41 (57)
T PF04761_consen 26 LSVDSKDFLQNSLIPQ 41 (57)
T ss_pred HHHHHHHHHHHhhHHH
Confidence 3334445555555555
No 21
>PRK04435 hypothetical protein; Provisional
Probab=32.53 E-value=13 Score=24.60 Aligned_cols=25 Identities=44% Similarity=0.675 Sum_probs=19.9
Q ss_pred HHHHHHhhcchhhHHHHHHHHHhhhhhc
Q psy9049 8 EKFQLINNTVLPEAVHFWEQALMVRKTE 35 (64)
Q Consensus 8 ek~~lv~~~llPeav~y~e~~l~VR~~v 35 (64)
.|+.+|.+.+|||++ ++++.++.+.
T Consensus 5 ~~~~~v~~~~lp~~~---~k~~~~k~ll 29 (147)
T PRK04435 5 KKFYLVREDVLPEAV---EKTLKAKELL 29 (147)
T ss_pred CeEEEEecccChHHH---HHHHHHHHHH
Confidence 477899999999986 7888777554
No 22
>cd03074 PDI_b'_Calsequestrin_C Protein Disulfide Isomerase (PDIb') family, Calsequestrin subfamily, C-terminal TRX-fold domain; Calsequestrin is the major calcium storage protein in the sarcoplasmic reticulum (SR) of skeletal and cardiac muscle. It stores calcium ions in sufficient quantities (up to 20 mM) to allow repetitive contractions and is essential to maintain movement, respiration and heart beat. A missense mutation in human cardiac calsequestrin is associated with catecholamine-induced polymorphic ventricular tachycardia (CPVT), a rare disease characterized by seizures or sudden death in response to physiologic or emotional stress. Calsequestrin is a highly acidic protein with up to 50 calcium binding sites formed simply by the clustering of two or more acidic residues. The monomer contains three redox inactive TRX-fold domains. Calsequestrin is condensed as a linear polymer in the SR lumen and is membrane-anchored through binding with intra-membrane proteins triadin, junctin
Probab=32.31 E-value=22 Score=24.17 Aligned_cols=31 Identities=23% Similarity=0.492 Sum_probs=20.6
Q ss_pred chhhHHHHHHHHHhh---hhhcccccchhhHHHH
Q psy9049 17 VLPEAVHFWEQALMV---RKTEKNTVLPEAVHFW 47 (64)
Q Consensus 17 llPeav~y~e~~l~V---R~~v~~iLLpRac~y~ 47 (64)
=||--+.|||++|.+ ++.+|-+-...|-+.|
T Consensus 64 ~FPllv~yWektF~IDl~~PqIGVV~vtdadSvW 97 (120)
T cd03074 64 DFPLLVPYWEKTFGIDLFRPQIGVVNVTDADSVW 97 (120)
T ss_pred cCchhhHHHHhhcCcccCCCceeeEeccccccee
Confidence 478999999999976 3555544444444444
No 23
>PF00956 NAP: Nucleosome assembly protein (NAP); InterPro: IPR002164 It is thought that NAPs act as histone chaperones, shuttling both core and linker histones from their site of synthesis in the cytoplasm to the nucleus. The proteins may be involved in regulating gene expression and therefore cellular differentiation [, ]. The centrosomal protein c-Nap1, also known as Cep250, has been implicated in the cell-cycle-regulated cohesion of microtubule-organizing centres. This 281 kDa protein consists mainly of domains predicted to form coiled coil structures. The C-terminal region defines a novel histone-binding domain that is responsible for targeting CNAP1, and possibly condensin, to mitotic chromosomes []. During interphase, C-Nap1 localizes to the proximal ends of both parental centrioles, but it dissociates from these structures at the onset of mitosis. Re-association with centrioles then occurs in late telophase or at the very beginning of G1 phase, when daughter cells are still connected by post-mitotic bridges. Electron microscopic studies performed on isolated centrosomes suggest that a proteinaceous linker connects parental centrioles and C-Nap1 may be part of a linker structure that assures the cohesion of duplicated centrosomes during interphase, but that is dismantled upon centrosome separation at the onset of mitosis []. ; GO: 0006334 nucleosome assembly, 0005634 nucleus; PDB: 2E50_Q 2Z2R_A 2AYU_A 3Q66_A 3C9B_A 3Q68_B 3Q33_B 2ZD7_B 3DM7_A 3C9D_A ....
Probab=31.42 E-value=31 Score=24.19 Aligned_cols=18 Identities=22% Similarity=0.632 Sum_probs=15.4
Q ss_pred HHHHHhhcchhhHHHHHH
Q psy9049 9 KFQLINNTVLPEAVHFWE 26 (64)
Q Consensus 9 k~~lv~~~llPeav~y~e 26 (64)
--..|++.|.|.|+.||-
T Consensus 224 i~~~i~d~i~P~av~yy~ 241 (244)
T PF00956_consen 224 IGEIIKDDIIPNAVKYYT 241 (244)
T ss_dssp HHHHHHHTCCCHHHHHHH
T ss_pred HHHHHHhheechHHHHhC
Confidence 447899999999999984
No 24
>PF02071 NSF: Aromatic-di-Alanine (AdAR) repeat ; InterPro: IPR000744 Regulated exocytosis of neurotransmitters and hormones, as well as intracellular traffic, requires fusion of two lipid bilayers. SNARE proteins are thought to form a protein bridge, the SNARE complex, between an incoming vesicle and the acceptor compartment. SNARE proteins contribute to the specificity of membrane fusion, implying that the mechanisms by which SNAREs are targeted to subcellular compartments are important for specific docking and fusion of vesicles. This mechanism involves a family of conserved proteins, members of which appear to function at all sites of constitutive and regulated secretion in eukaryotes []. Among them are 2 types of cytosolic protein, NSF (N-ethyl-maleimide-sensitive protein) and the SNAPs (alpha-, beta- and gamma-soluble NSF attachment proteins). The yeast vesicular fusion protein, sec17, a cytoplasmic peripheral membrane protein involved in vesicular transport between the endoplasmic reticulum and the golgi apparatus, shows a high degree of sequence similarity to the alpha-SNAP family. SNAP-25 and its non-neuronal homologue Syndet/SNAP-23 are synthesized as soluble proteins in the cytosol. Both SNAP-25 and Syndet/SNAP-23 are palmitoylated at cysteine residues clustered in a loop between two N- and C-terminal coils and palmitoylation is essential for membrane binding and plasma membrane targeting. The C-terminal and the N-terminal helices of SNAP-25, are each targeted to the plasma membrane by two distinct cysteine-rich domains and appear to regulate the availability of SNAP to form complexes with SNARE [].; GO: 0006886 intracellular protein transport
Probab=31.31 E-value=26 Score=14.86 Aligned_cols=8 Identities=38% Similarity=0.704 Sum_probs=4.1
Q ss_pred HHHHHHHH
Q psy9049 21 AVHFWEQA 28 (64)
Q Consensus 21 av~y~e~~ 28 (64)
|+.+|++|
T Consensus 1 A~~~y~~A 8 (12)
T PF02071_consen 1 AIKCYEKA 8 (12)
T ss_pred CcHHHHHH
Confidence 34555554
No 25
>PHA00687 hypothetical protein
Probab=31.11 E-value=28 Score=20.58 Aligned_cols=21 Identities=33% Similarity=0.401 Sum_probs=17.6
Q ss_pred HHHHhhcchhhHHHHHHHHHh
Q psy9049 10 FQLINNTVLPEAVHFWEQALM 30 (64)
Q Consensus 10 ~~lv~~~llPeav~y~e~~l~ 30 (64)
++-|..+|-|||...+++|-+
T Consensus 4 rqpvqttlppeamrllqqaaq 24 (56)
T PHA00687 4 RQPVQTTLPPEAMRLLQQAAQ 24 (56)
T ss_pred cCcccccCCHHHHHHHHHHhc
Confidence 466888999999999998865
No 26
>KOG1173|consensus
Probab=30.40 E-value=72 Score=26.80 Aligned_cols=40 Identities=30% Similarity=0.547 Sum_probs=26.2
Q ss_pred cchhhHHHHHHHHHhhhhhccccc-------------------chhhHHHHHHHHhhhh
Q psy9049 16 TVLPEAVHFWEQALMVRKTEKNTV-------------------LPEAVHFWEQALMVRK 55 (64)
Q Consensus 16 ~llPeav~y~e~~l~VR~~v~~iL-------------------LpRac~y~~~~~~VR~ 55 (64)
+-+|+|+.|+++|+.+=+.+.+.. +..|+.||++++..-+
T Consensus 428 ~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~ 486 (611)
T KOG1173|consen 428 EEYPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEAIDYYQKALLLSP 486 (611)
T ss_pred hhhHHHHHHHHHHHHHhhhccccccchhHHHHhHHHHHHHHhhHHHHHHHHHHHHHcCC
Confidence 446888888888885554444432 4567777777776544
No 27
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=29.22 E-value=79 Score=16.62 Aligned_cols=30 Identities=23% Similarity=0.190 Sum_probs=19.7
Q ss_pred hhcchhhHHHHHHHHHhhhhhcccccchhh
Q psy9049 14 NNTVLPEAVHFWEQALMVRKTEKNTVLPEA 43 (64)
Q Consensus 14 ~~~llPeav~y~e~~l~VR~~v~~iLLpRa 43 (64)
.+.=+.+|+.++++++..-+--..+.+.-+
T Consensus 3 ~~~~~~~A~~~~~~~l~~~p~~~~~~~~la 32 (68)
T PF14559_consen 3 KQGDYDEAIELLEKALQRNPDNPEARLLLA 32 (68)
T ss_dssp HTTHHHHHHHHHHHHHHHTTTSHHHHHHHH
T ss_pred hccCHHHHHHHHHHHHHHCCCCHHHHHHHH
Confidence 334467888888888887766555554333
No 28
>cd06588 PhnB_like Escherichia coli PhnB and similar proteins; the E. coli phnB gene is found next to an operon involved in the cleavage of carbon-phosphorus bonds in unactivated alkylphosphonates. The Escherichia coli phnB gene is found next to an operon of fourteen genes (phnC-to-phnP) related to the cleavage of carbon-phosphorus (C-P) bonds in unactivated alkylphosphonates, supporting bacterial growth on alkylphosphonates as the sole phosphorus source. It was originally considered part of that operon. PhnB appears to play no direct catalytic role in the usage of alkylphosphonate. Although many of the proteins in this family have been annotated as 3-demethylubiquinone-9 3-methyltransferase enzymes by automatic annotation programs, the experimental evidence for this assignment is lacking. In Escherichia coli, the gene coding 3-demethylubiquinone-9 3-methyltransferase enzyme is ubiG, which belongs to the AdoMet-MTase protein family. PhnB-like proteins adopt a structural fold similar to
Probab=29.17 E-value=38 Score=20.63 Aligned_cols=13 Identities=23% Similarity=0.465 Sum_probs=10.7
Q ss_pred hhhHHHHHHHHHh
Q psy9049 18 LPEAVHFWEQALM 30 (64)
Q Consensus 18 lPeav~y~e~~l~ 30 (64)
..+|+.||+++|-
T Consensus 11 ~~eAi~FY~~~fg 23 (128)
T cd06588 11 AEEALEFYQSVFG 23 (128)
T ss_pred HHHHHHHHHHHhC
Confidence 3689999999884
No 29
>KOG1717|consensus
Probab=28.53 E-value=33 Score=26.76 Aligned_cols=18 Identities=28% Similarity=0.492 Sum_probs=15.1
Q ss_pred hcchhhHHHHHHHHHhhh
Q psy9049 15 NTVLPEAVHFWEQALMVR 32 (64)
Q Consensus 15 ~~llPeav~y~e~~l~VR 32 (64)
...|||||+|-.+|.++-
T Consensus 234 s~ffpEAIsfIdeArsk~ 251 (343)
T KOG1717|consen 234 SQFFPEAISFIDEARSKN 251 (343)
T ss_pred hhhhHHHHHHHHHhhccC
Confidence 468999999999998754
No 30
>PF13013 F-box-like_2: F-box-like domain
Probab=27.58 E-value=76 Score=20.59 Aligned_cols=30 Identities=17% Similarity=0.287 Sum_probs=15.6
Q ss_pred HHHHHHHhhcchhhHHHHHHHHHhhhhhcc
Q psy9049 7 AEKFQLINNTVLPEAVHFWEQALMVRKTEK 36 (64)
Q Consensus 7 ~ek~~lv~~~llPeav~y~e~~l~VR~~v~ 36 (64)
++....+++-+=|-++.||.+.++++..--
T Consensus 66 p~~c~~lr~lls~~s~~~w~~~l~~~~~~~ 95 (109)
T PF13013_consen 66 PQSCTRLRSLLSWTSKMFWKSFLRVRDDYR 95 (109)
T ss_pred HHHHHHHHHHcCcccHHHHHHHHHHHhccH
Confidence 344444554444445666666666655443
No 31
>PTZ00008 (NAP-S) nucleosome assembly protein-S; Provisional
Probab=27.39 E-value=33 Score=23.93 Aligned_cols=20 Identities=5% Similarity=0.219 Sum_probs=16.6
Q ss_pred HHHHHHHhhcchhhHHHHHH
Q psy9049 7 AEKFQLINNTVLPEAVHFWE 26 (64)
Q Consensus 7 ~ek~~lv~~~llPeav~y~e 26 (64)
-|--++|++.|+|.|+.|+.
T Consensus 158 ~eIg~~i~e~i~P~av~yy~ 177 (185)
T PTZ00008 158 PDVGEIIRREIWHAPLLYYL 177 (185)
T ss_pred HHHHHHHHHhhccchHHhhC
Confidence 34567899999999999985
No 32
>PTZ00007 (NAP-L) nucleosome assembly protein -L; Provisional
Probab=25.44 E-value=33 Score=26.28 Aligned_cols=18 Identities=22% Similarity=0.253 Sum_probs=15.2
Q ss_pred HHHHHhhcchhhHHHHHH
Q psy9049 9 KFQLINNTVLPEAVHFWE 26 (64)
Q Consensus 9 k~~lv~~~llPeav~y~e 26 (64)
.=..+++.|+|.||.||.
T Consensus 259 iG~~ikd~IIP~AV~yft 276 (337)
T PTZ00007 259 IGITIRDKLIPYAVYWFL 276 (337)
T ss_pred HHHHHHHhcccccHHhhC
Confidence 446799999999999984
No 33
>PF09533 DUF2380: Predicted lipoprotein of unknown function (DUF2380); InterPro: IPR011755 This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N terminus; the others are predicted lipoproteins. The function is unknown.
Probab=23.33 E-value=34 Score=24.74 Aligned_cols=38 Identities=21% Similarity=0.487 Sum_probs=23.0
Q ss_pred cchHHHHHHHhhcchhhH---HHHHH-HHHhhhhhcccccchhhHH
Q psy9049 4 RLDAEKFQLINNTVLPEA---VHFWE-QALMVRKTEKNTVLPEAVH 45 (64)
Q Consensus 4 ~l~~ek~~lv~~~llPea---v~y~e-~~l~VR~~v~~iLLpRac~ 45 (64)
+++++ .+++..||||| -.||+ +-+-+...+. .+|+.++
T Consensus 92 ~~~~g--~~ekHHIFPQa~~la~wF~~~Gi~IHd~ti--~Ip~~vH 133 (188)
T PF09533_consen 92 RLPPG--RWEKHHIFPQAEELAEWFERRGIDIHDYTI--PIPRDVH 133 (188)
T ss_pred CCCCC--cchhhccCCCcHHHHHHHHHcCCChhheeE--ecCHHHH
Confidence 34444 56999999996 34444 4444445444 4488775
No 34
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=22.97 E-value=1.1e+02 Score=19.24 Aligned_cols=32 Identities=22% Similarity=0.293 Sum_probs=20.3
Q ss_pred hhHHHHHHHHHhhhhhcccccchhhHHHHHHHHh
Q psy9049 19 PEAVHFWEQALMVRKTEKNTVLPEAVHFWEQALM 52 (64)
Q Consensus 19 Peav~y~e~~l~VR~~v~~iLLpRac~y~~~~~~ 52 (64)
-+|++-|.+||.+..--.. -=++..|+.+|++
T Consensus 23 ~~Al~~W~~aL~k~~~~~~--rf~~lG~l~qA~~ 54 (80)
T PF10579_consen 23 QQALQKWRKALEKITDRED--RFRVLGYLIQAHM 54 (80)
T ss_pred HHHHHHHHHHHhhcCChHH--HHHHHHHHHHHHH
Confidence 5799999999987644221 2345566666654
No 35
>PF01216 Calsequestrin: Calsequestrin; InterPro: IPR001393 Calsequestrin is the principal calcium-binding protein present in the sarcoplasmic reticulum of cardiac and skeletal muscle []. It is a highly acidic protein that is able to bind over 40 calcium ions and acts as an internal calcium store in muscle. Sequence analysis has suggested that calcium is not bound in distinct pockets via EF-hand motifs, but rather via presentation of a charged protein surface. Two forms of calsequestrin have been identified. The cardiac form is present in cardiac and slow skeletal muscle and the fast skeletal form is found in fast skeletal muscle. The release of calsequestrin-bound calcium (through a a calcium release channel) triggers muscle contraction. The active protein is not highly structured, more than 50% of it adopting a random coil conformation []. When calcium binds there is a structural change whereby the alpha-helical content of the protein increases from 3 to 11% []. Both forms of calsequestrin are phosphorylated by casein kinase II, but the cardiac form is phosphorylated more rapidly and to a higher degree [].; GO: 0005509 calcium ion binding; PDB: 1SJI_B 2VAF_A 3UOM_C 1A8Y_A 3TRQ_A 3TRP_A 3US3_A.
Probab=22.38 E-value=58 Score=25.88 Aligned_cols=15 Identities=27% Similarity=0.702 Sum_probs=12.0
Q ss_pred chhhHHHHHHHHHhh
Q psy9049 17 VLPEAVHFWEQALMV 31 (64)
Q Consensus 17 llPeav~y~e~~l~V 31 (64)
=||-.+.|||++|.+
T Consensus 312 ~fPllv~yWE~tF~I 326 (383)
T PF01216_consen 312 DFPLLVPYWEKTFGI 326 (383)
T ss_dssp G-HHHHHHHHHHHTT
T ss_pred CCchhHHHHHhhcCc
Confidence 379999999999865
No 36
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=21.87 E-value=74 Score=14.79 Aligned_cols=11 Identities=18% Similarity=0.558 Sum_probs=8.2
Q ss_pred hhHHHHHHHHH
Q psy9049 19 PEAVHFWEQAL 29 (64)
Q Consensus 19 Peav~y~e~~l 29 (64)
.+|+.||++|-
T Consensus 22 ~~A~~~~~~Aa 32 (36)
T smart00671 22 EKALEYYKKAA 32 (36)
T ss_pred HHHHHHHHHHH
Confidence 57888888764
No 37
>PF10373 EST1_DNA_bind: Est1 DNA/RNA binding domain; InterPro: IPR018834 Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=20.71 E-value=1.2e+02 Score=20.30 Aligned_cols=36 Identities=22% Similarity=0.330 Sum_probs=18.9
Q ss_pred HHHHHHHHHhhhhhcccc------------cchhhHHHHHHHHhhhhc
Q psy9049 21 AVHFWEQALMVRKTEKNT------------VLPEAVHFWEQALMVRKT 56 (64)
Q Consensus 21 av~y~e~~l~VR~~v~~i------------LLpRac~y~~~~~~VR~~ 56 (64)
|.+||.+|..+-+..|.- --=+|+-||.|++.++.+
T Consensus 1 A~~~Y~~A~~l~P~~G~p~nQLAvl~~~~~~~l~avy~y~Rsl~~~~P 48 (278)
T PF10373_consen 1 AERYYRKAIRLLPSNGNPYNQLAVLASYQGDDLDAVYYYIRSLAVRIP 48 (278)
T ss_dssp HHHHHHHHHHH-TTBSHHHHHHHHHHHHTT-HHHHHHHHHHHHSSSB-
T ss_pred CHHHHHHHHHhCCCCCCcccchhhhhccccchHHHHHHHHHHHhcCCC
Confidence 455666666665554432 112556677777766654
No 38
>PF05794 Tcp11: T-complex protein 11; InterPro: IPR008862 This family consists of several eukaryotic T-complex protein 11 (Tcp11) related sequences. Tcp11 is only expressed in fertile adult mammalian testes and is thought to be important in sperm function and fertility. The family also contains the Saccharomyces cerevisiae Sok1 protein which is known to suppress cyclic AMP-dependent protein kinase mutants [].
Probab=20.62 E-value=1.9e+02 Score=21.59 Aligned_cols=45 Identities=11% Similarity=0.189 Sum_probs=38.1
Q ss_pred HHHhhcchhhHHHHHHHHHhhhhhcccccchhhHHHHHHHHhhhh
Q psy9049 11 QLINNTVLPEAVHFWEQALMVRKTEKNTVLPEAVHFWEQALMVRK 55 (64)
Q Consensus 11 ~lv~~~llPeav~y~e~~l~VR~~v~~iLLpRac~y~~~~~~VR~ 55 (64)
+...-.|...||.|-.+.|+-+--.++.-|+.....|.++..--.
T Consensus 163 ~~~rp~L~~~sv~yEr~~F~~~l~~~~~~l~~T~~Wl~~~~~~~~ 207 (441)
T PF05794_consen 163 RSLRPQLIEHSVEYERKKFQERLEKGPNSLPRTKAWLKRARSELE 207 (441)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCcCChHHHHHHHHHHHHHHh
Confidence 344556788899999999999999999999999999999876543
Done!