Query 008936
Match_columns 548
No_of_seqs 46 out of 48
Neff 2.8
Searched_HMMs 46136
Date Thu Mar 28 18:24:37 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/008936.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/008936hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK11637 AmiB activator; Provi 78.1 39 0.00084 35.8 12.7 78 54-131 45-135 (428)
2 PF10186 Atg14: UV radiation r 69.7 61 0.0013 31.4 10.9 90 57-147 28-125 (302)
3 PLN03144 Carbon catabolite rep 69.6 4.7 0.0001 45.6 3.8 35 349-383 171-205 (606)
4 PF14988 DUF4515: Domain of un 68.8 23 0.00049 35.0 7.8 76 59-134 81-174 (206)
5 PF12128 DUF3584: Protein of u 64.5 1.2E+02 0.0026 36.8 14.1 106 57-163 789-894 (1201)
6 PF04012 PspA_IM30: PspA/IM30 64.2 37 0.00081 32.5 8.2 117 44-191 19-139 (221)
7 PRK09039 hypothetical protein; 60.0 68 0.0015 33.8 9.8 122 58-191 48-171 (343)
8 COG1704 LemA Uncharacterized c 56.7 23 0.00051 35.1 5.5 47 133-191 91-138 (185)
9 TIGR03185 DNA_S_dndD DNA sulfu 55.7 57 0.0012 36.6 8.9 59 51-109 393-453 (650)
10 PRK10698 phage shock protein P 55.6 87 0.0019 31.1 9.3 76 45-125 21-101 (222)
11 TIGR01005 eps_transp_fam exopo 54.4 1.1E+02 0.0024 34.7 10.9 109 62-170 200-329 (754)
12 KOG0977 Nuclear envelope prote 52.1 79 0.0017 35.9 9.3 40 62-101 41-80 (546)
13 PRK09239 chorismate mutase; Pr 49.3 2E+02 0.0042 25.8 9.9 94 88-191 7-100 (104)
14 PRK14153 heat shock protein Gr 47.9 2E+02 0.0042 28.7 10.2 65 139-211 82-150 (194)
15 TIGR02977 phageshock_pspA phag 45.8 1.9E+02 0.0042 28.3 9.9 54 44-106 20-74 (219)
16 COG2900 SlyX Uncharacterized p 43.8 48 0.001 28.8 4.7 37 86-125 3-41 (72)
17 PF15294 Leu_zip: Leucine zipp 42.1 2.6E+02 0.0056 29.5 10.5 41 58-98 134-174 (278)
18 KOG0978 E3 ubiquitin ligase in 41.4 2.3E+02 0.0049 33.4 10.9 109 80-191 499-621 (698)
19 KOG0250 DNA repair protein RAD 40.7 3.3E+02 0.0072 33.6 12.4 139 49-191 734-891 (1074)
20 COG1579 Zn-ribbon protein, pos 40.3 1.6E+02 0.0035 30.2 8.6 92 61-161 36-128 (239)
21 PRK14148 heat shock protein Gr 39.9 3.1E+02 0.0068 27.3 10.3 64 139-210 89-156 (195)
22 PF11464 Rbsn: Rabenosyn Rab b 39.0 69 0.0015 25.2 4.5 40 63-104 3-42 (42)
23 KOG3091 Nuclear pore complex, 39.0 75 0.0016 35.9 6.5 80 47-128 346-446 (508)
24 PRK14154 heat shock protein Gr 37.7 3.4E+02 0.0075 27.4 10.3 152 19-211 11-171 (208)
25 COG4942 Membrane-bound metallo 37.7 5.7E+02 0.012 28.6 12.7 93 58-158 40-132 (420)
26 PRK14140 heat shock protein Gr 37.0 4.1E+02 0.0089 26.4 10.6 65 139-211 86-154 (191)
27 PF10186 Atg14: UV radiation r 36.6 3.7E+02 0.008 26.1 10.2 51 57-107 57-107 (302)
28 PF01895 PhoU: PhoU domain; I 36.6 1.2E+02 0.0027 23.5 5.8 50 52-101 24-73 (88)
29 KOG1222 Kinesin associated pro 36.5 1.4E+02 0.0031 34.4 8.2 128 23-178 231-359 (791)
30 PRK10361 DNA recombination pro 36.2 3.6E+02 0.0078 30.4 11.1 57 59-116 56-112 (475)
31 TIGR03185 DNA_S_dndD DNA sulfu 35.9 1.3E+02 0.0027 33.9 7.8 23 128-150 274-296 (650)
32 PLN02939 transferase, transfer 35.4 2.5E+02 0.0055 34.2 10.4 100 50-153 129-280 (977)
33 TIGR01076 sortase_fam LPXTG-si 34.6 39 0.00085 30.5 3.0 43 322-365 42-85 (136)
34 PRK14154 heat shock protein Gr 33.0 2.2E+02 0.0048 28.7 8.2 26 47-72 50-75 (208)
35 PRK14162 heat shock protein Gr 32.9 4.7E+02 0.01 26.1 10.3 64 140-211 89-156 (194)
36 PF04156 IncA: IncA protein; 32.7 2.4E+02 0.0052 26.3 8.0 27 162-188 156-182 (191)
37 PRK02119 hypothetical protein; 32.6 98 0.0021 26.2 4.9 37 84-123 2-40 (73)
38 PRK02793 phi X174 lysis protei 32.0 1E+02 0.0022 25.9 4.9 33 88-123 5-39 (72)
39 TIGR01795 CM_mono_cladeE monof 31.5 3.6E+02 0.0077 23.7 8.3 84 91-187 3-89 (94)
40 TIGR02169 SMC_prok_A chromosom 31.2 4.1E+02 0.0089 30.8 10.9 18 398-415 605-622 (1164)
41 PRK11020 hypothetical protein; 30.5 77 0.0017 29.7 4.2 40 67-109 16-60 (118)
42 PRK14139 heat shock protein Gr 30.1 4.1E+02 0.0088 26.3 9.3 64 140-211 82-147 (185)
43 cd05827 Sortase_C_3 Sortase C 29.4 53 0.0011 29.3 2.9 44 321-365 44-88 (131)
44 COG1196 Smc Chromosome segrega 29.4 4E+02 0.0088 32.3 10.9 61 59-119 372-432 (1163)
45 PF11932 DUF3450: Protein of u 29.2 5.3E+02 0.011 25.6 10.1 84 68-152 40-127 (251)
46 PF14048 MBD_C: C-terminal dom 29.2 40 0.00087 30.1 2.1 20 56-75 76-95 (96)
47 PF01025 GrpE: GrpE; InterPro 28.5 2.3E+02 0.0051 25.9 7.0 53 137-191 58-114 (165)
48 PF14389 Lzipper-MIP1: Leucine 27.9 83 0.0018 27.3 3.8 36 43-78 48-83 (88)
49 PRK14146 heat shock protein Gr 27.6 6.4E+02 0.014 25.5 10.4 64 140-211 104-171 (215)
50 PRK06342 transcription elongat 27.6 80 0.0017 30.2 4.0 42 51-97 32-77 (160)
51 PF00038 Filament: Intermediat 27.4 3.8E+02 0.0082 26.8 8.8 50 58-107 211-260 (312)
52 PRK00295 hypothetical protein; 27.2 1.4E+02 0.0031 24.9 4.9 31 89-122 3-35 (68)
53 PF07989 Microtub_assoc: Micro 26.8 1.5E+02 0.0032 25.3 5.0 63 58-144 2-64 (75)
54 PF04849 HAP1_N: HAP1 N-termin 26.6 1.4E+02 0.003 31.8 5.8 48 59-106 230-277 (306)
55 PRK14149 heat shock protein Gr 26.3 3.8E+02 0.0082 26.7 8.4 64 139-211 85-152 (191)
56 PF04012 PspA_IM30: PspA/IM30 25.7 1.2E+02 0.0026 29.1 4.9 48 51-98 86-133 (221)
57 PF09726 Macoilin: Transmembra 25.6 1.8E+02 0.0039 33.8 7.0 67 54-127 543-611 (697)
58 PRK11281 hypothetical protein; 25.6 1.9E+02 0.0041 35.6 7.3 61 49-110 66-147 (1113)
59 PRK14151 heat shock protein Gr 25.5 6.3E+02 0.014 24.7 9.6 64 140-211 70-138 (176)
60 PRK11281 hypothetical protein; 25.4 3.4E+02 0.0075 33.4 9.4 71 88-158 357-459 (1113)
61 COG1579 Zn-ribbon protein, pos 25.2 3.5E+02 0.0077 27.9 8.2 47 56-102 59-114 (239)
62 PRK04406 hypothetical protein; 24.7 1.7E+02 0.0037 25.0 5.0 32 88-122 8-41 (75)
63 PRK14160 heat shock protein Gr 23.6 6.2E+02 0.013 25.6 9.4 63 139-211 110-174 (211)
64 PLN02524 S-adenosylmethionine 23.4 1.6E+02 0.0035 31.9 5.7 87 324-448 228-315 (355)
65 KOG3083 Prohibitin [Posttransl 22.9 77 0.0017 33.0 3.1 41 103-143 141-199 (271)
66 PLN02542 fructose-1,6-bisphosp 22.9 1E+02 0.0022 33.9 4.2 80 319-416 234-314 (412)
67 PF08838 DUF1811: Protein of u 22.4 87 0.0019 28.7 3.0 33 60-94 11-43 (102)
68 PRK14141 heat shock protein Gr 22.1 7.8E+02 0.017 24.9 9.8 64 139-210 80-153 (209)
69 PF07926 TPR_MLP1_2: TPR/MLP1/ 22.0 5E+02 0.011 23.6 7.8 67 62-128 58-124 (132)
70 KOG3990 Uncharacterized conser 21.9 2.7E+02 0.0059 29.6 6.7 29 46-74 215-243 (305)
71 PF13600 DUF4140: N-terminal d 21.5 1.3E+02 0.0027 25.8 3.7 36 61-96 68-103 (104)
72 KOG3620 Uncharacterized conser 21.4 1.7E+02 0.0037 36.7 5.8 75 413-489 40-131 (1626)
73 cd05829 Sortase_E Sortase E (S 21.3 1.9E+02 0.0042 26.5 5.1 51 316-366 42-96 (144)
74 TIGR02168 SMC_prok_B chromosom 21.3 1.1E+03 0.023 27.4 11.8 8 310-317 1071-1078(1179)
75 PRK09293 fructose-1,6-bisphosp 21.3 1.1E+02 0.0024 32.3 3.9 76 319-413 145-221 (327)
76 PRK14143 heat shock protein Gr 21.2 8.9E+02 0.019 24.9 10.1 64 139-210 116-184 (238)
77 PRK02224 chromosome segregatio 21.0 8.3E+02 0.018 28.2 10.9 35 475-509 767-801 (880)
78 PF13874 Nup54: Nucleoporin co 20.8 4.3E+02 0.0092 24.4 7.2 79 52-140 33-113 (141)
79 PF06034 DUF919: Nucleopolyhed 20.7 1.9E+02 0.0042 24.4 4.4 43 58-100 6-54 (62)
80 PRK04325 hypothetical protein; 20.7 2.2E+02 0.0048 24.1 4.9 32 89-123 7-40 (74)
81 PF04201 TPD52: Tumour protein 20.6 2.1E+02 0.0045 28.2 5.3 51 44-108 24-79 (162)
82 PF11932 DUF3450: Protein of u 20.6 5E+02 0.011 25.8 8.1 38 135-180 125-162 (251)
83 KOG2896 UV radiation resistanc 20.6 3.5E+02 0.0075 29.9 7.4 44 92-135 142-192 (377)
84 PF00038 Filament: Intermediat 20.2 3.6E+02 0.0079 27.0 7.1 43 62-104 53-95 (312)
85 PF04011 LemA: LemA family; I 20.1 5.2E+02 0.011 24.5 7.8 48 132-191 90-138 (186)
No 1
>PRK11637 AmiB activator; Provisional
Probab=78.13 E-value=39 Score=35.81 Aligned_cols=78 Identities=15% Similarity=0.159 Sum_probs=47.1
Q ss_pred HHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHH-------------HHHHHHHHHHhhh
Q 008936 54 LYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEA-------------ITSALNELARRKG 120 (548)
Q Consensus 54 L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdl-------------VdAASKaLSyRQd 120 (548)
+-.+.++.+++|.++.+.|.+..-+..++.++...|+++|+.+.-.++..|+++ |+.+-+.|..+++
T Consensus 45 ~~~~l~~l~~qi~~~~~~i~~~~~~~~~~~~~l~~l~~qi~~~~~~i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~ 124 (428)
T PRK11637 45 NRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRKLRETQNTLNQLNKQIDELNASIAKLEQQQAAQER 124 (428)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334555666666666666666666666677777777777766666665555443 2333445555556
Q ss_pred hHhhhhhhhhh
Q 008936 121 VLEENLKLAHD 131 (548)
Q Consensus 121 iiEENiRLtya 131 (548)
.+.+=+|-.|.
T Consensus 125 ~l~~rlra~Y~ 135 (428)
T PRK11637 125 LLAAQLDAAFR 135 (428)
T ss_pred HHHHHHHHHHH
Confidence 66666666665
No 2
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=69.75 E-value=61 Score=31.43 Aligned_cols=90 Identities=18% Similarity=0.146 Sum_probs=49.8
Q ss_pred HhhccHHHHHHHHHHHHHHhh--------hhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhh
Q 008936 57 RARMQKEEIHSLRQQIAVACL--------KELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKL 128 (548)
Q Consensus 57 k~r~~eeEI~~LRk~Ladasv--------KE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRL 128 (548)
+...-.++...|+.++.+.-- .-.++..|...++.|++.+|....++++.+-+.=-+.-..|+.+-...-.|
T Consensus 28 ~l~~~~~~~~~l~~~i~~~l~~~~~~~~~~~~~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l~~~~~~l 107 (302)
T PF10186_consen 28 ELQQLKEENEELRRRIEEILESDSNGQLLEIQQLKREIEELRERLERLRERIERLRKRIEQKRERLEELRESLEQRRSRL 107 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445555666666666433 334556677778888888888887777665444333333344443334444
Q ss_pred hhhhhhhhhhhHHHHHHhH
Q 008936 129 AHDLKVAEDERYFFMSSML 147 (548)
Q Consensus 129 tyalq~aeQERstFvSSLL 147 (548)
. +.+...+++..++..+.
T Consensus 108 ~-~~~~~~~~~~~~~~~~~ 125 (302)
T PF10186_consen 108 S-ASQDLVESRQEQLEELQ 125 (302)
T ss_pred H-HHHHHHHHHHHHHHHHH
Confidence 4 34444455555554443
No 3
>PLN03144 Carbon catabolite repressor protein 4 homolog; Provisional
Probab=69.57 E-value=4.7 Score=45.57 Aligned_cols=35 Identities=31% Similarity=0.568 Sum_probs=29.0
Q ss_pred ccccccCCCCCcceeccccccceeEEEeEecCCCC
Q 008936 349 TRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQG 383 (548)
Q Consensus 349 s~~~IeGA~~p~Y~vTADDvd~llAiec~PmDdr~ 383 (548)
....|+=.....|+.|+|||+-.|-+||+|+|...
T Consensus 171 ~~~w~~vg~~~~y~p~~~d~g~~lk~~~~~~~~~~ 205 (606)
T PLN03144 171 GETWIEVGRSKTYTPTADDVGHVLKFECVVVDAET 205 (606)
T ss_pred CCceEEeCCCccccCChhhCCceEEEEEEEccccc
Confidence 34445555788999999999999999999998774
No 4
>PF14988 DUF4515: Domain of unknown function (DUF4515)
Probab=68.76 E-value=23 Score=35.01 Aligned_cols=76 Identities=26% Similarity=0.328 Sum_probs=52.4
Q ss_pred hccHHHHHHHHHHHHHH----hh----hhhhhhcchhhhHHhHhHHHHHH---------hHHhhHHHHHHHHHHH-Hhhh
Q 008936 59 RMQKEEIHSLRQQIAVA----CL----KELQLQNEKYTLERKVSELRMAI---------DEKQNEAITSALNELA-RRKG 120 (548)
Q Consensus 59 r~~eeEI~~LRk~Lada----sv----KE~QllnEKyvLErRiA~mRmAf---------DqqQqdlVdAASKaLS-yRQd 120 (548)
..+|.||..|++.++.. +. =+.|.+.||.-||+-..+++++- .++=|-|.-||-++|. +=++
T Consensus 81 ~~qe~eI~~Le~e~~~~~~e~~~~l~~~~~qfl~EK~~LEke~~e~~i~~l~e~a~~el~~k~~ale~~A~~~l~e~~~~ 160 (206)
T PF14988_consen 81 EQQEREIQTLEEELEKMRAEHAEKLQEAESQFLQEKARLEKEASELKILQLGERAHKELKKKAQALELAAKKSLDEFTRS 160 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhHHHhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46788998888887652 22 25789999999999887777651 2223334444555443 5678
Q ss_pred hHhhhhhhhhhhhh
Q 008936 121 VLEENLKLAHDLKV 134 (548)
Q Consensus 121 iiEENiRLtyalq~ 134 (548)
|-.||.+|.-+|.-
T Consensus 161 i~~EN~~L~k~L~~ 174 (206)
T PF14988_consen 161 IKRENQQLRKELLQ 174 (206)
T ss_pred HHHHHHHHHHHHHH
Confidence 99999999877654
No 5
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=64.53 E-value=1.2e+02 Score=36.75 Aligned_cols=106 Identities=23% Similarity=0.260 Sum_probs=68.2
Q ss_pred HhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhh
Q 008936 57 RARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAE 136 (548)
Q Consensus 57 k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~ae 136 (548)
+++.++.++..-++-+...=.+.-.+..+|-.|+.+++.++-++.+.++++-.. .+++..+..=+|+.+.-.-+.+..-
T Consensus 789 ~ie~~r~~V~eY~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~-~~~~~~~~~~le~~~~~~~~~~~~~ 867 (1201)
T PF12128_consen 789 RIEERRAEVIEYEDWLQEEWDKVDELREEKPELEEQLRDLEQELQELEQELNQL-QKEVKQRRKELEEELKALEEQLEQL 867 (1201)
T ss_pred HHHHhHHHHHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444444444555677788888888888888888777776543 4455555566666666555555544
Q ss_pred hhhHHHHHHhHHHhhhcCCCCCcccch
Q 008936 137 DERYFFMSSMLGLLADYGLWPHVTNAS 163 (548)
Q Consensus 137 QERstFvSSLLpLLaEY~LqP~V~DAq 163 (548)
++.-.-+..++.-|+++.+.|+..+|+
T Consensus 868 ~~~l~~l~~~~~~l~~~~~~~~~~~~~ 894 (1201)
T PF12128_consen 868 EEQLRRLRDLLEKLAELSEPPNAEDAE 894 (1201)
T ss_pred HHHHHHHHHHHhhhhhcCCCCCchhhh
Confidence 555555566666678888888888888
No 6
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=64.24 E-value=37 Score=32.53 Aligned_cols=117 Identities=18% Similarity=0.239 Sum_probs=72.8
Q ss_pred ccccchhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHH----HHHHhHHhhHHHHHHHHHHHHhh
Q 008936 44 ISFQDREAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSEL----RMAIDEKQNEAITSALNELARRK 119 (548)
Q Consensus 44 ~~~~d~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~m----RmAfDqqQqdlVdAASKaLSyRQ 119 (548)
..+.||+ .-|=.-+|+-+++|..+|+.+|.+...+.++-.+...+++.++.+ +.|...-..||.-.| |..++
T Consensus 19 d~~EDP~-~~l~q~ird~e~~l~~a~~~~a~~~a~~~~le~~~~~~~~~~~~~~~~A~~Al~~g~edLAr~a---l~~k~ 94 (221)
T PF04012_consen 19 DKAEDPE-KMLEQAIRDMEEQLRKARQALARVMANQKRLERKLDEAEEEAEKWEKQAELALAAGREDLAREA---LQRKA 94 (221)
T ss_pred HhhcCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHH---HHHHH
Confidence 3467888 445567899999999999999998877777666666666655543 456655555554332 22222
Q ss_pred hhHhhhhhhhhhhhhhhhhhHHHHHHhHHHhhhcCCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936 120 GVLEENLKLAHDLKVAEDERYFFMSSMLGLLADYGLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT 191 (548)
Q Consensus 120 diiEENiRLtyalq~aeQERstFvSSLLpLLaEY~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q 191 (548)
+.-+ +. .. |+..+..+...|..+|.-++.|..||..+..|.....
T Consensus 95 ~~e~--------------~~----~~---------l~~~~~~~~~~~~~l~~~l~~l~~kl~e~k~k~~~l~ 139 (221)
T PF04012_consen 95 DLEE--------------QA----ER---------LEQQLDQAEAQVEKLKEQLEELEAKLEELKSKREELK 139 (221)
T ss_pred HHHH--------------HH----HH---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1111 00 01 1223345666777888888888888877777655444
No 7
>PRK09039 hypothetical protein; Validated
Probab=60.04 E-value=68 Score=33.75 Aligned_cols=122 Identities=14% Similarity=0.064 Sum_probs=65.4
Q ss_pred hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhh-hhhhh
Q 008936 58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHD-LKVAE 136 (548)
Q Consensus 58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtya-lq~ae 136 (548)
+...++|+..|..+|++.+.--.==.+....|+.+|+.||--|+.-+. +.+-+|.-+--.++ ...++
T Consensus 48 i~~~~~eL~~L~~qIa~L~e~L~le~~~~~~l~~~l~~l~~~l~~a~~------------~r~~Le~~~~~~~~~~~~~~ 115 (343)
T PRK09039 48 ISGKDSALDRLNSQIAELADLLSLERQGNQDLQDSVANLRASLSAAEA------------ERSRLQALLAELAGAGAAAE 115 (343)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH------------HHHHHHHHHhhhhhhcchHH
Confidence 456788999999999885432222234456677777777776662111 11122221110111 00011
Q ss_pred hhhHHHHHHhHHHhhhc-CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936 137 DERYFFMSSMLGLLADY-GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT 191 (548)
Q Consensus 137 QERstFvSSLLpLLaEY-~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q 191 (548)
..-...-.-|=-+.++| .-+|.|.-.++=+..+|.=-..|+..|...+.+.++.+
T Consensus 116 ~~~~~l~~~L~~~k~~~se~~~~V~~L~~qI~aLr~Qla~le~~L~~ae~~~~~~~ 171 (343)
T PRK09039 116 GRAGELAQELDSEKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQ 171 (343)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 11111123344455666 47777777777777777766667777766666655555
No 8
>COG1704 LemA Uncharacterized conserved protein [Function unknown]
Probab=56.66 E-value=23 Score=35.10 Aligned_cols=47 Identities=21% Similarity=0.250 Sum_probs=40.1
Q ss_pred hhhhhhhHHHHHHhHHHhhhc-CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936 133 KVAEDERYFFMSSMLGLLADY-GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT 191 (548)
Q Consensus 133 q~aeQERstFvSSLLpLLaEY-~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q 191 (548)
|.||.|.+-+.+.|+=+-.-| +|.-. ..|..||++|..+|.++.-++
T Consensus 91 ~~aq~~Ls~~L~rl~a~~E~YPdLKAn------------~~f~~Lq~ql~~tEn~Ia~aR 138 (185)
T COG1704 91 QEAQAELSSALGRLFAVAEAYPDLKAN------------ENFLELQSQLEGTENRIAVAR 138 (185)
T ss_pred HHHHHHHHHHHHHHHHHHHhCcchhhh------------hHHHHHHHHHHhHHHHHHHHH
Confidence 788899999999999988889 76543 469999999999999987666
No 9
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=55.69 E-value=57 Score=36.55 Aligned_cols=59 Identities=20% Similarity=0.181 Sum_probs=46.6
Q ss_pred hhhHHHHhhccHHHHHHHHHHHHHHhhhh--hhhhcchhhhHHhHhHHHHHHhHHhhHHHH
Q 008936 51 AMELYSRARMQKEEIHSLRQQIAVACLKE--LQLQNEKYTLERKVSELRMAIDEKQNEAIT 109 (548)
Q Consensus 51 ~~~L~~k~r~~eeEI~~LRk~LadasvKE--~QllnEKyvLErRiA~mRmAfDqqQqdlVd 109 (548)
...|..+.++.|+||..|.++|+.+.-+| .++..++..++++++.++-.+...++.+-.
T Consensus 393 ~~~~~~~~~~~e~el~~l~~~l~~~~~~e~i~~l~e~l~~l~~~l~~~~~~~~~~~~~~~~ 453 (650)
T TIGR03185 393 KSQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLET 453 (650)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555667788999999999999875533 356677888999999999999998887753
No 10
>PRK10698 phage shock protein PspA; Provisional
Probab=55.60 E-value=87 Score=31.09 Aligned_cols=76 Identities=16% Similarity=0.181 Sum_probs=46.5
Q ss_pred cccchh-hhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHH----HHHHhHHhhHHHHHHHHHHHHhh
Q 008936 45 SFQDRE-AMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSEL----RMAIDEKQNEAITSALNELARRK 119 (548)
Q Consensus 45 ~~~d~~-~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~m----RmAfDqqQqdlVdAASKaLSyRQ 119 (548)
...||+ ..++ -+++-++.+..+|+-+|.+-..+.++-.+...++.++++. ++|...-..|| |-+||..++
T Consensus 21 kaEDP~k~l~q--~i~em~~~l~~~r~alA~~~A~~k~~er~~~~~~~~~~~~e~kA~~Al~~G~EdL---Ar~AL~~K~ 95 (222)
T PRK10698 21 KAEDPQKLVRL--MIQEMEDTLVEVRSTSARALAEKKQLTRRIEQAEAQQVEWQEKAELALRKEKEDL---ARAALIEKQ 95 (222)
T ss_pred hhcCHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHH---HHHHHHHHH
Confidence 456998 5565 4788899999999999887666544444444444433332 44554444444 456677776
Q ss_pred hhHhhh
Q 008936 120 GVLEEN 125 (548)
Q Consensus 120 diiEEN 125 (548)
..-+.-
T Consensus 96 ~~~~~~ 101 (222)
T PRK10698 96 KLTDLI 101 (222)
T ss_pred HHHHHH
Confidence 665443
No 11
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=54.39 E-value=1.1e+02 Score=34.66 Aligned_cols=109 Identities=13% Similarity=0.113 Sum_probs=57.4
Q ss_pred HHHHHHHHHHHHHHhhhhhhhhcc--------hhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhh--h-----
Q 008936 62 KEEIHSLRQQIAVACLKELQLQNE--------KYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEEN--L----- 126 (548)
Q Consensus 62 eeEI~~LRk~LadasvKE~QllnE--------KyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEEN--i----- 126 (548)
++++..||++|.++-.+=.+...+ .-+.+.+++++.-..-.-|.++++|.++.-..|..+-.-+ .
T Consensus 200 ~~ql~~l~~~l~~aE~~l~~fk~~~~l~~~~~~~~~~~~L~~l~~ql~~a~~~~~~a~a~~~~l~~~l~~~~~~~~~~~~ 279 (754)
T TIGR01005 200 APEIADLSKQSRDAEAEVAAYRAQSDLLMGNNATLATQQLAELNTELSRARANRAAAEGTADSVKKALQNGGSLDVLPEV 279 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCcccCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhh
Confidence 455667777776664443333222 1134588888887777777777776666555444332111 0
Q ss_pred -----hhhhhhhhhhhhhHHHHHHhHHHhhhc-CCCCCcccchhhhhhHH
Q 008936 127 -----KLAHDLKVAEDERYFFMSSMLGLLADY-GLWPHVTNASAISNTVK 170 (548)
Q Consensus 127 -----RLtyalq~aeQERstFvSSLLpLLaEY-~LqP~V~DAqSIvsnvK 170 (548)
.+.=.++...++....-+-+--|...| .-+|.|.+++.=+..++
T Consensus 280 ~~~~~~~~~~i~~L~~~l~~l~~~~~~l~~~y~~~hP~v~~l~~qi~~l~ 329 (754)
T TIGR01005 280 LSSQLKLEDLIQRLRERQAELRATIADLSTTMLANHPRVVAAKSSLADLD 329 (754)
T ss_pred hcCcccccHHHHHHHHHHHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHH
Confidence 000112222333333344444566778 77888877554444443
No 12
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=52.15 E-value=79 Score=35.95 Aligned_cols=40 Identities=28% Similarity=0.356 Sum_probs=36.4
Q ss_pred HHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHh
Q 008936 62 KEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAID 101 (548)
Q Consensus 62 eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfD 101 (548)
++||.+|=.|||+|.=|=-+|=.|.+.||-.|-.+|-.|-
T Consensus 41 K~El~~LNDRLA~YIekVR~LEaqN~~L~~di~~lr~~~~ 80 (546)
T KOG0977|consen 41 KKELQELNDRLAVYIEKVRFLEAQNRKLEHDINLLRGVVG 80 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 4789999999999999999999999999999998887664
No 13
>PRK09239 chorismate mutase; Provisional
Probab=49.26 E-value=2e+02 Score=25.78 Aligned_cols=94 Identities=15% Similarity=0.142 Sum_probs=58.1
Q ss_pred hhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhHHHHHHhHHHhhhcCCCCCcccchhhhh
Q 008936 88 TLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAEDERYFFMSSMLGLLADYGLWPHVTNASAISN 167 (548)
Q Consensus 88 vLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQERstFvSSLLpLLaEY~LqP~V~DAqSIvs 167 (548)
-+...++++|-.+|+-=..||+.-+|-+++=++|.+-...-...+-..++|... +..+--+.++++|.|.+
T Consensus 7 ~~~~~L~~lR~~ID~ID~eIv~LLa~R~~l~~~Ia~~K~~~~~~i~dp~RE~~v-l~~~~~~a~~~gl~p~~-------- 77 (104)
T PRK09239 7 RAPAELAALRQSIDNIDAALIHMLAERFKCTQAVGVLKAEHGLPPADPAREAYQ-IERLRQLAKDANLDPDF-------- 77 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH-HHHHHHHHHHCCCCHHH--------
Confidence 344557777777777666777654444444444433333222222333444433 35566777889999974
Q ss_pred hHHHHHHHHHHHHHhhhhhhcccc
Q 008936 168 TVKHLYDQLQSQIRTSYDRIRDLT 191 (548)
Q Consensus 168 nvKvLf~hLq~kL~~~e~klke~q 191 (548)
++.+|+.+-+.....++++...+
T Consensus 78 -~~~i~~~ii~esir~q~~i~~~~ 100 (104)
T PRK09239 78 -AEKFLNFIIKEVIRHHERIAAEH 100 (104)
T ss_pred -HHHHHHHHHHHHHHHHHHHHhcc
Confidence 47889999888888888776655
No 14
>PRK14153 heat shock protein GrpE; Provisional
Probab=47.91 E-value=2e+02 Score=28.73 Aligned_cols=65 Identities=14% Similarity=0.193 Sum_probs=43.9
Q ss_pred hHHHHHHhHHHhhhc--CCC--CCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 139 RYFFMSSMLGLLADY--GLW--PHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 139 RstFvSSLLpLLaEY--~Lq--P~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
...|+..|||.+-.+ -|. +.-.+..+|+.+|+..+++|..-|. ...|+++. +.| ..++|+.|+..
T Consensus 82 ~~~~~~~LLpv~DnLerAl~~~~~~~~~~~l~~Gvemi~k~~~~vL~--k~Gv~~I~-~~G-----~~FDP~~HEAv 150 (194)
T PRK14153 82 LEQVLLDLLEVTDNFERALESARTAEDMNSIVEGIEMVSKQFFSILE--KYGLERIE-CEG-----EEFDPHRHEAM 150 (194)
T ss_pred HHHHHHHHhhHHhHHHHHHhcccccchHHHHHHHHHHHHHHHHHHHH--HCCCeeeC-CCC-----CCCChhHhcee
Confidence 346889999998877 232 2234568899999999999998885 55556655 123 23667666543
No 15
>TIGR02977 phageshock_pspA phage shock protein A. Members of this family are the phage shock protein PspA, from the phage shock operon. This is a narrower family than the set of PspA and its homologs, sometimes several in a genome, as described by PFAM model pfam04012. PspA appears to maintain the protonmotive force under stress conditions that include overexpression of certain phage secretins, heat shock, ethanol, and protein export defects.
Probab=45.80 E-value=1.9e+02 Score=28.25 Aligned_cols=54 Identities=22% Similarity=0.271 Sum_probs=35.8
Q ss_pred ccccchh-hhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhH
Q 008936 44 ISFQDRE-AMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNE 106 (548)
Q Consensus 44 ~~~~d~~-~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqd 106 (548)
....||+ ..+. -+|+-+++|...|+-||.+-..+ ..|||+++.++...+..++.
T Consensus 20 dk~EDP~~~l~q--~irem~~~l~~ar~~lA~~~a~~-------k~~e~~~~~~~~~~~~~~~~ 74 (219)
T TIGR02977 20 DKAEDPEKMIRL--IIQEMEDTLVEVRTTSARTIADK-------KELERRVSRLEAQVADWQEK 74 (219)
T ss_pred HhccCHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHH
Confidence 3457898 5555 46888889999999998876654 45566665555444444433
No 16
>COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.81 E-value=48 Score=28.76 Aligned_cols=37 Identities=27% Similarity=0.443 Sum_probs=29.4
Q ss_pred hhhhHHhHhH--HHHHHhHHhhHHHHHHHHHHHHhhhhHhhh
Q 008936 86 KYTLERKVSE--LRMAIDEKQNEAITSALNELARRKGVLEEN 125 (548)
Q Consensus 86 KyvLErRiA~--mRmAfDqqQqdlVdAASKaLSyRQdiiEEN 125 (548)
...||+||.+ +|+|| |.+.|+.-|-+|..-|-+|+.+
T Consensus 3 ~~~lE~Ri~eLE~r~Af---QE~tieeLn~~laEq~~~i~k~ 41 (72)
T COG2900 3 DMELEARIIELEIRLAF---QEQTIEELNDALAEQQLVIDKL 41 (72)
T ss_pred hhhHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHH
Confidence 3468999986 58898 7778999888888888777754
No 17
>PF15294 Leu_zip: Leucine zipper
Probab=42.11 E-value=2.6e+02 Score=29.53 Aligned_cols=41 Identities=22% Similarity=0.232 Sum_probs=36.1
Q ss_pred hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHH
Q 008936 58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRM 98 (548)
Q Consensus 58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRm 98 (548)
++.-.+|...||.||...--.=...+.||..|++.+.+||.
T Consensus 134 i~rLq~EN~kLk~rl~~le~~at~~l~Ek~kl~~~L~~lq~ 174 (278)
T PF15294_consen 134 IDRLQEENEKLKERLKSLEKQATSALDEKSKLEAQLKELQD 174 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35556788999999999888888899999999999999998
No 18
>KOG0978 consensus E3 ubiquitin ligase involved in syntaxin degradation [Posttranslational modification, protein turnover, chaperones]
Probab=41.41 E-value=2.3e+02 Score=33.43 Aligned_cols=109 Identities=17% Similarity=0.159 Sum_probs=76.2
Q ss_pred hhhhcchhhhHHhHhHHHHHHhHHhhHH-------------HHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhHHHHHHh
Q 008936 80 LQLQNEKYTLERKVSELRMAIDEKQNEA-------------ITSALNELARRKGVLEENLKLAHDLKVAEDERYFFMSSM 146 (548)
Q Consensus 80 ~QllnEKyvLErRiA~mRmAfDqqQqdl-------------VdAASKaLSyRQdiiEENiRLtyalq~aeQERstFvSSL 146 (548)
--|++||++|+.-|+.|--+.|-.++.+ +..-+|.|.-++..+|.+=|.+=++.---...-.++..
T Consensus 499 k~L~~ek~~l~~~i~~l~~~~~~~~~~i~~leeq~~~lt~~~~~l~~el~~~~~~le~~kk~~~e~~~~~~~Lq~~~ek- 577 (698)
T KOG0978|consen 499 KLLREEKSKLEEQILTLKASVDKLELKIGKLEEQERGLTSNESKLIKELTTLTQSLEMLKKKAQEAKQSLEDLQIELEK- 577 (698)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
Confidence 3578899999999999887777666543 33456777778888887766654433222222222222
Q ss_pred HHHhhhc-CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936 147 LGLLADY-GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT 191 (548)
Q Consensus 147 LpLLaEY-~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q 191 (548)
.=+.+ .+++.+.|+.+=+...+--.+||++++..+..||....
T Consensus 578 --~~~~le~i~~~~~e~~~ele~~~~k~~rleEE~e~L~~kle~~k 621 (698)
T KOG0978|consen 578 --SEAKLEQIQEQYAELELELEIEKFKRKRLEEELERLKRKLERLK 621 (698)
T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 22344 68888888888888888889999999999999988777
No 19
>KOG0250 consensus DNA repair protein RAD18 (SMC family protein) [Replication, recombination and repair]
Probab=40.71 E-value=3.3e+02 Score=33.58 Aligned_cols=139 Identities=17% Similarity=0.138 Sum_probs=90.9
Q ss_pred hhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhH------
Q 008936 49 REAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVL------ 122 (548)
Q Consensus 49 ~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdii------ 122 (548)
+.-.+|+...+..++||......+-+..-+.-+|..|+.-++.-.+.+|.++- ++++..|++--+|..|++-+
T Consensus 734 ~~~~~l~~ei~~~~~eIe~~~~~~e~l~~e~e~~~~e~~e~~~~~~~~~~~l~-~e~~~l~~l~~el~~r~dk~~s~e~~ 812 (1074)
T KOG0250|consen 734 SKLEDLAREIKKKEKEIEEKEAPLEKLKEELEHIELEAQELEEYYAAGREKLQ-GEISKLDALKEELKLREDKLRSAEDE 812 (1074)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhhHHHHHHHHHHHHHhhhhhh
Confidence 44567788888899999999888888889999999999999999999999995 55566778878888887755
Q ss_pred ----hhhhh-hhhhhhhhhhhhHHHHHHhHHHh-------hhcCCCC-CcccchhhhhhHHHHHHHHHHHHHhhhhhhcc
Q 008936 123 ----EENLK-LAHDLKVAEDERYFFMSSMLGLL-------ADYGLWP-HVTNASAISNTVKHLYDQLQSQIRTSYDRIRD 189 (548)
Q Consensus 123 ----EENiR-Ltyalq~aeQERstFvSSLLpLL-------aEY~LqP-~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke 189 (548)
|++++ -.-++.-+|-| .+..=.|+. +++-=-| -+.+-+--|.-+|.--+.|+.+++-.++.+.+
T Consensus 813 ~~HyE~~~K~~l~~l~~~E~~---~~~~e~~~~e~~~ka~~~cp~~~~ei~~~~~~~~eik~ei~rlk~~i~~~ee~~~~ 889 (1074)
T KOG0250|consen 813 KRHYEDKLKSRLEELKQKEVE---KVNLEEPRAEEDQKARTECPEEGIEIEALGKTVAEIKREIKRLKRQIQMCEESLGE 889 (1074)
T ss_pred hhhHHHHHHHhhHHHHHHHHH---HHhhhcchhhhCchhhhhCccccchhhcccchHHHHHHHHHHHHHHHHHHHHhcch
Confidence 44555 12222222222 222222222 2221000 11111113566777788888888888877776
Q ss_pred cc
Q 008936 190 LT 191 (548)
Q Consensus 190 ~q 191 (548)
++
T Consensus 890 ~~ 891 (1074)
T KOG0250|consen 890 LE 891 (1074)
T ss_pred HH
Confidence 66
No 20
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=40.32 E-value=1.6e+02 Score=30.25 Aligned_cols=92 Identities=17% Similarity=0.297 Sum_probs=64.4
Q ss_pred cHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhH
Q 008936 61 QKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAEDERY 140 (548)
Q Consensus 61 ~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQERs 140 (548)
.+.|+..+++.+.+.=+.=--+-+++.-+|-.|..+|=-.+--+..+ ++++++=-|+. |..+++.|++.+-
T Consensus 36 ~~~e~e~~~~~~~~~~~e~e~le~qv~~~e~ei~~~r~r~~~~e~kl-~~v~~~~e~~a--------L~~E~~~ak~r~~ 106 (239)
T COG1579 36 AKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKL-SAVKDERELRA--------LNIEIQIAKERIN 106 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hccccHHHHHH--------HHHHHHHHHHHHH
Confidence 34566777777777766666677888888888888875555555555 77766655553 6777888888888
Q ss_pred HHHHHhHHHhhhc-CCCCCccc
Q 008936 141 FFMSSMLGLLADY-GLWPHVTN 161 (548)
Q Consensus 141 tFvSSLLpLLaEY-~LqP~V~D 161 (548)
..-+.|.+|+-++ .|+=-..+
T Consensus 107 ~le~el~~l~~~~~~l~~~i~~ 128 (239)
T COG1579 107 SLEDELAELMEEIEKLEKEIED 128 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 8888888888777 44443333
No 21
>PRK14148 heat shock protein GrpE; Provisional
Probab=39.88 E-value=3.1e+02 Score=27.30 Aligned_cols=64 Identities=14% Similarity=0.193 Sum_probs=43.3
Q ss_pred hHHHHHHhHHHhhhc----CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCC
Q 008936 139 RYFFMSSMLGLLADY----GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGV 210 (548)
Q Consensus 139 RstFvSSLLpLLaEY----~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~ 210 (548)
..-|+..|||.+--+ +.-+.-.++.+++.+|+-.+++|..=|. ..-|+++. +.| ..++|+.|+-
T Consensus 89 ~~~~~~~LLpV~DnlerAl~~~~~~~~~~~l~~Gv~mi~k~l~~vL~--k~Gv~~I~-~~G-----~~FDP~~HEA 156 (195)
T PRK14148 89 IEKFAKELLPVIDSIEQALKHEVKLEEAIAMKEGIELTAKMLVDILK--KNGVEELD-PKG-----EKFDPNLHEA 156 (195)
T ss_pred HHHHHHHHhhHHhHHHHHHhccccchhHHHHHHHHHHHHHHHHHHHH--HCCCEEeC-CCC-----CCCChhHhhe
Confidence 346889999998877 2234334567899999999999988885 44555554 112 2366776754
No 22
>PF11464 Rbsn: Rabenosyn Rab binding domain; InterPro: IPR021565 Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=39.05 E-value=69 Score=25.19 Aligned_cols=40 Identities=38% Similarity=0.426 Sum_probs=34.1
Q ss_pred HHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHh
Q 008936 63 EEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQ 104 (548)
Q Consensus 63 eEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQ 104 (548)
|.|..+|..|-+| |.+.=..|-.+||+-+.+|+-.|++||
T Consensus 3 eQi~~I~~~I~qA--k~~~r~dEV~~L~~NL~EL~~e~~~qq 42 (42)
T PF11464_consen 3 EQINIIESYIKQA--KAARRFDEVATLEENLRELQDEIDEQQ 42 (42)
T ss_dssp HHHHHHHHHHHHH--HHTT-HHHHHHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHH--HHhcCcHHHHHHHHHHHHHHHHHHhcC
Confidence 5678888888876 677788999999999999999999987
No 23
>KOG3091 consensus Nuclear pore complex, p54 component (sc Nup57) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=39.03 E-value=75 Score=35.86 Aligned_cols=80 Identities=26% Similarity=0.283 Sum_probs=54.9
Q ss_pred cchhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHh---------------------HHHHHHhHHhh
Q 008936 47 QDREAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVS---------------------ELRMAIDEKQN 105 (548)
Q Consensus 47 ~d~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA---------------------~mRmAfDqqQq 105 (548)
||-+++.---|++.-.|++..|.||=||+++|=+++.+---.|++||= +||.-+|--++
T Consensus 346 Q~q~~~~~r~ri~~i~e~v~eLqk~~ad~~~KI~~~k~r~~~Ls~RiLRv~ikqeilr~~G~~L~~~EE~Lr~Kldtll~ 425 (508)
T KOG3091|consen 346 QDQEVKQHRIRINAIGERVTELQKHHADAVAKIEEAKNRHVELSHRILRVMIKQEILRKRGYALTPDEEELRAKLDTLLA 425 (508)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcCCccHHHHHHHHHHHHH
Confidence 455555444566777788888999999999999999998889999982 34444444333
Q ss_pred HHHHHHHHHHHHhhhhHhhhhhh
Q 008936 106 EAITSALNELARRKGVLEENLKL 128 (548)
Q Consensus 106 dlVdAASKaLSyRQdiiEENiRL 128 (548)
.+=+= .+|.-|=+.|-|=+|.
T Consensus 426 ~ln~P--nq~k~Rl~~L~e~~r~ 446 (508)
T KOG3091|consen 426 QLNAP--NQLKARLDELYEILRM 446 (508)
T ss_pred HhcCh--HHHHHHHHHHHHHHHh
Confidence 33222 5677777777666654
No 24
>PRK14154 heat shock protein GrpE; Provisional
Probab=37.72 E-value=3.4e+02 Score=27.39 Aligned_cols=152 Identities=13% Similarity=0.149 Sum_probs=82.2
Q ss_pred CChhhhhhccccc-ccccccccCCCC--ccccchhhhhH-HHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHh
Q 008936 19 KNSDFVNRHKIET-HLAPTKQKEDNF--ISFQDREAMEL-YSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVS 94 (548)
Q Consensus 19 ~~s~~l~r~~~~~-~l~~~~~~~~~~--~~~~d~~~~~L-~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA 94 (548)
|-|++-..++.++ .-+..-+.++.- ..-|.|+-..+ +..+.+.+++|..|+++++++- ++
T Consensus 11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~el~~le~e~~elk--------d~-------- 74 (208)
T PRK14154 11 PESETKAKNKWEKVMEAEEEQEEGGGDGSQEMEPHREGLEFPSREKLEGQLTRMERKVDEYK--------TQ-------- 74 (208)
T ss_pred chhhhhhhhHHHHHHHHHHHHhhcCCCcccccCcccccccCcchhhHHHHHHHHHHHHHHHH--------HH--------
Confidence 5666666666665 222223333322 33467776666 4455556677777777776542 11
Q ss_pred HHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhHHHHHHhHHHhhhc--CCC--C-CcccchhhhhhH
Q 008936 95 ELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAEDERYFFMSSMLGLLADY--GLW--P-HVTNASAISNTV 169 (548)
Q Consensus 95 ~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQERstFvSSLLpLLaEY--~Lq--P-~V~DAqSIvsnv 169 (548)
+||+.=|- =.||+..-.|=- ++-..-..-|+..|||.+-.+ -|. + .-.+..+|+..|
T Consensus 75 ~lRl~ADf------------eNyRKR~~kE~e------~~~~~a~e~~~~~LLpVlDnLeRAL~~~~~~~~~~~~l~eGv 136 (208)
T PRK14154 75 YLRAQAEM------------DNLRKRIEREKA------DIIKFGSKQLITDLLPVADSLIHGLESPASEDPQVKSMRDGM 136 (208)
T ss_pred HHHHHHHH------------HHHHHHHHHHHH------HHHHHHHHHHHHHHhhHHhHHHHHHhcccccchhHHHHHHHH
Confidence 23332211 123333322210 111122346888999988777 232 1 223468899999
Q ss_pred HHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 170 KHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 170 KvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
+-.+++|..-|. ...|+++.-.+| ..++|+.|+-.
T Consensus 137 emi~k~l~~vL~--k~GVe~I~~~~G-----~~FDP~~HEAv 171 (208)
T PRK14154 137 SLTLDLLHNTLA--KHGVQVINPNPG-----DPFDPALHEAM 171 (208)
T ss_pred HHHHHHHHHHHH--HCCCEEecCCCC-----CCCChhHhhee
Confidence 999999988885 555666652234 24666666543
No 25
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=37.68 E-value=5.7e+02 Score=28.56 Aligned_cols=93 Identities=24% Similarity=0.282 Sum_probs=52.0
Q ss_pred hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhh
Q 008936 58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAED 137 (548)
Q Consensus 58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQ 137 (548)
.+.-..||..+.+.|++..=+=.+|..+--.||+.|+.++ ..|+..+..-=.-+++|-+=|=+| -+|++.+.
T Consensus 40 l~q~q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~~i~~~~-------~ql~~s~~~l~~~~~~I~~~~~~l-~~l~~q~r 111 (420)
T COG4942 40 LKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLE-------AQLIETADDLKKLRKQIADLNARL-NALEVQER 111 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHhHHHHHHhhHHHHHHHH-HHHHHHHH
Confidence 3344455556666665555444445445445555554443 334433332222233333333333 36777778
Q ss_pred hhHHHHHHhHHHhhhcCCCCC
Q 008936 138 ERYFFMSSMLGLLADYGLWPH 158 (548)
Q Consensus 138 ERstFvSSLLpLLaEY~LqP~ 158 (548)
||.-+++-+|--+--.+..||
T Consensus 112 ~qr~~La~~L~A~~r~g~~p~ 132 (420)
T COG4942 112 EQRRRLAEQLAALQRSGRNPP 132 (420)
T ss_pred HHHHHHHHHHHHHHhccCCCC
Confidence 888899988888877788885
No 26
>PRK14140 heat shock protein GrpE; Provisional
Probab=37.01 E-value=4.1e+02 Score=26.42 Aligned_cols=65 Identities=15% Similarity=0.220 Sum_probs=41.8
Q ss_pred hHHHHHHhHHHhhhc--CC--CCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 139 RYFFMSSMLGLLADY--GL--WPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 139 RstFvSSLLpLLaEY--~L--qP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
..-|+..|||.+--+ -| -|.-.+..+|+.+|+-.+++|..=|. .-.++.+. ..| ..++|+.|+.-
T Consensus 86 ~~~~~~~LLpvlDnLerAl~~~~~~~~~~~i~~Gv~mi~k~l~~~L~--k~GV~~i~-~~G-----e~FDP~~HEAv 154 (191)
T PRK14140 86 AQSLASDLLPALDNFERALQIEADDEQTKSLLKGVEMVHRQLLEALK--KEGVEVIE-AVG-----EQFDPNLHQAV 154 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHhccCccchHHHHHHHHHHHHHHHHHHHH--HCCCEeeC-CCC-----CCCChHHhccc
Confidence 456899999998877 22 24445567888999988888877774 33333332 122 24677777643
No 27
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=36.63 E-value=3.7e+02 Score=26.14 Aligned_cols=51 Identities=18% Similarity=0.287 Sum_probs=21.6
Q ss_pred HhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHH
Q 008936 57 RARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEA 107 (548)
Q Consensus 57 k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdl 107 (548)
.++..+.++..++.+++...-+=.++..+-..+-+|++++|-+.....+.+
T Consensus 57 ~~~~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l~~~~~~l 107 (302)
T PF10186_consen 57 EIQQLKREIEELRERLERLRERIERLRKRIEQKRERLEELRESLEQRRSRL 107 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444444444444443332223333333344444444444444444433
No 28
>PF01895 PhoU: PhoU domain; InterPro: IPR008170 This family contains phosphate regulatory proteins including PhoU. PhoU proteins are known to play a role in the regulation of phosphate uptake. The PhoU domain is composed of a three helix bundle []. The PhoU protein contains two copies of this domain. The domain binds to an iron cluster via its conserved E/DXXXD motif. Deletion of PhoU activates constitutive expression of the phosphate ABC transporter and allows phosphate transport, but causes a growth defect; suggesting that the protein has some secondary function []. ; PDB: 2I0M_A 1T72_B 1T8B_A 1SUM_B 1VCT_A 2BKN_A 2BKP_A 2BKO_A.
Probab=36.59 E-value=1.2e+02 Score=23.51 Aligned_cols=50 Identities=36% Similarity=0.302 Sum_probs=36.0
Q ss_pred hhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHh
Q 008936 52 MELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAID 101 (548)
Q Consensus 52 ~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfD 101 (548)
.++..++...|++|..+...+.++.++..+-.+....++.=+.+|+++.+
T Consensus 24 ~~~a~~i~~~e~~id~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~ 73 (88)
T PF01895_consen 24 SELAQEIIQLEEEIDELYREIRRQILKILKNQNPLEELRELVGLLRIARD 73 (88)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCGHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHH
Confidence 46668888999999999999999988877543433335555666666544
No 29
>KOG1222 consensus Kinesin associated protein KAP [Intracellular trafficking, secretion, and vesicular transport]
Probab=36.49 E-value=1.4e+02 Score=34.36 Aligned_cols=128 Identities=22% Similarity=0.209 Sum_probs=83.9
Q ss_pred hhhhccccc-ccccccccCCCCccccchhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHh
Q 008936 23 FVNRHKIET-HLAPTKQKEDNFISFQDREAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAID 101 (548)
Q Consensus 23 ~l~r~~~~~-~l~~~~~~~~~~~~~~d~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfD 101 (548)
.|+|+.+-- -|..++-.-| .+|++..| +|||..|-|+|--+..|..||| |+||.
T Consensus 231 ElkRye~w~~El~k~krs~d-----e~p~netL-------k~e~dr~~kklk~~~~KQeqLL-------------rva~y 285 (791)
T KOG1222|consen 231 ELKRYEFWIAELKKTKRSTD-----EKPKNETL-------KEEIDRLNKKLKTAIRKQEQLL-------------RVAVY 285 (791)
T ss_pred HHHHHHHHHHHHhhhhcccc-----cCcchhhH-------HHHHHHHHHHHHHHHHHHHHHH-------------HHHHH
Confidence 567776543 3433332222 68888777 8999999999999999999985 44544
Q ss_pred HHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhhhhHHHHHHhHHHhhhcCCCCCcccchhhhhhHHHHHHHHHH
Q 008936 102 EKQNEAITSALNELARRKGVLEENLKLAHDLKVAEDERYFFMSSMLGLLADYGLWPHVTNASAISNTVKHLYDQLQS 178 (548)
Q Consensus 102 qqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQERstFvSSLLpLLaEY~LqP~V~DAqSIvsnvKvLf~hLq~ 178 (548)
--=+ |..-.+-+|.-|...|-. -|--||+..--+..+.|++.|-=|+=|+=---|.-+--||.-+-.||+-=..
T Consensus 286 lLlN-lAed~~~ElKMrrkniV~--mLVKaLdr~n~~Ll~lv~~FLkKLSIf~eNK~~M~~~~iveKL~klfp~~h~ 359 (791)
T KOG1222|consen 286 LLLN-LAEDISVELKMRRKNIVA--MLVKALDRSNSSLLTLVIKFLKKLSIFDENKIVMEQNGIVEKLLKLFPIQHP 359 (791)
T ss_pred HHHH-HhhhhhHHHHHHHHhHHH--HHHHHHcccchHHHHHHHHHHHHhhhhccchHHHHhccHHHHHHHhcCCCCH
Confidence 3222 223334444443322211 3556677777788899999999898887777777777888777777654333
No 30
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=36.20 E-value=3.6e+02 Score=30.38 Aligned_cols=57 Identities=11% Similarity=0.154 Sum_probs=33.1
Q ss_pred hccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHH
Q 008936 59 RMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELA 116 (548)
Q Consensus 59 r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLS 116 (548)
.+.+++...++.+|...--....+..+...|+.|+...|-+..+|++. ...|.+.|+
T Consensus 56 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~~~le~~~~~~~ek~~~-l~~~~~~L~ 112 (475)
T PRK10361 56 EHWRAECELLNNEVRSLQSINTSLEADLREVTTRMEAAQQHADDKIRQ-MINSEQRLS 112 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH
Confidence 344445555555555555555555666667778888888877665543 444444443
No 31
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=35.93 E-value=1.3e+02 Score=33.93 Aligned_cols=23 Identities=17% Similarity=0.108 Sum_probs=15.1
Q ss_pred hhhhhhhhhhhhHHHHHHhHHHh
Q 008936 128 LAHDLKVAEDERYFFMSSMLGLL 150 (548)
Q Consensus 128 Ltyalq~aeQERstFvSSLLpLL 150 (548)
+--.++.++++..-|.+..+||+
T Consensus 274 le~e~~e~~~~l~~l~~~~~p~~ 296 (650)
T TIGR03185 274 IEAARKANRAQLRELAADPLPLL 296 (650)
T ss_pred HHHHHHHHHHHHHHHhcccCCHh
Confidence 34455556677777887777774
No 32
>PLN02939 transferase, transferring glycosyl groups
Probab=35.41 E-value=2.5e+02 Score=34.18 Aligned_cols=100 Identities=25% Similarity=0.372 Sum_probs=61.6
Q ss_pred hhhhHHHHhhccHHHHHHHHHHHHHHhhhhh-hhhcchhhhHHhHh--HHHHHHhHHhhH--------------------
Q 008936 50 EAMELYSRARMQKEEIHSLRQQIAVACLKEL-QLQNEKYTLERKVS--ELRMAIDEKQNE-------------------- 106 (548)
Q Consensus 50 ~~~~L~~k~r~~eeEI~~LRk~LadasvKE~-QllnEKyvLErRiA--~mRmAfDqqQqd-------------------- 106 (548)
.-.+|..-++.-|.-|+.|-+ ...-.+.++ .||+||..|.++|. +||+|=-..+-.
T Consensus 129 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 207 (977)
T PLN02939 129 QLEDLVGMIQNAEKNILLLNQ-ARLQALEDLEKILTEKEALQGKINILEMRLSETDARIKLAAQEKIHVEILEEQLEKLR 207 (977)
T ss_pred cHHHHHHHHHHHHhhhHhHHH-HHHHHHHHHHHHHHHHHHHHhhHHHHHHHhhhhhhhhhhhhhccccchhhHHHHHHHh
Confidence 345666667777777766543 233334444 48999999988865 666664211110
Q ss_pred ------------HHHHHHHHHHHhhhhHhhhhhhhhhhhh-----------------hhhhhHHHHHHhHHHhhhc
Q 008936 107 ------------AITSALNELARRKGVLEENLKLAHDLKV-----------------AEDERYFFMSSMLGLLADY 153 (548)
Q Consensus 107 ------------lVdAASKaLSyRQdiiEENiRLtyalq~-----------------aeQERstFvSSLLpLLaEY 153 (548)
-|.+-+++|. .+-|||+-|--+++. .+.||+..=+||-.|=+.+
T Consensus 208 ~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 280 (977)
T PLN02939 208 NELLIRGATEGLCVHSLSKELD---VLKEENMLLKDDIQFLKAELIEVAETEERVFKLEKERSLLDASLRELESKF 280 (977)
T ss_pred hhhhccccccccccccHHHHHH---HHHHHhHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1112355554 245899999988876 4567777777777765555
No 33
>TIGR01076 sortase_fam LPXTG-site transpeptidase (sortase) family protein. of an LPXTG motif to the cell wall. It also includes a protein required for correct assembly of an LPXTG-containing fimbrial protein, a set of homologous proteins from Streptococcus pneumoniae, in which LPXTG proteins are common. However, related proteins are found in Bacillus subtilis and Methanobacterium thermoautotrophicum, in which LPXTG-mediated cell wall attachment is not known.
Probab=34.56 E-value=39 Score=30.50 Aligned_cols=43 Identities=23% Similarity=0.286 Sum_probs=30.7
Q ss_pred cceeeeccccCCccceeeEEEEeccCCccccccCC-CCCcceecc
Q 008936 322 EKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGA-TNPEYVVTA 365 (548)
Q Consensus 322 ~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA-~~p~Y~vTA 365 (548)
+-..-.||... ..-.+|..+..|+.|..-++... ...+|.|++
T Consensus 42 gN~vIaGH~~~-~~~~~F~~L~~l~~GD~i~v~~~~~~~~Y~V~~ 85 (136)
T TIGR01076 42 TRIVITGHRGL-PTATMFTNLDKLKKGDMLYLHVGNEVLTYQVTS 85 (136)
T ss_pred CeEEEEecCCC-CCCCccCCHHHCCCCCEEEEEECCcEEEEEEEE
Confidence 45666778754 33569999999999999887743 345677654
No 34
>PRK14154 heat shock protein GrpE; Provisional
Probab=32.98 E-value=2.2e+02 Score=28.69 Aligned_cols=26 Identities=8% Similarity=0.072 Sum_probs=17.2
Q ss_pred cchhhhhHHHHhhccHHHHHHHHHHH
Q 008936 47 QDREAMELYSRARMQKEEIHSLRQQI 72 (548)
Q Consensus 47 ~d~~~~~L~~k~r~~eeEI~~LRk~L 72 (548)
.-|+-.+|-.++.+-++++..|+.++
T Consensus 50 ~~~~~~~l~~el~~le~e~~elkd~~ 75 (208)
T PRK14154 50 EFPSREKLEGQLTRMERKVDEYKTQY 75 (208)
T ss_pred cCcchhhHHHHHHHHHHHHHHHHHHH
Confidence 44556666666677777777776665
No 35
>PRK14162 heat shock protein GrpE; Provisional
Probab=32.86 E-value=4.7e+02 Score=26.08 Aligned_cols=64 Identities=14% Similarity=0.215 Sum_probs=42.5
Q ss_pred HHHHHHhHHHhhhcC--CC--CCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 140 YFFMSSMLGLLADYG--LW--PHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 140 stFvSSLLpLLaEY~--Lq--P~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
.-|+..|||.+--+. |. |.-.+..+|+.+|+-.+++|..=|. ..-|+++. ..| ..++|+.|+--
T Consensus 89 ~~~~~~LLpV~DnLerAl~~~~~~~~~~~l~~Gvemi~k~l~~vL~--~~GV~~I~-~~G-----~~FDP~~HEAv 156 (194)
T PRK14162 89 QSLAKDVLPAMDNLERALAVKADDEAAKQLKKGVQMTLDHLVKALK--DHGVTEIK-ADG-----EKFDPTLHQAV 156 (194)
T ss_pred HHHHHHHhhHHhHHHHHHhccccchhHHHHHHHHHHHHHHHHHHHH--HCCCEEeC-CCC-----CCCChhHhhhh
Confidence 458899999988773 22 3323457899999999999988875 44555554 223 23667666543
No 36
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=32.72 E-value=2.4e+02 Score=26.34 Aligned_cols=27 Identities=11% Similarity=0.282 Sum_probs=11.3
Q ss_pred chhhhhhHHHHHHHHHHHHHhhhhhhc
Q 008936 162 ASAISNTVKHLYDQLQSQIRTSYDRIR 188 (548)
Q Consensus 162 AqSIvsnvKvLf~hLq~kL~~~e~klk 188 (548)
...-+...+.-+++++..+....+++.
T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~l~~~~~ 182 (191)
T PF04156_consen 156 SREEVQELRSQLERLQENLQQLEEKIQ 182 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444444444444333
No 37
>PRK02119 hypothetical protein; Provisional
Probab=32.61 E-value=98 Score=26.17 Aligned_cols=37 Identities=22% Similarity=0.382 Sum_probs=27.7
Q ss_pred cchhhhHHhHhHH--HHHHhHHhhHHHHHHHHHHHHhhhhHh
Q 008936 84 NEKYTLERKVSEL--RMAIDEKQNEAITSALNELARRKGVLE 123 (548)
Q Consensus 84 nEKyvLErRiA~m--RmAfDqqQqdlVdAASKaLSyRQdiiE 123 (548)
+|-..+|.||.+| |+|| |.+.|+.=++++..=|..|.
T Consensus 2 ~~~~~~e~Ri~~LE~rla~---QE~tie~LN~~v~~Qq~~id 40 (73)
T PRK02119 2 QIQQNLENRIAELEMKIAF---QENLLEELNQALIEQQFVID 40 (73)
T ss_pred cchHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH
Confidence 3455688888764 7777 88999999999887776553
No 38
>PRK02793 phi X174 lysis protein; Provisional
Probab=32.04 E-value=1e+02 Score=25.92 Aligned_cols=33 Identities=24% Similarity=0.332 Sum_probs=25.2
Q ss_pred hhHHhHhHH--HHHHhHHhhHHHHHHHHHHHHhhhhHh
Q 008936 88 TLERKVSEL--RMAIDEKQNEAITSALNELARRKGVLE 123 (548)
Q Consensus 88 vLErRiA~m--RmAfDqqQqdlVdAASKaLSyRQdiiE 123 (548)
.+|.||.+| |+|| |++.|+.=++++..=|..|.
T Consensus 5 ~~e~Ri~~LE~~laf---Qe~tIe~Ln~~v~~Qq~~I~ 39 (72)
T PRK02793 5 SLEARLAELESRLAF---QEITIEELNVTVTAHEMEMA 39 (72)
T ss_pred hHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH
Confidence 377777754 7777 88999999999988775553
No 39
>TIGR01795 CM_mono_cladeE monofunctional chorismate mutase, alpha proteobacterial type. The alpha proteobacterial members are trusted because the pathways of CM are evident and there is only one plausible CM in the genome. In S. coelicolor, however, there is another aparrent monofunctional CM.
Probab=31.50 E-value=3.6e+02 Score=23.70 Aligned_cols=84 Identities=23% Similarity=0.286 Sum_probs=53.7
Q ss_pred HhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhh--hhhhhhhhhhHHH-HHHhHHHhhhcCCCCCcccchhhhh
Q 008936 91 RKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLA--HDLKVAEDERYFF-MSSMLGLLADYGLWPHVTNASAISN 167 (548)
Q Consensus 91 rRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLt--yalq~aeQERstF-vSSLLpLLaEY~LqP~V~DAqSIvs 167 (548)
++++++|-.+|+-=..||+- |+.|..+..+--++- ..+-+-+.+|-.- +..+.-+..++||.|.+
T Consensus 3 ~~L~~lR~~ID~ID~qLv~L----L~~R~~~~~~ia~~K~~~~~~v~dp~Re~~vl~~~~~~a~~~gl~p~~-------- 70 (94)
T TIGR01795 3 AELKALRQSIDNIDAAVIHM----LAERFKCTSQVGVLKANAGLAPADPAREDYQIARLRRLAIDAGLDPEF-------- 70 (94)
T ss_pred hhHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHhCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHH--------
Confidence 35677787777766666654 555555555544333 2344444555433 35678888999999985
Q ss_pred hHHHHHHHHHHHHHhhhhhh
Q 008936 168 TVKHLYDQLQSQIRTSYDRI 187 (548)
Q Consensus 168 nvKvLf~hLq~kL~~~e~kl 187 (548)
+..||+.+.+.....++++
T Consensus 71 -~e~i~~~i~~esir~q~~~ 89 (94)
T TIGR01795 71 -AEKFLNFIVTEVIKHHERI 89 (94)
T ss_pred -HHHHHHHHHHHHHHHHHHH
Confidence 3677888888777666553
No 40
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=31.18 E-value=4.1e+02 Score=30.79 Aligned_cols=18 Identities=6% Similarity=0.075 Sum_probs=14.0
Q ss_pred ccccCccchHHHHHHhhc
Q 008936 398 KIKCDLGMQSEIDAYISR 415 (548)
Q Consensus 398 kI~cdpeMq~~I~~~l~~ 415 (548)
-|.||++....|+..+..
T Consensus 605 ~i~~~~~~~~~~~~~lg~ 622 (1164)
T TIGR02169 605 LVEFDPKYEPAFKYVFGD 622 (1164)
T ss_pred HccCcHHHHHHHHHHCCC
Confidence 388999988888877654
No 41
>PRK11020 hypothetical protein; Provisional
Probab=30.54 E-value=77 Score=29.74 Aligned_cols=40 Identities=28% Similarity=0.439 Sum_probs=27.5
Q ss_pred HHHHHHHHHhhh-----hhhhhcchhhhHHhHhHHHHHHhHHhhHHHH
Q 008936 67 SLRQQIAVACLK-----ELQLQNEKYTLERKVSELRMAIDEKQNEAIT 109 (548)
Q Consensus 67 ~LRk~LadasvK-----E~QllnEKyvLErRiA~mRmAfDqqQqdlVd 109 (548)
.+|.+||.+... =+|+..|+..|+++|+.|+= +|.++|-.
T Consensus 16 ~~~~Klaaa~~rgd~~~i~qf~~E~~~l~k~I~~lk~---~~~~~lsk 60 (118)
T PRK11020 16 AIRHKLAAASLRGDAEKYAQFEKEKATLEAEIARLKE---VQSQKLSK 60 (118)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHH
Confidence 345566655443 37899999999999999983 44444433
No 42
>PRK14139 heat shock protein GrpE; Provisional
Probab=30.05 E-value=4.1e+02 Score=26.34 Aligned_cols=64 Identities=19% Similarity=0.231 Sum_probs=41.4
Q ss_pred HHHHHHhHHHhhhc--CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 140 YFFMSSMLGLLADY--GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 140 stFvSSLLpLLaEY--~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
.-|+..|||.+-.+ -|.=.-.+..+|+.+|+-.+++|..-|. ...|+++. ..| ..++|+.|+--
T Consensus 82 ~~~~~~LLpv~DnLerAl~~~~~~~~~l~~Gv~mi~k~l~~vL~--k~Gv~~I~-~~G-----~~FDP~~HEAv 147 (185)
T PRK14139 82 ESFAESLLPVKDSLEAALADESGDLEKLREGVELTLKQLTSAFE--KGRVVEIN-PVG-----EKFDPHQHQAI 147 (185)
T ss_pred HHHHHHHhhHHhHHHHHHhcccchHHHHHHHHHHHHHHHHHHHH--HCCCceeC-CCC-----CCCChHHhhee
Confidence 45888899988777 2221113567899999999999888774 44555555 223 24667666543
No 43
>cd05827 Sortase_C_3 Sortase C (SrtC) or subfamily-3 sortases are membrane cysteine transpeptidases found in Gram-positive bacteria that anchor surface proteins to peptidoglycans of the bacterial cell wall envelope. This involves a transpeptidation reaction in which the surface protein substrate is cleaved at a conserved cell wall sorting signal and covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. Sortase C cleaves surface proteins at the LPXTG motif between Thr and Gly and catalyzes the formation of an amide bond between the carboxyl group of Thr and the amino group of cell-wall cross bridges. It is a narrow-range enzyme required for anchoring a few substrates. It plays a role in Streptococcus pneumoniae pathogenesis and is required in the assembly of pili on the surface of Corynebacterium diphtheriae. Sortase C is
Probab=29.40 E-value=53 Score=29.28 Aligned_cols=44 Identities=20% Similarity=0.256 Sum_probs=29.8
Q ss_pred CcceeeeccccCCccceeeEEEEeccCCccccccCC-CCCcceecc
Q 008936 321 GEKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGA-TNPEYVVTA 365 (548)
Q Consensus 321 G~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA-~~p~Y~vTA 365 (548)
|+-..-.||... ....+|.++..|+.|..-+|.-. ....|.|++
T Consensus 44 ~~N~viaGH~~~-~~~~~F~~L~~l~~gd~i~l~t~~~~~~Y~V~~ 88 (131)
T cd05827 44 GTHSVITGHRGL-PNAKLFTDLDKLKKGDKFYIHVLGETLAYQVDQ 88 (131)
T ss_pred CCEEEEEeCCCC-CCCcccCCHHHCCCCCEEEEEECCeEEEEEEEE
Confidence 345777888731 23459999999999998777733 334565554
No 44
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=29.40 E-value=4e+02 Score=32.32 Aligned_cols=61 Identities=15% Similarity=0.130 Sum_probs=44.2
Q ss_pred hccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhh
Q 008936 59 RMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRK 119 (548)
Q Consensus 59 r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQ 119 (548)
.+.++.+..+|..+++.--..+++.++...|++.+.+++...+.....+-+.-.+-..-+.
T Consensus 372 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~~~ 432 (1163)
T COG1196 372 EELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEA 432 (1163)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456666788888888888888889999999999988888777776666554444333333
No 45
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=29.22 E-value=5.3e+02 Score=25.59 Aligned_cols=84 Identities=23% Similarity=0.233 Sum_probs=40.1
Q ss_pred HHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhh----hhhhhhhhhhhhhhhHHHH
Q 008936 68 LRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEE----NLKLAHDLKVAEDERYFFM 143 (548)
Q Consensus 68 LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEE----NiRLtyalq~aeQERstFv 143 (548)
..+++.+..-...+++.|...|++.+.-|+. |-++.+..|++.-+.+..-+.=|++ .-.|.=-+...-++...||
T Consensus 40 sQ~~id~~~~e~~~L~~e~~~l~~e~e~L~~-~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~p~m~~m~~~L~~~v 118 (251)
T PF11932_consen 40 SQKRIDQWDDEKQELLAEYRQLEREIENLEV-YNEQLERQVASQEQELASLEQQIEQIEETRQELVPLMEQMIDELEQFV 118 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444555566666666666666664 4555555555555444433222221 1112222333344444555
Q ss_pred HHhHHHhhh
Q 008936 144 SSMLGLLAD 152 (548)
Q Consensus 144 SSLLpLLaE 152 (548)
..-+|++.+
T Consensus 119 ~~d~Pf~~~ 127 (251)
T PF11932_consen 119 ELDLPFLLE 127 (251)
T ss_pred hcCCCCChH
Confidence 555555544
No 46
>PF14048 MBD_C: C-terminal domain of methyl-CpG binding protein 2 and 3; PDB: 2L2L_B.
Probab=29.22 E-value=40 Score=30.07 Aligned_cols=20 Identities=25% Similarity=0.381 Sum_probs=17.6
Q ss_pred HHhhccHHHHHHHHHHHHHH
Q 008936 56 SRARMQKEEIHSLRQQIAVA 75 (548)
Q Consensus 56 ~k~r~~eeEI~~LRk~Lada 75 (548)
.-+|.||+.....|+|||+|
T Consensus 76 eDIr~QE~rVk~aR~RLaeA 95 (96)
T PF14048_consen 76 EDIRRQERRVKKARKRLAEA 95 (96)
T ss_dssp HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34689999999999999987
No 47
>PF01025 GrpE: GrpE; InterPro: IPR000740 Molecular chaperones are a diverse family of proteins that function to protect proteins in the intracellular milieu from irreversible aggregation during synthesis and in times of cellular stress. The bacterial molecular chaperone DnaK is an enzyme that couples cycles of ATP binding, hydrolysis, and ADP release by an N-terminal ATP-hydrolysing domain to cycles of sequestration and release of unfolded proteins by a C-terminal substrate binding domain. In prokaryotes the grpE protein. Dimeric GrpE is the co-chaperone for DnaK, and acts as a nucleotide exchange factor, stimulating the rate of ADP release 5000-fold []. DnaK is itself a weak ATPase; ATP hydrolysis by DnaK is stimulated by its interaction with another co-chaperone, DnaJ. Thus the co-chaperones DnaJ and GrpE are capable of tightly regulating the nucleotide-bound and substrate-bound state of DnaK in ways that are necessary for the normal housekeeping functions and stress-related functions of the DnaK molecular chaperone cycle. The X-ray crystal structure of GrpE in complex with the ATPase domain of DnaK revealed that GrpE is an asymmetric homodimer, bent in a manner that favours extensive contacts with only one DnaKATPase monomer []. GrpE does not actively compete for the atomic positions occupied by the nucleotide. GrpE and ADP mutually reduce one another's affinity for DnaK 200-fold, and ATP instantly dissociates GrpE from DnaK.; GO: 0000774 adenyl-nucleotide exchange factor activity, 0042803 protein homodimerization activity, 0051087 chaperone binding, 0006457 protein folding; PDB: 3A6M_A 4ANI_A 1DKG_B.
Probab=28.47 E-value=2.3e+02 Score=25.88 Aligned_cols=53 Identities=9% Similarity=0.187 Sum_probs=33.3
Q ss_pred hhhHHHHHHhHHHhhhc----CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936 137 DERYFFMSSMLGLLADY----GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT 191 (548)
Q Consensus 137 QERstFvSSLLpLLaEY----~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q 191 (548)
....-|+..|||++.-+ .--+.-.+..++..+++.++++|..-|. .-.+.++.
T Consensus 58 ~~~~~~~~~ll~v~D~l~~a~~~~~~~~~~~~~~~g~~~~~~~l~~~L~--~~Gv~~i~ 114 (165)
T PF01025_consen 58 YALEKFLKDLLPVLDNLERALEAAKSNEEEESLLEGLEMILKQLEDILE--KNGVEEIE 114 (165)
T ss_dssp CCHHHHHHHHHHHHHHHHHHHCC-SHHCTCHHHHHHHHHHHHHHHHHHH--TTTEEEE-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccchHHHHHHHHHHHHHHHHHHHH--HCCCEecC
Confidence 34556788899988766 2222334567888888888888876663 33444444
No 48
>PF14389 Lzipper-MIP1: Leucine-zipper of ternary complex factor MIP1
Probab=27.88 E-value=83 Score=27.31 Aligned_cols=36 Identities=19% Similarity=0.177 Sum_probs=26.8
Q ss_pred CccccchhhhhHHHHhhccHHHHHHHHHHHHHHhhh
Q 008936 43 FISFQDREAMELYSRARMQKEEIHSLRQQIAVACLK 78 (548)
Q Consensus 43 ~~~~~d~~~~~L~~k~r~~eeEI~~LRk~LadasvK 78 (548)
.....++.+++|+.-+-.-|+||..|-+++-+.-.+
T Consensus 48 ~~~~lp~~~keLL~EIA~lE~eV~~LE~~v~~L~~~ 83 (88)
T PF14389_consen 48 SPSSLPKKAKELLEEIALLEAEVAKLEQKVLSLYRQ 83 (88)
T ss_pred ccccCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345788999999888888888888777776654433
No 49
>PRK14146 heat shock protein GrpE; Provisional
Probab=27.64 E-value=6.4e+02 Score=25.46 Aligned_cols=64 Identities=11% Similarity=0.103 Sum_probs=42.5
Q ss_pred HHHHHHhHHHhhhcC--C--CCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 140 YFFMSSMLGLLADYG--L--WPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 140 stFvSSLLpLLaEY~--L--qP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
..|+..|||.|--+. | -+.-.+..+|+.+|+-.+++|..=|. ..-|+++. +-| ..++|+.|+-.
T Consensus 104 e~~~~~lLpv~DnlerAl~~~~~~~~~~~l~~Gv~mi~k~l~~~L~--k~Gv~~i~-~~G-----~~FDP~~HeAv 171 (215)
T PRK14146 104 KSLVSGFLNPIDNLERVGATQNQSEELKPFVEGVKMILKEFYSVLE--KSNVIRFD-PKG-----EPFDPMSMEAL 171 (215)
T ss_pred HHHHHHHhhHHhHHHHHHhcccccchhhHHHHHHHHHHHHHHHHHH--HCcCeeeC-CCC-----CCCChhHhcee
Confidence 468999999998873 2 22234568899999999999988874 34444443 112 23667777543
No 50
>PRK06342 transcription elongation factor regulatory protein; Validated
Probab=27.62 E-value=80 Score=30.17 Aligned_cols=42 Identities=12% Similarity=0.073 Sum_probs=24.5
Q ss_pred hhhHHHHhhccHHHHHHHHHHHHHHh----hhhhhhhcchhhhHHhHhHHH
Q 008936 51 AMELYSRARMQKEEIHSLRQQIAVAC----LKELQLQNEKYTLERKVSELR 97 (548)
Q Consensus 51 ~~~L~~k~r~~eeEI~~LRk~Ladas----vKE~QllnEKyvLErRiA~mR 97 (548)
|.+.|.+. ++|+..|++.|++|. +.| =..+...+|+||+||.
T Consensus 32 T~~G~~~L---~~El~~L~~~i~~Ar~~GDlsE--ak~~~~~~e~rI~~L~ 77 (160)
T PRK06342 32 TEAGLKAL---EDQLAQARAAYEAAQAIEDVNE--RRRQMARPLRDLRYLA 77 (160)
T ss_pred CHHHHHHH---HHHHHHHHHHHHHHHHCCChhH--HHHHHHHHHHHHHHHH
Confidence 55555554 678889988877773 334 1223344555555553
No 51
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=27.41 E-value=3.8e+02 Score=26.83 Aligned_cols=50 Identities=32% Similarity=0.419 Sum_probs=41.0
Q ss_pred hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHH
Q 008936 58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEA 107 (548)
Q Consensus 58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdl 107 (548)
+....+|+..+|+.+.....+=..|.+.+..||++|++|.--++.+.++.
T Consensus 211 ~~~~~~E~~~~r~~~~~l~~el~~l~~~~~~Le~~l~~le~~~~~~~~~~ 260 (312)
T PF00038_consen 211 LESAKEELKELRRQIQSLQAELESLRAKNASLERQLRELEQRLDEEREEY 260 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cchhHhHHHHHHhhhhHhhhhhhccccchhhhhhhHHHHHHHHHHHHHHH
Confidence 35567888899999998888888899999999999999987777665543
No 52
>PRK00295 hypothetical protein; Provisional
Probab=27.23 E-value=1.4e+02 Score=24.86 Aligned_cols=31 Identities=32% Similarity=0.499 Sum_probs=21.5
Q ss_pred hHHhHhH--HHHHHhHHhhHHHHHHHHHHHHhhhhH
Q 008936 89 LERKVSE--LRMAIDEKQNEAITSALNELARRKGVL 122 (548)
Q Consensus 89 LErRiA~--mRmAfDqqQqdlVdAASKaLSyRQdii 122 (548)
+|.||.+ +|+|| |++.|+.=++++..-|..|
T Consensus 3 ~e~Ri~~LE~kla~---qE~tie~Ln~~v~~Qq~~I 35 (68)
T PRK00295 3 LEERVTELESRQAF---QDDTIQALNDVLVEQQRVI 35 (68)
T ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
Confidence 5566664 36666 7788888888887777444
No 53
>PF07989 Microtub_assoc: Microtubule associated; InterPro: IPR012943 Proteins with this domain associate with the spindle body during cell division [].
Probab=26.84 E-value=1.5e+02 Score=25.35 Aligned_cols=63 Identities=21% Similarity=0.186 Sum_probs=32.9
Q ss_pred hhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhhhhhhhhhh
Q 008936 58 ARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLAHDLKVAED 137 (548)
Q Consensus 58 ~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLtyalq~aeQ 137 (548)
.|+||+.|..|+|- +.++| =||-.|. ++.. ...=..-++++.|||-|--.+...+.
T Consensus 2 lrEqe~~i~~L~KE--NF~LK------------LrI~fLe-----e~l~-----~~~~~~~~~~~keNieLKve~~~L~~ 57 (75)
T PF07989_consen 2 LREQEEQIDKLKKE--NFNLK------------LRIYFLE-----ERLQ-----KLGPESIEELLKENIELKVEVESLKR 57 (75)
T ss_pred HHHHHHHHHHHHHh--hhhHH------------HHHHHHH-----HHHH-----hcccccHHHHHHHHHHHHHHHHHHHH
Confidence 57888888888873 23333 2333322 2222 11123345667777777666665555
Q ss_pred hhHHHHH
Q 008936 138 ERYFFMS 144 (548)
Q Consensus 138 ERstFvS 144 (548)
|-.-.-.
T Consensus 58 el~~~~~ 64 (75)
T PF07989_consen 58 ELQEKKK 64 (75)
T ss_pred HHHHHHH
Confidence 5444333
No 54
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=26.58 E-value=1.4e+02 Score=31.83 Aligned_cols=48 Identities=31% Similarity=0.275 Sum_probs=38.9
Q ss_pred hccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhH
Q 008936 59 RMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNE 106 (548)
Q Consensus 59 r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqd 106 (548)
..|.|||.+|..+|+|.--|.-|+-.|+..|=..++..+-+=.+-|-+
T Consensus 230 ~rQQEEIt~LlsqivdlQ~r~k~~~~EnEeL~q~L~~ske~Q~~L~aE 277 (306)
T PF04849_consen 230 RRQQEEITSLLSQIVDLQQRCKQLAAENEELQQHLQASKESQRQLQAE 277 (306)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 567899999999999999999999999999988887776554333333
No 55
>PRK14149 heat shock protein GrpE; Provisional
Probab=26.27 E-value=3.8e+02 Score=26.74 Aligned_cols=64 Identities=13% Similarity=0.122 Sum_probs=43.5
Q ss_pred hHHHHHHhHHHhhhcC----CCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 139 RYFFMSSMLGLLADYG----LWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 139 RstFvSSLLpLLaEY~----LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
..-|+..|||.|-.+. .-+...+..+|+.+|+-.+++|..-|. ..-|+++. | .| .++|+.|+.-
T Consensus 85 ~~~~~~~LLpVlDnLerAl~~~~~~~~~~~l~~Gv~mi~k~l~~vL~--k~GV~~I~--~---~G--~FDP~~HEAv 152 (191)
T PRK14149 85 YEKIALDLLPVIDALLGALKSAAEVDKESALTKGLELTMEKLHEVLA--RHGIEGIE--C---LE--EFDPNFHNAI 152 (191)
T ss_pred HHHHHHHHhhHHhHHHHHHhccccccchHHHHHHHHHHHHHHHHHHH--HCCCEEeC--C---CC--CCChHHhhee
Confidence 3468999999998872 223345667899999999999988775 33444443 2 23 3888777643
No 56
>PF04012 PspA_IM30: PspA/IM30 family; InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=25.67 E-value=1.2e+02 Score=29.07 Aligned_cols=48 Identities=29% Similarity=0.279 Sum_probs=34.9
Q ss_pred hhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHH
Q 008936 51 AMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRM 98 (548)
Q Consensus 51 ~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRm 98 (548)
..+...+....++++..|...+..+.-...+|.+....||.||.+|+-
T Consensus 86 Ar~al~~k~~~e~~~~~l~~~~~~~~~~~~~l~~~l~~l~~kl~e~k~ 133 (221)
T PF04012_consen 86 AREALQRKADLEEQAERLEQQLDQAEAQVEKLKEQLEELEAKLEELKS 133 (221)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344455667777777778888888777777777777777777777763
No 57
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=25.59 E-value=1.8e+02 Score=33.83 Aligned_cols=67 Identities=33% Similarity=0.419 Sum_probs=46.8
Q ss_pred HHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHH--hHHhhHHHHHHHHHHHHhhhhHhhhhh
Q 008936 54 LYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAI--DEKQNEAITSALNELARRKGVLEENLK 127 (548)
Q Consensus 54 L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAf--DqqQqdlVdAASKaLSyRQdiiEENiR 127 (548)
+=.|.++.|.||.+||.-|. .||-| +..||..+.+||..- +++-.+..=+|+-||--+-.+||+|+.
T Consensus 543 ~r~r~~~lE~E~~~lr~elk---~kee~----~~~~e~~~~~lr~~~~e~~~~~e~L~~aL~amqdk~~~LE~sLs 611 (697)
T PF09726_consen 543 CRQRRRQLESELKKLRRELK---QKEEQ----IRELESELQELRKYEKESEKDTEVLMSALSAMQDKNQHLENSLS 611 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHH---HHHHH----HHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhh
Confidence 55677888999999987553 56654 467899899999852 244456666777777766667776654
No 58
>PRK11281 hypothetical protein; Provisional
Probab=25.58 E-value=1.9e+02 Score=35.57 Aligned_cols=61 Identities=26% Similarity=0.317 Sum_probs=44.9
Q ss_pred hhhhhHHHHhhccHHHHHHHHHHHHHHhh------------hh---------hhhhcchhhhHHhHhHHHHHHhHHhhHH
Q 008936 49 REAMELYSRARMQKEEIHSLRQQIAVACL------------KE---------LQLQNEKYTLERKVSELRMAIDEKQNEA 107 (548)
Q Consensus 49 ~~~~~L~~k~r~~eeEI~~LRk~Ladasv------------KE---------~QllnEKyvLErRiA~mRmAfDqqQqdl 107 (548)
..+.++|.++..++++...|+++|++|-- |+ ..-++. ..||.++++..-.-.+.|.+|
T Consensus 66 ~~tL~~L~qi~~~~~~~~~L~k~l~~Ap~~l~~a~~~Le~Lk~~~~~~~~~~~~~~Sl-~qLEq~L~q~~~~Lq~~Q~~L 144 (1113)
T PRK11281 66 EQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETLSTLSL-RQLESRLAQTLDQLQNAQNDL 144 (1113)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHhhccccccccccccccCH-HHHHHHHHHHHHHHHHHHHHH
Confidence 35678899999999999999999888531 21 111122 349999999988888888888
Q ss_pred HHH
Q 008936 108 ITS 110 (548)
Q Consensus 108 VdA 110 (548)
-++
T Consensus 145 a~~ 147 (1113)
T PRK11281 145 AEY 147 (1113)
T ss_pred HHH
Confidence 776
No 59
>PRK14151 heat shock protein GrpE; Provisional
Probab=25.52 E-value=6.3e+02 Score=24.69 Aligned_cols=64 Identities=11% Similarity=0.179 Sum_probs=42.1
Q ss_pred HHHHHHhHHHhhhcC--CC--CCcc-cchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 140 YFFMSSMLGLLADYG--LW--PHVT-NASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 140 stFvSSLLpLLaEY~--Lq--P~V~-DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
.-|+..|||.+-.+. |. ++-. ...+|+.+|+-.+++|..-|. ..-|+++. ..| ..++|+.|+--
T Consensus 70 ~~~~~~LLpv~DnlerAl~~~~~~~~~~~~~~~Gv~mi~k~l~~~L~--k~Gv~~i~-~~G-----~~FDP~~HEAv 138 (176)
T PRK14151 70 EKFAGDLLPVVDSLERGLELSSADDEAIKPMREGVELTLKMFQDTLK--RYQLEAVD-PHG-----EPFNPEHHQAM 138 (176)
T ss_pred HHHHHHHhhHHhHHHHHHhcccccchhHHHHHHHHHHHHHHHHHHHH--HCCCEEeC-CCC-----CCCCHHHhhcc
Confidence 468899999988772 22 2222 247899999999999988774 44556555 223 24667666543
No 60
>PRK11281 hypothetical protein; Provisional
Probab=25.45 E-value=3.4e+02 Score=33.44 Aligned_cols=71 Identities=21% Similarity=0.390 Sum_probs=51.1
Q ss_pred hhHHhHhHHHHHH---hHHhhHH------H----------------HHHHHHHHHhhhhHhhhhh-------hhhhhhhh
Q 008936 88 TLERKVSELRMAI---DEKQNEA------I----------------TSALNELARRKGVLEENLK-------LAHDLKVA 135 (548)
Q Consensus 88 vLErRiA~mRmAf---DqqQqdl------V----------------dAASKaLSyRQdiiEENiR-------Ltyalq~a 135 (548)
.|.++||++|+.- .||+..| + ++..+-|.-|+++|++=++ +.=.||.+
T Consensus 357 ~l~~~iAdlrl~~f~~~q~~~~l~~~~~y~~~l~~~~~~~~~~~~~~~l~~ll~~r~~LL~~l~~~~~~~l~~~~~l~~~ 436 (1113)
T PRK11281 357 GLADRIADLRLEQFEINQQRDALFQPDAYIDKLEAGHKSEVTDEVRDALLQLLDERRELLDQLNKQLNNQLNLAINLQLN 436 (1113)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHhcChhhHHHHHHhcccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3889999999763 2222111 1 2344678889999988333 45577888
Q ss_pred hhhhHHHHHHhHHHhhhcCCCCC
Q 008936 136 EDERYFFMSSMLGLLADYGLWPH 158 (548)
Q Consensus 136 eQERstFvSSLLpLLaEY~LqP~ 158 (548)
+|+-..-+.+|=-.|.|.-+|=+
T Consensus 437 q~Ql~~~~~~l~~~L~~~lfWv~ 459 (1113)
T PRK11281 437 QQQLLSVSDSLQSTLTQQIFWVN 459 (1113)
T ss_pred HHHHHHHHHHHHHHHHHhhhccC
Confidence 99998889999999999988844
No 61
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=25.20 E-value=3.5e+02 Score=27.88 Aligned_cols=47 Identities=21% Similarity=0.235 Sum_probs=30.3
Q ss_pred HHhhccHHHHHHHHHHHHHHhhhh---------hhhhcchhhhHHhHhHHHHHHhH
Q 008936 56 SRARMQKEEIHSLRQQIAVACLKE---------LQLQNEKYTLERKVSELRMAIDE 102 (548)
Q Consensus 56 ~k~r~~eeEI~~LRk~LadasvKE---------~QllnEKyvLErRiA~mRmAfDq 102 (548)
..++..+.||.-+|+|++++..|= ..|-.|.+++++|++.|+=...+
T Consensus 59 ~qv~~~e~ei~~~r~r~~~~e~kl~~v~~~~e~~aL~~E~~~ak~r~~~le~el~~ 114 (239)
T COG1579 59 NQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAE 114 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345667788888888887775542 13555667777777666655443
No 62
>PRK04406 hypothetical protein; Provisional
Probab=24.73 E-value=1.7e+02 Score=24.98 Aligned_cols=32 Identities=22% Similarity=0.489 Sum_probs=24.0
Q ss_pred hhHHhHhHH--HHHHhHHhhHHHHHHHHHHHHhhhhH
Q 008936 88 TLERKVSEL--RMAIDEKQNEAITSALNELARRKGVL 122 (548)
Q Consensus 88 vLErRiA~m--RmAfDqqQqdlVdAASKaLSyRQdii 122 (548)
.||.||.+| |+|| |++.|+.=++++..-|..|
T Consensus 8 ~le~Ri~~LE~~lAf---QE~tIe~LN~~v~~Qq~~I 41 (75)
T PRK04406 8 QLEERINDLECQLAF---QEQTIEELNDALSQQQLLI 41 (75)
T ss_pred HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
Confidence 467777754 6776 7889999888888777444
No 63
>PRK14160 heat shock protein GrpE; Provisional
Probab=23.64 E-value=6.2e+02 Score=25.64 Aligned_cols=63 Identities=10% Similarity=0.176 Sum_probs=42.4
Q ss_pred hHHHHHHhHHHhhhc--CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCCC
Q 008936 139 RYFFMSSMLGLLADY--GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGVP 211 (548)
Q Consensus 139 RstFvSSLLpLLaEY--~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~p 211 (548)
..-|+..|||.+--+ -|.- ..+..+|+.+|+-.+++|..=|. ..-|+++. | .| .++|+.|+.-
T Consensus 110 ~e~~~~~LLpVlDnLerAl~~-~~~~~~l~~Gv~mi~kql~~vL~--k~GVe~I~--~---~G--~FDP~~HEAv 174 (211)
T PRK14160 110 CEDVLKELLPVLDNLERAAAV-EGSVEDLKKGIEMTVKQFKTSLE--KLGVEEIS--T---EG--EFDPNLHNAV 174 (211)
T ss_pred HHHHHHHHhhHHhHHHHHHhc-ccchhHHHHHHHHHHHHHHHHHH--HCCCEEeC--C---CC--CCChHHhcee
Confidence 356899999998877 2321 24557899999999999888774 33455543 2 23 4788777643
No 64
>PLN02524 S-adenosylmethionine decarboxylase
Probab=23.37 E-value=1.6e+02 Score=31.92 Aligned_cols=87 Identities=23% Similarity=0.269 Sum_probs=50.7
Q ss_pred eeeeccccCCccceeeEEEEeccCCccccccCCCCCcceeccccccceeEEEeEecCCCCcccceeeeeecCCCccccCc
Q 008936 324 LLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVRRFANDQNKIKCDL 403 (548)
Q Consensus 324 L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA~~p~Y~vTADDvd~llAiec~PmDdr~RkGelVk~~ANd~~kI~cdp 403 (548)
-.=||||.|| |+|..=-+--||..|==.+.+.|+.=-|. +..
T Consensus 228 F~PCGYSmN~-------------------i~g~~y~TIHVTPE~~~SYaSFEtn~~~~-------------------~~~ 269 (355)
T PLN02524 228 FDPCGYSMNG-------------------IEGDAISTIHVTPEDGFSYASFEAMGYDP-------------------GDL 269 (355)
T ss_pred cCCCcccccc-------------------cCCCceEEEEECCCCCCeEEEEEeccCCc-------------------ccc
Confidence 4569999996 44554445557777777788887753111 112
Q ss_pred cchHHHHHHhhc-cceEEEEeeeccccccccceEEEEeecceEEEe
Q 008936 404 GMQSEIDAYISR-GHATFSVLMLMDSSENWEQATLILRRSIYRIKI 448 (548)
Q Consensus 404 eMq~~I~~~l~~-G~AsF~V~ll~~~~d~wEpATLvirR~gY~IK~ 448 (548)
.+.+.|++-+.. .=..|.|.+-.+............+-.||.-+-
T Consensus 270 ~~~~lv~rVl~~F~P~~fsvt~~~~~~~~~~~~~~~~~~~gY~~~~ 315 (355)
T PLN02524 270 DLSQLVERVLACFKPKEFSVAVHANVGGEAGSWGCSLDPDGYSCKG 315 (355)
T ss_pred CHHHHHHHHHhhcCCceEEEEEEecCccchhccccccCcCCceecC
Confidence 366666666654 334577777654333333334445557876543
No 65
>KOG3083 consensus Prohibitin [Posttranslational modification, protein turnover, chaperones]
Probab=22.92 E-value=77 Score=33.03 Aligned_cols=41 Identities=29% Similarity=0.524 Sum_probs=28.5
Q ss_pred HhhHHHHH-HHHHHHHh---hhhHhhhhhhhh------------hhhhhhhh--hHHHH
Q 008936 103 KQNEAITS-ALNELARR---KGVLEENLKLAH------------DLKVAEDE--RYFFM 143 (548)
Q Consensus 103 qQqdlVdA-ASKaLSyR---QdiiEENiRLty------------alq~aeQE--RstFv 143 (548)
-|.++|++ .+++|..| -+++-+++-+|| +-|+|||| |.-||
T Consensus 141 TqRe~vS~~v~~~lt~rA~~Fgl~LddvsiThltfGkEFt~AvE~KQVAQQEAErarFv 199 (271)
T KOG3083|consen 141 TQRELVSRQVSNDLTERAATFGLILDDVSITHLTFGKEFTEAVEAKQVAQQEAERARFV 199 (271)
T ss_pred hhhHHHHHHHHHHHHHHHHhhCeeechhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36677776 67777777 456677777765 56888885 67775
No 66
>PLN02542 fructose-1,6-bisphosphatase
Probab=22.90 E-value=1e+02 Score=33.92 Aligned_cols=80 Identities=24% Similarity=0.424 Sum_probs=56.7
Q ss_pred CCCcceeeeccccCCccceeeEEEEeccCCccccccCCCCCcceeccccccceeEEEeEecCCCCcccceeeeee-cCCC
Q 008936 319 TPGEKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVRRFA-NDQN 397 (548)
Q Consensus 319 ~pG~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA~~p~Y~vTADDvd~llAiec~PmDdr~RkGelVk~~A-Nd~~ 397 (548)
-||+++.|.||-+=|.+.-. |=.+.+|...|.=....=+|++|.+++- +| ++ =++|+ |++|
T Consensus 234 qpG~~qvAAGY~lYGpsT~L---Vlt~G~GV~~FtLDp~~geFvLt~~~i~-------IP--~~------g~iySiN~~N 295 (412)
T PLN02542 234 QPGSNLLAAGYCMYSSSVIF---VLTIGTGVFSFTLDPMYGEFVLTQENIQ-------IP--KA------GKIYSFNEGN 295 (412)
T ss_pred CcchhhhEEEEEEEccceEE---EEEECCCEEEEEEcCCCCeEEEeCCCee-------eC--CC------CcEeeeCccc
Confidence 39999999999998865332 2245777766664555557888877651 22 22 26788 7788
Q ss_pred ccccCccchHHHHHHhhcc
Q 008936 398 KIKCDLGMQSEIDAYISRG 416 (548)
Q Consensus 398 kI~cdpeMq~~I~~~l~~G 416 (548)
.=.+|+.|+.-|+.....|
T Consensus 296 ~~~W~~~~~~yi~~~~~~~ 314 (412)
T PLN02542 296 YQLWDDKLKKYIDDLKDPG 314 (412)
T ss_pred ccccCHHHHHHHHHHhhCC
Confidence 8889999999999887543
No 67
>PF08838 DUF1811: Protein of unknown function (DUF1811); InterPro: IPR014938 This entry consists uncharacterised bacterial proteins. Some of the proteins are annotated as being transcriptional regulators (see Q4MQL7 from SWISSPROT, Q65MA2 from SWISSPROT). The structure of one of the proteins has revealed a beta-barrel like structure with helix-turn-helix like motif. ; PDB: 2YXY_A 1SF9_A.
Probab=22.36 E-value=87 Score=28.72 Aligned_cols=33 Identities=30% Similarity=0.402 Sum_probs=24.9
Q ss_pred ccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHh
Q 008936 60 MQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVS 94 (548)
Q Consensus 60 ~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA 94 (548)
|-++||..|+.+-..| ...-+.||--|||||++
T Consensus 11 EL~~Ei~~L~ekarKA--Eq~G~~nE~aV~erK~~ 43 (102)
T PF08838_consen 11 ELRQEIARLKEKARKA--EQLGIVNEYAVYERKII 43 (102)
T ss_dssp HHHHHHHHHHHHHHHH--HHCT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH--HHcCCccHHHHHHHHHH
Confidence 4467888888886544 45568899999999985
No 68
>PRK14141 heat shock protein GrpE; Provisional
Probab=22.14 E-value=7.8e+02 Score=24.90 Aligned_cols=64 Identities=17% Similarity=0.200 Sum_probs=41.1
Q ss_pred hHHHHHHhHHHhhhcC--CC--CC------cccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCC
Q 008936 139 RYFFMSSMLGLLADYG--LW--PH------VTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRH 208 (548)
Q Consensus 139 RstFvSSLLpLLaEY~--Lq--P~------V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~ 208 (548)
..-|+..|||.+--+. |. +. -.+..+|+.+|+-.+++|..=|. ...|+++. +.| ..++|..|
T Consensus 80 ~~~~~~dLLpViDnLerAl~~~~~~~~~~~~~~~~~l~eGv~mi~k~l~~vLe--k~GV~~I~-~~G-----e~FDP~~H 151 (209)
T PRK14141 80 IAGFARDMLSVSDNLRRALDAIPAEARAAADAGLKALIEGVEMTERAMLNALE--RHGVKKLD-PEG-----QKFDPNFH 151 (209)
T ss_pred HHHHHHHHhhhHhHHHHHHhccccccccccchhHHHHHHHHHHHHHHHHHHHH--HCCCEEEC-CCC-----CCCChHHh
Confidence 3468899999987662 21 21 12357899999999999988775 44556555 223 24666666
Q ss_pred CC
Q 008936 209 GV 210 (548)
Q Consensus 209 q~ 210 (548)
+.
T Consensus 152 EA 153 (209)
T PRK14141 152 QA 153 (209)
T ss_pred ce
Confidence 53
No 69
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=22.03 E-value=5e+02 Score=23.57 Aligned_cols=67 Identities=16% Similarity=0.168 Sum_probs=44.6
Q ss_pred HHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhh
Q 008936 62 KEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKL 128 (548)
Q Consensus 62 eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRL 128 (548)
=++|..||+.+..+-..=.++..+.......+...+-.+..+...|-..-+.+=..+.|+-+.|=-|
T Consensus 58 ~~~L~~lr~e~~~~~~~~~~l~~~~~~a~~~l~~~e~sw~~qk~~le~e~~~~~~r~~dL~~QN~lL 124 (132)
T PF07926_consen 58 IKELQQLREELQELQQEINELKAEAESAKAELEESEASWEEQKEQLEKELSELEQRIEDLNEQNKLL 124 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456677777777666666677777777777777777777776666666666666666666666433
No 70
>KOG3990 consensus Uncharacterized conserved protein [Function unknown]
Probab=21.88 E-value=2.7e+02 Score=29.55 Aligned_cols=29 Identities=28% Similarity=0.408 Sum_probs=25.7
Q ss_pred ccchhhhhHHHHhhccHHHHHHHHHHHHH
Q 008936 46 FQDREAMELYSRARMQKEEIHSLRQQIAV 74 (548)
Q Consensus 46 ~~d~~~~~L~~k~r~~eeEI~~LRk~Lad 74 (548)
|-|+.--+-.++..--+|||..|++.|+.
T Consensus 215 ~~D~~~~dh~V~i~~lkeeia~Lkk~L~q 243 (305)
T KOG3990|consen 215 FGDRDPGDHMVKIQKLKEEIARLKKLLHQ 243 (305)
T ss_pred CCCCCCcchHHHHHHHHHHHHHHHHHHhh
Confidence 66888888889999999999999999985
No 71
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=21.50 E-value=1.3e+02 Score=25.81 Aligned_cols=36 Identities=22% Similarity=0.273 Sum_probs=26.4
Q ss_pred cHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHH
Q 008936 61 QKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSEL 96 (548)
Q Consensus 61 ~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~m 96 (548)
..+++..|+++|.++--+-..+.+++.+++.+++.|
T Consensus 68 ~~~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~L 103 (104)
T PF13600_consen 68 DSPELKELEEELEALEDELAALQDEIQALEAQIAFL 103 (104)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 455777777777777777777778888888887755
No 72
>KOG3620 consensus Uncharacterized conserved protein [Function unknown]
Probab=21.40 E-value=1.7e+02 Score=36.72 Aligned_cols=75 Identities=21% Similarity=0.316 Sum_probs=56.1
Q ss_pred hhccceEEEEeeeccccccccceEEEEeecc----------------eEEEec-CCcceeeeccCCcceeEecCCCcceE
Q 008936 413 ISRGHATFSVLMLMDSSENWEQATLILRRSI----------------YRIKID-STEAIIEERFPKEVSIKVPCGLSTQF 475 (548)
Q Consensus 413 l~~G~AsF~V~ll~~~~d~wEpATLvirR~g----------------Y~IK~~-~~~~~I~EKfS~~~~IkIP~G~~~qf 475 (548)
.--|..+|+|-.+-+-.-.-+ |++|--.. |+||.= |=.+..+..|||.+.|-=|+.++-|.
T Consensus 40 ~P~GnTsf~VVFLpR~~G~~~--t~~~~~ts~Gvfty~vqG~g~~nPYRl~Pf~G~r~PmNss~sp~I~i~NPh~~plqi 117 (1626)
T KOG3620|consen 40 PPQGNTSFNVVFLPRQLGAIA--TDLLIHTSFGVFTYAVQGEGSENPYRLKPFVGIRAPMNSSLSPEIHIYNPHERPLQI 117 (1626)
T ss_pred CCCCcceEEEEEehhhhccce--eeEEEeeccceEEEEEeccCCCCccccccccceecccccccCCceEecCCccCceEE
Confidence 346899999998865444444 55554433 777765 44488999999999999999999999
Q ss_pred EEEecCCCeeeecc
Q 008936 476 VLTFSDGSSYPFST 489 (548)
Q Consensus 476 vi~~sdG~e~~L~~ 489 (548)
+=|.+.|.+-.|++
T Consensus 118 ~EiysSGG~~hlEL 131 (1626)
T KOG3620|consen 118 LEIYSSGGEFHLEL 131 (1626)
T ss_pred EEEeccCccEEEec
Confidence 98877777766664
No 73
>cd05829 Sortase_E Sortase E (SrtE) is a membrane transpeptidase found in gram-positive bacteria that cleaves surface proteins at a cell sorting motif and catalyzes a transpeptidation reaction in which the surface protein substrate is covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. The function of Sortase E is unknown. In two different sortase families, the N-terminus either functions as both a signal peptide for secretion and a stop-transfer signal for membrane anchoring, or it contains a signal peptide only and the C-terminus serves as a membrane anchor. Most gram-positive bacteria contain more than one sortase and it is thought that the different sortases anchor different surface protein classes. The sortase domain is a modified beta-barrel flanked by two (SrtA) or three (SrtB) short alpha-helices.
Probab=21.29 E-value=1.9e+02 Score=26.52 Aligned_cols=51 Identities=18% Similarity=0.232 Sum_probs=35.8
Q ss_pred cccCCC--cceeeeccccCCccceeeEEEEeccCCccccccCC--CCCcceeccc
Q 008936 316 GEATPG--EKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGA--TNPEYVVTAD 366 (548)
Q Consensus 316 Gda~pG--~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA--~~p~Y~vTAD 366 (548)
+.+.|| +...-.||....-....|-.+..|+-|..-+|..+ ....|.||.-
T Consensus 42 ~~~~pG~~Gn~viaGH~~~~g~~~~F~~L~~l~~GD~I~v~~~~g~~~~Y~V~~~ 96 (144)
T cd05829 42 GGPKPGEKGTAVLAGHVDSRGGPAVFFRLGDLRKGDKVEVTRADGQTATFRVDRV 96 (144)
T ss_pred CCCCCCCCCCEEEEEecCCCCCChhhcchhcCCCCCEEEEEECCCCEEEEEEeEE
Confidence 344444 35556788754224678999999999999999884 3478888753
No 74
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=21.28 E-value=1.1e+03 Score=27.37 Aligned_cols=8 Identities=25% Similarity=0.688 Sum_probs=4.0
Q ss_pred ccceeecc
Q 008936 310 EGFQIIGE 317 (548)
Q Consensus 310 eglqI~Gd 317 (548)
.|+.|.-+
T Consensus 1071 ~~~~~~~~ 1078 (1179)
T TIGR02168 1071 AGIEIFAQ 1078 (1179)
T ss_pred cCceEEEe
Confidence 45555544
No 75
>PRK09293 fructose-1,6-bisphosphatase; Provisional
Probab=21.28 E-value=1.1e+02 Score=32.31 Aligned_cols=76 Identities=25% Similarity=0.547 Sum_probs=55.3
Q ss_pred CCCcceeeeccccCCccceeeEEEEeccCCccccccCCCCCcceeccccccceeEEEeEecCCCCcccceeeeee-cCCC
Q 008936 319 TPGEKLLGCGYPVRGTTLCMFQWVRHLQDGTRHYIEGATNPEYVVTADDVDKLIAVECIPMDDQGRQGELVRRFA-NDQN 397 (548)
Q Consensus 319 ~pG~~L~acGy~inGTt~C~FQWVRhleDGs~~~IeGA~~p~Y~vTADDvd~llAiec~PmDdr~RkGelVk~~A-Nd~~ 397 (548)
-||+++.|.||-+=|.+.-+ |-.+.+|...|.=....-+|++|-+++- +| +++ ++|+ |.+|
T Consensus 145 q~G~~~vaAgy~~YG~~t~l---v~t~g~gv~~ftld~~~g~f~l~~~~i~-------ip--~~~------~~ys~n~~n 206 (327)
T PRK09293 145 QPGNNQVAAGYVLYGPSTML---VLTTGDGVHGFTLDPSLGEFVLTHENIR-------IP--EDG------KEYAINEGN 206 (327)
T ss_pred CCchhheeEEEEEEcCceEE---EEEeCCCEEEEEEeCCCCeEEEecCCce-------eC--CCC------CEEccCchh
Confidence 39999999999998876544 3344677666665555667888876653 22 222 6899 8888
Q ss_pred ccccCccchHHHHHHh
Q 008936 398 KIKCDLGMQSEIDAYI 413 (548)
Q Consensus 398 kI~cdpeMq~~I~~~l 413 (548)
-=.++|.++.-| .++
T Consensus 207 ~~~w~~~~~~yi-~~~ 221 (327)
T PRK09293 207 QRHWEPGVKKYI-ELL 221 (327)
T ss_pred hhhcCHHHHHHH-HHh
Confidence 889999999999 554
No 76
>PRK14143 heat shock protein GrpE; Provisional
Probab=21.21 E-value=8.9e+02 Score=24.90 Aligned_cols=64 Identities=20% Similarity=0.328 Sum_probs=41.5
Q ss_pred hHHHHHHhHHHhhhc-----CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccccCCCCCCCCCCcCCCCCCC
Q 008936 139 RYFFMSSMLGLLADY-----GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLTREGGTDAGAGSIDTDRHGV 210 (548)
Q Consensus 139 RstFvSSLLpLLaEY-----~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q~~pg~~~~~~~~~~~~~q~ 210 (548)
..-|+..|||.+--+ .+.|.-.++.+|+.+++-++++|..-|. ..-|+++. ..| ..++|+.|.-
T Consensus 116 ~~~~~~~lLpV~DnLerAl~~~~~~~~~~~~l~~Gve~i~k~l~~~L~--k~GV~~i~-~~G-----~~FDP~~HEA 184 (238)
T PRK14143 116 KCNTLSEILPVVDNFERARQQLKPEGEEAQALHRSYQGLYKQLVDVLK--RLGVSPMR-VVG-----QEFDPNLHEA 184 (238)
T ss_pred HHHHHHHHHHHHhHHHHHHhcccccchhHHHHHHHHHHHHHHHHHHHH--HCCCeeeC-CCC-----CCCChHHhhe
Confidence 456899999998776 2335445568899999999998877664 33444444 122 2356666643
No 77
>PRK02224 chromosome segregation protein; Provisional
Probab=20.96 E-value=8.3e+02 Score=28.19 Aligned_cols=35 Identities=14% Similarity=0.137 Sum_probs=20.2
Q ss_pred EEEEecCCCeeeeccCCCCCchhhHHHHHHHHHHH
Q 008936 475 FVLTFSDGSSYPFSTYNVRMRDTLVLTMRMLQGKV 509 (548)
Q Consensus 475 fvi~~sdG~e~~L~~~~~~~RD~iVLtlR~F~~~~ 509 (548)
..++..+|.......-..+.|..+.|.||+-.-++
T Consensus 767 i~~~~~~g~~~~~~~lS~G~~~~~~lalr~a~~~~ 801 (880)
T PRK02224 767 LTVYQKDGEPLEPEQLSGGERALFNLSLRCAIYRL 801 (880)
T ss_pred eeeeCCCCCccChhhcCccHHHHHHHHHHHHHHHH
Confidence 33455566655555555666766666666655443
No 78
>PF13874 Nup54: Nucleoporin complex subunit 54; PDB: 3T97_B.
Probab=20.82 E-value=4.3e+02 Score=24.41 Aligned_cols=79 Identities=24% Similarity=0.242 Sum_probs=30.2
Q ss_pred hhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhH--HhHhHHHHHHhHHhhHHHHHHHHHHHHhhhhHhhhhhhh
Q 008936 52 MELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLE--RKVSELRMAIDEKQNEAITSALNELARRKGVLEENLKLA 129 (548)
Q Consensus 52 ~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLE--rRiA~mRmAfDqqQqdlVdAASKaLSyRQdiiEENiRLt 129 (548)
.+|-.|++.|++++.+++.+|.+..-+=.+| .|+.++ .||+++|--.-+.++-++.-++| ++|+. +..
T Consensus 33 ~dL~~R~~~Q~~~~~~~~~~l~~i~~~l~~L--~~~~~~~~~rl~~~r~r~~~L~hR~l~v~~~-----~eilr---~~g 102 (141)
T PF13874_consen 33 EDLKKRVEAQEEEIAQHRERLKEINDKLEEL--QKHDLETSARLEEARRRHQELSHRLLRVLRK-----QEILR---NRG 102 (141)
T ss_dssp -------------HHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHH---H--
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHH---HcC
Confidence 4678899999999999999998776553344 444443 56666665555555555444433 22322 234
Q ss_pred hhhhhhhhhhH
Q 008936 130 HDLKVAEDERY 140 (548)
Q Consensus 130 yalq~aeQERs 140 (548)
|+|+..|.+..
T Consensus 103 ~~l~~eEe~L~ 113 (141)
T PF13874_consen 103 YALSPEEEELR 113 (141)
T ss_dssp -----------
T ss_pred CCCCHHHHHHH
Confidence 56655555433
No 79
>PF06034 DUF919: Nucleopolyhedrovirus protein of unknown function (DUF919); InterPro: IPR009265 This entry is represented by Autographa californica nuclear polyhedrosis virus (AcMNPV), Orf29. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several short baculovirus proteins of unknown function.
Probab=20.73 E-value=1.9e+02 Score=24.37 Aligned_cols=43 Identities=26% Similarity=0.387 Sum_probs=29.6
Q ss_pred hhccHHHHHHHHHHHHHHhhhhhhh------hcchhhhHHhHhHHHHHH
Q 008936 58 ARMQKEEIHSLRQQIAVACLKELQL------QNEKYTLERKVSELRMAI 100 (548)
Q Consensus 58 ~r~~eeEI~~LRk~LadasvKE~Ql------lnEKyvLErRiA~mRmAf 100 (548)
++.|=.+|.+.+++|.-.+--+-.| -+|-+.+++|+.+||+-|
T Consensus 6 L~~QLd~I~~~K~~l~ik~~H~Ekl~kitK~p~El~~i~~kl~~~R~~F 54 (62)
T PF06034_consen 6 LTQQLDEINQMKRQLTIKSQHWEKLKKITKNPKELQEIEKKLQELRQNF 54 (62)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHH
Confidence 3556677777777776655443333 245577999999999977
No 80
>PRK04325 hypothetical protein; Provisional
Probab=20.67 E-value=2.2e+02 Score=24.08 Aligned_cols=32 Identities=25% Similarity=0.522 Sum_probs=22.8
Q ss_pred hHHhHhH--HHHHHhHHhhHHHHHHHHHHHHhhhhHh
Q 008936 89 LERKVSE--LRMAIDEKQNEAITSALNELARRKGVLE 123 (548)
Q Consensus 89 LErRiA~--mRmAfDqqQqdlVdAASKaLSyRQdiiE 123 (548)
+|.||.+ +|+|| |++.|+.=++++..-|..|.
T Consensus 7 ~e~Ri~~LE~klAf---QE~tIe~LN~vv~~Qq~~I~ 40 (74)
T PRK04325 7 MEDRITELEIQLAF---QEDLIDGLNATVARQQQTLD 40 (74)
T ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHH
Confidence 4555554 46676 78899999998887775553
No 81
>PF04201 TPD52: Tumour protein D52 family; InterPro: IPR007327 The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family []. These proteins have an N-terminal coiled-coil that allows members to form homo- and heterodimers with each other [].
Probab=20.62 E-value=2.1e+02 Score=28.18 Aligned_cols=51 Identities=33% Similarity=0.421 Sum_probs=35.1
Q ss_pred ccccchhhhhHHHHhhccHHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHH-----HhHHhhHHH
Q 008936 44 ISFQDREAMELYSRARMQKEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMA-----IDEKQNEAI 108 (548)
Q Consensus 44 ~~~~d~~~~~L~~k~r~~eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmA-----fDqqQqdlV 108 (548)
.-|.+.|-.+|-.-...-||||..||+=|+. | ||+.++||=. +.+-+|++-
T Consensus 24 ~~LsEeE~eeLr~EL~KvEeEI~TLrqvL~a---K-----------er~~~eLKrkLGit~l~elkqnls 79 (162)
T PF04201_consen 24 EGLSEEEREELRSELAKVEEEIQTLRQVLAA---K-----------ERHCAELKRKLGITPLSELKQNLS 79 (162)
T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHHHHHHHH---H-----------HHhHHHHHHHHCCchHHHHHHHHH
Confidence 4567777777877777789999999998874 3 5666666532 444555553
No 82
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=20.58 E-value=5e+02 Score=25.76 Aligned_cols=38 Identities=13% Similarity=0.189 Sum_probs=25.5
Q ss_pred hhhhhHHHHHHhHHHhhhcCCCCCcccchhhhhhHHHHHHHHHHHH
Q 008936 135 AEDERYFFMSSMLGLLADYGLWPHVTNASAISNTVKHLYDQLQSQI 180 (548)
Q Consensus 135 aeQERstFvSSLLpLLaEY~LqP~V~DAqSIvsnvKvLf~hLq~kL 180 (548)
-.+||..-+-.|=-+|+..++-+. .-+..++.-++-+.
T Consensus 125 ~~~eR~~Rl~~L~~~l~~~dv~~~--------ek~r~vlea~~~E~ 162 (251)
T PF11932_consen 125 LLEERQERLARLRAMLDDADVSLA--------EKFRRVLEAYQIEM 162 (251)
T ss_pred ChHHHHHHHHHHHHhhhccCCCHH--------HHHHHHHHHHHHHH
Confidence 556888888888888888877544 33445555555544
No 83
>KOG2896 consensus UV radiation resistance associated protein [General function prediction only]
Probab=20.57 E-value=3.5e+02 Score=29.87 Aligned_cols=44 Identities=18% Similarity=0.248 Sum_probs=30.1
Q ss_pred hHhHHHHHHhHHhhHH-------HHHHHHHHHHhhhhHhhhhhhhhhhhhh
Q 008936 92 KVSELRMAIDEKQNEA-------ITSALNELARRKGVLEENLKLAHDLKVA 135 (548)
Q Consensus 92 RiA~mRmAfDqqQqdl-------VdAASKaLSyRQdiiEENiRLtyalq~a 135 (548)
++..||.+|+-+|-.+ +..+----.||+-.||-++-.|+.+.--
T Consensus 142 kL~~kr~q~~~~q~~l~k~~k~l~e~~~~l~a~re~fL~~~~~~~~~irq~ 192 (377)
T KOG2896|consen 142 KLEDKRQQFNASQVKLQKQLKSLIELRNELVAKRELFLEQRIQDTFKIRQD 192 (377)
T ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhHhHHHhhhhhhhcc
Confidence 4566888888776543 4444455578888888888887776543
No 84
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=20.23 E-value=3.6e+02 Score=26.96 Aligned_cols=43 Identities=28% Similarity=0.392 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHHHhhhhhhhhcchhhhHHhHhHHHHHHhHHh
Q 008936 62 KEEIHSLRQQIAVACLKELQLQNEKYTLERKVSELRMAIDEKQ 104 (548)
Q Consensus 62 eeEI~~LRk~LadasvKE~QllnEKyvLErRiA~mRmAfDqqQ 104 (548)
+.||..||+.|.+.+..=+++.-|.-.|..-+.++|.-|....
T Consensus 53 e~el~~lr~~id~~~~eka~l~~e~~~l~~e~~~~r~k~e~e~ 95 (312)
T PF00038_consen 53 EEELRELRRQIDDLSKEKARLELEIDNLKEELEDLRRKYEEEL 95 (312)
T ss_dssp HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHhHHhhhhHHHHhhHHhhhhhhHHHHHHHHHHHHHHHH
Confidence 6788888888888888878888888888888888887777653
No 85
>PF04011 LemA: LemA family; InterPro: IPR007156 The members of this family are related to the LemA protein P71452 from SWISSPROT. The exact molecular function of this protein is uncertain. It is predicted to be a transmembrane protein with an extracellular N terminus [].; PDB: 2ETD_A.
Probab=20.05 E-value=5.2e+02 Score=24.46 Aligned_cols=48 Identities=25% Similarity=0.361 Sum_probs=31.6
Q ss_pred hhhhhhhhHHHHHHhHHHhhhc-CCCCCcccchhhhhhHHHHHHHHHHHHHhhhhhhcccc
Q 008936 132 LKVAEDERYFFMSSMLGLLADY-GLWPHVTNASAISNTVKHLYDQLQSQIRTSYDRIRDLT 191 (548)
Q Consensus 132 lq~aeQERstFvSSLLpLLaEY-~LqP~V~DAqSIvsnvKvLf~hLq~kL~~~e~klke~q 191 (548)
++.++.+-+.-++.|+=....| .|+.. ..|++|+.+|..+|++|....
T Consensus 90 ~~~~~~~l~~al~~l~~~~e~yP~Lka~------------~~~~~l~~~l~~~E~~I~~aR 138 (186)
T PF04011_consen 90 FQQAEAELSQALSRLLAVVENYPELKAD------------ENFQQLMAQLEETENRIAAAR 138 (186)
T ss_dssp HHHHHHHHHHHHHHHHHHHTT-HHHHH-------------HHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHcCCccchh------------HHHHHHHHHHHHHHHHHHHHH
Confidence 3456666666677776555555 33221 248999999999999987665
Done!