Query psy4512
Match_columns 101
No_of_seqs 215 out of 1122
Neff 7.5
Searched_HMMs 46136
Date Fri Aug 16 19:05:28 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4512.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4512hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1379|consensus 99.9 8.6E-27 1.9E-31 171.9 11.2 98 1-99 233-330 (330)
2 PTZ00224 protein phosphatase 2 99.6 1.5E-15 3.2E-20 115.9 9.2 73 3-100 198-271 (381)
3 COG0631 PTC1 Serine/threonine 99.6 2.9E-15 6.4E-20 109.1 8.5 69 3-100 184-252 (262)
4 PLN03145 Protein phosphatase 2 99.6 6.1E-15 1.3E-19 112.0 9.1 73 3-100 257-330 (365)
5 cd00143 PP2Cc Serine/threonine 99.6 1.8E-14 3.8E-19 101.5 8.7 71 3-99 183-254 (254)
6 smart00332 PP2Cc Serine/threon 99.5 6.9E-14 1.5E-18 99.0 8.0 69 3-97 186-255 (255)
7 KOG0698|consensus 99.4 1.5E-12 3.3E-17 97.5 9.0 74 2-100 230-304 (330)
8 PRK14559 putative protein seri 99.4 2.2E-12 4.7E-17 103.9 9.1 69 3-100 562-635 (645)
9 KOG1323|consensus 99.3 3.9E-12 8.5E-17 95.8 6.6 84 6-99 400-486 (493)
10 PF00481 PP2C: Protein phospha 99.3 1.2E-12 2.7E-17 94.0 3.2 57 3-61 189-246 (254)
11 TIGR02865 spore_II_E stage II 99.1 2.2E-10 4.9E-15 93.9 8.7 73 3-99 686-763 (764)
12 PF07228 SpoIIE: Stage II spor 99.1 4E-10 8.7E-15 77.1 7.7 73 4-100 114-193 (193)
13 KOG0697|consensus 99.1 4E-10 8.6E-15 83.3 7.5 72 3-99 218-290 (379)
14 KOG0700|consensus 98.9 2.6E-09 5.7E-14 81.5 4.9 60 3-64 318-378 (390)
15 smart00331 PP2C_SIG Sigma fact 98.7 5.1E-08 1.1E-12 66.8 7.0 55 3-61 138-192 (193)
16 KOG0699|consensus 98.6 2.5E-07 5.3E-12 70.9 7.7 78 4-100 425-503 (542)
17 COG2208 RsbU Serine phosphatas 98.5 9.7E-07 2.1E-11 66.8 8.8 74 4-100 286-366 (367)
18 PF13672 PP2C_2: Protein phosp 97.9 6.5E-06 1.4E-10 57.2 2.2 37 4-40 157-194 (212)
19 KOG0618|consensus 97.4 0.0005 1.1E-08 58.0 6.7 69 3-100 703-772 (1081)
20 PF09436 DUF2016: Domain of un 89.3 0.27 5.9E-06 29.5 1.7 23 12-34 25-47 (72)
21 COG2168 DsrH Uncharacterized c 76.9 2.2 4.8E-05 27.0 2.0 29 10-38 21-49 (96)
22 TIGR03735 PRTRC_A PRTRC system 74.6 11 0.00025 26.6 5.4 51 12-62 24-89 (192)
23 PF06972 DUF1296: Protein of u 62.2 15 0.00032 21.3 3.1 26 28-58 19-44 (60)
24 COG3700 AphA Acid phosphatase 58.1 31 0.00068 24.7 4.8 48 12-61 69-127 (237)
25 PRK03982 heat shock protein Ht 54.6 13 0.00027 27.5 2.6 27 13-39 105-131 (288)
26 PF05762 VWA_CoxE: VWA domain 52.8 24 0.00052 24.9 3.7 26 13-39 149-174 (222)
27 PF05708 DUF830: Orthopoxvirus 49.3 14 0.0003 24.2 1.9 15 10-24 1-15 (158)
28 PF04077 DsrH: DsrH like prote 48.0 8.2 0.00018 23.5 0.6 28 11-38 16-43 (88)
29 TIGR03011 sulf_tusB_dsrH sulfu 46.6 9.7 0.00021 23.1 0.8 26 12-37 21-46 (94)
30 COG0501 HtpX Zn-dependent prot 45.1 14 0.0003 26.7 1.5 25 15-39 139-163 (302)
31 PRK03072 heat shock protein Ht 45.1 17 0.00036 27.0 1.9 27 13-39 107-133 (288)
32 PRK05457 heat shock protein Ht 43.6 25 0.00054 26.0 2.7 29 11-39 112-140 (284)
33 PRK02391 heat shock protein Ht 43.0 19 0.00041 26.9 2.0 27 13-39 113-139 (296)
34 COG0054 RibH Riboflavin syntha 40.7 48 0.001 22.7 3.5 30 50-97 24-53 (152)
35 cd06409 PB1_MUG70 The MUG70 pr 39.7 21 0.00046 22.0 1.5 11 13-23 52-62 (86)
36 cd01234 PH_CADPS CADPS (Ca2+-d 38.4 22 0.00048 23.1 1.5 14 10-23 82-95 (117)
37 PRK02870 heat shock protein Ht 36.9 33 0.00072 26.2 2.5 26 14-39 154-179 (336)
38 PRK03001 M48 family peptidase; 36.4 21 0.00046 26.1 1.4 25 15-39 106-130 (283)
39 PRK00050 16S rRNA m(4)C1402 me 36.1 96 0.0021 23.3 4.8 53 4-59 98-167 (296)
40 COG2144 Selenophosphate synthe 36.1 39 0.00085 25.7 2.7 19 9-28 49-67 (324)
41 cd00534 DHNA_DHNTPE Dihydroneo 36.0 1.1E+02 0.0023 19.2 4.5 43 20-62 43-85 (118)
42 PRK04897 heat shock protein Ht 35.4 44 0.00094 24.8 2.9 27 13-39 117-143 (298)
43 PF02080 TrkA_C: TrkA-C domain 35.1 23 0.00049 19.8 1.1 15 8-22 45-59 (71)
44 COG0853 PanD Aspartate 1-decar 35.0 30 0.00066 22.9 1.8 21 10-33 77-97 (126)
45 COG3411 Ferredoxin [Energy pro 34.7 79 0.0017 18.5 3.3 30 12-41 15-45 (64)
46 PF13649 Methyltransf_25: Meth 34.6 49 0.0011 19.7 2.6 28 13-40 65-93 (101)
47 TIGR02219 phage_NlpC_fam putat 34.6 33 0.00071 22.3 1.9 14 9-22 75-88 (134)
48 PRK13510 sulfur transfer compl 34.4 22 0.00047 22.1 1.0 21 10-30 20-40 (95)
49 TIGR00074 hypC_hupF hydrogenas 33.9 73 0.0016 19.1 3.2 30 10-39 35-64 (76)
50 cd04436 DEP_fRgd2 DEP (Disheve 33.2 1.2E+02 0.0026 18.6 4.2 39 23-61 24-63 (84)
51 PRK01345 heat shock protein Ht 33.1 35 0.00076 25.7 2.1 25 15-39 106-130 (317)
52 PF10610 Tafi-CsgC: Thin aggre 32.7 32 0.00069 22.1 1.5 22 4-25 68-92 (106)
53 KOG3571|consensus 32.6 53 0.0012 27.0 3.1 34 8-41 293-326 (626)
54 PHA02248 hypothetical protein 29.8 65 0.0014 22.3 2.8 29 10-38 107-135 (204)
55 PF08541 ACP_syn_III_C: 3-Oxoa 29.4 28 0.00061 20.6 0.9 16 9-24 64-79 (90)
56 cd03703 aeIF5B_II aeIF5B_II: T 29.4 34 0.00075 22.0 1.3 18 6-23 22-40 (110)
57 PHA02754 hypothetical protein; 28.4 39 0.00085 19.5 1.3 17 12-28 42-59 (67)
58 PF09992 DUF2233: Predicted pe 28.0 75 0.0016 21.0 2.9 28 12-40 99-128 (170)
59 PRK01265 heat shock protein Ht 26.9 40 0.00086 25.6 1.5 25 15-39 122-146 (324)
60 PRK11593 folB bifunctional dih 26.7 1.7E+02 0.0036 18.4 4.2 43 20-62 43-85 (119)
61 KOG2862|consensus 26.7 43 0.00093 26.0 1.6 20 10-29 88-107 (385)
62 KOG0301|consensus 26.5 37 0.0008 28.6 1.3 20 9-28 266-285 (745)
63 COG1935 Uncharacterized conser 26.4 34 0.00073 22.4 0.9 18 9-26 37-54 (122)
64 PF14604 SH3_9: Variant SH3 do 26.3 67 0.0014 17.1 2.0 15 8-22 12-26 (49)
65 PRK02268 hypothetical protein; 26.1 52 0.0011 22.1 1.8 12 10-21 35-46 (141)
66 PF00595 PDZ: PDZ domain (Also 25.9 86 0.0019 17.9 2.6 29 11-40 43-72 (81)
67 PF13052 DUF3913: Protein of u 25.4 29 0.00063 19.2 0.4 19 7-30 5-23 (57)
68 COG0150 PurM Phosphoribosylami 25.3 55 0.0012 25.3 1.9 22 8-29 169-191 (345)
69 PRK10413 hydrogenase 2 accesso 25.2 1.4E+02 0.003 18.2 3.4 31 10-40 42-72 (82)
70 cd01480 vWA_collagen_alpha_1-V 24.7 1.2E+02 0.0025 20.4 3.4 12 15-26 111-122 (186)
71 TIGR00006 S-adenosyl-methyltra 23.8 1.4E+02 0.0031 22.5 3.9 52 6-60 102-170 (305)
72 PF01455 HupF_HypC: HupF/HypC 23.5 58 0.0013 19.0 1.5 28 11-38 38-65 (68)
73 COG0275 Predicted S-adenosylme 23.0 2.2E+02 0.0047 21.8 4.7 52 6-60 106-174 (314)
74 cd06919 Asp_decarbox Aspartate 22.9 78 0.0017 20.5 2.1 17 10-26 77-93 (111)
75 TIGR00253 RNA_bind_YhbY putati 22.7 2E+02 0.0044 17.8 3.9 45 16-61 19-69 (95)
76 PF08148 DSHCT: DSHCT (NUC185) 22.6 82 0.0018 21.5 2.3 22 20-41 29-50 (180)
77 cd03062 TRX_Fd_Sucrase TRX-lik 22.6 1.4E+02 0.0031 18.2 3.2 28 14-41 53-83 (97)
78 cd00136 PDZ PDZ domain, also c 22.6 1.4E+02 0.0031 16.1 3.3 30 9-39 29-59 (70)
79 PF00717 Peptidase_S24: Peptid 21.5 82 0.0018 17.3 1.8 14 10-23 24-37 (70)
80 cd05829 Sortase_E Sortase E (S 21.2 81 0.0017 20.8 2.0 16 9-24 71-87 (144)
81 PF13580 SIS_2: SIS domain; PD 20.6 60 0.0013 21.0 1.2 13 10-22 100-112 (138)
82 COG0361 InfA Translation initi 20.6 96 0.0021 18.6 2.0 13 7-19 43-55 (75)
83 COG3552 CoxE Protein containin 20.3 1.5E+02 0.0033 23.3 3.5 25 9-34 307-331 (395)
84 PF13768 VWA_3: von Willebrand 20.1 1.5E+02 0.0033 18.9 3.1 25 15-40 98-122 (155)
85 cd01461 vWA_interalpha_trypsin 20.1 1.8E+02 0.0039 18.6 3.5 23 15-39 101-123 (171)
86 PRK11479 hypothetical protein; 20.0 77 0.0017 23.7 1.8 17 7-23 61-77 (274)
No 1
>KOG1379|consensus
Probab=99.94 E-value=8.6e-27 Score=171.94 Aligned_cols=98 Identities=48% Similarity=0.767 Sum_probs=90.9
Q ss_pred CCCccceeEeccCCCEEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHc
Q psy4512 1 PECADTSQFQVEDGDVILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARAN 80 (101)
Q Consensus 1 p~~~d~~~~~l~~gD~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~ 80 (101)
|+.++.+++.|++||+|||+|||||||+++++|..++...... ...++|.+|+.|++.|+.++.|+++.|||+..|+++
T Consensus 233 p~~ad~~~~~v~~GDvIilATDGlfDNl~e~~Il~il~~~~~~-~~~~lq~~A~~ia~~Ar~ls~d~~~~SPFA~~Ar~~ 311 (330)
T KOG1379|consen 233 PDSADVTSFDVQKGDVIILATDGLFDNLPEKEILSILKGLDAR-GNLDLQVTAQKIAEKARELSRDPKFQSPFAQAAREH 311 (330)
T ss_pred ccccceEEEeccCCCEEEEecccccccccHHHHHHHHHHhhcc-ccccHHHHHHHHHHHHHHhccCcCcCChHHHHHHHh
Confidence 7899999999999999999999999999999999999876421 356899999999999999999999999999999999
Q ss_pred CCCCCCCCCCceEEEEEEE
Q psy4512 81 GISTQGGKPDDITVLLAIV 99 (101)
Q Consensus 81 ~~~~~gg~~DdiTvvvv~v 99 (101)
|+.++|||+||||||+.++
T Consensus 312 g~~~~gGK~DdITvvls~v 330 (330)
T KOG1379|consen 312 GFKAYGGKPDDITVVLSSV 330 (330)
T ss_pred CcccCCCCcccEEEEEecC
Confidence 9999999999999999864
No 2
>PTZ00224 protein phosphatase 2C; Provisional
Probab=99.63 E-value=1.5e-15 Score=115.93 Aligned_cols=73 Identities=26% Similarity=0.410 Sum_probs=61.7
Q ss_pred CccceeEeccCCCEEEEeecCCCC-CCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcC
Q psy4512 3 CADTSQFQVEDGDVILLATDGVFD-NVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARANG 81 (101)
Q Consensus 3 ~~d~~~~~l~~gD~lvl~SDGl~d-~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~ 81 (101)
++++..+++++||+|||||||||| .++++++.+++.+.. ....+++.+|+.|++.|+
T Consensus 198 ~Pdi~~~~l~~~D~llLaSDGL~d~~ls~eEi~~iv~~~l--~~~~~~~~aA~~Lv~~A~-------------------- 255 (381)
T PTZ00224 198 VPDVTHLTCQSNDFIILACDGVFEGNFSNEEVVAFVKEQL--ETCDDLAVVAGRVCDEAI-------------------- 255 (381)
T ss_pred eeEEEEEECCCCCEEEEECCCcCcCccCHHHHHHHHHHHH--hcCCCHHHHHHHHHHHHH--------------------
Confidence 578889999999999999999999 799999999987532 112578999999999998
Q ss_pred CCCCCCCCCceEEEEEEEe
Q psy4512 82 ISTQGGKPDDITVLLAIVA 100 (101)
Q Consensus 82 ~~~~gg~~DdiTvvvv~v~ 100 (101)
..|..|||||||+++.
T Consensus 256 ---~rGs~DNITvIvV~~~ 271 (381)
T PTZ00224 256 ---RRGSKDNISCLIVQLK 271 (381)
T ss_pred ---hcCCCCCEEEEEEEee
Confidence 2445999999999875
No 3
>COG0631 PTC1 Serine/threonine protein phosphatase [Signal transduction mechanisms]
Probab=99.61 E-value=2.9e-15 Score=109.08 Aligned_cols=69 Identities=33% Similarity=0.550 Sum_probs=62.0
Q ss_pred CccceeEeccCCCEEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcCC
Q psy4512 3 CADTSQFQVEDGDVILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARANGI 82 (101)
Q Consensus 3 ~~d~~~~~l~~gD~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~~ 82 (101)
.+++....+.+||++|||||||||.+++++|.++++.. .+++.+++.|++.|+
T Consensus 184 ~p~~~~~~~~~~d~llL~SDGl~d~v~~~~i~~il~~~------~~~~~~~~~li~~a~--------------------- 236 (262)
T COG0631 184 EPDITELELEPGDFLLLCSDGLWDVVSDDEIVDILKNS------ETPQEAADKLIELAL--------------------- 236 (262)
T ss_pred ceeEEEEEcCCCCEEEEECCCCccCcCHHHHHHHHhcC------CCHHHHHHHHHHHHH---------------------
Confidence 47888999999999999999999999999999999863 689999999999998
Q ss_pred CCCCCCCCceEEEEEEEe
Q psy4512 83 STQGGKPDDITVLLAIVA 100 (101)
Q Consensus 83 ~~~gg~~DdiTvvvv~v~ 100 (101)
.+|..||||++++++.
T Consensus 237 --~~g~~DNiT~ilv~~~ 252 (262)
T COG0631 237 --EGGGPDNITVVLVRLN 252 (262)
T ss_pred --hcCCCCceEEEEEEee
Confidence 3555999999999865
No 4
>PLN03145 Protein phosphatase 2c; Provisional
Probab=99.59 E-value=6.1e-15 Score=111.98 Aligned_cols=73 Identities=12% Similarity=0.163 Sum_probs=60.5
Q ss_pred CccceeEeccCCCEE-EEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcC
Q psy4512 3 CADTSQFQVEDGDVI-LLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARANG 81 (101)
Q Consensus 3 ~~d~~~~~l~~gD~l-vl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~ 81 (101)
+|++..++|.++|.+ |||||||||.++++++.+++.+... ...+|+.+|+.|++.|+
T Consensus 257 ePdv~~~~l~~~D~fLILaSDGLwdvls~ee~v~~i~~~l~--~~~~p~~aa~~Lv~~Al-------------------- 314 (365)
T PLN03145 257 EPELMTTQLTEEDEFLIIGCDGIWDVFRSQNAVDFARRRLQ--EHNDPVMCSKELVDEAL-------------------- 314 (365)
T ss_pred EEEEEEEECCCCCEEEEEeCCccccCcCHHHHHHHHHHHHh--cCCCHHHHHHHHHHHHH--------------------
Confidence 578889999999975 6999999999999999888765431 12478999999999998
Q ss_pred CCCCCCCCCceEEEEEEEe
Q psy4512 82 ISTQGGKPDDITVLLAIVA 100 (101)
Q Consensus 82 ~~~~gg~~DdiTvvvv~v~ 100 (101)
.+|..||||||||++.
T Consensus 315 ---~rgs~DNITvIVV~l~ 330 (365)
T PLN03145 315 ---KRKSGDNLAVVVVCFQ 330 (365)
T ss_pred ---hCCCCCCEEEEEEEee
Confidence 2455999999999874
No 5
>cd00143 PP2Cc Serine/threonine phosphatases, family 2C, catalytic domain; The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
Probab=99.56 E-value=1.8e-14 Score=101.53 Aligned_cols=71 Identities=25% Similarity=0.384 Sum_probs=60.0
Q ss_pred CccceeEec-cCCCEEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcC
Q psy4512 3 CADTSQFQV-EDGDVILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARANG 81 (101)
Q Consensus 3 ~~d~~~~~l-~~gD~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~ 81 (101)
++++....+ ++||+|+|||||||+++++++|.+++.... ...+++.+|+.|++.|.
T Consensus 183 ~~~~~~~~l~~~~d~ill~SDG~~~~l~~~~i~~~~~~~~---~~~~~~~~a~~l~~~a~-------------------- 239 (254)
T cd00143 183 EPDVTVVKLTEDDDFLILASDGLWDVLSNQEAVDIVRSEL---AKEDLQEAAQELVDLAL-------------------- 239 (254)
T ss_pred CCeEEEEEeCCCCcEEEEECCCCeeccChHHHHHHHHHHh---cccCHHHHHHHHHHHHH--------------------
Confidence 567788899 999999999999999999999999998752 00279999999999998
Q ss_pred CCCCCCCCCceEEEEEEE
Q psy4512 82 ISTQGGKPDDITVLLAIV 99 (101)
Q Consensus 82 ~~~~gg~~DdiTvvvv~v 99 (101)
..+..||+|++++++
T Consensus 240 ---~~~~~Dn~t~i~~~~ 254 (254)
T cd00143 240 ---RRGSHDNITVVVVRL 254 (254)
T ss_pred ---hCCCCCCEEEEEEeC
Confidence 234589999999874
No 6
>smart00332 PP2Cc Serine/threonine phosphatases, family 2C, catalytic domain. The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
Probab=99.51 E-value=6.9e-14 Score=99.02 Aligned_cols=69 Identities=22% Similarity=0.369 Sum_probs=57.5
Q ss_pred CccceeEec-cCCCEEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcC
Q psy4512 3 CADTSQFQV-EDGDVILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARANG 81 (101)
Q Consensus 3 ~~d~~~~~l-~~gD~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~ 81 (101)
++++...++ ++||+|||||||||++++++++.+++..... ..++..+|+.|++.|.
T Consensus 186 ~p~~~~~~~~~~~d~ill~SDGv~~~l~~~~i~~~~~~~~~---~~~~~~~~~~l~~~a~-------------------- 242 (255)
T smart00332 186 EPDVTVVELTEKDDFLILASDGLWDVLSNQEVVDIVRKHLS---KSDPEEAAKRLIDLAL-------------------- 242 (255)
T ss_pred eeEEEEEEecCCCcEEEEECCccccCCCHHHHHHHHHHHhh---cCCHHHHHHHHHHHHH--------------------
Confidence 456777785 9999999999999999999999999987630 0258999999999997
Q ss_pred CCCCCCCCCceEEEEE
Q psy4512 82 ISTQGGKPDDITVLLA 97 (101)
Q Consensus 82 ~~~~gg~~DdiTvvvv 97 (101)
.++..||+|++++
T Consensus 243 ---~~~~~Dn~T~ivv 255 (255)
T smart00332 243 ---ARGSKDNITVIVV 255 (255)
T ss_pred ---HcCCCCCeEEEEC
Confidence 3456999999985
No 7
>KOG0698|consensus
Probab=99.40 E-value=1.5e-12 Score=97.53 Aligned_cols=74 Identities=20% Similarity=0.291 Sum_probs=61.8
Q ss_pred CCccceeEeccC-CCEEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHc
Q psy4512 2 ECADTSQFQVED-GDVILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARAN 80 (101)
Q Consensus 2 ~~~d~~~~~l~~-gD~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~ 80 (101)
.+|++....+.. .++|||+||||||.++++|++++++... .....+..++..|...|.
T Consensus 230 a~Pei~~~~~~~~deFLiLasDGiwDv~s~qeav~~V~~~~--~~~~~~~~a~~~l~~~a~------------------- 288 (330)
T KOG0698|consen 230 AEPEIQQVKINSDDEFLILASDGIWDVVSNQEAVDLVRDEL--ASISSPLAAAKLLATEAL------------------- 288 (330)
T ss_pred cCCceEEEEcCCCCcEEEEeCCchhcccChHHHHHHHHHHh--hccccHHHHHHHHHHHHh-------------------
Confidence 367888888888 5589999999999999999999999742 123578889999999997
Q ss_pred CCCCCCCCCCceEEEEEEEe
Q psy4512 81 GISTQGGKPDDITVLLAIVA 100 (101)
Q Consensus 81 ~~~~~gg~~DdiTvvvv~v~ 100 (101)
.+|+.||||||||.+.
T Consensus 289 ----~~~s~DnitvvvV~l~ 304 (330)
T KOG0698|consen 289 ----SRGSKDNITVVVVRLK 304 (330)
T ss_pred ----hcCCCCCeEEEEEEec
Confidence 4667999999999874
No 8
>PRK14559 putative protein serine/threonine phosphatase; Provisional
Probab=99.38 E-value=2.2e-12 Score=103.95 Aligned_cols=69 Identities=28% Similarity=0.419 Sum_probs=53.9
Q ss_pred CccceeEeccCCCEEEEeecCCCCC-CChH----HHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHH
Q psy4512 3 CADTSQFQVEDGDVILLATDGVFDN-VPDS----LLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQA 77 (101)
Q Consensus 3 ~~d~~~~~l~~gD~lvl~SDGl~d~-l~~~----ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a 77 (101)
++++..+.+++||+||||||||||+ +.+. ++..++.. ..+++.++++|++.|+
T Consensus 562 ~Pdi~~~~L~~gD~lLLCSDGL~D~~~ve~~~~~~l~~il~~------~~~l~~aa~~Li~~Al---------------- 619 (645)
T PRK14559 562 QPDIQFLEIEEDTLLLLCSDGLSDNDLLETHWQTHLLPLLSS------SANLDQGLNKLIDLAN---------------- 619 (645)
T ss_pred cceEEEEEcCCCCEEEEECCCCCCCcccchHHHHHHHHHHhc------CCCHHHHHHHHHHHHH----------------
Confidence 5788899999999999999999996 3332 23334332 2578999999999998
Q ss_pred HHcCCCCCCCCCCceEEEEEEEe
Q psy4512 78 RANGISTQGGKPDDITVLLAIVA 100 (101)
Q Consensus 78 ~~~~~~~~gg~~DdiTvvvv~v~ 100 (101)
.+|| +|||||+|+++.
T Consensus 620 ------~~gg-~DNITvIvV~l~ 635 (645)
T PRK14559 620 ------QYNG-HDNITAILVRLK 635 (645)
T ss_pred ------HcCC-CCcEEEEEEEec
Confidence 2454 899999999875
No 9
>KOG1323|consensus
Probab=99.32 E-value=3.9e-12 Score=95.81 Aligned_cols=84 Identities=25% Similarity=0.311 Sum_probs=65.7
Q ss_pred ceeEeccCCCEEEEeecCCCCCCChHHHHHHHHHhhCCCCchhH---HHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcCC
Q psy4512 6 TSQFQVEDGDVILLATDGVFDNVPDSLLLAELVRAQGSKDPMQL---QLVANTIALMARTLAFDETYMSPFAIQARANGI 82 (101)
Q Consensus 6 ~~~~~l~~gD~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~---~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~~ 82 (101)
....+...+|++||+||||||.++++|+..++++..+.....+| -.+|+.|+..|+....|++|..|
T Consensus 400 l~q~e~~~DdVvilatDGLWDVlSneeva~~Vrs~L~~~dp~Dp~RYt~aaqdlva~arg~~k~rgWr~~---------- 469 (493)
T KOG1323|consen 400 LRQYEHLTDDVVILATDGLWDVLSNEEVALIVRSFLPSTDPADPSRYTQAAQDLVAAARGQQKDRGWRMN---------- 469 (493)
T ss_pred hhhhccCCCcEEEEecCchhhhcccHHHHHHHHHhcCCCCCCChhHHHHHHHHHHHHhcCccCCCceecc----------
Confidence 44556678999999999999999999999999988643333334 36899999999988888877665
Q ss_pred CCCCCCCCceEEEEEEE
Q psy4512 83 STQGGKPDDITVLLAIV 99 (101)
Q Consensus 83 ~~~gg~~DdiTvvvv~v 99 (101)
+..-|+.|||||.|+-+
T Consensus 470 n~~lgSgDDIsVfVIPL 486 (493)
T KOG1323|consen 470 NGGLGSGDDISVFVIPL 486 (493)
T ss_pred CCCcCCCCceEEEEEec
Confidence 12335689999999854
No 10
>PF00481 PP2C: Protein phosphatase 2C; InterPro: IPR001932 This domain is found in protein phosphatase 2C, as well as other proteins eg. pyruvate dehydrogenase (lipoamide)-phosphatase (3.1.3.43 from EC), adenylate cyclase (4.6.1.1 from EC) and some bacterial stage II sporulation E proteins (3.1.3.16 from EC). Protein phosphatase 2C (PP2C) is one of the four major classes of mammalian serine/threonine specific protein phosphatases (3.1.3.16 from EC). PP2C [] is a monomeric enzyme of about 42 Kd which shows broad substrate specificity and is dependent on divalent cations (mainly manganese and magnesium) for its activity. Its exact physiological role is still unclear. Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma. In yeast, there are at least four PP2C homologs: phosphatase PTC1 [], which has weak tyrosine phosphatase activity in addition to its activity on serines, phosphatases PTC2 and PTC3, and hypothetical protein YBR125c. Isozymes of PP2C are also known from Arabidopsis thaliana (ABI1, PPH1), Caenorhabditis elegans (FEM-2, F42G9.1, T23F11.1), Leishmania chagasi and Paramecium tetraurelia. In A. thaliana, the kinase associated protein phosphatase (KAPP) [] is an enzyme that dephosphorylates the Ser/Thr receptor-like kinase RLK5 and which contains a C-terminal PP2C domain. PP2C does not seem to be evolutionary related to the main family of serine/ threonine phosphatases: PP1, PP2A and PP2B. However, it is significantly similar to the catalytic subunit of pyruvate dehydrogenase phosphatase 3.1.3.43 from EC (PDPC) [], which catalyzes dephosphorylation and concomitant reactivation of the alpha subunit of the E1 component of the pyruvate dehydrogenase complex. PDPC is a mitochondrial enzyme and, like PP2C, is magnesium-dependent.; GO: 0003824 catalytic activity; PDB: 2I0O_A 2POP_C 2POM_A 2J4O_A 2I44_B 3MQ3_A 3N3C_A 2PNQ_B 2P8E_A 2IQ1_A ....
Probab=99.30 E-value=1.2e-12 Score=94.03 Aligned_cols=57 Identities=19% Similarity=0.252 Sum_probs=46.8
Q ss_pred CccceeEeccCCC-EEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHH
Q psy4512 3 CADTSQFQVEDGD-VILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMAR 61 (101)
Q Consensus 3 ~~d~~~~~l~~gD-~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~ 61 (101)
+|++..+++.++| +||||||||||.++++++.+++.+... ....|+.+|+.|++.|+
T Consensus 189 ~P~i~~~~l~~~d~flvlaSDGlwd~l~~~ei~~~v~~~~~--~~~~~~~~a~~L~~~A~ 246 (254)
T PF00481_consen 189 EPDISEVDLTPDDEFLVLASDGLWDVLSNEEIVDIVRESLN--SGRSPQEAAEKLVDEAI 246 (254)
T ss_dssp --EEEEEEEBTTEEEEEEE-HHHHTTSHHHHHHHHHHHHHH--HHSHHHHHHHHHHHHHH
T ss_pred ecccccccccccceEEEEEcccccccCCHHHHHHHHHHHHh--cCCcHHHHHHHHHHHHH
Confidence 6889999999997 689999999999999999999988740 00149999999999998
No 11
>TIGR02865 spore_II_E stage II sporulation protein E. Stage II sporulation protein E (SpoIIE) is a multiple membrane spanning protein with two separable functions. It plays a role in the switch to polar cell division during sporulation. By means of it protein phosphatase activity, located in the C-terminal region, it activates sigma-F. All proteins that score above the trusted cutoff to this model are found in endospore-forming Gram-positive bacteria. Surprisingly, a sequence from the Cyanobacterium-like (and presumably non-spore-forming) photosynthesizer Heliobacillus mobilis is homologous, and scores between the trusted and noise cutoffs.
Probab=99.14 E-value=2.2e-10 Score=93.93 Aligned_cols=73 Identities=25% Similarity=0.416 Sum_probs=56.3
Q ss_pred CccceeEeccCCCEEEEeecCCCCCCChH-----HHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHH
Q psy4512 3 CADTSQFQVEDGDVILLATDGVFDNVPDS-----LLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQA 77 (101)
Q Consensus 3 ~~d~~~~~l~~gD~lvl~SDGl~d~l~~~-----ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a 77 (101)
+++..+.++++||+++|+|||++|...+. .+.++++.. ...+|+++++.|++.++..
T Consensus 686 ~~~~~~~~L~~GD~Lll~SDGv~E~~~~~~~~~~~l~~~l~~~----~~~~p~ela~~Il~~a~~~-------------- 747 (764)
T TIGR02865 686 DVELVRKKLKNGDLIVMVSDGVLEGEKEVEGKVLWLVRKLKET----NTNDPEEIAEYLLEKAKEL-------------- 747 (764)
T ss_pred ccceEEEEeCCCCEEEEECCCCCcCCcccccHHHHHHHHHHhc----CCCCHHHHHHHHHHHHHHh--------------
Confidence 45677889999999999999999977542 266666543 2367999999999999842
Q ss_pred HHcCCCCCCCCCCceEEEEEEE
Q psy4512 78 RANGISTQGGKPDDITVLLAIV 99 (101)
Q Consensus 78 ~~~~~~~~gg~~DdiTvvvv~v 99 (101)
..+...||+|++++++
T Consensus 748 ------~~~~~~DD~Tvlvirv 763 (764)
T TIGR02865 748 ------RSGKIKDDMTVIVAKV 763 (764)
T ss_pred ------cCCCCCCCeEEEEEEe
Confidence 1233589999999986
No 12
>PF07228 SpoIIE: Stage II sporulation protein E (SpoIIE); InterPro: IPR001932 This domain is found in protein phosphatase 2C, as well as other proteins eg. pyruvate dehydrogenase (lipoamide)-phosphatase (3.1.3.43 from EC), adenylate cyclase (4.6.1.1 from EC) and some bacterial stage II sporulation E proteins (3.1.3.16 from EC). Protein phosphatase 2C (PP2C) is one of the four major classes of mammalian serine/threonine specific protein phosphatases (3.1.3.16 from EC). PP2C [] is a monomeric enzyme of about 42 Kd which shows broad substrate specificity and is dependent on divalent cations (mainly manganese and magnesium) for its activity. Its exact physiological role is still unclear. Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma. In yeast, there are at least four PP2C homologs: phosphatase PTC1 [], which has weak tyrosine phosphatase activity in addition to its activity on serines, phosphatases PTC2 and PTC3, and hypothetical protein YBR125c. Isozymes of PP2C are also known from Arabidopsis thaliana (ABI1, PPH1), Caenorhabditis elegans (FEM-2, F42G9.1, T23F11.1), Leishmania chagasi and Paramecium tetraurelia. In A. thaliana, the kinase associated protein phosphatase (KAPP) [] is an enzyme that dephosphorylates the Ser/Thr receptor-like kinase RLK5 and which contains a C-terminal PP2C domain. PP2C does not seem to be evolutionary related to the main family of serine/ threonine phosphatases: PP1, PP2A and PP2B. However, it is significantly similar to the catalytic subunit of pyruvate dehydrogenase phosphatase 3.1.3.43 from EC (PDPC) [], which catalyzes dephosphorylation and concomitant reactivation of the alpha subunit of the E1 component of the pyruvate dehydrogenase complex. PDPC is a mitochondrial enzyme and, like PP2C, is magnesium-dependent.; GO: 0003824 catalytic activity; PDB: 3KE6_B 3ZT9_A 3RNR_A 3EQ2_A 3F7A_B 3F79_A 3ES2_B 3PU9_B 3T91_B 3T9Q_B ....
Probab=99.11 E-value=4e-10 Score=77.05 Aligned_cols=73 Identities=29% Similarity=0.388 Sum_probs=51.4
Q ss_pred ccceeEeccCCCEEEEeecCCCCCCChH-------HHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHH
Q psy4512 4 ADTSQFQVEDGDVILLATDGVFDNVPDS-------LLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQ 76 (101)
Q Consensus 4 ~d~~~~~l~~gD~lvl~SDGl~d~l~~~-------ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~ 76 (101)
+...++++++||+|+|+||||+|+...+ .+.+++.+. ...+++++++.|++.+...
T Consensus 114 ~~~~~~~l~~gd~l~l~TDGl~e~~~~~~~~~~~~~~~~~l~~~----~~~~~~~~~~~l~~~~~~~------------- 176 (193)
T PF07228_consen 114 YQEQEIQLEPGDRLLLYTDGLFEALNEDGEFFGEERLLELLDEN----RGLSPQEIIDALLEAIDRF------------- 176 (193)
T ss_dssp EEEEEEE--TTEEEEEECHHHCTTTCHHCHHCCCHHHHHHHHCH----TTS-HHHHHHHHHHHHHHH-------------
T ss_pred ccceEEEeccccEEEEeCCChhhccCCccchhHHHHHHHHHhhc----cCCCHHHHHHHHHHHHHHh-------------
Confidence 4567899999999999999999998433 334445433 2367999999999999741
Q ss_pred HHHcCCCCCCCCCCceEEEEEEEe
Q psy4512 77 ARANGISTQGGKPDDITVLLAIVA 100 (101)
Q Consensus 77 a~~~~~~~~gg~~DdiTvvvv~v~ 100 (101)
..+...||+|++++++.
T Consensus 177 -------~~~~~~DD~tvl~~~~~ 193 (193)
T PF07228_consen 177 -------GKGPLRDDITVLVIRRQ 193 (193)
T ss_dssp -------TTSSTSS-EEEEEEEE-
T ss_pred -------cCCCCCCceEEEEEEEC
Confidence 24567899999999873
No 13
>KOG0697|consensus
Probab=99.09 E-value=4e-10 Score=83.28 Aligned_cols=72 Identities=19% Similarity=0.308 Sum_probs=58.0
Q ss_pred CccceeEec-cCCCEEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcC
Q psy4512 3 CADTSQFQV-EDGDVILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARANG 81 (101)
Q Consensus 3 ~~d~~~~~l-~~gD~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~ 81 (101)
+|+++...- .++|++||++||+||.++++|++++++... .-..++..+|+..+..++
T Consensus 218 EPev~~~~R~eedeFivlACDGIwDVMtneelcefv~sRl--~Vt~dL~~vcn~VvDtCL-------------------- 275 (379)
T KOG0697|consen 218 EPEVYIIERSEEDEFIVLACDGIWDVMTNEELCEFVKSRL--EVTSDLEEVCNDVVDTCL-------------------- 275 (379)
T ss_pred CCceEEeeccccCcEEEEEccchhhhcccHHHHHHHHhhh--eecccHHHHHHHHHHHHH--------------------
Confidence 455554433 356789999999999999999999998653 224688999999999998
Q ss_pred CCCCCCCCCceEEEEEEE
Q psy4512 82 ISTQGGKPDDITVLLAIV 99 (101)
Q Consensus 82 ~~~~gg~~DdiTvvvv~v 99 (101)
..|++||+|++++.+
T Consensus 276 ---hKGSRDNMsivlvcf 290 (379)
T KOG0697|consen 276 ---HKGSRDNMSIVLVCF 290 (379)
T ss_pred ---hccCccCceEEEEec
Confidence 356799999999876
No 14
>KOG0700|consensus
Probab=98.87 E-value=2.6e-09 Score=81.48 Aligned_cols=60 Identities=15% Similarity=0.168 Sum_probs=49.0
Q ss_pred CccceeEeccCCCE-EEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHh
Q psy4512 3 CADTSQFQVEDGDV-ILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLA 64 (101)
Q Consensus 3 ~~d~~~~~l~~gD~-lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~ 64 (101)
+|+++.++|.+.|. +||+|||||+.++++|++.+|..++... ..-+.+|++|+++|+..+
T Consensus 318 eP~i~~HrL~p~DkFLIlASDGLwE~lsNeeaV~lV~~~i~~~--~pd~~~A~hLIr~aL~~a 378 (390)
T KOG0700|consen 318 EPSITHHKLTPNDKFLILASDGLWEYLSNEEAVSLVHEFISGK--FPDGNPATHLIRHALGRA 378 (390)
T ss_pred cceEEEEEcCCCCeEEEEeccchhhhcChHHHHHHHHHhhccC--CCCCCHHHHHHHHHHhhh
Confidence 57889999999997 6999999999999999999999875210 112458999999998643
No 15
>smart00331 PP2C_SIG Sigma factor PP2C-like phosphatases.
Probab=98.72 E-value=5.1e-08 Score=66.85 Aligned_cols=55 Identities=24% Similarity=0.196 Sum_probs=46.3
Q ss_pred CccceeEeccCCCEEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHH
Q psy4512 3 CADTSQFQVEDGDVILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMAR 61 (101)
Q Consensus 3 ~~d~~~~~l~~gD~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~ 61 (101)
+++..++++++||+|+|+||||++++..+++.+++++.. ..++++++++|++...
T Consensus 138 ~~~~~~~~l~~gd~l~l~TDGl~e~~~~~~l~~~l~~~~----~~~~~~~~~~i~~~~~ 192 (193)
T smart00331 138 EVDVRELTLEPGDLLLLYTDGLTEARNPERLEELLEELL----GSPPAEIAQRILEELL 192 (193)
T ss_pred cceeEEEeeCCCCEEEEECCCccccCChHHHHHHHHHhc----CCCHHHHHHHHHHHHh
Confidence 356778899999999999999999999999999988763 3568888998887653
No 16
>KOG0699|consensus
Probab=98.58 E-value=2.5e-07 Score=70.85 Aligned_cols=78 Identities=13% Similarity=0.237 Sum_probs=60.0
Q ss_pred ccceeEeccCCC-EEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcCC
Q psy4512 4 ADTSQFQVEDGD-VILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARANGI 82 (101)
Q Consensus 4 ~d~~~~~l~~gD-~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~~ 82 (101)
||+....|.+.| ++|+++||+|+.++.++++++++..+. ....+..+|+.|+..++. .++.
T Consensus 425 PDiK~l~lTpedEFmVvACDGIWN~MsSqeVVdFvr~~l~--~n~~ls~iceeL~D~CLA--p~T~-------------- 486 (542)
T KOG0699|consen 425 PDIKILALTPEDEFMVVACDGIWNSMSSQEVVDFVRDLLA--KNSSLSEICEELCDACLA--PSTD-------------- 486 (542)
T ss_pred ccceeEeecCcccEEEEEccchhhhccHHHHHHHHHHHHh--cCchHHHHHHHHHHhhcC--CCCC--------------
Confidence 567778888876 579999999999999999999997652 346788999999999973 1110
Q ss_pred CCCCCCCCceEEEEEEEe
Q psy4512 83 STQGGKPDDITVLLAIVA 100 (101)
Q Consensus 83 ~~~gg~~DdiTvvvv~v~ 100 (101)
-.|-.-||||||++.+.
T Consensus 487 -GDGTGCDNMT~ii~~Fk 503 (542)
T KOG0699|consen 487 -GDGTGCDNMTVIITTFK 503 (542)
T ss_pred -CCCcCCCcceEEEEEec
Confidence 12333699999999874
No 17
>COG2208 RsbU Serine phosphatase RsbU, regulator of sigma subunit [Signal transduction mechanisms / Transcription]
Probab=98.49 E-value=9.7e-07 Score=66.77 Aligned_cols=74 Identities=28% Similarity=0.269 Sum_probs=54.7
Q ss_pred ccceeEeccCCCEEEEeecCCCC-------CCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHH
Q psy4512 4 ADTSQFQVEDGDVILLATDGVFD-------NVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQ 76 (101)
Q Consensus 4 ~d~~~~~l~~gD~lvl~SDGl~d-------~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~ 76 (101)
....+.++++||.+++.|||+++ .+..+...+.+.+.. ..+++++++.+.+.......
T Consensus 286 ~~~~~~~l~~gd~lvl~tDGv~Ea~~~~~~~~~~~~~~~~~~~~~----~~~~~e~~~~i~~~l~~~~~----------- 350 (367)
T COG2208 286 YEVASLQLEPGDLLVLYTDGVTEARNSDGEFFGLERLLKILGRLL----GQPAEEILEAILESLEELQG----------- 350 (367)
T ss_pred chheeEEecCCCEEEEEcCCeeeeecCCccEecHHHHHHHHHHHh----CCCHHHHHHHHHHHHHHhhC-----------
Confidence 45678899999999999999998 334456666666542 36789999999988875321
Q ss_pred HHHcCCCCCCCCCCceEEEEEEEe
Q psy4512 77 ARANGISTQGGKPDDITVLLAIVA 100 (101)
Q Consensus 77 a~~~~~~~~gg~~DdiTvvvv~v~ 100 (101)
.-...||||++++++.
T Consensus 351 --------~~~~~DDiTll~lk~~ 366 (367)
T COG2208 351 --------DQIQDDDITLLVLKVK 366 (367)
T ss_pred --------CccccCceEEEEEEec
Confidence 1124688999999975
No 18
>PF13672 PP2C_2: Protein phosphatase 2C; PDB: 2JFT_A 2JFS_A 2V06_A 2JFR_A 2J86_A 2J82_A 2Y09_A 2XZV_A 2CM1_A 1TXO_B ....
Probab=97.90 E-value=6.5e-06 Score=57.15 Aligned_cols=37 Identities=38% Similarity=0.519 Sum_probs=22.9
Q ss_pred ccceeEeccCCCEEEEeecCCCCCCChH-HHHHHHHHh
Q psy4512 4 ADTSQFQVEDGDVILLATDGVFDNVPDS-LLLAELVRA 40 (101)
Q Consensus 4 ~d~~~~~l~~gD~lvl~SDGl~d~l~~~-ei~~~l~~~ 40 (101)
.++..+++++||.|+|||||||+++... .+..++...
T Consensus 157 ~~~~~~~~~~~d~ilL~SDG~~~~l~~~~~~~~~l~~~ 194 (212)
T PF13672_consen 157 VQYGSIPLEEGDVILLCSDGVWDNLRSYEDLEQFLKDL 194 (212)
T ss_dssp TEEEEEE--TT-EEEEE-HHHHTTS-HHHHHHHH----
T ss_pred CeEEEEEcCCCCEEEEECcCccccCCCHHHHHHHhhhc
Confidence 4566788999999999999999999764 466777654
No 19
>KOG0618|consensus
Probab=97.41 E-value=0.0005 Score=57.98 Aligned_cols=69 Identities=12% Similarity=0.221 Sum_probs=58.2
Q ss_pred CccceeEeccCCC-EEEEeecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcC
Q psy4512 3 CADTSQFQVEDGD-VILLATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMARTLAFDETYMSPFAIQARANG 81 (101)
Q Consensus 3 ~~d~~~~~l~~gD-~lvl~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~ 81 (101)
.+.+....|.+-| ++|+++-+||++++-+++++.++.. .+|-.+|.+|+..|..
T Consensus 703 ~Phv~~~~Lt~qdE~LIvgn~~lW~~Lsid~a~~~vRn~------~dpL~AAkKL~d~AqS------------------- 757 (1081)
T KOG0618|consen 703 DPHVSVVILTEQDEFLIVGNKQLWSVLSIDTAVDAVRNV------EDPLLAAKKLCDLAQS------------------- 757 (1081)
T ss_pred CCceeeEecccCceEEEEcchHHhhhccHHHHHHHHhcC------CchHHHHHHHHHHHHh-------------------
Confidence 3567778888765 5899999999999999999999853 7899999999999982
Q ss_pred CCCCCCCCCceEEEEEEEe
Q psy4512 82 ISTQGGKPDDITVLLAIVA 100 (101)
Q Consensus 82 ~~~~gg~~DdiTvvvv~v~ 100 (101)
.|. .||++|+|+++.
T Consensus 758 ---Ygc-~~nv~vlVv~l~ 772 (1081)
T KOG0618|consen 758 ---YGC-AENVSVLVVRLN 772 (1081)
T ss_pred ---ccc-ccCeeEEEEEee
Confidence 344 899999999874
No 20
>PF09436 DUF2016: Domain of unknown function (DUF2016); InterPro: IPR018560 This entry represents the N-terminal of proteins that contain a ubiquitin domain.
Probab=89.29 E-value=0.27 Score=29.49 Aligned_cols=23 Identities=35% Similarity=0.541 Sum_probs=18.0
Q ss_pred cCCCEEEEeecCCCCCCChHHHH
Q psy4512 12 EDGDVILLATDGVFDNVPDSLLL 34 (101)
Q Consensus 12 ~~gD~lvl~SDGl~d~l~~~ei~ 34 (101)
++|.++++++||+|=.+....+.
T Consensus 25 ~~G~Rllva~nGv~lEv~r~WL~ 47 (72)
T PF09436_consen 25 RPGHRLLVASNGVFLEVRRPWLH 47 (72)
T ss_pred cCCcEEEEecCcEEEEEechHHH
Confidence 48999999999999766554443
No 21
>COG2168 DsrH Uncharacterized conserved protein involved in oxidation of intracellular sulfur [Inorganic ion transport and metabolism]
Probab=76.89 E-value=2.2 Score=26.97 Aligned_cols=29 Identities=34% Similarity=0.597 Sum_probs=22.1
Q ss_pred eccCCCEEEEeecCCCCCCChHHHHHHHH
Q psy4512 10 QVEDGDVILLATDGVFDNVPDSLLLAELV 38 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl~d~l~~~ei~~~l~ 38 (101)
-+.+||.++|+.|||+=.+...+..+-++
T Consensus 21 ~l~~~D~vlL~qdGV~aAl~~~~~~~sl~ 49 (96)
T COG2168 21 LLTEGDAVLLLQDGVYAALKGNRYLASLR 49 (96)
T ss_pred HhcccCeEEEEcccchhhhcCcHHHHHHh
Confidence 46799999999999988776555544444
No 22
>TIGR03735 PRTRC_A PRTRC system protein A. A novel genetic system characterized by six major proteins, included a ParB homolog and a ThiF homolog, is designated PRTRC, or ParB-Related,ThiF-Related Cassette. It is often found on plasmids. This protein family is designated protein A.
Probab=74.64 E-value=11 Score=26.60 Aligned_cols=51 Identities=25% Similarity=0.228 Sum_probs=33.3
Q ss_pred cCCCEEEEeecCCCCCCChHHHHHHHHHh--h------CC-------CCchhHHHHHHHHHHHHHH
Q psy4512 12 EDGDVILLATDGVFDNVPDSLLLAELVRA--Q------GS-------KDPMQLQLVANTIALMART 62 (101)
Q Consensus 12 ~~gD~lvl~SDGl~d~l~~~ei~~~l~~~--~------~~-------~~~~~~~~~a~~Lv~~A~~ 62 (101)
+.|.++++++||||=.+-..++.-+..-. . |. ..+.=+..+.++++..|++
T Consensus 24 ~~g~r~~~a~~G~~lev~r~wl~~~~~~~~~~~~~~PYg~~~~~~~~~~g~Ip~~l~~~ii~hAr~ 89 (192)
T TIGR03735 24 KPGHRFIVAADGVWREVRRPWLHAIQRVAPASPITVPYGAVEETLEFLCGPIPASLLEEFAEAARA 89 (192)
T ss_pred cCCcEEEEecCcEEEEEecHHHHHHHHhcccccccccceeeeeeEEEecCCCCHHHHHHHHHHHHh
Confidence 57999999999999877655554332211 1 10 1234466788888888873
No 23
>PF06972 DUF1296: Protein of unknown function (DUF1296); InterPro: IPR009719 This family represents a conserved region approximately 60 residues long within a number of plant proteins of unknown function.
Probab=62.18 E-value=15 Score=21.26 Aligned_cols=26 Identities=12% Similarity=0.066 Sum_probs=21.8
Q ss_pred CChHHHHHHHHHhhCCCCchhHHHHHHHHHH
Q psy4512 28 VPDSLLLAELVRAQGSKDPMQLQLVANTIAL 58 (101)
Q Consensus 28 l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~ 58 (101)
-+++||...+... ..+|++++++|+.
T Consensus 19 hse~eIya~L~ec-----nMDpnea~qrLL~ 44 (60)
T PF06972_consen 19 HSEEEIYAMLKEC-----NMDPNEAVQRLLS 44 (60)
T ss_pred CCHHHHHHHHHHh-----CCCHHHHHHHHHh
Confidence 5788998888876 4899999999975
No 24
>COG3700 AphA Acid phosphatase (class B) [General function prediction only]
Probab=58.14 E-value=31 Score=24.68 Aligned_cols=48 Identities=15% Similarity=0.240 Sum_probs=30.1
Q ss_pred cCCCEEEEeecCCCC---CC--------ChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHH
Q psy4512 12 EDGDVILLATDGVFD---NV--------PDSLLLAELVRAQGSKDPMQLQLVANTIALMAR 61 (101)
Q Consensus 12 ~~gD~lvl~SDGl~d---~l--------~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~ 61 (101)
.-+|.+++.|.|+|. ++ .++..=+.+.. +-....-|.++|+.|+..-.
T Consensus 69 DIDDTvLFsSp~F~~Gk~~~sPgs~DyLknq~FW~~vn~--g~D~~SIPKevA~qLI~MHq 127 (237)
T COG3700 69 DIDDTVLFSSPGFWRGKKYFSPGSEDYLKNQVFWEKVNN--GWDEFSIPKEVARQLIDMHQ 127 (237)
T ss_pred ccCCeeEecccccccCccccCCChHHhhcCHHHHHHHhc--CCccccchHHHHHHHHHHHH
Confidence 457899999999995 23 22222222221 11223458999999998876
No 25
>PRK03982 heat shock protein HtpX; Provisional
Probab=54.59 E-value=13 Score=27.45 Aligned_cols=27 Identities=11% Similarity=0.288 Sum_probs=22.7
Q ss_pred CCCEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 13 DGDVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 13 ~gD~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
+...+|..|||+.+.++++|+..++..
T Consensus 105 ~~~~~V~vt~gLl~~l~~~El~AVlAH 131 (288)
T PRK03982 105 PKHAVVAVTEGILNLLNEDELEGVIAH 131 (288)
T ss_pred CCCeEEEeehHHHhhCCHHHHHHHHHH
Confidence 446788899999999999999887764
No 26
>PF05762 VWA_CoxE: VWA domain containing CoxE-like protein; InterPro: IPR008912 This group of proteins contains a VWA type domain and the function of this family is unknown. It is found as part of a CO oxidising (Cox) system operon in several bacteria [].
Probab=52.81 E-value=24 Score=24.91 Aligned_cols=26 Identities=27% Similarity=0.445 Sum_probs=17.6
Q ss_pred CCCEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 13 DGDVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 13 ~gD~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
.+-.||+.||| |++-..+...+.+.+
T Consensus 149 ~~t~vvIiSDg-~~~~~~~~~~~~l~~ 174 (222)
T PF05762_consen 149 RRTTVVIISDG-WDTNDPEPLAEELRR 174 (222)
T ss_pred cCcEEEEEecc-cccCChHHHHHHHHH
Confidence 45679999999 466655555555544
No 27
>PF05708 DUF830: Orthopoxvirus protein of unknown function (DUF830); PDB: 2IF6_B 3KW0_C.
Probab=49.28 E-value=14 Score=24.21 Aligned_cols=15 Identities=27% Similarity=0.541 Sum_probs=7.2
Q ss_pred eccCCCEEEEeecCC
Q psy4512 10 QVEDGDVILLATDGV 24 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl 24 (101)
+|++||+|+.-++|.
T Consensus 1 ~l~~GDIil~~~~~~ 15 (158)
T PF05708_consen 1 KLQTGDIILTRGKSS 15 (158)
T ss_dssp ---TT-EEEEEE-SC
T ss_pred CCCCeeEEEEECCch
Confidence 477888877776644
No 28
>PF04077 DsrH: DsrH like protein; InterPro: IPR007215 The three proteins TusB, TusC, and TusD form a heterohexamer responsible for a sulphur relay reaction. In large numbers of proteobacterial species, this complex acts on a Cys-derived persulphide moiety, delivered by the cysteine desulphurase IscS to TusA, then to TusBCD. The activated sulphur group is then transferred to TusE (DsrC), then by MnmA (TrmU) for modification of an anticodon nucleotide in tRNAs for Glu, Lys, and Gln. The sulphur relay complex TusBCD is also found, under the designation DsrEFH, in phototrophic and chemotrophic sulphur bacteria, such as Chromatium vinosum. In these organisms, it seems the primary purpose is related to sulphur flux, such as oxidation from sulphide to molecular sulphur to sulphate [].; GO: 0002143 tRNA wobble position uridine thiolation, 0005737 cytoplasm; PDB: 2HYB_O 2HY5_C 1X9A_A 1RHX_A 2D1P_C.
Probab=48.03 E-value=8.2 Score=23.54 Aligned_cols=28 Identities=36% Similarity=0.584 Sum_probs=18.2
Q ss_pred ccCCCEEEEeecCCCCCCChHHHHHHHH
Q psy4512 11 VEDGDVILLATDGVFDNVPDSLLLAELV 38 (101)
Q Consensus 11 l~~gD~lvl~SDGl~d~l~~~ei~~~l~ 38 (101)
+++||.|+|.-|||+-.+........+.
T Consensus 16 ~~~~D~ilLiqDgV~~a~~~~~~~~~l~ 43 (88)
T PF04077_consen 16 LSEGDAILLIQDGVYAALKGSPYFKLLQ 43 (88)
T ss_dssp --TT-EEEE-GGGGGGGBTTSTTHHHHH
T ss_pred cCCCCEEEeeHHHHHHHhcCCHHHHHHh
Confidence 5789999999999998776655444443
No 29
>TIGR03011 sulf_tusB_dsrH sulfur relay protein TusB/DsrH. The three proteins TusB, TusC, and TusD form a heterohexamer responsible for a sulfur relay reaction. In large numbers of proteobacterial species, this complex acts on a Cys-derived persulfide moiety, delivered by the cysteine desulfurase IscS to TusA, then to TusBCD. The activated sulfur group is then transferred to TusE (DsrC), then by MnmA (TrmU) for modification of an anticodon nucleotide in tRNAs for Glu, Lys, and Gln. The sulfur relay complex TusBCD is also found, under the designation DsrEFH, in phototrophic and chemotrophic sulfur bacteria, such as Chromatium vinosum. In these organisms, it seems the primary purpose is related to sulfur flux, such as oxidation from sulfide to molecular sulfur to sulfate.
Probab=46.61 E-value=9.7 Score=23.09 Aligned_cols=26 Identities=38% Similarity=0.630 Sum_probs=18.7
Q ss_pred cCCCEEEEeecCCCCCCChHHHHHHH
Q psy4512 12 EDGDVILLATDGVFDNVPDSLLLAEL 37 (101)
Q Consensus 12 ~~gD~lvl~SDGl~d~l~~~ei~~~l 37 (101)
.++|-|+|..|||+--+..++..+.+
T Consensus 21 ~~~d~vll~~DgV~~l~~~~~~~~~l 46 (94)
T TIGR03011 21 GPGDAILLLQDGVYAALEGNRYLSAL 46 (94)
T ss_pred CCCCeEEEehhhHHHHhcCChHHHHH
Confidence 56899999999998766554444333
No 30
>COG0501 HtpX Zn-dependent protease with chaperone function [Posttranslational modification, protein turnover, chaperones]
Probab=45.15 E-value=14 Score=26.74 Aligned_cols=25 Identities=28% Similarity=0.328 Sum_probs=21.5
Q ss_pred CEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 15 DVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 15 D~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
--.|..|+|+.+.++++|+..++..
T Consensus 139 ~~~V~vt~gLl~~l~~dEl~aVlaH 163 (302)
T COG0501 139 NGRVVVTTGLLDLLNDDELEAVLAH 163 (302)
T ss_pred CeeEEecHHHHhhCCHHHHHHHHHH
Confidence 3588999999999999999988764
No 31
>PRK03072 heat shock protein HtpX; Provisional
Probab=45.10 E-value=17 Score=26.97 Aligned_cols=27 Identities=15% Similarity=0.223 Sum_probs=22.1
Q ss_pred CCCEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 13 DGDVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 13 ~gD~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
++..+|.+|||+.+.++++|+..++..
T Consensus 107 ~~~~~v~vt~gLl~~l~~~El~aVlAH 133 (288)
T PRK03072 107 PRNAAVCCTEGILQILNERELRGVLGH 133 (288)
T ss_pred CCCcEEEecHHHHHhCCHHHHHHHHHH
Confidence 344578899999999999999887764
No 32
>PRK05457 heat shock protein HtpX; Provisional
Probab=43.60 E-value=25 Score=26.03 Aligned_cols=29 Identities=14% Similarity=0.302 Sum_probs=24.2
Q ss_pred ccCCCEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 11 VEDGDVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 11 l~~gD~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
..+...+|+.|.|+.+.++++|+..++..
T Consensus 112 ~~~~~~~V~vt~gLl~~L~~~El~aVlAH 140 (284)
T PRK05457 112 ASKNNSLVAVSTGLLQNMSRDEVEAVLAH 140 (284)
T ss_pred CCCCCeEEEeehHHhhhCCHHHHHHHHHH
Confidence 34456799999999999999999888764
No 33
>PRK02391 heat shock protein HtpX; Provisional
Probab=42.99 E-value=19 Score=26.88 Aligned_cols=27 Identities=19% Similarity=0.261 Sum_probs=22.5
Q ss_pred CCCEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 13 DGDVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 13 ~gD~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
+...+|+.|||+.+.++++|+..++..
T Consensus 113 ~~~~~V~vt~gLl~~L~~~El~aVlaH 139 (296)
T PRK02391 113 PKNAVVCVTTGLMRRLDPDELEAVLAH 139 (296)
T ss_pred CCCcEEEecHHHHhhCCHHHHHHHHHH
Confidence 345688999999999999999887753
No 34
>COG0054 RibH Riboflavin synthase beta-chain [Coenzyme metabolism]
Probab=40.73 E-value=48 Score=22.66 Aligned_cols=30 Identities=17% Similarity=0.160 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHhcCCCCCCHHHHHHHHcCCCCCCCCCCceEEEEE
Q psy4512 50 QLVANTIALMARTLAFDETYMSPFAIQARANGISTQGGKPDDITVLLA 97 (101)
Q Consensus 50 ~~~a~~Lv~~A~~~~~d~~~~~p~~~~a~~~~~~~~gg~~DdiTvvvv 97 (101)
..++++|++-|...+. ..|+..|||+++-+
T Consensus 24 ~~I~d~ll~gA~~~l~------------------~~G~~~~~i~vv~V 53 (152)
T COG0054 24 DDITDALLEGAVDALK------------------RHGADVDNIDVVRV 53 (152)
T ss_pred HHHHHHHHHHHHHHHH------------------HcCCCcccceEEEe
Confidence 4577777777764332 23566777877654
No 35
>cd06409 PB1_MUG70 The MUG70 protein is a product of the meiotically up-regulated gene 70 which has a role in meiosis and harbors a PB1 domain. The PB1 domain is a modular domain mediating specific protein-protein interactions which play a role in many critical cell processes such as osteoclastogenesis, angiogenesis, early cardiovascular development, and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domains depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic amino acid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is
Probab=39.71 E-value=21 Score=21.97 Aligned_cols=11 Identities=36% Similarity=1.035 Sum_probs=10.2
Q ss_pred CCCEEEEeecC
Q psy4512 13 DGDVILLATDG 23 (101)
Q Consensus 13 ~gD~lvl~SDG 23 (101)
+||.|+|+||.
T Consensus 52 EgD~VllT~D~ 62 (86)
T cd06409 52 EGDIVLITSDS 62 (86)
T ss_pred CCCEEEEeccc
Confidence 79999999997
No 36
>cd01234 PH_CADPS CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Probab=38.37 E-value=22 Score=23.06 Aligned_cols=14 Identities=43% Similarity=0.688 Sum_probs=12.1
Q ss_pred eccCCCEEEEeecC
Q psy4512 10 QVEDGDVILLATDG 23 (101)
Q Consensus 10 ~l~~gD~lvl~SDG 23 (101)
.+++||.++++||-
T Consensus 82 avkegd~~~fa~~d 95 (117)
T cd01234 82 AVKEGDELKFATDD 95 (117)
T ss_pred eeccCcEEEEeccc
Confidence 57899999999985
No 37
>PRK02870 heat shock protein HtpX; Provisional
Probab=36.92 E-value=33 Score=26.19 Aligned_cols=26 Identities=15% Similarity=0.298 Sum_probs=22.2
Q ss_pred CCEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 14 GDVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 14 gD~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
.+..|..|||+.+.++++|+..++..
T Consensus 154 ~~~~Ivvt~GLL~~L~~dEL~aVlAH 179 (336)
T PRK02870 154 KSAMVAITTGLLEKLDRDELQAVMAH 179 (336)
T ss_pred CCcEEEEehHHhhhCCHHHHHHHHHH
Confidence 45689999999999999999888764
No 38
>PRK03001 M48 family peptidase; Provisional
Probab=36.36 E-value=21 Score=26.15 Aligned_cols=25 Identities=12% Similarity=0.255 Sum_probs=21.2
Q ss_pred CEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 15 DVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 15 D~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
...|..|||+.+.++++|+..++..
T Consensus 106 ~~~Ivvt~gLl~~l~~~El~aVlAH 130 (283)
T PRK03001 106 HAAVAATTGILRVLSEREIRGVMAH 130 (283)
T ss_pred CeEEEecHHHHhhCCHHHHHHHHHH
Confidence 3468899999999999999888764
No 39
>PRK00050 16S rRNA m(4)C1402 methyltranserfase; Provisional
Probab=36.14 E-value=96 Score=23.25 Aligned_cols=53 Identities=17% Similarity=0.172 Sum_probs=33.9
Q ss_pred ccceeEeccCCCE-EEEeecCCCC----------------CCChHHHHHHHHHhhCCCCchhHHHHHHHHHHH
Q psy4512 4 ADTSQFQVEDGDV-ILLATDGVFD----------------NVPDSLLLAELVRAQGSKDPMQLQLVANTIALM 59 (101)
Q Consensus 4 ~d~~~~~l~~gD~-lvl~SDGl~d----------------~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~ 59 (101)
..+++++|...++ |=+-.||-.| .++.+++..++..+- .......+|+.|++.
T Consensus 98 LGvSs~Qld~~~RGFSf~~~gpLDMRmd~~~~~sA~~~ln~~~~~~L~~i~~~yG---ee~~a~~iA~~Iv~~ 167 (296)
T PRK00050 98 LGVSSPQLDDAERGFSFRKDGPLDMRMDQSQGLTAAEVVNTYSEEELARIFKEYG---EERFARRIARAIVEA 167 (296)
T ss_pred CCccccccCCCcCCcccCCCCCCcCCcCCCCCCCHHHHHhhCCHHHHHHHHHHhc---CcchHHHHHHHHHHH
Confidence 3456777777776 6667788666 335566666666541 234566788888755
No 40
>COG2144 Selenophosphate synthetase-related proteins [General function prediction only]
Probab=36.06 E-value=39 Score=25.74 Aligned_cols=19 Identities=26% Similarity=0.648 Sum_probs=14.8
Q ss_pred EeccCCCEEEEeecCCCCCC
Q psy4512 9 FQVEDGDVILLATDGVFDNV 28 (101)
Q Consensus 9 ~~l~~gD~lvl~SDGl~d~l 28 (101)
+++. |++++++.||.|..|
T Consensus 49 I~v~-~~~lliaadGi~g~l 67 (324)
T COG2144 49 IRVG-DGKLLIAADGIWGKL 67 (324)
T ss_pred EeeC-CcEEEEecCCccccc
Confidence 3444 789999999999655
No 41
>cd00534 DHNA_DHNTPE Dihydroneopterin aldolase (DHNA) and 7,8-dihydroneopterin triphosphate epimerase domain (DHNTPE); these enzymes have been designated folB and folX, respectively. Folate derivatives are essential cofactors in the biosynthesis of purines, pyrimidines, and amino acids, as well as formyl-tRNA. Mammalian cells are able to utilize pre-formed folates after uptake by a carrier-mediated active transport system. Most microbes and plants lack this system and must synthesize folates de novo from guanosine triphosphate. One enzyme from this pathway is DHNA which catalyses the conversion of 7,8-dihydroneopterin to 6-hydroxymethyl-7,8-dihydropterin in the biosynthetic pathway of tetrahydrofolate. Though it is known that DHNTPE catalyzes the epimerization of dihydroneopterin triphosphate to dihydromonapterin triphosphate, the biological role of this enzyme is still unclear. It is hypothesized that it is not an essential protein since a folX knockout in E. coli has a normal phenoty
Probab=36.01 E-value=1.1e+02 Score=19.22 Aligned_cols=43 Identities=12% Similarity=0.065 Sum_probs=30.3
Q ss_pred eecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHH
Q psy4512 20 ATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMART 62 (101)
Q Consensus 20 ~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~ 62 (101)
.||-|-+.+.-..+.+.+.+..........+.+|+.|.+....
T Consensus 43 ~~D~l~~tidY~~l~~~i~~~~~~~~~~llE~La~~ia~~i~~ 85 (118)
T cd00534 43 ESDDLADTLNYAEVAKLIKKIVEGSPFKLIETLAEEIADILLE 85 (118)
T ss_pred ccCChhhccCHHHHHHHHHHHHhCCCHhHHHHHHHHHHHHHHH
Confidence 4677777776677777777654333346788899999888874
No 42
>PRK04897 heat shock protein HtpX; Provisional
Probab=35.43 E-value=44 Score=24.84 Aligned_cols=27 Identities=11% Similarity=0.100 Sum_probs=22.8
Q ss_pred CCCEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 13 DGDVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 13 ~gD~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
+....|+.|+|+.+.++++|+..++..
T Consensus 117 ~~~~~v~vt~gLl~~l~~~El~aVlAH 143 (298)
T PRK04897 117 PKNAAVAVTTGLLAIMNREELEGVIGH 143 (298)
T ss_pred CCCcEEEeehHHHhhCCHHHHHHHHHH
Confidence 345689999999999999999888764
No 43
>PF02080 TrkA_C: TrkA-C domain; InterPro: IPR006037 The regulator of K+ conductance (RCK) domain is found in many ligand-gated K+ channels, most often attached to the intracellular carboxy terminus. The domain is prevalent among prokaryotic K+ channels, and also found in eukaryotic, high-conductance Ca2+-activated K+ channels (BK channels) [, , ]. Largely involved in redox-linked regulation of potassium channels, the N-terminal part of the RCK domain is predicted to be an active dehydrogenase at least in some cases []. Some have a conserved sequence motif (G-x-G-x-x-G-x(n)-[DE]) for NAD+ binding [], but others do not, reflecting the diversity of ligands for RCK domains. The C-terminal part is less conserved, being absent in some channels, such as the kefC antiporter from Escherichia coli. It is predicted to bind unidentified ligands and to regulate sulphate, sodium and other transporters. The X-ray structure of several RCK domains has been solved [, , ]. It reveals an alpha-beta fold similar to dehydrogenase enzymes. The domain forms a homodimer, producing a cleft between two lobes. It has a composite structure, with an N-terminal (RCK-N), and a C-terminal (RCK-C) subdomain. The RCK-N subdomain forms a Rossmann fold with two alpha helices on one side of a six stranded parallel beta sheet and three alpha helices on the other side. The RCK-C subdomain is an all-beta-strand fold. It forms an extention of the dimer interface and further stabilises the RCK homodimer [, , ]. Ca2+ is a ligand that opens the channel in a concentration-dependent manner. Two Ca2+ ions are located at the base of a cleft between two RCK domains, coordinated by the carboxylate groups of two glutamate residues, and by an aspartate residue [, , ]. RCK domains occur in at least five different contexts: As a single domain on the C terminus of some K+ channels (for example, many prokaryotic K+ channels). As two tandem RCK domains on the C terminus of some transporters that form gating rings (for example, eukaryotic BK channels). The gating ring has an arrangement of eight identical RCK domains, one from each of the four pore-forming subunits and four from the intracellular solution. As two domains, one at the N terminus and another at the C terminus of transporter (for example, the prokaryotic trk system potassium uptake protein A). As a soluble protein (not part of a K+ channel) consisting of two tandem RCK domains. As a soluble protein consisting of a single RCK domain. This entry represents the C-terminal subdomain of RCK.; GO: 0008324 cation transmembrane transporter activity, 0006813 potassium ion transport; PDB: 2BKP_A 1VCT_A 2BKO_A 2BKN_A 3L4B_C 2FY8_D 2AEF_A 1LNQ_E 3RBX_C 3KXD_A ....
Probab=35.05 E-value=23 Score=19.78 Aligned_cols=15 Identities=20% Similarity=0.576 Sum_probs=9.6
Q ss_pred eEeccCCCEEEEeec
Q psy4512 8 QFQVEDGDVILLATD 22 (101)
Q Consensus 8 ~~~l~~gD~lvl~SD 22 (101)
...|++||.+++..+
T Consensus 45 ~~~l~~gD~l~v~g~ 59 (71)
T PF02080_consen 45 DTVLQAGDILIVVGD 59 (71)
T ss_dssp T-BE-TTEEEEEEEE
T ss_pred CCEECCCCEEEEEEC
Confidence 567899999766553
No 44
>COG0853 PanD Aspartate 1-decarboxylase [Coenzyme metabolism]
Probab=35.03 E-value=30 Score=22.86 Aligned_cols=21 Identities=14% Similarity=0.324 Sum_probs=15.6
Q ss_pred eccCCCEEEEeecCCCCCCChHHH
Q psy4512 10 QVEDGDVILLATDGVFDNVPDSLL 33 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl~d~l~~~ei 33 (101)
..++||++|+||=+. ++++|+
T Consensus 77 l~~~GD~VII~sy~~---~~e~e~ 97 (126)
T COG0853 77 LVQVGDLVIIMSYAQ---MSEEEA 97 (126)
T ss_pred hCCCCCEEEEEEccc---CCHHHH
Confidence 458999999999765 445555
No 45
>COG3411 Ferredoxin [Energy production and conversion]
Probab=34.71 E-value=79 Score=18.47 Aligned_cols=30 Identities=20% Similarity=0.257 Sum_probs=24.4
Q ss_pred cCCCEEEEeecCCCC-CCChHHHHHHHHHhh
Q psy4512 12 EDGDVILLATDGVFD-NVPDSLLLAELVRAQ 41 (101)
Q Consensus 12 ~~gD~lvl~SDGl~d-~l~~~ei~~~l~~~~ 41 (101)
+.|=+++..-||+|- +++.+.+.++++++.
T Consensus 15 ~~gPvl~vYpegvWY~~V~p~~a~rIv~~hl 45 (64)
T COG3411 15 QDGPVLVVYPEGVWYTRVDPEDARRIVQSHL 45 (64)
T ss_pred ccCCEEEEecCCeeEeccCHHHHHHHHHHHH
Confidence 456689999999996 678888888888774
No 46
>PF13649 Methyltransf_25: Methyltransferase domain; PDB: 3BXO_B 3GGD_A 3PX2_A 3PX3_A 3PFH_D 3PFG_A 1Y8C_A.
Probab=34.64 E-value=49 Score=19.73 Aligned_cols=28 Identities=21% Similarity=0.290 Sum_probs=20.9
Q ss_pred CCCE-EEEeecCCCCCCChHHHHHHHHHh
Q psy4512 13 DGDV-ILLATDGVFDNVPDSLLLAELVRA 40 (101)
Q Consensus 13 ~gD~-lvl~SDGl~d~l~~~ei~~~l~~~ 40 (101)
.+.+ +|+|+-+++++++++++..++++.
T Consensus 65 ~~~~D~v~~~~~~~~~~~~~~~~~ll~~~ 93 (101)
T PF13649_consen 65 DGKFDLVVCSGLSLHHLSPEELEALLRRI 93 (101)
T ss_dssp SSSEEEEEE-TTGGGGSSHHHHHHHHHHH
T ss_pred CCCeeEEEEcCCccCCCCHHHHHHHHHHH
Confidence 3445 788888878889999998888764
No 47
>TIGR02219 phage_NlpC_fam putative phage cell wall peptidase, NlpC/P60 family. Members of this family show sequence similarity to members of the NlpC/P60 family described by Pfam model pfam00877 and by Anantharaman and Aravind (PubMed:12620121). The NlpC/P60 family includes a number of characterized bacterial cell wall hydrolases. Members of this related family are all found in prophage regions of bacterial genomes.
Probab=34.55 E-value=33 Score=22.34 Aligned_cols=14 Identities=21% Similarity=0.318 Sum_probs=11.3
Q ss_pred EeccCCCEEEEeec
Q psy4512 9 FQVEDGDVILLATD 22 (101)
Q Consensus 9 ~~l~~gD~lvl~SD 22 (101)
-++++||+|++.+.
T Consensus 75 ~~~qpGDlvff~~~ 88 (134)
T TIGR02219 75 DAAQPGDVLVFRWR 88 (134)
T ss_pred hcCCCCCEEEEeeC
Confidence 36899999998753
No 48
>PRK13510 sulfur transfer complex subunit TusB; Provisional
Probab=34.36 E-value=22 Score=22.07 Aligned_cols=21 Identities=33% Similarity=0.561 Sum_probs=16.4
Q ss_pred eccCCCEEEEeecCCCCCCCh
Q psy4512 10 QVEDGDVILLATDGVFDNVPD 30 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl~d~l~~ 30 (101)
-+++||.|+|.-|||.-.+.+
T Consensus 20 ~~~~~D~iLLieDgV~aa~~~ 40 (95)
T PRK13510 20 LLKEGDDLLLLQDGVTAALKG 40 (95)
T ss_pred hcCCCCEEEEehHHHHHHhcC
Confidence 357899999999999655543
No 49
>TIGR00074 hypC_hupF hydrogenase assembly chaperone HypC/HupF. An additional proposed function is to shuttle the iron atom that has been liganded at the HypC/HypD complex to the precursor of the large hydrogenase (HycE) subunit. PubMed:12441107.
Probab=33.85 E-value=73 Score=19.07 Aligned_cols=30 Identities=17% Similarity=0.160 Sum_probs=20.3
Q ss_pred eccCCCEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 10 QVEDGDVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
++++||++++-..=....+++++..+.+..
T Consensus 35 ~~~vGD~VLVH~G~Ai~~ide~eA~e~l~~ 64 (76)
T TIGR00074 35 EVKVGDYVLVHVGFAISVLDEEEARETLDA 64 (76)
T ss_pred CCCCCCEEEEecChhhhhCCHHHHHHHHHH
Confidence 467899988777544556677776665544
No 50
>cd04436 DEP_fRgd2 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in fungal RhoGAP (GTPase-activator protein) Rgd2-like proteins. Rgd2-like proteins share a common domain architecture, containing, beside the RhoGAP domain, a DEP and a FCH (Fes/CIP4 homology) domain. Yeast Rgd2 is a GAP protein for Cdc42 and Rho5.
Probab=33.18 E-value=1.2e+02 Score=18.65 Aligned_cols=39 Identities=15% Similarity=0.137 Sum_probs=27.8
Q ss_pred CCCCCC-ChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHH
Q psy4512 23 GVFDNV-PDSLLLAELVRAQGSKDPMQLQLVANTIALMAR 61 (101)
Q Consensus 23 Gl~d~l-~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~ 61 (101)
|-+.|. +.++|++.+.+.++..+-..++.+.+.|++.-.
T Consensus 24 Gty~nt~sG~~Iv~~L~~n~~~~s~~~aE~fGQdLv~~gf 63 (84)
T cd04436 24 GTYQNTSSGSEIVSWLQENMPEKDLDAAEAFGQDLLNQGF 63 (84)
T ss_pred ccccCcccHHHHHHHHHHcCCCCCHHHHHHHHHHHHhCch
Confidence 445566 668999999988755455667778888876543
No 51
>PRK01345 heat shock protein HtpX; Provisional
Probab=33.10 E-value=35 Score=25.67 Aligned_cols=25 Identities=12% Similarity=0.271 Sum_probs=21.5
Q ss_pred CEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 15 DVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 15 D~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
...|+.|+||.+.++++|+..++..
T Consensus 106 ~~~V~vt~gLL~~L~~dEL~aVlAH 130 (317)
T PRK01345 106 NAAVAATTGLLQRLSPEEVAGVMAH 130 (317)
T ss_pred CeEEEechHHHhhCCHHHHHHHHHH
Confidence 4589999999999999999887764
No 52
>PF10610 Tafi-CsgC: Thin aggregative fimbriae synthesis protein; InterPro: IPR014491 Thin aggressive fibres known as curli fibres or fimbriae (curli; Tafi) are cell-surface protein polymers found in Salmonella typhimurium and Escherichia coli that mediate interactions important for host and environmental persistence, development of biofilms, motility, colonisation and invasion of cells, and conjugation []. Four general assembly pathways for different fimbriae have been proposed, one of which is extracellular nucleation-precipitation (ENP), which differs from the others in that fibre-growth occurs extracellularly. Thin aggregative fimbriae are the only fimbriae dependent on the ENP pathway. Tafi were first identified in Salmonella spp and the controlling operon termed agf; however subsequent isolation of the homologous operon in E coli led to its being called csg. Tafi are known as curli because, in the absence of extracellular polysaccharides, their morphology appears curled; however, when expressed with such polysaccharides their morphology appears as a tangled amorphous matrix. The gene agfC is found to be transcribed at low levels, localised to the periplasm in a mature form, and in combination with AgfE is important for AgfA extracellular assembly, which facilitates the synthesis of Tafi. The genes involved in Tafi production are organised into two adjacent divergently transcribed operons, agfBAC and agfDEFG, both of which are required for biosynthesis and assembly [].; PDB: 2XSK_A 2Y2T_A 2Y2Y_A.
Probab=32.66 E-value=32 Score=22.08 Aligned_cols=22 Identities=27% Similarity=0.456 Sum_probs=11.3
Q ss_pred ccceeEeccCCCE---EEEeecCCC
Q psy4512 4 ADTSQFQVEDGDV---ILLATDGVF 25 (101)
Q Consensus 4 ~d~~~~~l~~gD~---lvl~SDGl~ 25 (101)
.....+.+++||. +|..|||=.
T Consensus 68 L~rls~ni~p~D~v~I~VtvSDG~s 92 (106)
T PF10610_consen 68 LSRLSFNISPGDKVKIIVTVSDGKS 92 (106)
T ss_dssp ---EEEE--TT-EEEEEEEEE-SSS
T ss_pred cceEEEEeCCCCeEEEEEEEcCCCe
Confidence 3456788999996 477899943
No 53
>KOG3571|consensus
Probab=32.60 E-value=53 Score=26.96 Aligned_cols=34 Identities=26% Similarity=0.389 Sum_probs=29.5
Q ss_pred eEeccCCCEEEEeecCCCCCCChHHHHHHHHHhh
Q psy4512 8 QFQVEDGDVILLATDGVFDNVPDSLLLAELVRAQ 41 (101)
Q Consensus 8 ~~~l~~gD~lvl~SDGl~d~l~~~ei~~~l~~~~ 41 (101)
..++++||.||=.-|==++|+++.+.+..|++..
T Consensus 293 DGRIe~GDMiLQVNevsFENmSNd~AVrvLREaV 326 (626)
T KOG3571|consen 293 DGRIEPGDMILQVNEVSFENMSNDQAVRVLREAV 326 (626)
T ss_pred cCccCccceEEEeeecchhhcCchHHHHHHHHHh
Confidence 3578899999999998999999999999998763
No 54
>PHA02248 hypothetical protein
Probab=29.83 E-value=65 Score=22.32 Aligned_cols=29 Identities=17% Similarity=0.322 Sum_probs=20.8
Q ss_pred eccCCCEEEEeecCCCCCCChHHHHHHHH
Q psy4512 10 QVEDGDVILLATDGVFDNVPDSLLLAELV 38 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl~d~l~~~ei~~~l~ 38 (101)
.|-.||++|..|||=.-.-+++-+..-+.
T Consensus 107 ~LP~GDiLI~~~DG~i~~P~neS~i~~V~ 135 (204)
T PHA02248 107 TLPQGDILIKFSDGSIYLPDNESVLDSVN 135 (204)
T ss_pred CCCCCcEEEEeeCCeEecCchHHHHHHhH
Confidence 45679999999999776556555554444
No 55
>PF08541 ACP_syn_III_C: 3-Oxoacyl-[acyl-carrier-protein (ACP)] synthase III C terminal ; InterPro: IPR013747 This domain is found on 3-Oxoacyl-[acyl-carrier-protein (ACP)] synthase III 2.3.1.41 from EC, the enzyme responsible for initiating the chain of reactions of the fatty acid synthase in plants and bacteria. ; GO: 0016747 transferase activity, transferring acyl groups other than amino-acyl groups, 0008610 lipid biosynthetic process; PDB: 3IL3_A 1ZOW_C 3GWE_B 3GWA_B 1UB7_B 3LED_B 2EBD_A 1HNJ_A 2EFT_B 1HN9_B ....
Probab=29.39 E-value=28 Score=20.57 Aligned_cols=16 Identities=31% Similarity=0.449 Sum_probs=12.9
Q ss_pred EeccCCCEEEEeecCC
Q psy4512 9 FQVEDGDVILLATDGV 24 (101)
Q Consensus 9 ~~l~~gD~lvl~SDGl 24 (101)
.++++||.+++++=|.
T Consensus 64 g~~~~Gd~vl~~~~G~ 79 (90)
T PF08541_consen 64 GRIKPGDRVLLVGFGA 79 (90)
T ss_dssp TSSCTTEEEEEEEEET
T ss_pred CCCCCCCEEEEEEEEh
Confidence 3688999999988664
No 56
>cd03703 aeIF5B_II aeIF5B_II: This family represents the domain II of archeal and eukaryotic aeIF5B. aeIF5B is a homologue of prokaryotic Initiation Factor 2 (IF2). Disruption of the eIF5B gene (FUN12) in yeast causes a severe slow-growth phenotype, associated with a defect in translation. eIF5B has a function analogous to prokaryotic IF2 in mediating the joining of joining of 60S subunits. The eIF5B consists of three N-terminal domains (I, II, II) connected by a long helix to domain IV. Domain I is a G domain, domain II and IV are beta-barrels and domain III has a novel alpha-beta-alpha sandwich fold. The G domain and the beta-barrel domain II display a similar structure and arrangement to the homologous domains of EF1A, eEF1A and aeIF2gamma.
Probab=29.36 E-value=34 Score=22.01 Aligned_cols=18 Identities=22% Similarity=0.451 Sum_probs=13.7
Q ss_pred ceeEeccCCCEEEEe-ecC
Q psy4512 6 TSQFQVEDGDVILLA-TDG 23 (101)
Q Consensus 6 ~~~~~l~~gD~lvl~-SDG 23 (101)
++.-.|+.||.|+++ ++|
T Consensus 22 l~~GtL~~GD~Iv~g~~~G 40 (110)
T cd03703 22 LYDGTLREGDTIVVCGLNG 40 (110)
T ss_pred EECCeEecCCEEEEccCCC
Confidence 445679999998885 576
No 57
>PHA02754 hypothetical protein; Provisional
Probab=28.41 E-value=39 Score=19.53 Aligned_cols=17 Identities=18% Similarity=0.542 Sum_probs=12.5
Q ss_pred cCCC-EEEEeecCCCCCC
Q psy4512 12 EDGD-VILLATDGVFDNV 28 (101)
Q Consensus 12 ~~gD-~lvl~SDGl~d~l 28 (101)
..|| ++++.+||+-=.+
T Consensus 42 ~SGdkIVVi~aD~I~i~l 59 (67)
T PHA02754 42 TSGDKIVVITADAIKIEL 59 (67)
T ss_pred ecCCEEEEEEcceEEEEE
Confidence 3688 6789999985444
No 58
>PF09992 DUF2233: Predicted periplasmic protein (DUF2233); InterPro: IPR018711 This entry contains proteins that catalyze the second step in the formation of the mannose 6-phosphate targeting signal on lysosomal enzyme oligosaccharides, this is achieved by removing GlcNAc residues from GlcNAc-alpha-P-mannose moieties, which are formed in the first step.; PDB: 3OHG_A.
Probab=27.95 E-value=75 Score=21.04 Aligned_cols=28 Identities=25% Similarity=0.330 Sum_probs=16.2
Q ss_pred cCCCEEEEeecCCC--CCCChHHHHHHHHHh
Q psy4512 12 EDGDVILLATDGVF--DNVPDSLLLAELVRA 40 (101)
Q Consensus 12 ~~gD~lvl~SDGl~--d~l~~~ei~~~l~~~ 40 (101)
+.|.++++++|| - ..++-.|+.+++++.
T Consensus 99 ~~g~l~l~~vdg-~~~~g~tl~ela~~l~~l 128 (170)
T PF09992_consen 99 ADGKLLLIVVDG-RQSAGMTLDELAQLLKSL 128 (170)
T ss_dssp TTSEEEEEEE-----S--B-HHHHHHHHHHH
T ss_pred CCCcEEEEEEcC-CcCCCCCHHHHHHHHHHc
Confidence 578889999998 2 235667777776553
No 59
>PRK01265 heat shock protein HtpX; Provisional
Probab=26.93 E-value=40 Score=25.64 Aligned_cols=25 Identities=8% Similarity=-0.120 Sum_probs=21.2
Q ss_pred CEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 15 DVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 15 D~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
..-|..|||+.+.++++|+..++..
T Consensus 122 ~~~Ivvt~gLl~~l~~~El~aVlAH 146 (324)
T PRK01265 122 GKRIAITLPLLKILNRDEIKAVAGH 146 (324)
T ss_pred CCEEEEehHHHhhCCHHHHHHHHHH
Confidence 3568999999999999999887764
No 60
>PRK11593 folB bifunctional dihydroneopterin aldolase/dihydroneopterin triphosphate 2'-epimerase; Provisional
Probab=26.73 E-value=1.7e+02 Score=18.43 Aligned_cols=43 Identities=14% Similarity=0.108 Sum_probs=29.3
Q ss_pred eecCCCCCCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHHHH
Q psy4512 20 ATDGVFDNVPDSLLLAELVRAQGSKDPMQLQLVANTIALMART 62 (101)
Q Consensus 20 ~SDGl~d~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A~~ 62 (101)
.||-+-+.+.-..+.+.+............+.+|+.|.+....
T Consensus 43 ~~Ddl~~tidY~~v~~~I~~~~~~~~~~LlE~la~~ia~~i~~ 85 (119)
T PRK11593 43 KSDDVADCLSYADIAETVISHVEGARFALVERVAEEVAELLLA 85 (119)
T ss_pred cccCHhhccCHHHHHHHHHHHHhCCCcccHHHHHHHHHHHHHh
Confidence 3667777777777887777654323345678888888887763
No 61
>KOG2862|consensus
Probab=26.66 E-value=43 Score=25.97 Aligned_cols=20 Identities=25% Similarity=0.534 Sum_probs=16.6
Q ss_pred eccCCCEEEEeecCCCCCCC
Q psy4512 10 QVEDGDVILLATDGVFDNVP 29 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl~d~l~ 29 (101)
-|++||.|++.+-|+|...+
T Consensus 88 ~lePgd~vLv~~~G~wg~ra 107 (385)
T KOG2862|consen 88 LLEPGDNVLVVSTGTWGQRA 107 (385)
T ss_pred hcCCCCeEEEEEechHHHHH
Confidence 36799999999999997544
No 62
>KOG0301|consensus
Probab=26.48 E-value=37 Score=28.64 Aligned_cols=20 Identities=20% Similarity=0.340 Sum_probs=16.6
Q ss_pred EeccCCCEEEEeecCCCCCC
Q psy4512 9 FQVEDGDVILLATDGVFDNV 28 (101)
Q Consensus 9 ~~l~~gD~lvl~SDGl~d~l 28 (101)
..|..||+++-||||+.-.+
T Consensus 266 ~~L~NgDIvvg~SDG~VrVf 285 (745)
T KOG0301|consen 266 KVLLNGDIVVGGSDGRVRVF 285 (745)
T ss_pred EEeeCCCEEEeccCceEEEE
Confidence 35779999999999998654
No 63
>COG1935 Uncharacterized conserved protein [Function unknown]
Probab=26.41 E-value=34 Score=22.45 Aligned_cols=18 Identities=22% Similarity=0.661 Sum_probs=14.5
Q ss_pred EeccCCCEEEEeecCCCC
Q psy4512 9 FQVEDGDVILLATDGVFD 26 (101)
Q Consensus 9 ~~l~~gD~lvl~SDGl~d 26 (101)
.+|++||.++|.|-++-|
T Consensus 37 ~rl~~GDlVFlT~~~~~D 54 (122)
T COG1935 37 LRLHEGDLVFLTSTSLED 54 (122)
T ss_pred hcCCCCCEEEEehhHhhH
Confidence 578999999999876644
No 64
>PF14604 SH3_9: Variant SH3 domain; PDB: 2CRE_A 2E5K_A 2CT3_A 2DE0_X 2D8H_A 2DA9_A 2X3X_E 2X3W_D 2KRN_A 2ED0_A ....
Probab=26.28 E-value=67 Score=17.06 Aligned_cols=15 Identities=27% Similarity=0.536 Sum_probs=9.7
Q ss_pred eEeccCCCEEEEeec
Q psy4512 8 QFQVEDGDVILLATD 22 (101)
Q Consensus 8 ~~~l~~gD~lvl~SD 22 (101)
++.++.||+|.+...
T Consensus 12 ELs~~~Gd~i~v~~~ 26 (49)
T PF14604_consen 12 ELSFKKGDVITVLEK 26 (49)
T ss_dssp B-EB-TTEEEEEEEE
T ss_pred EeeEcCCCEEEEEEe
Confidence 688999999766543
No 65
>PRK02268 hypothetical protein; Provisional
Probab=26.09 E-value=52 Score=22.14 Aligned_cols=12 Identities=25% Similarity=0.512 Sum_probs=10.6
Q ss_pred eccCCCEEEEee
Q psy4512 10 QVEDGDVILLAT 21 (101)
Q Consensus 10 ~l~~gD~lvl~S 21 (101)
++++||.||..|
T Consensus 35 RmkpGD~ivyYs 46 (141)
T PRK02268 35 RMKPGDWIIYYS 46 (141)
T ss_pred cCCCCCEEEEEe
Confidence 568999999988
No 66
>PF00595 PDZ: PDZ domain (Also known as DHR or GLGF) Coordinates are not yet available; InterPro: IPR001478 PDZ domains are found in diverse signalling proteins in bacteria, yeasts, plants, insects and vertebrates [, ]. PDZ domains can occur in one or multiple copies and are nearly always found in cytoplasmic proteins. They bind either the carboxyl-terminal sequences of proteins or internal peptide sequences []. In most cases, interaction between a PDZ domain and its target is constitutive, with a binding affinity of 1 to 10 microns. However, agonist-dependent activation of cell surface receptors is sometimes required to promote interaction with a PDZ protein. PDZ domain proteins are frequently associated with the plasma membrane, a compartment where high concentrations of phosphatidylinositol 4,5-bisphosphate (PIP2) are found. Direct interaction between PIP2 and a subset of class II PDZ domains (syntenin, CASK, Tiam-1) has been demonstrated. PDZ domains consist of 80 to 90 amino acids comprising six beta-strands (beta-A to beta-F) and two alpha-helices, A and B, compactly arranged in a globular structure. Peptide binding of the ligand takes place in an elongated surface groove as an anti-parallel beta-strand interacts with the beta-B strand and the B helix. The structure of PDZ domains allows binding to a free carboxylate group at the end of a peptide through a carboxylate-binding loop between the beta-A and beta-B strands.; GO: 0005515 protein binding; PDB: 3AXA_A 1WF8_A 1QAV_B 1QAU_A 1B8Q_A 1MC7_A 2KAW_A 1I16_A 1VB7_A 1WI4_A ....
Probab=25.88 E-value=86 Score=17.88 Aligned_cols=29 Identities=24% Similarity=0.306 Sum_probs=18.4
Q ss_pred ccCCCEEEEeecCC-CCCCChHHHHHHHHHh
Q psy4512 11 VEDGDVILLATDGV-FDNVPDSLLLAELVRA 40 (101)
Q Consensus 11 l~~gD~lvl~SDGl-~d~l~~~ei~~~l~~~ 40 (101)
|+.||.|+-. +|. ..+++..++..+++..
T Consensus 43 l~~GD~Il~I-Ng~~v~~~~~~~~~~~l~~~ 72 (81)
T PF00595_consen 43 LKVGDRILEI-NGQSVRGMSHDEVVQLLKSA 72 (81)
T ss_dssp SSTTEEEEEE-TTEESTTSBHHHHHHHHHHS
T ss_pred cchhhhhhee-CCEeCCCCCHHHHHHHHHCC
Confidence 8889976654 443 2355667777776653
No 67
>PF13052 DUF3913: Protein of unknown function (DUF3913)
Probab=25.44 E-value=29 Score=19.19 Aligned_cols=19 Identities=32% Similarity=0.758 Sum_probs=12.5
Q ss_pred eeEeccCCCEEEEeecCCCCCCCh
Q psy4512 7 SQFQVEDGDVILLATDGVFDNVPD 30 (101)
Q Consensus 7 ~~~~l~~gD~lvl~SDGl~d~l~~ 30 (101)
++..-|.+|.+ |+|||++.
T Consensus 5 yektaq~ddll-----giwdnvpt 23 (57)
T PF13052_consen 5 YEKTAQEDDLL-----GIWDNVPT 23 (57)
T ss_pred Eecccchhhhh-----hhhccCCC
Confidence 34445566654 99999864
No 68
>COG0150 PurM Phosphoribosylaminoimidazole (AIR) synthetase [Nucleotide transport and metabolism]
Probab=25.25 E-value=55 Score=25.31 Aligned_cols=22 Identities=41% Similarity=0.514 Sum_probs=18.0
Q ss_pred eEeccCCCEE-EEeecCCCCCCC
Q psy4512 8 QFQVEDGDVI-LLATDGVFDNVP 29 (101)
Q Consensus 8 ~~~l~~gD~l-vl~SDGl~d~l~ 29 (101)
--++++||++ -|.|-|++-|=+
T Consensus 169 g~~i~~GDviigl~SSG~HSNGy 191 (345)
T COG0150 169 GSKVKEGDVIIGLASSGLHSNGY 191 (345)
T ss_pred ccccCCCCEEEEecCCCcCCCch
Confidence 3468999985 689999998875
No 69
>PRK10413 hydrogenase 2 accessory protein HypG; Provisional
Probab=25.22 E-value=1.4e+02 Score=18.16 Aligned_cols=31 Identities=13% Similarity=-0.013 Sum_probs=19.1
Q ss_pred eccCCCEEEEeecCCCCCCChHHHHHHHHHh
Q psy4512 10 QVEDGDVILLATDGVFDNVPDSLLLAELVRA 40 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl~d~l~~~ei~~~l~~~ 40 (101)
+++.||++++-.-=-...+++++..+.+..+
T Consensus 42 ~~~vGDyVLVHaGfAi~~ideeeA~etl~~l 72 (82)
T PRK10413 42 ADLLGQWVLVHVGFAMSIIDEDEAKATLDAL 72 (82)
T ss_pred ccccCCEEEEecchhhhhCCHHHHHHHHHHH
Confidence 3567999877663333556777666555443
No 70
>cd01480 vWA_collagen_alpha_1-VI-type VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
Probab=24.66 E-value=1.2e+02 Score=20.41 Aligned_cols=12 Identities=42% Similarity=0.614 Sum_probs=9.6
Q ss_pred CEEEEeecCCCC
Q psy4512 15 DVILLATDGVFD 26 (101)
Q Consensus 15 D~lvl~SDGl~d 26 (101)
-++||.|||..+
T Consensus 111 ~~iillTDG~~~ 122 (186)
T cd01480 111 KFLLVITDGHSD 122 (186)
T ss_pred eEEEEEeCCCcC
Confidence 368999999864
No 71
>TIGR00006 S-adenosyl-methyltransferase MraW. Genetics paper in 1972 links mra cluster to peptidoglycan biosynthesis in E. coli. Seems to be common in proteobacteria.wn.
Probab=23.82 E-value=1.4e+02 Score=22.53 Aligned_cols=52 Identities=19% Similarity=0.163 Sum_probs=30.4
Q ss_pred ceeEeccCCCE-EEEeecCCCC----------------CCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHH
Q psy4512 6 TSQFQVEDGDV-ILLATDGVFD----------------NVPDSLLLAELVRAQGSKDPMQLQLVANTIALMA 60 (101)
Q Consensus 6 ~~~~~l~~gD~-lvl~SDGl~d----------------~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A 60 (101)
+++.+|....+ |=+-.||-.| +.+.+++.+++..+ ........+|+.|++.-
T Consensus 102 vSS~Qld~~~RGFSf~~dgpLDMRMd~~~~~tA~~~ln~~~e~~L~~i~~~y---GEe~~a~~IA~~Iv~~R 170 (305)
T TIGR00006 102 VSSPQLDDPERGFSFKHDGPLDMRMDQSQKLSAAEILNTYSEEDLERILKKY---GEEKFSKRIARAIVERR 170 (305)
T ss_pred CCHhhcCCCCCCCccCCCCCcccccCCCCCCCHHHHHhhCCHHHHHHHHHHh---cCcchHHHHHHHHHHHH
Confidence 34455554444 5555677666 44566777777765 12345667888887443
No 72
>PF01455 HupF_HypC: HupF/HypC family; InterPro: IPR001109 The large subunit of [NiFe]-hydrogenase, as well as other nickel metalloenzymes, is synthesised as a precursor devoid of the metalloenzyme active site. This precursor then undergoes a complex post-translational maturation process that requires a number of accessory proteins. The hydrogenase expression/formation proteins (HupF/HypC) form a family of small proteins that are hydrogenase precursor-specific chaperones required for this maturation process []. They are believed to keep the hydrogenase precursor in a conformation accessible for metal incorporation [, ].; PDB: 3D3R_A 2Z1C_C 2OT2_A.
Probab=23.48 E-value=58 Score=18.96 Aligned_cols=28 Identities=18% Similarity=0.235 Sum_probs=17.0
Q ss_pred ccCCCEEEEeecCCCCCCChHHHHHHHH
Q psy4512 11 VEDGDVILLATDGVFDNVPDSLLLAELV 38 (101)
Q Consensus 11 l~~gD~lvl~SDGl~d~l~~~ei~~~l~ 38 (101)
+++||++++--.=-...+++++..+.+.
T Consensus 38 v~~Gd~VLVHaG~Ai~~ideeeA~e~l~ 65 (68)
T PF01455_consen 38 VKVGDYVLVHAGFAIEKIDEEEAEETLD 65 (68)
T ss_dssp B-TT-EEEEETTEEEEEE-HHHHHHHHH
T ss_pred CCCCCEEEEecChhheeCCHHHHHHHHH
Confidence 6789988776665556667777666554
No 73
>COG0275 Predicted S-adenosylmethionine-dependent methyltransferase involved in cell envelope biogenesis [Cell envelope biogenesis, outer membrane]
Probab=23.02 E-value=2.2e+02 Score=21.84 Aligned_cols=52 Identities=17% Similarity=0.164 Sum_probs=33.1
Q ss_pred ceeEeccCCCE-EEEeecCCCC----------------CCChHHHHHHHHHhhCCCCchhHHHHHHHHHHHH
Q psy4512 6 TSQFQVEDGDV-ILLATDGVFD----------------NVPDSLLLAELVRAQGSKDPMQLQLVANTIALMA 60 (101)
Q Consensus 6 ~~~~~l~~gD~-lvl~SDGl~d----------------~l~~~ei~~~l~~~~~~~~~~~~~~~a~~Lv~~A 60 (101)
+++.+|...++ |=...||=.| +++.+++..++..+ .+...+..+|+.|++.-
T Consensus 106 VSS~QLD~~eRGFSf~~d~pLDMRMd~~~~lsA~evvN~~~e~~L~~I~~~y---GEEr~arrIA~aIv~~R 174 (314)
T COG0275 106 VSSPQLDDAERGFSFRKDGPLDMRMDQTQGLSAAEVVNTYSEEDLARIFKEY---GEERFAKRIARAIVERR 174 (314)
T ss_pred CCccccCCCcCCcccCCCCCcccCcCCCCCCCHHHHHhcCCHHHHHHHHHHh---ccHhhHHHHHHHHHHHh
Confidence 55666666655 6667777776 34556777777665 22345677888887644
No 74
>cd06919 Asp_decarbox Aspartate alpha-decarboxylase or L-aspartate 1-decarboxylase, a pyruvoyl group-dependent decarboxylase in beta-alanine production. Decarboxylation of aspartate is the major route of beta-alanine production in bacteria, and is catalyzed by the enzyme L-aspartate decarboxylase (ADC), EC:4.1.1.11 which requires a pyruvoyl group for its activity. The pyruvoyl cofactor is covalently bound to the enzyme. The protein is synthesized as a proenzyme and cleaved via self-processing at Gly23-Ser24 to yield an alpha chain (C-terminal fragment) and beta chain (N-terminal fragment), and the pyruvoyl group. Beta-alanine is required for the biosynthesis of pantothenate, in which the enzyme plays a critical regulatory role. The active site of the tetrameric enzyme is located at the interface of two subunits, with a Lysine and a Histidine from the beta chain of one subunit forming the active site with residues from the alpha chain of the adjacent subunit. This alignment
Probab=22.91 E-value=78 Score=20.51 Aligned_cols=17 Identities=18% Similarity=0.390 Sum_probs=13.9
Q ss_pred eccCCCEEEEeecCCCC
Q psy4512 10 QVEDGDVILLATDGVFD 26 (101)
Q Consensus 10 ~l~~gD~lvl~SDGl~d 26 (101)
..++||++|++|=+..+
T Consensus 77 ~~~~GD~vII~sy~~~~ 93 (111)
T cd06919 77 LGQPGDRVIIMAYALMD 93 (111)
T ss_pred cCCCCCEEEEEECccCC
Confidence 46799999999987754
No 75
>TIGR00253 RNA_bind_YhbY putative RNA-binding protein, YhbY family. A combination of crystal structure, molecular modeling, and bioinformatic data together suggest that members of this family, including YhbY of E. coli, are RNA binding proteins.
Probab=22.65 E-value=2e+02 Score=17.84 Aligned_cols=45 Identities=18% Similarity=0.293 Sum_probs=29.5
Q ss_pred EEEEeecCCCCCCChHHHHHHHHHhh------CCCCchhHHHHHHHHHHHHH
Q psy4512 16 VILLATDGVFDNVPDSLLLAELVRAQ------GSKDPMQLQLVANTIALMAR 61 (101)
Q Consensus 16 ~lvl~SDGl~d~l~~~ei~~~l~~~~------~~~~~~~~~~~a~~Lv~~A~ 61 (101)
++.++-+||+|++- +++...+.... ......+..++|+.|.+...
T Consensus 19 ~v~IGK~Glt~~vi-~ei~~aL~~hELIKVkvl~~~~~~~~e~a~~i~~~~~ 69 (95)
T TIGR00253 19 VVLVGKNGLTEGVI-KEIEQALEHRELIKVKVATEDREDKTLIAEALVKETG 69 (95)
T ss_pred eEEECCCCCCHHHH-HHHHHHHHhCCcEEEEecCCChhHHHHHHHHHHHHHC
Confidence 68899999998773 34555555432 12234567788888877663
No 76
>PF08148 DSHCT: DSHCT (NUC185) domain; InterPro: IPR012961 This C-terminal domain is found in DOB1/SK12/helY-like DEAD box helicases [].; GO: 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides; PDB: 4A4Z_A 2XGJ_B 3L9O_A.
Probab=22.63 E-value=82 Score=21.51 Aligned_cols=22 Identities=32% Similarity=0.291 Sum_probs=17.6
Q ss_pred eecCCCCCCChHHHHHHHHHhh
Q psy4512 20 ATDGVFDNVPDSLLLAELVRAQ 41 (101)
Q Consensus 20 ~SDGl~d~l~~~ei~~~l~~~~ 41 (101)
..+|+|+.+++.+++.+++...
T Consensus 29 l~~g~f~~L~p~elAa~lS~~v 50 (180)
T PF08148_consen 29 LFSGVFDDLDPAELAALLSCFV 50 (180)
T ss_dssp HHCTCCCCS-HHHHHHHHHHHC
T ss_pred HHcCCCCCCCHHHHHHHHHHhh
Confidence 3589999999999999987653
No 77
>cd03062 TRX_Fd_Sucrase TRX-like [2Fe-2S] Ferredoxin (Fd) family, Sucrase subfamily; composed of proteins with similarity to a novel plant enzyme, isolated from potato, which contains a Fd-like domain and exhibits sucrolytic activity. The putative active site of the Fd-like domain of the enzyme contains two cysteines and two histidines for possible binding to iron-sulfur clusters, compared to four cysteines present in the active site of Fd.
Probab=22.60 E-value=1.4e+02 Score=18.22 Aligned_cols=28 Identities=14% Similarity=0.232 Sum_probs=22.8
Q ss_pred CCEEEEee--cCCCCC-CChHHHHHHHHHhh
Q psy4512 14 GDVILLAT--DGVFDN-VPDSLLLAELVRAQ 41 (101)
Q Consensus 14 gD~lvl~S--DGl~d~-l~~~ei~~~l~~~~ 41 (101)
|=.+|+.. ||+|=. ++.+++..++.++.
T Consensus 53 gp~vvvyP~~~g~wy~~v~p~~v~~Iv~~hl 83 (97)
T cd03062 53 AGNVIIYPKGDGIWYGRVTPEHVPPIVDRLI 83 (97)
T ss_pred CCEEEEEeCCCeeEEeecCHHHHHHHHHHHh
Confidence 55788889 999974 68899999998764
No 78
>cd00136 PDZ PDZ domain, also called DHR (Dlg homologous region) or GLGF (after a conserved sequence motif). Many PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. Heterodimerization through PDZ-PDZ domain interactions adds to the domain's versatility, and PDZ domain-mediated interactions may be modulated dynamically through target phosphorylation. Some PDZ domains play a role in scaffolding supramolecular complexes. PDZ domains are found in diverse signaling proteins in bacteria, archebacteria, and eurkayotes. This CD contains two distinct structural subgroups with either a N- or C-terminal beta-strand forming the peptide-binding groove base. The circular permutation placing the strand on the N-terminus appears to be found in Eumetazoa only, while the C-terminal variant is found in all three kingdoms of life, and seems to co-occur with protease domains. PDZ domains have been named after PSD95(pos
Probab=22.56 E-value=1.4e+02 Score=16.10 Aligned_cols=30 Identities=27% Similarity=0.302 Sum_probs=16.5
Q ss_pred EeccCCCEEEEeecCCC-CCCChHHHHHHHHH
Q psy4512 9 FQVEDGDVILLATDGVF-DNVPDSLLLAELVR 39 (101)
Q Consensus 9 ~~l~~gD~lvl~SDGl~-d~l~~~ei~~~l~~ 39 (101)
..|++||+|+-. ||.- ...+.+++.++++.
T Consensus 29 ~gl~~GD~I~~I-ng~~v~~~~~~~~~~~l~~ 59 (70)
T cd00136 29 AGLQAGDVILAV-NGTDVKNLTLEDVAELLKK 59 (70)
T ss_pred cCCCCCCEEEEE-CCEECCCCCHHHHHHHHhh
Confidence 358899976543 5541 12222566666554
No 79
>PF00717 Peptidase_S24: Peptidase S24-like peptidase classification. ; InterPro: IPR019759 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ].; PDB: 1KCA_H 3BDN_A 1F39_A 1JHH_A 1JHE_B 3JSP_A 1JHF_B 1JHC_A 3JSO_B 1B12_D ....
Probab=21.46 E-value=82 Score=17.30 Aligned_cols=14 Identities=36% Similarity=0.885 Sum_probs=8.3
Q ss_pred eccCCCEEEEeecC
Q psy4512 10 QVEDGDVILLATDG 23 (101)
Q Consensus 10 ~l~~gD~lvl~SDG 23 (101)
.++.||++++-.+|
T Consensus 24 ~~~~gdivv~~~~~ 37 (70)
T PF00717_consen 24 EPKDGDIVVVKIDG 37 (70)
T ss_dssp ---TTSEEEEEETT
T ss_pred CCccCeEEEEEECC
Confidence 56678887777766
No 80
>cd05829 Sortase_E Sortase E (SrtE) is a membrane transpeptidase found in gram-positive bacteria that cleaves surface proteins at a cell sorting motif and catalyzes a transpeptidation reaction in which the surface protein substrate is covalently linked to peptidoglycan for display on the bacterial surface. Sortases are grouped into different classes and subfamilies based on sequence, membrane topology, genomic positioning, and cleavage site preference. The function of Sortase E is unknown. In two different sortase families, the N-terminus either functions as both a signal peptide for secretion and a stop-transfer signal for membrane anchoring, or it contains a signal peptide only and the C-terminus serves as a membrane anchor. Most gram-positive bacteria contain more than one sortase and it is thought that the different sortases anchor different surface protein classes. The sortase domain is a modified beta-barrel flanked by two (SrtA) or three (SrtB) short alpha-helices.
Probab=21.22 E-value=81 Score=20.76 Aligned_cols=16 Identities=25% Similarity=0.424 Sum_probs=13.3
Q ss_pred EeccCCCEEEEee-cCC
Q psy4512 9 FQVEDGDVILLAT-DGV 24 (101)
Q Consensus 9 ~~l~~gD~lvl~S-DGl 24 (101)
.+|++||.|.+.+ +|=
T Consensus 71 ~~l~~GD~I~v~~~~g~ 87 (144)
T cd05829 71 GDLRKGDKVEVTRADGQ 87 (144)
T ss_pred hcCCCCCEEEEEECCCC
Confidence 3688999999998 774
No 81
>PF13580 SIS_2: SIS domain; PDB: 1TK9_C 2I22_B 2I2W_A 1X92_A 3BJZ_D 2XBL_B 2X3Y_F 2YVA_B 3CVJ_D 3TRJ_D ....
Probab=20.61 E-value=60 Score=20.97 Aligned_cols=13 Identities=23% Similarity=0.736 Sum_probs=7.2
Q ss_pred eccCCCEEEEeec
Q psy4512 10 QVEDGDVILLATD 22 (101)
Q Consensus 10 ~l~~gD~lvl~SD 22 (101)
++++||++|..|-
T Consensus 100 ~~~~gDvli~iS~ 112 (138)
T PF13580_consen 100 DIRPGDVLIVISN 112 (138)
T ss_dssp T--TT-EEEEEES
T ss_pred CCCCCCEEEEECC
Confidence 4789999776664
No 82
>COG0361 InfA Translation initiation factor 1 (IF-1) [Translation, ribosomal structure and biogenesis]
Probab=20.55 E-value=96 Score=18.63 Aligned_cols=13 Identities=31% Similarity=0.721 Sum_probs=10.1
Q ss_pred eeEeccCCCEEEE
Q psy4512 7 SQFQVEDGDVILL 19 (101)
Q Consensus 7 ~~~~l~~gD~lvl 19 (101)
..+.+.+||++++
T Consensus 43 ~~i~I~~GD~V~V 55 (75)
T COG0361 43 NRIRILPGDVVLV 55 (75)
T ss_pred eeEEeCCCCEEEE
Confidence 3578899999765
No 83
>COG3552 CoxE Protein containing von Willebrand factor type A (vWA) domain [General function prediction only]
Probab=20.35 E-value=1.5e+02 Score=23.34 Aligned_cols=25 Identities=28% Similarity=0.421 Sum_probs=16.8
Q ss_pred EeccCCCEEEEeecCCCCCCChHHHH
Q psy4512 9 FQVEDGDVILLATDGVFDNVPDSLLL 34 (101)
Q Consensus 9 ~~l~~gD~lvl~SDGl~d~l~~~ei~ 34 (101)
..|..|-+||+.|||+ |+-...++.
T Consensus 307 ~~L~~gA~VlilsDg~-drd~~~~l~ 331 (395)
T COG3552 307 NVLSGGAVVLILSDGL-DRDDIPELV 331 (395)
T ss_pred cccCCceEEEEEeccc-ccCCchHHH
Confidence 3567788999999997 443333333
No 84
>PF13768 VWA_3: von Willebrand factor type A domain
Probab=20.14 E-value=1.5e+02 Score=18.90 Aligned_cols=25 Identities=32% Similarity=0.335 Sum_probs=17.2
Q ss_pred CEEEEeecCCCCCCChHHHHHHHHHh
Q psy4512 15 DVILLATDGVFDNVPDSLLLAELVRA 40 (101)
Q Consensus 15 D~lvl~SDGl~d~l~~~ei~~~l~~~ 40 (101)
..|++.|||..+. ..+++.+.++..
T Consensus 98 ~~IilltDG~~~~-~~~~i~~~v~~~ 122 (155)
T PF13768_consen 98 RAIILLTDGQPVS-GEEEILDLVRRA 122 (155)
T ss_pred cEEEEEEeccCCC-CHHHHHHHHHhc
Confidence 4589999999622 446777777653
No 85
>cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix.
Probab=20.09 E-value=1.8e+02 Score=18.56 Aligned_cols=23 Identities=35% Similarity=0.494 Sum_probs=15.2
Q ss_pred CEEEEeecCCCCCCChHHHHHHHHH
Q psy4512 15 DVILLATDGVFDNVPDSLLLAELVR 39 (101)
Q Consensus 15 D~lvl~SDGl~d~l~~~ei~~~l~~ 39 (101)
..+|+.|||..+. ..++.+.++.
T Consensus 101 ~~iillTDG~~~~--~~~~~~~~~~ 123 (171)
T cd01461 101 PQIILLTDGEVTN--ESQILKNVRE 123 (171)
T ss_pred cEEEEEeCCCCCC--HHHHHHHHHH
Confidence 5799999998432 3455555554
No 86
>PRK11479 hypothetical protein; Provisional
Probab=20.01 E-value=77 Score=23.67 Aligned_cols=17 Identities=18% Similarity=0.427 Sum_probs=12.6
Q ss_pred eeEeccCCCEEEEeecC
Q psy4512 7 SQFQVEDGDVILLATDG 23 (101)
Q Consensus 7 ~~~~l~~gD~lvl~SDG 23 (101)
..-.|++||+|+..|.+
T Consensus 61 s~~~LqpGDLVFfst~t 77 (274)
T PRK11479 61 TAPDLKPGDLLFSSSLG 77 (274)
T ss_pred ChhhCCCCCEEEEecCC
Confidence 34578999999876653
Done!