Query psy4833
Match_columns 141
No_of_seqs 132 out of 414
Neff 4.2
Searched_HMMs 46136
Date Fri Aug 16 18:55:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4833.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4833hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4659|consensus 99.8 8.9E-19 1.9E-23 169.5 11.1 126 2-129 45-182 (1899)
2 PF13715 DUF4480: Domain of un 99.7 1.6E-16 3.4E-21 108.7 10.4 79 10-93 1-80 (88)
3 PF13620 CarboxypepD_reg: Carb 99.5 7.1E-13 1.5E-17 88.9 9.6 79 10-90 1-82 (82)
4 cd03863 M14_CPD_II The second 99.4 1.6E-12 3.5E-17 112.9 9.0 77 8-89 296-373 (375)
5 cd06245 M14_CPD_III The third 99.3 3.6E-11 7.9E-16 103.9 10.1 76 8-89 286-361 (363)
6 cd03858 M14_CP_N-E_like Carbox 99.2 6.5E-11 1.4E-15 101.2 9.2 72 9-84 298-370 (374)
7 cd03864 M14_CPN Peptidase M14 99.1 2.5E-10 5.4E-15 99.9 9.6 71 9-84 316-387 (392)
8 cd03867 M14_CPZ Peptidase M14- 99.1 3.2E-10 6.9E-15 98.8 9.3 66 9-78 318-383 (395)
9 cd03868 M14_CPD_I The first ca 99.1 4.5E-10 9.8E-15 96.6 8.2 73 8-84 295-368 (372)
10 cd03865 M14_CPE_H Peptidase M1 99.0 6.1E-10 1.3E-14 98.1 8.0 71 10-84 327-397 (402)
11 cd03866 M14_CPM Peptidase M14 98.9 4.8E-09 1E-13 91.1 8.8 69 9-79 295-363 (376)
12 PF08400 phage_tail_N: Prophag 98.1 1.9E-05 4.2E-10 61.0 9.0 74 8-94 2-82 (134)
13 cd03869 M14_CPX_like Peptidase 98.0 2E-05 4.4E-10 69.9 7.2 67 9-79 329-395 (405)
14 PF07210 DUF1416: Protein of u 97.7 0.00037 8.1E-09 50.5 9.1 67 3-71 2-70 (85)
15 KOG2649|consensus 97.6 0.00024 5.3E-09 64.7 8.5 89 9-102 378-466 (500)
16 PRK15036 hydroxyisourate hydro 97.6 0.00012 2.7E-09 56.3 4.9 61 9-69 27-96 (137)
17 cd00421 intradiol_dioxygenase 97.4 0.0016 3.4E-08 49.8 9.4 74 5-78 8-107 (146)
18 cd03461 1,2-HQD Hydroxyquinol 97.4 0.002 4.4E-08 54.8 10.2 75 5-80 117-232 (277)
19 TIGR02423 protocat_alph protoc 97.3 0.0031 6.8E-08 51.0 10.5 77 6-82 37-146 (193)
20 cd03458 Catechol_intradiol_dio 97.2 0.0037 8E-08 52.8 10.0 76 5-80 101-216 (256)
21 TIGR02438 catachol_actin catec 97.2 0.0032 6.8E-08 53.9 9.7 78 4-81 128-245 (281)
22 PF00775 Dioxygenase_C: Dioxyg 97.2 0.0061 1.3E-07 48.7 10.5 48 4-51 25-89 (183)
23 cd03462 1,2-CCD chlorocatechol 97.1 0.0045 9.8E-08 52.0 9.8 77 5-81 96-212 (247)
24 cd03459 3,4-PCD Protocatechuat 97.1 0.0066 1.4E-07 47.6 9.9 74 5-78 12-118 (158)
25 TIGR02439 catechol_proteo cate 97.1 0.0056 1.2E-07 52.4 10.2 79 5-83 125-243 (285)
26 PF02369 Big_1: Bacterial Ig-l 97.1 0.0047 1E-07 44.2 8.3 70 4-73 20-97 (100)
27 TIGR02465 chlorocat_1_2 chloro 97.1 0.0064 1.4E-07 51.0 10.1 77 6-82 96-212 (246)
28 TIGR02422 protocat_beta protoc 97.1 0.006 1.3E-07 50.4 9.7 75 7-81 59-168 (220)
29 cd03460 1,2-CTD Catechol 1,2 d 97.0 0.0076 1.7E-07 51.6 10.2 79 5-83 121-239 (282)
30 cd03464 3,4-PCD_beta Protocate 96.9 0.01 2.3E-07 49.0 10.1 75 6-80 63-172 (220)
31 cd03463 3,4-PCD_alpha Protocat 96.9 0.011 2.3E-07 47.6 9.9 73 6-78 34-138 (185)
32 smart00634 BID_1 Bacterial Ig- 96.9 0.01 2.2E-07 41.5 8.1 69 4-72 15-90 (92)
33 PF10670 DUF4198: Domain of un 96.5 0.012 2.5E-07 45.3 7.0 54 10-64 152-211 (215)
34 COG3485 PcaH Protocatechuate 3 96.5 0.0064 1.4E-07 50.6 5.8 73 7-79 71-177 (226)
35 KOG1948|consensus 96.4 0.014 3E-07 57.2 8.1 64 9-76 316-383 (1165)
36 PRK15310 fimbrial outer membra 95.4 0.056 1.2E-06 52.6 7.7 78 8-92 785-863 (895)
37 PF11974 MG1: Alpha-2-macroglo 95.3 0.024 5.3E-07 40.7 3.7 39 15-53 19-60 (97)
38 PF01190 Pollen_Ole_e_I: Polle 95.2 0.044 9.6E-07 38.6 4.7 36 20-55 18-58 (97)
39 cd03457 intradiol_dioxygenase_ 94.7 0.1 2.2E-06 42.0 6.1 47 5-51 23-92 (188)
40 COG5266 CbiK ABC-type Co2+ tra 92.4 0.22 4.8E-06 42.6 4.7 42 10-53 173-229 (264)
41 PF12985 DUF3869: Domain of un 92.2 0.75 1.6E-05 34.5 6.7 56 7-77 20-77 (104)
42 PF14686 fn3_3: Polysaccharide 91.5 1.9 4.1E-05 31.1 8.0 49 10-60 4-64 (95)
43 PF08308 PEGA: PEGA domain; I 90.8 0.99 2.1E-05 29.6 5.6 52 24-84 11-63 (71)
44 TIGR02962 hdxy_isourate hydrox 90.5 0.43 9.2E-06 35.7 4.0 59 11-69 3-71 (112)
45 PF01060 DUF290: Transthyretin 90.1 0.56 1.2E-05 32.3 4.0 43 12-55 1-48 (80)
46 PF05738 Cna_B: Cna protein B- 89.3 3 6.6E-05 26.9 6.9 47 23-70 1-53 (70)
47 cd05822 TLP_HIUase HIUase (5-h 87.7 1 2.3E-05 33.6 4.3 58 11-68 3-70 (112)
48 KOG1948|consensus 84.3 2.3 4.9E-05 42.4 5.8 48 9-56 119-167 (1165)
49 PF10577 UPF0560: Uncharacteri 82.4 11 0.00023 37.1 9.4 85 10-94 2-105 (807)
50 PF12866 DUF3823: Protein of u 80.5 10 0.00022 31.3 7.6 29 2-31 16-49 (222)
51 cd05469 Transthyretin_like Tra 78.9 2.2 4.9E-05 32.1 2.9 57 11-67 3-69 (113)
52 cd05821 TLP_Transthyretin Tran 78.6 2.7 5.8E-05 32.1 3.3 60 9-68 7-76 (121)
53 PF00576 Transthyretin: HIUase 77.2 2.6 5.5E-05 31.5 2.8 39 11-49 3-47 (112)
54 smart00095 TR_THY Transthyreti 76.4 4.7 0.0001 30.7 4.1 58 10-67 5-72 (121)
55 PF03785 Peptidase_C25_C: Pept 75.9 15 0.00033 26.5 6.4 51 24-75 26-80 (81)
56 PF09430 DUF2012: Protein of u 73.1 23 0.00049 26.2 7.0 54 23-77 7-61 (123)
57 PF13953 PapC_C: PapC C-termin 68.4 32 0.00069 22.8 7.3 47 14-64 2-49 (68)
58 COG2351 Transthyretin-like pro 62.8 18 0.0004 28.0 4.8 56 10-65 10-75 (124)
59 KOG3006|consensus 57.0 65 0.0014 25.2 6.9 54 10-63 22-85 (132)
60 PF14054 DUF4249: Domain of un 56.4 1E+02 0.0023 24.8 10.5 66 22-90 59-132 (298)
61 PF05688 DUF824: Salmonella re 52.4 23 0.00051 22.9 3.3 26 8-33 14-39 (47)
62 COG1470 Predicted membrane pro 51.3 85 0.0018 29.5 7.8 77 8-90 186-263 (513)
63 PRK15314 outer membrane protei 46.7 49 0.0011 36.1 6.1 24 10-33 1609-1632(2435)
64 PF09244 DUF1964: Domain of un 45.5 46 0.001 23.3 4.0 27 44-72 2-28 (68)
65 PF13115 YtkA: YtkA-like 42.9 30 0.00065 23.1 2.9 22 8-32 23-44 (86)
66 PF03717 PBP_dimer: Penicillin 42.2 15 0.00033 27.7 1.4 17 10-26 4-20 (178)
67 PF11184 DUF2969: Protein of u 41.3 93 0.002 21.5 5.1 39 14-55 11-49 (71)
68 PRK15207 long polar fimbrial o 38.2 1E+02 0.0022 30.1 6.5 48 3-51 751-799 (842)
69 PF07492 Trehalase_Ca-bi: Neut 38.0 35 0.00076 20.5 2.2 18 43-60 9-26 (30)
70 PF06115 DUF956: Domain of unk 37.5 68 0.0015 24.6 4.3 33 22-56 54-88 (118)
71 COG3188 FimD P pilus assembly 35.1 1.6E+02 0.0035 28.9 7.4 49 3-53 747-796 (835)
72 PF03983 SHD1: SLA1 homology d 32.9 31 0.00067 24.1 1.7 43 40-92 14-56 (70)
73 KOG4217|consensus 32.2 15 0.00032 34.6 -0.1 15 113-134 277-291 (605)
74 PRK15248 fimbrial outer membra 31.4 1.6E+02 0.0034 28.8 6.7 47 2-51 762-809 (853)
75 COG3422 Uncharacterized conser 31.1 58 0.0013 22.3 2.7 28 42-71 9-37 (59)
76 PF14289 DUF4369: Domain of un 31.0 1.7E+02 0.0036 19.6 10.1 44 7-53 11-55 (106)
77 PF13753 SWM_repeat: Putative 30.3 88 0.0019 26.0 4.2 44 6-55 264-308 (317)
78 COG4594 FecB ABC-type Fe3+-cit 29.5 34 0.00074 30.1 1.7 33 38-70 33-65 (310)
79 PRK15213 fimbrial outer membra 29.4 2.1E+02 0.0044 27.6 7.0 51 3-56 718-770 (797)
80 PRK07121 hypothetical protein; 28.5 28 0.00061 30.7 1.1 15 13-27 439-453 (492)
81 TIGR02485 CobZ_N-term precorri 28.2 27 0.00058 30.1 0.8 16 11-26 375-390 (432)
82 PF06030 DUF916: Bacterial pro 27.2 1.4E+02 0.003 22.2 4.4 38 42-79 12-50 (121)
83 PRK15235 outer membrane fimbri 27.0 1.6E+02 0.0035 28.5 5.9 28 4-31 723-751 (814)
84 PRK15223 pilin outer membrane 24.8 5E+02 0.011 25.3 8.7 58 3-64 749-808 (836)
85 PRK12837 3-ketosteroid-delta-1 24.6 39 0.00084 30.3 1.2 16 11-26 457-472 (513)
86 PF03404 Mo-co_dimer: Mo-co ox 24.3 3.1E+02 0.0067 20.6 8.8 40 7-46 27-66 (131)
87 PRK08274 tricarballylate dehyd 24.2 35 0.00075 29.6 0.8 16 11-26 406-421 (466)
88 PRK15316 RatA-like protein; Pr 22.4 2.2E+02 0.0048 31.7 6.1 55 6-69 2160-2250(2683)
89 PRK15298 fimbrial outer membra 22.3 3.1E+02 0.0067 26.8 6.9 45 6-51 763-808 (848)
90 TIGR01813 flavo_cyto_c flavocy 22.0 45 0.00097 28.6 1.1 17 11-27 391-407 (439)
91 PRK06481 fumarate reductase fl 21.1 47 0.001 29.7 1.0 16 11-26 449-464 (506)
92 PF09624 DUF2393: Protein of u 20.9 1.8E+02 0.0039 21.6 4.1 26 6-31 61-86 (149)
93 PRK15273 putative fimbrial out 20.9 2.5E+02 0.0055 27.6 6.0 36 13-51 746-782 (881)
94 PRK12834 putative FAD-binding 20.9 43 0.00094 30.2 0.8 16 11-26 492-507 (549)
95 COG3549 HigB Plasmid maintenan 20.7 1.4E+02 0.0031 22.1 3.3 23 43-67 60-82 (94)
96 PF00207 A2M: Alpha-2-macroglo 20.7 2.6E+02 0.0057 19.0 4.6 51 40-92 8-61 (92)
97 COG0196 RibF FAD synthase [Coe 20.3 1.6E+02 0.0035 25.6 4.1 57 7-70 184-244 (304)
98 KOG1691|consensus 20.3 5.2E+02 0.011 21.7 8.2 72 10-84 44-124 (210)
99 PRK15304 putative fimbrial out 20.1 2.9E+02 0.0064 26.7 6.2 46 3-51 712-758 (801)
No 1
>KOG4659|consensus
Probab=99.78 E-value=8.9e-19 Score=169.52 Aligned_cols=126 Identities=36% Similarity=0.549 Sum_probs=114.5
Q ss_pred CcCCcceEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccE
Q psy4833 2 TDDELRVAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQI 81 (141)
Q Consensus 2 ~~d~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~ 81 (141)
+||++|.++|||+|+-.+|.||.||+|+...++ . ++|+|+.||.|+|.|++|.++||+|-+..|.+|+++|++|||++
T Consensus 45 ~fne~~~~vIrgrvv~~~~~pLVGVrVS~~~~~-~-yfTlTR~DG~FDL~vnGg~svtLqF~R~pF~~qkr~v~vpwnq~ 122 (1899)
T KOG4659|consen 45 QFNENRISVIRGRVVWGGGVPLVGVRVSDAAHP-L-YFTLTREDGYFDLTVNGGRSVTLQFLRTPFQSQKRSVFVPWNQI 122 (1899)
T ss_pred hhccccceEEeccEeecCCcceEEEEeeccccc-c-eEEEEecCceEEEEEcccceEEEEEccCCCcccceeEEeChhhE
Confidence 589999999999999999999999999999999 7 49999999999999999999999999999999999999999999
Q ss_pred EEeCCeEEEecCCC------------ccceeccccccCcCCCCCcccccccCCCCCCcce
Q psy4833 82 VVLPPVKMQLSDAS------------DIFREVASGVTPSYAPKGNSDLQHYGPRSSNKTV 129 (141)
Q Consensus 82 ~~~~~v~m~~~d~~------------~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~ 129 (141)
+++++|+|+..+.. ..+..|.|.|.|+++..|-.+|-+.+++--..-|
T Consensus 123 i~id~vvm~~~~~~~~~~~~~~~C~~~~~~~pdpvvissw~~tf~ga~pdr~~Iv~esqv 182 (1899)
T KOG4659|consen 123 IHIDDVVMYRQEGGSPPAPARAKCSPTLRRIPDPVVISSWQYTFDGACPDRTSIVVESQV 182 (1899)
T ss_pred EEEEeEEEEeecCCCCCCCCCCccChhhccCCCcEEecCcccccCCcCCCCcccccchhh
Confidence 99999999977653 2345678899999999999999999997654433
No 2
>PF13715 DUF4480: Domain of unknown function (DUF4480)
Probab=99.70 E-value=1.6e-16 Score=108.74 Aligned_cols=79 Identities=23% Similarity=0.376 Sum_probs=68.2
Q ss_pred EEEEEEECCC-CCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCeE
Q psy4833 10 VVRGQVLSVQ-GLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPVK 88 (141)
Q Consensus 10 ~IrG~V~D~~-GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v~ 88 (141)
+|+|+|+|++ |+||+||+|.+++.. . + ++||.+|+|.|.++.+. .+|.||++||+++++.+....+....+ .+.
T Consensus 1 ti~G~V~d~~t~~pl~~a~V~~~~~~-~-~-~~Td~~G~F~i~~~~g~-~~l~is~~Gy~~~~~~i~~~~~~~~~~-~i~ 75 (88)
T PF13715_consen 1 TISGKVVDSDTGEPLPGATVYLKNTK-K-G-TVTDENGRFSIKLPEGD-YTLKISYIGYETKTITISVNSNKNTNL-NIY 75 (88)
T ss_pred CEEEEEEECCCCCCccCeEEEEeCCc-c-e-EEECCCeEEEEEEcCCC-eEEEEEEeCEEEEEEEEEecCCCEEEE-EEE
Confidence 5899999987 999999999999987 4 4 99999999999988665 579999999999999999987765555 688
Q ss_pred EEecC
Q psy4833 89 MQLSD 93 (141)
Q Consensus 89 m~~~d 93 (141)
|.+..
T Consensus 76 L~~~~ 80 (88)
T PF13715_consen 76 LEPKS 80 (88)
T ss_pred EeeCc
Confidence 87653
No 3
>PF13620 CarboxypepD_reg: Carboxypeptidase regulatory-like domain; PDB: 3MN8_D 3P0D_I 3KCP_A 2B59_B 1UWY_A 1H8L_A 1QMU_A 2NSM_A.
Probab=99.46 E-value=7.1e-13 Score=88.90 Aligned_cols=79 Identities=24% Similarity=0.409 Sum_probs=58.1
Q ss_pred EEEEEEECCCCCeeeeEEEEeccCCC-eeeceEecCCceEEEE-ecCCCcEEEEEEecCccceEE-EEEccCccEEEeCC
Q psy4833 10 VVRGQVLSVQGLGITGIRVSVDKDSG-RFGFTLTRGGGWFDML-VNGGGAVTLQFQRSPYRPQTR-TVFIPWNQIVVLPP 86 (141)
Q Consensus 10 ~IrG~V~D~~GePL~GVsV~Vkgt~~-~~G~TiTD~DG~FsL~-V~~g~~~tL~FSyiGY~sqe~-~V~~~~n~~~~~~~ 86 (141)
+|+|+|+|++|+||+||+|.+....+ ....+.||.+|+|.|. ++.| .++|.+++.||.+... .+.+..++...+ +
T Consensus 1 tI~G~V~d~~g~pv~~a~V~l~~~~~~~~~~~~Td~~G~f~~~~l~~g-~Y~l~v~~~g~~~~~~~~v~v~~~~~~~~-~ 78 (82)
T PF13620_consen 1 TISGTVTDATGQPVPGATVTLTDQDGGTVYTTTTDSDGRFSFEGLPPG-TYTLRVSAPGYQPQTQENVTVTAGQTTTV-D 78 (82)
T ss_dssp -EEEEEEETTSCBHTT-EEEET--TTTECCEEE--TTSEEEEEEE-SE-EEEEEEEBTTEE-EEEEEEEESSSSEEE---
T ss_pred CEEEEEEcCCCCCcCCEEEEEEEeeCCCEEEEEECCCceEEEEccCCE-eEEEEEEECCcceEEEEEEEEeCCCEEEE-E
Confidence 58999999999999999999983211 1234999999999998 7765 5689999999999998 699998887776 7
Q ss_pred eEEE
Q psy4833 87 VKMQ 90 (141)
Q Consensus 87 v~m~ 90 (141)
+.|.
T Consensus 79 i~L~ 82 (82)
T PF13620_consen 79 ITLE 82 (82)
T ss_dssp EEEE
T ss_pred EEEC
Confidence 7764
No 4
>cd03863 M14_CPD_II The second carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain II. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, while the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally ac
Probab=99.38 E-value=1.6e-12 Score=112.91 Aligned_cols=77 Identities=26% Similarity=0.260 Sum_probs=66.0
Q ss_pred eEEEEEEEECC-CCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCC
Q psy4833 8 VAVVRGQVLSV-QGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPP 86 (141)
Q Consensus 8 ~~~IrG~V~D~-~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~ 86 (141)
...|+|+|+|+ +|+||+||+|+|+|.. . | ++||.||+|.+.++.|. .+|++|++||+++++.|.+..++...+ +
T Consensus 296 ~~gI~G~V~D~~~g~pl~~AtV~V~g~~-~-~-~~Td~~G~f~~~l~pG~-ytl~vs~~GY~~~~~~v~V~~~~~~~~-~ 370 (375)
T cd03863 296 HRGVRGFVLDATDGRGILNATISVADIN-H-P-VTTYKDGDYWRLLVPGT-YKVTASARGYDPVTKTVEVDSKGAVQV-N 370 (375)
T ss_pred cCeEEEEEEeCCCCCCCCCeEEEEecCc-C-c-eEECCCccEEEccCCee-EEEEEEEcCcccEEEEEEEcCCCcEEE-E
Confidence 45799999996 7999999999999987 4 5 99999999999988775 689999999999999999987766554 4
Q ss_pred eEE
Q psy4833 87 VKM 89 (141)
Q Consensus 87 v~m 89 (141)
+.|
T Consensus 371 ~~L 373 (375)
T cd03863 371 FTL 373 (375)
T ss_pred EEe
Confidence 444
No 5
>cd06245 M14_CPD_III The third carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain III. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active a
Probab=99.25 E-value=3.6e-11 Score=103.94 Aligned_cols=76 Identities=18% Similarity=0.359 Sum_probs=66.3
Q ss_pred eEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCe
Q psy4833 8 VAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPV 87 (141)
Q Consensus 8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v 87 (141)
-..|+|+|+|.+|+||+||+|.|+|.. + ++||.+|.|.+.++.|. .+|.+|++||++++++|.+..++...+ ++
T Consensus 286 ~~gI~G~V~d~~g~pi~~A~V~v~g~~---~-~~T~~~G~y~~~L~pG~-y~v~vs~~Gy~~~~~~V~v~~~~~~~~-~f 359 (363)
T cd06245 286 HKGVHGVVTDKAGKPISGATIVLNGGH---R-VYTKEGGYFHVLLAPGQ-HNINVIAEGYQQEHLPVVVSHDEASSV-KI 359 (363)
T ss_pred CcEEEEEEEcCCCCCccceEEEEeCCC---c-eEeCCCcEEEEecCCce-EEEEEEEeCceeEEEEEEEcCCCeEEE-EE
Confidence 468999999999999999999999864 4 89999999999987665 589999999999999999988777665 55
Q ss_pred EE
Q psy4833 88 KM 89 (141)
Q Consensus 88 ~m 89 (141)
.|
T Consensus 360 ~L 361 (363)
T cd06245 360 VL 361 (363)
T ss_pred Ee
Confidence 55
No 6
>cd03858 M14_CP_N-E_like Carboxypeptidase (CP) N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. The N/E subfamily includes eight members, of which five (CPN, CPE, CPM, CPD, CPZ) are considered enzymatically active, while the other three are non-active (CPX1, PCX2, ACLP/AEBP1) and lack the critical active site and substrate-binding residues considered necessary for CP activity. These non-active members may function as binding proteins or display catalytic activity towards other substrates. Unlike the A/B CP subfamily, enzymes belonging to the N/E subfamily are not produced as inactive precursors that require proteolysis to produce the active form; rather, they rely on their substrate specificity and subcellular compartmentalization to prevent inappr
Probab=99.21 E-value=6.5e-11 Score=101.22 Aligned_cols=72 Identities=26% Similarity=0.350 Sum_probs=62.3
Q ss_pred EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccC-ccEEEe
Q psy4833 9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPW-NQIVVL 84 (141)
Q Consensus 9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~-n~~~~~ 84 (141)
..|+|+|+|.+|+||+||+|.++|.. . + ++||.+|+|.+.++.| ..+|.+|++||+++++++.+.. ++...+
T Consensus 298 ~~i~G~V~d~~g~pl~~A~V~i~~~~-~-~-~~Td~~G~f~~~l~~G-~y~l~vs~~Gy~~~~~~v~v~~~g~~~~~ 370 (374)
T cd03858 298 RGIKGFVRDANGNPIANATISVEGIN-H-D-VTTAEDGDYWRLLLPG-TYNVTASAPGYEPQTKSVVVPNDNSAVVV 370 (374)
T ss_pred CceEEEEECCCCCccCCeEEEEecce-e-e-eEECCCceEEEecCCE-eEEEEEEEcCcceEEEEEEEecCCceEEE
Confidence 38999999999999999999999887 4 5 9999999999998755 4689999999999999888876 555544
No 7
>cd03864 M14_CPN Peptidase M14 Carboxypeptidase N (CPN, also known as kininase I, creatine kinase conversion factor, plasma carboxypeptidase B, arginine carboxypeptidase, and protaminase; EC 3.4.17.3) is an extracellular glycoprotein synthesized in the liver and released into the blood, where it is present in high concentrations. CPN belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPN plays an important role in protecting the body from excessive buildup of potentially deleterious peptides that normally act as local autocrine or paracrine hormones. It specifically removes C-terminal basic residues. As CPN can cleave lysine more avidly than arginine residues it is also called lysine carboxypeptidase. CPN substrates inclu
Probab=99.13 E-value=2.5e-10 Score=99.93 Aligned_cols=71 Identities=21% Similarity=0.227 Sum_probs=60.8
Q ss_pred EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEE-ecCCCcEEEEEEecCccceEEEEEccCccEEEe
Q psy4833 9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVL 84 (141)
Q Consensus 9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~ 84 (141)
..|+|+|+|.+|+||+||+|.|+|.. . + ++||.+|.| +. ++.| ..+|.+|++||++++++|.+..++...+
T Consensus 316 ~gI~G~V~D~~g~pi~~A~V~v~g~~-~-~-~~T~~~G~y-~r~l~pG-~Y~l~vs~~Gy~~~t~~v~V~~~~~~~~ 387 (392)
T cd03864 316 QGIKGMVTDENNNGIANAVISVSGIS-H-D-VTSGTLGDY-FRLLLPG-TYTVTASAPGYQPSTVTVTVGPAEATLV 387 (392)
T ss_pred CeEEEEEECCCCCccCCeEEEEECCc-c-c-eEECCCCcE-EecCCCe-eEEEEEEEcCceeEEEEEEEcCCCcEEE
Confidence 47999999999999999999999997 4 5 999999999 54 5555 4689999999999999999887665443
No 8
>cd03867 M14_CPZ Peptidase M14-like domain of carboxypeptidase (CP) Z (CPZ), CPZ belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPZ is a secreted Zn-dependent enzyme whose biological function is largely unknown. Unlike other members of the N/E subfamily, CPZ has a bipartite structure, which consists of an N-terminal cysteine-rich domain (CRD) whose sequence is similar to Wnt-binding proteins, and a C-terminal CP catalytic domain that removes C-terminal Arg residues from substrates. CPZ is enriched in the extracellular matrix and is widely distributed during early embryogenesis. That the CRD of CPZ can bind to Wnt4 suggests that CPZ plays a role in Wnt signaling.
Probab=99.11 E-value=3.2e-10 Score=98.76 Aligned_cols=66 Identities=24% Similarity=0.230 Sum_probs=58.8
Q ss_pred EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccC
Q psy4833 9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPW 78 (141)
Q Consensus 9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~ 78 (141)
..|+|+|+|++|+||+||+|.++|+. . + +.||.+|.|...++.|. .+|.+|++||++++.+|.++.
T Consensus 318 ~~i~G~V~D~~g~pi~~A~V~v~g~~-~-~-~~Td~~G~y~~~l~~G~-y~l~vs~~Gy~~~~~~v~v~~ 383 (395)
T cd03867 318 RGIKGFVKDKDGNPIKGARISVRGIR-H-D-ITTAEDGDYWRLLPPGI-HIVSAQAPGYTKVMKRVTLPA 383 (395)
T ss_pred ceeEEEEEcCCCCccCCeEEEEeccc-c-c-eEECCCceEEEecCCCc-EEEEEEecCeeeEEEEEEeCC
Confidence 47999999999999999999999987 3 5 99999999987777665 579999999999999999864
No 9
>cd03868 M14_CPD_I The first carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain I. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally active at p
Probab=99.07 E-value=4.5e-10 Score=96.56 Aligned_cols=73 Identities=22% Similarity=0.167 Sum_probs=61.2
Q ss_pred eEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEE-EEccCccEEEe
Q psy4833 8 VAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRT-VFIPWNQIVVL 84 (141)
Q Consensus 8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~-V~~~~n~~~~~ 84 (141)
...|+|+|+|.+|+||+||+|.+++.. . + ++||.+|.|.+.++.| ..+|.+|+.||++++++ +.+..++...+
T Consensus 295 ~~~i~G~V~d~~g~pv~~A~V~v~~~~-~-~-~~td~~G~y~~~l~~G-~Y~l~vs~~Gf~~~~~~~v~v~~g~~~~~ 368 (372)
T cd03868 295 HIGVKGFVRDASGNPIEDATIMVAGID-H-N-VTTAKFGDYWRLLLPG-TYTITAVAPGYEPSTVTDVVVKEGEATSV 368 (372)
T ss_pred CCceEEEEEcCCCCcCCCcEEEEEecc-c-c-eEeCCCceEEecCCCE-EEEEEEEecCCCceEEeeEEEcCCCeEEE
Confidence 346899999999999999999999987 4 5 9999999998777756 46899999999999886 45766665554
No 10
>cd03865 M14_CPE_H Peptidase M14 Carboxypeptidase (CP) E (CPE, also known as carboxypeptidase H, and enkephalin convertase; EC 3.4.17.10) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPE is an important enzyme responsible for the proteolytic processing of prohormone intermediates (such as pro-insulin, pro-opiomelanocortin, or pro-gonadotropin-releasing hormone) by specifically removing C-terminal basic residues. In addition, it has been proposed that the regulated secretory pathway (RSP) of the nervous and endocrine systems utilizes membrane-bound CPE as a sorting receptor. A naturally occurring point mutation in CPE reduces the stability of the enzyme and causes its degradation, leading to an accumulation of numerous neuroendocrine pe
Probab=99.04 E-value=6.1e-10 Score=98.09 Aligned_cols=71 Identities=23% Similarity=0.286 Sum_probs=62.1
Q ss_pred EEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEe
Q psy4833 10 VVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVL 84 (141)
Q Consensus 10 ~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~ 84 (141)
-|+|+|+|.+|+||+||+|+|+|.. . + +.||.+|.|.+.++.|. .+|.+|+.||++++..|.++.++...+
T Consensus 327 gI~G~V~D~~g~pI~~AtV~V~g~~-~-~-~~T~~~G~Y~~~L~pG~-Ytv~vsa~Gy~~~~~~V~V~~~~~~~v 397 (402)
T cd03865 327 GVKGFVKDLQGNPIANATISVEGID-H-D-ITSAKDGDYWRLLAPGN-YKLTASAPGYLAVVKKVAVPYSPAVRV 397 (402)
T ss_pred ceEEEEECCCCCcCCCeEEEEEcCc-c-c-cEECCCeeEEECCCCEE-EEEEEEecCcccEEEEEEEcCCCcEEE
Confidence 4999999999999999999999987 4 5 99999999999777664 689999999999999999987765443
No 11
>cd03866 M14_CPM Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the cont
Probab=98.92 E-value=4.8e-09 Score=91.14 Aligned_cols=69 Identities=23% Similarity=0.292 Sum_probs=58.9
Q ss_pred EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCc
Q psy4833 9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWN 79 (141)
Q Consensus 9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n 79 (141)
.-|+|+|+|.+|+||+||+|.|.|.. ....+.||.+|.|.+.+..|. .+|.+|..||+++..++.++.+
T Consensus 295 ~gI~G~V~D~~g~pi~~A~V~v~g~~-~~~~~~T~~~G~y~~~l~pG~-Y~v~vsa~Gy~~~~~~v~v~~~ 363 (376)
T cd03866 295 LGVKGQVFDSNGNPIPNAIVEVKGRK-HICPYRTNVNGEYFLLLLPGK-YMINVTAPGFKTVITNVIIPYN 363 (376)
T ss_pred CceEEEEECCCCCccCCeEEEEEcCC-ceeEEEECCCceEEEecCCee-EEEEEEeCCcceEEEEEEeCCC
Confidence 46999999999999999999999886 311268999999999877665 5799999999999999988754
No 12
>PF08400 phage_tail_N: Prophage tail fibre N-terminal; InterPro: IPR013609 This entry represents the N terminus of phage 933W tail fibre protein. The characteristics of the protein distribution suggest prophage matches.
Probab=98.14 E-value=1.9e-05 Score=61.04 Aligned_cols=74 Identities=12% Similarity=0.119 Sum_probs=55.4
Q ss_pred eEEEEEEEECCCCCeeeeEEEEeccCCC-------eeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCcc
Q psy4833 8 VAVVRGQVLSVQGLGITGIRVSVDKDSG-------RFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQ 80 (141)
Q Consensus 8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~-------~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~ 80 (141)
.++|+|...|..|+|++|++|.++--.+ ......||.+|.|+|.+..|.. .+.+...|..
T Consensus 2 sV~ISGvL~dg~G~pv~g~~I~L~A~~tS~~Vv~~t~as~~t~~~G~Ys~~~epG~Y-------------~V~l~~~g~~ 68 (134)
T PF08400_consen 2 SVKISGVLKDGAGKPVPGCTITLKARRTSSTVVVGTVASVVTGEAGEYSFDVEPGVY-------------RVTLKVEGRP 68 (134)
T ss_pred eEEEEEEEeCCCCCcCCCCEEEEEEccCchheEEEEEEEEEcCCCceEEEEecCCeE-------------EEEEEECCCC
Confidence 3689999999999999999999874321 1244689999999999987753 3445566666
Q ss_pred EEEeCCeEEEecCC
Q psy4833 81 IVVLPPVKMQLSDA 94 (141)
Q Consensus 81 ~~~~~~v~m~~~d~ 94 (141)
..++.+|..+.-+.
T Consensus 69 ~~~vG~I~V~~dS~ 82 (134)
T PF08400_consen 69 PVYVGDITVYEDSK 82 (134)
T ss_pred ceeEEEEEEecCCC
Confidence 67777887775544
No 13
>cd03869 M14_CPX_like Peptidase M14-like domain of carboxypeptidase (CP)-like protein X (CPX), CPX forms a distinct subgroup of the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Proteins belonging to this subgroup include CP-like protein X1 (CPX1), CP-like protein X2 (CPX2), and aortic CP-like protein (ACLP) and its isoform adipocyte enhancer binding protein-1 (AEBP1). AEBP1 is a truncated form of ACLP, which may arise from alternative splicing of the gene. These proteins are inactive towards standard CP substrates because they lack one or more critical active site and substrate-binding residues that are necessary for activity. They may function as binding proteins rather than as active CPs or display catalytic activity toward other substrates. Pro
Probab=97.98 E-value=2e-05 Score=69.89 Aligned_cols=67 Identities=21% Similarity=0.272 Sum_probs=55.8
Q ss_pred EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCc
Q psy4833 9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWN 79 (141)
Q Consensus 9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n 79 (141)
.=|+|.|+|.+|.||++|+|+|+|.. . . ..|..+|.|-=-+.+|. .++.+|.-||.++.+++.++.+
T Consensus 329 ~GikG~V~d~~g~~i~~a~i~v~g~~-~-~-v~t~~~GdywRll~pG~-y~v~~~a~gy~~~~~~~~v~~~ 395 (405)
T cd03869 329 RGIKGVVRDKTGKGIPNAIISVEGIN-H-D-IRTASDGDYWRLLNPGE-YRVTAHAEGYTSSTKNCEVGYE 395 (405)
T ss_pred cCceEEEECCCCCcCCCcEEEEecCc-c-c-eeeCCCCceEEecCCce-EEEEEEecCCCcccEEEEEcCC
Confidence 35899999999999999999999976 3 3 77788998866655564 5799999999999998888754
No 14
>PF07210 DUF1416: Protein of unknown function (DUF1416); InterPro: IPR010814 This family consists of several hypothetical bacterial proteins of around 100 residues in length. Members of this family appear to be Actinomycete specific. The function of this family is unknown.
Probab=97.74 E-value=0.00037 Score=50.49 Aligned_cols=67 Identities=24% Similarity=0.307 Sum_probs=51.1
Q ss_pred cCCcceEEEEEEEECCCCCeeeeEEEEeccCCCee-eceEecCCceEEEEecCCCcEEEE-EEecCccceE
Q psy4833 3 DDELRVAVVRGQVLSVQGLGITGIRVSVDKDSGRF-GFTLTRGGGWFDMLVNGGGAVTLQ-FQRSPYRPQT 71 (141)
Q Consensus 3 ~d~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~-G~TiTD~DG~FsL~V~~g~~~tL~-FSyiGY~sqe 71 (141)
.|.+...+|+|+|+ .+|+|++|+-|+++..+++| +.+.|.++|.|.+-..+|.= +|+ .+.-|-....
T Consensus 2 vd~~ke~VItG~V~-~~G~Pv~gAyVRLLD~sgEFtaEvvts~~G~FRFfaapG~W-tvRal~~~g~~d~~ 70 (85)
T PF07210_consen 2 VDVEKETVITGRVT-RDGEPVGGAYVRLLDSSGEFTAEVVTSATGDFRFFAAPGSW-TVRALSRGGNGDAM 70 (85)
T ss_pred cCccceEEEEEEEe-cCCcCCCCeEEEEEcCCCCeEEEEEecCCccEEEEeCCCce-EEEEEccCCCCceE
Confidence 57788899999999 99999999999998765433 56899999999998765643 343 3555555333
No 15
>KOG2649|consensus
Probab=97.64 E-value=0.00024 Score=64.74 Aligned_cols=89 Identities=25% Similarity=0.249 Sum_probs=68.5
Q ss_pred EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCeE
Q psy4833 9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPVK 88 (141)
Q Consensus 9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v~ 88 (141)
.-|+|.|+|.+|+||+.|+|+|+|-+ - . +.|-.+|.|=--+++|.. .|.++.-||.+...+|.++. .--..=++.
T Consensus 378 ~GIkG~V~D~~G~~I~NA~IsV~gin-H-d-v~T~~~GDYWRLL~PG~y-~vta~A~Gy~~~tk~v~V~~-~~a~~~df~ 452 (500)
T KOG2649|consen 378 RGIKGLVFDDTGNPIANATISVDGIN-H-D-VTTAKEGDYWRLLPPGKY-IITASAEGYDPVTKTVTVPP-DRAARVNFT 452 (500)
T ss_pred hccceeEEcCCCCccCceEEEEecCc-C-c-eeecCCCceEEeeCCcce-EEEEecCCCcceeeEEEeCC-CCccceeEE
Confidence 46899999999999999999999887 3 3 888889999877776764 69999999999999999886 333333566
Q ss_pred EEecCCCccceecc
Q psy4833 89 MQLSDASDIFREVA 102 (141)
Q Consensus 89 m~~~d~~~~~~~~~ 102 (141)
|.....++.+.+..
T Consensus 453 L~~~~~~~~~r~~~ 466 (500)
T KOG2649|consen 453 LQRSIPQPPFRELL 466 (500)
T ss_pred EecCCCcchhhHHH
Confidence 66555444444333
No 16
>PRK15036 hydroxyisourate hydrolase; Provisional
Probab=97.57 E-value=0.00012 Score=56.27 Aligned_cols=61 Identities=16% Similarity=0.287 Sum_probs=45.7
Q ss_pred EEEEEEEECC-CCCeeeeEEEEeccCCC----eeeceEecCCceEEEEe----cCCCcEEEEEEecCccc
Q psy4833 9 AVVRGQVLSV-QGLGITGIRVSVDKDSG----RFGFTLTRGGGWFDMLV----NGGGAVTLQFQRSPYRP 69 (141)
Q Consensus 9 ~~IrG~V~D~-~GePL~GVsV~Vkgt~~----~~G~TiTD~DG~FsL~V----~~g~~~tL~FSyiGY~s 69 (141)
..|+|.|+|. .|.|++||.|.+.+... .++.+.||+||+|.... -..+...|+|.--+|=.
T Consensus 27 ~~Is~HVLDt~~G~PA~gV~V~L~~~~~~~w~~l~~~~Td~dGR~~~l~~~~~~~~G~Y~L~F~t~~Yf~ 96 (137)
T PRK15036 27 NILSVHILNQQTGKPAADVTVTLEKKADNGWLQLNTAKTDKDGRIKALWPEQTATTGDYRVVFKTGDYFK 96 (137)
T ss_pred CCeEEEEEeCCCCcCCCCCEEEEEEccCCceEEEEEEEECCCCCCccccCcccCCCeeEEEEEEcchhhh
Confidence 3699999996 79999999999976431 12458999999998621 11245689998888765
No 17
>cd00421 intradiol_dioxygenase Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. This family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases which are mononuclear non-heme iron enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings. The members are intradiol-cleaving enzymes which break the catechol C1-C2 bond and utilize Fe3+, as opposed to the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. Catechol 1,2-dioxygenases are mostly homodimers with one catalytic ferric ion per monomer. Protocatechuate 3,4-dioxygenases form more diverse oligomers.
Probab=97.43 E-value=0.0016 Score=49.80 Aligned_cols=74 Identities=18% Similarity=0.260 Sum_probs=51.1
Q ss_pred CcceEEEEEEEECCCCCeeeeEEEEeccCCC-----------------eeeceEecCCceEEEE-ecCCCc-------EE
Q psy4833 5 ELRVAVVRGQVLSVQGLGITGIRVSVDKDSG-----------------RFGFTLTRGGGWFDML-VNGGGA-------VT 59 (141)
Q Consensus 5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~-----------------~~G~TiTD~DG~FsL~-V~~g~~-------~t 59 (141)
+.....|+|+|+|.+|.|++||.|.+-.... ..|...||++|+|.+. +.++.. +-
T Consensus 8 ~G~~l~l~G~V~D~~g~pv~~A~VeiW~~d~~G~Y~~~~~~~~~~~~~~rg~~~Td~~G~y~f~ti~Pg~Y~~~R~~HiH 87 (146)
T cd00421 8 PGEPLTLTGTVLDGDGCPVPDALVEIWQADADGRYSGQDDSGLDPEFFLRGRQITDADGRYRFRTIKPGPYPIGRPPHIH 87 (146)
T ss_pred CCCEEEEEEEEECCCCCCCCCcEEEEEecCCCCccCCcCccccCCCCCCEEEEEECCCcCEEEEEEcCCCCCCCCCCEEE
Confidence 4567899999999999999999998843221 1367899999999997 333321 23
Q ss_pred EEEEecCc-cceEEEEEccC
Q psy4833 60 LQFQRSPY-RPQTRTVFIPW 78 (141)
Q Consensus 60 L~FSyiGY-~sqe~~V~~~~ 78 (141)
+.+.--|| ....-+++.+.
T Consensus 88 ~~V~~~g~~~~l~Tqlyf~~ 107 (146)
T cd00421 88 FKVFAPGYNRRLTTQLYFPG 107 (146)
T ss_pred EEEECCCccCcEEEEEEeCC
Confidence 55555666 55555555553
No 18
>cd03461 1,2-HQD Hydroxyquinol 1,2-dioxygenase (1,2-HQD) catalyzes the ring cleavage of hydroxyquinol (1,2,4-trihydroxybenzene), a intermediate in the degradation of a large variety of aromatic compounds including some polychloro- and nitroaromatic pollutants, to form 3-hydroxy-cis,cis-muconates. 1,2-HQD blongs to the aromatic dioxygenase family, a family of mononuclear non-heme intradiol-cleaving enzymes.
Probab=97.36 E-value=0.002 Score=54.85 Aligned_cols=75 Identities=17% Similarity=0.194 Sum_probs=55.0
Q ss_pred CcceEEEEEEEECCCCCeeeeEEEEecc----------------CCCeeeceEecCCceEEEE-ecCCC-----------
Q psy4833 5 ELRVAVVRGQVLSVQGLGITGIRVSVDK----------------DSGRFGFTLTRGGGWFDML-VNGGG----------- 56 (141)
Q Consensus 5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkg----------------t~~~~G~TiTD~DG~FsL~-V~~g~----------- 56 (141)
+.....|+|+|+|.+|.||+||.|-+=. .. ..|...||+||+|.+. +.++.
T Consensus 117 ~G~~l~v~G~V~D~~G~Pv~gA~VeiWqad~~G~Y~~~~~~~~~~~-lRGr~~Td~~G~y~F~Ti~Pg~Ypip~dGp~g~ 195 (277)
T cd03461 117 DGEPCFVHGRVTDTDGKPLPGATVDVWQADPNGLYDVQDPDQPEFN-LRGKFRTDEDGRYAFRTLRPTPYPIPTDGPVGK 195 (277)
T ss_pred CCCEEEEEEEEEcCCCCCcCCcEEEEECcCCCCCcCCCCCCCCCCC-CeEEEEeCCCCCEEEEEECCCCcCCCCCCcHHH
Confidence 4556899999999999999999998821 11 2367899999999996 32221
Q ss_pred -------------cEEEEEEecCccceEEEEEccCcc
Q psy4833 57 -------------AVTLQFQRSPYRPQTRTVFIPWNQ 80 (141)
Q Consensus 57 -------------~~tL~FSyiGY~sqe~~V~~~~n~ 80 (141)
.+-+.++.-||+...-+++.+...
T Consensus 196 lL~~~grh~~RpaHIHf~V~a~G~~~L~Tqlyf~~d~ 232 (277)
T cd03461 196 LLKAMGRHPMRPAHIHFMVTAPGYRTLVTQIFDSGDP 232 (277)
T ss_pred HHHhhhccCCCCCeEEEEEEcCCcCceEEeEecCCCc
Confidence 345667778887777777777554
No 19
>TIGR02423 protocat_alph protocatechuate 3,4-dioxygenase, alpha subunit. This model represents the alpha chain of protocatechuate 3,4-dioxygenase. The most closely related family outside this family is that of the beta chain (TIGR02422), typically encoded in an adjacent locus. This enzyme acts in the degradation of aromatic compounds by way of p-hydroxybenzoate to succinate and acetyl-CoA.
Probab=97.34 E-value=0.0031 Score=50.98 Aligned_cols=77 Identities=18% Similarity=0.217 Sum_probs=52.2
Q ss_pred cceEEEEEEEECCCCCeeeeEEEEecc------------------CCC--eeeceEecCCceEEEE-ecCCC--------
Q psy4833 6 LRVAVVRGQVLSVQGLGITGIRVSVDK------------------DSG--RFGFTLTRGGGWFDML-VNGGG-------- 56 (141)
Q Consensus 6 ~r~~~IrG~V~D~~GePL~GVsV~Vkg------------------t~~--~~G~TiTD~DG~FsL~-V~~g~-------- 56 (141)
...+.|+|+|+|.+|+||+||.|-+=. .++ ..|...||++|+|.+. +..+.
T Consensus 37 G~~l~l~G~V~D~~g~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~~~~f~grGr~~Td~~G~y~f~TI~Pg~Yp~~~g~~ 116 (193)
T TIGR02423 37 GERIRLEGRVLDGDGHPVPDALIEIWQADAAGRYNSPADLRAPATDPGFRGWGRTGTDESGEFTFETVKPGAVPDRDGVL 116 (193)
T ss_pred CCEEEEEEEEECCCCCCCCCCEEEEEccCCCCccCCccCCcccccCCCCCCeEEEEECCCCCEEEEEEcCCCcCCCCCCC
Confidence 456899999999999999999998721 000 1255789999999996 33332
Q ss_pred ---cEEEEEEecCc-cceEEEEEccCccEE
Q psy4833 57 ---AVTLQFQRSPY-RPQTRTVFIPWNQIV 82 (141)
Q Consensus 57 ---~~tL~FSyiGY-~sqe~~V~~~~n~~~ 82 (141)
.+-+.+..-|| +...-+++.+.....
T Consensus 117 R~~HiH~~V~a~G~~~~LtTqlYf~~~~~~ 146 (193)
T TIGR02423 117 QAPHINVSVFARGINRRLYTRLYFDDEAAA 146 (193)
T ss_pred cCCeEEEEEECCCcccceEEEEEcCCCccc
Confidence 23355556676 556556666655444
No 20
>cd03458 Catechol_intradiol_dioxygenases Catechol intradiol dioxygenases can be divided into several subgroups according to their substrate specificity for catechol, chlorocatechols and hydroxyquinols. Almost all members of this family are homodimers containing one ferric ion (Fe3+) per monomer. They belong to the intradiol dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=97.22 E-value=0.0037 Score=52.76 Aligned_cols=76 Identities=17% Similarity=0.220 Sum_probs=53.4
Q ss_pred CcceEEEEEEEECCCCCeeeeEEEEeccCC---------------CeeeceEecCCceEEEE-ecCC-------------
Q psy4833 5 ELRVAVVRGQVLSVQGLGITGIRVSVDKDS---------------GRFGFTLTRGGGWFDML-VNGG------------- 55 (141)
Q Consensus 5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~---------------~~~G~TiTD~DG~FsL~-V~~g------------- 55 (141)
+.....|+|+|+|.+|+||+||.|-+=... +..|...||+||+|.+. +.++
T Consensus 101 ~G~~l~l~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~lRG~~~Td~~G~y~f~Ti~P~~Ypip~dGp~g~l 180 (256)
T cd03458 101 DGEPLFVHGTVTDTDGKPLAGATVDVWHADPDGFYSQQDPDQPEFNLRGKFRTDEDGRYRFRTIRPVPYPIPPDGPTGEL 180 (256)
T ss_pred CCcEEEEEEEEEcCCCCCCCCcEEEEEccCCCCCcCCCCCCCCCCCCEEEEEeCCCCCEEEEEECCCCccCCCCCcHHHH
Confidence 456789999999999999999999882110 02367899999999996 3332
Q ss_pred -----------CcEEEEEEecCccceEEEEEccCcc
Q psy4833 56 -----------GAVTLQFQRSPYRPQTRTVFIPWNQ 80 (141)
Q Consensus 56 -----------~~~tL~FSyiGY~sqe~~V~~~~n~ 80 (141)
..+-+.++.-||+...-+++.+...
T Consensus 181 L~~~grh~~RpaHIHf~V~a~G~~~LtTqlyf~~d~ 216 (256)
T cd03458 181 LEALGRHPWRPAHIHFMVSAPGYRTLTTQIYFEGDP 216 (256)
T ss_pred HHhcccCCCCCCeEEEEEECCCcceeeEEEecCCCc
Confidence 2234666777776666666666444
No 21
>TIGR02438 catachol_actin catechol 1,2-dioxygenase, Actinobacterial. Members of this family are catechol 1,2-dioxygenases of the Actinobacteria. They are more closely related to actinobacterial chlorocatechol 1,2-dioxygenases than to proteobacterial catechol 1,2-dioxygenases, and so are built in this separate model. The member from Rhodococcus rhodochrous NCIMB 13259 (GB|AAC33003.1) is described as a homodimer with bound Fe, similarly active on catechol, 3-methylcatechol and 4-methylcatechol.
Probab=97.22 E-value=0.0032 Score=53.88 Aligned_cols=78 Identities=19% Similarity=0.322 Sum_probs=53.8
Q ss_pred CCcceEEEEEEEECCCCCeeeeEEEEe--ccCCC-------------eeeceEecCCceEEEE-ecCC------------
Q psy4833 4 DELRVAVVRGQVLSVQGLGITGIRVSV--DKDSG-------------RFGFTLTRGGGWFDML-VNGG------------ 55 (141)
Q Consensus 4 d~~r~~~IrG~V~D~~GePL~GVsV~V--kgt~~-------------~~G~TiTD~DG~FsL~-V~~g------------ 55 (141)
++.....|+|+|+|.+|+||+||.|.+ .+..+ ..|...||+||+|.+. +.++
T Consensus 128 ~~G~pl~v~G~V~D~~G~Pv~gA~VdiWqada~G~Ys~~~~~~~~~~lRGr~~TDadG~y~F~TI~Pg~YpiP~dGp~G~ 207 (281)
T TIGR02438 128 EAGTPLVFSGQVTDLDGNGLAGAKVELWHADDDGFYSQFAPGIPEWNLRGTIIADDEGRFEITTMQPAPYQIPTDGPTGK 207 (281)
T ss_pred CCCCEEEEEEEEEcCCCCCcCCCEEEEEecCCCCCcCCCCCCCCCCCCeEEEEeCCCCCEEEEEECCCCcCCCCCCchHH
Confidence 345568999999999999999999999 11111 1256789999999996 3222
Q ss_pred ------------CcEEEEEEecCccceEEEEEccCccE
Q psy4833 56 ------------GAVTLQFQRSPYRPQTRTVFIPWNQI 81 (141)
Q Consensus 56 ------------~~~tL~FSyiGY~sqe~~V~~~~n~~ 81 (141)
..+-+.++.-||+...-+++.+...+
T Consensus 208 lL~~~Grh~~RpaHIHf~V~a~G~~~LtTqiyf~gd~~ 245 (281)
T TIGR02438 208 FIAAAGGHPWRPAHLHLKVSAPGHELITTQLYFKGGEH 245 (281)
T ss_pred HHHhcccCCCCCCEEEEEEECCCcceEEEeEecCCCCC
Confidence 22345567777776666666664443
No 22
>PF00775 Dioxygenase_C: Dioxygenase; InterPro: IPR000627 This entry represents the C-terminal domain common to several intradiol ring-cleavage dioxygenases. Dioxygenases catalyse the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms. Cleavage of aromatic rings is one of the most important functions of dioxygenases, which play key roles in the degradation of aromatic compounds. The substrates of ring-cleavage dioxygenases can be classified into two groups according to the mode of scission of the aromatic ring. Intradiol enzymes use a non-haem Fe(III) to cleave the aromatic ring between two hydroxyl groups (ortho-cleavage), whereas extradiol enzymes (IPR000486 from INTERPRO) use a non-haem Fe(II) to cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon (meta-cleavage) []. These two subfamilies differ in sequence, structural fold, iron ligands, and the orientation of second sphere active site amino acid residues. Enzymes that belong to the intradiol family include catechol 1,2-dioxygenase (1,2-CTD) (1.13.11.1 from EC); protocatechuate 3,4-dioxygenase (3,4-PCD) (1.13.11.3 from EC); and chlorocatechol 1,2-dioxygenase (1.13.11.1 from EC) [].; GO: 0003824 catalytic activity, 0008199 ferric iron binding, 0006725 cellular aromatic compound metabolic process, 0055114 oxidation-reduction process; PDB: 2BUV_A 2BUX_A 2BUU_A 2BUR_A 1EO9_A 2BUZ_A 2BV0_A 1EO2_A 1EOC_A 1EOA_A ....
Probab=97.17 E-value=0.0061 Score=48.70 Aligned_cols=48 Identities=19% Similarity=0.193 Sum_probs=36.4
Q ss_pred CCcceEEEEEEEECCCCCeeeeEEEEec-----cC------------CCeeeceEecCCceEEEE
Q psy4833 4 DELRVAVVRGQVLSVQGLGITGIRVSVD-----KD------------SGRFGFTLTRGGGWFDML 51 (141)
Q Consensus 4 d~~r~~~IrG~V~D~~GePL~GVsV~Vk-----gt------------~~~~G~TiTD~DG~FsL~ 51 (141)
.+.....|+|+|+|.+|+||+||.|-|= |. .+..|...||++|+|.+.
T Consensus 25 ~~G~~l~l~G~V~D~~g~Pv~~A~veiWqada~G~Ys~~~~~~~~~~~~~rG~~~Td~~G~y~f~ 89 (183)
T PF00775_consen 25 APGEPLVLHGRVIDTDGKPVPGALVEIWQADADGRYSGQDPGSDQPDFNLRGRFRTDADGRYSFR 89 (183)
T ss_dssp SSS-EEEEEEEEEETTSSB-TTEEEEEEE--TTS--TTTBTTSSSSTTTTEEEEEECTTSEEEEE
T ss_pred CCCCEEEEEEEEECCCCCCCCCcEEEEEecCCCCccccccccccccCCCcceEEecCCCCEEEEE
Confidence 3556889999999999999999999982 10 012377899999999986
No 23
>cd03462 1,2-CCD chlorocatechol 1,2-dioxygenases (1,2-CCDs) (type II enzymes) are homodimeric intradiol dioxygenases that degrade chlorocatechols via the addition of molecular oxygen and the subsequent cleavage between two adjacent hydroxyl groups. This reaction is part of the modified ortho-cleavage pathway which is a central oxidative bacterial pathway that channels chlorocatechols, derived from the degradation of chlorinated benzoic acids, phenoxyacetic acids, phenols, benzenes, and other aromatics into the energy-generating tricarboxylic acid pathway.
Probab=97.14 E-value=0.0045 Score=52.03 Aligned_cols=77 Identities=16% Similarity=0.217 Sum_probs=54.8
Q ss_pred CcceEEEEEEEECCCCCeeeeEEEEeccCC---------------CeeeceEecCCceEEEE-ecCC-------------
Q psy4833 5 ELRVAVVRGQVLSVQGLGITGIRVSVDKDS---------------GRFGFTLTRGGGWFDML-VNGG------------- 55 (141)
Q Consensus 5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~---------------~~~G~TiTD~DG~FsL~-V~~g------------- 55 (141)
+.....|+|+|+|.+|+||+||.|-+=... +..|...||+||+|.+. +.++
T Consensus 96 ~G~~l~l~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~~RG~~~Td~~G~y~F~Ti~P~~YpiP~dGp~g~l 175 (247)
T cd03462 96 DHKPLLFRGTVKDLAGAPVAGAVIDVWHSTPDGKYSGFHPNIPEDYYRGKIRTDEDGRYEVRTTVPVPYQIPNDGPTGAL 175 (247)
T ss_pred CCCEEEEEEEEEcCCCCCcCCcEEEEECCCCCCCcCCCCCCCCCCCCEEEEEeCCCCCEEEEEECCCCcCCCCCCcHHHH
Confidence 445689999999999999999999883111 01256789999999996 3322
Q ss_pred -----------CcEEEEEEecCccceEEEEEccCccE
Q psy4833 56 -----------GAVTLQFQRSPYRPQTRTVFIPWNQI 81 (141)
Q Consensus 56 -----------~~~tL~FSyiGY~sqe~~V~~~~n~~ 81 (141)
..+-+.++.-||+...-+++.+...+
T Consensus 176 L~~~grh~~RpaHIHf~V~a~G~~~L~Tqlyf~~d~~ 212 (247)
T cd03462 176 LEAMGGHSWRPAHVHFKVRADGYETLTTQLYFEGGEW 212 (247)
T ss_pred HHhcccCCCCCCeEEEEEECCCcCceEEEEecCCCCC
Confidence 23456777788877666777664444
No 24
>cd03459 3,4-PCD Protocatechuate 3,4-dioxygenase (3,4-PCD) catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=97.11 E-value=0.0066 Score=47.56 Aligned_cols=74 Identities=16% Similarity=0.267 Sum_probs=52.1
Q ss_pred CcceEEEEEEEECCCCCeeeeEEEEeccC------------------CC--eeeceEecCCceEEEE-ecCCC-------
Q psy4833 5 ELRVAVVRGQVLSVQGLGITGIRVSVDKD------------------SG--RFGFTLTRGGGWFDML-VNGGG------- 56 (141)
Q Consensus 5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt------------------~~--~~G~TiTD~DG~FsL~-V~~g~------- 56 (141)
+...+.|+|+|+|.+|+||+||.|.+=.. ++ ..|...||+||+|.+. +.++.
T Consensus 12 ~G~~l~l~g~V~D~~g~Pv~~A~veiWqad~~G~Y~~~~~~~~~~~~~~f~~rG~~~Td~~G~~~f~Ti~Pg~Y~~p~~~ 91 (158)
T cd03459 12 IGERIILEGRVLDGDGRPVPDALVEIWQADAAGRYRHPRDSHRAPLDPNFTGFGRVLTDADGRYRFRTIKPGAYPWRNGA 91 (158)
T ss_pred CCcEEEEEEEEECCCCCCCCCCEEEEEccCCCCccCCccCCcccccCCCCCceeEEEECCCCcEEEEEECCCCcCCCCCC
Confidence 34578999999999999999999987211 00 1255789999999996 33221
Q ss_pred ----cEEEEEEecCc-cceEEEEEccC
Q psy4833 57 ----AVTLQFQRSPY-RPQTRTVFIPW 78 (141)
Q Consensus 57 ----~~tL~FSyiGY-~sqe~~V~~~~ 78 (141)
.+-+.+.--|| +...-+++.+.
T Consensus 92 ~R~~HIH~~V~~~g~~~~L~Tqlyf~~ 118 (158)
T cd03459 92 WRAPHIHVSVFARGLLERLVTRLYFPG 118 (158)
T ss_pred CcCCEEEEEEECCCcccceEEeEecCC
Confidence 13356666788 66666777775
No 25
>TIGR02439 catechol_proteo catechol 1,2-dioxygenase, proteobacterial. Members of this family known so far are catechol 1,2-dioxygenases of the Proteobacteria. They are distinct from catechol 1,2-dioxygenases and chlorocatechol 1,2-dioxygenases of the Actinobacteria, which are quite similar to each other and resolved by separate models. This enzyme catalyzes intradiol cleavage in which catechol + O2 becomes cis,cis-muconate. Catechol is an intermediate in the catabolism of many different aromatic compounds, as is the alternative intermediate protocatechuate. In Acinetobacter lwoffii, two isozymes are present with abilities, differing somewhat, to act on catechol analogs 3-methylcatechol, 4-methylcatechol, 4-methoxycatechol, and 4-chlorocatechol.
Probab=97.11 E-value=0.0056 Score=52.41 Aligned_cols=79 Identities=18% Similarity=0.146 Sum_probs=56.0
Q ss_pred CcceEEEEEEEECCCCCeeeeEEEEeccC-------------C--CeeeceEecCCceEEEE-ecCC-------------
Q psy4833 5 ELRVAVVRGQVLSVQGLGITGIRVSVDKD-------------S--GRFGFTLTRGGGWFDML-VNGG------------- 55 (141)
Q Consensus 5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt-------------~--~~~G~TiTD~DG~FsL~-V~~g------------- 55 (141)
+.....|+|+|+|.+|+||+||.|-+=.. + +..|...||+||+|.+. +.++
T Consensus 125 ~G~pl~v~G~V~D~~G~PI~gA~VeIWqad~~G~Ys~~~~~~~~~~lRG~~~TD~~G~y~F~TI~P~~YpiP~dGp~g~l 204 (285)
T TIGR02439 125 PGETLFLHGQVTDADGKPIAGAKVELWHANTKGNYSHFDKSQSEFNLRRTIITDAEGRYRARSIVPSGYGCPPQGPTQQL 204 (285)
T ss_pred CCcEEEEEEEEECCCCCCcCCcEEEEEccCCCCCcCCCCCCCCCCCceEEEEECCCCCEEEEEECCCCCcCCCCCcHHHH
Confidence 35678999999999999999999988211 0 02267899999999996 3222
Q ss_pred -----------CcEEEEEEecCccceEEEEEccCccEEE
Q psy4833 56 -----------GAVTLQFQRSPYRPQTRTVFIPWNQIVV 83 (141)
Q Consensus 56 -----------~~~tL~FSyiGY~sqe~~V~~~~n~~~~ 83 (141)
..+-+.++.-||+...-+++.+...++.
T Consensus 205 L~~~grh~~RpaHIHf~V~a~G~~~LtTQiyf~gd~~l~ 243 (285)
T TIGR02439 205 LNLLGRHGNRPAHVHFFVSAPGHRKLTTQINIEGDPYLW 243 (285)
T ss_pred HHhccCCCCCCCeEEEEEecCCCCeeEEEEecCCCCCcc
Confidence 2345667777887777777766554443
No 26
>PF02369 Big_1: Bacterial Ig-like domain (group 1); InterPro: IPR003344 Proteins that contain this domain are found in a variety of bacterial and phage surface proteins such as intimins. Intimin is a bacterial cell-adhesion molecule that mediates the intimate bacterial host-cell interaction. It contains three domains; two immunoglobulin-like domains and a C-type lectin-like module implying that carbohydrate recognition may be important in intimin-mediated cell adhesion [].; PDB: 1CWV_A 4E9L_A 1F02_I 1F00_I.
Probab=97.11 E-value=0.0047 Score=44.25 Aligned_cols=70 Identities=14% Similarity=0.187 Sum_probs=44.3
Q ss_pred CCcceEEEEEEEECCCCCeeeeEEEEe--ccCCCeee----ceEecCCceEEEEecC--CCcEEEEEEecCccceEEE
Q psy4833 4 DELRVAVVRGQVLSVQGLGITGIRVSV--DKDSGRFG----FTLTRGGGWFDMLVNG--GGAVTLQFQRSPYRPQTRT 73 (141)
Q Consensus 4 d~~r~~~IrG~V~D~~GePL~GVsV~V--kgt~~~~G----~TiTD~DG~FsL~V~~--g~~~tL~FSyiGY~sqe~~ 73 (141)
|.....+|+-+|.|.+|.||+|+.|.+ ......+. ...||.+|.+.+.+.+ .+..++..+.-|.......
T Consensus 20 ~g~~~~tltatV~D~~gnpv~g~~V~f~~~~~~~~l~~~~~~~~Td~~G~a~~tltst~aG~~~VtA~~~~~~~~~~~ 97 (100)
T PF02369_consen 20 DGSDTNTLTATVTDANGNPVPGQPVTFSSSSSGGTLSPTNTSATTDSNGIATVTLTSTKAGTYTVTATVDGGSTSVTS 97 (100)
T ss_dssp SSSS-EEEEEEEEETTSEB-TS-EEEE--EESSSEES-CEE-EEE-TTSEEEEEEE-SS-EEEEEEEEETTEEEEEEE
T ss_pred CCcCcEEEEEEEEcCCCCCCCCCEEEEEEcCCCcEEecCccccEECCCEEEEEEEEecCceEEEEEEEECCcceeEEe
Confidence 456678899999999999999999999 32221111 2589999999998753 2334566666665544433
No 27
>TIGR02465 chlorocat_1_2 chlorocatechol 1,2-dioxygenase. Members of this protein family are chlorocatechol 1,2-dioxygenase. This protein is closely related to catechol 1,2-dioxygenase, TIGR02439, EC 1.13.11.1. Note that annotated database entries have appeared for the present protein family with the EC number that refers to that of family TIGR02439. This protein acts in pathways of the biodegradation of chlorinated aromatic compounds.
Probab=97.08 E-value=0.0064 Score=51.03 Aligned_cols=77 Identities=18% Similarity=0.276 Sum_probs=54.1
Q ss_pred cceEEEEEEEECCCCCeeeeEEEEeccCC---------------CeeeceEecCCceEEEE-ecCC--------------
Q psy4833 6 LRVAVVRGQVLSVQGLGITGIRVSVDKDS---------------GRFGFTLTRGGGWFDML-VNGG-------------- 55 (141)
Q Consensus 6 ~r~~~IrG~V~D~~GePL~GVsV~Vkgt~---------------~~~G~TiTD~DG~FsL~-V~~g-------------- 55 (141)
.-...|+|+|+|.+|+||+||.|-|=... +..|...||+||+|.+. +.++
T Consensus 96 G~~l~v~G~V~D~~G~Pv~~A~VeiWqad~~G~Y~~~~~~~~~~~lRG~~~Td~~G~y~F~Ti~P~~YpiP~dgp~g~lL 175 (246)
T TIGR02465 96 HKPLLIRGTVRDLSGTPVAGAVIDVWHSTPDGKYSGFHDNIPDDYYRGKLVTAADGSYEVRTTMPVPYQIPDAGPTGALL 175 (246)
T ss_pred CcEEEEEEEEEcCCCCCcCCcEEEEECCCCCCCCCCCCCCCCCCCCeEEEEECCCCCEEEEEECCCCCCCCCCCchHHHH
Confidence 45689999999999999999999882110 01266889999999996 3222
Q ss_pred ----------CcEEEEEEecCccceEEEEEccCccEE
Q psy4833 56 ----------GAVTLQFQRSPYRPQTRTVFIPWNQIV 82 (141)
Q Consensus 56 ----------~~~tL~FSyiGY~sqe~~V~~~~n~~~ 82 (141)
..+-+.++.-||+...-+++.+....+
T Consensus 176 ~~~grh~~RpaHIH~~V~a~G~~~L~Tqiyf~~d~~l 212 (246)
T TIGR02465 176 ETMGRHSWRPAHVHYKVRADGYRPLTTQAYFEGGPYI 212 (246)
T ss_pred HhcccCCCCCCeEEEEEECCCcCceEEeEecCCCccc
Confidence 234566677788766666666644433
No 28
>TIGR02422 protocat_beta protocatechuate 3,4-dioxygenase, beta subunit. This model represents the beta chain of protocatechuate 3,4-dioxygenase. The most closely related family outside this family is that of the alpha chain (TIGR02423), typically encoded in an adjacent locus. This enzyme acts in the degradation of aromatic compounds by way of p-hydroxybenzoate to succinate and acetyl-CoA.
Probab=97.07 E-value=0.006 Score=50.41 Aligned_cols=75 Identities=17% Similarity=0.343 Sum_probs=50.5
Q ss_pred ceEEEEEEEECCCCCeeeeEEEEeccC------------------CC--eeeceEecCCceEEEE-ecCCC---------
Q psy4833 7 RVAVVRGQVLSVQGLGITGIRVSVDKD------------------SG--RFGFTLTRGGGWFDML-VNGGG--------- 56 (141)
Q Consensus 7 r~~~IrG~V~D~~GePL~GVsV~Vkgt------------------~~--~~G~TiTD~DG~FsL~-V~~g~--------- 56 (141)
..+.|+|+|+|.+|+||+||.|.|=.. ++ .+|...||.||+|.+. +..+.
T Consensus 59 ~~i~l~G~V~D~~g~PV~~A~VEIWQada~G~Y~~~~d~~~~~~~~~f~grGr~~TD~~G~y~F~TI~PG~Y~~p~~~~~ 138 (220)
T TIGR02422 59 ERIIVHGRVLDEDGRPVPNTLVEVWQANAAGRYRHKNDQYLAPLDPNFGGVGRTLTDSDGYYRFRTIKPGPYPWGNHHNA 138 (220)
T ss_pred CEEEEEEEEECCCCCCCCCCEEEEEecCCCCcccCccCccccccCCCCCCEEEEEECCCccEEEEEECCCCccCCCCCCC
Confidence 458999999999999999999987211 00 1367889999999996 33332
Q ss_pred ----cEEEEEEecCc-cceEEEEEccCccE
Q psy4833 57 ----AVTLQFQRSPY-RPQTRTVFIPWNQI 81 (141)
Q Consensus 57 ----~~tL~FSyiGY-~sqe~~V~~~~n~~ 81 (141)
.+-+.+..-|| ....-+++.+....
T Consensus 139 ~R~pHIH~~V~~~g~~~~L~TqlYF~dd~~ 168 (220)
T TIGR02422 139 WRPAHIHFSLFGTSFAQRLITQMYFEGDPL 168 (220)
T ss_pred CcCCeEEEEEECCCCccceEEeEECCCCcc
Confidence 22344455666 35555566665443
No 29
>cd03460 1,2-CTD Catechol 1,2 dioxygenase (1,2-CTD) catalyzes an intradiol cleavage reaction of catechol to form cis,cis-muconate. 1,2-CTDs is homodimers with one catalytic non-heme ferric ion per monomer. They belong to the aromatic dioxygenase family, a family of mononuclear non-heme iron intradiol-cleaving enzymes that catalyze the oxygenation of catecholates to aliphatic acids via the cleavage of aromatic rings.
Probab=97.02 E-value=0.0076 Score=51.55 Aligned_cols=79 Identities=16% Similarity=0.177 Sum_probs=55.2
Q ss_pred CcceEEEEEEEECCCCCeeeeEEEEeccC-------------C--CeeeceEecCCceEEEE-ecCC-------------
Q psy4833 5 ELRVAVVRGQVLSVQGLGITGIRVSVDKD-------------S--GRFGFTLTRGGGWFDML-VNGG------------- 55 (141)
Q Consensus 5 ~~r~~~IrG~V~D~~GePL~GVsV~Vkgt-------------~--~~~G~TiTD~DG~FsL~-V~~g------------- 55 (141)
+.....|+|+|+|.+|+||+||.|-|=.. + +..|...||++|+|.+. +.++
T Consensus 121 ~Gepl~l~G~V~D~~G~PI~~A~VeiWqad~~G~Ys~~~~~~~~f~~RGr~~TD~~G~y~F~TI~P~~YpiP~dGp~g~l 200 (282)
T cd03460 121 DGETLVMHGTVTDTDGKPVPGAKVEVWHANSKGFYSHFDPTQSPFNLRRSIITDADGRYRFRSIMPSGYGVPPGGPTQQL 200 (282)
T ss_pred CCCEEEEEEEEECCCCCCcCCcEEEEECCCCCCCcCCCCCCCCCCCCceEEEeCCCCCEEEEEECCCCCcCCCCCcHHHH
Confidence 35678999999999999999999988211 0 02266899999999996 3222
Q ss_pred -----------CcEEEEEEecCccceEEEEEccCccEEE
Q psy4833 56 -----------GAVTLQFQRSPYRPQTRTVFIPWNQIVV 83 (141)
Q Consensus 56 -----------~~~tL~FSyiGY~sqe~~V~~~~n~~~~ 83 (141)
..+-+.++.-||+...-+++.+...++.
T Consensus 201 L~~~grh~~RpaHIHf~V~a~G~~~LtTqlyf~gd~~~~ 239 (282)
T cd03460 201 LNALGRHGNRPAHIHFFVSAPGHRKLTTQINIEGDPYIW 239 (282)
T ss_pred HHhhcCCCCCCCeEEEEEecCCcCceEEEEecCCCCccc
Confidence 2244666777777766667766554443
No 30
>cd03464 3,4-PCD_beta Protocatechuate 3,4-dioxygenase (3,4-PCD) , beta subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=96.95 E-value=0.01 Score=49.03 Aligned_cols=75 Identities=17% Similarity=0.331 Sum_probs=50.3
Q ss_pred cceEEEEEEEECCCCCeeeeEEEEeccC------------------CC--eeeceEecCCceEEEE-ecCCC--------
Q psy4833 6 LRVAVVRGQVLSVQGLGITGIRVSVDKD------------------SG--RFGFTLTRGGGWFDML-VNGGG-------- 56 (141)
Q Consensus 6 ~r~~~IrG~V~D~~GePL~GVsV~Vkgt------------------~~--~~G~TiTD~DG~FsL~-V~~g~-------- 56 (141)
...+.|+|+|+|.+|+||+||.|.|=.. ++ ..|...||+||+|.+. +..+.
T Consensus 63 G~~i~l~G~V~D~~G~PV~~A~VEIWQad~~G~Y~~~~d~~~~~~~~~f~grGr~~TD~~G~y~F~TI~Pg~Yp~p~~r~ 142 (220)
T cd03464 63 GERIIVHGRVLDEDGRPVPNTLVEIWQANAAGRYRHKRDQHDAPLDPNFGGAGRTLTDDDGYYRFRTIKPGAYPWGNHPN 142 (220)
T ss_pred CCEEEEEEEEECCCCCCCCCCEEEEEecCCCCcccCccCCcccccCCCCCCEEEEEECCCccEEEEEECCCCccCCCCCC
Confidence 4458999999999999999999987211 00 1356799999999996 33332
Q ss_pred -----cEEEEEEecCcc-ceEEEEEccCcc
Q psy4833 57 -----AVTLQFQRSPYR-PQTRTVFIPWNQ 80 (141)
Q Consensus 57 -----~~tL~FSyiGY~-sqe~~V~~~~n~ 80 (141)
.+-+.++.-||. ...-+++.+...
T Consensus 143 ~~RppHIH~~V~a~G~~~~L~TqlYF~dd~ 172 (220)
T cd03464 143 AWRPAHIHFSLFGPSFATRLVTQMYFPGDP 172 (220)
T ss_pred CCcCCeEEEEEECCCccCceEEeEECCCCc
Confidence 223444566765 455556666444
No 31
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=96.95 E-value=0.011 Score=47.64 Aligned_cols=73 Identities=16% Similarity=0.263 Sum_probs=51.2
Q ss_pred cceEEEEEEEECCCCCeeeeEEEEeccC-----------------CC--eeeceEecCCceEEEE-ecCCC---------
Q psy4833 6 LRVAVVRGQVLSVQGLGITGIRVSVDKD-----------------SG--RFGFTLTRGGGWFDML-VNGGG--------- 56 (141)
Q Consensus 6 ~r~~~IrG~V~D~~GePL~GVsV~Vkgt-----------------~~--~~G~TiTD~DG~FsL~-V~~g~--------- 56 (141)
.....|+|+|+|.+|+||+||.|-|=.. ++ ..|...||+||+|.+. +..+.
T Consensus 34 G~~l~l~G~V~D~~g~Pi~gA~VeiWqad~~G~Y~~~~~~~~~~~~~f~~rGr~~TD~~G~y~F~Ti~Pg~Y~~~~g~~R 113 (185)
T cd03463 34 GERITLEGRVYDGDGAPVPDAMLEIWQADAAGRYAHPADSRRRLDPGFRGFGRVATDADGRFSFTTVKPGAVPGRDGAGQ 113 (185)
T ss_pred CCEEEEEEEEECCCCCCCCCCEEEEEcCCCCCccCCcCCcccccCCCCCcEEEEEECCCCCEEEEEEcCCCcCCCCCCCc
Confidence 4678999999999999999999987211 10 1145789999999996 32221
Q ss_pred --cEEEEEEecCc-cceEEEEEccC
Q psy4833 57 --AVTLQFQRSPY-RPQTRTVFIPW 78 (141)
Q Consensus 57 --~~tL~FSyiGY-~sqe~~V~~~~ 78 (141)
.+-+.+..-|| +...-+++.+.
T Consensus 114 ~~HIH~~V~~~g~~~~L~Tqlyf~~ 138 (185)
T cd03463 114 APHINVWVFARGLLKHLFTRIYFPD 138 (185)
T ss_pred CCeEEEEEECCCcccceEEeEecCC
Confidence 13356666777 66666777765
No 32
>smart00634 BID_1 Bacterial Ig-like domain (group 1).
Probab=96.86 E-value=0.01 Score=41.47 Aligned_cols=69 Identities=16% Similarity=0.197 Sum_probs=48.9
Q ss_pred CCcceEEEEEEEECCCCCeeeeEEEEeccCCCe-----eeceEecCCceEEEEecC--CCcEEEEEEecCccceEE
Q psy4833 4 DELRVAVVRGQVLSVQGLGITGIRVSVDKDSGR-----FGFTLTRGGGWFDMLVNG--GGAVTLQFQRSPYRPQTR 72 (141)
Q Consensus 4 d~~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~-----~G~TiTD~DG~FsL~V~~--g~~~tL~FSyiGY~sqe~ 72 (141)
|......|+=+|+|++|.|++|+.|.+.-.... -+...||.+|+..+.+.+ .+.+++..+.-|..+++.
T Consensus 15 dg~d~~~i~v~v~D~~Gnpv~~~~V~f~~~~~~~~~~~~~~~~Td~~G~a~~~l~~~~~G~~~vta~~~~~~~~~~ 90 (92)
T smart00634 15 NGSDAITLTATVTDANGNPVAGQEVTFTTPSGGALTLSKGTATTDANGIATVTLTSTTAGVYTVTASLENGSSSVK 90 (92)
T ss_pred cCcccEEEEEEEECCCCCCcCCCEEEEEECCCceeeccCCeeeeCCCCEEEEEEECCCCcEEEEEEEECCCcceee
Confidence 445677899999999999999988887633200 023579999999998753 345567777777665543
No 33
>PF10670 DUF4198: Domain of unknown function (DUF4198)
Probab=96.52 E-value=0.012 Score=45.31 Aligned_cols=54 Identities=19% Similarity=0.133 Sum_probs=39.3
Q ss_pred EEEEEEECCCCCeeeeEEEEeccCCCe------eeceEecCCceEEEEecCCCcEEEEEEe
Q psy4833 10 VVRGQVLSVQGLGITGIRVSVDKDSGR------FGFTLTRGGGWFDMLVNGGGAVTLQFQR 64 (141)
Q Consensus 10 ~IrG~V~D~~GePL~GVsV~Vkgt~~~------~G~TiTD~DG~FsL~V~~g~~~tL~FSy 64 (141)
.++.+|+ -+|+||+|+.|.+-..... .....||++|++.|.++..+...|..++
T Consensus 152 ~~~~~vl-~~GkPl~~a~V~~~~~~~~~~~~~~~~~~~TD~~G~~~~~~~~~G~wli~a~~ 211 (215)
T PF10670_consen 152 PLPFQVL-FDGKPLAGAEVEAFSPGGWYDVEHEAKTLKTDANGRATFTLPRPGLWLIRASH 211 (215)
T ss_pred EEEEEEE-ECCeEcccEEEEEEECCCccccccceEEEEECCCCEEEEecCCCEEEEEEEEE
Confidence 4677887 7899999999998544210 1238899999999998755554565554
No 34
>COG3485 PcaH Protocatechuate 3,4-dioxygenase beta subunit [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=96.49 E-value=0.0064 Score=50.60 Aligned_cols=73 Identities=21% Similarity=0.311 Sum_probs=49.1
Q ss_pred ceEEEEEEEECCCCCeeeeEEEEec-----------cCC--------CeeeceEecCCceEEEE-ecCC------C----
Q psy4833 7 RVAVVRGQVLSVQGLGITGIRVSVD-----------KDS--------GRFGFTLTRGGGWFDML-VNGG------G---- 56 (141)
Q Consensus 7 r~~~IrG~V~D~~GePL~GVsV~Vk-----------gt~--------~~~G~TiTD~DG~FsL~-V~~g------~---- 56 (141)
..+.|+|+|+|.+|.||+||-|-|= ..+ +..|.++||.+|+|.+. +.++ .
T Consensus 71 e~i~l~G~VlD~~G~Pv~~A~VEiWQAda~GrY~~~~d~~~~~~~~f~g~Gr~~Td~~G~y~F~Ti~Pg~yp~~~~~~~~ 150 (226)
T COG3485 71 ERILLEGRVLDGNGRPVPDALVEIWQADADGRYSHPKDSRLAPLPNFNGRGRTITDEDGEYRFRTIKPGPYPWRNGGPMW 150 (226)
T ss_pred ceEEEEEEEECCCCCCCCCCEEEEEEcCCCCcccCccccccCcCccccceEEEEeCCCceEEEEEeecccccCCCCCCcC
Confidence 5789999999999999999999871 110 01267999999999996 2222 1
Q ss_pred ---cEEEEEEecCccc-eEEEEEccCc
Q psy4833 57 ---AVTLQFQRSPYRP-QTRTVFIPWN 79 (141)
Q Consensus 57 ---~~tL~FSyiGY~s-qe~~V~~~~n 79 (141)
.+.+.++--|+.. .+-.++++..
T Consensus 151 R~aHih~~v~~~g~~~~L~TqlYf~dd 177 (226)
T COG3485 151 RPAHIHFSVFARGINTRLVTQLYFPDD 177 (226)
T ss_pred ccceeEEEEeCCcccceEEEEEecCCC
Confidence 3456666666633 3444555544
No 35
>KOG1948|consensus
Probab=96.39 E-value=0.014 Score=57.21 Aligned_cols=64 Identities=25% Similarity=0.367 Sum_probs=45.4
Q ss_pred EEEEEEEECC-CCCeeeeEEEEeccCCCeeeceEecCCceEEEEe-cCCCcEEEEEE--ecCccceEEEEEc
Q psy4833 9 AVVRGQVLSV-QGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLV-NGGGAVTLQFQ--RSPYRPQTRTVFI 76 (141)
Q Consensus 9 ~~IrG~V~D~-~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V-~~g~~~tL~FS--yiGY~sqe~~V~~ 76 (141)
..|+|+|++. +|.||.||-|.|.|.. + .-||++|.|.|.- -..+.++|... ..-|.+...++..
T Consensus 316 fSvtGRVl~g~~g~~l~gvvvlvngk~---~-~kTdaqGyykLen~~t~gtytI~a~kehlqFstv~~kv~p 383 (1165)
T KOG1948|consen 316 FSVTGRVLVGSKGLPLSGVVVLVNGKS---G-GKTDAQGYYKLENLKTDGTYTITAKKEHLQFSTVHAKVKP 383 (1165)
T ss_pred EEeeeeEEeCCCCCCccceEEEEcCcc---c-ceEcccceEEeeeeeccCcEEEEEeccceeeeeEEEEecC
Confidence 3789999996 8999999999999887 4 8999999999974 22333455443 3334444444443
No 36
>PRK15310 fimbrial outer membrane usher protein TcfC; Provisional
Probab=95.44 E-value=0.056 Score=52.65 Aligned_cols=78 Identities=15% Similarity=0.089 Sum_probs=52.2
Q ss_pred eEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCcc-ceEEEEEccCccEEEeCC
Q psy4833 8 VAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYR-PQTRTVFIPWNQIVVLPP 86 (141)
Q Consensus 8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~-sqe~~V~~~~n~~~~~~~ 86 (141)
+.++-|++.|++|.||.++. |+++. + +.||+||.|+|.++.... +|-+-.=+=. .=+..|.=..+.+.++.+
T Consensus 785 ~~~yiGr~~~~~G~~l~~a~--IlN~~---~-~~td~~GgF~~d~d~~~~-~L~ll~~~~~~~C~L~v~k~R~~i~~VGd 857 (895)
T PRK15310 785 SRLYVGRVLDKDGRPLLDAQ--PLNYP---F-LSLGPSGRFSLQSEHKES-SLWLLSKNRILRCPMSVHKRRDVMQVVGD 857 (895)
T ss_pred EEEEEEEEECCCCCCcccce--eecCc---c-ceecCCCCEEEEcccCCc-eEEEEeCCcEEEcccchhhhcceEEEEEe
Confidence 34478999999999999987 56676 6 999999999999985543 4533322111 112233333444557788
Q ss_pred eEEEec
Q psy4833 87 VKMQLS 92 (141)
Q Consensus 87 v~m~~~ 92 (141)
|.-.+.
T Consensus 858 V~C~~~ 863 (895)
T PRK15310 858 VRCELS 863 (895)
T ss_pred EEeccC
Confidence 877655
No 37
>PF11974 MG1: Alpha-2-macroglobulin MG1 domain; InterPro: IPR021868 This is the N-terminal MG1 domain from alpha-2-macroglobulin [].
Probab=95.27 E-value=0.024 Score=40.74 Aligned_cols=39 Identities=21% Similarity=0.251 Sum_probs=30.7
Q ss_pred EEC-CCCCeeeeEEEEecc-CCC-eeeceEecCCceEEEEec
Q psy4833 15 VLS-VQGLGITGIRVSVDK-DSG-RFGFTLTRGGGWFDMLVN 53 (141)
Q Consensus 15 V~D-~~GePL~GVsV~Vkg-t~~-~~G~TiTD~DG~FsL~V~ 53 (141)
|.| .+|+|++||.|.+-+ ..+ .++...||++|...|...
T Consensus 19 v~~L~tg~Pv~ga~V~l~~~~~~~~l~~g~TD~~G~a~~~~~ 60 (97)
T PF11974_consen 19 VTSLSTGKPVAGAEVELYDSRNGQVLASGKTDADGFASFDST 60 (97)
T ss_pred EeeCCCCCccCCCEEEEEECCCCcEeeeeeeCCCceEEecCC
Confidence 455 579999999999965 321 346678999999999876
No 38
>PF01190 Pollen_Ole_e_I: Pollen proteins Ole e I like; InterPro: IPR006041 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Ole e 1. A number of plant pollen proteins, whose biological function is not yet known, are structurally related []. These proteins are most probably secreted and consist of about 145 residues. There are six cysteines which are conserved in the sequence of these proteins. They seem to be involved in disulphide bonds.
Probab=95.17 E-value=0.044 Score=38.64 Aligned_cols=36 Identities=19% Similarity=0.233 Sum_probs=27.4
Q ss_pred CCeeeeEEEEeccCC-----CeeeceEecCCceEEEEecCC
Q psy4833 20 GLGITGIRVSVDKDS-----GRFGFTLTRGGGWFDMLVNGG 55 (141)
Q Consensus 20 GePL~GVsV~Vkgt~-----~~~G~TiTD~DG~FsL~V~~g 55 (141)
..||+||.|.+.=.. .......||.+|.|.|.++..
T Consensus 18 ~~~l~GA~V~v~C~~~~~~~~~~~~~~Td~~G~F~i~l~~~ 58 (97)
T PF01190_consen 18 AKPLPGAKVSVECKDGNGGVVFSAEAKTDENGYFSIELPSD 58 (97)
T ss_pred CccCCCCEEEEECCCCCCCcEEEEEEEeCCCCEEEEEecCc
Confidence 579999999995111 023558999999999999853
No 39
>cd03457 intradiol_dioxygenase_like Intradiol dioxygenase supgroup. Intradiol dioxygenases catalyze the critical ring-cleavage step in the conversion of catecholate derivatives to citric acid cycle intermediates. They break the catechol C1-C2 bond and utilize Fe3+, as opposed to the extradiol-cleaving enzymes which break the C2-C3 or C1-C6 bond and utilize Fe2+ and Mn+. The family contains catechol 1,2-dioxygenases and protocatechuate 3,4-dioxygenases. The specific function of this subgroup is unknown.
Probab=94.69 E-value=0.1 Score=42.02 Aligned_cols=47 Identities=15% Similarity=-0.053 Sum_probs=35.4
Q ss_pred CcceEEEEEEEECC-CCCeeeeEEEEeccC----------------------CCeeeceEecCCceEEEE
Q psy4833 5 ELRVAVVRGQVLSV-QGLGITGIRVSVDKD----------------------SGRFGFTLTRGGGWFDML 51 (141)
Q Consensus 5 ~~r~~~IrG~V~D~-~GePL~GVsV~Vkgt----------------------~~~~G~TiTD~DG~FsL~ 51 (141)
+--..+|+|+|+|. +++||+||.|-+=.. ....|...||++|+|++.
T Consensus 23 ~G~pl~l~g~V~D~~~c~Pv~~a~VdiWh~da~G~Ys~~~~~~~~~~~~~~~~flRG~~~TD~~G~~~F~ 92 (188)
T cd03457 23 PGVPLTLDLQVVDVATCCPPPNAAVDIWHCDATGVYSGYSAGGGGGEDTDDETFLRGVQPTDADGVVTFT 92 (188)
T ss_pred CCCEEEEEEEEEeCCCCccCCCeEEEEecCCCCCCCCCccCCccccccccCCCcCEEEEEECCCccEEEE
Confidence 34467999999995 699999999987210 002266889999999996
No 40
>COG5266 CbiK ABC-type Co2+ transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=92.43 E-value=0.22 Score=42.61 Aligned_cols=42 Identities=24% Similarity=0.337 Sum_probs=31.7
Q ss_pred EEEEEEECCCCCeeeeEEEEec-------c----CCC----eeeceEecCCceEEEEec
Q psy4833 10 VVRGQVLSVQGLGITGIRVSVD-------K----DSG----RFGFTLTRGGGWFDMLVN 53 (141)
Q Consensus 10 ~IrG~V~D~~GePL~GVsV~Vk-------g----t~~----~~G~TiTD~DG~FsL~V~ 53 (141)
.++|+|+|. |.||++|+|-+. + +.+ .+- --||.+|.|++...
T Consensus 173 ~f~~~vl~~-GkPv~nA~V~v~~~n~~~~d~~a~~~~~ek~~~~-~~TD~kG~~~fip~ 229 (264)
T COG5266 173 VFRGKVLDN-GKPVPNATVEVEFDNIDTKDNRAKTGNTEKTALV-QFTDDKGEVSFIPL 229 (264)
T ss_pred eEEEEEEEC-CccCCCcEEEEEEecccccccccccCCCCCcceE-EEcCCCceEEEEEc
Confidence 689999998 999999999887 1 110 112 46899999999744
No 41
>PF12985 DUF3869: Domain of unknown function (DUF3869); InterPro: IPR024620 This functionally uncharacterised domain is found in bacterial sequences, including the N-terminal region of a putative pore-forming toxin (YP_001301288.1) from Bacteroides vulgatus ATCC 8482.; PDB: 3G3L_A 3KOG_A.
Probab=92.22 E-value=0.75 Score=34.52 Aligned_cols=56 Identities=23% Similarity=0.323 Sum_probs=39.5
Q ss_pred ceEEEEEEEECC-CCCeee-eEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccceEEEEEcc
Q psy4833 7 RVAVVRGQVLSV-QGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIP 77 (141)
Q Consensus 7 r~~~IrG~V~D~-~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~ 77 (141)
-..+|.|.|+|. +|++|. |+.+.+.+.. ||.+.-.. ..+.-|.-||.+.+++|.++
T Consensus 20 A~~tI~~tV~da~tG~~vt~~a~vti~~~~----------~~~~A~~a-----~~vtas~~~y~~~~~tV~vp 77 (104)
T PF12985_consen 20 AKYTIAGTVYDAETGEDVTTTATVTISAGS----------DGTLAAKA-----VTVTASKDGYMTGSVTVNVP 77 (104)
T ss_dssp -EEEEEEEEEETTTTEE-CGCSEEEETTCC----------CCCE--EC-----CEEEEEECTCEEEEEEEEE-
T ss_pred CEEEEEEEEEecCCCCeeeccceEEEccCC----------Cccccccc-----EEEEEEccCCeeeEEEEEeh
Confidence 356999999996 899998 8888886443 55555322 23555899999999999988
No 42
>PF14686 fn3_3: Polysaccharide lyase family 4, domain II; PDB: 1NKG_A 2XHN_B 3NJX_A 3NJV_A.
Probab=91.46 E-value=1.9 Score=31.06 Aligned_cols=49 Identities=22% Similarity=0.310 Sum_probs=23.9
Q ss_pred EEEEEEECCCC---Cee-eeEEEEec-------cCCCeeeceEecCCceEEEE-ecCCCcEEE
Q psy4833 10 VVRGQVLSVQG---LGI-TGIRVSVD-------KDSGRFGFTLTRGGGWFDML-VNGGGAVTL 60 (141)
Q Consensus 10 ~IrG~V~D~~G---ePL-~GVsV~Vk-------gt~~~~G~TiTD~DG~FsL~-V~~g~~~tL 60 (141)
.|+|+|...+| .|. ..+.|-+. +..-.| .+-||.+|+|+|. |.+|. ..|
T Consensus 4 ~VsG~l~l~dg~~~~~~~~~~~Vgl~~~~d~~q~~~yqY-wt~td~~G~Fti~~V~pGt-Y~L 64 (95)
T PF14686_consen 4 SVSGRLTLSDGVTNPPAGANAVVGLAPPGDFQQNKGYQY-WTRTDSDGNFTIPNVRPGT-YRL 64 (95)
T ss_dssp EEEEEEE---SS--TT--S-EEEEEE--------SS-EE-EEE--TTSEEE---B-SEE-EEE
T ss_pred EEEEEEEEccCcccCccceeEEEEeeeccccccCCCCcE-EEEeCCCCcEEeCCeeCcE-eEE
Confidence 68999987666 333 34555554 332122 3889999999997 55564 444
No 43
>PF08308 PEGA: PEGA domain; InterPro: IPR013229 This domain is found in both archaea and bacteria and has similarity to S-layer (surface layer) proteins. It is named after the characteristic PEGA sequence motif found in this domain. The secondary structure of this domain is predicted to be beta-strands.
Probab=90.81 E-value=0.99 Score=29.61 Aligned_cols=52 Identities=23% Similarity=0.337 Sum_probs=38.8
Q ss_pred eeEEEEeccCCCeeeceEecCCceEEEE-ecCCCcEEEEEEecCccceEEEEEccCccEEEe
Q psy4833 24 TGIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVL 84 (141)
Q Consensus 24 ~GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~ 84 (141)
.||.|.+.|.. .| ..-..+. ++.|. .+|++..-||++.+..+.+..++...+
T Consensus 11 ~gA~V~vdg~~--~G------~tp~~~~~l~~G~-~~v~v~~~Gy~~~~~~v~v~~~~~~~v 63 (71)
T PF08308_consen 11 SGAEVYVDGKY--IG------TTPLTLKDLPPGE-HTVTVEKPGYEPYTKTVTVKPGETTTV 63 (71)
T ss_pred CCCEEEECCEE--ec------cCcceeeecCCcc-EEEEEEECCCeeEEEEEEECCCCEEEE
Confidence 48899998865 35 2233555 66564 589999999999999999997776654
No 44
>TIGR02962 hdxy_isourate hydroxyisourate hydrolase. Members of this family, hydroxyisourate hydrolase, represent a distinct clade of transthyretin-related proteins. Bacterial members typically are encoded next to ureidoglycolate hydrolase and often near either xanthine dehydrogenase or xanthine/uracil permease genes and have been demonstrated to have hydroxyisourate hydrolase activity. In eukaryotes, a clade separate from the transthyretins (a family of thyroid-hormone binding proteins) has also been shown to have HIU hydrolase activity in urate catabolizing organisms. Transthyretin, then, would appear to be the recently diverged paralog of the more ancient HIUH family.
Probab=90.53 E-value=0.43 Score=35.68 Aligned_cols=59 Identities=22% Similarity=0.351 Sum_probs=40.2
Q ss_pred EEEEEECC-CCCeeeeEEEEeccCCC----eeeceEecCCceEEEEec-----CCCcEEEEEEecCccc
Q psy4833 11 VRGQVLSV-QGLGITGIRVSVDKDSG----RFGFTLTRGGGWFDMLVN-----GGGAVTLQFQRSPYRP 69 (141)
Q Consensus 11 IrG~V~D~-~GePL~GVsV~Vkgt~~----~~G~TiTD~DG~FsL~V~-----~g~~~tL~FSyiGY~s 69 (141)
|+=.|+|. .|.|-+||.|.+..-.+ .++...||+||+..-... ..+...|+|.--.|=.
T Consensus 3 lstHVLDt~~G~PAagv~V~L~~~~~~~~~~i~~~~Tn~DGR~~~~l~~~~~~~~G~Y~l~F~~g~Yf~ 71 (112)
T TIGR02962 3 LSTHVLDTTSGKPAAGVPVTLYRLDGSGWTPLAEGVTNADGRCPDLLPEGETLAAGIYKLRFDTGDYFA 71 (112)
T ss_pred ceEEEEeCCCCccCCCCEEEEEEecCCCeEEEEEEEECCCCCCcCcccCcccCCCeeEEEEEEhhhhhh
Confidence 56679995 79999999999964221 234588999999973111 1245678887655543
No 45
>PF01060 DUF290: Transthyretin-like family; InterPro: IPR001534 This new apparently nematode-specific protein family has been called family 2 []. The proteins show weak similarity to transthyretin (formerly called prealbumin) which transports thyroid hormones. The specific function of this protein is unknown.; GO: 0005615 extracellular space
Probab=90.07 E-value=0.56 Score=32.31 Aligned_cols=43 Identities=21% Similarity=0.281 Sum_probs=30.8
Q ss_pred EEEEECCCCCeeeeEEEEeccCCC-----eeeceEecCCceEEEEecCC
Q psy4833 12 RGQVLSVQGLGITGIRVSVDKDSG-----RFGFTLTRGGGWFDMLVNGG 55 (141)
Q Consensus 12 rG~V~D~~GePL~GVsV~Vkgt~~-----~~G~TiTD~DG~FsL~V~~g 55 (141)
+|+++= +|+|+.+|.|++-.... .++.+.||.+|+|.|.-...
T Consensus 1 ~G~L~C-~~~P~~~~~V~L~e~d~~~~Ddll~~~~Td~~G~F~l~G~~~ 48 (80)
T PF01060_consen 1 KGQLMC-GGKPAKNVKVKLWEDDYFDPDDLLDETKTDSDGNFELSGSTN 48 (80)
T ss_pred CeEEEe-CCccCCCCEEEEEECCCCCCCceeEEEEECCCceEEEEEEcc
Confidence 355542 48999999998843321 34669999999999996533
No 46
>PF05738 Cna_B: Cna protein B-type domain; InterPro: IPR008454 This entry represents a repeated B region domain found in the collagen-binding surface protein Cna in Staphylococcus aureus, as well as other related domains. The B region domain of Cna has a prealbumin-like beta-sandwich fold of seven strands in two sheets with a Greek key topology []. However, this domain does not mediate collagen binding, the IPR008456 from INTERPRO region carries out that function; instead it appears to form a stalk that presents the ligand binding domain away from the bacterial cell surface. Cna is a collagen-binding MSCRAMM (Microbial Surface Component Recognizing Adhesive Matrix Molecules), and is necessary and sufficient for S. aureus cells to adhere to cartilage.; PDB: 2X5P_A 3RKP_A 3KPT_A 1VLF_T 1TI2_F 1TI6_D 1TI4_J 1VLE_V 1VLD_X 3PF2_A ....
Probab=89.27 E-value=3 Score=26.86 Aligned_cols=47 Identities=17% Similarity=0.229 Sum_probs=31.8
Q ss_pred eeeEEEEeccCCCee-ec--eEecCCceEEEE-ecCCCcEEEEEEe--cCccce
Q psy4833 23 ITGIRVSVDKDSGRF-GF--TLTRGGGWFDML-VNGGGAVTLQFQR--SPYRPQ 70 (141)
Q Consensus 23 L~GVsV~Vkgt~~~~-G~--TiTD~DG~FsL~-V~~g~~~tL~FSy--iGY~sq 70 (141)
|+||.+.+....+.. .. ..||.||.|.+. ++.|. ..|.... -||...
T Consensus 1 L~Ga~f~L~~~~~~~~~~~~~~Td~~G~~~f~~L~~G~-Y~l~E~~aP~GY~~~ 53 (70)
T PF05738_consen 1 LAGATFELYDEDGNEVIEVTVTTDENGKYTFKNLPPGT-YTLKETKAPDGYQLD 53 (70)
T ss_dssp -STEEEEEEETTSEEEEEEEEEGGTTSEEEEEEEESEE-EEEEEEETTTTEEEE
T ss_pred CCCeEEEEEECCCCEEEEEEEEECCCCEEEEeecCCeE-EEEEEEECCCCCEEC
Confidence 689999997654221 21 579999999997 55565 5677666 677643
No 47
>cd05822 TLP_HIUase HIUase (5-hydroxyisourate hydrolase) catalyzes the second step in a three-step ureide pathway in which 5-hydroxyisourate (HIU), a product of the uricase (urate oxidase) reaction, is hydrolyzed to 2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (OHCU). HIUase has high sequence similarity with transthyretins and is a member of the transthyretin-like protein (TLP) family. HIUase is distinguished from transthyretins by a conserved signature motif at its C-terminus that forms part of the active site. In HIUase, this motif is YRGS, while transthyretins have a conserved TAVV sequence in the same location. Most HIUases are cytosolic but in plants and slime molds, they are peroxisomal based on the presence of N-terminal periplasmic localization sequences. HIUase forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located betw
Probab=87.74 E-value=1 Score=33.62 Aligned_cols=58 Identities=24% Similarity=0.382 Sum_probs=39.7
Q ss_pred EEEEEECC-CCCeeeeEEEEeccCCC----eeeceEecCCceEEEEecC-----CCcEEEEEEecCcc
Q psy4833 11 VRGQVLSV-QGLGITGIRVSVDKDSG----RFGFTLTRGGGWFDMLVNG-----GGAVTLQFQRSPYR 68 (141)
Q Consensus 11 IrG~V~D~-~GePL~GVsV~Vkgt~~----~~G~TiTD~DG~FsL~V~~-----g~~~tL~FSyiGY~ 68 (141)
|+=.|+|. .|.|-+|+.|.+-.-.. .++...||+||+..-.... .+...|+|-=-.|=
T Consensus 3 lstHVLDt~~G~PAagv~V~L~~~~~~~~~~i~~~~Td~DGR~~~~~~~~~~~~~G~Y~l~F~~~~Yf 70 (112)
T cd05822 3 LSTHVLDTATGKPAAGVAVTLYRLDGNGWTLLATGVTNADGRCDDLLPPGAQLAAGTYKLTFDTGAYF 70 (112)
T ss_pred ceeEEEeCCCCcccCCCEEEEEEecCCCeEEEEEEEECCCCCccCcccccccCCCeeEEEEEEhhhhh
Confidence 45578995 79999999999854321 2466899999999753321 24457887654553
No 48
>KOG1948|consensus
Probab=84.31 E-value=2.3 Score=42.43 Aligned_cols=48 Identities=29% Similarity=0.443 Sum_probs=38.5
Q ss_pred EEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEE-ecCCC
Q psy4833 9 AVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGG 56 (141)
Q Consensus 9 ~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~ 56 (141)
..|+|+|+-+-|-..+||.|..+........|.|+.+|.|.+. +.+|.
T Consensus 119 Fsv~GkVlgaaggGpagV~velrs~e~~iast~T~~~Gky~f~~iiPG~ 167 (1165)
T KOG1948|consen 119 FSVRGKVLGAAGGGPAGVLVELRSQEDPIASTKTEDGGKYEFRNIIPGK 167 (1165)
T ss_pred eeEeeEEeeccCCCcccceeecccccCcceeeEecCCCeEEEEecCCCc
Confidence 4699999998888999999999876222457999999999997 54453
No 49
>PF10577 UPF0560: Uncharacterised protein family UPF0560; InterPro: IPR018890 This family of proteins has no known function.
Probab=82.40 E-value=11 Score=37.05 Aligned_cols=85 Identities=12% Similarity=0.055 Sum_probs=62.2
Q ss_pred EEEEEEECC-CCCeeeeEEEEeccCCCeeeceEecCCceEEEEec--CCCcEEEEEEecCccceEE-------------E
Q psy4833 10 VVRGQVLSV-QGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVN--GGGAVTLQFQRSPYRPQTR-------------T 73 (141)
Q Consensus 10 ~IrG~V~D~-~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~--~g~~~tL~FSyiGY~sqe~-------------~ 73 (141)
+|+=+|.|. +=+||++|.|.|-++........|+.||.=-|.++ -|..+++.-+.-||.+..+ +
T Consensus 2 ~lKV~V~d~~t~qpl~~A~V~V~~N~t~~~s~~t~~dG~~~l~~~yrlg~~l~v~a~k~gyv~ns~Pw~~~rlPv~ssvs 81 (807)
T PF10577_consen 2 TLKVQVSDASTRQPLSDASVEVFGNQTLTASGTTGNDGVVLLKFPYRLGTWLIVTASKDGYVPNSVPWKTSRLPVYSSVS 81 (807)
T ss_pred eEEEEEecccCcccCCCCeEEEEeceeEeecceecCCceEEEEEeccCCCeEEEEEecCCccCCCCCcccccccccccee
Confidence 567788886 68999999999976543456689999999999987 4667788889999975422 2
Q ss_pred EEc---cCccEEEeCCeEEEecCC
Q psy4833 74 VFI---PWNQIVVLPPVKMQLSDA 94 (141)
Q Consensus 74 V~~---~~n~~~~~~~v~m~~~d~ 94 (141)
+++ ....+.+.||+++|....
T Consensus 82 L~Llp~r~a~l~lyeD~v~I~~~~ 105 (807)
T PF10577_consen 82 LSLLPERQATLWLYEDVVQITGGS 105 (807)
T ss_pred eeeeccCcceEEEEeeeeeeeccC
Confidence 222 134566679999986644
No 50
>PF12866 DUF3823: Protein of unknown function (DUF3823); InterPro: IPR024278 This is a family of uncharacterised proteins from Bacteroidetes. These proteins have characteristic DN and DR sequence-motifs but their function is not known.; PDB: 3HN5_B 4EIU_A.
Probab=80.55 E-value=10 Score=31.27 Aligned_cols=29 Identities=21% Similarity=0.401 Sum_probs=17.6
Q ss_pred CcCCcceEEEEEEEEC-CCCCeee----eEEEEec
Q psy4833 2 TDDELRVAVVRGQVLS-VQGLGIT----GIRVSVD 31 (141)
Q Consensus 2 ~~d~~r~~~IrG~V~D-~~GePL~----GVsV~Vk 31 (141)
||| +-...++|+|+| .+|||+. ||.|++-
T Consensus 16 NYD-~P~s~l~G~iiD~~tgE~i~~~~~gv~i~l~ 49 (222)
T PF12866_consen 16 NYD-EPDSTLTGRIIDVYTGEPIQTDIGGVRIQLY 49 (222)
T ss_dssp -------EEEEEEEEECCTTEE----STSSEEEEE
T ss_pred CCc-CCCceEEEEEEEeecCCeeeecCCceEEEEE
Confidence 344 456799999999 5899876 6777774
No 51
>cd05469 Transthyretin_like Transthyretin_like. This domain is present in the transthyretin-like protein (TLP) family which includes transthyretin (TTR) and a transthyretin-related protein called 5-hydroxyisourate hydrolase (HIUase). TTR and HIUase are homotetrameric proteins with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits. TTR transports thyroid hormones and retinol in the blood serum of vertebrates while HIUase catalyzes the second step in a three-step ureide pathway. TTRs are highly conserved and found only in vertebrates while the HIUases are found in a wide range of bacterial, plant, fungal, slime mold and vertebrate organisms.
Probab=78.85 E-value=2.2 Score=32.11 Aligned_cols=57 Identities=19% Similarity=0.323 Sum_probs=38.4
Q ss_pred EEEEEECC-CCCeeeeEEEEeccCC-----CeeeceEecCCceEEEEe-c---CCCcEEEEEEecCc
Q psy4833 11 VRGQVLSV-QGLGITGIRVSVDKDS-----GRFGFTLTRGGGWFDMLV-N---GGGAVTLQFQRSPY 67 (141)
Q Consensus 11 IrG~V~D~-~GePL~GVsV~Vkgt~-----~~~G~TiTD~DG~FsL~V-~---~g~~~tL~FSyiGY 67 (141)
|+=.|+|. .|.|-+|+.|.+.... ..++...||+||+..-.. . ..+...|+|.=-.|
T Consensus 3 lStHVLDt~~G~PAagv~V~L~~~~~~~~w~~l~~~~Tn~DGR~~~~l~~~~~~~G~Y~l~F~t~~Y 69 (113)
T cd05469 3 LMVKVLDAVRGSPAANVAIKVFRKTADGSWEIFATGKTNEDGELHGLITEEEFXAGVYRVEFDTKSY 69 (113)
T ss_pred ceEEEEeCCCCccCCCCEEEEEEecCCCceEEEEEEEECCCCCccCccccccccceEEEEEEehHHh
Confidence 56679995 7999999999994321 124568999999996321 1 12446788764444
No 52
>cd05821 TLP_Transthyretin Transthyretin (TTR) is a 55 kDa protein responsible for the transport of thyroid hormones and retinol in vertebrates. TTR distributes the two thyroid hormones T3 (3,5,3'-triiodo-L-thyronine) and T4 (Thyroxin, or 3,5,3',5'-tetraiodo-L-thyronine), as well as retinol (vitamin A) through the formation of a macromolecular complex that includes each of these as well as retinol-binding protein. Misfolded forms of TTR are implicated in the amyloid diseases familial amyloidotic polyneuropathy and senile systemic amyloidosis. TTR forms a homotetramer with each subunit consisting of eight beta-strands arranged in two sheets and a short alpha-helix. The central channel of the tetramer contains two independent binding sites, each located between a pair of subunits, which differ in their ligand binding affinity. A negative cooperativity has been observed for the binding of T4 and other TTR ligands. A fraction of plasma TTR is carried in high density lipoproteins by bindi
Probab=78.61 E-value=2.7 Score=32.09 Aligned_cols=60 Identities=22% Similarity=0.250 Sum_probs=40.7
Q ss_pred EEEEEEEECC-CCCeeeeEEEEecc-C-C---CeeeceEecCCceEEEEec----CCCcEEEEEEecCcc
Q psy4833 9 AVVRGQVLSV-QGLGITGIRVSVDK-D-S---GRFGFTLTRGGGWFDMLVN----GGGAVTLQFQRSPYR 68 (141)
Q Consensus 9 ~~IrG~V~D~-~GePL~GVsV~Vkg-t-~---~~~G~TiTD~DG~FsL~V~----~g~~~tL~FSyiGY~ 68 (141)
..|+=-|+|. .|.|-.||.|.+.. . . ..++...||+||+..--.. ..+...|+|-=-.|=
T Consensus 7 ~~ittHVLDt~~G~PAaGV~V~L~~~~~~~~w~~l~~~~Tn~DGR~~~ll~~~~~~~G~Y~l~F~tg~Yf 76 (121)
T cd05821 7 CPLMVKVLDAVRGSPAANVAVKVFKKTADGSWEPFASGKTTETGEIHGLTTDEQFTEGVYKVEFDTKAYW 76 (121)
T ss_pred CCcEEEEEECCCCccCCCCEEEEEEecCCCceEEEEEEEECCCCCCCCccCccccCCeeEEEEEehhHhh
Confidence 4567789995 79999999999942 1 1 1346689999999963111 124567888655553
No 53
>PF00576 Transthyretin: HIUase/Transthyretin family; InterPro: IPR023416 This family includes transthyretin that is a thyroid hormone-binding protein that transports thyroxine from the bloodstream to the brain. However, most of the sequences listed in this family do not bind thyroid hormones. They are actually enzymes of the purine catabolism that catalyse the conversion of 5-hydroxyisourate (HIU) to OHCU [, ]. HIU hydrolysis is the original function of the family and is conserved from bacteria to mammals; transthyretins arose by gene duplications in the vertebrate lineage [, ]. HIUases are distinguished in the alignment from the conserved C-terminal YRGS sequence. Transthyretin (formerly prealbumin) is one of 3 thyroid hormone-binding proteins found in the blood of vertebrates []. It is produced in the liver and circulates in the bloodstream, where it binds retinol and thyroxine (T4) []. It differs from the other 2 hormone-binding proteins (T4-binding globulin and albumin) in 3 distinct ways: (1) the gene is expressed at a high rate in the brain choroid plexus; (2) it is enriched in cerebrospinal fluid; and (3) no genetically caused absence has been observed, suggesting an essential role in brain function, distinct from that played in the bloodstream []. The protein consists of around 130 amino acids, which assemble as a homotetramer that contains an internal channel in which T4 is bound. Within this complex, T4 appears to be transported across the blood-brain barrier, where, in the choroid plexus, the hormone stimulates further synthesis of transthyretin. The protein then diffuses back into the bloodstream, where it binds T4 for transport back to the brain [].; PDB: 1TFP_B 1KGJ_D 1IE4_C 1GKE_C 1KGI_D 2H0J_B 2H0E_B 2H0F_B 1ZD6_A 3DGD_D ....
Probab=77.17 E-value=2.6 Score=31.50 Aligned_cols=39 Identities=18% Similarity=0.265 Sum_probs=27.6
Q ss_pred EEEEEECC-CCCeeeeEEEEeccCC-----CeeeceEecCCceEE
Q psy4833 11 VRGQVLSV-QGLGITGIRVSVDKDS-----GRFGFTLTRGGGWFD 49 (141)
Q Consensus 11 IrG~V~D~-~GePL~GVsV~Vkgt~-----~~~G~TiTD~DG~Fs 49 (141)
|+=.|+|. .|.|-.||.|.+-... ..++...||+||+..
T Consensus 3 iStHVLDt~~G~PA~gv~V~L~~~~~~~~~~~l~~~~Td~DGR~~ 47 (112)
T PF00576_consen 3 ISTHVLDTTTGKPAAGVPVTLYRLDSDGSWTLLAEGVTDADGRIK 47 (112)
T ss_dssp EEEEEEETTTTEE-TT-EEEEEEEETTSCEEEEEEEEBETTSEES
T ss_pred cEEEEeeCCCCCCccCCEEEEEEecCCCCcEEEEEEEECCCCccc
Confidence 56679995 7999999999984221 133568999999993
No 54
>smart00095 TR_THY Transthyretin.
Probab=76.42 E-value=4.7 Score=30.75 Aligned_cols=58 Identities=22% Similarity=0.260 Sum_probs=38.9
Q ss_pred EEEEEEECC-CCCeeeeEEEEecc-C-C---CeeeceEecCCceEEEEec----CCCcEEEEEEecCc
Q psy4833 10 VVRGQVLSV-QGLGITGIRVSVDK-D-S---GRFGFTLTRGGGWFDMLVN----GGGAVTLQFQRSPY 67 (141)
Q Consensus 10 ~IrG~V~D~-~GePL~GVsV~Vkg-t-~---~~~G~TiTD~DG~FsL~V~----~g~~~tL~FSyiGY 67 (141)
.|+=.|+|. .|.|-.|+.|.+-. . . ..++...||+||+-.--.+ ..+...|+|-=-.|
T Consensus 5 plTtHVLDt~~G~PAagv~V~L~~~~~~~~w~~la~~~Tn~DGR~~~ll~~~~~~~G~Y~l~F~tg~Y 72 (121)
T smart00095 5 PLMVKVLDAVRGSPAVNVAVKVFKKTEEGTWEPFASGKTNESGEIHELTTDEKFVEGLYKVEFDTKSY 72 (121)
T ss_pred CeEEEEEECCCCccCCCCEEEEEEeCCCCceEEEEEEecCCCccccCccCcccccceEEEEEEehhHh
Confidence 467789995 79999999999932 1 1 1246689999999963111 12445788854444
No 55
>PF03785 Peptidase_C25_C: Peptidase family C25, C terminal ig-like domain; InterPro: IPR005536 This domain is found in almost all members of MEROPS peptidase family C25, (clan CD). Peptidase family C25 is a protein family found in the bacteria Porphyromonas gingivalis (Bacteroides gingivalis) a Gram-negative anaerobic bacterial species strongly associated with adult periodontitis. One of its distinguishing characteristics and putative virulence properties is the ability to agglutinate erythrocytes []. It is a highly proteolytic organism which metabolises small peptides and amino acids. Indirect evidence suggests that the proteases produced by this microorganism constitute an important virulence factor []. Protease-encoding genes have been shown to contain multiple copies of repeated nucleotide sequences. These conserved sequences have also been found in haemagglutinin genes [].; GO: 0008233 peptidase activity, 0006508 proteolysis; PDB: 1CVR_A.
Probab=75.94 E-value=15 Score=26.51 Aligned_cols=51 Identities=20% Similarity=0.330 Sum_probs=30.2
Q ss_pred eeEEEEeccCCCeeeceEecCCceEEEEec----CCCcEEEEEEecCccceEEEEE
Q psy4833 24 TGIRVSVDKDSGRFGFTLTRGGGWFDMLVN----GGGAVTLQFQRSPYRPQTRTVF 75 (141)
Q Consensus 24 ~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~----~g~~~tL~FSyiGY~sqe~~V~ 75 (141)
+|+.|.+-.+...+|..++| +|++.|..+ +.+.++|.+...-|++...++.
T Consensus 26 ~gs~ValS~dg~l~G~ai~~-sG~ati~l~~~it~~~~~tlTit~~n~~t~i~~i~ 80 (81)
T PF03785_consen 26 PGSYVALSQDGDLYGKAIVN-SGNATINLTNPITDEGTLTLTITAFNYVTYIKTIQ 80 (81)
T ss_dssp TT-EEEEEETTEEEEEEE-B-TTEEEEE-SS--TT-SEEEEEEE-TTB--EEEEEE
T ss_pred CCcEEEEecCCEEEEEEEec-CceEEEECCcccCCCceEEEEEEEEccEEEEEEee
Confidence 56677775443356888999 999999876 3455677777766666555543
No 56
>PF09430 DUF2012: Protein of unknown function (DUF2012); InterPro: IPR019008 This domain is found in different proteins, including uncharacterised protein family UPF0480 and nodal modulators. A nodal modulator has been identified as part of a protein complex that participates in the nodal signaling pathway during vertebrate development [].
Probab=73.14 E-value=23 Score=26.15 Aligned_cols=54 Identities=20% Similarity=0.189 Sum_probs=36.3
Q ss_pred eeeEEEEeccCCCeeeceEecCCceEEEE-ecCCCcEEEEEEecCccceEEEEEcc
Q psy4833 23 ITGIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGGAVTLQFQRSPYRPQTRTVFIP 77 (141)
Q Consensus 23 L~GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~~~tL~FSyiGY~sqe~~V~~~ 77 (141)
++..+|.+.+.....-.+..+.||.|.+. |+.|. ..|++....|.=....|.+.
T Consensus 7 ~~~t~V~L~~g~~~~~~~~v~~dG~F~f~~Vp~Gs-Y~L~V~s~~~~F~~~RVdV~ 61 (123)
T PF09430_consen 7 PSSTRVTLNGGQYRPISAFVRSDGSFVFHNVPPGS-YLLEVHSPDYVFPPYRVDVS 61 (123)
T ss_pred CCCEEEEEeCCCccceEEEecCCCEEEeCCCCCce-EEEEEECCCccccCEEEEEe
Confidence 45667777655410013888999999997 77664 57999888876544555544
No 57
>PF13953 PapC_C: PapC C-terminal domain; PDB: 3L48_E 2XET_A 3RFZ_E 2KT6_A.
Probab=68.35 E-value=32 Score=22.77 Aligned_cols=47 Identities=15% Similarity=0.097 Sum_probs=25.8
Q ss_pred EEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEe
Q psy4833 14 QVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQR 64 (141)
Q Consensus 14 ~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSy 64 (141)
++.+.+|+||| |+.|...+.. . - .+...+|+--|.-..... +|.+..
T Consensus 2 ~l~~~~G~~lPfGA~v~~~~g~-~-~-g~Vg~~G~vyl~~~~~~~-~L~V~w 49 (68)
T PF13953_consen 2 TLRDADGKPLPFGASVSDEDGN-N-I-GIVGQDGQVYLSGLPPKG-TLTVKW 49 (68)
T ss_dssp EEEETTSEE--TT-EEEETTSS-E-E-EEB-GCGEEEEEEE-TCE-EEEEES
T ss_pred EEEcCCCCcCCCCcEEEcCCCC-E-E-EEEcCCCEEEEECCCCCc-EEEEEE
Confidence 46788999997 7777765444 3 2 466667777776432221 465555
No 58
>COG2351 Transthyretin-like protein [General function prediction only]
Probab=62.81 E-value=18 Score=27.99 Aligned_cols=56 Identities=21% Similarity=0.340 Sum_probs=36.9
Q ss_pred EEEEEEECC-CCCeeeeEEEEec---cCC-CeeeceEecCCceEEEEecCC-----CcEEEEEEec
Q psy4833 10 VVRGQVLSV-QGLGITGIRVSVD---KDS-GRFGFTLTRGGGWFDMLVNGG-----GAVTLQFQRS 65 (141)
Q Consensus 10 ~IrG~V~D~-~GePL~GVsV~Vk---gt~-~~~G~TiTD~DG~FsL~V~~g-----~~~tL~FSyi 65 (141)
.++=-|+|. .|.|-+||.|.+. ++. ..+..+.||.||+-+...-.+ +...|+|--=
T Consensus 10 ~LTTHVLDta~GkPAagv~V~L~rl~~~~~~~l~t~~Tn~DGR~d~pll~g~~~~~G~Y~l~F~~g 75 (124)
T COG2351 10 RLTTHVLDTASGKPAAGVKVELYRLEGNQWELLKTVVTNADGRIDAPLLAGETLATGIYELVFHTG 75 (124)
T ss_pred eeeeeeeecccCCcCCCCEEEEEEecCCcceeeeEEEecCCCcccccccCccccccceEEEEEEcc
Confidence 455678995 7999999999874 221 012458999999999443211 3456777543
No 59
>KOG3006|consensus
Probab=56.99 E-value=65 Score=25.19 Aligned_cols=54 Identities=24% Similarity=0.394 Sum_probs=37.2
Q ss_pred EEEEEEECC-CCCeeeeEEEEec---cC--CCeeeceEecCCceEEEEecC----CCcEEEEEE
Q psy4833 10 VVRGQVLSV-QGLGITGIRVSVD---KD--SGRFGFTLTRGGGWFDMLVNG----GGAVTLQFQ 63 (141)
Q Consensus 10 ~IrG~V~D~-~GePL~GVsV~Vk---gt--~~~~G~TiTD~DG~FsL~V~~----g~~~tL~FS 63 (141)
-|+-.|+|. .|.|-+||.|.+- ++ ...+|..-|+.||+-....+. .+..+|+|-
T Consensus 22 ~itahVLd~s~GsPA~gVqV~~f~~~~~~~w~~igs~~T~~nGrv~~~~~~~tl~~GtYr~~~d 85 (132)
T KOG3006|consen 22 PITAHVLDISRGSPAAGVQVHLFILANDDTWTPIGSGFTQDNGRVDWVSPDFTLIPGTYRLVFD 85 (132)
T ss_pred CcEeEEeecccCCcccceEEEEEEecCCCcccCccccccccCceeecccchhhhccceEEEEEe
Confidence 467789996 6999999998762 22 113577889999998865331 234566663
No 60
>PF14054 DUF4249: Domain of unknown function (DUF4249)
Probab=56.42 E-value=1e+02 Score=24.76 Aligned_cols=66 Identities=11% Similarity=0.205 Sum_probs=40.4
Q ss_pred eeeeEEEEe-ccCCCeeeceEecCC---ceEE----EEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCeEEE
Q psy4833 22 GITGIRVSV-DKDSGRFGFTLTRGG---GWFD----MLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPVKMQ 90 (141)
Q Consensus 22 PL~GVsV~V-kgt~~~~G~TiTD~D---G~Fs----L~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v~m~ 90 (141)
++.||+|.+ .+.. .......+.+ |.|. +....|...+|.+++-|.+..+-...+|..- -++.|...
T Consensus 59 ~v~~A~V~i~~~~~-~~~~~~~~~~~~~g~Y~~~~~~~~~~G~~Y~L~V~~~~~~~~sa~~~vp~~~--~i~~v~~~ 132 (298)
T PF14054_consen 59 PVSGATVTIYEDGQ-GNEYLFEESSNNDGVYYSSNSFRGRPGRTYRLEVETPGGKTYSAETTVPPPP--PIDSVSYE 132 (298)
T ss_pred ccCCcEEEEEeCCC-cceEeecccCCCcceEEecccccccCCCEEEEEEEECCCCEEEEEEEECCCC--ceeEEEEE
Confidence 389999999 4443 1122333333 8887 3344577789999998887766665555333 33444444
No 61
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=52.41 E-value=23 Score=22.89 Aligned_cols=26 Identities=8% Similarity=0.110 Sum_probs=20.9
Q ss_pred eEEEEEEEECCCCCeeeeEEEEeccC
Q psy4833 8 VAVVRGQVLSVQGLGITGIRVSVDKD 33 (141)
Q Consensus 8 ~~~IrG~V~D~~GePL~GVsV~Vkgt 33 (141)
.+.++=+|.|.+|.|++++.+.++..
T Consensus 14 ~I~ltVt~kda~G~pv~n~~f~l~r~ 39 (47)
T PF05688_consen 14 TIPLTVTVKDANGNPVPNAPFTLTRG 39 (47)
T ss_pred eEEEEEEEECCCCCCcCCceEEEEec
Confidence 35566678999999999999888644
No 62
>COG1470 Predicted membrane protein [Function unknown]
Probab=51.26 E-value=85 Score=29.50 Aligned_cols=77 Identities=16% Similarity=0.157 Sum_probs=60.8
Q ss_pred eEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecC-ccceEEEEEccCccEEEeCC
Q psy4833 8 VAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSP-YRPQTRTVFIPWNQIVVLPP 86 (141)
Q Consensus 8 ~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiG-Y~sqe~~V~~~~n~~~~~~~ 86 (141)
....+=+++|++|.|++-+-|...+.. --|+.+|+=.+.+.+|.. .+.+..-| |+...+.+.++.+....++-
T Consensus 186 ~a~~~l~~vd~~G~gv~~~~v~~g~e~-----~ets~~g~~~~e~t~g~y-~~~i~~~g~ye~~~~av~l~d~~t~dLkl 259 (513)
T COG1470 186 KAQSKLRLVDDDGAGVPKALVKDGNES-----FETSSKGNLEVEITPGKY-VVLIAKKGIYEKKKRAVKLNDGETKDLKL 259 (513)
T ss_pred cceEEEEEEccCCCccchheeecCcee-----EEeecccceeEEecCcce-EEEeccccceecceEEEEcCCCcccceeE
Confidence 345566889999999999999887554 457888988888887765 58889999 99999999999887766655
Q ss_pred eEEE
Q psy4833 87 VKMQ 90 (141)
Q Consensus 87 v~m~ 90 (141)
....
T Consensus 260 s~~~ 263 (513)
T COG1470 260 SVTE 263 (513)
T ss_pred EEEe
Confidence 5444
No 63
>PRK15314 outer membrane protein RatB; Provisional
Probab=46.70 E-value=49 Score=36.05 Aligned_cols=24 Identities=13% Similarity=0.221 Sum_probs=19.5
Q ss_pred EEEEEEECCCCCeeeeEEEEeccC
Q psy4833 10 VVRGQVLSVQGLGITGIRVSVDKD 33 (141)
Q Consensus 10 ~IrG~V~D~~GePL~GVsV~Vkgt 33 (141)
-++=+|+|..|+|++|+.|.++..
T Consensus 1609 PmtVTvtdsaGnP~~~a~~~l~Rg 1632 (2435)
T PRK15314 1609 AATIDVKDAAGQPMKNVMVKISRG 1632 (2435)
T ss_pred eEEEEEecCCCCCCCCceEEEEec
Confidence 344478999999999999999743
No 64
>PF09244 DUF1964: Domain of unknown function (DUF1964); InterPro: IPR015325 This domain is C-terminal to the catalytic sucrose phosphorylase beta/alpha barrel domain. It adopts a beta-sandwich fold, with Greek-key topology and is functionally uncharacterised []. ; PDB: 1R7A_B 2GDU_A 2GDV_A.
Probab=45.46 E-value=46 Score=23.27 Aligned_cols=27 Identities=15% Similarity=0.218 Sum_probs=20.0
Q ss_pred CCceEEEEecCCCcEEEEEEecCccceEE
Q psy4833 44 GGGWFDMLVNGGGAVTLQFQRSPYRPQTR 72 (141)
Q Consensus 44 ~DG~FsL~V~~g~~~tL~FSyiGY~sqe~ 72 (141)
.||.|+..+.++. .|.|+..|..++.-
T Consensus 2 FdG~FSy~~dgdt--Sitf~W~g~~t~at 28 (68)
T PF09244_consen 2 FDGEFSYEADGDT--SITFTWTGATTSAT 28 (68)
T ss_dssp GGSEEEEEEETTT--EEEEEEE-SS-EEE
T ss_pred ccceeeEecCCCc--EEEEEEeccccEEE
Confidence 4899999998665 48999999976643
No 65
>PF13115 YtkA: YtkA-like
Probab=42.89 E-value=30 Score=23.12 Aligned_cols=22 Identities=18% Similarity=0.214 Sum_probs=17.4
Q ss_pred eEEEEEEEECCCCCeeeeEEEEecc
Q psy4833 8 VAVVRGQVLSVQGLGITGIRVSVDK 32 (141)
Q Consensus 8 ~~~IrG~V~D~~GePL~GVsV~Vkg 32 (141)
.+.|+ +|.+|+|+.|+.|.+.-
T Consensus 23 ~i~v~---~~~~g~pv~~a~V~~~~ 44 (86)
T PF13115_consen 23 TITVT---VDQGGKPVTDADVQFEI 44 (86)
T ss_pred EEEEE---ECCCCCCCCCCEEEEEE
Confidence 34554 78899999999998863
No 66
>PF03717 PBP_dimer: Penicillin-binding Protein dimerisation domain; InterPro: IPR005311 This domain is found at the N terminus of Class B High Molecular Weight Penicillin-Binding Proteins. Its function has not been precisely defined, but is strongly implicated in PBP polymerisation. The domain forms a largely disordered "sugar tongs" structure.; GO: 0008658 penicillin binding; PDB: 3EQU_B 3EQV_A 3UE3_A 3PBR_A 3PBT_A 1QME_A 1RP5_B 2ZC3_A 2Z2M_D 2ZC4_D ....
Probab=42.19 E-value=15 Score=27.72 Aligned_cols=17 Identities=18% Similarity=0.294 Sum_probs=9.3
Q ss_pred EEEEEEECCCCCeeeeE
Q psy4833 10 VVRGQVLSVQGLGITGI 26 (141)
Q Consensus 10 ~IrG~V~D~~GePL~GV 26 (141)
--||.+.|.+|.||++=
T Consensus 4 a~RG~I~DrnG~~La~n 20 (178)
T PF03717_consen 4 APRGNIYDRNGRPLATN 20 (178)
T ss_dssp E----EE-TTS-EEEEE
T ss_pred CCCCCEEeCCCCEeEEe
Confidence 46899999999999873
No 67
>PF11184 DUF2969: Protein of unknown function (DUF2969); InterPro: IPR021351 This family of proteins with unknown function appears to be restricted to Lactobacillales.
Probab=41.34 E-value=93 Score=21.50 Aligned_cols=39 Identities=15% Similarity=0.323 Sum_probs=27.8
Q ss_pred EEECCCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCC
Q psy4833 14 QVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGG 55 (141)
Q Consensus 14 ~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g 55 (141)
+|.|.+..-.+|-.|.+ |.. ..| .|.-.||+|.+...++
T Consensus 11 ~i~d~~~~~~~~~~l~I-gkk-~IG-~I~e~d~~fav~~~~~ 49 (71)
T PF11184_consen 11 EIKDTKVNGQPGYELFI-GKK-VIG-EIEEDDGRFAVVKNDN 49 (71)
T ss_pred EEEecccCCeEEEEEEE-CCE-EEE-EEEEcCCcEEEEeCCC
Confidence 56676666677778888 665 567 8888899996665533
No 68
>PRK15207 long polar fimbrial outer membrane usher protein LpfC; Provisional
Probab=38.17 E-value=1e+02 Score=30.08 Aligned_cols=48 Identities=10% Similarity=0.031 Sum_probs=27.9
Q ss_pred cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833 3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML 51 (141)
Q Consensus 3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~ 51 (141)
|+-.+...+-=+|.|++|+||| |+.|.+.+.....- .+-..||.==|.
T Consensus 751 F~~~~g~~~ll~l~~~~G~~lP~Ga~V~~~~~~~~~~-g~Vg~~G~vyl~ 799 (842)
T PRK15207 751 FDTRVGYRVLMSLKQANGNAVPFGATATLIDTTKEAS-SIVGEEGQLYIS 799 (842)
T ss_pred EEEeeeeEEEEEEEcCCCCcCCccEEEEecCCCCceE-EEECCCCEEEEE
Confidence 4444444445567889999998 88877643321111 344556764444
No 69
>PF07492 Trehalase_Ca-bi: Neutral trehalase Ca2+ binding domain; InterPro: IPR011120 Neutral trehalases mobilise trehalose accumulated by fungal cells as a protective and storage carbohydrate. This family represents a calcium-binding domain similar to EF hand. Residues 97 and 108 in O42893 from SWISSPROT have been implicated in this interaction. It is thought that this domain may provide a general mechanism for regulating neutral trehalase activity in yeasts and filamentous fungi [].; GO: 0004555 alpha,alpha-trehalase activity, 0005509 calcium ion binding, 0005993 trehalose catabolic process, 0005737 cytoplasm
Probab=37.98 E-value=35 Score=20.46 Aligned_cols=18 Identities=22% Similarity=0.427 Sum_probs=13.5
Q ss_pred cCCceEEEEecCCCcEEE
Q psy4833 43 RGGGWFDMLVNGGGAVTL 60 (141)
Q Consensus 43 D~DG~FsL~V~~g~~~tL 60 (141)
|-||+|.|.+...+..+|
T Consensus 9 DTDgn~qITIeD~GPKv~ 26 (30)
T PF07492_consen 9 DTDGNFQITIEDTGPKVL 26 (30)
T ss_pred ccCCCcEEEEecCCCeEE
Confidence 779999999975555444
No 70
>PF06115 DUF956: Domain of unknown function (DUF956); InterPro: IPR010360 This is a family of bacterial sequences with undetermined function.
Probab=37.46 E-value=68 Score=24.63 Aligned_cols=33 Identities=21% Similarity=0.272 Sum_probs=25.3
Q ss_pred eeeeEEEEeccC--CCeeeceEecCCceEEEEecCCC
Q psy4833 22 GITGIRVSVDKD--SGRFGFTLTRGGGWFDMLVNGGG 56 (141)
Q Consensus 22 PL~GVsV~Vkgt--~~~~G~TiTD~DG~FsL~V~~g~ 56 (141)
-.++|+|..+|. + +|. ..|+.+|.|.+.....+
T Consensus 54 ~~V~a~V~fkgk~I~-RF~-I~Tk~~G~f~Fsskd~k 88 (118)
T PF06115_consen 54 DYVIASVSFKGKWIP-RFA-IFTKKNGKFTFSSKDSK 88 (118)
T ss_pred eEEEEEEEECCCEEe-eEE-EEECCCCEEEEEECChH
Confidence 356889998876 4 555 88999999999986443
No 71
>COG3188 FimD P pilus assembly protein, porin PapC [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=35.14 E-value=1.6e+02 Score=28.88 Aligned_cols=49 Identities=10% Similarity=-0.069 Sum_probs=34.1
Q ss_pred cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEEec
Q psy4833 3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDMLVN 53 (141)
Q Consensus 3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~ 53 (141)
|+......+--++.+++|+||| |+.|...+.. ..- .+.+.||+.=|...
T Consensus 747 F~t~~g~~~li~l~~~dG~~lPfGa~v~~~~~~-~~~-g~Vg~~G~vyl~gl 796 (835)
T COG3188 747 FDTRIGYRALITLRRADGSPLPFGAEVTDEGGG-QNV-GIVGDDGQVYLRGL 796 (835)
T ss_pred EeeeceeEEEEEEECCCCCcCCCceEEEECCCc-eEE-EEEecCCeEEEecC
Confidence 4444445555677889999998 7888877653 212 57788999999753
No 72
>PF03983 SHD1: SLA1 homology domain 1, SHD1 ; InterPro: IPR007131 The SLA1 homology domain is found in the cytoskeleton assembly control protein SLA1, which is responsible for the correct formation of the actin cytoskeleton.; GO: 0008092 cytoskeletal protein binding, 0030674 protein binding, bridging, 0042802 identical protein binding, 0043130 ubiquitin binding; PDB: 2HBP_A.
Probab=32.91 E-value=31 Score=24.11 Aligned_cols=43 Identities=14% Similarity=0.051 Sum_probs=23.7
Q ss_pred eEecCCceEEEEecCCCcEEEEEEecCccceEEEEEccCccEEEeCCeEEEec
Q psy4833 40 TLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWNQIVVLPPVKMQLS 92 (141)
Q Consensus 40 TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n~~~~~~~v~m~~~ 92 (141)
+=||.+|.|.+.. +|+|+..-.+.+.-..+..+-+|--.|...
T Consensus 14 tWtD~tG~f~VeA----------~fv~~~dgkV~L~k~nG~~i~VP~~kLS~~ 56 (70)
T PF03983_consen 14 TWTDRTGKFKVEA----------EFVGVNDGKVHLHKTNGVKIAVPLEKLSDE 56 (70)
T ss_dssp EEEBSSS--EEEE----------EEEEEETTEEEEE-TTS-EEEEETTSB-HH
T ss_pred EEEeCCCCEEEEE----------EEEEeeCCEEEEEecCCeEEEeEhHHcCHH
Confidence 7899999998864 355555555655555555555555444443
No 73
>KOG4217|consensus
Probab=32.16 E-value=15 Score=34.65 Aligned_cols=15 Identities=60% Similarity=1.101 Sum_probs=11.5
Q ss_pred CcccccccCCCCCCcceeeecc
Q psy4833 113 GNSDLQHYGPRSSNKTVRHCES 134 (141)
Q Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~ 134 (141)
-+++||||| ||-||.
T Consensus 277 DnAaCqHYG-------vRTCEG 291 (605)
T KOG4217|consen 277 DNAACQHYG-------VRTCEG 291 (605)
T ss_pred ChHHhhhcC-------cccccc
Confidence 468999999 566764
No 74
>PRK15248 fimbrial outer membrane usher protein StbC; Provisional
Probab=31.41 E-value=1.6e+02 Score=28.76 Aligned_cols=47 Identities=13% Similarity=-0.085 Sum_probs=28.6
Q ss_pred CcCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833 2 TDDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML 51 (141)
Q Consensus 2 ~~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~ 51 (141)
.|+-.+...+--+|.+++|+||| |+.|...+.. . - .+-..+|.-=|.
T Consensus 762 ~F~~~~g~~~l~~l~~~dG~~lP~Ga~V~~~~g~-~-~-g~Vg~~G~vyl~ 809 (853)
T PRK15248 762 NFATISGKAVLISVKMPDGGIPPMGADVFNGEGT-N-I-GMVGQSGQIYAR 809 (853)
T ss_pred EEEEeeceEEEEEEEcCCCCcCCCceEEECCCCc-E-E-EEEcCCCEEEEE
Confidence 35555666666778899999986 6666543332 2 1 344556665554
No 75
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=31.06 E-value=58 Score=22.28 Aligned_cols=28 Identities=11% Similarity=0.359 Sum_probs=20.0
Q ss_pred ecCCceEEEEecC-CCcEEEEEEecCccceE
Q psy4833 42 TRGGGWFDMLVNG-GGAVTLQFQRSPYRPQT 71 (141)
Q Consensus 42 TD~DG~FsL~V~~-g~~~tL~FSyiGY~sqe 71 (141)
-|.+|.|.+.... ++ .+.|+--||+++.
T Consensus 9 kdk~Ge~rfrlkA~N~--eiI~~sEgY~ska 37 (59)
T COG3422 9 KDKAGEYRFRLKAANG--EIILTSEGYKSKA 37 (59)
T ss_pred EcCCCcEEEEEEccCc--cEEEeecccchhH
Confidence 4889998887653 33 3667889998764
No 76
>PF14289 DUF4369: Domain of unknown function (DUF4369)
Probab=31.03 E-value=1.7e+02 Score=19.61 Aligned_cols=44 Identities=11% Similarity=0.145 Sum_probs=25.8
Q ss_pred ceEEEEEEEECC-CCCeeeeEEEEeccCCCeeeceEecCCceEEEEec
Q psy4833 7 RVAVVRGQVLSV-QGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVN 53 (141)
Q Consensus 7 r~~~IrG~V~D~-~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~ 53 (141)
-..+|+|++.+. +|.-+-=... .+.....-.+..+ ||+|++..+
T Consensus 11 ~~~~I~G~i~~~~~~~~vyL~~~--~~~~~~~ds~~v~-nG~F~f~~~ 55 (106)
T PF14289_consen 11 KQFTIEGKIKGLPDGDKVYLYYY--DNGKVVIDSVVVK-NGKFSFKGP 55 (106)
T ss_pred CcEEEEEEEcCCCCCCEEEEEEe--CCCCEEEEEEEEe-CCEEEEEEe
Confidence 468999999887 6655433322 1111011223334 999999865
No 77
>PF13753 SWM_repeat: Putative flagellar system-associated repeat
Probab=30.29 E-value=88 Score=26.01 Aligned_cols=44 Identities=20% Similarity=0.368 Sum_probs=30.6
Q ss_pred cceEEEEEEEEC-CCCCeeeeEEEEeccCCCeeeceEecCCceEEEEecCC
Q psy4833 6 LRVAVVRGQVLS-VQGLGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGG 55 (141)
Q Consensus 6 ~r~~~IrG~V~D-~~GePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g 55 (141)
....+|+|.|.+ +.|..+ .|.+.|.. +. +..+.+|.|++.+.+.
T Consensus 264 ~~~~t~sg~v~~~~~g~~v---~v~v~g~~--~~-t~v~~~g~~s~~v~~~ 308 (317)
T PF13753_consen 264 NTTVTISGTVTDAEAGDTV---TVTVNGNT--YT-TTVDADGTWSVTVTGA 308 (317)
T ss_pred cccEEEecccCCCCCCCEE---EEEECCEE--EE-EEEeCCCeEEEEcCHH
Confidence 346789999876 357764 46777655 33 5666799999998743
No 78
>COG4594 FecB ABC-type Fe3+-citrate transport system, periplasmic component [Inorganic ion transport and metabolism]
Probab=29.45 E-value=34 Score=30.06 Aligned_cols=33 Identities=15% Similarity=0.084 Sum_probs=26.9
Q ss_pred eceEecCCceEEEEecCCCcEEEEEEecCccce
Q psy4833 38 GFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQ 70 (141)
Q Consensus 38 G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sq 70 (141)
..|+.|..|.|.+.-.+...++|+.||+-+..+
T Consensus 33 ~~tVkde~Gt~tv~k~PKRVVVLE~SFaDaLaa 65 (310)
T COG4594 33 AHTVKDELGTFTVPKTPKRVVVLELSFADALAA 65 (310)
T ss_pred eeeeeccCCceecCCCCceEEEEEecHHHHHHH
Confidence 358999999999987666677899999877543
No 79
>PRK15213 fimbrial outer membrane usher protein PefC; Provisional
Probab=29.38 E-value=2.1e+02 Score=27.65 Aligned_cols=51 Identities=16% Similarity=0.100 Sum_probs=31.7
Q ss_pred cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE-ecCCC
Q psy4833 3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGG 56 (141)
Q Consensus 3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~ 56 (141)
|+-.+...+-=++.|++|+||| |+.|...+.. . - .+...||.==|. ++..+
T Consensus 718 F~~~~g~~~ll~l~~~~G~~lP~Ga~v~~~~g~-~-~-g~Vg~~G~vyl~gl~~~~ 770 (797)
T PRK15213 718 FKALKVLRYILRVKQKDGRFVPGGSWARNEQGT-P-L-GFVANNGVLLMNLLDAPG 770 (797)
T ss_pred EEEEeeeEEEEEEEccCCCcCCCCeEEECCCCc-E-E-EEEcCCCEEEEEcCCCCC
Confidence 5555566666678899999987 7767543322 2 2 466778885554 44444
No 80
>PRK07121 hypothetical protein; Validated
Probab=28.52 E-value=28 Score=30.68 Aligned_cols=15 Identities=40% Similarity=0.711 Sum_probs=13.3
Q ss_pred EEEECCCCCeeeeEE
Q psy4833 13 GQVLSVQGLGITGIR 27 (141)
Q Consensus 13 G~V~D~~GePL~GVs 27 (141)
++|+|.+|+||+|.=
T Consensus 439 ~qVld~~g~pI~GLY 453 (492)
T PRK07121 439 GEVLRADGAPIPGLY 453 (492)
T ss_pred ceEECCCCCCcCceE
Confidence 899999999999953
No 81
>TIGR02485 CobZ_N-term precorrin 3B synthase CobZ. CobZ is essential for cobalamin biosynthesis (by knockout of the R. capsulatus gene ) and is complemented by the characterized precorrin 3B synthase CobG. The enzyme has been shown to contain flavin, heme and Fe-S cluster cofactors and is believed to require dioxygen as a substrate. This model identifies the N-terminal portion of the R. capsulatus gene which, in other species exists as a separate protein. The C-terminal portion is homologous to the 2-component signal transduction system protein CitB (TIGR02484).
Probab=28.24 E-value=27 Score=30.14 Aligned_cols=16 Identities=6% Similarity=0.067 Sum_probs=13.7
Q ss_pred EEEEEECCCCCeeeeE
Q psy4833 11 VRGQVLSVQGLGITGI 26 (141)
Q Consensus 11 IrG~V~D~~GePL~GV 26 (141)
-+++|+|++|+||+|-
T Consensus 375 ~~~~Vl~~~g~~I~GL 390 (432)
T TIGR02485 375 ATARVRLNDAVAPDNL 390 (432)
T ss_pred CCceEECCCCCCCCCe
Confidence 4689999999999984
No 82
>PF06030 DUF916: Bacterial protein of unknown function (DUF916); InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function.
Probab=27.23 E-value=1.4e+02 Score=22.24 Aligned_cols=38 Identities=24% Similarity=0.255 Sum_probs=25.5
Q ss_pred ecC-CceEEEEecCCCcEEEEEEecCccceEEEEEccCc
Q psy4833 42 TRG-GGWFDMLVNGGGAVTLQFQRSPYRPQTRTVFIPWN 79 (141)
Q Consensus 42 TD~-DG~FsL~V~~g~~~tL~FSyiGY~sqe~~V~~~~n 79 (141)
.|. -|.|+|.+.++...+|++.----..+++++.+..+
T Consensus 12 ~~~~~~YFdL~~~P~q~~~l~v~i~N~s~~~~tv~v~~~ 50 (121)
T PF06030_consen 12 IDKNVSYFDLKVKPGQKQTLEVRITNNSDKEITVKVSAN 50 (121)
T ss_pred cCCCCCeEEEEeCCCCEEEEEEEEEeCCCCCEEEEEEEe
Confidence 355 49999999877766676666555566666655533
No 83
>PRK15235 outer membrane fimbrial usher protein SefC; Provisional
Probab=27.02 E-value=1.6e+02 Score=28.55 Aligned_cols=28 Identities=11% Similarity=0.064 Sum_probs=17.1
Q ss_pred CCcceEEEEEEEECCCCCeee-eEEEEec
Q psy4833 4 DELRVAVVRGQVLSVQGLGIT-GIRVSVD 31 (141)
Q Consensus 4 d~~r~~~IrG~V~D~~GePL~-GVsV~Vk 31 (141)
+-.+...+-=++.+++|+||| |+.|.++
T Consensus 723 ~~~~g~~~ll~l~~~~G~~lPfGa~V~~~ 751 (814)
T PRK15235 723 KTSVGTNALIRITRTNGKPLALGTVLSLK 751 (814)
T ss_pred EEeeeeEEEEEEECCCCCcCCCceEEEec
Confidence 333344444467888999887 6656543
No 84
>PRK15223 pilin outer membrane usher protein SafC; Provisional
Probab=24.82 E-value=5e+02 Score=25.28 Aligned_cols=58 Identities=17% Similarity=0.152 Sum_probs=33.8
Q ss_pred cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE-ecCCCcEEEEEEe
Q psy4833 3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML-VNGGGAVTLQFQR 64 (141)
Q Consensus 3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~-V~~g~~~tL~FSy 64 (141)
|+-.+...+-=++.+++|+||| |+.|...+.... - .+...+|.-=|. ++..+ +|.+..
T Consensus 749 F~~~~g~~~ll~l~~~~G~~lP~Ga~v~~~~~g~~-~-g~Vg~~G~vyl~g~~~~~--~l~v~w 808 (836)
T PRK15223 749 FHTRIGARALMTLKREDGSAIPFGAQVTVNGQDGS-A-ALVDTDSQVYLTGLADKG--ELTVKW 808 (836)
T ss_pred EEEeeeEEEEEEEECCCCCcCCCCeEEEecCCCce-E-EEECCCCEEEEEccCCCc--eEEEEE
Confidence 4444445555567889999987 777765432102 2 466778886665 34333 354444
No 85
>PRK12837 3-ketosteroid-delta-1-dehydrogenase; Provisional
Probab=24.56 E-value=39 Score=30.28 Aligned_cols=16 Identities=31% Similarity=0.553 Sum_probs=13.7
Q ss_pred EEEEEECCCCCeeeeE
Q psy4833 11 VRGQVLSVQGLGITGI 26 (141)
Q Consensus 11 IrG~V~D~~GePL~GV 26 (141)
-.++|+|.+|+||+|.
T Consensus 457 ~~~qVl~~~g~pIpGL 472 (513)
T PRK12837 457 TAARVLDTDGRPIPGL 472 (513)
T ss_pred CCceEECCCCCEeCCc
Confidence 3579999999999994
No 86
>PF03404 Mo-co_dimer: Mo-co oxidoreductase dimerisation domain; InterPro: IPR005066 The majority of molybdenum-containing enzymes utilise a molybdenum cofactor (MoCF or Moco) consisting of a Mo atom coordinated via a cis-dithiolene moiety to molybdopterin (MPT). MoCF is ubiquitous in nature, and the pathway for MoCF biosynthesis is conserved in all three domains of life. MoCF-containing enzymes function as oxidoreductases in carbon, nitrogen, and sulphur metabolism [, ]. In Escherichia coli, biosynthesis of MoCF is a three stage process. It begins with the MoaA and MoaC conversion of GTP to the meta-stable pterin intermediate precursor Z. The second stage involves MPT synthase (MoaD and MoaE), which converts precursor Z to MPT; MoeB is involved in the recycling of MPT synthase. The final step in MoCF synthesis is the attachment of mononuclear Mo to MPT, a process that requires MoeA and which is enhanced by MogA in an Mg2 ATP-dependent manner []. MoCF is the active co-factor in eukaryotic and some prokaryotic molybdo-enzymes, but the majority of bacterial enzymes requiring MoCF, need a modification of MTP for it to be active; MobA is involved in the attachment of a nucleotide monophosphate to MPT resulting in the MGD co-factor, the active co-factor for most prokaryotic molybdo-enzymes. Bacterial two-hybrid studies have revealed the close interactions between MoeA, MogA, and MobA in the synthesis of MoCF []. Moreover the close functional association of MoeA and MogA in the synthesis of MoCF is supported by fact that the known eukaryotic homologues to MoeA and MogA exist as fusion proteins: CNX1 (Q39054 from SWISSPROT) of Arabidopsis thaliana (Mouse-ear cress), mammalian Gephryin (e.g. Q9NQX3 from SWISSPROT) and Drosophila melanogaster (Fruit fly) Cinnamon (P39205 from SWISSPROT) []. This domain is found in molybdopterin cofactor oxidoreductases, such as in the C-terminal of Mo-containing sulphite oxidase, which catalyses the conversion of sulphite to sulphate, the terminal step in the oxidative degradation of cysteine and methionine []. This domain is involved in dimer formation, and has an Ig-fold structure [].; GO: 0016491 oxidoreductase activity, 0030151 molybdenum ion binding, 0055114 oxidation-reduction process; PDB: 2C9X_A 2CA3_A 2BLF_A 2CA4_A 2BPB_A 2XTS_C 2BII_A 2BIH_A 1OGP_A 2A9A_B ....
Probab=24.30 E-value=3.1e+02 Score=20.55 Aligned_cols=40 Identities=28% Similarity=0.266 Sum_probs=27.8
Q ss_pred ceEEEEEEEECCCCCeeeeEEEEeccCCCeeeceEecCCc
Q psy4833 7 RVAVVRGQVLSVQGLGITGIRVSVDKDSGRFGFTLTRGGG 46 (141)
Q Consensus 7 r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~DG 46 (141)
..++|+|.-.+..|.++..|-|++.|..+=.-..+....+
T Consensus 27 ~~v~i~G~A~~g~g~~I~rVEVS~DgG~tW~~A~l~~~~~ 66 (131)
T PF03404_consen 27 GTVTIRGYAWSGGGRGIARVEVSTDGGKTWQEATLDGPES 66 (131)
T ss_dssp EEEEEEEEEE-STT--EEEEEEESSTTSSEEE-EEESTSC
T ss_pred cEEEEEEEEEeCCCcceEEEEEEeCCCCCcEEeEeccCCC
Confidence 5789999999999999999999999765311236666666
No 87
>PRK08274 tricarballylate dehydrogenase; Validated
Probab=24.15 E-value=35 Score=29.61 Aligned_cols=16 Identities=13% Similarity=0.075 Sum_probs=13.4
Q ss_pred EEEEEECCCCCeeeeE
Q psy4833 11 VRGQVLSVQGLGITGI 26 (141)
Q Consensus 11 IrG~V~D~~GePL~GV 26 (141)
.+|+|+|.+|+||+|.
T Consensus 406 ~~~~vl~~~g~~I~GL 421 (466)
T PRK08274 406 EDARVRFADGRPSPNL 421 (466)
T ss_pred CCceEECCCCCCCCCc
Confidence 3588999999999985
No 88
>PRK15316 RatA-like protein; Provisional
Probab=22.44 E-value=2.2e+02 Score=31.67 Aligned_cols=55 Identities=11% Similarity=0.109 Sum_probs=0.0
Q ss_pred cceEEEEEEEECCCCCeeeeEEEEeccCCCe------------------------------------eeceEecCCceEE
Q psy4833 6 LRVAVVRGQVLSVQGLGITGIRVSVDKDSGR------------------------------------FGFTLTRGGGWFD 49 (141)
Q Consensus 6 ~r~~~IrG~V~D~~GePL~GVsV~Vkgt~~~------------------------------------~G~TiTD~DG~Fs 49 (141)
.+.+.|+=+|.|.+|.|++|+.|.++... . +. -.||.||.-.
T Consensus 2160 GEsiPLTVTVKD~~GnPv~nva~tLkRg~-A~nR~g~~~~~~~~ddm~~~~~tP~~at~~L~~~~~t~~-G~TgsDGtat 2237 (2683)
T PRK15316 2160 GEMLPIVVTIKDANGNPVGGEGVTLKRVQ-AKSRSGISVSSNTVDDLILDEVTPTSARISFNQNTSAWS-GFTGSDGTIT 2237 (2683)
T ss_pred CCceeEEEEEEcCCCCCcCCceEEEeecc-ccccCCccccccccccceeccccccceeEEeccCcceEe-eeeCCCceEE
Q ss_pred EEecCCCcEEEEEEecCccc
Q psy4833 50 MLVNGGGAVTLQFQRSPYRP 69 (141)
Q Consensus 50 L~V~~g~~~tL~FSyiGY~s 69 (141)
|.+...+. +|+++
T Consensus 2238 ftvtQpnt-------~GLKT 2250 (2683)
T PRK15316 2238 FNVTQNNT-------VGLVT 2250 (2683)
T ss_pred EEEecCCC-------Ccccc
No 89
>PRK15298 fimbrial outer membrane usher protein StiC; Provisional
Probab=22.34 E-value=3.1e+02 Score=26.83 Aligned_cols=45 Identities=16% Similarity=0.133 Sum_probs=24.2
Q ss_pred cceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833 6 LRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML 51 (141)
Q Consensus 6 ~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~ 51 (141)
.+...+-=+|.|++|+||| |+.|.+.+.....- .+...||.==|.
T Consensus 763 ~~g~~~ll~l~~~~G~~lP~Ga~v~~~~~~~~~~-g~Vg~~G~vyl~ 808 (848)
T PRK15298 763 HKGYRVLLTLTRSNGEPVPFGATASVDGQDANLA-SIVGDKGQVFLS 808 (848)
T ss_pred EeeEEEEEEEECCCCCcCCCCcEEEEcCCCCeeE-EEECCCCEEEEE
Confidence 3333333457788999886 66666532210111 455667774444
No 90
>TIGR01813 flavo_cyto_c flavocytochrome c. This model describes a family of redox proteins related to the succinate dehydrogenases and fumarate reductases of E. coli, mitochondria, and other well-characterized systems. A member of this family from Shewanella frigidimarina NCIMB400 is characterized as a water-soluble periplasmic protein with four heme groups, a non-covalently bound FAD, and essentially unidirectional fumarate reductase activity. At least seven distinct members of this family are found in Shewanella oneidensis, a species able to use a wide variety of pathways for respiraton.
Probab=22.01 E-value=45 Score=28.64 Aligned_cols=17 Identities=35% Similarity=0.565 Sum_probs=13.6
Q ss_pred EEEEEECCCCCeeeeEE
Q psy4833 11 VRGQVLSVQGLGITGIR 27 (141)
Q Consensus 11 IrG~V~D~~GePL~GVs 27 (141)
-.++|+|.+|+||+|.=
T Consensus 391 ~~~~vl~~~g~~IpGLy 407 (439)
T TIGR01813 391 TKAEVLDAQGKPIPGLF 407 (439)
T ss_pred CCCeEECCCCCEecccE
Confidence 45788999999999953
No 91
>PRK06481 fumarate reductase flavoprotein subunit; Validated
Probab=21.05 E-value=47 Score=29.71 Aligned_cols=16 Identities=38% Similarity=0.551 Sum_probs=13.4
Q ss_pred EEEEEECCCCCeeeeE
Q psy4833 11 VRGQVLSVQGLGITGI 26 (141)
Q Consensus 11 IrG~V~D~~GePL~GV 26 (141)
-.++|+|++|+||+|-
T Consensus 449 ~~~qVld~~g~pI~GL 464 (506)
T PRK06481 449 TNTEVLKKDGSPITGL 464 (506)
T ss_pred CCceEEcCCCCEeCCe
Confidence 3578999999999995
No 92
>PF09624 DUF2393: Protein of unknown function (DUF2393); InterPro: IPR013417 The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=20.90 E-value=1.8e+02 Score=21.59 Aligned_cols=26 Identities=19% Similarity=0.239 Sum_probs=22.7
Q ss_pred cceEEEEEEEECCCCCeeeeEEEEec
Q psy4833 6 LRVAVVRGQVLSVQGLGITGIRVSVD 31 (141)
Q Consensus 6 ~r~~~IrG~V~D~~GePL~GVsV~Vk 31 (141)
+-...|.|+|+...+.++-+|.|.++
T Consensus 61 ~~~~~v~g~V~N~g~~~i~~c~i~~~ 86 (149)
T PF09624_consen 61 SESFYVDGTVTNTGKFTIKKCKITVK 86 (149)
T ss_pred ccEEEEEEEEEECCCCEeeEEEEEEE
Confidence 34678999999999999999999886
No 93
>PRK15273 putative fimbrial outer membrane usher protein SteB; Provisional
Probab=20.90 E-value=2.5e+02 Score=27.62 Aligned_cols=36 Identities=14% Similarity=0.016 Sum_probs=22.3
Q ss_pred EEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833 13 GQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML 51 (141)
Q Consensus 13 G~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~ 51 (141)
=+|.+++|+||| |+.|...+.. . - .+...+|.==|.
T Consensus 746 ~~l~~~dG~~lPfGa~V~~~~g~-~-~-giVg~~G~vyl~ 782 (881)
T PRK15273 746 AVFRLADGDFPPFGAEVKNERQQ-Q-L-GLVADDGNAWLA 782 (881)
T ss_pred EEEEccCCCcCCCceEEEcCCCc-E-E-EEEcCCCEEEEe
Confidence 356788999987 7777644333 2 2 355667774444
No 94
>PRK12834 putative FAD-binding dehydrogenase; Reviewed
Probab=20.85 E-value=43 Score=30.18 Aligned_cols=16 Identities=25% Similarity=0.588 Sum_probs=13.6
Q ss_pred EEEEEECCCCCeeeeE
Q psy4833 11 VRGQVLSVQGLGITGI 26 (141)
Q Consensus 11 IrG~V~D~~GePL~GV 26 (141)
-.++|+|.+|+||+|.
T Consensus 492 ~~~qVld~dg~pIpGL 507 (549)
T PRK12834 492 LDSRVLGADGTPLPGL 507 (549)
T ss_pred CCCceeCCCCCEeCCe
Confidence 3478999999999995
No 95
>COG3549 HigB Plasmid maintenance system killer protein [General function prediction only]
Probab=20.73 E-value=1.4e+02 Score=22.12 Aligned_cols=23 Identities=22% Similarity=0.188 Sum_probs=18.5
Q ss_pred cCCceEEEEecCCCcEEEEEEecCc
Q psy4833 43 RGGGWFDMLVNGGGAVTLQFQRSPY 67 (141)
Q Consensus 43 D~DG~FsL~V~~g~~~tL~FSyiGY 67 (141)
|.+|.|+|-|+. ...|.|-+-|=
T Consensus 60 d~~g~ySIrvN~--QwrLiF~~~~~ 82 (94)
T COG3549 60 DLKGLYSIRVNQ--QWRLIFRWEGG 82 (94)
T ss_pred CccCcEEEEECC--EEEEEEEeCCC
Confidence 789999999983 34698988774
No 96
>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=20.70 E-value=2.6e+02 Score=19.00 Aligned_cols=51 Identities=18% Similarity=0.151 Sum_probs=28.4
Q ss_pred eEecCCceEEEEe--cCCCcEEEEEEecCccceEEEEEccCc-cEEEeCCeEEEec
Q psy4833 40 TLTRGGGWFDMLV--NGGGAVTLQFQRSPYRPQTRTVFIPWN-QIVVLPPVKMQLS 92 (141)
Q Consensus 40 TiTD~DG~FsL~V--~~g~~~tL~FSyiGY~sqe~~V~~~~n-~~~~~~~v~m~~~ 92 (141)
.++|.+|+-.+.. |..- .+.+++.+++.... .+..... ++.+..++.|.+.
T Consensus 8 ~~~d~~G~~~~~~~lPd~i-t~w~v~a~a~s~~~-~~g~~~~~~~~v~~p~~i~~~ 61 (92)
T PF00207_consen 8 VVTDANGKATFSFTLPDSI-TSWRVTAFAVSPTG-GFGIAEPPEITVFKPFFIQLN 61 (92)
T ss_dssp EECCTTEEEEEEEE-SSSS-SEEEEEEEEEETTT-EEEEECCEEEEEB-SEEEEEE
T ss_pred eccCCCCEEEEEEECCCCc-cEEEEEEEEECCCC-cceEecceEEEEEeeEEEEcC
Confidence 5677888877764 4332 24666666665431 3444444 5666666666654
No 97
>COG0196 RibF FAD synthase [Coenzyme metabolism]
Probab=20.31 E-value=1.6e+02 Score=25.60 Aligned_cols=57 Identities=26% Similarity=0.399 Sum_probs=39.8
Q ss_pred ceEEEEEEEECCC--C--CeeeeEEEEeccCCCeeeceEecCCceEEEEecCCCcEEEEEEecCccce
Q psy4833 7 RVAVVRGQVLSVQ--G--LGITGIRVSVDKDSGRFGFTLTRGGGWFDMLVNGGGAVTLQFQRSPYRPQ 70 (141)
Q Consensus 7 r~~~IrG~V~D~~--G--ePL~GVsV~Vkgt~~~~G~TiTD~DG~FsL~V~~g~~~tL~FSyiGY~sq 70 (141)
|...|+|+|+..+ | ...|=|++..+... +. ..|-|...+...+..---++.+||.+.
T Consensus 184 ~py~i~G~Vv~G~~~Gr~lGfPTaNi~~~~~~------~~-~~GVYav~v~~~~~~~~gv~~vG~rpt 244 (304)
T COG0196 184 RPYSIEGKVVHGQKLGRTLGFPTANIYLKDNV------LP-AFGVYAVRVKLDGGVYPGVANVGYRPT 244 (304)
T ss_pred CCeEEEEEEEcccccccccCCCcccccccccc------cc-CCeeEEEEEEECCccccceEEeccccc
Confidence 6678999998854 5 67788888887554 22 799999987632211256688888754
No 98
>KOG1691|consensus
Probab=20.26 E-value=5.2e+02 Score=21.67 Aligned_cols=72 Identities=22% Similarity=0.350 Sum_probs=42.0
Q ss_pred EEEE--EEECCCCCeeeeEEEEeccCCCeeeceEecC----CceEEEEecCCCcEEEEEEe--cCccceE-EEEEccCcc
Q psy4833 10 VVRG--QVLSVQGLGITGIRVSVDKDSGRFGFTLTRG----GGWFDMLVNGGGAVTLQFQR--SPYRPQT-RTVFIPWNQ 80 (141)
Q Consensus 10 ~IrG--~V~D~~GePL~GVsV~Vkgt~~~~G~TiTD~----DG~FsL~V~~g~~~tL~FSy--iGY~sqe-~~V~~~~n~ 80 (141)
.+.| .+.|..|.-.+-.+|.|.... |-+.-.+ +|+|++.+...+.....|.- .|..+.. +.|.+.|-.
T Consensus 44 lv~g~y~i~~~~~~~~~~~~~~Vts~~---G~~~~~~env~~gqFaFta~e~~~y~~Cf~~~~~~~~p~~~~~I~ld~k~ 120 (210)
T KOG1691|consen 44 LVVGDYEIINPNGDHSHKLSVKVTSPY---GNNLHSKENVTKGQFAFTAEESGMYEACFTADVPGHKPETKRSIDLDWKT 120 (210)
T ss_pred EEEEEEEEecCCCCccceEEEEEEcCC---CceeehhhccccceEEEEeccCCcEEEEEecccCCCCCCcceEEEEEeec
Confidence 3445 567765443255666664322 2222111 89999998766666677777 7777654 566666655
Q ss_pred EEEe
Q psy4833 81 IVVL 84 (141)
Q Consensus 81 ~~~~ 84 (141)
-+..
T Consensus 121 Gv~a 124 (210)
T KOG1691|consen 121 GVEA 124 (210)
T ss_pred cccc
Confidence 4443
No 99
>PRK15304 putative fimbrial outer membrane usher protein; Provisional
Probab=20.14 E-value=2.9e+02 Score=26.66 Aligned_cols=46 Identities=17% Similarity=0.069 Sum_probs=23.8
Q ss_pred cCCcceEEEEEEEECCCCCeee-eEEEEeccCCCeeeceEecCCceEEEE
Q psy4833 3 DDELRVAVVRGQVLSVQGLGIT-GIRVSVDKDSGRFGFTLTRGGGWFDML 51 (141)
Q Consensus 3 ~d~~r~~~IrG~V~D~~GePL~-GVsV~Vkgt~~~~G~TiTD~DG~FsL~ 51 (141)
|+-.+...+-=++.|++|+||| |+.|...+.. . - .+...+|.-=|.
T Consensus 712 F~~~~g~~~l~~l~~~~G~~lPfGa~V~~~~g~-~-~-g~Vg~~G~vyl~ 758 (801)
T PRK15304 712 FSVLNQRRVMLNVKRADGSPLPKGVSIVDEKGN-Y-L-TSAVDDGRVFLN 758 (801)
T ss_pred EEEccceEEEEEEEccCCCcCCCcEEEECCCCc-E-E-EEEeCCCEEEEE
Confidence 3444444444467788888886 5555432222 2 1 234445665444
Done!