Query psy5057
Match_columns 87
No_of_seqs 120 out of 1087
Neff 10.6
Searched_HMMs 46136
Date Sat Aug 17 00:37:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5057.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5057hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00027 cNMP_binding: Cyclic 99.9 4.4E-21 9.5E-26 96.8 8.2 85 3-87 1-86 (91)
2 PRK10402 DNA-binding transcrip 99.8 1.5E-19 3.3E-24 104.8 9.5 85 2-86 32-117 (226)
3 PRK11753 DNA-binding transcrip 99.8 6E-19 1.3E-23 101.1 11.5 86 1-86 20-107 (211)
4 PRK09392 ftrB transcriptional 99.8 5.8E-19 1.3E-23 102.7 9.7 85 2-86 31-115 (236)
5 COG0664 Crp cAMP-binding prote 99.8 1.8E-18 3.9E-23 98.5 10.0 85 2-86 24-109 (214)
6 cd00038 CAP_ED effector domain 99.8 4.1E-18 8.9E-23 88.5 9.8 85 2-86 18-103 (115)
7 PRK13918 CRP/FNR family transc 99.8 8.4E-18 1.8E-22 95.8 10.5 81 2-83 7-90 (202)
8 PRK09391 fixK transcriptional 99.8 2.9E-17 6.3E-22 95.5 10.7 82 2-86 39-121 (230)
9 TIGR03697 NtcA_cyano global ni 99.7 2.1E-17 4.6E-22 93.5 8.7 78 9-86 1-81 (193)
10 smart00100 cNMP Cyclic nucleot 99.7 9.6E-17 2.1E-21 83.7 10.3 85 2-86 18-105 (120)
11 PRK11161 fumarate/nitrate redu 99.7 4.3E-17 9.4E-22 94.8 9.8 82 4-86 40-122 (235)
12 PLN03192 Voltage-dependent pot 99.7 1E-16 2.2E-21 106.5 9.9 86 1-86 397-482 (823)
13 PLN02868 acyl-CoA thioesterase 99.7 3.8E-15 8.3E-20 92.9 11.8 84 1-86 31-114 (413)
14 KOG1113|consensus 99.6 1.7E-15 3.7E-20 91.1 7.3 83 1-87 145-227 (368)
15 KOG0498|consensus 99.6 1.3E-15 2.8E-20 99.3 5.2 86 1-86 442-529 (727)
16 KOG0614|consensus 99.6 1.4E-15 3.1E-20 95.8 2.5 83 1-87 177-259 (732)
17 KOG0500|consensus 99.5 7.7E-13 1.7E-17 82.7 8.8 84 1-86 330-419 (536)
18 COG2905 Predicted signal-trans 99.4 1.5E-12 3.2E-17 82.7 7.4 82 2-86 31-112 (610)
19 KOG0614|consensus 99.3 1E-12 2.3E-17 83.3 3.4 86 2-87 296-384 (732)
20 KOG1113|consensus 99.3 7E-12 1.5E-16 75.9 5.8 83 2-87 264-346 (368)
21 KOG2968|consensus 99.2 3.6E-12 7.8E-17 84.4 2.7 83 2-84 509-592 (1158)
22 KOG0501|consensus 99.2 1.5E-11 3.2E-16 79.1 2.2 80 3-86 573-654 (971)
23 KOG0499|consensus 99.1 5E-10 1.1E-14 72.1 6.5 84 2-86 551-637 (815)
24 PRK11832 putative DNA-binding 98.9 4.9E-08 1.1E-12 56.1 9.2 82 2-86 23-105 (207)
25 PF04831 Popeye: Popeye protei 98.9 6.4E-08 1.4E-12 52.9 9.1 81 2-86 29-115 (153)
26 KOG2968|consensus 98.7 1.8E-07 3.9E-12 63.0 7.8 80 7-86 392-478 (1158)
27 KOG3542|consensus 97.6 0.00011 2.4E-09 49.2 4.4 76 7-86 311-387 (1283)
28 KOG3542|consensus 96.0 0.019 4.1E-07 39.1 4.8 64 4-79 63-126 (1283)
29 PF07883 Cupin_2: Cupin domain 95.8 0.072 1.6E-06 25.1 6.3 44 4-52 3-47 (71)
30 smart00835 Cupin_1 Cupin. This 95.7 0.15 3.3E-06 27.8 7.5 51 3-53 34-86 (146)
31 COG1917 Uncharacterized conser 94.6 0.21 4.5E-06 26.7 5.2 47 3-54 47-94 (131)
32 PRK13290 ectC L-ectoine syntha 94.5 0.36 7.8E-06 26.0 7.5 67 4-78 40-106 (125)
33 COG3718 IolB Uncharacterized e 94.1 0.47 1E-05 28.5 6.1 76 2-81 32-111 (270)
34 PF05899 Cupin_3: Protein of u 93.7 0.34 7.3E-06 23.5 4.5 30 20-53 26-55 (74)
35 TIGR03404 bicupin_oxalic bicup 93.6 1.2 2.5E-05 28.4 8.1 50 4-53 72-121 (367)
36 PF00190 Cupin_1: Cupin; Inte 92.5 0.98 2.1E-05 24.6 5.7 49 5-53 40-95 (144)
37 TIGR02272 gentisate_1_2 gentis 92.4 1.8 3.8E-05 27.4 7.2 71 5-85 256-326 (335)
38 PF12973 Cupin_7: ChrR Cupin-l 92.0 0.87 1.9E-05 22.8 5.0 61 3-77 28-88 (91)
39 TIGR03037 anthran_nbaC 3-hydro 91.9 1.4 3E-05 24.9 6.8 56 19-80 48-103 (159)
40 TIGR03404 bicupin_oxalic bicup 90.9 2.9 6.2E-05 26.7 8.4 49 4-52 250-299 (367)
41 PRK13264 3-hydroxyanthranilate 88.6 3.2 6.9E-05 23.9 6.9 57 17-79 52-108 (177)
42 PRK04190 glucose-6-phosphate i 87.9 3.7 8E-05 23.9 7.9 41 13-53 91-131 (191)
43 PRK11171 hypothetical protein; 87.7 4.6 9.9E-05 24.6 7.7 68 3-78 65-134 (266)
44 PF04962 KduI: KduI/IolB famil 86.5 2.8 6.1E-05 25.5 4.8 74 3-80 31-107 (261)
45 COG2140 Thermophilic glucose-6 85.7 5.4 0.00012 23.6 5.6 33 20-52 104-136 (209)
46 PLN00212 glutelin; Provisional 84.1 9.7 0.00021 25.5 6.6 51 4-54 353-405 (493)
47 COG0662 {ManC} Mannose-6-phosp 83.9 4.9 0.00011 21.5 6.4 45 3-52 40-85 (127)
48 PRK09943 DNA-binding transcrip 83.3 6.3 0.00014 22.4 6.4 53 18-78 127-179 (185)
49 TIGR02451 anti_sig_ChrR anti-s 82.4 5.2 0.00011 23.6 4.6 42 3-53 131-172 (215)
50 KOG2378|consensus 81.9 1.3 2.9E-05 29.2 2.1 39 48-86 1-40 (573)
51 COG3823 Glutamine cyclotransfe 81.9 9 0.00019 23.1 6.1 81 4-86 38-124 (262)
52 TIGR03214 ura-cupin putative a 80.4 11 0.00023 23.0 7.8 30 19-53 80-109 (260)
53 PF05726 Pirin_C: Pirin C-term 80.3 6.3 0.00014 20.3 5.7 70 4-85 4-75 (104)
54 PF06052 3-HAO: 3-hydroxyanthr 79.7 7.5 0.00016 21.8 4.3 43 18-61 52-94 (151)
55 PF13128 DUF3954: Protein of u 79.0 4.8 0.0001 18.2 3.7 19 18-36 8-26 (50)
56 COG3837 Uncharacterized conser 78.1 10 0.00022 21.5 5.6 38 15-57 60-97 (161)
57 PHA00672 hypothetical protein 74.9 12 0.00025 20.5 6.7 31 3-34 51-81 (152)
58 PRK11171 hypothetical protein; 74.9 16 0.00036 22.2 6.8 45 3-52 188-233 (266)
59 TIGR01479 GMP_PMI mannose-1-ph 68.9 31 0.00068 22.9 7.9 46 3-53 380-426 (468)
60 PF11506 DUF3217: Protein of u 66.6 15 0.00033 18.6 3.2 31 24-54 5-35 (104)
61 PRK15460 cpsB mannose-1-phosph 63.3 43 0.00093 22.5 7.5 69 3-79 389-458 (478)
62 PLN00212 glutelin; Provisional 62.7 45 0.00098 22.6 5.8 34 3-36 84-117 (493)
63 COG3450 Predicted enzyme of th 62.6 23 0.00049 19.0 5.7 29 20-52 64-92 (116)
64 COG4101 Predicted mannose-6-ph 62.5 24 0.00051 19.2 4.9 43 19-63 67-109 (142)
65 COG3364 Zn-ribbon containing p 61.6 22 0.00048 18.7 4.3 41 15-55 57-99 (112)
66 PF11699 CENP-C_C: Mif2/CENP-C 56.1 26 0.00056 17.6 4.6 30 18-52 32-61 (85)
67 PRK15457 ethanolamine utilizat 54.5 47 0.001 20.2 4.7 29 19-52 175-203 (233)
68 PF06249 EutQ: Ethanolamine ut 51.5 43 0.00094 18.9 4.1 50 19-78 95-144 (152)
69 PF06865 DUF1255: Protein of u 50.9 35 0.00076 17.6 6.1 58 7-74 31-88 (94)
70 PF06560 GPI: Glucose-6-phosph 50.3 50 0.0011 19.2 5.2 37 16-52 80-120 (182)
71 PF02041 Auxin_BP: Auxin bindi 49.4 48 0.001 18.8 4.7 48 5-53 50-102 (167)
72 PF12852 Cupin_6: Cupin 45.0 57 0.0012 18.4 4.9 34 16-52 32-65 (186)
73 PF03891 DUF333: Domain of unk 44.6 32 0.0007 15.4 4.0 35 26-60 14-48 (50)
74 KOG2757|consensus 43.8 93 0.002 20.5 5.7 45 4-52 338-382 (411)
75 PF04648 MF_alpha: Yeast matin 43.7 9.8 0.00021 12.1 0.3 8 5-12 5-12 (13)
76 COG5458 Uncharacterized conser 43.0 40 0.00086 18.3 2.6 16 19-34 55-70 (144)
77 PRK10579 hypothetical protein; 40.9 54 0.0012 16.9 7.2 58 7-74 31-88 (94)
78 PF07697 7TMR-HDED: 7TM-HD ext 40.5 19 0.0004 20.8 1.3 15 6-20 193-207 (222)
79 COG3257 GlxB Uncharacterized p 39.0 91 0.002 19.0 6.6 59 14-80 78-136 (264)
80 PF10794 DUF2606: Protein of u 36.3 75 0.0016 17.3 4.2 28 7-34 86-113 (131)
81 TIGR02594 conserved hypothetic 35.5 77 0.0017 17.2 6.3 21 7-27 75-95 (129)
82 PF04622 ERG2_Sigma1R: ERG2 an 33.5 93 0.002 18.7 3.3 31 19-52 119-149 (216)
83 PF14499 DUF4437: Domain of un 33.1 26 0.00057 21.4 1.1 53 18-77 190-242 (251)
84 PF00122 E1-E2_ATPase: E1-E2 A 32.0 67 0.0015 18.7 2.7 42 5-46 51-95 (230)
85 PF05523 FdtA: WxcM-like, C-te 31.5 92 0.002 16.8 6.3 33 17-50 52-84 (131)
86 PF02311 AraC_binding: AraC-li 31.0 81 0.0017 16.0 4.5 30 18-52 22-51 (136)
87 COG4820 EutJ Ethanolamine util 28.9 1.1E+02 0.0024 18.6 3.1 30 5-34 156-187 (277)
88 PF13533 Biotin_lipoyl_2: Biot 28.2 57 0.0012 14.3 1.5 12 6-17 24-35 (50)
89 COG4766 EutQ Ethanolamine util 28.0 1.3E+02 0.0027 17.3 4.3 30 18-52 117-146 (176)
90 PF06449 DUF1082: Mitochondria 27.6 70 0.0015 14.2 2.0 25 9-33 16-40 (51)
91 KOG1758|consensus 27.5 1.3E+02 0.0027 17.2 3.3 24 7-30 72-95 (159)
92 PF00877 NLPC_P60: NlpC/P60 fa 25.1 1.1E+02 0.0023 15.4 3.0 22 5-26 51-73 (105)
93 PF03079 ARD: ARD/ARD' family; 24.2 1.5E+02 0.0032 16.8 6.5 53 20-75 93-145 (157)
94 COG3435 Gentisate 1,2-dioxygen 23.6 2.1E+02 0.0046 18.4 5.7 58 18-85 280-338 (351)
95 PF14157 YmzC: YmzC-like prote 22.8 1.1E+02 0.0023 14.6 4.1 22 20-41 30-53 (63)
96 PF02839 CBM_5_12: Carbohydrat 22.5 78 0.0017 13.0 1.5 12 5-16 9-20 (41)
97 PLN02288 mannose-6-phosphate i 22.1 2.5E+02 0.0054 18.6 6.0 15 18-32 353-367 (394)
98 PF11132 SplA: Transcriptional 22.0 60 0.0013 16.0 1.0 12 2-13 2-13 (75)
99 PRK03999 translation initiatio 22.0 1.5E+02 0.0033 16.1 5.8 16 4-19 9-24 (129)
100 PF00018 SH3_1: SH3 domain; I 21.9 84 0.0018 13.3 1.5 12 5-16 15-26 (48)
101 KOG4379|consensus 20.1 2.4E+02 0.0051 19.4 3.6 62 17-78 420-483 (596)
No 1
>PF00027 cNMP_binding: Cyclic nucleotide-binding domain; InterPro: IPR000595 Proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues [, , ]. The best studied of these proteins is the prokaryotic catabolite gene activator (also known as the cAMP receptor protein) (gene crp) where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure. There are six invariant amino acids in this domain, three of which are glycine residues that are thought to be essential for maintenance of the structural integrity of the beta-barrel. cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain. The cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain. The cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section. Vertebrate cyclic nucleotide-gated ion-channels also contain this domain. Two such cations channels have been fully characterised, one is found in rod cells where it plays a role in visual signal transduction.; PDB: 1O7F_A 2BYV_E 3E97_A 3U10_A 2H6B_A 3SHR_A 2OZ6_A 1WGP_A 3LA2_A 3LA3_B ....
Probab=99.86 E-value=4.4e-21 Score=96.79 Aligned_cols=85 Identities=31% Similarity=0.497 Sum_probs=78.4
Q ss_pred eEEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEchh
Q psy5057 3 EKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDRK 81 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 81 (87)
.++|++|++|+++|++.+++|+|++|.++++... +++...+..+.+|++||..+++.+.++..+++|.++|+++.+|++
T Consensus 1 ~~~~~~g~~i~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~i~~~ 80 (91)
T PF00027_consen 1 EKTYKKGEVIYRQGDPCDHIYIILSGEVKVSSINEDGKEQIIFFLGPGDIFGEIELLTGKPSPFTVIALTDSEVLRIPRE 80 (91)
T ss_dssp -EEESTTEEEEETTSBESEEEEEEESEEEEEEETTTSEEEEEEEEETTEEESGHHHHHTSBBSSEEEESSSEEEEEEEHH
T ss_pred CeEECCCCEEEeCCCcCCEEEEEEECceEEEeceecceeeeecceeeeccccceeecCCCccEEEEEEccCEEEEEEeHH
Confidence 4789999999999999999999999999999988 566677999999999999999999999999999999999999999
Q ss_pred hhhccC
Q psy5057 82 TFYCNL 87 (87)
Q Consensus 82 ~~~~~~ 87 (87)
.|.+++
T Consensus 81 ~~~~~~ 86 (91)
T PF00027_consen 81 DFLQLL 86 (91)
T ss_dssp HHHHHH
T ss_pred HHHHHH
Confidence 998763
No 2
>PRK10402 DNA-binding transcriptional activator YeiL; Provisional
Probab=99.82 E-value=1.5e-19 Score=104.75 Aligned_cols=85 Identities=14% Similarity=0.211 Sum_probs=79.5
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
+.+.|++|+.|+.+|++.+++|+|++|.++++..+ +|++.++..+.||++||+.+++.+.++..++.|.++|+++.+|+
T Consensus 32 ~~~~~~kge~l~~~G~~~~~~y~V~~G~v~v~~~~~~G~e~~~~~~~~g~~~G~~~~~~~~~~~~~~~A~~~~~i~~i~~ 111 (226)
T PRK10402 32 ELFHFLAREYIVQEGQQPSYLFYLTRGRAKLYATLANGKVSLIDFFAAPCFIGEIELIDKDHETKAVQAIEECWCLALPM 111 (226)
T ss_pred hheeeCCCCEEEcCCCCCceEEEEEeCEEEEEEECCCCCEeeeeecCCCCeEEeehhhcCCCCCccEEEeccEEEEEEEH
Confidence 45789999999999999999999999999999887 88899999999999999999888889899999999999999999
Q ss_pred hhhhcc
Q psy5057 81 KTFYCN 86 (87)
Q Consensus 81 ~~~~~~ 86 (87)
+.|.++
T Consensus 112 ~~~~~l 117 (226)
T PRK10402 112 KDCRPL 117 (226)
T ss_pred HHHHHH
Confidence 998775
No 3
>PRK11753 DNA-binding transcriptional dual regulator Crp; Provisional
Probab=99.82 E-value=6e-19 Score=101.06 Aligned_cols=86 Identities=19% Similarity=0.282 Sum_probs=78.3
Q ss_pred CeeEEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCC-CceeEEEEecceeEEEE
Q psy5057 1 MFEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNM-PRAATIKATSTGSLWAM 78 (87)
Q Consensus 1 ~~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~-~~~~~~~a~~~~~~~~~ 78 (87)
++.+.|++|+.|+.+|++++.+|+|++|.++++... +|++..+..+.+|++||+.+++.+. ++..++.|.++|+++.+
T Consensus 20 ~~~~~~~kg~~l~~~g~~~~~~y~V~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~~a~~~~~v~~i 99 (211)
T PRK11753 20 CHIHKYPAKSTLIHAGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGLFEEGQERSAWVRAKTACEVAEI 99 (211)
T ss_pred CeEEEeCCCCEEEeCCCCCCeEEEEEeCEEEEEEECCCCCEEEEEEcCCCCEEeehhhccCCCCceEEEEEcCcEEEEEE
Confidence 357899999999999999999999999999999876 7899999999999999999888754 56789999999999999
Q ss_pred chhhhhcc
Q psy5057 79 DRKTFYCN 86 (87)
Q Consensus 79 ~~~~~~~~ 86 (87)
|++.|.++
T Consensus 100 ~~~~~~~l 107 (211)
T PRK11753 100 SYKKFRQL 107 (211)
T ss_pred cHHHHHHH
Confidence 99999875
No 4
>PRK09392 ftrB transcriptional activator FtrB; Provisional
Probab=99.81 E-value=5.8e-19 Score=102.73 Aligned_cols=85 Identities=15% Similarity=0.293 Sum_probs=79.6
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEchh
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDRK 81 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 81 (87)
+.+.|++|+.|+++|++++.+|+|++|.++++...+|++..+..+.+|++||+.+++.+.++..++.|.++|+++.+|++
T Consensus 31 ~~~~~~~ge~l~~~g~~~~~~~~v~~G~v~~~~~~~~~~~~i~~~~~g~~~g~~~~~~~~~~~~~~~A~~~~~~~~i~~~ 110 (236)
T PRK09392 31 FLQRFPPGTMLITEGEPADFLFVVLDGLVELSASSQDRETTLAILRPVSTFILAAVVLDAPYLMSARTLTRSRVLMIPAE 110 (236)
T ss_pred ceeecCCCCEEEeCCCccceEEEEEeCEEEEEEcCCCceEEEEEeCCCchhhhHHHhCCCCCceEEEEcCceEEEEEeHH
Confidence 46789999999999999999999999999999877778889999999999999999999999999999999999999999
Q ss_pred hhhcc
Q psy5057 82 TFYCN 86 (87)
Q Consensus 82 ~~~~~ 86 (87)
.|.++
T Consensus 111 ~~~~l 115 (236)
T PRK09392 111 LVREA 115 (236)
T ss_pred HHHHH
Confidence 99875
No 5
>COG0664 Crp cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]
Probab=99.79 E-value=1.8e-18 Score=98.53 Aligned_cols=85 Identities=27% Similarity=0.418 Sum_probs=80.3
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
+.+.+++|+.|+.+|++.+.+|+|.+|.++++... +|++..+..+++|++||+.+++.+.++..++.|.++++++.+|+
T Consensus 24 ~~~~~~~g~~l~~~g~~~~~~y~v~~G~v~~~~~~~~G~~~~~~~~~~g~~fg~~~l~~~~~~~~~~~a~~~~~~~~~~~ 103 (214)
T COG0664 24 EVRKLPKGEVLFTEGEEADSLYIILSGIVKLYANTEDGREIILGFLGPGDFFGELALLGGDPRSASAVALTDVEVLEIPR 103 (214)
T ss_pred eeEeeCCCCEEEcCCCcCceEEEEEEeEEEEEEECCCCcEEEEEEecCCchhhhHHHhcCCCccceEEEcceEEEEEecH
Confidence 56789999999999999999999999999999987 78899999999999999999999888999999999999999999
Q ss_pred hhhhcc
Q psy5057 81 KTFYCN 86 (87)
Q Consensus 81 ~~~~~~ 86 (87)
+.|.++
T Consensus 104 ~~~~~~ 109 (214)
T COG0664 104 KDFLEL 109 (214)
T ss_pred HHHHHH
Confidence 999875
No 6
>cd00038 CAP_ED effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of
Probab=99.79 E-value=4.1e-18 Score=88.48 Aligned_cols=85 Identities=36% Similarity=0.531 Sum_probs=78.6
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
+.+.+++|+.|+.+|++.+++|+|.+|.++++... +|++..+..+.+|+++|..+++.+.++..++.|.++|+++.++.
T Consensus 18 ~~~~~~~g~~l~~~~~~~~~~~~i~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~i~~ 97 (115)
T cd00038 18 EERRFPAGEVIIRQGDPADSLYIVLSGSVEVYKLDEDGREQIVGFLGPGDLFGELALLGNGPRSATVRALTDSELLVLPR 97 (115)
T ss_pred eeeeeCCCCEEEcCCCCCCeEEEEEeCEEEEEEECCCCcEEEEEecCCccCcChHHHhcCCCCCceEEEcCceEEEEEeH
Confidence 56789999999999999999999999999999876 67788999999999999998888888889999999999999999
Q ss_pred hhhhcc
Q psy5057 81 KTFYCN 86 (87)
Q Consensus 81 ~~~~~~ 86 (87)
+.|.++
T Consensus 98 ~~~~~~ 103 (115)
T cd00038 98 SDFRRL 103 (115)
T ss_pred HHHHHH
Confidence 999765
No 7
>PRK13918 CRP/FNR family transcriptional regulator; Provisional
Probab=99.78 E-value=8.4e-18 Score=95.81 Aligned_cols=81 Identities=19% Similarity=0.231 Sum_probs=73.0
Q ss_pred eeEEecCCCEEEecCC--CCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEE
Q psy5057 2 FEKPVEAGDIVIRQGD--DGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAM 78 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~--~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~ 78 (87)
+.+.|++|++|+.+|+ +.+.+|+|++|.++++... +|++.++..+.||++||+.++ ...++..++.|.++|+++.+
T Consensus 7 ~~~~~~kg~~l~~~Gd~~~~~~~y~I~~G~vr~~~~~~~G~e~~l~~~~~Gd~~G~~~~-~~~~~~~~~~A~~~~~v~~i 85 (202)
T PRK13918 7 DTVTYRPGAVILYPGVPGPSDMLYRVRSGLVRLHTVDDEGNALTLRYVRPGEYFGEEAL-AGAERAYFAEAVTDSRIDVL 85 (202)
T ss_pred ceeEecCCCEEEcCCCCCCCCeEEEEEeeEEEEEEECCCCCEEEEEEecCCCeechHHh-cCCCCCceEEEcCceEEEEE
Confidence 5678999999999999 7799999999999999887 899999999999999999655 45677889999999999999
Q ss_pred chhhh
Q psy5057 79 DRKTF 83 (87)
Q Consensus 79 ~~~~~ 83 (87)
|.+.|
T Consensus 86 ~~~~~ 90 (202)
T PRK13918 86 NPALM 90 (202)
T ss_pred EHHHc
Confidence 98765
No 8
>PRK09391 fixK transcriptional regulator FixK; Provisional
Probab=99.76 E-value=2.9e-17 Score=95.53 Aligned_cols=82 Identities=18% Similarity=0.243 Sum_probs=74.0
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
+.+.|++|+.|+.+|++++++|+|++|.++++..+ +|++.++..+.+|++||+. .+.++..++.|.++|+++.+|.
T Consensus 39 ~~~~~~kge~l~~~Gd~~~~ly~I~~G~vkl~~~~~~G~e~i~~~~~~Gd~fG~~---~~~~~~~~~~A~~ds~v~~i~~ 115 (230)
T PRK09391 39 SEFSYKKGEEIYGEGEPADYVYQVESGAVRTYRLLSDGRRQIGAFHLPGDVFGLE---SGSTHRFTAEAIVDTTVRLIKR 115 (230)
T ss_pred eeEEECCCCEEECCCCCCCeEEEEEeCEEEEEEECCCCcEEEEEEecCCceeccc---CCCcCCeEEEEcCceEEEEEEH
Confidence 46789999999999999999999999999999876 7888899999999999964 3456678999999999999999
Q ss_pred hhhhcc
Q psy5057 81 KTFYCN 86 (87)
Q Consensus 81 ~~~~~~ 86 (87)
+.|.++
T Consensus 116 ~~f~~l 121 (230)
T PRK09391 116 RSLEQA 121 (230)
T ss_pred HHHHHH
Confidence 999875
No 9
>TIGR03697 NtcA_cyano global nitrogen regulator NtcA, cyanobacterial. Members of this protein family, found in the cyanobacteria, are the global nitrogen regulator NtcA. This DNA-binding transcriptional regulator is required for expressing many different ammonia-repressible genes. The consensus NtcA-binding site is G T A N(8)T A C.
Probab=99.74 E-value=2.1e-17 Score=93.48 Aligned_cols=78 Identities=19% Similarity=0.235 Sum_probs=71.1
Q ss_pred CCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCc--eeEEEEecceeEEEEchhhhhc
Q psy5057 9 GDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPR--AATIKATSTGSLWAMDRKTFYC 85 (87)
Q Consensus 9 g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~--~~~~~a~~~~~~~~~~~~~~~~ 85 (87)
|+.|+++|++.+++|+|++|.++++... +|++..+..+.||++||+.+++.+.+. ..++.|.++|+++.+|++.|.+
T Consensus 1 g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~G~e~~l~~~~~g~~~G~~~~~~~~~~~~~~~~~A~~~~~v~~i~~~~~~~ 80 (193)
T TIGR03697 1 GKTIFFPGDPAEKVYFLRRGAVKLSRVYESGEEITVALLRENSVFGVLSLITGHRSDRFYHAVAFTRVELLAVPIEQVEK 80 (193)
T ss_pred CCceecCCCCCCcEEEEEecEEEEEEeCCCCcEeeeEEccCCCEeeeeeeccCCCCccceEEEEecceEEEEeeHHHHHH
Confidence 7899999999999999999999999877 788999999999999999988877654 4679999999999999999987
Q ss_pred c
Q psy5057 86 N 86 (87)
Q Consensus 86 ~ 86 (87)
+
T Consensus 81 l 81 (193)
T TIGR03697 81 A 81 (193)
T ss_pred H
Confidence 5
No 10
>smart00100 cNMP Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
Probab=99.74 E-value=9.6e-17 Score=83.66 Aligned_cols=85 Identities=34% Similarity=0.476 Sum_probs=75.8
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhh--cCCCceeEEEEecceeEEEE
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALL--YNMPRAATIKATSTGSLWAM 78 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~--~~~~~~~~~~a~~~~~~~~~ 78 (87)
+.+.+++|++|+++|++.+.+|+|.+|.++++..+ +|+...+..+.+|++||+.+++ ...+...++.+.++++++.+
T Consensus 18 ~~~~~~~g~~l~~~g~~~~~~y~v~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~~~~~~v~~~ 97 (120)
T smart00100 18 EPVRYPAGEVIIRQGDVGDSFYIILSGEVRVYKVLEDGREQILGILGPGDFFGELALLTNSRRAASATAVALELATLLRI 97 (120)
T ss_pred eEEEeCCCCEEEeCCCcCCcEEEEEeeEEEEEEECCCCceEEEEeecCCceechhhhccCCCcccceEEEEEeeEEEEcc
Confidence 46789999999999999999999999999999876 7888899999999999999877 34566788999999999999
Q ss_pred chhhhhcc
Q psy5057 79 DRKTFYCN 86 (87)
Q Consensus 79 ~~~~~~~~ 86 (87)
+.+.+.+.
T Consensus 98 ~~~~~~~~ 105 (120)
T smart00100 98 DFRDFLQL 105 (120)
T ss_pred CHHHHHHH
Confidence 99988754
No 11
>PRK11161 fumarate/nitrate reduction transcriptional regulator; Provisional
Probab=99.74 E-value=4.3e-17 Score=94.81 Aligned_cols=82 Identities=18% Similarity=0.213 Sum_probs=73.1
Q ss_pred EEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEchhh
Q psy5057 4 KPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDRKT 82 (87)
Q Consensus 4 ~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 82 (87)
+.|++|+.|+.+|++++++|+|.+|.++++..+ +|++.++.++.||++||+.++.. .....++.|.++++++.+|++.
T Consensus 40 ~~~~kge~l~~~Gd~~~~ly~v~~G~v~~~~~~~~G~e~i~~~~~~gd~~g~~~~~~-~~~~~~~~a~~~~~i~~ip~~~ 118 (235)
T PRK11161 40 KPIQKGQTLFKAGDELKSLYAIRSGTIKSYTITEQGDEQITGFHLAGDLVGFDAIGS-GQHPSFAQALETSMVCEIPFET 118 (235)
T ss_pred eeecCCCEeECCCCCcceEEEEeeceEEEEEECCCCCEEEEEeccCCceeccccccC-CCCcceEEEeccEEEEEEEHHH
Confidence 579999999999999999999999999999887 78888999999999999876544 4455689999999999999999
Q ss_pred hhcc
Q psy5057 83 FYCN 86 (87)
Q Consensus 83 ~~~~ 86 (87)
|.++
T Consensus 119 f~~l 122 (235)
T PRK11161 119 LDDL 122 (235)
T ss_pred HHHH
Confidence 9875
No 12
>PLN03192 Voltage-dependent potassium channel; Provisional
Probab=99.71 E-value=1e-16 Score=106.51 Aligned_cols=86 Identities=20% Similarity=0.264 Sum_probs=80.2
Q ss_pred CeeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 1 MFEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 1 ~~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
|+.+.|+||+.|+.+|+.++.+|+|.+|.+++....++++.++..+.+|++||+.+++.+.++..+++|.++|+++.+++
T Consensus 397 ~~~~~~~pge~I~~qge~~~~lY~I~~G~V~i~~~~~~~e~~l~~l~~Gd~FGE~~~l~~~p~~~t~ra~~~s~ll~l~~ 476 (823)
T PLN03192 397 MKAEYIPPREDVIMQNEAPDDVYIVVSGEVEIIDSEGEKERVVGTLGCGDIFGEVGALCCRPQSFTFRTKTLSQLLRLKT 476 (823)
T ss_pred hheeeeCCCCEEEECCCCCceEEEEEecEEEEEEecCCcceeeEEccCCCEecchHHhcCCCCCCeEEEcccEEEEEEEH
Confidence 45778999999999999999999999999999876677888899999999999999999999999999999999999999
Q ss_pred hhhhcc
Q psy5057 81 KTFYCN 86 (87)
Q Consensus 81 ~~~~~~ 86 (87)
+.|.++
T Consensus 477 ~~f~~l 482 (823)
T PLN03192 477 STLIEA 482 (823)
T ss_pred HHHHHH
Confidence 999875
No 13
>PLN02868 acyl-CoA thioesterase family protein
Probab=99.66 E-value=3.8e-15 Score=92.89 Aligned_cols=84 Identities=23% Similarity=0.241 Sum_probs=74.6
Q ss_pred CeeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 1 MFEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 1 ~~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
++.+.|++|++|+++|+..+.+|+|++|.++++....+.+..+..+++|++||+. +.+.++..+++|.++|+++.+|+
T Consensus 31 ~~~~~~~~Ge~I~~~Gd~~~~lyiI~~G~V~v~~~~~~ge~~l~~l~~Gd~fG~~--l~~~~~~~~~~A~~d~~v~~ip~ 108 (413)
T PLN02868 31 VVPKRYGKGEYVVREGEPGDGLYFIWKGEAEVSGPAEEESRPEFLLKRYDYFGYG--LSGSVHSADVVAVSELTCLVLPH 108 (413)
T ss_pred ceEEEECCCCEEEeCCCcCceEEEEEeCEEEEEEECCCCcEEEEEeCCCCEeehh--hCCCCcccEEEECCCEEEEEEcH
Confidence 3578999999999999999999999999999998773337888999999999975 67778889999999999999999
Q ss_pred hhhhcc
Q psy5057 81 KTFYCN 86 (87)
Q Consensus 81 ~~~~~~ 86 (87)
+.|..+
T Consensus 109 ~~~~~~ 114 (413)
T PLN02868 109 EHCHLL 114 (413)
T ss_pred HHHhhh
Confidence 998653
No 14
>KOG1113|consensus
Probab=99.63 E-value=1.7e-15 Score=91.13 Aligned_cols=83 Identities=52% Similarity=0.904 Sum_probs=78.0
Q ss_pred CeeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 1 MFEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 1 ~~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
|.++.+++|+.|+.+|+..+.+|+|-+|.+.++. +...+..+.||..||+++++.+.|+.+++.|.+++.+|.+++
T Consensus 145 m~~~~v~~G~~Vi~qGdeGd~fYvI~kGt~dVyv----~~~~v~~~~~g~sFGElALmyn~PRaATv~a~t~~klWgldr 220 (368)
T KOG1113|consen 145 MFEKRVKAGETVIKQGDEGDNFYVIDKGTFDVYV----NGTYVTTYSPGGSFGELALMYNPPRAATVVAKSLKKLWGLDR 220 (368)
T ss_pred hceeeecCCcEEEecCCcCCcEEEEecceEEEEE----CCeEEeeeCCCCchhhhHhhhCCCcccceeeccccceEEEee
Confidence 5678899999999999999999999999999999 777889999999999999999999999999999999999999
Q ss_pred hhhhccC
Q psy5057 81 KTFYCNL 87 (87)
Q Consensus 81 ~~~~~~~ 87 (87)
..|.+++
T Consensus 221 ~SFrrIi 227 (368)
T KOG1113|consen 221 TSFRRII 227 (368)
T ss_pred ceeEEEe
Confidence 9987753
No 15
>KOG0498|consensus
Probab=99.60 E-value=1.3e-15 Score=99.30 Aligned_cols=86 Identities=21% Similarity=0.281 Sum_probs=78.9
Q ss_pred CeeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEe-hhhhhcC-CCceeEEEEecceeEEEE
Q psy5057 1 MFEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFG-ELALLYN-MPRAATIKATSTGSLWAM 78 (87)
Q Consensus 1 ~~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g-~~~~~~~-~~~~~~~~a~~~~~~~~~ 78 (87)
++...+++|+.|+++|++.+.+|||.+|.+++....+|.......+++||+|| ++..... .|...+++|++.|+++.+
T Consensus 442 lk~~~f~pge~iireGd~v~~myFI~rG~le~~~~~~g~~~~~~~L~~Gd~~GeEl~~~~~~~p~t~TVralt~~el~~L 521 (727)
T KOG0498|consen 442 LKPEYFTPGEYIIREGDPVTDMYFIVRGSLESITTDGGGFFVVAILGPGDFFGEELLTWCLDLPQTRTVRALTYCELFRL 521 (727)
T ss_pred hhhhccCCCCeEEecCCccceeEEEEeeeEEEEEccCCceEEEEEecCCCccchHHHHHHhcCCCCceeehhhhhhHHhc
Confidence 35678899999999999999999999999999988888889999999999999 7888777 888999999999999999
Q ss_pred chhhhhcc
Q psy5057 79 DRKTFYCN 86 (87)
Q Consensus 79 ~~~~~~~~ 86 (87)
++++|...
T Consensus 522 ~~~dL~~V 529 (727)
T KOG0498|consen 522 SADDLKEV 529 (727)
T ss_pred cHHHHHHH
Confidence 99999764
No 16
>KOG0614|consensus
Probab=99.56 E-value=1.4e-15 Score=95.85 Aligned_cols=83 Identities=37% Similarity=0.665 Sum_probs=79.1
Q ss_pred CeeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 1 MFEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 1 ~~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
|..+.|++|++|+++|++.+++|++.+|.+.+.+ ....+...++|..||+++++.+.++.++++|++++.+|.|++
T Consensus 177 Myp~~~~~gs~IIrege~Gs~~yV~aeG~~~V~~----~g~ll~~m~~gtvFGELAILynctRtAsV~alt~~~lWaidR 252 (732)
T KOG0614|consen 177 MYPVEYRAGSWIIREGEPGSHLYVSAEGELQVSR----EGKLLGKMGAGTVFGELAILYNCTRTASVRALTDVRLWAIDR 252 (732)
T ss_pred hCcccccCCcEEEecCCCCceEEEeecceEEEee----CCeeeeccCCchhhhHHHHHhCCcchhhhhhhhhhhHHHHHH
Confidence 5678999999999999999999999999999999 888999999999999999999999999999999999999999
Q ss_pred hhhhccC
Q psy5057 81 KTFYCNL 87 (87)
Q Consensus 81 ~~~~~~~ 87 (87)
+.|+.+|
T Consensus 253 ~vFq~IM 259 (732)
T KOG0614|consen 253 EVFQAIM 259 (732)
T ss_pred HHHHHHH
Confidence 9998763
No 17
>KOG0500|consensus
Probab=99.46 E-value=7.7e-13 Score=82.74 Aligned_cols=84 Identities=19% Similarity=0.449 Sum_probs=72.5
Q ss_pred CeeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhc------CCCceeEEEEeccee
Q psy5057 1 MFEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLY------NMPRAATIKATSTGS 74 (87)
Q Consensus 1 ~~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~------~~~~~~~~~a~~~~~ 74 (87)
++.+.|.|||.|++.|+.+..+|+|.+|.+.+.. +++......+++|++||+++++. +..|+++++.+..+.
T Consensus 330 lk~qvfSPgDyICrKGdvgkEMyIVk~G~L~Vv~--dDg~t~~~~L~~G~~FGEisIlni~g~~~gNRRtanvrSvGYSD 407 (536)
T KOG0500|consen 330 LKPQVFSPGDYICRKGDVGKEMYIVKEGKLAVVA--DDGVTVFVTLKAGSVFGEISILNIKGNKNGNRRTANVRSVGYSD 407 (536)
T ss_pred hcceeeCCCCeEEecCcccceEEEEEccEEEEEe--cCCcEEEEEecCCceeeeeEEEEEcCcccCCcceeeeeeeccce
Confidence 4678899999999999999999999999998765 33445678899999999998764 456789999999999
Q ss_pred EEEEchhhhhcc
Q psy5057 75 LWAMDRKTFYCN 86 (87)
Q Consensus 75 ~~~~~~~~~~~~ 86 (87)
+++++++++.++
T Consensus 408 lfvLskdDl~~a 419 (536)
T KOG0500|consen 408 LFVLSKDDLWEA 419 (536)
T ss_pred eeEeeHHHHHHH
Confidence 999999998764
No 18
>COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms]
Probab=99.41 E-value=1.5e-12 Score=82.74 Aligned_cols=82 Identities=18% Similarity=0.304 Sum_probs=74.7
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEchh
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDRK 81 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 81 (87)
....|++|++|+..+.+.+++|+|++|.+.+... +......+..|+.||..++++..+....+.|++|+.++.+|++
T Consensus 31 ~v~yy~kge~ii~~~~p~~~l~vi~kG~vev~~~---~g~v~~~~~~gdlFg~~~l~~~~~~~~~~~aeedsl~y~lp~s 107 (610)
T COG2905 31 EVKYYRKGEIIIYAGSPVHYLYVIRKGVVEVRSD---GGEVLDRLAAGDLFGFSSLFTELNKQRYMAAEEDSLCYLLPKS 107 (610)
T ss_pred ccccccCCCeeecCCCCcceeEEEEeceeeEEcC---CCeeeeeeccCccccchhhcccCCCcceeEeeccceEEecCHH
Confidence 4578999999999999999999999999998764 4448999999999999999999988899999999999999999
Q ss_pred hhhcc
Q psy5057 82 TFYCN 86 (87)
Q Consensus 82 ~~~~~ 86 (87)
.|.++
T Consensus 108 ~F~ql 112 (610)
T COG2905 108 VFMQL 112 (610)
T ss_pred HHHHH
Confidence 99875
No 19
>KOG0614|consensus
Probab=99.33 E-value=1e-12 Score=83.25 Aligned_cols=86 Identities=29% Similarity=0.438 Sum_probs=77.5
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEee--CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecc-eeEEEE
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI--DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATST-GSLWAM 78 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~--~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~-~~~~~~ 78 (87)
+..+|.+|+.|+++|+..+.+|+|.+|.|.+.... ++.+..+..+..||+||+.+++...-++++++|..+ ++++.+
T Consensus 296 e~~~Yd~g~yIirqge~G~~ffii~~G~V~vtq~~e~~~q~~~lr~l~kGd~FGE~al~~edvRtAniia~~~gv~cl~l 375 (732)
T KOG0614|consen 296 EEEYYDAGEYIIRQGEKGDTFFIISKGTVKVTQQDEGSTQPQELRTLNKGDYFGERALLGEDVRTANIIAQAPGVECLTL 375 (732)
T ss_pred HHHhhcCCceEEeecCCCCeEEEEecceEEEeecCCCCCchhHHhhccccchhhHHHhhccCccchhhhccCCCceEEEe
Confidence 34578999999999999999999999999999887 456677899999999999999999999999999888 899999
Q ss_pred chhhhhccC
Q psy5057 79 DRKTFYCNL 87 (87)
Q Consensus 79 ~~~~~~~~~ 87 (87)
+++.|.+++
T Consensus 376 DresF~~li 384 (732)
T KOG0614|consen 376 DRESFKKLI 384 (732)
T ss_pred cHHHHHHhc
Confidence 999998764
No 20
>KOG1113|consensus
Probab=99.31 E-value=7e-12 Score=75.85 Aligned_cols=83 Identities=39% Similarity=0.536 Sum_probs=75.3
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEchh
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDRK 81 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 81 (87)
..+.|++|+.|..+|+..+.+|+|.+|.+.+....+| ..+ .++.+++||+.+++.+.++.+++.|.....+..++++
T Consensus 264 ~~k~y~~G~~Vi~qg~~ge~f~~i~eGEvdv~~~~~~--v~v-kl~~~dyfge~al~~~~pr~Atv~a~~~~kc~~~dk~ 340 (368)
T KOG1113|consen 264 GTKSYKDGERVIVQGDQGEHFYIIEEGEVDVLKKRDG--VEV-KLKKGDYFGELALLKNLPRAATVVAKGRLKCAKLDKP 340 (368)
T ss_pred ceeeccCCceEEeccCCcceEEEecccccchhhccCC--eEE-EechhhhcchHHHHhhchhhceeeccCCceeeeeChH
Confidence 4678999999999999999999999999998876543 445 9999999999999999999999999999999999999
Q ss_pred hhhccC
Q psy5057 82 TFYCNL 87 (87)
Q Consensus 82 ~~~~~~ 87 (87)
.|..++
T Consensus 341 ~ferll 346 (368)
T KOG1113|consen 341 RFERLL 346 (368)
T ss_pred HHHHHh
Confidence 998764
No 21
>KOG2968|consensus
Probab=99.25 E-value=3.6e-12 Score=84.44 Aligned_cols=83 Identities=27% Similarity=0.430 Sum_probs=76.0
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
......+|+.++++||..+.+|+|+.|+++..... .|+...+..++.||.+|..+.++++++..++.|+.++++.++|.
T Consensus 509 dWv~l~~g~alyrqgD~Sd~iyvVl~GRlRsv~~~~~~k~~i~~EygrGd~iG~~E~lt~~~R~tTv~AvRdSelariPe 588 (1158)
T KOG2968|consen 509 DWVRLEPGQALYRQGDSSDSIYVVLNGRLRSVIRQSGGKKEIVGEYGRGDLIGEVEMLTKQPRATTVMAVRDSELARIPE 588 (1158)
T ss_pred ceEEeccccHHHhcCCccCcEEEEecCeehhhhhccCccchhhhhccCcceeehhHHhhcCCccceEEEEeehhhhhccH
Confidence 46788999999999999999999999999988765 55666899999999999999999999999999999999999998
Q ss_pred hhhh
Q psy5057 81 KTFY 84 (87)
Q Consensus 81 ~~~~ 84 (87)
..|.
T Consensus 589 ~l~~ 592 (1158)
T KOG2968|consen 589 GLLN 592 (1158)
T ss_pred HHHH
Confidence 8764
No 22
>KOG0501|consensus
Probab=99.16 E-value=1.5e-11 Score=79.07 Aligned_cols=80 Identities=20% Similarity=0.298 Sum_probs=70.3
Q ss_pred eEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCC--CceeEEEEecceeEEEEch
Q psy5057 3 EKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNM--PRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~--~~~~~~~a~~~~~~~~~~~ 80 (87)
..+..||+.|++.|+..+.++||.+|.+++.. ...++.+++.||+||...+.... .+.++++|++.|.++.|.+
T Consensus 573 ~~H~APGDLlYHtGESvDaLcFvVsGSLEVIQ----DDEVVAILGKGDVFGD~FWK~~t~~qs~ANVRALTYcDLH~IKr 648 (971)
T KOG0501|consen 573 TNHCAPGDLLYHTGESVDALCFVVSGSLEVIQ----DDEVVAILGKGDVFGDEFWKENTLGQSAANVRALTYCDLHMIKR 648 (971)
T ss_pred hccCCCcceeeecCCccceEEEEEecceEEee----cCcEEEEeecCccchhHHhhhhhhhhhhhhhhhhhhhhhhHHhH
Confidence 35679999999999999999999999999998 78899999999999997666553 3458999999999999999
Q ss_pred hhhhcc
Q psy5057 81 KTFYCN 86 (87)
Q Consensus 81 ~~~~~~ 86 (87)
+.+.+.
T Consensus 649 d~Ll~V 654 (971)
T KOG0501|consen 649 DKLLKV 654 (971)
T ss_pred HHHHHH
Confidence 988653
No 23
>KOG0499|consensus
Probab=99.08 E-value=5e-10 Score=72.10 Aligned_cols=84 Identities=25% Similarity=0.379 Sum_probs=72.4
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhc---CCCceeEEEEecceeEEEE
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLY---NMPRAATIKATSTGSLWAM 78 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~---~~~~~~~~~a~~~~~~~~~ 78 (87)
..+.|=+||.++..|+....+|||..|.+++.--.+ +..++..+.+|++||+++++. +..+.++++|...|.++.+
T Consensus 551 RsV~yLPgDfVCkKGeiGkEMYIIk~GqvQVlGGp~-~~~Vl~tL~~GsVFGEISLLaigG~nRRTAnV~a~Gf~nLfvL 629 (815)
T KOG0499|consen 551 RSVLYLPGDFVCKKGEIGKEMYIIKHGQVQVLGGPD-GTKVLVTLKAGSVFGEISLLAIGGGNRRTANVVAHGFANLFVL 629 (815)
T ss_pred hceeecCCceeeecccccceeEEeecceEEEecCCC-CCEEEEEecccceeeeeeeeeecCCCccchhhhhcccceeeEe
Confidence 457789999999999999999999999998775433 566888999999999998764 3556799999999999999
Q ss_pred chhhhhcc
Q psy5057 79 DRKTFYCN 86 (87)
Q Consensus 79 ~~~~~~~~ 86 (87)
+++++.++
T Consensus 630 ~KkdLnei 637 (815)
T KOG0499|consen 630 DKKDLNEI 637 (815)
T ss_pred cHhHHHHH
Confidence 99998775
No 24
>PRK11832 putative DNA-binding transcriptional regulator; Provisional
Probab=98.89 E-value=4.9e-08 Score=56.13 Aligned_cols=82 Identities=10% Similarity=-0.004 Sum_probs=66.2
Q ss_pred eeEEecCCCEE-EecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 2 FEKPVEAGDIV-IRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 2 ~~~~~~~g~~l-~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
+...+++|..+ +..+.....++++++|.+.+ +..+ ...+....++.+||....+......+.++|.++|+++.+|.
T Consensus 23 ~~~~~~~~~~~i~~~~~~~~~~~ll~~G~vsi-rr~d--~ll~~t~~aP~IlGl~~~~~~~~~~~~l~ae~~c~~~~i~~ 99 (207)
T PRK11832 23 TRFEFNNEKQVIFSSDVNNEDTFVILEGVISL-RREE--NVLIGITQAPYIMGLADGLMKNDIPYKLISEGNCTGYHLPA 99 (207)
T ss_pred CeEecCCCcEEeccccCCCceEEEEEeceEEE-EecC--CeEEEeccCCeEeecccccCCCCceEEEEEcCccEEEEeeH
Confidence 35678899987 65555568899999999999 4323 37778889999999877777666678899999999999999
Q ss_pred hhhhcc
Q psy5057 81 KTFYCN 86 (87)
Q Consensus 81 ~~~~~~ 86 (87)
++|.++
T Consensus 100 ~~~~~i 105 (207)
T PRK11832 100 KQTITL 105 (207)
T ss_pred HHHHHH
Confidence 999765
No 25
>PF04831 Popeye: Popeye protein conserved region; InterPro: IPR006916 The Popeye (POP) family of proteins, is restricted to vertebrates and is preferentially expressed in developing and adult striated muscle. It is represented by a conserved region which includes three potential transmembrane domains []. The strong conservation of POP genes during evolution and their preferential expression in heart and skeletal muscle suggest that these novel proteins may have an important function in these tissues in vertebrates.; GO: 0016020 membrane
Probab=98.89 E-value=6.4e-08 Score=52.89 Aligned_cols=81 Identities=17% Similarity=0.172 Sum_probs=68.7
Q ss_pred eeEEecCCCEEEecCC-CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCC-----ceeEEEEecceeE
Q psy5057 2 FEKPVEAGDIVIRQGD-DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMP-----RAATIKATSTGSL 75 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~-~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~-----~~~~~~a~~~~~~ 75 (87)
+....++|+...-+|. +.+.+.++++|++++.. +...+..+.|.++....++.+-.+ -++++.|.++|..
T Consensus 29 ~i~~L~~~~~YAvE~~T~~drLSlLLsGr~~Vs~----~g~fLH~I~p~qFlDSPEW~s~~~s~~~~FQVTitA~~~Cry 104 (153)
T PF04831_consen 29 EIRTLKKGETYAVEGKTPIDRLSLLLSGRMRVSC----DGRFLHYIYPYQFLDSPEWESLRPSEDDKFQVTITAEEDCRY 104 (153)
T ss_pred eEEEecCCceeeecCCcccceEeEEEcCcEEEEE----CCEeeEeecccccccChhhhccccCCCCeEEEEEEEcCCcEE
Confidence 4577899999999986 56999999999999999 888899999999888776666533 3589999999999
Q ss_pred EEEchhhhhcc
Q psy5057 76 WAMDRKTFYCN 86 (87)
Q Consensus 76 ~~~~~~~~~~~ 86 (87)
+.++++.+..+
T Consensus 105 l~W~R~kL~~~ 115 (153)
T PF04831_consen 105 LCWPREKLYLL 115 (153)
T ss_pred EEEEHHHHHHH
Confidence 99999988654
No 26
>KOG2968|consensus
Probab=98.68 E-value=1.8e-07 Score=63.02 Aligned_cols=80 Identities=23% Similarity=0.211 Sum_probs=69.7
Q ss_pred cCCCEEEecCCCCCeEEEEEeceEEEEEee-------CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEc
Q psy5057 7 EAGDIVIRQGDDGDFFYVIESGRYEASVEI-------DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMD 79 (87)
Q Consensus 7 ~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-------~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~ 79 (87)
+...+|.++|+....+|++.+|.+.++... .........++||+++|..+++++.++..+++|.+++.+..++
T Consensus 392 keitiiv~q~a~~~gl~~ii~g~l~v~~sm~~~s~~~~~~~~~~f~v~pG~ivgyla~lt~e~S~~tirArsdt~v~~is 471 (1158)
T KOG2968|consen 392 KEITIIVEQGARDVGLYYIIKGSLSVYQSMYDVSGNLVLAGMLLFVVGPGEIVGYLAILTNEPSFITIRARSDTRVLFIS 471 (1158)
T ss_pred cceEEEEecccccceeeEEeecceeeeehhcccccccccccceEEEecCCceechhhhhcCCcceEEEEEecceEEEEee
Confidence 345567899999999999999999777654 1245668899999999999999999999999999999999999
Q ss_pred hhhhhcc
Q psy5057 80 RKTFYCN 86 (87)
Q Consensus 80 ~~~~~~~ 86 (87)
+++|.++
T Consensus 472 rs~l~~~ 478 (1158)
T KOG2968|consen 472 RSDLERF 478 (1158)
T ss_pred HHHHHHH
Confidence 9999875
No 27
>KOG3542|consensus
Probab=97.63 E-value=0.00011 Score=49.23 Aligned_cols=76 Identities=18% Similarity=0.226 Sum_probs=59.4
Q ss_pred cCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEE-ecceeEEEEchhhhhc
Q psy5057 7 EAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKA-TSTGSLWAMDRKTFYC 85 (87)
Q Consensus 7 ~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a-~~~~~~~~~~~~~~~~ 85 (87)
.+|.+++..|+..+.+++|+.|.+++.+. .--...+.-|+.||.-+-+..+.....+++ +.||++.++...+|..
T Consensus 311 ~AGtivL~dgeeLDSWsVIlNG~VEv~~P----dGk~e~l~mGnSFG~~PT~dkqym~G~mRTkVDDCqFVciaqqDycr 386 (1283)
T KOG3542|consen 311 DAGTIVLADGEELDSWSVILNGCVEVVKP----DGKREELKMGNSFGAEPTPDKQYMIGEMRTKVDDCQFVCIAQQDYCR 386 (1283)
T ss_pred hcCeEEecCCcccceeEEEecceEEEecC----CCceEEeecccccCCCCCcchhhhhhhhheecccceEEEeehhhHHH
Confidence 68999999999999999999999999884 223356677899997766665555544444 5679999999888866
Q ss_pred c
Q psy5057 86 N 86 (87)
Q Consensus 86 ~ 86 (87)
+
T Consensus 387 I 387 (1283)
T KOG3542|consen 387 I 387 (1283)
T ss_pred H
Confidence 4
No 28
>KOG3542|consensus
Probab=96.04 E-value=0.019 Score=39.14 Aligned_cols=64 Identities=20% Similarity=0.290 Sum_probs=45.6
Q ss_pred EEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEc
Q psy5057 4 KPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMD 79 (87)
Q Consensus 4 ~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~ 79 (87)
.+++...+||+.|+.+..+|++++|.+-+.- ..+-|..+||-. +++.+......++++++.+++
T Consensus 63 er~~g~~ilf~~~~var~wyillsgsv~v~g---------qi~mp~~~fgkr---~g~~r~~nclllq~semivid 126 (1283)
T KOG3542|consen 63 ERHPGQYILFRDGDVARSWYILLSGSVFVEG---------QIYMPYGCFGKR---TGQNRTHNCLLLQESEMIVID 126 (1283)
T ss_pred hcCCCceEEecccchhhheeeeeccceEeec---------ceecCccccccc---cccccccceeeecccceeeee
Confidence 4567889999999999999999999975543 345566666643 344455566667777776663
No 29
>PF07883 Cupin_2: Cupin domain; InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=95.82 E-value=0.072 Score=25.07 Aligned_cols=44 Identities=14% Similarity=0.243 Sum_probs=30.9
Q ss_pred EEecCCCEEEecCCCCC-eEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 4 KPVEAGDIVIRQGDDGD-FFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 4 ~~~~~g~~l~~~g~~~~-~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
..+++|+..-..-.+.. .++++++|.+.+.. ... ...+.+|+.+
T Consensus 3 ~~~~pG~~~~~h~H~~~~e~~~vl~G~~~~~~----~~~-~~~l~~Gd~~ 47 (71)
T PF07883_consen 3 VTLPPGGSIPPHRHPGEDEFFYVLSGEGTLTV----DGE-RVELKPGDAI 47 (71)
T ss_dssp EEEETTEEEEEEEESSEEEEEEEEESEEEEEE----TTE-EEEEETTEEE
T ss_pred EEECCCCCCCCEECCCCCEEEEEEECCEEEEE----ccE-EeEccCCEEE
Confidence 46777876555544444 89999999998874 222 6777888855
No 30
>smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant.
Probab=95.68 E-value=0.15 Score=27.85 Aligned_cols=51 Identities=12% Similarity=0.115 Sum_probs=36.1
Q ss_pred eEEecCCCEEEecCC-CCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEe
Q psy5057 3 EKPVEAGDIVIRQGD-DGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFG 53 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~-~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g 53 (87)
...+++|...-..-- ....+++|++|...+.... ++++.....+.+|+.+-
T Consensus 34 ~~~i~pg~~~~~h~H~~~~e~~~Vl~G~~~~~~~~~~~~~~~~~~l~~GD~~~ 86 (146)
T smart00835 34 RVNLEPGGMLPPHYHPRATELLYVVRGEGRVGVVDPNGNKVYDARLREGDVFV 86 (146)
T ss_pred EEEecCCcCcCCeeCCCCCEEEEEEeCeEEEEEEeCCCCeEEEEEecCCCEEE
Confidence 345567766544432 3568999999999888755 45667788899999764
No 31
>COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=94.57 E-value=0.21 Score=26.73 Aligned_cols=47 Identities=11% Similarity=0.056 Sum_probs=37.0
Q ss_pred eEEecCCCEEEecCCC-CCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEeh
Q psy5057 3 EKPVEAGDIVIRQGDD-GDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGE 54 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~~-~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~ 54 (87)
...+++|+.+-+-..+ .+...+|++|.++... . .....+.+||++-.
T Consensus 47 ~v~~~~G~~~~~H~hp~~~~~~~Vl~G~~~~~~----~-g~~~~l~~Gd~i~i 94 (131)
T COG1917 47 LVTFEPGAVIPWHTHPLGEQTIYVLEGEGTVQL----E-GEKKELKAGDVIII 94 (131)
T ss_pred EEEECCCcccccccCCCcceEEEEEecEEEEEe----c-CCceEecCCCEEEE
Confidence 4677899998888776 7799999999998877 3 44567889997743
No 32
>PRK13290 ectC L-ectoine synthase; Reviewed
Probab=94.54 E-value=0.36 Score=25.97 Aligned_cols=67 Identities=10% Similarity=-0.073 Sum_probs=38.1
Q ss_pred EEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEE
Q psy5057 4 KPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAM 78 (87)
Q Consensus 4 ~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~ 78 (87)
..+++|+..-..-.....+++|++|++.+... ..-....+.+|+.+-.. .+.++. +.+.++++++++
T Consensus 40 ~~l~pG~~~~~h~h~~~E~~yVL~G~~~~~~i---~~g~~~~L~aGD~i~~~---~~~~H~--~~N~e~~~~l~v 106 (125)
T PRK13290 40 TTIYAGTETHLHYKNHLEAVYCIEGEGEVEDL---ATGEVHPIRPGTMYALD---KHDRHY--LRAGEDMRLVCV 106 (125)
T ss_pred EEECCCCcccceeCCCEEEEEEEeCEEEEEEc---CCCEEEEeCCCeEEEEC---CCCcEE--EEcCCCEEEEEE
Confidence 46678875543222224689999999987721 21234778999966422 233333 333366655544
No 33
>COG3718 IolB Uncharacterized enzyme involved in inositol metabolism [Carbohydrate transport and metabolism]
Probab=94.05 E-value=0.47 Score=28.46 Aligned_cols=76 Identities=13% Similarity=0.115 Sum_probs=49.3
Q ss_pred eeEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCC-CeEeh---hhhhcCCCceeEEEEecceeEEE
Q psy5057 2 FEKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDK-GSFGE---LALLYNMPRAATIKATSTGSLWA 77 (87)
Q Consensus 2 ~~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g-~~~g~---~~~~~~~~~~~~~~a~~~~~~~~ 77 (87)
...++++|+.+-........+.++++|.+.+... ......++.. +.|-. .++.-...+..++.|.+++++..
T Consensus 32 ~~~~L~~Ges~~~~~~~~E~clV~v~Gk~~vs~~----g~~f~~iG~R~SvFe~~p~~~vYvp~g~~~~vtA~t~~~vAv 107 (270)
T COG3718 32 RLLRLAAGESATEETGDRERCLVLVTGKATVSAH----GSTFGEIGTRMSVFERKPPDSVYVPAGSAFSVTATTDLEVAV 107 (270)
T ss_pred EEEEccCCCcccccCCCceEEEEEEeeeEEEeec----cchHhhcccccccccCCCCCeEEecCCceEEEEeecceEEEE
Confidence 3567789999888888888999999999998873 2233333311 12211 12223344568899999988877
Q ss_pred Echh
Q psy5057 78 MDRK 81 (87)
Q Consensus 78 ~~~~ 81 (87)
..+.
T Consensus 108 C~AP 111 (270)
T COG3718 108 CSAP 111 (270)
T ss_pred EeCC
Confidence 6654
No 34
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=93.73 E-value=0.34 Score=23.51 Aligned_cols=30 Identities=20% Similarity=0.293 Sum_probs=23.2
Q ss_pred CeEEEEEeceEEEEEeeCCCceEEEEecCCCeEe
Q psy5057 20 DFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFG 53 (87)
Q Consensus 20 ~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g 53 (87)
+.+.+|++|.+.+.. ..-....+++||.+-
T Consensus 26 ~E~~~vleG~v~it~----~~G~~~~~~aGD~~~ 55 (74)
T PF05899_consen 26 DEFFYVLEGEVTITD----EDGETVTFKAGDAFF 55 (74)
T ss_dssp EEEEEEEEEEEEEEE----TTTEEEEEETTEEEE
T ss_pred CEEEEEEEeEEEEEE----CCCCEEEEcCCcEEE
Confidence 788899999998886 333447789999663
No 35
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=93.61 E-value=1.2 Score=28.44 Aligned_cols=50 Identities=8% Similarity=0.001 Sum_probs=32.8
Q ss_pred EEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEe
Q psy5057 4 KPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFG 53 (87)
Q Consensus 4 ~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g 53 (87)
..+.+|...-.---....+.++++|.+++...+.++......+.+||++=
T Consensus 72 ~~l~pG~~~~~HwH~~~E~~yVl~G~~~v~~~d~~g~~~~~~L~~GD~~~ 121 (367)
T TIGR03404 72 MRLEPGAIRELHWHKEAEWAYVLYGSCRITAVDENGRNYIDDVGAGDLWY 121 (367)
T ss_pred EEEcCCCCCCcccCCCceEEEEEeeEEEEEEEcCCCcEEEeEECCCCEEE
Confidence 44556654432222346799999999998877643444445899999663
No 36
>PF00190 Cupin_1: Cupin; InterPro: IPR006045 This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. ; GO: 0045735 nutrient reservoir activity; PDB: 2E9Q_A 2EVX_A 1OD5_A 1UCX_A 1UD1_C 1FXZ_C 3KGL_C 3KSC_D 1UIJ_F 1IPK_B ....
Probab=92.50 E-value=0.98 Score=24.59 Aligned_cols=49 Identities=16% Similarity=0.174 Sum_probs=33.6
Q ss_pred EecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCC-----ceEEE--EecCCCeEe
Q psy5057 5 PVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGE-----DKLMH--AYEDKGSFG 53 (87)
Q Consensus 5 ~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~-----~~~~~--~~~~g~~~g 53 (87)
.+++|..+...-..++.+.+|++|+..+.....+. ..... .+.+||++-
T Consensus 40 ~i~pg~~~~Ph~h~a~~i~~V~~G~~~~~~v~~~~~~~~~~~~~~~v~l~~Gdv~~ 95 (144)
T PF00190_consen 40 LIEPGGLRAPHYHNADEIVYVIEGRGRVGVVGPGGPQEEFRDFSQKVRLKAGDVFV 95 (144)
T ss_dssp EEETTEEEEEEEESSEEEEEEEESEEEEEEEETTCSSSEEEEEEEEEEEETTEEEE
T ss_pred ehhcCCccceeEeeeeEEeeeeccceEEEEEecCCccccceeeeceeeeeccccee
Confidence 34677666655447899999999999977665322 12222 499999774
No 37
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=92.42 E-value=1.8 Score=27.40 Aligned_cols=71 Identities=11% Similarity=0.156 Sum_probs=48.1
Q ss_pred EecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEchhhhh
Q psy5057 5 PVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDRKTFY 84 (87)
Q Consensus 5 ~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 84 (87)
.+++|..--..-.....+|.|.+|.-.... +.. ....++||+|-..+.-. .+..+.+++.++.++-..+.
T Consensus 256 ~L~~G~~t~~~r~T~s~Vf~VieG~G~s~i----g~~-~~~W~~gD~f~vPsW~~-----~~h~a~~da~Lf~~~D~Pll 325 (335)
T TIGR02272 256 LLPKGFRTATYRSTDATVFCVVEGRGQVRI----GDA-VFRFSPKDVFVVPSWHP-----VRFEASDDAVLFSFSDRPVQ 325 (335)
T ss_pred ccCCCCCCCCccccccEEEEEEeCeEEEEE----CCE-EEEecCCCEEEECCCCc-----EecccCCCeEEEEecCHHHH
Confidence 445666655666677999999999977766 333 35689999997665522 23344567778887766554
Q ss_pred c
Q psy5057 85 C 85 (87)
Q Consensus 85 ~ 85 (87)
+
T Consensus 326 ~ 326 (335)
T TIGR02272 326 Q 326 (335)
T ss_pred H
Confidence 4
No 38
>PF12973 Cupin_7: ChrR Cupin-like domain; PDB: 3O14_B 2Z2S_F 2Q1Z_B 3EBR_A.
Probab=91.98 E-value=0.87 Score=22.80 Aligned_cols=61 Identities=11% Similarity=0.068 Sum_probs=37.7
Q ss_pred eEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEE
Q psy5057 3 EKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWA 77 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~ 77 (87)
..++++|..+-...-......+|++|...-- . ..+.+|+++=.. ..+..+..+.++|.++.
T Consensus 28 L~r~~pG~~~p~H~H~g~ee~~VLeG~~~d~-----~----~~~~~G~~~~~p-----~g~~h~~~s~~gc~~~v 88 (91)
T PF12973_consen 28 LLRLEPGASLPRHRHPGGEEILVLEGELSDG-----D----GRYGAGDWLRLP-----PGSSHTPRSDEGCLILV 88 (91)
T ss_dssp EEEE-TTEEEEEEEESS-EEEEEEECEEEET-----T----CEEETTEEEEE------TTEEEEEEESSCEEEEE
T ss_pred EEEECCCCCcCccCCCCcEEEEEEEEEEEEC-----C----ccCCCCeEEEeC-----CCCccccCcCCCEEEEE
Confidence 4567788777776666677888999987621 1 235788766332 22345666788888764
No 39
>TIGR03037 anthran_nbaC 3-hydroxyanthranilate 3,4-dioxygenase. Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase. This enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.
Probab=91.88 E-value=1.4 Score=24.88 Aligned_cols=56 Identities=14% Similarity=0.164 Sum_probs=37.0
Q ss_pred CCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 19 GDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 19 ~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
.+.++++++|.+.+-..++|+ .....+.+|++|=...-....+. +..++.++.+.+
T Consensus 48 tdE~FyqleG~~~l~v~d~g~-~~~v~L~eGd~flvP~gvpHsP~-----r~~~t~~LvIE~ 103 (159)
T TIGR03037 48 GEEFFYQLKGEMYLKVTEEGK-REDVPIREGDIFLLPPHVPHSPQ-----RPAGSIGLVIER 103 (159)
T ss_pred CceEEEEEcceEEEEEEcCCc-EEEEEECCCCEEEeCCCCCcccc-----cCCCcEEEEEEe
Confidence 689999999998886554443 45678899998865544444442 234555555543
No 40
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=90.93 E-value=2.9 Score=26.74 Aligned_cols=49 Identities=16% Similarity=0.148 Sum_probs=32.2
Q ss_pred EEecCCCEEEec-CCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 4 KPVEAGDIVIRQ-GDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 4 ~~~~~g~~l~~~-g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
..+++|...-.- ....+.++++++|..++...++........+++|+++
T Consensus 250 ~~l~PG~~~~~H~H~~~~E~~yvl~G~~~~~v~d~~g~~~~~~l~~GD~~ 299 (367)
T TIGR03404 250 VTVEPGAMRELHWHPNADEWQYFIQGQARMTVFAAGGNARTFDYQAGDVG 299 (367)
T ss_pred EEECCCCccCCeeCcCCCeEEEEEEEEEEEEEEecCCcEEEEEECCCCEE
Confidence 445666554322 3346789999999998876653333344669999955
No 41
>PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional
Probab=88.59 E-value=3.2 Score=23.88 Aligned_cols=57 Identities=16% Similarity=0.175 Sum_probs=37.3
Q ss_pred CCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEc
Q psy5057 17 DDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMD 79 (87)
Q Consensus 17 ~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~ 79 (87)
++.+.++++++|.+.+...++|+ .....+.+|++|=...-....+ ++..++..+.+.
T Consensus 52 ~~tdE~FyqleG~~~l~v~d~g~-~~~v~L~eGd~fllP~gvpHsP-----~r~~~tv~LviE 108 (177)
T PRK13264 52 DPGEEFFYQLEGDMYLKVQEDGK-RRDVPIREGEMFLLPPHVPHSP-----QREAGSIGLVIE 108 (177)
T ss_pred CCCceEEEEECCeEEEEEEcCCc-eeeEEECCCCEEEeCCCCCcCC-----ccCCCeEEEEEE
Confidence 45689999999998887765443 3457789999885544443333 224556666553
No 42
>PRK04190 glucose-6-phosphate isomerase; Provisional
Probab=87.93 E-value=3.7 Score=23.85 Aligned_cols=41 Identities=10% Similarity=-0.082 Sum_probs=28.3
Q ss_pred EecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEe
Q psy5057 13 IRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFG 53 (87)
Q Consensus 13 ~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g 53 (87)
+++......+|++++|...+.............+.+|+.+-
T Consensus 91 ~H~~~~~~EiyyvlsG~g~~~l~~~~G~~~~~~v~pGd~v~ 131 (191)
T PRK04190 91 FHAKADRAEIYYGLKGKGLMLLQDPEGEARWIEMEPGTVVY 131 (191)
T ss_pred EcCCCCCCEEEEEEeCEEEEEEecCCCcEEEEEECCCCEEE
Confidence 34444456999999999887764433345667889999653
No 43
>PRK11171 hypothetical protein; Provisional
Probab=87.66 E-value=4.6 Score=24.59 Aligned_cols=68 Identities=15% Similarity=0.100 Sum_probs=38.5
Q ss_pred eEEecCCCEEEecCC--CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEE
Q psy5057 3 EKPVEAGDIVIRQGD--DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAM 78 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~--~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~ 78 (87)
...+++|...-.... ....+++|++|.+.+.. .. ....+.+||.+= +-.+.++...-...++++++.+
T Consensus 65 ~~~l~PG~~~~~~~h~~~~eE~~~VlsG~l~v~~----~g-~~~~L~~GDsi~---~p~~~~H~~~N~g~~~a~~l~v 134 (266)
T PRK11171 65 LVEVEPGGGSDQPEPDEGAETFLFVVEGEITLTL----EG-KTHALSEGGYAY---LPPGSDWTLRNAGAEDARFHWI 134 (266)
T ss_pred EEEECCCCcCCCCCCCCCceEEEEEEeCEEEEEE----CC-EEEEECCCCEEE---ECCCCCEEEEECCCCCEEEEEE
Confidence 345566654322221 23688999999998876 22 247789998653 2223344333333445655554
No 44
>PF04962 KduI: KduI/IolB family; InterPro: IPR021120 The KduI/IolB family of enzymes includes 5-keto 4-deoxyuronate isomerase (KduI) and 5-deoxy-glucuronate isomerase (IolB). KduI is involved in pectin degradation by free-living soil bacteria that use pectin as a carbon source, breaking it down to 2-keto-3-deoxygluconate, which can ultimately be converted to pyruvate. KduI catalyses the fourth step in pectin degradation, namely the interconversion of 5-keto-4-deoxyuronate and 2,5-diketo-3-dexoygluconate []. KduI has a TIM-barrel fold []. IolB is one of several bacterial proteins encoded by the inositol operon (iolABCDEFGHIJ) in Bacillus subtilis that are involved in myo-inositol catabolism. The enzyme is responsible for isomerization of 5-deoxy-D-glucuronic acid by IolB to produce 2-deoxy-5-keto-D-gluconic acid []. IolBs possess a cupin-like structure.; GO: 0016861 intramolecular oxidoreductase activity, interconverting aldoses and ketoses, 0008152 metabolic process; PDB: 1YWK_B 2QJV_B 1X8M_A 1XRU_A.
Probab=86.50 E-value=2.8 Score=25.51 Aligned_cols=74 Identities=15% Similarity=0.106 Sum_probs=42.8
Q ss_pred eEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCc-eEEEEecCCCeEehh--hhhcCCCceeEEEEecceeEEEEc
Q psy5057 3 EKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGED-KLMHAYEDKGSFGEL--ALLYNMPRAATIKATSTGSLWAMD 79 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~-~~~~~~~~g~~~g~~--~~~~~~~~~~~~~a~~~~~~~~~~ 79 (87)
...+++|+.+-..-+.-....++++|.+.+.. .. ..-..-...+.|... ++.-...+...+.|.++++++..+
T Consensus 31 ~l~L~~g~~~~~~~~~~E~~vv~l~G~~~v~~----~g~~~~~l~~R~~vF~~~~d~lYvp~g~~~~i~a~~~ae~~~~s 106 (261)
T PF04962_consen 31 VLRLEAGESLEFELERRELGVVNLGGKATVTV----DGEEFYELGGRESVFDGPPDALYVPRGTKVVIFASTDAEFAVCS 106 (261)
T ss_dssp CEEEECCHCCCCCCCSEEEEEEEESSSEEEEE----TTEEEEEE-TTSSGGGS--EEEEE-TT--EEEEESSTEEEEEEE
T ss_pred EEEecCCCEEeccCCCcEEEEEEeCCEEEEEe----CCceEEEecccccccCCCCcEEEeCCCCeEEEEEcCCCEEEEEc
Confidence 34566666665555555677888899988776 32 222333333555322 333344556788888899888765
Q ss_pred h
Q psy5057 80 R 80 (87)
Q Consensus 80 ~ 80 (87)
.
T Consensus 107 a 107 (261)
T PF04962_consen 107 A 107 (261)
T ss_dssp E
T ss_pred c
Confidence 4
No 45
>COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only]
Probab=85.75 E-value=5.4 Score=23.58 Aligned_cols=33 Identities=12% Similarity=0.050 Sum_probs=25.4
Q ss_pred CeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 20 DFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 20 ~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
+.+|++++|+.+..-.....+.++..+++|+..
T Consensus 104 ~E~y~vi~G~g~m~v~~~~G~~~v~~~~~Gd~i 136 (209)
T COG2140 104 PEIYYVLKGEGRMLVQKPEGEARVIAVRAGDVI 136 (209)
T ss_pred ccEEEEEeccEEEEEEcCCCcEEEEEecCCcEE
Confidence 459999999998877764455777788888844
No 46
>PLN00212 glutelin; Provisional
Probab=84.08 E-value=9.7 Score=25.54 Aligned_cols=51 Identities=10% Similarity=0.016 Sum_probs=38.1
Q ss_pred EEecCCCEEEec-CCCCCeEEEEEeceEEEEEee-CCCceEEEEecCCCeEeh
Q psy5057 4 KPVEAGDIVIRQ-GDDGDFFYVIESGRYEASVEI-DGEDKLMHAYEDKGSFGE 54 (87)
Q Consensus 4 ~~~~~g~~l~~~-g~~~~~~~~i~~G~v~~~~~~-~~~~~~~~~~~~g~~~g~ 54 (87)
..+.+|-++..- ...++.+.++++|...+-... +|+......+.+|++|=.
T Consensus 353 v~L~~gam~~PHwn~nA~eI~yV~rG~g~vqvV~~~g~~vf~~~L~~GdvfVV 405 (493)
T PLN00212 353 VNLYQNALLSPFWNVNAHSVVYITQGRARVQVVSNNGKTVFNGVLRPGQLLII 405 (493)
T ss_pred EEEcCCcccCCeecCCCCEEEEEeecceEEEEEcCCCCEEEEEEEcCCCEEEE
Confidence 345566665543 345799999999999988876 667777789999997743
No 47
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=83.91 E-value=4.9 Score=21.49 Aligned_cols=45 Identities=11% Similarity=0.148 Sum_probs=29.1
Q ss_pred eEEecCCCEE-EecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 3 EKPVEAGDIV-IRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 3 ~~~~~~g~~l-~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
...+++|+-+ ++--...+.+|+|++|...+.. +. ....+++|+.+
T Consensus 40 ~~~v~pg~~~~~~~H~~~dE~~~Vl~G~g~v~~----~~-~~~~v~~gd~~ 85 (127)
T COG0662 40 RILVKPGEEISLHHHHHRDEHWYVLEGTGKVTI----GG-EEVEVKAGDSV 85 (127)
T ss_pred EEEECCCcccCcccccCcceEEEEEeeEEEEEE----CC-EEEEecCCCEE
Confidence 3445666664 3333447999999999998887 32 23455777744
No 48
>PRK09943 DNA-binding transcriptional repressor PuuR; Provisional
Probab=83.35 E-value=6.3 Score=22.39 Aligned_cols=53 Identities=8% Similarity=0.067 Sum_probs=32.2
Q ss_pred CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEE
Q psy5057 18 DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAM 78 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~ 78 (87)
+...+.+|++|.+.+.. +. ....+.+||.+=. -.+.++.......+++.++.+
T Consensus 127 ~~~E~~~Vl~G~~~~~~----~~-~~~~l~~Gd~~~~---~~~~~H~~~n~~~~~~~~l~~ 179 (185)
T PRK09943 127 QGEEIGTVLEGEIVLTI----NG-QDYHLVAGQSYAI---NTGIPHSFSNTSAGICRIISA 179 (185)
T ss_pred CCcEEEEEEEeEEEEEE----CC-EEEEecCCCEEEE---cCCCCeeeeCCCCCCeEEEEE
Confidence 34688999999998765 33 3367899996632 233444433333445555554
No 49
>TIGR02451 anti_sig_ChrR anti-sigma factor, putative, ChrR family. The member of this family from Rhodobacter sphaeroides has been shown both to form a complex with sigma(E) and to negatively regulate tetrapyrrole biosynthesis. This protein likely contains (at least) two distinct functional domains; several smaller homologs (excluded by the model) show homology only to the C-terminal, including a motif PxHxHxGxE.
Probab=82.42 E-value=5.2 Score=23.58 Aligned_cols=42 Identities=19% Similarity=0.192 Sum_probs=29.1
Q ss_pred eEEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEe
Q psy5057 3 EKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFG 53 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g 53 (87)
..++++|..+-...-....+.+|++|..+ + +. ..+.+|+|+=
T Consensus 131 Ll~i~pG~~~p~H~H~G~E~tlVLeG~f~-----d-e~---g~y~~Gd~i~ 172 (215)
T TIGR02451 131 LLYIEAGQSIPQHTHKGFELTLVLHGAFS-----D-ET---GVYGVGDFEE 172 (215)
T ss_pred EEEECCCCccCCCcCCCcEEEEEEEEEEE-----c-CC---CccCCCeEEE
Confidence 35667888887777777778899999963 1 11 3468888764
No 50
>KOG2378|consensus
Probab=81.89 E-value=1.3 Score=29.19 Aligned_cols=39 Identities=33% Similarity=0.503 Sum_probs=33.0
Q ss_pred CCCeEehhhhhcCCCceeEEEEecc-eeEEEEchhhhhcc
Q psy5057 48 DKGSFGELALLYNMPRAATIKATST-GSLWAMDRKTFYCN 86 (87)
Q Consensus 48 ~g~~~g~~~~~~~~~~~~~~~a~~~-~~~~~~~~~~~~~~ 86 (87)
+|+-||-.+.....|+.+++..-++ |.+..+++..|..+
T Consensus 1 eGddfgklalvnd~praativl~ed~~~fl~vDk~~Fn~I 40 (573)
T KOG2378|consen 1 EGDDFGKLALVNDAPRAATIVLREDNCHFLRVDKHDFNRI 40 (573)
T ss_pred CCcccchhccccccccccceeeecCCCcceeecHHHHHHH
Confidence 4888999999999999887666555 99999999999875
No 51
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=81.86 E-value=9 Score=23.08 Aligned_cols=81 Identities=16% Similarity=0.067 Sum_probs=52.2
Q ss_pred EEecCCCEEEecCC-CCCeEEEEEeceEEEEEee-----CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEE
Q psy5057 4 KPVEAGDIVIRQGD-DGDFFYVIESGRYEASVEI-----DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWA 77 (87)
Q Consensus 4 ~~~~~g~~l~~~g~-~~~~~~~i~~G~v~~~~~~-----~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~ 77 (87)
+.|+-++..|.||- -.+..++.-.|...-++.. .|+.+.-..+.+..+||+-. ..........+=++...+.
T Consensus 38 ~~yphDs~sfTQGL~~~~g~i~esTG~yg~S~ir~~~L~~gq~~~s~~l~~~~~FgEGi--t~~gd~~y~LTw~egvaf~ 115 (262)
T COG3823 38 RTYPHDSTSFTQGLEYLDGHILESTGLYGFSKIRVSDLTTGQEIFSEKLAPDTVFGEGI--TKLGDYFYQLTWKEGVAFK 115 (262)
T ss_pred EeccCchhhhhcceeeeCCEEEEeccccccceeEEEeccCceEEEEeecCCccccccce--eeccceEEEEEeccceeEE
Confidence 45666777777774 3355666777765544332 56666666777778999742 2233345556666788888
Q ss_pred Echhhhhcc
Q psy5057 78 MDRKTFYCN 86 (87)
Q Consensus 78 ~~~~~~~~~ 86 (87)
.+++.|+.+
T Consensus 116 ~d~~t~~~l 124 (262)
T COG3823 116 YDADTLEEL 124 (262)
T ss_pred EChHHhhhh
Confidence 888888653
No 52
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=80.42 E-value=11 Score=22.98 Aligned_cols=30 Identities=20% Similarity=0.278 Sum_probs=22.3
Q ss_pred CCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEe
Q psy5057 19 GDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFG 53 (87)
Q Consensus 19 ~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g 53 (87)
.+.+.+|++|.+.+.. ... ...+.+|+++=
T Consensus 80 ~ee~iyVl~G~l~v~~----~g~-~~~L~~Gd~~y 109 (260)
T TIGR03214 80 IETFLFVISGEVNVTA----EGE-THELREGGYAY 109 (260)
T ss_pred eEEEEEEEeCEEEEEE----CCE-EEEECCCCEEE
Confidence 3689999999998875 222 35889999663
No 53
>PF05726 Pirin_C: Pirin C-terminal cupin domain; InterPro: IPR008778 This entry represents C-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues []. Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold []. Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 1J1L_A 3ACL_A 2P17_A.
Probab=80.28 E-value=6.3 Score=20.29 Aligned_cols=70 Identities=13% Similarity=0.101 Sum_probs=39.8
Q ss_pred EEecCCCEEEec-CCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEe-cceeEEEEchh
Q psy5057 4 KPVEAGDIVIRQ-GDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKAT-STGSLWAMDRK 81 (87)
Q Consensus 4 ~~~~~g~~l~~~-g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~-~~~~~~~~~~~ 81 (87)
..+++|..+-.+ ...-+.+.++++|.+.+-. .. ..+.+++.. ++. ......+.+. ++++++.+..+
T Consensus 4 i~l~~g~~~~~~~~~~~~~~iyv~~G~~~v~~----~~---~~~~~~~~~----~l~-~g~~i~~~a~~~~a~~lll~Ge 71 (104)
T PF05726_consen 4 IKLEPGASFTLPLPPGHNAFIYVLEGSVEVGG----EE---DPLEAGQLV----VLE-DGDEIELTAGEEGARFLLLGGE 71 (104)
T ss_dssp EEE-TT-EEEEEEETT-EEEEEEEESEEEETT----TT---EEEETTEEE----EE--SECEEEEEESSSSEEEEEEEE-
T ss_pred EEECCCCEEEeecCCCCEEEEEEEECcEEECC----Cc---ceECCCcEE----EEC-CCceEEEEECCCCcEEEEEEcc
Confidence 456777776544 4556899999999975422 22 445555533 122 3345677777 78899888877
Q ss_pred hhhc
Q psy5057 82 TFYC 85 (87)
Q Consensus 82 ~~~~ 85 (87)
.+.+
T Consensus 72 Pl~E 75 (104)
T PF05726_consen 72 PLNE 75 (104)
T ss_dssp ---S
T ss_pred CCCC
Confidence 7653
No 54
>PF06052 3-HAO: 3-hydroxyanthranilic acid dioxygenase; InterPro: IPR010329 Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase (1.13.11.6 from EC). It is part of the kynurenine pathway for the degradation of tryptophan and the biosynthesis of nicotinic acid [].The prokaryotic homologue is involved in the 2-nitrobenzoate degradation pathway []. The enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.; GO: 0000334 3-hydroxyanthranilate 3,4-dioxygenase activity, 0005506 iron ion binding, 0008152 metabolic process, 0055114 oxidation-reduction process; PDB: 1ZVF_A 1YFX_A 1YFW_A 1YFY_A 1YFU_A 2QNK_A 3FE5_A.
Probab=79.69 E-value=7.5 Score=21.79 Aligned_cols=43 Identities=19% Similarity=0.067 Sum_probs=26.5
Q ss_pred CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCC
Q psy5057 18 DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNM 61 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~ 61 (87)
+.+.+++-++|.+.+-...+|+.. ...+.+|++|-..+-..+.
T Consensus 52 e~eE~FyQ~kG~m~Lkv~e~g~~k-di~I~EGe~fLLP~~vpHs 94 (151)
T PF06052_consen 52 ETEEFFYQLKGDMCLKVVEDGKFK-DIPIREGEMFLLPANVPHS 94 (151)
T ss_dssp SS-EEEEEEES-EEEEEEETTEEE-EEEE-TTEEEEE-TT--EE
T ss_pred CcceEEEEEeCcEEEEEEeCCceE-EEEeCCCcEEecCCCCCCC
Confidence 345889999999988766666554 5677999988654444333
No 55
>PF13128 DUF3954: Protein of unknown function (DUF3954)
Probab=78.97 E-value=4.8 Score=18.18 Aligned_cols=19 Identities=11% Similarity=0.179 Sum_probs=15.2
Q ss_pred CCCeEEEEEeceEEEEEee
Q psy5057 18 DGDFFYVIESGRYEASVEI 36 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~~~~~ 36 (87)
..|.+|++.+|.+......
T Consensus 8 ~~ngiYiV~~G~v~~i~pP 26 (50)
T PF13128_consen 8 KENGIYIVKDGEVTFIEPP 26 (50)
T ss_pred cCCeEEEEECCeEEEcCCC
Confidence 4589999999999877543
No 56
>COG3837 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=78.12 E-value=10 Score=21.48 Aligned_cols=38 Identities=24% Similarity=0.028 Sum_probs=26.9
Q ss_pred cCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhh
Q psy5057 15 QGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELAL 57 (87)
Q Consensus 15 ~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~ 57 (87)
-....+.+.+|++|+..+.. ++ -...++|||+.|..+-
T Consensus 60 ~Hs~edEfv~ILeGE~~l~~--d~---~e~~lrpGD~~gFpAG 97 (161)
T COG3837 60 WHSAEDEFVYILEGEGTLRE--DG---GETRLRPGDSAGFPAG 97 (161)
T ss_pred ccccCceEEEEEcCceEEEE--CC---eeEEecCCceeeccCC
Confidence 34456889999999987664 22 3367899998886543
No 57
>PHA00672 hypothetical protein
Probab=74.90 E-value=12 Score=20.52 Aligned_cols=31 Identities=16% Similarity=0.020 Sum_probs=26.4
Q ss_pred eEEecCCCEEEecCCCCCeEEEEEeceEEEEE
Q psy5057 3 EKPVEAGDIVIRQGDDGDFFYVIESGRYEASV 34 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~ 34 (87)
..+.++|..|.-.--..+.+ ++.+|.+.++.
T Consensus 51 ei~IPkGt~LtG~~hkf~~~-ii~sG~itV~t 81 (152)
T PHA00672 51 TIRIPAGVALTGALIKVSTV-LIFSGHATVFI 81 (152)
T ss_pred EEeccCceeeeeeeeEeeEE-EEecccEEEEe
Confidence 56779999998877777888 99999999887
No 58
>PRK11171 hypothetical protein; Provisional
Probab=74.85 E-value=16 Score=22.23 Aligned_cols=45 Identities=9% Similarity=-0.073 Sum_probs=32.9
Q ss_pred eEEecCCCEEEe-cCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 3 EKPVEAGDIVIR-QGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 3 ~~~~~~g~~l~~-~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
...+++|..+-. ......+.++|++|+..+.. + -....+.+||++
T Consensus 188 ~~~l~PG~~~~~~~~~~~ee~i~Vl~G~~~~~~----~-~~~~~l~~GD~i 233 (266)
T PRK11171 188 IVTFEPGASIPFVETHVMEHGLYVLEGKGVYRL----N-NDWVEVEAGDFI 233 (266)
T ss_pred EEEECCCCEEccCcCCCceEEEEEEeCEEEEEE----C-CEEEEeCCCCEE
Confidence 456788877766 35666899999999998765 2 244677899855
No 59
>TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (EC 5.3.1.8) (PMI) and mannose-1-phosphate guanylyltransferase (EC 2.7.7.22) in Pseudomonas aeruginosa, Xanthomonas campestris, and Gluconacetobacter xylinus. The literature on the enzyme from E. coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain is C-terminal.
Probab=68.85 E-value=31 Score=22.89 Aligned_cols=46 Identities=13% Similarity=0.127 Sum_probs=28.3
Q ss_pred eEEecCCCEEEe-cCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEe
Q psy5057 3 EKPVEAGDIVIR-QGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFG 53 (87)
Q Consensus 3 ~~~~~~g~~l~~-~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g 53 (87)
...+++|.-+-. ........++|++|.+.+.. .. ....+.+|+.+-
T Consensus 380 ~~~i~PG~~~~~h~H~~~~E~~~Vl~G~~~v~~----dg-~~~~l~~GDsi~ 426 (468)
T TIGR01479 380 RITVKPGEKLSLQMHHHRAEHWIVVSGTARVTI----GD-ETLLLTENESTY 426 (468)
T ss_pred EEEECCCCccCccccCCCceEEEEEeeEEEEEE----CC-EEEEecCCCEEE
Confidence 345677764321 12223456689999998876 32 235789999663
No 60
>PF11506 DUF3217: Protein of unknown function (DUF3217); InterPro: IPR024506 This family of proteins with unknown function appears to be restricted to Mycoplasma.; PDB: 2HQL_E.
Probab=66.60 E-value=15 Score=18.56 Aligned_cols=31 Identities=23% Similarity=0.291 Sum_probs=19.0
Q ss_pred EEEeceEEEEEeeCCCceEEEEecCCCeEeh
Q psy5057 24 VIESGRYEASVEIDGEDKLMHAYEDKGSFGE 54 (87)
Q Consensus 24 ~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~ 54 (87)
++++|.+..+.....+.-....+...-.||+
T Consensus 5 V~LEG~IeS~kWS~~KTGF~VTI~QkR~FG~ 35 (104)
T PF11506_consen 5 VFLEGEIESYKWSKKKTGFLVTIKQKRKFGE 35 (104)
T ss_dssp EEEEEEEEEEEE-TTSSEEEEEEEEEEEETT
T ss_pred EEEeceeehhcccccCceEEEEEeehhhhcc
Confidence 5689999999888555444444444445554
No 61
>PRK15460 cpsB mannose-1-phosphate guanyltransferase; Provisional
Probab=63.31 E-value=43 Score=22.50 Aligned_cols=69 Identities=10% Similarity=0.098 Sum_probs=38.0
Q ss_pred eEEecCCCEEEec-CCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEc
Q psy5057 3 EKPVEAGDIVIRQ-GDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMD 79 (87)
Q Consensus 3 ~~~~~~g~~l~~~-g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~ 79 (87)
...+++|..+-.+ .......++|++|.+.+.. .. ....+.+|+.+= +..+.++...-...++++++.+.
T Consensus 389 ~i~v~PG~~~~~~~H~~~~E~~~VlsG~~~v~i----dg-~~~~L~~GDSi~---ip~g~~H~~~N~g~~~l~iI~V~ 458 (478)
T PRK15460 389 RITVKPGEGLSVQMHHHRAEHWVVVAGTAKVTI----DG-DIKLLGENESIY---IPLGATHCLENPGKIPLDLIEVR 458 (478)
T ss_pred EEEECCCCcCCcCCCCCCceEEEEEeeEEEEEE----CC-EEEEecCCCEEE---ECCCCcEEEEcCCCCCEEEEEEE
Confidence 3455676643111 1233477889999998776 22 347788998553 22334443333334455555443
No 62
>PLN00212 glutelin; Provisional
Probab=62.73 E-value=45 Score=22.58 Aligned_cols=34 Identities=12% Similarity=0.181 Sum_probs=27.4
Q ss_pred eEEecCCCEEEecCCCCCeEEEEEeceEEEEEee
Q psy5057 3 EKPVEAGDIVIRQGDDGDFFYVIESGRYEASVEI 36 (87)
Q Consensus 3 ~~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~ 36 (87)
...++++-++...-..++.+.+|.+|...+-...
T Consensus 84 R~~i~p~gL~lP~y~na~~liyV~qG~G~~G~v~ 117 (493)
T PLN00212 84 RRVIEPQGLLLPRYSNTPGLVYIIQGRGSMGLTF 117 (493)
T ss_pred EEEecCCcccCccccCCCeEEEEEeCeEEEEEEe
Confidence 4566888888888888999999999997766553
No 63
>COG3450 Predicted enzyme of the cupin superfamily [General function prediction only]
Probab=62.58 E-value=23 Score=19.03 Aligned_cols=29 Identities=28% Similarity=0.255 Sum_probs=20.8
Q ss_pred CeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 20 DFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 20 ~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
+.++-+++|.+.+.. ..-....+++||.|
T Consensus 64 ~E~chil~G~v~~T~----d~Ge~v~~~aGD~~ 92 (116)
T COG3450 64 DEFCHILEGRVEVTP----DGGEPVEVRAGDSF 92 (116)
T ss_pred ceEEEEEeeEEEEEC----CCCeEEEEcCCCEE
Confidence 778889999988765 22344567888855
No 64
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=62.45 E-value=24 Score=19.22 Aligned_cols=43 Identities=9% Similarity=-0.117 Sum_probs=29.3
Q ss_pred CCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCc
Q psy5057 19 GDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPR 63 (87)
Q Consensus 19 ~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~ 63 (87)
-+...++++|+.....- ++-......++||+|=..+....++.
T Consensus 67 hEtaIYvlsG~ah~w~G--~rLE~ha~~~pGDf~YiPpgVPHqp~ 109 (142)
T COG4101 67 HETAIYVLSGEAHTWYG--NRLEEHAEVGPGDFFYIPPGVPHQPA 109 (142)
T ss_pred ccEEEEEEeceeeeeec--cceeeeEEecCCCeEEcCCCCCCccc
Confidence 35567889999988772 23345678899998865555555544
No 65
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=61.64 E-value=22 Score=18.67 Aligned_cols=41 Identities=24% Similarity=0.398 Sum_probs=27.4
Q ss_pred cCCCCCeEEEEEeceEEEEEee--CCCceEEEEecCCCeEehh
Q psy5057 15 QGDDGDFFYVIESGRYEASVEI--DGEDKLMHAYEDKGSFGEL 55 (87)
Q Consensus 15 ~g~~~~~~~~i~~G~v~~~~~~--~~~~~~~~~~~~g~~~g~~ 55 (87)
+|++...+-++.-|...+-... ++.+++++....|.+|=.+
T Consensus 57 ~ge~ietIrI~~pG~YeiNl~~Lld~~~iVval~EeG~Y~I~L 99 (112)
T COG3364 57 QGEPIETIRILRPGVYEINLESLLDRDEIVVALQEEGRYFIHL 99 (112)
T ss_pred ccCcceEEEEecCceEEEehhhhccCCceEEEEccCCeEEEEC
Confidence 4555677777777777664433 6677777777888776544
No 66
>PF11699 CENP-C_C: Mif2/CENP-C like; PDB: 2VPV_B.
Probab=56.08 E-value=26 Score=17.62 Aligned_cols=30 Identities=20% Similarity=0.470 Sum_probs=20.4
Q ss_pred CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 18 DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
...-.++|.+|.+++.. ++ ....+.+|+.|
T Consensus 32 ~~~~vF~V~~G~v~Vti----~~-~~f~v~~G~~F 61 (85)
T PF11699_consen 32 DNTMVFYVIKGKVEVTI----HE-TSFVVTKGGSF 61 (85)
T ss_dssp SEEEEEEEEESEEEEEE----TT-EEEEEETT-EE
T ss_pred CcEEEEEEEeCEEEEEE----cC-cEEEEeCCCEE
Confidence 34568899999999887 33 33446777766
No 67
>PRK15457 ethanolamine utilization protein EutQ; Provisional
Probab=54.54 E-value=47 Score=20.17 Aligned_cols=29 Identities=10% Similarity=-0.024 Sum_probs=21.0
Q ss_pred CCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 19 GDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 19 ~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
.+.+.++++|.+.+.. . -....+++|+++
T Consensus 175 ~dEi~YVLEGe~~l~I----d-G~t~~l~pGDvl 203 (233)
T PRK15457 175 YDEIDMVLEGELHVRH----E-GETMIAKAGDVM 203 (233)
T ss_pred ceEEEEEEEeEEEEEE----C-CEEEEeCCCcEE
Confidence 3678899999998876 2 234667888854
No 68
>PF06249 EutQ: Ethanolamine utilisation protein EutQ; InterPro: IPR010424 The eut operon of Salmonella typhimurium encodes proteins involved in the cobalamin-dependent degradation of ethanolamine. The role of EutQ in this process is unclear [].; PDB: 2PYT_B 3LWC_A.
Probab=51.49 E-value=43 Score=18.88 Aligned_cols=50 Identities=14% Similarity=0.109 Sum_probs=27.8
Q ss_pred CCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEE
Q psy5057 19 GDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAM 78 (87)
Q Consensus 19 ~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~ 78 (87)
.+.+.+|++|.+.+.. +| ......+||++= + ...+..+..+...++++..
T Consensus 95 YDEi~~VlEG~L~i~~--~G---~~~~A~~GDvi~----i-PkGs~I~fst~~~a~~~Yv 144 (152)
T PF06249_consen 95 YDEIKYVLEGTLEISI--DG---QTVTAKPGDVIF----I-PKGSTITFSTPDYARFFYV 144 (152)
T ss_dssp SEEEEEEEEEEEEEEE--TT---EEEEEETT-EEE----E--TT-EEEEEEEEEEEEEEE
T ss_pred cceEEEEEEeEEEEEE--CC---EEEEEcCCcEEE----E-CCCCEEEEecCCCEEEEEE
Confidence 4788899999987764 22 334557888552 1 1233445555555555443
No 69
>PF06865 DUF1255: Protein of unknown function (DUF1255); InterPro: IPR009664 This family consists of several conserved hypothetical bacterial proteins of around 95 residues in length. The function of this family is unknown; PDB: 2OYZ_A 3HQX_A.
Probab=50.89 E-value=35 Score=17.62 Aligned_cols=58 Identities=14% Similarity=0.157 Sum_probs=30.5
Q ss_pred cCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEeccee
Q psy5057 7 EAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGS 74 (87)
Q Consensus 7 ~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~ 74 (87)
.||+.=+.- ......-|++|.+.+... .+.....+.+|+.|-.. ..+...+++.+++.
T Consensus 31 ~pGeY~F~T--~~~E~M~vvsG~l~V~lp---g~~ew~~~~aGesF~Vp-----anssF~v~v~~~~~ 88 (94)
T PF06865_consen 31 LPGEYTFGT--SAPERMEVVSGELEVKLP---GEDEWQTYSAGESFEVP-----ANSSFDVKVKEPTA 88 (94)
T ss_dssp -SECEEEEE--SS-EEEEEEESEEEEEET---T-SS-EEEETT-EEEE------TTEEEEEEESS-EE
T ss_pred eeeEEEEcC--CCCEEEEEEEeEEEEEcC---CCcccEEeCCCCeEEEC-----CCCeEEEEECccee
Confidence 456644433 345677778888777663 33345667777777532 33455666666554
No 70
>PF06560 GPI: Glucose-6-phosphate isomerase (GPI); InterPro: IPR010551 This entry consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins (5.3.1.9 from EC), which are involved in glycolysis and in gluconeogenesis and catalyse the conversion of D-glucose 6-phosphate to D-fructose 6-phosphate. The deduced amino acid sequence of the first archaeal PGI isolated from Pyrococcus furiosus revealed that it is not related to its eukaryotic and many of its bacterial counterparts. In contrast, this archaeal PGI shares similarity with the cupin superfamily that consists of a variety of proteins that are generally involved in sugar metabolism in both prokaryotes and eukaryotes [].; GO: 0004347 glucose-6-phosphate isomerase activity, 0006094 gluconeogenesis, 0006096 glycolysis, 0005737 cytoplasm; PDB: 1J3Q_B 1J3R_B 1J3P_A 2GC0_A 1X8E_A 1X82_A 1QY4_B 2GC2_B 1QXJ_A 1QXR_B ....
Probab=50.35 E-value=50 Score=19.21 Aligned_cols=37 Identities=14% Similarity=0.023 Sum_probs=23.2
Q ss_pred CCCCCeEEEEEeceEEEEEeeCCC----ceEEEEecCCCeE
Q psy5057 16 GDDGDFFYVIESGRYEASVEIDGE----DKLMHAYEDKGSF 52 (87)
Q Consensus 16 g~~~~~~~~i~~G~v~~~~~~~~~----~~~~~~~~~g~~~ 52 (87)
|.....+|.+++|..-........ +.......+|+..
T Consensus 80 ~~~~pEvY~vl~G~g~~lLq~~~~~~~~~~~~v~~~~G~~v 120 (182)
T PF06560_consen 80 GLSYPEVYEVLSGEGLILLQKEEGDDVGDVIAVEAKPGDVV 120 (182)
T ss_dssp -TT--EEEEEEESSEEEEEE-TTS-----EEEEEE-TTEEE
T ss_pred CCCCCcEEEEEeCEEEEEEEecCCCcceeEEEEEeCCCCEE
Confidence 466789999999998877766333 4556777888854
No 71
>PF02041 Auxin_BP: Auxin binding protein; InterPro: IPR000526 Auxin binding protein is located in the lumen of the endoplasmic reticulum (ER). The primary structure contains an N-terminal hydrophobic leader sequence of 30-40 amino acids, which could represent a signal for translocation of the protein to the ER [, ]. The mature protein comprises around 165 residues, and contains a number of potential N-glycosylation sites. In vitro transport studies have demonstrated co-translational glycosylation []. Retention within the lumen of the ER correlates with an additional signal located at the C terminus, represented by the sequence Lys-Asp-Glu-Leu, known to be responsible for preventing secretion of proteins from the lumen of the ER in eukaryotic cells [, ].; GO: 0004872 receptor activity, 0005788 endoplasmic reticulum lumen; PDB: 1LR5_D 1LRH_D.
Probab=49.44 E-value=48 Score=18.83 Aligned_cols=48 Identities=10% Similarity=0.115 Sum_probs=21.9
Q ss_pred EecCCCEEEecCCCCCeEEEEEeceEEEEEee-----CCCceEEEEecCCCeEe
Q psy5057 5 PVEAGDIVIRQGDDGDFFYVIESGRYEASVEI-----DGEDKLMHAYEDKGSFG 53 (87)
Q Consensus 5 ~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~-----~~~~~~~~~~~~g~~~g 53 (87)
++.+|.---..--.++.+++|++|.-.++... .|+.+. ..+.+++.|-
T Consensus 50 TfAPG~~TPiHRHsCEEVFvVLkG~GTl~l~~~~~~~pG~pqe-f~~~pnSTf~ 102 (167)
T PF02041_consen 50 TFAPGSATPIHRHSCEEVFVVLKGSGTLYLASSHEKYPGKPQE-FPIFPNSTFH 102 (167)
T ss_dssp EE-TT-B--EEEESS-EEEEEEE--EEEEE--SSSSS--S-EE-EEE-TTEEEE
T ss_pred eecCCCCCCCccccccEEEEEEecceEEEEecccccCCCCceE-EEecCCCeEE
Confidence 34444333323335789999999998887664 244433 3456777664
No 72
>PF12852 Cupin_6: Cupin
Probab=44.96 E-value=57 Score=18.39 Aligned_cols=34 Identities=21% Similarity=0.085 Sum_probs=22.1
Q ss_pred CCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 16 GDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 16 g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
+.+.-.+++|.+|...+..... .....+.+||++
T Consensus 32 ~~~~~~fh~V~~G~~~l~~~~~---~~~~~L~~GDiv 65 (186)
T PF12852_consen 32 GSPGASFHVVLRGSCWLRVPGG---GEPIRLEAGDIV 65 (186)
T ss_pred CCCceEEEEEECCeEEEEEcCC---CCeEEecCCCEE
Confidence 3335899999999998885321 233556666644
No 73
>PF03891 DUF333: Domain of unknown function (DUF333); InterPro: IPR005590 This family consists of bacterial proteins whose function has not been characterised.
Probab=44.65 E-value=32 Score=15.43 Aligned_cols=35 Identities=23% Similarity=0.235 Sum_probs=27.1
Q ss_pred EeceEEEEEeeCCCceEEEEecCCCeEehhhhhcC
Q psy5057 26 ESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYN 60 (87)
Q Consensus 26 ~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~ 60 (87)
.-|.+.+....+|.+.-+..+..|....+.+++.+
T Consensus 14 ~GG~~~~~~~~~G~~~g~C~lpdG~~~~eW~l~r~ 48 (50)
T PF03891_consen 14 QGGKLEIRKQPDGSQVGYCVLPDGRRCEEWALYRG 48 (50)
T ss_pred hCCEEEEEEcCCCCeEeEEECCCCCEEeHHHHhhc
Confidence 45677777777887888899999998888877654
No 74
>KOG2757|consensus
Probab=43.77 E-value=93 Score=20.48 Aligned_cols=45 Identities=13% Similarity=0.099 Sum_probs=32.2
Q ss_pred EEecCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 4 KPVEAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 4 ~~~~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
...+.|+.-.-++-....++++++|...+... ......+.+|+++
T Consensus 338 ~~v~~g~~~~~~~~~~~SIllv~~G~g~l~~~----t~~~~~v~rG~V~ 382 (411)
T KOG2757|consen 338 TKVPTGESYKFPGVDGPSILLVLKGSGILKTD----TDSKILVNRGDVL 382 (411)
T ss_pred eecCCCceEEeecCCCceEEEEEecceEEecC----CCCceeeccCcEE
Confidence 45567888777888889999999999876652 2334556677744
No 75
>PF04648 MF_alpha: Yeast mating factor alpha hormone; InterPro: IPR006742 This repeated sequence,WHWLQLKPGQPMY, characterises the mating factor alpha-1 or alpha-1 mating pheromone [contains: Mating factor alpha].The hormone is excreted into the culture medium by haploid cells of the alpha mating type and acts on cells of the opposite mating type (type A) by binding to a cognate G-protein coupled receptor which is coupled to a downstream signal transduction pathway. It inhibits DNA synthesis in type A cells synchronising them with type alpha, and so mediates the conjugation process.; GO: 0000772 mating pheromone activity, 0019953 sexual reproduction, 0005576 extracellular region
Probab=43.74 E-value=9.8 Score=12.12 Aligned_cols=8 Identities=13% Similarity=0.331 Sum_probs=4.0
Q ss_pred EecCCCEE
Q psy5057 5 PVEAGDIV 12 (87)
Q Consensus 5 ~~~~g~~l 12 (87)
.+++||.+
T Consensus 5 ~~~~GqP~ 12 (13)
T PF04648_consen 5 RLSPGQPM 12 (13)
T ss_pred eccCCCcC
Confidence 44555544
No 76
>COG5458 Uncharacterized conserved protein [Function unknown]
Probab=42.96 E-value=40 Score=18.31 Aligned_cols=16 Identities=19% Similarity=0.399 Sum_probs=12.8
Q ss_pred CCeEEEEEeceEEEEE
Q psy5057 19 GDFFYVIESGRYEASV 34 (87)
Q Consensus 19 ~~~~~~i~~G~v~~~~ 34 (87)
...+|.|.+|.|..-.
T Consensus 55 GgSLYwViKG~VqcRQ 70 (144)
T COG5458 55 GGSLYWVIKGQVQCRQ 70 (144)
T ss_pred CCeEEEEEeceeehhh
Confidence 4679999999987654
No 77
>PRK10579 hypothetical protein; Provisional
Probab=40.92 E-value=54 Score=16.95 Aligned_cols=58 Identities=12% Similarity=0.065 Sum_probs=32.1
Q ss_pred cCCCEEEecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEeccee
Q psy5057 7 EAGDIVIRQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGS 74 (87)
Q Consensus 7 ~~g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~ 74 (87)
.||+ +.-+.......-|++|.+++... .+.....+.+|+.|-.. ..+...+++.+.+.
T Consensus 31 ~pGe--y~F~T~~~E~MeivsG~l~V~Lp---g~~ew~~~~aG~sF~Vp-----anssF~l~v~~~t~ 88 (94)
T PRK10579 31 AEGE--YTFSTAEPEEMTVISGALNVLLP---GATDWQVYEAGEVFNVP-----GHSEFHLQVAEPTS 88 (94)
T ss_pred eeeE--EEEcCCCcEEEEEEeeEEEEECC---CCcccEEeCCCCEEEEC-----CCCeEEEEECccee
Confidence 3455 33344456677777777776653 33345667777777532 23344555555443
No 78
>PF07697 7TMR-HDED: 7TM-HD extracellular; InterPro: IPR011624 This entry represents the extracellular domain of the 7TM-HD (7TM Receptors with HD hydrolase) protein family []. These proteins are known or predicted, to posses metal-dependent phospohydrolase activity.
Probab=40.45 E-value=19 Score=20.82 Aligned_cols=15 Identities=27% Similarity=0.694 Sum_probs=12.7
Q ss_pred ecCCCEEEecCCCCC
Q psy5057 6 VEAGDIVIRQGDDGD 20 (87)
Q Consensus 6 ~~~g~~l~~~g~~~~ 20 (87)
+++|+.|+++|+..+
T Consensus 193 V~~Ge~IV~kGe~VT 207 (222)
T PF07697_consen 193 VKKGEVIVRKGEIVT 207 (222)
T ss_pred ccCCCEEecCCcEeC
Confidence 789999999998653
No 79
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=39.01 E-value=91 Score=19.01 Aligned_cols=59 Identities=14% Similarity=0.184 Sum_probs=33.9
Q ss_pred ecCCCCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEEch
Q psy5057 14 RQGDDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAMDR 80 (87)
Q Consensus 14 ~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~~~ 80 (87)
..++..+.+-||++|.+.+.. +|+ ...+.+|++.= +..+......-.+.+++++..+.+
T Consensus 78 e~d~~ae~~lfVv~Ge~tv~~--~G~---th~l~eggyay---lPpgs~~~~~N~~~~~~rfhw~rk 136 (264)
T COG3257 78 EGDEGAETFLFVVSGEITVKA--EGK---THALREGGYAY---LPPGSGWTLRNAQKEDSRFHWIRK 136 (264)
T ss_pred CCCCcceEEEEEEeeeEEEEE--cCe---EEEeccCCeEE---eCCCCcceEeeccCCceEEEEEee
Confidence 345567899999999998765 222 24456666331 112223333334666777666644
No 80
>PF10794 DUF2606: Protein of unknown function (DUF2606); InterPro: IPR019730 This entry represents bacterial proteins with unknown function.
Probab=36.34 E-value=75 Score=17.29 Aligned_cols=28 Identities=21% Similarity=0.571 Sum_probs=21.4
Q ss_pred cCCCEEEecCCCCCeEEEEEeceEEEEE
Q psy5057 7 EAGDIVIRQGDDGDFFYVIESGRYEASV 34 (87)
Q Consensus 7 ~~g~~l~~~g~~~~~~~~i~~G~v~~~~ 34 (87)
+-|.++++.+...++..++.+|..+..+
T Consensus 86 ~~Gki~Wk~~~kG~Y~v~l~n~e~~~~~ 113 (131)
T PF10794_consen 86 EEGKIIWKNGRKGKYIVFLPNGETQETR 113 (131)
T ss_pred CCCcEEEecCCcceEEEEEcCCCceeEE
Confidence 4577888888888888888888877663
No 81
>TIGR02594 conserved hypothetical protein TIGR02594. Members of this protein family known so far are restricted to the bacteria, and for the most to the proteobacteria. The function is unknown.
Probab=35.46 E-value=77 Score=17.16 Aligned_cols=21 Identities=14% Similarity=0.259 Sum_probs=13.9
Q ss_pred cCCCEEEecCCCCCeEEEEEe
Q psy5057 7 EAGDIVIRQGDDGDFFYVIES 27 (87)
Q Consensus 7 ~~g~~l~~~g~~~~~~~~i~~ 27 (87)
++|++++-......|+-|++.
T Consensus 75 ~~GDiv~f~~~~~~HVGi~~g 95 (129)
T TIGR02594 75 AYGCIAVKRRGGGGHVGFVVG 95 (129)
T ss_pred CccEEEEEECCCCCEEEEEEe
Confidence 678888865444667777663
No 82
>PF04622 ERG2_Sigma1R: ERG2 and Sigma1 receptor like protein; InterPro: IPR006716 This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8--delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols []. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum [], which interacts with endogenous steroid hormones, such as progesterone and testosterone []. It also binds the sigma ligands, which are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine []. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.; GO: 0000247 C-8 sterol isomerase activity, 0006696 ergosterol biosynthetic process, 0005783 endoplasmic reticulum
Probab=33.47 E-value=93 Score=18.69 Aligned_cols=31 Identities=23% Similarity=0.165 Sum_probs=21.4
Q ss_pred CCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 19 GDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 19 ~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
++..+.|++|++..... .......+.||+..
T Consensus 119 ad~y~tIL~G~~~~~~~---g~~~~evy~pGd~~ 149 (216)
T PF04622_consen 119 ADDYFTILSGEQWAWSP---GSLEPEVYKPGDSH 149 (216)
T ss_pred eeeEEEEEEEEEEEEcC---CCCCceEeccCCEE
Confidence 56778899999877664 22335667777754
No 83
>PF14499 DUF4437: Domain of unknown function (DUF4437); PDB: 2QDR_A.
Probab=33.06 E-value=26 Score=21.42 Aligned_cols=53 Identities=11% Similarity=0.087 Sum_probs=24.0
Q ss_pred CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEE
Q psy5057 18 DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWA 77 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~ 77 (87)
.++.=-++++|.+..... +......+.+|++|+...-. .+... .+.++|.++.
T Consensus 190 ~~~eraVvI~G~~~~~~~---~~~~~~~L~~GSYf~s~~~~---~H~~~-~~e~~~vlyI 242 (251)
T PF14499_consen 190 ASNERAVVISGELDYQSY---GASNFGTLDPGSYFGSPGHI---THGIF-ITEDECVLYI 242 (251)
T ss_dssp -S-EEEEEEEEEEEETTE---EEETTEEEEE-TT-EE--E----------EESS-EEEEE
T ss_pred CCceEEEEEEeEEEEeec---ccCCCccccCCcccccCCcc---ccccc-ccCCCEEEEE
Confidence 345556788888776332 12233567888888754222 22222 5566666554
No 84
>PF00122 E1-E2_ATPase: E1-E2 ATPase p-type cation-transporting ATPase superfamily signature H+-transporting ATPase (proton pump) signature sodium/potassium-transporting ATPase signature; InterPro: IPR008250 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles []. P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H+, Na+, K+, Mg2+, Ca2+, Ag+ and Ag2+, Zn2+, Co2+, Pb2+, Ni2+, Cd2+, Cu+ and Cu2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the actuator (A) domain, and some transmembrane helices found in P-type ATPases []. It contains the TGES-loop which is essential for the metal ion binding which results in tight association between the A and P (phosphorylation) domains []. It does not contain the phosphorylation site. It is thought that the large movement of the actuator domain, which is transmitted to the transmembrane helices, is essential to the long distance coupling between formation/decomposition of the acyl phosphate in the cytoplasmic P-domain and the changes in the ion-binding sites buried deep in the membranous region []. This domain has a modulatory effect on the phosphoenzyme processing steps through its nucleotide binding [],[]. P-type (or E1-E2-type) ATPases that form an aspartyl phosphate intermediate in the course of ATP hydrolysis, can be divided into 4 major groups []: (1) Ca2+-transporting ATPases; (2) Na+/K+- and gastric H+/K+-transporting ATPases; (3) plasma membrane H+-transporting ATPases (proton pumps) of plants, fungi and lower eukaryotes; and (4) all bacterial P-type ATPases, except the g2+-ATPase of Salmonella typhimurium, which is more similar to the eukaryotic sequences. However, great variety of sequence analysis methods results in diversity of classification. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0000166 nucleotide binding, 0046872 metal ion binding; PDB: 2XZB_A 1MHS_B 3TLM_A 3A3Y_A 2ZXE_A 3NAL_A 3NAM_A 3NAN_A 2YJ6_B 2IYE_A ....
Probab=32.02 E-value=67 Score=18.65 Aligned_cols=42 Identities=19% Similarity=0.228 Sum_probs=27.2
Q ss_pred EecCCCEE-EecCCCCCeEEEEEe-ceEEEEEee-CCCceEEEEe
Q psy5057 5 PVEAGDIV-IRQGDDGDFFYVIES-GRYEASVEI-DGEDKLMHAY 46 (87)
Q Consensus 5 ~~~~g~~l-~~~g~~~~~~~~i~~-G~v~~~~~~-~~~~~~~~~~ 46 (87)
...+||++ +++|+..+.=..+++ |.+.+.... +|+.......
T Consensus 51 ~L~~GDiI~l~~g~~vPaD~~ll~~g~~~vd~s~ltGes~pv~k~ 95 (230)
T PF00122_consen 51 ELVPGDIIILKAGDIVPADGILLESGSAYVDESALTGESEPVKKT 95 (230)
T ss_dssp GT-TTSEEEEETTEBESSEEEEEESSEEEEECHHHHSBSSEEEES
T ss_pred hccceeeeecccccccccCccceeccccccccccccccccccccc
Confidence 34788888 577886666667777 887765544 6666555543
No 85
>PF05523 FdtA: WxcM-like, C-terminal ; InterPro: IPR008894 This entry includes FdtA (Q6T1W8 from SWISSPROT) from Aneurinibacillus thermoaerophilus, which has been characterised as a dTDP-6-deoxy-3,4-keto-hexulose isomerase []. It also includes WxcM (Q93S92 from SWISSPROT) from Xanthomonas campestris pv campestris) []. ; PDB: 2PAK_A 2PAE_A 2PA7_B 2PAM_A.
Probab=31.49 E-value=92 Score=16.83 Aligned_cols=33 Identities=9% Similarity=0.350 Sum_probs=17.2
Q ss_pred CCCCeEEEEEeceEEEEEeeCCCceEEEEecCCC
Q psy5057 17 DDGDFFYVIESGRYEASVEIDGEDKLMHAYEDKG 50 (87)
Q Consensus 17 ~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~ 50 (87)
.....++++++|.+.+...+ ++......+..++
T Consensus 52 k~~~~~~~~l~Gs~~v~~~d-~~~~~~~~L~~~~ 84 (131)
T PF05523_consen 52 KKTTQWFIVLSGSFKVVLDD-GREEEEFILDEPN 84 (131)
T ss_dssp SS--EEEEEEES-EEEEEE--SS-EEEEEE--TT
T ss_pred ccccEEEEEEeCEEEEEEec-CCCcEEEEECCCC
Confidence 35678889999999877543 3443444555554
No 86
>PF02311 AraC_binding: AraC-like ligand binding domain; InterPro: IPR003313 This entry defines the arabinose-binding and dimerisation domain of the bacterial gene regulatory protein AraC. The crystal structure of the arabinose-binding and dimerization domain of the Escherichia coli gene regulatory protein AraC was determined in the presence and absence of L-arabinose. The arabinose-bound molecule shows that the protein adopts an unusual fold, binding sugar within a beta barrel and completely burying the arabinose with the amino-terminal arm of the protein. Dimer contacts in the presence of arabinose are mediated by an antiparallel coiled-coil. In the uncomplexed protein, the amino-terminal arm is disordered, uncovering the sugar-binding pocket and allowing it to serve as an oligomerization interface [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1XJA_B 2ARA_A 2AAC_B 2ARC_A.
Probab=30.97 E-value=81 Score=16.01 Aligned_cols=30 Identities=13% Similarity=0.011 Sum_probs=17.2
Q ss_pred CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 18 DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
..-.+.++.+|...... +. ....++||+++
T Consensus 22 ~~~~i~~v~~G~~~~~~----~~-~~~~l~~g~~~ 51 (136)
T PF02311_consen 22 DFYEIIYVLSGEGTLHI----DG-QEYPLKPGDLF 51 (136)
T ss_dssp -SEEEEEEEEE-EEEEE----TT-EEEEE-TT-EE
T ss_pred CCEEEEEEeCCEEEEEE----CC-EEEEEECCEEE
Confidence 34578899999987644 22 23667888865
No 87
>COG4820 EutJ Ethanolamine utilization protein, possible chaperonin [Amino acid transport and metabolism]
Probab=28.90 E-value=1.1e+02 Score=18.57 Aligned_cols=30 Identities=17% Similarity=0.387 Sum_probs=18.0
Q ss_pred EecCCCEEEecCCCC--CeEEEEEeceEEEEE
Q psy5057 5 PVEAGDIVIRQGDDG--DFFYVIESGRYEASV 34 (87)
Q Consensus 5 ~~~~g~~l~~~g~~~--~~~~~i~~G~v~~~~ 34 (87)
.+++|.+++..+++. .++-+++-|...+..
T Consensus 156 i~kkGkViy~ADEpTGGtHmtLvlAG~ygi~~ 187 (277)
T COG4820 156 IVKKGKVIYSADEPTGGTHMTLVLAGNYGISL 187 (277)
T ss_pred EEEcCcEEEeccCCCCceeEEEEEecccCcCH
Confidence 456666776666543 566666666655443
No 88
>PF13533 Biotin_lipoyl_2: Biotin-lipoyl like
Probab=28.24 E-value=57 Score=14.27 Aligned_cols=12 Identities=25% Similarity=0.562 Sum_probs=6.8
Q ss_pred ecCCCEEEecCC
Q psy5057 6 VEAGDIVIRQGD 17 (87)
Q Consensus 6 ~~~g~~l~~~g~ 17 (87)
.++|++|+.-+.
T Consensus 24 VkkGd~L~~ld~ 35 (50)
T PF13533_consen 24 VKKGDVLLVLDS 35 (50)
T ss_pred EcCCCEEEEECc
Confidence 466666665443
No 89
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=28.00 E-value=1.3e+02 Score=17.29 Aligned_cols=30 Identities=13% Similarity=0.150 Sum_probs=20.1
Q ss_pred CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeE
Q psy5057 18 DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSF 52 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~ 52 (87)
..+.+.+|++|++.+... |+ ...-+|||++
T Consensus 117 ~yDe~d~VlEGrL~V~~~--g~---tv~a~aGDvi 146 (176)
T COG4766 117 NYDEIDYVLEGRLHVRID--GR---TVIAGAGDVI 146 (176)
T ss_pred cccceeEEEeeeEEEEEc--CC---eEecCCCcEE
Confidence 457888999999988763 22 2344677643
No 90
>PF06449 DUF1082: Mitochondrial domain of unknown function (DUF1082); InterPro: IPR009455 The domain is found exclusively in plant mitochonchria and is a putative homing endonuclease, though such a function remains to be demonstrated. The domain is found C-terminal to the plant mitochondrial ATPase subunit 8 domain IPR003319 from INTERPRO.; GO: 0016820 hydrolase activity, acting on acid anhydrides, catalyzing transmembrane movement of substances, 0005739 mitochondrion, 0016021 integral to membrane
Probab=27.62 E-value=70 Score=14.25 Aligned_cols=25 Identities=28% Similarity=0.594 Sum_probs=17.8
Q ss_pred CCEEEecCCCCCeEEEEEeceEEEE
Q psy5057 9 GDIVIRQGDDGDFFYVIESGRYEAS 33 (87)
Q Consensus 9 g~~l~~~g~~~~~~~~i~~G~v~~~ 33 (87)
|++.-..|...|-+|+|.+-....+
T Consensus 16 GEIsgSrgmErnI~YlIskssyst~ 40 (51)
T PF06449_consen 16 GEISGSRGMERNILYLISKSSYSTS 40 (51)
T ss_pred ceeecccccccceEEEEeecccccc
Confidence 5666677888888888887665443
No 91
>KOG1758|consensus
Probab=27.53 E-value=1.3e+02 Score=17.16 Aligned_cols=24 Identities=21% Similarity=0.387 Sum_probs=10.3
Q ss_pred cCCCEEEecCCCCCeEEEEEeceE
Q psy5057 7 EAGDIVIRQGDDGDFFYVIESGRY 30 (87)
Q Consensus 7 ~~g~~l~~~g~~~~~~~~i~~G~v 30 (87)
++|=+-..+++....-|||.+|.+
T Consensus 72 kPGvvsV~~~~~~~~k~FvSsGfa 95 (159)
T KOG1758|consen 72 KPGVVSVHEGSGTKSKYFVSSGFA 95 (159)
T ss_pred ccceEEEEeCCCcEEEEEEecceE
Confidence 344444444444333444444443
No 92
>PF00877 NLPC_P60: NlpC/P60 family; InterPro: IPR000064 The Escherichia coli NLPC/Listeria P60 domain occurs at the C terminus of a number of different bacterial and viral proteins. The viral proteins are either described as tail assembly proteins or Gp19. In bacteria, the proteins are variously described as being putative tail component of prophage, invasin, invasion associated protein, putative lipoprotein, cell wall hydrolase, or putative endopeptidase. The E. coli NLPC/Listeria P60 domain is contained within the boundaries of the cysteine peptidase domain that defines the MEROPS peptidase family C40 (clan C-). A type example being dipeptidyl-peptidase VI from Bacillus sphaericus and gamma-glutamyl-diamino acid-endopeptidase precursor from Lactococcus lactis 3.4.19.11 from EC. This group also contains proteins classified as non-peptidase homologues in that they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the C40 family. ; PDB: 3PVQ_B 3GT2_A 3NPF_B 2K1G_A 3I86_A 3S0Q_A 2XIV_A 3PBC_A 3NE0_A 3M1U_B ....
Probab=25.11 E-value=1.1e+02 Score=15.45 Aligned_cols=22 Identities=14% Similarity=0.277 Sum_probs=13.8
Q ss_pred EecCCCEEEecC-CCCCeEEEEE
Q psy5057 5 PVEAGDIVIRQG-DDGDFFYVIE 26 (87)
Q Consensus 5 ~~~~g~~l~~~g-~~~~~~~~i~ 26 (87)
..++||+|+-.+ ....++-+.+
T Consensus 51 ~~~pGDlif~~~~~~~~Hvgiy~ 73 (105)
T PF00877_consen 51 ELQPGDLIFFKGGGGISHVGIYL 73 (105)
T ss_dssp G-TTTEEEEEEGTGGEEEEEEEE
T ss_pred cCCcccEEEEeCCccCCEeEEEE
Confidence 357888888776 4445555555
No 93
>PF03079 ARD: ARD/ARD' family; InterPro: IPR004313 The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence Q9ZFE7 from SWISSPROT, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+ []. ARD and ARD' can be experimentally interconverted by removal of the bound metal ion and reconstitution with the appropriate metal ion. The two enzymes share the same substrate, 1,2-dihydroxy-3-keto-5-(methylthio)pentene, but yield different products. ARD' yields the alpha-keto precursor of methionine (and formate), thus forming part of the ubiquitous methionine salvage pathway that converts 5'-methylthioadenosine (MTA) to methionine. This pathway is responsible for the tight control of the concentration of MTA, which is a powerful inhibitor of polyamine biosynthesis and transmethylation reactions []. ARD yields methylthiopropanoate, carbon monoxide and formate, and thus prevents the conversion of MTA to methionine. The role of the ARD catalysed reaction is unclear: methylthiopropanoate is cytotoxic, and carbon monoxide can activate guanylyl cyclase, leading to increased intracellular cGMP levels [, ]. This family also contains other proteins, whose functions are not well characterised.; GO: 0010309 acireductone dioxygenase [iron(II)-requiring] activity, 0055114 oxidation-reduction process; PDB: 1VR3_A 1ZRR_A 2HJI_A.
Probab=24.25 E-value=1.5e+02 Score=16.80 Aligned_cols=53 Identities=9% Similarity=-0.151 Sum_probs=27.4
Q ss_pred CeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeE
Q psy5057 20 DFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSL 75 (87)
Q Consensus 20 ~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~ 75 (87)
+.+-++++|..-......+..-..-.+.+||++-. ..+..+..++........
T Consensus 93 eEvR~i~~G~g~Fdvr~~~~~wiri~~e~GDli~v---P~g~~HrF~~~~~~~i~a 145 (157)
T PF03079_consen 93 EEVRYIVDGSGYFDVRDGDDVWIRILCEKGDLIVV---PAGTYHRFTLGESPYIKA 145 (157)
T ss_dssp -EEEEEEECEEEEEEE-TTCEEEEEEEETTCEEEE----TT--EEEEESTTSSEEE
T ss_pred heEEEEeCcEEEEEEEcCCCEEEEEEEcCCCEEec---CCCCceeEEcCCCCcEEE
Confidence 56778899986544433334444467889997643 333444444333333333
No 94
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=23.63 E-value=2.1e+02 Score=18.45 Aligned_cols=58 Identities=7% Similarity=0.108 Sum_probs=37.3
Q ss_pred CCCeEEEEEeceEEEEEeeCCCceEEEEecCCCeEehhhhhcCCCceeEEEE-ecceeEEEEchhhhhc
Q psy5057 18 DGDFFYVIESGRYEASVEIDGEDKLMHAYEDKGSFGELALLYNMPRAATIKA-TSTGSLWAMDRKTFYC 85 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a-~~~~~~~~~~~~~~~~ 85 (87)
..+.+|-|.+|.-.+.. +.++ .-..+||+|-..+.--. ...+ .+++.+++++-....+
T Consensus 280 t~s~iy~V~eGsg~~~I----g~~r-f~~~~~D~fvVPsW~~~-----~~~~gs~da~LFsfsD~PV~e 338 (351)
T COG3435 280 TDSTIYHVVEGSGYTII----GGER-FDWSAGDIFVVPSWAWH-----EHVNGSEDAVLFSFSDRPVME 338 (351)
T ss_pred cCCEEEEEEecceeEEE----CCEE-eeccCCCEEEccCccee-----ecccCCcceEEEecCCcHHHH
Confidence 45788999999876655 3333 45588998876554432 2223 6778888887655544
No 95
>PF14157 YmzC: YmzC-like protein; PDB: 3KVP_E.
Probab=22.76 E-value=1.1e+02 Score=14.62 Aligned_cols=22 Identities=23% Similarity=0.295 Sum_probs=17.0
Q ss_pred CeEEEE-EeceEEEEEee-CCCce
Q psy5057 20 DFFYVI-ESGRYEASVEI-DGEDK 41 (87)
Q Consensus 20 ~~~~~i-~~G~v~~~~~~-~~~~~ 41 (87)
+++-++ .+|.+++++.+ +.+++
T Consensus 30 N~Fav~~e~~~iKIfkyd~~tNei 53 (63)
T PF14157_consen 30 NHFAVVDEDGQIKIFKYDEDTNEI 53 (63)
T ss_dssp TEEEEE-ETTEEEEEEEETTTTEE
T ss_pred CEEEEEecCCeEEEEEeCCCCCeE
Confidence 788888 89999999987 43333
No 96
>PF02839 CBM_5_12: Carbohydrate binding domain; InterPro: IPR003610 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM5 from CAZY and CBM12 from CAZY. These modules have a core structure consisting of a 3-stranded meander beta-sheet, which contain six aromatic groups that may be important for binding. CBM5/12 is found in proteins such as chitinase A1, chitinase B [], and endoglucanase Z []. The overall topology of the CBM is structurally similar to the C-terminal chitin-binding domains (ChBD) of chitinase A1 and chitinase B, however the binding mechanism for the ChBD may be different from that of the CBM [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process, 0005576 extracellular region; PDB: 1ED7_A 1W1V_A 1E15_B 1UR8_A 1E6Z_B 1E6P_A 1W1T_A 1W1P_B 1UR9_A 1E6R_A ....
Probab=22.54 E-value=78 Score=12.98 Aligned_cols=12 Identities=42% Similarity=0.509 Sum_probs=6.0
Q ss_pred EecCCCEEEecC
Q psy5057 5 PVEAGDIVIRQG 16 (87)
Q Consensus 5 ~~~~g~~l~~~g 16 (87)
.|.+|+++...|
T Consensus 9 ~Y~~Gd~V~~~g 20 (41)
T PF02839_consen 9 TYNAGDRVSYNG 20 (41)
T ss_dssp EE-TT-EEEETT
T ss_pred EEcCCCEEEECC
Confidence 466666666554
No 97
>PLN02288 mannose-6-phosphate isomerase
Probab=22.06 E-value=2.5e+02 Score=18.61 Aligned_cols=15 Identities=13% Similarity=0.222 Sum_probs=10.0
Q ss_pred CCCeEEEEEeceEEE
Q psy5057 18 DGDFFYVIESGRYEA 32 (87)
Q Consensus 18 ~~~~~~~i~~G~v~~ 32 (87)
....+.++++|.+++
T Consensus 353 ~gp~Illv~~G~~~i 367 (394)
T PLN02288 353 PGPSVFLVIEGEGVL 367 (394)
T ss_pred CCCEEEEEEcCEEEE
Confidence 345677777777765
No 98
>PF11132 SplA: Transcriptional regulator protein (SplA); InterPro: IPR022608 The SplA protein functions in trans as a negative regulator of the level of splB-lacZ expression in the developing forespore [].
Probab=21.99 E-value=60 Score=15.96 Aligned_cols=12 Identities=58% Similarity=0.667 Sum_probs=6.8
Q ss_pred eeEEecCCCEEE
Q psy5057 2 FEKPVEAGDIVI 13 (87)
Q Consensus 2 ~~~~~~~g~~l~ 13 (87)
....+++||.++
T Consensus 2 ~~~~~~~GD~Vy 13 (75)
T PF11132_consen 2 DMKPYHAGDIVY 13 (75)
T ss_pred CccccCCCCEEE
Confidence 334566666655
No 99
>PRK03999 translation initiation factor IF-5A; Provisional
Probab=21.98 E-value=1.5e+02 Score=16.10 Aligned_cols=16 Identities=19% Similarity=0.405 Sum_probs=11.5
Q ss_pred EEecCCCEEEecCCCC
Q psy5057 4 KPVEAGDIVIRQGDDG 19 (87)
Q Consensus 4 ~~~~~g~~l~~~g~~~ 19 (87)
-.+++|.+|...|+++
T Consensus 9 ~~lrkG~~i~~~g~p~ 24 (129)
T PRK03999 9 GELKEGSYVVIDGEPC 24 (129)
T ss_pred HHccCCCEEEECCEEE
Confidence 3568899998776644
No 100
>PF00018 SH3_1: SH3 domain; InterPro: IPR001452 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. The crystal structure of the SH3 domain of the cytoskeletal protein spectrin, and the solution structures of SH3 domains of phospholipase C (PLC-y) and phosphatidylinositol 3-kinase p85 alpha-subunit, have been determined [, , ]. In spite of relatively limited sequence similarity, their overall structures are similar. The domains belong to the alpha+beta structural class, with 5 to 8 beta-strands forming 2 tightly-packed, anti-parallel beta-sheets arranged in a barrel-like structure, and intervening loops sometimes forming helices. Conserved aliphatic and aromatic residues form a hydrophobic core (A11, L23, A29, V34, W42, L52 and V59 in PLC-y []) and a hydrophobic pocket on the molecular surface (L12, F13, W53 and P55 in PLC-y). The conserved core is believed to stabilise the fold, while the pocket is thought to serve as a binding site for target proteins. Conserved carboxylic amino acids located in the loops, on the periphery of the pocket (D14 and E22), may be involved in protein-protein interactions via proline-rich regions. The N- and C-termini are packed in close proximity, indicating that they are independent structural modules.; GO: 0005515 protein binding; PDB: 1UHF_A 1W1F_A 1WA7_A 1SEM_A 1KFZ_A 2SEM_B 1K76_A 3SEM_B 1X2Q_A 2J06_B ....
Probab=21.85 E-value=84 Score=13.31 Aligned_cols=12 Identities=25% Similarity=0.351 Sum_probs=5.0
Q ss_pred EecCCCEEEecC
Q psy5057 5 PVEAGDIVIRQG 16 (87)
Q Consensus 5 ~~~~g~~l~~~g 16 (87)
.+.+|+.|.--.
T Consensus 15 s~~~Gd~i~v~~ 26 (48)
T PF00018_consen 15 SFKKGDIIEVLE 26 (48)
T ss_dssp EB-TTEEEEEEE
T ss_pred eEECCCEEEEEE
Confidence 344555544433
No 101
>KOG4379|consensus
Probab=20.09 E-value=2.4e+02 Score=19.42 Aligned_cols=62 Identities=13% Similarity=0.027 Sum_probs=32.8
Q ss_pred CCCCeEEEEEeceEEEEEee--CCCceEEEEecCCCeEehhhhhcCCCceeEEEEecceeEEEE
Q psy5057 17 DDGDFFYVIESGRYEASVEI--DGEDKLMHAYEDKGSFGELALLYNMPRAATIKATSTGSLWAM 78 (87)
Q Consensus 17 ~~~~~~~~i~~G~v~~~~~~--~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~~ 78 (87)
++....-++.+|.++.+..+ +|+.-....++++.+.=..-+-.+.+....++...|+-+|..
T Consensus 420 ~p~EECD~~~eesm~~~rfd~~sgkith~~~lg~~q~lft~~~~p~~pa~fclRhdVDallW~~ 483 (596)
T KOG4379|consen 420 DPVEECDLISEESMKMVRFDLTSGKITHTIFLGPPQLLFTTTLRPGFPAAFCLRHDVDALLWLQ 483 (596)
T ss_pred CchhhhhhccccceeEEEEeccCceEEEEEecCCCcEEEEEEecCCCcceeEeeccCchhhccc
Confidence 34444555568888888876 555555566666653322222233344444555455554443
Done!