Query psy1122
Match_columns 150
No_of_seqs 114 out of 1376
Neff 10.3
Searched_HMMs 46136
Date Fri Aug 16 21:13:34 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1122.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1122hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK09392 ftrB transcriptional 99.9 2.7E-22 5.8E-27 139.4 14.6 130 3-135 6-138 (236)
2 PRK11753 DNA-binding transcrip 99.9 1.6E-20 3.6E-25 128.3 16.9 117 14-133 7-128 (211)
3 PRK11161 fumarate/nitrate redu 99.8 1.3E-19 2.9E-24 125.8 15.4 124 6-133 15-143 (235)
4 COG2905 Predicted signal-trans 99.8 1.2E-19 2.5E-24 135.0 13.2 135 3-140 6-140 (610)
5 cd00038 CAP_ED effector domain 99.8 2E-19 4.3E-24 110.8 11.8 109 11-122 1-113 (115)
6 PRK10402 DNA-binding transcrip 99.8 1.8E-19 3.8E-24 124.6 12.8 110 16-128 20-134 (226)
7 KOG0500|consensus 99.8 7.1E-19 1.5E-23 128.7 13.7 126 3-128 306-435 (536)
8 PF00027 cNMP_binding: Cyclic 99.8 5.6E-19 1.2E-23 105.1 8.9 86 30-118 2-91 (91)
9 PLN03192 Voltage-dependent pot 99.8 1.3E-18 2.7E-23 139.3 13.4 122 4-128 374-499 (823)
10 COG0664 Crp cAMP-binding prote 99.8 9.4E-18 2E-22 114.2 15.3 131 6-139 2-136 (214)
11 smart00100 cNMP Cyclic nucleot 99.8 5.8E-18 1.3E-22 104.8 12.5 108 11-119 1-112 (120)
12 PRK09391 fixK transcriptional 99.8 5.8E-17 1.3E-21 112.3 14.1 106 25-136 36-145 (230)
13 TIGR03697 NtcA_cyano global ni 99.7 1.1E-16 2.4E-21 107.9 13.3 101 35-136 1-105 (193)
14 KOG0498|consensus 99.7 1.8E-17 4E-22 128.4 8.9 119 4-124 419-542 (727)
15 PLN02868 acyl-CoA thioesterase 99.7 9E-17 1.9E-21 119.9 12.2 110 4-118 8-120 (413)
16 KOG0499|consensus 99.7 9.1E-17 2E-21 120.2 11.3 125 4-128 527-653 (815)
17 PRK13918 CRP/FNR family transc 99.7 1.5E-15 3.3E-20 103.1 12.2 80 26-109 5-90 (202)
18 KOG1113|consensus 99.6 9.3E-16 2E-20 108.6 5.9 111 4-118 122-232 (368)
19 KOG0614|consensus 99.5 3.6E-15 7.9E-20 110.7 3.4 110 4-117 154-263 (732)
20 KOG0614|consensus 99.5 2E-14 4.3E-19 106.9 3.7 115 3-120 271-391 (732)
21 KOG1113|consensus 99.4 1.3E-12 2.7E-17 92.9 5.4 112 1-115 237-348 (368)
22 KOG2968|consensus 99.2 2.7E-11 5.9E-16 94.9 5.7 113 18-133 499-616 (1158)
23 KOG0501|consensus 99.1 4.6E-11 9.9E-16 90.3 3.7 112 5-118 549-660 (971)
24 PF04831 Popeye: Popeye protei 99.0 1.1E-07 2.4E-12 60.6 15.2 114 14-128 14-131 (153)
25 PRK11832 putative DNA-binding 98.9 1.3E-07 2.7E-12 64.0 13.8 105 21-128 16-121 (207)
26 KOG2968|consensus 98.7 1.1E-07 2.3E-12 75.4 8.1 118 22-140 110-236 (1158)
27 KOG3542|consensus 97.9 1.8E-05 3.8E-10 61.8 4.8 109 3-115 280-390 (1283)
28 KOG3542|consensus 97.2 0.00049 1.1E-08 54.1 4.4 89 5-105 38-126 (1283)
29 PF07883 Cupin_2: Cupin domain 96.5 0.058 1.2E-06 29.8 8.5 68 29-104 2-70 (71)
30 PF05899 Cupin_3: Protein of u 95.7 0.062 1.3E-06 30.5 5.7 45 29-76 11-55 (74)
31 PRK13290 ectC L-ectoine syntha 94.4 0.72 1.6E-05 29.0 8.6 67 29-104 39-106 (125)
32 COG3718 IolB Uncharacterized e 91.7 1.1 2.4E-05 31.3 6.3 75 27-105 31-109 (270)
33 TIGR03037 anthran_nbaC 3-hydro 90.7 2.9 6.2E-05 27.5 7.3 60 39-107 43-104 (159)
34 COG0662 {ManC} Mannose-6-phosp 90.4 1.2 2.7E-05 27.9 5.4 48 26-75 37-85 (127)
35 smart00835 Cupin_1 Cupin. This 89.9 3.6 7.8E-05 26.3 8.2 50 27-76 32-86 (146)
36 COG1917 Uncharacterized conser 89.7 1.4 2.9E-05 27.7 5.3 48 27-76 45-93 (131)
37 COG3450 Predicted enzyme of th 88.8 3.2 7E-05 25.8 6.1 45 29-76 49-93 (116)
38 TIGR03404 bicupin_oxalic bicup 88.5 6.5 0.00014 29.6 8.8 49 27-75 247-299 (367)
39 PRK09943 DNA-binding transcrip 87.0 5.5 0.00012 26.6 7.1 66 31-104 113-179 (185)
40 TIGR03404 bicupin_oxalic bicup 86.3 3.1 6.6E-05 31.3 6.0 50 27-76 69-121 (367)
41 PRK04190 glucose-6-phosphate i 86.0 6.4 0.00014 26.8 6.9 47 29-75 72-130 (191)
42 PRK13264 3-hydroxyanthranilate 85.1 5.4 0.00012 26.7 6.1 57 43-107 52-110 (177)
43 PF00190 Cupin_1: Cupin; Inte 84.4 6 0.00013 25.3 6.0 50 27-76 36-95 (144)
44 PRK11171 hypothetical protein; 84.2 11 0.00023 27.0 7.7 69 28-104 64-134 (266)
45 PF12973 Cupin_7: ChrR Cupin-l 81.9 8 0.00017 22.5 7.9 64 26-103 25-88 (91)
46 PRK11171 hypothetical protein; 81.0 18 0.00038 25.9 8.0 49 26-76 185-234 (266)
47 TIGR02451 anti_sig_ChrR anti-s 79.0 9.1 0.0002 26.5 5.7 65 27-105 129-195 (215)
48 PF04962 KduI: KduI/IolB famil 77.1 14 0.00031 26.4 6.4 75 27-105 29-106 (261)
49 COG3837 Uncharacterized conser 76.8 8.8 0.00019 25.2 4.7 40 39-80 58-97 (161)
50 TIGR03214 ura-cupin putative a 73.5 30 0.00066 24.7 7.5 46 28-75 61-108 (260)
51 TIGR01479 GMP_PMI mannose-1-ph 71.6 32 0.00069 26.8 7.5 46 29-76 380-426 (468)
52 COG2140 Thermophilic glucose-6 68.3 15 0.00033 25.3 4.6 29 47-75 105-136 (209)
53 KOG2378|consensus 67.6 6 0.00013 30.6 2.7 45 71-118 1-46 (573)
54 TIGR03214 ura-cupin putative a 67.2 19 0.00042 25.7 5.2 49 25-75 179-228 (260)
55 PRK15460 cpsB mannose-1-phosph 65.0 55 0.0012 25.8 7.6 70 28-105 388-458 (478)
56 PF01050 MannoseP_isomer: Mann 62.1 41 0.00089 21.9 5.6 46 28-75 66-112 (151)
57 PF06249 EutQ: Ethanolamine ut 60.6 23 0.00051 23.1 4.2 33 41-75 91-123 (152)
58 TIGR02272 gentisate_1_2 gentis 60.4 61 0.0013 24.2 6.8 74 30-113 255-328 (335)
59 COG4101 Predicted mannose-6-ph 59.2 42 0.00091 21.1 5.4 73 9-81 30-104 (142)
60 PF05726 Pirin_C: Pirin C-term 59.2 37 0.00079 20.4 6.4 70 30-111 4-75 (104)
61 PHA00672 hypothetical protein 56.3 47 0.001 21.0 4.8 35 28-64 50-84 (152)
62 PF02041 Auxin_BP: Auxin bindi 56.1 50 0.0011 21.6 5.0 49 28-76 47-102 (167)
63 PF11699 CENP-C_C: Mif2/CENP-C 55.4 29 0.00062 20.3 3.6 29 45-75 33-61 (85)
64 PLN00212 glutelin; Provisional 54.0 31 0.00068 27.2 4.6 52 25-76 348-404 (493)
65 PF12852 Cupin_6: Cupin 53.9 44 0.00096 22.1 4.9 35 41-75 31-65 (186)
66 PHA00514 dsDNA binding protein 53.2 45 0.00097 19.5 4.1 54 60-113 21-81 (98)
67 PF14499 DUF4437: Domain of un 53.1 13 0.00028 26.5 2.3 52 29-80 175-226 (251)
68 PF05523 FdtA: WxcM-like, C-te 52.7 57 0.0012 20.6 6.3 66 39-111 48-116 (131)
69 COG5616 Predicted integral mem 50.5 43 0.00092 21.9 4.1 46 13-58 46-99 (152)
70 PLN02288 mannose-6-phosphate i 48.4 94 0.002 23.8 6.3 30 45-74 354-383 (394)
71 TIGR02272 gentisate_1_2 gentis 47.4 46 0.00099 24.9 4.4 49 28-77 84-132 (335)
72 PF07697 7TMR-HDED: 7TM-HD ext 46.0 15 0.00032 25.0 1.7 32 15-46 174-207 (222)
73 PLN00212 glutelin; Provisional 44.3 1.1E+02 0.0024 24.3 6.2 33 26-58 81-113 (493)
74 PF06560 GPI: Glucose-6-phosph 42.2 63 0.0014 21.8 4.1 33 43-75 81-120 (182)
75 PF06052 3-HAO: 3-hydroxyanthr 41.9 99 0.0021 20.2 6.1 56 45-108 53-110 (151)
76 PRK15457 ethanolamine utilizat 40.8 87 0.0019 22.1 4.7 29 45-75 175-203 (233)
77 PF13376 OmdA: Bacteriocin-pro 37.9 67 0.0015 17.2 3.2 22 107-128 8-29 (63)
78 COG4766 EutQ Ethanolamine util 36.5 82 0.0018 20.7 3.8 28 45-74 118-145 (176)
79 COG3257 GlxB Uncharacterized p 34.8 1.6E+02 0.0036 20.7 7.4 62 37-106 75-136 (264)
80 PRK09685 DNA-binding transcrip 30.6 2E+02 0.0044 20.5 6.3 57 48-112 74-130 (302)
81 KOG1758|consensus 28.1 1.1E+02 0.0024 20.0 3.4 61 48-117 68-129 (159)
82 PF02823 ATP-synt_DE_N: ATP sy 27.3 1.2E+02 0.0026 17.0 3.2 29 35-64 27-55 (80)
83 COG3435 Gentisate 1,2-dioxygen 27.0 1.2E+02 0.0026 22.5 3.7 45 32-77 99-143 (351)
84 TIGR01511 ATPase-IB1_Cu copper 26.1 3.5E+02 0.0075 21.8 6.5 50 32-81 110-163 (562)
85 cd00173 SH2 Src homology 2 dom 26.0 1.3E+02 0.0029 17.0 3.9 28 12-44 4-31 (94)
86 cd06929 NR_LBD_F1 Ligand-bindi 25.4 59 0.0013 21.1 1.9 25 3-27 19-43 (174)
87 PF02311 AraC_binding: AraC-li 25.2 1.6E+02 0.0034 17.5 7.4 67 40-114 18-86 (136)
88 PF10330 Stb3: Putative Sin3 b 24.9 1E+02 0.0022 18.3 2.5 41 5-57 30-71 (92)
89 TIGR01525 ATPase-IB_hvy heavy 24.8 3.6E+02 0.0077 21.6 6.4 45 31-75 73-121 (556)
90 PF13128 DUF3954: Protein of u 24.8 1.2E+02 0.0025 15.8 4.0 16 45-60 9-24 (50)
91 PLN02559 chalcone--flavonone i 24.1 1.2E+02 0.0025 21.5 3.1 39 6-44 117-155 (230)
92 cd03702 IF2_mtIF2_II This fami 23.6 1.7E+02 0.0037 17.3 3.5 41 36-76 19-63 (95)
93 KOG1517|consensus 23.4 1.2E+02 0.0027 26.7 3.6 33 94-126 355-388 (1387)
94 KOG1448|consensus 22.6 1.9E+02 0.0041 21.4 4.0 51 7-57 2-62 (316)
95 PF06865 DUF1255: Protein of u 22.3 1.9E+02 0.0041 17.3 7.4 62 31-102 29-90 (94)
96 PF13099 DUF3944: Domain of un 22.2 44 0.00096 16.0 0.6 20 9-28 7-26 (35)
97 KOG2757|consensus 21.8 3.6E+02 0.0079 20.7 5.4 44 30-75 338-382 (411)
98 PLN02804 chalcone isomerase 21.8 89 0.0019 21.6 2.2 36 6-41 111-146 (206)
99 PRK00529 elongation factor P; 21.4 2.7E+02 0.0058 18.8 5.8 75 30-110 5-89 (186)
100 PF02678 Pirin: Pirin; InterP 21.4 2.1E+02 0.0045 17.4 3.9 30 43-73 48-77 (107)
101 PF10794 DUF2606: Protein of u 21.2 1.6E+02 0.0034 18.6 2.9 29 31-59 84-112 (131)
102 PF07742 BTG: BTG family; Int 20.6 2.3E+02 0.0049 17.7 4.4 39 11-50 77-115 (118)
103 PF04648 MF_alpha: Yeast matin 20.1 41 0.0009 12.2 0.2 8 31-38 5-12 (13)
No 1
>PRK09392 ftrB transcriptional activator FtrB; Provisional
Probab=99.90 E-value=2.7e-22 Score=139.38 Aligned_cols=130 Identities=16% Similarity=0.211 Sum_probs=113.0
Q ss_pred ccccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEe-cCCc--EEEEecCCCeeeeee
Q psy1122 3 VVCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVIS-ESGR--VLTTMKAGDFFGEIG 79 (150)
Q Consensus 3 ~~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~-~~g~--~~~~~~~g~~~g~~~ 79 (150)
.++++.+++|..+++++++.+...+..+.|++|++|+.+|++++++|||++|.++++. .+|+ ++..+.+|++||+.+
T Consensus 6 ~~~l~~~~~f~~L~~~~~~~l~~~~~~~~~~~ge~l~~~g~~~~~~~~v~~G~v~~~~~~~~~~~~i~~~~~g~~~g~~~ 85 (236)
T PRK09392 6 LIRLRNLPLFADMADATFERLMRGAFLQRFPPGTMLITEGEPADFLFVVLDGLVELSASSQDRETTLAILRPVSTFILAA 85 (236)
T ss_pred HHHHhcCccccCCCHHHHHHHHhhcceeecCCCCEEEeCCCccceEEEEEeCEEEEEEcCCCceEEEEEeCCCchhhhHH
Confidence 3578999999999999999999999999999999999999999999999999999953 3333 889999999999998
Q ss_pred eeccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCchHHHHHHHHHHHHHHHHH
Q psy1122 80 ILNLDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPEAQEILQNLGRKRLMEAK 135 (150)
Q Consensus 80 ~~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~ 135 (150)
++.+ .++.+++.|.++|.++.+|.+.|.+++.++|.+...+......++....
T Consensus 86 ~~~~---~~~~~~~~A~~~~~~~~i~~~~~~~l~~~~p~l~~~~~~~l~~~~~~~~ 138 (236)
T PRK09392 86 VVLD---APYLMSARTLTRSRVLMIPAELVREAMSEDPGFMRAVVFELAGCYRGLV 138 (236)
T ss_pred HhCC---CCCceEEEEcCceEEEEEeHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH
Confidence 8876 7899999999999999999999999999999965554444433444443
No 2
>PRK11753 DNA-binding transcriptional dual regulator Crp; Provisional
Probab=99.87 E-value=1.6e-20 Score=128.28 Aligned_cols=117 Identities=22% Similarity=0.392 Sum_probs=102.1
Q ss_pred cCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeeeeeeeccCCCCce
Q psy1122 14 ECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGEIGILNLDGLNKR 89 (150)
Q Consensus 14 ~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~~~~~~~~~~~~~ 89 (150)
-+++++++.+.+.+..+.|++|++|+.+|++++.+|||++|.++++ +++|+ ++..+.+|++||+..++.+. .++
T Consensus 7 ~~~~~~~~~l~~~~~~~~~~kg~~l~~~g~~~~~~y~V~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~~--~~~ 84 (211)
T PRK11753 7 PQTDPTLEWFLSHCHIHKYPAKSTLIHAGEKAETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGLFEEG--QER 84 (211)
T ss_pred CCCHHHHHHHHhhCeEEEeCCCCEEEeCCCCCCeEEEEEeCEEEEEEECCCCCEEEEEEcCCCCEEeehhhccCC--CCc
Confidence 4799999999999999999999999999999999999999999995 45676 88999999999999888652 468
Q ss_pred eeEEEEcceeEEEEEcHHHHHHHHhhCchHH-HHHHHHHHHHHHH
Q psy1122 90 TADVRSVGYAELFSLSREDVLSAMKDYPEAQ-EILQNLGRKRLME 133 (150)
Q Consensus 90 ~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~-~~~~~~~~~r~~~ 133 (150)
.+++.|.++|.++.+|.+.|.+++.++|++. .+++.++. ++..
T Consensus 85 ~~~~~a~~~~~v~~i~~~~~~~l~~~~p~~~~~~~~~~~~-~l~~ 128 (211)
T PRK11753 85 SAWVRAKTACEVAEISYKKFRQLIQVNPDILMALSAQMAR-RLQN 128 (211)
T ss_pred eEEEEEcCcEEEEEEcHHHHHHHHHHCHHHHHHHHHHHHH-HHHH
Confidence 8899999999999999999999999999965 44555544 4433
No 3
>PRK11161 fumarate/nitrate reduction transcriptional regulator; Provisional
Probab=99.85 E-value=1.3e-19 Score=125.76 Aligned_cols=124 Identities=14% Similarity=0.186 Sum_probs=102.5
Q ss_pred cccccccccCCHHHHHHHHHhce-EEeecCCceEEecCCccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeeeeee
Q psy1122 6 VGVVTIFQECQPEFLHDLVLKMK-AYIFTPGDLICRKGEVAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGEIGI 80 (150)
Q Consensus 6 l~~~~~f~~l~~~~~~~l~~~~~-~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~~~~ 80 (150)
+++.+.|..+++++++.|..... .+.|+||++|+.+|++++++|||.+|.++++ +++|+ ++.+..||++||...+
T Consensus 15 ~~~~~~~~~l~~~~l~~L~~~~~~~~~~~kge~l~~~Gd~~~~ly~v~~G~v~~~~~~~~G~e~i~~~~~~gd~~g~~~~ 94 (235)
T PRK11161 15 ISQLCIPFTLNEHELDQLDNIIERKKPIQKGQTLFKAGDELKSLYAIRSGTIKSYTITEQGDEQITGFHLAGDLVGFDAI 94 (235)
T ss_pred ccccccccCCCHHHHHHHHHhhhhceeecCCCEeECCCCCcceEEEEeeceEEEEEECCCCCEEEEEeccCCceeccccc
Confidence 55666666799999999998875 5789999999999999999999999999995 45676 7888899999998655
Q ss_pred eccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCchHHHHHHHHHHHHHHH
Q psy1122 81 LNLDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPEAQEILQNLGRKRLME 133 (150)
Q Consensus 81 ~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~r~~~ 133 (150)
+. .+++.+++|.+++.++.+|.+.|.+++.++|++...+......++..
T Consensus 95 ~~----~~~~~~~~a~~~~~i~~ip~~~f~~l~~~~p~~~~~~~~~~~~~~~~ 143 (235)
T PRK11161 95 GS----GQHPSFAQALETSMVCEIPFETLDDLSGKMPKLRQQIMRLMSGEIKG 143 (235)
T ss_pred cC----CCCcceEEEeccEEEEEEEHHHHHHHHHHChHHHHHHHHHHHHHHHH
Confidence 43 45567899999999999999999999999999655544444434443
No 4
>COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms]
Probab=99.83 E-value=1.2e-19 Score=134.96 Aligned_cols=135 Identities=20% Similarity=0.312 Sum_probs=121.4
Q ss_pred ccccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeec
Q psy1122 3 VVCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILN 82 (150)
Q Consensus 3 ~~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~ 82 (150)
.+++.+.|.|+.|+++++..|...+.+..|.+|++|++.|.|.+++|+|.+|.|.+...+|..+..+..|+.||..+++.
T Consensus 6 ~~Fl~~~pPF~~L~~eel~~L~~~l~v~yy~kge~ii~~~~p~~~l~vi~kG~vev~~~~g~v~~~~~~gdlFg~~~l~~ 85 (610)
T COG2905 6 DQFLQQHPPFSQLPAEELEQLMGALEVKYYRKGEIIIYAGSPVHYLYVIRKGVVEVRSDGGEVLDRLAAGDLFGFSSLFT 85 (610)
T ss_pred HHHHhcCCCcccCCHHHHHHHHhhhccccccCCCeeecCCCCcceeEEEEeceeeEEcCCCeeeeeeccCccccchhhcc
Confidence 57899999999999999999999999999999999999999999999999999999989999999999999999999998
Q ss_pred cCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCchHHHHHHHHHHHHHHHHHhhhhh
Q psy1122 83 LDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPEAQEILQNLGRKRLMEAKNLNKK 140 (150)
Q Consensus 83 ~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~~~~~~ 140 (150)
. .+....+.|.+|+.+|.||++.|.+++.++|.+.+++..-..+|++..-....+
T Consensus 86 ~---~~~~~~~~aeedsl~y~lp~s~F~ql~~~n~~f~~ff~~~~akR~~~~~~~~~e 140 (610)
T COG2905 86 E---LNKQRYMAAEEDSLCYLLPKSVFMQLMEENPEFADFFLRSLAKRLRDIADRLAE 140 (610)
T ss_pred c---CCCcceeEeeccceEEecCHHHHHHHHHhCcHHHHHHHHHHHHHHHHHHHHHHh
Confidence 6 455566778889999999999999999999999998886666687755554444
No 5
>cd00038 CAP_ED effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of
Probab=99.83 E-value=2e-19 Score=110.84 Aligned_cols=109 Identities=32% Similarity=0.594 Sum_probs=98.9
Q ss_pred ccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEe--cCCc--EEEEecCCCeeeeeeeeccCCC
Q psy1122 11 IFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVIS--ESGR--VLTTMKAGDFFGEIGILNLDGL 86 (150)
Q Consensus 11 ~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~--~~g~--~~~~~~~g~~~g~~~~~~~~~~ 86 (150)
+|..++++.+..+...++.+.|++|++|+.+|++++++|+|.+|.++++. ++|+ .+..+.+|++||...++..
T Consensus 1 ~f~~l~~~~~~~l~~~~~~~~~~~g~~l~~~~~~~~~~~~i~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~--- 77 (115)
T cd00038 1 LFSGLDDEELEELADALEERRFPAGEVIIRQGDPADSLYIVLSGSVEVYKLDEDGREQIVGFLGPGDLFGELALLGN--- 77 (115)
T ss_pred CcccCCHHHHHHHHhhceeeeeCCCCEEEcCCCCCCeEEEEEeCEEEEEEECCCCcEEEEEecCCccCcChHHHhcC---
Confidence 47889999999999999999999999999999999999999999999953 4454 8889999999999988855
Q ss_pred CceeeEEEEcceeEEEEEcHHHHHHHHhhCchHHHH
Q psy1122 87 NKRTADVRSVGYAELFSLSREDVLSAMKDYPEAQEI 122 (150)
Q Consensus 87 ~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~~ 122 (150)
.++.+++.|.++|.++.+|.+.|.+++.++|.+...
T Consensus 78 ~~~~~~~~a~~~~~~~~i~~~~~~~~~~~~~~~~~~ 113 (115)
T cd00038 78 GPRSATVRALTDSELLVLPRSDFRRLLQEYPELARR 113 (115)
T ss_pred CCCCceEEEcCceEEEEEeHHHHHHHHHHCcHhHHh
Confidence 788999999999999999999999999999986543
No 6
>PRK10402 DNA-binding transcriptional activator YeiL; Provisional
Probab=99.83 E-value=1.8e-19 Score=124.57 Aligned_cols=110 Identities=16% Similarity=0.289 Sum_probs=94.7
Q ss_pred CHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeeeeeeeccCCCCceee
Q psy1122 16 QPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGEIGILNLDGLNKRTA 91 (150)
Q Consensus 16 ~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~~~~~~~~~~~~~~~ 91 (150)
.+-+...+.+.+..+.|++|++|+.+|++++++|||++|.++++ +++|+ ++.++.||++||+.+++.+ .++.+
T Consensus 20 ~~~~~~~i~~~~~~~~~~kge~l~~~G~~~~~~y~V~~G~v~v~~~~~~G~e~~~~~~~~g~~~G~~~~~~~---~~~~~ 96 (226)
T PRK10402 20 KDCFSFDVSADTELFHFLAREYIVQEGQQPSYLFYLTRGRAKLYATLANGKVSLIDFFAAPCFIGEIELIDK---DHETK 96 (226)
T ss_pred hhcCCHHHHhhhhheeeCCCCEEEcCCCCCceEEEEEeCEEEEEEECCCCCEeeeeecCCCCeEEeehhhcC---CCCCc
Confidence 33444567788899999999999999999999999999999995 46777 8899999999999988765 78999
Q ss_pred EEEEcceeEEEEEcHHHHHHHHhhCchHHH-HHHHHHH
Q psy1122 92 DVRSVGYAELFSLSREDVLSAMKDYPEAQE-ILQNLGR 128 (150)
Q Consensus 92 ~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~-~~~~~~~ 128 (150)
++.|.++|.++.+|.+.|..++.++|.+.. +++.++.
T Consensus 97 ~~~A~~~~~i~~i~~~~~~~ll~~~p~~~~~~~~~l~~ 134 (226)
T PRK10402 97 AVQAIEECWCLALPMKDCRPLLLNDALFLRKLCKFLSH 134 (226)
T ss_pred cEEEeccEEEEEEEHHHHHHHHhcCHHHHHHHHHHHHH
Confidence 999999999999999999999999999444 4444444
No 7
>KOG0500|consensus
Probab=99.81 E-value=7.1e-19 Score=128.72 Aligned_cols=126 Identities=50% Similarity=0.942 Sum_probs=114.5
Q ss_pred ccccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecCCc-EEEEecCCCeeeeeeee
Q psy1122 3 VVCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGR-VLTTMKAGDFFGEIGIL 81 (150)
Q Consensus 3 ~~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~-~~~~~~~g~~~g~~~~~ 81 (150)
.+.|+++++|.++.+..+.++...++...|.|||.|+++||.+..+|+|.+|.++++.+||. .+..+.+|++||+++++
T Consensus 306 ~dTLkkV~iF~~ce~~lL~elVLklk~qvfSPgDyICrKGdvgkEMyIVk~G~L~Vv~dDg~t~~~~L~~G~~FGEisIl 385 (536)
T KOG0500|consen 306 LDTLKKVRIFQDCEAGLLVELVLKLKPQVFSPGDYICRKGDVGKEMYIVKEGKLAVVADDGVTVFVTLKAGSVFGEISIL 385 (536)
T ss_pred HHHHHhhhHHHhcchhHHHHHHHHhcceeeCCCCeEEecCcccceEEEEEccEEEEEecCCcEEEEEecCCceeeeeEEE
Confidence 35689999999999999999999999999999999999999999999999999999988887 89999999999999998
Q ss_pred ccCC---CCceeeEEEEcceeEEEEEcHHHHHHHHhhCchHHHHHHHHHH
Q psy1122 82 NLDG---LNKRTADVRSVGYAELFSLSREDVLSAMKDYPEAQEILQNLGR 128 (150)
Q Consensus 82 ~~~~---~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~~~~~~~~ 128 (150)
.-.+ ..++.++++++..+.+++++++++.+++.++|+....+..-+.
T Consensus 386 ni~g~~~gNRRtanvrSvGYSDlfvLskdDl~~aL~eYP~a~~~L~~kgr 435 (536)
T KOG0500|consen 386 NIKGNKNGNRRTANVRSVGYSDLFVLSKDDLWEALSEYPDARKRLEEKGR 435 (536)
T ss_pred EEcCcccCCcceeeeeeeccceeeEeeHHHHHHHHHhCCHHHHHHHHHHH
Confidence 6543 3678999999999999999999999999999997666654433
No 8
>PF00027 cNMP_binding: Cyclic nucleotide-binding domain; InterPro: IPR000595 Proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues [, , ]. The best studied of these proteins is the prokaryotic catabolite gene activator (also known as the cAMP receptor protein) (gene crp) where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure. There are six invariant amino acids in this domain, three of which are glycine residues that are thought to be essential for maintenance of the structural integrity of the beta-barrel. cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain. The cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain. The cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section. Vertebrate cyclic nucleotide-gated ion-channels also contain this domain. Two such cations channels have been fully characterised, one is found in rod cells where it plays a role in visual signal transduction.; PDB: 1O7F_A 2BYV_E 3E97_A 3U10_A 2H6B_A 3SHR_A 2OZ6_A 1WGP_A 3LA2_A 3LA3_B ....
Probab=99.80 E-value=5.6e-19 Score=105.07 Aligned_cols=86 Identities=29% Similarity=0.569 Sum_probs=79.6
Q ss_pred EeecCCceEEecCCccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEc
Q psy1122 30 YIFTPGDLICRKGEVAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLS 105 (150)
Q Consensus 30 ~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~ 105 (150)
++|++|++|+.+|++++.+|||++|.++++ +.+|+ ++..+.+|++||..+++.. .++.++++|.++|.++.||
T Consensus 2 ~~~~~g~~i~~~g~~~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~---~~~~~~~~a~~~~~~~~i~ 78 (91)
T PF00027_consen 2 KTYKKGEVIYRQGDPCDHIYIILSGEVKVSSINEDGKEQIIFFLGPGDIFGEIELLTG---KPSPFTVIALTDSEVLRIP 78 (91)
T ss_dssp EEESTTEEEEETTSBESEEEEEEESEEEEEEETTTSEEEEEEEEETTEEESGHHHHHT---SBBSSEEEESSSEEEEEEE
T ss_pred eEECCCCEEEeCCCcCCEEEEEEECceEEEeceecceeeeecceeeeccccceeecCC---CccEEEEEEccCEEEEEEe
Confidence 689999999999999999999999999995 34555 6899999999999999987 6899999999999999999
Q ss_pred HHHHHHHHhhCch
Q psy1122 106 REDVLSAMKDYPE 118 (150)
Q Consensus 106 ~~~~~~l~~~~~~ 118 (150)
.+.|.++++++|+
T Consensus 79 ~~~~~~~~~~~p~ 91 (91)
T PF00027_consen 79 REDFLQLLQQDPE 91 (91)
T ss_dssp HHHHHHHHHHSHH
T ss_pred HHHHHHHHHhCcC
Confidence 9999999999986
No 9
>PLN03192 Voltage-dependent potassium channel; Provisional
Probab=99.80 E-value=1.3e-18 Score=139.28 Aligned_cols=122 Identities=29% Similarity=0.453 Sum_probs=110.7
Q ss_pred cccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEec-CCc--EEEEecCCCeeeeeee
Q psy1122 4 VCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISE-SGR--VLTTMKAGDFFGEIGI 80 (150)
Q Consensus 4 ~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~-~g~--~~~~~~~g~~~g~~~~ 80 (150)
++++++++|.+++++.+..++..++.+.|+|||.|+.+|++++.+|||++|.|++... +|+ ++..+.+|++||+.++
T Consensus 374 ~~l~~~~lF~~~s~~~l~~L~~~~~~~~~~pge~I~~qge~~~~lY~I~~G~V~i~~~~~~~e~~l~~l~~Gd~FGE~~~ 453 (823)
T PLN03192 374 PVVEKVYLFKGVSREILLLLVTKMKAEYIPPREDVIMQNEAPDDVYIVVSGEVEIIDSEGEKERVVGTLGCGDIFGEVGA 453 (823)
T ss_pred HHHhhCcchhcCCHHHHHHHHHhhheeeeCCCCEEEECCCCCceEEEEEecEEEEEEecCCcceeeEEccCCCEecchHH
Confidence 3578899999999999999999999999999999999999999999999999999643 333 7899999999999999
Q ss_pred eccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCch-HHHHHHHHHH
Q psy1122 81 LNLDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPE-AQEILQNLGR 128 (150)
Q Consensus 81 ~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~-~~~~~~~~~~ 128 (150)
+.+ .+++++++|.++|+++.+++++|.++++.+|+ ...+++.+.+
T Consensus 454 l~~---~p~~~t~ra~~~s~ll~l~~~~f~~ll~~~p~d~~~i~~~~l~ 499 (823)
T PLN03192 454 LCC---RPQSFTFRTKTLSQLLRLKTSTLIEAMQTRQEDNVVILKNFLQ 499 (823)
T ss_pred hcC---CCCCCeEEEcccEEEEEEEHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 987 78999999999999999999999999999999 6666666665
No 10
>COG0664 Crp cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]
Probab=99.79 E-value=9.4e-18 Score=114.25 Aligned_cols=131 Identities=27% Similarity=0.471 Sum_probs=109.2
Q ss_pred cccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeeeeeee
Q psy1122 6 VGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGEIGIL 81 (150)
Q Consensus 6 l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~~~~~ 81 (150)
+...+.|..++.+....+......+.+++|++|+.+|++.+++|+|.+|.++++ .++|+ ++.++.||++||+.+++
T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~l~~~g~~~~~~y~v~~G~v~~~~~~~~G~~~~~~~~~~g~~fg~~~l~ 81 (214)
T COG0664 2 LKENPLLNLLPSELLELLALKLEVRKLPKGEVLFTEGEEADSLYIILSGIVKLYANTEDGREIILGFLGPGDFFGELALL 81 (214)
T ss_pred cccccccccCCHHHHHHHhhhceeEeeCCCCEEEcCCCcCceEEEEEEeEEEEEEECCCCcEEEEEEecCCchhhhHHHh
Confidence 345567777788888888889999999999999999999999999999999995 45676 88899999999999999
Q ss_pred ccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCchHHHHHHHHHHHHHHHHHhhhh
Q psy1122 82 NLDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPEAQEILQNLGRKRLMEAKNLNK 139 (150)
Q Consensus 82 ~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~~~~~ 139 (150)
.+ .++.+++.|.+|++++.+|.+.|..++...|.+...+.....+++........
T Consensus 82 ~~---~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~p~l~~~l~~~~~~~l~~~~~~~~ 136 (214)
T COG0664 82 GG---DPRSASAVALTDVEVLEIPRKDFLELLAESPKLALALLRLLARRLRQALERLS 136 (214)
T ss_pred cC---CCccceEEEcceEEEEEecHHHHHHHHhhCcHHHHHHHHHHHHHHHHHHHHHH
Confidence 86 58999999999999999999999999877888655555555446555554433
No 11
>smart00100 cNMP Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
Probab=99.79 E-value=5.8e-18 Score=104.79 Aligned_cols=108 Identities=28% Similarity=0.552 Sum_probs=96.3
Q ss_pred ccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEe--cCCc--EEEEecCCCeeeeeeeeccCCC
Q psy1122 11 IFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVIS--ESGR--VLTTMKAGDFFGEIGILNLDGL 86 (150)
Q Consensus 11 ~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~--~~g~--~~~~~~~g~~~g~~~~~~~~~~ 86 (150)
+|.+++++.+..+.+.+..+.|++|++|+.+|++++++|||.+|.++++. .+|+ .+..+.+|++||+..++... .
T Consensus 1 ~f~~l~~~~~~~l~~~~~~~~~~~g~~l~~~g~~~~~~y~v~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~~-~ 79 (120)
T smart00100 1 LFKNLDAEELRELADALEPVRYPAGEVIIRQGDVGDSFYIILSGEVRVYKVLEDGREQILGILGPGDFFGELALLTNS-R 79 (120)
T ss_pred CcCCCCHHHHHHHHHhceEEEeCCCCEEEeCCCcCCcEEEEEeeEEEEEEECCCCceEEEEeecCCceechhhhccCC-C
Confidence 47899999999999999999999999999999999999999999999963 3455 89999999999999888221 1
Q ss_pred CceeeEEEEcceeEEEEEcHHHHHHHHhhCchH
Q psy1122 87 NKRTADVRSVGYAELFSLSREDVLSAMKDYPEA 119 (150)
Q Consensus 87 ~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~ 119 (150)
.++..++.+.++|.++.++.+.+...+.+++..
T Consensus 80 ~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~ 112 (120)
T smart00100 80 RAASATAVALELATLLRIDFRDFLQLLQENPQL 112 (120)
T ss_pred cccceEEEEEeeEEEEccCHHHHHHHHHHhHHH
Confidence 577889999999999999999999999999883
No 12
>PRK09391 fixK transcriptional regulator FixK; Provisional
Probab=99.76 E-value=5.8e-17 Score=112.31 Aligned_cols=106 Identities=18% Similarity=0.243 Sum_probs=88.3
Q ss_pred HhceEEeecCCceEEecCCccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeE
Q psy1122 25 LKMKAYIFTPGDLICRKGEVAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAE 100 (150)
Q Consensus 25 ~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~ 100 (150)
.....+.|++|++|+.+|++++++|||++|.++++ +++|+ ++.++.+|++||.. .. .++.+++.|+++|.
T Consensus 36 ~~~~~~~~~kge~l~~~Gd~~~~ly~I~~G~vkl~~~~~~G~e~i~~~~~~Gd~fG~~---~~---~~~~~~~~A~~ds~ 109 (230)
T PRK09391 36 LVASEFSYKKGEEIYGEGEPADYVYQVESGAVRTYRLLSDGRRQIGAFHLPGDVFGLE---SG---STHRFTAEAIVDTT 109 (230)
T ss_pred ceeeeEEECCCCEEECCCCCCCeEEEEEeCEEEEEEECCCCcEEEEEEecCCceeccc---CC---CcCCeEEEEcCceE
Confidence 45678999999999999999999999999999995 46777 88889999999953 22 57889999999999
Q ss_pred EEEEcHHHHHHHHhhCchHHHHHHHHHHHHHHHHHh
Q psy1122 101 LFSLSREDVLSAMKDYPEAQEILQNLGRKRLMEAKN 136 (150)
Q Consensus 101 ~~~i~~~~~~~l~~~~~~~~~~~~~~~~~r~~~~~~ 136 (150)
++.+|.+.|.+++..+|++...+......++.....
T Consensus 110 v~~i~~~~f~~l~~~~p~l~~~l~~~l~~~l~~~~~ 145 (230)
T PRK09391 110 VRLIKRRSLEQAAATDVDVARALLSLTAGGLRHAQD 145 (230)
T ss_pred EEEEEHHHHHHHHhhChHHHHHHHHHHHHHHHHHHH
Confidence 999999999999999999666554444445554433
No 13
>TIGR03697 NtcA_cyano global nitrogen regulator NtcA, cyanobacterial. Members of this protein family, found in the cyanobacteria, are the global nitrogen regulator NtcA. This DNA-binding transcriptional regulator is required for expressing many different ammonia-repressible genes. The consensus NtcA-binding site is G T A N(8)T A C.
Probab=99.74 E-value=1.1e-16 Score=107.90 Aligned_cols=101 Identities=19% Similarity=0.351 Sum_probs=81.7
Q ss_pred CceEEecCCccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEcHHHHH
Q psy1122 35 GDLICRKGEVAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLSREDVL 110 (150)
Q Consensus 35 g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~ 110 (150)
|+.|+.+|++++++|+|++|.|+++ .++|+ ++.++.||++||+.+++.... .++.+++.|.++|.++.+|.+.|.
T Consensus 1 g~~l~~~g~~~~~~~~i~~G~v~~~~~~~~G~e~~l~~~~~g~~~G~~~~~~~~~-~~~~~~~~A~~~~~v~~i~~~~~~ 79 (193)
T TIGR03697 1 GKTIFFPGDPAEKVYFLRRGAVKLSRVYESGEEITVALLRENSVFGVLSLITGHR-SDRFYHAVAFTRVELLAVPIEQVE 79 (193)
T ss_pred CCceecCCCCCCcEEEEEecEEEEEEeCCCCcEeeeEEccCCCEeeeeeeccCCC-CccceEEEEecceEEEEeeHHHHH
Confidence 7899999999999999999999995 46777 789999999999998886520 234578999999999999999999
Q ss_pred HHHhhCchHHHHHHHHHHHHHHHHHh
Q psy1122 111 SAMKDYPEAQEILQNLGRKRLMEAKN 136 (150)
Q Consensus 111 ~l~~~~~~~~~~~~~~~~~r~~~~~~ 136 (150)
+++.++|.+...+......++.....
T Consensus 80 ~l~~~~p~l~~~~~~~l~~~l~~~~~ 105 (193)
T TIGR03697 80 KAIEEDPDLSMLLLQGLSSRILQTEM 105 (193)
T ss_pred HHHHHChHHHHHHHHHHHHHHHHHHH
Confidence 99999999555444433335554443
No 14
>KOG0498|consensus
Probab=99.73 E-value=1.8e-17 Score=128.40 Aligned_cols=119 Identities=34% Similarity=0.602 Sum_probs=106.7
Q ss_pred cccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecCC-c--EEEEecCCCeee-eee
Q psy1122 4 VCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESG-R--VLTTMKAGDFFG-EIG 79 (150)
Q Consensus 4 ~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g-~--~~~~~~~g~~~g-~~~ 79 (150)
++++++|+|+++++..++.|+..++...|++|++|+.+||+.+.+|||..|.+.+...+| . ....+++||+|| ++.
T Consensus 419 ~lv~~vpLF~~md~~~L~al~~rlk~~~f~pge~iireGd~v~~myFI~rG~le~~~~~~g~~~~~~~L~~Gd~~GeEl~ 498 (727)
T KOG0498|consen 419 DLVRKVPLFAGMDDGLLDALCSRLKPEYFTPGEYIIREGDPVTDMYFIVRGSLESITTDGGGFFVVAILGPGDFFGEELL 498 (727)
T ss_pred HHHhhCchhhcCCHHHHHHHHHHhhhhccCCCCeEEecCCccceeEEEEeeeEEEEEccCCceEEEEEecCCCccchHHH
Confidence 568999999999999999999999999999999999999999999999999999965443 2 899999999999 777
Q ss_pred eeccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCch-HHHHHH
Q psy1122 80 ILNLDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPE-AQEILQ 124 (150)
Q Consensus 80 ~~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~-~~~~~~ 124 (150)
..+.. .|.+.+|+|.+.|+++.+..++|..+++++|. ...+++
T Consensus 499 ~~~~~--~p~t~TVralt~~el~~L~~~dL~~V~~~f~~~~~~~l~ 542 (727)
T KOG0498|consen 499 TWCLD--LPQTRTVRALTYCELFRLSADDLKEVLQQFRRLGSKFLQ 542 (727)
T ss_pred HHHhc--CCCCceeehhhhhhHHhccHHHHHHHHHHhHHHHHHHHH
Confidence 76653 46699999999999999999999999999998 455555
No 15
>PLN02868 acyl-CoA thioesterase family protein
Probab=99.72 E-value=9e-17 Score=119.90 Aligned_cols=110 Identities=22% Similarity=0.231 Sum_probs=98.6
Q ss_pred cccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEe--cCCc-EEEEecCCCeeeeeee
Q psy1122 4 VCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVIS--ESGR-VLTTMKAGDFFGEIGI 80 (150)
Q Consensus 4 ~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~--~~g~-~~~~~~~g~~~g~~~~ 80 (150)
++++++++|++|+++++..+...++.+.|++|++|+.+|++++.+|||++|.++++. .+|. ++..+.+|++||+.
T Consensus 8 ~~L~~~~~F~~L~~~~l~~l~~~~~~~~~~~Ge~I~~~Gd~~~~lyiI~~G~V~v~~~~~~ge~~l~~l~~Gd~fG~~-- 85 (413)
T PLN02868 8 EFLGSVPLLQRLPSSSLKKIAEVVVPKRYGKGEYVVREGEPGDGLYFIWKGEAEVSGPAEEESRPEFLLKRYDYFGYG-- 85 (413)
T ss_pred HHHhcCcccccCCHHHHHHHHHhceEEEECCCCEEEeCCCcCceEEEEEeCEEEEEEECCCCcEEEEEeCCCCEeehh--
Confidence 468899999999999999999999999999999999999999999999999999953 4455 78899999999964
Q ss_pred eccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCch
Q psy1122 81 LNLDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPE 118 (150)
Q Consensus 81 ~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~ 118 (150)
+.+ .++..+++|.++|+++.||.+.|.-+...++-
T Consensus 86 l~~---~~~~~~~~A~~d~~v~~ip~~~~~~~~~~~~~ 120 (413)
T PLN02868 86 LSG---SVHSADVVAVSELTCLVLPHEHCHLLSPKSIW 120 (413)
T ss_pred hCC---CCcccEEEECCCEEEEEEcHHHHhhhcccccc
Confidence 443 78999999999999999999999998766654
No 16
>KOG0499|consensus
Probab=99.72 E-value=9.1e-17 Score=120.22 Aligned_cols=125 Identities=39% Similarity=0.829 Sum_probs=117.1
Q ss_pred cccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEE-ecCCc-EEEEecCCCeeeeeeee
Q psy1122 4 VCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVI-SESGR-VLTTMKAGDFFGEIGIL 81 (150)
Q Consensus 4 ~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~-~~~g~-~~~~~~~g~~~g~~~~~ 81 (150)
.++.++.+|+++++..+..++..++.+.|-+||.|++.|+.+..+|+|..|.|++. -++|. ++..+.+|++||+++++
T Consensus 527 ~~lSKVqLFq~Cdr~mirDmllrLRsV~yLPgDfVCkKGeiGkEMYIIk~GqvQVlGGp~~~~Vl~tL~~GsVFGEISLL 606 (815)
T KOG0499|consen 527 SILSKVQLFQGCDRQMIRDMLLRLRSVLYLPGDFVCKKGEIGKEMYIIKHGQVQVLGGPDGTKVLVTLKAGSVFGEISLL 606 (815)
T ss_pred hhhhHHHHhhhhHHHHHHHHHHHhhceeecCCceeeecccccceeEEeecceEEEecCCCCCEEEEEecccceeeeeeee
Confidence 46888999999999999999999999999999999999999999999999999996 46666 99999999999999999
Q ss_pred ccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCchHHHHHHHHHH
Q psy1122 82 NLDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPEAQEILQNLGR 128 (150)
Q Consensus 82 ~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~~~~~~~~ 128 (150)
.-+|..++.++++|..-|.++.++++++++++..||+.+.+++.-+.
T Consensus 607 aigG~nRRTAnV~a~Gf~nLfvL~KkdLneil~~YP~sq~iLrkkAr 653 (815)
T KOG0499|consen 607 AIGGGNRRTANVVAHGFANLFVLDKKDLNEILVHYPDSQRILRKKAR 653 (815)
T ss_pred eecCCCccchhhhhcccceeeEecHhHHHHHHHhCccHHHHHHHHHH
Confidence 87776889999999999999999999999999999999999988886
No 17
>PRK13918 CRP/FNR family transcriptional regulator; Provisional
Probab=99.68 E-value=1.5e-15 Score=103.12 Aligned_cols=80 Identities=18% Similarity=0.359 Sum_probs=70.5
Q ss_pred hceEEeecCCceEEecCC--ccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeeeeeeeccCCCCceeeEEEEccee
Q psy1122 26 KMKAYIFTPGDLICRKGE--VAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYA 99 (150)
Q Consensus 26 ~~~~~~~~~g~~i~~~g~--~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~ 99 (150)
.++.+.|++|++|+.+|+ +++.+|+|++|.++++ +++|+ ++..+.||++||+..++ . .++++++.|.++|
T Consensus 5 ~~~~~~~~kg~~l~~~Gd~~~~~~~y~I~~G~vr~~~~~~~G~e~~l~~~~~Gd~~G~~~~~-~---~~~~~~~~A~~~~ 80 (202)
T PRK13918 5 VVDTVTYRPGAVILYPGVPGPSDMLYRVRSGLVRLHTVDDEGNALTLRYVRPGEYFGEEALA-G---AERAYFAEAVTDS 80 (202)
T ss_pred ccceeEecCCCEEEcCCCCCCCCeEEEEEeeEEEEEEECCCCCEEEEEEecCCCeechHHhc-C---CCCCceEEEcCce
Confidence 467889999999999999 7799999999999995 56777 88999999999986554 3 5788899999999
Q ss_pred EEEEEcHHHH
Q psy1122 100 ELFSLSREDV 109 (150)
Q Consensus 100 ~~~~i~~~~~ 109 (150)
.++.+|.+.|
T Consensus 81 ~v~~i~~~~~ 90 (202)
T PRK13918 81 RIDVLNPALM 90 (202)
T ss_pred EEEEEEHHHc
Confidence 9999999887
No 18
>KOG1113|consensus
Probab=99.61 E-value=9.3e-16 Score=108.62 Aligned_cols=111 Identities=20% Similarity=0.326 Sum_probs=102.3
Q ss_pred cccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeecc
Q psy1122 4 VCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNL 83 (150)
Q Consensus 4 ~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~ 83 (150)
++++..-+|.+|+++.+..++..+..+.+++|+.|+++|+.++++|||-+|.+.++. +|+.+..+.||..||++++..+
T Consensus 122 ~a~r~~~LF~~Ld~eq~~~v~dam~~~~v~~G~~Vi~qGdeGd~fYvI~kGt~dVyv-~~~~v~~~~~g~sFGElALmyn 200 (368)
T KOG1113|consen 122 EAFRKNLLFANLDDEQLSQVLDAMFEKRVKAGETVIKQGDEGDNFYVIDKGTFDVYV-NGTYVTTYSPGGSFGELALMYN 200 (368)
T ss_pred HHHHhccccccCCHHHHHHHHHhhceeeecCCcEEEecCCcCCcEEEEecceEEEEE-CCeEEeeeCCCCchhhhHhhhC
Confidence 356777899999999999999999999999999999999999999999999999985 3778899999999999999998
Q ss_pred CCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCch
Q psy1122 84 DGLNKRTADVRSVGYAELFSLSREDVLSAMKDYPE 118 (150)
Q Consensus 84 ~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~ 118 (150)
.|+.+|+.|.+++.+|.++...|..++..+..
T Consensus 201 ---~PRaATv~a~t~~klWgldr~SFrrIi~~s~~ 232 (368)
T KOG1113|consen 201 ---PPRAATVVAKSLKKLWGLDRTSFRRIIMKSCI 232 (368)
T ss_pred ---CCcccceeeccccceEEEeeceeEEEeeccch
Confidence 89999999999999999999999887666654
No 19
>KOG0614|consensus
Probab=99.54 E-value=3.6e-15 Score=110.74 Aligned_cols=110 Identities=28% Similarity=0.429 Sum_probs=101.6
Q ss_pred cccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeecc
Q psy1122 4 VCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNL 83 (150)
Q Consensus 4 ~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~ 83 (150)
+++..=.++.+|...++.++..++..+.|.+|++|+++|++.+.+|++-+|.+.+. .+|+.+..+++|.+||+++++.+
T Consensus 154 dAi~~NdFLknLd~~Qi~e~v~~Myp~~~~~gs~IIrege~Gs~~yV~aeG~~~V~-~~g~ll~~m~~gtvFGELAILyn 232 (732)
T KOG0614|consen 154 DAIQKNDFLKNLDASQIKELVDCMYPVEYRAGSWIIREGEPGSHLYVSAEGELQVS-REGKLLGKMGAGTVFGELAILYN 232 (732)
T ss_pred HHHHhhHHHHhhhHHHHHHHHHhhCcccccCCcEEEecCCCCceEEEeecceEEEe-eCCeeeeccCCchhhhHHHHHhC
Confidence 34555678899999999999999999999999999999999999999999999987 47789999999999999999998
Q ss_pred CCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCc
Q psy1122 84 DGLNKRTADVRSVGYAELFSLSREDVLSAMKDYP 117 (150)
Q Consensus 84 ~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~ 117 (150)
.++.++++|++++.+|.|+++.|+.+|+..-
T Consensus 233 ---ctRtAsV~alt~~~lWaidR~vFq~IM~~tg 263 (732)
T KOG0614|consen 233 ---CTRTASVRALTDVRLWAIDREVFQAIMMRTG 263 (732)
T ss_pred ---CcchhhhhhhhhhhHHHHHHHHHHHHHHHHH
Confidence 7999999999999999999999999987653
No 20
>KOG0614|consensus
Probab=99.49 E-value=2e-14 Score=106.90 Aligned_cols=115 Identities=28% Similarity=0.475 Sum_probs=102.6
Q ss_pred ccccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecC---Cc--EEEEecCCCeeee
Q psy1122 3 VVCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISES---GR--VLTTMKAGDFFGE 77 (150)
Q Consensus 3 ~~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~---g~--~~~~~~~g~~~g~ 77 (150)
+.+|+++++|.++|++.+..++..++...|++|+.|+.+|+..+.+|+|.+|.|.+...+ +. .+..+.+||+||+
T Consensus 271 ~~fLrsv~~~q~l~Ee~L~KiaD~le~~~Yd~g~yIirqge~G~~ffii~~G~V~vtq~~e~~~q~~~lr~l~kGd~FGE 350 (732)
T KOG0614|consen 271 MNFLRSVPLFQNLPEELLLKIADVLEEEYYDAGEYIIRQGEKGDTFFIISKGTVKVTQQDEGSTQPQELRTLNKGDYFGE 350 (732)
T ss_pred HHHHHhhhhhccCCHHHHHHHHHHHHHHhhcCCceEEeecCCCCeEEEEecceEEEeecCCCCCchhHHhhccccchhhH
Confidence 468999999999999999999999999999999999999999999999999999996432 22 7899999999999
Q ss_pred eeeeccCCCCceeeEEEEcce-eEEEEEcHHHHHHHHhhCchHH
Q psy1122 78 IGILNLDGLNKRTADVRSVGY-AELFSLSREDVLSAMKDYPEAQ 120 (150)
Q Consensus 78 ~~~~~~~~~~~~~~~~~a~~~-~~~~~i~~~~~~~l~~~~~~~~ 120 (150)
-+++.. ..++++++|..+ ..++.++++.|..++-...++.
T Consensus 351 ~al~~e---dvRtAniia~~~gv~cl~lDresF~~liG~l~~l~ 391 (732)
T KOG0614|consen 351 RALLGE---DVRTANIIAQAPGVECLTLDRESFKKLIGDLEELK 391 (732)
T ss_pred HHhhcc---CccchhhhccCCCceEEEecHHHHHHhcccHHHhh
Confidence 999987 688999999988 9999999999998875554443
No 21
>KOG1113|consensus
Probab=99.35 E-value=1.3e-12 Score=92.94 Aligned_cols=112 Identities=20% Similarity=0.314 Sum_probs=100.4
Q ss_pred CCccccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeee
Q psy1122 1 MGVVCVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGI 80 (150)
Q Consensus 1 M~~~~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~ 80 (150)
|-..+|+.+|+++.+...+...+...+..+.|++|+.|..+|++++.+|+|.+|.|.+....+.+...+.+|++||+.++
T Consensus 237 My~~~l~s~pil~~l~k~er~kv~dal~~k~y~~G~~Vi~qg~~ge~f~~i~eGEvdv~~~~~~v~vkl~~~dyfge~al 316 (368)
T KOG1113|consen 237 MYEPFLESVPILESLEKLERAKVADALGTKSYKDGERVIVQGDQGEHFYIIEEGEVDVLKKRDGVEVKLKKGDYFGELAL 316 (368)
T ss_pred hhhhhhhcchhhHHHHHHHHHhhhcccceeeccCCceEEeccCCcceEEEecccccchhhccCCeEEEechhhhcchHHH
Confidence 34578999999999999999999999999999999999999999999999999999985322333339999999999999
Q ss_pred eccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhh
Q psy1122 81 LNLDGLNKRTADVRSVGYAELFSLSREDVLSAMKD 115 (150)
Q Consensus 81 ~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~ 115 (150)
+.+ .++.++|.|..+..+..+++..|+.++--
T Consensus 317 ~~~---~pr~Atv~a~~~~kc~~~dk~~ferllgp 348 (368)
T KOG1113|consen 317 LKN---LPRAATVVAKGRLKCAKLDKPRFERLLGP 348 (368)
T ss_pred Hhh---chhhceeeccCCceeeeeChHHHHHHhhH
Confidence 988 89999999999999999999999987754
No 22
>KOG2968|consensus
Probab=99.20 E-value=2.7e-11 Score=94.93 Aligned_cols=113 Identities=22% Similarity=0.426 Sum_probs=97.7
Q ss_pred HHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEe--cCCc--EEEEecCCCeeeeeeeeccCCCCceeeEE
Q psy1122 18 EFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVIS--ESGR--VLTTMKAGDFFGEIGILNLDGLNKRTADV 93 (150)
Q Consensus 18 ~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~--~~g~--~~~~~~~g~~~g~~~~~~~~~~~~~~~~~ 93 (150)
..+..+.-.+.+....+|+.++++||.++++|+|++|.++-.. ..|+ ++..+++||.+|..+.+.. .++..++
T Consensus 499 p~lr~~D~AldWv~l~~g~alyrqgD~Sd~iyvVl~GRlRsv~~~~~~k~~i~~EygrGd~iG~~E~lt~---~~R~tTv 575 (1158)
T KOG2968|consen 499 PFLRKLDFALDWVRLEPGQALYRQGDSSDSIYVVLNGRLRSVIRQSGGKKEIVGEYGRGDLIGEVEMLTK---QPRATTV 575 (1158)
T ss_pred HHHhhhhhhcceEEeccccHHHhcCCccCcEEEEecCeehhhhhccCccchhhhhccCcceeehhHHhhc---CCccceE
Confidence 3455566678899999999999999999999999999999742 3344 8899999999999999987 7999999
Q ss_pred EEcceeEEEEEcHHHHHHHHhhCch-HHHHHHHHHHHHHHH
Q psy1122 94 RSVGYAELFSLSREDVLSAMKDYPE-AQEILQNLGRKRLME 133 (150)
Q Consensus 94 ~a~~~~~~~~i~~~~~~~l~~~~~~-~~~~~~~~~~~r~~~ 133 (150)
.|+-++++.+||..-|..+..++|+ ..++.+.++++-+..
T Consensus 576 ~AvRdSelariPe~l~~~ik~ryP~v~~rl~~ll~~~~~g~ 616 (1158)
T KOG2968|consen 576 MAVRDSELARIPEGLLNFIKLRYPQVVTRLIKLLAEKILGS 616 (1158)
T ss_pred EEEeehhhhhccHHHHHHHHHhccHHHHHHHHHHHHHhhhh
Confidence 9999999999999999999999999 777777777744333
No 23
>KOG0501|consensus
Probab=99.12 E-value=4.6e-11 Score=90.29 Aligned_cols=112 Identities=25% Similarity=0.453 Sum_probs=98.7
Q ss_pred ccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccC
Q psy1122 5 CVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLD 84 (150)
Q Consensus 5 ~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~ 84 (150)
.....|.|.-.++..+..|.-.++..+.+||+.|+..|+..+.++||++|.++++. |+.++.++++||+||+..+-..
T Consensus 549 VFnEHpaFRLASDGCLRaLAm~f~~~H~APGDLlYHtGESvDaLcFvVsGSLEVIQ-DDEVVAILGKGDVFGD~FWK~~- 626 (971)
T KOG0501|consen 549 VFNEHPAFRLASDGCLRALAMEFQTNHCAPGDLLYHTGESVDALCFVVSGSLEVIQ-DDEVVAILGKGDVFGDEFWKEN- 626 (971)
T ss_pred hhccCcceeeccchhHHHHHHHHHhccCCCcceeeecCCccceEEEEEecceEEee-cCcEEEEeecCccchhHHhhhh-
Confidence 45566888889999999999999999999999999999999999999999999984 5568899999999998866544
Q ss_pred CCCceeeEEEEcceeEEEEEcHHHHHHHHhhCch
Q psy1122 85 GLNKRTADVRSVGYAELFSLSREDVLSAMKDYPE 118 (150)
Q Consensus 85 ~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~~ 118 (150)
....+.++++|++.|.+..|.++.+.+.++=|..
T Consensus 627 t~~qs~ANVRALTYcDLH~IKrd~Ll~VLdFYtA 660 (971)
T KOG0501|consen 627 TLGQSAANVRALTYCDLHMIKRDKLLKVLDFYTA 660 (971)
T ss_pred hhhhhhhhhhhhhhhhhhHHhHHHHHHHHHHHHH
Confidence 3367888999999999999999999998877665
No 24
>PF04831 Popeye: Popeye protein conserved region; InterPro: IPR006916 The Popeye (POP) family of proteins, is restricted to vertebrates and is preferentially expressed in developing and adult striated muscle. It is represented by a conserved region which includes three potential transmembrane domains []. The strong conservation of POP genes during evolution and their preferential expression in heart and skeletal muscle suggest that these novel proteins may have an important function in these tissues in vertebrates.; GO: 0016020 membrane
Probab=99.01 E-value=1.1e-07 Score=60.64 Aligned_cols=114 Identities=14% Similarity=0.179 Sum_probs=94.5
Q ss_pred cCCHHHHHHHHHh-ceEEeecCCceEEecCCcc-ceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeecc--CCCCce
Q psy1122 14 ECQPEFLHDLVLK-MKAYIFTPGDLICRKGEVA-REMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNL--DGLNKR 89 (150)
Q Consensus 14 ~l~~~~~~~l~~~-~~~~~~~~g~~i~~~g~~~-~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~--~~~~~~ 89 (150)
++|.....+++.+ .++...++|+..--+|... +.+-++++|.++++ .+|+.+..+.|.+++.--++... ......
T Consensus 14 ~Vs~~~Fk~iv~~~~~i~~L~~~~~YAvE~~T~~drLSlLLsGr~~Vs-~~g~fLH~I~p~qFlDSPEW~s~~~s~~~~F 92 (153)
T PF04831_consen 14 KVSRQQFKKIVGCCCEIRTLKKGETYAVEGKTPIDRLSLLLSGRMRVS-CDGRFLHYIYPYQFLDSPEWESLRPSEDDKF 92 (153)
T ss_pred CCCHHHHHHHHhhhceEEEecCCceeeecCCcccceEeEEEcCcEEEE-ECCEeeEeecccccccChhhhccccCCCCeE
Confidence 4678888899886 7899999999998777654 89999999999987 46888888899888765544432 222577
Q ss_pred eeEEEEcceeEEEEEcHHHHHHHHhhCchHHHHHHHHHH
Q psy1122 90 TADVRSVGYAELFSLSREDVLSAMKDYPEAQEILQNLGR 128 (150)
Q Consensus 90 ~~~~~a~~~~~~~~i~~~~~~~l~~~~~~~~~~~~~~~~ 128 (150)
.+++.|.++|..+.++++.+..++.++|-+...+..+..
T Consensus 93 QVTitA~~~Cryl~W~R~kL~~~l~~~~~L~~vF~~liG 131 (153)
T PF04831_consen 93 QVTITAEEDCRYLCWPREKLYLLLAKDPFLAAVFSNLIG 131 (153)
T ss_pred EEEEEEcCCcEEEEEEHHHHHHHHhhCHHHHHHHHHHHH
Confidence 889999999999999999999999999998888777665
No 25
>PRK11832 putative DNA-binding transcriptional regulator; Provisional
Probab=98.93 E-value=1.3e-07 Score=63.99 Aligned_cols=105 Identities=13% Similarity=0.150 Sum_probs=83.5
Q ss_pred HHHHHhceEEeecCCceE-EecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEccee
Q psy1122 21 HDLVLKMKAYIFTPGDLI-CRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYA 99 (150)
Q Consensus 21 ~~l~~~~~~~~~~~g~~i-~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~ 99 (150)
+.+....+++.+++|..+ +........++++++|.+.+...||-.+.+..+..+||-...+.+ ....+...|.++|
T Consensus 16 ~~L~~~g~~~~~~~~~~~i~~~~~~~~~~~ll~~G~vsirr~d~ll~~t~~aP~IlGl~~~~~~---~~~~~~l~ae~~c 92 (207)
T PRK11832 16 KCLSRYGTRFEFNNEKQVIFSSDVNNEDTFVILEGVISLRREENVLIGITQAPYIMGLADGLMK---NDIPYKLISEGNC 92 (207)
T ss_pred HHhhccCCeEecCCCcEEeccccCCCceEEEEEeceEEEEecCCeEEEeccCCeEeecccccCC---CCceEEEEEcCcc
Confidence 444556789999999997 655555578999999999997555557888888899998776665 4456789999999
Q ss_pred EEEEEcHHHHHHHHhhCchHHHHHHHHHH
Q psy1122 100 ELFSLSREDVLSAMKDYPEAQEILQNLGR 128 (150)
Q Consensus 100 ~~~~i~~~~~~~l~~~~~~~~~~~~~~~~ 128 (150)
+++.+|.++|.+++++..-|......++-
T Consensus 93 ~~~~i~~~~~~~iie~~~LW~~~~~~l~~ 121 (207)
T PRK11832 93 TGYHLPAKQTITLIEQNQLWRDAFYWLAW 121 (207)
T ss_pred EEEEeeHHHHHHHHHHhchHHHHHHHHHH
Confidence 99999999999999988766666555544
No 26
>KOG2968|consensus
Probab=98.68 E-value=1.1e-07 Score=75.43 Aligned_cols=118 Identities=19% Similarity=0.286 Sum_probs=90.4
Q ss_pred HHHHhceEEeecCCceEEecCCccceeEEEeeceEEEE--ecCCc--EEEEecCCCeeee-e---eeeccCCCCceeeEE
Q psy1122 22 DLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVI--SESGR--VLTTMKAGDFFGE-I---GILNLDGLNKRTADV 93 (150)
Q Consensus 22 ~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g~-~---~~~~~~~~~~~~~~~ 93 (150)
++..+.+...+..|+++++.|++.+.+|++.+|.+.++ +.+|+ .+..+.||+.|.- . ..+...++......+
T Consensus 110 ~L~rh~~t~~l~~Gd~i~~~~~~dd~i~vv~sg~l~v~~~~~~g~~~llk~V~~G~~~tSllSiLd~l~~~ps~~~~i~a 189 (1158)
T KOG2968|consen 110 ELDRHIETLSLDAGDYIFKPGESDDSIYVVISGELTVHIRNGDGKEYLLKTVPPGGSFTSLLSILDSLPGFPSLSRTIAA 189 (1158)
T ss_pred eechhhhhhcccCCceeccCCCCCceEEEEeccceEEEecCCCCceeeEeeccCCCchHhHHHHHHhccCCCcccceeee
Confidence 34467778899999999999999999999999999986 45566 8999999965532 2 223332223345678
Q ss_pred EEcceeEEEEEcHHHHHHHHhhCch-HHHHHHHHHHHHHHHHHhhhhh
Q psy1122 94 RSVGYAELFSLSREDVLSAMKDYPE-AQEILQNLGRKRLMEAKNLNKK 140 (150)
Q Consensus 94 ~a~~~~~~~~i~~~~~~~l~~~~~~-~~~~~~~~~~~r~~~~~~~~~~ 140 (150)
.|.++|.+..+|...|.+....+|+ +.+.++.+.. |+....-....
T Consensus 190 kA~t~~tv~~~p~~sF~~~~~k~P~s~iriiQvvmT-RLq~vTf~t~~ 236 (1158)
T KOG2968|consen 190 KAATDCTVARIPYTSFRESFHKNPESSIRIIQVVMT-RLQRVTLTTLH 236 (1158)
T ss_pred eeecCceEEEeccchhhhhhccChHHHHHHHHHHHH-HHHHhhHHHHH
Confidence 9999999999999999999999999 6666666665 77665554443
No 27
>KOG3542|consensus
Probab=97.91 E-value=1.8e-05 Score=61.78 Aligned_cols=109 Identities=17% Similarity=0.280 Sum_probs=85.9
Q ss_pred ccccccccccccCCHHHHHHHHHhceEE-eecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeee
Q psy1122 3 VVCVGVVTIFQECQPEFLHDLVLKMKAY-IFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGIL 81 (150)
Q Consensus 3 ~~~l~~~~~f~~l~~~~~~~l~~~~~~~-~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~ 81 (150)
.+|..+++-|..++-....+|+..+... .=.+|.+|...|+..+..++|+.|.|.++.++|+. ..+.-|+-||.....
T Consensus 280 LeFMhqlpAFAnmtMSvrReLC~vMvFaVVe~AGtivL~dgeeLDSWsVIlNG~VEv~~PdGk~-e~l~mGnSFG~~PT~ 358 (1283)
T KOG3542|consen 280 LEFMHQLPAFANMTMSVRRELCLVMVFAVVEDAGTIVLADGEELDSWSVILNGCVEVVKPDGKR-EELKMGNSFGAEPTP 358 (1283)
T ss_pred HHHHHhchHhhcccHHHHHHHHHHHHHHHHhhcCeEEecCCcccceeEEEecceEEEecCCCce-EEeecccccCCCCCc
Confidence 3678899999999999999999866433 44789999999999999999999999999999984 345668889977666
Q ss_pred ccCCCCceeeEE-EEcceeEEEEEcHHHHHHHHhh
Q psy1122 82 NLDGLNKRTADV-RSVGYAELFSLSREDVLSAMKD 115 (150)
Q Consensus 82 ~~~~~~~~~~~~-~a~~~~~~~~i~~~~~~~l~~~ 115 (150)
.. +...--+ ..+.||+..+|...++-.++..
T Consensus 359 dk---qym~G~mRTkVDDCqFVciaqqDycrIln~ 390 (1283)
T KOG3542|consen 359 DK---QYMIGEMRTKVDDCQFVCIAQQDYCRILNT 390 (1283)
T ss_pred ch---hhhhhhhheecccceEEEeehhhHHHHHHH
Confidence 53 2222223 4577899999999888776643
No 28
>KOG3542|consensus
Probab=97.23 E-value=0.00049 Score=54.11 Aligned_cols=89 Identities=18% Similarity=0.185 Sum_probs=73.9
Q ss_pred ccccccccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccC
Q psy1122 5 CVGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLD 84 (150)
Q Consensus 5 ~l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~ 84 (150)
-|+++..|++|=...+..++.......++.+.++|..|+...+.|++++|.|-+-. ..+.|...||...-
T Consensus 38 ~lh~ld~~snl~~~~lk~l~~~aryer~~g~~ilf~~~~var~wyillsgsv~v~g------qi~mp~~~fgkr~g---- 107 (1283)
T KOG3542|consen 38 QLHQLDTFSNLFIGPLKALCKTARYERHPGQYILFRDGDVARSWYILLSGSVFVEG------QIYMPYGCFGKRTG---- 107 (1283)
T ss_pred HHhhhhhhhhhhhhhHHHhhhhhhhhcCCCceEEecccchhhheeeeeccceEeec------ceecCccccccccc----
Confidence 46788899999999999999999999999999999999999999999999998643 35666666775432
Q ss_pred CCCceeeEEEEcceeEEEEEc
Q psy1122 85 GLNKRTADVRSVGYAELFSLS 105 (150)
Q Consensus 85 ~~~~~~~~~~a~~~~~~~~i~ 105 (150)
+.+.+++..+++++.++++
T Consensus 108 --~~r~~nclllq~semivid 126 (1283)
T KOG3542|consen 108 --QNRTHNCLLLQESEMIVID 126 (1283)
T ss_pred --cccccceeeecccceeeee
Confidence 5677788888888888873
No 29
>PF07883 Cupin_2: Cupin domain; InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=96.45 E-value=0.058 Score=29.81 Aligned_cols=68 Identities=16% Similarity=0.194 Sum_probs=43.1
Q ss_pred EEeecCCceEEecCCccc-eeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEE
Q psy1122 29 AYIFTPGDLICRKGEVAR-EMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSL 104 (150)
Q Consensus 29 ~~~~~~g~~i~~~g~~~~-~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i 104 (150)
...++||+..-....+.. .+++|++|.+.+. .+|+ ...+.+|+.+ .+..+ .+..+....-+++.++.+
T Consensus 2 ~~~~~pG~~~~~h~H~~~~e~~~vl~G~~~~~-~~~~-~~~l~~Gd~~----~i~~~--~~H~~~n~~~~~~~~l~V 70 (71)
T PF07883_consen 2 LVTLPPGGSIPPHRHPGEDEFFYVLSGEGTLT-VDGE-RVELKPGDAI----YIPPG--VPHQVRNPGDEPARFLVV 70 (71)
T ss_dssp EEEEETTEEEEEEEESSEEEEEEEEESEEEEE-ETTE-EEEEETTEEE----EEETT--SEEEEEEESSSEEEEEEE
T ss_pred EEEECCCCCCCCEECCCCCEEEEEEECCEEEE-EccE-EeEccCCEEE----EECCC--CeEEEEECCCCCEEEEEE
Confidence 357888887655555555 8999999999987 3444 5688888844 23221 344444444455655543
No 30
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=95.74 E-value=0.062 Score=30.47 Aligned_cols=45 Identities=31% Similarity=0.502 Sum_probs=32.8
Q ss_pred EEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeee
Q psy1122 29 AYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFG 76 (150)
Q Consensus 29 ~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g 76 (150)
+..-.+|..-+... .+.+.+|++|.+.+.+++|.. ..+.+||.|-
T Consensus 11 ~w~~~pg~~~~~~~--~~E~~~vleG~v~it~~~G~~-~~~~aGD~~~ 55 (74)
T PF05899_consen 11 VWECTPGKFPWPYP--EDEFFYVLEGEVTITDEDGET-VTFKAGDAFF 55 (74)
T ss_dssp EEEEECEEEEEEES--SEEEEEEEEEEEEEEETTTEE-EEEETTEEEE
T ss_pred EEEECCceeEeeCC--CCEEEEEEEeEEEEEECCCCE-EEEcCCcEEE
Confidence 34456666444432 388889999999999887775 7889998653
No 31
>PRK13290 ectC L-ectoine synthase; Reviewed
Probab=94.44 E-value=0.72 Score=29.00 Aligned_cols=67 Identities=15% Similarity=0.168 Sum_probs=41.5
Q ss_pred EEeecCCceEEecCCccceeEEEeeceEEEE-ecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEE
Q psy1122 29 AYIFTPGDLICRKGEVAREMFIIADGILEVI-SESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSL 104 (150)
Q Consensus 29 ~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~-~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i 104 (150)
...+++|+..-..-.....+++|++|.+.+. -.+|+ ...+.+||.+- +. ...++.+.+.++++++++
T Consensus 39 ~~~l~pG~~~~~h~h~~~E~~yVL~G~~~~~~i~~g~-~~~L~aGD~i~----~~----~~~~H~~~N~e~~~~l~v 106 (125)
T PRK13290 39 ETTIYAGTETHLHYKNHLEAVYCIEGEGEVEDLATGE-VHPIRPGTMYA----LD----KHDRHYLRAGEDMRLVCV 106 (125)
T ss_pred EEEECCCCcccceeCCCEEEEEEEeCEEEEEEcCCCE-EEEeCCCeEEE----EC----CCCcEEEEcCCCEEEEEE
Confidence 4578888755322222247999999999987 33344 47899999654 22 223333344478777765
No 32
>COG3718 IolB Uncharacterized enzyme involved in inositol metabolism [Carbohydrate transport and metabolism]
Probab=91.65 E-value=1.1 Score=31.26 Aligned_cols=75 Identities=11% Similarity=0.115 Sum_probs=49.7
Q ss_pred ceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEec-CCCeeee---eeeeccCCCCceeeEEEEcceeEEE
Q psy1122 27 MKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMK-AGDFFGE---IGILNLDGLNKRTADVRSVGYAELF 102 (150)
Q Consensus 27 ~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~-~g~~~g~---~~~~~~~~~~~~~~~~~a~~~~~~~ 102 (150)
+....+++|+..-.+....+.+.+++.|++.+.. .|..+..++ +-+.|-. .++.-. .-..+++.|.+++++.
T Consensus 31 F~~~~L~~Ges~~~~~~~~E~clV~v~Gk~~vs~-~g~~f~~iG~R~SvFe~~p~~~vYvp---~g~~~~vtA~t~~~vA 106 (270)
T COG3718 31 FRLLRLAAGESATEETGDRERCLVLVTGKATVSA-HGSTFGEIGTRMSVFERKPPDSVYVP---AGSAFSVTATTDLEVA 106 (270)
T ss_pred EEEEEccCCCcccccCCCceEEEEEEeeeEEEee-ccchHhhcccccccccCCCCCeEEec---CCceEEEEeecceEEE
Confidence 4567889999999888888999999999999863 333222222 2233322 223333 4567888899997765
Q ss_pred EEc
Q psy1122 103 SLS 105 (150)
Q Consensus 103 ~i~ 105 (150)
.-.
T Consensus 107 vC~ 109 (270)
T COG3718 107 VCS 109 (270)
T ss_pred EEe
Confidence 543
No 33
>TIGR03037 anthran_nbaC 3-hydroxyanthranilate 3,4-dioxygenase. Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase. This enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.
Probab=90.68 E-value=2.9 Score=27.52 Aligned_cols=60 Identities=17% Similarity=0.251 Sum_probs=38.3
Q ss_pred EecCCccceeEEEeeceEEE-EecCCc-EEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEcHH
Q psy1122 39 CRKGEVAREMFIIADGILEV-ISESGR-VLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLSRE 107 (150)
Q Consensus 39 ~~~g~~~~~~y~i~~G~v~~-~~~~g~-~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~~~ 107 (150)
+... +.+.++++++|.+.+ +..+|+ ....+.+|++|- +. ...+++-.+..+|.++.|...
T Consensus 43 ~H~~-~tdE~FyqleG~~~l~v~d~g~~~~v~L~eGd~fl----vP----~gvpHsP~r~~~t~~LvIE~~ 104 (159)
T TIGR03037 43 FHDD-PGEEFFYQLKGEMYLKVTEEGKREDVPIREGDIFL----LP----PHVPHSPQRPAGSIGLVIERK 104 (159)
T ss_pred cccC-CCceEEEEEcceEEEEEEcCCcEEEEEECCCCEEE----eC----CCCCcccccCCCcEEEEEEeC
Confidence 4443 378999999999998 334454 567899998664 22 122333344566777776554
No 34
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=90.41 E-value=1.2 Score=27.89 Aligned_cols=48 Identities=21% Similarity=0.301 Sum_probs=34.7
Q ss_pred hceEEeecCCceE-EecCCccceeEEEeeceEEEEecCCcEEEEecCCCee
Q psy1122 26 KMKAYIFTPGDLI-CRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFF 75 (150)
Q Consensus 26 ~~~~~~~~~g~~i-~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~ 75 (150)
.+....+++|+-+ ++.....+.+|+|++|...+...+. ...+++|+.+
T Consensus 37 ~~~~~~v~pg~~~~~~~H~~~dE~~~Vl~G~g~v~~~~~--~~~v~~gd~~ 85 (127)
T COG0662 37 SIARILVKPGEEISLHHHHHRDEHWYVLEGTGKVTIGGE--EVEVKAGDSV 85 (127)
T ss_pred EEEEEEECCCcccCcccccCcceEEEEEeeEEEEEECCE--EEEecCCCEE
Confidence 4456778888885 6666668999999999999975433 2466777654
No 35
>smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant.
Probab=89.95 E-value=3.6 Score=26.34 Aligned_cols=50 Identities=20% Similarity=0.286 Sum_probs=35.0
Q ss_pred ceEEeecCCceEEecCCc-cceeEEEeeceEEEE--ecCCc--EEEEecCCCeee
Q psy1122 27 MKAYIFTPGDLICRKGEV-AREMFIIADGILEVI--SESGR--VLTTMKAGDFFG 76 (150)
Q Consensus 27 ~~~~~~~~g~~i~~~g~~-~~~~y~i~~G~v~~~--~~~g~--~~~~~~~g~~~g 76 (150)
+....+++|...-..-.+ ...+++|++|...+. ..+|+ ....+.+||.+-
T Consensus 32 ~~~~~i~pg~~~~~h~H~~~~e~~~Vl~G~~~~~~~~~~~~~~~~~~l~~GD~~~ 86 (146)
T smart00835 32 AARVNLEPGGMLPPHYHPRATELLYVVRGEGRVGVVDPNGNKVYDARLREGDVFV 86 (146)
T ss_pred EEEEEecCCcCcCCeeCCCCCEEEEEEeCeEEEEEEeCCCCeEEEEEecCCCEEE
Confidence 345677888876544433 568999999999984 23333 578899999654
No 36
>COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=89.74 E-value=1.4 Score=27.69 Aligned_cols=48 Identities=25% Similarity=0.331 Sum_probs=37.4
Q ss_pred ceEEeecCCceEEecCCc-cceeEEEeeceEEEEecCCcEEEEecCCCeee
Q psy1122 27 MKAYIFTPGDLICRKGEV-AREMFIIADGILEVISESGRVLTTMKAGDFFG 76 (150)
Q Consensus 27 ~~~~~~~~g~~i~~~g~~-~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g 76 (150)
.....|++|+.+-.-..+ .....+|++|.+.+.-+ |. ...+.+||++.
T Consensus 45 ~~~v~~~~G~~~~~H~hp~~~~~~~Vl~G~~~~~~~-g~-~~~l~~Gd~i~ 93 (131)
T COG1917 45 VVLVTFEPGAVIPWHTHPLGEQTIYVLEGEGTVQLE-GE-KKELKAGDVII 93 (131)
T ss_pred EEEEEECCCcccccccCCCcceEEEEEecEEEEEec-CC-ceEecCCCEEE
Confidence 346789999998777776 67899999999998654 43 46889999764
No 37
>COG3450 Predicted enzyme of the cupin superfamily [General function prediction only]
Probab=88.81 E-value=3.2 Score=25.77 Aligned_cols=45 Identities=31% Similarity=0.485 Sum_probs=31.8
Q ss_pred EEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeee
Q psy1122 29 AYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFG 76 (150)
Q Consensus 29 ~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g 76 (150)
+..-.+|..=...++ +.++.+++|.+.+..++|+. ..+.+||.|-
T Consensus 49 iWe~TpG~~r~~y~~--~E~chil~G~v~~T~d~Ge~-v~~~aGD~~~ 93 (116)
T COG3450 49 IWECTPGKFRVTYDE--DEFCHILEGRVEVTPDGGEP-VEVRAGDSFV 93 (116)
T ss_pred EEEecCccceEEccc--ceEEEEEeeEEEEECCCCeE-EEEcCCCEEE
Confidence 444556665555555 67788899999998777763 5788998663
No 38
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=88.48 E-value=6.5 Score=29.58 Aligned_cols=49 Identities=18% Similarity=0.144 Sum_probs=35.2
Q ss_pred ceEEeecCCceEEecCCc-cceeEEEeeceEEEE--ecCCc-EEEEecCCCee
Q psy1122 27 MKAYIFTPGDLICRKGEV-AREMFIIADGILEVI--SESGR-VLTTMKAGDFF 75 (150)
Q Consensus 27 ~~~~~~~~g~~i~~~g~~-~~~~y~i~~G~v~~~--~~~g~-~~~~~~~g~~~ 75 (150)
+....+++|...-.--.+ .+.++||++|.+++. +.+|+ ....+.+||++
T Consensus 247 ~~~~~l~PG~~~~~H~H~~~~E~~yvl~G~~~~~v~d~~g~~~~~~l~~GD~~ 299 (367)
T TIGR03404 247 AAIVTVEPGAMRELHWHPNADEWQYFIQGQARMTVFAAGGNARTFDYQAGDVG 299 (367)
T ss_pred EEEEEECCCCccCCeeCcCCCeEEEEEEEEEEEEEEecCCcEEEEEECCCCEE
Confidence 456677888876433333 578999999999984 45555 56789999954
No 39
>PRK09943 DNA-binding transcriptional repressor PuuR; Provisional
Probab=87.01 E-value=5.5 Score=26.63 Aligned_cols=66 Identities=18% Similarity=0.282 Sum_probs=38.3
Q ss_pred eecCCceEE-ecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEE
Q psy1122 31 IFTPGDLIC-RKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSL 104 (150)
Q Consensus 31 ~~~~g~~i~-~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i 104 (150)
.+++|...- ....+...+++|++|.+.+.- +|+ ...+.+||.+- +..+ .++.+....-+++.++++
T Consensus 113 ~~~pg~~~~~~~~h~~~E~~~Vl~G~~~~~~-~~~-~~~l~~Gd~~~----~~~~--~~H~~~n~~~~~~~~l~~ 179 (185)
T PRK09943 113 TYQPGTTTGERIKHQGEEIGTVLEGEIVLTI-NGQ-DYHLVAGQSYA----INTG--IPHSFSNTSAGICRIISA 179 (185)
T ss_pred EccCCCCcccccccCCcEEEEEEEeEEEEEE-CCE-EEEecCCCEEE----EcCC--CCeeeeCCCCCCeEEEEE
Confidence 556665322 112344689999999999854 333 36899999653 2211 344444444555666665
No 40
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=86.31 E-value=3.1 Score=31.29 Aligned_cols=50 Identities=20% Similarity=0.244 Sum_probs=34.6
Q ss_pred ceEEeecCCceEEecCCccceeEEEeeceEEEE--ecCCc-EEEEecCCCeee
Q psy1122 27 MKAYIFTPGDLICRKGEVAREMFIIADGILEVI--SESGR-VLTTMKAGDFFG 76 (150)
Q Consensus 27 ~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~~g~-~~~~~~~g~~~g 76 (150)
+....+.+|...-.--.....+.++++|.+++. +.+|+ ....+.+||++-
T Consensus 69 ~~~~~l~pG~~~~~HwH~~~E~~yVl~G~~~v~~~d~~g~~~~~~L~~GD~~~ 121 (367)
T TIGR03404 69 GVNMRLEPGAIRELHWHKEAEWAYVLYGSCRITAVDENGRNYIDDVGAGDLWY 121 (367)
T ss_pred ceEEEEcCCCCCCcccCCCceEEEEEeeEEEEEEEcCCCcEEEeEECCCCEEE
Confidence 345677888766432334567999999999984 44566 445899999763
No 41
>PRK04190 glucose-6-phosphate isomerase; Provisional
Probab=85.97 E-value=6.4 Score=26.76 Aligned_cols=47 Identities=15% Similarity=0.186 Sum_probs=31.7
Q ss_pred EEeecCCceE---------EecCCccceeEEEeeceEEEE--ecCCc-EEEEecCCCee
Q psy1122 29 AYIFTPGDLI---------CRKGEVAREMFIIADGILEVI--SESGR-VLTTMKAGDFF 75 (150)
Q Consensus 29 ~~~~~~g~~i---------~~~g~~~~~~y~i~~G~v~~~--~~~g~-~~~~~~~g~~~ 75 (150)
...+++|.+. +........+|++++|...+. +.+|. ....+.+|+++
T Consensus 72 ~t~l~PG~~g~e~~mt~gH~H~~~~~~EiyyvlsG~g~~~l~~~~G~~~~~~v~pGd~v 130 (191)
T PRK04190 72 TTRLYPGKVGDEYFMTKGHFHAKADRAEIYYGLKGKGLMLLQDPEGEARWIEMEPGTVV 130 (191)
T ss_pred EEEECCCcEecccccCCCeEcCCCCCCEEEEEEeCEEEEEEecCCCcEEEEEECCCCEE
Confidence 4466777643 223333459999999999884 34555 56788999865
No 42
>PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional
Probab=85.08 E-value=5.4 Score=26.71 Aligned_cols=57 Identities=16% Similarity=0.261 Sum_probs=36.3
Q ss_pred CccceeEEEeeceEEE-EecCCc-EEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEcHH
Q psy1122 43 EVAREMFIIADGILEV-ISESGR-VLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLSRE 107 (150)
Q Consensus 43 ~~~~~~y~i~~G~v~~-~~~~g~-~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~~~ 107 (150)
++.+.++++++|.+.+ +..+|+ ....+.+|++|- +.. ..+++-.+..+|..+.|...
T Consensus 52 ~~tdE~FyqleG~~~l~v~d~g~~~~v~L~eGd~fl----lP~----gvpHsP~r~~~tv~LviE~~ 110 (177)
T PRK13264 52 DPGEEFFYQLEGDMYLKVQEDGKRRDVPIREGEMFL----LPP----HVPHSPQREAGSIGLVIERK 110 (177)
T ss_pred CCCceEEEEECCeEEEEEEcCCceeeEEECCCCEEE----eCC----CCCcCCccCCCeEEEEEEeC
Confidence 4568899999999998 334555 567899998664 221 22223334566777776443
No 43
>PF00190 Cupin_1: Cupin; InterPro: IPR006045 This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. ; GO: 0045735 nutrient reservoir activity; PDB: 2E9Q_A 2EVX_A 1OD5_A 1UCX_A 1UD1_C 1FXZ_C 3KGL_C 3KSC_D 1UIJ_F 1IPK_B ....
Probab=84.37 E-value=6 Score=25.26 Aligned_cols=50 Identities=26% Similarity=0.426 Sum_probs=36.1
Q ss_pred ceEEeecCCceEEecCCccceeEEEeeceEEE--EecCC------cEEE--EecCCCeee
Q psy1122 27 MKAYIFTPGDLICRKGEVAREMFIIADGILEV--ISESG------RVLT--TMKAGDFFG 76 (150)
Q Consensus 27 ~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~--~~~~g------~~~~--~~~~g~~~g 76 (150)
+....+++|..+...-.....+++|++|...+ ..+++ .... .+.+||+|-
T Consensus 36 ~~~~~i~pg~~~~Ph~h~a~~i~~V~~G~~~~~~v~~~~~~~~~~~~~~~v~l~~Gdv~~ 95 (144)
T PF00190_consen 36 VRRVLIEPGGLRAPHYHNADEIVYVIEGRGRVGVVGPGGPQEEFRDFSQKVRLKAGDVFV 95 (144)
T ss_dssp EEEEEEETTEEEEEEEESSEEEEEEEESEEEEEEEETTCSSSEEEEEEEEEEEETTEEEE
T ss_pred EEeeehhcCCccceeEeeeeEEeeeeccceEEEEEecCCccccceeeeceeeeeccccee
Confidence 34556689888876655789999999999996 45554 1333 499999773
No 44
>PRK11171 hypothetical protein; Provisional
Probab=84.17 E-value=11 Score=27.03 Aligned_cols=69 Identities=13% Similarity=0.148 Sum_probs=41.0
Q ss_pred eEEeecCCceEEecCCc--cceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEE
Q psy1122 28 KAYIFTPGDLICRKGEV--AREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSL 104 (150)
Q Consensus 28 ~~~~~~~g~~i~~~g~~--~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i 104 (150)
....+++|...-..... ...+++|++|.+.+.- +|+ ...+.+||.+- +..+ .++.+....-+++.++++
T Consensus 64 ~~~~l~PG~~~~~~~h~~~~eE~~~VlsG~l~v~~-~g~-~~~L~~GDsi~----~p~~--~~H~~~N~g~~~a~~l~v 134 (266)
T PRK11171 64 YLVEVEPGGGSDQPEPDEGAETFLFVVEGEITLTL-EGK-THALSEGGYAY----LPPG--SDWTLRNAGAEDARFHWI 134 (266)
T ss_pred EEEEECCCCcCCCCCCCCCceEEEEEEeCEEEEEE-CCE-EEEECCCCEEE----ECCC--CCEEEEECCCCCEEEEEE
Confidence 45577887654333322 3688999999999864 333 47889998553 2211 333333334455666665
No 45
>PF12973 Cupin_7: ChrR Cupin-like domain; PDB: 3O14_B 2Z2S_F 2Q1Z_B 3EBR_A.
Probab=81.90 E-value=8 Score=22.54 Aligned_cols=64 Identities=20% Similarity=0.308 Sum_probs=41.8
Q ss_pred hceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEE
Q psy1122 26 KMKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFS 103 (150)
Q Consensus 26 ~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~ 103 (150)
.....++++|..+=..........||++|.+. +.+| .+.+|+++- .. ....++..+-++|.++.
T Consensus 25 ~~~L~r~~pG~~~p~H~H~g~ee~~VLeG~~~--d~~~----~~~~G~~~~----~p----~g~~h~~~s~~gc~~~v 88 (91)
T PF12973_consen 25 RVSLLRLEPGASLPRHRHPGGEEILVLEGELS--DGDG----RYGAGDWLR----LP----PGSSHTPRSDEGCLILV 88 (91)
T ss_dssp EEEEEEE-TTEEEEEEEESS-EEEEEEECEEE--ETTC----EEETTEEEE----E-----TTEEEEEEESSCEEEEE
T ss_pred EEEEEEECCCCCcCccCCCCcEEEEEEEEEEE--ECCc----cCCCCeEEE----eC----CCCccccCcCCCEEEEE
Confidence 34577889998877666777888899999998 3444 347777542 11 33455666777788765
No 46
>PRK11171 hypothetical protein; Provisional
Probab=80.96 E-value=18 Score=25.91 Aligned_cols=49 Identities=20% Similarity=0.233 Sum_probs=37.2
Q ss_pred hceEEeecCCceEEe-cCCccceeEEEeeceEEEEecCCcEEEEecCCCeee
Q psy1122 26 KMKAYIFTPGDLICR-KGEVAREMFIIADGILEVISESGRVLTTMKAGDFFG 76 (150)
Q Consensus 26 ~~~~~~~~~g~~i~~-~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g 76 (150)
.+....+++|..+-. ........++|++|...+... | ....+.+||++-
T Consensus 185 ~~~~~~l~PG~~~~~~~~~~~ee~i~Vl~G~~~~~~~-~-~~~~l~~GD~i~ 234 (266)
T PRK11171 185 HVNIVTFEPGASIPFVETHVMEHGLYVLEGKGVYRLN-N-DWVEVEAGDFIW 234 (266)
T ss_pred EEEEEEECCCCEEccCcCCCceEEEEEEeCEEEEEEC-C-EEEEeCCCCEEE
Confidence 445678999988866 466678999999999998642 2 356889999654
No 47
>TIGR02451 anti_sig_ChrR anti-sigma factor, putative, ChrR family. The member of this family from Rhodobacter sphaeroides has been shown both to form a complex with sigma(E) and to negatively regulate tetrapyrrole biosynthesis. This protein likely contains (at least) two distinct functional domains; several smaller homologs (excluded by the model) show homology only to the C-terminal, including a motif PxHxHxGxE.
Probab=78.99 E-value=9.1 Score=26.46 Aligned_cols=65 Identities=18% Similarity=0.304 Sum_probs=39.9
Q ss_pred ceEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEc--ceeEEEEE
Q psy1122 27 MKAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSV--GYAELFSL 104 (150)
Q Consensus 27 ~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~--~~~~~~~i 104 (150)
....++++|..+-........+.+|++|.+. +++| .+.+|+|+- .. ....++..+. ++|.++.+
T Consensus 129 v~Ll~i~pG~~~p~H~H~G~E~tlVLeG~f~--de~g----~y~~Gd~i~----~p----~~~~H~p~a~~~~~Cicl~v 194 (215)
T TIGR02451 129 VRLLYIEAGQSIPQHTHKGFELTLVLHGAFS--DETG----VYGVGDFEE----AD----GSVQHQPRTVSGGDCLCLAV 194 (215)
T ss_pred EEEEEECCCCccCCCcCCCcEEEEEEEEEEE--cCCC----ccCCCeEEE----CC----CCCCcCcccCCCCCeEEEEE
Confidence 3466777777777777777788889999864 3343 467777543 11 2233444454 44777766
Q ss_pred c
Q psy1122 105 S 105 (150)
Q Consensus 105 ~ 105 (150)
-
T Consensus 195 ~ 195 (215)
T TIGR02451 195 L 195 (215)
T ss_pred e
Confidence 3
No 48
>PF04962 KduI: KduI/IolB family; InterPro: IPR021120 The KduI/IolB family of enzymes includes 5-keto 4-deoxyuronate isomerase (KduI) and 5-deoxy-glucuronate isomerase (IolB). KduI is involved in pectin degradation by free-living soil bacteria that use pectin as a carbon source, breaking it down to 2-keto-3-deoxygluconate, which can ultimately be converted to pyruvate. KduI catalyses the fourth step in pectin degradation, namely the interconversion of 5-keto-4-deoxyuronate and 2,5-diketo-3-dexoygluconate []. KduI has a TIM-barrel fold []. IolB is one of several bacterial proteins encoded by the inositol operon (iolABCDEFGHIJ) in Bacillus subtilis that are involved in myo-inositol catabolism. The enzyme is responsible for isomerization of 5-deoxy-D-glucuronic acid by IolB to produce 2-deoxy-5-keto-D-gluconic acid []. IolBs possess a cupin-like structure.; GO: 0016861 intramolecular oxidoreductase activity, interconverting aldoses and ketoses, 0008152 metabolic process; PDB: 1YWK_B 2QJV_B 1X8M_A 1XRU_A.
Probab=77.10 E-value=14 Score=26.37 Aligned_cols=75 Identities=12% Similarity=0.009 Sum_probs=48.5
Q ss_pred ceEEeecCCceEEecCCccceeEEEeeceEEEEecCC-cEEEEecCCCeeeee--eeeccCCCCceeeEEEEcceeEEEE
Q psy1122 27 MKAYIFTPGDLICRKGEVAREMFIIADGILEVISESG-RVLTTMKAGDFFGEI--GILNLDGLNKRTADVRSVGYAELFS 103 (150)
Q Consensus 27 ~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g-~~~~~~~~g~~~g~~--~~~~~~~~~~~~~~~~a~~~~~~~~ 103 (150)
+....+++|+.+-..-+..+...++++|.+.+.- +| .......+.++|... .+.-+ ......+.|.++++++.
T Consensus 29 ~~~l~L~~g~~~~~~~~~~E~~vv~l~G~~~v~~-~g~~~~~l~~R~~vF~~~~d~lYvp---~g~~~~i~a~~~ae~~~ 104 (261)
T PF04962_consen 29 FGVLRLEAGESLEFELERRELGVVNLGGKATVTV-DGEEFYELGGRESVFDGPPDALYVP---RGTKVVIFASTDAEFAV 104 (261)
T ss_dssp CCCEEEECCHCCCCCCCSEEEEEEEESSSEEEEE-TTEEEEEE-TTSSGGGS--EEEEE----TT--EEEEESSTEEEEE
T ss_pred eEEEEecCCCEEeccCCCcEEEEEEeCCEEEEEe-CCceEEEecccccccCCCCcEEEeC---CCCeEEEEEcCCCEEEE
Confidence 3456778888887666667788899999999864 55 455566666777332 22222 44567888888888887
Q ss_pred Ec
Q psy1122 104 LS 105 (150)
Q Consensus 104 i~ 105 (150)
..
T Consensus 105 ~s 106 (261)
T PF04962_consen 105 CS 106 (261)
T ss_dssp EE
T ss_pred Ec
Confidence 64
No 49
>COG3837 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=76.84 E-value=8.8 Score=25.17 Aligned_cols=40 Identities=20% Similarity=0.194 Sum_probs=29.1
Q ss_pred EecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeee
Q psy1122 39 CRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGI 80 (150)
Q Consensus 39 ~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~ 80 (150)
+.....-+.+.+|++|...+...+|. ..+.|||+.|.-+-
T Consensus 58 ~H~Hs~edEfv~ILeGE~~l~~d~~e--~~lrpGD~~gFpAG 97 (161)
T COG3837 58 RHWHSAEDEFVYILEGEGTLREDGGE--TRLRPGDSAGFPAG 97 (161)
T ss_pred ccccccCceEEEEEcCceEEEECCee--EEecCCceeeccCC
Confidence 34444567899999999998755554 57899998875433
No 50
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=73.52 E-value=30 Score=24.66 Aligned_cols=46 Identities=15% Similarity=0.277 Sum_probs=29.0
Q ss_pred eEEeecCCceEEe-cCCc-cceeEEEeeceEEEEecCCcEEEEecCCCee
Q psy1122 28 KAYIFTPGDLICR-KGEV-AREMFIIADGILEVISESGRVLTTMKAGDFF 75 (150)
Q Consensus 28 ~~~~~~~g~~i~~-~g~~-~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~ 75 (150)
....+++|.-.-. .... .+.+.||++|.+.+.. +|+ ...+.+||.+
T Consensus 61 ~~v~l~pgg~~~~~~~~~g~ee~iyVl~G~l~v~~-~g~-~~~L~~Gd~~ 108 (260)
T TIGR03214 61 YIVEVHPGGGNTTGFGGEGIETFLFVISGEVNVTA-EGE-THELREGGYA 108 (260)
T ss_pred EEEEECCCCcCCCCCCCCceEEEEEEEeCEEEEEE-CCE-EEEECCCCEE
Confidence 3556666543211 1122 3689999999999863 333 3588999865
No 51
>TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (EC 5.3.1.8) (PMI) and mannose-1-phosphate guanylyltransferase (EC 2.7.7.22) in Pseudomonas aeruginosa, Xanthomonas campestris, and Gluconacetobacter xylinus. The literature on the enzyme from E. coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain is C-terminal.
Probab=71.58 E-value=32 Score=26.83 Aligned_cols=46 Identities=13% Similarity=0.188 Sum_probs=29.5
Q ss_pred EEeecCCceEE-ecCCccceeEEEeeceEEEEecCCcEEEEecCCCeee
Q psy1122 29 AYIFTPGDLIC-RKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFG 76 (150)
Q Consensus 29 ~~~~~~g~~i~-~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g 76 (150)
...++||.-+- +.....+..++|++|.+.+.- +|+ ...+.+||.+-
T Consensus 380 ~~~i~PG~~~~~h~H~~~~E~~~Vl~G~~~v~~-dg~-~~~l~~GDsi~ 426 (468)
T TIGR01479 380 RITVKPGEKLSLQMHHHRAEHWIVVSGTARVTI-GDE-TLLLTENESTY 426 (468)
T ss_pred EEEECCCCccCccccCCCceEEEEEeeEEEEEE-CCE-EEEecCCCEEE
Confidence 34667776431 223334567799999999864 333 35889998653
No 52
>COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only]
Probab=68.33 E-value=15 Score=25.30 Aligned_cols=29 Identities=21% Similarity=0.494 Sum_probs=23.4
Q ss_pred eeEEEeeceEEEE--ecCCc-EEEEecCCCee
Q psy1122 47 EMFIIADGILEVI--SESGR-VLTTMKAGDFF 75 (150)
Q Consensus 47 ~~y~i~~G~v~~~--~~~g~-~~~~~~~g~~~ 75 (150)
.+|++++|..+.. +++|+ ++....+|+.+
T Consensus 105 E~y~vi~G~g~m~v~~~~G~~~v~~~~~Gd~i 136 (209)
T COG2140 105 EIYYVLKGEGRMLVQKPEGEARVIAVRAGDVI 136 (209)
T ss_pred cEEEEEeccEEEEEEcCCCcEEEEEecCCcEE
Confidence 5999999999984 56677 77888888865
No 53
>KOG2378|consensus
Probab=67.63 E-value=6 Score=30.56 Aligned_cols=45 Identities=20% Similarity=0.349 Sum_probs=37.7
Q ss_pred CCCeeeeeeeeccCCCCceeeEEEEcce-eEEEEEcHHHHHHHHhhCch
Q psy1122 71 AGDFFGEIGILNLDGLNKRTADVRSVGY-AELFSLSREDVLSAMKDYPE 118 (150)
Q Consensus 71 ~g~~~g~~~~~~~~~~~~~~~~~~a~~~-~~~~~i~~~~~~~l~~~~~~ 118 (150)
.||-||.+++... .|+.+++..-++ |.++.+++.+|..++++...
T Consensus 1 eGddfgklalvnd---~praativl~ed~~~fl~vDk~~Fn~I~~~vEa 46 (573)
T KOG2378|consen 1 EGDDFGKLALVND---APRAATIVLREDNCHFLRVDKHDFNRILHDVEA 46 (573)
T ss_pred CCcccchhccccc---cccccceeeecCCCcceeecHHHHHHHHHhhhh
Confidence 4888999998877 788888877776 99999999999998876543
No 54
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=67.20 E-value=19 Score=25.66 Aligned_cols=49 Identities=16% Similarity=0.247 Sum_probs=36.3
Q ss_pred HhceEEeecCCceEE-ecCCccceeEEEeeceEEEEecCCcEEEEecCCCee
Q psy1122 25 LKMKAYIFTPGDLIC-RKGEVAREMFIIADGILEVISESGRVLTTMKAGDFF 75 (150)
Q Consensus 25 ~~~~~~~~~~g~~i~-~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~ 75 (150)
-.+...++++|..+- .+-....+.++|++|...... ||+ ...+.+||++
T Consensus 179 ~~~~~~~~~PG~~~~~~~~H~~eh~~yiL~G~G~~~~-~g~-~~~V~~GD~i 228 (260)
T TIGR03214 179 MNVHILSFEPGASHPYIETHVMEHGLYVLEGKGVYNL-DNN-WVPVEAGDYI 228 (260)
T ss_pred cEEEEEEECCCcccCCcccccceeEEEEEeceEEEEE-CCE-EEEecCCCEE
Confidence 355678999999884 455556789999999999753 333 4678888864
No 55
>PRK15460 cpsB mannose-1-phosphate guanyltransferase; Provisional
Probab=65.03 E-value=55 Score=25.75 Aligned_cols=70 Identities=13% Similarity=0.216 Sum_probs=39.7
Q ss_pred eEEeecCCceE-EecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEc
Q psy1122 28 KAYIFTPGDLI-CRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLS 105 (150)
Q Consensus 28 ~~~~~~~g~~i-~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~ 105 (150)
...+++||... .+........++|++|.+.+.-. |+ ...+.+||.+- +... .++......-++++++++-
T Consensus 388 ~~i~v~PG~~~~~~~H~~~~E~~~VlsG~~~v~id-g~-~~~L~~GDSi~----ip~g--~~H~~~N~g~~~l~iI~V~ 458 (478)
T PRK15460 388 KRITVKPGEGLSVQMHHHRAEHWVVVAGTAKVTID-GD-IKLLGENESIY----IPLG--ATHCLENPGKIPLDLIEVR 458 (478)
T ss_pred EEEEECCCCcCCcCCCCCCceEEEEEeeEEEEEEC-CE-EEEecCCCEEE----ECCC--CcEEEEcCCCCCEEEEEEE
Confidence 34466777543 12233445888899999998642 33 46888988553 2211 3333333344566666553
No 56
>PF01050 MannoseP_isomer: Mannose-6-phosphate isomerase; InterPro: IPR001538 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. The type II phosphomannose isomerases are bifunctional enzymes 5.3.1.8 from EC. This entry covers the isomerase region of the protein []. The guanosine diphospho-D-mannose pyrophosphorylase region is described in another InterPro entry (see IPR005836 from INTERPRO).; GO: 0016779 nucleotidyltransferase activity, 0005976 polysaccharide metabolic process
Probab=62.12 E-value=41 Score=21.90 Aligned_cols=46 Identities=24% Similarity=0.312 Sum_probs=32.1
Q ss_pred eEEeecCCceE-EecCCccceeEEEeeceEEEEecCCcEEEEecCCCee
Q psy1122 28 KAYIFTPGDLI-CRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFF 75 (150)
Q Consensus 28 ~~~~~~~g~~i-~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~ 75 (150)
+.....||.-+ ++.........+|++|.+.+.-.+ ....+.+|+.+
T Consensus 66 kri~V~pG~~lSlq~H~~R~E~W~Vv~G~a~v~~~~--~~~~~~~g~sv 112 (151)
T PF01050_consen 66 KRITVNPGKRLSLQYHHHRSEHWTVVSGTAEVTLDD--EEFTLKEGDSV 112 (151)
T ss_pred EEEEEcCCCccceeeecccccEEEEEeCeEEEEECC--EEEEEcCCCEE
Confidence 45567788877 455666788999999999997422 23567777643
No 57
>PF06249 EutQ: Ethanolamine utilisation protein EutQ; InterPro: IPR010424 The eut operon of Salmonella typhimurium encodes proteins involved in the cobalamin-dependent degradation of ethanolamine. The role of EutQ in this process is unclear [].; PDB: 2PYT_B 3LWC_A.
Probab=60.60 E-value=23 Score=23.15 Aligned_cols=33 Identities=27% Similarity=0.354 Sum_probs=22.3
Q ss_pred cCCccceeEEEeeceEEEEecCCcEEEEecCCCee
Q psy1122 41 KGEVAREMFIIADGILEVISESGRVLTTMKAGDFF 75 (150)
Q Consensus 41 ~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~ 75 (150)
.--..+.+.+|++|.+.+.. +|+ .....+||++
T Consensus 91 wtl~YDEi~~VlEG~L~i~~-~G~-~~~A~~GDvi 123 (152)
T PF06249_consen 91 WTLTYDEIKYVLEGTLEISI-DGQ-TVTAKPGDVI 123 (152)
T ss_dssp EE-SSEEEEEEEEEEEEEEE-TTE-EEEEETT-EE
T ss_pred EEeecceEEEEEEeEEEEEE-CCE-EEEEcCCcEE
Confidence 33446789999999998873 455 3577788854
No 58
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=60.39 E-value=61 Score=24.24 Aligned_cols=74 Identities=16% Similarity=0.147 Sum_probs=47.3
Q ss_pred EeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEcHHHH
Q psy1122 30 YIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLSREDV 109 (150)
Q Consensus 30 ~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~~~~~ 109 (150)
+.+++|..--.....+..+|+|++|.-...- +|+ ....++||+|..-+.- . ++-.+.+++.++.++-.-+
T Consensus 255 q~L~~G~~t~~~r~T~s~Vf~VieG~G~s~i-g~~-~~~W~~gD~f~vPsW~------~--~~h~a~~da~Lf~~~D~Pl 324 (335)
T TIGR02272 255 QLLPKGFRTATYRSTDATVFCVVEGRGQVRI-GDA-VFRFSPKDVFVVPSWH------P--VRFEASDDAVLFSFSDRPV 324 (335)
T ss_pred hccCCCCCCCCccccccEEEEEEeCeEEEEE-CCE-EEEecCCCEEEECCCC------c--EecccCCCeEEEEecCHHH
Confidence 4556666666666667899999999888653 233 4578999988744431 1 1222345777777766655
Q ss_pred HHHH
Q psy1122 110 LSAM 113 (150)
Q Consensus 110 ~~l~ 113 (150)
++.+
T Consensus 325 l~~L 328 (335)
T TIGR02272 325 QQKL 328 (335)
T ss_pred HHHh
Confidence 5543
No 59
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=59.22 E-value=42 Score=21.06 Aligned_cols=73 Identities=15% Similarity=0.073 Sum_probs=40.0
Q ss_pred ccccccCCHHHHHHHHHhceEEeecCCceEE-ecCCccceeEEEeeceEEE-EecCCcEEEEecCCCeeeeeeee
Q psy1122 9 VTIFQECQPEFLHDLVLKMKAYIFTPGDLIC-RKGEVAREMFIIADGILEV-ISESGRVLTTMKAGDFFGEIGIL 81 (150)
Q Consensus 9 ~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~-~~g~~~~~~y~i~~G~v~~-~~~~g~~~~~~~~g~~~g~~~~~ 81 (150)
+..|.++..+-.-.-.-++...++++|.--- .-.+..+...++++|.++. +-+.-..-....+||+|---...
T Consensus 30 ltyaagIa~etvGas~i~~~~vTi~pgAkakaH~H~~hEtaIYvlsG~ah~w~G~rLE~ha~~~pGDf~YiPpgV 104 (142)
T COG4101 30 LTYAAGIATETVGASGICMHLVTIPPGAKAKAHLHEEHETAIYVLSGEAHTWYGNRLEEHAEVGPGDFFYIPPGV 104 (142)
T ss_pred hhhhhhhhhhhcccceeeEEEEeeCCCccccccccccccEEEEEEeceeeeeeccceeeeEEecCCCeEEcCCCC
Confidence 3444555554444433455667777763220 1112234566779999998 32222245678899988644443
No 60
>PF05726 Pirin_C: Pirin C-terminal cupin domain; InterPro: IPR008778 This entry represents C-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues []. Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold []. Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 1J1L_A 3ACL_A 2P17_A.
Probab=59.21 E-value=37 Score=20.35 Aligned_cols=70 Identities=17% Similarity=0.228 Sum_probs=38.7
Q ss_pred EeecCCceEEec-CCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEc-ceeEEEEEcHH
Q psy1122 30 YIFTPGDLICRK-GEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSV-GYAELFSLSRE 107 (150)
Q Consensus 30 ~~~~~g~~i~~~-g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~-~~~~~~~i~~~ 107 (150)
...++|..+-.. ...-+.+.++++|.+.+-... ..+.+++++- +. ......+.+. ++++++.+..+
T Consensus 4 i~l~~g~~~~~~~~~~~~~~iyv~~G~~~v~~~~----~~~~~~~~~~----l~----~g~~i~~~a~~~~a~~lll~Ge 71 (104)
T PF05726_consen 4 IKLEPGASFTLPLPPGHNAFIYVLEGSVEVGGEE----DPLEAGQLVV----LE----DGDEIELTAGEEGARFLLLGGE 71 (104)
T ss_dssp EEE-TT-EEEEEEETT-EEEEEEEESEEEETTTT----EEEETTEEEE----E-----SECEEEEEESSSSEEEEEEEE-
T ss_pred EEECCCCEEEeecCCCCEEEEEEEECcEEECCCc----ceECCCcEEE----EC----CCceEEEEECCCCcEEEEEEcc
Confidence 356677766433 344578999999998864222 3455555332 21 2344556666 78889988888
Q ss_pred HHHH
Q psy1122 108 DVLS 111 (150)
Q Consensus 108 ~~~~ 111 (150)
.+.+
T Consensus 72 Pl~E 75 (104)
T PF05726_consen 72 PLNE 75 (104)
T ss_dssp ---S
T ss_pred CCCC
Confidence 8875
No 61
>PHA00672 hypothetical protein
Probab=56.31 E-value=47 Score=21.03 Aligned_cols=35 Identities=14% Similarity=0.140 Sum_probs=27.8
Q ss_pred eEEeecCCceEEecCCccceeEEEeeceEEEEecCCc
Q psy1122 28 KAYIFTPGDLICRKGEVAREMFIIADGILEVISESGR 64 (150)
Q Consensus 28 ~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~ 64 (150)
+..++++|..+...-..-+.+ ++.+|.+.++.. |.
T Consensus 50 Rei~IPkGt~LtG~~hkf~~~-ii~sG~itV~td-ge 84 (152)
T PHA00672 50 RTIRIPAGVALTGALIKVSTV-LIFSGHATVFIG-GE 84 (152)
T ss_pred EEEeccCceeeeeeeeEeeEE-EEecccEEEEeC-Cc
Confidence 566789998888777777788 999999998743 44
No 62
>PF02041 Auxin_BP: Auxin binding protein; InterPro: IPR000526 Auxin binding protein is located in the lumen of the endoplasmic reticulum (ER). The primary structure contains an N-terminal hydrophobic leader sequence of 30-40 amino acids, which could represent a signal for translocation of the protein to the ER [, ]. The mature protein comprises around 165 residues, and contains a number of potential N-glycosylation sites. In vitro transport studies have demonstrated co-translational glycosylation []. Retention within the lumen of the ER correlates with an additional signal located at the C terminus, represented by the sequence Lys-Asp-Glu-Leu, known to be responsible for preventing secretion of proteins from the lumen of the ER in eukaryotic cells [, ].; GO: 0004872 receptor activity, 0005788 endoplasmic reticulum lumen; PDB: 1LR5_D 1LRH_D.
Probab=56.06 E-value=50 Score=21.64 Aligned_cols=49 Identities=18% Similarity=0.250 Sum_probs=26.7
Q ss_pred eEEeecCCceEEecCCccceeEEEeeceEEEE--ec----CCc-EEEEecCCCeee
Q psy1122 28 KAYIFTPGDLICRKGEVAREMFIIADGILEVI--SE----SGR-VLTTMKAGDFFG 76 (150)
Q Consensus 28 ~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~--~~----~g~-~~~~~~~g~~~g 76 (150)
...+|++|.--=-....|+.+++|++|.-.++ .. .|+ .-..+.+++.|-
T Consensus 47 wlQTfAPG~~TPiHRHsCEEVFvVLkG~GTl~l~~~~~~~pG~pqef~~~pnSTf~ 102 (167)
T PF02041_consen 47 WLQTFAPGSATPIHRHSCEEVFVVLKGSGTLYLASSHEKYPGKPQEFPIFPNSTFH 102 (167)
T ss_dssp EEEEE-TT-B--EEEESS-EEEEEEE--EEEEE--SSSSS--S-EEEEE-TTEEEE
T ss_pred EeeeecCCCCCCCccccccEEEEEEecceEEEEecccccCCCCceEEEecCCCeEE
Confidence 45678888655444577999999999988874 22 244 456677777663
No 63
>PF11699 CENP-C_C: Mif2/CENP-C like; PDB: 2VPV_B.
Probab=55.42 E-value=29 Score=20.25 Aligned_cols=29 Identities=21% Similarity=0.328 Sum_probs=19.4
Q ss_pred cceeEEEeeceEEEEecCCcEEEEecCCCee
Q psy1122 45 AREMFIIADGILEVISESGRVLTTMKAGDFF 75 (150)
Q Consensus 45 ~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~ 75 (150)
...+++|++|.+++.-.+.+ ..+.+|+.|
T Consensus 33 ~~~vF~V~~G~v~Vti~~~~--f~v~~G~~F 61 (85)
T PF11699_consen 33 NTMVFYVIKGKVEVTIHETS--FVVTKGGSF 61 (85)
T ss_dssp EEEEEEEEESEEEEEETTEE--EEEETT-EE
T ss_pred cEEEEEEEeCEEEEEEcCcE--EEEeCCCEE
Confidence 45789999999998643322 456677655
No 64
>PLN00212 glutelin; Provisional
Probab=54.03 E-value=31 Score=27.19 Aligned_cols=52 Identities=15% Similarity=0.220 Sum_probs=37.5
Q ss_pred HhceEEeecCCceEEecC-CccceeEEEeeceEEE--EecCCc--EEEEecCCCeee
Q psy1122 25 LKMKAYIFTPGDLICRKG-EVAREMFIIADGILEV--ISESGR--VLTTMKAGDFFG 76 (150)
Q Consensus 25 ~~~~~~~~~~g~~i~~~g-~~~~~~y~i~~G~v~~--~~~~g~--~~~~~~~g~~~g 76 (150)
-.+....+.+|.++-+-- -..+.+.+|++|.+++ ++.+|+ .-..+.+|++|-
T Consensus 348 LSa~rv~L~~gam~~PHwn~nA~eI~yV~rG~g~vqvV~~~g~~vf~~~L~~GdvfV 404 (493)
T PLN00212 348 MSATRVNLYQNALLSPFWNVNAHSVVYITQGRARVQVVSNNGKTVFNGVLRPGQLLI 404 (493)
T ss_pred eeEEEEEEcCCcccCCeecCCCCEEEEEeecceEEEEEcCCCCEEEEEEEcCCCEEE
Confidence 345667778887775433 3368999999999987 566766 446899998763
No 65
>PF12852 Cupin_6: Cupin
Probab=53.91 E-value=44 Score=22.15 Aligned_cols=35 Identities=17% Similarity=0.263 Sum_probs=24.0
Q ss_pred cCCccceeEEEeeceEEEEecCCcEEEEecCCCee
Q psy1122 41 KGEVAREMFIIADGILEVISESGRVLTTMKAGDFF 75 (150)
Q Consensus 41 ~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~ 75 (150)
++.+.-.+|+|.+|.+.+.-+++.....+.+||++
T Consensus 31 ~~~~~~~fh~V~~G~~~l~~~~~~~~~~L~~GDiv 65 (186)
T PF12852_consen 31 PGSPGASFHVVLRGSCWLRVPGGGEPIRLEAGDIV 65 (186)
T ss_pred cCCCceEEEEEECCeEEEEEcCCCCeEEecCCCEE
Confidence 33336899999999999974442333567777765
No 66
>PHA00514 dsDNA binding protein
Probab=53.22 E-value=45 Score=19.53 Aligned_cols=54 Identities=19% Similarity=0.281 Sum_probs=34.9
Q ss_pred ecCCc-EEEEecCCCeeeeeeeeccC-----CCCceeeEEEEcce-eEEEEEcHHHHHHHH
Q psy1122 60 SESGR-VLTTMKAGDFFGEIGILNLD-----GLNKRTADVRSVGY-AELFSLSREDVLSAM 113 (150)
Q Consensus 60 ~~~g~-~~~~~~~g~~~g~~~~~~~~-----~~~~~~~~~~a~~~-~~~~~i~~~~~~~l~ 113 (150)
..||. .+..+.+-.++|-+..-... .-...++.+..+++ +.+|.+|..+|.++.
T Consensus 21 mvDGqv~~evl~~~Tl~GNLtiEqAQ~e~~k~~k~~pvqVvsVEpnt~lYElpv~~Fle~A 81 (98)
T PHA00514 21 MVDGQVQIEVLNEQTLLGNLTIEQAQKELSKQYKHGPVQVVSVEPNTKLYELPVLDFLELA 81 (98)
T ss_pred EecceEEEEEcCcceeecceeHHHHHHHHhhcccCCCeeEEEecCCCEeeeccHHHHhhhh
Confidence 34566 46666666677765543210 00234667777777 999999999999865
No 67
>PF14499 DUF4437: Domain of unknown function (DUF4437); PDB: 2QDR_A.
Probab=53.12 E-value=13 Score=26.46 Aligned_cols=52 Identities=19% Similarity=0.246 Sum_probs=27.9
Q ss_pred EEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeee
Q psy1122 29 AYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGI 80 (150)
Q Consensus 29 ~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~ 80 (150)
...+++|..--.....++.=.++++|.+......+.-...+.+|++|+.-.-
T Consensus 175 l~kLPagf~g~i~~h~~~eraVvI~G~~~~~~~~~~~~~~L~~GSYf~s~~~ 226 (251)
T PF14499_consen 175 LLKLPAGFTGRIHTHASNERAVVISGELDYQSYGASNFGTLDPGSYFGSPGH 226 (251)
T ss_dssp EEE-SSEE--SEEE--S-EEEEEEEEEEEETTEEEETTEEEEE-TT-EE--E
T ss_pred EEEcCCCCcCceeccCCceEEEEEEeEEEEeecccCCCccccCCcccccCCc
Confidence 4556666666666667777788999999983222222357889998875433
No 68
>PF05523 FdtA: WxcM-like, C-terminal ; InterPro: IPR008894 This entry includes FdtA (Q6T1W8 from SWISSPROT) from Aneurinibacillus thermoaerophilus, which has been characterised as a dTDP-6-deoxy-3,4-keto-hexulose isomerase []. It also includes WxcM (Q93S92 from SWISSPROT) from Xanthomonas campestris pv campestris) []. ; PDB: 2PAK_A 2PAE_A 2PA7_B 2PAM_A.
Probab=52.72 E-value=57 Score=20.58 Aligned_cols=66 Identities=15% Similarity=0.189 Sum_probs=30.7
Q ss_pred EecCCccceeEEEeeceEEEEecCCc--EEEEecCCCeeeeeeeeccCCCCceeeEEEE-cceeEEEEEcHHHHHH
Q psy1122 39 CRKGEVAREMFIIADGILEVISESGR--VLTTMKAGDFFGEIGILNLDGLNKRTADVRS-VGYAELFSLSREDVLS 111 (150)
Q Consensus 39 ~~~g~~~~~~y~i~~G~v~~~~~~g~--~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a-~~~~~~~~i~~~~~~~ 111 (150)
+......+..+++++|.+.+.-.+|+ ....+...+ -| +.-+ +..-..+.. .+++.++.+..+.+.+
T Consensus 48 ~H~Hk~~~~~~~~l~Gs~~v~~~d~~~~~~~~L~~~~-~~---L~Ip---pg~w~~~~~~s~~svlLv~as~~yd~ 116 (131)
T PF05523_consen 48 WHAHKKTTQWFIVLSGSFKVVLDDGREEEEFILDEPN-KG---LYIP---PGVWHGIKNFSEDSVLLVLASEPYDE 116 (131)
T ss_dssp EEEESS--EEEEEEES-EEEEEE-SS-EEEEEE--TT-EE---EEE----TT-EEEEE---TT-EEEEEESS---G
T ss_pred ccccccccEEEEEEeCEEEEEEecCCCcEEEEECCCC-eE---EEEC---CchhhHhhccCCCcEEEEEcCCCCCh
Confidence 44445678899999999999644444 344455544 22 2221 223333433 4448888887776655
No 69
>COG5616 Predicted integral membrane protein [Function unknown]
Probab=50.53 E-value=43 Score=21.92 Aligned_cols=46 Identities=11% Similarity=0.209 Sum_probs=36.9
Q ss_pred ccCCHHHHHHHHHhceEEeecCCceEEecCCccc--------eeEEEeeceEEE
Q psy1122 13 QECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAR--------EMFIIADGILEV 58 (150)
Q Consensus 13 ~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~--------~~y~i~~G~v~~ 58 (150)
.+++++.+..|...-...-++.+...-.+|.+.+ ..-||++|.++.
T Consensus 46 DGlTEdiIt~Lsr~~~l~VIArnssft~kgka~dv~~v~~~Lgv~YvleGsvRr 99 (152)
T COG5616 46 DGLTEDIITDLSRFRELFVIARNSSFTYKGKAVDVREVGEELGVRYVLEGSVRR 99 (152)
T ss_pred ccchHHHHHHhhhccCceEEEccceeeccCCCCCHHHHHHHhCCcEEEeeeEee
Confidence 3488999999988888888999999888886654 367889999884
No 70
>PLN02288 mannose-6-phosphate isomerase
Probab=48.38 E-value=94 Score=23.85 Aligned_cols=30 Identities=23% Similarity=0.297 Sum_probs=14.9
Q ss_pred cceeEEEeeceEEEEecCCcEEEEecCCCe
Q psy1122 45 AREMFIIADGILEVISESGRVLTTMKAGDF 74 (150)
Q Consensus 45 ~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~ 74 (150)
...+.++++|.+.+....+.....+.+|+.
T Consensus 354 gp~Illv~~G~~~i~~~~~~~~~~l~~G~~ 383 (394)
T PLN02288 354 GPSVFLVIEGEGVLSTGSSEDGTAAKRGDV 383 (394)
T ss_pred CCEEEEEEcCEEEEecCCccceEEEeceeE
Confidence 445666677776664333222223455543
No 71
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=47.36 E-value=46 Score=24.88 Aligned_cols=49 Identities=22% Similarity=0.335 Sum_probs=34.5
Q ss_pred eEEeecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeee
Q psy1122 28 KAYIFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGE 77 (150)
Q Consensus 28 ~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~ 77 (150)
....+++|++--........++||++|.-..+.-+|+ .....+||+|-.
T Consensus 84 ~~q~l~pGe~~~~HRht~sAl~~vveG~G~~t~V~g~-~~~~~~gD~~~t 132 (335)
T TIGR02272 84 GLQLILPGEVAPSHRHTQSALRFIVEGKGAFTAVDGE-RTTMHPGDFIIT 132 (335)
T ss_pred hhEEeCCCCCCCccccccceEEEEEEcCceEEEECCE-EEeeeCCCEEEe
Confidence 3556778877777777788999999998643333444 357788887754
No 72
>PF07697 7TMR-HDED: 7TM-HD extracellular; InterPro: IPR011624 This entry represents the extracellular domain of the 7TM-HD (7TM Receptors with HD hydrolase) protein family []. These proteins are known or predicted, to posses metal-dependent phospohydrolase activity.
Probab=45.98 E-value=15 Score=25.02 Aligned_cols=32 Identities=19% Similarity=0.351 Sum_probs=24.1
Q ss_pred CCHHHHHHHHHhceEEe--ecCCceEEecCCccc
Q psy1122 15 CQPEFLHDLVLKMKAYI--FTPGDLICRKGEVAR 46 (150)
Q Consensus 15 l~~~~~~~l~~~~~~~~--~~~g~~i~~~g~~~~ 46 (150)
.+........+...+.. +.+|+.|+.+|+..+
T Consensus 174 ~T~~~~~~a~~~V~pv~~~V~~Ge~IV~kGe~VT 207 (222)
T PF07697_consen 174 ATEKAREEALASVSPVRGMVKKGEVIVRKGEIVT 207 (222)
T ss_pred HHHHHHHHHHhcCCchHhhccCCCEEecCCcEeC
Confidence 34555566666676666 999999999999865
No 73
>PLN00212 glutelin; Provisional
Probab=44.33 E-value=1.1e+02 Score=24.26 Aligned_cols=33 Identities=12% Similarity=0.127 Sum_probs=28.2
Q ss_pred hceEEeecCCceEEecCCccceeEEEeeceEEE
Q psy1122 26 KMKAYIFTPGDLICRKGEVAREMFIIADGILEV 58 (150)
Q Consensus 26 ~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~~ 58 (150)
.+....+.++..+.+.-.....+.||+.|...+
T Consensus 81 ~~~R~~i~p~gL~lP~y~na~~liyV~qG~G~~ 113 (493)
T PLN00212 81 FVIRRVIEPQGLLLPRYSNTPGLVYIIQGRGSM 113 (493)
T ss_pred EEEEEEecCCcccCccccCCCeEEEEEeCeEEE
Confidence 456678899999998888889999999999876
No 74
>PF06560 GPI: Glucose-6-phosphate isomerase (GPI); InterPro: IPR010551 This entry consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins (5.3.1.9 from EC), which are involved in glycolysis and in gluconeogenesis and catalyse the conversion of D-glucose 6-phosphate to D-fructose 6-phosphate. The deduced amino acid sequence of the first archaeal PGI isolated from Pyrococcus furiosus revealed that it is not related to its eukaryotic and many of its bacterial counterparts. In contrast, this archaeal PGI shares similarity with the cupin superfamily that consists of a variety of proteins that are generally involved in sugar metabolism in both prokaryotes and eukaryotes [].; GO: 0004347 glucose-6-phosphate isomerase activity, 0006094 gluconeogenesis, 0006096 glycolysis, 0005737 cytoplasm; PDB: 1J3Q_B 1J3R_B 1J3P_A 2GC0_A 1X8E_A 1X82_A 1QY4_B 2GC2_B 1QXJ_A 1QXR_B ....
Probab=42.17 E-value=63 Score=21.84 Aligned_cols=33 Identities=21% Similarity=0.325 Sum_probs=21.4
Q ss_pred CccceeEEEeeceEEEE--ecCC----c-EEEEecCCCee
Q psy1122 43 EVAREMFIIADGILEVI--SESG----R-VLTTMKAGDFF 75 (150)
Q Consensus 43 ~~~~~~y~i~~G~v~~~--~~~g----~-~~~~~~~g~~~ 75 (150)
-....+|++++|..-+. +.+| + .+....+|+.+
T Consensus 81 ~~~pEvY~vl~G~g~~lLq~~~~~~~~~~~~v~~~~G~~v 120 (182)
T PF06560_consen 81 LSYPEVYEVLSGEGLILLQKEEGDDVGDVIAVEAKPGDVV 120 (182)
T ss_dssp TT--EEEEEEESSEEEEEE-TTS-----EEEEEE-TTEEE
T ss_pred CCCCcEEEEEeCEEEEEEEecCCCcceeEEEEEeCCCCEE
Confidence 45678999999999874 4555 4 56788888754
No 75
>PF06052 3-HAO: 3-hydroxyanthranilic acid dioxygenase; InterPro: IPR010329 Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase (1.13.11.6 from EC). It is part of the kynurenine pathway for the degradation of tryptophan and the biosynthesis of nicotinic acid [].The prokaryotic homologue is involved in the 2-nitrobenzoate degradation pathway []. The enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.; GO: 0000334 3-hydroxyanthranilate 3,4-dioxygenase activity, 0005506 iron ion binding, 0008152 metabolic process, 0055114 oxidation-reduction process; PDB: 1ZVF_A 1YFX_A 1YFW_A 1YFY_A 1YFU_A 2QNK_A 3FE5_A.
Probab=41.94 E-value=99 Score=20.25 Aligned_cols=56 Identities=16% Similarity=0.253 Sum_probs=30.5
Q ss_pred cceeEEEeeceEEE-EecCCc-EEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEcHHH
Q psy1122 45 AREMFIIADGILEV-ISESGR-VLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLSRED 108 (150)
Q Consensus 45 ~~~~y~i~~G~v~~-~~~~g~-~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~~~~ 108 (150)
...+++-++|.+.+ +-++|+ .-..+.+|++|- +. ...+++-+--.+|.-+++.++.
T Consensus 53 ~eE~FyQ~kG~m~Lkv~e~g~~kdi~I~EGe~fL----LP----~~vpHsP~R~~~tiGLViEr~R 110 (151)
T PF06052_consen 53 TEEFFYQLKGDMCLKVVEDGKFKDIPIREGEMFL----LP----ANVPHSPQRPADTIGLVIERKR 110 (151)
T ss_dssp S-EEEEEEES-EEEEEEETTEEEEEEE-TTEEEE----E-----TT--EEEEE-TT-EEEEEEE--
T ss_pred cceEEEEEeCcEEEEEEeCCceEEEEeCCCcEEe----cC----CCCCCCCcCCCCcEEEEEEecc
Confidence 45788999999987 445676 667888988664 32 2233333334577777776553
No 76
>PRK15457 ethanolamine utilization protein EutQ; Provisional
Probab=40.80 E-value=87 Score=22.12 Aligned_cols=29 Identities=31% Similarity=0.437 Sum_probs=21.0
Q ss_pred cceeEEEeeceEEEEecCCcEEEEecCCCee
Q psy1122 45 AREMFIIADGILEVISESGRVLTTMKAGDFF 75 (150)
Q Consensus 45 ~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~ 75 (150)
.+.+.+|++|.+.+.- +|+ ...+.|||++
T Consensus 175 ~dEi~YVLEGe~~l~I-dG~-t~~l~pGDvl 203 (233)
T PRK15457 175 YDEIDMVLEGELHVRH-EGE-TMIAKAGDVM 203 (233)
T ss_pred ceEEEEEEEeEEEEEE-CCE-EEEeCCCcEE
Confidence 4568899999999864 343 3577888754
No 77
>PF13376 OmdA: Bacteriocin-protection, YdeI or OmpD-Associated
Probab=37.87 E-value=67 Score=17.24 Aligned_cols=22 Identities=36% Similarity=0.598 Sum_probs=17.3
Q ss_pred HHHHHHHhhCchHHHHHHHHHH
Q psy1122 107 EDVLSAMKDYPEAQEILQNLGR 128 (150)
Q Consensus 107 ~~~~~l~~~~~~~~~~~~~~~~ 128 (150)
++|...+..+|....++..+..
T Consensus 8 ~dl~~aL~~~p~a~~~f~~l~~ 29 (63)
T PF13376_consen 8 EDLEAALEANPEAKEFFESLTP 29 (63)
T ss_pred HHHHHHHHCCHHHHHHHHHCCH
Confidence 6788889999997777776555
No 78
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=36.46 E-value=82 Score=20.75 Aligned_cols=28 Identities=25% Similarity=0.288 Sum_probs=18.3
Q ss_pred cceeEEEeeceEEEEecCCcEEEEecCCCe
Q psy1122 45 AREMFIIADGILEVISESGRVLTTMKAGDF 74 (150)
Q Consensus 45 ~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~ 74 (150)
-+.+-+|++|.+.+..++++ ...+|||+
T Consensus 118 yDe~d~VlEGrL~V~~~g~t--v~a~aGDv 145 (176)
T COG4766 118 YDEIDYVLEGRLHVRIDGRT--VIAGAGDV 145 (176)
T ss_pred ccceeEEEeeeEEEEEcCCe--EecCCCcE
Confidence 46677889999988644443 24556654
No 79
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=34.76 E-value=1.6e+02 Score=20.71 Aligned_cols=62 Identities=13% Similarity=0.194 Sum_probs=34.4
Q ss_pred eEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEcH
Q psy1122 37 LICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLSR 106 (150)
Q Consensus 37 ~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~~ 106 (150)
.--..++..+.+.||++|.+.+.- +|+ ...+.+|++ .++... ......-.+.+++++.++.+
T Consensus 75 ~~~e~d~~ae~~lfVv~Ge~tv~~-~G~-th~l~eggy----aylPpg--s~~~~~N~~~~~~rfhw~rk 136 (264)
T COG3257 75 QRPEGDEGAETFLFVVSGEITVKA-EGK-THALREGGY----AYLPPG--SGWTLRNAQKEDSRFHWIRK 136 (264)
T ss_pred CCCCCCCcceEEEEEEeeeEEEEE-cCe-EEEeccCCe----EEeCCC--CcceEeeccCCceEEEEEee
Confidence 333444556889999999999863 333 245666653 233221 22222223666666666543
No 80
>PRK09685 DNA-binding transcriptional activator FeaR; Provisional
Probab=30.58 E-value=2e+02 Score=20.51 Aligned_cols=57 Identities=16% Similarity=0.130 Sum_probs=30.2
Q ss_pred eEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEcHHHHHHH
Q psy1122 48 MFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLSREDVLSA 112 (150)
Q Consensus 48 ~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l 112 (150)
+.++++|.+.+.. +|+ ...+.||+++ ++... .++........+..++.+|...+...
T Consensus 74 l~~~~~G~~~~~~-~g~-~~~l~~G~~~----l~~~~--~p~~~~~~~~~~~~~l~i~~~~l~~~ 130 (302)
T PRK09685 74 TVFQLSGHAIIEQ-DDR-QVQLAAGDIT----LIDAS--RPCSIYPQGLSEQISLLLPRELVEQY 130 (302)
T ss_pred EEEEecceEEEEE-CCe-EEEEcCCCEE----EEECC--CCcEeecCCCceeEEEEccHHHhhcc
Confidence 5566899888753 343 2467888764 22211 22322222222345567777766654
No 81
>KOG1758|consensus
Probab=28.14 E-value=1.1e+02 Score=20.05 Aligned_cols=61 Identities=11% Similarity=0.168 Sum_probs=39.8
Q ss_pred eEEEeeceEEEEecCCcE-EEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEEEEcHHHHHHHHhhCc
Q psy1122 48 MFIIADGILEVISESGRV-LTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELFSLSREDVLSAMKDYP 117 (150)
Q Consensus 48 ~y~i~~G~v~~~~~~g~~-~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~~i~~~~~~~l~~~~~ 117 (150)
+-.+.=|.+.+...+|.. ..+++-|.. ... ..+..++.+.+...+=.|+....++++...-
T Consensus 68 i~~LkPGvvsV~~~~~~~~k~FvSsGfa-----~v~----~ds~~~ila~EA~~ledid~~~a~q~Le~aq 129 (159)
T KOG1758|consen 68 IQVLKPGVVSVHEGSGTKSKYFVSSGFA-----TVN----ADSSLQILAEEAVKLEDIDPSEAQQLLEKAQ 129 (159)
T ss_pred hheeccceEEEEeCCCcEEEEEEecceE-----EEc----CCCeEEEEehhccccccCCHHHHHHHHHHHH
Confidence 445566777777777763 566665532 121 4567777888887777888877777766543
No 82
>PF02823 ATP-synt_DE_N: ATP synthase, Delta/Epsilon chain, beta-sandwich domain; InterPro: IPR020546 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This family represents subunits called delta (in mitochondrial ATPase) or epsilon (in bacteria or chloroplast ATPase). The interaction site of subunit C of the F0 complex with the delta or epsilon subunit of the F1 complex may be important for connecting the rotor of F1 (gamma subunit) to the rotor of F0 (C subunit) []. In bacterial species, the delta subunit is the equivalent of the Oligomycin sensitive subunit (OSCP, IPR000711 from INTERPRO) in metazoans. The C-terminal domain of the epsilon subunit appears to act as an inhibitor of ATPase activity []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046933 hydrogen ion transporting ATP synthase activity, rotational mechanism, 0046961 proton-transporting ATPase activity, rotational mechanism, 0015986 ATP synthesis coupled proton transport, 0045261 proton-transporting ATP synthase complex, catalytic core F(1); PDB: 2WSS_H 1E79_H 2JDI_H 2W6I_H 2W6H_H 2W6J_H 2CK3_H 2V7Q_H 1H8E_H 2XND_H ....
Probab=27.30 E-value=1.2e+02 Score=17.03 Aligned_cols=29 Identities=17% Similarity=0.229 Sum_probs=17.4
Q ss_pred CceEEecCCccceeEEEeeceEEEEecCCc
Q psy1122 35 GDLICRKGEVAREMFIIADGILEVISESGR 64 (150)
Q Consensus 35 g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~ 64 (150)
|+.=+.+|. ...+.-+..|.+++...+|+
T Consensus 27 G~~gIl~~H-~p~i~~l~~G~~~i~~~~~~ 55 (80)
T PF02823_consen 27 GEFGILPNH-APFITALKPGELRIKDADGE 55 (80)
T ss_dssp SEEEEETTS--SEEEEEESEEEEEEESSSE
T ss_pred cChhhccCC-chhheeccceEEEEEEcCCC
Confidence 444444443 55667777788887666665
No 83
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=26.96 E-value=1.2e+02 Score=22.51 Aligned_cols=45 Identities=22% Similarity=0.331 Sum_probs=29.3
Q ss_pred ecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeee
Q psy1122 32 FTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGE 77 (150)
Q Consensus 32 ~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~ 77 (150)
+.+|++==.-....+.+.||++|.-..+.-||.. ....+||++-.
T Consensus 99 ilPGEvApsHrHsqsAlRFvveG~Ga~T~VdGer-~~M~~GDfilT 143 (351)
T COG3435 99 ILPGEVAPSHRHNQSALRFVVEGKGAYTVVDGER-TPMEAGDFILT 143 (351)
T ss_pred ecCcccCCcccccccceEEEEeccceeEeecCce-eeccCCCEEEc
Confidence 4455544444444567999999998775556552 36778887643
No 84
>TIGR01511 ATPase-IB1_Cu copper-(or silver)-translocating P-type ATPase. One member from Halobacterium is annotated as "molybdenum-binding protein" although no evidence can be found for this classification.
Probab=26.05 E-value=3.5e+02 Score=21.81 Aligned_cols=50 Identities=24% Similarity=0.226 Sum_probs=31.6
Q ss_pred ecCCceE-EecCCccceeEEEeeceEEEE--ecCCc-EEEEecCCCeeeeeeee
Q psy1122 32 FTPGDLI-CRKGEVAREMFIIADGILEVI--SESGR-VLTTMKAGDFFGEIGIL 81 (150)
Q Consensus 32 ~~~g~~i-~~~g~~~~~~y~i~~G~v~~~--~~~g~-~~~~~~~g~~~g~~~~~ 81 (150)
..+||+| +++|+....=..|++|...+. .-.|. ......+|+.+-..+..
T Consensus 110 l~~GDii~v~~Ge~iP~Dg~v~~g~~~vdes~lTGEs~pv~k~~gd~V~aGt~~ 163 (562)
T TIGR01511 110 LQPGDIVKVLPGEKIPVDGTVIEGESEVDESLVTGESLPVPKKVGDPVIAGTVN 163 (562)
T ss_pred CCCCCEEEECCCCEecCceEEEECceEEehHhhcCCCCcEEcCCCCEEEeeeEE
Confidence 4677777 588888777778889987653 23465 33445567655434433
No 85
>cd00173 SH2 Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
Probab=26.02 E-value=1.3e+02 Score=16.96 Aligned_cols=28 Identities=21% Similarity=0.249 Sum_probs=21.8
Q ss_pred cccCCHHHHHHHHHhceEEeecCCceEEecCCc
Q psy1122 12 FQECQPEFLHDLVLKMKAYIFTPGDLICRKGEV 44 (150)
Q Consensus 12 f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~ 44 (150)
+..++.++...++.. -+.|+.+++..+.
T Consensus 4 ~g~i~r~~Ae~~L~~-----~~~G~FLiR~s~~ 31 (94)
T cd00173 4 HGPISREEAEELLKK-----KPDGTFLVRDSES 31 (94)
T ss_pred ccCCCHHHHHHHHhc-----CCCceEEEEecCC
Confidence 456788888888765 6889999988874
No 86
>cd06929 NR_LBD_F1 Ligand-binding domain of nuclear receptor family 1. Ligand-binding domain (LBD) of nuclear receptor (NR) family 1: This is one of the major subfamily of nuclear receptors, including thyroid receptor, retinoid acid receptor, ecdysone receptor, farnesoid X receptor, vitamin D receptor, and other related receptors. Nuclear receptors form a superfamily of ligand-activated transcription regulators, which regulate various physiological functions, from development, reproduction, to homeostasis and metabolism in animals (metazoans). The family contains not only receptors for known ligands but also orphan receptors for which ligands do not exist or have not been identified. NRs share a common structural organization with a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
Probab=25.38 E-value=59 Score=21.15 Aligned_cols=25 Identities=20% Similarity=0.017 Sum_probs=21.2
Q ss_pred ccccccccccccCCHHHHHHHHHhc
Q psy1122 3 VVCVGVVTIFQECQPEFLHDLVLKM 27 (150)
Q Consensus 3 ~~~l~~~~~f~~l~~~~~~~l~~~~ 27 (150)
.++.+++|.|..|+.++...|+...
T Consensus 19 v~~ak~ip~F~~L~~~Dq~~Llk~~ 43 (174)
T cd06929 19 VEFAKRIPGFRELSQEDQIALLKGG 43 (174)
T ss_pred HhhccCCcCcccCChhHHHHHHHhc
Confidence 3678999999999999999888644
No 87
>PF02311 AraC_binding: AraC-like ligand binding domain; InterPro: IPR003313 This entry defines the arabinose-binding and dimerisation domain of the bacterial gene regulatory protein AraC. The crystal structure of the arabinose-binding and dimerization domain of the Escherichia coli gene regulatory protein AraC was determined in the presence and absence of L-arabinose. The arabinose-bound molecule shows that the protein adopts an unusual fold, binding sugar within a beta barrel and completely burying the arabinose with the amino-terminal arm of the protein. Dimer contacts in the presence of arabinose are mediated by an antiparallel coiled-coil. In the uncomplexed protein, the amino-terminal arm is disordered, uncovering the sugar-binding pocket and allowing it to serve as an oligomerization interface [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1XJA_B 2ARA_A 2AAC_B 2ARC_A.
Probab=25.20 E-value=1.6e+02 Score=17.52 Aligned_cols=67 Identities=13% Similarity=0.084 Sum_probs=31.1
Q ss_pred ecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcce--eEEEEEcHHHHHHHHh
Q psy1122 40 RKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGY--AELFSLSREDVLSAMK 114 (150)
Q Consensus 40 ~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~--~~~~~i~~~~~~~l~~ 114 (150)
......-.+.++.+|...+.- +|+ ...+.||++| ++.++ ..........++ ...+.++.+.+..+..
T Consensus 18 ~h~h~~~~i~~v~~G~~~~~~-~~~-~~~l~~g~~~----li~p~--~~H~~~~~~~~~~~~~~i~~~~~~~~~~~~ 86 (136)
T PF02311_consen 18 PHWHDFYEIIYVLSGEGTLHI-DGQ-EYPLKPGDLF----LIPPG--QPHSYYPDSNEPWEYYWIYFSPDFLEELLE 86 (136)
T ss_dssp EETT-SEEEEEEEEE-EEEEE-TTE-EEEE-TT-EE----EE-TT--S-EEEEE-TTSEEEEEEEEE---GGGGGGG
T ss_pred CEECCCEEEEEEeCCEEEEEE-CCE-EEEEECCEEE----EecCC--ccEEEecCCCCCEEEEEEEECHHHHHHHHH
Confidence 334445678999999999853 332 3678888865 23321 333333333224 3355556655555543
No 88
>PF10330 Stb3: Putative Sin3 binding protein; InterPro: IPR018818 This entry represents Sin3 binding proteins conserved in fungi. Sin3p does not bind DNA directly even though the yeast SIN3 gene functions as a transcriptional repressor. Sin3p is part of a large multiprotein complex []. Stb3 appears to bind directly to ribosomal RNA Processing Elements (RRPE) although there are no obvious domains which would accord with this, implying that Stb3 may be a novel RNA-binding protein [].
Probab=24.86 E-value=1e+02 Score=18.33 Aligned_cols=41 Identities=15% Similarity=0.182 Sum_probs=28.0
Q ss_pred ccccccccccCCHHHHHHHH-HhceEEeecCCceEEecCCccceeEEEeeceEE
Q psy1122 5 CVGVVTIFQECQPEFLHDLV-LKMKAYIFTPGDLICRKGEVAREMFIIADGILE 57 (150)
Q Consensus 5 ~l~~~~~f~~l~~~~~~~l~-~~~~~~~~~~g~~i~~~g~~~~~~y~i~~G~v~ 57 (150)
+-..+|-|++++..-...|+ ..++ .|+..+++-|=.=|+-+
T Consensus 30 Lt~~vPgF~~ls~sKqRRLi~~ALE------------~gd~~~~VvFEKvGWGr 71 (92)
T PF10330_consen 30 LTTSVPGFSDLSPSKQRRLIMAALE------------GGDKDGDVVFEKVGWGR 71 (92)
T ss_pred HhccCCCcccCCHHHHHHHHHHHHh------------cCCCCCCEEEEEeccce
Confidence 34678999999987776665 4555 56666666666666554
No 89
>TIGR01525 ATPase-IB_hvy heavy metal translocating P-type ATPase. This alignment encompasses two equivalog models for the copper and cadmium-type heavy metal transporting P-type ATPases (TIGR01511 and TIGR01512) as well as those species which score ambiguously between both models. For more comments and references, see the files on TIGR01511 and 01512.
Probab=24.83 E-value=3.6e+02 Score=21.65 Aligned_cols=45 Identities=27% Similarity=0.302 Sum_probs=29.5
Q ss_pred eecCCceE-EecCCccceeEEEeeceEEEEe--cCCc-EEEEecCCCee
Q psy1122 31 IFTPGDLI-CRKGEVAREMFIIADGILEVIS--ESGR-VLTTMKAGDFF 75 (150)
Q Consensus 31 ~~~~g~~i-~~~g~~~~~~y~i~~G~v~~~~--~~g~-~~~~~~~g~~~ 75 (150)
...+||+| +++|+....=..|++|...+.. -.|+ ......+|+.+
T Consensus 73 ~l~~GDiv~v~~G~~iP~Dg~vi~g~~~vdes~lTGEs~pv~k~~g~~v 121 (556)
T TIGR01525 73 ELQVGDIVIVRPGERIPVDGVVISGESEVDESALTGESMPVEKKEGDEV 121 (556)
T ss_pred HCCCCCEEEECCCCEeccceEEEecceEEeehhccCCCCCEecCCcCEE
Confidence 45778887 5788888777788999876532 2455 33344566644
No 90
>PF13128 DUF3954: Protein of unknown function (DUF3954)
Probab=24.79 E-value=1.2e+02 Score=15.84 Aligned_cols=16 Identities=25% Similarity=0.592 Sum_probs=12.7
Q ss_pred cceeEEEeeceEEEEe
Q psy1122 45 AREMFIIADGILEVIS 60 (150)
Q Consensus 45 ~~~~y~i~~G~v~~~~ 60 (150)
.+.+|+|.+|.+...+
T Consensus 9 ~ngiYiV~~G~v~~i~ 24 (50)
T PF13128_consen 9 ENGIYIVKDGEVTFIE 24 (50)
T ss_pred CCeEEEEECCeEEEcC
Confidence 4778999999888764
No 91
>PLN02559 chalcone--flavonone isomerase
Probab=24.10 E-value=1.2e+02 Score=21.45 Aligned_cols=39 Identities=15% Similarity=0.229 Sum_probs=29.9
Q ss_pred cccccccccCCHHHHHHHHHhceEEeecCCceEEecCCc
Q psy1122 6 VGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEV 44 (150)
Q Consensus 6 l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~ 44 (150)
++.+..+.+-..+.++.+.+.++-+.|+||..|...-.+
T Consensus 117 ~ks~g~y~daE~~aLekF~~~Fk~~~fp~Gs~I~ft~sp 155 (230)
T PLN02559 117 WKSLGIYTDAEAKAVEKFKEAFKEETFPPGSSILFTHSP 155 (230)
T ss_pred HHhcCCcchhHHHHHHHHHHHhcCCCCCCCCEEEEEECC
Confidence 455556666678899999999999999999988654333
No 92
>cd03702 IF2_mtIF2_II This family represents the domain II of bacterial Initiation Factor 2 (IF2) and its eukaryotic mitochondrial homologue mtIF2. IF2, the largest initiation factor is an essential GTP binding protein. In E. coli three natural forms of IF2 exist in the cell, IF2alpha, IF2beta1, and IF2beta2. Bacterial IF-2 is structurally and functionally related to eukaryotic mitochondrial mtIF-2.
Probab=23.59 E-value=1.7e+02 Score=17.34 Aligned_cols=41 Identities=12% Similarity=0.253 Sum_probs=24.5
Q ss_pred ceEEecCCccceeEEEe---eceEEE-EecCCcEEEEecCCCeee
Q psy1122 36 DLICRKGEVAREMFIIA---DGILEV-ISESGRVLTTMKAGDFFG 76 (150)
Q Consensus 36 ~~i~~~g~~~~~~y~i~---~G~v~~-~~~~g~~~~~~~~g~~~g 76 (150)
++|++.|.-...-+|+. .|.++. .++.|+.+....|++.+-
T Consensus 19 tviV~~GtL~~Gd~iv~G~~~gkVr~l~d~~g~~v~~a~Ps~~V~ 63 (95)
T cd03702 19 TVLVQNGTLKVGDVLVAGTTYGKVRAMFDENGKRVKEAGPSTPVE 63 (95)
T ss_pred EEEEEcCeEeCCCEEEEcccccEEEEEECCCCCCCCEECCCCcEE
Confidence 44555555544444443 356665 466677778888887654
No 93
>KOG1517|consensus
Probab=23.44 E-value=1.2e+02 Score=26.67 Aligned_cols=33 Identities=15% Similarity=0.183 Sum_probs=27.4
Q ss_pred EEcceeEEE-EEcHHHHHHHHhhCchHHHHHHHH
Q psy1122 94 RSVGYAELF-SLSREDVLSAMKDYPEAQEILQNL 126 (150)
Q Consensus 94 ~a~~~~~~~-~i~~~~~~~l~~~~~~~~~~~~~~ 126 (150)
.|++|+..| .+|++-|+++++++--+..+++++
T Consensus 355 TAITDTIAWn~lPR~LFQrLFRQDLLvAsLFRNF 388 (1387)
T KOG1517|consen 355 TAITDTIAWNSLPRELFQRLFRQDLLVASLFRNF 388 (1387)
T ss_pred eeeeceehhhcCCHHHHHHHHHHHHHHHHHHHHH
Confidence 588999888 689999999999998866666653
No 94
>KOG1448|consensus
Probab=22.56 E-value=1.9e+02 Score=21.43 Aligned_cols=51 Identities=18% Similarity=0.305 Sum_probs=36.4
Q ss_pred ccccccccCCHHHHHHHH-Hh-------ceEEeecCCceEEecCCcc--ceeEEEeeceEE
Q psy1122 7 GVVTIFQECQPEFLHDLV-LK-------MKAYIFTPGDLICRKGEVA--REMFIIADGILE 57 (150)
Q Consensus 7 ~~~~~f~~l~~~~~~~l~-~~-------~~~~~~~~g~~i~~~g~~~--~~~y~i~~G~v~ 57 (150)
+++.+|.+-+-.++.+.+ .. ..++.|+-|++-++-++.. .++|++.+|.-.
T Consensus 2 ~~i~lf~g~shp~La~~I~~~lgi~l~~v~~kkf~nge~~v~i~esvR~~dV~iiqsgsg~ 62 (316)
T KOG1448|consen 2 KNIKLFSGDSHPELAERIAARLGIELGKVNLKKFSNGETSVQIGESVRGEDVYIIQSGSGP 62 (316)
T ss_pred CceEEEcCCCCHHHHHHHHHHhCCCcceeeeEEccCCcEEEecccccccCcEEEeccCCCc
Confidence 345677776655555443 32 3688999999888777765 689999888776
No 95
>PF06865 DUF1255: Protein of unknown function (DUF1255); InterPro: IPR009664 This family consists of several conserved hypothetical bacterial proteins of around 95 residues in length. The function of this family is unknown; PDB: 2OYZ_A 3HQX_A.
Probab=22.32 E-value=1.9e+02 Score=17.32 Aligned_cols=62 Identities=23% Similarity=0.301 Sum_probs=35.2
Q ss_pred eecCCceEEecCCccceeEEEeeceEEEEecCCcEEEEecCCCeeeeeeeeccCCCCceeeEEEEcceeEEE
Q psy1122 31 IFTPGDLICRKGEVAREMFIIADGILEVISESGRVLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYAELF 102 (150)
Q Consensus 31 ~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~~~~ 102 (150)
...||+.-|. ......--|++|.+.+.-++..-...+.+|+.|. . . ..+.+.+++.+++.-+
T Consensus 29 Vm~pGeY~F~--T~~~E~M~vvsG~l~V~lpg~~ew~~~~aGesF~---V-p----anssF~v~v~~~~~Y~ 90 (94)
T PF06865_consen 29 VMLPGEYTFG--TSAPERMEVVSGELEVKLPGEDEWQTYSAGESFE---V-P----ANSSFDVKVKEPTAYL 90 (94)
T ss_dssp EE-SECEEEE--ESS-EEEEEEESEEEEEETT-SS-EEEETT-EEE---E------TTEEEEEEESS-EEEE
T ss_pred EEeeeEEEEc--CCCCEEEEEEEeEEEEEcCCCcccEEeCCCCeEE---E-C----CCCeEEEEECcceeeE
Confidence 3466774443 5556777788999988644433567788888664 1 1 3566677776665433
No 96
>PF13099 DUF3944: Domain of unknown function (DUF3944)
Probab=22.19 E-value=44 Score=15.97 Aligned_cols=20 Identities=35% Similarity=0.526 Sum_probs=16.3
Q ss_pred ccccccCCHHHHHHHHHhce
Q psy1122 9 VTIFQECQPEFLHDLVLKMK 28 (150)
Q Consensus 9 ~~~f~~l~~~~~~~l~~~~~ 28 (150)
+.++..++.+++..+...+.
T Consensus 7 LeFL~~cs~edL~~L~~~Lt 26 (35)
T PF13099_consen 7 LEFLAECSNEDLKDLVDILT 26 (35)
T ss_pred hHHHHHCCHHHHHHHHHHHh
Confidence 56788899999999988664
No 97
>KOG2757|consensus
Probab=21.82 E-value=3.6e+02 Score=20.69 Aligned_cols=44 Identities=11% Similarity=0.199 Sum_probs=0.0
Q ss_pred EeecCCceEEecCCccceeEEEeeceEEEEec-CCcEEEEecCCCee
Q psy1122 30 YIFTPGDLICRKGEVAREMFIIADGILEVISE-SGRVLTTMKAGDFF 75 (150)
Q Consensus 30 ~~~~~g~~i~~~g~~~~~~y~i~~G~v~~~~~-~g~~~~~~~~g~~~ 75 (150)
..++.|+.-.-++-++-++++|++|...+... ++. ..+.+|+++
T Consensus 338 ~~v~~g~~~~~~~~~~~SIllv~~G~g~l~~~t~~~--~~v~rG~V~ 382 (411)
T KOG2757|consen 338 TKVPTGESYKFPGVDGPSILLVLKGSGILKTDTDSK--ILVNRGDVL 382 (411)
T ss_pred eecCCCceEEeecCCCceEEEEEecceEEecCCCCc--eeeccCcEE
No 98
>PLN02804 chalcone isomerase
Probab=21.79 E-value=89 Score=21.62 Aligned_cols=36 Identities=17% Similarity=0.217 Sum_probs=27.6
Q ss_pred cccccccccCCHHHHHHHHHhceEEeecCCceEEec
Q psy1122 6 VGVVTIFQECQPEFLHDLVLKMKAYIFTPGDLICRK 41 (150)
Q Consensus 6 l~~~~~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~ 41 (150)
+++.....+-..+.++++.+.++-+.|+||..|.-.
T Consensus 111 lk~~~~y~d~e~~aL~kf~~~Fk~~~fp~Gs~I~ft 146 (206)
T PLN02804 111 LAEDDKYEEEEEEALEKVVEFFQSKYFKKNSIITYH 146 (206)
T ss_pred HhcCCCCcchHHHHHHHHHHHhCCCcCCCCCEEEEE
Confidence 344444555567899999999999999999988654
No 99
>PRK00529 elongation factor P; Validated
Probab=21.41 E-value=2.7e+02 Score=18.79 Aligned_cols=75 Identities=19% Similarity=0.123 Sum_probs=38.6
Q ss_pred EeecCCceEEecCCcccee--EEEeece------EEEEe-cCCc-EEEEecCCCeeeeeeeeccCCCCceeeEEEEccee
Q psy1122 30 YIFTPGDLICRKGEVAREM--FIIADGI------LEVIS-ESGR-VLTTMKAGDFFGEIGILNLDGLNKRTADVRSVGYA 99 (150)
Q Consensus 30 ~~~~~g~~i~~~g~~~~~~--y~i~~G~------v~~~~-~~g~-~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~a~~~~ 99 (150)
-.+++|.+|...|+++.-+ -+..-|. +++.+ .+|+ .-..+..++-+-...+ ....+.+....+.
T Consensus 5 ~~ik~G~~I~~~g~~~~V~~~~~~kpGkg~A~vrvk~knL~tG~~~e~~f~~~e~ve~~~v------e~~~~q~ly~dgd 78 (186)
T PRK00529 5 NDLRKGLVIEIDGEPYVVLEFEHVKPGKGQAFVRTKLKNLLTGSVVEKTFKAGDKVERADV------ERREMQYLYNDGD 78 (186)
T ss_pred hhcCCCCEEEECCEEEEEEEEEEeeCCCCceEEEEEEEECCCCCeEEEEeCCCCEEEeccE------EeEEEEEEEECCC
Confidence 4678999998888766422 2333353 33322 3455 3345566665433322 2344444444444
Q ss_pred EEEEEcHHHHH
Q psy1122 100 ELFSLSREDVL 110 (150)
Q Consensus 100 ~~~~i~~~~~~ 110 (150)
.+.+++.+.++
T Consensus 79 ~~~fMD~etye 89 (186)
T PRK00529 79 GYVFMDTETYE 89 (186)
T ss_pred EEEEecCCCce
Confidence 45555554444
No 100
>PF02678 Pirin: Pirin; InterPro: IPR003829 This entry represents N-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues []. Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold []. Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 2VEC_A 1J1L_A 3ACL_A 2P17_A 1TQ5_A.
Probab=21.39 E-value=2.1e+02 Score=17.44 Aligned_cols=30 Identities=20% Similarity=0.238 Sum_probs=20.9
Q ss_pred CccceeEEEeeceEEEEecCCcEEEEecCCC
Q psy1122 43 EVAREMFIIADGILEVISESGRVLTTMKAGD 73 (150)
Q Consensus 43 ~~~~~~y~i~~G~v~~~~~~g~~~~~~~~g~ 73 (150)
...+-+-++++|.+...++.|.. ..+.+|+
T Consensus 48 ~g~eivTyv~~G~~~H~Ds~G~~-~~l~~G~ 77 (107)
T PF02678_consen 48 RGFEIVTYVLEGELRHRDSLGNR-GVLRAGD 77 (107)
T ss_dssp CSEEEEEEEEESEEEEEETTSEE-EEEETTE
T ss_pred CCceEEEEEecCEEEEECCCCCe-eEeCCCe
Confidence 33566888899999887776662 3466664
No 101
>PF10794 DUF2606: Protein of unknown function (DUF2606); InterPro: IPR019730 This entry represents bacterial proteins with unknown function.
Probab=21.24 E-value=1.6e+02 Score=18.57 Aligned_cols=29 Identities=14% Similarity=0.314 Sum_probs=22.0
Q ss_pred eecCCceEEecCCccceeEEEeeceEEEE
Q psy1122 31 IFTPGDLICRKGEVAREMFIIADGILEVI 59 (150)
Q Consensus 31 ~~~~g~~i~~~g~~~~~~y~i~~G~v~~~ 59 (150)
+=..|+++|+.|...++.-++-.|....+
T Consensus 84 TD~~Gki~Wk~~~kG~Y~v~l~n~e~~~~ 112 (131)
T PF10794_consen 84 TDEEGKIIWKNGRKGKYIVFLPNGETQET 112 (131)
T ss_pred cCCCCcEEEecCCcceEEEEEcCCCceeE
Confidence 34568888888888888888888877654
No 102
>PF07742 BTG: BTG family; InterPro: IPR002087 Anti-proliferative proteins have been shown to include mammalian and avian protein BTG1 (which appears to be involved in negative regulation of cell proliferation) and rat/mouse NGF-inducible protein PC3/TIS21 (BTG2) [, , ]. These proteins have from 158 to 363 amino acid residues, that are highly similar and include 3 conserved cysteine residues. BTG2 seems to have a signal sequence; while the other proteins may lack such a domain. The sequence of the N-terminal half of these proteins is well conserved.; PDB: 3DJU_B 3E9V_A 2Z15_D 2D5R_B 3DJN_B.
Probab=20.64 E-value=2.3e+02 Score=17.65 Aligned_cols=39 Identities=21% Similarity=0.234 Sum_probs=26.4
Q ss_pred ccccCCHHHHHHHHHhceEEeecCCceEEecCCccceeEE
Q psy1122 11 IFQECQPEFLHDLVLKMKAYIFTPGDLICRKGEVAREMFI 50 (150)
Q Consensus 11 ~f~~l~~~~~~~l~~~~~~~~~~~g~~i~~~g~~~~~~y~ 50 (150)
.-++++.+++..++..--..-.+||++-++-|+.+ .+.+
T Consensus 77 ~~sgl~~~~l~~~LP~eltlWvDPgeVs~R~ge~g-~~~v 115 (118)
T PF07742_consen 77 KESGLSYEDLRSLLPRELTLWVDPGEVSYRIGEKG-SICV 115 (118)
T ss_dssp HHTT--HHHHHHHS-TTEEEEEETTEEEEEESTTS--EEE
T ss_pred HHhCCCHHHHHHhcchhcEEEECCCEEEEEEcCCC-ceEE
Confidence 34567777777777766677889999999999977 4443
No 103
>PF04648 MF_alpha: Yeast mating factor alpha hormone; InterPro: IPR006742 This repeated sequence,WHWLQLKPGQPMY, characterises the mating factor alpha-1 or alpha-1 mating pheromone [contains: Mating factor alpha].The hormone is excreted into the culture medium by haploid cells of the alpha mating type and acts on cells of the opposite mating type (type A) by binding to a cognate G-protein coupled receptor which is coupled to a downstream signal transduction pathway. It inhibits DNA synthesis in type A cells synchronising them with type alpha, and so mediates the conjugation process.; GO: 0000772 mating pheromone activity, 0019953 sexual reproduction, 0005576 extracellular region
Probab=20.13 E-value=41 Score=12.17 Aligned_cols=8 Identities=25% Similarity=0.683 Sum_probs=4.0
Q ss_pred eecCCceE
Q psy1122 31 IFTPGDLI 38 (150)
Q Consensus 31 ~~~~g~~i 38 (150)
.+.+|+.+
T Consensus 5 ~~~~GqP~ 12 (13)
T PF04648_consen 5 RLSPGQPM 12 (13)
T ss_pred eccCCCcC
Confidence 44555544
Done!