Query psy5204
Match_columns 209
No_of_seqs 133 out of 683
Neff 7.0
Searched_HMMs 46136
Date Fri Aug 16 20:00:23 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5204.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5204hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4064|consensus 100.0 1E-47 2.2E-52 299.5 8.7 183 19-208 9-191 (196)
2 PF05995 CDO_I: Cysteine dioxy 100.0 2.5E-45 5.5E-50 298.5 16.1 165 17-189 11-175 (175)
3 COG5553 Predicted metal-depend 99.9 2.7E-24 5.8E-29 169.5 13.8 156 22-189 8-170 (191)
4 PF07847 DUF1637: Protein of u 99.5 2.9E-13 6.2E-18 112.2 10.8 114 72-189 34-159 (200)
5 KOG4281|consensus 99.3 3.6E-11 7.8E-16 99.2 10.3 160 23-187 3-188 (236)
6 TIGR01479 GMP_PMI mannose-1-ph 98.3 1.3E-05 2.7E-10 74.6 14.4 116 43-180 336-453 (468)
7 PF07883 Cupin_2: Cupin domain 98.3 2.8E-06 6.1E-11 57.8 7.5 71 86-176 1-71 (71)
8 PRK13290 ectC L-ectoine syntha 98.3 2.5E-05 5.5E-10 60.3 12.2 97 70-187 21-118 (125)
9 TIGR03404 bicupin_oxalic bicup 98.2 2.4E-05 5.3E-10 70.7 12.2 86 80-180 242-327 (367)
10 COG1917 Uncharacterized conser 98.1 4.5E-05 9.8E-10 58.6 10.3 80 79-178 39-118 (131)
11 PRK15460 cpsB mannose-1-phosph 98.1 0.00012 2.7E-09 68.2 14.5 115 43-181 345-463 (478)
12 PRK09943 DNA-binding transcrip 98.0 0.0006 1.3E-08 55.5 16.7 120 39-179 45-183 (185)
13 COG0662 {ManC} Mannose-6-phosp 97.9 0.00025 5.4E-09 54.6 11.8 83 80-182 33-115 (127)
14 PF02041 Auxin_BP: Auxin bindi 97.8 0.00022 4.7E-09 56.4 9.9 99 79-188 40-138 (167)
15 COG2140 Thermophilic glucose-6 97.8 0.00026 5.5E-09 59.0 9.8 79 88-181 85-165 (209)
16 PF01050 MannoseP_isomer: Mann 97.7 0.0011 2.4E-08 52.8 13.0 117 42-180 22-140 (151)
17 TIGR02451 anti_sig_ChrR anti-s 97.7 0.00046 9.9E-09 58.0 10.4 86 71-181 115-200 (215)
18 smart00835 Cupin_1 Cupin. This 97.7 0.0008 1.7E-08 52.7 10.9 80 80-173 27-106 (146)
19 PRK11171 hypothetical protein; 97.5 0.0025 5.4E-08 55.1 13.3 87 69-175 45-134 (266)
20 PRK11171 hypothetical protein; 97.4 0.0026 5.5E-08 55.1 11.7 103 53-177 150-257 (266)
21 PRK04190 glucose-6-phosphate i 97.4 0.0079 1.7E-07 49.7 13.6 114 53-181 35-159 (191)
22 PF00190 Cupin_1: Cupin; Inte 97.3 0.0024 5.2E-08 49.9 9.8 89 81-180 32-122 (144)
23 TIGR03214 ura-cupin putative a 97.2 0.0059 1.3E-07 52.7 11.4 78 81-179 56-134 (260)
24 TIGR03214 ura-cupin putative a 97.0 0.012 2.6E-07 50.8 11.8 96 55-171 149-247 (260)
25 COG4101 Predicted mannose-6-ph 96.8 0.014 3.1E-07 44.6 9.3 86 80-182 43-128 (142)
26 TIGR03404 bicupin_oxalic bicup 96.7 0.019 4.2E-07 52.0 11.0 82 81-179 65-146 (367)
27 PF12973 Cupin_7: ChrR Cupin-l 96.6 0.0088 1.9E-07 43.1 6.7 69 71-163 10-80 (91)
28 COG3837 Uncharacterized conser 96.1 0.057 1.2E-06 43.2 8.9 86 72-176 29-118 (161)
29 PF02311 AraC_binding: AraC-li 95.9 0.1 2.2E-06 38.5 9.3 75 84-179 3-78 (136)
30 PRK12335 tellurite resistance 95.2 0.078 1.7E-06 46.0 7.3 79 91-182 19-98 (287)
31 COG3615 TehB Uncharacterized p 94.6 0.22 4.8E-06 36.6 7.1 65 102-179 34-98 (99)
32 PF09313 DUF1971: Domain of un 94.4 0.51 1.1E-05 33.8 8.6 52 103-166 24-75 (82)
33 TIGR03037 anthran_nbaC 3-hydro 93.9 0.61 1.3E-05 37.5 8.9 96 74-189 21-119 (159)
34 PRK13264 3-hydroxyanthranilate 93.4 0.96 2.1E-05 37.0 9.5 97 74-189 27-125 (177)
35 PF02678 Pirin: Pirin; InterP 93.0 0.59 1.3E-05 35.0 7.2 66 93-175 39-105 (107)
36 COG1741 Pirin-related protein 92.8 0.61 1.3E-05 40.8 8.0 73 90-179 51-125 (276)
37 PF13621 Cupin_8: Cupin-like d 92.4 0.88 1.9E-05 37.6 8.3 89 86-176 134-244 (251)
38 PF06560 GPI: Glucose-6-phosph 89.6 5.7 0.00012 32.6 10.2 101 83-194 50-172 (182)
39 TIGR02272 gentisate_1_2 gentis 89.5 1.3 2.8E-05 39.8 6.9 80 82-181 80-159 (335)
40 PRK10296 DNA-binding transcrip 89.4 3.7 8.1E-05 34.9 9.5 52 93-164 33-84 (278)
41 PF14499 DUF4437: Domain of un 87.9 2.8 6.1E-05 36.1 7.7 59 58-117 8-69 (251)
42 PLN00212 glutelin; Provisional 87.7 8.6 0.00019 36.4 11.4 97 80-179 77-185 (493)
43 PRK13500 transcriptional activ 85.5 7.7 0.00017 33.9 9.4 66 92-179 57-122 (312)
44 PF06339 Ectoine_synth: Ectoin 84.7 16 0.00036 28.2 11.6 96 71-189 23-120 (126)
45 PRK13501 transcriptional activ 83.7 8 0.00017 33.1 8.6 53 93-165 28-80 (290)
46 COG4766 EutQ Ethanolamine util 81.9 25 0.00054 28.3 11.1 80 71-175 87-167 (176)
47 PF13759 2OG-FeII_Oxy_5: Putat 80.9 11 0.00025 27.2 7.3 88 84-173 1-98 (101)
48 PRK10371 DNA-binding transcrip 80.9 11 0.00023 33.0 8.4 65 81-165 24-88 (302)
49 PLN00212 glutelin; Provisional 80.6 8.1 0.00018 36.5 7.9 70 81-163 346-415 (493)
50 PRK15457 ethanolamine utilizat 79.5 22 0.00047 30.4 9.4 73 71-165 144-217 (233)
51 COG3257 GlxB Uncharacterized p 78.6 19 0.00042 30.6 8.7 80 81-182 59-140 (264)
52 TIGR02297 HpaA 4-hydroxyphenyl 73.7 17 0.00038 30.7 7.5 55 92-165 32-86 (287)
53 TIGR02466 conserved hypothetic 72.9 25 0.00055 29.2 8.1 23 83-105 96-118 (201)
54 COG3806 ChrR Transcriptional a 72.6 14 0.00031 30.8 6.3 80 74-179 120-199 (216)
55 PRK13502 transcriptional activ 70.8 39 0.00085 28.6 9.1 54 92-165 27-80 (282)
56 PRK13503 transcriptional activ 68.2 22 0.00048 29.9 6.9 63 82-164 14-76 (278)
57 COG3822 ABC-type sugar transpo 67.9 18 0.0004 30.1 5.9 78 86-164 89-176 (225)
58 PF05523 FdtA: WxcM-like, C-te 67.2 32 0.0007 26.4 7.1 75 90-181 40-115 (131)
59 PF07385 DUF1498: Protein of u 65.8 17 0.00038 30.8 5.6 82 83-165 87-178 (225)
60 COG3435 Gentisate 1,2-dioxygen 65.3 11 0.00024 33.6 4.4 96 90-205 99-200 (351)
61 PF08007 Cupin_4: Cupin superf 65.3 79 0.0017 27.9 10.1 93 83-178 114-212 (319)
62 TIGR02272 gentisate_1_2 gentis 64.5 40 0.00087 30.3 8.0 59 86-164 253-311 (335)
63 TIGR03853 matur_matur probable 63.8 15 0.00033 25.9 4.1 40 13-52 9-56 (77)
64 PF11699 CENP-C_C: Mif2/CENP-C 59.5 62 0.0013 23.1 7.5 71 81-171 10-80 (85)
65 PF02373 JmjC: JmjC domain, hy 57.0 25 0.00055 25.2 4.6 33 140-174 81-113 (114)
66 PF05899 Cupin_3: Protein of u 56.5 61 0.0013 22.1 6.7 55 80-154 4-58 (74)
67 PF10678 DUF2492: Protein of u 55.5 27 0.00058 24.8 4.2 39 13-51 11-57 (78)
68 COG4297 Uncharacterized protei 55.4 77 0.0017 25.1 7.2 54 86-154 46-99 (163)
69 KOG2757|consensus 53.7 1.2E+02 0.0026 27.8 9.1 68 74-159 323-391 (411)
70 PLN02288 mannose-6-phosphate i 47.6 99 0.0022 28.4 7.8 62 76-154 326-388 (394)
71 COG3257 GlxB Uncharacterized p 38.9 1.4E+02 0.0031 25.5 6.8 83 53-154 150-235 (264)
72 PF11142 DUF2917: Protein of u 35.9 92 0.002 20.8 4.4 30 88-119 2-31 (63)
73 PF05118 Asp_Arg_Hydrox: Aspar 35.6 54 0.0012 26.0 3.7 75 82-172 79-154 (163)
74 COG3450 Predicted enzyme of th 34.9 2E+02 0.0044 21.8 8.1 70 65-154 27-96 (116)
75 PF03991 Prion_octapep: Copper 34.3 16 0.00034 14.8 0.2 6 101-106 2-7 (8)
76 KOG3706|consensus 33.4 28 0.00061 33.0 2.0 80 94-178 329-415 (629)
77 PF10902 DUF2693: Protein of u 32.9 1.5E+02 0.0033 21.2 5.2 27 181-207 49-76 (83)
78 COG3435 Gentisate 1,2-dioxygen 32.1 1.9E+02 0.0041 26.0 6.7 57 89-165 267-323 (351)
79 PF03392 OS-D: Insect pheromon 30.0 72 0.0016 23.2 3.3 63 8-70 20-89 (95)
80 PRK09685 DNA-binding transcrip 29.9 2.8E+02 0.0062 23.4 7.6 84 71-175 34-122 (302)
81 PF06249 EutQ: Ethanolamine ut 28.1 2.2E+02 0.0047 22.7 6.0 71 63-154 54-127 (152)
82 TIGR03367 queuosine_QueD queuo 26.6 87 0.0019 22.3 3.2 24 90-115 16-39 (92)
83 PRK12787 fliX flagellar assemb 26.0 1.8E+02 0.0039 22.8 5.1 46 3-48 76-122 (138)
84 PF03487 IL13: Interleukin-13; 25.9 70 0.0015 19.7 2.1 18 17-34 22-39 (43)
85 PF15590 Imm15: Immunity prote 25.0 67 0.0015 22.2 2.2 27 46-72 11-38 (69)
86 PF02671 PAH: Paired amphipath 23.9 1.8E+02 0.0039 17.7 5.0 43 22-66 2-45 (47)
87 PF10264 Stork_head: Winged he 23.2 2.2E+02 0.0047 20.3 4.6 40 17-56 25-64 (80)
88 COG2000 Predicted Fe-S protein 22.7 1.2E+02 0.0026 25.3 3.6 71 105-179 74-147 (226)
89 PHA02091 hypothetical protein 21.6 85 0.0018 21.3 2.1 15 167-181 43-57 (72)
90 PF14499 DUF4437: Domain of un 21.6 48 0.001 28.6 1.2 99 46-162 132-232 (251)
91 PRK10572 DNA-binding transcrip 21.5 4.9E+02 0.011 21.8 8.3 49 97-165 43-91 (290)
92 TIGR02837 spore_II_R stage II 21.2 4.5E+02 0.0098 21.3 6.9 36 66-109 118-153 (168)
93 PF06972 DUF1296: Protein of u 20.6 2.5E+02 0.0054 18.9 4.1 37 21-60 2-38 (60)
94 cd01786 STE50_RA Ubiquitin-lik 20.5 70 0.0015 23.5 1.7 27 40-66 35-61 (98)
No 1
>KOG4064|consensus
Probab=100.00 E-value=1e-47 Score=299.48 Aligned_cols=183 Identities=48% Similarity=0.907 Sum_probs=172.5
Q ss_pred CchhHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhhcCccchhhhhccCCccceEEeeecCCCceEEEEEEccCCCccCC
Q psy5204 19 TRIDTLSELIHRVYAAFEGRSSVDVAHVKFLMESYVSNADDWKRYAQFDENRYTRNLVDEGDGKFNLILLCWSQGQASSI 98 (209)
Q Consensus 19 ~~~~~l~~li~~l~~~~~~~~~~~~~~v~~ll~~l~~~~~~w~~~~~~~~~~Y~R~ll~~~~~~~~l~~~~W~pG~~spi 98 (209)
|.--+|-+||.+|+++|.+. -++++++..+|+.+.++|.+|..|+.||..+|+|||+..+|++|+++++||.+|..|.+
T Consensus 9 p~~~sl~dLv~~lh~~F~~~-~vnveeV~~lM~sYkSnp~EWr~yAkFD~y~YTRNLVD~GNGKfNLmILCWGeGhgSSv 87 (196)
T KOG4064|consen 9 PRMISLVDLVVQLHEIFQQK-LVNVEEVMKLMASYKSNPNEWRRYAKFDMYKYTRNLVDVGNGKFNLMILCWGEGHGSSV 87 (196)
T ss_pred chhhhHHHHHHHHHHHHHhc-ccCHHHHHHHHHHhhcCHHHHHHHHhhhHHHHhhhhhhcCCCeEeEEEEEecCCCCccc
Confidence 55567999999999999997 79999999999999999999999999999999999999889999999999999999999
Q ss_pred CCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEeCC
Q psy5204 99 HDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLYSP 178 (209)
Q Consensus 99 HDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY~p 178 (209)
|||.+++||+++|+|.|+|+.|.|++..+.+ +....+.+++.++|.|++|..|+|||+|+|..++|||||+|.|
T Consensus 88 HDHtdsHCF~KmL~G~L~Et~yawPd~ks~e------~v~isE~~~~~N~vaYiND~lGLHRvEN~SHs~~aVSLHLY~P 161 (196)
T KOG4064|consen 88 HDHTDSHCFVKMLDGELTETKYAWPDRKSHE------PVDISEKTYGMNGVAYINDELGLHRVENLSHSNGAVSLHLYIP 161 (196)
T ss_pred cccccchhHHHHhcCcchhhcccCCCcccCc------cccccceeeeccceEEecccccceeccccccCCCceEEEEecC
Confidence 9999999999999999999999999766543 5556789999999999999999999999999999999999999
Q ss_pred CCCcceEEECCCCceEEeeeeEEeccCeec
Q psy5204 179 PFNICSSFDAQTGVKSRARLTYWSKFGHKV 208 (209)
Q Consensus 179 p~~~~~~yd~~~g~~~~~~~~~~s~~g~~~ 208 (209)
|+..|+.||..||+...+.++|+|+||.+.
T Consensus 162 PfdTC~aFd~rtgkKt~~tvTF~SKyG~rt 191 (196)
T KOG4064|consen 162 PFDTCNAFDERTGKKTQCTVTFYSKYGKRT 191 (196)
T ss_pred CcchhhhhhhccCCcceEEEEEehhcCccc
Confidence 999999999999999999999999999874
No 2
>PF05995 CDO_I: Cysteine dioxygenase type I; InterPro: IPR010300 Cysteine dioxygenase type I (1.13.11.20 from EC) converts cysteine to cysteinesulphinic acid and is the rate-limiting step in sulphate production.; GO: 0005506 iron ion binding, 0017172 cysteine dioxygenase activity, 0046439 L-cysteine metabolic process, 0055114 oxidation-reduction process; PDB: 2IC1_A 3EQE_B 3ELN_A 2B5H_A 2GH2_A 2Q4S_A 2ATF_A 2GM6_A 3USS_B.
Probab=100.00 E-value=2.5e-45 Score=298.52 Aligned_cols=165 Identities=40% Similarity=0.736 Sum_probs=136.5
Q ss_pred cCCchhHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhhcCccchhhhhccCCccceEEeeecCCCceEEEEEEccCCCcc
Q psy5204 17 RLTRIDTLSELIHRVYAAFEGRSSVDVAHVKFLMESYVSNADDWKRYAQFDENRYTRNLVDEGDGKFNLILLCWSQGQAS 96 (209)
Q Consensus 17 ~~~~~~~l~~li~~l~~~~~~~~~~~~~~v~~ll~~l~~~~~~w~~~~~~~~~~Y~R~ll~~~~~~~~l~~~~W~pG~~s 96 (209)
..+++++|++||++|+++++++ ..++++++.+++.+..++++|.+|+.||+++|+||+|+. ++.|+|++|||+|||.|
T Consensus 11 ~~~~~~~~~~li~~L~~~~~~~-~~~v~~i~~l~~~~~~~~~~w~~~~~~d~~~Y~r~ll~~-~~~~el~ll~W~pGq~S 88 (175)
T PF05995_consen 11 ISPSPSSLADLIADLRQIFDQE-PDDVEEIAALLRSLASDPSDWLKYARFDPDRYTRNLLYR-DERFELWLLCWPPGQRS 88 (175)
T ss_dssp TT-SSSSHHHHHHHHHHHTSSS-SHHHHHHHHHHHHSCSSCHHHGGGC--SSSSSEEEEEEG-GCT-EEEEEEE-TT-B-
T ss_pred cccCCcCHHHHHHHHHHhhccc-ccCHHHHHHHHHHHhcChHHHHHhhcccCCCCeEEEEec-CCCeEEEEEEeCCCCcC
Confidence 5678899999999999999887 778999999999999999999999999999999999986 79999999999999999
Q ss_pred CCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEe
Q psy5204 97 SIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLY 176 (209)
Q Consensus 97 piHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY 176 (209)
|||||+++||+++||+|+++|++|++++++. .++.......+.+|+++++.++.+||+|+|++.++++||||||
T Consensus 89 ~IHDH~~s~g~~~vl~G~l~e~~y~~~~~~~------~~~~~~~~~~~~~~g~~~~~~~~~iH~v~n~s~~~~avSLHvY 162 (175)
T PF05995_consen 89 PIHDHGGSWGWVKVLSGELEETRYRRPDDGG------APLELVGRERLLPGGVTYIFDPHGIHRVENPSGDEPAVSLHVY 162 (175)
T ss_dssp -EEE-TTSEEEEEEEESEEEEEEEEESTSSS-------EEEECEEEEEETTTEEEEBTTTBEEEEEES-SSS-EEEEEEE
T ss_pred CCCCCCCceEEEEEecceEEEEEeccCCccc------CcccccCceEecCCCeEEecCCCCeEEeccCCCCCCEEEEEEc
Confidence 9999999999999999999999999987633 2366666777778888888777899999999867999999999
Q ss_pred CCCCCcceEEECC
Q psy5204 177 SPPFNICSSFDAQ 189 (209)
Q Consensus 177 ~pp~~~~~~yd~~ 189 (209)
+||+.+|++||++
T Consensus 163 spPl~~~~~y~~~ 175 (175)
T PF05995_consen 163 SPPLEQCRIYDES 175 (175)
T ss_dssp ES--SEEEEE-TT
T ss_pred CCChhhcccccCC
Confidence 9999999999974
No 3
>COG5553 Predicted metal-dependent enzyme of the double-stranded beta helix superfamily [General function prediction only]
Probab=99.92 E-value=2.7e-24 Score=169.49 Aligned_cols=156 Identities=19% Similarity=0.241 Sum_probs=130.3
Q ss_pred hHHHHHHHHHHHHhcCC--CCCCHHHHHHHHHHhhcCccchh--hhhccCCccceEEeeec-CCCceEEEEEEccCCCcc
Q psy5204 22 DTLSELIHRVYAAFEGR--SSVDVAHVKFLMESYVSNADDWK--RYAQFDENRYTRNLVDE-GDGKFNLILLCWSQGQAS 96 (209)
Q Consensus 22 ~~l~~li~~l~~~~~~~--~~~~~~~v~~ll~~l~~~~~~w~--~~~~~~~~~Y~R~ll~~-~~~~~~l~~~~W~pG~~s 96 (209)
.+|..|+.+|..++++. +..-+++++.+|.+|.. ..+|. .+..+||.+|+.+||.. +.+.|+|+.+.|.||+.+
T Consensus 8 ~rl~~f~~e~~diln~~~dEst~l~ev~~ll~~l~~-~~~wl~~~~~~pdp~tgt~~LLh~d~~gfltV~~~t~~PG~~~ 86 (191)
T COG5553 8 SRLSLFYREIMDILNAAADESTPLEEVKALLGDLTN-RFHWLGGLVHEPDPDTGTELLLHADPQGFLTVYHITLSPGVQY 86 (191)
T ss_pred hHHHHHHHHHHHHHhhccCcCcchHHHHHHHHHHhh-hhhhhhhhhcCCCCCccceEEEEEcccccEEEEEEEeCCCccc
Confidence 37889999999999875 34568999999999998 58898 57889999999999985 456799999999999999
Q ss_pred CCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEe
Q psy5204 97 SIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLY 176 (209)
Q Consensus 97 piHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY 176 (209)
|||||. +||++++|.|.++++.|..+..+.++|.+ +.+...+|+|. +.| ++||++.|.+ .+.+++||||
T Consensus 87 p~HnH~-~wglVgil~G~E~n~~y~~~~~~~~~P~~-------qdk~~apgeV~-lSp-gdihsv~n~~-sdrs~aiHvy 155 (191)
T COG5553 87 PPHNHL-MWGLVGILWGGETNFIYPLAGEEVDEPER-------QDKFAAPGEVH-LSP-GDIHSVANTG-SDRSGAIHVY 155 (191)
T ss_pred CCcccc-hheeeeeeecccccceecccCCCCCCcch-------hhhhcCcceEe-eCC-CCeeeecccC-CCccceEEEE
Confidence 999999 69999999999999999887666554332 34467799987 776 6799999998 4678999999
Q ss_pred CCCCCc--ceEEECC
Q psy5204 177 SPPFNI--CSSFDAQ 189 (209)
Q Consensus 177 ~pp~~~--~~~yd~~ 189 (209)
+..+.+ ..+|...
T Consensus 156 ~a~ig~~~r~~fsi~ 170 (191)
T COG5553 156 LADIGGTDRQLFSIL 170 (191)
T ss_pred ecccCCCcceeeeec
Confidence 998875 5667644
No 4
>PF07847 DUF1637: Protein of unknown function (DUF1637); InterPro: IPR012864 This entry represents cysteamine dioxygenase, which is a non-heme iron protein that is involved in the biosynthesis of taurine. Requires catalytic amounts of a cofactor-like compound, such as sulphur, sulphide, selenium or methylene blue for maximal activity. 3-Aminopropanethiol (homocysteamine) and 2-mercaptoethanol can also act as substrates, but glutathione, cysteine, and cysteine ethyl- and methyl esters are not good substrates [, ]. ; GO: 0047800 cysteamine dioxygenase activity, 0055114 oxidation-reduction process
Probab=99.48 E-value=2.9e-13 Score=112.22 Aligned_cols=114 Identities=24% Similarity=0.428 Sum_probs=86.8
Q ss_pred eEEeeecCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCC-----CCCccccccceEEeeC
Q psy5204 72 TRNLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEI-----GQDRPLSEISRTELAE 146 (209)
Q Consensus 72 ~R~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~-----~~~~~l~~~~~~~~~~ 146 (209)
+-.-||+ ++.|+|-+++.+||...|.|||+++.++.+||.|.++.+.|.|.+...... ...+.+. .....-.+
T Consensus 34 ~y~~iyE-~~~fsi~iF~lp~g~~IPLHDHP~M~v~sKvL~Gs~~v~Syd~~~~~~~~~~~~~~~~~a~~~-~d~~~~a~ 111 (200)
T PF07847_consen 34 TYMHIYE-DEDFSIGIFCLPPGAVIPLHDHPGMTVLSKVLYGSLHVKSYDWVDEPSDSIEGQRQPRLARLV-VDGEMTAP 111 (200)
T ss_pred EEEEEEE-CCCcEEEEEEeCCCCEeCCCCCCchHhhHhhEeeeEEEEEccccccccccccccccceeeEEE-ecceecCC
Confidence 3344688 789999999999999999999999999999999999999999976422110 0001111 11123345
Q ss_pred CcEEEEcCCC-C-eEEEecCCCCCceEEEEEeCCCCC-----cceEEECC
Q psy5204 147 NDVCYINDSM-G-LHKVGNSGPHPAAVSLHLYSPPFN-----ICSSFDAQ 189 (209)
Q Consensus 147 G~v~~~~~~~-~-iH~v~n~~~~~~avSLHvY~pp~~-----~~~~yd~~ 189 (209)
++.+.+.|.. + ||++.+.+ +++..|.|.+||+. .|++|.+.
T Consensus 112 ~~~~vL~P~~ggNiH~f~a~~--~p~AflDIL~PPY~~~~gR~C~YY~~~ 159 (200)
T PF07847_consen 112 SDTCVLYPTSGGNIHEFTALT--GPCAFLDILAPPYDPDDGRDCTYYRPV 159 (200)
T ss_pred CCCeEEccCCCCeeEEEEeCC--CCeEEEEEccCCCCCCCCCCcEEEeec
Confidence 6777777766 4 99999996 79999999999986 49999865
No 5
>KOG4281|consensus
Probab=99.27 E-value=3.6e-11 Score=99.25 Aligned_cols=160 Identities=16% Similarity=0.278 Sum_probs=105.3
Q ss_pred HHHHHHHHHHHHhcCCC----CCCHHHHHHHHHHhhcCccchh--h-hhccCCc--------cceEEeeecCCCceEEEE
Q psy5204 23 TLSELIHRVYAAFEGRS----SVDVAHVKFLMESYVSNADDWK--R-YAQFDEN--------RYTRNLVDEGDGKFNLIL 87 (209)
Q Consensus 23 ~l~~li~~l~~~~~~~~----~~~~~~v~~ll~~l~~~~~~w~--~-~~~~~~~--------~Y~R~ll~~~~~~~~l~~ 87 (209)
-+++|...+++.|...+ ..+++.|+++|.++.- +|.- + ...+.+. ..+=.-|++ +++|+|-+
T Consensus 3 ~vq~l~ntck~~fs~~~pg~~~~nlqkLr~vLd~lkp--~Dvgl~~~~~~~~~~~~~~rn~ppitYlhi~E-cD~FSigi 79 (236)
T KOG4281|consen 3 FVQRLFNTCKEVFSTGGPGPFEENLQKLRSVLDELKP--EDVGLEPEMQYFRPSGVSERNLPPITYLHIHE-CDRFSIGI 79 (236)
T ss_pred hhHHHHHHHHHHhcCCCCCccHHHHHHHHHHHhhCCH--hhcCcChhhhhccCCCCCcCCCCCceEEEEEe-cCceeEEE
Confidence 35778888888887652 4567888888888774 4442 2 2223321 112224677 78999999
Q ss_pred EEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCC---CCCCcc-ccccceEEee-CCcEEEEcC-CCCeEEE
Q psy5204 88 LCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDE---IGQDRP-LSEISRTELA-ENDVCYIND-SMGLHKV 161 (209)
Q Consensus 88 ~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~---~~~~~~-l~~~~~~~~~-~G~v~~~~~-~~~iH~v 161 (209)
+|.+|+...|.|||+++.++.++|-|++....|.|.+..... ++...+ ........+. +-.++.+.| +++-|.+
T Consensus 80 Fclp~ss~IPLHdHPgM~v~sKllyGtmhVksyDw~ePd~~~~~dp~q~~r~akl~~d~~~T~~s~~~~LyP~~ggn~h~ 159 (236)
T KOG4281|consen 80 FCLPPSSVIPLHDHPGMTVLSKLLYGTMHVKSYDWVEPDEPQTEDPHQPVRPAKLVSDKEFTAASPASTLYPKTGGNHHC 159 (236)
T ss_pred EEcCCCCeeecCCCcchHHHHHhhhceeEeeeccccCCCCcccCCCCcceeeeeEeccceecCCCCCcEeeecCCCcEee
Confidence 999999999999999999999999999999999997422211 110011 1111223333 233344444 3455555
Q ss_pred ecCCCCCceEEEEEeCCCCC-----cceEEE
Q psy5204 162 GNSGPHPAAVSLHLYSPPFN-----ICSSFD 187 (209)
Q Consensus 162 ~n~~~~~~avSLHvY~pp~~-----~~~~yd 187 (209)
-.+. .++.-|.|.|||+. .|++|-
T Consensus 160 f~a~--t~cAvlDILsPPY~~~~gR~C~Yyr 188 (236)
T KOG4281|consen 160 FTAI--TPCAVLDILSPPYDSDHGRHCTYYR 188 (236)
T ss_pred eeec--cceeEEeeccCCCCCCCCcCceEEe
Confidence 5554 38999999999986 387775
No 6
>TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (EC 5.3.1.8) (PMI) and mannose-1-phosphate guanylyltransferase (EC 2.7.7.22) in Pseudomonas aeruginosa, Xanthomonas campestris, and Gluconacetobacter xylinus. The literature on the enzyme from E. coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain is C-terminal.
Probab=98.35 E-value=1.3e-05 Score=74.64 Aligned_cols=116 Identities=16% Similarity=0.170 Sum_probs=82.8
Q ss_pred HHHHHHHHHHhhcCccchhhhhcc--CCccceEEeeecCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEe
Q psy5204 43 VAHVKFLMESYVSNADDWKRYAQF--DENRYTRNLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRY 120 (209)
Q Consensus 43 ~~~v~~ll~~l~~~~~~w~~~~~~--~~~~Y~R~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y 120 (209)
.++|+++++.+... ....+... .-..++++.+-..++.+.+..+...||+.++.|-|...+-...|++|+++.+.
T Consensus 336 ~q~vk~~v~~l~~~--~~~~~~~~~~~~~pWG~~~~~~~~~~~~~~~~~i~PG~~~~~h~H~~~~E~~~Vl~G~~~v~~- 412 (468)
T TIGR01479 336 VQDVKKIVEQLKAD--GRTETEQHREVYRPWGKYDSIDQGDRYQVKRITVKPGEKLSLQMHHHRAEHWIVVSGTARVTI- 412 (468)
T ss_pred hhhHHHHHHHHHhc--CchhhhcCceEECCCCceEEEecCCCEEEEEEEECCCCccCccccCCCceEEEEEeeEEEEEE-
Confidence 46788888888643 12222211 11224333333236899999999999999887766544555589999999875
Q ss_pred eCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEeCCCC
Q psy5204 121 QEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLYSPPF 180 (209)
Q Consensus 121 ~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp~ 180 (209)
++ .+..+++||++++++ ...|++.|.+ ++++.-|-|..|++
T Consensus 413 ----dg-------------~~~~l~~GDsi~ip~-~~~H~~~N~g-~~~~~~i~v~~~~~ 453 (468)
T TIGR01479 413 ----GD-------------ETLLLTENESTYIPL-GVIHRLENPG-KIPLELIEVQSGSY 453 (468)
T ss_pred ----CC-------------EEEEecCCCEEEECC-CCcEEEEcCC-CCCEEEEEEEcCCC
Confidence 11 367899999999976 5699999998 68999999998874
No 7
>PF07883 Cupin_2: Cupin domain; InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=98.34 E-value=2.8e-06 Score=57.80 Aligned_cols=71 Identities=30% Similarity=0.315 Sum_probs=59.7
Q ss_pred EEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCC
Q psy5204 86 ILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSG 165 (209)
Q Consensus 86 ~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~ 165 (209)
+++.++||..+|.|-|....-++.|++|+++... ++ ....+++|+..++.+ ...|++.|.+
T Consensus 1 ~~~~~~pG~~~~~h~H~~~~e~~~vl~G~~~~~~-----~~-------------~~~~l~~Gd~~~i~~-~~~H~~~n~~ 61 (71)
T PF07883_consen 1 GLVTLPPGGSIPPHRHPGEDEFFYVLSGEGTLTV-----DG-------------ERVELKPGDAIYIPP-GVPHQVRNPG 61 (71)
T ss_dssp EEEEEETTEEEEEEEESSEEEEEEEEESEEEEEE-----TT-------------EEEEEETTEEEEEET-TSEEEEEEES
T ss_pred CEEEECCCCCCCCEECCCCCEEEEEEECCEEEEE-----cc-------------EEeEccCCEEEEECC-CCeEEEEECC
Confidence 4678999999999999975488999999998773 11 367899999999886 5799999998
Q ss_pred CCCceEEEEEe
Q psy5204 166 PHPAAVSLHLY 176 (209)
Q Consensus 166 ~~~~avSLHvY 176 (209)
+++++-|-||
T Consensus 62 -~~~~~~l~V~ 71 (71)
T PF07883_consen 62 -DEPARFLVVY 71 (71)
T ss_dssp -SSEEEEEEEE
T ss_pred -CCCEEEEEEC
Confidence 5899988887
No 8
>PRK13290 ectC L-ectoine synthase; Reviewed
Probab=98.27 E-value=2.5e-05 Score=60.28 Aligned_cols=97 Identities=10% Similarity=0.099 Sum_probs=76.0
Q ss_pred cceEEeeec-CCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCc
Q psy5204 70 RYTRNLVDE-GDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAEND 148 (209)
Q Consensus 70 ~Y~R~ll~~-~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~ 148 (209)
.-+|.++.. ++..|++..++..||+.++.|-|.. +-++.||+|+++.... +++ .+..+.+||
T Consensus 21 ~~~krll~~~~~~~~~~~~~~l~pG~~~~~h~h~~-~E~~yVL~G~~~~~~i---~~g-------------~~~~L~aGD 83 (125)
T PRK13290 21 WTSRRLLLKDDGMGFSFHETTIYAGTETHLHYKNH-LEAVYCIEGEGEVEDL---ATG-------------EVHPIRPGT 83 (125)
T ss_pred ceEEEEEEecCCCCEEEEEEEECCCCcccceeCCC-EEEEEEEeCEEEEEEc---CCC-------------EEEEeCCCe
Confidence 357777764 4578899999999999999999875 4589999999997621 011 368899999
Q ss_pred EEEEcCCCCeEEEecCCCCCceEEEEEeCCCCCcceEEE
Q psy5204 149 VCYINDSMGLHKVGNSGPHPAAVSLHLYSPPFNICSSFD 187 (209)
Q Consensus 149 v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp~~~~~~yd 187 (209)
++++++ ..-|++.|. +++.-|=+.+||..+...-|
T Consensus 84 ~i~~~~-~~~H~~~N~---e~~~~l~v~tP~~~~~~~~~ 118 (125)
T PRK13290 84 MYALDK-HDRHYLRAG---EDMRLVCVFNPPLTGREVHD 118 (125)
T ss_pred EEEECC-CCcEEEEcC---CCEEEEEEECCCCCCccccC
Confidence 999876 579999997 57888889999987765444
No 9
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=98.19 E-value=2.4e-05 Score=70.68 Aligned_cols=86 Identities=17% Similarity=0.228 Sum_probs=71.9
Q ss_pred CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeE
Q psy5204 80 DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLH 159 (209)
Q Consensus 80 ~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH 159 (209)
...+++..+...||...+.|-|.++.=|+.|++|+.+.+....... ..+..+++||+.++.. ...|
T Consensus 242 ~~~~s~~~~~l~PG~~~~~H~H~~~~E~~yvl~G~~~~~v~d~~g~-------------~~~~~l~~GD~~~iP~-g~~H 307 (367)
T TIGR03404 242 SKTIAAAIVTVEPGAMRELHWHPNADEWQYFIQGQARMTVFAAGGN-------------ARTFDYQAGDVGYVPR-NMGH 307 (367)
T ss_pred cceEEEEEEEECCCCccCCeeCcCCCeEEEEEEEEEEEEEEecCCc-------------EEEEEECCCCEEEECC-CCeE
Confidence 3568899999999999999999998889999999999887643211 1357799999988875 5699
Q ss_pred EEecCCCCCceEEEEEeCCCC
Q psy5204 160 KVGNSGPHPAAVSLHLYSPPF 180 (209)
Q Consensus 160 ~v~n~~~~~~avSLHvY~pp~ 180 (209)
.+.|.+ ++++.-|-+|..+.
T Consensus 308 ~i~N~G-~e~l~fL~if~s~~ 327 (367)
T TIGR03404 308 YVENTG-DETLVFLEVFKADR 327 (367)
T ss_pred EEEECC-CCCEEEEEEECCCC
Confidence 999998 68999999998764
No 10
>COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=98.09 E-value=4.5e-05 Score=58.59 Aligned_cols=80 Identities=19% Similarity=0.219 Sum_probs=64.0
Q ss_pred CCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCe
Q psy5204 79 GDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGL 158 (209)
Q Consensus 79 ~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~i 158 (209)
....+.+..++..||+.++.|.|+...-.+.||+|+++.+.- + ....+.+||+.++.+ +..
T Consensus 39 ~~~~~~~~~v~~~~G~~~~~H~hp~~~~~~~Vl~G~~~~~~~-----g-------------~~~~l~~Gd~i~ip~-g~~ 99 (131)
T COG1917 39 EGENLSVVLVTFEPGAVIPWHTHPLGEQTIYVLEGEGTVQLE-----G-------------EKKELKAGDVIIIPP-GVV 99 (131)
T ss_pred CCceEEEEEEEECCCcccccccCCCcceEEEEEecEEEEEec-----C-------------CceEecCCCEEEECC-CCe
Confidence 468888999999999999999998334678999999998751 1 367799999999876 579
Q ss_pred EEEecCCCCCceEEEEEeCC
Q psy5204 159 HKVGNSGPHPAAVSLHLYSP 178 (209)
Q Consensus 159 H~v~n~~~~~~avSLHvY~p 178 (209)
|.+.|.+ +.+.+-|-+..+
T Consensus 100 H~~~a~~-~~~~~~l~v~~~ 118 (131)
T COG1917 100 HGLKAVE-DEPMVLLLVFPL 118 (131)
T ss_pred eeeccCC-CCceeEEEEeee
Confidence 9999987 454566666644
No 11
>PRK15460 cpsB mannose-1-phosphate guanyltransferase; Provisional
Probab=98.06 E-value=0.00012 Score=68.25 Aligned_cols=115 Identities=15% Similarity=0.163 Sum_probs=81.9
Q ss_pred HHHHHHHHHHhhcCccchhhhh----ccCCccceEEeeecCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEE
Q psy5204 43 VAHVKFLMESYVSNADDWKRYA----QFDENRYTRNLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEV 118 (209)
Q Consensus 43 ~~~v~~ll~~l~~~~~~w~~~~----~~~~~~Y~R~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~ 118 (209)
.++|+.+++.+... ...++. .+.| ++++.....+.+|++..+.-.||..++.|-|....=+..|++|+++.+
T Consensus 345 ~q~vk~~v~~l~~~--~~~~~~~~~~~~rp--WG~~~~l~~g~~~~v~~i~v~PG~~~~~~~H~~~~E~~~VlsG~~~v~ 420 (478)
T PRK15460 345 VQDVKKVVEQIKAD--GRHEHRVHREVYRP--WGKYDSIDAGDRYQVKRITVKPGEGLSVQMHHHRAEHWVVVAGTAKVT 420 (478)
T ss_pred hhhHHHHHHHHHhc--CCchhhccccEECC--CCceEeecCCCcEEEEEEEECCCCcCCcCCCCCCceEEEEEeeEEEEE
Confidence 46788888888643 111111 1233 333333333689999999999999875555544444566999999987
Q ss_pred EeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEeCCCCC
Q psy5204 119 RYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLYSPPFN 181 (209)
Q Consensus 119 ~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp~~ 181 (209)
. ++ .+..+.+||.++++. ...|+++|.+ ++++..|=|-.|++.
T Consensus 421 i-----dg-------------~~~~L~~GDSi~ip~-g~~H~~~N~g-~~~l~iI~V~~g~yl 463 (478)
T PRK15460 421 I-----DG-------------DIKLLGENESIYIPL-GATHCLENPG-KIPLDLIEVRSGSYL 463 (478)
T ss_pred E-----CC-------------EEEEecCCCEEEECC-CCcEEEEcCC-CCCEEEEEEEcCCCC
Confidence 6 11 367899999999976 5699999998 689999999988864
No 12
>PRK09943 DNA-binding transcriptional repressor PuuR; Provisional
Probab=98.05 E-value=0.0006 Score=55.55 Aligned_cols=120 Identities=13% Similarity=0.171 Sum_probs=83.9
Q ss_pred CCCCHHHHHHHHHHhhcCccchhhhhc----------------cCCccceEEeeec--CCCceEEEEEEccCCCccC-CC
Q psy5204 39 SSVDVAHVKFLMESYVSNADDWKRYAQ----------------FDENRYTRNLVDE--GDGKFNLILLCWSQGQASS-IH 99 (209)
Q Consensus 39 ~~~~~~~v~~ll~~l~~~~~~w~~~~~----------------~~~~~Y~R~ll~~--~~~~~~l~~~~W~pG~~sp-iH 99 (209)
..++++.+..+.+.|..++++|..-.. .....+...+|.. +...+++++....||..++ .|
T Consensus 45 ~~p~~~~l~~ia~~l~v~~~~l~~~~~~~~~~~~v~~~~e~~~~~~~g~~~~~l~~~~~~~~~~~~~~~~~pg~~~~~~~ 124 (185)
T PRK09943 45 VSPAISTLQKLLKVYGLSLSEFFSEPEKPDEPQVVINQDDLIEMGSQGVSMKLVHNGNPNRTLAMIFETYQPGTTTGERI 124 (185)
T ss_pred CCCCHHHHHHHHHHhCCCHHHHccCccccCCcceEEchhHhhccccCCceEEEeccCCCCCeeEEEEEEccCCCCccccc
Confidence 356778888888877777665542100 0011243344442 2356778888889999764 66
Q ss_pred CCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEeCCC
Q psy5204 100 DHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLYSPP 179 (209)
Q Consensus 100 DH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp 179 (209)
.|.+ .=++.|++|+++... ++ ....+.+||..+++. .-.|+..|.+ ++++..|-+..|+
T Consensus 125 ~h~~-~E~~~Vl~G~~~~~~-----~~-------------~~~~l~~Gd~~~~~~-~~~H~~~n~~-~~~~~~l~~~~p~ 183 (185)
T PRK09943 125 KHQG-EEIGTVLEGEIVLTI-----NG-------------QDYHLVAGQSYAINT-GIPHSFSNTS-AGICRIISAHTPT 183 (185)
T ss_pred ccCC-cEEEEEEEeEEEEEE-----CC-------------EEEEecCCCEEEEcC-CCCeeeeCCC-CCCeEEEEEeCCC
Confidence 6775 678899999999765 11 367799999999986 5699999997 6889888888776
No 13
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=97.94 E-value=0.00025 Score=54.61 Aligned_cols=83 Identities=22% Similarity=0.325 Sum_probs=70.8
Q ss_pred CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeE
Q psy5204 80 DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLH 159 (209)
Q Consensus 80 ~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH 159 (209)
++.+++..+.-.||...+.|-|....=+..|++|+...+. ++ ....+++||++++++ +..|
T Consensus 33 ~~~~~~~~~~v~pg~~~~~~~H~~~dE~~~Vl~G~g~v~~-----~~-------------~~~~v~~gd~~~iP~-g~~H 93 (127)
T COG0662 33 GDRYSIARILVKPGEEISLHHHHHRDEHWYVLEGTGKVTI-----GG-------------EEVEVKAGDSVYIPA-GTPH 93 (127)
T ss_pred CCcEEEEEEEECCCcccCcccccCcceEEEEEeeEEEEEE-----CC-------------EEEEecCCCEEEECC-CCcE
Confidence 5799999999999999888887776778899999999876 11 367899999999876 6799
Q ss_pred EEecCCCCCceEEEEEeCCCCCc
Q psy5204 160 KVGNSGPHPAAVSLHLYSPPFNI 182 (209)
Q Consensus 160 ~v~n~~~~~~avSLHvY~pp~~~ 182 (209)
++.|++ ..+..-|.+++|+...
T Consensus 94 ~~~N~G-~~~L~liei~~p~~~~ 115 (127)
T COG0662 94 RVRNTG-KIPLVLIEVQSPPYLG 115 (127)
T ss_pred EEEcCC-CcceEEEEEecCCcCC
Confidence 999998 5899999999999763
No 14
>PF02041 Auxin_BP: Auxin binding protein; InterPro: IPR000526 Auxin binding protein is located in the lumen of the endoplasmic reticulum (ER). The primary structure contains an N-terminal hydrophobic leader sequence of 30-40 amino acids, which could represent a signal for translocation of the protein to the ER [, ]. The mature protein comprises around 165 residues, and contains a number of potential N-glycosylation sites. In vitro transport studies have demonstrated co-translational glycosylation []. Retention within the lumen of the ER correlates with an additional signal located at the C terminus, represented by the sequence Lys-Asp-Glu-Leu, known to be responsible for preventing secretion of proteins from the lumen of the ER in eukaryotic cells [, ].; GO: 0004872 receptor activity, 0005788 endoplasmic reticulum lumen; PDB: 1LR5_D 1LRH_D.
Probab=97.83 E-value=0.00022 Score=56.43 Aligned_cols=99 Identities=21% Similarity=0.204 Sum_probs=58.7
Q ss_pred CCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCe
Q psy5204 79 GDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGL 158 (209)
Q Consensus 79 ~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~i 158 (209)
+-..++||+-...||+.||||-|.. .-++.||+|.=+-..+.. .... +| ...+..+.+|+...+.. ++.
T Consensus 40 GmkevEVwlQTfAPG~~TPiHRHsC-EEVFvVLkG~GTl~l~~~-~~~~--pG------~pqef~~~pnSTf~IPv-n~~ 108 (167)
T PF02041_consen 40 GMKEVEVWLQTFAPGSATPIHRHSC-EEVFVVLKGSGTLYLASS-HEKY--PG------KPQEFPIFPNSTFHIPV-NDA 108 (167)
T ss_dssp --SSEEEEEEEE-TT-B--EEEESS--EEEEEEE--EEEEE--S-SSSS----------S-EEEEE-TTEEEEE-T-T--
T ss_pred CceeeeEEeeeecCCCCCCCccccc-cEEEEEEecceEEEEecc-cccC--CC------CceEEEecCCCeEEeCC-CCc
Confidence 4578999999999999999999995 678899999877666522 1111 11 12467788999888875 689
Q ss_pred EEEecCCCCCceEEEEEeCCCCCcceEEEC
Q psy5204 159 HKVGNSGPHPAAVSLHLYSPPFNICSSFDA 188 (209)
Q Consensus 159 H~v~n~~~~~~avSLHvY~pp~~~~~~yd~ 188 (209)
|++-|.+..+..--|=+-+.|.-+..+||-
T Consensus 109 HQv~NT~e~eDlqvlViiSrpPvkvf~y~d 138 (167)
T PF02041_consen 109 HQVWNTNEHEDLQVLVIISRPPVKVFIYDD 138 (167)
T ss_dssp EEEE---SSS-EEEEEEEESSS--EEEESS
T ss_pred ceeecCCCCcceEEEEEecCCCeEEEEecc
Confidence 999999844677777788888888888884
No 15
>COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only]
Probab=97.76 E-value=0.00026 Score=59.03 Aligned_cols=79 Identities=19% Similarity=0.188 Sum_probs=65.9
Q ss_pred EEccCCCccCCCCCCCceE--EEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCC
Q psy5204 88 LCWSQGQASSIHDHAQAHC--FMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSG 165 (209)
Q Consensus 88 ~~W~pG~~spiHDH~~s~~--~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~ 165 (209)
....||+....|.|+++.- |..|++|+-+-.++.-. +. ..+..+++|+++++++ .-.|.+.|.+
T Consensus 85 ~~~t~G~~~~~H~Hp~ade~E~y~vi~G~g~m~v~~~~--G~-----------~~v~~~~~Gd~iyVPp-~~gH~t~N~G 150 (209)
T COG2140 85 VFKTPGAMRELHYHPNADEPEIYYVLKGEGRMLVQKPE--GE-----------ARVIAVRAGDVIYVPP-GYGHYTINTG 150 (209)
T ss_pred EEecCCcccccccCCCCCcccEEEEEeccEEEEEEcCC--Cc-----------EEEEEecCCcEEEeCC-CcceEeecCC
Confidence 3578888899999999988 99999999998887432 21 2467789999999987 5699999998
Q ss_pred CCCceEEEEEeCCCCC
Q psy5204 166 PHPAAVSLHLYSPPFN 181 (209)
Q Consensus 166 ~~~~avSLHvY~pp~~ 181 (209)
++|.+.+-+|.++-.
T Consensus 151 -d~pLvf~~v~~~~~~ 165 (209)
T COG2140 151 -DEPLVFLNVYPADAG 165 (209)
T ss_pred -CCCEEEEEEEeCCCC
Confidence 799999999988854
No 16
>PF01050 MannoseP_isomer: Mannose-6-phosphate isomerase; InterPro: IPR001538 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. The type II phosphomannose isomerases are bifunctional enzymes 5.3.1.8 from EC. This entry covers the isomerase region of the protein []. The guanosine diphospho-D-mannose pyrophosphorylase region is described in another InterPro entry (see IPR005836 from INTERPRO).; GO: 0016779 nucleotidyltransferase activity, 0005976 polysaccharide metabolic process
Probab=97.75 E-value=0.0011 Score=52.78 Aligned_cols=117 Identities=14% Similarity=0.179 Sum_probs=82.4
Q ss_pred CHHHHHHHHHHhhcCccchh-hhhc-cCCccceEEeeecCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEE
Q psy5204 42 DVAHVKFLMESYVSNADDWK-RYAQ-FDENRYTRNLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVR 119 (209)
Q Consensus 42 ~~~~v~~ll~~l~~~~~~w~-~~~~-~~~~~Y~R~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~ 119 (209)
..+.|+.+++.|...+..-. .+.. +.| ++.+.+...++.|.+-.+.-.||+..+.|-|..-.-.-.|++|+...+.
T Consensus 22 ~~q~vK~~v~~lk~~~~~E~~~~~~~~rp--WG~~~~l~~~~~~~vkri~V~pG~~lSlq~H~~R~E~W~Vv~G~a~v~~ 99 (151)
T PF01050_consen 22 RSQDVKEVVEQLKQKGRYEAKEHRRVYRP--WGSYEVLDEGEGYKVKRITVNPGKRLSLQYHHHRSEHWTVVSGTAEVTL 99 (151)
T ss_pred HhhhhHHHHHhhhcccccccccceeEecC--CcEEEEEEccCCEEEEEEEEcCCCccceeeecccccEEEEEeCeEEEEE
Confidence 35678888888876422111 1111 233 3333332226789999999999999999888755445578999999886
Q ss_pred eeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEeCCCC
Q psy5204 120 YQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLYSPPF 180 (209)
Q Consensus 120 y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp~ 180 (209)
++ ....+.+|++++++. +..|++.|++ +.+..-|.|=.+++
T Consensus 100 -----~~-------------~~~~~~~g~sv~Ip~-g~~H~i~n~g-~~~L~~IEVq~G~~ 140 (151)
T PF01050_consen 100 -----DD-------------EEFTLKEGDSVYIPR-GAKHRIENPG-KTPLEIIEVQTGEY 140 (151)
T ss_pred -----CC-------------EEEEEcCCCEEEECC-CCEEEEECCC-CcCcEEEEEecCCC
Confidence 11 367799999999875 5699999998 57888888866654
No 17
>TIGR02451 anti_sig_ChrR anti-sigma factor, putative, ChrR family. The member of this family from Rhodobacter sphaeroides has been shown both to form a complex with sigma(E) and to negatively regulate tetrapyrrole biosynthesis. This protein likely contains (at least) two distinct functional domains; several smaller homologs (excluded by the model) show homology only to the C-terminal, including a motif PxHxHxGxE.
Probab=97.69 E-value=0.00046 Score=57.99 Aligned_cols=86 Identities=17% Similarity=0.223 Sum_probs=65.5
Q ss_pred ceEEeeecCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEE
Q psy5204 71 YTRNLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVC 150 (209)
Q Consensus 71 Y~R~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~ 150 (209)
..+..|+.+++.-.+.++.|.||+..|.|.|.|.. +.-||+|.+. + + ...+.+|+..
T Consensus 115 v~~~~L~~~~~~~~v~Ll~i~pG~~~p~H~H~G~E-~tlVLeG~f~------d-e---------------~g~y~~Gd~i 171 (215)
T TIGR02451 115 VSRVTLPIDDGNARVRLLYIEAGQSIPQHTHKGFE-LTLVLHGAFS------D-E---------------TGVYGVGDFE 171 (215)
T ss_pred eEEEeccCCCCCcEEEEEEECCCCccCCCcCCCcE-EEEEEEEEEE------c-C---------------CCccCCCeEE
Confidence 34555654234678899999999999999999865 6789999974 1 1 2347899987
Q ss_pred EEcCCCCeEEEecCCCCCceEEEEEeCCCCC
Q psy5204 151 YINDSMGLHKVGNSGPHPAAVSLHLYSPPFN 181 (209)
Q Consensus 151 ~~~~~~~iH~v~n~~~~~~avSLHvY~pp~~ 181 (209)
..++ ...|+..+.+ ++.+|-|=|.-.|+.
T Consensus 172 ~~p~-~~~H~p~a~~-~~~Cicl~v~dapl~ 200 (215)
T TIGR02451 172 EADG-SVQHQPRTVS-GGDCLCLAVLDAPLR 200 (215)
T ss_pred ECCC-CCCcCcccCC-CCCeEEEEEecCCcc
Confidence 7765 5689999886 578999888877764
No 18
>smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant.
Probab=97.66 E-value=0.0008 Score=52.71 Aligned_cols=80 Identities=16% Similarity=0.088 Sum_probs=62.8
Q ss_pred CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeE
Q psy5204 80 DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLH 159 (209)
Q Consensus 80 ~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH 159 (209)
...+.+..+...||..++.|-|.++.-++.|++|+.....-.. ++. ......+.+||+.++++ ...|
T Consensus 27 ~~~~~~~~~~i~pg~~~~~h~H~~~~e~~~Vl~G~~~~~~~~~--~~~----------~~~~~~l~~GD~~~ip~-g~~H 93 (146)
T smart00835 27 GLGISAARVNLEPGGMLPPHYHPRATELLYVVRGEGRVGVVDP--NGN----------KVYDARLREGDVFVVPQ-GHPH 93 (146)
T ss_pred cCceEEEEEEecCCcCcCCeeCCCCCEEEEEEeCeEEEEEEeC--CCC----------eEEEEEecCCCEEEECC-CCEE
Confidence 3579999999999999999999766778999999998875322 111 11367799999999876 5699
Q ss_pred EEecCCCCCceEEE
Q psy5204 160 KVGNSGPHPAAVSL 173 (209)
Q Consensus 160 ~v~n~~~~~~avSL 173 (209)
.+.|.+ +++++-+
T Consensus 94 ~~~n~~-~~~~~~l 106 (146)
T smart00835 94 FQVNSG-DENLEFV 106 (146)
T ss_pred EEEcCC-CCCEEEE
Confidence 999997 5777666
No 19
>PRK11171 hypothetical protein; Provisional
Probab=97.54 E-value=0.0025 Score=55.15 Aligned_cols=87 Identities=16% Similarity=0.162 Sum_probs=65.3
Q ss_pred ccceEEeeec--CCCceEEEEEEccCCCccCCCCCC-CceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEee
Q psy5204 69 NRYTRNLVDE--GDGKFNLILLCWSQGQASSIHDHA-QAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELA 145 (209)
Q Consensus 69 ~~Y~R~ll~~--~~~~~~l~~~~W~pG~~spiHDH~-~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~ 145 (209)
..+.+..|.. ....|+++.+...||..+..|-|. ++.-++.|++|+++... ++ .+..+.
T Consensus 45 ~~~~~~~L~~~~~~~~~~~~~~~l~PG~~~~~~~h~~~~eE~~~VlsG~l~v~~-----~g-------------~~~~L~ 106 (266)
T PRK11171 45 ENTRAWVLARPGLGATFSQYLVEVEPGGGSDQPEPDEGAETFLFVVEGEITLTL-----EG-------------KTHALS 106 (266)
T ss_pred CCeEEEEEeCCCCCCcEEEEEEEECCCCcCCCCCCCCCceEEEEEEeCEEEEEE-----CC-------------EEEEEC
Confidence 3455555553 357899999999999987766654 45677899999999874 11 367899
Q ss_pred CCcEEEEcCCCCeEEEecCCCCCceEEEEE
Q psy5204 146 ENDVCYINDSMGLHKVGNSGPHPAAVSLHL 175 (209)
Q Consensus 146 ~G~v~~~~~~~~iH~v~n~~~~~~avSLHv 175 (209)
+||.+++.+ ..-|++.|.+ ++++.-|=|
T Consensus 107 ~GDsi~~p~-~~~H~~~N~g-~~~a~~l~v 134 (266)
T PRK11171 107 EGGYAYLPP-GSDWTLRNAG-AEDARFHWI 134 (266)
T ss_pred CCCEEEECC-CCCEEEEECC-CCCEEEEEE
Confidence 999999976 5699999998 567655543
No 20
>PRK11171 hypothetical protein; Provisional
Probab=97.42 E-value=0.0026 Score=55.09 Aligned_cols=103 Identities=15% Similarity=0.187 Sum_probs=70.5
Q ss_pred hhcCccchhhhhccC-CccceEE-eeec--CCCceEEEEEEccCCCccCCC-CCCCceEEEEEEEeEEEEEEeeCCCCCC
Q psy5204 53 YVSNADDWKRYAQFD-ENRYTRN-LVDE--GDGKFNLILLCWSQGQASSIH-DHAQAHCFMKMLQGCLTEVRYQEPNHTS 127 (209)
Q Consensus 53 l~~~~~~w~~~~~~~-~~~Y~R~-ll~~--~~~~~~l~~~~W~pG~~spiH-DH~~s~~~~~vl~G~l~e~~y~~~~~~~ 127 (209)
+..+-.+|....... .+...|. ++.. ....++++.+..+||..++.| .|+ ..=.+.||+|+++... ++
T Consensus 150 ~~~~~~d~~~~~~~g~~g~~~~~~~~~p~~~~~~~~~~~~~l~PG~~~~~~~~~~-~ee~i~Vl~G~~~~~~-----~~- 222 (266)
T PRK11171 150 FVGNESDIEPIPMPGTDGVWATTRLVDPEDLRFDMHVNIVTFEPGASIPFVETHV-MEHGLYVLEGKGVYRL-----NN- 222 (266)
T ss_pred EecchhcccccccCCCCCeEEEEEeeCchhcCCCcEEEEEEECCCCEEccCcCCC-ceEEEEEEeCEEEEEE-----CC-
Confidence 344455665443322 2335555 4432 234579999999999999995 555 4556899999999764 11
Q ss_pred CCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEeC
Q psy5204 128 DEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLYS 177 (209)
Q Consensus 128 ~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY~ 177 (209)
....+.+||+++|.+ ..-|.+.|++ +++ ..+-+|=
T Consensus 223 ------------~~~~l~~GD~i~~~~-~~~h~~~N~g-~~~-~~yl~~k 257 (266)
T PRK11171 223 ------------DWVEVEAGDFIWMRA-YCPQACYAGG-PGP-FRYLLYK 257 (266)
T ss_pred ------------EEEEeCCCCEEEECC-CCCEEEECCC-CCc-EEEEEEc
Confidence 367799999999986 5799999998 455 4455553
No 21
>PRK04190 glucose-6-phosphate isomerase; Provisional
Probab=97.38 E-value=0.0079 Score=49.74 Aligned_cols=114 Identities=18% Similarity=0.197 Sum_probs=77.3
Q ss_pred hhcCccchhhhhc-cCCccceEEeee--cCCCceEEEEEEccCCCc------cCCCCCCCc--eEEEEEEEeEEEEEEee
Q psy5204 53 YVSNADDWKRYAQ-FDENRYTRNLVD--EGDGKFNLILLCWSQGQA------SSIHDHAQA--HCFMKMLQGCLTEVRYQ 121 (209)
Q Consensus 53 l~~~~~~w~~~~~-~~~~~Y~R~ll~--~~~~~~~l~~~~W~pG~~------spiHDH~~s--~~~~~vl~G~l~e~~y~ 121 (209)
+..+++.|.+... .|+--|.-+... ...+.+.+-+....||+. |+.|-|... .=++.||+|+-....=
T Consensus 35 ~~~d~~~~~~~~~~~d~~~Y~v~~~~~~~~~~~L~~g~t~l~PG~~g~e~~mt~gH~H~~~~~~EiyyvlsG~g~~~l~- 113 (191)
T PRK04190 35 LFADEEAFRAMPATEDTVVYEVYAIEPEETEGDLNFGTTRLYPGKVGDEYFMTKGHFHAKADRAEIYYGLKGKGLMLLQ- 113 (191)
T ss_pred HHcCccccccccccCCceEEEEEEecCCCcCCceEEEEEEECCCcEecccccCCCeEcCCCCCCEEEEEEeCEEEEEEe-
Confidence 3344554544332 333335443332 134689999999999997 888999632 3678899999776541
Q ss_pred CCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEeCCCCC
Q psy5204 122 EPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLYSPPFN 181 (209)
Q Consensus 122 ~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp~~ 181 (209)
+.++. .....+.+|+++++++ ...|++.|.+ +++.+-|=+|.....
T Consensus 114 -~~~G~-----------~~~~~v~pGd~v~IPp-g~~H~~iN~G-~epl~fl~v~p~~~~ 159 (191)
T PRK04190 114 -DPEGE-----------ARWIEMEPGTVVYVPP-YWAHRSVNTG-DEPLVFLACYPADAG 159 (191)
T ss_pred -cCCCc-----------EEEEEECCCCEEEECC-CCcEEeEECC-CCCEEEEEEEcCCcc
Confidence 11111 1367799999999886 5699999998 689999999977643
No 22
>PF00190 Cupin_1: Cupin; InterPro: IPR006045 This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. ; GO: 0045735 nutrient reservoir activity; PDB: 2E9Q_A 2EVX_A 1OD5_A 1UCX_A 1UD1_C 1FXZ_C 3KGL_C 3KSC_D 1UIJ_F 1IPK_B ....
Probab=97.35 E-value=0.0024 Score=49.89 Aligned_cols=89 Identities=15% Similarity=0.063 Sum_probs=64.5
Q ss_pred CceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceE--EeeCCcEEEEcCCCCe
Q psy5204 81 GKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRT--ELAENDVCYINDSMGL 158 (209)
Q Consensus 81 ~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~--~~~~G~v~~~~~~~~i 158 (209)
..+.+.+....||....+|-| .++.++.|++|+.+........... -...... .+++||+..++. ..+
T Consensus 32 ~~~~~~~~~i~pg~~~~Ph~h-~a~~i~~V~~G~~~~~~v~~~~~~~--------~~~~~~~~v~l~~Gdv~~vP~-G~~ 101 (144)
T PF00190_consen 32 NGVAVRRVLIEPGGLRAPHYH-NADEIVYVIEGRGRVGVVGPGGPQE--------EFRDFSQKVRLKAGDVFVVPA-GHP 101 (144)
T ss_dssp TTEEEEEEEEETTEEEEEEEE-SSEEEEEEEESEEEEEEEETTCSSS--------EEEEEEEEEEEETTEEEEE-T-T-E
T ss_pred cceEEEeeehhcCCccceeEe-eeeEEeeeeccceEEEEEecCCccc--------cceeeeceeeeecccceeecc-cee
Confidence 456777788899999999999 7899999999999977764322100 0011123 599999988875 569
Q ss_pred EEEecCCCCCceEEEEEeCCCC
Q psy5204 159 HKVGNSGPHPAAVSLHLYSPPF 180 (209)
Q Consensus 159 H~v~n~~~~~~avSLHvY~pp~ 180 (209)
|-+.|.+ +++..-|.++.-.-
T Consensus 102 h~~~n~~-~~~~~~~~~f~~~~ 122 (144)
T PF00190_consen 102 HWIINDG-DDEALVLIIFDTNN 122 (144)
T ss_dssp EEEEECS-SSSEEEEEEEEESS
T ss_pred EEEEcCC-CCCCEEEEEEECCC
Confidence 9999997 57788887775443
No 23
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=97.19 E-value=0.0059 Score=52.69 Aligned_cols=78 Identities=14% Similarity=0.200 Sum_probs=59.0
Q ss_pred CceEEEEEEccCCCccC-CCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeE
Q psy5204 81 GKFNLILLCWSQGQASS-IHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLH 159 (209)
Q Consensus 81 ~~~~l~~~~W~pG~~sp-iHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH 159 (209)
..|+++++...||..++ .|.|.+..-++.|++|+++.+. ++ ....+.+||..++++ +.-|
T Consensus 56 ~~f~~~~v~l~pgg~~~~~~~~~g~ee~iyVl~G~l~v~~-----~g-------------~~~~L~~Gd~~y~pa-~~~H 116 (260)
T TIGR03214 56 ATFVQYIVEVHPGGGNTTGFGGEGIETFLFVISGEVNVTA-----EG-------------ETHELREGGYAYLPP-GSKW 116 (260)
T ss_pred CcEEEEEEEECCCCcCCCCCCCCceEEEEEEEeCEEEEEE-----CC-------------EEEEECCCCEEEECC-CCCE
Confidence 69999999999987764 4456665789999999999763 11 357899999999976 5699
Q ss_pred EEecCCCCCceEEEEEeCCC
Q psy5204 160 KVGNSGPHPAAVSLHLYSPP 179 (209)
Q Consensus 160 ~v~n~~~~~~avSLHvY~pp 179 (209)
++.|.+ +++ ..++++-.+
T Consensus 117 ~~~N~~-~~~-a~~l~v~k~ 134 (260)
T TIGR03214 117 TLANAQ-AED-ARFFLYKKR 134 (260)
T ss_pred EEEECC-CCC-EEEEEEEee
Confidence 999998 455 445554433
No 24
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=97.03 E-value=0.012 Score=50.79 Aligned_cols=96 Identities=13% Similarity=0.114 Sum_probs=65.9
Q ss_pred cCccchhhhhccCCcc-ceEEeeec-CCCceEEEEEEccCCCccCC-CCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCC
Q psy5204 55 SNADDWKRYAQFDENR-YTRNLVDE-GDGKFNLILLCWSQGQASSI-HDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIG 131 (209)
Q Consensus 55 ~~~~~w~~~~~~~~~~-Y~R~ll~~-~~~~~~l~~~~W~pG~~spi-HDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~ 131 (209)
.+.+++........+. ..|.+|-+ ....+.+-++...||..+|. |-|....+ +.||+|+-.... ++
T Consensus 149 g~~~dv~~~~~~g~~~~~~~~llp~~~~~~~~~~~~~~~PG~~~~~~~~H~~eh~-~yiL~G~G~~~~-----~g----- 217 (260)
T TIGR03214 149 GNEKDIEPEPYEGMDDVILTTLLPKELAFDMNVHILSFEPGASHPYIETHVMEHG-LYVLEGKGVYNL-----DN----- 217 (260)
T ss_pred cCHHHCCccccCCCCcEEEEEeCchhcCCCcEEEEEEECCCcccCCcccccceeE-EEEEeceEEEEE-----CC-----
Confidence 3444554333333333 66777732 23478899999999999996 55553344 799999987653 11
Q ss_pred CCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceE
Q psy5204 132 QDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAV 171 (209)
Q Consensus 132 ~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~av 171 (209)
....+.+||+++|.+ ...|.+.|.+ +++..
T Consensus 218 --------~~~~V~~GD~i~i~~-~~~h~~~~~G-~~~~~ 247 (260)
T TIGR03214 218 --------NWVPVEAGDYIWMGA-YCPQACYAGG-RGEFR 247 (260)
T ss_pred --------EEEEecCCCEEEECC-CCCEEEEecC-CCcEE
Confidence 367799999999987 6799999998 45543
No 25
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=96.84 E-value=0.014 Score=44.64 Aligned_cols=86 Identities=19% Similarity=0.196 Sum_probs=69.2
Q ss_pred CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeE
Q psy5204 80 DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLH 159 (209)
Q Consensus 80 ~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH 159 (209)
.....+-++..+||..+.-|-|++-..+++||+|+.+.- |. ..|+ ....+++||-.|+.+ .--|
T Consensus 43 as~i~~~~vTi~pgAkakaH~H~~hEtaIYvlsG~ah~w-~G------------~rLE--~ha~~~pGDf~YiPp-gVPH 106 (142)
T COG4101 43 ASGICMHLVTIPPGAKAKAHLHEEHETAIYVLSGEAHTW-YG------------NRLE--EHAEVGPGDFFYIPP-GVPH 106 (142)
T ss_pred cceeeEEEEeeCCCccccccccccccEEEEEEeceeeee-ec------------ccee--eeEEecCCCeEEcCC-CCCC
Confidence 456778888999999999999999889999999987643 31 1144 356789999999986 4599
Q ss_pred EEecCCCCCceEEEEEeCCCCCc
Q psy5204 160 KVGNSGPHPAAVSLHLYSPPFNI 182 (209)
Q Consensus 160 ~v~n~~~~~~avSLHvY~pp~~~ 182 (209)
+-.|.| ++|+.-+=+-+.|..+
T Consensus 107 qp~N~S-~ep~s~vIaRsDp~~~ 128 (142)
T COG4101 107 QPANLS-TEPLSAVIARSDPNPQ 128 (142)
T ss_pred cccccC-CCCeEEEEEccCCCCC
Confidence 999999 7999988888877654
No 26
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=96.72 E-value=0.019 Score=51.97 Aligned_cols=82 Identities=13% Similarity=0.121 Sum_probs=62.3
Q ss_pred CceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEE
Q psy5204 81 GKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHK 160 (209)
Q Consensus 81 ~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~ 160 (209)
..+++..+...||...++|.|.+. =++.|++|+++...... ++. . -...+++||+.+|+. ..+|.
T Consensus 65 ~~ls~~~~~l~pG~~~~~HwH~~~-E~~yVl~G~~~v~~~d~--~g~--------~---~~~~L~~GD~~~fP~-g~~H~ 129 (367)
T TIGR03404 65 TAIAGVNMRLEPGAIRELHWHKEA-EWAYVLYGSCRITAVDE--NGR--------N---YIDDVGAGDLWYFPP-GIPHS 129 (367)
T ss_pred ccccceEEEEcCCCCCCcccCCCc-eEEEEEeeEEEEEEEcC--CCc--------E---EEeEECCCCEEEECC-CCeEE
Confidence 468899999999999999999875 58999999999887531 111 0 123699999999975 56999
Q ss_pred EecCCCCCceEEEEEeCCC
Q psy5204 161 VGNSGPHPAAVSLHLYSPP 179 (209)
Q Consensus 161 v~n~~~~~~avSLHvY~pp 179 (209)
+.|.+ +.+..|=++.+.
T Consensus 130 ~~n~~--~~~~~l~vf~~~ 146 (367)
T TIGR03404 130 LQGLD--EGCEFLLVFDDG 146 (367)
T ss_pred EEECC--CCeEEEEEeCCc
Confidence 99985 345556566554
No 27
>PF12973 Cupin_7: ChrR Cupin-like domain; PDB: 3O14_B 2Z2S_F 2Q1Z_B 3EBR_A.
Probab=96.63 E-value=0.0088 Score=43.10 Aligned_cols=69 Identities=14% Similarity=0.112 Sum_probs=50.3
Q ss_pred ceEEeeecC--CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCc
Q psy5204 71 YTRNLVDEG--DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAEND 148 (209)
Q Consensus 71 Y~R~ll~~~--~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~ 148 (209)
..+..|+.. .....+.++-|.||...|.|.|.+ .-.+.||+|++... ...+.+|+
T Consensus 10 v~~~~L~~~~~~~g~~~~L~r~~pG~~~p~H~H~g-~ee~~VLeG~~~d~----------------------~~~~~~G~ 66 (91)
T PF12973_consen 10 VSVKPLHRDEGETGERVSLLRLEPGASLPRHRHPG-GEEILVLEGELSDG----------------------DGRYGAGD 66 (91)
T ss_dssp EEEEEEEECSSSTTEEEEEEEE-TTEEEEEEEESS--EEEEEEECEEEET----------------------TCEEETTE
T ss_pred EEEEEeccCCCcccCEEEEEEECCCCCcCccCCCC-cEEEEEEEEEEEEC----------------------CccCCCCe
Confidence 455666642 236789999999999999999996 56789999999821 12368999
Q ss_pred EEEEcCCCCeEEEec
Q psy5204 149 VCYINDSMGLHKVGN 163 (209)
Q Consensus 149 v~~~~~~~~iH~v~n 163 (209)
..+.++ ...|+...
T Consensus 67 ~~~~p~-g~~h~~~s 80 (91)
T PF12973_consen 67 WLRLPP-GSSHTPRS 80 (91)
T ss_dssp EEEE-T-TEEEEEEE
T ss_pred EEEeCC-CCccccCc
Confidence 877765 56999984
No 28
>COG3837 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=96.08 E-value=0.057 Score=43.18 Aligned_cols=86 Identities=22% Similarity=0.172 Sum_probs=63.3
Q ss_pred eEEeeec--CCCceEEEEEEccCCCcc-CCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCc
Q psy5204 72 TRNLVDE--GDGKFNLILLCWSQGQAS-SIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAEND 148 (209)
Q Consensus 72 ~R~ll~~--~~~~~~l~~~~W~pG~~s-piHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~ 148 (209)
+|..|-+ +-++|=|-+..-+||..| ..|.|....-|+.||+|+..-.. ++ .+..|+|||
T Consensus 29 ~~~~lG~~~Gl~~fGvn~~~v~PG~~Ss~~H~Hs~edEfv~ILeGE~~l~~-----d~-------------~e~~lrpGD 90 (161)
T COG3837 29 TRTRLGDALGLKRFGVNLEIVEPGGESSLRHWHSAEDEFVYILEGEGTLRE-----DG-------------GETRLRPGD 90 (161)
T ss_pred hhhhhhhhcChhhcccceEEeCCCCccccccccccCceEEEEEcCceEEEE-----CC-------------eeEEecCCc
Confidence 3344443 567888888888998865 56999888899999999988653 11 367799999
Q ss_pred EEEEcCC-CCeEEEecCCCCCceEEEEEe
Q psy5204 149 VCYINDS-MGLHKVGNSGPHPAAVSLHLY 176 (209)
Q Consensus 149 v~~~~~~-~~iH~v~n~~~~~~avSLHvY 176 (209)
+..|... ..-|.+.|.| +...+=|-|=
T Consensus 91 ~~gFpAG~~~aHhliN~s-~~~~~yL~vG 118 (161)
T COG3837 91 SAGFPAGVGNAHHLINRS-DVILRYLEVG 118 (161)
T ss_pred eeeccCCCcceeEEeecC-CceEEEEEec
Confidence 9888642 2389999998 4566666553
No 29
>PF02311 AraC_binding: AraC-like ligand binding domain; InterPro: IPR003313 This entry defines the arabinose-binding and dimerisation domain of the bacterial gene regulatory protein AraC. The crystal structure of the arabinose-binding and dimerization domain of the Escherichia coli gene regulatory protein AraC was determined in the presence and absence of L-arabinose. The arabinose-bound molecule shows that the protein adopts an unusual fold, binding sugar within a beta barrel and completely burying the arabinose with the amino-terminal arm of the protein. Dimer contacts in the presence of arabinose are mediated by an antiparallel coiled-coil. In the uncomplexed protein, the amino-terminal arm is disordered, uncovering the sugar-binding pocket and allowing it to serve as an oligomerization interface [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1XJA_B 2ARA_A 2AAC_B 2ARC_A.
Probab=95.90 E-value=0.1 Score=38.50 Aligned_cols=75 Identities=15% Similarity=0.006 Sum_probs=50.0
Q ss_pred EEEEEEcc-CCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEe
Q psy5204 84 NLILLCWS-QGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVG 162 (209)
Q Consensus 84 ~l~~~~W~-pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~ 162 (209)
.+....|. ++...+.|-|. ..-++.|++|+..... + .....+++|+++.+.| +.+|+..
T Consensus 3 ~~~~~~~~~~~~~~~~h~h~-~~~i~~v~~G~~~~~~-----~-------------~~~~~l~~g~~~li~p-~~~H~~~ 62 (136)
T PF02311_consen 3 DFYGISQRSPNFEFPPHWHD-FYEIIYVLSGEGTLHI-----D-------------GQEYPLKPGDLFLIPP-GQPHSYY 62 (136)
T ss_dssp EEEEEETTSTT-SEEEETT--SEEEEEEEEE-EEEEE-----T-------------TEEEEE-TT-EEEE-T-TS-EEEE
T ss_pred EEEEEEecCCCCccCCEECC-CEEEEEEeCCEEEEEE-----C-------------CEEEEEECCEEEEecC-CccEEEe
Confidence 45566677 78888999999 5889999999998654 1 1367899999999887 5799999
Q ss_pred cCCCCCceEEEEEeCCC
Q psy5204 163 NSGPHPAAVSLHLYSPP 179 (209)
Q Consensus 163 n~~~~~~avSLHvY~pp 179 (209)
..+ +++.....||-++
T Consensus 63 ~~~-~~~~~~~~i~~~~ 78 (136)
T PF02311_consen 63 PDS-NEPWEYYWIYFSP 78 (136)
T ss_dssp E-T-TSEEEEEEEEE--
T ss_pred cCC-CCCEEEEEEEECH
Confidence 776 3577777777766
No 30
>PRK12335 tellurite resistance protein TehB; Provisional
Probab=95.20 E-value=0.078 Score=46.05 Aligned_cols=79 Identities=13% Similarity=0.178 Sum_probs=57.6
Q ss_pred cCCCccCCCCC-CCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCc
Q psy5204 91 SQGQASSIHDH-AQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPA 169 (209)
Q Consensus 91 ~pG~~spiHDH-~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~ 169 (209)
-|+..-..|+| .|.|+.+.||+|++.-..|. +++. ......+.+.....+.++...|++...+ ++.
T Consensus 19 ~p~~~~~~H~t~~g~~~~~~vl~G~l~~~~~d--e~g~----------~~~~~~l~~~~~~~~i~p~~wh~v~~~s-~d~ 85 (287)
T PRK12335 19 LPEMFQEKHNTKEGTWAKLTVLKGELKFYELT--EDGE----------ELSEHIFDAENQPPFIEPQAWHRIEAAS-DDL 85 (287)
T ss_pred chHHHHhccCCCCCcceEEEEEeeeEEEEEEC--CCCC----------eeeEEEEecCCCCceeCCcceEEEEEcC-CCc
Confidence 57888899999 68899999999999999883 2321 1134556655433333345799999887 578
Q ss_pred eEEEEEeCCCCCc
Q psy5204 170 AVSLHLYSPPFNI 182 (209)
Q Consensus 170 avSLHvY~pp~~~ 182 (209)
...|.+|..|...
T Consensus 86 ~~~l~fy~~~~~~ 98 (287)
T PRK12335 86 ECQLSFYCKPEDY 98 (287)
T ss_pred EEEEEEEEcchhh
Confidence 8899999988653
No 31
>COG3615 TehB Uncharacterized protein/domain, possibly involved in tellurite resistance [Inorganic ion transport and metabolism]
Probab=94.60 E-value=0.22 Score=36.60 Aligned_cols=65 Identities=12% Similarity=0.171 Sum_probs=43.8
Q ss_pred CCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEEEeCCC
Q psy5204 102 AQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLHLYSPP 179 (209)
Q Consensus 102 ~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp 179 (209)
.|.|+-+.||+|+++-.-|. +.+.. . .+.++.++.-..+-++...|+|+..+ |+.-.-|++|..|
T Consensus 34 ~G~w~kLsVl~G~vk~~~~~--ee~~~-------~---~e~~~~~ea~~~~~~PQ~WHrVea~t-DD~e~~l~Fyc~~ 98 (99)
T COG3615 34 PGTWGKLSVLKGAVKFLGLA--EEGET-------E---PEHVFSIEAQFPVFPPQAWHRVEAMT-DDAEFNLSFYCAP 98 (99)
T ss_pred CCceeEEEEEeceeEEEEEc--CCCCc-------c---ceEEEeecCCCCccChhHeeeeeecc-cccEEEEEEEEcC
Confidence 45799999999999876663 33321 1 24445444433333446799999887 5788899998755
No 32
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=94.44 E-value=0.51 Score=33.76 Aligned_cols=52 Identities=13% Similarity=0.089 Sum_probs=37.4
Q ss_pred CceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCC
Q psy5204 103 QAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGP 166 (209)
Q Consensus 103 ~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~ 166 (209)
|.||.+.||+|+|+-..+.- ++. .........+|+...+.| ...|+|+-.|+
T Consensus 24 GtWg~l~Vl~G~L~f~~~~~--~~~---------~~~~~~~~~~~~~~~i~P-q~wH~V~p~s~ 75 (82)
T PF09313_consen 24 GTWGKLRVLEGELKFYGLDE--EGE---------EPEEEVFIPAGQPPVIEP-QQWHRVEPLSD 75 (82)
T ss_dssp TEEEEEEEEESEEEEEEESS--TT----------SESEEEEEETTEEEEE-T-T-EEEEEESST
T ss_pred CeEEEEEEEeeEEEEEEECC--CCC---------ceeEEEEeCCCCCceeCC-CceEEEEECCC
Confidence 56999999999999887742 221 122466778999888877 57999998874
No 33
>TIGR03037 anthran_nbaC 3-hydroxyanthranilate 3,4-dioxygenase. Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase. This enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.
Probab=93.87 E-value=0.61 Score=37.51 Aligned_cols=96 Identities=13% Similarity=0.127 Sum_probs=65.1
Q ss_pred EeeecCCCceEEEEEEc-cCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEE
Q psy5204 74 NLVDEGDGKFNLILLCW-SQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYI 152 (209)
Q Consensus 74 ~ll~~~~~~~~l~~~~W-~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~ 152 (209)
..|+. +. ++.++.. .||+++..|.|. +.-++.+|+|++....-. ++. .....+++||...+
T Consensus 21 ~~l~~-~~--~~~v~~vgGpn~R~d~H~~~-tdE~FyqleG~~~l~v~d---~g~-----------~~~v~L~eGd~flv 82 (159)
T TIGR03037 21 QQIWQ-DS--EFMVTVVGGPNARTDFHDDP-GEEFFYQLKGEMYLKVTE---EGK-----------REDVPIREGDIFLL 82 (159)
T ss_pred eEeec-CC--cEEEEEeCCCCCCcccccCC-CceEEEEEcceEEEEEEc---CCc-----------EEEEEECCCCEEEe
Confidence 55664 34 4555556 999999999999 588999999999887531 110 13688999998777
Q ss_pred cCCCCeEEEecCCCCCceEEEEEeCCCCC--cceEEECC
Q psy5204 153 NDSMGLHKVGNSGPHPAAVSLHLYSPPFN--ICSSFDAQ 189 (209)
Q Consensus 153 ~~~~~iH~v~n~~~~~~avSLHvY~pp~~--~~~~yd~~ 189 (209)
++ +--|+..... +...+.+=--.++.. +++-|-+.
T Consensus 83 P~-gvpHsP~r~~-~t~~LvIE~~r~~~~~d~~~wyc~~ 119 (159)
T TIGR03037 83 PP-HVPHSPQRPA-GSIGLVIERKRPQGELDGFQWFCPQ 119 (159)
T ss_pred CC-CCCcccccCC-CcEEEEEEeCCCCCCCcceEEECCC
Confidence 64 5588887654 345555555555433 35667665
No 34
>PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional
Probab=93.40 E-value=0.96 Score=37.00 Aligned_cols=97 Identities=9% Similarity=0.086 Sum_probs=63.4
Q ss_pred EeeecCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEc
Q psy5204 74 NLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYIN 153 (209)
Q Consensus 74 ~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~ 153 (209)
++|+. ++.|.+++ +=.||+++..|.|.+ --++.+|+|.+....-+ ++. .....+++||...++
T Consensus 27 ~~l~~-~~d~~Vmv-vgGpn~r~d~H~~~t-dE~FyqleG~~~l~v~d---~g~-----------~~~v~L~eGd~fllP 89 (177)
T PRK13264 27 KQIWQ-DSDFIVMV-VGGPNARTDFHYDPG-EEFFYQLEGDMYLKVQE---DGK-----------RRDVPIREGEMFLLP 89 (177)
T ss_pred eeeEc-CCCEEEEE-EccCCcccccccCCC-ceEEEEECCeEEEEEEc---CCc-----------eeeEEECCCCEEEeC
Confidence 56775 45555553 349999999999995 78999999999888743 211 136789999987765
Q ss_pred CCCCeEEEecCCCCCceEEEEEeCCCC--CcceEEECC
Q psy5204 154 DSMGLHKVGNSGPHPAAVSLHLYSPPF--NICSSFDAQ 189 (209)
Q Consensus 154 ~~~~iH~v~n~~~~~~avSLHvY~pp~--~~~~~yd~~ 189 (209)
+ +--|+..... +...+.+=-=+++- .+++-|...
T Consensus 90 ~-gvpHsP~r~~-~tv~LviE~~r~~~~~d~~~wyc~~ 125 (177)
T PRK13264 90 P-HVPHSPQREA-GSIGLVIERKRPEGELDGFQWYCDE 125 (177)
T ss_pred C-CCCcCCccCC-CeEEEEEEeCCCCCCccceEEECCC
Confidence 4 5689886642 33344443333332 245667665
No 35
>PF02678 Pirin: Pirin; InterPro: IPR003829 This entry represents N-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues []. Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold []. Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 2VEC_A 1J1L_A 3ACL_A 2P17_A 1TQ5_A.
Probab=92.97 E-value=0.59 Score=35.04 Aligned_cols=66 Identities=23% Similarity=0.303 Sum_probs=47.9
Q ss_pred CCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCC-eEEEecCCCCCceE
Q psy5204 93 GQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMG-LHKVGNSGPHPAAV 171 (209)
Q Consensus 93 G~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~-iH~v~n~~~~~~av 171 (209)
+..-+.|=|.+...+-.|++|+++..- +.+ ...++.+|++-+|....| .|+-.|.+.+++.-
T Consensus 39 ~~gf~~HPH~g~eivTyv~~G~~~H~D----s~G-------------~~~~l~~G~vq~m~AG~Gi~H~E~~~~~~~~~~ 101 (107)
T PF02678_consen 39 GAGFPMHPHRGFEIVTYVLEGELRHRD----SLG-------------NRGVLRAGDVQWMTAGSGIVHSERNASDGGPLH 101 (107)
T ss_dssp TTEEEEEEECSEEEEEEEEESEEEEEE----TTS-------------EEEEEETTEEEEEE-TTTEEEEEEE-TSSS-EE
T ss_pred CCCCCCcCCCCceEEEEEecCEEEEEC----CCC-------------CeeEeCCCeEEEEeCCCCceEEEecCCCCCeEE
Confidence 677799999998888889999998662 111 357799999999975456 99999997435665
Q ss_pred EEEE
Q psy5204 172 SLHL 175 (209)
Q Consensus 172 SLHv 175 (209)
.|.+
T Consensus 102 ~lQl 105 (107)
T PF02678_consen 102 GLQL 105 (107)
T ss_dssp EEEE
T ss_pred EEEE
Confidence 5544
No 36
>COG1741 Pirin-related protein [General function prediction only]
Probab=92.81 E-value=0.61 Score=40.78 Aligned_cols=73 Identities=16% Similarity=0.235 Sum_probs=55.9
Q ss_pred ccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCC-eEEEecC-CCC
Q psy5204 90 WSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMG-LHKVGNS-GPH 167 (209)
Q Consensus 90 W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~-iH~v~n~-~~~ 167 (209)
..||+.-++|=|.+..-+-.||+|+++.+- +.+ ...+++||+|-+|+...| +|+=.|+ +.+
T Consensus 51 ~~pG~~f~pHPHrg~etvTyvl~G~i~HrD----S~G-------------n~~~i~pGdvqwMTAG~GI~HSE~~~~~~~ 113 (276)
T COG1741 51 LAPGRGFPPHPHRGLETVTYVLDGEIEHRD----SLG-------------NKGVIRPGDVQWMTAGSGIVHSEMNPPSTG 113 (276)
T ss_pred ccCCCcCCCCCCCCcEEEEEEEccEEEEee----cCC-------------ceeeecccceeEEcCCCceeecccCCccCC
Confidence 579999999999999999999999988762 222 367789999999986556 8888887 444
Q ss_pred CceEEEEEeCCC
Q psy5204 168 PAAVSLHLYSPP 179 (209)
Q Consensus 168 ~~avSLHvY~pp 179 (209)
++.-.+.+.-.+
T Consensus 114 ~~l~~~QlWv~l 125 (276)
T COG1741 114 KPLHGLQLWVNL 125 (276)
T ss_pred CccceeeeecCC
Confidence 455566666544
No 37
>PF13621 Cupin_8: Cupin-like domain; PDB: 3AL6_C 3AL5_C 2XUM_A 2Y0I_A 1MZE_A 3KCY_A 1MZF_A 1YCI_A 2ILM_A 1H2L_A ....
Probab=92.42 E-value=0.88 Score=37.61 Aligned_cols=89 Identities=15% Similarity=0.182 Sum_probs=54.4
Q ss_pred EEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCC--C----------------CC--CCCcccc--ccceEE
Q psy5204 86 ILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTS--D----------------EI--GQDRPLS--EISRTE 143 (209)
Q Consensus 86 ~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~--~----------------~~--~~~~~l~--~~~~~~ 143 (209)
.+..=.+|..|+.|-.. +..++.++.|.=+-..|....... . .+ .....+. ..-+.+
T Consensus 134 ~l~ig~~gs~t~lH~D~-~~n~~~~i~G~K~~~L~pP~~~~~l~~~~~~~~~~~~~~~d~~~~d~~~~p~~~~~~~~~~~ 212 (251)
T PF13621_consen 134 NLWIGPPGSFTPLHYDP-SHNLLAQIRGRKRWILFPPDDSPNLYPRPDSHGGTVFSWVDPDNPDLERFPKFRKAPPYEVV 212 (251)
T ss_dssp EEEEE-TTEEEEEEE-S-SEEEEEEEESEEEEEEE-GGGGGGCTBETTTST-TCBBSS-TTS--TTT-CGGGG--EEEEE
T ss_pred EEEEeCCCceeeeeECc-hhhhhhccCCCEEEEEECCccccccccceecccccceeeeeccChhhhhhhhhccCceeEEE
Confidence 34445778999999877 589999999998888875221100 0 00 0000011 234788
Q ss_pred eeCCcEEEEcCCCCeEEEecCCCCCceEEEEEe
Q psy5204 144 LAENDVCYINDSMGLHKVGNSGPHPAAVSLHLY 176 (209)
Q Consensus 144 ~~~G~v~~~~~~~~iH~v~n~~~~~~avSLHvY 176 (209)
+.|||+.++++ .=.|+|.|.++++.+||+.+.
T Consensus 213 l~pGD~LfiP~-gWwH~V~~~~~~~~sisvn~w 244 (251)
T PF13621_consen 213 LEPGDVLFIPP-GWWHQVENLSDDDLSISVNYW 244 (251)
T ss_dssp EETT-EEEE-T-T-EEEEEESTTSSCEEEEEEE
T ss_pred ECCCeEEEECC-CCeEEEEEcCCCCeEEEEEEE
Confidence 99999999976 349999999422349998764
No 38
>PF06560 GPI: Glucose-6-phosphate isomerase (GPI); InterPro: IPR010551 This entry consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins (5.3.1.9 from EC), which are involved in glycolysis and in gluconeogenesis and catalyse the conversion of D-glucose 6-phosphate to D-fructose 6-phosphate. The deduced amino acid sequence of the first archaeal PGI isolated from Pyrococcus furiosus revealed that it is not related to its eukaryotic and many of its bacterial counterparts. In contrast, this archaeal PGI shares similarity with the cupin superfamily that consists of a variety of proteins that are generally involved in sugar metabolism in both prokaryotes and eukaryotes [].; GO: 0004347 glucose-6-phosphate isomerase activity, 0006094 gluconeogenesis, 0006096 glycolysis, 0005737 cytoplasm; PDB: 1J3Q_B 1J3R_B 1J3P_A 2GC0_A 1X8E_A 1X82_A 1QY4_B 2GC2_B 1QXJ_A 1QXR_B ....
Probab=89.57 E-value=5.7 Score=32.64 Aligned_cols=101 Identities=16% Similarity=0.120 Sum_probs=55.7
Q ss_pred eEEEEEEccCCC------ccCCCCCCC------ceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEE
Q psy5204 83 FNLILLCWSQGQ------ASSIHDHAQ------AHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVC 150 (209)
Q Consensus 83 ~~l~~~~W~pG~------~spiHDH~~------s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~ 150 (209)
...-+....||. .|.-|-|+. .-=++-||+|+-.-..-+...+. ....-...+.+|+++
T Consensus 50 L~ygiTvi~Pg~vG~E~~~T~GH~H~~~~~~~~~pEvY~vl~G~g~~lLq~~~~~~---------~~~~~~v~~~~G~~v 120 (182)
T PF06560_consen 50 LRYGITVIPPGKVGGEYFMTKGHYHPISPCGLSYPEVYEVLSGEGLILLQKEEGDD---------VGDVIAVEAKPGDVV 120 (182)
T ss_dssp EEEEEEEE---EETTEE-B---BB-SS----TT--EEEEEEESSEEEEEE-TTS--------------EEEEEE-TTEEE
T ss_pred EEeeeEEEcCcccCCccccCCCccCCccccCCCCCcEEEEEeCEEEEEEEecCCCc---------ceeEEEEEeCCCCEE
Confidence 445556667775 577899986 34567799998665554322211 222335668899999
Q ss_pred EEcCCCCeEEEecCCCCCceEEEEEeCCCCCc----------ceEEECCCCceE
Q psy5204 151 YINDSMGLHKVGNSGPHPAAVSLHLYSPPFNI----------CSSFDAQTGVKS 194 (209)
Q Consensus 151 ~~~~~~~iH~v~n~~~~~~avSLHvY~pp~~~----------~~~yd~~~g~~~ 194 (209)
+++| .-.|+..|++ +++.+..-++..+... ...|-..+|+..
T Consensus 121 ~IPp-~yaH~tIN~g-~~~L~~~~~~~~~~g~dY~~i~~~gg~a~~~~~~G~~~ 172 (182)
T PF06560_consen 121 YIPP-GYAHRTINTG-DEPLVFAAWVPRDAGHDYGPIEEHGGAAYYVLKDGKII 172 (182)
T ss_dssp EE-T-T-EEEEEE-S-SS-EEEEEEEETT---BCHHHHHH--SEEEEECTTEEE
T ss_pred EECC-CceEEEEECC-CCcEEEEEEEecCCCCCcHHHHhCCCeEEEEEeCCcEE
Confidence 9987 4599999998 7999999999877641 345555667654
No 39
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=89.53 E-value=1.3 Score=39.80 Aligned_cols=80 Identities=20% Similarity=0.188 Sum_probs=58.8
Q ss_pred ceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEE
Q psy5204 82 KFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKV 161 (209)
Q Consensus 82 ~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v 161 (209)
.+...+-..+||..++.|=|.. .++..|++|+=.-+. . + ..+..+++||++.. |+...|+-
T Consensus 80 tl~a~~q~l~pGe~~~~HRht~-sAl~~vveG~G~~t~---V-~-------------g~~~~~~~gD~~~t-P~w~wH~H 140 (335)
T TIGR02272 80 SLYAGLQLILPGEVAPSHRHTQ-SALRFIVEGKGAFTA---V-D-------------GERTTMHPGDFIIT-PSWTWHDH 140 (335)
T ss_pred hHHhhhEEeCCCCCCCcccccc-ceEEEEEEcCceEEE---E-C-------------CEEEeeeCCCEEEe-CCCeeEec
Confidence 3344556789999999999994 688889998843222 1 1 13677899998666 55779999
Q ss_pred ecCCCCCceEEEEEeCCCCC
Q psy5204 162 GNSGPHPAAVSLHLYSPPFN 181 (209)
Q Consensus 162 ~n~~~~~~avSLHvY~pp~~ 181 (209)
.|.+ +++.+.|.+-=-|+-
T Consensus 141 ~n~~-d~~~~wld~lD~Pl~ 159 (335)
T TIGR02272 141 GNPG-DEPMIWLDGLDIPLV 159 (335)
T ss_pred ccCC-CCcEEEEecCCHHHH
Confidence 9997 689999877666643
No 40
>PRK10296 DNA-binding transcriptional regulator ChbR; Provisional
Probab=89.40 E-value=3.7 Score=34.89 Aligned_cols=52 Identities=21% Similarity=0.200 Sum_probs=40.3
Q ss_pred CCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecC
Q psy5204 93 GQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNS 164 (209)
Q Consensus 93 G~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~ 164 (209)
+...+.|-|. ..-++.|++|...... ++ ....+.+|++..+++ +.+|.....
T Consensus 33 ~~~~~~H~H~-~~ei~~v~~G~~~~~i-----~~-------------~~~~l~~g~l~~i~p-~~~H~~~~~ 84 (278)
T PRK10296 33 ESVSGLHQHD-YYEFTLVLTGRYYQEI-----NG-------------KRVLLERGDFVFIPL-GSHHQSFYE 84 (278)
T ss_pred hcCCCCcccc-cEEEEEEEeceEEEEE-----CC-------------EEEEECCCcEEEeCC-CCccceeee
Confidence 5567999998 5889999999998654 11 356789999999876 569987544
No 41
>PF14499 DUF4437: Domain of unknown function (DUF4437); PDB: 2QDR_A.
Probab=87.91 E-value=2.8 Score=36.15 Aligned_cols=59 Identities=15% Similarity=0.112 Sum_probs=36.3
Q ss_pred cchhhhhc--cCCccceEEeeecC-CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEE
Q psy5204 58 DDWKRYAQ--FDENRYTRNLVDEG-DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTE 117 (209)
Q Consensus 58 ~~w~~~~~--~~~~~Y~R~ll~~~-~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e 117 (209)
..|.+... .+...-.|.|..++ ...-..+++..++|..+|+|-|.-. =-+.|++|.+.+
T Consensus 8 ~~w~~~~p~~~~~~~~~~~L~gd~~~~g~~~~~vkf~~g~~~pph~H~~~-~~~~Vi~G~~~~ 69 (251)
T PF14499_consen 8 VKWGPLNPARGDKGPGAAVLWGDPTKDGPSGMRVKFPAGFSSPPHIHNAD-YRGTVISGELHN 69 (251)
T ss_dssp EEEE--TTS-TTS--EEEEEEEE--TTS-EEEEEEE-TT-EE--BEESS--EEEEEEESEEEE
T ss_pred ccccccCCCCCCCCcceeeeecCcccCCcceEEEEcCCCccCCCcceeee-EEEEEEEeEEEc
Confidence 44554332 33344677777754 6788889999999999999999964 457899999876
No 42
>PLN00212 glutelin; Provisional
Probab=87.73 E-value=8.6 Score=36.38 Aligned_cols=97 Identities=14% Similarity=-0.003 Sum_probs=64.1
Q ss_pred CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCC-CCCCCCC-------C----CCccccccceEEeeCC
Q psy5204 80 DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEP-NHTSDEI-------G----QDRPLSEISRTELAEN 147 (209)
Q Consensus 80 ~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~-~~~~~~~-------~----~~~~l~~~~~~~~~~G 147 (209)
+-.+++.-....|+...++|-|. +..++.|++|+-....-... .....+. + ....-.......++.|
T Consensus 77 caGv~~~R~~i~p~gL~lP~y~n-a~~liyV~qG~G~~G~v~pGcpeT~~~~~~~~~~~~~~~~~~~~d~hqkv~~lr~G 155 (493)
T PLN00212 77 CTGVFVIRRVIEPQGLLLPRYSN-TPGLVYIIQGRGSMGLTFPGCPATYQQQFQQFLTEGQSQSQKFRDEHQKIHQFRQG 155 (493)
T ss_pred ccceEEEEEEecCCcccCccccC-CCeEEEEEeCeEEEEEEeCCCcchhhhhcccccccccccccccccccccceEeccC
Confidence 45777888889999999999996 67889999999665543110 0000000 0 0000112234689999
Q ss_pred cEEEEcCCCCeEEEecCCCCCceEEEEEeCCC
Q psy5204 148 DVCYINDSMGLHKVGNSGPHPAAVSLHLYSPP 179 (209)
Q Consensus 148 ~v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp 179 (209)
||..++. +-+|-+.|.+ +++.|.|-++...
T Consensus 156 DViaiPa-G~~hw~yN~G-d~~~v~v~~~d~~ 185 (493)
T PLN00212 156 DVVALPA-GVAHWFYNDG-DAPVVALYVYDIN 185 (493)
T ss_pred CEEEECC-CCeEEEEeCC-CCcEEEEEEEecc
Confidence 9999875 5699999998 6787777666543
No 43
>PRK13500 transcriptional activator RhaR; Provisional
Probab=85.46 E-value=7.7 Score=33.87 Aligned_cols=66 Identities=15% Similarity=0.203 Sum_probs=46.7
Q ss_pred CCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCceE
Q psy5204 92 QGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPAAV 171 (209)
Q Consensus 92 pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~av 171 (209)
|....+.|.|. ..-++.|++|...... ++ ....+.+|+++++++ +.+|.....+ +..+
T Consensus 57 ~~~~~~~H~H~-~~el~~v~~G~g~~~v-----~~-------------~~~~l~~Gdl~~I~~-~~~H~~~~~~--~~~~ 114 (312)
T PRK13500 57 PQDVFAEHTHD-FCELVIVWRGNGLHVL-----ND-------------RPYRITRGDLFYIHA-DDKHSYASVN--DLVL 114 (312)
T ss_pred CCCCCCccccc-eEEEEEEEcCeEEEEE-----CC-------------EEEeecCCeEEEECC-CCeecccccC--CceE
Confidence 44456899999 5889999999988654 11 356788999999986 6799987654 2344
Q ss_pred EEEEeCCC
Q psy5204 172 SLHLYSPP 179 (209)
Q Consensus 172 SLHvY~pp 179 (209)
..=+|.|.
T Consensus 115 ~~i~~~~~ 122 (312)
T PRK13500 115 QNIIYCPE 122 (312)
T ss_pred EEEEEcch
Confidence 44455544
No 44
>PF06339 Ectoine_synth: Ectoine synthase; InterPro: IPR010462 This family consists of several bacterial ectoine synthase proteins. The ectABC genes encode the diaminobutyric acid acetyltransferase (EctA), the diaminobutyric acid aminotransferase (EctB), and the ectoine synthase (EctC). Together these proteins constitute the ectoine biosynthetic pathway [].; GO: 0016836 hydro-lyase activity, 0006596 polyamine biosynthetic process
Probab=84.68 E-value=16 Score=28.18 Aligned_cols=96 Identities=13% Similarity=0.156 Sum_probs=70.6
Q ss_pred ceEEeeecCCCceEEEEEEccCCCccCCC--CCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCc
Q psy5204 71 YTRNLVDEGDGKFNLILLCWSQGQASSIH--DHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAEND 148 (209)
Q Consensus 71 Y~R~ll~~~~~~~~l~~~~W~pG~~spiH--DH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~ 148 (209)
-.|.||..+.=.|++-.-...+|..+.+| +|- -.+.+++|+=+.+.- +++ ....+.||.
T Consensus 23 SrRlll~~DgmGFS~h~T~i~aGtet~~~YknHl---EAvyci~G~Gev~~~---~~G-------------~~~~i~pGt 83 (126)
T PF06339_consen 23 SRRLLLKDDGMGFSFHETTIYAGTETHIHYKNHL---EAVYCIEGEGEVEDL---DTG-------------EVHPIKPGT 83 (126)
T ss_pred EEEEEEccCCCCEEEEEEEEeCCCeeEEEecCce---EEEEEEeceEEEEEc---cCC-------------cEEEcCCCe
Confidence 46777766567999999999999999999 554 346788887665542 122 367788998
Q ss_pred EEEEcCCCCeEEEecCCCCCceEEEEEeCCCCCcceEEECC
Q psy5204 149 VCYINDSMGLHKVGNSGPHPAAVSLHLYSPPFNICSSFDAQ 189 (209)
Q Consensus 149 v~~~~~~~~iH~v~n~~~~~~avSLHvY~pp~~~~~~yd~~ 189 (209)
...++ ..+=|.++.-. ...-+=|+.||+.+--+-|+.
T Consensus 84 ~YaLd-~hD~H~lra~~---dm~~vCVFnPpltG~E~Hd~~ 120 (126)
T PF06339_consen 84 MYALD-KHDRHYLRAKT---DMRLVCVFNPPLTGREVHDEN 120 (126)
T ss_pred EEecC-CCccEEEEecC---CEEEEEEcCCCCcCceecCCC
Confidence 76665 47799998663 577778899999876665544
No 45
>PRK13501 transcriptional activator RhaR; Provisional
Probab=83.71 E-value=8 Score=33.12 Aligned_cols=53 Identities=17% Similarity=0.145 Sum_probs=40.5
Q ss_pred CCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCC
Q psy5204 93 GQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSG 165 (209)
Q Consensus 93 G~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~ 165 (209)
....++|.|. ..-++.+++|..+... ++ ....+.+|+++++++ +.+|.+...+
T Consensus 28 ~~~~~~H~H~-~~ei~~i~~G~~~~~i-----~~-------------~~~~l~~g~~~~I~p-~~~H~~~~~~ 80 (290)
T PRK13501 28 QETFVEHTHQ-FCEIVIVWRGNGLHVL-----ND-------------HPYRITCGDVFYIQA-ADHHSYESVH 80 (290)
T ss_pred CCCCcccccc-ceeEEEEecCceEEEE-----CC-------------eeeeecCCeEEEEcC-CCcccccccC
Confidence 3456789998 5788899999988665 11 367789999999987 5799987553
No 46
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=81.91 E-value=25 Score=28.25 Aligned_cols=80 Identities=19% Similarity=0.148 Sum_probs=51.9
Q ss_pred ceEEeeec-CCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcE
Q psy5204 71 YTRNLVDE-GDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDV 149 (209)
Q Consensus 71 Y~R~ll~~-~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v 149 (209)
|+-+|+.+ ..++..+-++-|.+-...--=+- .-+..||+|+|.... ++ +..+-+|||+
T Consensus 87 ~~tdLvt~~~g~~l~aG~m~~~~~tf~wtl~y---De~d~VlEGrL~V~~-----~g-------------~tv~a~aGDv 145 (176)
T COG4766 87 YTTDLVTEQEGSRLGAGLMEMKNTTFPWTLNY---DEIDYVLEGRLHVRI-----DG-------------RTVIAGAGDV 145 (176)
T ss_pred EeeceeecccCCccccceeeeccccCcceecc---cceeEEEeeeEEEEE-----cC-------------CeEecCCCcE
Confidence 89999985 45889999999999333222222 246789999999875 11 3566789999
Q ss_pred EEEcCCCCeEEEecCCCCCceEEEEE
Q psy5204 150 CYINDSMGLHKVGNSGPHPAAVSLHL 175 (209)
Q Consensus 150 ~~~~~~~~iH~v~n~~~~~~avSLHv 175 (209)
.+++- +.-=.+..++ + +-.|-|
T Consensus 146 ifiPK-gssIefst~g--e-a~flyv 167 (176)
T COG4766 146 IFIPK-GSSIEFSTTG--E-AKFLYV 167 (176)
T ss_pred EEecC-CCeEEEeccc--e-EEEEEE
Confidence 99864 3323344443 2 555444
No 47
>PF13759 2OG-FeII_Oxy_5: Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=80.93 E-value=11 Score=27.16 Aligned_cols=88 Identities=18% Similarity=0.184 Sum_probs=34.6
Q ss_pred EEEEEEccCCCccCCCCCCCce-EEEEEEE-eE-EEEEEeeCCCCCCC--CC----CCCccccccceEEeeCCcEEEEcC
Q psy5204 84 NLILLCWSQGQASSIHDHAQAH-CFMKMLQ-GC-LTEVRYQEPNHTSD--EI----GQDRPLSEISRTELAENDVCYIND 154 (209)
Q Consensus 84 ~l~~~~W~pG~~spiHDH~~s~-~~~~vl~-G~-l~e~~y~~~~~~~~--~~----~~~~~l~~~~~~~~~~G~v~~~~~ 154 (209)
+.|...-.+|...++|+|.+++ +.+.-++ +. .....|..+..... .+ ..............++|+++.|+.
T Consensus 1 ~~W~ni~~~g~~~~~H~H~~s~~SgVyYv~~p~~~~~l~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~G~lvlFPs 80 (101)
T PF13759_consen 1 NSWANIYRKGGYNEPHNHPNSWLSGVYYVQVPEGSGPLRFHDPRGSFSFGAPFDNYDQNDLNSPYYIVEPEEGDLVLFPS 80 (101)
T ss_dssp -EEEEEE-TT--EEEE--TT-SEEEEEECE--TTS-SEEEE-TTCCCGTTS----TTTTCCC-SEEEE---TTEEEEEET
T ss_pred CeeEEEeCCCCccCceECCCcCEEEEEEEECCCCCCceeeeCCCccceecccccccccCcccCceEEeCCCCCEEEEeCC
Confidence 3578888999999999999863 3333221 11 11122221111100 00 000112222344456899888874
Q ss_pred CCCeEEEe-cCCCCCceEEE
Q psy5204 155 SMGLHKVG-NSGPHPAAVSL 173 (209)
Q Consensus 155 ~~~iH~v~-n~~~~~~avSL 173 (209)
.-.|.+. |.+ +++=|||
T Consensus 81 -~l~H~v~p~~~-~~~Risi 98 (101)
T PF13759_consen 81 -WLWHGVPPNNS-DEERISI 98 (101)
T ss_dssp -TSEEEE----S-SS-EEEE
T ss_pred -CCEEeccCcCC-CCCEEEE
Confidence 5699988 554 3444443
No 48
>PRK10371 DNA-binding transcriptional regulator MelR; Provisional
Probab=80.89 E-value=11 Score=32.96 Aligned_cols=65 Identities=9% Similarity=0.078 Sum_probs=49.9
Q ss_pred CceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEE
Q psy5204 81 GKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHK 160 (209)
Q Consensus 81 ~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~ 160 (209)
...-+-+..|+|-...++|-|. ..-++.+++|...... ++ ....+.+|+++.+++ ..+|.
T Consensus 24 ~~~~~~~~~~~~~~m~~~HwH~-e~Ei~yv~~G~~~~~i-----~g-------------~~~~l~~Gd~ili~s-~~~H~ 83 (302)
T PRK10371 24 EYQRLEIEFRPPHIMPTSHWHG-QVEVNVPFDGDVEYLI-----NN-------------EKVQINQGHITLFWA-CTPHQ 83 (302)
T ss_pred CCceeEEEeeCCCCCCCCCccc-cEEEEEecCCcEEEEE-----CC-------------EEEEEcCCcEEEEec-CCccc
Confidence 4556666779998889999999 4778899999876443 11 367799999999876 56999
Q ss_pred EecCC
Q psy5204 161 VGNSG 165 (209)
Q Consensus 161 v~n~~ 165 (209)
+...+
T Consensus 84 ~~~~~ 88 (302)
T PRK10371 84 LTDPG 88 (302)
T ss_pred ccccC
Confidence 87654
No 49
>PLN00212 glutelin; Provisional
Probab=80.64 E-value=8.1 Score=36.54 Aligned_cols=70 Identities=13% Similarity=-0.030 Sum_probs=53.3
Q ss_pred CceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEE
Q psy5204 81 GKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHK 160 (209)
Q Consensus 81 ~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~ 160 (209)
=..++......||...++|-|.++.-++.|++|..+...-. ..+. ..-...|.+|++..++- ..+|.
T Consensus 346 L~LSa~rv~L~~gam~~PHwn~nA~eI~yV~rG~g~vqvV~--~~g~----------~vf~~~L~~GdvfVVPq-g~~v~ 412 (493)
T PLN00212 346 IQMSATRVNLYQNALLSPFWNVNAHSVVYITQGRARVQVVS--NNGK----------TVFNGVLRPGQLLIIPQ-HYAVL 412 (493)
T ss_pred cCeeEEEEEEcCCcccCCeecCCCCEEEEEeecceEEEEEc--CCCC----------EEEEEEEcCCCEEEECC-CCeEE
Confidence 35777788899999999999999999999999999998753 1211 11356799999988864 45775
Q ss_pred Eec
Q psy5204 161 VGN 163 (209)
Q Consensus 161 v~n 163 (209)
...
T Consensus 413 ~~A 415 (493)
T PLN00212 413 KKA 415 (493)
T ss_pred Eee
Confidence 543
No 50
>PRK15457 ethanolamine utilization protein EutQ; Provisional
Probab=79.46 E-value=22 Score=30.38 Aligned_cols=73 Identities=14% Similarity=0.108 Sum_probs=49.6
Q ss_pred ceEEeee-cCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcE
Q psy5204 71 YTRNLVD-EGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDV 149 (209)
Q Consensus 71 Y~R~ll~-~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v 149 (209)
|..-++. +...+.....|.|...... -|-...=+..|++|+++.+.- + .+..+++||+
T Consensus 144 ~~~d~~~~~d~s~m~aGf~~~~~~sf~---wtl~~dEi~YVLEGe~~l~Id-----G-------------~t~~l~pGDv 202 (233)
T PRK15457 144 GLTDLVTGDDGSSMAAGFMQWENAFFP---WTLNYDEIDMVLEGELHVRHE-----G-------------ETMIAKAGDV 202 (233)
T ss_pred EeeeeeccCCCCceeeEEEEEecCccc---eeccceEEEEEEEeEEEEEEC-----C-------------EEEEeCCCcE
Confidence 4444443 3468999999999974433 333345688999999998861 1 3788999999
Q ss_pred EEEcCCCCeEEEecCC
Q psy5204 150 CYINDSMGLHKVGNSG 165 (209)
Q Consensus 150 ~~~~~~~~iH~v~n~~ 165 (209)
.+++. +..|...+++
T Consensus 203 lfIPk-Gs~~hf~tp~ 217 (233)
T PRK15457 203 MFIPK-GSSIEFGTPS 217 (233)
T ss_pred EEECC-CCeEEecCCC
Confidence 99876 3454445553
No 51
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=78.56 E-value=19 Score=30.62 Aligned_cols=80 Identities=14% Similarity=0.164 Sum_probs=58.4
Q ss_pred CceEEEEEEccCCC--ccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCe
Q psy5204 81 GKFNLILLCWSQGQ--ASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGL 158 (209)
Q Consensus 81 ~~~~l~~~~W~pG~--~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~i 158 (209)
+.|+=+++...|+- .-|--|-+ ..+++.|++|++.... ++ .+..+++|+-.+++| ..-
T Consensus 59 ~tF~qyive~~p~GGs~~~e~d~~-ae~~lfVv~Ge~tv~~-----~G-------------~th~l~eggyaylPp-gs~ 118 (264)
T COG3257 59 ATFVQYIVELHPNGGSQRPEGDEG-AETFLFVVSGEITVKA-----EG-------------KTHALREGGYAYLPP-GSG 118 (264)
T ss_pred hhhhhheEEECCCCCCCCCCCCCc-ceEEEEEEeeeEEEEE-----cC-------------eEEEeccCCeEEeCC-CCc
Confidence 56777777775544 33444444 6899999999999875 11 367899999888776 569
Q ss_pred EEEecCCCCCceEEEEEeCCCCCc
Q psy5204 159 HKVGNSGPHPAAVSLHLYSPPFNI 182 (209)
Q Consensus 159 H~v~n~~~~~~avSLHvY~pp~~~ 182 (209)
|++.|.+ ....-+|+|-.++..
T Consensus 119 ~~~~N~~--~~~~rfhw~rk~Y~~ 140 (264)
T COG3257 119 WTLRNAQ--KEDSRFHWIRKRYQP 140 (264)
T ss_pred ceEeecc--CCceEEEEEeeccee
Confidence 9999997 467788999777653
No 52
>TIGR02297 HpaA 4-hydroxyphenylacetate catabolism regulatory protein HpaA. This putative transcriptional regulator, which contains both the substrate-binding, dimerization domain (pfam02311) and the helix-turn-helix DNA-binding domain (pfam00165) of the AraC famil, is located proximal to genes of the 4-hydroxyphenylacetate catabolism pathway.
Probab=73.69 E-value=17 Score=30.74 Aligned_cols=55 Identities=9% Similarity=-0.042 Sum_probs=39.6
Q ss_pred CCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCC
Q psy5204 92 QGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSG 165 (209)
Q Consensus 92 pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~ 165 (209)
-|..+++|-|...--++.+++|...... ++ ....+.+|+++.+++ ..+|++....
T Consensus 32 ~~~~~~~H~H~~~~~l~~~~~G~~~~~~-----~~-------------~~~~l~~g~~~ii~~-~~~H~~~~~~ 86 (287)
T TIGR02297 32 FGRNMPVHFHDRYYQLHYLTEGSIALQL-----DE-------------HEYSEYAPCFFLTPP-SVPHGFVTDL 86 (287)
T ss_pred cCCCCCCcccccceeEEEEeeCceEEEE-----CC-------------EEEEecCCeEEEeCC-CCccccccCC
Confidence 3467899999733567789999886443 11 256788999988886 5699987654
No 53
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=72.89 E-value=25 Score=29.16 Aligned_cols=23 Identities=22% Similarity=0.108 Sum_probs=19.6
Q ss_pred eEEEEEEccCCCccCCCCCCCce
Q psy5204 83 FNLILLCWSQGQASSIHDHAQAH 105 (209)
Q Consensus 83 ~~l~~~~W~pG~~spiHDH~~s~ 105 (209)
-+.|...-.+|..-..|.|+++|
T Consensus 96 ~~~W~ni~~~Gg~h~~H~Hp~~~ 118 (201)
T TIGR02466 96 QKAWVNILPQGGTHSPHLHPGSV 118 (201)
T ss_pred eeEeEEEcCCCCccCceECCCce
Confidence 36788888999999999999863
No 54
>COG3806 ChrR Transcriptional activator [Transcription]
Probab=72.64 E-value=14 Score=30.81 Aligned_cols=80 Identities=16% Similarity=0.123 Sum_probs=53.1
Q ss_pred EeeecCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEc
Q psy5204 74 NLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYIN 153 (209)
Q Consensus 74 ~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~ 153 (209)
..|-. +++..|+++--.||+..|-|.|.| .--+.|++|.+.-.+ -.+.+||..--+
T Consensus 120 v~l~~-dds~~V~llki~~g~s~P~HtH~G-~E~t~vl~G~~sde~----------------------G~y~vgD~~~~d 175 (216)
T COG3806 120 VRLPT-DDSRRVALLKIEPGRSFPDHTHVG-IERTAVLEGAFSDEN----------------------GEYLVGDFTLAD 175 (216)
T ss_pred cccCC-CCCceeEEEEeccCcccccccccc-eEEEEEEeeccccCC----------------------CccccCceeecC
Confidence 34443 689999999999999999999996 556778888765221 124455543322
Q ss_pred CCCCeEEEecCCCCCceEEEEEeCCC
Q psy5204 154 DSMGLHKVGNSGPHPAAVSLHLYSPP 179 (209)
Q Consensus 154 ~~~~iH~v~n~~~~~~avSLHvY~pp 179 (209)
+ .--|+-.-.. +..++.|=+...|
T Consensus 176 ~-~v~H~piv~~-~~eClcl~al~~~ 199 (216)
T COG3806 176 G-TVQHSPIVLP-PGECLCLAALDGP 199 (216)
T ss_pred C-ccccccccCC-CCCceEEEEcCCC
Confidence 2 2367654443 4678887776655
No 55
>PRK13502 transcriptional activator RhaR; Provisional
Probab=70.84 E-value=39 Score=28.56 Aligned_cols=54 Identities=15% Similarity=0.150 Sum_probs=40.8
Q ss_pred CCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCC
Q psy5204 92 QGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSG 165 (209)
Q Consensus 92 pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~ 165 (209)
|.+..+.|.|. ..-++.+++|......- + ....+.+|++..+++ +.+|.+...+
T Consensus 27 ~~~~~~~H~h~-~~~l~~v~~G~~~~~i~-----~-------------~~~~l~~g~l~li~~-~~~H~~~~~~ 80 (282)
T PRK13502 27 PQDVFAEHTHE-FCELVMVWRGNGLHVLN-----E-------------RPYRITRGDLFYIRA-EDKHSYTSVN 80 (282)
T ss_pred CCCCCCccccc-eEEEEEEecCcEEEEEC-----C-------------EEEeecCCcEEEECC-CCcccccccC
Confidence 44557899998 57788899999886641 1 357788999988876 5799987544
No 56
>PRK13503 transcriptional activator RhaS; Provisional
Probab=68.15 E-value=22 Score=29.88 Aligned_cols=63 Identities=17% Similarity=0.183 Sum_probs=43.2
Q ss_pred ceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEE
Q psy5204 82 KFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKV 161 (209)
Q Consensus 82 ~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v 161 (209)
.+-+++..=.+....+.|-|. ..-++.|++|..+... ++ ....+.+|+++.+++ ...|..
T Consensus 14 ~~~~~~~~~~~~~~~~~H~H~-~~ei~~v~~G~~~~~i-----~~-------------~~~~l~~g~~~~i~~-~~~h~~ 73 (278)
T PRK13503 14 NAPVAIEPRLPQAAFPEHHHD-FHEIVIVEHGTGIHVF-----NG-------------QPYTLSGGTVCFVRD-HDRHLY 73 (278)
T ss_pred CCCeeeecCCccccccccccC-ceeEEEEecCceeeEe-----cC-------------CcccccCCcEEEECC-Cccchh
Confidence 333443322344667899998 4788999999987554 11 245578999999886 568987
Q ss_pred ecC
Q psy5204 162 GNS 164 (209)
Q Consensus 162 ~n~ 164 (209)
.+.
T Consensus 74 ~~~ 76 (278)
T PRK13503 74 EHT 76 (278)
T ss_pred hhc
Confidence 765
No 57
>COG3822 ABC-type sugar transport system, auxiliary component [General function prediction only]
Probab=67.89 E-value=18 Score=30.09 Aligned_cols=78 Identities=15% Similarity=0.118 Sum_probs=46.8
Q ss_pred EEEEccCCCccCCCCCCCc-eEEEEEEEeEEEEEEeeCCC-CCCCC--------CCCCccccccceEEeeCCcEEEEcCC
Q psy5204 86 ILLCWSQGQASSIHDHAQA-HCFMKMLQGCLTEVRYQEPN-HTSDE--------IGQDRPLSEISRTELAENDVCYINDS 155 (209)
Q Consensus 86 ~~~~W~pG~~spiHDH~~s-~~~~~vl~G~l~e~~y~~~~-~~~~~--------~~~~~~l~~~~~~~~~~G~v~~~~~~ 155 (209)
=+|.-.+||.||.|-|.-- .-++--=.|++..+.|..+. ++..+ .|......+.....+.||+.+.+.|
T Consensus 89 KiM~vr~gQvtPmHrH~~k~eDiinrgggtlv~el~~~d~~~~~~~ks~vtv~~dg~r~~~~ag~~lkL~PGesitL~P- 167 (225)
T COG3822 89 KIMHVRPGQVTPMHRHWRKPEDIINRGGGTLVVELWNVDLVEGQDEKSDVTVPVDGCRQTHTAGSQLKLSPGESITLPP- 167 (225)
T ss_pred eeEEeccCCcCcccccccchhhhhhcCCceEEEEEeccccccCcCCCCCeEecCCCcEEEeccceeEEECCCCcEecCC-
Confidence 4677899999999999721 01111123778888887652 22211 1222224455567788888877776
Q ss_pred CCeEEEecC
Q psy5204 156 MGLHKVGNS 164 (209)
Q Consensus 156 ~~iH~v~n~ 164 (209)
.--|..-..
T Consensus 168 g~~HsFwae 176 (225)
T COG3822 168 GLYHSFWAE 176 (225)
T ss_pred Cceeeeeec
Confidence 447766543
No 58
>PF05523 FdtA: WxcM-like, C-terminal ; InterPro: IPR008894 This entry includes FdtA (Q6T1W8 from SWISSPROT) from Aneurinibacillus thermoaerophilus, which has been characterised as a dTDP-6-deoxy-3,4-keto-hexulose isomerase []. It also includes WxcM (Q93S92 from SWISSPROT) from Xanthomonas campestris pv campestris) []. ; PDB: 2PAK_A 2PAE_A 2PA7_B 2PAM_A.
Probab=67.24 E-value=32 Score=26.36 Aligned_cols=75 Identities=24% Similarity=0.257 Sum_probs=40.5
Q ss_pred ccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCc-EEEEcCCCCeEEEecCCCCC
Q psy5204 90 WSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAEND-VCYINDSMGLHKVGNSGPHP 168 (209)
Q Consensus 90 W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~-v~~~~~~~~iH~v~n~~~~~ 168 (209)
=++|..=..|-|..++=++.|++|..+....... ...+..+...+ +.+++| +-.|.+.|-+. +
T Consensus 40 ~~~~~~RG~H~Hk~~~~~~~~l~Gs~~v~~~d~~--------------~~~~~~L~~~~~~L~Ipp-g~w~~~~~~s~-~ 103 (131)
T PF05523_consen 40 VPPGVIRGWHAHKKTTQWFIVLSGSFKVVLDDGR--------------EEEEFILDEPNKGLYIPP-GVWHGIKNFSE-D 103 (131)
T ss_dssp --SS--EEEEEESS--EEEEEEES-EEEEEE-SS---------------EEEEEE--TTEEEEE-T-T-EEEEE---T-T
T ss_pred CCCCCcccccccccccEEEEEEeCEEEEEEecCC--------------CcEEEEECCCCeEEEECC-chhhHhhccCC-C
Confidence 3466667899999889999999999999975321 11455666554 445554 66999999983 4
Q ss_pred ceEEEEEeCCCCC
Q psy5204 169 AAVSLHLYSPPFN 181 (209)
Q Consensus 169 ~avSLHvY~pp~~ 181 (209)
+|-|-+-+.++.
T Consensus 104 -svlLv~as~~yd 115 (131)
T PF05523_consen 104 -SVLLVLASEPYD 115 (131)
T ss_dssp --EEEEEESS---
T ss_pred -cEEEEEcCCCCC
Confidence 888877666654
No 59
>PF07385 DUF1498: Protein of unknown function (DUF1498); InterPro: IPR010864 This family consists of several hypothetical bacterial proteins of around 225 residues in length. The function of this family is unknown.; PDB: 3MPB_B 3KMH_A.
Probab=65.76 E-value=17 Score=30.78 Aligned_cols=82 Identities=15% Similarity=0.118 Sum_probs=44.7
Q ss_pred eEEEEEEccCCCccCCCCCCCce-EEEEEEEeEEEEEEeeCCCCCCC--C-------CCCCccccccceEEeeCCcEEEE
Q psy5204 83 FNLILLCWSQGQASSIHDHAQAH-CFMKMLQGCLTEVRYQEPNHTSD--E-------IGQDRPLSEISRTELAENDVCYI 152 (209)
Q Consensus 83 ~~l~~~~W~pG~~spiHDH~~s~-~~~~vl~G~l~e~~y~~~~~~~~--~-------~~~~~~l~~~~~~~~~~G~v~~~ 152 (209)
|.==+|.-.+||.||.|-|..-. =++-==-|.|..+.|..+.++.. + .|....+.+.....|.||+.+.+
T Consensus 87 YAEKim~~~~~Q~tP~H~H~~K~EDIINRGGG~L~i~l~~s~~~~~~~~~~~v~V~~DG~~~t~~aG~~l~L~PGESiTL 166 (225)
T PF07385_consen 87 YAEKIMIVREGQVTPMHFHWKKMEDIINRGGGNLVIELYNSDPDGELDADTDVTVPVDGIRRTVPAGTQLRLNPGESITL 166 (225)
T ss_dssp EEEEEEEE-BT-EEEEEEESS--EEEEEEEES-EEEEEEEB--TTSSB-SS-EEEEETTEEEEE-TT-EEEE-TT-EEEE
T ss_pred chhhheeccCCCcCCcccCcchhhheeecCCceEEEEEEeccCCCccccCCCeEEecCCcEEEecCCceEEeCCCCeEee
Confidence 55456778999999999997532 11111126899999987643321 0 12223355666788889998888
Q ss_pred cCCCCeEEEecCC
Q psy5204 153 NDSMGLHKVGNSG 165 (209)
Q Consensus 153 ~~~~~iH~v~n~~ 165 (209)
.| ...|+.-..+
T Consensus 167 ~P-g~yH~Fw~e~ 178 (225)
T PF07385_consen 167 PP-GIYHWFWGEG 178 (225)
T ss_dssp -T-TEEEEEEE-T
T ss_pred CC-CCeeeEEecC
Confidence 76 4589887654
No 60
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=65.29 E-value=11 Score=33.58 Aligned_cols=96 Identities=17% Similarity=0.237 Sum_probs=63.6
Q ss_pred ccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCCCCCc
Q psy5204 90 WSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSGPHPA 169 (209)
Q Consensus 90 W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~ 169 (209)
..||...|-|-|. +.++-.|++|.=-=+.- ......+.+||. .+.|....|.-.|.+ ++|
T Consensus 99 ilPGEvApsHrHs-qsAlRFvveG~Ga~T~V-----------------dGer~~M~~GDf-ilTP~w~wHdHgn~g-~eP 158 (351)
T COG3435 99 ILPGEVAPSHRHN-QSALRFVVEGKGAYTVV-----------------DGERTPMEAGDF-ILTPAWTWHDHGNEG-TEP 158 (351)
T ss_pred ecCcccCCccccc-ccceEEEEeccceeEee-----------------cCceeeccCCCE-EEccCceeccCCCCC-CCc
Confidence 4799999999999 56788888876332221 113566889995 446666799999997 699
Q ss_pred eEEEEEeCCCCC---cceEEEC---CCCceEEeeeeEEeccC
Q psy5204 170 AVSLHLYSPPFN---ICSSFDA---QTGVKSRARLTYWSKFG 205 (209)
Q Consensus 170 avSLHvY~pp~~---~~~~yd~---~~g~~~~~~~~~~s~~g 205 (209)
+|=|...--|+- .|..|+. +..-.++..-.+...||
T Consensus 159 ~iWlDgLDiplv~~l~~gFfe~~~e~~q~v~~~~~d~~ar~~ 200 (351)
T COG3435 159 CIWLDGLDIPLVNSLGAGFFEEHPEEQQPVTRPEGDSLARYG 200 (351)
T ss_pred eEEEcccchHHHHhhcccccccCchhcCcccCCCCCchhhcC
Confidence 999988776764 3455543 33333333334544444
No 61
>PF08007 Cupin_4: Cupin superfamily protein; InterPro: IPR022777 This signature represents primarily the cupin fold found in JmjC transcription factors. The fold is also found in lysine-specific demethylase NO66.; PDB: 2XDV_A 1VRB_B 4DIQ_B.
Probab=65.29 E-value=79 Score=27.86 Aligned_cols=93 Identities=16% Similarity=0.161 Sum_probs=50.5
Q ss_pred eEEEEEEccCC-CccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCC----C-CCCccccccceEEeeCCcEEEEcCCC
Q psy5204 83 FNLILLCWSQG-QASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDE----I-GQDRPLSEISRTELAENDVCYINDSM 156 (209)
Q Consensus 83 ~~l~~~~W~pG-~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~----~-~~~~~l~~~~~~~~~~G~v~~~~~~~ 156 (209)
+.+.+.+=++| +..++|-=. ...++-=+.|+=+=+.|......... + .......+..+.+++|||+.|++. +
T Consensus 114 ~~~n~Y~tp~g~~g~~~H~D~-~dvfvlQ~~G~K~W~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~pGD~LYlPr-G 191 (319)
T PF08007_consen 114 VGANAYLTPPGSQGFGPHYDD-HDVFVLQLEGRKRWRLYPPPDEPAPLYSDQPFKQLEEFEPVEEVVLEPGDVLYLPR-G 191 (319)
T ss_dssp EEEEEEEETSSBEESECEE-S-SEEEEEEEES-EEEEEE-SCCCTTTSSCE--TTTCG--STSEEEEE-TT-EEEE-T-T
T ss_pred cceEEEecCCCCCCccCEECC-cccEEEECCceeEEEECCCCcccccccCCCCccccccCceeEEEEECCCCEEEECC-C
Confidence 34444444444 567777444 35677778888777777633221110 0 001112445589999999999975 5
Q ss_pred CeEEEecCCCCCceEEEEEeCC
Q psy5204 157 GLHKVGNSGPHPAAVSLHLYSP 178 (209)
Q Consensus 157 ~iH~v~n~~~~~~avSLHvY~p 178 (209)
-+|...+.+ ....+|+++..+
T Consensus 192 ~~H~~~~~~-~S~hltv~~~~~ 212 (319)
T PF08007_consen 192 WWHQAVTTD-PSLHLTVGFRAP 212 (319)
T ss_dssp -EEEEEESS--EEEEEEEECCE
T ss_pred ccCCCCCCC-CceEEEEeeeCC
Confidence 599999886 466777775544
No 62
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=64.46 E-value=40 Score=30.32 Aligned_cols=59 Identities=8% Similarity=0.047 Sum_probs=44.2
Q ss_pred EEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecC
Q psy5204 86 ILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNS 164 (209)
Q Consensus 86 ~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~ 164 (209)
.+...++|+.+..|-|.. +.++.|++|+=.-+. ++ .....++||+++++. ...|+..|.
T Consensus 253 ~~q~L~~G~~t~~~r~T~-s~Vf~VieG~G~s~i-----g~-------------~~~~W~~gD~f~vPs-W~~~~h~a~ 311 (335)
T TIGR02272 253 FIQLLPKGFRTATYRSTD-ATVFCVVEGRGQVRI-----GD-------------AVFRFSPKDVFVVPS-WHPVRFEAS 311 (335)
T ss_pred HHhccCCCCCCCCccccc-cEEEEEEeCeEEEEE-----CC-------------EEEEecCCCEEEECC-CCcEecccC
Confidence 345689999999999985 688999999876665 11 256788999877764 446776664
No 63
>TIGR03853 matur_matur probable metal-binding protein. This protein family was identified by searching with a phylogenetic profile based on an anaerobic sulfatase-maturase enzyme, which contains multiple 4Fe-4S clusters. The linkages by phylogenetic profiling and by iron-sulfur cluster-related motifs together suggest this protein may be an accessory protein to certain maturases in sulfatase/maturase systems.
Probab=63.81 E-value=15 Score=25.95 Aligned_cols=40 Identities=10% Similarity=0.060 Sum_probs=31.3
Q ss_pred hhhccCCchhHHHHHHHHHHHHhcCC--------CCCCHHHHHHHHHH
Q psy5204 13 HYLLRLTRIDTLSELIHRVYAAFEGR--------SSVDVAHVKFLMES 52 (209)
Q Consensus 13 ~~~~~~~~~~~l~~li~~l~~~~~~~--------~~~~~~~v~~ll~~ 52 (209)
|||+....+++-.+|++.|.+-|+.+ +.+++++|-+.|.+
T Consensus 9 ~mml~~~~~~t~~~L~~~i~~~FG~~arFhTCSa~~m~a~~Li~FL~~ 56 (77)
T TIGR03853 9 NLMLASGEPYTRESLKAAIEQKFGEDARFHTCSAEGMTADELLQFLLK 56 (77)
T ss_pred HHHHHcCCCcCHHHHHHHHHHHhCCCceEeecccccCCHHHHHHHHHH
Confidence 56777778889999999999999875 36777777666643
No 64
>PF11699 CENP-C_C: Mif2/CENP-C like; PDB: 2VPV_B.
Probab=59.55 E-value=62 Score=23.11 Aligned_cols=71 Identities=10% Similarity=0.078 Sum_probs=47.4
Q ss_pred CceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEE
Q psy5204 81 GKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHK 160 (209)
Q Consensus 81 ~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~ 160 (209)
+.|..-.+..+||+.=+.-+=....=+..|++|.++.+.-+ .+.++..|+...++. ...=.
T Consensus 10 ~~fa~G~l~Lpp~~~K~~k~s~~~~~vF~V~~G~v~Vti~~------------------~~f~v~~G~~F~VP~-gN~Y~ 70 (85)
T PF11699_consen 10 PFFASGMLELPPGGEKPPKNSRDNTMVFYVIKGKVEVTIHE------------------TSFVVTKGGSFQVPR-GNYYS 70 (85)
T ss_dssp TS-EEEEEEE-TCCCEEEEE--SEEEEEEEEESEEEEEETT------------------EEEEEETT-EEEE-T-T-EEE
T ss_pred CCceeEEEEeCCCCccCCcccCCcEEEEEEEeCEEEEEEcC------------------cEEEEeCCCEEEECC-CCEEE
Confidence 48888899999988766655554566778999999999731 367789999877765 45889
Q ss_pred EecCCCCCceE
Q psy5204 161 VGNSGPHPAAV 171 (209)
Q Consensus 161 v~n~~~~~~av 171 (209)
+.|.+ +++|.
T Consensus 71 i~N~~-~~~a~ 80 (85)
T PF11699_consen 71 IKNIG-NEEAK 80 (85)
T ss_dssp EEE-S-SS-EE
T ss_pred EEECC-CCcEE
Confidence 99998 45543
No 65
>PF02373 JmjC: JmjC domain, hydroxylase; InterPro: IPR013129 Jumonji protein is required for neural tube formation in mice [].There is evidence of domain swapping within the jumonji family of transcription factors []. This domain is often associated with jmjN (see IPR003349 from INTERPRO) and belongs to the Cupin superfamily [].; PDB: 2YU2_A 2YU1_A 3AVR_A 3AVS_A 2OX0_B 2OQ6_B 2WWJ_A 2Q8D_A 3PDQ_A 2YBK_A ....
Probab=57.01 E-value=25 Score=25.24 Aligned_cols=33 Identities=18% Similarity=0.203 Sum_probs=22.3
Q ss_pred ceEEeeCCcEEEEcCCCCeEEEecCCCCCceEEEE
Q psy5204 140 SRTELAENDVCYINDSMGLHKVGNSGPHPAAVSLH 174 (209)
Q Consensus 140 ~~~~~~~G~v~~~~~~~~iH~v~n~~~~~~avSLH 174 (209)
-..+..+||.+.+.| ...|++-|.+ +.-+++.+
T Consensus 81 ~~~~Q~~Ge~V~i~p-g~~H~v~n~g-~~i~~a~N 113 (114)
T PF02373_consen 81 YRFVQKPGEFVFIPP-GAYHQVFNLG-DNISEAVN 113 (114)
T ss_dssp EEEEEETT-EEEE-T-T-EEEEEESS-SEEEEEEE
T ss_pred ccceECCCCEEEECC-CceEEEEeCC-ceEEEEec
Confidence 367778999888876 5799999997 45555544
No 66
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=56.52 E-value=61 Score=22.09 Aligned_cols=55 Identities=16% Similarity=0.230 Sum_probs=36.4
Q ss_pred CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcC
Q psy5204 80 DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYIND 154 (209)
Q Consensus 80 ~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~ 154 (209)
++.++..+..=.||...-.... .=++.||+|+.+.+. +++ ....++|||+.+++.
T Consensus 4 ~g~~~~g~w~~~pg~~~~~~~~---~E~~~vleG~v~it~----~~G-------------~~~~~~aGD~~~~p~ 58 (74)
T PF05899_consen 4 DGVFSAGVWECTPGKFPWPYPE---DEFFYVLEGEVTITD----EDG-------------ETVTFKAGDAFFLPK 58 (74)
T ss_dssp CTSEEEEEEEEECEEEEEEESS---EEEEEEEEEEEEEEE----TTT-------------EEEEEETTEEEEE-T
T ss_pred CCCEEEEEEEECCceeEeeCCC---CEEEEEEEeEEEEEE----CCC-------------CEEEEcCCcEEEECC
Confidence 5666776666677765433322 345689999999884 222 367899999888864
No 67
>PF10678 DUF2492: Protein of unknown function (DUF2492); InterPro: IPR019620 This entry describes a family of small cytosolic proteins, about 80 amino acids in length, in which the eight invariant residues include three His residues and two Cys residues. Two pairs of these invariant residues occur in motifs HxH (where x is A or G) and CxH, both of which suggest metal-binding activity. This protein family was identified by searching with a phylogenetic profile based on an anaerobic sulphatase-maturase enzyme, which contains multiple 4Fe-4S clusters. The linkages by phylogenetic profiling and by iron-sulphur cluster-related motifs together suggest this protein may be an accessory protein to certain maturases in sulphatase/maturase systems.
Probab=55.51 E-value=27 Score=24.77 Aligned_cols=39 Identities=13% Similarity=0.146 Sum_probs=30.4
Q ss_pred hhhccCCchhHHHHHHHHHHHHhcCC--------CCCCHHHHHHHHH
Q psy5204 13 HYLLRLTRIDTLSELIHRVYAAFEGR--------SSVDVAHVKFLME 51 (209)
Q Consensus 13 ~~~~~~~~~~~l~~li~~l~~~~~~~--------~~~~~~~v~~ll~ 51 (209)
|||+....+.+..+|+++|.+-|+.+ +..+.++|-+.|.
T Consensus 11 ~mmi~~~~~~t~~~L~~ai~~~FG~~arFhTCSae~m~a~eLv~FL~ 57 (78)
T PF10678_consen 11 NMMIESGNPYTKEELKAAIIEKFGEDARFHTCSAEGMTADELVDFLE 57 (78)
T ss_pred HHHHHcCCCcCHHHHHHHHHHHhCCCceEEecCCCCCCHHHHHHHHH
Confidence 56667777889999999999999865 3677777666654
No 68
>COG4297 Uncharacterized protein containing double-stranded beta helix domain [Function unknown]
Probab=55.45 E-value=77 Score=25.11 Aligned_cols=54 Identities=24% Similarity=0.230 Sum_probs=41.8
Q ss_pred EEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcC
Q psy5204 86 ILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYIND 154 (209)
Q Consensus 86 ~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~ 154 (209)
|.-+|..|-+--=|=|.+++-++.|++|+.....= ..+ ..+..++.||+..++.
T Consensus 46 W~gsW~g~Vf~yHHYHs~aHEVl~vlrgqA~l~iG---G~~------------G~el~v~~GDvlliPA 99 (163)
T COG4297 46 WFGSWRGGVFNYHHYHSGAHEVLGVLRGQAGLQIG---GAD------------GQELEVGEGDVLLIPA 99 (163)
T ss_pred CcccccccccccccccCCcceEEEEecceeEEEec---CCC------------CceeeecCCCEEEEec
Confidence 45578888888889999999999999999887761 111 1356789999888764
No 69
>KOG2757|consensus
Probab=53.70 E-value=1.2e+02 Score=27.79 Aligned_cols=68 Identities=15% Similarity=0.164 Sum_probs=50.2
Q ss_pred Eeeec-CCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEE
Q psy5204 74 NLVDE-GDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYI 152 (209)
Q Consensus 74 ~ll~~-~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~ 152 (209)
.++|. |-+.|+|.-.--++|+..-+--=.+ -.++-|++|.-.-.. +. .....+.+|+|.++
T Consensus 323 ~~~Y~Ppi~eF~v~~~~v~~g~~~~~~~~~~-~SIllv~~G~g~l~~----~t-------------~~~~~v~rG~V~fI 384 (411)
T KOG2757|consen 323 VLLYDPPIEEFAVLETKVPTGESYKFPGVDG-PSILLVLKGSGILKT----DT-------------DSKILVNRGDVLFI 384 (411)
T ss_pred eeEeCCCCcceeEEEeecCCCceEEeecCCC-ceEEEEEecceEEec----CC-------------CCceeeccCcEEEE
Confidence 34665 5789999999999999987777764 467889998765442 10 13567889999999
Q ss_pred cCCCCeE
Q psy5204 153 NDSMGLH 159 (209)
Q Consensus 153 ~~~~~iH 159 (209)
++...||
T Consensus 385 ~a~~~i~ 391 (411)
T KOG2757|consen 385 PANHPIH 391 (411)
T ss_pred cCCCCce
Confidence 8766674
No 70
>PLN02288 mannose-6-phosphate isomerase
Probab=47.56 E-value=99 Score=28.44 Aligned_cols=62 Identities=11% Similarity=0.050 Sum_probs=42.7
Q ss_pred eec-CCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcC
Q psy5204 76 VDE-GDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYIND 154 (209)
Q Consensus 76 l~~-~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~ 154 (209)
.|. +.+.|+|..+.-.+|+...+-... +.+++-|++|+.+... .+.. ....+++|+++++.+
T Consensus 326 ~y~~P~~eF~v~~~~l~~~~~~~~~~~~-gp~Illv~~G~~~i~~-----~~~~-----------~~~~l~~G~~~fv~a 388 (394)
T PLN02288 326 RYLPPFDEFEVDHCDVPPGASVVFPAVP-GPSVFLVIEGEGVLST-----GSSE-----------DGTAAKRGDVFFVPA 388 (394)
T ss_pred EECCCCcceEEEEEEeCCCCeEeecCCC-CCEEEEEEcCEEEEec-----CCcc-----------ceEEEeceeEEEEeC
Confidence 454 578999999988888765444344 3688899999998631 1100 124588999988875
No 71
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=38.87 E-value=1.4e+02 Score=25.47 Aligned_cols=83 Identities=11% Similarity=0.175 Sum_probs=52.7
Q ss_pred hhcCccchhhhhccCCcc-ceEEeeec-CCCceEEEEEEccCCCccCCC-CCCCceEEEEEEEeEEEEEEeeCCCCCCCC
Q psy5204 53 YVSNADDWKRYAQFDENR-YTRNLVDE-GDGKFNLILLCWSQGQASSIH-DHAQAHCFMKMLQGCLTEVRYQEPNHTSDE 129 (209)
Q Consensus 53 l~~~~~~w~~~~~~~~~~-Y~R~ll~~-~~~~~~l~~~~W~pG~~spiH-DH~~s~~~~~vl~G~l~e~~y~~~~~~~~~ 129 (209)
+..|.++..++..++.+. +.+.++=+ ..-.+.+-++++.||.+.|-= .|--.+| +.||+|+- .|+...+
T Consensus 150 ~~~Ne~ei~~~~m~gtdg~~attv~P~d~r~Dmhv~ivsFePGa~ip~aEtHvmEHG-lyvLeGk~---vYrLn~d---- 221 (264)
T COG3257 150 VSGNESEIEPSPMEGTDGVIATTVLPKELRFDMHVHIVSFEPGASIPYAETHVMEHG-LYVLEGKG---VYRLNNN---- 221 (264)
T ss_pred eecChhhCCCCCCCCCCCeEEEeeCccccCcceEEEEEEecCCcccchhhhhhhhcc-eEEEecce---EEeecCc----
Confidence 344556666666666554 77765433 345677788899999998843 3443345 68999974 4543322
Q ss_pred CCCCccccccceEEeeCCcEEEEcC
Q psy5204 130 IGQDRPLSEISRTELAENDVCYIND 154 (209)
Q Consensus 130 ~~~~~~l~~~~~~~~~~G~v~~~~~ 154 (209)
=..+.+||..++..
T Consensus 222 -----------wv~V~aGD~mwm~A 235 (264)
T COG3257 222 -----------WVPVEAGDYIWMGA 235 (264)
T ss_pred -----------eEEeecccEEEeec
Confidence 34577888888753
No 72
>PF11142 DUF2917: Protein of unknown function (DUF2917); InterPro: IPR021317 This bacterial family of proteins appears to be restricted to Proteobacteria.
Probab=35.94 E-value=92 Score=20.79 Aligned_cols=30 Identities=17% Similarity=0.198 Sum_probs=22.8
Q ss_pred EEccCCCccCCCCCCCceEEEEEEEeEEEEEE
Q psy5204 88 LCWSQGQASSIHDHAQAHCFMKMLQGCLTEVR 119 (209)
Q Consensus 88 ~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~ 119 (209)
+...||+..+++...+. .+.|.+|.+=.++
T Consensus 2 ~~L~~g~~~~lr~~~~~--~l~v~~G~vWlT~ 31 (63)
T PF11142_consen 2 FELAPGETLSLRAAAGQ--RLRVESGRVWLTR 31 (63)
T ss_pred EEeCCCceEEeEcCCCc--EEEEccccEEEEC
Confidence 34568888888888863 4899999877765
No 73
>PF05118 Asp_Arg_Hydrox: Aspartyl/Asparaginyl beta-hydroxylase; InterPro: IPR007803 The alpha-ketoglutarate-dependent dioxygenase aspartyl (asparaginyl) beta-hydroxylase (1.14.11.16 from EC) specifically hydroxylates one aspartic or asparagine residue in certain epidermal growth factor-like domains of a number of proteins. Its action may be due to histidine-675, which, when mutated to an alanine residue, causes the loss of enzymatic activity in the protein []. An invertebrate alpha-ketoglutarate-dependent aspartyl/asparaginyl beta-hydroxylase, which posttranslationally hydroxylates specific aspartyl or asparaginyl residues within epidermal growth factor-like modules [], activity was found to be similar to that of the purified mammalian aspartyl/asparaginyl beta-hydroxylase with respect to cofactor requirements, stereochemistry and substrate sequence specificity []. This enzyme requires Fe2+ as a cofactor. Some vitamin K-dependent coagulation factors, as well as synthetic peptides based on the structure of the first epidermal growth factor domain of human coagulation factor IX or X, can act as acceptors.; GO: 0018193 peptidyl-amino acid modification, 0030176 integral to endoplasmic reticulum membrane; PDB: 3RCQ_A 1E5S_A 1E5R_B.
Probab=35.60 E-value=54 Score=25.97 Aligned_cols=75 Identities=15% Similarity=0.091 Sum_probs=39.8
Q ss_pred ceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCcccccc-ceEEeeCCcEEEEcCCCCeEE
Q psy5204 82 KFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEI-SRTELAENDVCYINDSMGLHK 160 (209)
Q Consensus 82 ~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~-~~~~~~~G~v~~~~~~~~iH~ 160 (209)
...+.+....||...++|--.. ...+.+.-|-.. +.++ +.+... .....++|++..|++ .-.|.
T Consensus 79 ~~~~~~s~l~pg~~I~pH~d~~-~~~lR~Hl~L~~------p~~~-------~~~~v~~~~~~w~~G~~~~fD~-s~~H~ 143 (163)
T PF05118_consen 79 LGRVRFSRLPPGTHIKPHRDPT-NLRLRLHLPLIV------PNPG-------CYIRVGGETRHWREGECWVFDD-SFEHE 143 (163)
T ss_dssp CEEEEEEEEECTEEEEEE-SS--TTEEEEEEEEC--------STT-------EEEEETTEEEB--CTEEEEE-T-TS-EE
T ss_pred hhhEEEEEECCCCEECCeeCCC-CcceEEEEEEEc------CCCC-------eEEEECCeEEEeccCcEEEEeC-CEEEE
Confidence 3457777889999999998763 332222222211 1011 112222 255668999988887 46999
Q ss_pred EecCCCCCceEE
Q psy5204 161 VGNSGPHPAAVS 172 (209)
Q Consensus 161 v~n~~~~~~avS 172 (209)
+.|.+ +++-|.
T Consensus 144 ~~N~~-~~~Rv~ 154 (163)
T PF05118_consen 144 VWNNG-DEDRVV 154 (163)
T ss_dssp EEESS-SS-EEE
T ss_pred EEeCC-CCCEEE
Confidence 99998 454443
No 74
>COG3450 Predicted enzyme of the cupin superfamily [General function prediction only]
Probab=34.86 E-value=2e+02 Score=21.81 Aligned_cols=70 Identities=20% Similarity=0.220 Sum_probs=50.0
Q ss_pred ccCCccceEEeeecCCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEe
Q psy5204 65 QFDENRYTRNLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTEL 144 (209)
Q Consensus 65 ~~~~~~Y~R~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~ 144 (209)
.-+|..-++++...+|+.++.-+..=-||+..-.=+= +-+..+|+|..+.+ ++++ ....+
T Consensus 27 ~G~p~~~t~~~~~~~dg~~~~GiWe~TpG~~r~~y~~---~E~chil~G~v~~T----~d~G-------------e~v~~ 86 (116)
T COG3450 27 LGDPSAATWNLYGAPDGQVETGIWECTPGKFRVTYDE---DEFCHILEGRVEVT----PDGG-------------EPVEV 86 (116)
T ss_pred cCCccceehheeeCCCCCeeEeEEEecCccceEEccc---ceEEEEEeeEEEEE----CCCC-------------eEEEE
Confidence 3456667777777667888888888788887665544 45567999998866 2333 35678
Q ss_pred eCCcEEEEcC
Q psy5204 145 AENDVCYIND 154 (209)
Q Consensus 145 ~~G~v~~~~~ 154 (209)
++||+.+|.+
T Consensus 87 ~aGD~~~~~~ 96 (116)
T COG3450 87 RAGDSFVFPA 96 (116)
T ss_pred cCCCEEEECC
Confidence 8999888865
No 75
>PF03991 Prion_octapep: Copper binding octapeptide repeat; InterPro: IPR020949 Prion protein (PrP-c) [, , ] is a small glycoprotein found in high quantity in the brain of animals infected with certain degenerative neurological diseases, such as sheep scrapie and bovine spongiform encephalopathy (BSE), and the human dementias Creutzfeldt-Jacob disease (CJD) and Gerstmann-Straussler syndrome (GSS). PrP-c is encoded in the host genome and is expressed both in normal and infected cells. During infection, however, the PrP-c molecule become altered (conformationally rather than at the amino acid level) to an abnormal isoform, PrP-sc. In detergent-treated brain extracts from infected individuals, fibrils composed of polymers of PrP-sc, namely scrapie-associated fibrils or prion rods, can be evidenced by electron microscopy. The precise function of the normal PrP isoform in healthy individuals remains unknown. Several results, mainly obtained in transgenic animals, indicate that PrP-c might play a role in long-term potentiation, in sleep physiology, in oxidative burst compensation (PrP can fix four Cu2+ through its octarepeat domain), in interactions with the extracellular matrix (PrP-c can bind to the precursor of the laminin receptor, LRP), in apoptosis and in signal transduction (costimulation of PrP-c induces a modulation of Fyn kinase phosphorylation) []. The normal isoform, PrP-c, is anchored at the cell membrane, in rafts, through a glycosyl phosphatidyl inositol (GPI); its half-life at the cell surface is 5 h, after which the protein is internalised through a caveolae-dependent mechanism and degraded in the endolysosome compartment. Conversion between PrP-c and PrP-sc occurs likely during the internalisation process. This repeat is found at the amino terminus of mammalian prion proteins. It has been shown to bind to copper [].
Probab=34.25 E-value=16 Score=14.78 Aligned_cols=6 Identities=17% Similarity=0.180 Sum_probs=3.7
Q ss_pred CCCceE
Q psy5204 101 HAQAHC 106 (209)
Q Consensus 101 H~~s~~ 106 (209)
|+|.||
T Consensus 2 hgG~Wg 7 (8)
T PF03991_consen 2 HGGGWG 7 (8)
T ss_pred CCCcCC
Confidence 666665
No 76
>KOG3706|consensus
Probab=33.43 E-value=28 Score=32.99 Aligned_cols=80 Identities=15% Similarity=0.147 Sum_probs=49.8
Q ss_pred CccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCC--C-----CCccccccceEEeeCCcEEEEcCCCCeEEEecCCC
Q psy5204 94 QASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEI--G-----QDRPLSEISRTELAENDVCYINDSMGLHKVGNSGP 166 (209)
Q Consensus 94 ~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~--~-----~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~~ 166 (209)
|.=++|--.- .+|+--++|.=.=..|.........+ . ...--++.-+.++.+||+.||+- +.||+-.+++
T Consensus 329 qGfaPHyDdI-eaFvlQvEGrK~Wrly~P~~~~eel~l~sS~Nf~eedlgePV~e~vle~GDllYfPR-G~IHQA~t~~- 405 (629)
T KOG3706|consen 329 QGFAPHYDDI-EAFVLQVEGRKHWRLYHPTVPLEELALVSSDNFTEEDLGEPVHEFVLEPGDLLYFPR-GTIHQADTPA- 405 (629)
T ss_pred CCCCCchhhh-hhhhheeccceeeEeecCCCcHhhhhhccCCCCChhHhCCchHHhhcCCCcEEEecC-cceeeccccc-
Confidence 4456665553 57888888988878886432211000 0 00011334478899999999963 5699998774
Q ss_pred CCceEEEEEeCC
Q psy5204 167 HPAAVSLHLYSP 178 (209)
Q Consensus 167 ~~~avSLHvY~p 178 (209)
-.-||||=--
T Consensus 406 --~vHSlHvTlS 415 (629)
T KOG3706|consen 406 --LVHSLHVTLS 415 (629)
T ss_pred --hhceeEEEee
Confidence 6778887533
No 77
>PF10902 DUF2693: Protein of unknown function (DUF2693); InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown.
Probab=32.93 E-value=1.5e+02 Score=21.17 Aligned_cols=27 Identities=19% Similarity=0.108 Sum_probs=22.1
Q ss_pred CcceEEECCCCceEEeeee-EEeccCee
Q psy5204 181 NICSSFDAQTGVKSRARLT-YWSKFGHK 207 (209)
Q Consensus 181 ~~~~~yd~~~g~~~~~~~~-~~s~~g~~ 207 (209)
..+.+||.+.+.++..++. +.|..|-+
T Consensus 49 ~s~~yfDve~~~WRSFk~dnLIsV~gl~ 76 (83)
T PF10902_consen 49 TSVRYFDVEKKGWRSFKIDNLISVNGLK 76 (83)
T ss_pred ceEEEEEeccCceeeeeheeEEEECCcC
Confidence 3489999999999988884 88887754
No 78
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=32.11 E-value=1.9e+02 Score=26.00 Aligned_cols=57 Identities=16% Similarity=0.226 Sum_probs=42.0
Q ss_pred EccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCC
Q psy5204 89 CWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSG 165 (209)
Q Consensus 89 ~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~ 165 (209)
..+||..+-.|-|.++. ++.|.+|.=.... + ..+....+||+..++ +-.-|+..|.+
T Consensus 267 lL~~Gf~~~~~r~t~s~-iy~V~eGsg~~~I-----g-------------~~rf~~~~~D~fvVP-sW~~~~~~~gs 323 (351)
T COG3435 267 LLPPGFHGKAHRHTDST-IYHVVEGSGYTII-----G-------------GERFDWSAGDIFVVP-SWAWHEHVNGS 323 (351)
T ss_pred hcCCcccCCceeccCCE-EEEEEecceeEEE-----C-------------CEEeeccCCCEEEcc-CcceeecccCC
Confidence 37999999999999865 5889998755443 0 125566789976665 46788888875
No 79
>PF03392 OS-D: Insect pheromone-binding family, A10/OS-D; InterPro: IPR005055 A class of small (14-20 Kd) water-soluble proteins, called odorant binding proteins (OBPs), first discovered in the insect sensillar lymph but also in the mucus of vertebrates, is postulated to mediate the solubilisation of hydrophobic odorant molecules, and thereby to facilitate their transport to the receptor neurons. The product of a gene expressed in the olfactory system of Drosophila melanogaster (Fruit fly), OS-D, shares features common to vertebrate odorant-binding proteins, but has a primary structure unlike odorant-binding proteins []. OS-D derivatives have subsequently been found in chemosensory organs of phylogenetically distinct insects, including cockroaches, phasmids and moths, suggesting that OS-D-like proteins seem to be conserved in the insect phylum.; PDB: 1KX9_A 1N8U_A 1KX8_A 1K19_A 1N8V_A 2GVS_A 2JNT_A.
Probab=30.04 E-value=72 Score=23.25 Aligned_cols=63 Identities=14% Similarity=0.242 Sum_probs=36.1
Q ss_pred hhhhhhhhccC-CchhHHHHHHHHHHHHhcCC----CCCCHHHHHHHHHHhhcC-ccchhhhhc-cCCcc
Q psy5204 8 INRVEHYLLRL-TRIDTLSELIHRVYAAFEGR----SSVDVAHVKFLMESYVSN-ADDWKRYAQ-FDENR 70 (209)
Q Consensus 8 ~~r~~~~~~~~-~~~~~l~~li~~l~~~~~~~----~~~~~~~v~~ll~~l~~~-~~~w~~~~~-~~~~~ 70 (209)
++++-.|++.- |=...-.+|=+.|.+++... .+-.-+.++..+.-+..+ |++|..+.. +||++
T Consensus 20 ~~~y~~ClldkgpCt~~~~~lK~~ipeal~t~C~KCt~kQK~~~~kv~~~l~~~~P~~w~~l~~KyDp~~ 89 (95)
T PF03392_consen 20 LKSYIDCLLDKGPCTPEGKELKKVIPEALKTNCAKCTPKQKENARKVIKFLKKNYPDEWEELVKKYDPEG 89 (95)
T ss_dssp HHHHHHHHTSSSTSHHHHHHHHHHHHHHHHCTTTTS-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTT-
T ss_pred HHHHHHHHhcCCCCCHHHHHHHHHHHHHHHccccCCCHHHHHHHHHHHHHHHHcCHHHHHHHHHHHCCCc
Confidence 45555666642 22223455555567777654 223345566666666663 999998775 88875
No 80
>PRK09685 DNA-binding transcriptional activator FeaR; Provisional
Probab=29.88 E-value=2.8e+02 Score=23.41 Aligned_cols=84 Identities=15% Similarity=0.137 Sum_probs=46.7
Q ss_pred ceEEeeecCCCceEEEEEEccCCC-----ccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEee
Q psy5204 71 YTRNLVDEGDGKFNLILLCWSQGQ-----ASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELA 145 (209)
Q Consensus 71 Y~R~ll~~~~~~~~l~~~~W~pG~-----~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~ 145 (209)
+++..... -+.+.|.-+.-.+.. ...-|++....-++.+++|...... ++ ....+.
T Consensus 34 ~~~~~~~~-~~~~~l~~~~~~~~~~~R~~~~i~~~~~~~~~l~~~~~G~~~~~~-----~g-------------~~~~l~ 94 (302)
T PRK09685 34 IGELSTHD-AGGLKLSTVTTNAVNLSRTWQEIKHSDDAHFFTVFQLSGHAIIEQ-----DD-------------RQVQLA 94 (302)
T ss_pred eEEEeeee-eCCEEEEEEecCCceEEeChHHhccCCCCcEEEEEEecceEEEEE-----CC-------------eEEEEc
Confidence 34443433 345555555444431 1224555554445567888777553 11 256789
Q ss_pred CCcEEEEcCCCCeEEEecCCCCCceEEEEE
Q psy5204 146 ENDVCYINDSMGLHKVGNSGPHPAAVSLHL 175 (209)
Q Consensus 146 ~G~v~~~~~~~~iH~v~n~~~~~~avSLHv 175 (209)
+|+++.+++ ...|.+...+ +...++|++
T Consensus 95 ~G~~~l~~~-~~p~~~~~~~-~~~~~~l~i 122 (302)
T PRK09685 95 AGDITLIDA-SRPCSIYPQG-LSEQISLLL 122 (302)
T ss_pred CCCEEEEEC-CCCcEeecCC-CceeEEEEc
Confidence 999988875 4577776543 234555554
No 81
>PF06249 EutQ: Ethanolamine utilisation protein EutQ; InterPro: IPR010424 The eut operon of Salmonella typhimurium encodes proteins involved in the cobalamin-dependent degradation of ethanolamine. The role of EutQ in this process is unclear [].; PDB: 2PYT_B 3LWC_A.
Probab=28.09 E-value=2.2e+02 Score=22.71 Aligned_cols=71 Identities=11% Similarity=0.127 Sum_probs=41.8
Q ss_pred hhccCCc--cceEEeeec-CCCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCcccccc
Q psy5204 63 YAQFDEN--RYTRNLVDE-GDGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEI 139 (209)
Q Consensus 63 ~~~~~~~--~Y~R~ll~~-~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~ 139 (209)
+...+|. -|.+-++.. .+.+.+.-.|.|... +.--.-.-.=+..||+|+|.... ++
T Consensus 54 ~dtg~p~~~~~~~dv~~~~e~~~l~~Gf~~le~~---~f~wtl~YDEi~~VlEG~L~i~~-----~G------------- 112 (152)
T PF06249_consen 54 FDTGNPGDKVYIKDVFSSDESPRLSAGFMELEKT---SFPWTLTYDEIKYVLEGTLEISI-----DG------------- 112 (152)
T ss_dssp -GGGTTS--EEE-EEE-GGGT-SSEEEEEEEEEE---EEEEE-SSEEEEEEEEEEEEEEE-----TT-------------
T ss_pred ccCCCCCccEEEEEeccCCCCCceeeEEEEEeCC---CccEEeecceEEEEEEeEEEEEE-----CC-------------
Confidence 3334443 377777763 346889999999863 22222222356779999998763 12
Q ss_pred ceEEeeCCcEEEEcC
Q psy5204 140 SRTELAENDVCYIND 154 (209)
Q Consensus 140 ~~~~~~~G~v~~~~~ 154 (209)
+....++||+.+|+.
T Consensus 113 ~~~~A~~GDvi~iPk 127 (152)
T PF06249_consen 113 QTVTAKPGDVIFIPK 127 (152)
T ss_dssp EEEEEETT-EEEE-T
T ss_pred EEEEEcCCcEEEECC
Confidence 367788999999975
No 82
>TIGR03367 queuosine_QueD queuosine biosynthesis protein QueD. Members of this protein family, closely related to eukaryotic 6-pyruvoyl tetrahydrobiopterin synthase enzymes, are the QueD protein of queuosine biosynthesis. Queuosine is a hypermodified base in the wobble position of tRNAs for Tyr, His, Asp, and Asn in many species. This modification, although widespread, appears not to be important for viability. The queuosine precursor made by this enzyme may be converted instead to archeaosine as in some Archaea.
Probab=26.58 E-value=87 Score=22.35 Aligned_cols=24 Identities=17% Similarity=0.254 Sum_probs=19.6
Q ss_pred ccCCCccCCCCCCCceEEEEEEEeEE
Q psy5204 90 WSQGQASSIHDHAQAHCFMKMLQGCL 115 (209)
Q Consensus 90 W~pG~~spiHDH~~s~~~~~vl~G~l 115 (209)
+.+|....+|-|. |-+...+.|..
T Consensus 16 ~~~g~c~~lHGH~--y~v~v~~~~~~ 39 (92)
T TIGR03367 16 GYPGKCANLHGHT--YKVEVTVSGEV 39 (92)
T ss_pred CCCCCccCcCCcc--EEEEEEEEEee
Confidence 5678999999996 88888888875
No 83
>PRK12787 fliX flagellar assembly regulator FliX; Reviewed
Probab=26.05 E-value=1.8e+02 Score=22.83 Aligned_cols=46 Identities=17% Similarity=0.081 Sum_probs=32.2
Q ss_pred chhhhhhhhhhhhcc-CCchhHHHHHHHHHHHHhcCCCCCCHHHHHH
Q psy5204 3 QLLTKINRVEHYLLR-LTRIDTLSELIHRVYAAFEGRSSVDVAHVKF 48 (209)
Q Consensus 3 ~~~~~~~r~~~~~~~-~~~~~~l~~li~~l~~~~~~~~~~~~~~v~~ 48 (209)
.++..|+++++-++. .-+...|.+|...+++.-+...++-++.|-.
T Consensus 76 ~~LD~Ld~Lk~aLL~G~~~~~~L~~L~~~~~e~r~~s~DP~L~~vL~ 122 (138)
T PRK12787 76 TALDVLDELKIGLLSGTLDAATLARLRRAVRELRAASGDPGLDAVLD 122 (138)
T ss_pred HHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhccCCCChhHHHHHH
Confidence 467889999988875 5567888999888887555444555554433
No 84
>PF03487 IL13: Interleukin-13; InterPro: IPR020470 Interleukin-13 (IL-13) is a pleiotropic cytokine which may be important in the regulation of the inflammatory and immune responses []. It inhibits inflammatory cytokine production and synergises with IL-2 in regulating interferon-gamma synthesis. The sequences of IL-4 and IL-13 are distantly related.; PDB: 3G6D_A 3L5W_J 3BPO_A 1GA3_A 1IK0_A 3L5X_A 3L5Y_A 1IJZ_A 3LB6_B.
Probab=25.86 E-value=70 Score=19.74 Aligned_cols=18 Identities=22% Similarity=0.292 Sum_probs=13.4
Q ss_pred cCCchhHHHHHHHHHHHH
Q psy5204 17 RLTRIDTLSELIHRVYAA 34 (209)
Q Consensus 17 ~~~~~~~l~~li~~l~~~ 34 (209)
+.|.+.+|++||+++-.+
T Consensus 22 Pvp~~~alkELIeELvNI 39 (43)
T PF03487_consen 22 PVPSSTALKELIEELVNI 39 (43)
T ss_dssp -S-HHHHHHHHHHHHHHH
T ss_pred CCCchHHHHHHHHHHHhh
Confidence 457788999999988765
No 85
>PF15590 Imm15: Immunity protein 15
Probab=24.97 E-value=67 Score=22.16 Aligned_cols=27 Identities=22% Similarity=0.487 Sum_probs=18.4
Q ss_pred HHHHHHHhhcCccchh-hhhccCCccce
Q psy5204 46 VKFLMESYVSNADDWK-RYAQFDENRYT 72 (209)
Q Consensus 46 v~~ll~~l~~~~~~w~-~~~~~~~~~Y~ 72 (209)
|...|+++...+++|+ .|..++.++|.
T Consensus 11 i~n~Lqkv~~~~d~We~~y~DP~D~r~W 38 (69)
T PF15590_consen 11 ITNRLQKVASSPDGWETLYQDPRDGRYW 38 (69)
T ss_pred HHHHHHHHhcCCcchhhhccCCCCCcee
Confidence 4455777788888997 45557666643
No 86
>PF02671 PAH: Paired amphipathic helix repeat; InterPro: IPR003822 This family contains the paired amphipathic helix (PAH) repeat. The family contains the eukaryotic Sin3 proteins, which have at least three PAH domains (PAH1, PAH2, and PAH3). Sin3 proteins are components of a co-repressor complex that silences transcription, playing important roles in the transition between proliferation and differentiation. Sin3 proteins are recruited to the DNA by various DNA-binding transcription factors such as the Mad family of repressors, Mnt/Rox, PLZF, MeCP2, p53, REST/NRSF, MNFbeta, Sp1, TGIF and Ume6 []. Sin3 acts as a scaffold protein that in turn recruits histone-binding proteins RbAp46/RbAp48 and histone deacetylases HDAC1/HDAC2, which deacetylate the core histones resulting in a repressed state of the chromatin []. The PAH domains are protein-protein interaction domains through which Sin3 fulfils its role as a scaffold. The PAH2 domain of Sin3 can interact with a wide range of unrelated and structurally diverse transcription factors that bind using different interaction motifs. For example, the Sin3 PAH2 domain can interact with the unrelated Mad and HBP1 factors using alternative interaction motifs that involve binding in opposite helical orientations [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1S5Q_B 2L9S_B 1G1E_B 1S5R_B 2CR7_A 2CZY_A 2LD7_B 2RMR_A 2RMS_A 1PD7_A ....
Probab=23.86 E-value=1.8e+02 Score=17.70 Aligned_cols=43 Identities=12% Similarity=0.246 Sum_probs=29.4
Q ss_pred hHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhhcCccch-hhhhcc
Q psy5204 22 DTLSELIHRVYAAFEGRSSVDVAHVKFLMESYVSNADDW-KRYAQF 66 (209)
Q Consensus 22 ~~l~~li~~l~~~~~~~~~~~~~~v~~ll~~l~~~~~~w-~~~~~~ 66 (209)
..-++|++.++.--.+ ..+..++...+..+..+..++ ..+..|
T Consensus 2 ~~Y~~FL~il~~y~~~--~~~~~~v~~~v~~Ll~~hpdLl~~F~~F 45 (47)
T PF02671_consen 2 EVYNEFLKILNDYKKG--RISRSEVIEEVSELLRGHPDLLEEFNRF 45 (47)
T ss_dssp HHHHHHHHHHHHHHCT--CSCHHHHHHHHHHHTTT-HHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHhc--CCCHHHHHHHHHHHHccCHHHHHHHHhh
Confidence 3457888888775443 688899999999998864444 455443
No 87
>PF10264 Stork_head: Winged helix Storkhead-box1 domain; InterPro: IPR019391 In humans the Storkhead-box protein controls polyploidization of extravillus trophoblast and is implicated in pre-eclampsia []. This entry represents the conserved N-terminal winged-helix domain, which is likely to bind DNA.
Probab=23.23 E-value=2.2e+02 Score=20.27 Aligned_cols=40 Identities=8% Similarity=0.038 Sum_probs=32.8
Q ss_pred cCCchhHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhhcC
Q psy5204 17 RLTRIDTLSELIHRVYAAFEGRSSVDVAHVKFLMESYVSN 56 (209)
Q Consensus 17 ~~~~~~~l~~li~~l~~~~~~~~~~~~~~v~~ll~~l~~~ 56 (209)
..-.+.+++.+++.|.+.+.+-..++.+.|...|..|...
T Consensus 25 ~~~~~at~E~l~~~L~~~yp~i~~Ps~e~l~~~L~~Li~e 64 (80)
T PF10264_consen 25 AAGQPATQETLREHLRKHYPGIAIPSQEVLYNTLGTLIKE 64 (80)
T ss_pred ccCCcchHHHHHHHHHHhCCCCCCCCHHHHHHHHHHHHHc
Confidence 4556788889999999988877678888899999888874
No 88
>COG2000 Predicted Fe-S protein [General function prediction only]
Probab=22.75 E-value=1.2e+02 Score=25.30 Aligned_cols=71 Identities=17% Similarity=0.143 Sum_probs=40.6
Q ss_pred eEEEEEEEeEEEEEEeeCC-CCCCCCCCCCccccccceEEeeCCcEEEEcCCCC--eEEEecCCCCCceEEEEEeCCC
Q psy5204 105 HCFMKMLQGCLTEVRYQEP-NHTSDEIGQDRPLSEISRTELAENDVCYINDSMG--LHKVGNSGPHPAAVSLHLYSPP 179 (209)
Q Consensus 105 ~~~~~vl~G~l~e~~y~~~-~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~--iH~v~n~~~~~~avSLHvY~pp 179 (209)
.|+++||+|-..-...... +..+.. ..+-+.....+..|+++..-|-+. -|--+........+++|+-||.
T Consensus 74 ~~~~gvlDgyead~vl~plPne~scr----e~lf~~~~~~lekgdiiRyRp~Gcpi~hfaeIld~~kG~~tvhIvGpr 147 (226)
T COG2000 74 MKIVGVLDGYEADFVLKPLPNECSCR----EILFPFKSKELEKGDIIRYRPLGCPITHFAEILDEAKGLVTVHIVGPR 147 (226)
T ss_pred eeEEEEecceeeceeeccCCCccccc----eeEEeecccccccccEEEeccCCCcchhHHHHHHhhcCeEEEEEeccc
Confidence 5888888887766444322 222210 123444455578888776544222 3433333334578999999998
No 89
>PHA02091 hypothetical protein
Probab=21.64 E-value=85 Score=21.29 Aligned_cols=15 Identities=20% Similarity=0.319 Sum_probs=12.4
Q ss_pred CCceEEEEEeCCCCC
Q psy5204 167 HPAAVSLHLYSPPFN 181 (209)
Q Consensus 167 ~~~avSLHvY~pp~~ 181 (209)
|.+-.+|.||||--.
T Consensus 43 d~~~w~lnvygp~~~ 57 (72)
T PHA02091 43 DRGMWTLNVYGPLGK 57 (72)
T ss_pred CCceEEEEeecccch
Confidence 468999999998754
No 90
>PF14499 DUF4437: Domain of unknown function (DUF4437); PDB: 2QDR_A.
Probab=21.62 E-value=48 Score=28.60 Aligned_cols=99 Identities=16% Similarity=0.078 Sum_probs=44.9
Q ss_pred HHHHHHHhhcCccchhhhhccCCcc-ceEEeeecC-CCceEEEEEEccCCCccCCCCCCCceEEEEEEEeEEEEEEeeCC
Q psy5204 46 VKFLMESYVSNADDWKRYAQFDENR-YTRNLVDEG-DGKFNLILLCWSQGQASSIHDHAQAHCFMKMLQGCLTEVRYQEP 123 (209)
Q Consensus 46 v~~ll~~l~~~~~~w~~~~~~~~~~-Y~R~ll~~~-~~~~~l~~~~W~pG~~spiHDH~~s~~~~~vl~G~l~e~~y~~~ 123 (209)
-+..+.-+..+.-.|.......+.. ..-.|--.+ ++.+.=+++-.+.|....||.|....= ..|++|++....++.
T Consensus 132 ~~~~ivwld~~dl~W~~~~~~~~~g~~~a~Lwgd~~~g~~~gll~kLPagf~g~i~~h~~~er-aVvI~G~~~~~~~~~- 209 (251)
T PF14499_consen 132 DKDNIVWLDASDLEWISAPPGPPPGAQIAFLWGDPNTGQYTGLLLKLPAGFTGRIHTHASNER-AVVISGELDYQSYGA- 209 (251)
T ss_dssp GGG-EEEEECCCS--EE-SSSTT-SEEEEEEEE-TTS-EE-EEEEE-SSEE--SEEE--S-EE-EEEEEEEEEETTEEE-
T ss_pred ccccceEeccccCCccccCCCCCCcceEEEEecCCCCCceeeEEEEcCCCCcCceeccCCceE-EEEEEeEEEEeeccc-
Confidence 3344444444444554333222223 344444343 456778888899999999999996543 458999999876542
Q ss_pred CCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEe
Q psy5204 124 NHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVG 162 (209)
Q Consensus 124 ~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~ 162 (209)
.....|.+|.-+.. +..-.|++.
T Consensus 210 ---------------~~~~~L~~GSYf~s-~~~~~H~~~ 232 (251)
T PF14499_consen 210 ---------------SNFGTLDPGSYFGS-PGHITHGIF 232 (251)
T ss_dssp ---------------ETTEEEEE-TT-EE---E------
T ss_pred ---------------CCCccccCCccccc-CCccccccc
Confidence 23566888884443 334599997
No 91
>PRK10572 DNA-binding transcriptional regulator AraC; Provisional
Probab=21.48 E-value=4.9e+02 Score=21.84 Aligned_cols=49 Identities=8% Similarity=-0.050 Sum_probs=33.8
Q ss_pred CCCCCCCceEEEEEEEeEEEEEEeeCCCCCCCCCCCCccccccceEEeeCCcEEEEcCCCCeEEEecCC
Q psy5204 97 SIHDHAQAHCFMKMLQGCLTEVRYQEPNHTSDEIGQDRPLSEISRTELAENDVCYINDSMGLHKVGNSG 165 (209)
Q Consensus 97 piHDH~~s~~~~~vl~G~l~e~~y~~~~~~~~~~~~~~~l~~~~~~~~~~G~v~~~~~~~~iH~v~n~~ 165 (209)
.+|...+ +.+..+++|...... ++ ....+++|++..+++ +.+|.+....
T Consensus 43 r~~~~~~-~~i~~~~~G~~~~~~-----~~-------------~~~~~~~g~~i~i~p-~~~h~~~~~~ 91 (290)
T PRK10572 43 RPLGMKG-YILNLTIRGQGVIFN-----GG-------------RAFVCRPGDLLLFPP-GEIHHYGRHP 91 (290)
T ss_pred cCCCccc-eEEEEEEeccEEEec-----CC-------------eeEecCCCCEEEECC-CCceeeccCC
Confidence 3455553 667778999887432 11 356788999998887 5699887643
No 92
>TIGR02837 spore_II_R stage II sporulation protein R. A comparative genome analysis of all sequenced genomes of shows a number of proteins conserved strictly among the endospore-forming subset of the Firmicutes. This protein, a member of this panel, is designated stage II sporulation protein R.
Probab=21.17 E-value=4.5e+02 Score=21.32 Aligned_cols=36 Identities=14% Similarity=0.342 Sum_probs=27.1
Q ss_pred cCCccceEEeeecCCCceEEEEEEccCCCccCCCCCCCceEEEE
Q psy5204 66 FDENRYTRNLVDEGDGKFNLILLCWSQGQASSIHDHAQAHCFMK 109 (209)
Q Consensus 66 ~~~~~Y~R~ll~~~~~~~~l~~~~W~pG~~spiHDH~~s~~~~~ 109 (209)
|....|+-..+ |-+.|+.+.+.-..|+. |. +||++.
T Consensus 118 FPtK~YG~~~~--PaG~YeAlrI~IG~g~G-----~N-WWCVlf 153 (168)
T TIGR02837 118 FPTKLYGNIVL--PAGEYEALRILIGEGAG-----AN-WWCVVF 153 (168)
T ss_pred CCCcccCCEec--cCCceEEEEEEecCcCC-----cc-eEEEec
Confidence 44455877666 56899999999999984 54 899754
No 93
>PF06972 DUF1296: Protein of unknown function (DUF1296); InterPro: IPR009719 This family represents a conserved region approximately 60 residues long within a number of plant proteins of unknown function.
Probab=20.64 E-value=2.5e+02 Score=18.88 Aligned_cols=37 Identities=0% Similarity=0.020 Sum_probs=28.0
Q ss_pred hhHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHhhcCccch
Q psy5204 21 IDTLSELIHRVYAAFEGRSSVDVAHVKFLMESYVSNADDW 60 (209)
Q Consensus 21 ~~~l~~li~~l~~~~~~~~~~~~~~v~~ll~~l~~~~~~w 60 (209)
|++.+..||.|+++.+.- +=++|-..|....-||++=
T Consensus 2 P~~~rk~VQ~iKEiv~~h---se~eIya~L~ecnMDpnea 38 (60)
T PF06972_consen 2 PAASRKTVQSIKEIVGCH---SEEEIYAMLKECNMDPNEA 38 (60)
T ss_pred ChHHHHHHHHHHHHhcCC---CHHHHHHHHHHhCCCHHHH
Confidence 567899999999998762 3467888888877776644
No 94
>cd01786 STE50_RA Ubiquitin-like domain of STE50_RA. STE50_RA The fungal adaptor protein STE50 is an essential component of three MAPK-mediated signalling pathways, which control the mating response, invasive/filamentous growth and osmotolerance (HOG pathway), respectively. STE50 functions in cell signalling between the activated G protein and STE11. The domain architecture of STE50 includes an amino-terminal SAM (sterile alpha motif) domain in addition to the carboxy-terminal ubiquitin-like RA (RAS-associated) domain. While the SAM domain interacts with STE11, the RA domain interacts with CDC42 and RAS. Modulation of signal transduction by STE50 specifically affects the pheromone-response pathway in yeast.
Probab=20.55 E-value=70 Score=23.48 Aligned_cols=27 Identities=22% Similarity=0.313 Sum_probs=22.7
Q ss_pred CCCHHHHHHHHHHhhcCccchhhhhcc
Q psy5204 40 SVDVAHVKFLMESYVSNADDWKRYAQF 66 (209)
Q Consensus 40 ~~~~~~v~~ll~~l~~~~~~w~~~~~~ 66 (209)
++...-+...|+++..+.++|..|+-+
T Consensus 35 DpC~kVL~~Alkry~I~~~dW~~Y~L~ 61 (98)
T cd01786 35 DSCEKILKNAMKRHNLNDQDWRQYVLV 61 (98)
T ss_pred CcHHHHHHHHHHHcCCChhhhhheEEE
Confidence 456788999999999998899887764
Done!