Query psy14742
Match_columns 66
No_of_seqs 139 out of 1176
Neff 8.0
Searched_HMMs 46136
Date Fri Aug 16 19:29:06 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14742.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14742hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1113|consensus 99.1 1E-10 2.3E-15 77.6 5.0 60 3-64 166-225 (368)
2 PF00027 cNMP_binding: Cyclic 98.9 3.5E-09 7.5E-14 56.2 5.5 56 3-58 20-80 (91)
3 COG0664 Crp cAMP-binding prote 98.8 1.2E-08 2.6E-13 61.2 6.0 54 3-56 44-102 (214)
4 PRK10402 DNA-binding transcrip 98.8 2E-08 4.3E-13 62.4 6.6 55 3-57 52-111 (226)
5 KOG0614|consensus 98.8 8.4E-10 1.8E-14 77.0 -1.2 59 3-63 198-256 (732)
6 PLN03192 Voltage-dependent pot 98.7 3.8E-08 8.2E-13 70.8 6.4 55 3-57 418-476 (823)
7 PRK09392 ftrB transcriptional 98.7 7.3E-08 1.6E-12 59.9 6.8 55 3-57 51-109 (236)
8 PRK13918 CRP/FNR family transc 98.7 9.3E-08 2E-12 58.0 6.9 55 3-58 29-88 (202)
9 TIGR03697 NtcA_cyano global ni 98.7 1.3E-07 2.9E-12 56.7 7.2 56 2-57 13-75 (193)
10 PRK11753 DNA-binding transcrip 98.7 1.6E-07 3.5E-12 57.2 7.6 55 3-57 41-101 (211)
11 cd00038 CAP_ED effector domain 98.7 1.2E-07 2.7E-12 51.3 5.7 55 3-57 38-97 (115)
12 KOG1113|consensus 98.4 6.2E-08 1.3E-12 64.6 1.1 59 4-65 285-345 (368)
13 smart00100 cNMP Cyclic nucleot 98.4 1.2E-06 2.5E-11 47.5 5.6 54 3-56 38-98 (120)
14 PLN02868 acyl-CoA thioesterase 98.4 1.5E-06 3.2E-11 58.6 6.2 56 2-59 51-110 (413)
15 KOG0614|consensus 98.3 1.1E-07 2.4E-12 66.7 0.3 58 4-63 317-381 (732)
16 KOG0500|consensus 98.3 2.7E-06 5.9E-11 58.9 6.6 56 3-58 351-414 (536)
17 KOG0499|consensus 98.3 1.4E-06 3.1E-11 61.8 5.0 60 3-64 571-636 (815)
18 KOG0498|consensus 98.3 4.3E-07 9.4E-12 65.2 2.3 55 3-57 463-523 (727)
19 PRK09391 fixK transcriptional 98.2 7.2E-06 1.6E-10 51.2 7.0 53 3-58 59-116 (230)
20 PRK11161 fumarate/nitrate redu 98.2 8.5E-06 1.9E-10 50.5 6.4 54 3-57 58-116 (235)
21 COG2905 Predicted signal-trans 97.6 0.00014 3.1E-09 51.4 5.3 53 4-56 52-105 (610)
22 KOG0501|consensus 97.5 2.8E-05 6.1E-10 55.6 0.0 56 3-58 592-649 (971)
23 KOG2968|consensus 97.2 0.00015 3.3E-09 53.7 1.0 54 3-56 529-587 (1158)
24 PF04831 Popeye: Popeye protei 97.1 0.0027 5.9E-08 38.3 5.9 58 3-60 50-112 (153)
25 KOG2968|consensus 95.5 0.069 1.5E-06 40.3 6.4 54 4-57 408-472 (1158)
26 KOG2378|consensus 95.3 0.0046 1E-07 43.2 -0.1 40 25-65 1-40 (573)
27 PRK11832 putative DNA-binding 93.3 0.74 1.6E-05 29.1 6.7 55 3-57 44-99 (207)
28 PF05899 Cupin_3: Protein of u 92.7 0.24 5.1E-06 26.1 3.3 26 4-29 29-54 (74)
29 PF07883 Cupin_2: Cupin domain 86.8 2.6 5.6E-05 20.9 6.1 26 3-29 22-47 (71)
30 PRK09943 DNA-binding transcrip 86.3 5.4 0.00012 24.1 6.4 49 4-56 132-180 (185)
31 PRK13290 ectC L-ectoine syntha 85.7 5.1 0.00011 23.2 6.1 48 3-55 58-106 (125)
32 COG3450 Predicted enzyme of th 84.2 2.6 5.6E-05 24.4 3.8 26 4-29 67-92 (116)
33 TIGR03214 ura-cupin putative a 82.6 11 0.00023 24.4 7.2 50 3-56 83-132 (260)
34 PRK11171 hypothetical protein; 81.0 12 0.00027 24.1 7.0 48 4-55 87-134 (266)
35 PF11699 CENP-C_C: Mif2/CENP-C 80.6 5.7 0.00012 21.6 4.1 25 4-29 37-61 (85)
36 COG0662 {ManC} Mannose-6-phosp 75.8 10 0.00022 21.6 4.5 52 3-60 60-111 (127)
37 KOG3542|consensus 75.6 3.9 8.4E-05 30.9 3.2 53 3-55 326-379 (1283)
38 COG3718 IolB Uncharacterized e 71.6 19 0.00041 23.7 5.3 53 3-60 52-113 (270)
39 COG1917 Uncharacterized conser 70.3 8.9 0.00019 21.7 3.4 28 3-31 67-94 (131)
40 TIGR01479 GMP_PMI mannose-1-ph 66.6 41 0.00089 23.5 6.6 49 4-56 401-449 (468)
41 PRK15460 cpsB mannose-1-phosph 64.3 49 0.0011 23.5 6.5 47 4-54 410-456 (478)
42 PF04962 KduI: KduI/IolB famil 63.8 38 0.00082 22.0 6.6 58 3-60 50-110 (261)
43 PF02311 AraC_binding: AraC-li 60.5 25 0.00054 18.9 6.5 56 4-63 27-82 (136)
44 PF12852 Cupin_6: Cupin 58.8 23 0.00049 21.1 3.7 27 3-29 38-65 (186)
45 smart00835 Cupin_1 Cupin. This 55.8 38 0.00083 19.5 6.3 27 3-29 54-85 (146)
46 PRK11171 hypothetical protein; 55.4 54 0.0012 21.2 6.3 26 3-29 208-233 (266)
47 PRK15457 ethanolamine utilizat 54.9 40 0.00087 21.9 4.5 25 4-29 179-203 (233)
48 PF06249 EutQ: Ethanolamine ut 54.3 33 0.00071 20.8 3.8 25 4-29 99-123 (152)
49 COG3257 GlxB Uncharacterized p 51.8 66 0.0014 21.1 6.6 51 3-57 86-136 (264)
50 PRK13264 3-hydroxyanthranilate 51.8 56 0.0012 20.3 6.2 53 3-60 57-112 (177)
51 COG4766 EutQ Ethanolamine util 47.1 44 0.00095 20.6 3.6 23 5-28 123-145 (176)
52 COG3837 Uncharacterized conser 43.3 69 0.0015 19.7 4.0 28 4-32 68-95 (161)
53 TIGR03037 anthran_nbaC 3-hydro 42.9 77 0.0017 19.3 6.6 53 3-60 51-106 (159)
54 PF14525 AraC_binding_2: AraC- 40.3 70 0.0015 18.1 6.5 51 4-60 59-109 (172)
55 PRK10296 DNA-binding transcrip 36.9 1.1E+02 0.0023 19.2 4.5 24 4-28 47-70 (278)
56 PF05726 Pirin_C: Pirin C-term 34.4 82 0.0018 17.2 4.3 20 38-57 50-70 (104)
57 PF01050 MannoseP_isomer: Mann 34.3 1E+02 0.0023 18.3 4.6 24 4-28 88-111 (151)
58 PHA02123 hypothetical protein 32.7 44 0.00096 19.5 2.0 17 50-66 114-130 (146)
59 PRK09685 DNA-binding transcrip 32.3 1.3E+02 0.0029 19.0 5.5 23 5-28 76-98 (302)
60 COG4101 Predicted mannose-6-ph 31.4 1.2E+02 0.0026 18.0 5.7 39 3-41 70-110 (142)
61 TIGR02272 gentisate_1_2 gentis 31.0 1.5E+02 0.0032 20.2 4.5 29 3-32 273-301 (335)
62 PF07943 PBP5_C: Penicillin-bi 29.1 92 0.002 16.1 4.8 34 20-53 4-37 (91)
63 TIGR03404 bicupin_oxalic bicup 28.6 2E+02 0.0042 19.7 8.2 55 4-63 91-149 (367)
64 KOG2500|consensus 28.1 1.8E+02 0.0039 19.1 4.4 51 6-58 41-96 (253)
65 PRK10579 hypothetical protein; 27.5 1.2E+02 0.0026 17.0 3.1 39 6-49 47-86 (94)
66 PF05409 Peptidase_C30: Corona 25.5 2E+02 0.0043 19.4 4.3 30 18-47 73-102 (293)
67 PF06719 AraC_N: AraC-type tra 23.4 1.7E+02 0.0036 17.1 3.8 53 4-60 27-82 (155)
68 PRK13502 transcriptional activ 22.1 2.1E+02 0.0046 17.9 4.1 24 4-28 42-65 (282)
69 PF01043 SecA_PP_bind: SecA pr 22.0 93 0.002 17.7 2.1 14 2-15 79-92 (113)
70 PF13421 Band_7_1: SPFH domain 21.9 1.7E+02 0.0036 18.4 3.3 25 5-29 18-43 (211)
71 cd04467 S1_aIF5A S1_aIF5A: Arc 21.2 1.3E+02 0.0028 15.1 2.8 17 40-56 4-20 (57)
No 1
>KOG1113|consensus
Probab=99.12 E-value=1e-10 Score=77.57 Aligned_cols=60 Identities=33% Similarity=0.558 Sum_probs=54.4
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecCCCcc
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTSGDNE 64 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~~~~~ 64 (66)
-|||.+|.+++++.+..+..+.||..|||++++.+.||.+|+.|.++ ..+|.++..-|.+
T Consensus 166 fYvI~kGt~dVyv~~~~v~~~~~g~sFGElALmyn~PRaATv~a~t~--~klWgldr~SFrr 225 (368)
T KOG1113|consen 166 FYVIDKGTFDVYVNGTYVTTYSPGGSFGELALMYNPPRAATVVAKSL--KKLWGLDRTSFRR 225 (368)
T ss_pred EEEEecceEEEEECCeEEeeeCCCCchhhhHhhhCCCcccceeeccc--cceEEEeeceeEE
Confidence 38999999999999999999999999999999999999999999996 6777777766654
No 2
>PF00027 cNMP_binding: Cyclic nucleotide-binding domain; InterPro: IPR000595 Proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues [, , ]. The best studied of these proteins is the prokaryotic catabolite gene activator (also known as the cAMP receptor protein) (gene crp) where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure. There are six invariant amino acids in this domain, three of which are glycine residues that are thought to be essential for maintenance of the structural integrity of the beta-barrel. cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain. The cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain. The cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section. Vertebrate cyclic nucleotide-gated ion-channels also contain this domain. Two such cations channels have been fully characterised, one is found in rod cells where it plays a role in visual signal transduction.; PDB: 1O7F_A 2BYV_E 3E97_A 3U10_A 2H6B_A 3SHR_A 2OZ6_A 1WGP_A 3LA2_A 3LA3_B ....
Probab=98.94 E-value=3.5e-09 Score=56.23 Aligned_cols=56 Identities=25% Similarity=0.305 Sum_probs=48.1
Q ss_pred eEEEEccEEEEEECC-----EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEee
Q psy14742 3 DRILSKGRVEVSREN-----KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSR 58 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~-----~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~ 58 (66)
-|+|.+|.+++.... ..+..+.+|++||+.+++.+.++..+++|.++|+++.++..
T Consensus 20 ~~~i~~G~v~~~~~~~~~~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~i~~~ 80 (91)
T PF00027_consen 20 IYIILSGEVKVSSINEDGKEQIIFFLGPGDIFGEIELLTGKPSPFTVIALTDSEVLRIPRE 80 (91)
T ss_dssp EEEEEESEEEEEEETTTSEEEEEEEEETTEEESGHHHHHTSBBSSEEEESSSEEEEEEEHH
T ss_pred EEEEEECceEEEeceecceeeeecceeeeccccceeecCCCccEEEEEEccCEEEEEEeHH
Confidence 489999999998752 35789999999999999999999999999999777776643
No 3
>COG0664 Crp cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]
Probab=98.84 E-value=1.2e-08 Score=61.17 Aligned_cols=54 Identities=28% Similarity=0.321 Sum_probs=46.3
Q ss_pred eEEEEccEEEEEECC-----EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEE
Q psy14742 3 DRILSKGRVEVSREN-----KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWG 56 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~-----~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~ 56 (66)
-|+|.+|.+++++.. ..+..++||++||+.+++.+.+++++++|.++++++.++
T Consensus 44 ~y~v~~G~v~~~~~~~~G~~~~~~~~~~g~~fg~~~l~~~~~~~~~~~a~~~~~~~~~~ 102 (214)
T COG0664 44 LYIILSGIVKLYANTEDGREIILGFLGPGDFFGELALLGGDPRSASAVALTDVEVLEIP 102 (214)
T ss_pred EEEEEEeEEEEEEECCCCcEEEEEEecCCchhhhHHHhcCCCccceEEEcceEEEEEec
Confidence 489999999998752 467899999999999999988999999999995555544
No 4
>PRK10402 DNA-binding transcriptional activator YeiL; Provisional
Probab=98.82 E-value=2e-08 Score=62.43 Aligned_cols=55 Identities=13% Similarity=0.117 Sum_probs=47.7
Q ss_pred eEEEEccEEEEEEC---C--EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSRE---N--KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~---~--~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~ 57 (66)
-|+|.+|.++++.. + ..+..+.||++||+.+++.+.+++++++|.++|.++.++.
T Consensus 52 ~y~V~~G~v~v~~~~~~G~e~~~~~~~~g~~~G~~~~~~~~~~~~~~~A~~~~~i~~i~~ 111 (226)
T PRK10402 52 LFYLTRGRAKLYATLANGKVSLIDFFAAPCFIGEIELIDKDHETKAVQAIEECWCLALPM 111 (226)
T ss_pred EEEEEeCEEEEEEECCCCCEeeeeecCCCCeEEeehhhcCCCCCccEEEeccEEEEEEEH
Confidence 48999999999864 2 4678899999999999999999999999999988777664
No 5
>KOG0614|consensus
Probab=98.75 E-value=8.4e-10 Score=76.97 Aligned_cols=59 Identities=39% Similarity=0.667 Sum_probs=53.6
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecCCCc
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTSGDN 63 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~~~~ 63 (66)
-|++.+|++.|++.++.+...++|.+|||++++.+++|+|+|+|+++ +.+|.+++..|.
T Consensus 198 ~yV~aeG~~~V~~~g~ll~~m~~gtvFGELAILynctRtAsV~alt~--~~lWaidR~vFq 256 (732)
T KOG0614|consen 198 LYVSAEGELQVSREGKLLGKMGAGTVFGELAILYNCTRTASVRALTD--VRLWAIDREVFQ 256 (732)
T ss_pred EEEeecceEEEeeCCeeeeccCCchhhhHHHHHhCCcchhhhhhhhh--hhHHHHHHHHHH
Confidence 48999999999999999999999999999999999999999999998 677777766553
No 6
>PLN03192 Voltage-dependent potassium channel; Provisional
Probab=98.72 E-value=3.8e-08 Score=70.79 Aligned_cols=55 Identities=24% Similarity=0.332 Sum_probs=48.0
Q ss_pred eEEEEccEEEEEEC----CEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSRE----NKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~----~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~ 57 (66)
-|+|.+|.+++... ++.+..+++|++|||.+++.+.|++++++|.++|+++.++.
T Consensus 418 lY~I~~G~V~i~~~~~~~e~~l~~l~~Gd~FGE~~~l~~~p~~~t~ra~~~s~ll~l~~ 476 (823)
T PLN03192 418 VYIVVSGEVEIIDSEGEKERVVGTLGCGDIFGEVGALCCRPQSFTFRTKTLSQLLRLKT 476 (823)
T ss_pred EEEEEecEEEEEEecCCcceeeEEccCCCEecchHHhcCCCCCCeEEEcccEEEEEEEH
Confidence 48999999999753 25688999999999999999999999999999987777654
No 7
>PRK09392 ftrB transcriptional activator FtrB; Provisional
Probab=98.71 E-value=7.3e-08 Score=59.88 Aligned_cols=55 Identities=16% Similarity=0.115 Sum_probs=47.5
Q ss_pred eEEEEccEEEEEEC----CEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSRE----NKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~----~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~ 57 (66)
-|+|.+|.++++.. ...+..+.+|++||+.+++.+.++.++++|.++|.++.++.
T Consensus 51 ~~~v~~G~v~~~~~~~~~~~~i~~~~~g~~~g~~~~~~~~~~~~~~~A~~~~~~~~i~~ 109 (236)
T PRK09392 51 LFVVLDGLVELSASSQDRETTLAILRPVSTFILAAVVLDAPYLMSARTLTRSRVLMIPA 109 (236)
T ss_pred EEEEEeCEEEEEEcCCCceEEEEEeCCCchhhhHHHhCCCCCceEEEEcCceEEEEEeH
Confidence 58999999999764 24678999999999999999999999999999977776654
No 8
>PRK13918 CRP/FNR family transcriptional regulator; Provisional
Probab=98.70 E-value=9.3e-08 Score=57.97 Aligned_cols=55 Identities=18% Similarity=0.192 Sum_probs=45.6
Q ss_pred eEEEEccEEEEEEC---C--EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEee
Q psy14742 3 DRILSKGRVEVSRE---N--KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSR 58 (66)
Q Consensus 3 ~~iI~~G~v~v~~~---~--~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~ 58 (66)
-|+|.+|.+++++. + ..+..+.||++||+.+++ +.+++.++.|.++|.++.++..
T Consensus 29 ~y~I~~G~vr~~~~~~~G~e~~l~~~~~Gd~~G~~~~~-~~~~~~~~~A~~~~~v~~i~~~ 88 (202)
T PRK13918 29 LYRVRSGLVRLHTVDDEGNALTLRYVRPGEYFGEEALA-GAERAYFAEAVTDSRIDVLNPA 88 (202)
T ss_pred EEEEEeeEEEEEEECCCCCEEEEEEecCCCeechHHhc-CCCCCceEEEcCceEEEEEEHH
Confidence 58999999999874 2 467889999999997664 5788999999999888877654
No 9
>TIGR03697 NtcA_cyano global nitrogen regulator NtcA, cyanobacterial. Members of this protein family, found in the cyanobacteria, are the global nitrogen regulator NtcA. This DNA-binding transcriptional regulator is required for expressing many different ammonia-repressible genes. The consensus NtcA-binding site is G T A N(8)T A C.
Probab=98.69 E-value=1.3e-07 Score=56.75 Aligned_cols=56 Identities=21% Similarity=0.261 Sum_probs=44.9
Q ss_pred ceEEEEccEEEEEEC---C--EEEEEeCCCCEEeccccccCCc--eeEEEEEccceEEEEEEe
Q psy14742 2 RDRILSKGRVEVSRE---N--KYLSTLAPGKVFGELAILYNCK--RTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 2 ~~~iI~~G~v~v~~~---~--~~v~~l~~G~~fGe~~l~~~~~--r~~tv~A~~~~~v~~l~~ 57 (66)
.-|+|.+|.+++++. + ..+..+.||++||+.+++.+.+ +..+++|.++|.++.++.
T Consensus 13 ~~~~i~~G~v~~~~~~~~G~e~~l~~~~~g~~~G~~~~~~~~~~~~~~~~~A~~~~~v~~i~~ 75 (193)
T TIGR03697 13 KVYFLRRGAVKLSRVYESGEEITVALLRENSVFGVLSLITGHRSDRFYHAVAFTRVELLAVPI 75 (193)
T ss_pred cEEEEEecEEEEEEeCCCCcEeeeEEccCCCEeeeeeeccCCCCccceEEEEecceEEEEeeH
Confidence 359999999999873 3 3578999999999999988754 457899999977666554
No 10
>PRK11753 DNA-binding transcriptional dual regulator Crp; Provisional
Probab=98.69 E-value=1.6e-07 Score=57.18 Aligned_cols=55 Identities=24% Similarity=0.278 Sum_probs=45.5
Q ss_pred eEEEEccEEEEEEC---C--EEEEEeCCCCEEeccccccC-CceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSRE---N--KYLSTLAPGKVFGELAILYN-CKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~---~--~~v~~l~~G~~fGe~~l~~~-~~r~~tv~A~~~~~v~~l~~ 57 (66)
-|+|.+|.++++.. + ..+..+++|++||+.+++.+ .+++++++|.++|.++.++.
T Consensus 41 ~y~V~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~~a~~~~~v~~i~~ 101 (211)
T PRK11753 41 LYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGELGLFEEGQERSAWVRAKTACEVAEISY 101 (211)
T ss_pred EEEEEeCEEEEEEECCCCCEEEEEEcCCCCEEeehhhccCCCCceEEEEEcCcEEEEEEcH
Confidence 58999999999753 2 46788999999999999886 46888999999977776654
No 11
>cd00038 CAP_ED effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of
Probab=98.65 E-value=1.2e-07 Score=51.30 Aligned_cols=55 Identities=35% Similarity=0.445 Sum_probs=47.0
Q ss_pred eEEEEccEEEEEECC-----EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSREN-----KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~-----~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~ 57 (66)
-|+|.+|.++++... ..+..+.+|++||+..++.+.++..+++|.++|.++.++.
T Consensus 38 ~~~i~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~a~~~~~~~~i~~ 97 (115)
T cd00038 38 LYIVLSGSVEVYKLDEDGREQIVGFLGPGDLFGELALLGNGPRSATVRALTDSELLVLPR 97 (115)
T ss_pred EEEEEeCEEEEEEECCCCcEEEEEecCCccCcChHHHhcCCCCCceEEEcCceEEEEEeH
Confidence 589999999997753 5678899999999999998889999999999977766654
No 12
>KOG1113|consensus
Probab=98.44 E-value=6.2e-08 Score=64.57 Aligned_cols=59 Identities=29% Similarity=0.386 Sum_probs=48.9
Q ss_pred EEEEccEEEEEEC--CEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecCCCccC
Q psy14742 4 RILSKGRVEVSRE--NKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTSGDNEM 65 (66)
Q Consensus 4 ~iI~~G~v~v~~~--~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~~~~~~ 65 (66)
|+|.+|.|.+.+. +..+ .+++|++|||.+++.+.||.++|.|..+..|. .++++.|+++
T Consensus 285 ~~i~eGEvdv~~~~~~v~v-kl~~~dyfge~al~~~~pr~Atv~a~~~~kc~--~~dk~~ferl 345 (368)
T KOG1113|consen 285 YIIEEGEVDVLKKRDGVEV-KLKKGDYFGELALLKNLPRAATVVAKGRLKCA--KLDKPRFERL 345 (368)
T ss_pred EEecccccchhhccCCeEE-EechhhhcchHHHHhhchhhceeeccCCceee--eeChHHHHHH
Confidence 8999999999764 4455 99999999999999999999999999985554 4677777653
No 13
>smart00100 cNMP Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
Probab=98.42 E-value=1.2e-06 Score=47.51 Aligned_cols=54 Identities=31% Similarity=0.355 Sum_probs=42.5
Q ss_pred eEEEEccEEEEEEC---C--EEEEEeCCCCEEeccccc--cCCceeEEEEEccceEEEEEE
Q psy14742 3 DRILSKGRVEVSRE---N--KYLSTLAPGKVFGELAIL--YNCKRTATIKVHWTLNLKSWG 56 (66)
Q Consensus 3 ~~iI~~G~v~v~~~---~--~~v~~l~~G~~fGe~~l~--~~~~r~~tv~A~~~~~v~~l~ 56 (66)
-|+|.+|.+++... + ..+..+.+|++||+..++ ...+.+.++.+.++|.+..++
T Consensus 38 ~y~v~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~~~~~~~~~~~~~~~~~~~~~v~~~~ 98 (120)
T smart00100 38 FYIILSGEVRVYKVLEDGREQILGILGPGDFFGELALLTNSRRAASATAVALELATLLRID 98 (120)
T ss_pred EEEEEeeEEEEEEECCCCceEEEEeecCCceechhhhccCCCcccceEEEEEeeEEEEccC
Confidence 58999999999864 2 467899999999999988 346677888888876665544
No 14
>PLN02868 acyl-CoA thioesterase family protein
Probab=98.36 E-value=1.5e-06 Score=58.60 Aligned_cols=56 Identities=20% Similarity=0.055 Sum_probs=46.6
Q ss_pred ceEEEEccEEEEEEC----CEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeec
Q psy14742 2 RDRILSKGRVEVSRE----NKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRT 59 (66)
Q Consensus 2 ~~~iI~~G~v~v~~~----~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~ 59 (66)
+-|+|.+|.+++.+. ...+..+++|++||+. +.+.+++++++|.++|+++.++...
T Consensus 51 ~lyiI~~G~V~v~~~~~~ge~~l~~l~~Gd~fG~~--l~~~~~~~~~~A~~d~~v~~ip~~~ 110 (413)
T PLN02868 51 GLYFIWKGEAEVSGPAEEESRPEFLLKRYDYFGYG--LSGSVHSADVVAVSELTCLVLPHEH 110 (413)
T ss_pred eEEEEEeCEEEEEEECCCCcEEEEEeCCCCEeehh--hCCCCcccEEEECCCEEEEEEcHHH
Confidence 358999999999774 2567889999999985 6788899999999998888776554
No 15
>KOG0614|consensus
Probab=98.34 E-value=1.1e-07 Score=66.68 Aligned_cols=58 Identities=33% Similarity=0.385 Sum_probs=47.5
Q ss_pred EEEEccEEEEEECC------EEEEEeCCCCEEeccccccCCceeEEEEEccc-eEEEEEEeecCCCc
Q psy14742 4 RILSKGRVEVSREN------KYLSTLAPGKVFGELAILYNCKRTATIKVHWT-LNLKSWGSRTSGDN 63 (66)
Q Consensus 4 ~iI~~G~v~v~~~~------~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~-~~v~~l~~~~~~~~ 63 (66)
|+|.+|.|++.+.+ +.+..+++||+|||-+++....|+|+++|..+ ..|+.+ +++.|+
T Consensus 317 fii~~G~V~vtq~~e~~~q~~~lr~l~kGd~FGE~al~~edvRtAniia~~~gv~cl~l--DresF~ 381 (732)
T KOG0614|consen 317 FIISKGTVKVTQQDEGSTQPQELRTLNKGDYFGERALLGEDVRTANIIAQAPGVECLTL--DRESFK 381 (732)
T ss_pred EEEecceEEEeecCCCCCchhHHhhccccchhhHHHhhccCccchhhhccCCCceEEEe--cHHHHH
Confidence 78899999998752 46889999999999999999999999999998 555554 444443
No 16
>KOG0500|consensus
Probab=98.31 E-value=2.7e-06 Score=58.93 Aligned_cols=56 Identities=29% Similarity=0.429 Sum_probs=48.2
Q ss_pred eEEEEccEEEEEECC--EEEEEeCCCCEEeccccc------cCCceeEEEEEccceEEEEEEee
Q psy14742 3 DRILSKGRVEVSREN--KYLSTLAPGKVFGELAIL------YNCKRTATIKVHWTLNLKSWGSR 58 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~--~~v~~l~~G~~fGe~~l~------~~~~r~~tv~A~~~~~v~~l~~~ 58 (66)
-|||.+|.+.+.-++ .....+++|++|||++++ .+.+|+++++++..+.+++|+.+
T Consensus 351 MyIVk~G~L~Vv~dDg~t~~~~L~~G~~FGEisIlni~g~~~gNRRtanvrSvGYSDlfvLskd 414 (536)
T KOG0500|consen 351 MYIVKEGKLAVVADDGVTVFVTLKAGSVFGEISILNIKGNKNGNRRTANVRSVGYSDLFVLSKD 414 (536)
T ss_pred EEEEEccEEEEEecCCcEEEEEecCCceeeeeEEEEEcCcccCCcceeeeeeeccceeeEeeHH
Confidence 389999999997764 467899999999999987 36789999999999999999853
No 17
>KOG0499|consensus
Probab=98.29 E-value=1.4e-06 Score=61.75 Aligned_cols=60 Identities=37% Similarity=0.443 Sum_probs=48.7
Q ss_pred eEEEEccEEEEEEC---CEEEEEeCCCCEEeccccc---cCCceeEEEEEccceEEEEEEeecCCCcc
Q psy14742 3 DRILSKGRVEVSRE---NKYLSTLAPGKVFGELAIL---YNCKRTATIKVHWTLNLKSWGSRTSGDNE 64 (66)
Q Consensus 3 ~~iI~~G~v~v~~~---~~~v~~l~~G~~fGe~~l~---~~~~r~~tv~A~~~~~v~~l~~~~~~~~~ 64 (66)
-|||..|+|.|.-. .+++..|+.|++|||++++ .+.+|+|+|+|...|.+++|+ +...||
T Consensus 571 MYIIk~GqvQVlGGp~~~~Vl~tL~~GsVFGEISLLaigG~nRRTAnV~a~Gf~nLfvL~--KkdLne 636 (815)
T KOG0499|consen 571 MYIIKHGQVQVLGGPDGTKVLVTLKAGSVFGEISLLAIGGGNRRTANVVAHGFANLFVLD--KKDLNE 636 (815)
T ss_pred eEEeecceEEEecCCCCCEEEEEecccceeeeeeeeeecCCCccchhhhhcccceeeEec--HhHHHH
Confidence 48999999998754 3688999999999999997 347889999999998777765 455554
No 18
>KOG0498|consensus
Probab=98.28 E-value=4.3e-07 Score=65.19 Aligned_cols=55 Identities=22% Similarity=0.287 Sum_probs=46.8
Q ss_pred eEEEEccEEEEEECC----EEEEEeCCCCEEe-ccccccC-CceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSREN----KYLSTLAPGKVFG-ELAILYN-CKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~----~~v~~l~~G~~fG-e~~l~~~-~~r~~tv~A~~~~~v~~l~~ 57 (66)
=|||.+|.+++...+ ..+..|+|||+|| |+..... .|.++||+|.+.|+++.++.
T Consensus 463 myFI~rG~le~~~~~~g~~~~~~~L~~Gd~~GeEl~~~~~~~p~t~TVralt~~el~~L~~ 523 (727)
T KOG0498|consen 463 MYFIVRGSLESITTDGGGFFVVAILGPGDFFGEELLTWCLDLPQTRTVRALTYCELFRLSA 523 (727)
T ss_pred eEEEEeeeEEEEEccCCceEEEEEecCCCccchHHHHHHhcCCCCceeehhhhhhHHhccH
Confidence 489999999998775 6789999999999 7887777 88899999999877766654
No 19
>PRK09391 fixK transcriptional regulator FixK; Provisional
Probab=98.24 E-value=7.2e-06 Score=51.19 Aligned_cols=53 Identities=19% Similarity=0.142 Sum_probs=42.0
Q ss_pred eEEEEccEEEEEEC---C--EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEee
Q psy14742 3 DRILSKGRVEVSRE---N--KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSR 58 (66)
Q Consensus 3 ~~iI~~G~v~v~~~---~--~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~ 58 (66)
-|+|.+|.+++++. + ..+..+.+|++||+. .+.+++.+++|.++|.++.++..
T Consensus 59 ly~I~~G~vkl~~~~~~G~e~i~~~~~~Gd~fG~~---~~~~~~~~~~A~~ds~v~~i~~~ 116 (230)
T PRK09391 59 VYQVESGAVRTYRLLSDGRRQIGAFHLPGDVFGLE---SGSTHRFTAEAIVDTTVRLIKRR 116 (230)
T ss_pred EEEEEeCEEEEEEECCCCcEEEEEEecCCceeccc---CCCcCCeEEEEcCceEEEEEEHH
Confidence 58999999999863 3 367788999999975 35677899999999877766543
No 20
>PRK11161 fumarate/nitrate reduction transcriptional regulator; Provisional
Probab=98.19 E-value=8.5e-06 Score=50.49 Aligned_cols=54 Identities=11% Similarity=-0.065 Sum_probs=42.3
Q ss_pred eEEEEccEEEEEEC---C--EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSRE---N--KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~---~--~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~ 57 (66)
-|+|.+|.+++++. + ..+..+.||++||+.+++. .+++.+++|.++++++.++.
T Consensus 58 ly~v~~G~v~~~~~~~~G~e~i~~~~~~gd~~g~~~~~~-~~~~~~~~a~~~~~i~~ip~ 116 (235)
T PRK11161 58 LYAIRSGTIKSYTITEQGDEQITGFHLAGDLVGFDAIGS-GQHPSFAQALETSMVCEIPF 116 (235)
T ss_pred EEEEeeceEEEEEECCCCCEEEEEeccCCceeccccccC-CCCcceEEEeccEEEEEEEH
Confidence 48999999999874 2 3677889999999977654 45567899999977776654
No 21
>COG2905 Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms]
Probab=97.63 E-value=0.00014 Score=51.37 Aligned_cols=53 Identities=21% Similarity=0.266 Sum_probs=44.4
Q ss_pred EEEEccEEEEEEC-CEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEE
Q psy14742 4 RILSKGRVEVSRE-NKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWG 56 (66)
Q Consensus 4 ~iI~~G~v~v~~~-~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~ 56 (66)
|+|.+|.|++... +..+..+..||.||-.+++...+....+.|.+|+-|+.+.
T Consensus 52 ~vi~kG~vev~~~~g~v~~~~~~gdlFg~~~l~~~~~~~~~~~aeedsl~y~lp 105 (610)
T COG2905 52 YVIRKGVVEVRSDGGEVLDRLAAGDLFGFSSLFTELNKQRYMAAEEDSLCYLLP 105 (610)
T ss_pred EEEEeceeeEEcCCCeeeeeeccCccccchhhcccCCCcceeEeeccceEEecC
Confidence 8999999998766 5678999999999999999987777778888886555543
No 22
>KOG0501|consensus
Probab=97.46 E-value=2.8e-05 Score=55.59 Aligned_cols=56 Identities=20% Similarity=0.255 Sum_probs=45.4
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEeccccccC--CceeEEEEEccceEEEEEEee
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAILYN--CKRTATIKVHWTLNLKSWGSR 58 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~--~~r~~tv~A~~~~~v~~l~~~ 58 (66)
-+||.+|++++..++..+.+|++||+||....-.. ....++|+|.+.|.+..+.++
T Consensus 592 LcFvVsGSLEVIQDDEVVAILGKGDVFGD~FWK~~t~~qs~ANVRALTYcDLH~IKrd 649 (971)
T KOG0501|consen 592 LCFVVSGSLEVIQDDEVVAILGKGDVFGDEFWKENTLGQSAANVRALTYCDLHMIKRD 649 (971)
T ss_pred EEEEEecceEEeecCcEEEEeecCccchhHHhhhhhhhhhhhhhhhhhhhhhhHHhHH
Confidence 37999999999999999999999999999876554 233468999998877665443
No 23
>KOG2968|consensus
Probab=97.17 E-value=0.00015 Score=53.66 Aligned_cols=54 Identities=17% Similarity=0.250 Sum_probs=46.4
Q ss_pred eEEEEccEEEEEEC----C-EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEE
Q psy14742 3 DRILSKGRVEVSRE----N-KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWG 56 (66)
Q Consensus 3 ~~iI~~G~v~v~~~----~-~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~ 56 (66)
-|+|+.|+++.... . ..+..++.||.+|+..++.+.+|..|+.|+.++++..+.
T Consensus 529 iyvVl~GRlRsv~~~~~~k~~i~~EygrGd~iG~~E~lt~~~R~tTv~AvRdSelariP 587 (1158)
T KOG2968|consen 529 IYVVLNGRLRSVIRQSGGKKEIVGEYGRGDLIGEVEMLTKQPRATTVMAVRDSELARIP 587 (1158)
T ss_pred EEEEecCeehhhhhccCccchhhhhccCcceeehhHHhhcCCccceEEEEeehhhhhcc
Confidence 48999999987543 2 367899999999999999999999999999998877654
No 24
>PF04831 Popeye: Popeye protein conserved region; InterPro: IPR006916 The Popeye (POP) family of proteins, is restricted to vertebrates and is preferentially expressed in developing and adult striated muscle. It is represented by a conserved region which includes three potential transmembrane domains []. The strong conservation of POP genes during evolution and their preferential expression in heart and skeletal muscle suggest that these novel proteins may have an important function in these tissues in vertebrates.; GO: 0016020 membrane
Probab=97.11 E-value=0.0027 Score=38.32 Aligned_cols=58 Identities=19% Similarity=0.225 Sum_probs=43.3
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEecccc-----ccCCceeEEEEEccceEEEEEEeecC
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAI-----LYNCKRTATIKVHWTLNLKSWGSRTS 60 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l-----~~~~~r~~tv~A~~~~~v~~l~~~~~ 60 (66)
=-++++|++.|+.+++.++.+.|-.+..--.. -.+..-..|++|.++|+.++|++.+.
T Consensus 50 LSlLLsGr~~Vs~~g~fLH~I~p~qFlDSPEW~s~~~s~~~~FQVTitA~~~Cryl~W~R~kL 112 (153)
T PF04831_consen 50 LSLLLSGRMRVSCDGRFLHYIYPYQFLDSPEWESLRPSEDDKFQVTITAEEDCRYLCWPREKL 112 (153)
T ss_pred EeEEEcCcEEEEECCEeeEeecccccccChhhhccccCCCCeEEEEEEEcCCcEEEEEEHHHH
Confidence 35789999999999988888877655432221 12344568999999999999998753
No 25
>KOG2968|consensus
Probab=95.49 E-value=0.069 Score=40.28 Aligned_cols=54 Identities=26% Similarity=0.302 Sum_probs=43.4
Q ss_pred EEEEccEEEEEEC-----------CEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEe
Q psy14742 4 RILSKGRVEVSRE-----------NKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 4 ~iI~~G~v~v~~~-----------~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~ 57 (66)
|++..|.+.+... +..+...+||+++|.++++.+.+...+++|.+++.+..++.
T Consensus 408 ~~ii~g~l~v~~sm~~~s~~~~~~~~~~f~v~pG~ivgyla~lt~e~S~~tirArsdt~v~~isr 472 (1158)
T KOG2968|consen 408 YYIIKGSLSVYQSMYDVSGNLVLAGMLLFVVGPGEIVGYLAILTNEPSFITIRARSDTRVLFISR 472 (1158)
T ss_pred eEEeecceeeeehhcccccccccccceEEEecCCceechhhhhcCCcceEEEEEecceEEEEeeH
Confidence 5677777766432 23466889999999999999999999999999987777664
No 26
>KOG2378|consensus
Probab=95.28 E-value=0.0046 Score=43.21 Aligned_cols=40 Identities=28% Similarity=0.336 Sum_probs=36.2
Q ss_pred CCCEEeccccccCCceeEEEEEccceEEEEEEeecCCCccC
Q psy14742 25 PGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTSGDNEM 65 (66)
Q Consensus 25 ~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~~~~~~ 65 (66)
.||-||.+++....||.+++..-++ .|.++..++..||.+
T Consensus 1 eGddfgklalvnd~praativl~ed-~~~fl~vDk~~Fn~I 40 (573)
T KOG2378|consen 1 EGDDFGKLALVNDAPRAATIVLRED-NCHFLRVDKHDFNRI 40 (573)
T ss_pred CCcccchhccccccccccceeeecC-CCcceeecHHHHHHH
Confidence 4899999999999999999888887 799999999999875
No 27
>PRK11832 putative DNA-binding transcriptional regulator; Provisional
Probab=93.29 E-value=0.74 Score=29.12 Aligned_cols=55 Identities=9% Similarity=0.036 Sum_probs=41.1
Q ss_pred eEEEEccEEEEEECC-EEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSREN-KYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~-~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~ 57 (66)
-|++.+|.+.+.+.+ -.+....+-.+||-.............+|.++|++..++.
T Consensus 44 ~~ll~~G~vsirr~d~ll~~t~~aP~IlGl~~~~~~~~~~~~l~ae~~c~~~~i~~ 99 (207)
T PRK11832 44 TFVILEGVISLRREENVLIGITQAPYIMGLADGLMKNDIPYKLISEGNCTGYHLPA 99 (207)
T ss_pred EEEEEeceEEEEecCCeEEEeccCCeEeecccccCCCCceEEEEEcCccEEEEeeH
Confidence 378999999996554 4567777788999877666665567899999877765543
No 28
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=92.69 E-value=0.24 Score=26.06 Aligned_cols=26 Identities=23% Similarity=0.385 Sum_probs=20.8
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVF 29 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~f 29 (66)
.+|++|++.+...+..-..+++||.|
T Consensus 29 ~~vleG~v~it~~~G~~~~~~aGD~~ 54 (74)
T PF05899_consen 29 FYVLEGEVTITDEDGETVTFKAGDAF 54 (74)
T ss_dssp EEEEEEEEEEEETTTEEEEEETTEEE
T ss_pred EEEEEeEEEEEECCCCEEEEcCCcEE
Confidence 57899999999865444789999875
No 29
>PF07883 Cupin_2: Cupin domain; InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=86.80 E-value=2.6 Score=20.88 Aligned_cols=26 Identities=15% Similarity=0.216 Sum_probs=20.0
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEE
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVF 29 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~f 29 (66)
-++|++|.+.+..+++ ...+++||.+
T Consensus 22 ~~~vl~G~~~~~~~~~-~~~l~~Gd~~ 47 (71)
T PF07883_consen 22 FFYVLSGEGTLTVDGE-RVELKPGDAI 47 (71)
T ss_dssp EEEEEESEEEEEETTE-EEEEETTEEE
T ss_pred EEEEEECCEEEEEccE-EeEccCCEEE
Confidence 3689999999997766 3578888753
No 30
>PRK09943 DNA-binding transcriptional repressor PuuR; Provisional
Probab=86.33 E-value=5.4 Score=24.06 Aligned_cols=49 Identities=12% Similarity=0.083 Sum_probs=32.4
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWG 56 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~ 56 (66)
++|++|.+.+..+++. ..+++||.+ -+-.+.++.......+++.++++.
T Consensus 132 ~~Vl~G~~~~~~~~~~-~~l~~Gd~~---~~~~~~~H~~~n~~~~~~~~l~~~ 180 (185)
T PRK09943 132 GTVLEGEIVLTINGQD-YHLVAGQSY---AINTGIPHSFSNTSAGICRIISAH 180 (185)
T ss_pred EEEEEeEEEEEECCEE-EEecCCCEE---EEcCCCCeeeeCCCCCCeEEEEEe
Confidence 5789999999887755 469999874 223346666555445555555543
No 31
>PRK13290 ectC L-ectoine synthase; Reviewed
Probab=85.70 E-value=5.1 Score=23.18 Aligned_cols=48 Identities=13% Similarity=0.121 Sum_probs=29.9
Q ss_pred eEEEEccEEEEE-ECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEE
Q psy14742 3 DRILSKGRVEVS-RENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSW 55 (66)
Q Consensus 3 ~~iI~~G~v~v~-~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l 55 (66)
-++|++|+.++. ..+.....+++||.+- +-.+.++..... +++.+++.
T Consensus 58 ~~yVL~G~~~~~~i~~g~~~~L~aGD~i~---~~~~~~H~~~N~--e~~~~l~v 106 (125)
T PRK13290 58 AVYCIEGEGEVEDLATGEVHPIRPGTMYA---LDKHDRHYLRAG--EDMRLVCV 106 (125)
T ss_pred EEEEEeCEEEEEEcCCCEEEEeCCCeEEE---ECCCCcEEEEcC--CCEEEEEE
Confidence 378999999998 5423336799999753 223345544433 66555443
No 32
>COG3450 Predicted enzyme of the cupin superfamily [General function prediction only]
Probab=84.24 E-value=2.6 Score=24.41 Aligned_cols=26 Identities=27% Similarity=0.390 Sum_probs=21.2
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVF 29 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~f 29 (66)
+-|++|.+++..++.....+++||.|
T Consensus 67 chil~G~v~~T~d~Ge~v~~~aGD~~ 92 (116)
T COG3450 67 CHILEGRVEVTPDGGEPVEVRAGDSF 92 (116)
T ss_pred EEEEeeEEEEECCCCeEEEEcCCCEE
Confidence 45889999999887656689999975
No 33
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=82.61 E-value=11 Score=24.38 Aligned_cols=50 Identities=16% Similarity=0.031 Sum_probs=31.1
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEE
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWG 56 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~ 56 (66)
-.+|++|++++...+.. ..|.+||++= +-.+.++...-...+++++++..
T Consensus 83 ~iyVl~G~l~v~~~g~~-~~L~~Gd~~y---~pa~~~H~~~N~~~~~a~~l~v~ 132 (260)
T TIGR03214 83 FLFVISGEVNVTAEGET-HELREGGYAY---LPPGSKWTLANAQAEDARFFLYK 132 (260)
T ss_pred EEEEEeCEEEEEECCEE-EEECCCCEEE---ECCCCCEEEEECCCCCEEEEEEE
Confidence 36899999999877765 4899999752 22334554433344455554433
No 34
>PRK11171 hypothetical protein; Provisional
Probab=80.97 E-value=12 Score=24.11 Aligned_cols=48 Identities=13% Similarity=0.037 Sum_probs=31.1
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSW 55 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l 55 (66)
++|++|++++..+++. ..|.+||.+ .+-.+.++.......++++++++
T Consensus 87 ~~VlsG~l~v~~~g~~-~~L~~GDsi---~~p~~~~H~~~N~g~~~a~~l~v 134 (266)
T PRK11171 87 LFVVEGEITLTLEGKT-HALSEGGYA---YLPPGSDWTLRNAGAEDARFHWI 134 (266)
T ss_pred EEEEeCEEEEEECCEE-EEECCCCEE---EECCCCCEEEEECCCCCEEEEEE
Confidence 5899999999887764 579999874 23334555544444455555544
No 35
>PF11699 CENP-C_C: Mif2/CENP-C like; PDB: 2VPV_B.
Probab=80.61 E-value=5.7 Score=21.64 Aligned_cols=25 Identities=28% Similarity=0.440 Sum_probs=17.7
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVF 29 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~f 29 (66)
+.|.+|.+++..++... .+.+|+.|
T Consensus 37 F~V~~G~v~Vti~~~~f-~v~~G~~F 61 (85)
T PF11699_consen 37 FYVIKGKVEVTIHETSF-VVTKGGSF 61 (85)
T ss_dssp EEEEESEEEEEETTEEE-EEETT-EE
T ss_pred EEEEeCEEEEEEcCcEE-EEeCCCEE
Confidence 68999999999887542 35666655
No 36
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=75.84 E-value=10 Score=21.63 Aligned_cols=52 Identities=10% Similarity=0.078 Sum_probs=30.2
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecC
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTS 60 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~ 60 (66)
-|+|++|...+...+..+ .+++|+.+ -+-.+.++. +.......+.++.+...
T Consensus 60 ~~~Vl~G~g~v~~~~~~~-~v~~gd~~---~iP~g~~H~--~~N~G~~~L~liei~~p 111 (127)
T COG0662 60 HWYVLEGTGKVTIGGEEV-EVKAGDSV---YIPAGTPHR--VRNTGKIPLVLIEVQSP 111 (127)
T ss_pred EEEEEeeEEEEEECCEEE-EecCCCEE---EECCCCcEE--EEcCCCcceEEEEEecC
Confidence 489999999999887553 57777753 122233332 33333344555555443
No 37
>KOG3542|consensus
Probab=75.58 E-value=3.9 Score=30.86 Aligned_cols=53 Identities=15% Similarity=0.099 Sum_probs=35.3
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEE-ccceEEEEE
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKV-HWTLNLKSW 55 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A-~~~~~v~~l 55 (66)
-++|+.|.|++.+-+.....+.-|+.||.....+.....-.++. +.||+..++
T Consensus 326 WsVIlNG~VEv~~PdGk~e~l~mGnSFG~~PT~dkqym~G~mRTkVDDCqFVci 379 (1283)
T KOG3542|consen 326 WSVILNGCVEVVKPDGKREELKMGNSFGAEPTPDKQYMIGEMRTKVDDCQFVCI 379 (1283)
T ss_pred eEEEecceEEEecCCCceEEeecccccCCCCCcchhhhhhhhheecccceEEEe
Confidence 37999999999988766678999999998766544332222222 334555544
No 38
>COG3718 IolB Uncharacterized enzyme involved in inositol metabolism [Carbohydrate transport and metabolism]
Probab=71.63 E-value=19 Score=23.65 Aligned_cols=53 Identities=9% Similarity=0.177 Sum_probs=32.4
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEeccccccC---------CceeEEEEEccceEEEEEEeecC
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAILYN---------CKRTATIKVHWTLNLKSWGSRTS 60 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~---------~~r~~tv~A~~~~~v~~l~~~~~ 60 (66)
-.++++|.++++..++....++.. ++.|.+ .-+.+++.|.+++++-+.+-...
T Consensus 52 clV~v~Gk~~vs~~g~~f~~iG~R-----~SvFe~~p~~~vYvp~g~~~~vtA~t~~~vAvC~AP~~ 113 (270)
T COG3718 52 CLVLVTGKATVSAHGSTFGEIGTR-----MSVFERKPPDSVYVPAGSAFSVTATTDLEVAVCSAPGK 113 (270)
T ss_pred EEEEEeeeEEEeeccchHhhcccc-----cccccCCCCCeEEecCCceEEEEeecceEEEEEeCCCC
Confidence 356777877777665443333311 234444 23568999999988877665543
No 39
>COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=70.35 E-value=8.9 Score=21.65 Aligned_cols=28 Identities=21% Similarity=0.106 Sum_probs=21.5
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEec
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGE 31 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe 31 (66)
-.++++|.+++... .....+.+||++-.
T Consensus 67 ~~~Vl~G~~~~~~~-g~~~~l~~Gd~i~i 94 (131)
T COG1917 67 TIYVLEGEGTVQLE-GEKKELKAGDVIII 94 (131)
T ss_pred EEEEEecEEEEEec-CCceEecCCCEEEE
Confidence 35789999999887 44467999998654
No 40
>TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (EC 5.3.1.8) (PMI) and mannose-1-phosphate guanylyltransferase (EC 2.7.7.22) in Pseudomonas aeruginosa, Xanthomonas campestris, and Gluconacetobacter xylinus. The literature on the enzyme from E. coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain is C-terminal.
Probab=66.63 E-value=41 Score=23.47 Aligned_cols=49 Identities=14% Similarity=0.024 Sum_probs=31.6
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWG 56 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~ 56 (66)
++|++|.+++..+++. ..+++||.+= +-.+.++...-...++++++...
T Consensus 401 ~~Vl~G~~~v~~dg~~-~~l~~GDsi~---ip~~~~H~~~N~g~~~~~~i~v~ 449 (468)
T TIGR01479 401 WIVVSGTARVTIGDET-LLLTENESTY---IPLGVIHRLENPGKIPLELIEVQ 449 (468)
T ss_pred EEEEeeEEEEEECCEE-EEecCCCEEE---ECCCCcEEEEcCCCCCEEEEEEE
Confidence 5799999999888865 4799998752 22345555444444555444443
No 41
>PRK15460 cpsB mannose-1-phosphate guanyltransferase; Provisional
Probab=64.27 E-value=49 Score=23.46 Aligned_cols=47 Identities=13% Similarity=0.138 Sum_probs=30.6
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKS 54 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~ 54 (66)
++|++|.+++..+++. ..+.+||.+ .+-.+.++...-...++++++.
T Consensus 410 ~~VlsG~~~v~idg~~-~~L~~GDSi---~ip~g~~H~~~N~g~~~l~iI~ 456 (478)
T PRK15460 410 WVVVAGTAKVTIDGDI-KLLGENESI---YIPLGATHCLENPGKIPLDLIE 456 (478)
T ss_pred EEEEeeEEEEEECCEE-EEecCCCEE---EECCCCcEEEEcCCCCCEEEEE
Confidence 6799999999988865 568999874 3334456554444445544443
No 42
>PF04962 KduI: KduI/IolB family; InterPro: IPR021120 The KduI/IolB family of enzymes includes 5-keto 4-deoxyuronate isomerase (KduI) and 5-deoxy-glucuronate isomerase (IolB). KduI is involved in pectin degradation by free-living soil bacteria that use pectin as a carbon source, breaking it down to 2-keto-3-deoxygluconate, which can ultimately be converted to pyruvate. KduI catalyses the fourth step in pectin degradation, namely the interconversion of 5-keto-4-deoxyuronate and 2,5-diketo-3-dexoygluconate []. KduI has a TIM-barrel fold []. IolB is one of several bacterial proteins encoded by the inositol operon (iolABCDEFGHIJ) in Bacillus subtilis that are involved in myo-inositol catabolism. The enzyme is responsible for isomerization of 5-deoxy-D-glucuronic acid by IolB to produce 2-deoxy-5-keto-D-gluconic acid []. IolBs possess a cupin-like structure.; GO: 0016861 intramolecular oxidoreductase activity, interconverting aldoses and ketoses, 0008152 metabolic process; PDB: 1YWK_B 2QJV_B 1X8M_A 1XRU_A.
Probab=63.81 E-value=38 Score=22.04 Aligned_cols=58 Identities=9% Similarity=0.075 Sum_probs=34.2
Q ss_pred eEEEEccEEEEEECC-EEEEEeCCCCEEeccc--cccCCceeEEEEEccceEEEEEEeecC
Q psy14742 3 DRILSKGRVEVSREN-KYLSTLAPGKVFGELA--ILYNCKRTATIKVHWTLNLKSWGSRTS 60 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~-~~v~~l~~G~~fGe~~--l~~~~~r~~tv~A~~~~~v~~l~~~~~ 60 (66)
+.|+++|.+.+..++ .....-+.-++|.+.+ +--..-....+.|.+++++++.+-...
T Consensus 50 ~vv~l~G~~~v~~~g~~~~~l~~R~~vF~~~~d~lYvp~g~~~~i~a~~~ae~~~~sapa~ 110 (261)
T PF04962_consen 50 GVVNLGGKATVTVDGEEFYELGGRESVFDGPPDALYVPRGTKVVIFASTDAEFAVCSAPAH 110 (261)
T ss_dssp EEEEESSSEEEEETTEEEEEE-TTSSGGGS--EEEEE-TT--EEEEESSTEEEEEEEEE-S
T ss_pred EEEEeCCEEEEEeCCceEEEecccccccCCCCcEEEeCCCCeEEEEEcCCCEEEEEccccC
Confidence 578899999999887 3333344445664422 222233457788888888887765443
No 43
>PF02311 AraC_binding: AraC-like ligand binding domain; InterPro: IPR003313 This entry defines the arabinose-binding and dimerisation domain of the bacterial gene regulatory protein AraC. The crystal structure of the arabinose-binding and dimerization domain of the Escherichia coli gene regulatory protein AraC was determined in the presence and absence of L-arabinose. The arabinose-bound molecule shows that the protein adopts an unusual fold, binding sugar within a beta barrel and completely burying the arabinose with the amino-terminal arm of the protein. Dimer contacts in the presence of arabinose are mediated by an antiparallel coiled-coil. In the uncomplexed protein, the amino-terminal arm is disordered, uncovering the sugar-binding pocket and allowing it to serve as an oligomerization interface [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1XJA_B 2ARA_A 2AAC_B 2ARC_A.
Probab=60.53 E-value=25 Score=18.87 Aligned_cols=56 Identities=11% Similarity=-0.047 Sum_probs=28.5
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecCCCc
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTSGDN 63 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~~~~ 63 (66)
.++.+|...+..+++. -.++||++| -+-.+.++.......++.....+.++.+.+.
T Consensus 27 ~~v~~G~~~~~~~~~~-~~l~~g~~~---li~p~~~H~~~~~~~~~~~~~~i~~~~~~~~ 82 (136)
T PF02311_consen 27 IYVLSGEGTLHIDGQE-YPLKPGDLF---LIPPGQPHSYYPDSNEPWEYYWIYFSPDFLE 82 (136)
T ss_dssp EEEEEE-EEEEETTEE-EEE-TT-EE---EE-TTS-EEEEE-TTSEEEEEEEEE---GGG
T ss_pred EEEeCCEEEEEECCEE-EEEECCEEE---EecCCccEEEecCCCCCEEEEEEEECHHHHH
Confidence 5788999988777765 468888875 1222344443333333666666666665543
No 44
>PF12852 Cupin_6: Cupin
Probab=58.79 E-value=23 Score=21.15 Aligned_cols=27 Identities=11% Similarity=0.122 Sum_probs=19.3
Q ss_pred eEEEEccEEEEEECC-EEEEEeCCCCEE
Q psy14742 3 DRILSKGRVEVSREN-KYLSTLAPGKVF 29 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~-~~v~~l~~G~~f 29 (66)
-|+|.+|.+.+...+ ..--.+.+||++
T Consensus 38 fh~V~~G~~~l~~~~~~~~~~L~~GDiv 65 (186)
T PF12852_consen 38 FHVVLRGSCWLRVPGGGEPIRLEAGDIV 65 (186)
T ss_pred EEEEECCeEEEEEcCCCCeEEecCCCEE
Confidence 488999999998654 333467777764
No 45
>smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant.
Probab=55.76 E-value=38 Score=19.50 Aligned_cols=27 Identities=26% Similarity=0.325 Sum_probs=20.0
Q ss_pred eEEEEccEEEEEECC-----EEEEEeCCCCEE
Q psy14742 3 DRILSKGRVEVSREN-----KYLSTLAPGKVF 29 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~-----~~v~~l~~G~~f 29 (66)
-++|++|...+...+ +....+.+||.|
T Consensus 54 ~~~Vl~G~~~~~~~~~~~~~~~~~~l~~GD~~ 85 (146)
T smart00835 54 LLYVVRGEGRVGVVDPNGNKVYDARLREGDVF 85 (146)
T ss_pred EEEEEeCeEEEEEEeCCCCeEEEEEecCCCEE
Confidence 468899999887543 235689999975
No 46
>PRK11171 hypothetical protein; Provisional
Probab=55.45 E-value=54 Score=21.16 Aligned_cols=26 Identities=12% Similarity=0.222 Sum_probs=20.5
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEE
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVF 29 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~f 29 (66)
.++|++|+..+..+++. ..+.+||..
T Consensus 208 ~i~Vl~G~~~~~~~~~~-~~l~~GD~i 233 (266)
T PRK11171 208 GLYVLEGKGVYRLNNDW-VEVEAGDFI 233 (266)
T ss_pred EEEEEeCEEEEEECCEE-EEeCCCCEE
Confidence 57899999999877654 468999874
No 47
>PRK15457 ethanolamine utilization protein EutQ; Provisional
Probab=54.94 E-value=40 Score=21.86 Aligned_cols=25 Identities=20% Similarity=0.377 Sum_probs=19.5
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVF 29 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~f 29 (66)
.++++|.+.+..+++. ..++|||++
T Consensus 179 ~YVLEGe~~l~IdG~t-~~l~pGDvl 203 (233)
T PRK15457 179 DMVLEGELHVRHEGET-MIAKAGDVM 203 (233)
T ss_pred EEEEEeEEEEEECCEE-EEeCCCcEE
Confidence 5788999999887654 468888864
No 48
>PF06249 EutQ: Ethanolamine utilisation protein EutQ; InterPro: IPR010424 The eut operon of Salmonella typhimurium encodes proteins involved in the cobalamin-dependent degradation of ethanolamine. The role of EutQ in this process is unclear [].; PDB: 2PYT_B 3LWC_A.
Probab=54.31 E-value=33 Score=20.76 Aligned_cols=25 Identities=28% Similarity=0.562 Sum_probs=18.1
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVF 29 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~f 29 (66)
.+|++|.+.+..+++.+ ..+|||++
T Consensus 99 ~~VlEG~L~i~~~G~~~-~A~~GDvi 123 (152)
T PF06249_consen 99 KYVLEGTLEISIDGQTV-TAKPGDVI 123 (152)
T ss_dssp EEEEEEEEEEEETTEEE-EEETT-EE
T ss_pred EEEEEeEEEEEECCEEE-EEcCCcEE
Confidence 46789999998877653 57788864
No 49
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=51.82 E-value=66 Score=21.06 Aligned_cols=51 Identities=14% Similarity=0.005 Sum_probs=32.8
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEe
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGS 57 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~ 57 (66)
.-+|++|.+.+...++. ..+.+|++ .-+-.++.++..-.+.+++++..+..
T Consensus 86 ~lfVv~Ge~tv~~~G~t-h~l~eggy---aylPpgs~~~~~N~~~~~~rfhw~rk 136 (264)
T COG3257 86 FLFVVSGEITVKAEGKT-HALREGGY---AYLPPGSGWTLRNAQKEDSRFHWIRK 136 (264)
T ss_pred EEEEEeeeEEEEEcCeE-EEeccCCe---EEeCCCCcceEeeccCCceEEEEEee
Confidence 45789999998877764 45777765 23334455555545677777766654
No 50
>PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional
Probab=51.80 E-value=56 Score=20.28 Aligned_cols=53 Identities=8% Similarity=-0.090 Sum_probs=32.3
Q ss_pred eEEEEccEEEEEE--CCE-EEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecC
Q psy14742 3 DRILSKGRVEVSR--ENK-YLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTS 60 (66)
Q Consensus 3 ~~iI~~G~v~v~~--~~~-~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~ 60 (66)
-|.+++|.+.+.. .++ ....+++||.|=-- .+.+++- ++.++|.++++...+.
T Consensus 57 ~FyqleG~~~l~v~d~g~~~~v~L~eGd~fllP---~gvpHsP--~r~~~tv~LviE~~r~ 112 (177)
T PRK13264 57 FFYQLEGDMYLKVQEDGKRRDVPIREGEMFLLP---PHVPHSP--QREAGSIGLVIERKRP 112 (177)
T ss_pred EEEEECCeEEEEEEcCCceeeEEECCCCEEEeC---CCCCcCC--ccCCCeEEEEEEeCCC
Confidence 3678999988865 333 45679999876322 2233332 2346667777765543
No 51
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=47.10 E-value=44 Score=20.62 Aligned_cols=23 Identities=22% Similarity=0.432 Sum_probs=17.3
Q ss_pred EEEccEEEEEECCEEEEEeCCCCE
Q psy14742 5 ILSKGRVEVSRENKYLSTLAPGKV 28 (66)
Q Consensus 5 iI~~G~v~v~~~~~~v~~l~~G~~ 28 (66)
+|++|++.+..++..+ .-+|||+
T Consensus 123 ~VlEGrL~V~~~g~tv-~a~aGDv 145 (176)
T COG4766 123 YVLEGRLHVRIDGRTV-IAGAGDV 145 (176)
T ss_pred EEEeeeEEEEEcCCeE-ecCCCcE
Confidence 6789999998887543 4677775
No 52
>COG3837 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=43.31 E-value=69 Score=19.66 Aligned_cols=28 Identities=18% Similarity=0.247 Sum_probs=20.0
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEEecc
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVFGEL 32 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~fGe~ 32 (66)
..|++|...+..++.. ..+.|||..|--
T Consensus 68 v~ILeGE~~l~~d~~e-~~lrpGD~~gFp 95 (161)
T COG3837 68 VYILEGEGTLREDGGE-TRLRPGDSAGFP 95 (161)
T ss_pred EEEEcCceEEEECCee-EEecCCceeecc
Confidence 5688888877666543 468999987743
No 53
>TIGR03037 anthran_nbaC 3-hydroxyanthranilate 3,4-dioxygenase. Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase. This enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.
Probab=42.91 E-value=77 Score=19.31 Aligned_cols=53 Identities=9% Similarity=-0.119 Sum_probs=31.9
Q ss_pred eEEEEccEEEEEEC--CE-EEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecC
Q psy14742 3 DRILSKGRVEVSRE--NK-YLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTS 60 (66)
Q Consensus 3 ~~iI~~G~v~v~~~--~~-~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~ 60 (66)
-|.+++|.+.+... ++ ..-.+++||+|=--+= .+++ -++.++|.++++...+.
T Consensus 51 ~FyqleG~~~l~v~d~g~~~~v~L~eGd~flvP~g---vpHs--P~r~~~t~~LvIE~~r~ 106 (159)
T TIGR03037 51 FFYQLKGEMYLKVTEEGKREDVPIREGDIFLLPPH---VPHS--PQRPAGSIGLVIERKRP 106 (159)
T ss_pred EEEEEcceEEEEEEcCCcEEEEEECCCCEEEeCCC---CCcc--cccCCCcEEEEEEeCCC
Confidence 37889999888442 32 4568999987633222 3332 22346667777766543
No 54
>PF14525 AraC_binding_2: AraC-binding-like domain
Probab=40.30 E-value=70 Score=18.06 Aligned_cols=51 Identities=16% Similarity=0.169 Sum_probs=32.2
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecC
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTS 60 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~ 60 (66)
.+..+|.+.+...+.. ..+.||+.+ +.+..++.+....++++.+.+.++..
T Consensus 59 ~~~~~G~~~~~~~g~~-~~~~pg~~~-----l~d~~~~~~~~~~~~~~~~~l~ip~~ 109 (172)
T PF14525_consen 59 VLPLSGSARIEQGGRE-VELAPGDVV-----LLDPGQPYRLEFSAGCRQLSLRIPRA 109 (172)
T ss_pred EEEccCCEEEEECCEE-EEEcCCeEE-----EEcCCCCEEEEECCCccEEEEEECHH
Confidence 4567788877766553 457887632 34444566666666777777776643
No 55
>PRK10296 DNA-binding transcriptional regulator ChbR; Provisional
Probab=36.85 E-value=1.1e+02 Score=19.24 Aligned_cols=24 Identities=21% Similarity=0.275 Sum_probs=17.6
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKV 28 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~ 28 (66)
.++.+|.+....+++.. .+++|++
T Consensus 47 ~~v~~G~~~~~i~~~~~-~l~~g~l 70 (278)
T PRK10296 47 TLVLTGRYYQEINGKRV-LLERGDF 70 (278)
T ss_pred EEEEeceEEEEECCEEE-EECCCcE
Confidence 57889999888877654 5777664
No 56
>PF05726 Pirin_C: Pirin C-terminal cupin domain; InterPro: IPR008778 This entry represents C-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues []. Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold []. Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 1J1L_A 3ACL_A 2P17_A.
Probab=34.42 E-value=82 Score=17.17 Aligned_cols=20 Identities=5% Similarity=-0.100 Sum_probs=14.3
Q ss_pred CceeEEEEEc-cceEEEEEEe
Q psy14742 38 CKRTATIKVH-WTLNLKSWGS 57 (66)
Q Consensus 38 ~~r~~tv~A~-~~~~v~~l~~ 57 (66)
.....++.+. +++++++++-
T Consensus 50 ~g~~i~~~a~~~~a~~lll~G 70 (104)
T PF05726_consen 50 DGDEIELTAGEEGARFLLLGG 70 (104)
T ss_dssp SECEEEEEESSSSEEEEEEEE
T ss_pred CCceEEEEECCCCcEEEEEEc
Confidence 4455678888 7788888764
No 57
>PF01050 MannoseP_isomer: Mannose-6-phosphate isomerase; InterPro: IPR001538 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. The type II phosphomannose isomerases are bifunctional enzymes 5.3.1.8 from EC. This entry covers the isomerase region of the protein []. The guanosine diphospho-D-mannose pyrophosphorylase region is described in another InterPro entry (see IPR005836 from INTERPRO).; GO: 0016779 nucleotidyltransferase activity, 0005976 polysaccharide metabolic process
Probab=34.35 E-value=1e+02 Score=18.33 Aligned_cols=24 Identities=25% Similarity=0.332 Sum_probs=18.1
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKV 28 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~ 28 (66)
.+|++|.+++..+++. ..+.+|+.
T Consensus 88 W~Vv~G~a~v~~~~~~-~~~~~g~s 111 (151)
T PF01050_consen 88 WTVVSGTAEVTLDDEE-FTLKEGDS 111 (151)
T ss_pred EEEEeCeEEEEECCEE-EEEcCCCE
Confidence 5789999999887654 45777765
No 58
>PHA02123 hypothetical protein
Probab=32.66 E-value=44 Score=19.47 Aligned_cols=17 Identities=12% Similarity=0.009 Sum_probs=15.0
Q ss_pred eEEEEEEeecCCCccCC
Q psy14742 50 LNLKSWGSRTSGDNEMG 66 (66)
Q Consensus 50 ~~v~~l~~~~~~~~~~~ 66 (66)
+.+..|+.+.+.||-||
T Consensus 114 saivi~r~dnp~fnp~g 130 (146)
T PHA02123 114 SAIVIRRMDNPTFNPIG 130 (146)
T ss_pred cEEEEEecCCCCCCccc
Confidence 45889999999999987
No 59
>PRK09685 DNA-binding transcriptional activator FeaR; Provisional
Probab=32.28 E-value=1.3e+02 Score=18.98 Aligned_cols=23 Identities=17% Similarity=0.416 Sum_probs=16.6
Q ss_pred EEEccEEEEEECCEEEEEeCCCCE
Q psy14742 5 ILSKGRVEVSRENKYLSTLAPGKV 28 (66)
Q Consensus 5 iI~~G~v~v~~~~~~v~~l~~G~~ 28 (66)
++.+|...+..+++. ..+.||++
T Consensus 76 ~~~~G~~~~~~~g~~-~~l~~G~~ 98 (302)
T PRK09685 76 FQLSGHAIIEQDDRQ-VQLAAGDI 98 (302)
T ss_pred EEecceEEEEECCeE-EEEcCCCE
Confidence 567898888777654 35788875
No 60
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=31.37 E-value=1.2e+02 Score=18.04 Aligned_cols=39 Identities=13% Similarity=0.076 Sum_probs=25.1
Q ss_pred eEEEEccEEEEEECCE--EEEEeCCCCEEeccccccCCcee
Q psy14742 3 DRILSKGRVEVSRENK--YLSTLAPGKVFGELAILYNCKRT 41 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~--~v~~l~~G~~fGe~~l~~~~~r~ 41 (66)
+..+++|++.+--.+. .-...++||+|=--+.+-+.+.+
T Consensus 70 aIYvlsG~ah~w~G~rLE~ha~~~pGDf~YiPpgVPHqp~N 110 (142)
T COG4101 70 AIYVLSGEAHTWYGNRLEEHAEVGPGDFFYIPPGVPHQPAN 110 (142)
T ss_pred EEEEEeceeeeeeccceeeeEEecCCCeEEcCCCCCCcccc
Confidence 4567888888765542 34578999998655554444433
No 61
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=30.95 E-value=1.5e+02 Score=20.20 Aligned_cols=29 Identities=21% Similarity=0.232 Sum_probs=22.7
Q ss_pred eEEEEccEEEEEECCEEEEEeCCCCEEecc
Q psy14742 3 DRILSKGRVEVSRENKYLSTLAPGKVFGEL 32 (66)
Q Consensus 3 ~~iI~~G~v~v~~~~~~v~~l~~G~~fGe~ 32 (66)
-|.|.+|+-.+...++.+ ..++||+|---
T Consensus 273 Vf~VieG~G~s~ig~~~~-~W~~gD~f~vP 301 (335)
T TIGR02272 273 VFCVVEGRGQVRIGDAVF-RFSPKDVFVVP 301 (335)
T ss_pred EEEEEeCeEEEEECCEEE-EecCCCEEEEC
Confidence 478899998888877754 69999998543
No 62
>PF07943 PBP5_C: Penicillin-binding protein 5, C-terminal domain; InterPro: IPR012907 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry contains proteins that are annotated as penicillin-binding protein 5 and 6. These belong to MEROPS peptidase family S11 (D-Ala-D-Ala carboxypeptidase A family, clan SE). Penicillin-binding protein 5 expressed by Escherichia coli functions as a D-alanyl-D-alanine carboxypeptidase. It is composed of two domains that are oriented at approximately right angles to each other. The N-terminal domain (IPR001967 from INTERPRO) is the catalytic domain. The C-terminal domain, this entry, is organised into a sandwich of two anti-parallel beta-sheets, and has a relatively hydrophobic surface as compared to the N-terminal domain. Its precise function is unknown; it may mediate interactions with other cell wall-synthesising enzymes, thus allowing the protein to be recruited to areas of active cell wall synthesis. It may also function as a linker domain that positions the active site in the catalytic domain closer to the peptidoglycan layer, to allow it to interact with cell wall peptides []. ; GO: 0009002 serine-type D-Ala-D-Ala carboxypeptidase activity, 0006508 proteolysis; PDB: 3A3J_A 3MFD_B 1XP4_D 3MZD_A 1NZU_A 1NJ4_A 1Z6F_A 3MZF_A 1NZO_A 3MZE_A ....
Probab=29.09 E-value=92 Score=16.09 Aligned_cols=34 Identities=12% Similarity=0.192 Sum_probs=26.1
Q ss_pred EEEeCCCCEEeccccccCCceeEEEEEccceEEE
Q psy14742 20 LSTLAPGKVFGELAILYNCKRTATIKVHWTLNLK 53 (66)
Q Consensus 20 v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~ 53 (66)
...+.+|+.++...+..+......+.+.++..+.
T Consensus 4 ~~l~~kg~~~~~~~v~~g~~~~v~l~~~~d~~~~ 37 (91)
T PF07943_consen 4 VKLLKKGQPVATVPVKKGKKKTVPLVAKEDIYIT 37 (91)
T ss_dssp EEEE-TTCESEEEEEESCSSSEEEEEESSGEEEE
T ss_pred EEEeCCCCEEEEEEECCCCcCEEEEEECCCEEEE
Confidence 4568889999889988888888888888884333
No 63
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=28.64 E-value=2e+02 Score=19.69 Aligned_cols=55 Identities=4% Similarity=-0.064 Sum_probs=31.2
Q ss_pred EEEEccEEEEEEC---CEE-EEEeCCCCEEeccccccCCceeEEEEEccceEEEEEEeecCCCc
Q psy14742 4 RILSKGRVEVSRE---NKY-LSTLAPGKVFGELAILYNCKRTATIKVHWTLNLKSWGSRTSGDN 63 (66)
Q Consensus 4 ~iI~~G~v~v~~~---~~~-v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~~~v~~l~~~~~~~~ 63 (66)
.++++|++++... ++. ...+++||+|= +-.+.++..... .++++ +++-.+...|+
T Consensus 91 ~yVl~G~~~v~~~d~~g~~~~~~L~~GD~~~---fP~g~~H~~~n~-~~~~~-~l~vf~~~~f~ 149 (367)
T TIGR03404 91 AYVLYGSCRITAVDENGRNYIDDVGAGDLWY---FPPGIPHSLQGL-DEGCE-FLLVFDDGNFS 149 (367)
T ss_pred EEEEeeEEEEEEEcCCCcEEEeEECCCCEEE---ECCCCeEEEEEC-CCCeE-EEEEeCCcccC
Confidence 5788999999763 343 45899999752 223345544433 23333 44444444444
No 64
>KOG2500|consensus
Probab=28.13 E-value=1.8e+02 Score=19.09 Aligned_cols=51 Identities=14% Similarity=0.137 Sum_probs=31.6
Q ss_pred EEccEEEEEECCE----EEEEeCCCCEEeccccccCCceeEEEEEccc-eEEEEEEee
Q psy14742 6 LSKGRVEVSRENK----YLSTLAPGKVFGELAILYNCKRTATIKVHWT-LNLKSWGSR 58 (66)
Q Consensus 6 I~~G~v~v~~~~~----~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~-~~v~~l~~~ 58 (66)
..+|++++.-.+. .+..-..|+.|....+ +..+. ..|.|+.| ++..++++.
T Consensus 41 ~WtGrlrvvakg~~~~ikLeD~tsg~LfA~c~i-d~~~~-~avEav~DSSRYFViRv~ 96 (253)
T KOG2500|consen 41 AWTGRLRVVAKGERCEIKLEDKTSGELFAQCPI-DEGPG-NAVEAVSDSSRYFVIRVE 96 (253)
T ss_pred cccceeEEEEcCcEEEEEeccCCchhhhhhCcc-cCCCC-ccceeecccceEEEEEEe
Confidence 3578888865542 3445567888876555 33444 45777777 556666655
No 65
>PRK10579 hypothetical protein; Provisional
Probab=27.51 E-value=1.2e+02 Score=16.98 Aligned_cols=39 Identities=15% Similarity=0.149 Sum_probs=22.3
Q ss_pred EEccEEEEEECCE-EEEEeCCCCEEeccccccCCceeEEEEEccc
Q psy14742 6 LSKGRVEVSRENK-YLSTLAPGKVFGELAILYNCKRTATIKVHWT 49 (66)
Q Consensus 6 I~~G~v~v~~~~~-~v~~l~~G~~fGe~~l~~~~~r~~tv~A~~~ 49 (66)
|++|.+++...+. .-..+.+|+.|--- ...+..+++.++
T Consensus 47 ivsG~l~V~Lpg~~ew~~~~aG~sF~Vp-----anssF~l~v~~~ 86 (94)
T PRK10579 47 VISGALNVLLPGATDWQVYEAGEVFNVP-----GHSEFHLQVAEP 86 (94)
T ss_pred EEeeEEEEECCCCcccEEeCCCCEEEEC-----CCCeEEEEECcc
Confidence 6677777765543 34567888777421 223345555554
No 66
>PF05409 Peptidase_C30: Coronavirus endopeptidase C30; InterPro: IPR008740 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases correspond to MEROPS peptidase family C30 (clan PA(C)). These peptidases are related to serine endopeptidases of family S1 and are restricted to RNA viruses, where they are involved in viral polyprotein processing during replication [, , ]. This entry represents the CoV M-pro which comprises three domains. Domains I and II are six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain, a globular cluster of five helices, has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad. In contrast to serine proteinases and other cysteine proteinases, which have a catalytic triad, there is no third catalytic residue present [, , , ]. ; GO: 0019082 viral protein processing; PDB: 2VJ1_B 2K7X_A 2Q6D_B 2Q6F_A 1P9U_E 1LVO_B 2AMP_B 1P9S_A 2ZU2_A 3D23_A ....
Probab=25.53 E-value=2e+02 Score=19.39 Aligned_cols=30 Identities=20% Similarity=0.101 Sum_probs=18.6
Q ss_pred EEEEEeCCCCEEeccccccCCceeEEEEEc
Q psy14742 18 KYLSTLAPGKVFGELAILYNCKRTATIKVH 47 (66)
Q Consensus 18 ~~v~~l~~G~~fGe~~l~~~~~r~~tv~A~ 47 (66)
.....++||+.|--++.-++.|+...-..+
T Consensus 73 y~F~~~kpG~sftIla~YdG~~~gvy~V~m 102 (293)
T PF05409_consen 73 YKFVRLKPGDSFTILACYDGVPSGVYHVNM 102 (293)
T ss_dssp EEE----TT-EEEEEEEETTEEEEEEEEEB
T ss_pred eEEEEcCCCCceEEEEEeCCeecceEEEEE
Confidence 357789999999999998888887543333
No 67
>PF06719 AraC_N: AraC-type transcriptional regulator N-terminus; InterPro: IPR009594 This entry represents the N terminus of bacterial ARAC-type transcriptional regulators. In Escherichia coli these regulate the L-arabinose operon through sensing the presence of arabinose, and when the sugar is present, transmitting this information from the arabinose-binding domains to the protein s DNA-binding domains []. This family might represent the N-terminal arm of the protein, which binds to the C-terminal DNA binding domains to hold them in a state where the protein prefers to loop and remain non-activating []. This domain is associated with the IPR000005 from INTERPRO domain.
Probab=23.41 E-value=1.7e+02 Score=17.13 Aligned_cols=53 Identities=9% Similarity=0.012 Sum_probs=29.0
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCEEeccccccCCceeEEE-EE--ccceEEEEEEeecC
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKVFGELAILYNCKRTATI-KV--HWTLNLKSWGSRTS 60 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~fGe~~l~~~~~r~~tv-~A--~~~~~v~~l~~~~~ 60 (66)
++|++|.=++..+++. -.+.+|+++= .--+-|-...+ .| .++...+.+.++.+
T Consensus 27 ~~vlQG~K~~~~g~~~-~~Y~~g~~lv---~~~~lPv~~~v~~AS~~~P~l~l~l~ld~~ 82 (155)
T PF06719_consen 27 CIVLQGSKRVHLGDQV-FEYDAGQYLV---SSVDLPVESEVVEASPEEPYLALSLELDPA 82 (155)
T ss_pred EEEEeeeEEEEECCce-EEecCCcEEE---ecCCCcEEEEEeeccCCCCEEEEEEEcCHH
Confidence 6888998666665543 4588887641 11123333333 33 33456666666543
No 68
>PRK13502 transcriptional activator RhaR; Provisional
Probab=22.15 E-value=2.1e+02 Score=17.90 Aligned_cols=24 Identities=8% Similarity=0.113 Sum_probs=15.8
Q ss_pred EEEEccEEEEEECCEEEEEeCCCCE
Q psy14742 4 RILSKGRVEVSRENKYLSTLAPGKV 28 (66)
Q Consensus 4 ~iI~~G~v~v~~~~~~v~~l~~G~~ 28 (66)
.++.+|......+++. ..+.||++
T Consensus 42 ~~v~~G~~~~~i~~~~-~~l~~g~l 65 (282)
T PRK13502 42 VMVWRGNGLHVLNERP-YRITRGDL 65 (282)
T ss_pred EEEecCcEEEEECCEE-EeecCCcE
Confidence 5788888887776643 34566554
No 69
>PF01043 SecA_PP_bind: SecA preprotein cross-linking domain; InterPro: IPR011130 The SecA ATPase is involved in the insertion and retraction of preproteins through the plasma membrane. This domain has been found to cross-link to preproteins, thought to indicate a role in preprotein binding. The pre-protein cross-linking domain is comprised of two sub domains that are inserted within the ATPase domain [].; GO: 0017038 protein import, 0016020 membrane; PDB: 3DIN_B 3JUX_A 3IQY_A 2IBM_A 3DL8_A 3JV2_B 3IQM_A 1TF2_A 1TF5_A 1M74_A ....
Probab=22.02 E-value=93 Score=17.67 Aligned_cols=14 Identities=29% Similarity=0.413 Sum_probs=11.4
Q ss_pred ceEEEEccEEEEEE
Q psy14742 2 RDRILSKGRVEVSR 15 (66)
Q Consensus 2 ~~~iI~~G~v~v~~ 15 (66)
++|||..|.|.+.-
T Consensus 79 ~dYiV~dg~V~IVD 92 (113)
T PF01043_consen 79 VDYIVRDGEVVIVD 92 (113)
T ss_dssp TSEEEETTEEEEBC
T ss_pred cceEEEcCEEEEEE
Confidence 58999999987754
No 70
>PF13421 Band_7_1: SPFH domain-Band 7 family
Probab=21.93 E-value=1.7e+02 Score=18.38 Aligned_cols=25 Identities=24% Similarity=0.461 Sum_probs=16.8
Q ss_pred EEEccEEEEEE-CCEEEEEeCCCCEE
Q psy14742 5 ILSKGRVEVSR-ENKYLSTLAPGKVF 29 (66)
Q Consensus 5 iI~~G~v~v~~-~~~~v~~l~~G~~f 29 (66)
+|.+|++-+.. +++.+..++||.+-
T Consensus 18 iV~egQ~Avfv~~G~i~d~~~pG~y~ 43 (211)
T PF13421_consen 18 IVREGQCAVFVNDGKIADVFGPGRYT 43 (211)
T ss_pred EECCCCEEEEEECCEEEEEecCceEE
Confidence 56667777654 46777778887764
No 71
>cd04467 S1_aIF5A S1_aIF5A: Archaeal translation Initiation Factor 5A (aIF5A), S1-like RNA-binding domain. aIF5A is a homolog of eukaryotic eIF5A. IF5A is the only protein known to have the unusual amino acid hypusine. Hypusine is a post-translationally modified lysine and is essential for IF5A function. In yeast, eIF5A interacts with components of the 80S ribosome and translation elongation factors 2 (eEF2) in a hypusine-dependent manner. This C-terminal S1 domain resembles the cold-shock domain which binds RNA. Moreover, IF5A prefers binding to the actively translating ribosome. This evidence suggests that IF5A plays a role in translation elongation instead of translation initiation as previously proposed.
Probab=21.23 E-value=1.3e+02 Score=15.12 Aligned_cols=17 Identities=6% Similarity=-0.085 Sum_probs=7.8
Q ss_pred eeEEEEEccceEEEEEE
Q psy14742 40 RTATIKVHWTLNLKSWG 56 (66)
Q Consensus 40 r~~tv~A~~~~~v~~l~ 56 (66)
+.+++.+++...+-+++
T Consensus 4 ~~aqVisi~g~~vQlMD 20 (57)
T cd04467 4 KTGQVLSIMGDVVQLMD 20 (57)
T ss_pred eEEEEEEEcCCEEEEec
Confidence 44455555544444443
Done!