Query psy16824
Match_columns 85
No_of_seqs 106 out of 153
Neff 4.0
Searched_HMMs 46136
Date Fri Aug 16 22:59:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16824.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16824hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04062 P21-Arc: ARP2/3 compl 100.0 9.3E-49 2E-53 286.6 2.5 79 1-79 97-175 (175)
2 KOG3155|consensus 100.0 5E-44 1.1E-48 260.6 5.2 79 1-79 95-173 (173)
3 cd00126 PAH Pancreatic Hormone 85.4 0.9 1.9E-05 26.0 2.3 22 26-47 8-29 (36)
4 PF00159 Hormone_3: Pancreatic 83.9 1.3 2.8E-05 25.3 2.5 22 26-47 8-29 (36)
5 smart00309 PAH Pancreatic horm 82.9 1.3 2.9E-05 25.3 2.3 22 26-47 8-29 (36)
6 PF09254 Endonuc-FokI_C: Restr 81.5 2.3 4.9E-05 32.2 3.7 32 29-71 89-120 (189)
7 COG3765 WzzB Chain length dete 57.8 22 0.00048 29.1 4.6 47 9-55 143-190 (347)
8 TIGR03779 Bac_Flav_CT_M Bacter 56.0 17 0.00036 30.2 3.7 36 19-55 133-180 (410)
9 PF08990 Docking: Erythronolid 55.2 11 0.00025 20.1 1.8 17 31-47 3-19 (27)
10 COG1298 FlhA Flagellar biosynt 53.9 27 0.00058 31.1 4.8 46 8-54 445-515 (696)
11 PF08606 Prp19: Prp19/Pso4-lik 49.9 33 0.00071 22.2 3.6 19 33-51 25-43 (70)
12 PF00631 G-gamma: GGL domain; 45.0 27 0.00059 21.2 2.6 19 30-48 2-20 (68)
13 PF02831 gpW: gpW; InterPro: 41.3 22 0.00047 22.6 1.8 19 30-48 38-56 (68)
14 PHA02865 MHC-like TNF binding 39.5 13 0.00028 30.2 0.7 11 11-21 261-271 (338)
15 TIGR03142 cytochro_ccmI cytoch 39.0 28 0.00061 23.2 2.2 16 40-55 63-78 (117)
16 COG2232 Predicted ATP-dependen 38.8 44 0.00095 27.9 3.6 28 8-35 334-364 (389)
17 PRK06553 lipid A biosynthesis 37.5 21 0.00045 27.2 1.5 35 13-51 153-187 (308)
18 PRK08943 lipid A biosynthesis 36.0 23 0.00049 27.0 1.5 34 14-51 152-185 (314)
19 cd05493 Bromo_ALL-1 Bromodomai 35.8 41 0.00088 23.9 2.7 19 48-66 113-131 (131)
20 PF08802 CytB6-F_Fe-S: Cytochr 35.3 7 0.00015 22.7 -1.1 13 69-81 6-18 (39)
21 PF13709 DUF4159: Domain of un 35.0 34 0.00073 25.2 2.2 14 28-41 68-81 (207)
22 PF11300 DUF3102: Protein of u 34.9 60 0.0013 22.9 3.4 21 32-52 7-34 (130)
23 COG4114 FhuF Uncharacterized F 34.8 40 0.00087 26.6 2.7 41 32-72 176-223 (251)
24 COG5500 Predicted integral mem 33.8 29 0.00062 25.5 1.7 26 17-42 108-135 (159)
25 PF00140 Sigma70_r1_2: Sigma-7 33.7 48 0.001 18.1 2.2 13 32-44 1-13 (37)
26 PF13006 Nterm_IS4: Insertion 33.6 39 0.00084 22.8 2.2 21 36-56 77-97 (98)
27 PRK15202 type III secretion ch 33.4 47 0.001 23.5 2.6 34 17-50 77-116 (117)
28 PF10372 YojJ: Bacterial membr 32.8 61 0.0013 20.8 2.9 16 32-47 12-27 (70)
29 PF11464 Rbsn: Rabenosyn Rab b 31.8 58 0.0013 19.1 2.4 15 31-45 4-18 (42)
30 PF04536 TPM: TLP18.3, Psb32 a 31.6 24 0.00052 22.3 0.9 24 28-51 3-26 (119)
31 cd00555 Maf Nucleotide binding 31.2 37 0.00081 24.5 1.9 16 20-35 74-89 (180)
32 PF08938 HBS1_N: HBS1 N-termin 31.0 8.3 0.00018 24.4 -1.4 22 28-49 21-42 (79)
33 KOG1205|consensus 29.6 40 0.00087 26.5 2.0 34 10-48 150-183 (282)
34 PF05438 TRH: Thyrotropin-rele 28.8 44 0.00095 25.8 2.0 47 2-48 7-60 (212)
35 cd07624 BAR_SNX7_30 The Bin/Am 28.3 97 0.0021 22.4 3.7 29 19-47 3-31 (200)
36 PRK06946 lipid A biosynthesis 28.1 39 0.00084 25.5 1.6 32 16-51 133-164 (293)
37 PRK00234 Maf-like protein; Rev 27.9 47 0.001 24.3 2.0 16 20-35 76-91 (192)
38 PF12891 Glyco_hydro_44: Glyco 27.9 26 0.00056 27.2 0.6 20 33-52 206-228 (239)
39 KOG2940|consensus 27.7 39 0.00084 27.3 1.6 17 40-56 52-68 (325)
40 PF11544 Spc42p: Spindle pole 27.6 80 0.0017 20.7 2.8 19 29-47 18-36 (76)
41 TIGR02208 lipid_A_msbB lipid A 27.2 30 0.00066 26.2 0.9 34 14-51 143-176 (305)
42 PRK00078 Maf-like protein; Rev 27.0 49 0.0011 24.2 1.9 16 20-35 78-93 (192)
43 PF03221 HTH_Tnp_Tc5: Tc5 tran 26.4 1.3E+02 0.0028 17.0 3.5 45 29-73 2-60 (66)
44 PF14357 DUF4404: Domain of un 26.1 50 0.0011 21.4 1.7 13 35-47 2-14 (85)
45 PF09396 Thrombin_light: Throm 25.6 73 0.0016 19.4 2.2 28 13-40 18-47 (49)
46 PRK00648 Maf-like protein; Rev 25.5 54 0.0012 23.9 1.9 16 20-35 79-94 (191)
47 PF11035 SnAPC_2_like: Small n 25.4 1.2E+02 0.0027 25.0 4.1 43 29-72 57-109 (344)
48 cd00068 GGL G protein gamma su 25.4 58 0.0013 19.5 1.8 17 33-49 2-18 (57)
49 PRK00884 Maf-like protein; Rev 25.3 54 0.0012 24.1 1.9 16 20-35 76-91 (194)
50 PF09509 Hypoth_Ymh: Protein o 25.3 1.1E+02 0.0024 20.9 3.4 25 33-57 28-58 (125)
51 COG2207 AraC AraC-type DNA-bin 25.3 99 0.0021 18.8 2.9 19 34-52 64-82 (127)
52 TIGR00172 maf MAF protein. Thi 25.1 55 0.0012 23.8 1.9 16 20-35 77-92 (183)
53 PRK00032 Maf-like protein; Rev 24.9 56 0.0012 23.9 1.9 16 20-35 77-92 (190)
54 smart00224 GGL G protein gamma 24.8 59 0.0013 19.8 1.7 16 33-48 2-17 (63)
55 PF06183 DinI: DinI-like famil 24.8 83 0.0018 19.5 2.4 18 38-56 5-22 (65)
56 PRK04056 Maf-like protein; Rev 24.6 58 0.0013 23.6 1.9 16 20-35 75-90 (180)
57 KOG4119|consensus 24.5 95 0.0021 20.0 2.7 21 28-48 5-25 (71)
58 COG1694 MazG Predicted pyropho 24.5 1.1E+02 0.0023 19.6 3.0 23 33-55 44-68 (102)
59 PRK14364 Maf-like protein; Pro 24.4 58 0.0013 23.6 1.9 16 20-35 71-86 (181)
60 PRK00148 Maf-like protein; Rev 24.0 60 0.0013 23.8 1.9 16 20-35 75-90 (194)
61 PRK01441 Maf-like protein; Rev 24.0 61 0.0013 24.0 2.0 16 20-35 86-101 (207)
62 PRK08905 lipid A biosynthesis 23.9 37 0.00079 25.6 0.8 31 17-51 124-154 (289)
63 PF09720 Unstab_antitox: Putat 23.2 92 0.002 17.9 2.3 31 43-74 7-37 (54)
64 PF12019 GspH: Type II transpo 23.1 1.3E+02 0.0028 18.0 3.1 22 31-52 1-22 (114)
65 PRK14366 Maf-like protein; Pro 23.1 64 0.0014 23.8 1.9 16 20-35 80-95 (195)
66 TIGR03418 chol_sulf_TF putativ 23.0 71 0.0015 22.9 2.1 21 34-54 31-51 (291)
67 PRK11716 DNA-binding transcrip 22.5 32 0.0007 23.9 0.3 20 35-54 8-27 (269)
68 PRK10219 DNA-binding transcrip 22.5 1.2E+02 0.0025 19.1 2.9 19 34-52 49-67 (107)
69 KOG3382|consensus 22.4 1E+02 0.0022 22.6 2.8 26 33-58 16-42 (151)
70 PF12174 RST: RCD1-SRO-TAF4 (R 22.4 1.2E+02 0.0025 19.2 2.8 18 38-55 47-64 (70)
71 cd04789 HTH_Cfa Helix-Turn-Hel 22.3 57 0.0012 21.1 1.4 33 15-47 56-88 (102)
72 PF02099 Josephin: Josephin; 22.3 34 0.00074 24.5 0.4 28 17-45 121-148 (157)
73 PF15601 Imm42: Immunity prote 22.3 1E+02 0.0022 22.0 2.7 40 16-55 37-78 (134)
74 PRK14363 Maf-like protein; Pro 22.1 70 0.0015 23.8 2.0 16 20-35 75-90 (204)
75 PF10229 DUF2246: Uncharacteri 21.9 45 0.00097 26.3 1.0 11 8-18 51-61 (278)
76 PRK11546 zraP zinc resistance 21.9 1.3E+02 0.0029 21.6 3.3 29 19-47 77-106 (143)
77 PRK06012 flhA flagellar biosyn 21.8 1.7E+02 0.0036 26.0 4.5 45 9-54 447-516 (697)
78 PRK13348 chromosome replicatio 21.8 42 0.00091 24.1 0.7 20 34-53 32-51 (294)
79 PF02545 Maf: Maf-like protein 21.8 64 0.0014 23.6 1.7 16 20-35 78-93 (195)
80 PF03190 Thioredox_DsbH: Prote 21.8 55 0.0012 23.8 1.3 32 13-44 99-142 (163)
81 PF01231 IDO: Indoleamine 2,3- 21.8 74 0.0016 26.0 2.2 18 34-51 405-422 (422)
82 PRK06628 lipid A biosynthesis 21.7 59 0.0013 24.5 1.6 32 16-51 137-168 (290)
83 COG4047 Uncharacterized protei 21.6 70 0.0015 25.1 1.9 13 59-71 56-68 (243)
84 PLN03143 nudix hydrolase; Prov 21.2 89 0.0019 24.6 2.5 42 7-48 32-79 (291)
85 COG1503 eRF1 Peptide chain rel 21.0 63 0.0014 27.0 1.7 29 14-47 24-52 (411)
86 PF14852 Fis1_TPR_N: Fis1 N-te 20.8 1E+02 0.0023 16.9 2.1 14 44-57 20-33 (35)
87 PRK13480 3'-5' exoribonuclease 20.7 1E+02 0.0022 24.4 2.7 40 16-55 95-136 (314)
88 PRK14368 Maf-like protein; Pro 20.5 76 0.0016 23.3 1.9 16 20-35 79-94 (193)
89 PRK14365 Maf-like protein; Pro 20.4 78 0.0017 23.3 1.9 16 20-35 77-92 (197)
90 PRK13456 DNA protection protei 20.4 1.5E+02 0.0033 22.2 3.5 43 32-75 51-99 (186)
91 PRK13503 transcriptional activ 20.2 1.2E+02 0.0026 21.7 2.9 19 34-52 215-233 (278)
92 PRK14361 Maf-like protein; Pro 20.2 79 0.0017 23.1 1.9 16 20-35 72-87 (187)
93 PF12785 VESA1_N: Variant eryt 20.2 39 0.00084 28.8 0.3 30 19-48 39-68 (462)
94 PRK01526 Maf-like protein; Rev 20.1 79 0.0017 23.5 1.9 15 21-35 85-99 (205)
95 COG3248 Tsx Nucleoside-binding 20.0 20 0.00043 28.6 -1.3 27 8-35 153-180 (284)
No 1
>PF04062 P21-Arc: ARP2/3 complex ARPC3 (21 kDa) subunit; InterPro: IPR007204 The Arp2/3 complex is a seven-protein assembly that is critical for actin nucleation and branching in cells. Arp2/3 nucleates new actin filaments while bound to existing filaments, thus creating a branched network []. The complex consists of Arp2, Arp3, p41, p34, p21, p20 and p16. Subunits p34 and p20 constitute the core of the structure, with the remaining subunits located peripherally []. This entry describes the p21 subunit. Proteins such as WASp and Scar1 may mediate receptor signalling through interactions with p21-Arc, resulting in the activation of Arc2/3 complex activity [].; GO: 0030833 regulation of actin filament polymerization, 0005856 cytoskeleton; PDB: 3DWL_J 2P9P_E 2P9N_E 2P9K_E 1TYQ_E 1U2V_E 3RSE_E 2P9U_E 3DXM_E 2P9S_E ....
Probab=100.00 E-value=9.3e-49 Score=286.58 Aligned_cols=79 Identities=65% Similarity=1.263 Sum_probs=62.6
Q ss_pred CccccccCcCCCCCCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCChhhhhhccchhhhhcc
Q psy16824 1 MYSLALAKFDLPGEAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVCERVFNTPDGKPSKWWLCFSKKRFMDKSL 79 (85)
Q Consensus 1 l~~lal~~F~iPGd~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~e~vy~~~~~~psKwWm~F~KRkFM~KsL 79 (85)
|+++|+++|+||||+|||||++|++|+|++|+|+||+||+|+|||+|.|||||||++++++||||||||+|||||||||
T Consensus 97 l~~lal~~F~iPGd~gFpLn~~y~~P~~~~e~d~lR~Yl~QlRqElg~RL~ervf~~~~~~psKwW~~FaKRkFM~KsL 175 (175)
T PF04062_consen 97 LYNLALDNFPIPGDPGFPLNSLYAKPANRQEADLLRQYLTQLRQELGLRLLERVFDDGDGKPSKWWMCFAKRKFMNKSL 175 (175)
T ss_dssp HHHHHHS----TTSTT-TTTTTS---SSHHHHHHHHHHHHHHHHHHCCCCCHHHB-SSSS-B-HCCCCCCC--GGG--T
T ss_pred HHHHHHhcCCCCCCCCCCccccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCChhhhhhhhhccccccC
Confidence 5899999999999999999999999999999999999999999999999999999988899999999999999999997
No 2
>KOG3155|consensus
Probab=100.00 E-value=5e-44 Score=260.61 Aligned_cols=79 Identities=62% Similarity=1.228 Sum_probs=77.5
Q ss_pred CccccccCcCCCCCCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCChhhhhhccchhhhhcc
Q psy16824 1 MYSLALAKFDLPGEAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVCERVFNTPDGKPSKWWLCFSKKRFMDKSL 79 (85)
Q Consensus 1 l~~lal~~F~iPGd~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~e~vy~~~~~~psKwWm~F~KRkFM~KsL 79 (85)
|++||+++|+||||+|||||+||++|.+++|+|.||+||+|+|||+|.|||++||++.+++|||||+||+|||||||||
T Consensus 95 l~~lal~~~~iPGe~GFPln~ly~~p~s~~~~e~mr~Yl~Q~RqE~~~RL~~~v~~~~~d~PsKwW~cF~kRrFMnksl 173 (173)
T KOG3155|consen 95 LYNLALENFPIPGEPGFPLNALYTLPASKQDAELMRAYLQQFRQETGLRLLEKVYDTPKDKPSKWWLCFAKRRFMNKSL 173 (173)
T ss_pred HHHhhhhcccCCCCCCCCchhhhcCCCCCCchHHHHHHHHHHHHHhhhhHHHHhcCCCCCCcchhHHHHHHHHhhhccC
Confidence 6899999999999999999999999999999999999999999999999999999999999999999999999999997
No 3
>cd00126 PAH Pancreatic Hormone domain, a regulator of pancreatic and gastrointestinal functions; neuropeptide Y (NPY)b, peptide YY (PYY), and pancreatic polypetide (PP) are closely related; propeptide is enzymatically cleaved to yield the mature active peptide with amidated C-terminal ends; receptor binding and activation functions may reside in the N- and C-termini respectively; occurs in neurons, intestinal endocrine cells, and pancreas; exist as monomers and dimers
Probab=85.44 E-value=0.9 Score=26.03 Aligned_cols=22 Identities=18% Similarity=0.483 Sum_probs=17.0
Q ss_pred CCChhHHHHHHHHHHHHHHHHH
Q psy16824 26 PQSQNEADLMKNYLTQVRQETG 47 (85)
Q Consensus 26 P~~~~E~d~lr~Yl~QlRqE~~ 47 (85)
|++-.-.|.|++|+++||+=+.
T Consensus 8 Pg~~a~~eel~~Y~~~L~~Yin 29 (36)
T cd00126 8 PGDDASPEELRQYLAALREYIN 29 (36)
T ss_pred CCCCCCHHHHHHHHHHHHHHHH
Confidence 4555557999999999998653
No 4
>PF00159 Hormone_3: Pancreatic hormone peptide; InterPro: IPR001955 Pancreatic hormone (PP) [] is a peptide synthesized in pancreatic islets of Langherhans, which acts as a regulator of pancreatic and gastrointestinal functions. The hormone is produced as a larger propeptide, which is enzymatically cleaved to yield the mature active peptide: this is 36 amino acids in length [] and has an amidated C terminus []. The hormone has a globular structure, residues 2-8 forming a left-handed poly-proline-II-like helix, residues 9-13 a beta turn, and 14-32 an alpha-helix,held close to the first helix by hydrophobic interactions []. Unlike glucagon, another peptide hormone, the structure of pancreatic peptide is preserved in aqueous solution []. Both N and C termini are required for activity: receptor binding and activation functions may reside in the N and C termini respectively []. Pancreatic hormone is part of a wider family of active peptides that includes: Neuropeptide Y (NPY) [], one of the most abundant peptides in the mammalian nervous system. NPY is implicated in the control of feeding and the secretion of the gonadotrophin-releasing hormone. Peptide YY (PYY) []. PPY is a gut peptide that inhibits exocrine pancreatic secretion, has a vasoconstrictory action and inhibits jejunal and colonic mobility. Various NPY and PYY-like polypeptides from fish and amphibians [, ]. Neuropeptide F (NPF) from invertebrates such as worms and snail. Skin peptide Tyr-Tyr (SPYY) from the frog Phyllomedusa bicolor. SPYY shows a large spectra of antibacterial and antifungal activity. All these peptides are 36 to 39 amino acids long. Like most active peptides, their C-terminal is amidated and they are synthesized as larger protein precursors.; GO: 0005179 hormone activity, 0005576 extracellular region; PDB: 1LJV_A 1BBA_A 1V1D_A 1PPT_A 2H3T_A 2H4B_A 2BF9_A 2H3S_B 1K8V_A 2DF0_A ....
Probab=83.94 E-value=1.3 Score=25.29 Aligned_cols=22 Identities=14% Similarity=0.315 Sum_probs=16.3
Q ss_pred CCChhHHHHHHHHHHHHHHHHH
Q psy16824 26 PQSQNEADLMKNYLTQVRQETG 47 (85)
Q Consensus 26 P~~~~E~d~lr~Yl~QlRqE~~ 47 (85)
|++-.-.|.|++|+.+||+=+.
T Consensus 8 P~~~aspeel~~Y~~~L~~Y~~ 29 (36)
T PF00159_consen 8 PGDFASPEELAQYYAALRHYIN 29 (36)
T ss_dssp SSTTSSHHHHHHHHHHHHHHHH
T ss_pred CCCCCCHHHHHHHHHHHHHHHH
Confidence 4444555789999999998653
No 5
>smart00309 PAH Pancreatic hormones / neuropeptide F / peptide YY family. Pancreatic hormone is a regulator of pancreatic and gastrointestinal functions.
Probab=82.87 E-value=1.3 Score=25.32 Aligned_cols=22 Identities=18% Similarity=0.465 Sum_probs=16.2
Q ss_pred CCChhHHHHHHHHHHHHHHHHH
Q psy16824 26 PQSQNEADLMKNYLTQVRQETG 47 (85)
Q Consensus 26 P~~~~E~d~lr~Yl~QlRqE~~ 47 (85)
|++..-.|.|++|+.+||+=+.
T Consensus 8 Pg~~a~~e~l~~Y~~~L~~Yin 29 (36)
T smart00309 8 PGDDASPEDLRQYLAALREYIN 29 (36)
T ss_pred CCCCCCHHHHHHHHHHHHHHHH
Confidence 4444445779999999998653
No 6
>PF09254 Endonuc-FokI_C: Restriction endonuclease FokI, C terminal; InterPro: IPR015334 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. This entry represents the C-terminal domain of FokI restriction endonucleases, which adopts a structure consisting of an alpha/beta/alpha core containing a five-stranded beta-sheet. FokI recognises the double-stranded DNA sequence 5'-GGATG-3' and cleave DNA phosphodiester groups 9 base pairs away on this strand and 13 base pairs away on the complementary strand [, ].; GO: 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system; PDB: 1FOK_A 2FOK_B.
Probab=81.55 E-value=2.3 Score=32.17 Aligned_cols=32 Identities=28% Similarity=0.677 Sum_probs=21.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCChhhhhhcc
Q psy16824 29 QNEADLMKNYLTQVRQETGLRVCERVFNTPDGKPSKWWLCFSK 71 (85)
Q Consensus 29 ~~E~d~lr~Yl~QlRqE~~~RL~e~vy~~~~~~psKwWm~F~K 71 (85)
-.++|.|--|+.+-|.- ++.-.|+|||-+|-+
T Consensus 89 i~q~DeM~RYI~~n~~R-----------d~~~npnkWWe~f~~ 120 (189)
T PF09254_consen 89 ISQADEMIRYIEENQER-----------DEKRNPNKWWENFPE 120 (189)
T ss_dssp HHHHHHHHHHHHHHHH-------------TTTSTT-GGGGS-T
T ss_pred cccHHHHHHHHHHHhcc-----------cccCCCchhHHhccc
Confidence 46778888888877642 234569999999976
No 7
>COG3765 WzzB Chain length determinant protein [Cell envelope biogenesis, outer membrane]
Probab=57.81 E-value=22 Score=29.13 Aligned_cols=47 Identities=17% Similarity=0.306 Sum_probs=37.4
Q ss_pred cCCCCCCCCC-CcccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy16824 9 FDLPGEAGFP-LNSVYAKPQSQNEADLMKNYLTQVRQETGLRVCERVF 55 (85)
Q Consensus 9 F~iPGd~gFp-Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~e~vy 55 (85)
|..+|+.+++ ++..|+.+....-.+.+++|++++=|.....+++++=
T Consensus 143 ~~~~~~~~~~~~~vs~~a~t~edAq~~L~gyI~~~s~~v~~el~~~l~ 190 (347)
T COG3765 143 FKPGGFDLATNLTVSFTAETAEDAQDLLRGYIAFVSQKVAQELLDNLK 190 (347)
T ss_pred ccCCccccchhheeeeecCCcHHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 4456666665 8888988877667799999999999999888887764
No 8
>TIGR03779 Bac_Flav_CT_M Bacteroides conjugative transposon TraM protein. Members of this protein family are designated TraM and are found in a proposed transfer region of a class of conjugative transposon found in the Bacteroides lineage.
Probab=56.00 E-value=17 Score=30.22 Aligned_cols=36 Identities=19% Similarity=0.456 Sum_probs=28.0
Q ss_pred CcccccCCCChhHHHHHHHHHHHHHHHHH------------HHHHHHhc
Q psy16824 19 LNSVYAKPQSQNEADLMKNYLTQVRQETG------------LRVCERVF 55 (85)
Q Consensus 19 Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~------------~RL~e~vy 55 (85)
|+++|..|.+..|.+ ++.=+.|++.++. ..|+||-|
T Consensus 133 LgsFYe~~~~~~e~e-l~~~veel~~~l~~~~~~~~~~~eq~almEKsy 180 (410)
T TIGR03779 133 LGSFYEYPKTDEEKE-LLREVEELESRLATEPSPAPELEEQLALMEKSY 180 (410)
T ss_pred hhhhccCCCCchhHH-HHHHHHHHHHHHhhhcccccchhHHHHHHHHhH
Confidence 778999998888888 7777888887664 56777666
No 9
>PF08990 Docking: Erythronolide synthase docking; InterPro: IPR015083 The N-terminal docking domain found in modular polyketide synthase assumes an alpha-helical structure, wherein two alpha-helices are connected by a short loop. Two such N-terminal domains dimerise to form amphipathic parallel alpha-helical coiled coils: dimerisation is essential for protein function []. ; GO: 0016740 transferase activity, 0048037 cofactor binding; PDB: 2HG4_E.
Probab=55.21 E-value=11 Score=20.09 Aligned_cols=17 Identities=29% Similarity=0.567 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy16824 31 EADLMKNYLTQVRQETG 47 (85)
Q Consensus 31 E~d~lr~Yl~QlRqE~~ 47 (85)
++|++|.||++.=-||-
T Consensus 3 ~e~kLr~YLkr~t~eL~ 19 (27)
T PF08990_consen 3 NEDKLRDYLKRVTAELR 19 (27)
T ss_dssp -HCHHHHHHHHHHHHHH
T ss_pred cHHHHHHHHHHHHHHHH
Confidence 56889999998776663
No 10
>COG1298 FlhA Flagellar biosynthesis pathway, component FlhA [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=53.89 E-value=27 Score=31.09 Aligned_cols=46 Identities=24% Similarity=0.535 Sum_probs=36.1
Q ss_pred CcCCCC----CCCCCCcccccCCCChhHH-------------------HHHHHHHHHH--HHHHHHHHHHHh
Q psy16824 8 KFDLPG----EAGFPLNSVYAKPQSQNEA-------------------DLMKNYLTQV--RQETGLRVCERV 54 (85)
Q Consensus 8 ~F~iPG----d~gFpLn~~y~~P~~~~E~-------------------d~lr~Yl~Ql--RqE~~~RL~e~v 54 (85)
.=.||| ||-|=|.+++-.+..++|+ |.+|+++.+| |||. ..|+|++
T Consensus 445 ~~~i~G~~t~ePaFGl~a~WI~~~~ke~A~~~GYtVvd~~sVi~THl~evir~~a~eLl~rqev-q~Lld~l 515 (696)
T COG1298 445 TGEIPGEETKDPAFGLPAYWIEEDQKEEAQILGYTVVDPSSVIATHLSEVIRNHAHELLGRQEV-QQLLDRL 515 (696)
T ss_pred CCCCCCccccCCCCCCceeecCchhhhHHHhcCCeeecchHHHHHhHHHHHHHhHHHHhHHHHH-HHHHHHH
Confidence 456888 7889999999999888888 7788888877 7776 5555544
No 11
>PF08606 Prp19: Prp19/Pso4-like; InterPro: IPR013915 This region is found specifically in PRP19-like protein. The region represented by this protein covers the sequence implicated in self-interaction and a coiled-coiled motif []. PRP19-like proteins form an oligomer that is necessary for spliceosome assembly [].
Probab=49.91 E-value=33 Score=22.20 Aligned_cols=19 Identities=26% Similarity=0.254 Sum_probs=15.2
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy16824 33 DLMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 33 d~lr~Yl~QlRqE~~~RL~ 51 (85)
=.+|+=|.|+||||+..|-
T Consensus 25 f~LRk~l~~~rqELs~aLY 43 (70)
T PF08606_consen 25 FTLRKQLDQTRQELSHALY 43 (70)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4589999999999986543
No 12
>PF00631 G-gamma: GGL domain; InterPro: IPR015898 This entry represents the G protein gamma subunit and the GGL (G protein gamma-like) domain, which are related in sequence and are comprised of an extended alpha-helical polypeptide. The G protein gamma subunit forms a stable dimer with the beta subunit, but it does not make any contact with the alpha subunit, which contacts the opposite face of the beta subunit. The GGL domain is found in several RGS (regulators of G protein signaling) proteins. GGL domains can interact with beta subunits to form novel dimers that prevent gamma subunit binding, and may prevent heterotrimer formation by inhibiting alpha subunit binding. The interaction between G protein beta-5 neuro-specific isoforms and RGS GGL domains may represent a general mode of binding between beta-propeller proteins and their partners [].; GO: 0004871 signal transducer activity, 0007186 G-protein coupled receptor protein signaling pathway, 0005834 heterotrimeric G-protein complex; PDB: 3PSC_G 3SN6_G 1OMW_G 2BCJ_G 1GG2_G 3PVW_G 3PVU_G 3AH8_G 3CIK_G 1GP2_G ....
Probab=45.00 E-value=27 Score=21.24 Aligned_cols=19 Identities=37% Similarity=0.431 Sum_probs=15.6
Q ss_pred hHHHHHHHHHHHHHHHHHH
Q psy16824 30 NEADLMKNYLTQVRQETGL 48 (85)
Q Consensus 30 ~E~d~lr~Yl~QlRqE~~~ 48 (85)
++.+.++.-+.|||+|+..
T Consensus 2 ~~~~~l~~ei~~L~~el~~ 20 (68)
T PF00631_consen 2 QEKDQLKREIEQLRQELER 20 (68)
T ss_dssp THHHHHHHHHHHHHHHHTS
T ss_pred hHHHHHHHHHHHHHHHHcc
Confidence 5677888888999999873
No 13
>PF02831 gpW: gpW; InterPro: IPR004174 GpW is a 68 residue protein known to be present in phage particles. Extracts of phage-infected cells lacking GpW contain DNA-filled heads, and active tails, but no infectious virions. GpW is required for the addition of GpFII to the head, which is, in turn, required for the attachment of tails. Since GpFII and tails are known to be attached at the connector, GpW is also likely to assemble at this site. The addition of GpW to filled heads increases the DNase resistance of the packaged DNA, suggesting that GpW either forms a plug at the connector to prevent ejection of the DNA, or binds directly to the DNA. The large number of positively charged residues in GpW (its calculated pI is 10.8) is consistent with a role in DNA interaction [].; GO: 0019067 viral assembly, maturation, egress, and release; PDB: 2L6Q_A 2L6R_A 1HYW_A.
Probab=41.35 E-value=22 Score=22.59 Aligned_cols=19 Identities=16% Similarity=0.461 Sum_probs=15.9
Q ss_pred hHHHHHHHHHHHHHHHHHH
Q psy16824 30 NEADLMKNYLTQVRQETGL 48 (85)
Q Consensus 30 ~E~d~lr~Yl~QlRqE~~~ 48 (85)
.....|+.|+.+|..+||.
T Consensus 38 a~i~~L~~yI~~L~~~Lg~ 56 (68)
T PF02831_consen 38 ANIGDLRAYIQQLEAQLGI 56 (68)
T ss_dssp GGHHHHHHHHHHHHHHTTS
T ss_pred CCHHHHHHHHHHHHHHhCc
Confidence 4557799999999999984
No 14
>PHA02865 MHC-like TNF binding protein; Provisional
Probab=39.54 E-value=13 Score=30.16 Aligned_cols=11 Identities=64% Similarity=1.226 Sum_probs=9.2
Q ss_pred CCCCCCCCCcc
Q psy16824 11 LPGEAGFPLNS 21 (85)
Q Consensus 11 iPGd~gFpLn~ 21 (85)
=|||||||+|.
T Consensus 261 e~ge~~~~~~~ 271 (338)
T PHA02865 261 EPGEPGFPTNT 271 (338)
T ss_pred cccccccccce
Confidence 36999999984
No 15
>TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family
Probab=39.00 E-value=28 Score=23.25 Aligned_cols=16 Identities=19% Similarity=0.156 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHhc
Q psy16824 40 TQVRQETGLRVCERVF 55 (85)
Q Consensus 40 ~QlRqE~~~RL~e~vy 55 (85)
.|+|+|+..||++-+=
T Consensus 63 ~~~~~El~rrLL~d~~ 78 (117)
T TIGR03142 63 EAARAELQRRLLADIP 78 (117)
T ss_pred HHHHHHHHHHHHHCcc
Confidence 6788999999997543
No 16
>COG2232 Predicted ATP-dependent carboligase related to biotin carboxylase [General function prediction only]
Probab=38.84 E-value=44 Score=27.86 Aligned_cols=28 Identities=39% Similarity=0.565 Sum_probs=23.2
Q ss_pred CcCCCC---CCCCCCcccccCCCChhHHHHH
Q psy16824 8 KFDLPG---EAGFPLNSVYAKPQSQNEADLM 35 (85)
Q Consensus 8 ~F~iPG---d~gFpLn~~y~~P~~~~E~d~l 35 (85)
..|-|| +.|-||-+++++-.++++++.|
T Consensus 334 DiP~~GtviekgePl~sviA~~nt~~~a~~~ 364 (389)
T COG2232 334 DIPRPGTVIEKGEPLCSVIASSNTRSGAESM 364 (389)
T ss_pred cCCCCCcccCCCCceeeeeeccCCHHHHHHH
Confidence 456666 8999999999998888888766
No 17
>PRK06553 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=37.51 E-value=21 Score=27.18 Aligned_cols=35 Identities=11% Similarity=0.221 Sum_probs=26.3
Q ss_pred CCCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824 13 GEAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 13 Gd~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~ 51 (85)
+.-|+|++.+|.+..|+ .+..++.++|...|.+++
T Consensus 153 ~~~~~~~~~vyr~~~n~----~~d~~i~~~R~~~g~~~i 187 (308)
T PRK06553 153 AAFGLDVTVLFRPPNNP----YAARKVLEARRTTMGGLV 187 (308)
T ss_pred HHcCCceEEEEecCCCh----HHHHHHHHHHHHcCCCcc
Confidence 34578999999887764 566778888888776654
No 18
>PRK08943 lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase; Validated
Probab=35.97 E-value=23 Score=27.02 Aligned_cols=34 Identities=21% Similarity=0.422 Sum_probs=26.5
Q ss_pred CCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824 14 EAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 14 d~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~ 51 (85)
.-|+|+..||.+..|+ .+..++.+.|+..|.+++
T Consensus 152 ~~~~~~~~vyr~~~n~----~~d~~~~~~R~~~g~~~i 185 (314)
T PRK08943 152 SQGQPMAAMFHNQRNP----LFDWLWNRVRRRFGGRLH 185 (314)
T ss_pred hcCCCccEEEeCCCCH----HHHHHHHHHHhhcCCeee
Confidence 3478999999987764 567788899998876655
No 19
>cd05493 Bromo_ALL-1 Bromodomain, ALL-1 like proteins. ALL-1 is a vertebrate homologue of Drosophila trithorax and is often affected in chromosomal rearrangements that are linked to acute leukemias, such as acute lymphocytic leukemia (ALL). Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine.
Probab=35.79 E-value=41 Score=23.92 Aligned_cols=19 Identities=26% Similarity=0.566 Sum_probs=16.6
Q ss_pred HHHHHHhcCCCCCCCChhh
Q psy16824 48 LRVCERVFNTPDGKPSKWW 66 (85)
Q Consensus 48 ~RL~e~vy~~~~~~psKwW 66 (85)
.|++|+||.--+.+.+|+|
T Consensus 113 ik~me~vf~Wf~~~d~k~W 131 (131)
T cd05493 113 IKLMESVFPWFNSEDPKLW 131 (131)
T ss_pred HHHHHHhcccccccCCCCC
Confidence 7899999987777888888
No 20
>PF08802 CytB6-F_Fe-S: Cytochrome B6-F complex Fe-S subunit ; InterPro: IPR014909 The cytochrome b6-f complex mediates electron transfer between photosystem II (PSII) and photosystem I (PSI), cyclic electron flow around PSI, and state transitions. The cytochrome b6-f complex has 4 large subunits, these are: cytochrome b6, subunit IV (17 kDa polypeptide, PetD), cytochrome f and the Rieske protein, while the 4 small subunits are: PetG, PetL, PetM and PetN. The complex functions as a dimer. This protein corresponds to the alpha helical transmembrane domain of the cytochrome b6-f complex Rieske iron-sulphur subunit. ; GO: 0009496 plastoquinol-plastocyanin reductase activity, 0051537 2 iron, 2 sulfur cluster binding, 0055114 oxidation-reduction process, 0042651 thylakoid membrane; PDB: 1Q90_R 1VF5_D 2E75_D 2E74_D 2E76_D 2D2C_Q 2ZT9_D.
Probab=35.33 E-value=7 Score=22.68 Aligned_cols=13 Identities=23% Similarity=0.473 Sum_probs=10.1
Q ss_pred hccchhhhhcccC
Q psy16824 69 FSKKRFMDKSLTA 81 (85)
Q Consensus 69 F~KRkFM~KsLs~ 81 (85)
+.||+|||.-+.|
T Consensus 6 m~RR~lmN~ll~G 18 (39)
T PF08802_consen 6 MSRRQLMNLLLGG 18 (39)
T ss_dssp HHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHh
Confidence 6899999986543
No 21
>PF13709 DUF4159: Domain of unknown function (DUF4159)
Probab=34.98 E-value=34 Score=25.16 Aligned_cols=14 Identities=29% Similarity=0.517 Sum_probs=12.1
Q ss_pred ChhHHHHHHHHHHH
Q psy16824 28 SQNEADLMKNYLTQ 41 (85)
Q Consensus 28 ~~~E~d~lr~Yl~Q 41 (85)
+.+|.+.||.||++
T Consensus 68 s~~e~~~Lr~Yl~~ 81 (207)
T PF13709_consen 68 SDEEIANLRRYLEN 81 (207)
T ss_pred CHHHHHHHHHHHHc
Confidence 67899999999865
No 22
>PF11300 DUF3102: Protein of unknown function (DUF3102); InterPro: IPR021451 This entry is represented by Streptococcus phage 7201, Orf2. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=34.87 E-value=60 Score=22.94 Aligned_cols=21 Identities=33% Similarity=0.294 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHH-------HHHHHHHH
Q psy16824 32 ADLMKNYLTQVRQ-------ETGLRVCE 52 (85)
Q Consensus 32 ~d~lr~Yl~QlRq-------E~~~RL~e 52 (85)
+..++.|.+|.++ |+|.||++
T Consensus 7 a~EIn~~~~q~~~~~l~~~iEIGrrL~e 34 (130)
T PF11300_consen 7 AAEINAIKRQTGQILLESAIEIGRRLKE 34 (130)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3457788888887 89999998
No 23
>COG4114 FhuF Uncharacterized Fe-S protein [General function prediction only]
Probab=34.82 E-value=40 Score=26.60 Aligned_cols=41 Identities=27% Similarity=0.487 Sum_probs=31.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-----hc--CCCCCCCChhhhhhccc
Q psy16824 32 ADLMKNYLTQVRQETGLRVCER-----VF--NTPDGKPSKWWLCFSKK 72 (85)
Q Consensus 32 ~d~lr~Yl~QlRqE~~~RL~e~-----vy--~~~~~~psKwWm~F~KR 72 (85)
.-.|-=||+.+||++|.++++- +| -=.+|.+|-+|.|...|
T Consensus 176 gyli~wyl~e~k~~lg~~~~~s~r~~lf~e~ll~nG~dnPl~rtv~~r 223 (251)
T COG4114 176 GYLINWYLTEMKQLLGEDLVESLRHALFFEKLLPNGQDNPLWRTVVLR 223 (251)
T ss_pred HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHhccCCCCCcHHHHHHHh
Confidence 3456779999999999999872 22 11458899999998877
No 24
>COG5500 Predicted integral membrane protein [Function unknown]
Probab=33.80 E-value=29 Score=25.50 Aligned_cols=26 Identities=42% Similarity=0.636 Sum_probs=20.3
Q ss_pred CCCccccc--CCCChhHHHHHHHHHHHH
Q psy16824 17 FPLNSVYA--KPQSQNEADLMKNYLTQV 42 (85)
Q Consensus 17 FpLn~~y~--~P~~~~E~d~lr~Yl~Ql 42 (85)
-|||.-.+ .|++.+-++.|.+||+|-
T Consensus 108 VPLNdALAaa~Pas~~ga~lW~~yLt~W 135 (159)
T COG5500 108 VPLNDALAAANPASTDGAKLWTSYLTNW 135 (159)
T ss_pred ccchHHHhhcCCCcccHHHHHHHHHHhH
Confidence 47776543 378899999999999973
No 25
>PF00140 Sigma70_r1_2: Sigma-70 factor, region 1.2; InterPro: IPR009042 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. ; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 1SMY_F 1IW7_P 1SIG_A 3IYD_F 2BE5_F 2A6E_F 2CW0_F 2A69_P 2A6H_P 3DXJ_P ....
Probab=33.67 E-value=48 Score=18.13 Aligned_cols=13 Identities=23% Similarity=0.600 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHH
Q psy16824 32 ADLMKNYLTQVRQ 44 (85)
Q Consensus 32 ~d~lr~Yl~QlRq 44 (85)
.|.++.||++++.
T Consensus 1 ~D~l~~Yl~ei~~ 13 (37)
T PF00140_consen 1 SDSLRLYLKEIGR 13 (37)
T ss_dssp HHHHHHHHHHHHH
T ss_pred CcHHHHHHHHHcC
Confidence 3789999999974
No 26
>PF13006 Nterm_IS4: Insertion element 4 transposase N-terminal; InterPro: IPR024473 This entry represents the N-terminal domain of Insertion Element 4 transposases [].
Probab=33.56 E-value=39 Score=22.76 Aligned_cols=21 Identities=29% Similarity=0.378 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHhcC
Q psy16824 36 KNYLTQVRQETGLRVCERVFN 56 (85)
Q Consensus 36 r~Yl~QlRqE~~~RL~e~vy~ 56 (85)
++-++|.||-||..-++.+|.
T Consensus 77 ~SaitqARqRLG~~pl~~LF~ 97 (98)
T PF13006_consen 77 PSAITQARQRLGAEPLRWLFE 97 (98)
T ss_pred hHHHHHHHHHcCHHHHHHHhc
Confidence 567899999999999999994
No 27
>PRK15202 type III secretion chaperone protein SigE; Provisional
Probab=33.44 E-value=47 Score=23.47 Aligned_cols=34 Identities=24% Similarity=0.442 Sum_probs=25.5
Q ss_pred CCCcccccCCCChhHHHHHH------HHHHHHHHHHHHHH
Q psy16824 17 FPLNSVYAKPQSQNEADLMK------NYLTQVRQETGLRV 50 (85)
Q Consensus 17 FpLn~~y~~P~~~~E~d~lr------~Yl~QlRqE~~~RL 50 (85)
=-|-++|.-|+...|+|.+- +-.+|||||...|.
T Consensus 77 t~L~Al~rlp~~s~~ee~~~g~e~fIs~vr~L~~~~a~r~ 116 (117)
T PRK15202 77 TALVALYRLPQTSTEEEALTGFELFISNVKQLKEEYARRM 116 (117)
T ss_pred ceEEEeeecCCCCcHHHHHhhHHHHHHHHHHHHHHHhhcc
Confidence 34778899998777776543 44689999998874
No 28
>PF10372 YojJ: Bacterial membrane-spanning protein N-terminus; InterPro: IPR019457 This entry is found at the N terminus of a family of putative membrane-spanning bacterial proteins. These proteins often contain IPR003390 from INTERPRO towards the C terminus. ; PDB: 2FB5_A.
Probab=32.79 E-value=61 Score=20.76 Aligned_cols=16 Identities=31% Similarity=0.634 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHHHHH
Q psy16824 32 ADLMKNYLTQVRQETG 47 (85)
Q Consensus 32 ~d~lr~Yl~QlRqE~~ 47 (85)
+..+++||.|+.++++
T Consensus 12 K~~lk~~L~~I~~~~~ 27 (70)
T PF10372_consen 12 KEQLKQYLEQIEEEIS 27 (70)
T ss_dssp HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4789999999999997
No 29
>PF11464 Rbsn: Rabenosyn Rab binding domain; InterPro: IPR021565 Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=31.78 E-value=58 Score=19.09 Aligned_cols=15 Identities=13% Similarity=0.589 Sum_probs=11.9
Q ss_pred HHHHHHHHHHHHHHH
Q psy16824 31 EADLMKNYLTQVRQE 45 (85)
Q Consensus 31 E~d~lr~Yl~QlRqE 45 (85)
....+++|++|.|..
T Consensus 4 Qi~~I~~~I~qAk~~ 18 (42)
T PF11464_consen 4 QINIIESYIKQAKAA 18 (42)
T ss_dssp HHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHh
Confidence 457789999998864
No 30
>PF04536 TPM: TLP18.3, Psb32 and MOLO-1 founding proteins of phosphatase; InterPro: IPR007621 This is a family of uncharacterised proteins. They are found in both eukarya and eubacteria. In eubacteria the region is towards the N-terminal of the protein and is accompanied by an N-terminal signal sequence. The C-terminal of eubacterial proteins typically contains one or more putative transmembrane regions. In eukaryotes the region is not accompanied by a signal sequence.; PDB: 3PTJ_A 3PW9_A 3PVH_A 2KPT_A 2KW7_A.
Probab=31.61 E-value=24 Score=22.29 Aligned_cols=24 Identities=25% Similarity=0.527 Sum_probs=18.8
Q ss_pred ChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824 28 SQNEADLMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 28 ~~~E~d~lr~Yl~QlRqE~~~RL~ 51 (85)
+++|.+.+.+.++++.+++|.+++
T Consensus 3 s~~~~~~l~~~l~~~~~~t~~~i~ 26 (119)
T PF04536_consen 3 SQEERERLNQALAKLEKKTGVQIV 26 (119)
T ss_dssp -HHHHHHHHHHHHHHHHHC--EEE
T ss_pred CHHHHHHHHHHHHHHHHhhCCEEE
Confidence 467889999999999999998774
No 31
>cd00555 Maf Nucleotide binding protein Maf. Maf has been implicated in inhibition of septum formation in eukaryotes, bacteria and archaea, but homologs in B.subtilis and S.cerevisiae are nonessential for cell division. Maf has been predicted to be a nucleotide- or nucleic acid-binding protein with structural similarity to the hypoxanthine/xanthine NTP pyrophosphatase Ham1 from Methanococcus jannaschii, RNase H from Escherichia coli, and some other nucleotide or RNA-binding proteins.
Probab=31.20 E-value=37 Score=24.47 Aligned_cols=16 Identities=31% Similarity=0.547 Sum_probs=12.7
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
|.++.||.|.+|+..|
T Consensus 74 g~il~KP~~~~eA~~~ 89 (180)
T cd00555 74 GRILGKPKDREEAREM 89 (180)
T ss_pred CEEEcCCCCHHHHHHH
Confidence 4689999999888543
No 32
>PF08938 HBS1_N: HBS1 N-terminus; InterPro: IPR015033 This domain is found in various eukaryotic HBS1-like proteins. ; PDB: 1UFZ_A 3IZQ_1.
Probab=30.96 E-value=8.3 Score=24.36 Aligned_cols=22 Identities=27% Similarity=0.286 Sum_probs=18.8
Q ss_pred ChhHHHHHHHHHHHHHHHHHHH
Q psy16824 28 SQNEADLMKNYLTQVRQETGLR 49 (85)
Q Consensus 28 ~~~E~d~lr~Yl~QlRqE~~~R 49 (85)
+..|.+.|++-|-|+|..||.-
T Consensus 21 s~ed~~~L~~~l~~vr~~Lg~~ 42 (79)
T PF08938_consen 21 SPEDQAQLYSCLPQVREVLGDY 42 (79)
T ss_dssp TCHHHHHHCHHCCCHHHHCCCC
T ss_pred CHHHHHHHHHHHHHHHHHHccc
Confidence 4578899999999999999853
No 33
>KOG1205|consensus
Probab=29.59 E-value=40 Score=26.46 Aligned_cols=34 Identities=21% Similarity=0.511 Sum_probs=27.7
Q ss_pred CCCCCCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHH
Q psy16824 10 DLPGEAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGL 48 (85)
Q Consensus 10 ~iPGd~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~ 48 (85)
+|=|--+.|++++|.+.+ =.+.+|..-||+|+..
T Consensus 150 SiaG~~~~P~~~~Y~ASK-----~Al~~f~etLR~El~~ 183 (282)
T KOG1205|consen 150 SIAGKMPLPFRSIYSASK-----HALEGFFETLRQELIP 183 (282)
T ss_pred ccccccCCCcccccchHH-----HHHHHHHHHHHHHhhc
Confidence 455777889999998643 5699999999999976
No 34
>PF05438 TRH: Thyrotropin-releasing hormone (TRH); InterPro: IPR008857 This family consists of several thyrotropin-releasing hormone (TRH) proteins. Thyrotropin-Releasing Hormone (TRH; pyroGlu-His-Pro-NH2), originally isolated as a hypothalamic neuropeptide hormone, most likely acts also as a neuromodulator and/or neurotransmitter in the central nervous system (CNS). This interpretation is supported by the identification of a peptidase localised on the surface of neuronal cells which has been termed TRH-degrading ectoenzyme (TRH-DE) since it selectively inactivates TRH. TRH has been used clinically for the treatment of spinocerebellar degeneration and disturbance of consciousness in humans [].; GO: 0005184 neuropeptide hormone activity, 0009755 hormone-mediated signaling pathway, 0005576 extracellular region
Probab=28.78 E-value=44 Score=25.79 Aligned_cols=47 Identities=19% Similarity=0.312 Sum_probs=32.4
Q ss_pred ccccccCcCCCCCCCCCCccccc---CCCC----hhHHHHHHHHHHHHHHHHHH
Q psy16824 2 YSLALAKFDLPGEAGFPLNSVYA---KPQS----QNEADLMKNYLTQVRQETGL 48 (85)
Q Consensus 2 ~~lal~~F~iPGd~gFpLn~~y~---~P~~----~~E~d~lr~Yl~QlRqE~~~ 48 (85)
.+|++.++.|||-.+.|=-+.-. ++.+ ++|.-.||+-|+.++.|-+.
T Consensus 7 l~L~l~~~~v~gg~~~~~e~~~~~~~~~lDdlLqrAe~llLrs~L~k~~~e~~~ 60 (212)
T PF05438_consen 7 LALTLCNTGVPGGQAAQEEAETGEETPGLDDLLQRAESLLLRSILQKLQEEQGN 60 (212)
T ss_pred HHHHHhhccccCCccCccccccccCCccHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence 46778899999988877333322 1211 45556788899999988876
No 35
>cd07624 BAR_SNX7_30 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexins 7 and 30. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. This subfamily consists of SNX7, SNX30, and similar proteins. The specific functions of SNX7 and SNX30 have not been elucidated. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=28.29 E-value=97 Score=22.38 Aligned_cols=29 Identities=28% Similarity=0.418 Sum_probs=22.1
Q ss_pred CcccccCCCChhHHHHHHHHHHHHHHHHH
Q psy16824 19 LNSVYAKPQSQNEADLMKNYLTQVRQETG 47 (85)
Q Consensus 19 Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~ 47 (85)
+++.|.-..-..|=+.++.|+.+|-+-|+
T Consensus 3 ~~~~~~~~~~d~eF~e~~eyi~~L~~~l~ 31 (200)
T cd07624 3 TSTMYLLKNRSPEFDKMNEYLTLFGEKLG 31 (200)
T ss_pred ccchhhhcCCCccHHHHHHHHHHHHHHhH
Confidence 46666554445677999999999998886
No 36
>PRK06946 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=28.06 E-value=39 Score=25.50 Aligned_cols=32 Identities=16% Similarity=0.229 Sum_probs=25.1
Q ss_pred CCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824 16 GFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 16 gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~ 51 (85)
++|+..||.+..|+ .+-.++.++|...|.+++
T Consensus 133 ~~~~~~vyr~~~n~----~~d~~~~~~R~~~g~~~i 164 (293)
T PRK06946 133 RRRVGSLYTPMSNP----LLDAIAKAARGRFGAEMV 164 (293)
T ss_pred cCCceEEeeCCCCH----HHHHHHHHHHHhcCCCcc
Confidence 57899999887764 566778899998876665
No 37
>PRK00234 Maf-like protein; Reviewed
Probab=27.93 E-value=47 Score=24.26 Aligned_cols=16 Identities=19% Similarity=0.308 Sum_probs=12.7
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 76 g~Il~KP~~~~eA~~m 91 (192)
T PRK00234 76 GQILGKPHTFERAREQ 91 (192)
T ss_pred CEECCCCCCHHHHHHH
Confidence 3689999999888544
No 38
>PF12891 Glyco_hydro_44: Glycoside hydrolase family 44; InterPro: IPR024745 This is a family of putative bacterial glycoside hydrolases.; PDB: 3IK2_A 3ZQ9_A 2YJQ_B 2YKK_A 2YIH_A 2EEX_A 2EQD_A 2E0P_A 2E4T_A 2EO7_A ....
Probab=27.90 E-value=26 Score=27.20 Aligned_cols=20 Identities=35% Similarity=0.675 Sum_probs=15.5
Q ss_pred HHHHHHHHHHHHH---HHHHHHH
Q psy16824 33 DLMKNYLTQVRQE---TGLRVCE 52 (85)
Q Consensus 33 d~lr~Yl~QlRqE---~~~RL~e 52 (85)
+.|.=||+|+|++ .|.|||+
T Consensus 206 ~fl~wyL~qm~~~~~~~G~RLLD 228 (239)
T PF12891_consen 206 DFLPWYLDQMKEAEKSTGKRLLD 228 (239)
T ss_dssp SHHHHHHHHHHHHHHHHTS-S-S
T ss_pred chHHHHHHHHHHhhhhcCceeee
Confidence 4889999999975 7899998
No 39
>KOG2940|consensus
Probab=27.70 E-value=39 Score=27.33 Aligned_cols=17 Identities=35% Similarity=0.728 Sum_probs=14.7
Q ss_pred HHHHHHHHHHHHHHhcC
Q psy16824 40 TQVRQETGLRVCERVFN 56 (85)
Q Consensus 40 ~QlRqE~~~RL~e~vy~ 56 (85)
+-|.+|.|-||.+|||+
T Consensus 52 dylkeeig~rlaDrvfD 68 (325)
T KOG2940|consen 52 DYLKEEIGDRLADRVFD 68 (325)
T ss_pred hHHHHHHHHHHHHHHHH
Confidence 44679999999999995
No 40
>PF11544 Spc42p: Spindle pole body component Spc42p; InterPro: IPR021611 Spc42p is a 42kDa component of the S.cerevisiae spindle body that localises to the electron dense central region of the SPB. Spc42p is a phosphoprotein which forms a polymeric layer at the periphery of the SPB central plaque. This functions during SPB duplication and also facilitates the attachment of the SPB to the nuclear membrane. ; PDB: 2Q6Q_B.
Probab=27.58 E-value=80 Score=20.72 Aligned_cols=19 Identities=21% Similarity=0.204 Sum_probs=16.0
Q ss_pred hhHHHHHHHHHHHHHHHHH
Q psy16824 29 QNEADLMKNYLTQVRQETG 47 (85)
Q Consensus 29 ~~E~d~lr~Yl~QlRqE~~ 47 (85)
.+|-|.+..|+.+||.+|-
T Consensus 18 ~eEI~rLn~lv~sLR~KLi 36 (76)
T PF11544_consen 18 QEEIDRLNILVGSLRGKLI 36 (76)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 4567999999999999874
No 41
>TIGR02208 lipid_A_msbB lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase. This family consists of MsbB in E. coli and closely related proteins in other species. MsbB is homologous to HtrB (TIGR02207) and acts immediately after it in the biosynthesis of KDO-2 lipid A (also called Re LPS and Re endotoxin). These two enzymes act after creation of KDO-2 lipid IV-A by addition of the KDO sugars.
Probab=27.17 E-value=30 Score=26.17 Aligned_cols=34 Identities=21% Similarity=0.419 Sum_probs=26.1
Q ss_pred CCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824 14 EAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 14 d~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~ 51 (85)
.-|.|+..||.+..|+ .+..++.+.|...|.+++
T Consensus 143 ~~~~~~~~v~r~~~n~----~~d~~~~~~R~~~g~~~i 176 (305)
T TIGR02208 143 SQGLPMVTMFNNHKNP----LFDWLWNRVRSRFGGHVY 176 (305)
T ss_pred hcCCCceEEeeCCCCH----HHHHHHHHHHhcCCCcee
Confidence 3478999999988774 467778889988776655
No 42
>PRK00078 Maf-like protein; Reviewed
Probab=26.99 E-value=49 Score=24.20 Aligned_cols=16 Identities=31% Similarity=0.457 Sum_probs=12.7
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|++|+..|
T Consensus 78 g~ilgKP~~~eeA~~~ 93 (192)
T PRK00078 78 GKVLGKPKDEEDAFEM 93 (192)
T ss_pred CEEeCCCCCHHHHHHH
Confidence 4689999999888554
No 43
>PF03221 HTH_Tnp_Tc5: Tc5 transposase DNA-binding domain; InterPro: IPR006600 This entry represents a DNA-binding helix-turn-helix domain found in the pogo family of transposable elements, the centromere protein Cenp-B, and yeast PCD2. There is extensive sequence similarity between Cenp-B and transposase proteins encoded by the pogo superfamily of transposable elements, which includes the human Tigger and Jerky elements []. The HTH domain is composed of three alpha-helices, with the second and third helices connected via a turn comprise the helix-turn-helix motif. Helix 3 is termed the recognition helix as it binds the DNA major groove, as in other HTHs []. This conserved DNA-binding domain is found in the following proteins: Cenp-B (major centromere autoantigen B or centromere protein B), which appears to organise arrays of centromere satellite DNA into a higher order structure that then direct centromere formation and kinetochore assembly in mammalian chromosomes. The N terminus of Cenp-B contains two DNA-binding HTH domains, which bind to adjacent major grooves of DNA: a psq-type HTH domain followed by a CenpB-type HTH domain, which together bind specifically to the Cenp-B box, which occurs in alpha-satellite DNA in human centromeres []. Pogo family transposable elements includes both Tigger and Jerky elements []. Pogo contains two open reading frames flanked by inverted repeats. The N-terminal region of pogo transposase contains a Cenp-B-type HTH DNA-binding domain []. Mammalian jerky protein, involved in epileptic seizures in mice []. PDC2 (Pyruvate DeCarboxylase 2), which is a transcription factor required for the synthesis of the glycolytic enzyme pyruvate decarboxylase, required for high level expression of both the THI and the PDC genes. PDC2 may be important for a high basal level of PDC gene expression or play a positive role in the autoregulation control of PDC1 and PDC5 [, ]. ; PDB: 1HLV_A 1IUF_A.
Probab=26.43 E-value=1.3e+02 Score=16.95 Aligned_cols=45 Identities=18% Similarity=0.367 Sum_probs=21.5
Q ss_pred hhHHHHHHHHHHHHH-----------HHHHHHHHHHhcCC--CCCC-CChhhhhhccch
Q psy16824 29 QNEADLMKNYLTQVR-----------QETGLRVCERVFNT--PDGK-PSKWWLCFSKKR 73 (85)
Q Consensus 29 ~~E~d~lr~Yl~QlR-----------qE~~~RL~e~vy~~--~~~~-psKwWm~F~KRk 73 (85)
++.++.|-.|+.+.+ ++.+..|.+.-... ...+ .+.|+-.|-||-
T Consensus 2 ~~~E~~L~~wi~~~~~~g~~vt~~~i~~~A~~i~~~~~~~~~~~~~~s~~W~~~F~~Rh 60 (66)
T PF03221_consen 2 PELEKALVEWIKRMRRKGFPVTREMIREKAKEIAELAKSPGPPEFKASKGWLDRFKKRH 60 (66)
T ss_dssp HHHHHHHHHHHHHHCGCT---SCHHHHHHHHHHHH-SCCCT-TT-S--CHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhcccccCcCCcccHHHHHHHHHc
Confidence 345566667776653 23334443221211 2223 446777787774
No 44
>PF14357 DUF4404: Domain of unknown function (DUF4404)
Probab=26.14 E-value=50 Score=21.36 Aligned_cols=13 Identities=23% Similarity=0.442 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHH
Q psy16824 35 MKNYLTQVRQETG 47 (85)
Q Consensus 35 lr~Yl~QlRqE~~ 47 (85)
+++.|.+||+||.
T Consensus 2 L~~~L~~L~~eL~ 14 (85)
T PF14357_consen 2 LQELLEKLHQELE 14 (85)
T ss_pred HHHHHHHHHHHHh
Confidence 6788888888887
No 45
>PF09396 Thrombin_light: Thrombin light chain; InterPro: IPR018992 Thrombin is a Na+-activated, allosteric serine protease that functions in blood homeostasis, inflammation and wound healing []. Thrombin (or coagulation factor II) is an enzyme that cleaves bonds after Arg and Lys, converts fibrinogen to fibrin and activates factors V, VII, VIII, and (in complex with thrombomodulin) protein C []. Sodium binding is the major driving force behind the procoagulant, prothrombotic and signaling functions of the enzyme, but is dispensable for cleavage of the anticoagulant protein C []. Prothrombin is activated on the surface of a phospholipid membrane where factor Xa removes the activation peptide and cleaves the remaining part into light and heavy chains. This domain corresponds to the light chain of thrombin. ; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis, 0007596 blood coagulation, 0005576 extracellular region; PDB: 3HKI_D 2OCV_A 3EDX_A 3HK6_C 2PV9_A 2PUX_A 3HK3_A 1HRT_L 1TOC_G 1ETT_L ....
Probab=25.58 E-value=73 Score=19.43 Aligned_cols=28 Identities=29% Similarity=0.629 Sum_probs=17.2
Q ss_pred CCCCCCCcccccCC--CChhHHHHHHHHHH
Q psy16824 13 GEAGFPLNSVYAKP--QSQNEADLMKNYLT 40 (85)
Q Consensus 13 Gd~gFpLn~~y~~P--~~~~E~d~lr~Yl~ 40 (85)
||+.=-|.-||.+- .|..|.|+|.+|+.
T Consensus 18 GEadCG~RPlFEkk~~~D~~E~ELleSY~~ 47 (49)
T PF09396_consen 18 GEADCGLRPLFEKKSKKDKTEKELLESYIG 47 (49)
T ss_dssp SGTTTT--TTTGGGT---TTHHHHHHHHHC
T ss_pred cccccCcchhhhhhcccccCHHHHHHHHhc
Confidence 45555555566554 67899999999974
No 46
>PRK00648 Maf-like protein; Reviewed
Probab=25.50 E-value=54 Score=23.95 Aligned_cols=16 Identities=38% Similarity=0.509 Sum_probs=12.6
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 79 g~ilgKP~~~~eA~~~ 94 (191)
T PRK00648 79 GKVLGKPKDEEEAVEM 94 (191)
T ss_pred CEEeCCCCCHHHHHHH
Confidence 3689999999888544
No 47
>PF11035 SnAPC_2_like: Small nuclear RNA activating complex subunit 2-like; InterPro: IPR021281 This family of proteins is SnAPC subunit 2-like. SnAPC allows the transcription of human small nuclear RNA genes to occur by recognition of the proximal sequence element [].
Probab=25.38 E-value=1.2e+02 Score=24.95 Aligned_cols=43 Identities=19% Similarity=0.469 Sum_probs=32.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCC----------CCCCCChhhhhhccc
Q psy16824 29 QNEADLMKNYLTQVRQETGLRVCERVFNT----------PDGKPSKWWLCFSKK 72 (85)
Q Consensus 29 ~~E~d~lr~Yl~QlRqE~~~RL~e~vy~~----------~~~~psKwWm~F~KR 72 (85)
+.|+| +|.||+||+.-.....+.+|+.. +.-.|..-||-.+++
T Consensus 57 Rs~aE-I~~fl~~LK~rvareaiqkv~~~g~~~~R~~e~q~paPIEvW~dla~k 109 (344)
T PF11035_consen 57 RSEAE-IRDFLQQLKGRVAREAIQKVHPGGLKGPRRREAQPPAPIEVWMDLAEK 109 (344)
T ss_pred cCHHH-HHHHHHHHHHHHHHHHHHHhcccccccccccccCCCccHHHHHHHHHH
Confidence 45544 89999999999999999999842 123477788876653
No 48
>cd00068 GGL G protein gamma subunit-like motifs, the alpha-helical G-gamma chain dimerizes with the G-beta propeller subunit as part of the heterotrimeric G-protein complex; involved in signal transduction via G-protein-coupled receptors
Probab=25.37 E-value=58 Score=19.46 Aligned_cols=17 Identities=29% Similarity=0.403 Sum_probs=13.6
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy16824 33 DLMKNYLTQVRQETGLR 49 (85)
Q Consensus 33 d~lr~Yl~QlRqE~~~R 49 (85)
+.++.=+.|||+|++..
T Consensus 2 ~~~~~~veqLr~el~~~ 18 (57)
T cd00068 2 DQLKKEVEQLRKELSRE 18 (57)
T ss_pred HHHHHHHHHHHHHHCCc
Confidence 56788889999999843
No 49
>PRK00884 Maf-like protein; Reviewed
Probab=25.35 E-value=54 Score=24.12 Aligned_cols=16 Identities=19% Similarity=0.343 Sum_probs=12.8
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
|.++.||.|.+|+..|
T Consensus 76 g~ilgKP~~~eeA~~~ 91 (194)
T PRK00884 76 GEITGKPLTEENARAQ 91 (194)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3689999999888554
No 50
>PF09509 Hypoth_Ymh: Protein of unknown function (Hypoth_ymh); InterPro: IPR012654 This entry consists of a relatively rare prokaryotic protein family (about 8 occurrences per 200 genomes). Genes for members of this family appear to be associated variously with phage and plasmid regions, restriction system loci, transposons, and housekeeping genes. Their function is unknown.
Probab=25.31 E-value=1.1e+02 Score=20.90 Aligned_cols=25 Identities=24% Similarity=0.578 Sum_probs=22.3
Q ss_pred HHHHHHHHHHHHHHHH------HHHHHhcCC
Q psy16824 33 DLMKNYLTQVRQETGL------RVCERVFNT 57 (85)
Q Consensus 33 d~lr~Yl~QlRqE~~~------RL~e~vy~~ 57 (85)
|..+.+-..+|+-+|. .|++.+|..
T Consensus 28 eA~K~l~~rvR~~sg~~~~dG~~L~~~aFs~ 58 (125)
T PF09509_consen 28 EAVKSLEDRVREKSGLNDLDGADLMDQAFSS 58 (125)
T ss_pred HHHHHHHHHHHHHHCCCCcchHHHHHHHhCC
Confidence 7788999999999998 999999973
No 51
>COG2207 AraC AraC-type DNA-binding domain-containing proteins [Transcription]
Probab=25.30 E-value=99 Score=18.81 Aligned_cols=19 Identities=32% Similarity=0.399 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy16824 34 LMKNYLTQVRQETGLRVCE 52 (85)
Q Consensus 34 ~lr~Yl~QlRqE~~~RL~e 52 (85)
...+|+.+.|.+.+.+|+.
T Consensus 64 s~~~~~~~~Rl~~A~~lL~ 82 (127)
T COG2207 64 SPSQYLRQLRLEEARRLLR 82 (127)
T ss_pred CHHHHHHHHHHHHHHHHHH
Confidence 3568999999999999885
No 52
>TIGR00172 maf MAF protein. This nonessential gene causes inhibition of septation when overexpressed. A member of the family is found in the Archaeon Pyrococcus horikoshii and another in the round worm Caenorhabditis elegans.
Probab=25.06 E-value=55 Score=23.77 Aligned_cols=16 Identities=31% Similarity=0.661 Sum_probs=12.8
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
|.++.||.|.+|+..|
T Consensus 77 g~ilgKP~~~~eA~~~ 92 (183)
T TIGR00172 77 GEIYGKPKDKEEAAEF 92 (183)
T ss_pred CEEecCCCCHHHHHHH
Confidence 4689999999888554
No 53
>PRK00032 Maf-like protein; Reviewed
Probab=24.88 E-value=56 Score=23.92 Aligned_cols=16 Identities=31% Similarity=0.505 Sum_probs=12.6
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|++|+..|
T Consensus 77 g~IlgKP~~~eeA~~~ 92 (190)
T PRK00032 77 GEVLEKPRDAADAAAM 92 (190)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3689999999888544
No 54
>smart00224 GGL G protein gamma subunit-like motifs.
Probab=24.82 E-value=59 Score=19.83 Aligned_cols=16 Identities=25% Similarity=0.410 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHHHHHH
Q psy16824 33 DLMKNYLTQVRQETGL 48 (85)
Q Consensus 33 d~lr~Yl~QlRqE~~~ 48 (85)
+.+|.=+.|||+|++.
T Consensus 2 ~~~~~~ve~Lr~el~~ 17 (63)
T smart00224 2 DQLRKEVEQLRKELSR 17 (63)
T ss_pred hHHHHHHHHHHHHHCC
Confidence 5678888999999984
No 55
>PF06183 DinI: DinI-like family; InterPro: IPR010391 This family of short proteins includes DNA-damage-inducible protein I (DinI) and related proteins. The SOS response, a set of cellular phenomena exhibited by eubacteria, is initiated by various causes that include DNA damage-induced replication arrest, and is positively regulated by the co- protease activity of RecA. Escherichia coli DinI, a LexA-regulated SOS gene product, shuts off the initiation of the SOS response when overexpressed in vivo. Biochemical and genetic studies indicated that DinI physically interacts with RecA to inhibit its co-protease activity []. The structure of DinI is known [].; PDB: 1GHH_A.
Probab=24.79 E-value=83 Score=19.53 Aligned_cols=18 Identities=17% Similarity=0.211 Sum_probs=13.1
Q ss_pred HHHHHHHHHHHHHHHHhcC
Q psy16824 38 YLTQVRQETGLRVCERVFN 56 (85)
Q Consensus 38 Yl~QlRqE~~~RL~e~vy~ 56 (85)
-+..|++||..||.. +|+
T Consensus 5 a~~AL~~EL~kRl~~-~yP 22 (65)
T PF06183_consen 5 ALEALESELTKRLHR-QYP 22 (65)
T ss_dssp HHHHHHHHHHHHHHH-H-S
T ss_pred HHHHHHHHHHHHHHH-HCC
Confidence 467899999988775 554
No 56
>PRK04056 Maf-like protein; Reviewed
Probab=24.57 E-value=58 Score=23.64 Aligned_cols=16 Identities=31% Similarity=0.449 Sum_probs=12.4
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 75 g~ilgKP~~~~eA~~~ 90 (180)
T PRK04056 75 NKILRKAKDKEEAREM 90 (180)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3689999999887543
No 57
>KOG4119|consensus
Probab=24.51 E-value=95 Score=20.04 Aligned_cols=21 Identities=24% Similarity=0.360 Sum_probs=17.2
Q ss_pred ChhHHHHHHHHHHHHHHHHHH
Q psy16824 28 SQNEADLMKNYLTQVRQETGL 48 (85)
Q Consensus 28 ~~~E~d~lr~Yl~QlRqE~~~ 48 (85)
+.++.+.+|.-.+|||.|++.
T Consensus 5 ~~~~~~q~k~~VeqLk~e~~~ 25 (71)
T KOG4119|consen 5 SNSKKPQMKKEVEQLKLEANI 25 (71)
T ss_pred cccchHHHHHHHHHHHHHHHh
Confidence 345678899999999999974
No 58
>COG1694 MazG Predicted pyrophosphatase [General function prediction only]
Probab=24.47 E-value=1.1e+02 Score=19.60 Aligned_cols=23 Identities=17% Similarity=0.443 Sum_probs=17.1
Q ss_pred HHHHHH--HHHHHHHHHHHHHHHhc
Q psy16824 33 DLMKNY--LTQVRQETGLRVCERVF 55 (85)
Q Consensus 33 d~lr~Y--l~QlRqE~~~RL~e~vy 55 (85)
|.++.+ +..+++|||..|...++
T Consensus 44 eai~~~~~~~~l~eELgDvL~~v~~ 68 (102)
T COG1694 44 EAIRKEEDLEDLKEELGDVLADVLF 68 (102)
T ss_pred HHHHhcCcHHHHHHHHHHHHHHHHH
Confidence 444444 68899999998888665
No 59
>PRK14364 Maf-like protein; Provisional
Probab=24.40 E-value=58 Score=23.62 Aligned_cols=16 Identities=25% Similarity=0.343 Sum_probs=12.4
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 71 g~ilgKP~~~eeA~~~ 86 (181)
T PRK14364 71 GQIIGKPDSKQHAFDI 86 (181)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3589999999888543
No 60
>PRK00148 Maf-like protein; Reviewed
Probab=23.96 E-value=60 Score=23.81 Aligned_cols=16 Identities=25% Similarity=0.302 Sum_probs=12.5
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
|.++.||.|.+|+..|
T Consensus 75 g~Il~KP~~~eeA~~~ 90 (194)
T PRK00148 75 GRLLGKPHTPEEAIER 90 (194)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3689999999888543
No 61
>PRK01441 Maf-like protein; Reviewed
Probab=23.96 E-value=61 Score=24.02 Aligned_cols=16 Identities=19% Similarity=0.237 Sum_probs=12.7
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|++|+..|
T Consensus 86 g~il~KP~~~~eA~~~ 101 (207)
T PRK01441 86 RRILPKAELVDEASQC 101 (207)
T ss_pred CEEcCCCCCHHHHHHH
Confidence 3689999999888554
No 62
>PRK08905 lipid A biosynthesis lauroyl acyltransferase; Validated
Probab=23.90 E-value=37 Score=25.58 Aligned_cols=31 Identities=19% Similarity=0.532 Sum_probs=25.0
Q ss_pred CCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824 17 FPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 17 FpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~ 51 (85)
+|+..+|.+..| ..+..++.++|++.|.+++
T Consensus 124 ~~~~~v~r~~~n----~~~~~~~~~~R~~~g~~~i 154 (289)
T PRK08905 124 FPLTAMFRPPRK----AALRPLMEAGRARGNMRTA 154 (289)
T ss_pred CCceEEEECCCC----HHHHHHHHHHhcccCCcee
Confidence 799999987665 4577788999999887776
No 63
>PF09720 Unstab_antitox: Putative addiction module component; InterPro: IPR013406 This entry defines several short bacterial proteins, typically about 75 amino acids long, which are always found as part of a pair (at least) of small genes. The other protein in the pair always belongs to a family of plasmid stabilisation proteins (IPR007712 from INTERPRO). It is likely that this protein and its partner comprise some form of addiction module - a pair of genes consisting of a stable toxin and an unstable antitoxin which mediate programmed cell death [] - although these gene pairs are usually found on the bacterial main chromosome.
Probab=23.18 E-value=92 Score=17.90 Aligned_cols=31 Identities=19% Similarity=0.431 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHhcCCCCCCCChhhhhhccchh
Q psy16824 43 RQETGLRVCERVFNTPDGKPSKWWLCFSKKRF 74 (85)
Q Consensus 43 RqE~~~RL~e~vy~~~~~~psKwWm~F~KRkF 74 (85)
|.+|...|++.+=.+ +..-+.||..-..|++
T Consensus 7 r~~L~e~L~~sl~~~-~~~~~~~w~~el~rR~ 37 (54)
T PF09720_consen 7 RAELAEELWDSLDDP-DSEVEAWWKEELERRL 37 (54)
T ss_pred HHHHHHHHHHHhccc-cccCcHHHHHHHHHHH
Confidence 667777888776654 3367899999988886
No 64
>PF12019 GspH: Type II transport protein GspH; InterPro: IPR022346 GspH is involved in bacterial type II export systems []. Like all pilins, GspH has an N terminus alpha helix []. This helix is followed by nine beta strands forming two beta sheets, one of five antiparallel strands and one of four antiparallel strands []. GspH is a minor pseudopilin; it is expressed much less than other pseudopilins in the type II secretion pilus (major pilins) []. The function and localisation of minor pseudo-pilins are still to be fully unraveled []. It has been suggested that some minor pseudopilins may assemble either into the base or the tip of pili, or both. They function as initiators or regulators of pilus biogenesis and dynamics, and/or as adaptors between various pseudopilin component and other members of the T2SS []. ; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex; PDB: 2QV8_B 2KNQ_A.
Probab=23.13 E-value=1.3e+02 Score=17.96 Aligned_cols=22 Identities=23% Similarity=0.092 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy16824 31 EADLMKNYLTQVRQETGLRVCE 52 (85)
Q Consensus 31 E~d~lr~Yl~QlRqE~~~RL~e 52 (85)
|++.|.+.|++.|.|.-.|=-.
T Consensus 1 aA~~l~~~l~~Ar~eAi~~~~~ 22 (114)
T PF12019_consen 1 AANRLAAALQLARSEAIRRGRP 22 (114)
T ss_dssp -HHHHHHHHHHHHHHHHHHSS-
T ss_pred CHHHHHHHHHHHHHHHHHCCCE
Confidence 5788999999999998655444
No 65
>PRK14366 Maf-like protein; Provisional
Probab=23.06 E-value=64 Score=23.83 Aligned_cols=16 Identities=13% Similarity=0.316 Sum_probs=12.9
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 80 g~ilgKP~~~eeA~~m 95 (195)
T PRK14366 80 RRILLKAETEEQAEEY 95 (195)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3689999999888554
No 66
>TIGR03418 chol_sulf_TF putative choline sulfate-utilization transcription factor. Members of this protein family are transcription factors of the LysR family. Their genes typically are divergently transcribed from choline-sulfatase genes. That enzyme makes choline, a precursor to the osmoprotectant glycine-betaine, available by hydrolysis of choline sulfate.
Probab=23.02 E-value=71 Score=22.87 Aligned_cols=21 Identities=19% Similarity=0.311 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHh
Q psy16824 34 LMKNYLTQVRQETGLRVCERV 54 (85)
Q Consensus 34 ~lr~Yl~QlRqE~~~RL~e~v 54 (85)
.+-..|++|..|+|.+|.+|-
T Consensus 31 avS~~Ik~LE~~lg~~LF~R~ 51 (291)
T TIGR03418 31 AVSQQVKRLEEELGTPLFERG 51 (291)
T ss_pred HHHHHHHHHHHHhCcHHhhcC
Confidence 377889999999999999873
No 67
>PRK11716 DNA-binding transcriptional regulator IlvY; Provisional
Probab=22.54 E-value=32 Score=23.87 Aligned_cols=20 Identities=15% Similarity=0.285 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHHh
Q psy16824 35 MKNYLTQVRQETGLRVCERV 54 (85)
Q Consensus 35 lr~Yl~QlRqE~~~RL~e~v 54 (85)
+-+.|++|++|+|..|.+|.
T Consensus 8 vS~~I~~LE~~lg~~Lf~R~ 27 (269)
T PRK11716 8 LSRQIQRLEEELGQPLFVRD 27 (269)
T ss_pred HHHHHHHHHHHhCCeeEEec
Confidence 77889999999998888763
No 68
>PRK10219 DNA-binding transcriptional regulator SoxS; Provisional
Probab=22.47 E-value=1.2e+02 Score=19.08 Aligned_cols=19 Identities=16% Similarity=0.125 Sum_probs=16.3
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy16824 34 LMKNYLTQVRQETGLRVCE 52 (85)
Q Consensus 34 ~lr~Yl~QlRqE~~~RL~e 52 (85)
...+|+.++|-|-+.+++.
T Consensus 49 s~~~~i~~~Rl~~a~~~L~ 67 (107)
T PRK10219 49 TLGDYIRQRRLLLAAVELR 67 (107)
T ss_pred CHHHHHHHHHHHHHHHHHH
Confidence 4789999999999888875
No 69
>KOG3382|consensus
Probab=22.41 E-value=1e+02 Score=22.65 Aligned_cols=26 Identities=23% Similarity=0.586 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHHHHHHH-HHhcCCC
Q psy16824 33 DLMKNYLTQVRQETGLRVC-ERVFNTP 58 (85)
Q Consensus 33 d~lr~Yl~QlRqE~~~RL~-e~vy~~~ 58 (85)
+.|...+|++|+|=|.|-+ .+.|..+
T Consensus 16 ~~l~k~~qmire~Gglr~~~~k~yrtd 42 (151)
T KOG3382|consen 16 KGLGKFMQMIREEGGLRCLLDKLYRTD 42 (151)
T ss_pred HHHHHHHHHHHhhccHHHHHHHHHhcc
Confidence 5688899999999998854 4666443
No 70
>PF12174 RST: RCD1-SRO-TAF4 (RST) plant domain; InterPro: IPR022003 This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors.
Probab=22.39 E-value=1.2e+02 Score=19.18 Aligned_cols=18 Identities=17% Similarity=0.379 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHHHHHHhc
Q psy16824 38 YLTQVRQETGLRVCERVF 55 (85)
Q Consensus 38 Yl~QlRqE~~~RL~e~vy 55 (85)
.++++|+..|.+|+-.+.
T Consensus 47 fvr~lR~IVGD~lL~s~I 64 (70)
T PF12174_consen 47 FVRKLRQIVGDQLLRSAI 64 (70)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 467888888888876554
No 71
>cd04789 HTH_Cfa Helix-Turn-Helix DNA binding domain of the Cfa transcription regulator. Putative helix-turn-helix (HTH) MerR-like transcription regulator; the N-terminal domain of Cfa, a cyclopropane fatty acid synthase and other related methyltransferases. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=22.32 E-value=57 Score=21.12 Aligned_cols=33 Identities=18% Similarity=0.276 Sum_probs=17.5
Q ss_pred CCCCCcccccCCCChhHHHHHHHHHHHHHHHHH
Q psy16824 15 AGFPLNSVYAKPQSQNEADLMKNYLTQVRQETG 47 (85)
Q Consensus 15 ~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~ 47 (85)
.||||..+-.-=......+.+.+.+.++.+++.
T Consensus 56 ~G~~l~ei~~~l~~~~~~~~l~~~~~~l~~~i~ 88 (102)
T cd04789 56 GGLSLKECLACLQGKLTRSLLLERLSSLAEQIA 88 (102)
T ss_pred CCCCHHHHHHHHcCCcHHHHHHHHHHHHHHHHH
Confidence 466665543221222334666777777766664
No 72
>PF02099 Josephin: Josephin; InterPro: IPR006155 Human genes containing triplet repeats can markedly expand in length, leading to neuropsychiatric disease. Expansion of triplet repeats explains the phenomenon of anticipation, i.e. the increasing severity or earlier age of onset in successive generations in a pedigree []. A novel gene containing CAG repeats has been identified and mapped to chromosome 14q32.1, the genetic locus for Machado-Joseph disease (MJD). Normally, the gene contains 13-36 CAG repeats, but most clinically diagnosed patients and all affected members of a family with the clinical and pathological diagnosis of MJD show expansion of the repeat number, from 68-79 []. Similar abnormalities in related genes may give rise to diseases similar to MJD. MJD is a neurodegenerative disorder characterised by cerebellar ataxia, pyramidal and extra-pyramidal signs, peripheral nerve palsy, external ophtalmoplegia, facial and lingual fasciculation and bulging. The disease is autosomal dominant, with late onset of symptoms, generally after the fourth decade.; GO: 0008242 omega peptidase activity; PDB: 3O65_G 1YZB_A 2JRI_A 2DOS_A 2AGA_A.
Probab=22.32 E-value=34 Score=24.47 Aligned_cols=28 Identities=21% Similarity=0.413 Sum_probs=20.9
Q ss_pred CCCcccccCCCChhHHHHHHHHHHHHHHH
Q psy16824 17 FPLNSVYAKPQSQNEADLMKNYLTQVRQE 45 (85)
Q Consensus 17 FpLn~~y~~P~~~~E~d~lr~Yl~QlRqE 45 (85)
|=||+....|.--. .+.++.||+|+..|
T Consensus 121 yNLDS~l~~P~~i~-~~~l~~fL~~l~~~ 148 (157)
T PF02099_consen 121 YNLDSKLKEPELIS-DFYLSAFLQQLQSE 148 (157)
T ss_dssp EEECTTTSS-EEE--HHHHHHHHHHHHCC
T ss_pred EeccCCCCCCcccC-HHHHHHHHHHHHhC
Confidence 55899999996554 47799999998765
No 73
>PF15601 Imm42: Immunity protein 42
Probab=22.29 E-value=1e+02 Score=21.97 Aligned_cols=40 Identities=23% Similarity=0.266 Sum_probs=33.4
Q ss_pred CCC--CcccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy16824 16 GFP--LNSVYAKPQSQNEADLMKNYLTQVRQETGLRVCERVF 55 (85)
Q Consensus 16 gFp--Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~e~vy 55 (85)
-|| +|.+|..==+.+|.+...+=|.|+|+|+...--+.|-
T Consensus 37 kfP~Lm~~LY~g~L~~~~~~~A~~eL~~I~~~l~~~~p~~Vi 78 (134)
T PF15601_consen 37 KFPLLMNELYRGYLRYEELEKALKELEEIRKELKKFPPSEVI 78 (134)
T ss_pred cchHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHhcCChhhhe
Confidence 477 7888988778899999999999999999877666555
No 74
>PRK14363 Maf-like protein; Provisional
Probab=22.07 E-value=70 Score=23.80 Aligned_cols=16 Identities=38% Similarity=0.528 Sum_probs=12.8
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 75 g~IlgKP~~~eeA~~~ 90 (204)
T PRK14363 75 GNILGKPESLEEAKGM 90 (204)
T ss_pred CEEcCCCCCHHHHHHH
Confidence 4689999999888554
No 75
>PF10229 DUF2246: Uncharacterized conserved protein (DUF2246); InterPro: IPR019362 This entry represents conserved proteins found in the metazoa but absent from fungi. They are all approximately 300 residues in length and have no known function.
Probab=21.91 E-value=45 Score=26.26 Aligned_cols=11 Identities=55% Similarity=1.244 Sum_probs=9.9
Q ss_pred CcCCCCCCCCC
Q psy16824 8 KFDLPGEAGFP 18 (85)
Q Consensus 8 ~F~iPGd~gFp 18 (85)
+|++||+.||.
T Consensus 51 rf~lpGnvg~~ 61 (278)
T PF10229_consen 51 RFPLPGNVGFD 61 (278)
T ss_pred CCCCCCcccch
Confidence 69999999986
No 76
>PRK11546 zraP zinc resistance protein; Provisional
Probab=21.89 E-value=1.3e+02 Score=21.59 Aligned_cols=29 Identities=10% Similarity=0.224 Sum_probs=16.2
Q ss_pred CcccccCCCChhH-HHHHHHHHHHHHHHHH
Q psy16824 19 LNSVYAKPQSQNE-ADLMKNYLTQVRQETG 47 (85)
Q Consensus 19 Ln~~y~~P~~~~E-~d~lr~Yl~QlRqE~~ 47 (85)
||+++..|....+ ...+..=|.+||+.|.
T Consensus 77 LnALl~~~~pD~~kI~aL~kEI~~Lr~kL~ 106 (143)
T PRK11546 77 YNALLTANPPDSSKINAVAKEMENLRQSLD 106 (143)
T ss_pred HHHHHcCCCCCHHHHHHHHHHHHHHHHHHH
Confidence 6777777654444 3555555555555444
No 77
>PRK06012 flhA flagellar biosynthesis protein FlhA; Validated
Probab=21.84 E-value=1.7e+02 Score=25.95 Aligned_cols=45 Identities=31% Similarity=0.548 Sum_probs=32.1
Q ss_pred cCCCC----CCCCCCcccccCCCChhHHH-------------------HHHHHHHHH--HHHHHHHHHHHh
Q psy16824 9 FDLPG----EAGFPLNSVYAKPQSQNEAD-------------------LMKNYLTQV--RQETGLRVCERV 54 (85)
Q Consensus 9 F~iPG----d~gFpLn~~y~~P~~~~E~d-------------------~lr~Yl~Ql--RqE~~~RL~e~v 54 (85)
-.+|| ||.|=+.+++-.|..+++++ .+|.|+..+ |||+ .+|++++
T Consensus 447 ~~~~G~~~~eP~fg~~a~WI~~~~~~~a~~~Gytvvdp~svi~THLsevir~~a~ellgrQEv-q~LLD~L 516 (697)
T PRK06012 447 GELPGIPTKEPAFGLPAVWIDEALREQAQLLGYTVVDPSTVVATHLTEVIKNHAAELLGRQEV-QQLLDRL 516 (697)
T ss_pred cCCCCccccCCCCCCCeEEeChhhHHHHHHcCCeeecHHHHHHHHHHHHHHHhHHHHhCHHHH-HHHHHHH
Confidence 45777 77899999999888777766 566676666 7777 5555543
No 78
>PRK13348 chromosome replication initiation inhibitor protein; Provisional
Probab=21.81 E-value=42 Score=24.14 Aligned_cols=20 Identities=10% Similarity=0.204 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy16824 34 LMKNYLTQVRQETGLRVCER 53 (85)
Q Consensus 34 ~lr~Yl~QlRqE~~~RL~e~ 53 (85)
.+-+.|++|++|+|.+|.+|
T Consensus 32 avS~~i~~LE~~lg~~Lf~R 51 (294)
T PRK13348 32 AVSQRIKALEESLGQPLLVR 51 (294)
T ss_pred HHHHHHHHHHHHhCceeeec
Confidence 37788999999999999887
No 79
>PF02545 Maf: Maf-like protein; InterPro: IPR003697 Maf is a putative inhibitor of septum formation in eukaryotes, bacteria, and archaea. The Maf protein shares substantial amino acid sequence identity with the Escherichia coli OrfE protein [].; GO: 0005737 cytoplasm; PDB: 2P5X_A 1EXC_B 1EX2_A.
Probab=21.81 E-value=64 Score=23.57 Aligned_cols=16 Identities=31% Similarity=0.559 Sum_probs=10.4
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 78 g~Il~KP~~~eeA~~~ 93 (195)
T PF02545_consen 78 GEILGKPKDEEEAREM 93 (195)
T ss_dssp TEEE-S-SSHHHHHHH
T ss_pred eEEEeCCCCHHHHHHH
Confidence 4689999999888443
No 80
>PF03190 Thioredox_DsbH: Protein of unknown function, DUF255; InterPro: IPR004879 This is a group of uncharacterised proteins.; PDB: 3IRA_A.
Probab=21.76 E-value=55 Score=23.82 Aligned_cols=32 Identities=22% Similarity=0.487 Sum_probs=15.1
Q ss_pred CCCCCCCcc------------cccCCCChhHHHHHHHHHHHHHH
Q psy16824 13 GEAGFPLNS------------VYAKPQSQNEADLMKNYLTQVRQ 44 (85)
Q Consensus 13 Gd~gFpLn~------------~y~~P~~~~E~d~lr~Yl~QlRq 44 (85)
|..|+||+- -|-+|.++.....+.+.|.++++
T Consensus 99 ~~gGwPl~vfltPdg~p~~~~tY~P~~~~~g~~~f~~~l~~i~~ 142 (163)
T PF03190_consen 99 GSGGWPLTVFLTPDGKPFFGGTYFPPEDRYGRPGFLQLLERIAE 142 (163)
T ss_dssp S---SSEEEEE-TTS-EEEEESS--SS-BTTB--HHHHHHHHHH
T ss_pred CCCCCCceEEECCCCCeeeeeeecCCCCCCCCccHHHHHHHHHH
Confidence 667888874 45666655555566666666653
No 81
>PF01231 IDO: Indoleamine 2,3-dioxygenase; InterPro: IPR000898 Indoleamine 2,3-dioxgyenase (IDO, 1.13.11.42 from EC) [] is a cytosolic haem protein which, together with the hepatic enzyme tryptophan 2,3-dioxygenase, catalyzes the conversion of tryptophan and other indole derivatives to kynurenines. The physiological role of IDO is not fully understood but is of great interest, because IDO is widely distributed in human tissues, can be up-regulated via cytokines such as interferon-gamma, and can thereby modulate the levels of tryptophan, which is vital for cell growth. The degradative action of IDO on tryptophan leads to cell death by starvation of this essential and relatively scarce amino acid. IDO is a haem-containing enzyme of about 400 amino acids. Site-directed mutagenesis showed His346 (P14902 from SWISSPROT) to be essential for haem binding, indicating that this histidine residue may be the proximal ligand. Mutation of Asp274 also compromised the ability of IDO to bind haem, suggesting that Asp274 may coordinate to haem directly as the distal ligand or is essential in maintaining the conformation of the haem pocket []. Other proteins that are evolutionarily related to IDO include yeast hypothetical protein YJR078w; and myoglobin from the red muscle of the archaeogastropodic molluscs, Nordotis madaka (Giant abalone) and Sulculus diversicolor [, ]. These unusual globins lack enzymatic activity but have kept the haem group.; GO: 0020037 heme binding; PDB: 2D0U_A 2D0T_A.
Probab=21.76 E-value=74 Score=25.97 Aligned_cols=18 Identities=28% Similarity=0.303 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy16824 34 LMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 34 ~lr~Yl~QlRqE~~~RL~ 51 (85)
-+-.||+|+|.|+...++
T Consensus 405 ~l~~fLK~~Rd~T~~a~~ 422 (422)
T PF01231_consen 405 DLMPFLKQCRDETKEALI 422 (422)
T ss_dssp HHHHHHHHHHHHHHHT--
T ss_pred hHHHHHHHHHHHHHhhcC
Confidence 467899999999987654
No 82
>PRK06628 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=21.75 E-value=59 Score=24.47 Aligned_cols=32 Identities=16% Similarity=0.262 Sum_probs=25.1
Q ss_pred CCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824 16 GFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC 51 (85)
Q Consensus 16 gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~ 51 (85)
|.|+..||.+..| ..+..++.+.|...|.+++
T Consensus 137 ~~~~~~vyr~~~n----~~~d~~~~~~R~~~g~~~i 168 (290)
T PRK06628 137 YPKVAVIYRKANN----PYVNKLVNESRAGDKLRLI 168 (290)
T ss_pred CCCeeEEEecCCC----HHHHHHHHHHHHhcCCcee
Confidence 5589999988766 4577778889998887776
No 83
>COG4047 Uncharacterized protein conserved in archaea [Function unknown]
Probab=21.58 E-value=70 Score=25.11 Aligned_cols=13 Identities=46% Similarity=1.168 Sum_probs=11.3
Q ss_pred CCCCChhhhhhcc
Q psy16824 59 DGKPSKWWLCFSK 71 (85)
Q Consensus 59 ~~~psKwWm~F~K 71 (85)
.++.-.||.||+|
T Consensus 56 tgkGEewW~eF~k 68 (243)
T COG4047 56 TGKGEEWWWEFAK 68 (243)
T ss_pred ccCcHHHHHHHHH
Confidence 3788899999997
No 84
>PLN03143 nudix hydrolase; Provisional
Probab=21.21 E-value=89 Score=24.61 Aligned_cols=42 Identities=29% Similarity=0.552 Sum_probs=29.4
Q ss_pred cCcCCCCCCCCCCcccccCCCChhHH------HHHHHHHHHHHHHHHH
Q psy16824 7 AKFDLPGEAGFPLNSVYAKPQSQNEA------DLMKNYLTQVRQETGL 48 (85)
Q Consensus 7 ~~F~iPGd~gFpLn~~y~~P~~~~E~------d~lr~Yl~QlRqE~~~ 48 (85)
..|.||+.++-|+.-..++--+.++- ..+++||..+.++++.
T Consensus 32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~ 79 (291)
T PLN03143 32 HSITLPGQPGQPVLVVAAPGISSSDFRKAIDSSLFRQWLKNLQSESGI 79 (291)
T ss_pred eEEEccCCCCCceeEecCCCCCHHHHHhHhcChHHHHHHHHhhhcccc
Confidence 35889998888887665443333333 3469999999998764
No 85
>COG1503 eRF1 Peptide chain release factor 1 (eRF1) [Translation, ribosomal structure and biogenesis]
Probab=20.97 E-value=63 Score=27.01 Aligned_cols=29 Identities=24% Similarity=0.436 Sum_probs=23.8
Q ss_pred CCCCCCcccccCCCChhHHHHHHHHHHHHHHHHH
Q psy16824 14 EAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETG 47 (85)
Q Consensus 14 d~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~ 47 (85)
-.|+-|.++|-+|.. .+-.|.+++|+|.+
T Consensus 24 g~gt~lislyIPp~~-----qisdv~~~lr~e~s 52 (411)
T COG1503 24 GRGTELISLYIPPDR-----QISDVVNRLRDEYS 52 (411)
T ss_pred CCCceEEEEecCCCC-----chHHHHHHHHhhcc
Confidence 467889999998764 38889999999987
No 86
>PF14852 Fis1_TPR_N: Fis1 N-terminal tetratricopeptide repeat; PDB: 1IYG_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A 1PC2_A 1NZN_A.
Probab=20.79 E-value=1e+02 Score=16.92 Aligned_cols=14 Identities=21% Similarity=0.586 Sum_probs=11.5
Q ss_pred HHHHHHHHHHhcCC
Q psy16824 44 QETGLRVCERVFNT 57 (85)
Q Consensus 44 qE~~~RL~e~vy~~ 57 (85)
+..|..|++.+|.+
T Consensus 20 ~~~Gi~lLe~l~~~ 33 (35)
T PF14852_consen 20 QQEGIALLEELYRD 33 (35)
T ss_dssp HHHHHHHHHHHCCC
T ss_pred HHHHHHHHHHHHhc
Confidence 45789999999964
No 87
>PRK13480 3'-5' exoribonuclease YhaM; Provisional
Probab=20.68 E-value=1e+02 Score=24.41 Aligned_cols=40 Identities=15% Similarity=0.371 Sum_probs=30.2
Q ss_pred CCCCccccc-CCCChhHH-HHHHHHHHHHHHHHHHHHHHHhc
Q psy16824 16 GFPLNSVYA-KPQSQNEA-DLMKNYLTQVRQETGLRVCERVF 55 (85)
Q Consensus 16 gFpLn~~y~-~P~~~~E~-d~lr~Yl~QlRqE~~~RL~e~vy 55 (85)
.+.+..+.. .|.+.+|. +.+++|+.+++.+.=.+|++.+|
T Consensus 95 ~~d~~dfvp~s~~~~e~l~~el~~~i~~i~~~~l~~l~~~~~ 136 (314)
T PRK13480 95 GVKISDFVEKAPLSKEEMQEEITQYIFEMENPNIQRITRHLL 136 (314)
T ss_pred cCCHHHcCCCCCCCHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 455555553 45566555 77899999999998899999999
No 88
>PRK14368 Maf-like protein; Provisional
Probab=20.50 E-value=76 Score=23.34 Aligned_cols=16 Identities=25% Similarity=0.470 Sum_probs=12.7
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 79 g~ilgKP~~~~eA~~~ 94 (193)
T PRK14368 79 GEIMGKPKDEADAVRM 94 (193)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3589999999888554
No 89
>PRK14365 Maf-like protein; Provisional
Probab=20.43 E-value=78 Score=23.32 Aligned_cols=16 Identities=38% Similarity=0.532 Sum_probs=12.3
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 77 g~Il~KP~~~~eA~~~ 92 (197)
T PRK14365 77 GEVLGKPASPENAEEM 92 (197)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3589999999888443
No 90
>PRK13456 DNA protection protein DPS; Provisional
Probab=20.41 E-value=1.5e+02 Score=22.20 Aligned_cols=43 Identities=9% Similarity=0.112 Sum_probs=28.1
Q ss_pred HHHHHHHHHHHH-HHHH--HHHHHHhcC---CCCCCCChhhhhhccchhh
Q psy16824 32 ADLMKNYLTQVR-QETG--LRVCERVFN---TPDGKPSKWWLCFSKKRFM 75 (85)
Q Consensus 32 ~d~lr~Yl~QlR-qE~~--~RL~e~vy~---~~~~~psKwWm~F~KRkFM 75 (85)
.+.+...|.+.+ +|++ .+|.+|++. +.+..|.+|| ..+++.+.
T Consensus 51 ~e~V~e~le~a~~EEl~HA~~lAeRI~qLGG~P~~~p~~~~-~ls~~~~~ 99 (186)
T PRK13456 51 GEGLKEIAEDARLEDRNHFEALVPRIYELGGKLPRDIREFH-DISACPDA 99 (186)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChHHHh-hhhcCccc
Confidence 355678889999 7776 688889983 2333455554 55566553
No 91
>PRK13503 transcriptional activator RhaS; Provisional
Probab=20.21 E-value=1.2e+02 Score=21.67 Aligned_cols=19 Identities=16% Similarity=0.242 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy16824 34 LMKNYLTQVRQETGLRVCE 52 (85)
Q Consensus 34 ~lr~Yl~QlRqE~~~RL~e 52 (85)
...+|+++.|-+-+.|||.
T Consensus 215 S~~~yi~~~Rl~~A~~LL~ 233 (278)
T PRK13503 215 TPQRYLNRLRLLKARHLLR 233 (278)
T ss_pred CHHHHHHHHHHHHHHHHHH
Confidence 4689999999999999886
No 92
>PRK14361 Maf-like protein; Provisional
Probab=20.20 E-value=79 Score=23.09 Aligned_cols=16 Identities=25% Similarity=0.470 Sum_probs=12.5
Q ss_pred cccccCCCChhHHHHH
Q psy16824 20 NSVYAKPQSQNEADLM 35 (85)
Q Consensus 20 n~~y~~P~~~~E~d~l 35 (85)
+.++.||.|.+|+..|
T Consensus 72 g~ilgKP~~~eeA~~~ 87 (187)
T PRK14361 72 GVLLAKPADEAENEAF 87 (187)
T ss_pred CEEecCCCCHHHHHHH
Confidence 3689999999888544
No 93
>PF12785 VESA1_N: Variant erythrocyte surface antigen-1; InterPro: IPR024751 This entry represents variant erythrocyte surface antigen 1, versions a and b, of Babesia. Babesia bovis is a tick-borne, intra-erythrocytic, protozoal parasite of cattle that shares many lifestyle parallels with the most virulent of the human malarial parasites, Plasmodium falciparum. Babesia uses antigenic variation to establish consistent infections of long duration. The two variants of VESA1, a and b, are expressed from different but closely related genes, and variation is achieved through the involvement of a segmental gene conversion mechanism and low-frequency epigenetic in situ switching of transcriptional activity from the VESA1 gene-pair to a possible other gene pair [].
Probab=20.19 E-value=39 Score=28.79 Aligned_cols=30 Identities=20% Similarity=0.374 Sum_probs=20.1
Q ss_pred CcccccCCCChhHHHHHHHHHHHHHHHHHH
Q psy16824 19 LNSVYAKPQSQNEADLMKNYLTQVRQETGL 48 (85)
Q Consensus 19 Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~~ 48 (85)
||.+|.--..-.-..+.|.||.||-|-|+.
T Consensus 39 ln~lfslvqglggtavvrtyidqlaqvLsa 68 (462)
T PF12785_consen 39 LNGLFSLVQGLGGTAVVRTYIDQLAQVLSA 68 (462)
T ss_pred HHhHHHHHhccCCceeHHHHHHHHHHHHHH
Confidence 555554433333346789999999999964
No 94
>PRK01526 Maf-like protein; Reviewed
Probab=20.15 E-value=79 Score=23.47 Aligned_cols=15 Identities=13% Similarity=0.173 Sum_probs=12.1
Q ss_pred ccccCCCChhHHHHH
Q psy16824 21 SVYAKPQSQNEADLM 35 (85)
Q Consensus 21 ~~y~~P~~~~E~d~l 35 (85)
.++.||.|.+|+..|
T Consensus 85 ~IlgKP~~~~eA~~m 99 (205)
T PRK01526 85 RILPKATTYEEVKNC 99 (205)
T ss_pred EEecCCCCHHHHHHH
Confidence 589999999888544
No 95
>COG3248 Tsx Nucleoside-binding outer membrane protein [Cell envelope biogenesis, outer membrane]
Probab=20.03 E-value=20 Score=28.60 Aligned_cols=27 Identities=41% Similarity=0.410 Sum_probs=22.0
Q ss_pred CcCCCCCCCCCCcccccCC-CChhHHHHH
Q psy16824 8 KFDLPGEAGFPLNSVYAKP-QSQNEADLM 35 (85)
Q Consensus 8 ~F~iPGd~gFpLn~~y~~P-~~~~E~d~l 35 (85)
...+|+=-+|.|| +|.+| .|++|.+-+
T Consensus 153 Di~~p~f~~~~LN-~Y~kydgNr~~w~g~ 180 (284)
T COG3248 153 DIDTPLFMSLSLN-VYAKYDGNRNEWDGY 180 (284)
T ss_pred ccccCCcceEEEE-EEeccCCCccccCce
Confidence 4788999999999 99999 677776544
Done!