Query psy1421
Match_columns 89
No_of_seqs 119 out of 542
Neff 7.3
Searched_HMMs 46136
Date Fri Aug 16 22:15:16 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1421.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1421hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3430|consensus 100.0 3.3E-44 7.2E-49 217.4 11.6 89 1-89 1-90 (90)
2 PTZ00059 dynein light chain; P 100.0 6.9E-44 1.5E-48 219.0 12.5 89 1-89 2-90 (90)
3 PF01221 Dynein_light: Dynein 100.0 9E-43 2E-47 213.3 10.7 89 1-89 1-89 (89)
4 PLN03058 dynein light chain ty 100.0 6.5E-41 1.4E-45 216.6 12.2 87 3-89 33-122 (128)
5 PF04155 Ground-like: Ground-l 97.8 0.00032 7E-09 41.3 8.5 54 33-87 22-76 (76)
6 PF05075 DUF684: Protein of un 89.0 5.2 0.00011 29.7 8.7 54 35-88 188-251 (345)
7 PF13742 tRNA_anti_2: OB-fold 83.7 4.9 0.00011 24.5 5.2 47 36-88 5-59 (99)
8 PF15650 Tox-REase-9: Restrict 81.3 1.4 3E-05 27.0 2.0 17 42-58 70-86 (89)
9 PF12006 DUF3500: Protein of u 79.1 17 0.00037 26.8 7.6 43 8-50 216-261 (313)
10 PF10703 MoaF: Molybdenum cofa 78.0 4.6 0.0001 29.3 4.2 32 53-87 182-214 (265)
11 PF12652 CotJB: CotJB protein; 77.6 1.9 4.1E-05 25.7 1.8 26 28-53 31-56 (78)
12 PF12362 DUF3646: DNA polymera 73.6 4.8 0.0001 25.7 3.0 29 35-63 48-76 (117)
13 PRK00286 xseA exodeoxyribonucl 69.8 17 0.00037 27.6 5.7 48 35-88 8-61 (438)
14 PF08006 DUF1700: Protein of u 62.4 24 0.00052 23.4 4.8 36 11-46 15-64 (181)
15 COG1570 XseA Exonuclease VII, 60.6 17 0.00036 28.4 4.1 50 35-88 8-61 (440)
16 TIGR02084 leud 3-isopropylmala 60.3 16 0.00035 24.4 3.6 34 34-67 25-62 (156)
17 cd03538 Rieske_RO_Alpha_AntDO 58.8 31 0.00067 22.4 4.7 43 41-88 10-53 (146)
18 PF08958 DUF1871: Domain of un 58.3 13 0.00027 22.2 2.5 34 18-52 21-54 (79)
19 PF11858 DUF3378: Domain of un 57.8 15 0.00034 21.8 2.9 20 70-89 29-48 (81)
20 cd03472 Rieske_RO_Alpha_BPDO_l 57.6 22 0.00048 22.5 3.8 38 46-88 1-39 (128)
21 PF12550 GCR1_C: Transcription 56.8 3.7 8E-05 24.2 0.1 12 44-55 29-40 (81)
22 PF10440 WIYLD: Ubiquitin-bind 56.8 19 0.00042 20.7 3.0 41 22-62 11-51 (65)
23 PF09868 DUF2095: Uncharacteri 54.4 28 0.00061 22.5 3.8 20 34-53 93-116 (128)
24 PF08776 VASP_tetra: VASP tetr 54.3 30 0.00064 18.1 3.2 32 15-48 6-37 (40)
25 PF03398 Ist1: Regulator of Vp 53.2 6.3 0.00014 26.4 0.7 42 11-54 79-120 (165)
26 TIGR00237 xseA exodeoxyribonuc 52.4 68 0.0015 24.6 6.3 47 36-88 3-55 (432)
27 PF14900 DUF4493: Domain of un 48.3 55 0.0012 22.6 4.9 37 44-80 119-160 (235)
28 PF06150 ChaB: ChaB; InterPro 48.2 47 0.001 18.4 4.4 43 11-55 7-54 (57)
29 PF06457 Ectatomin: Ectatomin; 47.5 18 0.00039 17.9 1.7 13 36-48 22-34 (34)
30 PF08594 UPF0300: Uncharacteri 46.8 72 0.0015 22.6 5.2 58 13-70 37-112 (215)
31 PRK14023 homoaconitate hydrata 46.2 28 0.0006 23.5 3.0 34 34-67 27-64 (166)
32 PF00352 TBP: Transcription fa 46.1 29 0.00063 20.4 2.8 25 65-89 30-58 (86)
33 COG0655 WrbA Multimeric flavod 45.9 17 0.00036 24.7 1.9 56 11-66 59-120 (207)
34 PF10045 DUF2280: Uncharacteri 45.2 59 0.0013 20.5 4.1 32 13-48 4-35 (104)
35 cd00197 VHS_ENTH_ANTH VHS, ENT 44.9 11 0.00025 23.0 0.9 36 19-57 19-54 (115)
36 PF02283 CobU: Cobinamide kina 43.8 30 0.00064 23.1 2.9 25 35-60 36-60 (167)
37 TIGR03515 GldC gliding motilit 43.4 71 0.0015 20.2 4.3 42 6-47 53-98 (108)
38 PRK13007 succinyl-diaminopimel 43.4 1.2E+02 0.0026 21.7 6.6 39 13-51 1-42 (352)
39 cd03545 Rieske_RO_Alpha_OHBDO_ 43.1 92 0.002 20.2 5.1 44 40-88 11-56 (150)
40 PF08908 DUF1852: Domain of un 42.3 13 0.00029 27.3 1.0 23 45-67 98-120 (322)
41 PF06057 VirJ: Bacterial virul 42.0 30 0.00066 24.0 2.7 29 37-66 53-81 (192)
42 PF06925 MGDG_synth: Monogalac 42.0 32 0.00069 22.4 2.8 27 37-63 3-29 (169)
43 KOG2451|consensus 41.2 62 0.0013 25.3 4.4 41 6-46 51-93 (503)
44 TIGR03147 cyt_nit_nrfF cytochr 41.1 16 0.00035 23.7 1.2 12 41-52 78-89 (126)
45 PRK09111 DNA polymerase III su 41.0 31 0.00066 27.8 3.0 29 35-63 504-532 (598)
46 KOG3248|consensus 40.3 17 0.00036 27.7 1.3 12 15-26 204-215 (421)
47 PF07742 BTG: BTG family; Int 40.0 55 0.0012 20.8 3.5 19 37-55 29-47 (118)
48 PF13798 PCYCGC: Protein of un 39.0 12 0.00027 25.2 0.4 34 18-52 114-147 (158)
49 PF03564 DUF1759: Protein of u 38.8 60 0.0013 20.4 3.6 43 11-53 23-70 (145)
50 PF11307 DUF3109: Protein of u 38.6 9.6 0.00021 26.3 -0.1 31 24-54 145-176 (183)
51 PF01726 LexA_DNA_bind: LexA D 37.7 77 0.0017 17.8 3.6 28 15-42 5-35 (65)
52 PF05184 SapB_1: Saposin-like 37.7 53 0.0011 15.9 4.5 31 16-46 5-35 (39)
53 PRK10667 Hha toxicity attenuat 37.0 58 0.0013 21.0 3.2 20 31-50 65-84 (122)
54 COG1352 CheR Methylase of chem 36.4 51 0.0011 23.8 3.3 47 4-56 128-178 (268)
55 TIGR00294 conserved hypothetic 35.6 64 0.0014 23.6 3.7 39 36-74 251-293 (308)
56 cd01578 AcnA_Mitochon_Swivel M 35.3 23 0.00049 23.7 1.2 18 50-67 67-84 (149)
57 PF03918 CcmH: Cytochrome C bi 34.7 21 0.00045 23.6 1.0 34 11-52 56-89 (148)
58 PF06884 DUF1264: Protein of u 34.6 32 0.00069 23.5 1.9 19 38-56 98-116 (171)
59 PF01739 CheR: CheR methyltran 33.8 19 0.00041 24.7 0.7 47 5-55 64-111 (196)
60 KOG2130|consensus 33.7 38 0.00082 25.8 2.3 19 51-69 277-295 (407)
61 PF10925 DUF2680: Protein of u 33.3 91 0.002 17.4 3.5 16 35-50 34-49 (59)
62 TIGR01641 phageSPP1_gp7 phage 33.2 65 0.0014 19.4 3.0 13 34-46 14-26 (108)
63 PF08015 Pheromone: Fungal mat 32.9 25 0.00054 19.9 1.0 14 45-58 54-69 (69)
64 PRK10144 formate-dependent nit 32.9 26 0.00057 22.7 1.2 12 41-52 78-89 (126)
65 KOG3165|consensus 32.8 41 0.0009 23.1 2.2 25 38-62 48-72 (195)
66 PF03869 Arc: Arc-like DNA bin 32.7 56 0.0012 17.5 2.3 35 12-49 10-45 (50)
67 PRK06489 hypothetical protein; 32.6 28 0.00062 25.3 1.5 32 36-67 136-168 (360)
68 PF10757 YbaJ: Biofilm formati 32.3 65 0.0014 20.8 2.9 20 31-50 65-84 (122)
69 TIGR03228 anthran_1_2_A anthra 31.7 1.1E+02 0.0023 23.8 4.5 43 42-88 27-69 (438)
70 cd01579 AcnA_Bact_Swivel Bacte 31.0 33 0.00072 21.9 1.4 19 49-67 45-63 (121)
71 KOG4194|consensus 30.4 33 0.00072 28.4 1.6 24 51-74 586-609 (873)
72 PF06983 3-dmu-9_3-mt: 3-demet 30.2 24 0.00053 21.9 0.7 12 46-57 105-116 (116)
73 KOG1573|consensus 28.8 2E+02 0.0043 19.9 5.8 60 12-72 92-162 (204)
74 PRK14964 DNA polymerase III su 28.6 56 0.0012 25.7 2.6 26 36-61 434-459 (491)
75 PF10655 DUF2482: Hypothetical 27.9 38 0.00082 21.0 1.2 31 33-63 22-65 (100)
76 cd05503 Bromo_BAZ2A_B_like Bro 27.7 1.1E+02 0.0025 18.2 3.4 36 15-50 59-97 (97)
77 PF01743 PolyA_pol: Poly A pol 27.1 1.6E+02 0.0036 18.3 4.8 37 39-77 28-64 (126)
78 PF07062 Clc-like: Clc-like; 26.9 56 0.0012 22.9 2.1 24 50-75 23-46 (211)
79 cd00454 Trunc_globin Truncated 26.7 1.5E+02 0.0032 17.7 4.5 36 12-47 72-107 (116)
80 PHA00368 internal virion prote 26.5 94 0.002 27.3 3.6 40 10-49 585-632 (1315)
81 TIGR02087 LEUD_arch 3-isopropy 26.2 99 0.0022 20.5 3.1 34 34-67 25-62 (154)
82 CHL00168 pbsA heme oxygenase; 26.1 1.3E+02 0.0028 21.4 3.9 37 11-47 179-216 (238)
83 PF01545 Cation_efflux: Cation 26.0 84 0.0018 21.8 2.9 22 37-58 255-276 (284)
84 PF15571 Imm25: Immunity prote 25.4 52 0.0011 21.3 1.6 23 36-58 18-40 (124)
85 COG1498 SIK1 Protein implicate 25.3 1.2E+02 0.0026 23.4 3.8 40 9-48 200-239 (395)
86 PF06840 DUF1241: Protein of u 25.1 52 0.0011 22.1 1.6 13 36-48 123-135 (154)
87 KOG1569|consensus 24.8 1.5E+02 0.0032 22.3 4.0 27 37-63 227-261 (323)
88 PF14824 Sirohm_synth_M: Siroh 24.8 35 0.00075 16.7 0.5 16 35-50 15-30 (30)
89 PRK10425 DNase TatD; Provision 24.2 2.3E+02 0.0049 20.1 4.9 46 15-60 103-157 (258)
90 PRK00394 transcription factor; 24.2 1.3E+02 0.0029 20.3 3.5 25 65-89 120-148 (179)
91 COG1999 Uncharacterized protei 24.2 1.4E+02 0.0031 20.4 3.7 26 34-59 114-139 (207)
92 cd06913 beta3GnTL1_like Beta 1 24.2 1.5E+02 0.0032 19.4 3.7 6 53-58 56-61 (219)
93 TIGR01916 F420_cofE F420-0:gam 24.2 1.1E+02 0.0025 21.9 3.3 30 36-65 125-154 (243)
94 cd01580 AcnA_IRP_Swivel Aconit 24.0 48 0.001 22.7 1.3 18 49-66 93-110 (171)
95 PF06793 UPF0262: Uncharacteri 23.9 86 0.0019 21.2 2.4 29 16-49 108-136 (158)
96 COG5556 Uncharacterized conser 23.9 1.9E+02 0.0042 18.1 4.0 31 12-46 3-33 (110)
97 COG4003 Uncharacterized protei 23.5 1.8E+02 0.004 17.8 3.6 30 19-48 48-77 (98)
98 TIGR03849 arch_ComA phosphosul 23.5 2.5E+02 0.0054 20.1 4.9 47 10-59 94-154 (237)
99 PRK13293 F420-0--gamma-glutamy 23.4 1.2E+02 0.0026 21.9 3.3 29 36-64 126-154 (245)
100 PF04622 ERG2_Sigma1R: ERG2 an 23.4 1.8E+02 0.004 20.4 4.2 33 20-52 35-68 (216)
101 PRK13888 conjugal transfer pro 23.4 26 0.00056 24.6 -0.1 21 37-57 88-109 (206)
102 PF02268 TFIIA_gamma_N: Transc 23.4 1.4E+02 0.003 16.1 3.8 22 27-48 21-42 (49)
103 PF02318 FYVE_2: FYVE-type zin 23.2 1.3E+02 0.0028 18.7 3.1 36 11-46 1-36 (118)
104 KOG3938|consensus 23.0 1.5E+02 0.0034 22.0 3.8 22 11-32 250-271 (334)
105 cd00404 Aconitase_swivel Aconi 22.8 64 0.0014 19.5 1.6 15 52-66 15-29 (88)
106 cd03479 Rieske_RO_Alpha_PhDO_l 22.6 1.7E+02 0.0037 18.7 3.7 38 46-88 14-52 (144)
107 PF01152 Bac_globin: Bacterial 22.5 1.9E+02 0.0041 17.4 4.3 35 12-46 76-110 (120)
108 PF10438 Cyc-maltodext_C: Cycl 22.2 1.5E+02 0.0032 17.4 3.1 19 68-86 7-26 (78)
109 PF10302 DUF2407: DUF2407 ubiq 22.0 44 0.00095 20.5 0.8 7 54-60 90-96 (97)
110 PF09477 Type_III_YscG: Bacter 22.0 2.3E+02 0.0049 18.2 4.1 32 13-44 1-34 (116)
111 TIGR01738 bioH putative pimelo 22.0 56 0.0012 20.9 1.3 28 35-67 52-79 (245)
112 PRK00466 acetyl-lysine deacety 21.9 3.1E+02 0.0066 19.7 5.3 40 15-55 6-48 (346)
113 cd01577 IPMI_Swivel Aconatase- 21.7 47 0.001 20.2 0.8 15 53-67 18-32 (91)
114 COG1911 RPL30 Ribosomal protei 21.7 2.1E+02 0.0046 17.8 3.7 27 2-28 34-60 (100)
115 cd04518 TBP_archaea archaeal T 21.6 1.2E+02 0.0025 20.5 2.8 25 65-89 119-147 (174)
116 cd04196 GT_2_like_d Subfamily 21.5 1.6E+02 0.0035 18.7 3.5 8 53-60 27-34 (214)
117 PRK13601 putative L7Ae-like ri 21.5 1.9E+02 0.0041 17.1 3.6 25 4-28 25-49 (82)
118 cd01674 Homoaconitase_Swivel H 21.5 94 0.002 20.3 2.2 34 35-68 24-61 (129)
119 PRK08775 homoserine O-acetyltr 21.4 61 0.0013 23.3 1.5 32 36-67 121-152 (343)
120 cd00652 TBP_TLF TATA box bindi 21.0 1.8E+02 0.004 19.4 3.7 26 64-89 119-148 (174)
121 COG2101 SPT15 TATA-box binding 21.0 1.1E+02 0.0023 21.2 2.5 16 74-89 45-62 (185)
122 cd05392 RasGAP_Neurofibromin_l 20.7 96 0.0021 22.6 2.4 31 29-59 143-174 (323)
123 PF12697 Abhydrolase_6: Alpha/ 20.7 63 0.0014 20.2 1.3 30 36-66 50-79 (228)
124 PRK13675 GTP cyclohydrolase; P 20.6 1E+02 0.0022 22.8 2.5 43 36-78 253-299 (308)
125 PF00271 Helicase_C: Helicase 20.4 1.2E+02 0.0027 16.4 2.4 24 4-27 8-31 (78)
126 PF00817 IMS: impB/mucB/samB f 20.3 1.2E+02 0.0025 19.3 2.6 26 36-61 115-140 (149)
127 PF10047 DUF2281: Protein of u 20.1 1.7E+02 0.0037 16.3 2.9 22 11-32 10-31 (66)
128 TIGR03229 benzo_1_2_benA benzo 20.1 2.5E+02 0.0054 21.7 4.6 43 42-88 27-69 (433)
129 PF02806 Alpha-amylase_C: Alph 20.1 1.5E+02 0.0032 17.1 2.8 38 51-88 46-93 (95)
No 1
>KOG3430|consensus
Probab=100.00 E-value=3.3e-44 Score=217.45 Aligned_cols=89 Identities=79% Similarity=1.265 Sum_probs=86.9
Q ss_pred CCCCccEEEeCCCCHHHHHHHHHHHHHHHhhcC-chHHHHHHHHHHhcccCCCceEEEEeCCceeeEEEcCCcEEEEEeC
Q psy1421 1 MSDRKAVIKNADMSEEMQQDAVDCATQALEKYN-IEKDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETRHFIYFYLG 79 (89)
Q Consensus 1 m~~~~~~v~~~dM~~~mq~~i~~~~~~a~~~~~-~~~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~th~~~~~~~f~~~ 79 (89)
|++.+++|+.+|||++||+++++++++|+++|+ +++++|..||++||++|||+||||||++|||++||++++|+||++|
T Consensus 1 ~~~~~~~vk~tDM~~~mq~~a~~~a~~al~~f~~~~k~iA~~iKkefDkkyG~~WhcivG~~FGs~vThe~g~Fiyf~~g 80 (90)
T KOG3430|consen 1 MLERKAVVKATDMPEEMQQEAIELARQALEKFNVIEKDIAAFIKKEFDKKYGPTWHCIVGRNFGSYVTHETGHFIYFYLG 80 (90)
T ss_pred CCCccceEecCCCChHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHhhhcCCccEEEEcCCcceEEEeecCcEEEEEec
Confidence 778999999999999999999999999999999 7899999999999999999999999999999999999999999999
Q ss_pred CEEEEEEeeC
Q psy1421 80 QVAILLFKSG 89 (89)
Q Consensus 80 ~~~~lv~kt~ 89 (89)
.+++||||++
T Consensus 81 ~l~illfK~~ 90 (90)
T KOG3430|consen 81 VLAILLFKCA 90 (90)
T ss_pred eEEEEEEecC
Confidence 9999999985
No 2
>PTZ00059 dynein light chain; Provisional
Probab=100.00 E-value=6.9e-44 Score=219.05 Aligned_cols=89 Identities=93% Similarity=1.469 Sum_probs=87.4
Q ss_pred CCCCccEEEeCCCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhcccCCCceEEEEeCCceeeEEEcCCcEEEEEeCC
Q psy1421 1 MSDRKAVIKNADMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETRHFIYFYLGQ 80 (89)
Q Consensus 1 m~~~~~~v~~~dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~th~~~~~~~f~~~~ 80 (89)
|++.+++|+.+|||++||++|++++.+|++++++++|+|++||+.||++|||+||||||++|||++||++++|+||++|+
T Consensus 2 m~~~~~~i~~~dM~~emq~~a~~~~~~Al~~~~~~kdiA~~IK~~fD~~yg~~WhciVG~~Fgs~vthe~~~~i~F~~~~ 81 (90)
T PTZ00059 2 MSDRKAVVKNADMSEDMQQDAIDCANQALEKFNIEKDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETKHFIYFYLGQ 81 (90)
T ss_pred CCCCccEEEECCCCHHHHHHHHHHHHHHHHHcCchHHHHHHHHHHHHhhcCCCCEEEEecCeeEEEEEeCCcEEEEEECC
Confidence 78889999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEEEeeC
Q psy1421 81 VAILLFKSG 89 (89)
Q Consensus 81 ~~~lv~kt~ 89 (89)
++|||||+|
T Consensus 82 ~~vLlfK~~ 90 (90)
T PTZ00059 82 VAILLFKSG 90 (90)
T ss_pred EEEEEEecC
Confidence 999999997
No 3
>PF01221 Dynein_light: Dynein light chain type 1 ; InterPro: IPR001372 Dynein is a multisubunit microtubule-dependent motor enzyme that acts as the force generating protein of eukaryotic cilia and flagella. The cytoplasmic isoform of dynein acts as a motor for the intracellular retrograde motility of vesicles and organelles along microtubules. Dynein is composed of a number of ATP-binding large subunits (see IPR004273 from INTERPRO), intermediate size subunits and small subunits. Among the small subunits, there is a family of highly conserved proteins which make up this family [, ]. Both type 1 (DLC1) and 2 (DLC2) dynein light chains have a similar two-layer alpha-beta core structure consisting of beta-alpha(2)-beta-X-beta(2) [, ].; GO: 0007017 microtubule-based process, 0005875 microtubule associated complex; PDB: 1F95_A 1F96_A 1F3C_A 3P8M_B 2XQQ_C 1RE6_A 1CMI_A 1PWK_A 1PWJ_A 4DS1_C ....
Probab=100.00 E-value=9e-43 Score=213.32 Aligned_cols=89 Identities=58% Similarity=1.039 Sum_probs=85.3
Q ss_pred CCCCccEEEeCCCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhcccCCCceEEEEeCCceeeEEEcCCcEEEEEeCC
Q psy1421 1 MSDRKAVIKNADMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETRHFIYFYLGQ 80 (89)
Q Consensus 1 m~~~~~~v~~~dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~th~~~~~~~f~~~~ 80 (89)
|++.+++|+.+|||++||++|+++|.+|++++++++++|++||++||++|||+||||||++||+++||+++++++|++++
T Consensus 1 m~~~~~~i~~~dM~~~~~~~~~~~~~~a~~~~~~~~eiA~~iK~~lD~~yG~~Wh~IVG~~Fg~~~th~~~~~~~f~~~~ 80 (89)
T PF01221_consen 1 MSENKIVIKSSDMPEEMQEEAIELAKEALKKYQDEKEIAEFIKQELDKKYGPTWHCIVGKSFGSSVTHEPGTFLYFKIGN 80 (89)
T ss_dssp SGSCSEEEEEEES-HHHHHHHHHHHHHHHHHCSSHHHHHHHHHHHHHHHHSS-EEEEEESEEEEEEEEETTEEEEEEETT
T ss_pred CCCCccEEEECCCCHHHHHHHHHHHHHHHHHCCcHHHHHHHHHHHHhcccCCceEEEECCcEEEEEEEcCCcEEEEEECC
Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred EEEEEEeeC
Q psy1421 81 VAILLFKSG 89 (89)
Q Consensus 81 ~~~lv~kt~ 89 (89)
++|||||||
T Consensus 81 ~~~li~kt~ 89 (89)
T PF01221_consen 81 IAFLIFKTQ 89 (89)
T ss_dssp EEEEEEEE-
T ss_pred EEEEEEecC
Confidence 999999996
No 4
>PLN03058 dynein light chain type 1 family protein; Provisional
Probab=100.00 E-value=6.5e-41 Score=216.56 Aligned_cols=87 Identities=45% Similarity=0.815 Sum_probs=83.5
Q ss_pred CCccEEEeCCCCHHHHHHHHHHHHHHHhhc---CchHHHHHHHHHHhcccCCCceEEEEeCCceeeEEEcCCcEEEEEeC
Q psy1421 3 DRKAVIKNADMSEEMQQDAVDCATQALEKY---NIEKDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETRHFIYFYLG 79 (89)
Q Consensus 3 ~~~~~v~~~dM~~~mq~~i~~~~~~a~~~~---~~~~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~th~~~~~~~f~~~ 79 (89)
+.+++|+.+|||++||++|+++|..|++++ ++++++|++||+.||++|||+||||||++|||++||++++|+||++|
T Consensus 33 ~~dv~Ik~sDM~~emQ~~ave~a~~Al~k~~~~~~ekdIA~~IKk~fDkkYG~tWHCIVGk~FGs~VTHe~~~fIyF~ig 112 (128)
T PLN03058 33 ELNVRVRASDMPLVLQNRAFSCARDILDAMPGKLDSKRLALALKKEFDSAYGPAWHCIVGTSFGSYVTHSTGGFLYFSID 112 (128)
T ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHHHHccccCCHHHHHHHHHHHHhhhhCCceEEEECCcEEEEEEEcCCcEEEEEEC
Confidence 468999999999999999999999999997 46799999999999999999999999999999999999999999999
Q ss_pred CEEEEEEeeC
Q psy1421 80 QVAILLFKSG 89 (89)
Q Consensus 80 ~~~~lv~kt~ 89 (89)
+++|||||||
T Consensus 113 ~~aiLLfKt~ 122 (128)
T PLN03058 113 KVYILLFKTA 122 (128)
T ss_pred CEEEEEEecc
Confidence 9999999996
No 5
>PF04155 Ground-like: Ground-like domain; InterPro: IPR007284 This group of proteins contain one or more copies of the ground-like domain, which are specific to Caenorhabditis elegans and Caenorhabditis briggsae. It has been proposed that the ground-like domain containing proteins may bind and modulate the activity of Patched-like membrane molecules, reminiscent of the modulating activities of neuropeptides [].
Probab=97.84 E-value=0.00032 Score=41.35 Aligned_cols=54 Identities=15% Similarity=0.173 Sum_probs=45.9
Q ss_pred CchHHHHHHHHHHhcccCCCceEEEEeC-CceeeEEEcCCcEEEEEeCCEEEEEEe
Q psy1421 33 NIEKDIAAYIKKEFDKKYNPTWHCIVGR-NFGSYVTHETRHFIYFYLGQVAILLFK 87 (89)
Q Consensus 33 ~~~~~iA~~iK~~lD~~yG~~Wh~vVG~-~fg~~~th~~~~~~~f~~~~~~~lv~k 87 (89)
.+..++++.|.+.+.++||..+-||+++ +|....... ..|--...++..+++|+
T Consensus 22 ~~~~~s~~~Iq~~~e~~f~~~f~vIcs~~~Fsy~~~~~-~~~C~~~~~g~~c~af~ 76 (76)
T PF04155_consen 22 CNLSISKRAIQKAAEKRFGGSFEVICSEGDFSYSTHTD-DLYCKVEKNGVTCLAFA 76 (76)
T ss_pred CCHHHHHHHHHHHHHHHhCCCEEEEEeCCCceeEEecc-cceeeeeeCCEEEEEEC
Confidence 5567899999999999999999999999 576555544 77888899999999985
No 6
>PF05075 DUF684: Protein of unknown function (DUF684); InterPro: IPR007767 This family contains uncharacterised proteins from Caenorhabditis elegans.
Probab=89.00 E-value=5.2 Score=29.71 Aligned_cols=54 Identities=22% Similarity=0.416 Sum_probs=38.4
Q ss_pred hHHHHHHHHHHhcccCC-CceEEEEeC------C--ceeeEEEcCCcEEE-EEeCCEEEEEEee
Q psy1421 35 EKDIAAYIKKEFDKKYN-PTWHCIVGR------N--FGSYVTHETRHFIY-FYLGQVAILLFKS 88 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~yG-~~Wh~vVG~------~--fg~~~th~~~~~~~-f~~~~~~~lv~kt 88 (89)
..+.|..||+.||+..- -.+-++|-. + |.....+....++. |.-|+..++||||
T Consensus 188 n~eKAd~Ik~~Le~ilTnDsFYIiVfd~~~~~~~~~~y~~~~~~~dq~I~s~~rGgcNv~VYRS 251 (345)
T PF05075_consen 188 NEEKADEIKKKLEKILTNDSFYIIVFDDCSGYDNHYYYGFYDNNEDQYIESFNRGGCNVFVYRS 251 (345)
T ss_pred hHHHHHHHHHHHHhhCCCCcEEEEEecccccCCccceeeeccCcccCEEEEEeCCCeEEEEEee
Confidence 46799999999999654 458888833 1 22222355666665 6679999999998
No 7
>PF13742 tRNA_anti_2: OB-fold nucleic acid binding domain
Probab=83.71 E-value=4.9 Score=24.49 Aligned_cols=47 Identities=26% Similarity=0.687 Sum_probs=34.3
Q ss_pred HHHHHHHHHHhccc--CCCceEEEEeC--CceeeEEEcCCcEEEEEe----CCEEEEEEee
Q psy1421 36 KDIAAYIKKEFDKK--YNPTWHCIVGR--NFGSYVTHETRHFIYFYL----GQVAILLFKS 88 (89)
Q Consensus 36 ~~iA~~iK~~lD~~--yG~~Wh~vVG~--~fg~~~th~~~~~~~f~~----~~~~~lv~kt 88 (89)
.++.+.||+.+++. ++..| |.|+ ++- .| .+..+||.+ ..+.+.+|++
T Consensus 5 s~l~~~ik~~le~~~~~~~vw--V~GEIs~~~---~~-~~gh~YftLkD~~a~i~~~~~~~ 59 (99)
T PF13742_consen 5 SELNNYIKDLLERDPPLPNVW--VEGEISNLK---RH-SSGHVYFTLKDEEASISCVIFRS 59 (99)
T ss_pred HHHHHHHHHHHhcCCCcCCEE--EEEEEeecE---EC-CCceEEEEEEcCCcEEEEEEEHH
Confidence 57889999999998 67888 5666 343 22 667788887 3567777764
No 8
>PF15650 Tox-REase-9: Restriction endonuclease fold toxin 9
Probab=81.25 E-value=1.4 Score=26.97 Aligned_cols=17 Identities=29% Similarity=0.640 Sum_probs=15.3
Q ss_pred HHHHhcccCCCceEEEE
Q psy1421 42 IKKEFDKKYNPTWHCIV 58 (89)
Q Consensus 42 iK~~lD~~yG~~Wh~vV 58 (89)
=|++|...||.+|.+||
T Consensus 70 Y~~el~~~~G~~W~~~l 86 (89)
T PF15650_consen 70 YKQELEKIYGGGWKTRL 86 (89)
T ss_pred HHHHhcCccCCCeeEEe
Confidence 47889999999999997
No 9
>PF12006 DUF3500: Protein of unknown function (DUF3500); InterPro: IPR021889 This family of proteins is functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 335 to 438 amino acids in length. This protein has a conserved GHH sequence motif. This protein has two completely conserved G residues that may be functionally important.
Probab=79.07 E-value=17 Score=26.78 Aligned_cols=43 Identities=16% Similarity=0.254 Sum_probs=30.9
Q ss_pred EEeCCCCHHHHHHHHHHHHHHHhhcCch--HHHHHHH-HHHhcccC
Q psy1421 8 IKNADMSEEMQQDAVDCATQALEKYNIE--KDIAAYI-KKEFDKKY 50 (89)
Q Consensus 8 v~~~dM~~~mq~~i~~~~~~a~~~~~~~--~~iA~~i-K~~lD~~y 50 (89)
|..++|+.+.|+.+..++..-+..++.+ ++.-+.| +..||+.|
T Consensus 216 l~~s~Lt~~Qq~ll~~li~~y~~~~~~~~a~~~~~~i~~~~ld~t~ 261 (313)
T PF12006_consen 216 LAVSELTADQQELLLALIKEYLGRLPEEDAAERMAEIEEAGLDETY 261 (313)
T ss_pred cChhhCCHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhccccceE
Confidence 5678999999999999999888877753 1222234 66666655
No 10
>PF10703 MoaF: Molybdenum cofactor biosynthesis protein F; InterPro: IPR024724 Molybdenum cofactor biosynthesis protein F (MoaF) is essential for the production of the monoamine-inducible 30kDa protein in Klebsiella []. It is necessary for reconstituting organoautotrophic growth in Ralstonia eutropha []. MoaF is conserved in proteobacteria and some lower eukaryotes. The operon regulating the Moa genes is responsible for molybdenum cofactor biosynthesis.
Probab=77.97 E-value=4.6 Score=29.34 Aligned_cols=32 Identities=19% Similarity=0.626 Sum_probs=23.8
Q ss_pred ceEEEEeCCceeeEEEcCCcEEEEEe-CCEEEEEEe
Q psy1421 53 TWHCIVGRNFGSYVTHETRHFIYFYL-GQVAILLFK 87 (89)
Q Consensus 53 ~Wh~vVG~~fg~~~th~~~~~~~f~~-~~~~~lv~k 87 (89)
+|||+.|..=|-. +....-++++ .++.+++|+
T Consensus 182 ~W~CL~G~e~Gla---D~D~c~~~Ki~d~lYlf~Wr 214 (265)
T PF10703_consen 182 AWQCLSGVEKGLA---DTDRCHYYKIADNLYLFTWR 214 (265)
T ss_pred EEEEeeccccCCC---CccceEEEEecCCEEEEEEE
Confidence 4999999976632 4577777887 577777776
No 11
>PF12652 CotJB: CotJB protein; InterPro: IPR024207 The cotJ operon proteins affect spore coat composition, and is controlled by sigma E. The genes, which include CotJB, are either required for the normal formation of the inner layers of the coat or are themselves structural components of the coat []. CotJB has been identified as a spore coat protein [].
Probab=77.58 E-value=1.9 Score=25.70 Aligned_cols=26 Identities=42% Similarity=0.655 Sum_probs=16.8
Q ss_pred HHhhcCchHHHHHHHHHHhcccCCCc
Q psy1421 28 ALEKYNIEKDIAAYIKKEFDKKYNPT 53 (89)
Q Consensus 28 a~~~~~~~~~iA~~iK~~lD~~yG~~ 53 (89)
|++.|+.....-+.+++.+.++|||-
T Consensus 31 Al~~y~~~~~~~~~l~~~Ye~~yGPL 56 (78)
T PF12652_consen 31 ALEYYNEYSKQRKQLKKEYEKRYGPL 56 (78)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 33333333344566889999999985
No 12
>PF12362 DUF3646: DNA polymerase III gamma and tau subunits C terminal; InterPro: IPR022107 This domain family is found in bacteria, and is approximately 120 amino acids in length. The family is found in association with PF00004 from PFAM. The proteins in this family are frequently annotated as the gamma and tau subunits of DNA polymerase III, however there is little accompanying literature to back this up.
Probab=73.63 E-value=4.8 Score=25.66 Aligned_cols=29 Identities=17% Similarity=0.375 Sum_probs=26.2
Q ss_pred hHHHHHHHHHHhcccCCCceEEEEeCCce
Q psy1421 35 EKDIAAYIKKEFDKKYNPTWHCIVGRNFG 63 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~yG~~Wh~vVG~~fg 63 (89)
..++|..|.+.|.+--|..|.|.+++.=|
T Consensus 48 p~dl~~~L~~~L~~wTG~rW~V~~s~~~g 76 (117)
T PF12362_consen 48 PKDLAQRLSRKLQEWTGQRWIVSLSNEPG 76 (117)
T ss_pred CHHHHHHHHHHHHHHhCCCeEEEEeCCCC
Confidence 46899999999999999999999998755
No 13
>PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed
Probab=69.78 E-value=17 Score=27.58 Aligned_cols=48 Identities=27% Similarity=0.659 Sum_probs=36.2
Q ss_pred hHHHHHHHHHHhcccCCCceEEEEeC--CceeeEEEcCCcEEEEEe----CCEEEEEEee
Q psy1421 35 EKDIAAYIKKEFDKKYNPTWHCIVGR--NFGSYVTHETRHFIYFYL----GQVAILLFKS 88 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~yG~~Wh~vVG~--~fg~~~th~~~~~~~f~~----~~~~~lv~kt 88 (89)
-.++..+||..|+..++..| |.|+ +|-. | ..-.+||.+ ..+.+.+|++
T Consensus 8 vsel~~~ik~~le~~~~~v~--v~gEis~~~~---~-~sGH~Yf~Lkd~~a~i~~~~~~~ 61 (438)
T PRK00286 8 VSELNRYVKSLLERDLGQVW--VRGEISNFTR---H-SSGHWYFTLKDEIAQIRCVMFKG 61 (438)
T ss_pred HHHHHHHHHHHHHhhCCcEE--EEEEeCCCee---C-CCCeEEEEEEcCCcEEEEEEEcC
Confidence 46789999999999988888 6666 4442 2 445688987 4578999985
No 14
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=62.43 E-value=24 Score=23.44 Aligned_cols=36 Identities=19% Similarity=0.347 Sum_probs=25.1
Q ss_pred CCCCHHHHHHHHHHHHHHHhh--------------cCchHHHHHHHHHHh
Q psy1421 11 ADMSEEMQQDAVDCATQALEK--------------YNIEKDIAAYIKKEF 46 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a~~~--------------~~~~~~iA~~iK~~l 46 (89)
.+||++.++++++-..+-++. .++.+++|+.|+...
T Consensus 15 ~~lp~~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~LG~P~~iA~~i~~~~ 64 (181)
T PF08006_consen 15 KKLPEEEREEILEYYEEYFDDAGEEGKSEEEIIAELGSPKEIAREILAEY 64 (181)
T ss_pred HcCCHHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHcCCHHHHHHHHHHhh
Confidence 469999999999877655442 334567888876543
No 15
>COG1570 XseA Exonuclease VII, large subunit [DNA replication, recombination, and repair]
Probab=60.62 E-value=17 Score=28.36 Aligned_cols=50 Identities=26% Similarity=0.562 Sum_probs=36.6
Q ss_pred hHHHHHHHHHHhcccCCCceEEEEeCCceeeEEEcCCcEEEEEeC----CEEEEEEee
Q psy1421 35 EKDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETRHFIYFYLG----QVAILLFKS 88 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~th~~~~~~~f~~~----~~~~lv~kt 88 (89)
-.++..+||..||..+|..| |-|+ =|.+|+-+....||.+. .+.+.+||.
T Consensus 8 VSeln~~ik~llE~~~~~V~--v~GE--ISn~t~~~sgH~YFtLKD~~A~i~c~mf~~ 61 (440)
T COG1570 8 VSELNDYIKRLLERDLGQVW--VRGE--ISNFTRPASGHLYFTLKDERAQIRCVMFKG 61 (440)
T ss_pred HHHHHHHHHHHHHhcCCeEE--EEEE--ecCCccCCCccEEEEEccCCceEEEEEEcC
Confidence 35789999999999999998 5566 12345444448899983 567888874
No 16
>TIGR02084 leud 3-isopropylmalate dehydratase, small subunit. Several pairs of archaeal proteins resemble the leuC and leuD pair in length and sequence but even more closely resemble the respective domains of homoaconitase, and their identity is uncertain. The members of the seed for this model are those sequences which are gene clustered with other genes involved in leucine biosynthesis and include some archaea.
Probab=60.28 E-value=16 Score=24.39 Aligned_cols=34 Identities=26% Similarity=0.320 Sum_probs=22.6
Q ss_pred chHHHHHHHHHHhcc----cCCCceEEEEeCCceeeEE
Q psy1421 34 IEKDIAAYIKKEFDK----KYNPTWHCIVGRNFGSYVT 67 (89)
Q Consensus 34 ~~~~iA~~iK~~lD~----~yG~~Wh~vVG~~fg~~~t 67 (89)
+..++++++-+.+|. ++.+..=+|.|+|||+=-|
T Consensus 25 ~~~~l~~~~f~~~~p~f~~~~~~g~iiVaG~NFG~GSS 62 (156)
T TIGR02084 25 DPKELAKHCMEDLDKDFVKKVKEGDIIVAGENFGCGSS 62 (156)
T ss_pred CHHHHHhhhhccCChhHHhhcCCCCEEEccCcccCCCc
Confidence 345677777666663 4556677888889996443
No 17
>cd03538 Rieske_RO_Alpha_AntDO Rieske non-heme iron oxygenase (RO) family, Anthranilate 1,2-dioxygenase (AntDO) subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. AntDO converts anthranilate to catechol, a naturally occurring compound formed through tryptophan degradation and an important intermediate in the metabolism of many N-heterocyclic compounds such as indole, o-nitrobenzoate, carbazole, and quinaldine.
Probab=58.79 E-value=31 Score=22.35 Aligned_cols=43 Identities=9% Similarity=0.342 Sum_probs=31.4
Q ss_pred HHHHHhcccCCCceEEEEeC-CceeeEEEcCCcEEEEEeCCEEEEEEee
Q psy1421 41 YIKKEFDKKYNPTWHCIVGR-NFGSYVTHETRHFIYFYLGQVAILLFKS 88 (89)
Q Consensus 41 ~iK~~lD~~yG~~Wh~vVG~-~fg~~~th~~~~~~~f~~~~~~~lv~kt 88 (89)
....++++-|...|+.|.-. ... +++.++-+.+++..++|+|.
T Consensus 10 ~~~~e~~~i~~~~W~~v~~~~elp-----~~G~~~~~~i~g~~i~v~r~ 53 (146)
T cd03538 10 IFALEMERLFGNAWIYVGHESQVP-----NPGDYITTRIGDQPVVMVRH 53 (146)
T ss_pred HHHHHHHHHhhcCCEEEEEHHHCC-----CCCCEEEEEECCeeEEEEEC
Confidence 34567788889999986543 342 45778888888888888874
No 18
>PF08958 DUF1871: Domain of unknown function (DUF1871); InterPro: IPR015053 This set of hypothetical proteins is produced by prokaryotes pertaining to the Bacillus genus. ; PDB: 1U84_A.
Probab=58.31 E-value=13 Score=22.18 Aligned_cols=34 Identities=18% Similarity=0.301 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHhhcCchHHHHHHHHHHhcccCCC
Q psy1421 18 QQDAVDCATQALEKYNIEKDIAAYIKKEFDKKYNP 52 (89)
Q Consensus 18 q~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG~ 52 (89)
..++.+ +..++...++..++|+.|..-|...||.
T Consensus 21 ~~Ei~~-Iv~~v~~~~~~~~LA~~Iq~If~~SF~e 54 (79)
T PF08958_consen 21 DTEIND-IVQAVHENDDPEELAKKIQSIFEFSFGE 54 (79)
T ss_dssp HHHHHH-HHHHHTT-S-HHHHHHHHHHHHHHHHSS
T ss_pred HHHHHH-HHHHHHhCCCHHHHHHHHHHHHHHHHcc
Confidence 344433 4456777778889999999998888874
No 19
>PF11858 DUF3378: Domain of unknown function (DUF3378); InterPro: IPR024568 Ribonuclease HIII 3.1.26 from EC cleaves RNA from DNA-RNA hybrids. It catalyses endonucleolytic cleavage to 5'-phospho-monoesters. Two types of ribonuclease H in Bacillus subtilis, RNase HII (rnhB) and RNase HIII (rnhC), are both known experimentally and are quite similar to each other []. The only RNase H homologue in the Mycoplasmas resembles rnhC. Archaeal forms resemble HII more closely than HIII. This entry represents the uncharacterised N-terminal domain of bacterial RNase HIII.; PDB: 3ASM_A 2D0C_A 2D0B_A 2D0A_A.
Probab=57.76 E-value=15 Score=21.78 Aligned_cols=20 Identities=25% Similarity=0.243 Sum_probs=16.0
Q ss_pred CCcEEEEEeCCEEEEEEeeC
Q psy1421 70 TRHFIYFYLGQVAILLFKSG 89 (89)
Q Consensus 70 ~~~~~~f~~~~~~~lv~kt~ 89 (89)
|+..+.++.++..|.+|+||
T Consensus 29 p~~~f~aK~~~~tIt~Y~SG 48 (81)
T PF11858_consen 29 PYAVFQAKYNGVTITAYKSG 48 (81)
T ss_dssp TTEEEEEEETTEEEEEETTS
T ss_pred CCEEEEEeCCCeEEEEEeCC
Confidence 45555678899999999987
No 20
>cd03472 Rieske_RO_Alpha_BPDO_like Rieske non-heme iron oxygenase (RO) family, Biphenyl dioxygenase (BPDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of BPDO and similar proteins including cumene dioxygenase (CumDO), nitrobenzene dioxygenase (NBDO), alkylbenzene dioxygenase (AkbDO) and dibenzofuran 4,4a-dioxygenase (DFDO). ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. BPDO degrades biphenyls and polychlorinated biphenyls (PCB's) while CumDO degrades cumene (isopropylbenzene), an aromatic hydrocarbon that is i
Probab=57.60 E-value=22 Score=22.53 Aligned_cols=38 Identities=8% Similarity=0.466 Sum_probs=26.6
Q ss_pred hcccCCCceEEEEeC-CceeeEEEcCCcEEEEEeCCEEEEEEee
Q psy1421 46 FDKKYNPTWHCIVGR-NFGSYVTHETRHFIYFYLGQVAILLFKS 88 (89)
Q Consensus 46 lD~~yG~~Wh~vVG~-~fg~~~th~~~~~~~f~~~~~~~lv~kt 88 (89)
|++-|.+.|+.|.=. .+. +++.+..+.+++..++|+|.
T Consensus 1 ~~~i~~~~W~~v~~~~el~-----~~g~~~~~~~~~~~i~l~r~ 39 (128)
T cd03472 1 LERVFARSWLLLGHETHIP-----KAGDYLTTYMGEDPVIVVRQ 39 (128)
T ss_pred CcchhhCCCeEeEEHHHCC-----CCCCEEEEEECCceEEEEEC
Confidence 567788999986544 243 44666667788888888873
No 21
>PF12550 GCR1_C: Transcriptional activator of glycolytic enzymes; InterPro: IPR022210 This domain family is found in eukaryotes, and is approximately 80 amino acids in length. This family is activates the transcription of glycolytic enzymes.
Probab=56.82 E-value=3.7 Score=24.17 Aligned_cols=12 Identities=33% Similarity=1.041 Sum_probs=10.1
Q ss_pred HHhcccCCCceE
Q psy1421 44 KEFDKKYNPTWH 55 (89)
Q Consensus 44 ~~lD~~yG~~Wh 55 (89)
+.|+++||..|.
T Consensus 29 ~~le~~yG~~WR 40 (81)
T PF12550_consen 29 RSLEKKYGSKWR 40 (81)
T ss_pred HHHHHHhChhhc
Confidence 358999999996
No 22
>PF10440 WIYLD: Ubiquitin-binding WIYLD domain; InterPro: IPR018848 This entry represents a presumed domain which has been predicted to contain three alpha helices. It was named the WIYLD domain based on the pattern of the ost conserved residues []. This domain appears to be specific to plant SET-domain proteins. ; GO: 0018024 histone-lysine N-methyltransferase activity
Probab=56.77 E-value=19 Score=20.72 Aligned_cols=41 Identities=24% Similarity=0.434 Sum_probs=26.8
Q ss_pred HHHHHHHHhhcCchHHHHHHHHHHhcccCCCceEEEEeCCc
Q psy1421 22 VDCATQALEKYNIEKDIAAYIKKEFDKKYNPTWHCIVGRNF 62 (89)
Q Consensus 22 ~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG~~Wh~vVG~~f 62 (89)
++.|..|+...+..++.+.-.=+.|=+-||+.|--|=-.+|
T Consensus 11 ~daA~dam~~lG~~~~~v~~vl~~LL~lY~~nW~lIEed~Y 51 (65)
T PF10440_consen 11 IDAALDAMRQLGFSKKQVRPVLKNLLKLYDGNWELIEEDNY 51 (65)
T ss_pred HHHHHHHHHHcCCCHHHHHHHHHHHHHHHcCCchhhhcccH
Confidence 45677888888876655544444455688888976644444
No 23
>PF09868 DUF2095: Uncharacterized protein conserved in archaea (DUF2095); InterPro: IPR018662 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=54.44 E-value=28 Score=22.52 Aligned_cols=20 Identities=20% Similarity=0.280 Sum_probs=16.3
Q ss_pred chHHHHHHHHHHhccc----CCCc
Q psy1421 34 IEKDIAAYIKKEFDKK----YNPT 53 (89)
Q Consensus 34 ~~~~iA~~iK~~lD~~----yG~~ 53 (89)
...+.|+.|+..|.++ ||+.
T Consensus 93 It~e~A~eLr~~L~~kGvr~fG~r 116 (128)
T PF09868_consen 93 ITPEEAKELRSILVKKGVRSFGSR 116 (128)
T ss_pred CCHHHHHHHHHHHHHhhHHHhCCc
Confidence 4578999999999998 6654
No 24
>PF08776 VASP_tetra: VASP tetramerisation domain; InterPro: IPR014885 Vasodilator-stimulated phosphoprotein (VASP) is an actin cytoskeletal regulatory protein. This region corresponds to the tetramerisation domain which forms a right handed alpha helical coiled coil structure []. ; PDB: 1USE_A 1USD_A.
Probab=54.30 E-value=30 Score=18.08 Aligned_cols=32 Identities=22% Similarity=0.483 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhcc
Q psy1421 15 EEMQQDAVDCATQALEKYNIEKDIAAYIKKEFDK 48 (89)
Q Consensus 15 ~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~ 48 (89)
+.++++|++..+.-+++-+ .++-..|++.|-+
T Consensus 6 e~~KqEIL~EvrkEl~K~K--~EIIeA~~~eL~r 37 (40)
T PF08776_consen 6 ERLKQEILEEVRKELQKVK--EEIIEAIRQELSR 37 (40)
T ss_dssp HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHhc
Confidence 3478888888887777655 4566667777754
No 25
>PF03398 Ist1: Regulator of Vps4 activity in the MVB pathway; InterPro: IPR005061 This is a eukaryotic protein family of unknown function.; PDB: 3GGZ_B 3GGY_B 3FRR_A 3FRS_A.
Probab=53.23 E-value=6.3 Score=26.35 Aligned_cols=42 Identities=12% Similarity=0.294 Sum_probs=29.1
Q ss_pred CCCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhcccCCCce
Q psy1421 11 ADMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEFDKKYNPTW 54 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG~~W 54 (89)
.+.|+++++.|..++-.| ..+++.+|+.. |++.|-.+||+.|
T Consensus 79 k~~p~~l~eAi~siiyAa-~r~~elpEL~~-vr~~l~~kyG~~f 120 (165)
T PF03398_consen 79 KECPPELKEAISSIIYAA-PRCGELPELQE-VRKQLAEKYGKEF 120 (165)
T ss_dssp TSSSCCHHHHHHHHHHHH-HHHTTTCCHHH-HHHHHHCCC-HHH
T ss_pred cCCCHHHHHHHHHHHHHh-hhccCChhHHH-HHHHHHHHhCHHH
Confidence 677888888777766655 33444455555 8899999999765
No 26
>TIGR00237 xseA exodeoxyribonuclease VII, large subunit. This family consist of exodeoxyribonuclease VII, large subunit XseA which catalyses exonucleolytic cleavage in either the 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. Exonuclease VII consists of one large subunit and four small subunits.
Probab=52.42 E-value=68 Score=24.59 Aligned_cols=47 Identities=30% Similarity=0.579 Sum_probs=35.7
Q ss_pred HHHHHHHHHHhcccCCCceEEEEeC--CceeeEEEcCCcEEEEEe----CCEEEEEEee
Q psy1421 36 KDIAAYIKKEFDKKYNPTWHCIVGR--NFGSYVTHETRHFIYFYL----GQVAILLFKS 88 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG~~Wh~vVG~--~fg~~~th~~~~~~~f~~----~~~~~lv~kt 88 (89)
.++...||..||..|+..| |.|+ ||- .| ..-.+||.+ ..+.+.+|++
T Consensus 3 sel~~~ik~~le~~~~~v~--V~GEisn~~---~~-~sGH~YFtLkD~~a~i~~vmf~~ 55 (432)
T TIGR00237 3 SELNAQIKALLEATFLQVW--IQGEISNFT---QP-VSGHWYFTLKDENAQVRCVMFRG 55 (432)
T ss_pred HHHHHHHHHHHHhhCCcEE--EEEEecCCe---eC-CCceEEEEEEcCCcEEEEEEEcC
Confidence 4788999999999999888 6676 565 23 334688987 4678899986
No 27
>PF14900 DUF4493: Domain of unknown function (DUF4493)
Probab=48.34 E-value=55 Score=22.62 Aligned_cols=37 Identities=27% Similarity=0.485 Sum_probs=29.1
Q ss_pred HHhcccCCCceEEEEeCCceeeEEE--cCC---cEEEEEeCC
Q psy1421 44 KEFDKKYNPTWHCIVGRNFGSYVTH--ETR---HFIYFYLGQ 80 (89)
Q Consensus 44 ~~lD~~yG~~Wh~vVG~~fg~~~th--~~~---~~~~f~~~~ 80 (89)
..|.+.|+..|++-|..+-+..++. ... .-.||..+.
T Consensus 119 ~~f~~~f~~~y~vtV~~~~~~~~~~~~~~~~~~~~~Yf~~~~ 160 (235)
T PF14900_consen 119 DEFKKYFGSDYSVTVSTGAGGSVTFNKDETTSDRSAYFKAGE 160 (235)
T ss_pred HHHHhhhccceEEEEEccCCccEEEeeccCCCCcceEEECCC
Confidence 4577778888999999976777777 555 889999877
No 28
>PF06150 ChaB: ChaB; InterPro: IPR009317 This family of proteins contain a conserved 60 residue region. This protein is known as ChaB in Escherichia coli and is found next to ChaA, which is a cation transporter protein. ChaB may be regulate ChaA function in some way.; PDB: 1SG7_A.
Probab=48.20 E-value=47 Score=18.38 Aligned_cols=43 Identities=28% Similarity=0.409 Sum_probs=26.9
Q ss_pred CCCCHHHHHHHHHHHHHHHhhcCchHHHH-----HHHHHHhcccCCCceE
Q psy1421 11 ADMSEEMQQDAVDCATQALEKYNIEKDIA-----AYIKKEFDKKYNPTWH 55 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a~~~~~~~~~iA-----~~iK~~lD~~yG~~Wh 55 (89)
..||..-|.--++....|++.+.++. .| ..+|+...+ -|+.|.
T Consensus 7 ~~LP~~Aq~if~~afn~a~~~~~de~-~A~~vAw~AVk~~Y~k-~~g~W~ 54 (57)
T PF06150_consen 7 EHLPEHAQRIFRKAFNSAWEEYGDEE-RAHRVAWAAVKRKYEK-VNGRWV 54 (57)
T ss_dssp TT--SHHHHHHHHHHHHHHHH--SHH-HHHHHHHHHHHHHEEE-SSS-EE
T ss_pred hHCCHHHHHHHHHHHHHHHHhcCCHh-HHHHHHHHHHHHHhee-cCCEee
Confidence 45677777777777778888886643 33 356999988 677785
No 29
>PF06457 Ectatomin: Ectatomin; InterPro: IPR009458 Ectatomin is a toxin from the venom of the ant Ectatomma tuberculatum. Ectatomin can efficiently insert into the plasma membrane, where it can form channels. Ectatomin was shown to inhibit L-type calcium currents in isolated rat cardiac myocytes []. In these cells, ectatomin induces a gradual, irreversible increase in ion leakage across the membrane, which can lead to cell death. Ectatomin is comprised of two subunits, A and B, which are homologous. The structure of ectatomin reveals that each subunit consists of two alpha helices with a connecting hinge region, which form a hairpin structure that is stabilised by disulphide bridges. A disulphide bridge between the hinge regions of the two subunits links the heterodimer together, forming a closed bundle of four helices with a left-handed twist [].; GO: 0005216 ion channel activity, 0009405 pathogenesis, 0005576 extracellular region; PDB: 1ECI_A.
Probab=47.49 E-value=18 Score=17.93 Aligned_cols=13 Identities=46% Similarity=0.626 Sum_probs=10.4
Q ss_pred HHHHHHHHHHhcc
Q psy1421 36 KDIAAYIKKEFDK 48 (89)
Q Consensus 36 ~~iA~~iK~~lD~ 48 (89)
.++|.+||+..|+
T Consensus 22 g~iat~ik~~c~k 34 (34)
T PF06457_consen 22 GSIATMIKRKCDK 34 (34)
T ss_dssp CCHHHHHHHHCH-
T ss_pred ccHHHHHHHHhCC
Confidence 5799999998774
No 30
>PF08594 UPF0300: Uncharacterised protein family (UPF0300); InterPro: IPR013903 This entry of proteins appear to be specific to Schizosaccharomyces pombe (Fission yeast).
Probab=46.82 E-value=72 Score=22.60 Aligned_cols=58 Identities=14% Similarity=0.217 Sum_probs=36.6
Q ss_pred CCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhcccCC------------------CceEEEEeCCceeeEEEcC
Q psy1421 13 MSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEFDKKYN------------------PTWHCIVGRNFGSYVTHET 70 (89)
Q Consensus 13 M~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG------------------~~Wh~vVG~~fg~~~th~~ 70 (89)
-|+.+|+.+-..+..-+++........+.-|..-.+++| ..|=+|+.++|-|++--+.
T Consensus 37 Yp~~lq~aLW~AV~~yv~~~v~~~~yt~lh~~Aa~kriG~IRmyLV~P~Diy~v~~~~~W~~I~~k~F~c~I~l~~ 112 (215)
T PF08594_consen 37 YPEFLQEALWKAVEHYVNNNVSSGGYTKLHKRAAQKRIGHIRMYLVDPHDIYSVDHSNSWIAICSKNFMCNIHLDQ 112 (215)
T ss_pred CcHHHHHHHHHHHHHHHHhhcccccchHHHHHHHHHhcCCeEEEEecccceEEecCCccEEEEecCcceEEEEecC
Confidence 366777776665555555433333333444555567776 3599999999999885443
No 31
>PRK14023 homoaconitate hydratase small subunit; Provisional
Probab=46.21 E-value=28 Score=23.46 Aligned_cols=34 Identities=24% Similarity=0.275 Sum_probs=21.3
Q ss_pred chHHHHHHHHHHhc----ccCCCceEEEEeCCceeeEE
Q psy1421 34 IEKDIAAYIKKEFD----KKYNPTWHCIVGRNFGSYVT 67 (89)
Q Consensus 34 ~~~~iA~~iK~~lD----~~yG~~Wh~vVG~~fg~~~t 67 (89)
+..++++++-..+| +++.+..=+|.|+|||+=-|
T Consensus 27 ~~~~l~~~~f~~~~p~f~~~~~~g~IIVaG~NFG~GSS 64 (166)
T PRK14023 27 GEDRFHNYAFAHLRPEFASTVRPGDILVAGRNFGLGSS 64 (166)
T ss_pred CHHHHHhhhccCCChhhHhhcCCCCEEEccCcccCCcc
Confidence 34556666655555 44556677788889996444
No 32
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=46.11 E-value=29 Score=20.36 Aligned_cols=25 Identities=28% Similarity=0.467 Sum_probs=17.0
Q ss_pred eEEEcCCc--EEEEEe--CCEEEEEEeeC
Q psy1421 65 YVTHETRH--FIYFYL--GQVAILLFKSG 89 (89)
Q Consensus 65 ~~th~~~~--~~~f~~--~~~~~lv~kt~ 89 (89)
...|+|+. .+.++. ....++||+||
T Consensus 30 ~~~YePe~fpgl~~r~~~p~~t~~IF~sG 58 (86)
T PF00352_consen 30 NVEYEPERFPGLIYRLRNPKATVLIFSSG 58 (86)
T ss_dssp TEEEETTTESSEEEEETTTTEEEEEETTS
T ss_pred CcEEeeccCCeEEEeecCCcEEEEEEcCC
Confidence 35777774 345665 47788888887
No 33
>COG0655 WrbA Multimeric flavodoxin WrbA [General function prediction only]
Probab=45.94 E-value=17 Score=24.70 Aligned_cols=56 Identities=14% Similarity=0.261 Sum_probs=33.7
Q ss_pred CCCCHHHHHHHHHHHHHH--Hh-hcC-chHHHHHHHHHHhcccCCCceE--EEEeCCceeeE
Q psy1421 11 ADMSEEMQQDAVDCATQA--LE-KYN-IEKDIAAYIKKEFDKKYNPTWH--CIVGRNFGSYV 66 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a--~~-~~~-~~~~iA~~iK~~lD~~yG~~Wh--~vVG~~fg~~~ 66 (89)
+.+++++-+++.+...+| +- -.+ -...++..+|..||+-+++.|. -.-|+-.+..+
T Consensus 59 c~~~dD~~~~i~~~l~~aD~iI~gsPvy~g~vsa~~K~fiDR~~~~~~~~~~l~~k~~~~~~ 120 (207)
T COG0655 59 CVIKDDDMNEIYEKLLEADGIIFGSPVYFGNVSAQMKAFIDRSTGPLWAPGALRGKVGAAFV 120 (207)
T ss_pred CCCCcccHHHHHHHHHHCCEEEEeCCeecCCchHHHHHHHhhcchhhcccchhccccceEEE
Confidence 555666566666655544 00 001 1346888999999998899988 34444444333
No 34
>PF10045 DUF2280: Uncharacterized conserved protein (DUF2280); InterPro: IPR018738 This entry is represented by Burkholderia phage Bups phi1, Orf2.36. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=45.19 E-value=59 Score=20.46 Aligned_cols=32 Identities=22% Similarity=0.404 Sum_probs=23.2
Q ss_pred CCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhcc
Q psy1421 13 MSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEFDK 48 (89)
Q Consensus 13 M~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~ 48 (89)
++++++..| ++++--|++..++|+.+|++|.-
T Consensus 4 L~~~vK~FI----VQ~LAcfdTPs~v~~aVk~eFgi 35 (104)
T PF10045_consen 4 LKKEVKAFI----VQSLACFDTPSEVAEAVKEEFGI 35 (104)
T ss_pred ccHHHHHHH----HHHHHhhCCHHHHHHHHHHHhCC
Confidence 445555554 45777888888999999998853
No 35
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=44.92 E-value=11 Score=23.02 Aligned_cols=36 Identities=19% Similarity=0.129 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHhhcCchHHHHHHHHHHhcccCCCceEEE
Q psy1421 19 QDAVDCATQALEKYNIEKDIAAYIKKEFDKKYNPTWHCI 57 (89)
Q Consensus 19 ~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG~~Wh~v 57 (89)
..+.++|.-.-..-...+++++.|++.|+.+ .|+++
T Consensus 19 ~~i~~i~d~~~~~~~~~~~~~~~l~kRl~~~---~~~~~ 54 (115)
T cd00197 19 PLIMEICDLINETNVGPKEAVDAIKKRINNK---NPHVV 54 (115)
T ss_pred HHHHHHHHHHHCCCccHHHHHHHHHHHhcCC---cHHHH
Confidence 3344444433222233578999999999875 67654
No 36
>PF02283 CobU: Cobinamide kinase / cobinamide phosphate guanyltransferase; InterPro: IPR003203 This family is composed of a group of bifunctional cobalbumin biosynthesis enzymes which display cobinamide kinase and cobinamide phosphate guanyltransferase activity. The crystal structure of the enzyme reveals the molecule to be a trimer with a propeller-like shape [].; GO: 0000166 nucleotide binding, 0043752 adenosylcobinamide kinase activity, 0051188 cofactor biosynthetic process; PDB: 1CBU_C 1C9K_B.
Probab=43.80 E-value=30 Score=23.06 Aligned_cols=25 Identities=12% Similarity=0.345 Sum_probs=21.2
Q ss_pred hHHHHHHHHHHhcccCCCceEEEEeC
Q psy1421 35 EKDIAAYIKKEFDKKYNPTWHCIVGR 60 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~yG~~Wh~vVG~ 60 (89)
+.|+++.|+++-.++ |..|+.+=-.
T Consensus 36 D~em~~RI~~H~~~R-~~~w~tiE~~ 60 (167)
T PF02283_consen 36 DEEMRERIARHRQRR-PKGWITIEEP 60 (167)
T ss_dssp HHHHHHHHHHHHHHS-STCEEEEE-S
T ss_pred CHHHHHHHHHHHHhC-CCCcEEEecc
Confidence 568999999999999 9999998653
No 37
>TIGR03515 GldC gliding motility-associated protein GldC. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldC do not abolish the gliding phenotype but do impair it. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Probab=43.42 E-value=71 Score=20.19 Aligned_cols=42 Identities=19% Similarity=0.179 Sum_probs=24.3
Q ss_pred cEEEeCCCC-HHHHHHHHHHHHHH---HhhcCchHHHHHHHHHHhc
Q psy1421 6 AVIKNADMS-EEMQQDAVDCATQA---LEKYNIEKDIAAYIKKEFD 47 (89)
Q Consensus 6 ~~v~~~dM~-~~mq~~i~~~~~~a---~~~~~~~~~iA~~iK~~lD 47 (89)
+-.=--||| +||+....+..... +.+-+.++.+|..|+...|
T Consensus 53 IDLWTKdMpVdEMK~Ff~qtl~~maDT~~rAT~d~kma~~m~dfcd 98 (108)
T TIGR03515 53 IDLWTKDMPVDEMKRFFIETLGGMADTFLNATGDEKMAEDIKDLCD 98 (108)
T ss_pred EEeecCcCcHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHH
Confidence 334456888 56887776655433 2233445557666666554
No 38
>PRK13007 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=43.37 E-value=1.2e+02 Score=21.68 Aligned_cols=39 Identities=13% Similarity=0.169 Sum_probs=28.8
Q ss_pred CCHHHHHHHHHHHHHHHhhcC---chHHHHHHHHHHhcccCC
Q psy1421 13 MSEEMQQDAVDCATQALEKYN---IEKDIAAYIKKEFDKKYN 51 (89)
Q Consensus 13 M~~~mq~~i~~~~~~a~~~~~---~~~~iA~~iK~~lD~~yG 51 (89)
|+.+...++++...+.++-.+ .+.++|++|++.|++.+|
T Consensus 1 ~~~~~~~~~~~~l~~li~ips~s~~e~~~~~~l~~~l~~~~~ 42 (352)
T PRK13007 1 MTLDLAADLAELTAALVDIPSVSGDEKALADAVEAALRALPH 42 (352)
T ss_pred CccchHHHHHHHHHHHhcCCCCCchHHHHHHHHHHHHHhCcC
Confidence 455566788888888776543 367899999999998634
No 39
>cd03545 Rieske_RO_Alpha_OHBDO_like Rieske non-heme iron oxygenase (RO) family, Ortho-halobenzoate-1,2-dioxygenase (OHBDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of OHBDO, salicylate 5-hydroxylase (S5H), terephthalate 1,2-dioxygenase system (TERDOS) and similar proteins. ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. OHBDO converts 2-chlorobenzoate (2-CBA) to catechol as well as 2,4-dCBA and 2,5-dCBA to 4-chlorocatechol, as part of the chlorobenzoate degradation pathway. Although ortho-substituted chlorobe
Probab=43.07 E-value=92 Score=20.23 Aligned_cols=44 Identities=16% Similarity=0.449 Sum_probs=31.6
Q ss_pred HHHHHHhcccCC-CceEEEEeCC-ceeeEEEcCCcEEEEEeCCEEEEEEee
Q psy1421 40 AYIKKEFDKKYN-PTWHCIVGRN-FGSYVTHETRHFIYFYLGQVAILLFKS 88 (89)
Q Consensus 40 ~~iK~~lD~~yG-~~Wh~vVG~~-fg~~~th~~~~~~~f~~~~~~~lv~kt 88 (89)
+....++++-|. ..|+.+.-.+ .. +++.++-+.+++..++|+|.
T Consensus 11 ~~~~~E~~~if~~~~W~~v~~~~el~-----~~g~~~~~~i~g~~iiv~r~ 56 (150)
T cd03545 11 AYFDREQERIFRGKTWSYVGLEAEIP-----NAGDFKSTFVGDTPVVVTRA 56 (150)
T ss_pred HHHHHHHHhhhCCCceEEEEEHHHCC-----CCCCEEEEEECCceEEEEEC
Confidence 345678888895 9999987653 32 45677778888877888763
No 40
>PF08908 DUF1852: Domain of unknown function (DUF1852); InterPro: IPR015004 This group of proteins are functionally uncharacterised.
Probab=42.25 E-value=13 Score=27.31 Aligned_cols=23 Identities=48% Similarity=0.615 Sum_probs=20.6
Q ss_pred HhcccCCCceEEEEeCCceeeEE
Q psy1421 45 EFDKKYNPTWHCIVGRNFGSYVT 67 (89)
Q Consensus 45 ~lD~~yG~~Wh~vVG~~fg~~~t 67 (89)
-+|++-|..=.-|||.+|.|+|.
T Consensus 98 IvD~kt~~rieGivGNnFSSYVR 120 (322)
T PF08908_consen 98 IVDHKTNERIEGIVGNNFSSYVR 120 (322)
T ss_pred EEecCCCceecceeccccccccc
Confidence 47899999999999999999885
No 41
>PF06057 VirJ: Bacterial virulence protein (VirJ); InterPro: IPR010333 This entry contains several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localised to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium [].
Probab=42.02 E-value=30 Score=23.97 Aligned_cols=29 Identities=24% Similarity=0.453 Sum_probs=17.3
Q ss_pred HHHHHHHHHhcccCCCceEEEEeCCceeeE
Q psy1421 37 DIAAYIKKEFDKKYNPTWHCIVGRNFGSYV 66 (89)
Q Consensus 37 ~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~ 66 (89)
++++.|..++ +++|..==+.||.|||+.+
T Consensus 53 Dl~~~i~~y~-~~w~~~~vvLiGYSFGADv 81 (192)
T PF06057_consen 53 DLARIIRHYR-ARWGRKRVVLIGYSFGADV 81 (192)
T ss_pred HHHHHHHHHH-HHhCCceEEEEeecCCchh
Confidence 3444333333 3445444489999999854
No 42
>PF06925 MGDG_synth: Monogalactosyldiacylglycerol (MGDG) synthase; InterPro: IPR009695 This entry represents a conserved region of approximately 180 residues found towirds the N terminus of a number of plant and bacterial diacylglycerol glucosyltransferases, such as monogalactosyldiacylglycerol synthase [].; GO: 0016758 transferase activity, transferring hexosyl groups, 0009247 glycolipid biosynthetic process
Probab=42.00 E-value=32 Score=22.39 Aligned_cols=27 Identities=11% Similarity=0.140 Sum_probs=23.0
Q ss_pred HHHHHHHHHhcccCCCceEEEEeCCce
Q psy1421 37 DIAAYIKKEFDKKYNPTWHCIVGRNFG 63 (89)
Q Consensus 37 ~iA~~iK~~lD~~yG~~Wh~vVG~~fg 63 (89)
..|+.|++.|.+.||+...|.|..-|.
T Consensus 3 ~aA~Al~eal~~~~~~~~~v~v~D~~~ 29 (169)
T PF06925_consen 3 SAARALAEALERRRGPDAEVEVVDFLE 29 (169)
T ss_pred HHHHHHHHHHHhhcCCCCEEEEEehHH
Confidence 578999999999999999999887443
No 43
>KOG2451|consensus
Probab=41.16 E-value=62 Score=25.35 Aligned_cols=41 Identities=32% Similarity=0.439 Sum_probs=30.1
Q ss_pred cEEEeCCCCHHHHHHHHHHHHHHHhhcCc--hHHHHHHHHHHh
Q psy1421 6 AVIKNADMSEEMQQDAVDCATQALEKYNI--EKDIAAYIKKEF 46 (89)
Q Consensus 6 ~~v~~~dM~~~mq~~i~~~~~~a~~~~~~--~~~iA~~iK~~l 46 (89)
++=+-.||+-+.-+++++.|.+|++.+.+ -+|-++.|+++.
T Consensus 51 ii~~V~~~~V~e~~kAI~aA~EaF~s~~~~takeRs~lLrkwy 93 (503)
T KOG2451|consen 51 IIGKVADMTVEEAEKAIDAAYEAFKSYRNLTAKERSALLRKWY 93 (503)
T ss_pred hhhcccCCcHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHH
Confidence 44467899999999999999999998754 344444444443
No 44
>TIGR03147 cyt_nit_nrfF cytochrome c nitrite reductase, accessory protein NrfF.
Probab=41.12 E-value=16 Score=23.69 Aligned_cols=12 Identities=8% Similarity=0.155 Sum_probs=7.8
Q ss_pred HHHHHhcccCCC
Q psy1421 41 YIKKEFDKKYNP 52 (89)
Q Consensus 41 ~iK~~lD~~yG~ 52 (89)
.|+.+|-++||.
T Consensus 78 eI~~~~v~RYG~ 89 (126)
T TIGR03147 78 QIIDFMTARFGD 89 (126)
T ss_pred HHHHHHHHhcCC
Confidence 466667777773
No 45
>PRK09111 DNA polymerase III subunits gamma and tau; Validated
Probab=40.96 E-value=31 Score=27.75 Aligned_cols=29 Identities=21% Similarity=0.431 Sum_probs=25.6
Q ss_pred hHHHHHHHHHHhcccCCCceEEEEeCCce
Q psy1421 35 EKDIAAYIKKEFDKKYNPTWHCIVGRNFG 63 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~yG~~Wh~vVG~~fg 63 (89)
.++++..|++.|.+-.|..|.|.|.+..|
T Consensus 504 ~~~~~~~l~~~l~~~t~~~w~v~~~~~~~ 532 (598)
T PRK09111 504 PRDLAQRLARKLEEWTGRRWVVSVSREGG 532 (598)
T ss_pred ChHHHHHHHHHHHHHhCCceEEEEecCCC
Confidence 36899999999999999999999987655
No 46
>KOG3248|consensus
Probab=40.30 E-value=17 Score=27.70 Aligned_cols=12 Identities=33% Similarity=0.393 Sum_probs=8.2
Q ss_pred HHHHHHHHHHHH
Q psy1421 15 EEMQQDAVDCAT 26 (89)
Q Consensus 15 ~~mq~~i~~~~~ 26 (89)
.||+.++++.|.
T Consensus 204 KEmRa~vvaEct 215 (421)
T KOG3248|consen 204 KEMRAKVVAECT 215 (421)
T ss_pred HHHHHHHHHHhh
Confidence 467777777665
No 47
>PF07742 BTG: BTG family; InterPro: IPR002087 Anti-proliferative proteins have been shown to include mammalian and avian protein BTG1 (which appears to be involved in negative regulation of cell proliferation) and rat/mouse NGF-inducible protein PC3/TIS21 (BTG2) [, , ]. These proteins have from 158 to 363 amino acid residues, that are highly similar and include 3 conserved cysteine residues. BTG2 seems to have a signal sequence; while the other proteins may lack such a domain. The sequence of the N-terminal half of these proteins is well conserved.; PDB: 3DJU_B 3E9V_A 2Z15_D 2D5R_B 3DJN_B.
Probab=40.01 E-value=55 Score=20.76 Aligned_cols=19 Identities=16% Similarity=0.419 Sum_probs=14.9
Q ss_pred HHHHHHHHHhcccCCCceE
Q psy1421 37 DIAAYIKKEFDKKYNPTWH 55 (89)
Q Consensus 37 ~iA~~iK~~lD~~yG~~Wh 55 (89)
..++.|.+.|-++|.+.|+
T Consensus 29 ~F~~~L~~~L~~ry~~HW~ 47 (118)
T PF07742_consen 29 RFAEELENLLCERYKGHWY 47 (118)
T ss_dssp HHHHHHHHHHHHHHTTS--
T ss_pred HHHHHHHHHHHHHHhCCCC
Confidence 4688899999999999997
No 48
>PF13798 PCYCGC: Protein of unknown function with PCYCGC motif
Probab=39.01 E-value=12 Score=25.17 Aligned_cols=34 Identities=29% Similarity=0.382 Sum_probs=22.8
Q ss_pred HHHHHHHHHHHHhhcCchHHHHHHHHHHhcccCCC
Q psy1421 18 QQDAVDCATQALEKYNIEKDIAAYIKKEFDKKYNP 52 (89)
Q Consensus 18 q~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG~ 52 (89)
....++++..++..+..-+. ...|++..|++|..
T Consensus 114 C~vCl~ia~~a~~~~~~Gks-~~eIR~~ID~kYk~ 147 (158)
T PF13798_consen 114 CGVCLDIAVQAVQMYQEGKS-PKEIRQYIDEKYKE 147 (158)
T ss_pred cHHHHHHHHHHHHHHHcCCC-HHHHHHHHHHHHHh
Confidence 34456777777766654443 45589999999864
No 49
>PF03564 DUF1759: Protein of unknown function (DUF1759); InterPro: IPR005312 This is a small family of proteins of unknown function.
Probab=38.83 E-value=60 Score=20.38 Aligned_cols=43 Identities=7% Similarity=0.166 Sum_probs=26.0
Q ss_pred CCCCHHHHHHHHHHH-----HHHHhhcCchHHHHHHHHHHhcccCCCc
Q psy1421 11 ADMSEEMQQDAVDCA-----TQALEKYNIEKDIAAYIKKEFDKKYNPT 53 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~-----~~a~~~~~~~~~iA~~iK~~lD~~yG~~ 53 (89)
.++++..+-..+..+ .+++..+....+--+...+.|+++||..
T Consensus 23 ~~~~d~~K~~~L~~~L~G~A~~~i~~~~~~~~~Y~~a~~~L~~~yg~~ 70 (145)
T PF03564_consen 23 PDLSDIEKLNYLRSCLKGEAKELIRGLPLSEENYEEAWELLEERYGNP 70 (145)
T ss_pred cCCCHHHHHHHHHHHhcchHHHHHHcccccchhhHHHHHHHHHHhCCc
Confidence 678888777766644 3445544433333344566788888864
No 50
>PF11307 DUF3109: Protein of unknown function (DUF3109); InterPro: IPR021458 This bacterial family of proteins has no known function.
Probab=38.64 E-value=9.6 Score=26.28 Aligned_cols=31 Identities=13% Similarity=0.273 Sum_probs=21.5
Q ss_pred HHHHHHhhcCch-HHHHHHHHHHhcccCCCce
Q psy1421 24 CATQALEKYNIE-KDIAAYIKKEFDKKYNPTW 54 (89)
Q Consensus 24 ~~~~a~~~~~~~-~~iA~~iK~~lD~~yG~~W 54 (89)
++..|...-+.. --+.++||.-|-++||.-|
T Consensus 145 iC~~A~~~G~el~vpvy~Flk~pLir~fG~~w 176 (183)
T PF11307_consen 145 ICSDACSLGKELGVPVYKFLKEPLIRKFGEEW 176 (183)
T ss_pred hCHHHHHhchhhCCcHHHHhHHHHHHHHhHHH
Confidence 445554432221 1378999999999999998
No 51
>PF01726 LexA_DNA_bind: LexA DNA binding domain; InterPro: IPR006199 This is the DNA binding domain of the LexA SOS regulon repressor which prevents expression of DNA repair proteins in bacteria. The aligned region contains a variant form of the helix-turn-helix DNA binding motif []. This domain usually at the N terminus is found associated with IPR006198 from INTERPRO the auto-proteolytic domain of LexA 3.4.21.88 from EC.; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1LEA_A 1JHH_A 3JSP_A 1JHF_A 3JSO_B 1LEB_A 3K2Z_A.
Probab=37.71 E-value=77 Score=17.80 Aligned_cols=28 Identities=18% Similarity=0.329 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHHHhhcC---chHHHHHHH
Q psy1421 15 EEMQQDAVDCATQALEKYN---IEKDIAAYI 42 (89)
Q Consensus 15 ~~mq~~i~~~~~~a~~~~~---~~~~iA~~i 42 (89)
.+.|.+|++.+.+-+.++. +..|||+.+
T Consensus 5 T~rQ~~vL~~I~~~~~~~G~~Pt~rEIa~~~ 35 (65)
T PF01726_consen 5 TERQKEVLEFIREYIEENGYPPTVREIAEAL 35 (65)
T ss_dssp -HHHHHHHHHHHHHHHHHSS---HHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHcCCCCCHHHHHHHh
Confidence 3589999999988877754 356676654
No 52
>PF05184 SapB_1: Saposin-like type B, region 1; InterPro: IPR007856 Synonym(s):cerebroside sulphate activator, CSAct Saposin B is a small non-enzymatic glycoprotein required for the breakdown of cerebroside sulphates (sulphatides) in lysosomes. Saposin B contains three intramolecular disulphide bridges, exists as a dimer and is remarkably heat, protease, and pH stable. The crystal structure of human saposin B reveals an unusual shell-like dimer consisting of a monolayer of alpha-helices enclosing a large hydrophobic cavity. Although the secondary structure of saposin B is similar to that of the known monomeric members of the saposin-like superfamily, the helices are repacked into a different tertiary arrangement to form the homodimer. A comparison of the two forms of the saposin B dimer suggests that extraction of target lipids from membranes involves a conformational change that facilitates access to the inner cavity [].; GO: 0006629 lipid metabolic process; PDB: 1N69_C 1QDM_C 4DDJ_A 2DOB_A 1OF9_A 2Z9A_A 1M12_A 2GTG_A 1SN6_A 2QYP_B ....
Probab=37.71 E-value=53 Score=15.89 Aligned_cols=31 Identities=16% Similarity=0.216 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHHHhhcCchHHHHHHHHHHh
Q psy1421 16 EMQQDAVDCATQALEKYNIEKDIAAYIKKEF 46 (89)
Q Consensus 16 ~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~l 46 (89)
+..+.++..+...++...++.+|.+.|++.-
T Consensus 5 ~~C~~~v~~i~~~l~~~~t~~~I~~~l~~~C 35 (39)
T PF05184_consen 5 DICKFVVKEIEKLLKNNKTEEEIKKALEKAC 35 (39)
T ss_dssp HHHHHHHHHHHHHHHSTCHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHcCccHHHHHHHHHHHH
Confidence 3456778888888887777888888777654
No 53
>PRK10667 Hha toxicity attenuator; Provisional
Probab=37.03 E-value=58 Score=20.97 Aligned_cols=20 Identities=25% Similarity=0.486 Sum_probs=17.9
Q ss_pred hcCchHHHHHHHHHHhcccC
Q psy1421 31 KYNIEKDIAAYIKKEFDKKY 50 (89)
Q Consensus 31 ~~~~~~~iA~~iK~~lD~~y 50 (89)
||+++.+++..|-++||++|
T Consensus 65 KYp~~~~l~~~ideYLDeTy 84 (122)
T PRK10667 65 KYPEDSKLIEQIDEYLDDTY 84 (122)
T ss_pred cCCcHhhHHHHHHHHHHHHH
Confidence 67788899999999999987
No 54
>COG1352 CheR Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]
Probab=36.44 E-value=51 Score=23.85 Aligned_cols=47 Identities=11% Similarity=0.254 Sum_probs=31.5
Q ss_pred CccEEEeCCCCHHHHHHHHHHHHHHHhhcCc---hHHH-HHHHHHHhcccCCCceEE
Q psy1421 4 RKAVIKNADMSEEMQQDAVDCATQALEKYNI---EKDI-AAYIKKEFDKKYNPTWHC 56 (89)
Q Consensus 4 ~~~~v~~~dM~~~mq~~i~~~~~~a~~~~~~---~~~i-A~~iK~~lD~~yG~~Wh~ 56 (89)
..+.|..+|+++..-+.| +.+ .|+. .+.+ .+.++++|++.-|+.|.+
T Consensus 128 ~~~~I~AtDId~~~L~~A----~~G--~Y~~~~~~~~~~~~~~~ryF~~~~~~~y~v 178 (268)
T COG1352 128 FRVKILATDIDLSVLEKA----RAG--IYPSRELLRGLPPELLRRYFERGGDGSYRV 178 (268)
T ss_pred CceEEEEEECCHHHHHHH----hcC--CCChhHhhccCCHHHHhhhEeecCCCcEEE
Confidence 478999999999755544 333 2331 1222 466789999998887775
No 55
>TIGR00294 conserved hypothetical protein TIGR00294.
Probab=35.63 E-value=64 Score=23.60 Aligned_cols=39 Identities=15% Similarity=0.068 Sum_probs=31.5
Q ss_pred HHHHHHHHHHhcccCC--CceEEEEeC--CceeeEEEcCCcEE
Q psy1421 36 KDIAAYIKKEFDKKYN--PTWHCIVGR--NFGSYVTHETRHFI 74 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG--~~Wh~vVG~--~fg~~~th~~~~~~ 74 (89)
.|+++.|+..|.+.++ |.|-.|-.+ ||.|-..|..-..+
T Consensus 251 ED~vR~ia~~l~~~~~~~~~~~~~~v~~~~~ESIH~H~A~A~~ 293 (308)
T TIGR00294 251 EDCVRLMAARLVELFPHLPDDTEVECRQINEESIHRHNAFAEK 293 (308)
T ss_pred HHHHHHHHHHHHhccCCCCCceEEEEEecCcccccCCceeEEE
Confidence 4899999999999976 768888888 99998877655444
No 56
>cd01578 AcnA_Mitochon_Swivel Mitochondrial aconitase A swivel domain. Aconitase (also known as aconitate hydratase and citrate hydro-lyase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. This is the aconitase swivel domain, which undergoes swivelling conformational change in the enzyme mechanism. In eukaryotes two isozymes of aconitase are known to exist: one found in the mitochondrial matrix and the other found in the cytoplasm. This is the mitochondrial form. The mitochondrial product is coded by a nuclear gene. Most members of this subfamily are mitochondrial but there are some bacterial members.
Probab=35.29 E-value=23 Score=23.67 Aligned_cols=18 Identities=22% Similarity=0.396 Sum_probs=13.6
Q ss_pred CCCceEEEEeCCceeeEE
Q psy1421 50 YNPTWHCIVGRNFGSYVT 67 (89)
Q Consensus 50 yG~~Wh~vVG~~fg~~~t 67 (89)
-|..|-+|.|++||+=-|
T Consensus 67 ~g~~~iIVaG~nyG~GSS 84 (149)
T cd01578 67 HGIKWVVIGDENYGEGSS 84 (149)
T ss_pred cCCCeEEEccCccCCCCc
Confidence 466899999999885433
No 57
>PF03918 CcmH: Cytochrome C biogenesis protein; InterPro: IPR005616 Members of this family include NrfF, CcmH, CycL, Ccl2.; PDB: 2KW0_A 2HL7_A.
Probab=34.70 E-value=21 Score=23.58 Aligned_cols=34 Identities=15% Similarity=0.299 Sum_probs=20.3
Q ss_pred CCCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhcccCCC
Q psy1421 11 ADMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEFDKKYNP 52 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~yG~ 52 (89)
+++..+|+.+|.+...++..+ +.|+++|=++||.
T Consensus 56 a~~A~dmR~~I~~~l~~G~s~--------~eI~~~~v~rYG~ 89 (148)
T PF03918_consen 56 APIARDMRREIREMLAEGKSD--------EEIIDYFVERYGE 89 (148)
T ss_dssp SHHHHHHHHHHHHHHHHT--H--------HHHHHHHHHHHTT
T ss_pred cHHHHHHHHHHHHHHHcCCCH--------HHHHHHHHHhcCc
Confidence 444566666666666554331 4577778888874
No 58
>PF06884 DUF1264: Protein of unknown function (DUF1264); InterPro: IPR010686 This family contains a number of bacterial and eukaryotic proteins of unknown function that are approximately 200 residues long. Some family members are annotated as putative lipoproteins.
Probab=34.58 E-value=32 Score=23.49 Aligned_cols=19 Identities=26% Similarity=0.476 Sum_probs=13.7
Q ss_pred HHHHHHHHhcccCCCceEE
Q psy1421 38 IAAYIKKEFDKKYNPTWHC 56 (89)
Q Consensus 38 iA~~iK~~lD~~yG~~Wh~ 56 (89)
+.+.+-+.+-+.||.+||-
T Consensus 98 ae~~~m~~l~~tYGKt~Ht 116 (171)
T PF06884_consen 98 AEKAEMEKLVKTYGKTWHT 116 (171)
T ss_pred HHHHHHHHHHhhhCCeEEe
Confidence 3445566677899999984
No 59
>PF01739 CheR: CheR methyltransferase, SAM binding domain; InterPro: IPR022642 Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. Flagellated bacteria swim towards favourable chemicals and away from deleterious ones. Sensing of chemoeffector gradients involves chemotaxis receptors, transmembrane (TM) proteins that detect stimuli through their periplasmic domains and transduce the signals via their cytoplasmic domains []. Signalling outputs from these receptors are influenced both by the binding of the chemoeffector ligand to their periplasmic domains and by methylation of specific glutamate residues on their cytoplasmic domains. Methylation is catalysed by CheR, an S-adenosylmethionine-dependent methyltransferase [], which reversibly methylates specific glutamate residues within a coiled coil region, to form gamma-glutamyl methyl ester residues [, ]. The structure of the Salmonella typhimurium chemotaxis receptor methyltransferase CheR, bound to S-adenosylhomocysteine, has been determined to a resolution of 2.0 A []. The structure reveals CheR to be a two-domain protein, with a smaller N-terminal helical domain linked via a single polypeptide connection to a larger C-terminal alpha/beta domain. The C-terminal domain has the characteristics of a nucleotide-binding fold, with an insertion of a small anti-parallel beta-sheet subdomain. The S-adenosylhomocysteine-binding site is formed mainly by the large domain, with contributions from residues within the N-terminal domain and the linker region []. CheR proteins are part of the chemotaxis signaling mechanism which methylates the chemotaxis receptor at specific glutamate residues. This entry refers to the C-terminal SAM-binding domain of the CherR-type MCP methyltransferases, which are found in bacteria, archaea and green plants. This entry is found in association with PF03705 from PFAM. ; PDB: 1AF7_A 1BC5_A.
Probab=33.75 E-value=19 Score=24.70 Aligned_cols=47 Identities=15% Similarity=0.254 Sum_probs=29.0
Q ss_pred ccEEEeCCCCHHHHHHHHHHHHHHHhhcCchHHH-HHHHHHHhcccCCCceE
Q psy1421 5 KAVIKNADMSEEMQQDAVDCATQALEKYNIEKDI-AAYIKKEFDKKYNPTWH 55 (89)
Q Consensus 5 ~~~v~~~dM~~~mq~~i~~~~~~a~~~~~~~~~i-A~~iK~~lD~~yG~~Wh 55 (89)
++.|..+|.++.. ++.|+.+.=.....+.+ ...++++|++.-|+.|.
T Consensus 64 ~~~I~atDi~~~~----L~~Ar~G~Y~~~~~~~~~~~~~~ryf~~~~~~~~~ 111 (196)
T PF01739_consen 64 DFRILATDISPSA----LEKARAGIYPERSLRGLPPAYLRRYFTERDGGGYR 111 (196)
T ss_dssp SEEEEEEES-HHH----HHHHHHTEEEGGGGTTS-HHHHHHHEEEE-CCCTT
T ss_pred ceEEEEEECCHHH----HHHHHhCCCCHHHHhhhHHHHHHHhccccCCCcee
Confidence 6899999999975 45555552111112233 45678999888787777
No 60
>KOG2130|consensus
Probab=33.72 E-value=38 Score=25.76 Aligned_cols=19 Identities=16% Similarity=0.331 Sum_probs=14.2
Q ss_pred CCceEEEEeCCceeeEEEc
Q psy1421 51 NPTWHCIVGRNFGSYVTHE 69 (89)
Q Consensus 51 G~~Wh~vVG~~fg~~~th~ 69 (89)
|+-||||+--.....+|+.
T Consensus 277 ~GWWHvVlNle~TIAiTqN 295 (407)
T KOG2130|consen 277 SGWWHVVLNLEPTIAITQN 295 (407)
T ss_pred CCeEEEEeccCceeeeeec
Confidence 5679999887777666653
No 61
>PF10925 DUF2680: Protein of unknown function (DUF2680); InterPro: IPR024485 Members in this family of proteins are annotated as YckD however currently no function is known.
Probab=33.35 E-value=91 Score=17.36 Aligned_cols=16 Identities=31% Similarity=0.441 Sum_probs=13.2
Q ss_pred hHHHHHHHHHHhcccC
Q psy1421 35 EKDIAAYIKKEFDKKY 50 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~y 50 (89)
-+|-|..||+.+|+++
T Consensus 34 TqeqAd~ik~~id~~~ 49 (59)
T PF10925_consen 34 TQEQADAIKKHIDQRQ 49 (59)
T ss_pred CHHHHHHHHHHHHHHH
Confidence 4678999999999875
No 62
>TIGR01641 phageSPP1_gp7 phage putative head morphogenesis protein, SPP1 gp7 family. This model describes a region of about 110 amino acids found exclusively in phage-related proteins, internally or toward the C-terminus. One member, gp7 of phage SPP1, appears involved in head morphogenesis.
Probab=33.16 E-value=65 Score=19.35 Aligned_cols=13 Identities=23% Similarity=0.406 Sum_probs=5.5
Q ss_pred chHHHHHHHHHHh
Q psy1421 34 IEKDIAAYIKKEF 46 (89)
Q Consensus 34 ~~~~iA~~iK~~l 46 (89)
+.+++|+.|++.|
T Consensus 14 ~~~~iak~i~~~~ 26 (108)
T TIGR01641 14 GPNELAKRLRKEL 26 (108)
T ss_pred CHHHHHHHHHHHH
Confidence 3344444444443
No 63
>PF08015 Pheromone: Fungal mating-type pheromone; InterPro: IPR012597 This family corresponds to mating-type pheromone proteins. The homobasidiomycetes, or mushroom fungi, have arguably the most complex mating system of all known organisms. Many species possess a mating system known as bifactorial incompatibility, where two unlinked loci control the mating-type of an individual incompatibility loci (the A and B mating-type loci). Each A mating-type sublocus encodes a pair of divergently transcribed homeodomain transcription factors while the genes responsible for B mating-type activity encode lipopeptide pheromones and G-protein -coupled pheromone receptors [].; GO: 0000772 mating pheromone activity, 0016020 membrane
Probab=32.89 E-value=25 Score=19.93 Aligned_cols=14 Identities=14% Similarity=0.608 Sum_probs=8.8
Q ss_pred HhcccCCC--ceEEEE
Q psy1421 45 EFDKKYNP--TWHCIV 58 (89)
Q Consensus 45 ~lD~~yG~--~Wh~vV 58 (89)
..|+..|+ +|-|||
T Consensus 54 d~Er~~~g~~~~fCVI 69 (69)
T PF08015_consen 54 DFERRGGGGAGAFCVI 69 (69)
T ss_pred CccccCCCCceEEEeC
Confidence 34555555 788886
No 64
>PRK10144 formate-dependent nitrite reductase complex subunit NrfF; Provisional
Probab=32.88 E-value=26 Score=22.71 Aligned_cols=12 Identities=17% Similarity=0.191 Sum_probs=7.4
Q ss_pred HHHHHhcccCCC
Q psy1421 41 YIKKEFDKKYNP 52 (89)
Q Consensus 41 ~iK~~lD~~yG~ 52 (89)
.|..+|-++||.
T Consensus 78 eI~~~~v~RYG~ 89 (126)
T PRK10144 78 EIIGWMTERYGD 89 (126)
T ss_pred HHHHHHHHhcCC
Confidence 355666667763
No 65
>KOG3165|consensus
Probab=32.81 E-value=41 Score=23.10 Aligned_cols=25 Identities=24% Similarity=0.587 Sum_probs=20.5
Q ss_pred HHHHHHHHhcccCCCceEEEEeCCc
Q psy1421 38 IAAYIKKEFDKKYNPTWHCIVGRNF 62 (89)
Q Consensus 38 iA~~iK~~lD~~yG~~Wh~vVG~~f 62 (89)
+++.+--..+...||+.||+|-.||
T Consensus 48 ~~s~lffqyn~~L~PPy~vivDTNF 72 (195)
T KOG3165|consen 48 VPSALFFQYNTTLGPPYHVIVDTNF 72 (195)
T ss_pred cchhHHHhcccccCCCeEEEEecch
Confidence 4566666778899999999998876
No 66
>PF03869 Arc: Arc-like DNA binding domain; InterPro: IPR005569 Arc repressor act by the cooperative binding of two Arc repressor dimers to a 21-base-pair operator site. Each Arc dimer uses an antiparallel beta-sheet to recognise bases in the major groove [].; GO: 0003677 DNA binding; PDB: 3QOQ_D 1MNT_B 1QTG_B 1BDV_A 1PAR_C 1BDT_C 1ARR_B 1MYL_F 1MYK_A 1NLA_B ....
Probab=32.68 E-value=56 Score=17.45 Aligned_cols=35 Identities=14% Similarity=0.301 Sum_probs=24.3
Q ss_pred CCCHHHHHHHHHHHHHHHhhcCc-hHHHHHHHHHHhccc
Q psy1421 12 DMSEEMQQDAVDCATQALEKYNI-EKDIAAYIKKEFDKK 49 (89)
Q Consensus 12 dM~~~mq~~i~~~~~~a~~~~~~-~~~iA~~iK~~lD~~ 49 (89)
-||+++.+.+-+.|.. ...+ ..+|...|.+.|.+.
T Consensus 10 RlP~~l~~~lk~~A~~---~gRS~NsEIv~~L~~~l~~e 45 (50)
T PF03869_consen 10 RLPEELKEKLKERAEE---NGRSMNSEIVQRLEEALKKE 45 (50)
T ss_dssp ECEHHHHHHHHHHHHH---TTS-HHHHHHHHHHHHHHHC
T ss_pred ECCHHHHHHHHHHHHH---hCCChHHHHHHHHHHHHhcc
Confidence 3899988888776653 2222 467888888888764
No 67
>PRK06489 hypothetical protein; Provisional
Probab=32.62 E-value=28 Score=25.27 Aligned_cols=32 Identities=13% Similarity=0.207 Sum_probs=21.2
Q ss_pred HHHHHHHHHHhcccCC-CceEEEEeCCceeeEE
Q psy1421 36 KDIAAYIKKEFDKKYN-PTWHCIVGRNFGSYVT 67 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG-~~Wh~vVG~~fg~~~t 67 (89)
.++|+.+...+.+..| ...+++||.|.|+.+.
T Consensus 136 ~~~a~~~~~~l~~~lgi~~~~~lvG~SmGG~vA 168 (360)
T PRK06489 136 DDMVEAQYRLVTEGLGVKHLRLILGTSMGGMHA 168 (360)
T ss_pred HHHHHHHHHHHHHhcCCCceeEEEEECHHHHHH
Confidence 3566666665544444 5678899999997653
No 68
>PF10757 YbaJ: Biofilm formation regulator YbaJ; InterPro: IPR019693 YbaJ regulates biofilm formation. It also has an important role in the regulation of motility in the biofilm. YbaJ functions in increasing conjugation, aggregation and decreasing the motility, resulting in an increase of biofilm [].
Probab=32.30 E-value=65 Score=20.79 Aligned_cols=20 Identities=35% Similarity=0.592 Sum_probs=17.9
Q ss_pred hcCchHHHHHHHHHHhcccC
Q psy1421 31 KYNIEKDIAAYIKKEFDKKY 50 (89)
Q Consensus 31 ~~~~~~~iA~~iK~~lD~~y 50 (89)
||+++.++...|-++||++|
T Consensus 65 KYp~~~~l~~~ideYLDeTy 84 (122)
T PF10757_consen 65 KYPDESDLIELIDEYLDETY 84 (122)
T ss_pred ccCcHhhHHHHHHHHHHHHH
Confidence 67888899999999999987
No 69
>TIGR03228 anthran_1_2_A anthranilate 1,2-dioxygenase, large subunit. Anthranilate (2-aminobenzoate) is an intermediate of tryptophan (Trp) biosynthesis and degradation. Members of this family are the large subunit of anthranilate 1,2-dioxygenase, which acts in Trp degradation by converting anthranilate to catechol. Closely related paralogs typically are the benzoate 1,2-dioxygenase large subunit, among the larger set of ring-hydroxylating dioxygenases.
Probab=31.70 E-value=1.1e+02 Score=23.80 Aligned_cols=43 Identities=9% Similarity=0.314 Sum_probs=32.2
Q ss_pred HHHHhcccCCCceEEEEeCCceeeEEEcCCcEEEEEeCCEEEEEEee
Q psy1421 42 IKKEFDKKYNPTWHCIVGRNFGSYVTHETRHFIYFYLGQVAILLFKS 88 (89)
Q Consensus 42 iK~~lD~~yG~~Wh~vVG~~fg~~~th~~~~~~~f~~~~~~~lv~kt 88 (89)
...++++-|+++|+.|-=.+ .+ -+++.|+-+.+++..++|+|.
T Consensus 27 f~~E~~~IF~~~W~~v~h~s---el-p~~GDy~t~~ig~~pviv~R~ 69 (438)
T TIGR03228 27 FDLEMELIFEKNWIYACHES---EL-PNNHDFVTVRAGRQPMIVTRD 69 (438)
T ss_pred HHHHHHHHHhhCCEEEEEHH---HC-CCCCCeEEEEECCeEEEEEEC
Confidence 35778888999999875332 22 356788989999999999874
No 70
>cd01579 AcnA_Bact_Swivel Bacterial Aconitase-like swivel domain. Aconitase (aconitate hydratase or citrate hydrolyase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. Cis-aconitate is formed as an intermediate product during the course of the reaction. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism. This distinct subfamily is found only in bacteria and archea. Its exact characteristics are not known.
Probab=30.98 E-value=33 Score=21.87 Aligned_cols=19 Identities=21% Similarity=0.230 Sum_probs=12.7
Q ss_pred cCCCceEEEEeCCceeeEE
Q psy1421 49 KYNPTWHCIVGRNFGSYVT 67 (89)
Q Consensus 49 ~yG~~Wh~vVG~~fg~~~t 67 (89)
+.|..+=+|.|++||+=-|
T Consensus 45 ~~~~~~iiVaG~nFG~GSS 63 (121)
T cd01579 45 KAAGPGFIVGGENYGQGSS 63 (121)
T ss_pred ccCCCeEEEcCCcCCCCcc
Confidence 3355567788889996443
No 71
>KOG4194|consensus
Probab=30.40 E-value=33 Score=28.44 Aligned_cols=24 Identities=21% Similarity=0.581 Sum_probs=19.5
Q ss_pred CCceEEEEeCCceeeEEEcCCcEE
Q psy1421 51 NPTWHCIVGRNFGSYVTHETRHFI 74 (89)
Q Consensus 51 G~~Wh~vVG~~fg~~~th~~~~~~ 74 (89)
.+..||||...|||.+++..+--+
T Consensus 586 ~grYQCVvtN~FGStysqk~KltV 609 (873)
T KOG4194|consen 586 EGRYQCVVTNHFGSTYSQKAKLTV 609 (873)
T ss_pred CceEEEEEecccCcchhheeEEEe
Confidence 588999999999999887655433
No 72
>PF06983 3-dmu-9_3-mt: 3-demethylubiquinone-9 3-methyltransferase; PDB: 1U7I_A 1TSJ_A 1U69_D 3L20_B 3OMS_A.
Probab=30.19 E-value=24 Score=21.94 Aligned_cols=12 Identities=17% Similarity=0.733 Sum_probs=7.0
Q ss_pred hcccCCCceEEE
Q psy1421 46 FDKKYNPTWHCI 57 (89)
Q Consensus 46 lD~~yG~~Wh~v 57 (89)
+-.|||=+|++|
T Consensus 105 v~DkFGv~Wqiv 116 (116)
T PF06983_consen 105 VTDKFGVSWQIV 116 (116)
T ss_dssp EE-TTS-EEEEE
T ss_pred EEeCCCCEEEeC
Confidence 344778888875
No 73
>KOG1573|consensus
Probab=28.77 E-value=2e+02 Score=19.86 Aligned_cols=60 Identities=23% Similarity=0.341 Sum_probs=29.1
Q ss_pred CCCHHHHHHHHHHHHHHHhhcCchH-----HHHHHHHHHhcccCC-CceEEE-----EeCCceeeEEEcCCc
Q psy1421 12 DMSEEMQQDAVDCATQALEKYNIEK-----DIAAYIKKEFDKKYN-PTWHCI-----VGRNFGSYVTHETRH 72 (89)
Q Consensus 12 dM~~~mq~~i~~~~~~a~~~~~~~~-----~iA~~iK~~lD~~yG-~~Wh~v-----VG~~fg~~~th~~~~ 72 (89)
|+++..-..+...|-..-+.|+++. -+...|-+-| .-+| |.|-|| ||=.|++++-|-..+
T Consensus 92 DlDepni~Ha~QtAE~iR~~~Pd~dWlHLtaLiHDLGKvl-~f~GepQWAVvGDTfpVGC~~~~s~V~~d~~ 162 (204)
T KOG1573|consen 92 DLDEPNIQHALQTAEAIRKDYPDEDWLHLTALIHDLGKVL-AFGGEPQWAVVGDTFPVGCAFDASNVHHDKY 162 (204)
T ss_pred CCchHHHHHHHHHHHHHHHhCCCccHHHHHHHHHHHHHHH-HhcCCcceeeecCcccccccccccceechhh
Confidence 4455555555555544444555432 2333344445 4444 678764 333455555554433
No 74
>PRK14964 DNA polymerase III subunits gamma and tau; Provisional
Probab=28.55 E-value=56 Score=25.73 Aligned_cols=26 Identities=15% Similarity=0.405 Sum_probs=23.9
Q ss_pred HHHHHHHHHHhcccCCCceEEEEeCC
Q psy1421 36 KDIAAYIKKEFDKKYNPTWHCIVGRN 61 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG~~Wh~vVG~~ 61 (89)
.++|..+++.|.+--|..|.|.+.+.
T Consensus 434 ~~~~~~~~~~l~~~t~~~w~~~~~~~ 459 (491)
T PRK14964 434 SNFCNDLKNFLNQLTNKKWNICIDKN 459 (491)
T ss_pred hHHHHHHHHHHHHHhCCCeEEEecch
Confidence 68999999999999999999998864
No 75
>PF10655 DUF2482: Hypothetical protein of unknown function (DUF2482); InterPro: IPR018917 This entry is represented by Bacteriophage 80, Orf10. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. All the members of this very small, very short family are derived from bacteriophages, of the SA bacteriophages 11, Mu50B, system, and from the Staphylococcal_phi-Mu50B-like_prophages subsystem. All members are hypothetical proteins.
Probab=27.85 E-value=38 Score=21.01 Aligned_cols=31 Identities=32% Similarity=0.526 Sum_probs=18.7
Q ss_pred CchHHHHHHHHHHhc--c-----------cCCCceEEEEeCCce
Q psy1421 33 NIEKDIAAYIKKEFD--K-----------KYNPTWHCIVGRNFG 63 (89)
Q Consensus 33 ~~~~~iA~~iK~~lD--~-----------~yG~~Wh~vVG~~fg 63 (89)
...-|+|..|+++=| - -|-+.=||++|+-|+
T Consensus 22 ~ELydL~~eI~kETeFdillfS~igv~~GD~~~ss~~alG~~~~ 65 (100)
T PF10655_consen 22 GELYDLANEIDKETEFDILLFSTIGVSNGDFISSSHCALGNPFG 65 (100)
T ss_pred HHHHHHHHHhcccceeeeeeeeeeccccCccccccchhhccHHH
Confidence 334578888866432 2 233557788887554
No 76
>cd05503 Bromo_BAZ2A_B_like Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B (BAZ2B) were identified as a novel human bromodomain gene by cDNA library screening. BAZ2A is also known as Tip5 (Transcription termination factor I-interacting protein 5) and hWALp3. The proteins may play roles in transcriptional regulation. Human Tip5 is part of a complex termed NoRC (nucleolar remodeling complex), which induces nucleosome sliding and may play a role in the regulation of the rDNA locus. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=27.73 E-value=1.1e+02 Score=18.16 Aligned_cols=36 Identities=17% Similarity=0.149 Sum_probs=20.8
Q ss_pred HHHHHHHHHHHHHHHhhcCc---hHHHHHHHHHHhcccC
Q psy1421 15 EEMQQDAVDCATQALEKYNI---EKDIAAYIKKEFDKKY 50 (89)
Q Consensus 15 ~~mq~~i~~~~~~a~~~~~~---~~~iA~~iK~~lD~~y 50 (89)
.+...++.-+...|..-+.. .-..|..|++.|++++
T Consensus 59 ~ef~~D~~li~~Na~~yN~~~s~i~~~a~~l~~~f~~~~ 97 (97)
T cd05503 59 EEFAEDVRLVFDNCETFNEDDSEVGRAGHNMRKFFEKRW 97 (97)
T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 34555555555555432222 2356888888888764
No 77
>PF01743 PolyA_pol: Poly A polymerase head domain; InterPro: IPR002646 This group includes nucleic acid independent RNA polymerases, such as polynucleotide adenylyltransferase (2.7.7.19 from EC), which adds the poly (A) tail to mRNA. This group also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of the tRNA 2.7.7.25 from EC.; GO: 0003723 RNA binding, 0016779 nucleotidyltransferase activity, 0006396 RNA processing; PDB: 1VFG_A 3H38_A 3H3A_B 3H39_B 3H37_A 1MIY_A 1MIV_B 1MIW_B 1OU5_B 3AQN_A ....
Probab=27.10 E-value=1.6e+02 Score=18.31 Aligned_cols=37 Identities=19% Similarity=0.151 Sum_probs=27.4
Q ss_pred HHHHHHHhcccCCCceEEEEeCCceeeEEEcCCcEEEEE
Q psy1421 39 AAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETRHFIYFY 77 (89)
Q Consensus 39 A~~iK~~lD~~yG~~Wh~vVG~~fg~~~th~~~~~~~f~ 77 (89)
+..+.+.|.++++..+++ |+.|+..--+..+..+.+.
T Consensus 28 ~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~~di~ 64 (126)
T PF01743_consen 28 PEEFAKLLAKKLGGVFVV--GKRFGTVRVVFGGGSIDIA 64 (126)
T ss_dssp HHHHHHHHCTTCCEEEEE--ETTTTEEEEEETTCEEEEE
T ss_pred HHHHHHHHHhhccccccc--ccccceeeecCCCcccccc
Confidence 455777888888888877 9999987766666655543
No 78
>PF07062 Clc-like: Clc-like; InterPro: IPR010761 Clc proteins are a nine-member gene family of chloride channels that have diverse roles in the plasma membrane and in intracellular organelles, especially membrane excitability and the maintenance of osmotic balance [, ]. This family contains a number of Clc-like proteins that are approximately 250 residues long and their homologues. ; GO: 0016021 integral to membrane
Probab=26.91 E-value=56 Score=22.93 Aligned_cols=24 Identities=25% Similarity=0.771 Sum_probs=17.6
Q ss_pred CCCceEEEEeCCceeeEEEcCCcEEE
Q psy1421 50 YNPTWHCIVGRNFGSYVTHETRHFIY 75 (89)
Q Consensus 50 yG~~Wh~vVG~~fg~~~th~~~~~~~ 75 (89)
+-|.||++--++++.. |+.|-..+
T Consensus 23 ~SPsWQvv~~~e~~~~--h~hGLW~d 46 (211)
T PF07062_consen 23 FSPSWQVVNIREFRAY--HQHGLWLD 46 (211)
T ss_pred cCCceEEEEecccCcE--Eecceeee
Confidence 5799999999999977 44454443
No 79
>cd00454 Trunc_globin Truncated hemoglobins (trHbs) are a family of oxygen-binding heme proteins found in cyanobacteria, eubacteria, unicellular eukaryotes, and plants. The truncated hemoglobins have a characteristic two-over-two alpha helical folding pattern that is distinct from the three-over-three pattern found in other globins. A subset of these have been demonstrated to form homodimers.
Probab=26.67 E-value=1.5e+02 Score=17.71 Aligned_cols=36 Identities=14% Similarity=0.231 Sum_probs=27.6
Q ss_pred CCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhc
Q psy1421 12 DMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEFD 47 (89)
Q Consensus 12 dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD 47 (89)
.++++.-...++...+|++++....++++.+...+.
T Consensus 72 ~I~~~~f~~~l~~l~~al~~~~~~~~~~~~~~~~~~ 107 (116)
T cd00454 72 PITEEEFDAWLELLRDALDELGVPAELADALLARAE 107 (116)
T ss_pred CCCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHH
Confidence 677888888999999999998876666666655543
No 80
>PHA00368 internal virion protein D
Probab=26.47 E-value=94 Score=27.30 Aligned_cols=40 Identities=30% Similarity=0.429 Sum_probs=30.0
Q ss_pred eCCCC-HHHHHHHHHHHHHHHhhc-------CchHHHHHHHHHHhccc
Q psy1421 10 NADMS-EEMQQDAVDCATQALEKY-------NIEKDIAAYIKKEFDKK 49 (89)
Q Consensus 10 ~~dM~-~~mq~~i~~~~~~a~~~~-------~~~~~iA~~iK~~lD~~ 49 (89)
..+|+ ++++++|-+-+..|++.. +-|+++-..||++||.+
T Consensus 585 ~~~~sr~~~rq~iYrr~a~AIE~~el~a~Ltk~Er~vmd~~k~hfd~K 632 (1315)
T PHA00368 585 GAKMSREAARQEIYRRVALAIERPELQANLTKAERKVMDILKEHFDLK 632 (1315)
T ss_pred cccccHHHHHHHHHHHHHHHhcCHHHHhhccHHHHHHHHHHHHHHHHH
Confidence 45555 568888888888887754 23678999999999985
No 81
>TIGR02087 LEUD_arch 3-isopropylmalate dehydratase, small subunit. This subfamily is most closely related to the 3-isopropylmalate dehydratase, small subunits which form TIGR00171. This subfamily includes the members of TIGR02084 which are gene clustered with other genes of leucine biosynthesis. The rest of the subfamily includes mainly archaeal species which exhibit two hits to this model. In these cases it is possible that one or the other of the hits does not have a 3-isopropylmalate dehydratase activity but rather one of the other related aconitase-like activities.
Probab=26.20 E-value=99 Score=20.52 Aligned_cols=34 Identities=21% Similarity=0.370 Sum_probs=20.4
Q ss_pred chHHHHHHHHHHhcccC----CCceEEEEeCCceeeEE
Q psy1421 34 IEKDIAAYIKKEFDKKY----NPTWHCIVGRNFGSYVT 67 (89)
Q Consensus 34 ~~~~iA~~iK~~lD~~y----G~~Wh~vVG~~fg~~~t 67 (89)
+..++++++-..+|..| .+.-=+|.|+|||+=-|
T Consensus 25 ~~~~l~~~~f~~~~p~f~~~~~~g~iiVaG~NFG~GSS 62 (154)
T TIGR02087 25 DPDELASHAMEGIDPEFAKKVRPGDVIVAGKNFGCGSS 62 (154)
T ss_pred CHHHHHhhccCcCCchhhhcCCCCcEEEcCCcccCCcc
Confidence 34557777666665433 44555666779996443
No 82
>CHL00168 pbsA heme oxygenase; Provisional
Probab=26.05 E-value=1.3e+02 Score=21.39 Aligned_cols=37 Identities=14% Similarity=0.096 Sum_probs=25.6
Q ss_pred CCCCHHHHHHHHHHHHHHHhhcCc-hHHHHHHHHHHhc
Q psy1421 11 ADMSEEMQQDAVDCATQALEKYNI-EKDIAAYIKKEFD 47 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a~~~~~~-~~~iA~~iK~~lD 47 (89)
.+++++.+..+++.|+.|+.-... ..|++..+++.+.
T Consensus 179 l~l~e~e~~~iI~EA~~AF~lN~~vf~eL~~~~~~~~~ 216 (238)
T CHL00168 179 LPLSDDQIQNIIAEANIAFNLNMKMFQELNSSFIKIIT 216 (238)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 468999999999999999774432 3455554554443
No 83
>PF01545 Cation_efflux: Cation efflux family; InterPro: IPR002524 Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are considered to be efflux pumps that remove these ions from cells [, ], however others are implicated in ion uptake []. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3BYP_B 3BYR_A 3H90_B 2QFI_B 2ZZT_A.
Probab=26.04 E-value=84 Score=21.84 Aligned_cols=22 Identities=18% Similarity=0.428 Sum_probs=19.9
Q ss_pred HHHHHHHHHhcccCCCceEEEE
Q psy1421 37 DIAAYIKKEFDKKYNPTWHCIV 58 (89)
Q Consensus 37 ~iA~~iK~~lD~~yG~~Wh~vV 58 (89)
++++.|++.+-++|++-++|.|
T Consensus 255 ~i~~~i~~~l~~~~~~i~~v~I 276 (284)
T PF01545_consen 255 EIRERIEKRLREKFPGIYDVTI 276 (284)
T ss_dssp HHHHHHHHHHHHHSTTCEEEEE
T ss_pred HHHHHHHHHHHHhCCCcEEEEE
Confidence 5899999999999999999776
No 84
>PF15571 Imm25: Immunity protein 25
Probab=25.38 E-value=52 Score=21.34 Aligned_cols=23 Identities=26% Similarity=0.292 Sum_probs=18.6
Q ss_pred HHHHHHHHHHhcccCCCceEEEE
Q psy1421 36 KDIAAYIKKEFDKKYNPTWHCIV 58 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG~~Wh~vV 58 (89)
.++...||+.+|.-||..|+-|-
T Consensus 18 r~~r~~Ik~~~~~~~g~~~~~I~ 40 (124)
T PF15571_consen 18 REIRNEIKELNDNLYGIEIESIA 40 (124)
T ss_pred HHHHHHHHHHHccccccchhhhc
Confidence 45777889999999999987543
No 85
>COG1498 SIK1 Protein implicated in ribosomal biogenesis, Nop56p homolog [Translation, ribosomal structure and biogenesis]
Probab=25.34 E-value=1.2e+02 Score=23.39 Aligned_cols=40 Identities=28% Similarity=0.454 Sum_probs=31.8
Q ss_pred EeCCCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHhcc
Q psy1421 9 KNADMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEFDK 48 (89)
Q Consensus 9 ~~~dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~ 48 (89)
...+|++.+-..|...|....+.+..-+++.++|...|+.
T Consensus 200 mG~~~~~~Di~~i~~~ae~i~~L~~~R~~l~~Yi~~~M~~ 239 (395)
T COG1498 200 MGADLSEEDIDNIRELAEIILELYELREQLEEYIESKMSE 239 (395)
T ss_pred cccCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4578899988889998888877777667788888877764
No 86
>PF06840 DUF1241: Protein of unknown function (DUF1241); InterPro: IPR009652 This family consists of several programmed cell death 10 protein (PDCD10 or TFAR15) sequences. The function of this family is unknown.; PDB: 3L8I_A 3RQG_B 3RQE_B 3L8J_A 3RQF_B 3AJM_B.
Probab=25.09 E-value=52 Score=22.07 Aligned_cols=13 Identities=54% Similarity=0.644 Sum_probs=12.0
Q ss_pred HHHHHHHHHHhcc
Q psy1421 36 KDIAAYIKKEFDK 48 (89)
Q Consensus 36 ~~iA~~iK~~lD~ 48 (89)
++||..||+-||.
T Consensus 123 K~IAsaIK~lLdA 135 (154)
T PF06840_consen 123 KEIASAIKKLLDA 135 (154)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 7999999999996
No 87
>KOG1569|consensus
Probab=24.81 E-value=1.5e+02 Score=22.27 Aligned_cols=27 Identities=15% Similarity=0.212 Sum_probs=18.1
Q ss_pred HHHHHHHHHhcc------cCCCce-EEEEeC-Cce
Q psy1421 37 DIAAYIKKEFDK------KYNPTW-HCIVGR-NFG 63 (89)
Q Consensus 37 ~iA~~iK~~lD~------~yG~~W-h~vVG~-~fg 63 (89)
++++.|-+...- .+++.| ||.||+ +|.
T Consensus 227 nipemieeFk~G~~i~~d~~~~~~~~~~vGkl~mt 261 (323)
T KOG1569|consen 227 NIPEMIEEFKNGHEIKFDEERENILQIKVGKLDMT 261 (323)
T ss_pred chHHHHHHhhCCcccccccccCceeeeeeeeecCC
Confidence 566665443332 457889 999999 775
No 88
>PF14824 Sirohm_synth_M: Sirohaem biosynthesis protein central; PDB: 1KYQ_B.
Probab=24.75 E-value=35 Score=16.67 Aligned_cols=16 Identities=25% Similarity=0.453 Sum_probs=11.1
Q ss_pred hHHHHHHHHHHhcccC
Q psy1421 35 EKDIAAYIKKEFDKKY 50 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~y 50 (89)
...+|+.|++.+++.|
T Consensus 15 sP~la~~iR~~ie~~l 30 (30)
T PF14824_consen 15 SPRLARLIRKEIERLL 30 (30)
T ss_dssp -HHHHHHHHHHHHHHH
T ss_pred ChHHHHHHHHHHHHhC
Confidence 3458888888887653
No 89
>PRK10425 DNase TatD; Provisional
Probab=24.25 E-value=2.3e+02 Score=20.08 Aligned_cols=46 Identities=22% Similarity=0.304 Sum_probs=31.6
Q ss_pred HHHHHHHHHHHHHHHhhcCc---------hHHHHHHHHHHhcccCCCceEEEEeC
Q psy1421 15 EEMQQDAVDCATQALEKYNI---------EKDIAAYIKKEFDKKYNPTWHCIVGR 60 (89)
Q Consensus 15 ~~mq~~i~~~~~~a~~~~~~---------~~~iA~~iK~~lD~~yG~~Wh~vVG~ 60 (89)
.+.|.++.+...+...+++. .+++.+.||+...+.-+..+||.-|.
T Consensus 103 ~~~Q~~vF~~ql~lA~~~~~Pv~iH~r~a~~~~l~iL~~~~~~~~~~i~H~fsG~ 157 (258)
T PRK10425 103 PEEQERAFVAQLAIAAELNMPVFMHCRDAHERFMALLEPWLDKLPGAVLHCFTGT 157 (258)
T ss_pred HHHHHHHHHHHHHHHHHhCCCeEEEEeCchHHHHHHHHHhccCCCCeEEEecCCC
Confidence 56788888877777666532 46777777775433336788888776
No 90
>PRK00394 transcription factor; Reviewed
Probab=24.21 E-value=1.3e+02 Score=20.31 Aligned_cols=25 Identities=32% Similarity=0.503 Sum_probs=19.5
Q ss_pred eEEEcCCc--EEEEEe--CCEEEEEEeeC
Q psy1421 65 YVTHETRH--FIYFYL--GQVAILLFKSG 89 (89)
Q Consensus 65 ~~th~~~~--~~~f~~--~~~~~lv~kt~ 89 (89)
..+|+|.. .+-|++ ....+|||.||
T Consensus 120 ~~~YePe~fPglvyR~~~pk~~~lIF~SG 148 (179)
T PRK00394 120 NIEYEPEQFPGLVYRLDDPKVVVLLFGSG 148 (179)
T ss_pred CcEECcccCceEEEEecCCcEEEEEEcCC
Confidence 57888887 466777 36899999987
No 91
>COG1999 Uncharacterized protein SCO1/SenC/PrrC, involved in biogenesis of respiratory and photosynthetic systems [General function prediction only]
Probab=24.21 E-value=1.4e+02 Score=20.43 Aligned_cols=26 Identities=27% Similarity=0.599 Sum_probs=23.1
Q ss_pred chHHHHHHHHHHhcccCCCceEEEEe
Q psy1421 34 IEKDIAAYIKKEFDKKYNPTWHCIVG 59 (89)
Q Consensus 34 ~~~~iA~~iK~~lD~~yG~~Wh~vVG 59 (89)
.++|-++.+|+++...+.+.|+-+-|
T Consensus 114 PerDtp~~lk~Y~~~~~~~~~~~ltg 139 (207)
T COG1999 114 PERDTPEVLKKYAELNFDPRWIGLTG 139 (207)
T ss_pred CCCCCHHHHHHHhcccCCCCeeeeeC
Confidence 46678999999999888999999999
No 92
>cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified.
Probab=24.18 E-value=1.5e+02 Score=19.44 Aligned_cols=6 Identities=17% Similarity=0.019 Sum_probs=2.8
Q ss_pred ceEEEE
Q psy1421 53 TWHCIV 58 (89)
Q Consensus 53 ~Wh~vV 58 (89)
.+++++
T Consensus 56 ~~~~~~ 61 (219)
T cd06913 56 GVIVLV 61 (219)
T ss_pred CeEEEE
Confidence 444444
No 93
>TIGR01916 F420_cofE F420-0:gamma-glutamyl ligase. This model represents an enzyme of coenzyme F(420) biosynthesis, as catalyzed by MJ0768 of Methanococcus jannaschii and by the N-terminal half of FbiB of Mycobacterium bovis strain BCG. Note that only two glutamates are ligated in M. jannaschii, but five to six in the Mycobacterium lineage. In M. jannaschii, CofE catalyzes the GTP-dependent addition of two L-glutamates.
Probab=24.15 E-value=1.1e+02 Score=21.92 Aligned_cols=30 Identities=13% Similarity=0.082 Sum_probs=26.5
Q ss_pred HHHHHHHHHHhcccCCCceEEEEeCCceee
Q psy1421 36 KDIAAYIKKEFDKKYNPTWHCIVGRNFGSY 65 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~ 65 (89)
...|+.|++.|-+++|..=-|||..++|--
T Consensus 125 d~sA~~ir~~l~~~~g~~v~VIItDt~gr~ 154 (243)
T TIGR01916 125 DASAEKIRRGLRELTGVDVGVIITDTNGRP 154 (243)
T ss_pred HHHHHHHHHHHHHHHCCCEEEEEECCCCCc
Confidence 457999999999999999999999988854
No 94
>cd01580 AcnA_IRP_Swivel Aconitase A swivel domain. This is the major form of the TCA cycle enzyme aconitate hydratase, also known as aconitase and citrate hydro-lyase. It includes bacterial and archaeal aconitase A, and the eukaryotic cytosolic form of aconitase. This group also includes sequences that have been shown to act as an iron-responsive element (IRE) binding protein in animals and may have the same role in other eukaryotes. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism.
Probab=23.96 E-value=48 Score=22.65 Aligned_cols=18 Identities=17% Similarity=0.287 Sum_probs=14.5
Q ss_pred cCCCceEEEEeCCceeeE
Q psy1421 49 KYNPTWHCIVGRNFGSYV 66 (89)
Q Consensus 49 ~yG~~Wh~vVG~~fg~~~ 66 (89)
+-|..|-+|.|++||+=-
T Consensus 93 ~~g~plIIvaG~nfG~GS 110 (171)
T cd01580 93 EEGVPLVILAGKEYGSGS 110 (171)
T ss_pred HcCCcEEEEccCcccCCC
Confidence 348899999999998643
No 95
>PF06793 UPF0262: Uncharacterised protein family (UPF0262); InterPro: IPR008321 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=23.86 E-value=86 Score=21.15 Aligned_cols=29 Identities=21% Similarity=0.366 Sum_probs=21.9
Q ss_pred HHHHHHHHHHHHHHhhcCchHHHHHHHHHHhccc
Q psy1421 16 EMQQDAVDCATQALEKYNIEKDIAAYIKKEFDKK 49 (89)
Q Consensus 16 ~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~~ 49 (89)
..|.+++++++.++. +|=|+.|++.|+-|
T Consensus 108 p~qIEaIDMgRRGlH-----NEGa~lL~eRL~GK 136 (158)
T PF06793_consen 108 PSQIEAIDMGRRGLH-----NEGAELLQERLEGK 136 (158)
T ss_pred Hhhhhhhhhhhhccc-----hHHHHHHHHHhcCC
Confidence 467777787777765 67888888888754
No 96
>COG5556 Uncharacterized conserved protein [Function unknown]
Probab=23.85 E-value=1.9e+02 Score=18.06 Aligned_cols=31 Identities=35% Similarity=0.490 Sum_probs=20.9
Q ss_pred CCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHh
Q psy1421 12 DMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEF 46 (89)
Q Consensus 12 dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~l 46 (89)
-+++|++..|+. |+--|++..-+|...|++|
T Consensus 3 kLsdevKtfIvQ----alACFDs~S~Va~aVkkEf 33 (110)
T COG5556 3 KLSDEVKTFIVQ----ALACFDSPSVVAAAVKKEF 33 (110)
T ss_pred cccHHHHHHHHH----HHHHcCcHHHHHHHHHHHh
Confidence 356777776654 6666777666777777665
No 97
>COG4003 Uncharacterized protein conserved in archaea [Function unknown]
Probab=23.54 E-value=1.8e+02 Score=17.78 Aligned_cols=30 Identities=10% Similarity=0.187 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHhhcCchHHHHHHHHHHhcc
Q psy1421 19 QDAVDCATQALEKYNIEKDIAAYIKKEFDK 48 (89)
Q Consensus 19 ~~i~~~~~~a~~~~~~~~~iA~~iK~~lD~ 48 (89)
+++++++.-..++-....++|+.|+..+-.
T Consensus 48 EEAlEii~yleKrGEi~~E~A~~L~~~~~~ 77 (98)
T COG4003 48 EEALEIINYLEKRGEITPEMAKALRVTLVH 77 (98)
T ss_pred HHHHHHHHHHHHhCCCCHHHHHHHHhhHHH
Confidence 445555554444445567899999887763
No 98
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=23.53 E-value=2.5e+02 Score=20.11 Aligned_cols=47 Identities=17% Similarity=0.179 Sum_probs=31.5
Q ss_pred eCCCCHHHHHHHHHHHHHH-Hh-------hc------CchHHHHHHHHHHhcccCCCceEEEEe
Q psy1421 10 NADMSEEMQQDAVDCATQA-LE-------KY------NIEKDIAAYIKKEFDKKYNPTWHCIVG 59 (89)
Q Consensus 10 ~~dM~~~mq~~i~~~~~~a-~~-------~~------~~~~~iA~~iK~~lD~~yG~~Wh~vVG 59 (89)
..+||++.+...++.+++. ++ ++ .+..+..+.+++.||. +.|+|+|=
T Consensus 94 ~~~i~~~~~~rlI~~~~~~g~~v~~EvG~K~~~~~~~~~~~~~i~~~~~~LeA---GA~~ViiE 154 (237)
T TIGR03849 94 SMEISLEERCNLIERAKDNGFMVLSEVGKKSPEKDSELTPDDRIKLINKDLEA---GADYVIIE 154 (237)
T ss_pred ccCCCHHHHHHHHHHHHhCCCeEeccccccCCcccccCCHHHHHHHHHHHHHC---CCcEEEEe
Confidence 3589999999999988854 11 01 1124677778888874 56777763
No 99
>PRK13293 F420-0--gamma-glutamyl ligase; Reviewed
Probab=23.42 E-value=1.2e+02 Score=21.86 Aligned_cols=29 Identities=14% Similarity=0.190 Sum_probs=26.1
Q ss_pred HHHHHHHHHHhcccCCCceEEEEeCCcee
Q psy1421 36 KDIAAYIKKEFDKKYNPTWHCIVGRNFGS 64 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~ 64 (89)
..-|+.|++.|-+++|..=-|||..+||-
T Consensus 126 d~SA~~ir~~l~~~~g~~v~VIItDt~gr 154 (245)
T PRK13293 126 DESAERIREGLEELTGKKVGVIITDTNGR 154 (245)
T ss_pred HHHHHHHHHHHHHHHCCCEEEEEEcCCCc
Confidence 45799999999999999999999999884
No 100
>PF04622 ERG2_Sigma1R: ERG2 and Sigma1 receptor like protein; InterPro: IPR006716 This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8--delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols []. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum [], which interacts with endogenous steroid hormones, such as progesterone and testosterone []. It also binds the sigma ligands, which are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine []. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.; GO: 0000247 C-8 sterol isomerase activity, 0006696 ergosterol biosynthetic process, 0005783 endoplasmic reticulum
Probab=23.40 E-value=1.8e+02 Score=20.44 Aligned_cols=33 Identities=21% Similarity=0.253 Sum_probs=24.3
Q ss_pred HHHHHHHHHHhhcC-chHHHHHHHHHHhcccCCC
Q psy1421 20 DAVDCATQALEKYN-IEKDIAAYIKKEFDKKYNP 52 (89)
Q Consensus 20 ~i~~~~~~a~~~~~-~~~~iA~~iK~~lD~~yG~ 52 (89)
.+-+++++++.+++ +-+++-..|-+.|-++||+
T Consensus 35 ~l~~ia~~~ia~~~~~~~~~~~~l~~~L~~~y~~ 68 (216)
T PF04622_consen 35 VLHEIAKKAIARHPNDTEEILSDLVDELRKKYPD 68 (216)
T ss_pred HHHHHHHHHHhhcCCCHHHHHHHHHHHHHhHCCC
Confidence 44566777777664 3467788888999999987
No 101
>PRK13888 conjugal transfer protein TrbN; Provisional
Probab=23.38 E-value=26 Score=24.63 Aligned_cols=21 Identities=19% Similarity=0.376 Sum_probs=16.7
Q ss_pred HHHH-HHHHHhcccCCCceEEE
Q psy1421 37 DIAA-YIKKEFDKKYNPTWHCI 57 (89)
Q Consensus 37 ~iA~-~iK~~lD~~yG~~Wh~v 57 (89)
.++. .|++.|++++|-.|.+|
T Consensus 88 ~vGAWILr~~i~~~~G~~W~AV 109 (206)
T PRK13888 88 DLAAWRLRMHIRNDKGDLWTKA 109 (206)
T ss_pred HHHHHHHHHHHHHhhCchHHHH
Confidence 3444 56899998999999986
No 102
>PF02268 TFIIA_gamma_N: Transcription initiation factor IIA, gamma subunit, helical domain; InterPro: IPR015872 Transcription factor IIA (TFIIA) is one of several factors that form part of a transcription pre-initiation complex along with RNA polymerase II, the TATA-box-binding protein (TBP) and TBP-associated factors, on the TATA-box sequence upstream of the initiation start site. After initiation, some components of the pre-initiation complex (including TFIIA) remain attached and re-initiate a subsequent round of transcription. TFIIA binds to TBP to stabilise TBP binding to the TATA element. TFIIA also inhibits the cytokine HMGB1 (high mobility group 1 protein) binding to TBP [], and can dissociate HMGB1 already bound to TBP/TATA-box. Human and Drosophila TFIIA have three subunits: two large subunits, LN/alpha and LC/beta, derived from the same gene, and a small subunit, S/gamma. Yeast TFIIA has two subunits: a large TOA1 subunit that shows sequence similarity to the N-terminal of LN/alpha and the C-terminal of LC/beta, and a small subunit, TOA2 that is highly homologous with S/gamma. The conserved regions of the large and small subunits of TFIIA combine to form two domains: a four-helix bundle (helical domain) composed of two helices from each of the N-terminal regions of TOA1 and TOA2 in yeast; and a beta-barrel (beta-barrel domain) composed of beta-sheets from the C-terminal regions of TOA1 and TOA2 []. This entry represents the alpha-helical domain found at the N-terminal of the gamma subunit of transcription factor TFIIA.; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005672 transcription factor TFIIA complex; PDB: 1NVP_D 1RM1_B 1YTF_D 1NH2_D.
Probab=23.36 E-value=1.4e+02 Score=16.14 Aligned_cols=22 Identities=23% Similarity=0.431 Sum_probs=15.4
Q ss_pred HHHhhcCchHHHHHHHHHHhcc
Q psy1421 27 QALEKYNIEKDIAAYIKKEFDK 48 (89)
Q Consensus 27 ~a~~~~~~~~~iA~~iK~~lD~ 48 (89)
+.+..-....++|..+-..||+
T Consensus 21 eli~~~~I~p~La~kVL~~FDk 42 (49)
T PF02268_consen 21 ELIQEGKITPQLAMKVLEQFDK 42 (49)
T ss_dssp HHHHTTSS-HHHHHHHHHHHHH
T ss_pred HHHHcCCCCHHHHHHHHHHHHH
Confidence 3344445567899999999997
No 103
>PF02318 FYVE_2: FYVE-type zinc finger; InterPro: IPR003315 This entry represents the zinc-binding domain found in rabphilin Rab3A. The small G protein Rab3A plays an important role in the regulation of neurotransmitter release. The crystal structure of the small G protein Rab3A complexed with the effector domain of rabphilin-3A shows that the effector domain of rabphilin-3A contacts Rab3A in two distinct areas. The first interface involves the Rab3A switch I and switch II regions, which are sensitive to the nucleotide-binding state of Rab3A. The second interface consists of a deep pocket in Rab3A that interacts with a SGAWFF structural element of rabphilin-3A. Sequence and structure analysis, and biochemical data suggest that this pocket, or Rab complementarity-determining region (RabCDR), establishes a specific interaction between each Rab protein and its effectors. It has been suggested that RabCDRs could be major determinants of effector specificity during vesicle trafficking and fusion [].; GO: 0008270 zinc ion binding, 0017137 Rab GTPase binding, 0006886 intracellular protein transport; PDB: 2CSZ_A 2ZET_C 1ZBD_B 3BC1_B 2CJS_C 2A20_A.
Probab=23.24 E-value=1.3e+02 Score=18.68 Aligned_cols=36 Identities=17% Similarity=0.211 Sum_probs=22.9
Q ss_pred CCCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHh
Q psy1421 11 ADMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEF 46 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~l 46 (89)
|-++++-++.|++....+-+.-..+.+--..||+.|
T Consensus 1 s~Lt~eE~~~I~~Vl~R~~~l~~~E~~Ri~kLk~~L 36 (118)
T PF02318_consen 1 SHLTEEEREIILQVLQRDEELRKKEEERIRKLKQEL 36 (118)
T ss_dssp TTS-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH
Confidence 346788888888888877554444444455566666
No 104
>KOG3938|consensus
Probab=23.05 E-value=1.5e+02 Score=22.01 Aligned_cols=22 Identities=27% Similarity=0.537 Sum_probs=19.4
Q ss_pred CCCCHHHHHHHHHHHHHHHhhc
Q psy1421 11 ADMSEEMQQDAVDCATQALEKY 32 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a~~~~ 32 (89)
-|+|+++.+.+++.+.+.++.|
T Consensus 250 Ed~~~e~~~~~i~kindllesy 271 (334)
T KOG3938|consen 250 EDQPDEFEEAAIEKINDLLESY 271 (334)
T ss_pred ccchhHHHHHHHHHHHHHHHHh
Confidence 4899999999999888888877
No 105
>cd00404 Aconitase_swivel Aconitase swivel domain. Aconitase (aconitate hydratase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. This is the aconitase swivel domain, which undergoes swivelling conformational change in the enzyme mechanism. The aconitase family contains the following proteins: - Iron-responsive element binding protein (IRE-BP). IRE-BP is a cytosolic protein that binds to iron-responsive elements (IREs). IREs are stem-loop structures found in the 5'UTR of ferritin, and delta aminolevulinic acid synthase mRNAs, and in the 3'UTR of transferrin receptor mRNA. IRE-BP also express aconitase activity. - 3-isopropylmalate dehydratase (isopropylmalate isomerase), the enzyme that catalyzes the second step in the biosynthesis of leucine. - Homoaconitase (homoaconitate hydratase), an enzyme that participates in the alpha-aminoadipate pathway of lysine biosynthesis and that converts cis-homoaconitate into homoisocitric acid.
Probab=22.81 E-value=64 Score=19.54 Aligned_cols=15 Identities=20% Similarity=0.129 Sum_probs=10.4
Q ss_pred CceEEEEeCCceeeE
Q psy1421 52 PTWHCIVGRNFGSYV 66 (89)
Q Consensus 52 ~~Wh~vVG~~fg~~~ 66 (89)
+.|-+|.|++||+=-
T Consensus 15 ~~~iiVaG~nfG~GS 29 (88)
T cd00404 15 GPGVVIGDENYGTGS 29 (88)
T ss_pred CCEEEEecCCeecCC
Confidence 367777788888643
No 106
>cd03479 Rieske_RO_Alpha_PhDO_like Rieske non-heme iron oxygenase (RO) family, Phthalate 4,5-dioxygenase (PhDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of PhDO and similar proteins including 3-chlorobenzoate 3,4-dioxygenase (CBDO), phenoxybenzoate dioxygenase (POB-dioxygenase) and 3-nitrobenzoate oxygenase (MnbA). ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. PhDO and CBDO are two-component RO systems, containing oxygenase and reductase components. PhDO catalyzes the dihydroxylation of phthalate to form th
Probab=22.62 E-value=1.7e+02 Score=18.70 Aligned_cols=38 Identities=11% Similarity=0.183 Sum_probs=25.3
Q ss_pred hcccCCCceEEEEeC-CceeeEEEcCCcEEEEEeCCEEEEEEee
Q psy1421 46 FDKKYNPTWHCIVGR-NFGSYVTHETRHFIYFYLGQVAILLFKS 88 (89)
Q Consensus 46 lD~~yG~~Wh~vVG~-~fg~~~th~~~~~~~f~~~~~~~lv~kt 88 (89)
+|+.+-..|+.|.-. +.. +++.++.+.+++..++|||.
T Consensus 14 ~~~~~~~~W~~v~~~~eL~-----~~g~~~~~~~~g~~i~v~r~ 52 (144)
T cd03479 14 MGELLRRYWQPVALSSELT-----EDGQPVRVRLLGEDLVAFRD 52 (144)
T ss_pred hhhHhhCceEEEEEHHHCC-----CCCCEEEEEECCcEEEEEEe
Confidence 355566779986644 232 24677778888888888873
No 107
>PF01152 Bac_globin: Bacterial-like globin; InterPro: IPR001486 Globins are haem-containing proteins involved in binding and/or transporting oxygen. They belong to a very large and well studied family that is widely distributed in many organisms []. Globins have evolved from a common ancestor and can be divided into three groups: single-domain globins, and two types of chimeric globins, flavohaemoglobins and globin-coupled sensors. Bacteria have all three types of globins, while archaea lack flavohaemoglobins, and eukaryotes lack globin-coupled sensors []. Several functionally different haemoglobins can coexist in the same species. The major types of globins include: Haemoglobin (Hb): trimer of two alpha and two beta chains, although embryonic and foetal forms can substitute the alpha or beta chain for ones with higher oxygen affinity, such as gamma, delta, epsilon or zeta chains. Hb transports oxygen from lungs to other tissues in vertebrates []. Hb proteins are also present in unicellular organisms where they act as enzymes or sensors []. Myoglobin (Mb): monomeric protein responsible for oxygen storage in vertebrate muscle []. Neuroglobin: a myoglobin-like haemprotein expressed in vertebrate brain and retina, where it is involved in neuroprotection from damage due to hypoxia or ischemia []. Neuroglobin belongs to a branch of the globin family that diverged early in evolution. Cytoglobin: an oxygen sensor expressed in multiple tissues. Related to neuroglobin []. Erythrocruorin: highly cooperative extracellular respiratory proteins found in annelids and arthropods that are assembled from as many as 180 subunit into hexagonal bilayers []. Leghaemoglobin (legHb or symbiotic Hb): occurs in the root nodules of leguminous plants, where it facilitates the diffusion of oxygen to symbiotic bacteriods in order to promote nitrogen fixation. Non-symbiotic haemoglobin (NsHb): occurs in non-leguminous plants, and can be over-expressed in stressed plants []. Flavohaemoglobins (FHb): chimeric, with an N-terminal globin domain and a C-terminal ferredoxin reductase-like NAD/FAD-binding domain. FHb provides protection against nitric oxide via its C-terminal domain, which transfers electrons to haem in the globin []. Globin-coupled sensors: chimeric, with an N-terminal myoglobin-like domain and a C-terminal domain that resembles the cytoplasmic signalling domain of bacterial chemoreceptors. They bind oxygen, and act to initiate an aerotactic response or regulate gene expression [, ]. Protoglobin: a single domain globin found in archaea that is related to the N-terminal domain of globin-coupled sensors []. Truncated 2/2 globin: lack the first helix, giving them a 2-over-2 instead of the canonical 3-over-3 alpha-helical sandwich fold. Can be divided into three main groups (I, II and II) based on structural features []. This entry represents a group of haemoglobin-like proteins found in eubacteria, cyanobacteria, protozoa, algae and plants, but not in animals or yeast. These proteins have a truncated 2-over-2 rather than the canonical 3-over-3 alpha-helical sandwich fold []. This entry includes: HbN (or GlbN): a truncated haemoglobin-like protein that binds oxygen cooperatively with a very high affinity and a slow dissociation rate, which may exclude it from oxygen transport. It appears to be involved in bacterial nitric oxide detoxification and in nitrosative stress []. Cyanoglobin (or GlbN): a truncated haemoprotein found in cyanobacteria that has high oxygen affinity, and which appears to serve as part of a terminal oxidase, rather than as a respiratory pigment []. HbO (or GlbO): a truncated haemoglobin-like protein with a lower oxygen affinity than HbN. HbO associates with the bacterial cell membrane, where it significantly increases oxygen uptake over membranes lacking this protein. HbO appears to interact with a terminal oxidase, and could participate in an oxygen/electron-transfer process that facilitates oxygen transfer during aerobic metabolism []. Glb3: a nuclear-encoded truncated haemoglobin from plants that appears more closely related to HbO than HbN. Glb3 from Arabidopsis thaliana (Mouse-ear cress) exhibits an unusual concentration-independent binding of oxygen and carbon dioxide []. ; GO: 0019825 oxygen binding, 0015671 oxygen transport; PDB: 2BKM_B 1UVY_A 1DLW_A 2XYK_B 2IG3_A 2GKM_B 1S61_A 1S56_B 1RTE_B 2GLN_A ....
Probab=22.47 E-value=1.9e+02 Score=17.43 Aligned_cols=35 Identities=14% Similarity=0.267 Sum_probs=24.1
Q ss_pred CCCHHHHHHHHHHHHHHHhhcCchHHHHHHHHHHh
Q psy1421 12 DMSEEMQQDAVDCATQALEKYNIEKDIAAYIKKEF 46 (89)
Q Consensus 12 dM~~~mq~~i~~~~~~a~~~~~~~~~iA~~iK~~l 46 (89)
.++++.-+..++...+|+++...+.+..+.+.+.+
T Consensus 76 ~it~~~f~~~~~~~~~al~~~~v~~~~~~~~~~~~ 110 (120)
T PF01152_consen 76 GITEEHFDRWLELLKQALDELGVPEELIDELLARL 110 (120)
T ss_dssp -BBHHHHHHHHHHHHHHHHHTTCTHHHHHHHHHHH
T ss_pred CCCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHH
Confidence 46677888889999999999886554444444433
No 108
>PF10438 Cyc-maltodext_C: Cyclo-malto-dextrinase C-terminal domain; InterPro: IPR019492 This domain is at the very C terminus of cyclo-malto-dextrinase proteins and consists of 8 beta strands, is largely globular and appears to help stabilise the active sites created by upstream domains, IPR015171 from INTERPRO, and IPR006047 from INTERPRO. Cyclo-malto-dextrinases hydrolyse cyclodextrans to maltose and glucose and catalyse trans-glycosylation of oligosaccharides to the C3-, C4- or C6-hydroxyl groups of various acceptor sugar molecules. ; PDB: 3EDK_B 3EDD_A 3EDJ_B 3EDE_A 1H3G_B 3EDF_B.
Probab=22.15 E-value=1.5e+02 Score=17.38 Aligned_cols=19 Identities=11% Similarity=0.448 Sum_probs=13.6
Q ss_pred EcCCcEEEEEe-CCEEEEEE
Q psy1421 68 HETRHFIYFYL-GQVAILLF 86 (89)
Q Consensus 68 h~~~~~~~f~~-~~~~~lv~ 86 (89)
=+.|-|+||+. ++..++|.
T Consensus 7 P~~gvYvYfR~~~~~tVmVi 26 (78)
T PF10438_consen 7 PQDGVYVYFRYYDGKTVMVI 26 (78)
T ss_dssp -BTTEEEEEEEESSEEEEEE
T ss_pred ccCCEEEEEEEcCCCEEEEE
Confidence 36788999986 67777663
No 109
>PF10302 DUF2407: DUF2407 ubiquitin-like domain; InterPro: IPR019413 This entry represents a family of proteins of unknown function found in fungi. They contain a characteristic GFDRL sequence motif.
Probab=22.03 E-value=44 Score=20.45 Aligned_cols=7 Identities=43% Similarity=1.108 Sum_probs=5.6
Q ss_pred eEEEEeC
Q psy1421 54 WHCIVGR 60 (89)
Q Consensus 54 Wh~vVG~ 60 (89)
=||+||.
T Consensus 90 IhCsIGd 96 (97)
T PF10302_consen 90 IHCSIGD 96 (97)
T ss_pred EEEeccC
Confidence 4999985
No 110
>PF09477 Type_III_YscG: Bacterial type II secretion system chaperone protein (type_III_yscG); InterPro: IPR013348 YscG is a molecular chaperone for YscE, where both are part of the type III secretion system that in Yersinia is designated Ysc (Yersinia secretion). The secretion system delivers effector proteins, designated Yops (Yersinia outer proteins), in Yersinia. This entry consists of YscG from Yersinia, and functionally equivalent type III secretion proteins in other species: e.g. AscG in Aeromonas and LscG in Photorhabdus luminescens.; GO: 0009405 pathogenesis; PDB: 3PH0_D 2UWJ_G 2P58_C.
Probab=21.97 E-value=2.3e+02 Score=18.16 Aligned_cols=32 Identities=19% Similarity=0.253 Sum_probs=20.5
Q ss_pred CCHHHHHHHHHHHHHHHhhcCch--HHHHHHHHH
Q psy1421 13 MSEEMQQDAVDCATQALEKYNIE--KDIAAYIKK 44 (89)
Q Consensus 13 M~~~mq~~i~~~~~~a~~~~~~~--~~iA~~iK~ 44 (89)
|+.+++....++|.-|-..+-.+ +.||..+++
T Consensus 1 M~~~l~~lLAElAL~atG~HcH~EA~tIa~wL~~ 34 (116)
T PF09477_consen 1 MNRELRRLLAELALMATGHHCHQEANTIADWLEQ 34 (116)
T ss_dssp --HHHHHHHHHHHHHHHTTT-HHHHHHHHHHHHH
T ss_pred CchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHh
Confidence 78899999999999887766532 345555443
No 111
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=21.96 E-value=56 Score=20.89 Aligned_cols=28 Identities=21% Similarity=0.184 Sum_probs=19.7
Q ss_pred hHHHHHHHHHHhcccCCCceEEEEeCCceeeEE
Q psy1421 35 EKDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVT 67 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~t 67 (89)
..++++.+...+++ --.+||.|+|+.+.
T Consensus 52 ~~~~~~~~~~~~~~-----~~~lvG~S~Gg~~a 79 (245)
T TIGR01738 52 LADAAEAIAAQAPD-----PAIWLGWSLGGLVA 79 (245)
T ss_pred HHHHHHHHHHhCCC-----CeEEEEEcHHHHHH
Confidence 35677778777663 24689999998664
No 112
>PRK00466 acetyl-lysine deacetylase; Validated
Probab=21.87 E-value=3.1e+02 Score=19.72 Aligned_cols=40 Identities=20% Similarity=0.241 Sum_probs=26.6
Q ss_pred HHHHHHHHHHHHHHHhhc---CchHHHHHHHHHHhcccCCCceE
Q psy1421 15 EEMQQDAVDCATQALEKY---NIEKDIAAYIKKEFDKKYNPTWH 55 (89)
Q Consensus 15 ~~mq~~i~~~~~~a~~~~---~~~~~iA~~iK~~lD~~yG~~Wh 55 (89)
++|++++++...+.++-. .++.++|++|+++|+ +.|-.++
T Consensus 6 ~~~~~~~~~~l~~lv~i~s~s~~e~~~~~~l~~~l~-~~g~~~~ 48 (346)
T PRK00466 6 ELVKQKAKELLLDLLSIYTPSGNETNATKFFEKISN-ELNLKLE 48 (346)
T ss_pred HHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHH-HcCCeEE
Confidence 456667777666665433 236789999999998 4565444
No 113
>cd01577 IPMI_Swivel Aconatase-like swivel domain of 3-isopropylmalate dehydratase and related uncharacterized proteins. 3-isopropylmalate dehydratase catalyzes the isomerization between 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate 3-isopropylmalate. IPMI is involved in fungal and bacterial leucine biosynthesis and is also found in eukaryotes. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism.
Probab=21.73 E-value=47 Score=20.24 Aligned_cols=15 Identities=27% Similarity=0.377 Sum_probs=11.0
Q ss_pred ceEEEEeCCceeeEE
Q psy1421 53 TWHCIVGRNFGSYVT 67 (89)
Q Consensus 53 ~Wh~vVG~~fg~~~t 67 (89)
.+=+|.|++||+=-|
T Consensus 18 ~~ilVaG~nfG~GSS 32 (91)
T cd01577 18 GDIIVAGKNFGCGSS 32 (91)
T ss_pred CCEEEecCcccCCCc
Confidence 567888889986433
No 114
>COG1911 RPL30 Ribosomal protein L30E [Translation, ribosomal structure and biogenesis]
Probab=21.71 E-value=2.1e+02 Score=17.83 Aligned_cols=27 Identities=19% Similarity=0.265 Sum_probs=22.4
Q ss_pred CCCccEEEeCCCCHHHHHHHHHHHHHH
Q psy1421 2 SDRKAVIKNADMSEEMQQDAVDCATQA 28 (89)
Q Consensus 2 ~~~~~~v~~~dM~~~mq~~i~~~~~~a 28 (89)
...+..|+.++.|++.++++--.|+-+
T Consensus 34 gkaKliiiAsN~P~~~k~~ieyYAkLs 60 (100)
T COG1911 34 GKAKLIIIASNCPKELKEDIEYYAKLS 60 (100)
T ss_pred CCCcEEEEecCCCHHHHHHHHHHHHHc
Confidence 457889999999999999997776654
No 115
>cd04518 TBP_archaea archaeal TATA box binding protein (TBP): TBPs are transcription factors present in archaea and eukaryotes, that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA.
Probab=21.64 E-value=1.2e+02 Score=20.53 Aligned_cols=25 Identities=32% Similarity=0.475 Sum_probs=19.6
Q ss_pred eEEEcCCc--EEEEEe--CCEEEEEEeeC
Q psy1421 65 YVTHETRH--FIYFYL--GQVAILLFKSG 89 (89)
Q Consensus 65 ~~th~~~~--~~~f~~--~~~~~lv~kt~ 89 (89)
...|+|.. .+-|++ ....+|||.||
T Consensus 119 ~~~YePe~fpglvyR~~~pk~~~lIF~SG 147 (174)
T cd04518 119 NAEYEPEQFPGLVYRLDEPKVVLLLFSSG 147 (174)
T ss_pred CCccCcccCceEEEEecCCcEEEEEeCCC
Confidence 57888886 466777 47899999987
No 116
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=21.54 E-value=1.6e+02 Score=18.71 Aligned_cols=8 Identities=13% Similarity=0.061 Sum_probs=3.5
Q ss_pred ceEEEEeC
Q psy1421 53 TWHCIVGR 60 (89)
Q Consensus 53 ~Wh~vVG~ 60 (89)
.|.+||..
T Consensus 27 ~~eiiVvd 34 (214)
T cd04196 27 NDELIISD 34 (214)
T ss_pred CeEEEEEe
Confidence 34444443
No 117
>PRK13601 putative L7Ae-like ribosomal protein; Provisional
Probab=21.52 E-value=1.9e+02 Score=17.05 Aligned_cols=25 Identities=12% Similarity=0.106 Sum_probs=21.0
Q ss_pred CccEEEeCCCCHHHHHHHHHHHHHH
Q psy1421 4 RKAVIKNADMSEEMQQDAVDCATQA 28 (89)
Q Consensus 4 ~~~~v~~~dM~~~mq~~i~~~~~~a 28 (89)
-+..|+..|-++..+..+.+.|...
T Consensus 25 akLViiA~Da~~~~~k~i~~~c~~~ 49 (82)
T PRK13601 25 VLQVYIAKDAEEHVTKKIKELCEEK 49 (82)
T ss_pred eeEEEEeCCCCHHHHHHHHHHHHhC
Confidence 4578899999999999998887754
No 118
>cd01674 Homoaconitase_Swivel Homoaconitase swivel domain. This family includes homoaconitase and other uncharacterized proteins of the Aconitase family. Homoaconitase is part of an unusual lysine biosynthesis pathway found only in filamentous fungi, in which lysine is synthesized via the alpha-aminoadipate pathway. In this pathway, homoaconitase catalyzes the conversion of cis-homoaconitic acid into homoisocitric acid. The reaction mechanism is believed to be similar to that of other aconitases. This is the swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism.
Probab=21.49 E-value=94 Score=20.27 Aligned_cols=34 Identities=18% Similarity=0.324 Sum_probs=18.8
Q ss_pred hHHHHHHHHHHhcccC----CCceEEEEeCCceeeEEE
Q psy1421 35 EKDIAAYIKKEFDKKY----NPTWHCIVGRNFGSYVTH 68 (89)
Q Consensus 35 ~~~iA~~iK~~lD~~y----G~~Wh~vVG~~fg~~~th 68 (89)
..++|++.-+.+|..| .+.=-+|.|+|||+=-|.
T Consensus 24 ~e~la~~~~e~~dp~f~~~v~~gdilVaG~nFG~GSSR 61 (129)
T cd01674 24 PEKMAEVCMENYDSEFSTKTKQGDILVSGFNFGTGSSR 61 (129)
T ss_pred HHHHHHhhcccCCchhhhcCCCCCEEEeCCccCCCCcH
Confidence 3456666666666544 233234455589965443
No 119
>PRK08775 homoserine O-acetyltransferase; Provisional
Probab=21.44 E-value=61 Score=23.30 Aligned_cols=32 Identities=28% Similarity=0.309 Sum_probs=23.2
Q ss_pred HHHHHHHHHHhcccCCCceEEEEeCCceeeEE
Q psy1421 36 KDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVT 67 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~t 67 (89)
.+.|+.+...+|.---..-+++||.|+|+.+.
T Consensus 121 ~~~a~dl~~ll~~l~l~~~~~lvG~SmGG~vA 152 (343)
T PRK08775 121 ADQADAIALLLDALGIARLHAFVGYSYGALVG 152 (343)
T ss_pred HHHHHHHHHHHHHcCCCcceEEEEECHHHHHH
Confidence 46788888888865333457899999997653
No 120
>cd00652 TBP_TLF TATA box binding protein (TBP): Present in archaea and eukaryotes, TBPs are transcription factors that recognize promoters and initiate transcription. TBP has been shown to be an essential component of three different transcription initiation complexes: SL1, TFIID and TFIIIB, directing transcription by RNA polymerases I, II and III, respectively. TBP binds directly to the TATA box promoter element, where it nucleates polymerase assembly, thus defining the transcription start site. TBP's binding in the minor groove induces a dramatic DNA bending while its own structure barely changes. The conserved core domain of TBP, which binds to the TATA box, has a bipartite structure, with intramolecular symmetry generating a saddle-shaped structure that sits astride the DNA. New members of the TBP family, called TBP-like proteins (TBLP, TLF, TLP) or TBP-related factors (TRF1, TRF2,TRP), are similar to the core domain of TBPs, with identical or chemically similar amino acids at many
Probab=21.01 E-value=1.8e+02 Score=19.44 Aligned_cols=26 Identities=23% Similarity=0.392 Sum_probs=20.0
Q ss_pred eeEEEcCCc--EEEEEe--CCEEEEEEeeC
Q psy1421 64 SYVTHETRH--FIYFYL--GQVAILLFKSG 89 (89)
Q Consensus 64 ~~~th~~~~--~~~f~~--~~~~~lv~kt~ 89 (89)
..++|+|.. .+-|++ ....+|||.||
T Consensus 119 ~~~~YePe~fpgli~r~~~pk~t~lIF~sG 148 (174)
T cd00652 119 ENASYEPELFPGLIYRMDEPKVVLLIFVSG 148 (174)
T ss_pred cccEECCccCceEEEEecCCcEEEEEEcCC
Confidence 478899986 455666 47899999987
No 121
>COG2101 SPT15 TATA-box binding protein (TBP), component of TFIID and TFIIIB [Transcription]
Probab=20.97 E-value=1.1e+02 Score=21.20 Aligned_cols=16 Identities=38% Similarity=0.746 Sum_probs=11.5
Q ss_pred EEEEe--CCEEEEEEeeC
Q psy1421 74 IYFYL--GQVAILLFKSG 89 (89)
Q Consensus 74 ~~f~~--~~~~~lv~kt~ 89 (89)
+.+++ .+.++|||+||
T Consensus 45 lv~Rl~ePk~a~LIF~SG 62 (185)
T COG2101 45 LVYRLEEPKTAALIFRSG 62 (185)
T ss_pred eEEEecCCcceEEEEecC
Confidence 44555 57789999887
No 122
>cd05392 RasGAP_Neurofibromin_like Neurofibromin-like proteins include the Saccharomyces cerevisiae RasGAP proteins Ira1 and Ira2, the closest homolog of neurofibromin, which is responsible for the human autosomal dominant disease neurofibromatosis type I (NF1). The RasGAP Ira1/2 proteins are negative regulators of the Ras-cAMP signaling pathway and conserved from yeast to human. In yeast Ras proteins are activated by GEFs, and inhibited by two GAPs, Ira1 and Ira2. Ras proteins activate the cAMP/protein kinase A (PKA) pathway, which controls metabolism, stress resistance, growth, and meiosis. Recent studies showed that the kelch proteins Gpb1 and Gpb2 inhibit Ras activity via association with Ira1 and Ira2. Gpb1/2 bind to a conserved C-terminal domain of Ira1/2, and loss of Gpb1/2 results in a destabilization of Ira1 and Ira2, leading to elevated levels of Ras2-GTP and uninhibited cAMP-PKA signaling. Since the Gpb1/2 binding domain on Ira1/2 is conserved in the human neurofibromin prote
Probab=20.73 E-value=96 Score=22.60 Aligned_cols=31 Identities=19% Similarity=0.325 Sum_probs=21.7
Q ss_pred HhhcCc-hHHHHHHHHHHhcccCCCceEEEEe
Q psy1421 29 LEKYNI-EKDIAAYIKKEFDKKYNPTWHCIVG 59 (89)
Q Consensus 29 ~~~~~~-~~~iA~~iK~~lD~~yG~~Wh~vVG 59 (89)
+...+. .+.+|+.|++.+.++|...+.-+||
T Consensus 143 ~~~~P~~lr~i~~~l~~~v~~kfp~~~~~~Vg 174 (323)
T cd05392 143 LDRFPPELREICHHIYEVVSEKFPDSALSAVG 174 (323)
T ss_pred HHhCCHHHHHHHHHHHHHHHHHCCCchHHHHH
Confidence 333443 4689999999999999875544444
No 123
>PF12697 Abhydrolase_6: Alpha/beta hydrolase family; PDB: 3LLC_A 3A2N_E 3A2M_A 3A2L_A 3AFI_F 3C5V_A 3C5W_P 3E0X_A 2ZJF_A 3QYJ_A ....
Probab=20.70 E-value=63 Score=20.20 Aligned_cols=30 Identities=20% Similarity=0.263 Sum_probs=21.3
Q ss_pred HHHHHHHHHHhcccCCCceEEEEeCCceeeE
Q psy1421 36 KDIAAYIKKEFDKKYNPTWHCIVGRNFGSYV 66 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG~~Wh~vVG~~fg~~~ 66 (89)
.+.|+.|.+.+|+.-.+..+ +||.|+|+.+
T Consensus 50 ~~~~~~l~~~l~~~~~~~~~-lvG~S~Gg~~ 79 (228)
T PF12697_consen 50 EDYAEDLAELLDALGIKKVI-LVGHSMGGMI 79 (228)
T ss_dssp HHHHHHHHHHHHHTTTSSEE-EEEETHHHHH
T ss_pred hhhhhhhhhccccccccccc-cccccccccc
Confidence 45677777777776656654 7899999764
No 124
>PRK13675 GTP cyclohydrolase; Provisional
Probab=20.56 E-value=1e+02 Score=22.79 Aligned_cols=43 Identities=12% Similarity=-0.017 Sum_probs=34.0
Q ss_pred HHHHHHHHHHhcccCC----CceEEEEeCCceeeEEEcCCcEEEEEe
Q psy1421 36 KDIAAYIKKEFDKKYN----PTWHCIVGRNFGSYVTHETRHFIYFYL 78 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG----~~Wh~vVG~~fg~~~th~~~~~~~f~~ 78 (89)
.|+|+.|+..|.+.|+ -.|--|-.+||-|-..|..-..+...+
T Consensus 253 ED~vR~ia~~l~~~~~~~p~~~~~~V~~en~ESIH~HnA~A~~~~~~ 299 (308)
T PRK13675 253 EDCVREMAKKVVEEFPHLPDDAVVTVRQINEESIHRHNAFAERVATM 299 (308)
T ss_pred HHHHHHHHHHHhhhcccCCCccEEEEEEeccccccCcccEEEEehhH
Confidence 4899999999999876 578888889999988887655554333
No 125
>PF00271 Helicase_C: Helicase conserved C-terminal domain; InterPro: IPR001650 The domain, which defines this group of proteins is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase. The eukaryotic translation initiation factor 4A (eIF4A) is a member of the DEA(D/H)-box RNA helicase family This is a diverse group of proteins that couples an ATPase activity to RNA binding and unwinding. The structure of the carboxyl-terminal domain of eIF4A has been determined to 1.75 A resolution; it has a parallel alpha-beta topology that superimposes, with minor variations, on the structures and conserved motifs of the equivalent domain in other, distantly related helicases [].; GO: 0003676 nucleic acid binding, 0004386 helicase activity, 0005524 ATP binding; PDB: 2Z83_A 2JGN_C 2I4I_A 2BMF_A 2BHR_B 1WP9_E 2WAX_C 2WAY_C 3JUX_A 3DIN_B ....
Probab=20.41 E-value=1.2e+02 Score=16.44 Aligned_cols=24 Identities=25% Similarity=0.289 Sum_probs=18.2
Q ss_pred CccEEEeCCCCHHHHHHHHHHHHH
Q psy1421 4 RKAVIKNADMSEEMQQDAVDCATQ 27 (89)
Q Consensus 4 ~~~~v~~~dM~~~mq~~i~~~~~~ 27 (89)
.++.++.++|+.+.+.++++....
T Consensus 8 ~~~~~i~~~~~~~~r~~~~~~f~~ 31 (78)
T PF00271_consen 8 IKVAIIHGDMSQKERQEILKKFNS 31 (78)
T ss_dssp SSEEEESTTSHHHHHHHHHHHHHT
T ss_pred CcEEEEECCCCHHHHHHHHHHhhc
Confidence 456778889999999888775543
No 126
>PF00817 IMS: impB/mucB/samB family; InterPro: IPR001126 In Escherichia coli, UV and many chemicals appear to cause mutagenesis by a process of translesion synthesis that requires DNA polymerase III and the SOS-regulated proteins UmuD, UmuC and RecA. This machinery allows the replication to continue through DNA lesion, and therefore avoid lethal interruption of DNA replication after DNA damage []. UmuC is a well conserved protein in prokaryotes, with a homologue in yeast species. Proteins currently known to belong to this family are listed below: E. coli MucB protein. Plasmid-born analogue of the UmuC protein. Saccharomyces cerevisiae (Baker's yeast) Rev1 protein. Homologue of UmuC also required for normal induction of mutations by physical and chemical agents. Salmonella typhimurium ImpB protein. Plasmid-born analogue of the UmuC protein. Bacterial UmuC protein. E. coli DNA-damage-inducible protein P (DinP). S. typhimurium SamB homologue of UmuC plasmid associated. ; GO: 0003684 damaged DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006281 DNA repair; PDB: 3BJY_A 2AQ4_A 3OSP_A 4DL7_A 4DL6_A 3TQ1_A 3MR2_A 4EEY_A 3MR3_A 4DL4_A ....
Probab=20.30 E-value=1.2e+02 Score=19.25 Aligned_cols=26 Identities=12% Similarity=0.125 Sum_probs=21.1
Q ss_pred HHHHHHHHHHhcccCCCceEEEEeCC
Q psy1421 36 KDIAAYIKKEFDKKYNPTWHCIVGRN 61 (89)
Q Consensus 36 ~~iA~~iK~~lD~~yG~~Wh~vVG~~ 61 (89)
.++++.|++.+-++.|=++++-||.+
T Consensus 115 ~~~~~~l~~~i~~~~gl~~s~Gia~~ 140 (149)
T PF00817_consen 115 EALARRLRQAIAEETGLTASIGIAPN 140 (149)
T ss_dssp HHHHHHHHHHHHHHHS--EEEEEESS
T ss_pred HHHHHHHHHHHHHHhCCEEEEEEeCC
Confidence 46899999999999998888888876
No 127
>PF10047 DUF2281: Protein of unknown function (DUF2281); InterPro: IPR018739 This domain is found in putative uncharacterised proteins, though some proteins contaning this domain are described as a transcritional regulator of the Xre family.
Probab=20.13 E-value=1.7e+02 Score=16.35 Aligned_cols=22 Identities=36% Similarity=0.604 Sum_probs=18.2
Q ss_pred CCCCHHHHHHHHHHHHHHHhhc
Q psy1421 11 ADMSEEMQQDAVDCATQALEKY 32 (89)
Q Consensus 11 ~dM~~~mq~~i~~~~~~a~~~~ 32 (89)
..+|++.|.+|++-+.-.+.++
T Consensus 10 ~~LP~~~~~Evldfi~fL~~k~ 31 (66)
T PF10047_consen 10 QQLPEELQQEVLDFIEFLLQKY 31 (66)
T ss_pred HHCCHHHHHHHHHHHHHHHHhc
Confidence 4589999999999888777776
No 128
>TIGR03229 benzo_1_2_benA benzoate 1,2-dioxygenase, large subunit. Benzoate 1,2-dioxygenase (EC 1.14.12.10) belongs to the larger family of aromatic ring-hydroxylating dioxygenases. Members of this family all act on benzoate, but may have additional activities on various benozate analogs. This model describes the large subunit. Between the trusted and noise cutoffs are similar enzymes, likely to act on benzoate but perhaps best identified according to some other activity, such as 2-chlorobenzoate 1,2-dioxygenase (1.14.12.13).
Probab=20.11 E-value=2.5e+02 Score=21.73 Aligned_cols=43 Identities=12% Similarity=0.413 Sum_probs=30.5
Q ss_pred HHHHhcccCCCceEEEEeCCceeeEEEcCCcEEEEEeCCEEEEEEee
Q psy1421 42 IKKEFDKKYNPTWHCIVGRNFGSYVTHETRHFIYFYLGQVAILLFKS 88 (89)
Q Consensus 42 iK~~lD~~yG~~Wh~vVG~~fg~~~th~~~~~~~f~~~~~~~lv~kt 88 (89)
...++++-|.+.|+++-=.+ .+ -+++.|+-+.+++..++|+|.
T Consensus 27 f~~E~~~IF~~~W~~v~~~s---el-p~~gd~~t~~~~~~~vvv~R~ 69 (433)
T TIGR03229 27 FDLEMKHIFEGNWIYLAHES---QI-PNNNDYYTTYMGRQPIFIARN 69 (433)
T ss_pred HHHHHHHHhhhCCEEEEEHH---HC-CCCCCeEEEEECCeEEEEEEC
Confidence 45788889999999876432 11 255667777788888888874
No 129
>PF02806 Alpha-amylase_C: Alpha amylase, C-terminal all-beta domain; InterPro: IPR006048 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Alpha-amylase is classified as family 13 of the glycosyl hydrolases and is present in archaea, bacteria, plants and animals. Alpha-amylase is an essential enzyme in alpha-glucan metabolism, acting to catalyse the hydrolysis of alpha-1,4-glucosidic bonds of glycogen, starch and related polysaccharides. Although all alpha-amylases possess the same catalytic function, they can vary with respect to sequence. In general, they are composed of three domains: a TIM barrel containing the active site residues and chloride ion-binding site (domain A), a long loop region inserted between the third beta strand and the alpha-helix of domain A that contains calcium-binding site(s) (domain B), and a C-terminal beta-sheet domain that appears to show some variability in sequence and length between amylases (domain C) []. Amylases have at least one conserved calcium-binding site, as calcium is essential for the stability of the enzyme. The chloride-binding functions to activate the enzyme, which acts by a two-step mechanism involving a catalytic nucleophile base (usually an Asp) and a catalytic proton donor (usually a Glu) that are responsible for the formation of the beta-linked glycosyl-enzyme intermediate. This entry represents the all-beta domain that is found in several alpha-amylases, usually at the C terminus, and which forms a Greek key beta-barrel fold in these enzymes []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0043169 cation binding, 0005975 carbohydrate metabolic process; PDB: 1TCM_A 1CXL_A 1PJ9_A 1OT2_A 2DIJ_A 1CGV_A 1CXK_A 1PEZ_A 1CGX_A 2CXG_A ....
Probab=20.06 E-value=1.5e+02 Score=17.05 Aligned_cols=38 Identities=18% Similarity=0.266 Sum_probs=23.6
Q ss_pred CCceEEEEeC---Cceee-------EEEcCCcEEEEEeCCEEEEEEee
Q psy1421 51 NPTWHCIVGR---NFGSY-------VTHETRHFIYFYLGQVAILLFKS 88 (89)
Q Consensus 51 G~~Wh~vVG~---~fg~~-------~th~~~~~~~f~~~~~~~lv~kt 88 (89)
++.|+.++.. .||.. ++...+.-+.+.+..++.+|||-
T Consensus 46 ~g~y~~vlnsd~~~~~g~~~~~~~~v~~~~~g~~~~~lp~~s~~vl~~ 93 (95)
T PF02806_consen 46 AGRYKEVLNSDDEEYGGSGKGNSGEVTVDSNGRITVTLPPYSALVLKL 93 (95)
T ss_dssp SEEEEETTTTTCEEEEESSCSETSEEEEETTSEEEEEESTTEEEEEEE
T ss_pred cceeeEEeCCCccEECCcccccCceEEEeeCCEEEEEECCCEEEEEEE
Confidence 4566666655 25532 33323334789999999999874
Done!