Query 034366
Match_columns 96
No_of_seqs 115 out of 546
Neff 7.0
Searched_HMMs 46136
Date Fri Mar 29 12:41:01 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/034366.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/034366hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3430 Dynein light chain typ 100.0 5.8E-43 1.3E-47 217.9 11.4 90 1-91 1-90 (90)
2 PTZ00059 dynein light chain; P 100.0 2.5E-42 5.3E-47 217.9 12.2 89 1-91 2-90 (90)
3 PLN03058 dynein light chain ty 100.0 6.8E-41 1.5E-45 222.5 12.5 93 3-95 33-126 (128)
4 PF01221 Dynein_light: Dynein 100.0 3.4E-41 7.4E-46 211.8 9.9 89 1-91 1-89 (89)
5 PF04155 Ground-like: Ground-l 97.7 0.00068 1.5E-08 41.0 8.3 53 36-89 23-76 (76)
6 PF05075 DUF684: Protein of un 92.8 1.5 3.3E-05 33.4 9.2 68 24-91 175-252 (345)
7 PF15650 Tox-REase-9: Restrict 80.8 1.4 3.1E-05 27.6 2.0 49 13-61 33-87 (89)
8 PF12006 DUF3500: Protein of u 79.5 14 0.00031 27.9 7.4 45 8-52 216-261 (313)
9 TIGR00237 xseA exodeoxyribonuc 75.1 8 0.00017 30.4 5.2 48 38-91 3-56 (432)
10 PF06150 ChaB: ChaB; InterPro 70.1 15 0.00032 21.0 4.3 46 11-57 7-54 (57)
11 KOG1573 Aldehyde reductase [Ge 68.2 31 0.00068 24.4 6.3 61 11-73 91-161 (204)
12 KOG2451 Aldehyde dehydrogenase 66.9 14 0.0003 29.6 4.8 46 4-49 49-94 (503)
13 PRK00286 xseA exodeoxyribonucl 65.4 20 0.00043 27.9 5.5 48 38-91 9-62 (438)
14 PF07742 BTG: BTG family; Int 63.2 7.3 0.00016 25.5 2.3 21 37-57 27-47 (118)
15 PF13742 tRNA_anti_2: OB-fold 62.5 31 0.00068 21.4 5.1 50 38-91 5-60 (99)
16 PF12652 CotJB: CotJB protein; 62.0 4.3 9.4E-05 24.8 1.0 14 42-55 43-56 (78)
17 cd03472 Rieske_RO_Alpha_BPDO_l 61.9 21 0.00044 23.3 4.3 40 48-92 1-41 (128)
18 PF12362 DUF3646: DNA polymera 61.2 11 0.00023 24.7 2.8 28 38-65 49-76 (117)
19 PF08776 VASP_tetra: VASP tetr 60.3 23 0.0005 19.0 3.5 32 15-50 6-37 (40)
20 TIGR02084 leud 3-isopropylmala 57.3 10 0.00022 26.0 2.3 33 38-70 27-63 (156)
21 PF10703 MoaF: Molybdenum cofa 52.3 29 0.00062 25.9 4.1 37 55-94 182-219 (265)
22 cd01579 AcnA_Bact_Swivel Bacte 51.6 13 0.00028 24.5 2.0 18 52-69 46-63 (121)
23 cd03538 Rieske_RO_Alpha_AntDO 51.2 54 0.0012 21.8 5.0 45 43-92 10-55 (146)
24 PF14900 DUF4493: Domain of un 48.5 48 0.001 23.5 4.7 37 46-82 119-160 (235)
25 PRK14023 homoaconitate hydrata 48.2 23 0.0005 24.5 2.9 33 38-70 29-65 (166)
26 PF11858 DUF3378: Domain of un 46.8 30 0.00064 21.1 3.0 21 72-92 29-49 (81)
27 PF12550 GCR1_C: Transcription 45.8 6.7 0.00014 23.7 -0.0 12 46-57 29-40 (81)
28 PF06457 Ectatomin: Ectatomin; 45.1 9.6 0.00021 19.5 0.5 13 38-50 22-34 (34)
29 COG1570 XseA Exonuclease VII, 44.1 35 0.00075 27.3 3.7 50 38-91 9-62 (440)
30 PRK06489 hypothetical protein; 38.5 21 0.00045 26.7 1.6 33 38-70 136-169 (360)
31 smart00099 btg1 tob/btg1 famil 38.1 38 0.00082 22.0 2.6 39 19-57 6-47 (108)
32 PF08958 DUF1871: Domain of un 37.7 31 0.00068 21.0 2.1 29 24-54 26-54 (79)
33 cd05392 RasGAP_Neurofibromin_l 37.4 25 0.00055 26.3 1.9 41 20-61 134-174 (323)
34 PF05184 SapB_1: Saposin-like 37.3 57 0.0012 16.2 3.5 30 16-47 5-34 (39)
35 PF15571 Imm25: Immunity prote 36.5 27 0.00059 23.2 1.7 24 37-60 17-40 (124)
36 PF10655 DUF2482: Hypothetical 35.3 22 0.00048 22.7 1.1 48 10-65 5-65 (100)
37 PF00249 Myb_DNA-binding: Myb- 35.3 51 0.0011 17.4 2.5 23 23-45 10-32 (48)
38 TIGR01916 F420_cofE F420-0:gam 35.2 55 0.0012 24.2 3.3 34 35-68 122-155 (243)
39 PF10925 DUF2680: Protein of u 34.6 91 0.002 17.8 3.6 15 38-52 35-49 (59)
40 cd03545 Rieske_RO_Alpha_OHBDO_ 34.3 1.4E+02 0.0031 19.8 5.4 45 42-91 11-57 (150)
41 PF08015 Pheromone: Fungal mat 34.3 22 0.00049 20.7 1.0 13 48-60 55-69 (69)
42 KOG3252 Uncharacterized conser 34.2 54 0.0012 23.7 3.0 32 14-46 3-34 (217)
43 TIGR03849 arch_ComA phosphosul 34.1 1.5E+02 0.0032 21.8 5.4 48 11-61 95-154 (237)
44 cd00197 VHS_ENTH_ANTH VHS, ENT 33.4 37 0.00081 21.2 2.0 22 35-59 33-54 (115)
45 PRK10425 DNase TatD; Provision 33.3 1.3E+02 0.0028 21.8 5.1 49 14-62 102-157 (258)
46 PF08006 DUF1700: Protein of u 33.2 1.2E+02 0.0026 20.6 4.6 39 11-49 15-65 (181)
47 PRK13293 F420-0--gamma-glutamy 33.1 61 0.0013 23.9 3.3 33 35-67 123-155 (245)
48 KOG2130 Phosphatidylserine-spe 32.9 37 0.00079 26.5 2.2 18 53-70 277-294 (407)
49 PF06925 MGDG_synth: Monogalac 32.8 51 0.0011 22.0 2.7 27 38-64 2-28 (169)
50 PF06057 VirJ: Bacterial virul 32.5 47 0.001 23.6 2.5 29 39-68 53-81 (192)
51 PF09477 Type_III_YscG: Bacter 32.5 66 0.0014 21.2 3.0 34 13-46 1-34 (116)
52 TIGR01738 bioH putative pimelo 32.3 30 0.00066 22.7 1.5 29 36-69 51-79 (245)
53 PRK13007 succinyl-diaminopimel 32.1 2E+02 0.0044 21.0 6.7 39 13-51 1-40 (352)
54 PF09868 DUF2095: Uncharacteri 32.0 1E+02 0.0022 20.5 3.9 17 38-54 95-115 (128)
55 cd05498 Bromo_Brdt_II_like Bro 31.6 1.3E+02 0.0027 18.5 4.7 38 15-52 64-102 (102)
56 PF00352 TBP: Transcription fa 31.3 72 0.0016 19.1 2.9 26 67-92 30-59 (86)
57 PF06840 DUF1241: Protein of u 31.2 35 0.00076 23.5 1.7 14 37-50 122-135 (154)
58 PF01739 CheR: CheR methyltran 31.2 23 0.0005 24.9 0.8 62 5-74 64-126 (196)
59 cd01674 Homoaconitase_Swivel H 30.6 54 0.0012 22.0 2.4 32 38-70 25-61 (129)
60 cd01578 AcnA_Mitochon_Swivel M 30.2 29 0.00064 23.8 1.1 19 52-70 67-85 (149)
61 PF06884 DUF1264: Protein of u 30.1 39 0.00084 23.7 1.7 19 40-58 98-116 (171)
62 COG1999 Uncharacterized protei 29.6 1.1E+02 0.0024 21.5 4.1 56 38-93 116-192 (207)
63 PF01996 F420_ligase: F420-0:G 29.4 40 0.00086 24.3 1.8 32 35-66 130-161 (228)
64 PF06202 GDE_C: Amylo-alpha-1, 29.0 71 0.0015 24.5 3.2 17 36-52 193-209 (370)
65 PF01545 Cation_efflux: Cation 28.9 62 0.0014 23.1 2.8 23 38-60 254-276 (284)
66 cd05135 RasGAP_RASAL Ras GTPas 28.9 96 0.0021 23.8 3.9 35 22-57 159-193 (333)
67 cd05391 RasGAP_p120GAP p120GAP 28.8 39 0.00085 25.7 1.7 37 20-57 134-170 (315)
68 PF01743 PolyA_pol: Poly A pol 28.5 1.6E+02 0.0036 18.8 4.8 37 41-79 28-64 (126)
69 PF10440 WIYLD: Ubiquitin-bind 28.3 64 0.0014 19.1 2.2 42 22-65 11-52 (65)
70 PF14372 DUF4413: Domain of un 27.8 1E+02 0.0022 19.1 3.3 29 25-53 19-49 (101)
71 COG1352 CheR Methylase of chem 27.7 67 0.0015 23.8 2.8 49 4-58 128-178 (268)
72 PF10302 DUF2407: DUF2407 ubiq 27.0 31 0.00067 21.7 0.8 7 56-62 90-96 (97)
73 KOG1569 50S ribosomal protein 26.8 59 0.0013 25.0 2.3 28 38-65 226-261 (323)
74 TIGR02087 LEUD_arch 3-isopropy 26.7 47 0.001 22.7 1.7 32 38-69 27-62 (154)
75 COG1498 SIK1 Protein implicate 26.2 81 0.0018 25.0 3.1 40 9-50 200-239 (395)
76 PRK08775 homoserine O-acetyltr 25.9 45 0.00098 24.6 1.6 33 38-70 121-153 (343)
77 PF08908 DUF1852: Domain of un 25.9 35 0.00076 25.8 1.0 23 47-69 98-120 (322)
78 PRK09111 DNA polymerase III su 25.8 70 0.0015 26.4 2.8 28 38-65 505-532 (598)
79 PF02283 CobU: Cobinamide kina 25.8 77 0.0017 21.6 2.6 23 38-61 37-59 (167)
80 PF12697 Abhydrolase_6: Alpha/ 25.8 43 0.00093 21.5 1.3 32 37-69 49-80 (228)
81 cd01577 IPMI_Swivel Aconatase- 25.5 35 0.00076 21.4 0.8 15 55-69 18-32 (91)
82 PF03869 Arc: Arc-like DNA bin 25.5 84 0.0018 17.2 2.3 35 13-51 11-45 (50)
83 KOG3990 Uncharacterized conser 25.4 57 0.0012 24.6 2.0 41 18-58 53-103 (305)
84 PF07055 Eno-Rase_FAD_bd: Enoy 25.1 1.6E+02 0.0034 17.4 3.5 47 3-53 5-53 (65)
85 PF04622 ERG2_Sigma1R: ERG2 an 25.0 1.2E+02 0.0025 22.0 3.5 34 20-54 35-68 (216)
86 PF13798 PCYCGC: Protein of un 24.9 31 0.00067 23.9 0.5 34 18-54 114-147 (158)
87 cd02860 Pullulanase_N_term Pul 24.8 82 0.0018 19.1 2.4 17 47-63 40-56 (100)
88 PRK13601 putative L7Ae-like ri 24.7 1.4E+02 0.003 18.2 3.3 24 4-27 25-48 (82)
89 cd05128 RasGAP_GAP1_like The G 24.5 61 0.0013 24.6 2.1 34 21-55 141-174 (315)
90 PRK06765 homoserine O-acetyltr 24.5 58 0.0013 25.1 2.0 34 36-69 142-175 (389)
91 PLN00072 3-isopropylmalate iso 24.1 70 0.0015 23.7 2.3 61 4-69 71-144 (246)
92 KOG3938 RGS-GAIP interacting p 23.9 1.5E+02 0.0033 22.6 4.0 23 11-33 250-272 (334)
93 COG0066 LeuD 3-isopropylmalate 23.8 40 0.00087 24.0 0.9 19 51-69 59-77 (191)
94 CHL00168 pbsA heme oxygenase; 23.6 91 0.002 22.7 2.8 38 11-49 179-216 (238)
95 cd05130 RasGAP_Neurofibromin N 23.5 1.4E+02 0.0031 22.8 3.9 31 25-56 148-178 (329)
96 KOG4194 Membrane glycoprotein 23.0 50 0.0011 28.2 1.5 24 53-76 586-609 (873)
97 cd05503 Bromo_BAZ2A_B_like Bro 23.0 1.9E+02 0.0041 17.6 4.8 38 15-52 59-97 (97)
98 cd05137 RasGAP_CLA2_BUD2 CLA2/ 23.0 1.5E+02 0.0032 23.3 4.0 36 20-56 207-242 (395)
99 PRK06683 hypothetical protein; 22.0 1.7E+02 0.0036 17.6 3.4 24 4-27 28-51 (82)
100 TIGR03228 anthran_1_2_A anthra 21.9 2E+02 0.0043 22.9 4.5 45 44-92 27-71 (438)
101 cd00404 Aconitase_swivel Aconi 21.7 66 0.0014 20.0 1.5 15 55-69 16-30 (88)
102 PF11841 DUF3361: Domain of un 21.7 2.2E+02 0.0047 19.7 4.2 34 12-45 113-146 (160)
103 PF01726 LexA_DNA_bind: LexA D 21.6 1.7E+02 0.0036 16.8 3.2 30 15-44 5-35 (65)
104 PF15586 Imm47: Immunity prote 21.4 2.3E+02 0.0049 18.3 4.1 33 20-52 80-112 (116)
105 PRK12414 putative aminotransfe 21.4 3.5E+02 0.0077 20.2 5.7 49 14-62 44-97 (384)
106 PF08594 UPF0300: Uncharacteri 21.3 3.1E+02 0.0068 19.9 5.1 19 55-73 95-113 (215)
107 TIGR00171 leuD 3-isopropylmala 21.3 43 0.00093 23.7 0.7 13 57-69 72-84 (188)
108 smart00764 Citrate_ly_lig Citr 21.3 2.1E+02 0.0046 19.7 4.2 10 56-65 109-120 (182)
109 KOG0760 Mitochondrial carrier 21.2 24 0.00051 26.8 -0.7 19 42-60 133-154 (302)
110 cd05506 Bromo_plant1 Bromodoma 21.0 2.1E+02 0.0045 17.3 4.8 38 15-52 61-99 (99)
111 PRK10667 Hha toxicity attenuat 21.0 1.4E+02 0.0031 19.8 3.0 32 19-52 49-84 (122)
112 PRK13888 conjugal transfer pro 20.6 32 0.00069 24.8 -0.1 19 41-59 91-109 (206)
113 cd02680 MIT_calpain7_2 MIT: do 20.3 2.1E+02 0.0046 17.1 3.8 10 41-50 63-72 (75)
114 PF11307 DUF3109: Protein of u 20.3 31 0.00067 24.4 -0.2 18 39-56 159-176 (183)
115 cd01580 AcnA_IRP_Swivel Aconit 20.2 60 0.0013 22.8 1.2 18 52-69 94-111 (171)
116 PRK01641 leuD isopropylmalate 20.1 47 0.001 23.7 0.7 33 38-70 36-83 (200)
117 PF10045 DUF2280: Uncharacteri 20.1 1.6E+02 0.0035 19.0 3.1 28 20-49 7-34 (104)
118 PF02881 SRP54_N: SRP54-type p 20.1 1.9E+02 0.004 16.4 4.1 21 10-30 36-57 (75)
119 cd03548 Rieske_RO_Alpha_OMO_CA 20.0 2.4E+02 0.0052 18.2 4.1 34 51-90 10-44 (136)
No 1
>KOG3430 consensus Dynein light chain type 1 [Cytoskeleton]
Probab=100.00 E-value=5.8e-43 Score=217.92 Aligned_cols=90 Identities=56% Similarity=0.909 Sum_probs=86.5
Q ss_pred CCCCCceEEeCCCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCceEEEEeCccceeEEEccCcEEEEEe
Q 034366 1 MLEGKASIKETDMPVKMQVQAMASASQALDLYDVVDSISIATHIKKEFDKIYGGGWQCVVGSNFGCYFTHKHGTFIYFSL 80 (96)
Q Consensus 1 M~~~~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt~~~~~~i~F~~ 80 (96)
|.+.+++|+.+|||++||++|+++|.+|+++|+. +.++||..||++||++||++||||||++|||+|||++++||||++
T Consensus 1 ~~~~~~~vk~tDM~~~mq~~a~~~a~~al~~f~~-~~k~iA~~iKkefDkkyG~~WhcivG~~FGs~vThe~g~Fiyf~~ 79 (90)
T KOG3430|consen 1 MLERKAVVKATDMPEEMQQEAIELARQALEKFNV-IEKDIAAFIKKEFDKKYGPTWHCIVGRNFGSYVTHETGHFIYFYL 79 (90)
T ss_pred CCCccceEecCCCChHHHHHHHHHHHHHHHHcCC-ChHHHHHHHHHHHhhhcCCccEEEEcCCcceEEEeecCcEEEEEe
Confidence 7889999999999999999999999999999995 239999999999999999999999999999999999999999999
Q ss_pred CCEEEEEEecC
Q 034366 81 ETLNFLIFKAA 91 (96)
Q Consensus 81 ~~~~illfkt~ 91 (96)
|.++|||||+.
T Consensus 80 g~l~illfK~~ 90 (90)
T KOG3430|consen 80 GVLAILLFKCA 90 (90)
T ss_pred ceEEEEEEecC
Confidence 99999999984
No 2
>PTZ00059 dynein light chain; Provisional
Probab=100.00 E-value=2.5e-42 Score=217.87 Aligned_cols=89 Identities=46% Similarity=0.801 Sum_probs=86.8
Q ss_pred CCCCCceEEeCCCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCceEEEEeCccceeEEEccCcEEEEEe
Q 034366 1 MLEGKASIKETDMPVKMQVQAMASASQALDLYDVVDSISIATHIKKEFDKIYGGGWQCVVGSNFGCYFTHKHGTFIYFSL 80 (96)
Q Consensus 1 M~~~~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt~~~~~~i~F~~ 80 (96)
|++++++|+.+|||++||++|+++|.+|+++|+.+ +|||++||+.||++|||+||||||++|||++||++++||||++
T Consensus 2 m~~~~~~i~~~dM~~emq~~a~~~~~~Al~~~~~~--kdiA~~IK~~fD~~yg~~WhciVG~~Fgs~vthe~~~~i~F~~ 79 (90)
T PTZ00059 2 MSDRKAVVKNADMSEDMQQDAIDCANQALEKFNIE--KDIAAYIKKEFDKKYNPTWHCIVGRNFGSYVTHETKHFIYFYL 79 (90)
T ss_pred CCCCccEEEECCCCHHHHHHHHHHHHHHHHHcCch--HHHHHHHHHHHHhhcCCCCEEEEecCeeEEEEEeCCcEEEEEE
Confidence 78889999999999999999999999999999987 9999999999999999999999999999999999999999999
Q ss_pred CCEEEEEEecC
Q 034366 81 ETLNFLIFKAA 91 (96)
Q Consensus 81 ~~~~illfkt~ 91 (96)
|+++|||||++
T Consensus 80 ~~~~vLlfK~~ 90 (90)
T PTZ00059 80 GQVAILLFKSG 90 (90)
T ss_pred CCEEEEEEecC
Confidence 99999999985
No 3
>PLN03058 dynein light chain type 1 family protein; Provisional
Probab=100.00 E-value=6.8e-41 Score=222.49 Aligned_cols=93 Identities=41% Similarity=0.810 Sum_probs=87.5
Q ss_pred CCCceEEeCCCCHHHHHHHHHHHHHHhhhcCC-CCHHHHHHHHHHhhcccCCCceEEEEeCccceeEEEccCcEEEEEeC
Q 034366 3 EGKASIKETDMPVKMQVQAMASASQALDLYDV-VDSISIATHIKKEFDKIYGGGWQCVVGSNFGCYFTHKHGTFIYFSLE 81 (96)
Q Consensus 3 ~~~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~-~~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt~~~~~~i~F~~~ 81 (96)
+.+++|+.+|||++||++|+++|.+|+++++. .+.++||++||++||++|||+||||||++|||+|||++++||||++|
T Consensus 33 ~~dv~Ik~sDM~~emQ~~ave~a~~Al~k~~~~~~ekdIA~~IKk~fDkkYG~tWHCIVGk~FGs~VTHe~~~fIyF~ig 112 (128)
T PLN03058 33 ELNVRVRASDMPLVLQNRAFSCARDILDAMPGKLDSKRLALALKKEFDSAYGPAWHCIVGTSFGSYVTHSTGGFLYFSID 112 (128)
T ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHHHHccccCCHHHHHHHHHHHHhhhhCCceEEEECCcEEEEEEEcCCcEEEEEEC
Confidence 47899999999999999999999999999863 23389999999999999999999999999999999999999999999
Q ss_pred CEEEEEEecCCCCC
Q 034366 82 TLNFLIFKAAITPS 95 (96)
Q Consensus 82 ~~~illfkt~~~~~ 95 (96)
+++||||||+.+|-
T Consensus 113 ~~aiLLfKt~~~~~ 126 (128)
T PLN03058 113 KVYILLFKTAVEPL 126 (128)
T ss_pred CEEEEEEeccCccC
Confidence 99999999999984
No 4
>PF01221 Dynein_light: Dynein light chain type 1 ; InterPro: IPR001372 Dynein is a multisubunit microtubule-dependent motor enzyme that acts as the force generating protein of eukaryotic cilia and flagella. The cytoplasmic isoform of dynein acts as a motor for the intracellular retrograde motility of vesicles and organelles along microtubules. Dynein is composed of a number of ATP-binding large subunits (see IPR004273 from INTERPRO), intermediate size subunits and small subunits. Among the small subunits, there is a family of highly conserved proteins which make up this family [, ]. Both type 1 (DLC1) and 2 (DLC2) dynein light chains have a similar two-layer alpha-beta core structure consisting of beta-alpha(2)-beta-X-beta(2) [, ].; GO: 0007017 microtubule-based process, 0005875 microtubule associated complex; PDB: 1F95_A 1F96_A 1F3C_A 3P8M_B 2XQQ_C 1RE6_A 1CMI_A 1PWK_A 1PWJ_A 4DS1_C ....
Probab=100.00 E-value=3.4e-41 Score=211.77 Aligned_cols=89 Identities=47% Similarity=0.812 Sum_probs=84.6
Q ss_pred CCCCCceEEeCCCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCceEEEEeCccceeEEEccCcEEEEEe
Q 034366 1 MLEGKASIKETDMPVKMQVQAMASASQALDLYDVVDSISIATHIKKEFDKIYGGGWQCVVGSNFGCYFTHKHGTFIYFSL 80 (96)
Q Consensus 1 M~~~~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt~~~~~~i~F~~ 80 (96)
|++++++|+.+|||++||++|+++|.+|+++++++ +++|++||++||++|||+||||||++|||++||++++++||++
T Consensus 1 m~~~~~~i~~~dM~~~~~~~~~~~~~~a~~~~~~~--~eiA~~iK~~lD~~yG~~Wh~IVG~~Fg~~~th~~~~~~~f~~ 78 (89)
T PF01221_consen 1 MSENKIVIKSSDMPEEMQEEAIELAKEALKKYQDE--KEIAEFIKQELDKKYGPTWHCIVGKSFGSSVTHEPGTFLYFKI 78 (89)
T ss_dssp SGSCSEEEEEEES-HHHHHHHHHHHHHHHHHCSSH--HHHHHHHHHHHHHHHSS-EEEEEESEEEEEEEEETTEEEEEEE
T ss_pred CCCCccEEEECCCCHHHHHHHHHHHHHHHHHCCcH--HHHHHHHHHHHhcccCCceEEEECCcEEEEEEEcCCcEEEEEE
Confidence 89999999999999999999999999999999976 9999999999999999999999999999999999999999999
Q ss_pred CCEEEEEEecC
Q 034366 81 ETLNFLIFKAA 91 (96)
Q Consensus 81 ~~~~illfkt~ 91 (96)
++++||||||+
T Consensus 79 ~~~~~li~kt~ 89 (89)
T PF01221_consen 79 GNIAFLIFKTQ 89 (89)
T ss_dssp TTEEEEEEEE-
T ss_pred CCEEEEEEecC
Confidence 99999999985
No 5
>PF04155 Ground-like: Ground-like domain; InterPro: IPR007284 This group of proteins contain one or more copies of the ground-like domain, which are specific to Caenorhabditis elegans and Caenorhabditis briggsae. It has been proposed that the ground-like domain containing proteins may bind and modulate the activity of Patched-like membrane molecules, reminiscent of the modulating activities of neuropeptides [].
Probab=97.67 E-value=0.00068 Score=41.01 Aligned_cols=53 Identities=15% Similarity=0.290 Sum_probs=46.3
Q ss_pred CHHHHHHHHHHhhcccCCCceEEEEeC-ccceeEEEccCcEEEEEeCCEEEEEEe
Q 034366 36 DSISIATHIKKEFDKIYGGGWQCVVGS-NFGCYFTHKHGTFIYFSLETLNFLIFK 89 (96)
Q Consensus 36 ~~~~iA~~IK~~lD~~yg~~WhciVG~-~Fgs~vt~~~~~~i~F~~~~~~illfk 89 (96)
+..++++.|.+.++++||+.+-||+++ +|.....+. ..|--...++...++|+
T Consensus 23 ~~~~s~~~Iq~~~e~~f~~~f~vIcs~~~Fsy~~~~~-~~~C~~~~~g~~c~af~ 76 (76)
T PF04155_consen 23 NLSISKRAIQKAAEKRFGGSFEVICSEGDFSYSTHTD-DLYCKVEKNGVTCLAFA 76 (76)
T ss_pred CHHHHHHHHHHHHHHHhCCCEEEEEeCCCceeEEecc-cceeeeeeCCEEEEEEC
Confidence 347999999999999999999999999 777666666 78888899999999985
No 6
>PF05075 DUF684: Protein of unknown function (DUF684); InterPro: IPR007767 This family contains uncharacterised proteins from Caenorhabditis elegans.
Probab=92.83 E-value=1.5 Score=33.39 Aligned_cols=68 Identities=16% Similarity=0.296 Sum_probs=44.0
Q ss_pred HHHHHhhhcCCCCHHHHHHHHHHhhcccCC-CceEEEEeC--------ccceeEEEccCcEEE-EEeCCEEEEEEecC
Q 034366 24 SASQALDLYDVVDSISIATHIKKEFDKIYG-GGWQCVVGS--------NFGCYFTHKHGTFIY-FSLETLNFLIFKAA 91 (96)
Q Consensus 24 ~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg-~~WhciVG~--------~Fgs~vt~~~~~~i~-F~~~~~~illfkt~ 91 (96)
.+.+..+.+....-.+.|..||+.||+-.. -.+-++|-. .|-....+....+|. |.-|+..++||||-
T Consensus 175 ~v~~~~d~~~~~sn~eKAd~Ik~~Le~ilTnDsFYIiVfd~~~~~~~~~~y~~~~~~~dq~I~s~~rGgcNv~VYRS~ 252 (345)
T PF05075_consen 175 FVEEVQDNNSHLSNEEKADEIKKKLEKILTNDSFYIIVFDDCSGYDNHYYYGFYDNNEDQYIESFNRGGCNVFVYRSK 252 (345)
T ss_pred HHHHHHHhcCCCChHHHHHHHHHHHHhhCCCCcEEEEEecccccCCccceeeeccCcccCEEEEEeCCCeEEEEEeeC
Confidence 444444444422227899999999999654 468888822 111222345555665 67899999999994
No 7
>PF15650 Tox-REase-9: Restriction endonuclease fold toxin 9
Probab=80.81 E-value=1.4 Score=27.64 Aligned_cols=49 Identities=22% Similarity=0.462 Sum_probs=27.7
Q ss_pred CCHHHHHHHHHHHHHHhhhcCCCCHHHHH------HHHHHhhcccCCCceEEEEe
Q 034366 13 MPVKMQVQAMASASQALDLYDVVDSISIA------THIKKEFDKIYGGGWQCVVG 61 (96)
Q Consensus 13 M~~~~q~~~~~~~~~a~~~~~~~~~~~iA------~~IK~~lD~~yg~~WhciVG 61 (96)
||.-.+-++++.....+=+.++.+.+.|. +-=|++|...||.+|.|+|-
T Consensus 33 lpsGkR~D~id~~~k~IyELKPnNPr~ik~G~kQl~~Y~~el~~~~G~~W~~~l~ 87 (89)
T PF15650_consen 33 LPSGKRPDFIDFETKIIYELKPNNPRAIKRGLKQLENYKQELEKIYGGGWKTRLE 87 (89)
T ss_pred cCCCCcCccccCCcceEEEecCCCHHHHHHHHHHHHHHHHHhcCccCCCeeEEee
Confidence 33333444555444433334433333333 23357899999999999973
No 8
>PF12006 DUF3500: Protein of unknown function (DUF3500); InterPro: IPR021889 This family of proteins is functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 335 to 438 amino acids in length. This protein has a conserved GHH sequence motif. This protein has two completely conserved G residues that may be functionally important.
Probab=79.52 E-value=14 Score=27.95 Aligned_cols=45 Identities=16% Similarity=0.200 Sum_probs=31.5
Q ss_pred EEeCCCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHH-HHhhcccC
Q 034366 8 IKETDMPVKMQVQAMASASQALDLYDVVDSISIATHI-KKEFDKIY 52 (96)
Q Consensus 8 I~~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~I-K~~lD~~y 52 (96)
|..++|+.+.|+.+..++..-+..++.+..++.-+.| +..||+.|
T Consensus 216 l~~s~Lt~~Qq~ll~~li~~y~~~~~~~~a~~~~~~i~~~~ld~t~ 261 (313)
T PF12006_consen 216 LAVSELTADQQELLLALIKEYLGRLPEEDAAERMAEIEEAGLDETY 261 (313)
T ss_pred cChhhCCHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHhccccceE
Confidence 5678999999999999999999988853222222334 55555543
No 9
>TIGR00237 xseA exodeoxyribonuclease VII, large subunit. This family consist of exodeoxyribonuclease VII, large subunit XseA which catalyses exonucleolytic cleavage in either the 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. Exonuclease VII consists of one large subunit and four small subunits.
Probab=75.15 E-value=8 Score=30.41 Aligned_cols=48 Identities=21% Similarity=0.473 Sum_probs=36.9
Q ss_pred HHHHHHHHHhhcccCCCceEEEEeC--ccceeEEEccCcEEEEEeC----CEEEEEEecC
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVGS--NFGCYFTHKHGTFIYFSLE----TLNFLIFKAA 91 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG~--~Fgs~vt~~~~~~i~F~~~----~~~illfkt~ 91 (96)
.++..+||+.||..++..| |.|+ ||-. | ..-.+||.+. .+...+|++.
T Consensus 3 sel~~~ik~~le~~~~~v~--V~GEisn~~~---~-~sGH~YFtLkD~~a~i~~vmf~~~ 56 (432)
T TIGR00237 3 SELNAQIKALLEATFLQVW--IQGEISNFTQ---P-VSGHWYFTLKDENAQVRCVMFRGN 56 (432)
T ss_pred HHHHHHHHHHHHhhCCcEE--EEEEecCCee---C-CCceEEEEEEcCCcEEEEEEEcCh
Confidence 5789999999999999888 6676 6652 3 3446899873 6788999874
No 10
>PF06150 ChaB: ChaB; InterPro: IPR009317 This family of proteins contain a conserved 60 residue region. This protein is known as ChaB in Escherichia coli and is found next to ChaA, which is a cation transporter protein. ChaB may be regulate ChaA function in some way.; PDB: 1SG7_A.
Probab=70.07 E-value=15 Score=21.01 Aligned_cols=46 Identities=20% Similarity=0.387 Sum_probs=30.0
Q ss_pred CCCCHHHHHHHHHHHHHHhhhcCCCC-HHHHH-HHHHHhhcccCCCceE
Q 034366 11 TDMPVKMQVQAMASASQALDLYDVVD-SISIA-THIKKEFDKIYGGGWQ 57 (96)
Q Consensus 11 ~dM~~~~q~~~~~~~~~a~~~~~~~~-~~~iA-~~IK~~lD~~yg~~Wh 57 (96)
..||..-|+--++....|++.|..+. ...+| ..+|+...+ -|+.|.
T Consensus 7 ~~LP~~Aq~if~~afn~a~~~~~de~~A~~vAw~AVk~~Y~k-~~g~W~ 54 (57)
T PF06150_consen 7 EHLPEHAQRIFRKAFNSAWEEYGDEERAHRVAWAAVKRKYEK-VNGRWV 54 (57)
T ss_dssp TT--SHHHHHHHHHHHHHHHH--SHHHHHHHHHHHHHHHEEE-SSS-EE
T ss_pred hHCCHHHHHHHHHHHHHHHHhcCCHhHHHHHHHHHHHHHhee-cCCEee
Confidence 46788888888888899999996531 11233 679999999 688895
No 11
>KOG1573 consensus Aldehyde reductase [General function prediction only]
Probab=68.24 E-value=31 Score=24.36 Aligned_cols=61 Identities=20% Similarity=0.278 Sum_probs=37.9
Q ss_pred CCCCHHHHHHHHHHHHHHhhhcCCCCHHHHH---HHHHHhhcccCC-CceEEEEeC------ccceeEEEccC
Q 034366 11 TDMPVKMQVQAMASASQALDLYDVVDSISIA---THIKKEFDKIYG-GGWQCVVGS------NFGCYFTHKHG 73 (96)
Q Consensus 11 ~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA---~~IK~~lD~~yg-~~WhciVG~------~Fgs~vt~~~~ 73 (96)
-|+++...+-+.+.|...-+.|+.++.-.++ .-|-+-| .-+| |-|-|| |. -|+.++-|-..
T Consensus 91 PDlDepni~Ha~QtAE~iR~~~Pd~dWlHLtaLiHDLGKvl-~f~GepQWAVv-GDTfpVGC~~~~s~V~~d~ 161 (204)
T KOG1573|consen 91 PDLDEPNIQHALQTAEAIRKDYPDEDWLHLTALIHDLGKVL-AFGGEPQWAVV-GDTFPVGCAFDASNVHHDK 161 (204)
T ss_pred CCCchHHHHHHHHHHHHHHHhCCCccHHHHHHHHHHHHHHH-HhcCCcceeee-cCcccccccccccceechh
Confidence 3677777788888887777888877744444 4444556 3444 688764 54 44444444433
No 12
>KOG2451 consensus Aldehyde dehydrogenase [Energy production and conversion]
Probab=66.92 E-value=14 Score=29.56 Aligned_cols=46 Identities=22% Similarity=0.294 Sum_probs=36.3
Q ss_pred CCceEEeCCCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhc
Q 034366 4 GKASIKETDMPVKMQVQAMASASQALDLYDVVDSISIATHIKKEFD 49 (96)
Q Consensus 4 ~~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD 49 (96)
+.+.=+-.||+.+.-+++++.|.+|++.|.+-..++-++.|++|.|
T Consensus 49 geii~~V~~~~V~e~~kAI~aA~EaF~s~~~~takeRs~lLrkwy~ 94 (503)
T KOG2451|consen 49 GEIIGKVADMTVEEAEKAIDAAYEAFKSYRNLTAKERSALLRKWYE 94 (503)
T ss_pred cchhhcccCCcHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHH
Confidence 3445577899999999999999999999986444666777776655
No 13
>PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed
Probab=65.39 E-value=20 Score=27.94 Aligned_cols=48 Identities=25% Similarity=0.526 Sum_probs=37.0
Q ss_pred HHHHHHHHHhhcccCCCceEEEEeC--ccceeEEEccCcEEEEEeC----CEEEEEEecC
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVGS--NFGCYFTHKHGTFIYFSLE----TLNFLIFKAA 91 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG~--~Fgs~vt~~~~~~i~F~~~----~~~illfkt~ 91 (96)
.++..+||..||..++..| |.|+ +|-. -..-.+||.+. .+...+|++.
T Consensus 9 sel~~~ik~~le~~~~~v~--v~gEis~~~~----~~sGH~Yf~Lkd~~a~i~~~~~~~~ 62 (438)
T PRK00286 9 SELNRYVKSLLERDLGQVW--VRGEISNFTR----HSSGHWYFTLKDEIAQIRCVMFKGS 62 (438)
T ss_pred HHHHHHHHHHHHhhCCcEE--EEEEeCCCee----CCCCeEEEEEEcCCcEEEEEEEcCh
Confidence 6899999999999988888 6676 5432 24456899874 5889999874
No 14
>PF07742 BTG: BTG family; InterPro: IPR002087 Anti-proliferative proteins have been shown to include mammalian and avian protein BTG1 (which appears to be involved in negative regulation of cell proliferation) and rat/mouse NGF-inducible protein PC3/TIS21 (BTG2) [, , ]. These proteins have from 158 to 363 amino acid residues, that are highly similar and include 3 conserved cysteine residues. BTG2 seems to have a signal sequence; while the other proteins may lack such a domain. The sequence of the N-terminal half of these proteins is well conserved.; PDB: 3DJU_B 3E9V_A 2Z15_D 2D5R_B 3DJN_B.
Probab=63.25 E-value=7.3 Score=25.49 Aligned_cols=21 Identities=19% Similarity=0.362 Sum_probs=16.3
Q ss_pred HHHHHHHHHHhhcccCCCceE
Q 034366 37 SISIATHIKKEFDKIYGGGWQ 57 (96)
Q Consensus 37 ~~~iA~~IK~~lD~~yg~~Wh 57 (96)
....|+.|.+.|-.+|.+.|+
T Consensus 27 ~~~F~~~L~~~L~~ry~~HW~ 47 (118)
T PF07742_consen 27 VDRFAEELENLLCERYKGHWY 47 (118)
T ss_dssp HHHHHHHHHHHHHHHHTTS--
T ss_pred HHHHHHHHHHHHHHHHhCCCC
Confidence 356789999999999999997
No 15
>PF13742 tRNA_anti_2: OB-fold nucleic acid binding domain
Probab=62.48 E-value=31 Score=21.41 Aligned_cols=50 Identities=20% Similarity=0.460 Sum_probs=35.2
Q ss_pred HHHHHHHHHhhccc--CCCceEEEEeCccceeEEEccCcEEEEEeC----CEEEEEEecC
Q 034366 38 ISIATHIKKEFDKI--YGGGWQCVVGSNFGCYFTHKHGTFIYFSLE----TLNFLIFKAA 91 (96)
Q Consensus 38 ~~iA~~IK~~lD~~--yg~~WhciVG~~Fgs~vt~~~~~~i~F~~~----~~~illfkt~ 91 (96)
.++.++||+.++.. ++..| |.|+= +.++.-.+..+||.+. .+...+|++.
T Consensus 5 s~l~~~ik~~le~~~~~~~vw--V~GEI--s~~~~~~~gh~YftLkD~~a~i~~~~~~~~ 60 (99)
T PF13742_consen 5 SELNNYIKDLLERDPPLPNVW--VEGEI--SNLKRHSSGHVYFTLKDEEASISCVIFRSR 60 (99)
T ss_pred HHHHHHHHHHHhcCCCcCCEE--EEEEE--eecEECCCceEEEEEEcCCcEEEEEEEHHH
Confidence 68999999999998 67888 56661 1122226677888873 5677888764
No 16
>PF12652 CotJB: CotJB protein; InterPro: IPR024207 The cotJ operon proteins affect spore coat composition, and is controlled by sigma E. The genes, which include CotJB, are either required for the normal formation of the inner layers of the coat or are themselves structural components of the coat []. CotJB has been identified as a spore coat protein [].
Probab=62.02 E-value=4.3 Score=24.77 Aligned_cols=14 Identities=43% Similarity=0.740 Sum_probs=11.8
Q ss_pred HHHHHhhcccCCCc
Q 034366 42 THIKKEFDKIYGGG 55 (96)
Q Consensus 42 ~~IK~~lD~~yg~~ 55 (96)
+.+++.+.++|||-
T Consensus 43 ~~l~~~Ye~~yGPL 56 (78)
T PF12652_consen 43 KQLKKEYEKRYGPL 56 (78)
T ss_pred HHHHHHHHHHhCCC
Confidence 66888999999984
No 17
>cd03472 Rieske_RO_Alpha_BPDO_like Rieske non-heme iron oxygenase (RO) family, Biphenyl dioxygenase (BPDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of BPDO and similar proteins including cumene dioxygenase (CumDO), nitrobenzene dioxygenase (NBDO), alkylbenzene dioxygenase (AkbDO) and dibenzofuran 4,4a-dioxygenase (DFDO). ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. BPDO degrades biphenyls and polychlorinated biphenyls (PCB's) while CumDO degrades cumene (isopropylbenzene), an aromatic hydrocarbon that is i
Probab=61.86 E-value=21 Score=23.29 Aligned_cols=40 Identities=8% Similarity=0.308 Sum_probs=28.2
Q ss_pred hcccCCCceEEEEeC-ccceeEEEccCcEEEEEeCCEEEEEEecCC
Q 034366 48 FDKIYGGGWQCVVGS-NFGCYFTHKHGTFIYFSLETLNFLIFKAAI 92 (96)
Q Consensus 48 lD~~yg~~WhciVG~-~Fgs~vt~~~~~~i~F~~~~~~illfkt~~ 92 (96)
|++-|...|+.|.=. .+. +++.+..+.+++..|+|+|+.+
T Consensus 1 ~~~i~~~~W~~v~~~~el~-----~~g~~~~~~~~~~~i~l~r~~~ 41 (128)
T cd03472 1 LERVFARSWLLLGHETHIP-----KAGDYLTTYMGEDPVIVVRQKD 41 (128)
T ss_pred CcchhhCCCeEeEEHHHCC-----CCCCEEEEEECCceEEEEECCC
Confidence 566788899986543 332 3466667889999999998643
No 18
>PF12362 DUF3646: DNA polymerase III gamma and tau subunits C terminal; InterPro: IPR022107 This domain family is found in bacteria, and is approximately 120 amino acids in length. The family is found in association with PF00004 from PFAM. The proteins in this family are frequently annotated as the gamma and tau subunits of DNA polymerase III, however there is little accompanying literature to back this up.
Probab=61.24 E-value=11 Score=24.67 Aligned_cols=28 Identities=14% Similarity=0.407 Sum_probs=25.7
Q ss_pred HHHHHHHHHhhcccCCCceEEEEeCccc
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVGSNFG 65 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG~~Fg 65 (96)
.++|..|++.|..--|..|.|.+++.=|
T Consensus 49 ~dl~~~L~~~L~~wTG~rW~V~~s~~~g 76 (117)
T PF12362_consen 49 KDLAQRLSRKLQEWTGQRWIVSLSNEPG 76 (117)
T ss_pred HHHHHHHHHHHHHHhCCCeEEEEeCCCC
Confidence 7999999999999999999999998654
No 19
>PF08776 VASP_tetra: VASP tetramerisation domain; InterPro: IPR014885 Vasodilator-stimulated phosphoprotein (VASP) is an actin cytoskeletal regulatory protein. This region corresponds to the tetramerisation domain which forms a right handed alpha helical coiled coil structure []. ; PDB: 1USE_A 1USD_A.
Probab=60.29 E-value=23 Score=19.00 Aligned_cols=32 Identities=13% Similarity=0.292 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhcc
Q 034366 15 VKMQVQAMASASQALDLYDVVDSISIATHIKKEFDK 50 (96)
Q Consensus 15 ~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~ 50 (96)
+.+.+++++..+.-+++.+ .+|-..|+++|-+
T Consensus 6 e~~KqEIL~EvrkEl~K~K----~EIIeA~~~eL~r 37 (40)
T PF08776_consen 6 ERLKQEILEEVRKELQKVK----EEIIEAIRQELSR 37 (40)
T ss_dssp HHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHhc
Confidence 3578889999999888777 5688888888764
No 20
>TIGR02084 leud 3-isopropylmalate dehydratase, small subunit. Several pairs of archaeal proteins resemble the leuC and leuD pair in length and sequence but even more closely resemble the respective domains of homoaconitase, and their identity is uncertain. The members of the seed for this model are those sequences which are gene clustered with other genes involved in leucine biosynthesis and include some archaea.
Probab=57.33 E-value=10 Score=25.98 Aligned_cols=33 Identities=33% Similarity=0.491 Sum_probs=22.7
Q ss_pred HHHHHHHHHhhcc----cCCCceEEEEeCccceeEEE
Q 034366 38 ISIATHIKKEFDK----IYGGGWQCVVGSNFGCYFTH 70 (96)
Q Consensus 38 ~~iA~~IK~~lD~----~yg~~WhciVG~~Fgs~vt~ 70 (96)
.++|+++-+.+|. ++.+..=+|.|+||||=-|.
T Consensus 27 ~~l~~~~f~~~~p~f~~~~~~g~iiVaG~NFG~GSSR 63 (156)
T TIGR02084 27 KELAKHCMEDLDKDFVKKVKEGDIIVAGENFGCGSSR 63 (156)
T ss_pred HHHHhhhhccCChhHHhhcCCCCEEEccCcccCCCcH
Confidence 5777777776664 45566677778899975443
No 21
>PF10703 MoaF: Molybdenum cofactor biosynthesis protein F; InterPro: IPR024724 Molybdenum cofactor biosynthesis protein F (MoaF) is essential for the production of the monoamine-inducible 30kDa protein in Klebsiella []. It is necessary for reconstituting organoautotrophic growth in Ralstonia eutropha []. MoaF is conserved in proteobacteria and some lower eukaryotes. The operon regulating the Moa genes is responsible for molybdenum cofactor biosynthesis.
Probab=52.31 E-value=29 Score=25.95 Aligned_cols=37 Identities=24% Similarity=0.650 Sum_probs=26.5
Q ss_pred ceEEEEeCccceeEEEccCcEEEEEeC-CEEEEEEecCCCC
Q 034366 55 GWQCVVGSNFGCYFTHKHGTFIYFSLE-TLNFLIFKAAITP 94 (96)
Q Consensus 55 ~WhciVG~~Fgs~vt~~~~~~i~F~~~-~~~illfkt~~~~ 94 (96)
+|||+.|..=|-. +...+-+++++ ++.++.|+=..-|
T Consensus 182 ~W~CL~G~e~Gla---D~D~c~~~Ki~d~lYlf~WrEkiiP 219 (265)
T PF10703_consen 182 AWQCLSGVEKGLA---DTDRCHYYKIADNLYLFTWREKIIP 219 (265)
T ss_pred EEEEeeccccCCC---CccceEEEEecCCEEEEEEEecccc
Confidence 8999999866533 35777788875 6777788755444
No 22
>cd01579 AcnA_Bact_Swivel Bacterial Aconitase-like swivel domain. Aconitase (aconitate hydratase or citrate hydrolyase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. Cis-aconitate is formed as an intermediate product during the course of the reaction. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism. This distinct subfamily is found only in bacteria and archea. Its exact characteristics are not known.
Probab=51.57 E-value=13 Score=24.46 Aligned_cols=18 Identities=28% Similarity=0.193 Sum_probs=12.5
Q ss_pred CCCceEEEEeCccceeEE
Q 034366 52 YGGGWQCVVGSNFGCYFT 69 (96)
Q Consensus 52 yg~~WhciVG~~Fgs~vt 69 (96)
.|..+=+|.|+||||=-+
T Consensus 46 ~~~~~iiVaG~nFG~GSS 63 (121)
T cd01579 46 AAGPGFIVGGENYGQGSS 63 (121)
T ss_pred cCCCeEEEcCCcCCCCcc
Confidence 355666777889997444
No 23
>cd03538 Rieske_RO_Alpha_AntDO Rieske non-heme iron oxygenase (RO) family, Anthranilate 1,2-dioxygenase (AntDO) subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. AntDO converts anthranilate to catechol, a naturally occurring compound formed through tryptophan degradation and an important intermediate in the metabolism of many N-heterocyclic compounds such as indole, o-nitrobenzoate, carbazole, and quinaldine.
Probab=51.20 E-value=54 Score=21.79 Aligned_cols=45 Identities=13% Similarity=0.346 Sum_probs=32.4
Q ss_pred HHHHhhcccCCCceEEEEeC-ccceeEEEccCcEEEEEeCCEEEEEEecCC
Q 034366 43 HIKKEFDKIYGGGWQCVVGS-NFGCYFTHKHGTFIYFSLETLNFLIFKAAI 92 (96)
Q Consensus 43 ~IK~~lD~~yg~~WhciVG~-~Fgs~vt~~~~~~i~F~~~~~~illfkt~~ 92 (96)
....+++.-|...|+.|.-. .. -+++.++-+.+++..++|+|..+
T Consensus 10 ~~~~e~~~i~~~~W~~v~~~~el-----p~~G~~~~~~i~g~~i~v~r~~~ 55 (146)
T cd03538 10 IFALEMERLFGNAWIYVGHESQV-----PNPGDYITTRIGDQPVVMVRHTD 55 (146)
T ss_pred HHHHHHHHHhhcCCEEEEEHHHC-----CCCCCEEEEEECCeeEEEEECCC
Confidence 44566777788899986542 33 13577888899999999999643
No 24
>PF14900 DUF4493: Domain of unknown function (DUF4493)
Probab=48.51 E-value=48 Score=23.53 Aligned_cols=37 Identities=24% Similarity=0.433 Sum_probs=29.4
Q ss_pred HhhcccCCCceEEEEeCccceeEEE--ccC---cEEEEEeCC
Q 034366 46 KEFDKIYGGGWQCVVGSNFGCYFTH--KHG---TFIYFSLET 82 (96)
Q Consensus 46 ~~lD~~yg~~WhciVG~~Fgs~vt~--~~~---~~i~F~~~~ 82 (96)
..|.+.|+..|++-|...-+..+++ ... .-.||..+.
T Consensus 119 ~~f~~~f~~~y~vtV~~~~~~~~~~~~~~~~~~~~~Yf~~~~ 160 (235)
T PF14900_consen 119 DEFKKYFGSDYSVTVSTGAGGSVTFNKDETTSDRSAYFKAGE 160 (235)
T ss_pred HHHHhhhccceEEEEEccCCccEEEeeccCCCCcceEEECCC
Confidence 3466777777999999865888887 555 889999988
No 25
>PRK14023 homoaconitate hydratase small subunit; Provisional
Probab=48.16 E-value=23 Score=24.49 Aligned_cols=33 Identities=18% Similarity=0.073 Sum_probs=21.0
Q ss_pred HHHHHHHHHhhc----ccCCCceEEEEeCccceeEEE
Q 034366 38 ISIATHIKKEFD----KIYGGGWQCVVGSNFGCYFTH 70 (96)
Q Consensus 38 ~~iA~~IK~~lD----~~yg~~WhciVG~~Fgs~vt~ 70 (96)
.++++++-..+| +++.+..=+|.|+||||=-|.
T Consensus 29 ~~l~~~~f~~~~p~f~~~~~~g~IIVaG~NFG~GSSR 65 (166)
T PRK14023 29 DRFHNYAFAHLRPEFASTVRPGDILVAGRNFGLGSSR 65 (166)
T ss_pred HHHHhhhccCCChhhHhhcCCCCEEEccCcccCCccH
Confidence 466666655555 345566667777899975443
No 26
>PF11858 DUF3378: Domain of unknown function (DUF3378); InterPro: IPR024568 Ribonuclease HIII 3.1.26 from EC cleaves RNA from DNA-RNA hybrids. It catalyses endonucleolytic cleavage to 5'-phospho-monoesters. Two types of ribonuclease H in Bacillus subtilis, RNase HII (rnhB) and RNase HIII (rnhC), are both known experimentally and are quite similar to each other []. The only RNase H homologue in the Mycoplasmas resembles rnhC. Archaeal forms resemble HII more closely than HIII. This entry represents the uncharacterised N-terminal domain of bacterial RNase HIII.; PDB: 3ASM_A 2D0C_A 2D0B_A 2D0A_A.
Probab=46.78 E-value=30 Score=21.11 Aligned_cols=21 Identities=5% Similarity=0.026 Sum_probs=16.2
Q ss_pred cCcEEEEEeCCEEEEEEecCC
Q 034366 72 HGTFIYFSLETLNFLIFKAAI 92 (96)
Q Consensus 72 ~~~~i~F~~~~~~illfkt~~ 92 (96)
++..+.++.++..|.+|+|+.
T Consensus 29 p~~~f~aK~~~~tIt~Y~SGK 49 (81)
T PF11858_consen 29 PYAVFQAKYNGVTITAYKSGK 49 (81)
T ss_dssp TTEEEEEEETTEEEEEETTSE
T ss_pred CCEEEEEeCCCeEEEEEeCCe
Confidence 444455789999999999874
No 27
>PF12550 GCR1_C: Transcriptional activator of glycolytic enzymes; InterPro: IPR022210 This domain family is found in eukaryotes, and is approximately 80 amino acids in length. This family is activates the transcription of glycolytic enzymes.
Probab=45.79 E-value=6.7 Score=23.66 Aligned_cols=12 Identities=33% Similarity=1.057 Sum_probs=10.0
Q ss_pred HhhcccCCCceE
Q 034366 46 KEFDKIYGGGWQ 57 (96)
Q Consensus 46 ~~lD~~yg~~Wh 57 (96)
+.|+++||..|-
T Consensus 29 ~~le~~yG~~WR 40 (81)
T PF12550_consen 29 RSLEKKYGSKWR 40 (81)
T ss_pred HHHHHHhChhhc
Confidence 467899999996
No 28
>PF06457 Ectatomin: Ectatomin; InterPro: IPR009458 Ectatomin is a toxin from the venom of the ant Ectatomma tuberculatum. Ectatomin can efficiently insert into the plasma membrane, where it can form channels. Ectatomin was shown to inhibit L-type calcium currents in isolated rat cardiac myocytes []. In these cells, ectatomin induces a gradual, irreversible increase in ion leakage across the membrane, which can lead to cell death. Ectatomin is comprised of two subunits, A and B, which are homologous. The structure of ectatomin reveals that each subunit consists of two alpha helices with a connecting hinge region, which form a hairpin structure that is stabilised by disulphide bridges. A disulphide bridge between the hinge regions of the two subunits links the heterodimer together, forming a closed bundle of four helices with a left-handed twist [].; GO: 0005216 ion channel activity, 0009405 pathogenesis, 0005576 extracellular region; PDB: 1ECI_A.
Probab=45.14 E-value=9.6 Score=19.46 Aligned_cols=13 Identities=62% Similarity=0.603 Sum_probs=10.3
Q ss_pred HHHHHHHHHhhcc
Q 034366 38 ISIATHIKKEFDK 50 (96)
Q Consensus 38 ~~iA~~IK~~lD~ 50 (96)
.+||.+||+..|+
T Consensus 22 g~iat~ik~~c~k 34 (34)
T PF06457_consen 22 GSIATMIKRKCDK 34 (34)
T ss_dssp CCHHHHHHHHCH-
T ss_pred ccHHHHHHHHhCC
Confidence 4799999998764
No 29
>COG1570 XseA Exonuclease VII, large subunit [DNA replication, recombination, and repair]
Probab=44.10 E-value=35 Score=27.35 Aligned_cols=50 Identities=26% Similarity=0.517 Sum_probs=37.4
Q ss_pred HHHHHHHHHhhcccCCCceEEEEeCccceeEEEccCcEEEEEeC----CEEEEEEecC
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVGSNFGCYFTHKHGTFIYFSLE----TLNFLIFKAA 91 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt~~~~~~i~F~~~----~~~illfkt~ 91 (96)
.++..+||..||..+|..| |-|+ =|.++.-+.-..||.+. .+...+|+..
T Consensus 9 Seln~~ik~llE~~~~~V~--v~GE--ISn~t~~~sgH~YFtLKD~~A~i~c~mf~~~ 62 (440)
T COG1570 9 SELNDYIKRLLERDLGQVW--VRGE--ISNFTRPASGHLYFTLKDERAQIRCVMFKGN 62 (440)
T ss_pred HHHHHHHHHHHHhcCCeEE--EEEE--ecCCccCCCccEEEEEccCCceEEEEEEcCc
Confidence 6899999999999999999 5665 13445434448899985 5678888764
No 30
>PRK06489 hypothetical protein; Provisional
Probab=38.53 E-value=21 Score=26.66 Aligned_cols=33 Identities=9% Similarity=0.145 Sum_probs=22.6
Q ss_pred HHHHHHHHHhhcccCC-CceEEEEeCccceeEEE
Q 034366 38 ISIATHIKKEFDKIYG-GGWQCVVGSNFGCYFTH 70 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg-~~WhciVG~~Fgs~vt~ 70 (96)
.++|+.+...+..+.| ...+++||.|+|+.+.-
T Consensus 136 ~~~a~~~~~~l~~~lgi~~~~~lvG~SmGG~vAl 169 (360)
T PRK06489 136 DDMVEAQYRLVTEGLGVKHLRLILGTSMGGMHAW 169 (360)
T ss_pred HHHHHHHHHHHHHhcCCCceeEEEEECHHHHHHH
Confidence 5677766665533344 56788999999987653
No 31
>smart00099 btg1 tob/btg1 family. The tob/btg1 is a family of proteins that inhibit cell proliferation.
Probab=38.09 E-value=38 Score=21.95 Aligned_cols=39 Identities=13% Similarity=0.165 Sum_probs=26.6
Q ss_pred HHHHHHHHHHhhhcCC---CCHHHHHHHHHHhhcccCCCceE
Q 034366 19 VQAMASASQALDLYDV---VDSISIATHIKKEFDKIYGGGWQ 57 (96)
Q Consensus 19 ~~~~~~~~~a~~~~~~---~~~~~iA~~IK~~lD~~yg~~Wh 57 (96)
..++......+++++. +.....|+.|.+.|-.+|-+.|+
T Consensus 6 ~~av~Fl~~~l~~~~~l~~~~v~~F~~~L~~~L~~~y~~HWy 47 (108)
T smart00099 6 AAAVNFITSLLRKHNKLSKRRVEIFAEKLTRLLKEKYKNHWY 47 (108)
T ss_pred HHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHhcCCC
Confidence 4566777777777753 22345778888888888877775
No 32
>PF08958 DUF1871: Domain of unknown function (DUF1871); InterPro: IPR015053 This set of hypothetical proteins is produced by prokaryotes pertaining to the Bacillus genus. ; PDB: 1U84_A.
Probab=37.67 E-value=31 Score=21.00 Aligned_cols=29 Identities=28% Similarity=0.385 Sum_probs=19.8
Q ss_pred HHHHHhhhcCCCCHHHHHHHHHHhhcccCCC
Q 034366 24 SASQALDLYDVVDSISIATHIKKEFDKIYGG 54 (96)
Q Consensus 24 ~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~ 54 (96)
.+..++...+ +..++|+.|+.-|..-||.
T Consensus 26 ~Iv~~v~~~~--~~~~LA~~Iq~If~~SF~e 54 (79)
T PF08958_consen 26 DIVQAVHEND--DPEELAKKIQSIFEFSFGE 54 (79)
T ss_dssp HHHHHHTT-S---HHHHHHHHHHHHHHHHSS
T ss_pred HHHHHHHhCC--CHHHHHHHHHHHHHHHHcc
Confidence 3445555454 3489999999999888874
No 33
>cd05392 RasGAP_Neurofibromin_like Neurofibromin-like proteins include the Saccharomyces cerevisiae RasGAP proteins Ira1 and Ira2, the closest homolog of neurofibromin, which is responsible for the human autosomal dominant disease neurofibromatosis type I (NF1). The RasGAP Ira1/2 proteins are negative regulators of the Ras-cAMP signaling pathway and conserved from yeast to human. In yeast Ras proteins are activated by GEFs, and inhibited by two GAPs, Ira1 and Ira2. Ras proteins activate the cAMP/protein kinase A (PKA) pathway, which controls metabolism, stress resistance, growth, and meiosis. Recent studies showed that the kelch proteins Gpb1 and Gpb2 inhibit Ras activity via association with Ira1 and Ira2. Gpb1/2 bind to a conserved C-terminal domain of Ira1/2, and loss of Gpb1/2 results in a destabilization of Ira1 and Ira2, leading to elevated levels of Ras2-GTP and uninhibited cAMP-PKA signaling. Since the Gpb1/2 binding domain on Ira1/2 is conserved in the human neurofibromin prote
Probab=37.42 E-value=25 Score=26.31 Aligned_cols=41 Identities=17% Similarity=0.199 Sum_probs=27.5
Q ss_pred HHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCceEEEEe
Q 034366 20 QAMASASQALDLYDVVDSISIATHIKKEFDKIYGGGWQCVVG 61 (96)
Q Consensus 20 ~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~WhciVG 61 (96)
.+++.....++.++.+ ++.+|+.|++.+.++|...+.-+||
T Consensus 134 ~~l~~I~~s~~~~P~~-lr~i~~~l~~~v~~kfp~~~~~~Vg 174 (323)
T cd05392 134 KFIDAIISSLDRFPPE-LREICHHIYEVVSEKFPDSALSAVG 174 (323)
T ss_pred HHHHHHHHhHHhCCHH-HHHHHHHHHHHHHHHCCCchHHHHH
Confidence 3334444455555543 5899999999999999875544444
No 34
>PF05184 SapB_1: Saposin-like type B, region 1; InterPro: IPR007856 Synonym(s):cerebroside sulphate activator, CSAct Saposin B is a small non-enzymatic glycoprotein required for the breakdown of cerebroside sulphates (sulphatides) in lysosomes. Saposin B contains three intramolecular disulphide bridges, exists as a dimer and is remarkably heat, protease, and pH stable. The crystal structure of human saposin B reveals an unusual shell-like dimer consisting of a monolayer of alpha-helices enclosing a large hydrophobic cavity. Although the secondary structure of saposin B is similar to that of the known monomeric members of the saposin-like superfamily, the helices are repacked into a different tertiary arrangement to form the homodimer. A comparison of the two forms of the saposin B dimer suggests that extraction of target lipids from membranes involves a conformational change that facilitates access to the inner cavity [].; GO: 0006629 lipid metabolic process; PDB: 1N69_C 1QDM_C 4DDJ_A 2DOB_A 1OF9_A 2Z9A_A 1M12_A 2GTG_A 1SN6_A 2QYP_B ....
Probab=37.25 E-value=57 Score=16.18 Aligned_cols=30 Identities=10% Similarity=0.045 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHh
Q 034366 16 KMQVQAMASASQALDLYDVVDSISIATHIKKE 47 (96)
Q Consensus 16 ~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~ 47 (96)
+.-+.++..+...++.-.++ .+|.+.|++.
T Consensus 5 ~~C~~~v~~i~~~l~~~~t~--~~I~~~l~~~ 34 (39)
T PF05184_consen 5 DICKFVVKEIEKLLKNNKTE--EEIKKALEKA 34 (39)
T ss_dssp HHHHHHHHHHHHHHHSTCHH--HHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHcCccH--HHHHHHHHHH
Confidence 34567888888888755555 6777777764
No 35
>PF15571 Imm25: Immunity protein 25
Probab=36.54 E-value=27 Score=23.24 Aligned_cols=24 Identities=25% Similarity=0.313 Sum_probs=20.2
Q ss_pred HHHHHHHHHHhhcccCCCceEEEE
Q 034366 37 SISIATHIKKEFDKIYGGGWQCVV 60 (96)
Q Consensus 37 ~~~iA~~IK~~lD~~yg~~WhciV 60 (96)
+.++.+.||+.+|.-||..|+-|-
T Consensus 17 fr~~r~~Ik~~~~~~~g~~~~~I~ 40 (124)
T PF15571_consen 17 FREIRNEIKELNDNLYGIEIESIA 40 (124)
T ss_pred HHHHHHHHHHHHccccccchhhhc
Confidence 467889999999999999987553
No 36
>PF10655 DUF2482: Hypothetical protein of unknown function (DUF2482); InterPro: IPR018917 This entry is represented by Bacteriophage 80, Orf10. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. All the members of this very small, very short family are derived from bacteriophages, of the SA bacteriophages 11, Mu50B, system, and from the Staphylococcal_phi-Mu50B-like_prophages subsystem. All members are hypothetical proteins.
Probab=35.30 E-value=22 Score=22.65 Aligned_cols=48 Identities=21% Similarity=0.342 Sum_probs=28.8
Q ss_pred eCCCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhc--c-----------cCCCceEEEEeCccc
Q 034366 10 ETDMPVKMQVQAMASASQALDLYDVVDSISIATHIKKEFD--K-----------IYGGGWQCVVGSNFG 65 (96)
Q Consensus 10 ~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD--~-----------~yg~~WhciVG~~Fg 65 (96)
+.||+++...+++. + -+. ++-|+|..|+++=| - -|-+.=||++|+-|+
T Consensus 5 yKdMTqeelr~lls------e-K~~-ELydL~~eI~kETeFdillfS~igv~~GD~~~ss~~alG~~~~ 65 (100)
T PF10655_consen 5 YKDMTQEELRDLLS------E-KNG-ELYDLANEIDKETEFDILLFSTIGVSNGDFISSSHCALGNPFG 65 (100)
T ss_pred hhhhhHHHHHHHHH------H-hhH-HHHHHHHHhcccceeeeeeeeeeccccCccccccchhhccHHH
Confidence 35777765555443 1 121 24689999987643 2 234567888888654
No 37
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins. These belong to the SANT domain family that specifically recognise the sequence YAAC(G/T)G [, ]. In myb, one of the most conserved regions consisting of three tandem repeats has been shown to be involved in DNA-binding [].; PDB: 1X41_A 2XAF_B 2XAG_B 2XAH_B 2UXN_B 2Y48_B 2XAQ_B 2X0L_B 2IW5_B 2XAJ_B ....
Probab=35.28 E-value=51 Score=17.36 Aligned_cols=23 Identities=17% Similarity=0.151 Sum_probs=17.6
Q ss_pred HHHHHHhhhcCCCCHHHHHHHHH
Q 034366 23 ASASQALDLYDVVDSISIATHIK 45 (96)
Q Consensus 23 ~~~~~a~~~~~~~~~~~iA~~IK 45 (96)
+...+|+.+|+..+...||+.+.
T Consensus 10 ~~l~~~v~~~g~~~W~~Ia~~~~ 32 (48)
T PF00249_consen 10 EKLLEAVKKYGKDNWKKIAKRMP 32 (48)
T ss_dssp HHHHHHHHHSTTTHHHHHHHHHS
T ss_pred HHHHHHHHHhCCcHHHHHHHHcC
Confidence 35668888898765688998875
No 38
>TIGR01916 F420_cofE F420-0:gamma-glutamyl ligase. This model represents an enzyme of coenzyme F(420) biosynthesis, as catalyzed by MJ0768 of Methanococcus jannaschii and by the N-terminal half of FbiB of Mycobacterium bovis strain BCG. Note that only two glutamates are ligated in M. jannaschii, but five to six in the Mycobacterium lineage. In M. jannaschii, CofE catalyzes the GTP-dependent addition of two L-glutamates.
Probab=35.16 E-value=55 Score=24.16 Aligned_cols=34 Identities=18% Similarity=0.248 Sum_probs=29.4
Q ss_pred CCHHHHHHHHHHhhcccCCCceEEEEeCccceeE
Q 034366 35 VDSISIATHIKKEFDKIYGGGWQCVVGSNFGCYF 68 (96)
Q Consensus 35 ~~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~v 68 (96)
+|...-|+.|++.|.+++|..=-|||..+||--.
T Consensus 122 ~DPd~sA~~ir~~l~~~~g~~v~VIItDt~gr~~ 155 (243)
T TIGR01916 122 EDPDASAEKIRRGLRELTGVDVGVIITDTNGRPF 155 (243)
T ss_pred CChHHHHHHHHHHHHHHHCCCEEEEEECCCCCcc
Confidence 3457889999999999999999999999998654
No 39
>PF10925 DUF2680: Protein of unknown function (DUF2680); InterPro: IPR024485 Members in this family of proteins are annotated as YckD however currently no function is known.
Probab=34.65 E-value=91 Score=17.85 Aligned_cols=15 Identities=33% Similarity=0.266 Sum_probs=13.0
Q ss_pred HHHHHHHHHhhcccC
Q 034366 38 ISIATHIKKEFDKIY 52 (96)
Q Consensus 38 ~~iA~~IK~~lD~~y 52 (96)
++-|+.||+.+|.++
T Consensus 35 qeqAd~ik~~id~~~ 49 (59)
T PF10925_consen 35 QEQADAIKKHIDQRQ 49 (59)
T ss_pred HHHHHHHHHHHHHHH
Confidence 788999999999865
No 40
>cd03545 Rieske_RO_Alpha_OHBDO_like Rieske non-heme iron oxygenase (RO) family, Ortho-halobenzoate-1,2-dioxygenase (OHBDO)-like subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; composed of the oxygenase alpha subunits of OHBDO, salicylate 5-hydroxylase (S5H), terephthalate 1,2-dioxygenase system (TERDOS) and similar proteins. ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. OHBDO converts 2-chlorobenzoate (2-CBA) to catechol as well as 2,4-dCBA and 2,5-dCBA to 4-chlorocatechol, as part of the chlorobenzoate degradation pathway. Although ortho-substituted chlorobe
Probab=34.33 E-value=1.4e+02 Score=19.85 Aligned_cols=45 Identities=16% Similarity=0.274 Sum_probs=32.9
Q ss_pred HHHHHhhcccCC-CceEEEEeC-ccceeEEEccCcEEEEEeCCEEEEEEecC
Q 034366 42 THIKKEFDKIYG-GGWQCVVGS-NFGCYFTHKHGTFIYFSLETLNFLIFKAA 91 (96)
Q Consensus 42 ~~IK~~lD~~yg-~~WhciVG~-~Fgs~vt~~~~~~i~F~~~~~~illfkt~ 91 (96)
+....++++-|. ..|+.+.-. .. -+++.++-+.+++..++|+|..
T Consensus 11 ~~~~~E~~~if~~~~W~~v~~~~el-----~~~g~~~~~~i~g~~iiv~r~~ 57 (150)
T cd03545 11 AYFDREQERIFRGKTWSYVGLEAEI-----PNAGDFKSTFVGDTPVVVTRAE 57 (150)
T ss_pred HHHHHHHHhhhCCCceEEEEEHHHC-----CCCCCEEEEEECCceEEEEECC
Confidence 456677888895 999998764 32 1346777788988888998853
No 41
>PF08015 Pheromone: Fungal mating-type pheromone; InterPro: IPR012597 This family corresponds to mating-type pheromone proteins. The homobasidiomycetes, or mushroom fungi, have arguably the most complex mating system of all known organisms. Many species possess a mating system known as bifactorial incompatibility, where two unlinked loci control the mating-type of an individual incompatibility loci (the A and B mating-type loci). Each A mating-type sublocus encodes a pair of divergently transcribed homeodomain transcription factors while the genes responsible for B mating-type activity encode lipopeptide pheromones and G-protein -coupled pheromone receptors [].; GO: 0000772 mating pheromone activity, 0016020 membrane
Probab=34.27 E-value=22 Score=20.66 Aligned_cols=13 Identities=46% Similarity=0.963 Sum_probs=8.3
Q ss_pred hcccCCC--ceEEEE
Q 034366 48 FDKIYGG--GWQCVV 60 (96)
Q Consensus 48 lD~~yg~--~WhciV 60 (96)
.|++.|+ +|-|||
T Consensus 55 ~Er~~~g~~~~fCVI 69 (69)
T PF08015_consen 55 FERRGGGGAGAFCVI 69 (69)
T ss_pred ccccCCCCceEEEeC
Confidence 4455555 788886
No 42
>KOG3252 consensus Uncharacterized conserved protein [Function unknown]
Probab=34.18 E-value=54 Score=23.66 Aligned_cols=32 Identities=9% Similarity=0.166 Sum_probs=26.0
Q ss_pred CHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHH
Q 034366 14 PVKMQVQAMASASQALDLYDVVDSISIATHIKK 46 (96)
Q Consensus 14 ~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~ 46 (96)
.+.+|.. +.....+++.|++|++.+++.++..
T Consensus 3 feq~qs~-~~e~l~ginryNPE~latLe~yVq~ 34 (217)
T KOG3252|consen 3 FEQMQSN-VGELLVGINRYNPENLATLENYVQA 34 (217)
T ss_pred chHHHHH-HHHHHHhcccCChhHHHHHHHHHHH
Confidence 4677777 7778899999999888888888763
No 43
>TIGR03849 arch_ComA phosphosulfolactate synthase. This model finds the ComA (Coenzyme M biosynthesis A) protein, phosphosulfolactate synthase, in methanogenic archaea. The ComABC pathway is one of at least two pathways to the intermediate sulfopyruvate. Coenzyme M occurs rarely and sporadically outside of the archaea, as for expoxide metabolism in Xanthobacter autotrophicus Py2, but candidate phosphosulfolactate synthases from that and other species occur fall below the cutoff and outside the scope of this model. This model deliberately is narrower in scope than pfam02679.
Probab=34.07 E-value=1.5e+02 Score=21.78 Aligned_cols=48 Identities=8% Similarity=0.148 Sum_probs=33.1
Q ss_pred CCCCHHHHHHHHHHHHHH------------hhhcCCCCHHHHHHHHHHhhcccCCCceEEEEe
Q 034366 11 TDMPVKMQVQAMASASQA------------LDLYDVVDSISIATHIKKEFDKIYGGGWQCVVG 61 (96)
Q Consensus 11 ~dM~~~~q~~~~~~~~~a------------~~~~~~~~~~~iA~~IK~~lD~~yg~~WhciVG 61 (96)
.+||++.+..+++.+.+. -......+..+..+.+++.||. |.|+|||=
T Consensus 95 ~~i~~~~~~rlI~~~~~~g~~v~~EvG~K~~~~~~~~~~~~~i~~~~~~LeA---GA~~ViiE 154 (237)
T TIGR03849 95 MEISLEERCNLIERAKDNGFMVLSEVGKKSPEKDSELTPDDRIKLINKDLEA---GADYVIIE 154 (237)
T ss_pred cCCCHHHHHHHHHHHHhCCCeEeccccccCCcccccCCHHHHHHHHHHHHHC---CCcEEEEe
Confidence 579999999999988844 1111112235778888888874 67888774
No 44
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=33.45 E-value=37 Score=21.19 Aligned_cols=22 Identities=23% Similarity=0.202 Sum_probs=17.2
Q ss_pred CCHHHHHHHHHHhhcccCCCceEEE
Q 034366 35 VDSISIATHIKKEFDKIYGGGWQCV 59 (96)
Q Consensus 35 ~~~~~iA~~IK~~lD~~yg~~Whci 59 (96)
+..+++++.|++.|+.+ .|+++
T Consensus 33 ~~~~~~~~~l~kRl~~~---~~~~~ 54 (115)
T cd00197 33 VGPKEAVDAIKKRINNK---NPHVV 54 (115)
T ss_pred ccHHHHHHHHHHHhcCC---cHHHH
Confidence 34589999999999875 67654
No 45
>PRK10425 DNase TatD; Provisional
Probab=33.33 E-value=1.3e+02 Score=21.84 Aligned_cols=49 Identities=14% Similarity=0.183 Sum_probs=35.2
Q ss_pred CHHHHHHHHHHHHHHhhhcCC-------CCHHHHHHHHHHhhcccCCCceEEEEeC
Q 034366 14 PVKMQVQAMASASQALDLYDV-------VDSISIATHIKKEFDKIYGGGWQCVVGS 62 (96)
Q Consensus 14 ~~~~q~~~~~~~~~a~~~~~~-------~~~~~iA~~IK~~lD~~yg~~WhciVG~ 62 (96)
+.+.|.++.+.-.+...+++. +..+++.+-||+...+.-++.+||.-|.
T Consensus 102 ~~~~Q~~vF~~ql~lA~~~~~Pv~iH~r~a~~~~l~iL~~~~~~~~~~i~H~fsG~ 157 (258)
T PRK10425 102 TPEEQERAFVAQLAIAAELNMPVFMHCRDAHERFMALLEPWLDKLPGAVLHCFTGT 157 (258)
T ss_pred CHHHHHHHHHHHHHHHHHhCCCeEEEEeCchHHHHHHHHHhccCCCCeEEEecCCC
Confidence 356788888888888777752 2237788888875433337899999887
No 46
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=33.22 E-value=1.2e+02 Score=20.58 Aligned_cols=39 Identities=15% Similarity=0.183 Sum_probs=27.6
Q ss_pred CCCCHHHHHHHHHHHHHHhhhcC----CC--------CHHHHHHHHHHhhc
Q 034366 11 TDMPVKMQVQAMASASQALDLYD----VV--------DSISIATHIKKEFD 49 (96)
Q Consensus 11 ~dM~~~~q~~~~~~~~~a~~~~~----~~--------~~~~iA~~IK~~lD 49 (96)
.+||++.++++++...+-++.-. +| +.+++|+.|+....
T Consensus 15 ~~lp~~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~LG~P~~iA~~i~~~~~ 65 (181)
T PF08006_consen 15 KKLPEEEREEILEYYEEYFDDAGEEGKSEEEIIAELGSPKEIAREILAEYS 65 (181)
T ss_pred HcCCHHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHcCCHHHHHHHHHHhhh
Confidence 36999999999998887776532 11 24788888876543
No 47
>PRK13293 F420-0--gamma-glutamyl ligase; Reviewed
Probab=33.11 E-value=61 Score=23.93 Aligned_cols=33 Identities=12% Similarity=0.176 Sum_probs=29.0
Q ss_pred CCHHHHHHHHHHhhcccCCCceEEEEeCcccee
Q 034366 35 VDSISIATHIKKEFDKIYGGGWQCVVGSNFGCY 67 (96)
Q Consensus 35 ~~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~ 67 (96)
+|...-|+.|++.|.+++|..=-|||..+||--
T Consensus 123 ~DPd~SA~~ir~~l~~~~g~~v~VIItDt~gr~ 155 (245)
T PRK13293 123 ENPDESAERIREGLEELTGKKVGVIITDTNGRP 155 (245)
T ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEEEcCCCcc
Confidence 445788999999999999999999999999843
No 48
>KOG2130 consensus Phosphatidylserine-specific receptor PtdSerR, contains JmjC domain [Chromatin structure and dynamics; Signal transduction mechanisms]
Probab=32.93 E-value=37 Score=26.51 Aligned_cols=18 Identities=22% Similarity=0.359 Sum_probs=14.6
Q ss_pred CCceEEEEeCccceeEEE
Q 034366 53 GGGWQCVVGSNFGCYFTH 70 (96)
Q Consensus 53 g~~WhciVG~~Fgs~vt~ 70 (96)
+|.||||+--...-.+|+
T Consensus 277 ~GWWHvVlNle~TIAiTq 294 (407)
T KOG2130|consen 277 SGWWHVVLNLEPTIAITQ 294 (407)
T ss_pred CCeEEEEeccCceeeeee
Confidence 779999998777777775
No 49
>PF06925 MGDG_synth: Monogalactosyldiacylglycerol (MGDG) synthase; InterPro: IPR009695 This entry represents a conserved region of approximately 180 residues found towirds the N terminus of a number of plant and bacterial diacylglycerol glucosyltransferases, such as monogalactosyldiacylglycerol synthase [].; GO: 0016758 transferase activity, transferring hexosyl groups, 0009247 glycolipid biosynthetic process
Probab=32.76 E-value=51 Score=21.97 Aligned_cols=27 Identities=15% Similarity=0.096 Sum_probs=22.9
Q ss_pred HHHHHHHHHhhcccCCCceEEEEeCcc
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVGSNF 64 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG~~F 64 (96)
...|+.|++.|.+++|+...|.|..-|
T Consensus 2 ~~aA~Al~eal~~~~~~~~~v~v~D~~ 28 (169)
T PF06925_consen 2 NSAARALAEALERRRGPDAEVEVVDFL 28 (169)
T ss_pred HHHHHHHHHHHHhhcCCCCEEEEEehH
Confidence 357899999999999999999988744
No 50
>PF06057 VirJ: Bacterial virulence protein (VirJ); InterPro: IPR010333 This entry contains several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localised to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium [].
Probab=32.47 E-value=47 Score=23.64 Aligned_cols=29 Identities=21% Similarity=0.339 Sum_probs=17.1
Q ss_pred HHHHHHHHhhcccCCCceEEEEeCccceeE
Q 034366 39 SIATHIKKEFDKIYGGGWQCVVGSNFGCYF 68 (96)
Q Consensus 39 ~iA~~IK~~lD~~yg~~WhciVG~~Fgs~v 68 (96)
++++.|..++. ++|..==+.||-|||+.|
T Consensus 53 Dl~~~i~~y~~-~w~~~~vvLiGYSFGADv 81 (192)
T PF06057_consen 53 DLARIIRHYRA-RWGRKRVVLIGYSFGADV 81 (192)
T ss_pred HHHHHHHHHHH-HhCCceEEEEeecCCchh
Confidence 34444443333 444444489999999865
No 51
>PF09477 Type_III_YscG: Bacterial type II secretion system chaperone protein (type_III_yscG); InterPro: IPR013348 YscG is a molecular chaperone for YscE, where both are part of the type III secretion system that in Yersinia is designated Ysc (Yersinia secretion). The secretion system delivers effector proteins, designated Yops (Yersinia outer proteins), in Yersinia. This entry consists of YscG from Yersinia, and functionally equivalent type III secretion proteins in other species: e.g. AscG in Aeromonas and LscG in Photorhabdus luminescens.; GO: 0009405 pathogenesis; PDB: 3PH0_D 2UWJ_G 2P58_C.
Probab=32.47 E-value=66 Score=21.17 Aligned_cols=34 Identities=15% Similarity=0.028 Sum_probs=23.2
Q ss_pred CCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHH
Q 034366 13 MPVKMQVQAMASASQALDLYDVVDSISIATHIKK 46 (96)
Q Consensus 13 M~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~ 46 (96)
|+.++..-..++|.-|...+--++-..||..+++
T Consensus 1 M~~~l~~lLAElAL~atG~HcH~EA~tIa~wL~~ 34 (116)
T PF09477_consen 1 MNRELRRLLAELALMATGHHCHQEANTIADWLEQ 34 (116)
T ss_dssp --HHHHHHHHHHHHHHHTTT-HHHHHHHHHHHHH
T ss_pred CchHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHh
Confidence 7889999999999999888764333556666654
No 52
>TIGR01738 bioH putative pimeloyl-BioC--CoA transferase BioH. This CoA-binding enzyme is required for the production of pimeloyl-coenzyme A, the substrate of the BioF protein early in the biosynthesis of biotin. Its exact function is unknown, but is proposed in ref 2. This enzyme belongs to the alpha/beta hydrolase fold family (pfam model pfam00561). Members of this family are restricted to the Proteobacteria.
Probab=32.35 E-value=30 Score=22.71 Aligned_cols=29 Identities=14% Similarity=0.094 Sum_probs=21.8
Q ss_pred CHHHHHHHHHHhhcccCCCceEEEEeCccceeEE
Q 034366 36 DSISIATHIKKEFDKIYGGGWQCVVGSNFGCYFT 69 (96)
Q Consensus 36 ~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt 69 (96)
++.++++.|...++++ ..+||.|+|+.+.
T Consensus 51 ~~~~~~~~~~~~~~~~-----~~lvG~S~Gg~~a 79 (245)
T TIGR01738 51 SLADAAEAIAAQAPDP-----AIWLGWSLGGLVA 79 (245)
T ss_pred CHHHHHHHHHHhCCCC-----eEEEEEcHHHHHH
Confidence 3578888888877632 4689999999775
No 53
>PRK13007 succinyl-diaminopimelate desuccinylase; Reviewed
Probab=32.13 E-value=2e+02 Score=21.01 Aligned_cols=39 Identities=13% Similarity=0.256 Sum_probs=28.8
Q ss_pred CCHHHHHHHHHHHHHHhhhcCC-CCHHHHHHHHHHhhccc
Q 034366 13 MPVKMQVQAMASASQALDLYDV-VDSISIATHIKKEFDKI 51 (96)
Q Consensus 13 M~~~~q~~~~~~~~~a~~~~~~-~~~~~iA~~IK~~lD~~ 51 (96)
|+.++..++++...+.++.-.. .+..++|++|++.|++.
T Consensus 1 ~~~~~~~~~~~~l~~li~ips~s~~e~~~~~~l~~~l~~~ 40 (352)
T PRK13007 1 MTLDLAADLAELTAALVDIPSVSGDEKALADAVEAALRAL 40 (352)
T ss_pred CccchHHHHHHHHHHHhcCCCCCchHHHHHHHHHHHHHhC
Confidence 4455677888888888875442 12378999999999986
No 54
>PF09868 DUF2095: Uncharacterized protein conserved in archaea (DUF2095); InterPro: IPR018662 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=31.98 E-value=1e+02 Score=20.49 Aligned_cols=17 Identities=18% Similarity=0.257 Sum_probs=14.9
Q ss_pred HHHHHHHHHhhccc----CCC
Q 034366 38 ISIATHIKKEFDKI----YGG 54 (96)
Q Consensus 38 ~~iA~~IK~~lD~~----yg~ 54 (96)
.+.|+.|+..|.++ ||+
T Consensus 95 ~e~A~eLr~~L~~kGvr~fG~ 115 (128)
T PF09868_consen 95 PEEAKELRSILVKKGVRSFGS 115 (128)
T ss_pred HHHHHHHHHHHHHhhHHHhCC
Confidence 78999999999987 665
No 55
>cd05498 Bromo_Brdt_II_like Bromodomain, Brdt_like subfamily, repeat II. Human Brdt is a testis-specific member of the BET subfamily of bromodomain proteins; the first bromodomain in Brdt has been shown to be essential for male germ cell differentiation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=31.61 E-value=1.3e+02 Score=18.45 Aligned_cols=38 Identities=8% Similarity=0.090 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHHHHhhhcCCC-CHHHHHHHHHHhhcccC
Q 034366 15 VKMQVQAMASASQALDLYDVV-DSISIATHIKKEFDKIY 52 (96)
Q Consensus 15 ~~~q~~~~~~~~~a~~~~~~~-~~~~iA~~IK~~lD~~y 52 (96)
++...++--+...|..-++.. ..-..|..|++.|+++|
T Consensus 64 ~ef~~D~~li~~Na~~yn~~~s~i~~~a~~l~~~fe~~~ 102 (102)
T cd05498 64 QEFAADVRLMFSNCYKYNPPDHPVHAMARKLQDVFEDRW 102 (102)
T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 556777777777776644432 23578999999998875
No 56
>PF00352 TBP: Transcription factor TFIID (or TATA-binding protein, TBP); InterPro: IPR000814 The TATA-box binding protein (TBP) is required for the initiation of transcription by RNA polymerases I, II and III, from promoters with or without a TATA box [, ]. TBP associates with a host of factors, including the general transcription factors TFIIA, -B, -D, -E, and -H, to form huge multi-subunit pre-initiation complexes on the core promoter. Through its association with different transcription factors, TBP can initiate transcription from different RNA polymerases. There are several related TBPs, including TBP-like (TBPL) proteins []. The C-terminal core of TBP (~180 residues) is highly conserved and contains two 77-amino acid repeats that produce a saddle-shaped structure that straddles the DNA; this region binds to the TATA box and interacts with transcription factors and regulatory proteins []. By contrast, the N-terminal region varies in both length and sequence.; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent, 0006367 transcription initiation from RNA polymerase II promoter; PDB: 1D3U_A 1PCZ_B 1AIS_A 1NGM_A 1TBP_A 1TBA_B 1YTB_A 1RM1_A 1YTF_A 1NH2_A ....
Probab=31.27 E-value=72 Score=19.13 Aligned_cols=26 Identities=19% Similarity=0.360 Sum_probs=16.8
Q ss_pred eEEEccCcE--EEEEeC--CEEEEEEecCC
Q 034366 67 YFTHKHGTF--IYFSLE--TLNFLIFKAAI 92 (96)
Q Consensus 67 ~vt~~~~~~--i~F~~~--~~~illfkt~~ 92 (96)
.+.|+|..| +.+++. +..++||+||.
T Consensus 30 ~~~YePe~fpgl~~r~~~p~~t~~IF~sGk 59 (86)
T PF00352_consen 30 NVEYEPERFPGLIYRLRNPKATVLIFSSGK 59 (86)
T ss_dssp TEEEETTTESSEEEEETTTTEEEEEETTSE
T ss_pred CcEEeeccCCeEEEeecCCcEEEEEEcCCE
Confidence 346777633 445553 78888888874
No 57
>PF06840 DUF1241: Protein of unknown function (DUF1241); InterPro: IPR009652 This family consists of several programmed cell death 10 protein (PDCD10 or TFAR15) sequences. The function of this family is unknown.; PDB: 3L8I_A 3RQG_B 3RQE_B 3L8J_A 3RQF_B 3AJM_B.
Probab=31.24 E-value=35 Score=23.47 Aligned_cols=14 Identities=43% Similarity=0.413 Sum_probs=12.6
Q ss_pred HHHHHHHHHHhhcc
Q 034366 37 SISIATHIKKEFDK 50 (96)
Q Consensus 37 ~~~iA~~IK~~lD~ 50 (96)
.|+||..||+-||.
T Consensus 122 IK~IAsaIK~lLdA 135 (154)
T PF06840_consen 122 IKEIASAIKKLLDA 135 (154)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH
Confidence 58999999999995
No 58
>PF01739 CheR: CheR methyltransferase, SAM binding domain; InterPro: IPR022642 Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. Flagellated bacteria swim towards favourable chemicals and away from deleterious ones. Sensing of chemoeffector gradients involves chemotaxis receptors, transmembrane (TM) proteins that detect stimuli through their periplasmic domains and transduce the signals via their cytoplasmic domains []. Signalling outputs from these receptors are influenced both by the binding of the chemoeffector ligand to their periplasmic domains and by methylation of specific glutamate residues on their cytoplasmic domains. Methylation is catalysed by CheR, an S-adenosylmethionine-dependent methyltransferase [], which reversibly methylates specific glutamate residues within a coiled coil region, to form gamma-glutamyl methyl ester residues [, ]. The structure of the Salmonella typhimurium chemotaxis receptor methyltransferase CheR, bound to S-adenosylhomocysteine, has been determined to a resolution of 2.0 A []. The structure reveals CheR to be a two-domain protein, with a smaller N-terminal helical domain linked via a single polypeptide connection to a larger C-terminal alpha/beta domain. The C-terminal domain has the characteristics of a nucleotide-binding fold, with an insertion of a small anti-parallel beta-sheet subdomain. The S-adenosylhomocysteine-binding site is formed mainly by the large domain, with contributions from residues within the N-terminal domain and the linker region []. CheR proteins are part of the chemotaxis signaling mechanism which methylates the chemotaxis receptor at specific glutamate residues. This entry refers to the C-terminal SAM-binding domain of the CherR-type MCP methyltransferases, which are found in bacteria, archaea and green plants. This entry is found in association with PF03705 from PFAM. ; PDB: 1AF7_A 1BC5_A.
Probab=31.17 E-value=23 Score=24.88 Aligned_cols=62 Identities=16% Similarity=0.269 Sum_probs=35.5
Q ss_pred CceEEeCCCCHHHHHHHHHHHHHHhhhcCCCCHHHH-HHHHHHhhcccCCCceEEEEeCccceeEEEccCc
Q 034366 5 KASIKETDMPVKMQVQAMASASQALDLYDVVDSISI-ATHIKKEFDKIYGGGWQCVVGSNFGCYFTHKHGT 74 (96)
Q Consensus 5 ~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~i-A~~IK~~lD~~yg~~WhciVG~~Fgs~vt~~~~~ 74 (96)
++.|..+|.++...+.| +.++ |+....+.+ ...++++|++.-|+.|. |....=..|+....+
T Consensus 64 ~~~I~atDi~~~~L~~A----r~G~--Y~~~~~~~~~~~~~~ryf~~~~~~~~~--v~~~lr~~V~F~~~N 126 (196)
T PF01739_consen 64 DFRILATDISPSALEKA----RAGI--YPERSLRGLPPAYLRRYFTERDGGGYR--VKPELRKMVRFRRHN 126 (196)
T ss_dssp SEEEEEEES-HHHHHHH----HHTE--EEGGGGTTS-HHHHHHHEEEE-CCCTT--E-HHHHTTEEEEE--
T ss_pred ceEEEEEECCHHHHHHH----HhCC--CCHHHHhhhHHHHHHHhccccCCCcee--EChHHcCceEEEecc
Confidence 68999999999865544 4433 443222344 56788999888788887 444444444443333
No 59
>cd01674 Homoaconitase_Swivel Homoaconitase swivel domain. This family includes homoaconitase and other uncharacterized proteins of the Aconitase family. Homoaconitase is part of an unusual lysine biosynthesis pathway found only in filamentous fungi, in which lysine is synthesized via the alpha-aminoadipate pathway. In this pathway, homoaconitase catalyzes the conversion of cis-homoaconitic acid into homoisocitric acid. The reaction mechanism is believed to be similar to that of other aconitases. This is the swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism.
Probab=30.57 E-value=54 Score=21.96 Aligned_cols=32 Identities=25% Similarity=0.393 Sum_probs=19.0
Q ss_pred HHHHHHHHHhhcccC----CCceEEEE-eCccceeEEE
Q 034366 38 ISIATHIKKEFDKIY----GGGWQCVV-GSNFGCYFTH 70 (96)
Q Consensus 38 ~~iA~~IK~~lD~~y----g~~WhciV-G~~Fgs~vt~ 70 (96)
+++|+++-+.+|..| .+. .+|| |+||||=-+.
T Consensus 25 e~la~~~~e~~dp~f~~~v~~g-dilVaG~nFG~GSSR 61 (129)
T cd01674 25 EKMAEVCMENYDSEFSTKTKQG-DILVSGFNFGTGSSR 61 (129)
T ss_pred HHHHHhhcccCCchhhhcCCCC-CEEEeCCccCCCCcH
Confidence 567777766666544 223 4555 4699975443
No 60
>cd01578 AcnA_Mitochon_Swivel Mitochondrial aconitase A swivel domain. Aconitase (also known as aconitate hydratase and citrate hydro-lyase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. This is the aconitase swivel domain, which undergoes swivelling conformational change in the enzyme mechanism. In eukaryotes two isozymes of aconitase are known to exist: one found in the mitochondrial matrix and the other found in the cytoplasm. This is the mitochondrial form. The mitochondrial product is coded by a nuclear gene. Most members of this subfamily are mitochondrial but there are some bacterial members.
Probab=30.16 E-value=29 Score=23.76 Aligned_cols=19 Identities=21% Similarity=0.342 Sum_probs=14.1
Q ss_pred CCCceEEEEeCccceeEEE
Q 034366 52 YGGGWQCVVGSNFGCYFTH 70 (96)
Q Consensus 52 yg~~WhciVG~~Fgs~vt~ 70 (96)
-|-.|-+|.|++||+=-+.
T Consensus 67 ~g~~~iIVaG~nyG~GSSR 85 (149)
T cd01578 67 HGIKWVVIGDENYGEGSSR 85 (149)
T ss_pred cCCCeEEEccCccCCCCch
Confidence 3657989999999975443
No 61
>PF06884 DUF1264: Protein of unknown function (DUF1264); InterPro: IPR010686 This family contains a number of bacterial and eukaryotic proteins of unknown function that are approximately 200 residues long. Some family members are annotated as putative lipoproteins.
Probab=30.06 E-value=39 Score=23.71 Aligned_cols=19 Identities=21% Similarity=0.450 Sum_probs=14.2
Q ss_pred HHHHHHHhhcccCCCceEE
Q 034366 40 IATHIKKEFDKIYGGGWQC 58 (96)
Q Consensus 40 iA~~IK~~lD~~yg~~Whc 58 (96)
+...+-+.+-+.||.+||-
T Consensus 98 ae~~~m~~l~~tYGKt~Ht 116 (171)
T PF06884_consen 98 AEKAEMEKLVKTYGKTWHT 116 (171)
T ss_pred HHHHHHHHHHhhhCCeEEe
Confidence 4455667777899999984
No 62
>COG1999 Uncharacterized protein SCO1/SenC/PrrC, involved in biogenesis of respiratory and photosynthetic systems [General function prediction only]
Probab=29.64 E-value=1.1e+02 Score=21.49 Aligned_cols=56 Identities=13% Similarity=0.197 Sum_probs=37.8
Q ss_pred HHHHHHHHHhhcccCCCceEEEEe---------CccceeE---------EEccCcEEEEEe---CCEEEEEEecCCC
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVG---------SNFGCYF---------THKHGTFIYFSL---ETLNFLIFKAAIT 93 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG---------~~Fgs~v---------t~~~~~~i~F~~---~~~~illfkt~~~ 93 (96)
+|-++.+|+++...+.+.|+=+-| +.|+.+. .|...|...|++ ++.....+..+..
T Consensus 116 rDtp~~lk~Y~~~~~~~~~~~ltg~~~~~~~~~k~~~V~~~~v~~~~~~~y~~~Hs~~~~lid~~G~~~~~~~~~~~ 192 (207)
T COG1999 116 RDTPEVLKKYAELNFDPRWIGLTGTPEQIEEVAKAYGVFYSKVPLDDSQNYTIDHSAGFYLIDADGRFLGTYDYGEP 192 (207)
T ss_pred CCCHHHHHHHhcccCCCCeeeeeCCHHHHHHHHHHhcceeeecccCCCCCceeeeeeEEEEECCCCeEEEEecCCCC
Confidence 677999999999888999999999 3666664 444555555553 3444444444433
No 63
>PF01996 F420_ligase: F420-0:Gamma-glutamyl ligase; InterPro: IPR002847 This entry contains F420-0:gamma-glutamyl ligase and related proteins. F420-0:gamma-glutamyl ligase catalyzes the GTP-dependent successive addition of multiple gamma-linked L-glutamates to the L-lactyl phosphodiester of 7,8-didemethyl-8-hydroxy-5-deazariboflavin (F420-0) to form polyglutamated F420 derivatives [, , , ].; PDB: 2G9I_A 2PHN_A.
Probab=29.37 E-value=40 Score=24.31 Aligned_cols=32 Identities=19% Similarity=0.229 Sum_probs=24.8
Q ss_pred CCHHHHHHHHHHhhcccCCCceEEEEeCccce
Q 034366 35 VDSISIATHIKKEFDKIYGGGWQCVVGSNFGC 66 (96)
Q Consensus 35 ~~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs 66 (96)
+|....|+.|++.|.+++|..=.|||..++|-
T Consensus 130 ~dPd~sA~~i~~~l~~~~g~~v~ViI~Dt~gr 161 (228)
T PF01996_consen 130 EDPDASARRIREELKERTGKDVGVIITDTNGR 161 (228)
T ss_dssp S-HHHHHHHHHHHHHHHHS---EEEEEEEEEE
T ss_pred CCHHHHHHHHHHHHHHHHCCceEEEEECCCCc
Confidence 45688999999999999999999999887773
No 64
>PF06202 GDE_C: Amylo-alpha-1,6-glucosidase ; InterPro: IPR010401 This family includes human glycogen branching enzyme P35573 from SWISSPROT. This enzyme contains a number of distinct catalytic activities. It has been shown for the yeast homologue O93808 from SWISSPROT that mutations in this region disrupt the enzymes Amylo-alpha-1,6-glucosidase (3.2.1.33 from EC).; GO: 0004135 amylo-alpha-1,6-glucosidase activity, 0005978 glycogen biosynthetic process
Probab=28.97 E-value=71 Score=24.45 Aligned_cols=17 Identities=24% Similarity=0.387 Sum_probs=15.3
Q ss_pred CHHHHHHHHHHhhcccC
Q 034366 36 DSISIATHIKKEFDKIY 52 (96)
Q Consensus 36 ~~~~iA~~IK~~lD~~y 52 (96)
.+++.|+.||+.|+++|
T Consensus 193 ~~~~~A~~lk~~F~~~F 209 (370)
T PF06202_consen 193 RYREWAERLKESFEKRF 209 (370)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 36789999999999998
No 65
>PF01545 Cation_efflux: Cation efflux family; InterPro: IPR002524 Members of this family are integral membrane proteins, that are found to increase tolerance to divalent metal ions such as cadmium, zinc, and cobalt. These proteins are considered to be efflux pumps that remove these ions from cells [, ], however others are implicated in ion uptake []. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.; GO: 0008324 cation transmembrane transporter activity, 0006812 cation transport, 0055085 transmembrane transport, 0016021 integral to membrane; PDB: 3BYP_B 3BYR_A 3H90_B 2QFI_B 2ZZT_A.
Probab=28.88 E-value=62 Score=23.07 Aligned_cols=23 Identities=17% Similarity=0.367 Sum_probs=20.7
Q ss_pred HHHHHHHHHhhcccCCCceEEEE
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVV 60 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciV 60 (96)
.++++.|++.+-+++++.++|.|
T Consensus 254 ~~i~~~i~~~l~~~~~~i~~v~I 276 (284)
T PF01545_consen 254 HEIRERIEKRLREKFPGIYDVTI 276 (284)
T ss_dssp HHHHHHHHHHHHHHSTTCEEEEE
T ss_pred HHHHHHHHHHHHHhCCCcEEEEE
Confidence 56899999999999999999876
No 66
>cd05135 RasGAP_RASAL Ras GTPase activating-like protein (RASAL) or RASAL1 is a member of the GAP1 family, and a Ca2+ sensor responding in-phase to repetitive Ca2+ signals by associating with the plasma membrane and deactivating Ras. It contains a conserved domain structure comprising N-terminal tandem C2 domains, a highly conserved central RasGAP domain, and a C-terminal pleckstrin-homology domain that is associated with a Bruton's tyrosine kinase motif. RASAL, like Ca2+ -promoted Ras inactivator (CAPRI, or RASAL4), is a cytosolic protein that undergoes a rapid translocation to the plasma membrane in response to receptor-mediated elevation in the concentration of intracellular free Ca2+, a translocation that activates its ability to function as a RasGAP. However, unlike RASAL4, RASAL undergoes an oscillatory translocation to the plasma membrane that occurs in synchrony with repetitive Ca2+ spikes.
Probab=28.87 E-value=96 Score=23.75 Aligned_cols=35 Identities=6% Similarity=0.005 Sum_probs=25.3
Q ss_pred HHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCceE
Q 034366 22 MASASQALDLYDVVDSISIATHIKKEFDKIYGGGWQ 57 (96)
Q Consensus 22 ~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~Wh 57 (96)
++.+...+++++.. ++.+++.|++.+.++|...++
T Consensus 159 ~~~I~~S~~~~P~~-lR~i~~~l~~~v~~kFp~~~~ 193 (333)
T cd05135 159 TDAIVGSVSQCPPV-MRLTFKQLHKRVEERFPEAEN 193 (333)
T ss_pred HHHHHhhHHhCCHH-HHHHHHHHHHHHHHHCCCCcc
Confidence 33444444555532 589999999999999998875
No 67
>cd05391 RasGAP_p120GAP p120GAP is a negative regulator of Ras that stimulates hydrolysis of bound GTP to GDP. Once the Ras regulator p120GAP, a member of the GAP protein family, is recruited to the membrane, it is transiently immobilized to interact with Ras-GTP. The down regulation of Ras by p120GAP is a critical step in the regulation of many cellular processes, which is disrupted in approximately 30% of human cancers. p120GAP contains SH2, SH3, PH, calcium- and lipid-binding domains, suggesting its involvement in a complex network of cellular interactions in vivo.
Probab=28.76 E-value=39 Score=25.66 Aligned_cols=37 Identities=8% Similarity=-0.004 Sum_probs=26.5
Q ss_pred HHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCceE
Q 034366 20 QAMASASQALDLYDVVDSISIATHIKKEFDKIYGGGWQ 57 (96)
Q Consensus 20 ~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~Wh 57 (96)
++++.+..+++.++.+ .+.+++.|++.+.++|...+.
T Consensus 134 ~~~~~I~~S~~~~P~~-lr~i~~~l~~~v~~kfp~~~~ 170 (315)
T cd05391 134 ELVEKIFMAAEILPPT-LRYIYGCLQKSVQAKWPTNTT 170 (315)
T ss_pred HHHHHHHHhHHhCCHH-HHHHHHHHHHHHHHHCCCchh
Confidence 3444444555556643 589999999999999987664
No 68
>PF01743 PolyA_pol: Poly A polymerase head domain; InterPro: IPR002646 This group includes nucleic acid independent RNA polymerases, such as polynucleotide adenylyltransferase (2.7.7.19 from EC), which adds the poly (A) tail to mRNA. This group also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of the tRNA 2.7.7.25 from EC.; GO: 0003723 RNA binding, 0016779 nucleotidyltransferase activity, 0006396 RNA processing; PDB: 1VFG_A 3H38_A 3H3A_B 3H39_B 3H37_A 1MIY_A 1MIV_B 1MIW_B 1OU5_B 3AQN_A ....
Probab=28.47 E-value=1.6e+02 Score=18.78 Aligned_cols=37 Identities=24% Similarity=0.285 Sum_probs=27.1
Q ss_pred HHHHHHhhcccCCCceEEEEeCccceeEEEccCcEEEEE
Q 034366 41 ATHIKKEFDKIYGGGWQCVVGSNFGCYFTHKHGTFIYFS 79 (96)
Q Consensus 41 A~~IK~~lD~~yg~~WhciVG~~Fgs~vt~~~~~~i~F~ 79 (96)
+..+.+.|.++++..+++ |++|+..--+..+..+.+.
T Consensus 28 ~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~~di~ 64 (126)
T PF01743_consen 28 PEEFAKLLAKKLGGVFVV--GKRFGTVRVVFGGGSIDIA 64 (126)
T ss_dssp HHHHHHHHCTTCCEEEEE--ETTTTEEEEEETTCEEEEE
T ss_pred HHHHHHHHHhhccccccc--ccccceeeecCCCcccccc
Confidence 456777788888888877 9999987766666555543
No 69
>PF10440 WIYLD: Ubiquitin-binding WIYLD domain; InterPro: IPR018848 This entry represents a presumed domain which has been predicted to contain three alpha helices. It was named the WIYLD domain based on the pattern of the ost conserved residues []. This domain appears to be specific to plant SET-domain proteins. ; GO: 0018024 histone-lysine N-methyltransferase activity
Probab=28.35 E-value=64 Score=19.05 Aligned_cols=42 Identities=19% Similarity=0.448 Sum_probs=26.7
Q ss_pred HHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCceEEEEeCccc
Q 034366 22 MASASQALDLYDVVDSISIATHIKKEFDKIYGGGWQCVVGSNFG 65 (96)
Q Consensus 22 ~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~WhciVG~~Fg 65 (96)
++-|..|++.++-.+ +.+-.-||+-| +-||+.|--|=-.||.
T Consensus 11 ~daA~dam~~lG~~~-~~v~~vl~~LL-~lY~~nW~lIEed~Y~ 52 (65)
T PF10440_consen 11 IDAALDAMRQLGFSK-KQVRPVLKNLL-KLYDGNWELIEEDNYR 52 (65)
T ss_pred HHHHHHHHHHcCCCH-HHHHHHHHHHH-HHHcCCchhhhcccHH
Confidence 466788888888651 34444455444 5788889776555543
No 70
>PF14372 DUF4413: Domain of unknown function (DUF4413)
Probab=27.84 E-value=1e+02 Score=19.11 Aligned_cols=29 Identities=24% Similarity=0.371 Sum_probs=18.2
Q ss_pred HHHHhhhcCCC--CHHHHHHHHHHhhcccCC
Q 034366 25 ASQALDLYDVV--DSISIATHIKKEFDKIYG 53 (96)
Q Consensus 25 ~~~a~~~~~~~--~~~~iA~~IK~~lD~~yg 53 (96)
+..++++...+ .++++|+.+++.+|+-+.
T Consensus 19 i~~~l~~~~~~d~~l~~ma~~M~~KfdKYw~ 49 (101)
T PF14372_consen 19 IKDLLRDWNNDDPDLKNMAKKMKEKFDKYWK 49 (101)
T ss_pred HHHHHHHhccCCHHHHHHHHHHHHHHHHHHH
Confidence 33444544433 257888888888887654
No 71
>COG1352 CheR Methylase of chemotaxis methyl-accepting proteins [Cell motility and secretion / Signal transduction mechanisms]
Probab=27.74 E-value=67 Score=23.84 Aligned_cols=49 Identities=16% Similarity=0.287 Sum_probs=33.1
Q ss_pred CCceEEeCCCCHHHHHHHHHHHHHHhhhcCCCC-HHHH-HHHHHHhhcccCCCceEE
Q 034366 4 GKASIKETDMPVKMQVQAMASASQALDLYDVVD-SISI-ATHIKKEFDKIYGGGWQC 58 (96)
Q Consensus 4 ~~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~~~-~~~i-A~~IK~~lD~~yg~~Whc 58 (96)
..+.|..+|.+....+.|.. -.|+..+ ++.+ .+.++++|++.-|+.|.+
T Consensus 128 ~~~~I~AtDId~~~L~~A~~------G~Y~~~~~~~~~~~~~~~ryF~~~~~~~y~v 178 (268)
T COG1352 128 FRVKILATDIDLSVLEKARA------GIYPSRELLRGLPPELLRRYFERGGDGSYRV 178 (268)
T ss_pred CceEEEEEECCHHHHHHHhc------CCCChhHhhccCCHHHHhhhEeecCCCcEEE
Confidence 47899999999987766643 2345221 1222 467789999988877765
No 72
>PF10302 DUF2407: DUF2407 ubiquitin-like domain; InterPro: IPR019413 This entry represents a family of proteins of unknown function found in fungi. They contain a characteristic GFDRL sequence motif.
Probab=27.03 E-value=31 Score=21.71 Aligned_cols=7 Identities=29% Similarity=0.956 Sum_probs=5.7
Q ss_pred eEEEEeC
Q 034366 56 WQCVVGS 62 (96)
Q Consensus 56 WhciVG~ 62 (96)
=||+||.
T Consensus 90 IhCsIGd 96 (97)
T PF10302_consen 90 IHCSIGD 96 (97)
T ss_pred EEEeccC
Confidence 4999995
No 73
>KOG1569 consensus 50S ribosomal protein L1 [Translation, ribosomal structure and biogenesis]
Probab=26.79 E-value=59 Score=24.97 Aligned_cols=28 Identities=18% Similarity=0.071 Sum_probs=19.2
Q ss_pred HHHHHHHHHhhcc------cCCCce-EEEEeC-ccc
Q 034366 38 ISIATHIKKEFDK------IYGGGW-QCVVGS-NFG 65 (96)
Q Consensus 38 ~~iA~~IK~~lD~------~yg~~W-hciVG~-~Fg 65 (96)
.+|++.|-+...- .++..| ||.||+ +|.
T Consensus 226 ~nipemieeFk~G~~i~~d~~~~~~~~~~vGkl~mt 261 (323)
T KOG1569|consen 226 RNIPEMIEEFKNGHEIKFDEERENILQIKVGKLDMT 261 (323)
T ss_pred cchHHHHHHhhCCcccccccccCceeeeeeeeecCC
Confidence 4566665554443 357889 999999 875
No 74
>TIGR02087 LEUD_arch 3-isopropylmalate dehydratase, small subunit. This subfamily is most closely related to the 3-isopropylmalate dehydratase, small subunits which form TIGR00171. This subfamily includes the members of TIGR02084 which are gene clustered with other genes of leucine biosynthesis. The rest of the subfamily includes mainly archaeal species which exhibit two hits to this model. In these cases it is possible that one or the other of the hits does not have a 3-isopropylmalate dehydratase activity but rather one of the other related aconitase-like activities.
Probab=26.69 E-value=47 Score=22.65 Aligned_cols=32 Identities=31% Similarity=0.468 Sum_probs=20.1
Q ss_pred HHHHHHHHHhhcccC----CCceEEEEeCccceeEE
Q 034366 38 ISIATHIKKEFDKIY----GGGWQCVVGSNFGCYFT 69 (96)
Q Consensus 38 ~~iA~~IK~~lD~~y----g~~WhciVG~~Fgs~vt 69 (96)
.++++++-..+|..| .+.-=+|.|+||||=-|
T Consensus 27 ~~l~~~~f~~~~p~f~~~~~~g~iiVaG~NFG~GSS 62 (154)
T TIGR02087 27 DELASHAMEGIDPEFAKKVRPGDVIVAGKNFGCGSS 62 (154)
T ss_pred HHHHhhccCcCCchhhhcCCCCcEEEcCCcccCCcc
Confidence 567777776666444 44545555679997544
No 75
>COG1498 SIK1 Protein implicated in ribosomal biogenesis, Nop56p homolog [Translation, ribosomal structure and biogenesis]
Probab=26.21 E-value=81 Score=24.98 Aligned_cols=40 Identities=15% Similarity=0.190 Sum_probs=33.8
Q ss_pred EeCCCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhcc
Q 034366 9 KETDMPVKMQVQAMASASQALDLYDVVDSISIATHIKKEFDK 50 (96)
Q Consensus 9 ~~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~ 50 (96)
...+|++.+...+..+|....+.+..- +++.+||.+.|+.
T Consensus 200 mG~~~~~~Di~~i~~~ae~i~~L~~~R--~~l~~Yi~~~M~~ 239 (395)
T COG1498 200 MGADLSEEDIDNIRELAEIILELYELR--EQLEEYIESKMSE 239 (395)
T ss_pred cccCCChhhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHH
Confidence 457899999999999999999988765 7889999888874
No 76
>PRK08775 homoserine O-acetyltransferase; Provisional
Probab=25.94 E-value=45 Score=24.60 Aligned_cols=33 Identities=18% Similarity=0.180 Sum_probs=24.5
Q ss_pred HHHHHHHHHhhcccCCCceEEEEeCccceeEEE
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVGSNFGCYFTH 70 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt~ 70 (96)
.+.|+.|...+|..--..-+++||.|+|+.+..
T Consensus 121 ~~~a~dl~~ll~~l~l~~~~~lvG~SmGG~vA~ 153 (343)
T PRK08775 121 ADQADAIALLLDALGIARLHAFVGYSYGALVGL 153 (343)
T ss_pred HHHHHHHHHHHHHcCCCcceEEEEECHHHHHHH
Confidence 567888888888653334578999999987654
No 77
>PF08908 DUF1852: Domain of unknown function (DUF1852); InterPro: IPR015004 This group of proteins are functionally uncharacterised.
Probab=25.93 E-value=35 Score=25.84 Aligned_cols=23 Identities=26% Similarity=0.410 Sum_probs=20.3
Q ss_pred hhcccCCCceEEEEeCccceeEE
Q 034366 47 EFDKIYGGGWQCVVGSNFGCYFT 69 (96)
Q Consensus 47 ~lD~~yg~~WhciVG~~Fgs~vt 69 (96)
-+|++-|..=.=|||.||.|+|.
T Consensus 98 IvD~kt~~rieGivGNnFSSYVR 120 (322)
T PF08908_consen 98 IVDHKTNERIEGIVGNNFSSYVR 120 (322)
T ss_pred EEecCCCceecceeccccccccc
Confidence 47888888888999999999986
No 78
>PRK09111 DNA polymerase III subunits gamma and tau; Validated
Probab=25.84 E-value=70 Score=26.43 Aligned_cols=28 Identities=18% Similarity=0.413 Sum_probs=25.2
Q ss_pred HHHHHHHHHhhcccCCCceEEEEeCccc
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVGSNFG 65 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG~~Fg 65 (96)
++++..|++.|..-.|..|.|.|.+..|
T Consensus 505 ~~~~~~l~~~l~~~t~~~w~v~~~~~~~ 532 (598)
T PRK09111 505 RDLAQRLARKLEEWTGRRWVVSVSREGG 532 (598)
T ss_pred hHHHHHHHHHHHHHhCCceEEEEecCCC
Confidence 7899999999999999999999987554
No 79
>PF02283 CobU: Cobinamide kinase / cobinamide phosphate guanyltransferase; InterPro: IPR003203 This family is composed of a group of bifunctional cobalbumin biosynthesis enzymes which display cobinamide kinase and cobinamide phosphate guanyltransferase activity. The crystal structure of the enzyme reveals the molecule to be a trimer with a propeller-like shape [].; GO: 0000166 nucleotide binding, 0043752 adenosylcobinamide kinase activity, 0051188 cofactor biosynthetic process; PDB: 1CBU_C 1C9K_B.
Probab=25.79 E-value=77 Score=21.61 Aligned_cols=23 Identities=13% Similarity=0.302 Sum_probs=20.3
Q ss_pred HHHHHHHHHhhcccCCCceEEEEe
Q 034366 38 ISIATHIKKEFDKIYGGGWQCVVG 61 (96)
Q Consensus 38 ~~iA~~IK~~lD~~yg~~WhciVG 61 (96)
.|+++.|+++-.++ |..|+.|=-
T Consensus 37 ~em~~RI~~H~~~R-~~~w~tiE~ 59 (167)
T PF02283_consen 37 EEMRERIARHRQRR-PKGWITIEE 59 (167)
T ss_dssp HHHHHHHHHHHHHS-STCEEEEE-
T ss_pred HHHHHHHHHHHHhC-CCCcEEEec
Confidence 68999999999999 999999865
No 80
>PF12697 Abhydrolase_6: Alpha/beta hydrolase family; PDB: 3LLC_A 3A2N_E 3A2M_A 3A2L_A 3AFI_F 3C5V_A 3C5W_P 3E0X_A 2ZJF_A 3QYJ_A ....
Probab=25.77 E-value=43 Score=21.49 Aligned_cols=32 Identities=16% Similarity=0.135 Sum_probs=23.3
Q ss_pred HHHHHHHHHHhhcccCCCceEEEEeCccceeEE
Q 034366 37 SISIATHIKKEFDKIYGGGWQCVVGSNFGCYFT 69 (96)
Q Consensus 37 ~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt 69 (96)
..+.|+.|.+.+|+.-.+..+ +||.|+|+.+.
T Consensus 49 ~~~~~~~l~~~l~~~~~~~~~-lvG~S~Gg~~a 80 (228)
T PF12697_consen 49 IEDYAEDLAELLDALGIKKVI-LVGHSMGGMIA 80 (228)
T ss_dssp HHHHHHHHHHHHHHTTTSSEE-EEEETHHHHHH
T ss_pred hhhhhhhhhhccccccccccc-ccccccccccc
Confidence 356777788888776556654 78999998764
No 81
>cd01577 IPMI_Swivel Aconatase-like swivel domain of 3-isopropylmalate dehydratase and related uncharacterized proteins. 3-isopropylmalate dehydratase catalyzes the isomerization between 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate 3-isopropylmalate. IPMI is involved in fungal and bacterial leucine biosynthesis and is also found in eukaryotes. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism.
Probab=25.54 E-value=35 Score=21.36 Aligned_cols=15 Identities=47% Similarity=0.716 Sum_probs=11.1
Q ss_pred ceEEEEeCccceeEE
Q 034366 55 GWQCVVGSNFGCYFT 69 (96)
Q Consensus 55 ~WhciVG~~Fgs~vt 69 (96)
.+=+|.|++|||==+
T Consensus 18 ~~ilVaG~nfG~GSS 32 (91)
T cd01577 18 GDIIVAGKNFGCGSS 32 (91)
T ss_pred CCEEEecCcccCCCc
Confidence 567788889997544
No 82
>PF03869 Arc: Arc-like DNA binding domain; InterPro: IPR005569 Arc repressor act by the cooperative binding of two Arc repressor dimers to a 21-base-pair operator site. Each Arc dimer uses an antiparallel beta-sheet to recognise bases in the major groove [].; GO: 0003677 DNA binding; PDB: 3QOQ_D 1MNT_B 1QTG_B 1BDV_A 1PAR_C 1BDT_C 1ARR_B 1MYL_F 1MYK_A 1NLA_B ....
Probab=25.52 E-value=84 Score=17.20 Aligned_cols=35 Identities=11% Similarity=0.205 Sum_probs=23.9
Q ss_pred CCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhccc
Q 034366 13 MPVKMQVQAMASASQALDLYDVVDSISIATHIKKEFDKI 51 (96)
Q Consensus 13 M~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~ 51 (96)
||+++.+.+-+.|..--. +.+ .+|...|.+.|.+.
T Consensus 11 lP~~l~~~lk~~A~~~gR--S~N--sEIv~~L~~~l~~e 45 (50)
T PF03869_consen 11 LPEELKEKLKERAEENGR--SMN--SEIVQRLEEALKKE 45 (50)
T ss_dssp CEHHHHHHHHHHHHHTTS---HH--HHHHHHHHHHHHHC
T ss_pred CCHHHHHHHHHHHHHhCC--ChH--HHHHHHHHHHHhcc
Confidence 888888887776654332 122 68888888888754
No 83
>KOG3990 consensus Uncharacterized conserved protein [Function unknown]
Probab=25.40 E-value=57 Score=24.58 Aligned_cols=41 Identities=15% Similarity=0.242 Sum_probs=32.6
Q ss_pred HHHHHHHHHHHhhhcCC----CCHHHHHHHHHHhhcc------cCCCceEE
Q 034366 18 QVQAMASASQALDLYDV----VDSISIATHIKKEFDK------IYGGGWQC 58 (96)
Q Consensus 18 q~~~~~~~~~a~~~~~~----~~~~~iA~~IK~~lD~------~yg~~Whc 58 (96)
-+.+...+..++++|.+ +..+=+|.+|...-++ +|||.-.|
T Consensus 53 t~t~CkkCah~~~kfG~P~pC~~CkiiaAF~g~kc~rctn~e~kyGpp~~C 103 (305)
T KOG3990|consen 53 TNTICKKCAHNVRKFGTPKPCQYCKIIAAFIGRKCQRCTNSEKKYGPPLLC 103 (305)
T ss_pred hhhHHHHHHHHHHhcCCCCcchhhhhhhhhccchhhhccchhhccCCchhH
Confidence 34567788899999975 3457899999998888 99997665
No 84
>PF07055 Eno-Rase_FAD_bd: Enoyl reductase FAD binding domain; InterPro: IPR010758 This family contains a number of bacterial putative reductases. ; GO: 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 3S8M_A 3ZU5_A 3ZU3_A 3ZU4_A 3ZU2_A.
Probab=25.14 E-value=1.6e+02 Score=17.39 Aligned_cols=47 Identities=13% Similarity=0.345 Sum_probs=27.9
Q ss_pred CCCceEEeCCCCHHHHHHHHHHHHHHhhhcCCCCHHHHH--HHHHHhhcccCC
Q 034366 3 EGKASIKETDMPVKMQVQAMASASQALDLYDVVDSISIA--THIKKEFDKIYG 53 (96)
Q Consensus 3 ~~~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA--~~IK~~lD~~yg 53 (96)
++.+++=.-.|.++.|.++ .+...+-++++.++++ +--|+.|-+-+|
T Consensus 5 ~grIR~DdwEl~~dvQ~~V----~~lw~~it~en~~el~D~~gy~~eFl~L~G 53 (65)
T PF07055_consen 5 EGRIRMDDWELRPDVQAEV----AELWEQITTENFKELGDYDGYRQEFLQLFG 53 (65)
T ss_dssp TS-EES-CCCCSHHHHHHH----HHHHCCT-CCCHHHHS-HHHHHHHHHHHTT
T ss_pred ccCcccchhhcCHHHHHHH----HHHHHHhccccHHHHHhHHHHHHHHHHHcC
Confidence 4566667778888888765 4556666777767765 334555555444
No 85
>PF04622 ERG2_Sigma1R: ERG2 and Sigma1 receptor like protein; InterPro: IPR006716 This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8--delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols []. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum [], which interacts with endogenous steroid hormones, such as progesterone and testosterone []. It also binds the sigma ligands, which are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine []. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.; GO: 0000247 C-8 sterol isomerase activity, 0006696 ergosterol biosynthetic process, 0005783 endoplasmic reticulum
Probab=25.02 E-value=1.2e+02 Score=22.00 Aligned_cols=34 Identities=21% Similarity=0.183 Sum_probs=25.8
Q ss_pred HHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCC
Q 034366 20 QAMASASQALDLYDVVDSISIATHIKKEFDKIYGG 54 (96)
Q Consensus 20 ~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~ 54 (96)
.+.++|++++.+++. +.+++-..|-+.|-++|++
T Consensus 35 ~l~~ia~~~ia~~~~-~~~~~~~~l~~~L~~~y~~ 68 (216)
T PF04622_consen 35 VLHEIAKKAIARHPN-DTEEILSDLVDELRKKYPD 68 (216)
T ss_pred HHHHHHHHHHhhcCC-CHHHHHHHHHHHHHhHCCC
Confidence 445677788887764 3477888888889999987
No 86
>PF13798 PCYCGC: Protein of unknown function with PCYCGC motif
Probab=24.89 E-value=31 Score=23.88 Aligned_cols=34 Identities=21% Similarity=0.223 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCC
Q 034366 18 QVQAMASASQALDLYDVVDSISIATHIKKEFDKIYGG 54 (96)
Q Consensus 18 q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~ 54 (96)
..-.+++|..|+..+..- + -.+.|++..|++|..
T Consensus 114 C~vCl~ia~~a~~~~~~G--k-s~~eIR~~ID~kYk~ 147 (158)
T PF13798_consen 114 CGVCLDIAVQAVQMYQEG--K-SPKEIRQYIDEKYKE 147 (158)
T ss_pred cHHHHHHHHHHHHHHHcC--C-CHHHHHHHHHHHHHh
Confidence 344677888888877643 3 367788888988863
No 87
>cd02860 Pullulanase_N_term Pullulanase domain N-terminus. Pullulanase (AKA dextrinase; alpha-dextrin endo-1,6-alpha glucosidase) is an enzyme with action similar to that of isoamylase; it cleaves 1,6-alpha-glucosidic linkages in pullulan, amylopectin, and glycogen, and in alpha-and beta-amylase limit-dextrins of amylopectin and glycogen. The N-terminus of pullulanase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=24.80 E-value=82 Score=19.09 Aligned_cols=17 Identities=12% Similarity=0.431 Sum_probs=13.5
Q ss_pred hhcccCCCceEEEEeCc
Q 034366 47 EFDKIYGGGWQCVVGSN 63 (96)
Q Consensus 47 ~lD~~yg~~WhciVG~~ 63 (96)
.|+++-|+.||+.|-..
T Consensus 40 ~m~~~~~gvw~~~v~~~ 56 (100)
T cd02860 40 QMKRGENGVWSVTLDGD 56 (100)
T ss_pred eeecCCCCEEEEEeCCc
Confidence 46677899999999753
No 88
>PRK13601 putative L7Ae-like ribosomal protein; Provisional
Probab=24.69 E-value=1.4e+02 Score=18.15 Aligned_cols=24 Identities=4% Similarity=-0.103 Sum_probs=20.1
Q ss_pred CCceEEeCCCCHHHHHHHHHHHHH
Q 034366 4 GKASIKETDMPVKMQVQAMASASQ 27 (96)
Q Consensus 4 ~~~~I~~~dM~~~~q~~~~~~~~~ 27 (96)
-+.+|+..|-+++..+++.+.|..
T Consensus 25 akLViiA~Da~~~~~k~i~~~c~~ 48 (82)
T PRK13601 25 VLQVYIAKDAEEHVTKKIKELCEE 48 (82)
T ss_pred eeEEEEeCCCCHHHHHHHHHHHHh
Confidence 467889999999999998887755
No 89
>cd05128 RasGAP_GAP1_like The GAP1 family of Ras GTPase-activating proteins includes GAP1(m) (or RASA2), GAP1_IP4BP (or RASA3), Ca2+ -promoted Ras inactivator (CAPRI, or RASAL4), and Ras GTPase activating-like proteins (RASAL) or RASAL1. The members are characterized by a conserved domain structure comprising N-terminal tandem C2 domains, a highly conserved central RasGAP domain, and a C-terminal pleckstrin homology domain that is associated with a Bruton's tyrosine kinase motif. While this domain structure is conserved, a small change in the function of each individual domain and the interaction between domains has a marked effect on the regulation of each protein.
Probab=24.50 E-value=61 Score=24.56 Aligned_cols=34 Identities=9% Similarity=0.151 Sum_probs=23.8
Q ss_pred HHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCc
Q 034366 21 AMASASQALDLYDVVDSISIATHIKKEFDKIYGGG 55 (96)
Q Consensus 21 ~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~ 55 (96)
+++.+...+..++.. ++.+++.|++.+.++|...
T Consensus 141 ~~~~I~~S~~~~P~~-lr~i~~~l~~~v~~kFp~~ 174 (315)
T cd05128 141 LFEAITKSSVSCPTV-MCDIFYQLRERVGERFPGD 174 (315)
T ss_pred HHHHHHHHHHhCCHH-HHHHHHHHHHHHHHHCCCC
Confidence 333444444455543 5899999999999999875
No 90
>PRK06765 homoserine O-acetyltransferase; Provisional
Probab=24.50 E-value=58 Score=25.14 Aligned_cols=34 Identities=12% Similarity=0.134 Sum_probs=27.1
Q ss_pred CHHHHHHHHHHhhcccCCCceEEEEeCccceeEE
Q 034366 36 DSISIATHIKKEFDKIYGGGWQCVVGSNFGCYFT 69 (96)
Q Consensus 36 ~~~~iA~~IK~~lD~~yg~~WhciVG~~Fgs~vt 69 (96)
...++|..+.+-+|+.--...++|||.|.|+.++
T Consensus 142 t~~d~~~~~~~ll~~lgi~~~~~vvG~SmGG~ia 175 (389)
T PRK06765 142 TILDFVRVQKELIKSLGIARLHAVMGPSMGGMQA 175 (389)
T ss_pred cHHHHHHHHHHHHHHcCCCCceEEEEECHHHHHH
Confidence 3578999999999765446688999999998765
No 91
>PLN00072 3-isopropylmalate isomerase/dehydratase small subunit; Provisional
Probab=24.12 E-value=70 Score=23.68 Aligned_cols=61 Identities=15% Similarity=0.295 Sum_probs=33.0
Q ss_pred CCceEEeCCCCHHHHHHHHHHHHHHhhhcCC--CCHHHHHHHHHHhhcc----cC---C---CceEEEEeC-ccceeEE
Q 034366 4 GKASIKETDMPVKMQVQAMASASQALDLYDV--VDSISIATHIKKEFDK----IY---G---GGWQCVVGS-NFGCYFT 69 (96)
Q Consensus 4 ~~~~I~~~dM~~~~q~~~~~~~~~a~~~~~~--~~~~~iA~~IK~~lD~----~y---g---~~WhciVG~-~Fgs~vt 69 (96)
+++.+...|.+.+.-. -.+.+..... .+..++++++-..+|. +| | +.+.+|||. ||||==+
T Consensus 71 Grv~k~gDNIdTD~Ii-----Pa~~l~~~~sn~~~~~~l~~~~F~~l~~~~~~r~v~~Gd~~~~~~IIVaG~NFGcGSS 144 (246)
T PLN00072 71 GLCFVVGDNIDTDQII-----PAEYLTLVPSKPDEYEKLGSYALIGLPAFYKTRFVEPGEMKTKYSIIIGGENFGCGSS 144 (246)
T ss_pred EeEEEeCCCcchhhcc-----cHHHhccccccCCCHHHHHHhhhccCCcchhhcccCCCCCCCCceEEEecCcccCCCc
Confidence 4455566666666322 1223332211 2346788888766652 22 2 247777775 9997444
No 92
>KOG3938 consensus RGS-GAIP interacting protein GIPC, contains PDZ domain [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]
Probab=23.92 E-value=1.5e+02 Score=22.63 Aligned_cols=23 Identities=22% Similarity=0.362 Sum_probs=21.0
Q ss_pred CCCCHHHHHHHHHHHHHHhhhcC
Q 034366 11 TDMPVKMQVQAMASASQALDLYD 33 (96)
Q Consensus 11 ~dM~~~~q~~~~~~~~~a~~~~~ 33 (96)
.|+++++.+++++.+.+.++.|-
T Consensus 250 Ed~~~e~~~~~i~kindllesym 272 (334)
T KOG3938|consen 250 EDQPDEFEEAAIEKINDLLESYM 272 (334)
T ss_pred ccchhHHHHHHHHHHHHHHHHhc
Confidence 58999999999999999999984
No 93
>COG0066 LeuD 3-isopropylmalate dehydratase small subunit [Amino acid transport and metabolism]
Probab=23.78 E-value=40 Score=24.04 Aligned_cols=19 Identities=47% Similarity=0.749 Sum_probs=12.5
Q ss_pred cCCCceEEEEeCccceeEE
Q 034366 51 IYGGGWQCVVGSNFGCYFT 69 (96)
Q Consensus 51 ~yg~~WhciVG~~Fgs~vt 69 (96)
.|-+.=-.|+|+||||=-|
T Consensus 59 ~yq~g~IlVag~NFGcGSS 77 (191)
T COG0066 59 PYQGGDILVAGENFGCGSS 77 (191)
T ss_pred ccCCccEEEecCCCCCCcc
Confidence 3433446788889998544
No 94
>CHL00168 pbsA heme oxygenase; Provisional
Probab=23.56 E-value=91 Score=22.75 Aligned_cols=38 Identities=13% Similarity=0.020 Sum_probs=26.3
Q ss_pred CCCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHHHhhc
Q 034366 11 TDMPVKMQVQAMASASQALDLYDVVDSISIATHIKKEFD 49 (96)
Q Consensus 11 ~dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD 49 (96)
.+++++.+..+++.|+.|+.... +-..|++..+++.+.
T Consensus 179 l~l~e~e~~~iI~EA~~AF~lN~-~vf~eL~~~~~~~~~ 216 (238)
T CHL00168 179 LPLSDDQIQNIIAEANIAFNLNM-KMFQELNSSFIKIIT 216 (238)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH
Confidence 36899999999999999998433 223455555555443
No 95
>cd05130 RasGAP_Neurofibromin Neurofibromin is the product of the neurofibromatosis type 1 gene (NF1) and shares a region of similarity with catalytic domain of the mammalian p120RasGAP protein and an extended similarity with the Saccharomyces cerevisiae RasGAP proteins Ira1 and Ira2. Neurofibromin has been shown to function as a GAP (GTPase-activating protein) which inhibits low molecular weight G proteins such as Ras by stimulating their intrinsic GTPase activity. NF1 is a common genetic disorder characterized by various symptoms ranging from predisposition for the development of tumors to learning disability or mental retardation. Loss of neurofibromin activity can be correlated to the increase in Ras-GTP concentration in neurofibromas of NF1 of patients, supporting the notion that unregulated Ras signaling may contribute to their development.
Probab=23.55 E-value=1.4e+02 Score=22.78 Aligned_cols=31 Identities=3% Similarity=-0.053 Sum_probs=22.2
Q ss_pred HHHHhhhcCCCCHHHHHHHHHHhhcccCCCce
Q 034366 25 ASQALDLYDVVDSISIATHIKKEFDKIYGGGW 56 (96)
Q Consensus 25 ~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~W 56 (96)
+...+++++.+ ++.+|+.|++.+.++|...+
T Consensus 148 I~~S~~~~P~~-lR~i~~~l~~~v~~kFP~~~ 178 (329)
T cd05130 148 IINSSSEFPPQ-LRSVCHCLYQVVSQRFPNKA 178 (329)
T ss_pred HHHhHHhCCHH-HHHHHHHHHHHHHHHCCCcc
Confidence 33334444432 58999999999999998755
No 96
>KOG4194 consensus Membrane glycoprotein LIG-1 [Signal transduction mechanisms]
Probab=23.04 E-value=50 Score=28.15 Aligned_cols=24 Identities=33% Similarity=0.714 Sum_probs=19.3
Q ss_pred CCceEEEEeCccceeEEEccCcEE
Q 034366 53 GGGWQCVVGSNFGCYFTHKHGTFI 76 (96)
Q Consensus 53 g~~WhciVG~~Fgs~vt~~~~~~i 76 (96)
.|..||||...|||.+++-.+--+
T Consensus 586 ~grYQCVvtN~FGStysqk~KltV 609 (873)
T KOG4194|consen 586 EGRYQCVVTNHFGSTYSQKAKLTV 609 (873)
T ss_pred CceEEEEEecccCcchhheeEEEe
Confidence 589999999999999887554333
No 97
>cd05503 Bromo_BAZ2A_B_like Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B (BAZ2B) were identified as a novel human bromodomain gene by cDNA library screening. BAZ2A is also known as Tip5 (Transcription termination factor I-interacting protein 5) and hWALp3. The proteins may play roles in transcriptional regulation. Human Tip5 is part of a complex termed NoRC (nucleolar remodeling complex), which induces nucleosome sliding and may play a role in the regulation of the rDNA locus. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=23.03 E-value=1.9e+02 Score=17.64 Aligned_cols=38 Identities=8% Similarity=0.042 Sum_probs=25.3
Q ss_pred HHHHHHHHHHHHHHhhhcCCCC-HHHHHHHHHHhhcccC
Q 034366 15 VKMQVQAMASASQALDLYDVVD-SISIATHIKKEFDKIY 52 (96)
Q Consensus 15 ~~~q~~~~~~~~~a~~~~~~~~-~~~iA~~IK~~lD~~y 52 (96)
++...++.-+...|..-+.... .-..|..|++.|++++
T Consensus 59 ~ef~~D~~li~~Na~~yN~~~s~i~~~a~~l~~~f~~~~ 97 (97)
T cd05503 59 EEFAEDVRLVFDNCETFNEDDSEVGRAGHNMRKFFEKRW 97 (97)
T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 4566777777777766443321 3467899999998764
No 98
>cd05137 RasGAP_CLA2_BUD2 CLA2/BUD2 functions as a GTPase-activating protein (GAP) for BUD1/RSR1 and is necessary for proper bud-site selection in yeast. BUD2 has sequence similarity to the catalytic domain of RasGAPs, and stimulates the hydrolysis of BUD1-GTP to BUD1-GDP. Elimination of Bud2p activity by mutation causes a random budding pattern with no growth defect. Overproduction of Bud2p also alters the budding pattern.
Probab=22.97 E-value=1.5e+02 Score=23.35 Aligned_cols=36 Identities=11% Similarity=0.077 Sum_probs=25.6
Q ss_pred HHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccCCCce
Q 034366 20 QAMASASQALDLYDVVDSISIATHIKKEFDKIYGGGW 56 (96)
Q Consensus 20 ~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~yg~~W 56 (96)
++++.+...++.++.+ ++.+++.|++.+.++|+..+
T Consensus 207 ~~~~~I~~S~~~~P~~-lR~i~~~lr~~v~~kfpd~~ 242 (395)
T cd05137 207 EIWKRIANTSNDLPQE-IRHILKYIRAKLEDRYGDFL 242 (395)
T ss_pred HHHHHHHHHHHhCCHH-HHHHHHHHHHHHHHHCCCch
Confidence 3444444455555543 58999999999999999765
No 99
>PRK06683 hypothetical protein; Provisional
Probab=21.98 E-value=1.7e+02 Score=17.65 Aligned_cols=24 Identities=17% Similarity=0.173 Sum_probs=20.2
Q ss_pred CCceEEeCCCCHHHHHHHHHHHHH
Q 034366 4 GKASIKETDMPVKMQVQAMASASQ 27 (96)
Q Consensus 4 ~~~~I~~~dM~~~~q~~~~~~~~~ 27 (96)
.+.+|+..|-++...+++.+.|..
T Consensus 28 aklViiA~Da~~~~~~~i~~~~~~ 51 (82)
T PRK06683 28 VKEVVIAEDADMRLTHVIIRTALQ 51 (82)
T ss_pred eeEEEEECCCCHHHHHHHHHHHHh
Confidence 467889999999999998887765
No 100
>TIGR03228 anthran_1_2_A anthranilate 1,2-dioxygenase, large subunit. Anthranilate (2-aminobenzoate) is an intermediate of tryptophan (Trp) biosynthesis and degradation. Members of this family are the large subunit of anthranilate 1,2-dioxygenase, which acts in Trp degradation by converting anthranilate to catechol. Closely related paralogs typically are the benzoate 1,2-dioxygenase large subunit, among the larger set of ring-hydroxylating dioxygenases.
Probab=21.90 E-value=2e+02 Score=22.94 Aligned_cols=45 Identities=9% Similarity=0.238 Sum_probs=32.9
Q ss_pred HHHhhcccCCCceEEEEeCccceeEEEccCcEEEEEeCCEEEEEEecCC
Q 034366 44 IKKEFDKIYGGGWQCVVGSNFGCYFTHKHGTFIYFSLETLNFLIFKAAI 92 (96)
Q Consensus 44 IK~~lD~~yg~~WhciVG~~Fgs~vt~~~~~~i~F~~~~~~illfkt~~ 92 (96)
...++++-|+++|+.|-=. +.+ -+++-|+-+.+++.-++|+|..+
T Consensus 27 f~~E~~~IF~~~W~~v~h~---sel-p~~GDy~t~~ig~~pviv~R~~d 71 (438)
T TIGR03228 27 FDLEMELIFEKNWIYACHE---SEL-PNNHDFVTVRAGRQPMIVTRDGK 71 (438)
T ss_pred HHHHHHHHHhhCCEEEEEH---HHC-CCCCCeEEEEECCeEEEEEECCC
Confidence 4467777789999987432 222 24578888999999999998654
No 101
>cd00404 Aconitase_swivel Aconitase swivel domain. Aconitase (aconitate hydratase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. This is the aconitase swivel domain, which undergoes swivelling conformational change in the enzyme mechanism. The aconitase family contains the following proteins: - Iron-responsive element binding protein (IRE-BP). IRE-BP is a cytosolic protein that binds to iron-responsive elements (IREs). IREs are stem-loop structures found in the 5'UTR of ferritin, and delta aminolevulinic acid synthase mRNAs, and in the 3'UTR of transferrin receptor mRNA. IRE-BP also express aconitase activity. - 3-isopropylmalate dehydratase (isopropylmalate isomerase), the enzyme that catalyzes the second step in the biosynthesis of leucine. - Homoaconitase (homoaconitate hydratase), an enzyme that participates in the alpha-aminoadipate pathway of lysine biosynthesis and that converts cis-homoaconitate into homoisocitric acid.
Probab=21.68 E-value=66 Score=19.99 Aligned_cols=15 Identities=13% Similarity=0.062 Sum_probs=10.6
Q ss_pred ceEEEEeCccceeEE
Q 034366 55 GWQCVVGSNFGCYFT 69 (96)
Q Consensus 55 ~WhciVG~~Fgs~vt 69 (96)
.|-+|.|++||+==+
T Consensus 16 ~~iiVaG~nfG~GSS 30 (88)
T cd00404 16 PGVVIGDENYGTGSS 30 (88)
T ss_pred CEEEEecCCeecCCC
Confidence 676777779986444
No 102
>PF11841 DUF3361: Domain of unknown function (DUF3361)
Probab=21.65 E-value=2.2e+02 Score=19.66 Aligned_cols=34 Identities=18% Similarity=0.062 Sum_probs=26.6
Q ss_pred CCCHHHHHHHHHHHHHHhhhcCCCCHHHHHHHHH
Q 034366 12 DMPVKMQVQAMASASQALDLYDVVDSISIATHIK 45 (96)
Q Consensus 12 dM~~~~q~~~~~~~~~a~~~~~~~~~~~iA~~IK 45 (96)
+.++++|..++......+.+-+..+-+++|+.|-
T Consensus 113 ~~~~~iq~naiaLinAL~~kA~~~~r~~i~~~l~ 146 (160)
T PF11841_consen 113 VSNQEIQTNAIALINALFLKADDSKRKEIAETLS 146 (160)
T ss_pred cCCHHHHHHHHHHHHHHHhcCChHHHHHHHHHHH
Confidence 3688999999999999999887654457776654
No 103
>PF01726 LexA_DNA_bind: LexA DNA binding domain; InterPro: IPR006199 This is the DNA binding domain of the LexA SOS regulon repressor which prevents expression of DNA repair proteins in bacteria. The aligned region contains a variant form of the helix-turn-helix DNA binding motif []. This domain usually at the N terminus is found associated with IPR006198 from INTERPRO the auto-proteolytic domain of LexA 3.4.21.88 from EC.; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 1LEA_A 1JHH_A 3JSP_A 1JHF_A 3JSO_B 1LEB_A 3K2Z_A.
Probab=21.57 E-value=1.7e+02 Score=16.83 Aligned_cols=30 Identities=10% Similarity=0.051 Sum_probs=20.0
Q ss_pred HHHHHHHHHHHHHHhhhcCC-CCHHHHHHHH
Q 034366 15 VKMQVQAMASASQALDLYDV-VDSISIATHI 44 (96)
Q Consensus 15 ~~~q~~~~~~~~~a~~~~~~-~~~~~iA~~I 44 (96)
.+.|.+|++.+.+-++++.- -..+|||+.+
T Consensus 5 T~rQ~~vL~~I~~~~~~~G~~Pt~rEIa~~~ 35 (65)
T PF01726_consen 5 TERQKEVLEFIREYIEENGYPPTVREIAEAL 35 (65)
T ss_dssp -HHHHHHHHHHHHHHHHHSS---HHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHcCCCCCHHHHHHHh
Confidence 36788999998888887541 1237887654
No 104
>PF15586 Imm47: Immunity protein 47
Probab=21.41 E-value=2.3e+02 Score=18.33 Aligned_cols=33 Identities=12% Similarity=0.136 Sum_probs=22.9
Q ss_pred HHHHHHHHHhhhcCCCCHHHHHHHHHHhhcccC
Q 034366 20 QAMASASQALDLYDVVDSISIATHIKKEFDKIY 52 (96)
Q Consensus 20 ~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD~~y 52 (96)
.|.+...+-+.++..++..++|+.|.+.+.=.|
T Consensus 80 ~I~~~i~~~i~~c~~~~W~~~~~kLsr~f~WEf 112 (116)
T PF15586_consen 80 EIKKTIERIIESCEGDDWDEIAEKLSRYFAWEF 112 (116)
T ss_pred HHHHHHHHHHHHccCCCHHHHHHHHHHheeeec
Confidence 445555566666766667899999998876444
No 105
>PRK12414 putative aminotransferase; Provisional
Probab=21.36 E-value=3.5e+02 Score=20.21 Aligned_cols=49 Identities=8% Similarity=0.157 Sum_probs=24.6
Q ss_pred CHHHHHHHHHHHHHHhhhcCC-CCHHHHHHHHHHhhcccCC----CceEEEEeC
Q 034366 14 PVKMQVQAMASASQALDLYDV-VDSISIATHIKKEFDKIYG----GGWQCVVGS 62 (96)
Q Consensus 14 ~~~~q~~~~~~~~~a~~~~~~-~~~~~iA~~IK~~lD~~yg----~~WhciVG~ 62 (96)
++++++.+.+.+......|.. ....++-+.|.++|.+.+| +..+++++.
T Consensus 44 ~~~~~~~~~~~~~~~~~~Y~~~~G~~~lr~~ia~~l~~~~g~~~~~~~~i~it~ 97 (384)
T PRK12414 44 DPALVEGVARAMRDGHNQYAPMAGIAALREALAEKTERLYGARYDPASEVTVIA 97 (384)
T ss_pred CHHHHHHHHHHHHhCCCCcCCCCCcHHHHHHHHHHHHHHhCCCCCCCCcEEEEC
Confidence 566666555544433345643 2123443444444444444 457888876
No 106
>PF08594 UPF0300: Uncharacterised protein family (UPF0300); InterPro: IPR013903 This entry of proteins appear to be specific to Schizosaccharomyces pombe (Fission yeast).
Probab=21.34 E-value=3.1e+02 Score=19.92 Aligned_cols=19 Identities=21% Similarity=0.581 Sum_probs=16.1
Q ss_pred ceEEEEeCccceeEEEccC
Q 034366 55 GWQCVVGSNFGCYFTHKHG 73 (96)
Q Consensus 55 ~WhciVG~~Fgs~vt~~~~ 73 (96)
.|=||+-++|-|++.-+..
T Consensus 95 ~W~~I~~k~F~c~I~l~~~ 113 (215)
T PF08594_consen 95 SWIAICSKNFMCNIHLDQP 113 (215)
T ss_pred cEEEEecCcceEEEEecCC
Confidence 8999999999999875544
No 107
>TIGR00171 leuD 3-isopropylmalate dehydratase, small subunit. Several pairs of archaeal proteins resemble the leuC and leuD pair in length and sequence but even more closely resemble the respective domains of homoaconitase, and their identity is uncertain. The candidate archaeal leuD proteins are not included in the seed alignment for this model and score below the trusted cutoff.
Probab=21.29 E-value=43 Score=23.69 Aligned_cols=13 Identities=31% Similarity=0.547 Sum_probs=9.2
Q ss_pred EEEEeCccceeEE
Q 034366 57 QCVVGSNFGCYFT 69 (96)
Q Consensus 57 hciVG~~Fgs~vt 69 (96)
=.|.|+||||=-|
T Consensus 72 IlVaG~NFGcGSS 84 (188)
T TIGR00171 72 ILLARENFGCGSS 84 (188)
T ss_pred EEEcCCcccCCCc
Confidence 3677789998544
No 108
>smart00764 Citrate_ly_lig Citrate lyase ligase C-terminal domain. Proteins of this family contain the C-terminal domain of citrate lyase ligase EC:6.2.1.22.
Probab=21.26 E-value=2.1e+02 Score=19.72 Aligned_cols=10 Identities=20% Similarity=-0.094 Sum_probs=7.4
Q ss_pred eEEEEeC--ccc
Q 034366 56 WQCVVGS--NFG 65 (96)
Q Consensus 56 WhciVG~--~Fg 65 (96)
=++|||. .||
T Consensus 109 ~~ivvG~df~FG 120 (182)
T smart00764 109 THRYVGEEPFSP 120 (182)
T ss_pred eEEEEcCCCCCC
Confidence 3899998 455
No 109
>KOG0760 consensus Mitochondrial carrier protein MRS3/4 [Energy production and conversion]
Probab=21.23 E-value=24 Score=26.77 Aligned_cols=19 Identities=21% Similarity=0.685 Sum_probs=13.7
Q ss_pred HHHHHhh---cccCCCceEEEE
Q 034366 42 THIKKEF---DKIYGGGWQCVV 60 (96)
Q Consensus 42 ~~IK~~l---D~~yg~~WhciV 60 (96)
+-||+.| .+.|++.|+||=
T Consensus 133 dvVKQR~Qm~~~~y~sv~~ci~ 154 (302)
T KOG0760|consen 133 DVVKQRMQMYNSPYKSVWDCIR 154 (302)
T ss_pred HHHHHHHhcccCCCccHHHHHH
Confidence 4466655 467999999983
No 110
>cd05506 Bromo_plant1 Bromodomain, uncharacterized subfamily specific to plants. Might function as a global transcription factor. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=21.02 E-value=2.1e+02 Score=17.32 Aligned_cols=38 Identities=13% Similarity=0.166 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHHHhhhcCCC-CHHHHHHHHHHhhcccC
Q 034366 15 VKMQVQAMASASQALDLYDVV-DSISIATHIKKEFDKIY 52 (96)
Q Consensus 15 ~~~q~~~~~~~~~a~~~~~~~-~~~~iA~~IK~~lD~~y 52 (96)
.+...++..+...|..-++.. ..-.+|..|++.|++++
T Consensus 61 ~ef~~D~~li~~Na~~yn~~~s~i~~~a~~l~~~fe~~w 99 (99)
T cd05506 61 EEFAADVRLTFANAMRYNPPGNDVHTMAKELLKIFETRW 99 (99)
T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHhC
Confidence 456677777777776643332 24678999999998764
No 111
>PRK10667 Hha toxicity attenuator; Provisional
Probab=20.95 E-value=1.4e+02 Score=19.75 Aligned_cols=32 Identities=9% Similarity=0.094 Sum_probs=21.0
Q ss_pred HHHHHHHHHHhhhc----CCCCHHHHHHHHHHhhcccC
Q 034366 19 VQAMASASQALDLY----DVVDSISIATHIKKEFDKIY 52 (96)
Q Consensus 19 ~~~~~~~~~a~~~~----~~~~~~~iA~~IK~~lD~~y 52 (96)
++.++-+..-.-.| +.+ .++++.|-+.||.+|
T Consensus 49 NeLIEHIa~f~~~fKIKYp~~--~~l~~~ideYLDeTy 84 (122)
T PRK10667 49 NELIEHIATFALNFKIKYPED--SKLIEQIDEYLDDTY 84 (122)
T ss_pred HHHHHHHHHHHHHhhccCCcH--hhHHHHHHHHHHHHH
Confidence 44444444333334 444 789999999999986
No 112
>PRK13888 conjugal transfer protein TrbN; Provisional
Probab=20.55 E-value=32 Score=24.84 Aligned_cols=19 Identities=16% Similarity=0.237 Sum_probs=16.1
Q ss_pred HHHHHHhhcccCCCceEEE
Q 034366 41 ATHIKKEFDKIYGGGWQCV 59 (96)
Q Consensus 41 A~~IK~~lD~~yg~~Whci 59 (96)
|-.|++.|++++|-.|.+|
T Consensus 91 AWILr~~i~~~~G~~W~AV 109 (206)
T PRK13888 91 AWRLRMHIRNDKGDLWTKA 109 (206)
T ss_pred HHHHHHHHHHhhCchHHHH
Confidence 4567899998899999986
No 113
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=20.29 E-value=2.1e+02 Score=17.12 Aligned_cols=10 Identities=30% Similarity=0.474 Sum_probs=6.0
Q ss_pred HHHHHHhhcc
Q 034366 41 ATHIKKEFDK 50 (96)
Q Consensus 41 A~~IK~~lD~ 50 (96)
|+.||..|++
T Consensus 63 AE~Lk~~l~~ 72 (75)
T cd02680 63 AEALKESMSK 72 (75)
T ss_pred HHHHHHHhcc
Confidence 5666666653
No 114
>PF11307 DUF3109: Protein of unknown function (DUF3109); InterPro: IPR021458 This bacterial family of proteins has no known function.
Probab=20.25 E-value=31 Score=24.45 Aligned_cols=18 Identities=17% Similarity=0.576 Sum_probs=16.6
Q ss_pred HHHHHHHHhhcccCCCce
Q 034366 39 SIATHIKKEFDKIYGGGW 56 (96)
Q Consensus 39 ~iA~~IK~~lD~~yg~~W 56 (96)
-+-++||.-|-++||..|
T Consensus 159 pvy~Flk~pLir~fG~~w 176 (183)
T PF11307_consen 159 PVYKFLKEPLIRKFGEEW 176 (183)
T ss_pred cHHHHhHHHHHHHHhHHH
Confidence 478999999999999988
No 115
>cd01580 AcnA_IRP_Swivel Aconitase A swivel domain. This is the major form of the TCA cycle enzyme aconitate hydratase, also known as aconitase and citrate hydro-lyase. It includes bacterial and archaeal aconitase A, and the eukaryotic cytosolic form of aconitase. This group also includes sequences that have been shown to act as an iron-responsive element (IRE) binding protein in animals and may have the same role in other eukaryotes. This is the aconitase-like swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism.
Probab=20.21 E-value=60 Score=22.76 Aligned_cols=18 Identities=17% Similarity=0.219 Sum_probs=14.5
Q ss_pred CCCceEEEEeCccceeEE
Q 034366 52 YGGGWQCVVGSNFGCYFT 69 (96)
Q Consensus 52 yg~~WhciVG~~Fgs~vt 69 (96)
-|-.|-+|.|++||+=-+
T Consensus 94 ~g~plIIvaG~nfG~GSS 111 (171)
T cd01580 94 EGVPLVILAGKEYGSGSS 111 (171)
T ss_pred cCCcEEEEccCcccCCCc
Confidence 378999999999996443
No 116
>PRK01641 leuD isopropylmalate isomerase small subunit; Provisional
Probab=20.14 E-value=47 Score=23.71 Aligned_cols=33 Identities=21% Similarity=0.439 Sum_probs=19.8
Q ss_pred HHHHHHHHHhhc---------------ccCCCceEEEEeCccceeEEE
Q 034366 38 ISIATHIKKEFD---------------KIYGGGWQCVVGSNFGCYFTH 70 (96)
Q Consensus 38 ~~iA~~IK~~lD---------------~~yg~~WhciVG~~Fgs~vt~ 70 (96)
.++++++-..++ .+|.+.==+|.|+||||=-|.
T Consensus 36 ~~l~~~~f~~~r~~~~~~~~p~F~ln~~~~~~~~IlVaG~NFGcGSSR 83 (200)
T PRK01641 36 TGFGKGLFDDWRYLDDGQPNPDFVLNQPRYQGASILLAGDNFGCGSSR 83 (200)
T ss_pred HHHHHhhhccccccccCCCCCCccccccccCCCeEEEcCCcccCCCcH
Confidence 467766665543 334444456777899975443
No 117
>PF10045 DUF2280: Uncharacterized conserved protein (DUF2280); InterPro: IPR018738 This entry is represented by Burkholderia phage Bups phi1, Orf2.36. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=20.10 E-value=1.6e+02 Score=18.98 Aligned_cols=28 Identities=29% Similarity=0.399 Sum_probs=20.4
Q ss_pred HHHHHHHHHhhhcCCCCHHHHHHHHHHhhc
Q 034366 20 QAMASASQALDLYDVVDSISIATHIKKEFD 49 (96)
Q Consensus 20 ~~~~~~~~a~~~~~~~~~~~iA~~IK~~lD 49 (96)
++...+.++|-.|++- .++|+.+|++|.
T Consensus 7 ~vK~FIVQ~LAcfdTP--s~v~~aVk~eFg 34 (104)
T PF10045_consen 7 EVKAFIVQSLACFDTP--SEVAEAVKEEFG 34 (104)
T ss_pred HHHHHHHHHHHhhCCH--HHHHHHHHHHhC
Confidence 3444556777878765 789999998885
No 118
>PF02881 SRP54_N: SRP54-type protein, helical bundle domain; InterPro: IPR013822 The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes [, ]. SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. Eukaryotic SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor []. In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. In archaea, the SRP complex contains 7S RNA like its eukaryotic counterpart, yet only includes two of the six protein subunits found in the eukarytic complex: SRP19 and SRP54 []. This entry represents the N-terminal helical bundle domain of the 54 kDa SRP54 component, a GTP-binding protein that interacts with the signal sequence when it emerges from the ribosome. SRP54 of the signal recognition particle has a three-domain structure: an N-terminal helical bundle domain, a GTPase domain, and the M-domain that binds the 7s RNA and also binds the signal sequence. The extreme C-terminal region is glycine-rich and lower in complexity and poorly conserved between species. These proteins include Escherichia coli and Bacillus subtilis ffh protein (P48), which seems to be the prokaryotic counterpart of SRP54; signal recognition particle receptor alpha subunit (docking protein), an integral membrane GTP-binding protein which ensures, in conjunction with SRP, the correct targeting of nascent secretory proteins to the endoplasmic reticulum membrane; bacterial FtsY protein, which is believed to play a similar role to that of the docking protein in eukaryotes; the pilA protein from Neisseria gonorrhoeae, the homologue of ftsY; and bacterial flagellar biosynthesis protein flhF.; GO: 0005525 GTP binding, 0006614 SRP-dependent cotranslational protein targeting to membrane; PDB: 1J8M_F 1J8Y_F 2J37_W 2OG2_A 3B9Q_A 2V3C_C 3NDB_B 1ZU5_B 1ZU4_A 1WGW_A ....
Probab=20.09 E-value=1.9e+02 Score=16.42 Aligned_cols=21 Identities=14% Similarity=0.319 Sum_probs=11.7
Q ss_pred eCCCCHHHHHHHHHHHHH-Hhh
Q 034366 10 ETDMPVKMQVQAMASASQ-ALD 30 (96)
Q Consensus 10 ~~dM~~~~q~~~~~~~~~-a~~ 30 (96)
.+|+..+.-.++++..++ .+.
T Consensus 36 ~aDVg~~~a~~i~~~ik~~~~~ 57 (75)
T PF02881_consen 36 EADVGVEVAEKIIENIKKKLIK 57 (75)
T ss_dssp HTTTSHHHHHHHHHHHHHHHHC
T ss_pred HcCcCHHHHHHHHHHHHHHHhc
Confidence 455556666666655555 443
No 119
>cd03548 Rieske_RO_Alpha_OMO_CARDO Rieske non-heme iron oxygenase (RO) family, 2-Oxoquinoline 8-monooxygenase (OMO) and Carbazole 1,9a-dioxygenase (CARDO) subfamily, N-terminal Rieske domain of the oxygenase alpha subunit; ROs comprise a large class of aromatic ring-hydroxylating dioxygenases that enable microorganisms to tolerate and utilize aromatic compounds for growth. The oxygenase alpha subunit contains an N-terminal Rieske domain with an [2Fe-2S] cluster and a C-terminal catalytic domain with a mononuclear Fe(II) binding site. The Rieske [2Fe-2S] cluster accepts electrons from a reductase or ferredoxin component and transfers them to the mononuclear iron for catalysis. OMO catalyzes the NADH-dependent oxidation of the N-heterocyclic aromatic compound 2-oxoquinoline to 8-hydroxy-2-oxoquinoline, the second step in the bacterial degradation of quinoline. OMO consists of a reductase component (OMR) and an oxygenase component (OMO) that together function to shuttle electrons from the
Probab=20.03 E-value=2.4e+02 Score=18.21 Aligned_cols=34 Identities=15% Similarity=0.127 Sum_probs=25.4
Q ss_pred cCCCceEEEEeC-ccceeEEEccCcEEEEEeCCEEEEEEec
Q 034366 51 IYGGGWQCVVGS-NFGCYFTHKHGTFIYFSLETLNFLIFKA 90 (96)
Q Consensus 51 ~yg~~WhciVG~-~Fgs~vt~~~~~~i~F~~~~~~illfkt 90 (96)
.|...|+.|.-. +. .++.+.-+.+++..++|+|.
T Consensus 10 ~~~~~W~~v~~~~el------~~g~~~~~~~~g~~i~l~r~ 44 (136)
T cd03548 10 GFRNHWYPALFSHEL------EEGEPKGIQLCGEPILLRRV 44 (136)
T ss_pred CcccCcEEEEEHHHC------CCCCeEEEEECCcEEEEEec
Confidence 567889987653 33 24778888899999999985
Done!