Query psy732
Match_columns 119
No_of_seqs 162 out of 1094
Neff 8.1
Searched_HMMs 46136
Date Fri Aug 16 23:47:04 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy732.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/732hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1232|consensus 99.9 7.6E-25 1.7E-29 166.1 8.3 113 3-116 329-443 (511)
2 PLN02805 D-lactate dehydrogena 99.9 1.3E-21 2.8E-26 156.8 12.4 111 3-116 365-478 (555)
3 PF02913 FAD-oxidase_C: FAD li 99.8 3.1E-21 6.7E-26 138.6 4.0 114 3-117 61-179 (248)
4 PRK11230 glycolate oxidase sub 99.8 3.7E-19 8.1E-24 141.4 12.1 111 3-116 290-401 (499)
5 TIGR00387 glcD glycolate oxida 99.8 1.7E-18 3.7E-23 134.8 11.8 111 3-116 233-344 (413)
6 COG0277 GlcD FAD/FMN-containin 99.4 6.2E-12 1.3E-16 98.2 11.4 113 3-116 270-385 (459)
7 KOG1231|consensus 98.4 4.8E-09 1E-13 81.9 -5.1 109 5-115 312-427 (505)
8 KOG4165|consensus 85.2 3.6 7.8E-05 32.1 6.1 40 68-115 192-231 (433)
9 PRK10204 hypothetical protein; 79.4 7.2 0.00016 21.8 4.3 47 41-91 5-53 (55)
10 COG0014 ProA Gamma-glutamyl ph 78.6 12 0.00026 29.8 6.8 37 69-113 198-234 (417)
11 PF09330 Lact-deh-memb: D-lact 67.1 42 0.0009 25.5 7.1 105 2-113 113-225 (291)
12 KOG1262|consensus 65.1 4.1 8.9E-05 32.5 1.6 46 66-111 384-431 (543)
13 PF03871 RNA_pol_Rpb5_N: RNA p 65.1 1.8 3.8E-05 27.1 -0.3 25 41-65 3-27 (93)
14 cd05397 NT_Pol-beta-like Nucle 64.7 14 0.0003 20.1 3.3 31 79-109 2-33 (49)
15 PRK10858 nitrogen regulatory p 62.0 15 0.00033 23.7 3.7 40 64-112 57-96 (112)
16 PF03876 SHS2_Rpb7-N: SHS2 dom 60.2 17 0.00036 20.9 3.3 44 70-113 4-65 (70)
17 PF10832 DUF2559: Protein of u 59.3 30 0.00066 19.5 4.0 18 41-58 4-21 (54)
18 COG0347 GlnK Nitrogen regulato 59.2 18 0.00038 23.6 3.5 40 64-112 57-96 (112)
19 PLN00107 FAD-dependent oxidore 55.0 19 0.00042 26.9 3.6 25 67-91 64-88 (257)
20 PF04030 ALO: D-arabinono-1,4- 54.5 8.9 0.00019 27.9 1.8 26 66-91 126-151 (259)
21 PLN03111 DNA-directed RNA poly 52.3 13 0.00028 26.8 2.3 41 40-80 3-44 (206)
22 KOG3218|consensus 50.7 12 0.00027 26.8 1.9 42 41-82 5-47 (208)
23 cd07399 MPP_YvnB Bacillus subt 49.3 15 0.00032 26.1 2.2 21 82-102 138-158 (214)
24 PF14792 DNA_pol_B_palm: DNA p 48.7 31 0.00066 22.1 3.4 30 72-101 2-32 (112)
25 PRK10665 nitrogen regulatory p 48.5 36 0.00078 22.0 3.7 40 64-112 57-96 (112)
26 PF01764 Lipase_3: Lipase (cla 48.5 34 0.00074 21.8 3.7 27 75-101 45-71 (140)
27 PRK11628 transcriptional regul 47.8 36 0.00079 21.8 3.6 31 82-113 5-43 (105)
28 PF02244 Propep_M14: Carboxype 46.4 30 0.00066 19.9 2.9 20 67-86 35-54 (74)
29 cd01984 AANH_like Adenine nucl 46.3 24 0.00052 20.8 2.5 32 79-111 35-66 (86)
30 PF07597 DUF1560: Protein of u 44.7 15 0.00033 19.6 1.2 31 47-79 15-48 (49)
31 COG2885 OmpA Outer membrane pr 44.5 34 0.00074 23.7 3.4 37 66-102 87-123 (190)
32 PRK11148 cyclic 3',5'-adenosin 41.7 25 0.00055 25.8 2.5 21 82-102 184-204 (275)
33 TIGR02802 Pal_lipo peptidoglyc 41.7 51 0.0011 20.3 3.6 33 72-104 10-42 (104)
34 cd00655 RNAP_Rpb7_N_like RNAP_ 41.1 80 0.0017 18.9 4.6 47 66-113 6-71 (80)
35 PF03992 ABM: Antibiotic biosy 40.2 58 0.0013 18.2 3.5 35 67-101 5-43 (78)
36 PRK11183 D-lactate dehydrogena 40.0 59 0.0013 27.1 4.5 104 2-113 383-494 (564)
37 PF03432 Relaxase: Relaxase/Mo 39.9 56 0.0012 23.1 4.0 42 72-116 69-112 (242)
38 KOG2355|consensus 39.7 1.1E+02 0.0025 22.7 5.5 40 66-106 170-211 (291)
39 TIGR00032 argG argininosuccina 38.7 39 0.00084 26.7 3.2 28 79-107 93-120 (394)
40 PF03568 Peptidase_C50: Peptid 38.3 73 0.0016 24.9 4.7 55 37-91 59-124 (383)
41 PRK10802 peptidoglycan-associa 37.8 57 0.0012 22.6 3.6 38 66-103 73-110 (173)
42 PRK10486 autoinducer-2 (AI-2) 37.3 80 0.0017 19.4 4.0 39 69-116 7-45 (96)
43 PF09186 DUF1949: Domain of un 37.2 62 0.0013 17.3 3.1 24 65-88 30-53 (56)
44 cd07392 MPP_PAE1087 Pyrobaculu 37.0 31 0.00067 23.1 2.2 20 82-102 151-170 (188)
45 TIGR03729 acc_ester putative p 36.7 27 0.00059 25.0 1.9 20 82-102 199-218 (239)
46 KOG4498|consensus 36.7 46 0.00099 23.9 3.0 49 44-100 43-91 (197)
47 COG1768 Predicted phosphohydro 36.3 33 0.0007 24.7 2.2 46 47-102 142-196 (230)
48 TIGR03751 conj_TIGR03751 conju 36.1 44 0.00096 21.9 2.6 28 79-106 61-90 (116)
49 PHA00684 hypothetical protein 36.0 39 0.00084 22.6 2.4 32 72-103 53-88 (128)
50 cd01585 AcnA_Bact Aconitase ca 35.7 72 0.0016 25.2 4.2 42 65-110 31-74 (380)
51 cd07402 MPP_GpdQ Enterobacter 35.6 36 0.00077 24.0 2.4 24 81-104 170-193 (240)
52 PRK13746 aminoglycoside resist 35.3 60 0.0013 24.2 3.6 37 72-109 5-44 (262)
53 cd07404 MPP_MS158 Microscilla 35.1 30 0.00064 23.1 1.8 24 79-103 124-147 (166)
54 cd07379 MPP_239FB Homo sapiens 34.9 32 0.00069 22.2 1.9 20 82-102 94-113 (135)
55 COG1509 KamA Lysine 2,3-aminom 34.8 2.3E+02 0.0049 22.4 6.7 105 2-110 220-337 (369)
56 PF07045 DUF1330: Protein of u 34.8 70 0.0015 18.1 3.2 26 74-100 1-26 (65)
57 PTZ00061 DNA-directed RNA poly 34.7 28 0.0006 25.2 1.7 37 43-79 3-40 (205)
58 PF11343 DUF3145: Protein of u 34.6 35 0.00076 23.6 2.1 41 66-106 99-150 (158)
59 COG4922 Uncharacterized protei 32.5 60 0.0013 21.4 2.8 34 78-111 47-80 (129)
60 PF03389 MobA_MobL: MobA/MobL 32.2 1.7E+02 0.0036 21.0 5.4 68 43-115 47-123 (216)
61 KOG1504|consensus 31.9 65 0.0014 24.5 3.2 36 68-106 165-200 (346)
62 cd05403 NT_KNTase_like Nucleot 31.9 70 0.0015 18.6 3.0 16 93-108 18-33 (93)
63 PF13380 CoA_binding_2: CoA bi 31.7 67 0.0015 20.5 3.0 33 55-87 45-78 (116)
64 cd00839 MPP_PAPs purple acid p 31.5 47 0.001 24.3 2.6 25 77-102 178-202 (294)
65 cd07403 MPP_TTHA0053 Thermus t 31.4 39 0.00085 21.9 1.9 20 82-102 81-100 (129)
66 TIGR03875 RNA_lig_partner RNA 31.3 38 0.00083 24.5 1.9 77 17-95 24-109 (206)
67 smart00206 NTR Tissue inhibito 31.1 43 0.00094 23.5 2.1 19 93-111 77-95 (172)
68 cd07400 MPP_YydB Bacillus subt 30.9 48 0.001 21.3 2.3 22 82-104 103-124 (144)
69 PRK04358 hypothetical protein; 30.6 2.2E+02 0.0047 20.8 5.8 79 16-96 27-114 (217)
70 KOG3179|consensus 30.3 61 0.0013 23.7 2.8 33 69-101 71-103 (245)
71 PF13740 ACT_6: ACT domain; PD 29.8 1.1E+02 0.0024 17.7 3.5 31 66-97 44-74 (76)
72 PF00862 Sucrose_synth: Sucros 29.1 18 0.0004 29.7 0.0 34 75-108 381-415 (550)
73 cd04871 ACT_PSP_2 ACT domains 28.8 1.4E+02 0.003 17.9 4.0 33 65-98 50-82 (84)
74 cd07395 MPP_CSTP1 Homo sapiens 28.5 66 0.0014 23.2 2.9 23 79-102 193-215 (262)
75 cd05402 NT_PAP_TUTase Nucleoti 27.3 83 0.0018 19.5 2.9 28 79-106 4-32 (114)
76 cd07398 MPP_YbbF-LpxH Escheric 27.0 77 0.0017 21.9 2.9 30 75-105 172-201 (217)
77 cd04870 ACT_PSP_1 CT domains f 26.9 1.2E+02 0.0027 17.4 3.4 32 65-97 40-72 (75)
78 TIGR03767 P_acnes_RR metalloph 26.9 61 0.0013 26.6 2.6 21 82-102 369-389 (496)
79 PF00586 AIRS: AIR synthase re 26.6 1.5E+02 0.0032 17.8 3.9 52 48-101 34-95 (96)
80 PF00487 FA_desaturase: Fatty 26.6 59 0.0013 22.4 2.2 30 69-99 227-256 (257)
81 TIGR03759 conj_TIGR03759 integ 26.5 1.2E+02 0.0027 21.8 3.8 38 27-64 130-169 (200)
82 KOG3125|consensus 26.0 1.1E+02 0.0023 22.4 3.4 27 77-104 116-142 (234)
83 PRK10510 putative outer membra 25.9 1.2E+02 0.0025 21.9 3.7 35 69-103 119-153 (219)
84 PF09370 TIM-br_sig_trns: TIM- 25.8 3E+02 0.0064 20.8 5.8 75 23-100 142-221 (268)
85 cd07378 MPP_ACP5 Homo sapiens 25.7 75 0.0016 23.0 2.8 24 78-102 187-210 (277)
86 PF07892 DUF1667: Protein of u 25.4 64 0.0014 19.8 1.9 26 65-90 33-58 (82)
87 PF02769 AIRS_C: AIR synthase 25.3 1.3E+02 0.0029 19.4 3.7 33 70-107 114-147 (153)
88 TIGR01677 pln_FAD_oxido plant- 25.2 1.2E+02 0.0026 25.2 4.1 23 69-91 373-395 (557)
89 TIGR00106 uncharacterized prot 25.1 1.1E+02 0.0023 19.2 3.0 25 72-97 8-36 (97)
90 PF14304 CSTF_C: Transcription 25.0 44 0.00096 18.2 1.0 18 39-56 24-41 (46)
91 PLN02282 phosphoglycerate kina 24.8 1E+02 0.0022 24.6 3.4 36 66-103 22-66 (401)
92 PF01510 Amidase_2: N-acetylmu 24.5 54 0.0012 20.8 1.6 33 69-102 85-119 (132)
93 PRK05731 thiamine monophosphat 24.4 1.4E+02 0.0029 22.4 4.0 33 69-106 267-299 (318)
94 PF13692 Glyco_trans_1_4: Glyc 24.4 1.3E+02 0.0029 18.6 3.5 30 73-102 13-43 (135)
95 cd00519 Lipase_3 Lipase (class 24.3 1.1E+02 0.0024 21.5 3.4 21 81-101 115-135 (229)
96 cd00318 Phosphoglycerate_kinas 24.3 1E+02 0.0023 24.5 3.4 36 66-103 13-56 (397)
97 PLN02934 triacylglycerol lipas 24.2 1E+02 0.0022 25.5 3.4 27 75-101 302-328 (515)
98 PF00162 PGK: Phosphoglycerate 24.2 99 0.0021 24.5 3.2 37 66-104 14-58 (384)
99 PRK13820 argininosuccinate syn 24.0 99 0.0021 24.5 3.2 29 81-110 98-126 (394)
100 PF10113 Fibrillarin_2: Fibril 23.9 83 0.0018 25.5 2.8 34 67-106 198-232 (505)
101 cd01839 SGNH_arylesterase_like 23.8 67 0.0014 22.1 2.1 33 76-110 153-190 (208)
102 KOG3939|consensus 23.8 99 0.0022 23.4 3.0 46 67-115 257-303 (312)
103 KOG4569|consensus 23.7 97 0.0021 23.8 3.1 25 78-102 155-179 (336)
104 COG3427 Carbon monoxide dehydr 23.5 76 0.0017 21.7 2.2 36 67-104 6-41 (146)
105 PF02257 RFX_DNA_binding: RFX 23.4 50 0.0011 20.4 1.2 16 66-81 18-33 (85)
106 COG0850 MinC Septum formation 23.3 52 0.0011 23.9 1.4 19 92-110 128-153 (219)
107 COG0011 Uncharacterized conser 23.2 1.3E+02 0.0028 19.2 3.1 22 76-98 18-39 (100)
108 PF00149 Metallophos: Calcineu 23.1 80 0.0017 19.5 2.3 22 80-102 176-197 (200)
109 PTZ00422 glideosome-associated 23.0 90 0.002 24.8 2.8 23 78-101 234-256 (394)
110 smart00712 PUR DNA/RNA-binding 22.8 1.1E+02 0.0024 17.6 2.5 25 67-91 38-62 (63)
111 cd07396 MPP_Nbla03831 Homo sap 22.5 74 0.0016 23.2 2.2 22 82-103 205-226 (267)
112 PF00543 P-II: Nitrogen regula 22.4 1.3E+02 0.0029 18.6 3.1 36 65-109 55-90 (102)
113 cd07388 MPP_Tt1561 Thermus the 22.3 68 0.0015 23.2 1.9 21 82-103 169-189 (224)
114 PF02579 Nitro_FeMo-Co: Dinitr 22.3 39 0.00085 20.1 0.6 21 85-106 46-66 (94)
115 TIGR03768 RPA4764 metallophosp 22.0 76 0.0016 26.0 2.2 27 82-113 388-414 (492)
116 COG1458 Predicted DNA-binding 21.9 63 0.0014 23.4 1.6 55 34-90 47-110 (221)
117 TIGR00853 pts-lac PTS system, 21.8 1.5E+02 0.0033 18.3 3.2 29 68-97 51-79 (95)
118 PRK05340 UDP-2,3-diacylglucosa 21.8 75 0.0016 22.8 2.1 20 83-103 178-197 (241)
119 cd06583 PGRP Peptidoglycan rec 21.5 1.1E+02 0.0025 19.0 2.7 31 71-102 82-114 (126)
120 TIGR03789 pdsO proteobacterial 21.3 1.6E+02 0.0034 21.7 3.6 39 66-104 139-177 (239)
121 PF13481 AAA_25: AAA domain; P 21.1 1.1E+02 0.0025 20.5 2.8 28 77-105 164-191 (193)
122 PF06183 DinI: DinI-like famil 21.1 1.2E+02 0.0026 17.6 2.5 27 72-98 1-28 (65)
123 COG0622 Predicted phosphoester 20.9 94 0.002 21.6 2.3 19 83-102 100-118 (172)
124 COG5017 Uncharacterized conser 20.8 87 0.0019 21.5 2.0 23 81-107 56-78 (161)
125 COG1938 Archaeal enzymes of AT 20.8 1.4E+02 0.0031 22.1 3.3 34 66-100 81-115 (244)
126 COG5186 PAP1 Poly(A) polymeras 20.6 91 0.002 25.0 2.4 71 43-113 211-288 (552)
127 PRK10897 phosphohistidinoprote 20.6 1.5E+02 0.0033 18.2 3.0 11 23-33 52-62 (90)
128 cd07394 MPP_Vps29 Homo sapiens 20.4 91 0.002 21.5 2.2 19 83-102 97-115 (178)
129 PF02633 Creatininase: Creatin 20.3 90 0.002 22.4 2.2 36 67-103 74-111 (237)
130 PRK13780 phosphocarrier protei 20.1 1.7E+02 0.0036 17.9 3.1 20 2-21 59-78 (88)
No 1
>KOG1232|consensus
Probab=99.91 E-value=7.6e-25 Score=166.15 Aligned_cols=113 Identities=39% Similarity=0.687 Sum_probs=104.7
Q ss_pred ceEEEecCCCchhHHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcCCCceeeEeeeccchHHHHHH
Q psy732 3 PFFCCCLGSCVDHDAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKDGYIYNYDISLSLKDFYSIIP 82 (119)
Q Consensus 3 ~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~~~~~~~DvaVP~~~l~~~i~ 82 (119)
++|||.+|++++.+++++..+++.+++++.+.|.++|+|+.+.+++|++|+.+++++.+.|..+++|+|+|+..+.+++.
T Consensus 329 yiLiETsGSn~dhD~eKl~afl~d~lek~lIsDGv~a~d~~~~~~lW~~Re~ip~a~~~~g~vyKyDvSLpL~d~Y~lvn 408 (511)
T KOG1232|consen 329 YILIETSGSNKDHDEEKLTAFLEDCLEKGLISDGVLAQDEAEAQKLWKIRESIPEALQKAGGVYKYDVSLPLEDLYNLVN 408 (511)
T ss_pred EEEEEecCCCccccHHHHHHHHHHhhhhcccccceecCCHHHHHHHHHHHhccHHHHHhcCCEEEeeccccHHHHHHHHH
Confidence 46899999999999999999999999999999999999999999999999999999999998999999999999999999
Q ss_pred HHHHHHhcCCCC--eEEeeeeeCCCCcceeeecCCC
Q psy732 83 IMKERLKDQPVV--TVCGYGHLERLKDQPVVTVCGY 116 (119)
Q Consensus 83 ~~~~~~~~~p~~--~~~~~GH~GDGNlH~~i~~~~~ 116 (119)
.+++++.+. ++ +++.|||+||||+|+||....+
T Consensus 409 ~~~eRl~~~-~l~~d~~gyGHlGDgNlHLNia~~ef 443 (511)
T KOG1232|consen 409 VMKERLGEA-ALVGDIVGYGHLGDGNLHLNIAVREF 443 (511)
T ss_pred HHHHhhhhh-hhhhcccccccccCCceeEeeeHHHH
Confidence 999999885 43 4689999999999999986544
No 2
>PLN02805 D-lactate dehydrogenase [cytochrome]
Probab=99.87 E-value=1.3e-21 Score=156.85 Aligned_cols=111 Identities=13% Similarity=0.139 Sum_probs=92.1
Q ss_pred ceEEEecCCCchhHHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcCCC---ceeeEeeeccchHHH
Q psy732 3 PFFCCCLGSCVDHDAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKDGY---IYNYDISLSLKDFYS 79 (119)
Q Consensus 3 ~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~~~---~~~~DvaVP~~~l~~ 79 (119)
.|++|++|+++ .++++.+.+.+ ++++.+..+..+++++++++++|++|+++++++....+ .+++|++||+++|++
T Consensus 365 ~Ll~e~~g~~~-~~~~~~~~~~~-i~~~~g~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~~~~~~~~~~DvaVP~s~L~e 442 (555)
T PLN02805 365 TLMFEFIGTEA-YAREQTLIVQK-IASKHNGSDFVFAEEPEAKKELWKIRKEALWACFAMEPKYEAMITDVCVPLSHLAE 442 (555)
T ss_pred EEEEEEecCcH-HHHHHHHHHHH-HHHhCCCceEEEeCCHHHHHHHHHHHHHHHHHHhhcCCCCceeEEEEEEEHHHHHH
Confidence 47899999754 35555554444 44444467888999999999999999999988865432 378999999999999
Q ss_pred HHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeecCCC
Q psy732 80 IIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTVCGY 116 (119)
Q Consensus 80 ~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~~~~ 116 (119)
++++++++++++ ++.+++|||+||||+|++++++.+
T Consensus 443 ~i~~~~~~~~~~-~~~~~~~gHaGdGnlH~~i~~~~~ 478 (555)
T PLN02805 443 LISRSKKELDAS-PLVCTVIAHAGDGNFHTIILFDPS 478 (555)
T ss_pred HHHHHHHHHHHc-CCeEEEEEEcCCCcEEEEeccCCC
Confidence 999999999999 999999999999999999987654
No 3
>PF02913 FAD-oxidase_C: FAD linked oxidases, C-terminal domain; InterPro: IPR004113 Some oxygen-dependent oxidoreductases are flavoproteins that contain a covalently bound FAD group which is attached to a histidine via an 8-alpha-(N3-histidyl)-riboflavin linkage. The region around the histidine that binds the FAD group is conserved in these enzymes (see IPR006093 from INTERPRO).; GO: 0003824 catalytic activity, 0050660 flavin adenine dinucleotide binding; PDB: 1WVE_B 1DII_B 1WVF_A 1DIQ_A 2UUU_B 2UUV_A 1W1M_A 1E8H_B 1E0Y_B 1DZN_B ....
Probab=99.83 E-value=3.1e-21 Score=138.55 Aligned_cols=114 Identities=15% Similarity=0.212 Sum_probs=93.6
Q ss_pred ceEEEecCCCchhHHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcC----C-CceeeEeeeccchH
Q psy732 3 PFFCCCLGSCVDHDAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKD----G-YIYNYDISLSLKDF 77 (119)
Q Consensus 3 ~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~----~-~~~~~DvaVP~~~l 77 (119)
.|++|++|+.++.++++++..+..+++..+..++.++.+.++.+++|+.|+.+.+.++.. + ..+++|++||++++
T Consensus 61 ~llv~~~g~~~~~~~~~~~~~i~~~~~~~~~~~~~~a~~~~~~~~~W~~R~~~~~~~~~~~~~~~~~~~~~dv~vp~~~l 140 (248)
T PF02913_consen 61 VLLVEFEGSDEEAVEEQLEAEIEEICKKYGGEDVVIADDEEEQERLWAIRRAIMPYLRDAAGRAGPVWDTEDVAVPPSRL 140 (248)
T ss_dssp EEEEECCCHHHCCHHHHHHHHHHHHHCTCTCCEEEEEHCHHCTSTHHHHHHHHCCGGGCSHCTTEEEEEEEEEESCHHHH
T ss_pred EEEEEECCCcHHHHHHHHHHHHHHHHhhcCCceeEEeCCHHHHHhhhhhhhhhcccccccccccCCceeeeeecccchhh
Confidence 578999998655566666524444666666677888899999999999999999844332 2 25899999999999
Q ss_pred HHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeecCCCC
Q psy732 78 YSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTVCGYG 117 (119)
Q Consensus 78 ~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~~~~~ 117 (119)
+++++.++++++++ ++.+++|||+||||+|++++++..+
T Consensus 141 ~~~~~~~~~~~~~~-~~~~~~~gH~~~g~~h~~~~~~~~~ 179 (248)
T PF02913_consen 141 PEFLREIRALLREY-GLEVCHFGHAGDGNLHLYILFDPRD 179 (248)
T ss_dssp HHHHHHHHHHHHHC-TEEEEEEEEEEECEEEEEEEEETTS
T ss_pred hhHHHhhhhhhhhc-cccccceEEccCCeEEEEeecccch
Confidence 99999999999999 9999999999999999999986543
No 4
>PRK11230 glycolate oxidase subunit GlcD; Provisional
Probab=99.81 E-value=3.7e-19 Score=141.36 Aligned_cols=111 Identities=11% Similarity=0.095 Sum_probs=92.3
Q ss_pred ceEEEecCCCchhHHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcCCC-ceeeEeeeccchHHHHH
Q psy732 3 PFFCCCLGSCVDHDAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKDGY-IYNYDISLSLKDFYSII 81 (119)
Q Consensus 3 ~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~~~-~~~~DvaVP~~~l~~~i 81 (119)
.|++|++|+.+ .+..+++.+.+ ++...++.+..+++++++++++|+.|+..++++...++ .+.+|++||+++|++++
T Consensus 290 ~ll~e~~g~~~-~v~~~~~~l~~-~~~~~g~~~~~~a~~~~~~~~~W~~R~~~~~~~~~~~~~~~~~dv~vP~~~l~~~~ 367 (499)
T PRK11230 290 ILLCELDGVES-DVQEDCERVND-ILLKAGATDVRLAQDEAERVRFWAGRKNAFPAVGRISPDYYCMDGTIPRRELPGVL 367 (499)
T ss_pred EEEEEecCCch-HHHHHHHHHHH-HHHhcCCceEEEeCCHHHHHHHHHHHHhhHHHHHhhCCCeeEEeecCChHHHHHHH
Confidence 57899999854 45566665555 44554466778889999999999999998888876444 46889999999999999
Q ss_pred HHHHHHHhcCCCCeEEeeeeeCCCCcceeeecCCC
Q psy732 82 PIMKERLKDQPVVTVCGYGHLERLKDQPVVTVCGY 116 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~~~~ 116 (119)
+.++++++++ ++.+.+|||+||||+|++++++.+
T Consensus 368 ~~~~~~~~~~-~~~~~~~gH~GdGn~H~~i~~~~~ 401 (499)
T PRK11230 368 EGIARLSQQY-GLRVANVFHAGDGNMHPLILFDAN 401 (499)
T ss_pred HHHHHHHHHc-CCeEEEEEEeCCCcceeeecCCCC
Confidence 9999999999 999999999999999999987654
No 5
>TIGR00387 glcD glycolate oxidase, subunit GlcD. This protein, the glycolate oxidase GlcD subunit, is similar in sequence to that of several D-lactate dehydrogenases, including that of E. coli. The glycolate oxidase has been found to have some D-lactate dehydrogenase activity.
Probab=99.78 E-value=1.7e-18 Score=134.77 Aligned_cols=111 Identities=12% Similarity=0.066 Sum_probs=91.4
Q ss_pred ceEEEecCCCchhHHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcCCC-ceeeEeeeccchHHHHH
Q psy732 3 PFFCCCLGSCVDHDAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKDGY-IYNYDISLSLKDFYSII 81 (119)
Q Consensus 3 ~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~~~-~~~~DvaVP~~~l~~~i 81 (119)
.|++|++|.++ .++++++.+.+ +++..+..+...+.++++.+++|+.|+...+.+.+..+ .+++|++||+++|++++
T Consensus 233 ~l~v~~~g~~~-~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~W~~r~~~~~~~~~~~~~~~~~d~~vp~~~l~~~~ 310 (413)
T TIGR00387 233 ILLVEIDGVHE-AVERDEEKIEQ-ICRKNGAVDVQIAQDEEERALLWAGRRNAFKAASKLSPLYLIEDGTVPRSKLPEAL 310 (413)
T ss_pred EEEEEecCCcH-HHHHHHHHHHH-HHHhCCCceEEEeCCHHHHHHHHHHHHHhHHHHHhhCCCcceeEEecCHHHHHHHH
Confidence 57899999764 45666665544 44444456677888999999999999998888865433 47899999999999999
Q ss_pred HHHHHHHhcCCCCeEEeeeeeCCCCcceeeecCCC
Q psy732 82 PIMKERLKDQPVVTVCGYGHLERLKDQPVVTVCGY 116 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~~~~ 116 (119)
+.++++++++ ++..++|||+||||+|++++++..
T Consensus 311 ~~~~~~~~~~-~~~~~~~gH~g~g~lh~~~~~~~~ 344 (413)
T TIGR00387 311 RGIADIARKY-DFTIANFGHAGDGNLHPTILTDPE 344 (413)
T ss_pred HHHHHHHHHc-CCeEEEEEEecCCccccccCCCCC
Confidence 9999999999 999999999999999999987643
No 6
>COG0277 GlcD FAD/FMN-containing dehydrogenases [Energy production and conversion]
Probab=99.38 E-value=6.2e-12 Score=98.21 Aligned_cols=113 Identities=14% Similarity=0.126 Sum_probs=92.8
Q ss_pred ceEEEecCCCchhHHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcCCC-ceeeEeeeccchHHHHH
Q psy732 3 PFFCCCLGSCVDHDAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKDGY-IYNYDISLSLKDFYSII 81 (119)
Q Consensus 3 ~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~~~-~~~~DvaVP~~~l~~~i 81 (119)
.+++|++|............+.+.+.+.+.+.+..++.+.++..++|..|+...++....++ .+.+|++||.+++++++
T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~d~~vp~~~~~~~~ 349 (459)
T COG0277 270 RLLVEVEGSDEAAVDEALEALGELLLEHGLARDLVVAQDLAEAARLWLARKGALAAAGALGPGVIQEDVVVPLEALPEFL 349 (459)
T ss_pred EEEEEEcCCcHHHHHHHHHHHHHHHHhcCCceeEEEeCCHHHHHHHHHHHHHHHHHHHhhCCCccccceeeeHHHHHHHH
Confidence 37889998763445555655555444444367788889999999999999999998887665 58899999999999999
Q ss_pred HHHHHHHhcCCC--CeEEeeeeeCCCCcceeeecCCC
Q psy732 82 PIMKERLKDQPV--VTVCGYGHLERLKDQPVVTVCGY 116 (119)
Q Consensus 82 ~~~~~~~~~~p~--~~~~~~GH~GDGNlH~~i~~~~~ 116 (119)
+.+.+.+.++ + +.+..|||.||||+|+++..+..
T Consensus 350 ~~~~~~~~~~-~~~~~~~~~~~~~dg~~~~~~~~~~~ 385 (459)
T COG0277 350 REILALLDKA-GLALRVALFGHAGDGNLHLNILYDVG 385 (459)
T ss_pred HHHHHHHHhc-CCCceeeeecccCCCcceeeeccCCC
Confidence 9999999998 5 99999999999999999987654
No 7
>KOG1231|consensus
Probab=98.43 E-value=4.8e-09 Score=81.91 Aligned_cols=109 Identities=11% Similarity=-0.000 Sum_probs=78.6
Q ss_pred EEEecCCCchhHHHHHHHHHHHHhhcCcccc-eeecCCHHHHHHHHHhhhhcHHHhhcCC------CceeeEeeeccchH
Q psy732 5 FCCCLGSCVDHDAEKLNTFLQDGIENNIILD-AVMCSEESKIQKIWPLRERIVESCLKDG------YIYNYDISLSLKDF 77 (119)
Q Consensus 5 lvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~~~e~~~lW~~R~~~~~~~~~~~------~~~~~DvaVP~~~l 77 (119)
..|++|.++.. ...--.+.+..++.....| +.++.++..+..+|++|+-.+-.....+ ..+..||+||.++.
T Consensus 312 ~~e~~~l~~~l-~~~~~~~~~~~v~y~~fldrv~~ae~klrskgLWevphpWlnL~vpks~i~~fa~gv~~dIl~~~s~g 390 (505)
T KOG1231|consen 312 FQEIGGLSEKL-NYAPTFIVEQDVQYHDFLDRVHFAEDKLRSKGLWEVPHPWLNLAVPKSRISDFARGVFTDILVPNSSG 390 (505)
T ss_pred HHHHhccchhh-hccchhhhhhhhHHHHhhhHhhhcccchhhcccccCCCchheeecccccchhhhhhhccceeeccCCC
Confidence 34566644332 1111123333444445667 8889999999999999997443333222 24788999999999
Q ss_pred HHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeecCC
Q psy732 78 YSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTVCG 115 (119)
Q Consensus 78 ~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~~~ 115 (119)
+.++-.+++-++.. .....+.+|+||||+|+.++..+
T Consensus 391 ~~liyptnk~~kw~-~~~sav~ph~~e~vFy~v~~l~s 427 (505)
T KOG1231|consen 391 PVLIYPTNKDLKWS-NRLSAVTPHAGEGVFYLVILLRS 427 (505)
T ss_pred ceEEeccccCcchh-hhhccccccCCCceEEEEEEecC
Confidence 99999999988777 78888999999999999988764
No 8
>KOG4165|consensus
Probab=85.25 E-value=3.6 Score=32.06 Aligned_cols=40 Identities=13% Similarity=0.107 Sum_probs=29.1
Q ss_pred eEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeecCC
Q psy732 68 YDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTVCG 115 (119)
Q Consensus 68 ~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~~~ 115 (119)
-|..+|+.. .++++.+++ ..++.+.||+ ||-.|+-+-.+.
T Consensus 192 IDLvIPRGS-s~LVr~Ik~------~tkIPVLGHA-~GichvYvd~da 231 (433)
T KOG4165|consen 192 IDLVIPRGS-SDLVRSIKD------TTKIPVLGHA-EGICHVYVDKDA 231 (433)
T ss_pred eeEEecCCc-HHHHHHHhh------cccCcccccc-cceeEEEecccc
Confidence 399999876 355666654 3456699998 899998776543
No 9
>PRK10204 hypothetical protein; Provisional
Probab=79.43 E-value=7.2 Score=21.83 Aligned_cols=47 Identities=9% Similarity=0.200 Sum_probs=31.5
Q ss_pred CHHHHHHHHHhhhhcHHHhhc--CCCceeeEeeeccchHHHHHHHHHHHHhcC
Q psy732 41 EESKIQKIWPLRERIVESCLK--DGYIYNYDISLSLKDFYSIIPIMKERLKDQ 91 (119)
Q Consensus 41 ~~~e~~~lW~~R~~~~~~~~~--~~~~~~~DvaVP~~~l~~~i~~~~~~~~~~ 91 (119)
+.++...||+.|++...-.+. .| .+++.|.=...+..+.+.++-..|
T Consensus 5 TDkQKsrlWeqrrn~nfqaS~RLeG----~~~p~vtlt~eeA~aRl~~LRr~Y 53 (55)
T PRK10204 5 TDKQKSRLWEQQRNRNFQASRRLEG----VEMPLVTLTAEEALARLEELRRHY 53 (55)
T ss_pred hHHHHHHHHHHHHhhhHHHHhhhcC----cccccccccHHHHHHHHHHHHHHh
Confidence 457788999999987653322 33 366667666777777777665544
No 10
>COG0014 ProA Gamma-glutamyl phosphate reductase [Amino acid transport and metabolism]
Probab=78.56 E-value=12 Score=29.82 Aligned_cols=37 Identities=19% Similarity=-0.004 Sum_probs=27.4
Q ss_pred EeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeec
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTV 113 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~ 113 (119)
|+.+|+..- .+++.+++ +.++.+.+|. +||.|+-+-.
T Consensus 198 D~iIPRGg~-~Li~~v~~------~a~vPVi~~~-~G~CHiyvd~ 234 (417)
T COG0014 198 DLVIPRGGA-GLIRRVVE------NATVPVIEHG-VGNCHIYVDE 234 (417)
T ss_pred eEEEcCCcH-HHHHHHHh------CCcCCEEecC-cceEEEEecc
Confidence 999999873 45555554 5677788885 9999987654
No 11
>PF09330 Lact-deh-memb: D-lactate dehydrogenase, membrane binding; InterPro: IPR015409 Members of this entry are predominantly found in prokaryotic D-lactate dehydrogenase, forming the cap-membrane-binding domain, which consists of a large seven-stranded antiparallel beta-sheet flanked on both sides by alpha-helices. They allow for membrane association []. ; GO: 0050660 flavin adenine dinucleotide binding, 0055085 transmembrane transport; PDB: 1F0X_A.
Probab=67.07 E-value=42 Score=25.51 Aligned_cols=105 Identities=16% Similarity=0.139 Sum_probs=56.7
Q ss_pred CceEEEecCCCchhHHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcC-----C---CceeeEeeec
Q psy732 2 DPFFCCCLGSCVDHDAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKD-----G---YIYNYDISLS 73 (119)
Q Consensus 2 ~~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~-----~---~~~~~DvaVP 73 (119)
|-|++.++|.+- +++.++++..+.... ....+.=+++|.++-.--|=++..+..+. . ..+.-||++|
T Consensus 113 HHLilkm~~~gi----~ea~~~L~~~f~~~~-~G~~feCt~~E~~~A~LhRFaaAgAaiRY~avh~~~vedivaLDiALr 187 (291)
T PF09330_consen 113 HHLILKMSGDGI----EEARAYLKEFFAKAE-DGDFFECTPEEGKKAFLHRFAAAGAAIRYRAVHRKEVEDIVALDIALR 187 (291)
T ss_dssp EEEEEEE-TTHH----HHHHHHHHHHCCCSS---EEEE--HHHHHHHHHHHTTHHHHHHHHHHHTCCCEECEEEEEEE-B
T ss_pred hheeeeecCCcH----HHHHHHHHHHhcccC-CCceEeeCHHHHHHHHHHHHHhhhhhHHHHHhccccccceeeeeeeec
Confidence 568899998643 233455555555442 13345556777777666676666644321 1 2478899999
Q ss_pred cchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeec
Q psy732 74 LKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTV 113 (119)
Q Consensus 74 ~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~ 113 (119)
+..- +-++.+-+.+++. -+.-..|||..-=-+|-..+.
T Consensus 188 RNd~-dW~E~LP~ei~~~-i~~klyyGHF~ChVfHQDYiv 225 (291)
T PF09330_consen 188 RNDR-DWFETLPPEIEDQ-IVHKLYYGHFFCHVFHQDYIV 225 (291)
T ss_dssp TT-S---S----HHHHTT-EEEEEEEEECCCTEEEEEEEE
T ss_pred CCch-hhhhhCCHHHHHH-HHHHeeccccchhhchhhhhh
Confidence 8763 2222333333332 345668999998888876654
No 12
>KOG1262|consensus
Probab=65.14 E-value=4.1 Score=32.51 Aligned_cols=46 Identities=11% Similarity=0.140 Sum_probs=40.4
Q ss_pred eeeEeeeccchHHHHHHHHHHHHhcCC--CCeEEeeeeeCCCCcceee
Q psy732 66 YNYDISLSLKDFYSIIPIMKERLKDQP--VVTVCGYGHLERLKDQPVV 111 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~~~~~~~~~~p--~~~~~~~GH~GDGNlH~~i 111 (119)
+.-|+-||+.+|.+.++-+.+.++-|| -....+|-|=|.|.+|+..
T Consensus 384 V~QDmlvPl~kl~eald~~hke~evYPiwlcP~~l~~qp~~Gq~~~~p 431 (543)
T KOG1262|consen 384 VFQDMLVPLDKLKEALDTFHKEFEVYPIWLCPFRLYSQPGQGQLRPPP 431 (543)
T ss_pred HHHHhhccHHHHHHHHHHHHhhheeeeeeeeeeeccCCCCCceecCCc
Confidence 456999999999999999999999886 5678899999999999863
No 13
>PF03871 RNA_pol_Rpb5_N: RNA polymerase Rpb5, N-terminal domain; InterPro: IPR005571 Prokaryotes contain a single DNA-dependent RNA polymerase (RNAP; 2.7.7.6 from EC) that is responsible for the transcription of all genes, while eukaryotes have three classes of RNAPs (I-III) that transcribe different sets of genes. Each class of RNA polymerase is an assemblage of ten to twelve different polypeptides. Certain subunits of RNAPs, including RPB5 (POLR2E in mammals), are common to all three eukaryotic polymerases. RPB5 plays a role in the transcription activation process. Eukaryotic RPB5 has a bipartite structure consisting of a unique N-terminal region, plus a C-terminal region that is structurally homologous to the prokaryotic RPB5 homologue, subunit H (gene rpoH) (IPR000783 from INTERPRO) [, , , ]. This entry represents the N-terminal domain of eukaryotic RPB5, which has a core structure consisting of 3 layers alpha/beta/alpha []. The N-terminal domain is involved in DNA binding and is part of the jaw module in the RNA pol II structure []. This module is important for positioning the downstream DNA.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent, 0005634 nucleus; PDB: 3H0G_Q 1DZF_A 3GTG_E 2VUM_E 3GTP_E 3GTO_E 3S17_E 3S1R_E 1I3Q_E 1I50_E ....
Probab=65.08 E-value=1.8 Score=27.13 Aligned_cols=25 Identities=16% Similarity=0.622 Sum_probs=20.9
Q ss_pred CHHHHHHHHHhhhhcHHHhhcCCCc
Q psy732 41 EESKIQKIWPLRERIVESCLKDGYI 65 (119)
Q Consensus 41 ~~~e~~~lW~~R~~~~~~~~~~~~~ 65 (119)
++++..+||++|+.+.+-++..|+.
T Consensus 3 ~~~e~~rl~rirrTv~eMl~DRGY~ 27 (93)
T PF03871_consen 3 DDEEVSRLFRIRRTVMEMLRDRGYL 27 (93)
T ss_dssp TTHHHHHHHHHHCCCCCCCCCCTEE
T ss_pred cHHHHHHHHHHHHHHHHHHhcCCCc
Confidence 4478899999999999988878863
No 14
>cd05397 NT_Pol-beta-like Nucleotidyltransferase (NT) domain of DNA polymerase beta and similar proteins. This superfamily includes the NT domains of DNA polymerase beta and other family X DNA polymerases, as well as the NT domains of Class I and Class II CCA-adding enzymes, RelA- and SpoT-like ppGpp synthetases and hydrolases, 2'5'-oligoadenylate (2-5A)synthetases, Escherichia coli adenylyltransferase (GlnE), Escherichia coli uridylyl transferase (GlnD), poly (A) polymerases, terminal uridylyl transferases, and Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins. The Escherichia coli CCA-adding enzyme belongs to this superfamily but is not included as this enzyme lacks the N-terminal helix conserved in the remainder of the superfamily. In the majority of the Pol beta-like superfamily NTs, two carboxylates, Dx[D/E], together with a third more distal carboxylate coordinate two divalent metal cations that are essential for catalysis. These divalent metal ions are
Probab=64.74 E-value=14 Score=20.06 Aligned_cols=31 Identities=35% Similarity=0.469 Sum_probs=24.0
Q ss_pred HHHHHHHHHHhcC-CCCeEEeeeeeCCCCcce
Q psy732 79 SIIPIMKERLKDQ-PVVTVCGYGHLERLKDQP 109 (119)
Q Consensus 79 ~~i~~~~~~~~~~-p~~~~~~~GH~GDGNlH~ 109 (119)
+.++.+++.+.++ +...+++||-.+-|+.+.
T Consensus 2 ~~l~~i~~~l~~~~~~~~v~lfGS~arg~~~~ 33 (49)
T cd05397 2 ELLDIIKERLKKLVPGYEIVVYGSLVRGLLKK 33 (49)
T ss_pred HHHHHHHHHHHhhcCCcEEEEECCcCCCCCCC
Confidence 3456677777776 678999999999998753
No 15
>PRK10858 nitrogen regulatory protein P-II 1; Provisional
Probab=62.03 E-value=15 Score=23.71 Aligned_cols=40 Identities=8% Similarity=-0.084 Sum_probs=32.5
Q ss_pred CceeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeee
Q psy732 64 YIYNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVT 112 (119)
Q Consensus 64 ~~~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~ 112 (119)
+...-++.||-+...+.++.+.+.+. -|+.|||-+-+..+
T Consensus 57 pK~~ieivv~D~~v~~vv~~I~~~a~---------TG~~GDGkIfV~pV 96 (112)
T PRK10858 57 PKVKIEIVVPDDIVDTCVDTIIRTAQ---------TGKIGDGKIFVFDV 96 (112)
T ss_pred eeEEEEEEEChHhHHHHHHHHHHHhc---------cCCCCCcEEEEEEh
Confidence 35678999999999999988887653 28999999887655
No 16
>PF03876 SHS2_Rpb7-N: SHS2 domain found in N terminus of Rpb7p/Rpc25p/MJ0397; InterPro: IPR005576 The eukaryotic RNA polymerase subunits RPB4 and RPB7 form a heterodimer that reversibly associates with the RNA polymerase II core. Archaeal cells contain a single RNAP made up of about 12 subunits, displaying considerable homology to the eukaryotic RNAPII subunits. The RPB4 and RPB7 homologs are called subunits F and E, respectively, and have been shown to form a stable heterodimer. While the RPB7 homologue is reasonably well conserved, the similarity between the eukaryotic RPB4 and the archaeal F subunit is barely detectable []. This entry represents the N-terminal, heterodimerisation domain of RPB7.; GO: 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2C35_F 3HKZ_E 2PMZ_T 2CKZ_D 2Y0S_E 2RF4_A 2JA7_G 1Y1V_G 2JA5_G 4A3D_G ....
Probab=60.20 E-value=17 Score=20.93 Aligned_cols=44 Identities=16% Similarity=-0.017 Sum_probs=24.8
Q ss_pred eeeccchHH-HHHHHHHHHH-----hcC-CCC--eEEeeee---------eCCCCcceeeec
Q psy732 70 ISLSLKDFY-SIIPIMKERL-----KDQ-PVV--TVCGYGH---------LERLKDQPVVTV 113 (119)
Q Consensus 70 vaVP~~~l~-~~i~~~~~~~-----~~~-p~~--~~~~~GH---------~GDGNlH~~i~~ 113 (119)
+.+|++.|. +.-+.+++.+ .++ |++ -+.++.. -|||.+|+++.+
T Consensus 4 v~l~P~~l~~~~~~~i~~~L~~~~~~k~~~~~G~~i~v~~i~~~~~g~I~~~~g~~~~~V~f 65 (70)
T PF03876_consen 4 VRLPPSYLGPDLKDGIKEQLLDKYEGKYIPELGVVIAVTDIKEISEGKIIPGDGFVYFKVTF 65 (70)
T ss_dssp EEE-GGGTTSTHHHHHHHHHHHHHTTEEETTTEEEEEEEEEEEESCEEE-TTTSSEEEEEEE
T ss_pred EEECHHHhCcCHHHHHHHHHHHHHhCcCcCCceEEEEEeeeeEecCcEEECCCCCEEEEEEE
Confidence 678888888 6666664444 333 333 2222222 477788877654
No 17
>PF10832 DUF2559: Protein of unknown function (DUF2559); InterPro: IPR022541 This family of proteins appear to be restricted to Proteobacteria. The sequences are annotated as YhfG however currently no function is known.
Probab=59.29 E-value=30 Score=19.46 Aligned_cols=18 Identities=11% Similarity=0.346 Sum_probs=14.1
Q ss_pred CHHHHHHHHHhhhhcHHH
Q psy732 41 EESKIQKIWPLRERIVES 58 (119)
Q Consensus 41 ~~~e~~~lW~~R~~~~~~ 58 (119)
+.++...||+.|++...-
T Consensus 4 TdKQK~~lw~q~Rn~nfQ 21 (54)
T PF10832_consen 4 TDKQKSRLWEQRRNANFQ 21 (54)
T ss_pred cHHHHHHHHHHHHHhhHH
Confidence 467889999999986643
No 18
>COG0347 GlnK Nitrogen regulatory protein PII [Amino acid transport and metabolism]
Probab=59.17 E-value=18 Score=23.64 Aligned_cols=40 Identities=18% Similarity=0.112 Sum_probs=31.4
Q ss_pred CceeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeee
Q psy732 64 YIYNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVT 112 (119)
Q Consensus 64 ~~~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~ 112 (119)
+....-++|+-+.+...++.+.+.++ . ||.|||-+=+..+
T Consensus 57 pK~~ieIvV~de~ve~vie~I~~~a~-t--------G~~GDGkIFV~~V 96 (112)
T COG0347 57 PKVKIEIVVSDEDVDEVIEAIKKAAR-T--------GKIGDGKIFVSPV 96 (112)
T ss_pred cceEEEEEEChHHHHHHHHHHHHHHh-c--------CCCCCeEEEEEEh
Confidence 34677899998889888888888765 3 8999998765443
No 19
>PLN00107 FAD-dependent oxidoreductase; Provisional
Probab=54.97 E-value=19 Score=26.85 Aligned_cols=25 Identities=20% Similarity=0.105 Sum_probs=22.8
Q ss_pred eeEeeeccchHHHHHHHHHHHHhcC
Q psy732 67 NYDISLSLKDFYSIIPIMKERLKDQ 91 (119)
Q Consensus 67 ~~DvaVP~~~l~~~i~~~~~~~~~~ 91 (119)
-.-.+||+++.++++++++++.++.
T Consensus 64 E~EyaVP~e~~~~aL~elr~l~~~~ 88 (257)
T PLN00107 64 QSAISVPLSGAAAFINDIKALRDIE 88 (257)
T ss_pred EEEEEecHHHHHHHHHHHHHHHHhC
Confidence 4589999999999999999999876
No 20
>PF04030 ALO: D-arabinono-1,4-lactone oxidase ; InterPro: IPR007173 This domain is specific to D-arabinono-1,4-lactone oxidase 1.1.3.37 from EC, which is involved in the final step of the D-erythroascorbic acid biosynthesis pathway [].; GO: 0003885 D-arabinono-1,4-lactone oxidase activity, 0055114 oxidation-reduction process, 0016020 membrane; PDB: 2VFU_A 2VFV_A 2VFT_A 2VFS_A 2VFR_A.
Probab=54.47 E-value=8.9 Score=27.94 Aligned_cols=26 Identities=0% Similarity=0.249 Sum_probs=23.6
Q ss_pred eeeEeeeccchHHHHHHHHHHHHhcC
Q psy732 66 YNYDISLSLKDFYSIIPIMKERLKDQ 91 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~~~~~~~~~~ 91 (119)
.....+||+++.+++++++++++++.
T Consensus 126 ~~~E~~iP~~~~~~~l~~l~~~~~~~ 151 (259)
T PF04030_consen 126 WEMEYAIPIENAPEALRELRALIDKE 151 (259)
T ss_dssp -EEEEEEEGGGHHHHHHHHHHTHHHH
T ss_pred eeEEEeeCHHHHHHHHHHHHHHHHHc
Confidence 56799999999999999999999887
No 21
>PLN03111 DNA-directed RNA polymerase II subunit family protein; Provisional
Probab=52.34 E-value=13 Score=26.84 Aligned_cols=41 Identities=17% Similarity=0.454 Sum_probs=29.4
Q ss_pred CCHHHHHHHHHhhhhcHHHhhcCCCc-eeeEeeeccchHHHH
Q psy732 40 SEESKIQKIWPLRERIVESCLKDGYI-YNYDISLSLKDFYSI 80 (119)
Q Consensus 40 ~~~~e~~~lW~~R~~~~~~~~~~~~~-~~~DvaVP~~~l~~~ 80 (119)
.+..+..++|.+|+.+.+-++..|+. -.+++...++.+.+-
T Consensus 3 ~~~~e~~rl~r~rrTv~eMl~DRGY~V~~~el~~s~~~F~~~ 44 (206)
T PLN03111 3 TGSEESTRLYLVRRTVLEMLRDRGYLVSDSELNLTLSEFREK 44 (206)
T ss_pred cchHHHHHHHHHHHHHHHHHhccCCccCHHHHcCCHHHHHHH
Confidence 35677889999999999999888874 334555555555433
No 22
>KOG3218|consensus
Probab=50.71 E-value=12 Score=26.82 Aligned_cols=42 Identities=19% Similarity=0.391 Sum_probs=32.6
Q ss_pred CHHHHHHHHHhhhhcHHHhhcCCCcee-eEeeeccchHHHHHH
Q psy732 41 EESKIQKIWPLRERIVESCLKDGYIYN-YDISLSLKDFYSIIP 82 (119)
Q Consensus 41 ~~~e~~~lW~~R~~~~~~~~~~~~~~~-~DvaVP~~~l~~~i~ 82 (119)
++.|..++|.+|+.+.+-++..|+-++ +|+..+++.+-+..-
T Consensus 5 ~e~E~~rl~~ar~T~~qMlrDRGY~vt~~el~ltLe~F~~~yg 47 (208)
T KOG3218|consen 5 KEEEIYRLYLARKTAMQMLRDRGYTVTQEELDLTLEEFKARYG 47 (208)
T ss_pred hHHHHHHHHHHHHHHHHHHHhcCccccHHHhhhhHHHHHHHhc
Confidence 477889999999999999998887543 577787777655443
No 23
>cd07399 MPP_YvnB Bacillus subtilis YvnB and related proteins, metallophosphatase domain. YvnB (BSU35040) is an uncharacterized Bacillus subtilis protein with a metallophosphatase domain. This family includes bacterial and eukaryotic proteins similar to YvnB. YvnB belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets. This domain is thought to allow for
Probab=49.25 E-value=15 Score=26.07 Aligned_cols=21 Identities=24% Similarity=0.264 Sum_probs=17.6
Q ss_pred HHHHHHHhcCCCCeEEeeeee
Q psy732 82 PIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~ 102 (119)
+.+.++++++|++..+++||.
T Consensus 138 ~~~~~ll~~~~~V~~v~~GH~ 158 (214)
T cd07399 138 QIWDKLVKKNDNVFMVLSGHV 158 (214)
T ss_pred HHHHHHHhCCCCEEEEEcccc
Confidence 456788888888999999995
No 24
>PF14792 DNA_pol_B_palm: DNA polymerase beta palm ; PDB: 1RZT_A 3PML_A 2PFN_A 3HX0_K 3HWT_A 2GWS_E 2BCQ_A 3UPQ_A 2BCS_A 3UQ2_A ....
Probab=48.72 E-value=31 Score=22.14 Aligned_cols=30 Identities=27% Similarity=0.290 Sum_probs=25.6
Q ss_pred eccchHHHHHHHHHHHHhcC-CCCeEEeeee
Q psy732 72 LSLKDFYSIIPIMKERLKDQ-PVVTVCGYGH 101 (119)
Q Consensus 72 VP~~~l~~~i~~~~~~~~~~-p~~~~~~~GH 101 (119)
+|++...++.+.+++.+.+. |++.+.+.|-
T Consensus 2 IPR~Ev~~i~~~V~~~~~~i~p~~~v~i~GS 32 (112)
T PF14792_consen 2 IPRDEVEEIEEIVKEALEKIDPGLEVEICGS 32 (112)
T ss_dssp EEHHHHHHHHHHHHHHHHCCSTT-EEEEEHH
T ss_pred CCHHHHHHHHHHHHHHHHhcCCCcEEEEccc
Confidence 58888999999999999998 8999999885
No 25
>PRK10665 nitrogen regulatory protein P-II 2; Provisional
Probab=48.53 E-value=36 Score=21.97 Aligned_cols=40 Identities=13% Similarity=0.071 Sum_probs=32.3
Q ss_pred CceeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeee
Q psy732 64 YIYNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVT 112 (119)
Q Consensus 64 ~~~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~ 112 (119)
+...-++.||-+...+.++.+.+... -|+.|||-+-+...
T Consensus 57 ~k~~ieivv~de~ve~vv~~I~~~a~---------TG~~GDGkIfV~pV 96 (112)
T PRK10665 57 PKVKIDVAIADDQLDEVIDIISKAAY---------TGKIGDGKIFVAEL 96 (112)
T ss_pred ceEEEEEEEChHhHHHHHHHHHHHhc---------cCCCCCcEEEEEEh
Confidence 35677999999999999988887653 28999999887655
No 26
>PF01764 Lipase_3: Lipase (class 3); InterPro: IPR002921 Triglyceride lipases are lipolytic enzymes that hydrolyse ester linkages of triglycerides []. Lipases are widely distributed in animals, plants and prokaryotes. This family of lipases have been called Class 3 as they are not closely related to other lipase families.; GO: 0004806 triglyceride lipase activity, 0006629 lipid metabolic process; PDB: 1LGY_A 1DTE_A 1DT5_F 4DYH_B 1DU4_C 4EA6_B 1GT6_B 1EIN_A 1DT3_A 1TIB_A ....
Probab=48.47 E-value=34 Score=21.81 Aligned_cols=27 Identities=22% Similarity=0.341 Sum_probs=21.5
Q ss_pred chHHHHHHHHHHHHhcCCCCeEEeeee
Q psy732 75 KDFYSIIPIMKERLKDQPVVTVCGYGH 101 (119)
Q Consensus 75 ~~l~~~i~~~~~~~~~~p~~~~~~~GH 101 (119)
....++.+.++++.+++|..++++.||
T Consensus 45 ~~~~~~~~~l~~~~~~~~~~~i~itGH 71 (140)
T PF01764_consen 45 SLYDQILDALKELVEKYPDYSIVITGH 71 (140)
T ss_dssp HHHHHHHHHHHHHHHHSTTSEEEEEEE
T ss_pred HHHHHHHHHHHHHHhcccCccchhhcc
Confidence 344567778888888887799999999
No 27
>PRK11628 transcriptional regulator BolA; Provisional
Probab=47.79 E-value=36 Score=21.80 Aligned_cols=31 Identities=13% Similarity=-0.034 Sum_probs=17.8
Q ss_pred HHHHHHHhc-CCCCeEEee-------eeeCCCCcceeeec
Q psy732 82 PIMKERLKD-QPVVTVCGY-------GHLERLKDQPVVTV 113 (119)
Q Consensus 82 ~~~~~~~~~-~p~~~~~~~-------GH~GDGNlH~~i~~ 113 (119)
+++++.+.. .|.....+. ||.|+|. |+.+..
T Consensus 5 ~~I~~~L~~~l~~~~l~V~d~S~~H~~~~g~~~-Hf~v~I 43 (105)
T PRK11628 5 EQIEEKLRAAFQPVFLEVVDESYRHNVPAGSES-HFKVVL 43 (105)
T ss_pred HHHHHHHHhhCCCcEEEEEECcccccCCCCCCc-eEEEEE
Confidence 345555544 344444444 6777764 888775
No 28
>PF02244 Propep_M14: Carboxypeptidase activation peptide; InterPro: IPR003146 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The peptidases are synthesised as inactive molecules, zymogens, with propeptides that must be removed by proteolytic cleavage to activate the enzyme. Structural studies of carboxypeptidases A and B reveal the propeptide to exist as a globular domain, followed by an extended alpha-helix; this shields the catalytic site, without specifically binding to it, while the substrate-binding site is blocked by making specific contacts [, ]. Members of this propeptide family are found in the metallocarboxypeptidases: A1, A2 [], A3, A4, A5, A6, U, insect gut carboxypeptidase and B [], and and are associated with peptidases belonging to MEROPS peptidase family M14A. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase.; GO: 0004180 carboxypeptidase activity, 0006508 proteolysis; PDB: 3D68_C 3D67_B 3D66_A 1PBA_A 3GLJ_A 1NSA_A 1KWM_A 2BOA_B 1PYT_A 1JQG_A ....
Probab=46.40 E-value=30 Score=19.94 Aligned_cols=20 Identities=15% Similarity=0.263 Sum_probs=16.5
Q ss_pred eeEeeeccchHHHHHHHHHH
Q psy732 67 NYDISLSLKDFYSIIPIMKE 86 (119)
Q Consensus 67 ~~DvaVP~~~l~~~i~~~~~ 86 (119)
.-||-||++.+..|...+++
T Consensus 35 ~~dv~V~p~~~~~f~~~L~~ 54 (74)
T PF02244_consen 35 PVDVMVPPEKLEEFEELLKE 54 (74)
T ss_dssp EEEEEEEGGGHHHHHHHHHH
T ss_pred eEEEEECHHHHHHHHHHHHH
Confidence 46999999999888777664
No 29
>cd01984 AANH_like Adenine nucleotide alpha hydrolases superfamily including N type ATP PPases, ATP sulphurylases Universal Stress Response protein and electron transfer flavoprotein (ETF). The domain forms a apha/beta/apha fold which binds to Adenosine nucleotide.
Probab=46.28 E-value=24 Score=20.77 Aligned_cols=32 Identities=16% Similarity=0.118 Sum_probs=25.2
Q ss_pred HHHHHHHHHHhcCCCCeEEeeeeeCCCCcceee
Q psy732 79 SIIPIMKERLKDQPVVTVCGYGHLERLKDQPVV 111 (119)
Q Consensus 79 ~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i 111 (119)
.+.+.+.+..++. +...++.||-+|.+.-..+
T Consensus 35 ~~~~~~~~~a~~~-~~~~Iv~G~~~~d~~~~~~ 66 (86)
T cd01984 35 AFVRILKRLAAEE-GADVIILGHNADDVAGRRL 66 (86)
T ss_pred HHHHHHHHHHHHc-CCCEEEEcCCchhhhhhcc
Confidence 6677778888888 9999999998877655443
No 30
>PF07597 DUF1560: Protein of unknown function (DUF1560); InterPro: IPR011454 This is a small family of short hypothetical proteins in Rhodopirellula baltica.
Probab=44.73 E-value=15 Score=19.65 Aligned_cols=31 Identities=19% Similarity=0.251 Sum_probs=17.6
Q ss_pred HHHHhhhhcHHHhhcCCC---ceeeEeeeccchHHH
Q psy732 47 KIWPLRERIVESCLKDGY---IYNYDISLSLKDFYS 79 (119)
Q Consensus 47 ~lW~~R~~~~~~~~~~~~---~~~~DvaVP~~~l~~ 79 (119)
.-|+-|+ +++--...+ .-+-.++||+..|++
T Consensus 15 ~rwecrr--feakhplsknisvrsaeisvplaalae 48 (49)
T PF07597_consen 15 SRWECRR--FEAKHPLSKNISVRSAEISVPLAALAE 48 (49)
T ss_pred chhhhhh--hhhcCccccceeEeeeeecchhhhhhc
Confidence 3477776 332211111 245789999988764
No 31
>COG2885 OmpA Outer membrane protein and related peptidoglycan-associated (lipo)proteins [Cell envelope biogenesis, outer membrane]
Probab=44.55 E-value=34 Score=23.68 Aligned_cols=37 Identities=19% Similarity=0.348 Sum_probs=31.9
Q ss_pred eeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeee
Q psy732 66 YNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~ 102 (119)
|-+|-+.=.....+.+..+-+.++++|..++.+-||.
T Consensus 87 F~~ds~~l~p~~~~~L~~~a~~L~~~p~~~i~V~GHT 123 (190)
T COG2885 87 FDFDSSVLKPKAQATLDELAKYLKKNPITRILVEGHT 123 (190)
T ss_pred ecCCccccCHhHHHHHHHHHHHHHhCCCcEEEEEecC
Confidence 5567777677888999999999999999999999996
No 32
>PRK11148 cyclic 3',5'-adenosine monophosphate phosphodiesterase; Provisional
Probab=41.68 E-value=25 Score=25.76 Aligned_cols=21 Identities=29% Similarity=0.321 Sum_probs=17.0
Q ss_pred HHHHHHHhcCCCCeEEeeeee
Q psy732 82 PIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~ 102 (119)
.++.++++++|+++.+++||.
T Consensus 184 ~~l~~ll~~~~~v~~vl~GH~ 204 (275)
T PRK11148 184 HELAEVLAKFPNVKAILCGHI 204 (275)
T ss_pred HHHHHHHhcCCCceEEEeccc
Confidence 466777788778999999996
No 33
>TIGR02802 Pal_lipo peptidoglycan-associated lipoprotein. Members of this protein are Pal (also called OprL), the Peptidoglycan-Associated Lipoprotein of the Tol-Pal system. The system appears to be involved both in the maintenance of outer membrane integrity and in the import of certain organic molecules as nutrients. Members of this family contain a hydrodrophobic lipoprotein signal sequence, a conserved N-terminal cleavage and modification site, a poorly conserved low-complexity region, together comprising about 65 amino acids, and a well-conserved C-terminal domain. The seed alignment for this model includes only the conserved C-terminal domain.
Probab=41.67 E-value=51 Score=20.27 Aligned_cols=33 Identities=24% Similarity=0.222 Sum_probs=23.5
Q ss_pred eccchHHHHHHHHHHHHhcCCCCeEEeeeeeCC
Q psy732 72 LSLKDFYSIIPIMKERLKDQPVVTVCGYGHLER 104 (119)
Q Consensus 72 VP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GD 104 (119)
.+...-...+..+-+.+.++|+..+.+-||.-+
T Consensus 10 ~l~~~~~~~L~~~a~~l~~~~~~~i~I~Ghtd~ 42 (104)
T TIGR02802 10 DLKSEAQAILDAHAAYLKKNPSVRVTIEGHTDE 42 (104)
T ss_pred ccCHHHHHHHHHHHHHHHHCCCcEEEEEEecCC
Confidence 344455556777777777777888999999754
No 34
>cd00655 RNAP_Rpb7_N_like RNAP_Rpb7_N_like: This conserved domain represents the N-terminal ribonucleoprotein (RNP) domain of the Rpb7 subunit of eukaryotic RNA polymerase (RNAP) II and its homologs, Rpa43 of eukaryotic RNAP I, Rpc25 of eukaryotic RNAP III, and RpoE (subunit E) of archaeal RNAP. These proteins have, in addition to their N-terminal RNP domain, a C-terminal oligonucleotide-binding (OB) domain. Each of these subunits heterodimerizes with another RNAP subunit (Rpb7 to Rpb4, Rpc25 to Rpc17, RpoE to RpoF, and Rpa43 to Rpa14). The heterodimer is thought to tether the RNAP to a given promoter via its interactions with a promoter-bound transcription factor.The heterodimer is also thought to bind and position nascent RNA as it exits the polymerase complex.
Probab=41.07 E-value=80 Score=18.93 Aligned_cols=47 Identities=13% Similarity=-0.114 Sum_probs=27.4
Q ss_pred eeeEeeeccchHHHHHH-HHHHHHh---------cCCCCeEEeee---------eeCCCCcceeeec
Q psy732 66 YNYDISLSLKDFYSIIP-IMKERLK---------DQPVVTVCGYG---------HLERLKDQPVVTV 113 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~-~~~~~~~---------~~p~~~~~~~G---------H~GDGNlH~~i~~ 113 (119)
+...+.+|++.+..-+. .+.+.+. ++ |+-+.++. +-|||..|+++.+
T Consensus 6 l~d~v~i~P~~f~~~~~~~i~~~L~~k~~gkv~~~~-G~~v~v~di~~i~~G~I~~gdG~~~~~V~F 71 (80)
T cd00655 6 IADLVSVPPKYFGDDCKGVKKCLLQEKGEGDRTPVV-GIILAIKDTKDIPEGAIRPGDGSAYVNVSF 71 (80)
T ss_pred EEEEEEECHHHhCccHHHHHHHHHHHHhCCeEeCCc-cEEEEEEEeEEEcCCEEECCCCCEEEEEEE
Confidence 45678999999974443 3333343 22 44444441 3578888877654
No 35
>PF03992 ABM: Antibiotic biosynthesis monooxygenase; InterPro: IPR007138 This domain is found in monooxygenases involved in the biosynthesis of several antibiotics by Streptomyces species, which can carry out oxygenation without the assistance of any of the prosthetic groups, metal ions or cofactors normally associated with activation of molecular oxygen. The structure of ActVA-Orf6 monooxygenase from Streptomyces coelicolor (Q53908 from SWISSPROT), which is involved in actinorhodin biosynthesis, reveals a dimeric alpha+beta barrel topology []. There is also a conserved histidine that is likely to be an active site residue. In S. coelicolor SCO1909 (Q9X9W3 from SWISSPROT) this domain occurs as a repeat. This domain is also found in protein LsrG, involved in the degradation of quorum-sensing molecule autoinducer-2 [], and in several uncharacterised proteins.; PDB: 1X7V_A 3F44_A 4DN9_B 1N5T_B 1LQ9_A 1N5Q_B 1N5V_A 1N5S_A 2GFF_B 3BM7_A ....
Probab=40.17 E-value=58 Score=18.22 Aligned_cols=35 Identities=6% Similarity=-0.020 Sum_probs=24.5
Q ss_pred eeEeeeccchHHHHHHHHHHHHh----cCCCCeEEeeee
Q psy732 67 NYDISLSLKDFYSIIPIMKERLK----DQPVVTVCGYGH 101 (119)
Q Consensus 67 ~~DvaVP~~~l~~~i~~~~~~~~----~~p~~~~~~~GH 101 (119)
..-+-|++++..+|++.+++... +.||+....+..
T Consensus 5 ~~~~~v~~~~~~~f~~~~~~~~~~~~~~~~G~~~~~~~~ 43 (78)
T PF03992_consen 5 IVKFKVKPGKEEEFLAAFQELAEATLRKEPGCLSYELYR 43 (78)
T ss_dssp EEEEEEETTGHHHHHHHHHHHHHHHHHTSTTEEEEEEEE
T ss_pred EEEEEeCcchHHHHHHHHHHHHHHHHhcCCCcEEEEEEE
Confidence 34567899999999998887765 335655555555
No 36
>PRK11183 D-lactate dehydrogenase; Provisional
Probab=39.98 E-value=59 Score=27.12 Aligned_cols=104 Identities=15% Similarity=0.144 Sum_probs=58.4
Q ss_pred CceEEEecCCCchhHHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcC--------CCceeeEeeec
Q psy732 2 DPFFCCCLGSCVDHDAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKD--------GYIYNYDISLS 73 (119)
Q Consensus 2 ~~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~--------~~~~~~DvaVP 73 (119)
|-|++..+|..-++. .+++++.+... ....+.=+++|.++-.--|=.+..+..++ ...+.-||++|
T Consensus 383 hhl~lk~~~~~~~e~----~~~l~~~f~~~--~g~~f~c~~~e~~~a~lhrf~~a~aa~ry~~~~~~~~~~i~~ldial~ 456 (564)
T PRK11183 383 HHLLLKMSGDGIEEA----RAYLKEYFKSA--EGDFFECTPEEGKKAFLHRFAAAGAAIRYRAVHRDEVEDILALDIALR 456 (564)
T ss_pred hheeeeecCccHHHH----HHHHHHHhhhc--CCCeEeeCHHHHHHHHHHHHHhhhhHHHHHHhhhccccceeeEEeeec
Confidence 668899988754333 33444444433 23345556777777667776665544332 12468899999
Q ss_pred cchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeec
Q psy732 74 LKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTV 113 (119)
Q Consensus 74 ~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~ 113 (119)
+..-. -++.+-+.++.. -+.-..|||+.-=-+|-..+.
T Consensus 457 rn~~~-w~e~lp~~~~~~-~~~~~yyghf~chv~hqdyi~ 494 (564)
T PRK11183 457 RNDRD-WFEHLPPEIDDQ-LVHKLYYGHFFCHVFHQDYIV 494 (564)
T ss_pred CCccc-hHHhCCHHHhhh-hhhheeccccceecchhhhhh
Confidence 87632 122222122221 233347999987777765543
No 37
>PF03432 Relaxase: Relaxase/Mobilisation nuclease domain ; InterPro: IPR005094 Relaxases/mobilisation proteins are required for the horizontal transfer of genetic information contained on plasmids that occurs during bacterial conjugation. The relaxase, in conjunction with several auxiliary proteins, forms the relaxation complex or relaxosome. Relaxases nick duplex DNA in a specific manner by catalysing trans-esterification [].
Probab=39.90 E-value=56 Score=23.12 Aligned_cols=42 Identities=12% Similarity=0.025 Sum_probs=29.2
Q ss_pred eccchHHHHHHHHHHHHhcC-C-CCeEEeeeeeCCCCcceeeecCCC
Q psy732 72 LSLKDFYSIIPIMKERLKDQ-P-VVTVCGYGHLERLKDQPVVTVCGY 116 (119)
Q Consensus 72 VP~~~l~~~i~~~~~~~~~~-p-~~~~~~~GH~GDGNlH~~i~~~~~ 116 (119)
+.+..+.++.. +.+.++ | +..++++-|.--||+|+-|+.+.-
T Consensus 69 ~~~e~~~~~~~---~~~~~~~~~~~~~v~~~H~D~~h~H~Hivin~v 112 (242)
T PF03432_consen 69 LTPEQAHEIAR---EFAEEMGPGNHQYVVVVHTDTDHPHVHIVINRV 112 (242)
T ss_pred cCHHHHHHHHH---HHHHHcCCCCcceEEEECCCcCeeeeeEEEeec
Confidence 44444444444 444555 4 788999999999999998887543
No 38
>KOG2355|consensus
Probab=39.69 E-value=1.1e+02 Score=22.70 Aligned_cols=40 Identities=13% Similarity=0.145 Sum_probs=35.0
Q ss_pred eeeEeeeccchH--HHHHHHHHHHHhcCCCCeEEeeeeeCCCC
Q psy732 66 YNYDISLSLKDF--YSIIPIMKERLKDQPVVTVCGYGHLERLK 106 (119)
Q Consensus 66 ~~~DvaVP~~~l--~~~i~~~~~~~~~~p~~~~~~~GH~GDGN 106 (119)
+-..|.|.++-+ +++++++++.++.. +.+++.--|+=||-
T Consensus 170 LLDEVTVDLDVlARadLLeFlkeEce~R-gatIVYATHIFDGL 211 (291)
T KOG2355|consen 170 LLDEVTVDLDVLARADLLEFLKEECEQR-GATIVYATHIFDGL 211 (291)
T ss_pred EeeeeEeehHHHHHHHHHHHHHHHHhhc-CcEEEEEeeeccch
Confidence 567888988887 57899999999999 99999999999984
No 39
>TIGR00032 argG argininosuccinate synthase. argG in bacteria, ARG1 in Saccharomyces cerevisiae. There is a very unusual clustering in the alignment, with a deep split between one cohort of E. coli, H. influenzae, and Streptomyces, and the other cohort of eukaryotes, archaea, and the rest of the eubacteria.
Probab=38.74 E-value=39 Score=26.73 Aligned_cols=28 Identities=18% Similarity=0.114 Sum_probs=22.0
Q ss_pred HHHHHHHHHHhcCCCCeEEeeeeeCCCCc
Q psy732 79 SIIPIMKERLKDQPVVTVCGYGHLERLKD 107 (119)
Q Consensus 79 ~~i~~~~~~~~~~p~~~~~~~GH~GDGNl 107 (119)
.....+.+.+++. |+.++++||.|.||=
T Consensus 93 li~~~l~~~A~~~-G~~~Ia~G~t~~gnD 120 (394)
T TIGR00032 93 LIAKKLVEAAKKE-GANAVAHGCTGKGND 120 (394)
T ss_pred HHHHHHHHHHHHc-CCCEEEECccCCcch
Confidence 3445566677778 999999999999983
No 40
>PF03568 Peptidase_C50: Peptidase family C50; InterPro: IPR005314 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to MEROPS peptidase family C50 (separase family, clan CD). The active site residues for members of this family and family C14 occur in the same order in the sequence: H,C. The separases are caspase-like proteases, which plays a central role in the chromosome segregation. In yeast they cleave the rad21 subunit of the cohesin complex at the onset of anaphase. During most of the cell cycle, separase is inactivated by the securin/cut2 protein, which probably covers its active site. ; GO: 0008233 peptidase activity, 0006508 proteolysis, 0005634 nucleus
Probab=38.32 E-value=73 Score=24.90 Aligned_cols=55 Identities=9% Similarity=0.026 Sum_probs=35.4
Q ss_pred eecCCHHHHHHHHHhhhhcHHHhhc---------CCC-c-eeeEeeeccchHHHHHHHHHHHHhcC
Q psy732 37 VMCSEESKIQKIWPLRERIVESCLK---------DGY-I-YNYDISLSLKDFYSIIPIMKERLKDQ 91 (119)
Q Consensus 37 ~~a~~~~e~~~lW~~R~~~~~~~~~---------~~~-~-~~~DvaVP~~~l~~~i~~~~~~~~~~ 91 (119)
....+.++++++|+.|+++=..++. .|+ . +-.--.++.+.+..+...+.+++.+.
T Consensus 59 ~~~~t~~~kk~WW~~R~~LD~rL~~LL~~iE~~wLG~wkglf~~~~~~~~~~~~~~~~~~~iL~~~ 124 (383)
T PF03568_consen 59 SKIKTKEDKKKWWKERKALDSRLKDLLENIENSWLGGWKGLFLGKRRDSSSLEKFRKSFEKILNKH 124 (383)
T ss_pred cccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcCCccccchhhHHHHHHHHHHHHHh
Confidence 3446899999999999987654422 121 1 11111233677888888888888775
No 41
>PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional
Probab=37.81 E-value=57 Score=22.63 Aligned_cols=38 Identities=21% Similarity=0.301 Sum_probs=33.0
Q ss_pred eeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeC
Q psy732 66 YNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLE 103 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~G 103 (119)
|.+|-+.....-...+..+-+.+.++|...+.+-||.-
T Consensus 73 F~~d~~~l~~~~~~~L~~~a~~L~~~p~~~v~I~GhtD 110 (173)
T PRK10802 73 FDLDKYDIRSDFAQMLDAHANFLRSNPSYKVTVEGHAD 110 (173)
T ss_pred ecCCCccCCHHHHHHHHHHHHHHHhCCCceEEEEEecC
Confidence 66777888888888999999999999888999999984
No 42
>PRK10486 autoinducer-2 (AI-2) modifying protein LsrG; Provisional
Probab=37.35 E-value=80 Score=19.37 Aligned_cols=39 Identities=10% Similarity=-0.043 Sum_probs=26.2
Q ss_pred EeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeecCCC
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTVCGY 116 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~~~~ 116 (119)
-+.+.+++..++.+.++++.... ..-.||+.+.+..+.+
T Consensus 7 ~~~~kpg~~~~~~~~l~~~~~~s---------r~EpGcl~y~~~~~~~ 45 (96)
T PRK10486 7 EINVKEDKVDEFIEVFRQNHLGS---------IQEPGNLRFDVLQDPE 45 (96)
T ss_pred EEEECcchHHHHHHHHHHHHHHH---------hCCCCceEEEEEeCCC
Confidence 46678888889988888877654 3345666655555443
No 43
>PF09186 DUF1949: Domain of unknown function (DUF1949); InterPro: IPR015269 Members of this entry are a set of functionally uncharacterised hypothetical bacterial proteins. They adopt a ferredoxin-like fold, with a beta-alpha-beta-beta-alpha-beta arrangement []. This entry contains the protein Impact, which is a translational regulator that ensures constant high levels of translation under amino acid starvation. It acts by interacting with Gcn1/Gcn1L1, thereby preventing activation of Gcn2 protein kinases (EIF2AK1 to 4) and subsequent down-regulation of protein synthesis. It is evolutionary conserved from eukaryotes to archaea []. ; PDB: 2CVE_A 1VI7_A.
Probab=37.24 E-value=62 Score=17.25 Aligned_cols=24 Identities=0% Similarity=0.004 Sum_probs=21.2
Q ss_pred ceeeEeeeccchHHHHHHHHHHHH
Q psy732 65 IYNYDISLSLKDFYSIIPIMKERL 88 (119)
Q Consensus 65 ~~~~DvaVP~~~l~~~i~~~~~~~ 88 (119)
.+...+.||.+....|.+.+.++.
T Consensus 30 ~V~~~v~v~~~~~~~f~~~l~~~t 53 (56)
T PF09186_consen 30 DVTLTVAVPEEEVEEFKAQLTDLT 53 (56)
T ss_dssp TEEEEEEEECCCHHHHHHHHHHHT
T ss_pred eEEEEEEECHHHHHHHHHHHHHHc
Confidence 478899999999999999998764
No 44
>cd07392 MPP_PAE1087 Pyrobaculum aerophilum PAE1087 and related proteins, metallophosphatase domain. PAE1087 is an uncharacterized Pyrobaculum aerophilum protein with a metallophosphatase domain. The domain present in members of this family belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets. This domain is thought to allow for productive metal coordina
Probab=36.97 E-value=31 Score=23.08 Aligned_cols=20 Identities=15% Similarity=0.486 Sum_probs=15.2
Q ss_pred HHHHHHHhcCCCCeEEeeeee
Q psy732 82 PIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~ 102 (119)
+.+.++++++ ..+.+++||+
T Consensus 151 ~~l~~li~~~-~~~~~l~GH~ 170 (188)
T cd07392 151 KAIRKFIEER-QPLLCICGHI 170 (188)
T ss_pred HHHHHHHHHh-CCcEEEEecc
Confidence 4566666777 7889999987
No 45
>TIGR03729 acc_ester putative phosphoesterase. Members of this protein family belong to the larger family pfam00149 (calcineurin-like phosphoesterase), a family largely defined by small motifs of metal-chelating residues. The subfamily in this model shows a good but imperfect co-occurrence in species with domain TIGR03715 that defines a novel class of signal peptide typical of the accessory secretory system.
Probab=36.72 E-value=27 Score=25.01 Aligned_cols=20 Identities=20% Similarity=0.240 Sum_probs=17.2
Q ss_pred HHHHHHHhcCCCCeEEeeeee
Q psy732 82 PIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~ 102 (119)
..+.++++++ ++..+++||+
T Consensus 199 ~~l~~li~~~-~v~~~i~GH~ 218 (239)
T TIGR03729 199 QHFGQLLVKY-EIKDVIFGHL 218 (239)
T ss_pred HHHHHHHHHh-CCCEEEECCc
Confidence 5677888888 8999999997
No 46
>KOG4498|consensus
Probab=36.70 E-value=46 Score=23.85 Aligned_cols=49 Identities=14% Similarity=0.241 Sum_probs=37.0
Q ss_pred HHHHHHHhhhhcHHHhhcCCCceeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeee
Q psy732 44 KIQKIWPLRERIVESCLKDGYIYNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYG 100 (119)
Q Consensus 44 e~~~lW~~R~~~~~~~~~~~~~~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~G 100 (119)
...+||+-|.++...+++.+ ||--...++.+..++.++++. |+..+..|
T Consensus 43 p~~~L~~~~~avV~~vRrpg-------CvlCR~~aadLa~l~~~ld~~-Gv~Li~vg 91 (197)
T KOG4498|consen 43 PVTSLFKERSAVVAFVRRPG-------CVLCREEAADLASLKDLLDEL-GVVLIAVG 91 (197)
T ss_pred ehHHhhhcCCeEEEEeccCc-------EEEeHHHHHHHHHHHHHHHHh-CCEEEEEe
Confidence 34789999998877776644 333455667777789999999 99988877
No 47
>COG1768 Predicted phosphohydrolase [General function prediction only]
Probab=36.29 E-value=33 Score=24.72 Aligned_cols=46 Identities=26% Similarity=0.299 Sum_probs=25.8
Q ss_pred HHHHhhhhcHHHhhcCCC---------ceeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeee
Q psy732 47 KIWPLRERIVESCLKDGY---------IYNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 47 ~lW~~R~~~~~~~~~~~~---------~~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~ 102 (119)
++=++|.+...++.+... +++.|- |.+.+.++++ +. .+..++|||+
T Consensus 142 E~~RLrlsa~a~l~k~~~~fivM~HYPP~s~~~--t~~~~sevle-------e~-rv~~~lyGHl 196 (230)
T COG1768 142 EIGRLRLSADAALPKGVSKFIVMTHYPPFSDDG--TPGPFSEVLE-------EG-RVSKCLYGHL 196 (230)
T ss_pred HHHHHHHHHHHhcccCcCeEEEEEecCCCCCCC--CCcchHHHHh-------hc-ceeeEEeeec
Confidence 445667766666655432 122221 3444544443 45 6888899995
No 48
>TIGR03751 conj_TIGR03751 conjugative transfer region lipoprotein, TIGR03751 family. Members of this protein family are found occasionally on plasmids. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=36.11 E-value=44 Score=21.92 Aligned_cols=28 Identities=11% Similarity=0.125 Sum_probs=22.4
Q ss_pred HHHHHHHHHHhcC--CCCeEEeeeeeCCCC
Q psy732 79 SIIPIMKERLKDQ--PVVTVCGYGHLERLK 106 (119)
Q Consensus 79 ~~i~~~~~~~~~~--p~~~~~~~GH~GDGN 106 (119)
....+++..+.+. |++-+++|=|+..|.
T Consensus 61 ta~nei~~~F~RLPNPdlvMYVfPHLa~~~ 90 (116)
T TIGR03751 61 TAENEIQQLFPRLPNPDLVMYVFPHLAGGD 90 (116)
T ss_pred HHHhHHHHhccCCCCCCeEEEEeeeecCCC
Confidence 3456788888887 689999999998654
No 49
>PHA00684 hypothetical protein
Probab=35.97 E-value=39 Score=22.55 Aligned_cols=32 Identities=28% Similarity=0.285 Sum_probs=26.6
Q ss_pred eccchHHHHHHHHHHHHhcCCC----CeEEeeeeeC
Q psy732 72 LSLKDFYSIIPIMKERLKDQPV----VTVCGYGHLE 103 (119)
Q Consensus 72 VP~~~l~~~i~~~~~~~~~~p~----~~~~~~GH~G 103 (119)
+|++.+..+++++....+++|+ ++-+.+|.+|
T Consensus 53 ~~l~~I~~~V~~Fi~ya~~hp~~~F~VT~IGCGiAG 88 (128)
T PHA00684 53 LSLPDIGAAVNRFIAYATAHPHLNFQVTRVGCGLAG 88 (128)
T ss_pred ccHHHHHHHHHHHHHHHHhCCCcEEEeeeecccccc
Confidence 6999999999999999988874 4566778877
No 50
>cd01585 AcnA_Bact Aconitase catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. Bacterial Aconitase-like catalytic domain. Aconitase (aconitate hydratase or citrate hydrolyase) catalyzes the reversible isomerization of citrate and isocitrate as part of the TCA cycle. Cis-aconitate is formed as an intermediate product during the course of the reaction. This distinct subfamily is found only in bacteria and Archaea. Its exact characteristics are not known.
Probab=35.68 E-value=72 Score=25.22 Aligned_cols=42 Identities=7% Similarity=-0.150 Sum_probs=33.0
Q ss_pred ceeeEeeeccch--HHHHHHHHHHHHhcCCCCeEEeeeeeCCCCccee
Q psy732 65 IYNYDISLSLKD--FYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPV 110 (119)
Q Consensus 65 ~~~~DvaVP~~~--l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~ 110 (119)
.+..|=.+|... -.+....+++..+++ +++. -..|.|=.|..
T Consensus 31 ~~~~DH~~p~~~~~~a~~~~~lr~~a~~~-gi~~---~~~g~GI~H~v 74 (380)
T cd01585 31 VSYVDHNTLQTDFENADDHRFLQTVAARY-GIYF---SRPGNGICHQV 74 (380)
T ss_pred EEEeCCCCCCCChHHHHHHHHHHHHHHHc-CCEE---ECCCCCeeeee
Confidence 356788888774 568899999999999 8752 37889999974
No 51
>cd07402 MPP_GpdQ Enterobacter aerogenes GpdQ and related proteins, metallophosphatase domain. GpdQ (glycerophosphodiesterase Q, also known as Rv0805 in Mycobacterium tuberculosis) is a binuclear metallophosphoesterase from Enterobacter aerogenes that catalyzes the hydrolysis of mono-, di-, and triester substrates, including some organophosphate pesticides and products of the degradation of nerve agents. The GpdQ homolog, Rv0805, has 2',3'-cyclic nucleotide phosphodiesterase activity. GpdQ and Rv0805 belong to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosa
Probab=35.61 E-value=36 Score=24.00 Aligned_cols=24 Identities=25% Similarity=0.349 Sum_probs=18.4
Q ss_pred HHHHHHHHhcCCCCeEEeeeeeCC
Q psy732 81 IPIMKERLKDQPVVTVCGYGHLER 104 (119)
Q Consensus 81 i~~~~~~~~~~p~~~~~~~GH~GD 104 (119)
-..+.+++.++|++..+++||.-.
T Consensus 170 ~~~~~~~l~~~~~v~~v~~GH~H~ 193 (240)
T cd07402 170 AEALAAVLARHPNVRAILCGHVHR 193 (240)
T ss_pred HHHHHHHHhcCCCeeEEEECCcCc
Confidence 455667777777899999999754
No 52
>PRK13746 aminoglycoside resistance protein; Provisional
Probab=35.26 E-value=60 Score=24.25 Aligned_cols=37 Identities=8% Similarity=0.077 Sum_probs=29.2
Q ss_pred eccchHHHHHHHHHHHHhcCCC---CeEEeeeeeCCCCcce
Q psy732 72 LSLKDFYSIIPIMKERLKDQPV---VTVCGYGHLERLKDQP 109 (119)
Q Consensus 72 VP~~~l~~~i~~~~~~~~~~p~---~~~~~~GH~GDGNlH~ 109 (119)
+|. .+++.+..++++++++++ +.++.||-+..|+++.
T Consensus 5 ~~~-~i~~~l~~~~~~l~~~l~~~l~~vyLfGS~~~G~~~p 44 (262)
T PRK13746 5 VPA-EISTQLSEACAVIERHLEPTLLAIHLYGSAVDGGLKP 44 (262)
T ss_pred ccH-HHHHHHHHHHHHHHHhCcccEEEEEEECCcccCCCCC
Confidence 455 778888888899888832 2579999999999885
No 53
>cd07404 MPP_MS158 Microscilla MS158 and related proteins, metallophosphatase domain. MS158 is an uncharacterized Microscilla protein with a metallophosphatase domain. Microscilla proteins MS152, and MS153 are also included in this family. The domain present in members of this family belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets. This domain is t
Probab=35.12 E-value=30 Score=23.10 Aligned_cols=24 Identities=8% Similarity=0.112 Sum_probs=18.2
Q ss_pred HHHHHHHHHHhcCCCCeEEeeeeeC
Q psy732 79 SIIPIMKERLKDQPVVTVCGYGHLE 103 (119)
Q Consensus 79 ~~i~~~~~~~~~~p~~~~~~~GH~G 103 (119)
.+...+.+++++. ++..+++||.-
T Consensus 124 ~~~~~l~~~~~~~-~v~~~i~GH~H 147 (166)
T cd07404 124 AFAVDLDDLILAD-PIDLWIHGHTH 147 (166)
T ss_pred hhhhccHhHHhhc-CCCEEEECCcc
Confidence 4555566777776 89999999974
No 54
>cd07379 MPP_239FB Homo sapiens 239FB and related proteins, metallophosphatase domain. 239FB (Fetal brain protein 239) is thought to play a role in central nervous system development, but its specific role in unknown. 239FB is expressed predominantly in human fetal brain from a gene located in the chromosome 11p13 region associated with the mental retardation component of the WAGR (Wilms tumor, Aniridia, Genitourinary anomalies, Mental retardation) syndrome. Orthologous brp-like (brain protein 239-like) proteins have been identified in the invertebrate amphioxus group and in vertebrates. 239FB belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzyme
Probab=34.89 E-value=32 Score=22.18 Aligned_cols=20 Identities=15% Similarity=0.312 Sum_probs=15.1
Q ss_pred HHHHHHHhcCCCCeEEeeeee
Q psy732 82 PIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~ 102 (119)
..+.+.+++. +.++++|||.
T Consensus 94 ~~~~~~~~~~-~~~~~i~GH~ 113 (135)
T cd07379 94 EELLNRVQRV-RPKLHVFGHI 113 (135)
T ss_pred HHHHHHHHHH-CCcEEEEcCc
Confidence 3456666677 7899999996
No 55
>COG1509 KamA Lysine 2,3-aminomutase [Amino acid transport and metabolism]
Probab=34.83 E-value=2.3e+02 Score=22.43 Aligned_cols=105 Identities=9% Similarity=0.056 Sum_probs=62.4
Q ss_pred CceEEEecCCCchhHHHHHHHHHHHHhhcCcc-cc--eeec---CCHHHHHHHHHhhhhcHHHhhc-------CCCceee
Q psy732 2 DPFFCCCLGSCVDHDAEKLNTFLQDGIENNII-LD--AVMC---SEESKIQKIWPLRERIVESCLK-------DGYIYNY 68 (119)
Q Consensus 2 ~~LlvE~~g~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~a---~~~~e~~~lW~~R~~~~~~~~~-------~~~~~~~ 68 (119)
+++++-.+-+.+.++........+.+...|.. .+ +... +++ +.|=++.++++..--. ....-..
T Consensus 220 ~~v~~~tH~NHp~Eit~e~~~A~~~L~~aGv~l~NQsVLLrGVND~~---evl~~L~~~L~~~gV~PYYl~~~D~~~G~~ 296 (369)
T COG1509 220 KPVWLVTHFNHPNEITPEAREACAKLRDAGVPLLNQSVLLRGVNDDP---EVLKELSRALFDAGVKPYYLHQLDLVQGAA 296 (369)
T ss_pred ceEEEEcccCChhhcCHHHHHHHHHHHHcCceeecchheecccCCCH---HHHHHHHHHHHHcCCcceEEeccCccCCcc
Confidence 45566666555555555554445544444411 11 1111 233 3333455544332111 0112467
Q ss_pred EeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCccee
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPV 110 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~ 110 (119)
+.-||+++-.+.++.++..+.-+ .+...++-|-|.|.=++.
T Consensus 297 hfr~~i~~~~~i~~~lr~~~SG~-~~P~~v~d~pgg~gK~~~ 337 (369)
T COG1509 297 HFRVPIAEGLQIVEELRGRTSGY-AVPTLVVDIPGGGGKTPL 337 (369)
T ss_pred ceeccHHHHHHHHHHHHHhCCCc-ccceeEEecCCCCCcccc
Confidence 99999999999999999999988 788888999988876554
No 56
>PF07045 DUF1330: Protein of unknown function (DUF1330); InterPro: IPR010753 This family consists of several hypothetical bacterial proteins of around 90 residues in length. The function of this family is unknown.; PDB: 2FIU_B 3HHL_A 3DCA_D 3LO3_I.
Probab=34.82 E-value=70 Score=18.14 Aligned_cols=26 Identities=12% Similarity=0.006 Sum_probs=20.5
Q ss_pred cchHHHHHHHHHHHHhcCCCCeEEeee
Q psy732 74 LKDFYSIIPIMKERLKDQPVVTVCGYG 100 (119)
Q Consensus 74 ~~~l~~~i~~~~~~~~~~p~~~~~~~G 100 (119)
++.+.+|.+.+..+++++ |-+++..|
T Consensus 1 p~~~~~Y~~~~~~~l~~~-GG~~l~~~ 26 (65)
T PF07045_consen 1 PEAYQEYREAVPPILEKY-GGRVLARG 26 (65)
T ss_dssp -HHHHHHHHHHHHHHHHT-T-EEEEEC
T ss_pred ChHHHHHHHHHHHHHHHc-CCEEEEEC
Confidence 356889999999999999 88887764
No 57
>PTZ00061 DNA-directed RNA polymerase; Provisional
Probab=34.71 E-value=28 Score=25.17 Aligned_cols=37 Identities=16% Similarity=0.221 Sum_probs=26.4
Q ss_pred HHHHHHHHhhhhcHHHhhcCCCc-eeeEeeeccchHHH
Q psy732 43 SKIQKIWPLRERIVESCLKDGYI-YNYDISLSLKDFYS 79 (119)
Q Consensus 43 ~e~~~lW~~R~~~~~~~~~~~~~-~~~DvaVP~~~l~~ 79 (119)
++..++|.+|+.+.+-++..|+. -.+++...++.+.+
T Consensus 3 ~~~~rl~r~rrTv~eMl~DRGY~V~~~el~~s~~~F~~ 40 (205)
T PTZ00061 3 DSETRFFRCRRTCCEMLEDRGYIITSQEKLETFATFKE 40 (205)
T ss_pred HHHHHHHHHHHHHHHHHhccCCccCHHHHcCCHHHHHH
Confidence 35569999999999999888874 33455555555544
No 58
>PF11343 DUF3145: Protein of unknown function (DUF3145); InterPro: IPR021491 This family of proteins with unknown function appear to be restricted to Actinobacteria.
Probab=34.65 E-value=35 Score=23.58 Aligned_cols=41 Identities=10% Similarity=-0.004 Sum_probs=27.9
Q ss_pred eeeEeeeccchHHHHHHH-----------HHHHHhcCCCCeEEeeeeeCCCC
Q psy732 66 YNYDISLSLKDFYSIIPI-----------MKERLKDQPVVTVCGYGHLERLK 106 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~~-----------~~~~~~~~p~~~~~~~GH~GDGN 106 (119)
..-||.||=.+|..+++. +..++.+-.+-..--|-|+|||-
T Consensus 99 a~Gdvvv~E~rlRa~~~~a~~~~~~l~~el~~~LG~aWD~eLEpfR~ag~ga 150 (158)
T PF11343_consen 99 ANGDVVVPEDRLRAAMERARGDGEDLARELDRALGTAWDDELEPFRHAGDGA 150 (158)
T ss_pred CCCCEecCHHHHHHHHHHhcCCHHHHHHHHHHHhCCchhhhhcccccCCCCC
Confidence 356999999999888883 33333332244555688999884
No 59
>COG4922 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.55 E-value=60 Score=21.45 Aligned_cols=34 Identities=12% Similarity=0.068 Sum_probs=26.8
Q ss_pred HHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceee
Q psy732 78 YSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVV 111 (119)
Q Consensus 78 ~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i 111 (119)
..|++++.+.++++|....-+.--+.||+|=..-
T Consensus 47 ~~fv~fFt~ffk~~P~~~~kiVr~iadGdLV~vh 80 (129)
T COG4922 47 DGFVRFFTEFFKEKPRISTKIVRVIADGDLVTVH 80 (129)
T ss_pred HHHHHHHHHHHHhCccccceeeEEeccCCEEEEE
Confidence 3567778888888888888888999999985443
No 60
>PF03389 MobA_MobL: MobA/MobL family; InterPro: IPR005053 This entry represents a domain found at the N terminus of MobA in Escherichia coli, and MobL in Thiobacillus ferrooxidans (Acidithiobacillus ferrooxidans), as well as in conjugal transfer protein TraA. MobA and MobL are mobilisation proteins, which are essential for specific plasmid transfer.; GO: 0009291 unidirectional conjugation; PDB: 2NS6_A.
Probab=32.18 E-value=1.7e+02 Score=21.04 Aligned_cols=68 Identities=12% Similarity=0.152 Sum_probs=34.6
Q ss_pred HHHHHHHHhhhhcHHHhh--cCCC-ceeeEeeeccch-HHH---HHH-HHHHHHhcCCCCeEEeeee-eCCCCcceeeec
Q psy732 43 SKIQKIWPLRERIVESCL--KDGY-IYNYDISLSLKD-FYS---IIP-IMKERLKDQPVVTVCGYGH-LERLKDQPVVTV 113 (119)
Q Consensus 43 ~e~~~lW~~R~~~~~~~~--~~~~-~~~~DvaVP~~~-l~~---~i~-~~~~~~~~~p~~~~~~~GH-~GDGNlH~~i~~ 113 (119)
.+++.||..=+ ... +.+. .--..+++|.+- ..+ +++ ++++.+..+ |+.+-+.=| -|++|.|+=|+.
T Consensus 47 ~dr~~lWn~aE----~~E~Rkna~~are~~iALP~EL~~eq~~~L~~~f~~~~~~~~-G~~~d~aIH~d~~~NpHaHim~ 121 (216)
T PF03389_consen 47 KDREELWNAAE----AAEKRKNARLAREFEIALPRELTLEQNIELVREFAQENFVDY-GMAADVAIHDDGPRNPHAHIMF 121 (216)
T ss_dssp -SSHHHHHHHH----HHS--TTS--EEEEEEE--TTS-HHHHHHHHHHHHHHHHTTT-T--EEEEEEEETTTEEEEEEEE
T ss_pred cCHHHHHHHHH----HhhcCCCCeEeeeeeeeCCccCCHHHHHHHHHHHHHHHhhcc-ceEEEEEEecCCCCCCEEEEEe
Confidence 34577887532 222 1222 234688888652 222 332 233334556 777777778 889999998886
Q ss_pred CC
Q psy732 114 CG 115 (119)
Q Consensus 114 ~~ 115 (119)
..
T Consensus 122 t~ 123 (216)
T PF03389_consen 122 TT 123 (216)
T ss_dssp --
T ss_pred ec
Confidence 44
No 61
>KOG1504|consensus
Probab=31.94 E-value=65 Score=24.52 Aligned_cols=36 Identities=14% Similarity=0.071 Sum_probs=25.8
Q ss_pred eEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCC
Q psy732 68 YDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLK 106 (119)
Q Consensus 68 ~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGN 106 (119)
.|.==|+.-|++++..++..-+...|+++ .-+||||
T Consensus 165 ~D~~HPlQ~laD~LTi~E~f~ks~~glkv---awiGD~N 200 (346)
T KOG1504|consen 165 CDHFHPLQILADLLTIIEHFGKSVEGLKV---AWIGDGN 200 (346)
T ss_pred ccccChHHHHHHHHHHHHHHhccccccEE---EEEcccc
Confidence 57777888999999887776543335554 4578998
No 62
>cd05403 NT_KNTase_like Nucleotidyltransferase (NT) domain of Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins. S. aureus KNTase is a plasmid encoded enzyme which confers resistance to a wide range of aminoglycoside antibiotics which have a 4'- or 4''-hydroxyl group in the equatorial position, such as kanamycin A. This enzyme transfers a nucleoside monophosphate group from a nucleotide (ATP,GTP, or UTP) to the 4'-hydroxyl group of kanamycin A. This enzyme is a homodimer, having two NT active sites. The nucleotide and antibiotic binding sites of each active site include residues from each monomer. Included in this subgroup is Escherichia coli AadA5 which confers resistance to the antibiotic spectinomycin and is a putative aminoglycoside-3'-adenylyltransferase. It is part of the aadA5 cassette of a class 1 integron. This subgroup also includes Haemophilus influenzae HI0073 which forms a 2:2 heterotetramer with an unrelated protein HI0074. Structurally HI0074 is
Probab=31.92 E-value=70 Score=18.61 Aligned_cols=16 Identities=25% Similarity=0.285 Sum_probs=14.4
Q ss_pred CCeEEeeeeeCCCCcc
Q psy732 93 VVTVCGYGHLERLKDQ 108 (119)
Q Consensus 93 ~~~~~~~GH~GDGNlH 108 (119)
-..+++||-...|+.|
T Consensus 18 i~~i~LfGS~arg~~~ 33 (93)
T cd05403 18 VEKVYLFGSYARGDAR 33 (93)
T ss_pred ccEEEEEeeeecCCCC
Confidence 5889999999999987
No 63
>PF13380 CoA_binding_2: CoA binding domain; PDB: 3FF4_A 2D5A_A 2D59_A 2E6U_X 1IUL_A 1IUK_A 1Y81_A 2DUW_A.
Probab=31.73 E-value=67 Score=20.52 Aligned_cols=33 Identities=9% Similarity=-0.148 Sum_probs=21.3
Q ss_pred cHHHhhcC-CCceeeEeeeccchHHHHHHHHHHH
Q psy732 55 IVESCLKD-GYIYNYDISLSLKDFYSIIPIMKER 87 (119)
Q Consensus 55 ~~~~~~~~-~~~~~~DvaVP~~~l~~~i~~~~~~ 87 (119)
+++++... ++.-.-++++|..+.+++++++.+.
T Consensus 45 ~y~sl~e~p~~iDlavv~~~~~~~~~~v~~~~~~ 78 (116)
T PF13380_consen 45 CYPSLAEIPEPIDLAVVCVPPDKVPEIVDEAAAL 78 (116)
T ss_dssp -BSSGGGCSST-SEEEE-S-HHHHHHHHHHHHHH
T ss_pred eeccccCCCCCCCEEEEEcCHHHHHHHHHHHHHc
Confidence 45555543 2334568999999999999999876
No 64
>cd00839 MPP_PAPs purple acid phosphatases of the metallophosphatase superfamily, metallophosphatase domain. Purple acid phosphatases (PAPs) belong to a diverse family of binuclear metallohydrolases that have been identified and characterized in plants, animals, and fungi. PAPs contain a binuclear metal center and their characteristic pink or purple color derives from a charge-transfer transition between a tyrosine residue and a chromophoric ferric ion within the binuclear center. PAPs catalyze the hydrolysis of a wide range of activated phosphoric acid mono- and di-esters and anhydrides. PAPs are distinguished from the other phosphatases by their insensitivity to L-(+) tartrate inhibition and are therefore also known as tartrate resistant acid phosphatases (TRAPs). While only a few copies of PAP-like genes are present in mammalian and fungal genomes, multiple copies are present in plant genomes. PAPs belong to the metallophosphatase (MPP) superfamily. MPPs are functionally diver
Probab=31.49 E-value=47 Score=24.25 Aligned_cols=25 Identities=12% Similarity=0.069 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHhcCCCCeEEeeeee
Q psy732 77 FYSIIPIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 77 l~~~i~~~~~~~~~~p~~~~~~~GH~ 102 (119)
..+..+.+.++++++ ++..++.||.
T Consensus 178 ~~~~~~~l~~ll~~~-~v~~vl~GH~ 202 (294)
T cd00839 178 GEKMRAALEDLFYKY-GVDLVLSGHV 202 (294)
T ss_pred hHHHHHHHHHHHHHh-CCCEEEEccc
Confidence 456777888899999 9999999995
No 65
>cd07403 MPP_TTHA0053 Thermus thermophilus TTHA0053 and related proteins, metallophosphatase domain. TTHA0053 is an uncharacterized Thermus thermophilus protein with a domain that belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets. This domain is thought to allow for productive metal coordination.
Probab=31.43 E-value=39 Score=21.90 Aligned_cols=20 Identities=10% Similarity=0.003 Sum_probs=15.0
Q ss_pred HHHHHHHhcCCCCeEEeeeee
Q psy732 82 PIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~ 102 (119)
+.+.+.+.++ +...+++||.
T Consensus 81 ~~l~~~l~~~-~~~~vl~GH~ 100 (129)
T cd07403 81 EAFLDFIDRF-RPKLFIHGHT 100 (129)
T ss_pred HHHHHHHHHH-CCcEEEEcCc
Confidence 3455666677 7889999997
No 66
>TIGR03875 RNA_lig_partner RNA ligase partner, MJ_0950 family. This uncharacterized protein family is found almost perfectly in the same set of genomes as the Pab1020 family described by model TIGR01209. These pairs are found mostly in Archaea, but also in a few bacteria (e.g. Alkalilimnicola ehrlichei MLHE-1, Aquifex aeolicus). While the partner protein has been described as homodimeric ligase that has RNA circularization activity, the function of this protein (also called UPF0278) is unknown.
Probab=31.26 E-value=38 Score=24.45 Aligned_cols=77 Identities=17% Similarity=0.183 Sum_probs=43.8
Q ss_pred HHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHhh--hhcHH-HhhcC-C-----CceeeEeeeccchHHHHHHHHHHH
Q psy732 17 AEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPLR--ERIVE-SCLKD-G-----YIYNYDISLSLKDFYSIIPIMKER 87 (119)
Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R--~~~~~-~~~~~-~-----~~~~~DvaVP~~~l~~~i~~~~~~ 87 (119)
.+.+..++..+.+.....++.....+.-.++|-.+. +.+.+ .+.+. . ++--+++-+|..=+.+|+++++..
T Consensus 24 ~ea~~~~l~Lia~arl~l~iscYmPpsVy~El~~fl~~~~~~~e~~~kl~twv~~KsP~rye~~IPA~i~ye~I~e~R~R 103 (206)
T TIGR03875 24 CEAVRTFLDLIARARLKLGIECYMPPSVYKELRRFLERNGCDPETLAKLDTWVVKKSPNRYEVKIPAEIFYEYIEEVRER 103 (206)
T ss_pred HHHHHHHHHHHHHhhhccCceeecCHHHHHHHHHHHHhcCCCHHHHHhheeEEEEcCCCeeeeeccHHHHHHHHHHHHHH
Confidence 344444554333332222332223355556655553 34443 23221 1 234679999999999999999999
Q ss_pred HhcCCCCe
Q psy732 88 LKDQPVVT 95 (119)
Q Consensus 88 ~~~~p~~~ 95 (119)
.++ |++
T Consensus 104 Ink--GLR 109 (206)
T TIGR03875 104 IDK--GLR 109 (206)
T ss_pred Hhc--chh
Confidence 877 554
No 67
>smart00206 NTR Tissue inhibitor of metalloproteinase family. Form complexes with metalloproteinases, such as collagenases, and irreversibly inactivate them.
Probab=31.09 E-value=43 Score=23.50 Aligned_cols=19 Identities=21% Similarity=-0.038 Sum_probs=16.2
Q ss_pred CCeEEeeeeeCCCCcceee
Q psy732 93 VVTVCGYGHLERLKDQPVV 111 (119)
Q Consensus 93 ~~~~~~~GH~GDGNlH~~i 111 (119)
+-...+.|.+-||++|++.
T Consensus 77 ~~~YLIaG~~~dG~l~i~l 95 (172)
T smart00206 77 KEEYLIAGRLEDGKMHITL 95 (172)
T ss_pred CEEEEEEEEccCCeEEEEe
Confidence 3678899999999999865
No 68
>cd07400 MPP_YydB Bacillus subtilis YydB and related proteins, metallophosphatase domain. YydB (BSU40220) is an uncharacterized Bacillus subtilis protein that belongs to the following Bacillus subtilis gene cluster yydB-yydC-yydD-yydG-yydH-yydI-yydJ. YydB belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets. This domain is thought to allow for productiv
Probab=30.88 E-value=48 Score=21.34 Aligned_cols=22 Identities=18% Similarity=0.019 Sum_probs=17.6
Q ss_pred HHHHHHHhcCCCCeEEeeeeeCC
Q psy732 82 PIMKERLKDQPVVTVCGYGHLER 104 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~GD 104 (119)
+.+.++++++ +++.+++||.-.
T Consensus 103 ~~~~~~l~~~-~~~~~l~GH~H~ 124 (144)
T cd07400 103 GDALKLLAEA-GVDLVLHGHKHV 124 (144)
T ss_pred HHHHHHHHHc-CCCEEEECCCCC
Confidence 4567777788 999999999754
No 69
>PRK04358 hypothetical protein; Provisional
Probab=30.65 E-value=2.2e+02 Score=20.84 Aligned_cols=79 Identities=20% Similarity=0.217 Sum_probs=44.4
Q ss_pred HHHHHHHHHHHHhhcCcccceeecCCHHHHHHHHHh--hhhcHHH-hhcCC------CceeeEeeeccchHHHHHHHHHH
Q psy732 16 DAEKLNTFLQDGIENNIILDAVMCSEESKIQKIWPL--RERIVES-CLKDG------YIYNYDISLSLKDFYSIIPIMKE 86 (119)
Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~e~~~lW~~--R~~~~~~-~~~~~------~~~~~DvaVP~~~l~~~i~~~~~ 86 (119)
..+.+..++..+.+.....++....-+.-.++|-.+ |+.+.+- +.+.. ++--+++-+|..=+.+|+++++.
T Consensus 27 l~ea~~~~l~Lia~arl~l~is~YmPpSVy~El~~f~~~~~~~~e~~~kl~twi~~KsP~ry~v~IPA~i~ye~I~~mR~ 106 (217)
T PRK04358 27 LEEAVEKFLDLIARARLKLGISCYMPPSVYKELRGFLERNGCSPEVIAKLDTWIVKKSPNRYEIKIPAEIFYEYIEDMRE 106 (217)
T ss_pred HHHHHHHHHHHHHHhhhccCceEEcCHHHHHHHHHHHHhcCCCHHHHhhheeEEEEcCCCceeeeccHHHHHHHHHHHHH
Confidence 334455555544333222223222335555555444 3344432 32211 13467999999999999999999
Q ss_pred HHhcCCCCeE
Q psy732 87 RLKDQPVVTV 96 (119)
Q Consensus 87 ~~~~~p~~~~ 96 (119)
..++ |+++
T Consensus 107 RInk--GLRV 114 (217)
T PRK04358 107 RINK--GLRV 114 (217)
T ss_pred HHhc--chHH
Confidence 9887 5543
No 70
>KOG3179|consensus
Probab=30.32 E-value=61 Score=23.74 Aligned_cols=33 Identities=15% Similarity=0.021 Sum_probs=23.7
Q ss_pred EeeeccchHHHHHHHHHHHHhcCCCCeEEeeee
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQPVVTVCGYGH 101 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH 101 (119)
|.--..+=+-.+...++++....+.+-.+||||
T Consensus 71 dAf~d~dWI~KLcs~~kkld~mkkkvlGICFGH 103 (245)
T KOG3179|consen 71 DAFSDADWIKKLCSFVKKLDFMKKKVLGICFGH 103 (245)
T ss_pred cccccchHHHHHHHHHHHHHhhccceEEEeccH
Confidence 333345556677777777776666899999999
No 71
>PF13740 ACT_6: ACT domain; PDB: 1ZPV_A 3P96_A 1U8S_A.
Probab=29.80 E-value=1.1e+02 Score=17.74 Aligned_cols=31 Identities=10% Similarity=0.078 Sum_probs=24.7
Q ss_pred eeeEeeeccchHHHHHHHHHHHHhcCCCCeEE
Q psy732 66 YNYDISLSLKDFYSIIPIMKERLKDQPVVTVC 97 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~ 97 (119)
+..=+.+|.+.+.++-..+.++.++. ++.+.
T Consensus 44 ~~~~v~~~~~~~~~l~~~L~~l~~~~-~l~v~ 74 (76)
T PF13740_consen 44 LIMLVSIPEDSLERLESALEELAEEL-GLDVS 74 (76)
T ss_dssp EEEEEEESHHHHHHHHHHHHHHHHHT-T-EEE
T ss_pred EEEEEEeCcccHHHHHHHHHHHHHHC-CcEEE
Confidence 55677888889999999999998888 87764
No 72
>PF00862 Sucrose_synth: Sucrose synthase; InterPro: IPR000368 Sucrose synthases catalyse the synthesis of sucrose 2.4.1.13 from EC in the following reaction: UDP-glucose + D-fructose = UDP + sucrose This family includes the bulk of the sucrose synthase protein. However the carboxyl terminal region of the sucrose synthases belongs to the glycosyl transferase family IPR001296 from INTERPRO. This enzyme is found mainly in plants but also appears in bacteria.; GO: 0005985 sucrose metabolic process; PDB: 2R60_A 2R66_A 2R68_A 3S27_G 3S29_A 3S28_A.
Probab=29.08 E-value=18 Score=29.75 Aligned_cols=34 Identities=6% Similarity=-0.176 Sum_probs=21.5
Q ss_pred chHHHHHHHHHHH-HhcCCCCeEEeeeeeCCCCcc
Q psy732 75 KDFYSIIPIMKER-LKDQPVVTVCGYGHLERLKDQ 108 (119)
Q Consensus 75 ~~l~~~i~~~~~~-~~~~p~~~~~~~GH~GDGNlH 108 (119)
..|.+|...+.+. .++..+..-.+.||-+|||+=
T Consensus 381 PyLe~fa~d~~~~i~~e~~~~PdlI~GnYsDgnlv 415 (550)
T PF00862_consen 381 PYLEEFADDAEREILAELQGKPDLIIGNYSDGNLV 415 (550)
T ss_dssp GGHHHHHHHHHHHHHHHHTS--SEEEEEHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHhCCCCcEEEeccCcchHH
Confidence 3467777776543 343335556789999999973
No 73
>cd04871 ACT_PSP_2 ACT domains found N-terminal of phosphoserine phosphatase (PSP, SerB). The ACT_PSP_2 CD includes the second of the two ACT domains found N-terminal of phosphoserine phosphatase (PSP, SerB). PSPs belong to the L-2-haloacid dehalogenase-like protein superfamily. PSP is involved in serine metabolism; serine is synthesized from phosphoglycerate through sequential reactions catalyzed by 3-phosphoglycerate dehydrogenase (SerA), 3-phosphoserine aminotransferase (SerC), and SerB. Members of this CD belong to the superfamily of ACT regulatory domains
Probab=28.76 E-value=1.4e+02 Score=17.93 Aligned_cols=33 Identities=9% Similarity=0.073 Sum_probs=27.7
Q ss_pred ceeeEeeeccchHHHHHHHHHHHHhcCCCCeEEe
Q psy732 65 IYNYDISLSLKDFYSIIPIMKERLKDQPVVTVCG 98 (119)
Q Consensus 65 ~~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~ 98 (119)
.+.+++..|...+.++-+.+.++.++. |+.+.+
T Consensus 50 ~~e~~v~~~~~~~~~lr~~L~~la~el-gvDIav 82 (84)
T cd04871 50 CVEFSVRGQPADLEALRAALLELASEL-NVDIAF 82 (84)
T ss_pred EEEEEEeCCCCCHHHHHHHHHHHhccc-CceEEE
Confidence 467888888889999999999999888 887754
No 74
>cd07395 MPP_CSTP1 Homo sapiens CSTP1 and related proteins, metallophosphatase domain. CSTP1 (complete S-transactivated protein 1) is an uncharacterized Homo sapiens protein with a metallophosphatase domain, that is transactivated by the complete S protein of hepatitis B virus. CSTP1 belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets. This domain is th
Probab=28.54 E-value=66 Score=23.20 Aligned_cols=23 Identities=22% Similarity=0.249 Sum_probs=18.3
Q ss_pred HHHHHHHHHHhcCCCCeEEeeeee
Q psy732 79 SIIPIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 79 ~~i~~~~~~~~~~p~~~~~~~GH~ 102 (119)
+.-.++.++++++ ++..+++||.
T Consensus 193 ~~~~~l~~ll~~~-~V~~v~~GH~ 215 (262)
T cd07395 193 SVRKPLLDKFKKA-GVKAVFSGHY 215 (262)
T ss_pred HHHHHHHHHHHhc-CceEEEECcc
Confidence 3445677788888 9999999995
No 75
>cd05402 NT_PAP_TUTase Nucleotidyltransferase (NT) domain of poly(A) polymerases and terminal uridylyl transferases. Poly(A) polymerases (PAPs) catalyze mRNA poly(A) tail synthesis, and terminal uridylyl transferases (TUTases) uridylate RNA. PAPs in this subgroup include human PAP alpha, mouse testis-specific cytoplasmic PAP beta, human nuclear PAP gamma, Saccharomyces cerevisiae PAP1, TRF4 and-5, Schizosaccharomyces pombe caffeine-induced death proteins -1, and -14, Caenorhabditis elegans Germ Line Development-2, and Chlamydomonas reinhardtii MUT68. This family also includes human U6 snRNA-specific TUTase1, and Trypanosoma brucei 3'-TUTase-1,-2, and 4. This family belongs to the Pol beta-like NT superfamily. In the majority of enzymes in this superfamily, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations involved in a two-metal ion mechanism of nucleotide addition. For the majority of proteins in this family, these carboxyla
Probab=27.29 E-value=83 Score=19.55 Aligned_cols=28 Identities=14% Similarity=0.261 Sum_probs=22.9
Q ss_pred HHHHHHHHHHhcC-CCCeEEeeeeeCCCC
Q psy732 79 SIIPIMKERLKDQ-PVVTVCGYGHLERLK 106 (119)
Q Consensus 79 ~~i~~~~~~~~~~-p~~~~~~~GH~GDGN 106 (119)
++++.+++.+++. |+.++..||-..-|-
T Consensus 4 ~i~~~l~~~i~~~~~~~~v~~fGS~~~g~ 32 (114)
T cd05402 4 EVLDRLQELIKEWFPGAKLYPFGSYVTGL 32 (114)
T ss_pred HHHHHHHHHHHHHCCCCEEEEecccccCC
Confidence 5678888888886 899999999877664
No 76
>cd07398 MPP_YbbF-LpxH Escherichia coli YbbF/LpxH and related proteins, metallophosphatase domain. YbbF/LpxH is an Escherichia coli UDP-2,3-diacylglucosamine hydrolase thought to catalyze the fourth step of lipid A biosynthesis, in which a precursor UDP-2,3-diacylglucosamine is hydrolyzed to yield 2,3-diacylglucosamine 1-phosphate and UMP. YbbF belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues l
Probab=26.96 E-value=77 Score=21.86 Aligned_cols=30 Identities=13% Similarity=-0.020 Sum_probs=20.4
Q ss_pred chHHHHHHHHHHHHhcCCCCeEEeeeeeCCC
Q psy732 75 KDFYSIIPIMKERLKDQPVVTVCGYGHLERL 105 (119)
Q Consensus 75 ~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDG 105 (119)
..+..+...+.+.+.+. ++..+++||.=-.
T Consensus 172 ~~~~~~~~~~~~~~~~~-~~~~~i~GH~H~~ 201 (217)
T cd07398 172 AIIDVFEEAVARLARRK-GVDGVICGHTHRP 201 (217)
T ss_pred HHHHHHHHHHHHHHHhc-CCCEEEECCCCCC
Confidence 34444555556666677 9999999997443
No 77
>cd04870 ACT_PSP_1 CT domains found N-terminal of phosphoserine phosphatase (PSP, SerB). The ACT_PSP_1 CD includes the first of the two ACT domains found N-terminal of phosphoserine phosphatase (PSP, SerB). PSPs belong to the L-2-haloacid dehalogenase-like protein superfamily. PSP is involved in serine metabolism; serine is synthesized from phosphoglycerate through sequential reactions catalyzed by 3-phosphoglycerate dehydrogenase (SerA), 3-phosphoserine aminotransferase (SerC), and SerB. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=26.90 E-value=1.2e+02 Score=17.36 Aligned_cols=32 Identities=6% Similarity=-0.021 Sum_probs=25.2
Q ss_pred ceeeEeeeccc-hHHHHHHHHHHHHhcCCCCeEE
Q psy732 65 IYNYDISLSLK-DFYSIIPIMKERLKDQPVVTVC 97 (119)
Q Consensus 65 ~~~~DvaVP~~-~l~~~i~~~~~~~~~~p~~~~~ 97 (119)
....-+.+|.+ .+.++-+.++++.++. ++.+.
T Consensus 40 ~~~~~v~~p~~~~~~~l~~~l~~l~~~l-~l~i~ 72 (75)
T cd04870 40 SLGILVQIPDSADSEALLKDLLFKAHEL-GLQVR 72 (75)
T ss_pred EEEEEEEcCCCCCHHHHHHHHHHHHHHc-CceEE
Confidence 34557778877 8899999999999888 87764
No 78
>TIGR03767 P_acnes_RR metallophosphoesterase, PPA1498 family. This model describes a small collection of probable metallophosphoresterases, related to pfam00149 but with long inserts separating some of the shared motifs such that the homology is apparent only through multiple sequence alignment. Members of this protein family, in general, have a Sec-independent TAT (twin-arginine translocation) signal sequence, N-terminal to the region modeled by this alignment. Members include YP_056203.1 from Propionibacterium acnes KPA171202.
Probab=26.89 E-value=61 Score=26.60 Aligned_cols=21 Identities=24% Similarity=0.248 Sum_probs=17.5
Q ss_pred HHHHHHHhcCCCCeEEeeeee
Q psy732 82 PIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~ 102 (119)
+++.++++++|++..+++||.
T Consensus 369 ~eLldLL~~ypnV~aVfsGHv 389 (496)
T TIGR03767 369 TELVSLLLEHPNVLAWVNGHT 389 (496)
T ss_pred HHHHHHHhcCCCceEEEECCc
Confidence 467777888878999999996
No 79
>PF00586 AIRS: AIR synthase related protein, N-terminal domain; InterPro: IPR000728 This family includes Hydrogen expression/formation protein, HypE, which may be involved in the maturation of NifE hydrogenase; AIR synthase and FGAM synthase, which are involved in de novo purine biosynthesis; and selenide, water dikinase, an enzyme which synthesizes selenophosphate from selenide and ATP.; GO: 0003824 catalytic activity; PDB: 3VIU_A 2Z1T_A 2Z1U_A 3C9U_B 3C9S_A 3C9R_A 1VQV_A 3C9T_B 3M84_A 3QTY_A ....
Probab=26.59 E-value=1.5e+02 Score=17.83 Aligned_cols=52 Identities=12% Similarity=0.064 Sum_probs=33.8
Q ss_pred HHHhhhhcHHHhhc---CC-C--ceeeEeeecc----chHHHHHHHHHHHHhcCCCCeEEeeee
Q psy732 48 IWPLRERIVESCLK---DG-Y--IYNYDISLSL----KDFYSIIPIMKERLKDQPVVTVCGYGH 101 (119)
Q Consensus 48 lW~~R~~~~~~~~~---~~-~--~~~~DvaVP~----~~l~~~i~~~~~~~~~~p~~~~~~~GH 101 (119)
.|-.+.++..+++. .| . .+...+.+|. +.+.++++.+.+.++++ ++.++ -||
T Consensus 34 ~~~g~~av~~~~sDl~a~Ga~P~~~~~~l~~~~~~~~~~l~~~~~Gi~~~~~~~-g~~iv-GG~ 95 (96)
T PF00586_consen 34 YDGGWKAVAEALSDLAAMGAKPLAILDSLGLPNPESPEELKEIVKGIAEACREF-GIPIV-GGD 95 (96)
T ss_dssp HHHHHHHHHHHHHHHHTTTEEEEEEEEEEEESTTSBHHHHHHHHHHHHHHHHHH-T-EEE-EEE
T ss_pred HHHHHHHHHHHHHHHHhcCCeeeEEEEEEEcCCCCCHHHHHHHHHHHHHHHHHh-CCcEe-CcC
Confidence 33445555555443 22 1 3667788874 58999999999999999 88664 455
No 80
>PF00487 FA_desaturase: Fatty acid desaturase This entry is only a subset of the Pfam family.; InterPro: IPR005804 Fatty acid desaturases are enzymes that catalyse the insertion of a double bond at the delta position of fatty acids. There seem to be two distinct families of fatty acid desaturases which do not seem to be evolutionary related. Family 1 is composed of: Stearoyl-CoA desaturase (SCD) (1.14.19.1 from EC) []. Family 2 is composed of: Bacterial fatty acid desaturases. Plant stearoyl-acyl-carrier-protein desaturase (1.14.19.1 from EC) [], this enzyme catalyzes the introduction of a double bond at the delta(9) position of steraoyl-ACP to produce oleoyl-ACP. This enzyme is responsible for the conversion of saturated fatty acids to unsaturated fatty acids in the synthesis of vegetable oils. Cyanobacterial DesA [], an enzyme that can introduce a second cis double bond at the delta(12) position of fatty acid bound to membranes glycerolipids. DesA is involved in chilling tolerance; the phase transition temperature of lipids of cellular membranes being dependent on the degree of unsaturation of fatty acids of the membrane lipids. This entry contains fatty acid desaturases belonging to Family 1. ; GO: 0006629 lipid metabolic process
Probab=26.57 E-value=59 Score=22.39 Aligned_cols=30 Identities=17% Similarity=0.121 Sum_probs=25.7
Q ss_pred EeeeccchHHHHHHHHHHHHhcCCCCeEEee
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQPVVTVCGY 99 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~ 99 (119)
.-.||..+++++-+.+++.++++ +++...|
T Consensus 227 ~P~vp~~~l~~~~~~~~~~~~~~-~~~y~~~ 256 (257)
T PF00487_consen 227 FPGVPWYNLPEAHPILKEVCPEY-GVPYNEF 256 (257)
T ss_pred CCCcCHHHHHHHHHHHHHHHHHc-CCceecC
Confidence 34899999999999999999999 8776554
No 81
>TIGR03759 conj_TIGR03759 integrating conjugative element protein, PFL_4693 family. Members of this protein family, such as model protein PFL_4693 from Pseudomonas fluorescens Pf-5, belong to extended genomic regions that appear to be spread by conjugative transfer. Most members have a predicted N-terminal signal sequence. The function is unknown.
Probab=26.53 E-value=1.2e+02 Score=21.78 Aligned_cols=38 Identities=16% Similarity=0.265 Sum_probs=28.2
Q ss_pred HhhcCcccceeecC--CHHHHHHHHHhhhhcHHHhhcCCC
Q psy732 27 GIENNIILDAVMCS--EESKIQKIWPLRERIVESCLKDGY 64 (119)
Q Consensus 27 ~~~~~~~~~~~~a~--~~~e~~~lW~~R~~~~~~~~~~~~ 64 (119)
+...+...|+++.. ....+=..|+.++.|.+...+.+.
T Consensus 130 l~a~~~~~Diylvgs~~dD~~Ir~WA~~~~Idp~~V~~~~ 169 (200)
T TIGR03759 130 LLADNAPLDLYLVGSQGDDERIRQWANRHQIDPAKVRSRQ 169 (200)
T ss_pred HhcCCCceeEEEecCCCCHHHHHHHHHHcCCCHHHeecCe
Confidence 34455567888776 667777889999999998766554
No 82
>KOG3125|consensus
Probab=26.03 E-value=1.1e+02 Score=22.36 Aligned_cols=27 Identities=19% Similarity=0.015 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHhcCCCCeEEeeeeeCC
Q psy732 77 FYSIIPIMKERLKDQPVVTVCGYGHLER 104 (119)
Q Consensus 77 l~~~i~~~~~~~~~~p~~~~~~~GH~GD 104 (119)
+++++++.++.++.. |..++++|=-||
T Consensus 116 f~dl~efc~evAd~~-Gk~VivagLdgd 142 (234)
T KOG3125|consen 116 FGDLYEFCREVADVH-GKTVIVAGLDGD 142 (234)
T ss_pred hHHHHHHHHHHHhcc-CCEEEEEecCCc
Confidence 456667777777778 899999987665
No 83
>PRK10510 putative outer membrane lipoprotein; Provisional
Probab=25.85 E-value=1.2e+02 Score=21.90 Aligned_cols=35 Identities=14% Similarity=-0.001 Sum_probs=26.5
Q ss_pred EeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeC
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLE 103 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~G 103 (119)
|-+.........+..+-..++++|...+.+.||..
T Consensus 119 ~sa~L~~~~~~~L~~ia~~L~~~p~~~I~I~GhTD 153 (219)
T PRK10510 119 SSATLKPAGANTLTGVAMVLKEYPKTAVNVVGYTD 153 (219)
T ss_pred CCcccCHHHHHHHHHHHHHHHhCCCceEEEEEecC
Confidence 34444555667778888888888888999999985
No 84
>PF09370 TIM-br_sig_trns: TIM-barrel signal transduction protein; InterPro: IPR009215 Members of this family are predicted to have a TIM barrel fold, based on PSI-BLAST analysis (iteration 4) and on SCOP prediction (using SMART). Interestingly, this novel domain also exists as an N-terminal domain of sigma54-dependent transcriptional activators (enhancer-binding proteins). Because sigma54 dependent activators typically have a three-domain structure: the variable N-terminal regulatory (activation) domain involved in signal recognition/receiving, the central AAA-type ATPase domain, and the DNA-binding domain (see PIRSF003187 from PIRSF, PIRSF005263 from PIRSF, PIRSF003184 from PIRSF, PIRSF005263 from PIRSF, IPR014443 from INTERPRO for details), the proteins of the current entry may be predicted to play a role in signal recognition/receiving and signal transduction.; PDB: 2P10_C.
Probab=25.79 E-value=3e+02 Score=20.82 Aligned_cols=75 Identities=15% Similarity=0.172 Sum_probs=36.3
Q ss_pred HHHHHhhcCcccceeecCCHHHHHHHHHhhhhcHHHhhcCC----CceeeEeeeccchHHHHHHHHHHHHhcC-CCCeEE
Q psy732 23 FLQDGIENNIILDAVMCSEESKIQKIWPLRERIVESCLKDG----YIYNYDISLSLKDFYSIIPIMKERLKDQ-PVVTVC 97 (119)
Q Consensus 23 ~~~~~~~~~~~~~~~~a~~~~e~~~lW~~R~~~~~~~~~~~----~~~~~DvaVP~~~l~~~i~~~~~~~~~~-p~~~~~ 97 (119)
++..+.+.+ .....+..+++|.+++-++--.+. +...| ..+.-+.+..++.-.+.++++.+.+.+. |++-+.
T Consensus 142 mi~~A~~~g-l~T~~yvf~~e~A~~M~~AGaDii--v~H~GlT~gG~~Ga~~~~sl~~a~~~~~~i~~aa~~v~~dii~l 218 (268)
T PF09370_consen 142 MIRKAHEKG-LFTTAYVFNEEQARAMAEAGADII--VAHMGLTTGGSIGAKTALSLEEAAERIQEIFDAARAVNPDIIVL 218 (268)
T ss_dssp HHHHHHHTT--EE--EE-SHHHHHHHHHHT-SEE--EEE-SS----------S--HHHHHHHHHHHHHHHHCC-TT-EEE
T ss_pred HHHHHHHCC-CeeeeeecCHHHHHHHHHcCCCEE--EecCCccCCCCcCccccCCHHHHHHHHHHHHHHHHHhCCCeEEE
Confidence 344445555 344445567888777665444332 11222 1245567788888888888888888888 555554
Q ss_pred eee
Q psy732 98 GYG 100 (119)
Q Consensus 98 ~~G 100 (119)
++|
T Consensus 219 ~hG 221 (268)
T PF09370_consen 219 CHG 221 (268)
T ss_dssp EEC
T ss_pred EeC
Confidence 443
No 85
>cd07378 MPP_ACP5 Homo sapiens acid phosphatase 5 and related proteins, metallophosphatase domain. Acid phosphatase 5 (ACP5) removes the mannose 6-phosphate recognition marker from lysosomal proteins. The exact site of dephosphorylation is not clear. Evidence suggests dephosphorylation may take place in a prelysosomal compartment as well as in the lysosome. ACP5 belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site ma
Probab=25.74 E-value=75 Score=22.98 Aligned_cols=24 Identities=21% Similarity=0.121 Sum_probs=19.6
Q ss_pred HHHHHHHHHHHhcCCCCeEEeeeee
Q psy732 78 YSIIPIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 78 ~~~i~~~~~~~~~~p~~~~~~~GH~ 102 (119)
..+.+.+.++++++ ++..++.||.
T Consensus 187 ~~~~~~l~~l~~~~-~v~~vl~GH~ 210 (277)
T cd07378 187 SCLVDRLLPLLKKY-KVDAYLSGHD 210 (277)
T ss_pred HHHHHHHHHHHHHc-CCCEEEeCCc
Confidence 34567788888999 8999999994
No 86
>PF07892 DUF1667: Protein of unknown function (DUF1667); InterPro: IPR012460 Hypothetical archaeal and bacterial proteins make up this family. A few proteins are annotated as being potential metal-binding proteins, and in fact the members of this family have four highly conserved cysteine residues, but no further literature evidence was found in this regard. ; PDB: 2JOV_A.
Probab=25.42 E-value=64 Score=19.81 Aligned_cols=26 Identities=4% Similarity=0.079 Sum_probs=20.1
Q ss_pred ceeeEeeeccchHHHHHHHHHHHHhc
Q psy732 65 IYNYDISLSLKDFYSIIPIMKERLKD 90 (119)
Q Consensus 65 ~~~~DvaVP~~~l~~~i~~~~~~~~~ 90 (119)
+..++-.+|.+++.++++.++++--+
T Consensus 33 pVkT~~pIPk~~i~e~m~~i~~i~v~ 58 (82)
T PF07892_consen 33 PVKTSKPIPKEKIFEVMEEIKKIEVK 58 (82)
T ss_dssp EEEEEEEE-HHHHHHHHHHHTT-EE-
T ss_pred EEecCCCCCHHHHHHHHHHHhCCEEc
Confidence 47899999999999999999876433
No 87
>PF02769 AIRS_C: AIR synthase related protein, C-terminal domain; InterPro: IPR010918 This entry includes Hydrogen expression/formation protein, HypE, which may be involved in the maturation of NifE hydrogenase; AIR synthase and FGAM synthase, which are involved in de novo purine biosynthesis; and selenide, water dikinase, an enzyme which synthesizes selenophosphate from selenide and ATP.; PDB: 2Z1F_A 2Z1E_A 2YXZ_D 2V9Y_A 2HS0_A 2HRU_A 2HS3_A 3D54_A 1VK3_A 2HS4_A ....
Probab=25.33 E-value=1.3e+02 Score=19.39 Aligned_cols=33 Identities=12% Similarity=0.167 Sum_probs=24.0
Q ss_pred eeeccchHHHHHHHHHHHHhcCCCC-eEEeeeeeCCCCc
Q psy732 70 ISLSLKDFYSIIPIMKERLKDQPVV-TVCGYGHLERLKD 107 (119)
Q Consensus 70 vaVP~~~l~~~i~~~~~~~~~~p~~-~~~~~GH~GDGNl 107 (119)
++||.++..++++ .++++ ++ .+...|++..++=
T Consensus 114 ~~v~~~~~~~~~~----~~~~~-g~~~~~~IG~V~~~~~ 147 (153)
T PF02769_consen 114 VAVPPEDAEEFLA----ALKKA-GIPNATVIGEVTEGPG 147 (153)
T ss_dssp EEEEGGGHHHHHH----HHHHT-TCTTEEEEEEEESSSS
T ss_pred EEEcHHHHHHHHH----HHHhC-CCCCEEEEEEEEcCCC
Confidence 6778887755554 45556 88 6999999988763
No 88
>TIGR01677 pln_FAD_oxido plant-specific FAD-dependent oxidoreductase. This model represents an uncharacterized plant-specific family of FAD-dependent oxidoreductases. At least seven distinct members are found in Arabidopsis thaliana. The family shows considerable sequence similarity to three different enzymes of ascorbic acid biosynthesis: L-galactono-1,4-lactone dehydrogenase (EC 1.3.2.3) from higher plants, D-arabinono-1,4-lactone oxidase (EC 1.1.3.37 from Saccharomyces cerevisiae, and L-gulonolactone oxidase (EC 1.1.3.8) from mouse, as well as to a bacterial sorbitol oxidase. The class of compound acted on by members of this family is unknown.
Probab=25.23 E-value=1.2e+02 Score=25.17 Aligned_cols=23 Identities=13% Similarity=0.183 Sum_probs=21.5
Q ss_pred EeeeccchHHHHHHHHHHHHhcC
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQ 91 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~ 91 (119)
..+||++++++.+++++++.++.
T Consensus 373 EyavP~e~~~~al~~vr~~~~~~ 395 (557)
T TIGR01677 373 TLSVPVSRFRDFVLDVKRLRDME 395 (557)
T ss_pred EEEeeHHHHHHHHHHHHHHHHhc
Confidence 67999999999999999999876
No 89
>TIGR00106 uncharacterized protein, MTH1187 family. This protein has been crystallized in both Methanobacterium thermoautotrophicum and yeast, but its function remains unknown. Both crystal structures showed sulfate ions bound at the interface of two dimers to form a tetramer.
Probab=25.09 E-value=1.1e+02 Score=19.24 Aligned_cols=25 Identities=16% Similarity=0.150 Sum_probs=19.4
Q ss_pred eccc----hHHHHHHHHHHHHhcCCCCeEE
Q psy732 72 LSLK----DFYSIIPIMKERLKDQPVVTVC 97 (119)
Q Consensus 72 VP~~----~l~~~i~~~~~~~~~~p~~~~~ 97 (119)
+|+. .+.+|++++.+.+++. |++..
T Consensus 8 ~P~g~~~~s~s~yVa~~i~~l~~s-Gl~y~ 36 (97)
T TIGR00106 8 IPIGTVGASVSSYVAAAIEVLKES-GLKYE 36 (97)
T ss_pred eecCCCCCcHHHHHHHHHHHHHHc-CCCeE
Confidence 4555 7888999999999887 77654
No 90
>PF14304 CSTF_C: Transcription termination and cleavage factor C-terminal; PDB: 2J8P_A.
Probab=24.95 E-value=44 Score=18.25 Aligned_cols=18 Identities=22% Similarity=0.383 Sum_probs=14.0
Q ss_pred cCCHHHHHHHHHhhhhcH
Q psy732 39 CSEESKIQKIWPLRERIV 56 (119)
Q Consensus 39 a~~~~e~~~lW~~R~~~~ 56 (119)
+--++++..+|.+|..+.
T Consensus 24 ~LPp~qR~~I~~Lr~ql~ 41 (46)
T PF14304_consen 24 ALPPDQRQQILQLRQQLM 41 (46)
T ss_dssp TS-HHHHTHHHHHHHHHH
T ss_pred hCCHHHHHHHHHHHHHHH
Confidence 457899999999998654
No 91
>PLN02282 phosphoglycerate kinase
Probab=24.80 E-value=1e+02 Score=24.57 Aligned_cols=36 Identities=19% Similarity=0.266 Sum_probs=29.5
Q ss_pred eeeEeeecc---------chHHHHHHHHHHHHhcCCCCeEEeeeeeC
Q psy732 66 YNYDISLSL---------KDFYSIIPIMKERLKDQPVVTVCGYGHLE 103 (119)
Q Consensus 66 ~~~DvaVP~---------~~l~~~i~~~~~~~~~~p~~~~~~~GH~G 103 (119)
+--|+-||+ .++...+..++.++++ |.++++..|.|
T Consensus 22 vRvD~NvPi~~~g~I~dd~RI~a~lpTI~~l~~~--gakvVl~SHlG 66 (401)
T PLN02282 22 VRVDLNVPLDDNSNITDDTRIRAAVPTIKYLMGH--GARVILCSHLG 66 (401)
T ss_pred EEeecCCccCCCCcccCcHHHHHHHHHHHHHHHC--CCeEEEEecCC
Confidence 556888886 3578888888888877 78999999999
No 92
>PF01510 Amidase_2: N-acetylmuramoyl-L-alanine amidase; InterPro: IPR002502 Proteins families containing this domain include zinc amidases that have N-acetylmuramoyl-L-alanine amidase activity 3.5.1.28 from EC This enzyme domain cleaves the amide bond between N-acetylmuramoyl and L-amino acids in bacterial cell walls (preferentially: D-lactyl-L-Ala). The structure is known for the Bacteriophage T7 structure and shows that two of the conserved histidines are zinc binding.; GO: 0008745 N-acetylmuramoyl-L-alanine amidase activity, 0009253 peptidoglycan catabolic process; PDB: 2RKQ_A 2EAV_A 2EAX_B 2XZ4_A 3TRU_A 3NKW_B 3QS0_B 3NW3_C 2Z9N_C 3NNO_B ....
Probab=24.51 E-value=54 Score=20.77 Aligned_cols=33 Identities=9% Similarity=0.035 Sum_probs=26.9
Q ss_pred EeeeccchHHHHHHHHHHHHhcCCCCeEE--eeeee
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQPVVTVC--GYGHL 102 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~p~~~~~--~~GH~ 102 (119)
+...+..++..+.+-++.+++++ ++... ++||-
T Consensus 85 ~~~~t~~Q~~a~~~L~~~l~~~~-~i~~~~~I~~H~ 119 (132)
T PF01510_consen 85 DDPPTDAQIEALARLIADLCRRY-GIPPNRGIVGHR 119 (132)
T ss_dssp SSHHHHHHHHHHHHHHHHHHHTT-SEEEEEEEEEHH
T ss_pred CCCCCHHHHHHHHHHHHHHHHHC-CcCCCCEEeeHH
Confidence 67788888888888888888999 77764 88884
No 93
>PRK05731 thiamine monophosphate kinase; Provisional
Probab=24.39 E-value=1.4e+02 Score=22.42 Aligned_cols=33 Identities=9% Similarity=0.008 Sum_probs=24.8
Q ss_pred EeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCC
Q psy732 69 DISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLK 106 (119)
Q Consensus 69 DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGN 106 (119)
=++||.++..++.+ .+++. ++.+...|++-+|+
T Consensus 267 l~~v~~~~~~~~~~----~l~~~-g~~~~~IG~v~~~~ 299 (318)
T PRK05731 267 LFTFPPENRGALLA----AAGHL-GVGVTIIGRVTEGE 299 (318)
T ss_pred EEEECHHHHHHHHH----HHHhc-CCCceEEEEEEcCc
Confidence 35778888766544 45557 88999999999887
No 94
>PF13692 Glyco_trans_1_4: Glycosyl transferases group 1; PDB: 3OY2_A 3OY7_B 2Q6V_A 2HY7_A 3CV3_A 3CUY_A.
Probab=24.37 E-value=1.3e+02 Score=18.63 Aligned_cols=30 Identities=13% Similarity=0.161 Sum_probs=23.2
Q ss_pred ccchHHHHHH-HHHHHHhcCCCCeEEeeeee
Q psy732 73 SLKDFYSIIP-IMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 73 P~~~l~~~i~-~~~~~~~~~p~~~~~~~GH~ 102 (119)
|...+..+++ -++++.++.|+++..++|..
T Consensus 13 ~~k~~~~li~~~~~~l~~~~p~~~l~i~G~~ 43 (135)
T PF13692_consen 13 PDKGLEELIEAALERLKEKHPDIELIIIGNG 43 (135)
T ss_dssp GGGTHHHHHH-HHHHHHHHSTTEEEEEECES
T ss_pred ccccccchhhhHHHHHHHHCcCEEEEEEeCC
Confidence 5567778888 67777777788999998883
No 95
>cd00519 Lipase_3 Lipase (class 3). Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation," the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site.
Probab=24.33 E-value=1.1e+02 Score=21.47 Aligned_cols=21 Identities=33% Similarity=0.521 Sum_probs=15.0
Q ss_pred HHHHHHHHhcCCCCeEEeeee
Q psy732 81 IPIMKERLKDQPVVTVCGYGH 101 (119)
Q Consensus 81 i~~~~~~~~~~p~~~~~~~GH 101 (119)
...+++.++++|+.++.+.||
T Consensus 115 ~~~~~~~~~~~p~~~i~vtGH 135 (229)
T cd00519 115 LPELKSALKQYPDYKIIVTGH 135 (229)
T ss_pred HHHHHHHHhhCCCceEEEEcc
Confidence 344555566677889999999
No 96
>cd00318 Phosphoglycerate_kinase Phosphoglycerate kinase (PGK) is a monomeric enzyme which catalyzes the transfer of the high-energy phosphate group of 1,3-bisphosphoglycerate to ADP, forming ATP and 3-phosphoglycerate. This reaction represents the first of the two substrate-level phosphorylation events in the glycolytic pathway. Substrate-level phosphorylation is defined as production of ATP by a process, which is catalyzed by water-soluble enzymes in the cytosol; not involving membranes and ion gradients.
Probab=24.31 E-value=1e+02 Score=24.50 Aligned_cols=36 Identities=19% Similarity=0.281 Sum_probs=28.8
Q ss_pred eeeEeeecc--------chHHHHHHHHHHHHhcCCCCeEEeeeeeC
Q psy732 66 YNYDISLSL--------KDFYSIIPIMKERLKDQPVVTVCGYGHLE 103 (119)
Q Consensus 66 ~~~DvaVP~--------~~l~~~i~~~~~~~~~~p~~~~~~~GH~G 103 (119)
+--|+=||+ .++...+..++.++++ |.++++..|.|
T Consensus 13 vRvD~NvPi~~g~I~dd~RI~a~lpTI~~l~~~--gakvvl~SHlG 56 (397)
T cd00318 13 VRVDFNVPVDDGKITDDTRIRAALPTIKYLLEQ--GAKVVLLSHLG 56 (397)
T ss_pred EEeccCCCCcCCeECChHHHHHHHHHHHHHHHC--CCeEEEEEecC
Confidence 456777777 3678888888888877 78999999998
No 97
>PLN02934 triacylglycerol lipase
Probab=24.19 E-value=1e+02 Score=25.46 Aligned_cols=27 Identities=22% Similarity=0.406 Sum_probs=22.9
Q ss_pred chHHHHHHHHHHHHhcCCCCeEEeeee
Q psy732 75 KDFYSIIPIMKERLKDQPVVTVCGYGH 101 (119)
Q Consensus 75 ~~l~~~i~~~~~~~~~~p~~~~~~~GH 101 (119)
+...++...++++++++|+.++++.||
T Consensus 302 ~Ay~~v~~~lk~ll~~~p~~kIvVTGH 328 (515)
T PLN02934 302 SAYYAVRSKLKSLLKEHKNAKFVVTGH 328 (515)
T ss_pred hHHHHHHHHHHHHHHHCCCCeEEEecc
Confidence 555678888888888888899999999
No 98
>PF00162 PGK: Phosphoglycerate kinase; InterPro: IPR001576 Phosphoglycerate kinase (2.7.2.3 from EC) (PGK) is an enzyme that catalyses the formation of ATP to ADP and vice versa. In the second step of the second phase in glycolysis, 1,3-diphosphoglycerate is converted to 3-phosphoglycerate, forming one molecule of ATP. If the reverse were to occur, one molecule of ADP would be formed. This reaction is essential in most cells for the generation of ATP in aerobes, for fermentation in anaerobes and for carbon fixation in plants. PGK is found in all living organisms and its sequence has been highly conserved throughout evolution. The enzyme exists as a monomer containing two nearly equal-sized domains that correspond to the N- and C-termini of the protein (the last 15 C-terminal residues loop back into the N-terminal domain). 3-phosphoglycerate (3-PG) binds to the N-terminal, while the nucleotide substrates, MgATP or MgADP, bind to the C-terminal domain of the enzyme. This extended two-domain structure is associated with large-scale 'hinge-bending' conformational changes, similar to those found in hexokinase []. At the core of each domain is a 6-stranded parallel beta-sheet surrounded by alpha helices. Domain 1 has a parallel beta-sheet of six strands with an order of 342156, while domain 2 has a parallel beta-sheet of six strands with an order of 321456. Analysis of the reversible unfolding of yeast phosphoglycerate kinase leads to the conclusion that the two lobes are capable of folding independently, consistent with the presence of intermediates on the folding pathway with a single domain folded []. Phosphoglycerate kinase (PGK) deficiency is associated with haemolytic anaemia and mental disorders in man []. This group represents a phosphoglycerate kinase.; GO: 0004618 phosphoglycerate kinase activity, 0006096 glycolysis; PDB: 1PHP_A 1V6S_A 2IE8_A 1ZMR_A 16PK_A 13PK_B 2P9Q_A 2P9T_A 2PAA_B 3OZA_A ....
Probab=24.15 E-value=99 Score=24.47 Aligned_cols=37 Identities=22% Similarity=0.319 Sum_probs=23.4
Q ss_pred eeeEeeecc--------chHHHHHHHHHHHHhcCCCCeEEeeeeeCC
Q psy732 66 YNYDISLSL--------KDFYSIIPIMKERLKDQPVVTVCGYGHLER 104 (119)
Q Consensus 66 ~~~DvaVP~--------~~l~~~i~~~~~~~~~~p~~~~~~~GH~GD 104 (119)
+--|.=||+ .+|...+..++.++++ |.++++..|.|.
T Consensus 14 vRvD~NvPi~~g~I~Dd~RI~~~lpTI~~l~~~--gakvVl~sH~GR 58 (384)
T PF00162_consen 14 VRVDFNVPIKNGKITDDTRIRAALPTIKYLLEK--GAKVVLMSHLGR 58 (384)
T ss_dssp EEE-----EETTEES-THHHHHHHHHHHHHHHT--TEEEEEE---SS
T ss_pred EEeCCCCCcCCCcCCCcchHHHHHHHHHHHHhc--CCeEEEEeccCC
Confidence 445777777 5788888888888877 899999999983
No 99
>PRK13820 argininosuccinate synthase; Provisional
Probab=24.01 E-value=99 Score=24.54 Aligned_cols=29 Identities=14% Similarity=0.028 Sum_probs=23.4
Q ss_pred HHHHHHHHhcCCCCeEEeeeeeCCCCccee
Q psy732 81 IPIMKERLKDQPVVTVCGYGHLERLKDQPV 110 (119)
Q Consensus 81 i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~ 110 (119)
...+.+.+++. |...+..||-|+||=..+
T Consensus 98 ~~~l~e~A~e~-G~~~IA~G~t~~gnDq~r 126 (394)
T PRK13820 98 AEKIVEVAEKE-GASAIAHGCTGKGNDQLR 126 (394)
T ss_pred HHHHHHHHHHc-CCCEEEECCCCCcchHHH
Confidence 34566667788 999999999999997763
No 100
>PF10113 Fibrillarin_2: Fibrillarin-like archaeal protein; InterPro: IPR016760 Members of this protein family are HmdC, whose gene regularly occurs in the context of genes for HmdA (5,10-methenyltetrahydromethanopterin hydrogenase) and the radical SAM protein HmdB involved in biosynthesis of the HmdA cofactor. Bioinformatics suggests this protein, a homologue of eukaryotic fibrillarin, may be involved in biosynthesis of the guanylyl pyridinol cofactor in HmdA.
Probab=23.94 E-value=83 Score=25.50 Aligned_cols=34 Identities=12% Similarity=0.059 Sum_probs=24.3
Q ss_pred eeEeeeccchHHHHHHHHHHHHhcC-CCCeEEeeeeeCCCC
Q psy732 67 NYDISLSLKDFYSIIPIMKERLKDQ-PVVTVCGYGHLERLK 106 (119)
Q Consensus 67 ~~DvaVP~~~l~~~i~~~~~~~~~~-p~~~~~~~GH~GDGN 106 (119)
.+.=+-|++.+.+..+.. ++| .|+.+++ |+|||-
T Consensus 198 ~e~HaApldE~~~Va~~A----kk~gkGveaI~--~vGDGy 232 (505)
T PF10113_consen 198 GEEHAAPLDEMEEVAELA----KKYGKGVEAIM--HVGDGY 232 (505)
T ss_pred CcccCCCHHHHHHHHHHH----HHhCCCceEEE--EecCCh
Confidence 567778888877655544 445 4788887 999984
No 101
>cd01839 SGNH_arylesterase_like SGNH_hydrolase subfamily, similar to arylesterase (7-aminocephalosporanic acid-deacetylating enzyme) of A. tumefaciens. SGNH hydrolases are a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases.
Probab=23.78 E-value=67 Score=22.11 Aligned_cols=33 Identities=3% Similarity=-0.136 Sum_probs=21.3
Q ss_pred hHHHHHHHHHHHHhcCCCCeEEeeeee-----CCCCccee
Q psy732 76 DFYSIIPIMKERLKDQPVVTVCGYGHL-----ERLKDQPV 110 (119)
Q Consensus 76 ~l~~~i~~~~~~~~~~p~~~~~~~GH~-----GDGNlH~~ 110 (119)
.+.+|-+.+++++++. ++..+=+.++ .|| +|++
T Consensus 153 ~~~~~~~~~~~~a~~~-~~~~iD~~~~~~~~~~DG-vH~~ 190 (208)
T cd01839 153 KSKGLADAYRALAEEL-GCHFFDAGSVGSTSPVDG-VHLD 190 (208)
T ss_pred HHHHHHHHHHHHHHHh-CCCEEcHHHHhccCCCCc-cCcC
Confidence 4677778888888887 7665544333 254 6654
No 102
>KOG3939|consensus
Probab=23.78 E-value=99 Score=23.35 Aligned_cols=46 Identities=11% Similarity=-0.018 Sum_probs=36.9
Q ss_pred eeEeeeccchHHHHHHHHHHHHhcC-CCCeEEeeeeeCCCCcceeeecCC
Q psy732 67 NYDISLSLKDFYSIIPIMKERLKDQ-PVVTVCGYGHLERLKDQPVVTVCG 115 (119)
Q Consensus 67 ~~DvaVP~~~l~~~i~~~~~~~~~~-p~~~~~~~GH~GDGNlH~~i~~~~ 115 (119)
.-=+|+|.++-+.|.+++++. ++ +| .+.+.||+--||--.+|+.+.
T Consensus 257 GlLIclP~eqaakfcaei~s~--k~geg-qAWIIGiVekgn~tArIidkp 303 (312)
T KOG3939|consen 257 GLLICLPREQAAKFCAEIKSP--KNGEG-QAWIIGIVEKGNSTARIIDKP 303 (312)
T ss_pred ceEEEccHHHHHHHHHHhcCC--ccCCC-ceEEEEEEecCCcceEecCCc
Confidence 345899999999999998863 44 46 888999999999888877543
No 103
>KOG4569|consensus
Probab=23.68 E-value=97 Score=23.76 Aligned_cols=25 Identities=12% Similarity=0.139 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHhcCCCCeEEeeeee
Q psy732 78 YSIIPIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 78 ~~~i~~~~~~~~~~p~~~~~~~GH~ 102 (119)
..+.+.++.++..+|+.++.+.||=
T Consensus 155 ~~~~~~~~~L~~~~~~~~i~vTGHS 179 (336)
T KOG4569|consen 155 SGLDAELRRLIELYPNYSIWVTGHS 179 (336)
T ss_pred HHHHHHHHHHHHhcCCcEEEEecCC
Confidence 7788889999999999999999994
No 104
>COG3427 Carbon monoxide dehydrogenase subunit G, CoxG [Energy production and conversion]
Probab=23.53 E-value=76 Score=21.66 Aligned_cols=36 Identities=6% Similarity=0.030 Sum_probs=29.7
Q ss_pred eeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCC
Q psy732 67 NYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLER 104 (119)
Q Consensus 67 ~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GD 104 (119)
.+.|++|+...=+|+..-+.+.+-.||+.. |.+.||
T Consensus 6 ~f~V~~p~e~Vw~~L~dpe~~a~ciPG~qs--~e~~g~ 41 (146)
T COG3427 6 TFRVAAPPEAVWEFLNDPEQVAACIPGVQS--VETNGD 41 (146)
T ss_pred eEEecCCHHHHHHHhcCHHHHHhhcCCcce--eeecCC
Confidence 589999999999999999998877788765 446665
No 105
>PF02257 RFX_DNA_binding: RFX DNA-binding domain; InterPro: IPR003150 RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer [].; GO: 0003677 DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1DP7_P 2KW3_A.
Probab=23.38 E-value=50 Score=20.37 Aligned_cols=16 Identities=19% Similarity=0.279 Sum_probs=12.4
Q ss_pred eeeEeeeccchHHHHH
Q psy732 66 YNYDISLSLKDFYSII 81 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i 81 (119)
..+|++||.+.|.+-.
T Consensus 18 ~~~~~svpR~~lY~~Y 33 (85)
T PF02257_consen 18 ESPGVSVPRSDLYAHY 33 (85)
T ss_dssp EEEEEEEEHHHHHHHH
T ss_pred cCCCCccchHHHHHHH
Confidence 5689999999885533
No 106
>COG0850 MinC Septum formation inhibitor [Cell division and chromosome partitioning]
Probab=23.29 E-value=52 Score=23.90 Aligned_cols=19 Identities=11% Similarity=-0.018 Sum_probs=14.0
Q ss_pred CCCeEEeeee-------eCCCCccee
Q psy732 92 PVVTVCGYGH-------LERLKDQPV 110 (119)
Q Consensus 92 p~~~~~~~GH-------~GDGNlH~~ 110 (119)
++-..++.|+ +.|||+|+-
T Consensus 128 ~~gDlivlG~VnpGAeIiA~GnI~Vy 153 (219)
T COG0850 128 PGGDLIVLGDVNPGAEIIADGNIHVY 153 (219)
T ss_pred eCCCEEEEccCCCCCEEEeCCcEEEE
Confidence 3557777787 578999973
No 107
>COG0011 Uncharacterized conserved protein [Function unknown]
Probab=23.20 E-value=1.3e+02 Score=19.21 Aligned_cols=22 Identities=14% Similarity=-0.028 Sum_probs=18.0
Q ss_pred hHHHHHHHHHHHHhcCCCCeEEe
Q psy732 76 DFYSIIPIMKERLKDQPVVTVCG 98 (119)
Q Consensus 76 ~l~~~i~~~~~~~~~~p~~~~~~ 98 (119)
.+.+|++++.+.++++ |++..+
T Consensus 18 svs~yVa~~i~~lk~~-glky~~ 39 (100)
T COG0011 18 SVSKYVAEAIEILKES-GLKYQL 39 (100)
T ss_pred CHHHHHHHHHHHHHHc-CCceee
Confidence 5889999999999998 776543
No 108
>PF00149 Metallophos: Calcineurin-like phosphoesterase; InterPro: IPR004843 This domain is found in a diverse range of phosphoesterases [], including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases, as well as nucleases such as bacterial SbcD or yeast MRE11. The most conserved regions in this domain centre around the metal chelating residues.; GO: 0016787 hydrolase activity; PDB: 2IAE_C 3DW8_F 3FGA_C 2IE4_C 2NYM_C 2NYL_C 3K7V_C 2NPP_C 2IE3_C 3K7W_C ....
Probab=23.11 E-value=80 Score=19.53 Aligned_cols=22 Identities=27% Similarity=0.172 Sum_probs=16.3
Q ss_pred HHHHHHHHHhcCCCCeEEeeeee
Q psy732 80 IIPIMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 80 ~i~~~~~~~~~~p~~~~~~~GH~ 102 (119)
....+..+++++ ++..+++||.
T Consensus 176 ~~~~~~~~~~~~-~v~~~~~GH~ 197 (200)
T PF00149_consen 176 GREALEELLKKY-NVDLVLSGHT 197 (200)
T ss_dssp HHHHHHHHHHHT-TCSEEEEESS
T ss_pred cHHHHHHHHhhC-CCCEEEeCce
Confidence 344556666777 9999999995
No 109
>PTZ00422 glideosome-associated protein 50; Provisional
Probab=23.00 E-value=90 Score=24.81 Aligned_cols=23 Identities=22% Similarity=0.064 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHhcCCCCeEEeeee
Q psy732 78 YSIIPIMKERLKDQPVVTVCGYGH 101 (119)
Q Consensus 78 ~~~i~~~~~~~~~~p~~~~~~~GH 101 (119)
+++...+..+++++ ++..++.||
T Consensus 234 ~~L~~~L~PLL~ky-~VdlYisGH 256 (394)
T PTZ00422 234 SYLSYYLLPLLKDA-QVDLYISGY 256 (394)
T ss_pred HHHHHHHHHHHHHc-CcCEEEEcc
Confidence 45777899999999 999999999
No 110
>smart00712 PUR DNA/RNA-binding repeats in PUR-alpha/beta/gamma and in hypothetical proteins from spirochetes and the Bacteroides-Cytophaga-Flexibacter bacteria.
Probab=22.76 E-value=1.1e+02 Score=17.65 Aligned_cols=25 Identities=4% Similarity=0.050 Sum_probs=20.9
Q ss_pred eeEeeeccchHHHHHHHHHHHHhcC
Q psy732 67 NYDISLSLKDFYSIIPIMKERLKDQ 91 (119)
Q Consensus 67 ~~DvaVP~~~l~~~i~~~~~~~~~~ 91 (119)
..=|.||.+.+.+|.+.+.+.++.+
T Consensus 38 r~~I~lp~~~~~~F~~~l~~~~~~~ 62 (63)
T smart00712 38 RSSITVPEQGAAEFRDALNKLIEKY 62 (63)
T ss_pred ceEEEEEHHHHHHHHHHHHHHHHhh
Confidence 4578999999999999998887654
No 111
>cd07396 MPP_Nbla03831 Homo sapiens Nbla03831 and related proteins, metallophosphatase domain. Nbla03831 (also known as LOC56985) is an uncharacterized Homo sapiens protein with a domain that belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets. This domain is thought to allow for productive metal coordination.
Probab=22.47 E-value=74 Score=23.21 Aligned_cols=22 Identities=23% Similarity=0.227 Sum_probs=16.2
Q ss_pred HHHHHHHhcCCCCeEEeeeeeC
Q psy732 82 PIMKERLKDQPVVTVCGYGHLE 103 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~G 103 (119)
+.+.+++++++++..+++||.-
T Consensus 205 ~~~~~ll~~~~~V~~v~~GH~H 226 (267)
T cd07396 205 EEVLSILRAYGCVKACISGHDH 226 (267)
T ss_pred HHHHHHHHhCCCEEEEEcCCcC
Confidence 4556777776578899999963
No 112
>PF00543 P-II: Nitrogen regulatory protein P-II members of this family.; InterPro: IPR002187 In Gram-negative bacteria, the activity and concentration of glutamine synthetase (GS) is regulated in response to nitrogen source availability. PII, a tetrameric protein encoded by the glnB gene, is a component of the adenylation cascade involved in the regulation of GS activity []. In nitrogen-limiting conditions, when the ratio of glutamine to 2-ketoglutarate decreases, P-II is uridylylated on a tyrosine residue to form P-II-UMP. P-II-UMP allows the deadenylation of GS, thus activating the enzyme. Conversely, in nitrogen excess, P-II-UMP is deuridylated and then promotes the adenylation of GS. P-II also indirectly controls the transcription of the GS gene (glnA) by preventing NR-II (ntrB) to phosphorylate NR-I (ntrC) which is the transcriptional activator of glnA. Once P-II is uridylylated, these events are reversed. P-II is a protein of about 110 amino acid residues extremely well conserved. The tyrosine which is uridylated is located in the central part of the protein. In cyanobacteria, P-II seems to be phosphorylated on a serine residue rather than being uridylated. In methanogenic archaebacteria, the nitrogenase iron protein gene (nifH) is followed by two open reading frames highly similar to the eubacterial P-II protein []. These proteins could be involved in the regulation of nitrogen fixation. In the red alga, Porphyra purpurea, there is a glnB homologue encoded in the chloroplast genome. Other proteins highly similar to glnB are: Bacillus subtilis protein nrgB []. Escherichia coli hypothetical protein ybaI []. ; GO: 0030234 enzyme regulator activity, 0006808 regulation of nitrogen utilization; PDB: 1V3S_B 1V3R_C 2XZW_A 2XUL_A 2V5H_J 2XG8_C 2JJ4_F 1QY7_C 4AFF_A 2XBP_A ....
Probab=22.41 E-value=1.3e+02 Score=18.55 Aligned_cols=36 Identities=17% Similarity=0.108 Sum_probs=28.4
Q ss_pred ceeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcce
Q psy732 65 IYNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQP 109 (119)
Q Consensus 65 ~~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~ 109 (119)
...-.+.||-++..+.++.+.+.+.. |+.|||-+=+
T Consensus 55 k~~i~ivv~d~~v~~iv~~I~~~~~t---------g~~GdGkIfV 90 (102)
T PF00543_consen 55 KVKIEIVVPDEDVEEIVEAISEAART---------GEPGDGKIFV 90 (102)
T ss_dssp EEEEEEEEEGGGHHHHHHHHHHHH-S---------SSTTSEEEEE
T ss_pred cEEEEEEECHHhHHHHHHHHHHhccC---------CCCCCEEEEE
Confidence 45679999999999999999988744 5788886544
No 113
>cd07388 MPP_Tt1561 Thermus thermophilus Tt1561 and related proteins, metallophosphatase domain. This family includes bacterial proteins related to Tt1561 (also known as Aq1956 in Aquifex aeolicus), an uncharacterized Thermus thermophilus protein. The conserved domain present in members of this family belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal active site made up of residues located at the C-terminal side of the sheets,
Probab=22.30 E-value=68 Score=23.23 Aligned_cols=21 Identities=14% Similarity=-0.034 Sum_probs=15.6
Q ss_pred HHHHHHHhcCCCCeEEeeeeeC
Q psy732 82 PIMKERLKDQPVVTVCGYGHLE 103 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~G 103 (119)
..++++++++ .-+.+++||++
T Consensus 169 ~alr~~I~~~-~P~l~i~GHih 189 (224)
T cd07388 169 HEVAHLIKTH-NPLVVLVGGKG 189 (224)
T ss_pred HHHHHHHHHh-CCCEEEEcCCc
Confidence 4455666777 66788999998
No 114
>PF02579 Nitro_FeMo-Co: Dinitrogenase iron-molybdenum cofactor; InterPro: IPR003731 This entry represents several Nif (B, X and Y) proteins, which are involved in the biosynthesis of the iron-molybdenum cofactor (FeMo-co) found in the dinitrogenase enzyme of the nitrogenase complex in nitrogen-fixing bacteria. The nitrogenase complex catalyses the reduction of atmospheric dinitrogen to ammonia, and is composed of an iron metalloprotein (dinitrogenase reductase; homodimer of NifH; IPR000392 from INTERPRO) and a Fe-Mo metalloprotein (dinitrogenase; heterotetramer of NifD and NifK; IPR000318 from INTERPRO). The pathway for the synthesis of the Fe-Mo cofactor involves several proteins, including NifB, NifE, NifH, NifN, NifQ, NifV and NifX. NifB appears to be an iron-sulphur source for FeMo-co biosynthesis, while NifX may be associated with the mature FeMo-co, in particular with the addition of homocitrate during the last step of biosynthesis []. The NifX protein shows sequence similarity with the C terminus of NifB [], as well as to the conserved protein MTH1175 from the archaeon Methanobacterium thermoautotrophicum, which displays a ribonuclease H-like motif of three layers, alpha/beta/alpha, with a single mixed beta-sheet [].; PDB: 2QTD_A 2KLA_A 1EO1_A 1P90_A 1RDU_A 2YX6_D 1O13_A 1T3V_A 2RE2_B 2WFB_A.
Probab=22.26 E-value=39 Score=20.09 Aligned_cols=21 Identities=19% Similarity=0.088 Sum_probs=15.5
Q ss_pred HHHHhcCCCCeEEeeeeeCCCC
Q psy732 85 KERLKDQPVVTVCGYGHLERLK 106 (119)
Q Consensus 85 ~~~~~~~p~~~~~~~GH~GDGN 106 (119)
.+.+... ++.+++.|++|.+-
T Consensus 46 ~~~l~~~-~v~~li~~~iG~~~ 66 (94)
T PF02579_consen 46 AKFLAEE-GVDVLICGGIGEGA 66 (94)
T ss_dssp HHHHHHT-TESEEEESCSCHHH
T ss_pred HHHHHHc-CCCEEEEeCCCHHH
Confidence 3444446 99999999999763
No 115
>TIGR03768 RPA4764 metallophosphoesterase, RPA4764 family. This model describes a small collection of probable metallophosphoresterases, related to pfam00149. Members of this protein family usually have a Sec-independent TAT (twin-arginine translocation) signal sequence, N-terminal to the region modeled by this alignment. This model and TIGR03767 divide a narrow clade of pfam00149-related enzymes.
Probab=22.02 E-value=76 Score=26.01 Aligned_cols=27 Identities=15% Similarity=0.075 Sum_probs=21.5
Q ss_pred HHHHHHHhcCCCCeEEeeeeeCCCCcceeeec
Q psy732 82 PIMKERLKDQPVVTVCGYGHLERLKDQPVVTV 113 (119)
Q Consensus 82 ~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~ 113 (119)
.++.++++++|++..++.||. |.++..
T Consensus 388 ~eLlaLL~~hPnVla~LsGHv-----Hrn~v~ 414 (492)
T TIGR03768 388 TGLVTTLQKYPNLLMWIAGHR-----HLNTVK 414 (492)
T ss_pred HHHHHHHhcCCCeEEEEcCCc-----cccccc
Confidence 478888899999999999996 655543
No 116
>COG1458 Predicted DNA-binding protein containing PIN domain [General function prediction only]
Probab=21.93 E-value=63 Score=23.40 Aligned_cols=55 Identities=16% Similarity=0.277 Sum_probs=36.1
Q ss_pred cceeecCCHHHHHHHHHh--hhhcHHHh-hcC------CCceeeEeeeccchHHHHHHHHHHHHhc
Q psy732 34 LDAVMCSEESKIQKIWPL--RERIVESC-LKD------GYIYNYDISLSLKDFYSIIPIMKERLKD 90 (119)
Q Consensus 34 ~~~~~a~~~~e~~~lW~~--R~~~~~~~-~~~------~~~~~~DvaVP~~~l~~~i~~~~~~~~~ 90 (119)
..+.+. +.-...+-.. |.++.+-+ .+. .++-.+++-||..-+.+|+..+++.+++
T Consensus 47 iscyiP--PsVY~El~~fm~r~gc~~e~~~ki~twivkKtPnryevkiPa~ifyeyV~diR~Rink 110 (221)
T COG1458 47 ISCYIP--PSVYRELMGFMERNGCPEEVIAKIETWIVKKTPNRYEVKIPAAIFYEYVEDIRERINK 110 (221)
T ss_pred eEEEeC--hHHHHHHHHHHHhCCCcHHHHHhhheeeEecCCCceeecCcHHHHHHHHHHHHHHHHh
Confidence 444443 5555555444 55565422 221 1234689999999999999999999887
No 117
>TIGR00853 pts-lac PTS system, lactose/cellobiose family IIB component. Bacterial PTS transporters transport and concomitantly phosphorylate their sugar substrates, and typically consist of multiple subunits or protein domains.The Lac family includes several sequenced lactose (b-galactoside) permeases of Gram-positive bacteria as well as those in E. coli. While the Lac family usually consists of two polypeptide components IIA and IICB, the Chb permease of E. coli consists of three IIA, IIB and IIC. This family is specific for the IIB subunit of the Lac PTS family.
Probab=21.81 E-value=1.5e+02 Score=18.28 Aligned_cols=29 Identities=17% Similarity=0.229 Sum_probs=24.8
Q ss_pred eEeeeccchHHHHHHHHHHHHhcCCCCeEE
Q psy732 68 YDISLSLKDFYSIIPIMKERLKDQPVVTVC 97 (119)
Q Consensus 68 ~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~ 97 (119)
.|+.+--.++...+.++++.++++ ++.+.
T Consensus 51 ~Dvill~pqi~~~~~~i~~~~~~~-~ipv~ 79 (95)
T TIGR00853 51 ADVVLLAPQVAYMLPDLKKETDKK-GIPVE 79 (95)
T ss_pred CCEEEECchHHHHHHHHHHHhhhc-CCCEE
Confidence 499999999999999999999887 54443
No 118
>PRK05340 UDP-2,3-diacylglucosamine hydrolase; Provisional
Probab=21.76 E-value=75 Score=22.77 Aligned_cols=20 Identities=15% Similarity=0.127 Sum_probs=15.3
Q ss_pred HHHHHHhcCCCCeEEeeeeeC
Q psy732 83 IMKERLKDQPVVTVCGYGHLE 103 (119)
Q Consensus 83 ~~~~~~~~~p~~~~~~~GH~G 103 (119)
.+.+.++++ +...+++||.=
T Consensus 178 ~~~~~~~~~-~~~~~i~GH~H 197 (241)
T PRK05340 178 AVAALMEKH-GVDTLIHGHTH 197 (241)
T ss_pred HHHHHHHHh-CCCEEEECccc
Confidence 355666777 88999999973
No 119
>cd06583 PGRP Peptidoglycan recognition proteins (PGRPs) are pattern recognition receptors that bind, and in certain cases, hydrolyze peptidoglycans (PGNs) of bacterial cell walls. PGRPs have been divided into three classes: short PGRPs (PGRP-S), that are small (20 kDa) extracellular proteins; intermediate PGRPs (PGRP-I) that are 40-45 kDa and are predicted to be transmembrane proteins; and long PGRPs (PGRP-L), up to 90 kDa, which may be either intracellular or transmembrane. Several structures of PGRPs are known in insects and mammals, some bound with substrates like Muramyl Tripeptide (MTP) or Tracheal Cytotoxin (TCT). The substrate binding site is conserved in PGRP-LCx, PGRP-LE, and PGRP-Ialpha proteins. This family includes Zn-dependent N-Acetylmuramoyl-L-alanine Amidase, EC:3.5.1.28. This enzyme cleaves the amide bond between N-acetylmuramoyl and L-amino acids, preferentially D-lactyl-L-Ala, in bacterial cell walls. The structure for the bacteriophage T7 lysozyme shows that two of
Probab=21.48 E-value=1.1e+02 Score=18.98 Aligned_cols=31 Identities=10% Similarity=0.060 Sum_probs=26.7
Q ss_pred eeccchHHHHHHHHHHHHhcCCCC--eEEeeeee
Q psy732 71 SLSLKDFYSIIPIMKERLKDQPVV--TVCGYGHL 102 (119)
Q Consensus 71 aVP~~~l~~~i~~~~~~~~~~p~~--~~~~~GH~ 102 (119)
..+..++..+..-++.+++++ ++ ...++||-
T Consensus 82 ~~~~~q~~~~~~L~~~l~~~~-~i~~~~~i~~H~ 114 (126)
T cd06583 82 PPTAAQLEALAELLAYLVKRY-GIPPDYRIVGHR 114 (126)
T ss_pred CCCHHHHHHHHHHHHHHHHhC-CCCCCCEEEecc
Confidence 568888999999999999999 77 58888996
No 120
>TIGR03789 pdsO proteobacterial sortase system OmpA family protein. A newly defined histidine kinase (TIGR03785) and response regulator (TIGR03787) gene pair occurs exclusively in Proteobacteria, mostly of marine origin, nearly all of which contain a subfamily 6 sortase (TIGR03784) and its single dedicated target protein (TIGR03788) adjacent to to the sortase. This protein family shows up in only in those species with the histidine kinase/response regulator gene pair, and often adjacent to that pair. It belongs to the OmpA protein family (pfam00691). Its function is unknown. We assign the gene symbol pdsO, for Proteobacterial Dedicated Sortase system OmpA family protein.
Probab=21.27 E-value=1.6e+02 Score=21.74 Aligned_cols=39 Identities=15% Similarity=0.129 Sum_probs=30.0
Q ss_pred eeeEeeeccchHHHHHHHHHHHHhcCCCCeEEeeeeeCC
Q psy732 66 YNYDISLSLKDFYSIIPIMKERLKDQPVVTVCGYGHLER 104 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GD 104 (119)
|.+|-+-........+..+-..++.+|+..+.+.||...
T Consensus 139 F~~gSa~L~p~~~~~L~~iA~~Lk~~p~~~V~I~GHTD~ 177 (239)
T TIGR03789 139 FRTGSSDIEPHFQPQLDEVATLMKQSPELKLDLSGYADR 177 (239)
T ss_pred eCCCCccCCHHHHHHHHHHHHHHHhCCCCeEEEEEeCCC
Confidence 455556566677777888888888888899999999863
No 121
>PF13481 AAA_25: AAA domain; PDB: 1G8Y_J 1OLO_A 1NLF_C.
Probab=21.11 E-value=1.1e+02 Score=20.55 Aligned_cols=28 Identities=18% Similarity=0.121 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHhcCCCCeEEeeeeeCCC
Q psy732 77 FYSIIPIMKERLKDQPVVTVCGYGHLERL 105 (119)
Q Consensus 77 l~~~i~~~~~~~~~~p~~~~~~~GH~GDG 105 (119)
...++..++++++++ ++.+++..|.+-+
T Consensus 164 ~~~~~~~l~~la~~~-~~~vi~v~H~~K~ 191 (193)
T PF13481_consen 164 VAQLMQELKRLAKEY-GVAVILVHHTNKS 191 (193)
T ss_dssp HHHHHHHHHHHHHHH---EEEEEEEE---
T ss_pred HHHHHHHHHHHHHHc-CCEEEEEECCCCC
Confidence 467899999999999 9999999998744
No 122
>PF06183 DinI: DinI-like family; InterPro: IPR010391 This family of short proteins includes DNA-damage-inducible protein I (DinI) and related proteins. The SOS response, a set of cellular phenomena exhibited by eubacteria, is initiated by various causes that include DNA damage-induced replication arrest, and is positively regulated by the co- protease activity of RecA. Escherichia coli DinI, a LexA-regulated SOS gene product, shuts off the initiation of the SOS response when overexpressed in vivo. Biochemical and genetic studies indicated that DinI physically interacts with RecA to inhibit its co-protease activity []. The structure of DinI is known [].; PDB: 1GHH_A.
Probab=21.06 E-value=1.2e+02 Score=17.58 Aligned_cols=27 Identities=22% Similarity=0.229 Sum_probs=18.2
Q ss_pred eccchHHHHHHHHHHHHhcC-CCCeEEe
Q psy732 72 LSLKDFYSIIPIMKERLKDQ-PVVTVCG 98 (119)
Q Consensus 72 VP~~~l~~~i~~~~~~~~~~-p~~~~~~ 98 (119)
.|.+.+..+-.++.+.+.+. |++.+.+
T Consensus 1 lp~ga~~AL~~EL~kRl~~~yPd~~v~V 28 (65)
T PF06183_consen 1 LPAGALEALESELTKRLHRQYPDAEVRV 28 (65)
T ss_dssp --TTHHHHHHHHHHHHHHHH-SS-EEEE
T ss_pred CCccHHHHHHHHHHHHHHHHCCCceEee
Confidence 47778888888888887775 7765544
No 123
>COG0622 Predicted phosphoesterase [General function prediction only]
Probab=20.85 E-value=94 Score=21.58 Aligned_cols=19 Identities=21% Similarity=0.275 Sum_probs=13.7
Q ss_pred HHHHHHhcCCCCeEEeeeee
Q psy732 83 IMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 83 ~~~~~~~~~p~~~~~~~GH~ 102 (119)
.+..+.++. +..+++|||-
T Consensus 100 ~l~~la~~~-~~Dvli~GHT 118 (172)
T COG0622 100 LLEYLAKEL-GADVLIFGHT 118 (172)
T ss_pred HHHHHHHhc-CCCEEEECCC
Confidence 344455566 8999999993
No 124
>COG5017 Uncharacterized conserved protein [Function unknown]
Probab=20.84 E-value=87 Score=21.52 Aligned_cols=23 Identities=4% Similarity=-0.161 Sum_probs=15.3
Q ss_pred HHHHHHHHhcCCCCeEEeeeeeCCCCc
Q psy732 81 IPIMKERLKDQPVVTVCGYGHLERLKD 107 (119)
Q Consensus 81 i~~~~~~~~~~p~~~~~~~GH~GDGNl 107 (119)
..+++++.. +.+ ++..|+|.|.+
T Consensus 56 ~~kiQsli~---dar-IVISHaG~GSI 78 (161)
T COG5017 56 EEKIQSLIH---DAR-IVISHAGEGSI 78 (161)
T ss_pred HHHHHHHhh---cce-EEEeccCcchH
Confidence 345555553 344 67799999976
No 125
>COG1938 Archaeal enzymes of ATP-grasp superfamily [General function prediction only]
Probab=20.76 E-value=1.4e+02 Score=22.11 Aligned_cols=34 Identities=21% Similarity=0.255 Sum_probs=29.2
Q ss_pred eeeEeeeccchHHHHHHHHHHHHhcCCCCe-EEeee
Q psy732 66 YNYDISLSLKDFYSIIPIMKERLKDQPVVT-VCGYG 100 (119)
Q Consensus 66 ~~~DvaVP~~~l~~~i~~~~~~~~~~p~~~-~~~~G 100 (119)
+..|+..|+..+.+|.+.+-+.+++. +++ ++++|
T Consensus 81 ~~~dv~I~p~~i~e~s~~v~~w~~~~-~v~~ii~~~ 115 (244)
T COG1938 81 LVSDVPIPPAVIYEISNAVVEWAEEN-GVEEVISLG 115 (244)
T ss_pred EEecCCCCHHHHHHHHHHHHHHHHHc-CCeEEEEec
Confidence 67899999999999999999999999 664 45555
No 126
>COG5186 PAP1 Poly(A) polymerase [RNA processing and modification]
Probab=20.64 E-value=91 Score=25.00 Aligned_cols=71 Identities=13% Similarity=-0.041 Sum_probs=41.5
Q ss_pred HHHHHHHHhhhhcHHHhhcCCCceeeEeee-------ccchHHHHHHHHHHHHhcCCCCeEEeeeeeCCCCcceeeec
Q psy732 43 SKIQKIWPLRERIVESCLKDGYIYNYDISL-------SLKDFYSIIPIMKERLKDQPVVTVCGYGHLERLKDQPVVTV 113 (119)
Q Consensus 43 ~e~~~lW~~R~~~~~~~~~~~~~~~~DvaV-------P~~~l~~~i~~~~~~~~~~p~~~~~~~GH~GDGNlH~~i~~ 113 (119)
-..-++|+.|++++.-.......+..-++| |-..=.-.+..+-.++.+.+--..++.--+-||++.+++.-
T Consensus 211 LRaIK~WAqRRavYaN~~GfpGGVAwam~VARiCQLYPNA~S~vIv~kFF~ils~WnWPqPviLkPieDgplqvrvWn 288 (552)
T COG5186 211 LRAIKYWAQRRAVYANPYGFPGGVAWAMCVARICQLYPNASSFVIVCKFFEILSSWNWPQPVILKPIEDGPLQVRVWN 288 (552)
T ss_pred HHHHHHHHHhhhhhccccCCcchHHHHHHHHHHHhhccCcchHhHHHHHHHHHHhcCCCCCeEeeeccCCCeeEEeeC
Confidence 345689999999886543321100000000 22222334555566777763456677888999999999874
No 127
>PRK10897 phosphohistidinoprotein-hexose phosphotransferase component of N-regulated PTS system (Npr); Provisional
Probab=20.56 E-value=1.5e+02 Score=18.17 Aligned_cols=11 Identities=9% Similarity=-0.112 Sum_probs=4.2
Q ss_pred HHHHHhhcCcc
Q psy732 23 FLQDGIENNII 33 (119)
Q Consensus 23 ~~~~~~~~~~~ 33 (119)
++......+.-
T Consensus 52 lm~Lg~~~G~~ 62 (90)
T PRK10897 52 LLMLDSAKGRQ 62 (90)
T ss_pred HHHhCCCCCCE
Confidence 33333334443
No 128
>cd07394 MPP_Vps29 Homo sapiens Vps29 and related proteins, metallophosphatase domain. Vps29 (vacuolar sorting protein 29), also known as vacuolar membrane protein Pep11, is a subunit of the retromer complex which is responsible for the retrieval of mannose-6-phosphate receptors (MPRs) from the endosomes for retrograde transport back to the Golgi. Vps29 has a phosphoesterase fold that acts as a protein interaction scaffold for retromer complex assembly as well as a phosphatase with specificity for the cytoplasmic tail of the MPR. The retromer includes the following 5 subunits: Vps35, Vps26, Vps29, and a dimer of the sorting nexins Vps5 (Snx1), and Vps17 (Snx2). Vps29 belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily incl
Probab=20.44 E-value=91 Score=21.46 Aligned_cols=19 Identities=16% Similarity=0.104 Sum_probs=13.1
Q ss_pred HHHHHHhcCCCCeEEeeeee
Q psy732 83 IMKERLKDQPVVTVCGYGHL 102 (119)
Q Consensus 83 ~~~~~~~~~p~~~~~~~GH~ 102 (119)
.+.+.+++. +..+++|||-
T Consensus 97 ~~~~~~~~~-~~dvii~GHT 115 (178)
T cd07394 97 SLAALQRQL-DVDILISGHT 115 (178)
T ss_pred HHHHHHHhc-CCCEEEECCC
Confidence 344444555 7899999994
No 129
>PF02633 Creatininase: Creatinine amidohydrolase; InterPro: IPR003785 This family includes the enzymes creatininase and 2-amino-5-formylamino-6-ribosylaminopyrimidin-4(3H)-one 5'-monophosphate deformylase, also known as formamide hydrolase. Creatinase or creatinine amidohydrolase (3.5.2.10 from EC) catalyses the hydrolysis of creatinine to creatine, which can then be metabolised to urea and sarcosine by creatinase (3.5.3.3 from EC). Creatininase is a member of the urease-related amidohydrolase superfamily []. Formamide hydrolase catalyzes the hydrolysis of the formamide of 2-amino-5-formylamino-6-ribosylamino-4(3H)-pyrimidinone 5'-monophosphate (FAPy) to form 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate (APy) (3.5.1.102 from EC). ; PDB: 3A6K_F 3A6F_A 3A6D_B 1J2U_B 3A6J_C 1J2T_A 3A6G_C 3A6H_F 1Q3K_E 3A6L_C ....
Probab=20.29 E-value=90 Score=22.43 Aligned_cols=36 Identities=22% Similarity=0.154 Sum_probs=26.6
Q ss_pred eeEeeeccchHHHHHHHHHHHHhcCCCCe--EEeeeeeC
Q psy732 67 NYDISLSLKDFYSIIPIMKERLKDQPVVT--VCGYGHLE 103 (119)
Q Consensus 67 ~~DvaVP~~~l~~~i~~~~~~~~~~p~~~--~~~~GH~G 103 (119)
.--++++.+-+..++.++-+-+.++ |++ +++-||-|
T Consensus 74 pGTisl~~~t~~~~l~di~~sl~~~-Gf~~ivivngHgG 111 (237)
T PF02633_consen 74 PGTISLSPETLIALLRDILRSLARH-GFRRIVIVNGHGG 111 (237)
T ss_dssp TT-BBB-HHHHHHHHHHHHHHHHHH-T--EEEEEESSTT
T ss_pred CCeEEeCHHHHHHHHHHHHHHHHHc-CCCEEEEEECCHh
Confidence 3468999999999999999988888 776 45569987
No 130
>PRK13780 phosphocarrier protein HPr; Provisional
Probab=20.11 E-value=1.7e+02 Score=17.89 Aligned_cols=20 Identities=5% Similarity=0.026 Sum_probs=11.6
Q ss_pred CceEEEecCCCchhHHHHHH
Q psy732 2 DPFFCCCLGSCVDHDAEKLN 21 (119)
Q Consensus 2 ~~LlvE~~g~~~~~~~~~~~ 21 (119)
+.+-|+.+|.++++.-+.+.
T Consensus 59 ~~v~i~a~G~De~~Al~~l~ 78 (88)
T PRK13780 59 ADITISAEGADAADAIAAIE 78 (88)
T ss_pred CEEEEEEeCcCHHHHHHHHH
Confidence 34678888876543333333
Done!