Query psy985
Match_columns 191
No_of_seqs 72 out of 74
Neff 4.8
Searched_HMMs 46136
Date Fri Aug 16 21:15:48 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy985.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/985hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF05641 Agenet: Agenet domain 99.0 1.2E-09 2.5E-14 77.4 5.7 59 40-99 1-66 (68)
2 smart00743 Agenet Tudor-like d 98.8 2E-08 4.3E-13 68.8 6.2 56 39-98 2-58 (61)
3 smart00561 MBT Present in Dros 97.5 0.00074 1.6E-08 51.1 7.9 59 36-97 24-84 (96)
4 PF11717 Tudor-knot: RNA bindi 97.4 0.00055 1.2E-08 46.6 6.0 50 40-94 1-54 (55)
5 smart00333 TUDOR Tudor domain. 96.3 0.016 3.4E-07 38.4 6.0 53 39-98 2-55 (57)
6 cd04508 TUDOR Tudor domains ar 96.0 0.02 4.3E-07 36.7 5.1 47 43-95 1-48 (48)
7 cd05162 PWWP The PWWP domain, 95.2 0.045 9.7E-07 39.7 4.8 53 40-99 1-64 (87)
8 PF00855 PWWP: PWWP domain; I 94.8 0.059 1.3E-06 38.4 4.5 51 40-97 1-59 (86)
9 PF02820 MBT: mbt repeat; Int 94.8 0.12 2.5E-06 36.8 6.0 51 44-97 1-53 (73)
10 cd05835 Dnmt3b_related The PWW 94.1 0.059 1.3E-06 39.8 3.3 50 40-96 1-58 (87)
11 PF00018 SH3_1: SH3 domain; I 94.0 0.067 1.4E-06 34.7 3.1 28 35-66 11-38 (48)
12 cd05836 N_Pac_NP60 The PWWP do 93.2 0.34 7.4E-06 35.8 6.1 51 40-97 1-60 (86)
13 PF07653 SH3_2: Variant SH3 do 92.5 0.13 2.8E-06 34.2 2.7 26 36-65 14-40 (55)
14 cd05834 HDGF_related The PWWP 91.2 0.43 9.2E-06 35.1 4.5 53 39-98 2-59 (83)
15 PLN00104 MYST -like histone ac 91.0 0.84 1.8E-05 43.5 7.4 57 37-97 51-115 (450)
16 PF14604 SH3_9: Variant SH3 do 91.0 0.27 5.8E-06 32.4 2.9 30 32-65 7-36 (49)
17 PF06003 SMN: Survival motor n 90.7 0.61 1.3E-05 41.2 5.8 57 37-99 66-124 (264)
18 smart00293 PWWP domain with co 90.6 0.69 1.5E-05 31.9 4.9 50 40-96 1-62 (63)
19 PF15057 DUF4537: Domain of un 90.4 0.88 1.9E-05 35.8 5.9 81 6-96 26-112 (124)
20 KOG0162|consensus 90.2 0.27 5.8E-06 49.8 3.4 26 39-68 1069-1094(1106)
21 cd05840 SPBC215_ISWI_like The 90.1 0.52 1.1E-05 35.4 4.1 51 40-97 1-65 (93)
22 PF00567 TUDOR: Tudor domain; 89.1 1.3 2.9E-05 31.8 5.7 55 38-98 50-105 (121)
23 cd05837 MSH6_like The PWWP dom 87.6 0.82 1.8E-05 35.2 3.8 55 39-99 2-70 (110)
24 smart00326 SH3 Src homology 3 87.3 1.4 3.1E-05 27.6 4.3 27 36-66 17-43 (58)
25 cd00174 SH3 Src homology 3 dom 86.5 1.6 3.5E-05 27.1 4.2 27 36-66 14-40 (54)
26 COG2139 RPL21A Ribosomal prote 83.9 2.8 6.1E-05 32.3 5.1 62 36-97 29-95 (98)
27 cd05841 BS69_related The PWWP 81.6 2.7 5.8E-05 31.3 4.1 54 40-99 7-60 (83)
28 cd06080 MUM1_like Mutated mela 81.2 4.4 9.5E-05 29.9 5.1 51 40-96 1-53 (80)
29 cd02781 MopB_CT_Acetylene-hydr 65.5 1.8 4E-05 32.8 -0.3 30 24-53 31-60 (130)
30 cd05838 WHSC1_related The PWWP 63.7 15 0.00033 27.4 4.5 50 40-96 1-62 (95)
31 PF13509 S1_2: S1 domain; PDB: 62.6 3.8 8.2E-05 28.1 0.9 28 26-54 25-52 (61)
32 KOG4348|consensus 58.5 1.9 4E-05 41.6 -1.6 32 36-71 115-146 (627)
33 COG2336 MazE Growth regulator 58.2 2.6 5.7E-05 31.5 -0.5 24 27-50 13-36 (82)
34 cd05839 BR140_related The PWWP 58.1 16 0.00034 28.7 3.8 58 41-104 2-86 (111)
35 cd02794 MopB_CT_DmsA-EC The Mo 56.8 3.4 7.3E-05 31.2 -0.2 26 28-53 32-57 (121)
36 cd02786 MopB_CT_3 The MopB_CT_ 56.8 28 0.0006 25.7 4.9 29 24-52 29-57 (116)
37 KOG3038|consensus 56.7 46 0.001 29.9 6.9 72 26-98 108-186 (264)
38 KOG2996|consensus 56.1 10 0.00022 38.1 2.9 39 31-71 815-853 (865)
39 PF09465 LBR_tudor: Lamin-B re 54.3 40 0.00087 23.5 4.9 39 38-80 4-44 (55)
40 smart00298 CHROMO Chromatin or 51.2 9.4 0.0002 24.3 1.3 34 61-94 3-41 (55)
41 cd02793 MopB_CT_DMSOR-BSOR-TMA 49.5 4 8.6E-05 31.4 -0.8 25 28-52 35-59 (129)
42 PF01245 Ribosomal_L19: Riboso 47.5 49 0.0011 25.9 5.0 34 37-70 16-51 (113)
43 PRK05338 rplS 50S ribosomal pr 47.2 41 0.00088 26.6 4.5 43 37-84 16-58 (116)
44 PF07039 DUF1325: SGF29 tudor- 46.1 48 0.001 26.3 4.9 49 29-81 61-112 (130)
45 CHL00084 rpl19 ribosomal prote 45.5 44 0.00095 26.5 4.5 43 37-84 20-62 (117)
46 PF12945 YcgR_2: Flagellar pro 44.3 85 0.0019 21.7 5.5 38 40-78 1-39 (87)
47 TIGR01024 rplS_bact ribosomal 43.9 45 0.00098 26.2 4.3 43 37-84 16-58 (113)
48 KOG4348|consensus 43.6 29 0.00064 33.7 3.8 34 37-72 277-310 (627)
49 cd02789 MopB_CT_FmdC-FwdD The 41.9 5.8 0.00013 29.8 -0.9 25 28-52 33-57 (106)
50 PF11302 DUF3104: Protein of u 41.7 45 0.00098 24.6 3.8 33 39-71 5-39 (75)
51 PF10775 ATP_sub_h: ATP syntha 40.8 14 0.0003 26.6 0.9 18 158-175 6-23 (67)
52 PF07039 DUF1325: SGF29 tudor- 40.4 71 0.0015 25.3 5.0 37 41-78 1-39 (130)
53 cd02783 MopB_CT_2 The MopB_CT_ 40.0 18 0.0004 28.9 1.7 38 24-67 30-67 (156)
54 PF13376 OmdA: Bacteriocin-pro 39.6 26 0.00056 24.0 2.1 24 111-134 3-27 (63)
55 cd02790 MopB_CT_Formate-Dh_H F 39.3 49 0.0011 24.2 3.8 62 24-89 33-95 (116)
56 cd02785 MopB_CT_4 The MopB_CT_ 38.6 83 0.0018 23.6 5.0 37 24-66 30-66 (124)
57 PF04014 Antitoxin-MazE: Antid 38.5 6.8 0.00015 25.3 -0.9 27 27-53 8-34 (47)
58 PRK05659 sulfur carrier protei 36.3 25 0.00053 23.8 1.6 16 36-51 47-62 (66)
59 cd02791 MopB_CT_Nitrate-R-NapA 35.8 89 0.0019 23.0 4.7 31 22-52 31-61 (122)
60 smart00739 KOW KOW (Kyprides, 35.7 79 0.0017 17.4 4.2 26 40-70 2-27 (28)
61 cd02778 MopB_CT_Thiosulfate-R- 35.6 13 0.00029 27.7 0.2 32 22-53 26-57 (123)
62 cd02787 MopB_CT_ydeP The MopB_ 35.5 11 0.00024 27.9 -0.2 27 28-54 33-59 (112)
63 PF03454 MoeA_C: MoeA C-termin 35.4 18 0.00038 24.7 0.8 20 28-50 52-71 (72)
64 TIGR01843 type_I_hlyD type I s 32.9 52 0.0011 29.2 3.6 35 37-71 321-355 (423)
65 PRK09798 antitoxin MazE; Provi 32.4 18 0.00038 26.7 0.4 24 27-50 14-37 (82)
66 PF00868 Transglut_N: Transglu 32.3 2.2E+02 0.0048 21.8 6.6 68 3-77 32-99 (118)
67 cd02858 Esterase_N_term Estera 32.1 43 0.00093 24.0 2.4 21 58-78 39-59 (85)
68 cd00508 MopB_CT_Fdh-Nap-like T 32.0 16 0.00034 26.8 0.1 29 24-52 33-61 (120)
69 PRK11347 antitoxin ChpS; Provi 31.9 17 0.00037 26.8 0.2 26 27-52 13-38 (83)
70 PF08239 SH3_3: Bacterial SH3 30.9 1.1E+02 0.0023 19.5 3.9 40 27-71 4-45 (55)
71 PRK06944 sulfur carrier protei 30.8 43 0.00093 22.5 2.1 16 36-51 46-61 (65)
72 KOG3766|consensus 30.8 1.3E+02 0.0028 29.2 5.9 54 36-92 197-252 (478)
73 COG4954 Uncharacterized protei 30.8 53 0.0011 26.2 2.8 63 109-177 5-80 (135)
74 PRK10800 acyl-CoA thioesterase 30.7 2.2E+02 0.0049 21.1 6.3 21 36-56 63-83 (130)
75 KOG2039|consensus 30.7 94 0.002 32.2 5.3 61 32-99 688-750 (875)
76 KOG4225|consensus 30.5 39 0.00084 32.7 2.4 32 36-72 447-478 (489)
77 KOG1118|consensus 30.2 43 0.00094 31.1 2.6 38 36-78 321-358 (366)
78 TIGR01683 thiS thiamine biosyn 29.6 38 0.00083 23.0 1.7 17 36-52 45-61 (64)
79 COG2996 Predicted RNA-bindinin 29.6 55 0.0012 29.7 3.1 68 27-96 31-110 (287)
80 KOG4226|consensus 29.4 58 0.0012 30.0 3.2 69 23-97 91-180 (379)
81 PRK04306 50S ribosomal protein 28.7 1.7E+02 0.0036 22.5 5.2 62 36-97 31-97 (98)
82 cd02775 MopB_CT Molybdopterin- 28.5 20 0.00043 25.4 0.1 38 15-53 13-50 (101)
83 cd02776 MopB_CT_Nitrate-R-NarG 28.4 17 0.00037 28.8 -0.3 29 24-52 29-57 (141)
84 PF13279 4HBT_2: Thioesterase- 28.2 1.2E+02 0.0026 22.0 4.3 17 36-52 53-69 (121)
85 PF01568 Molydop_binding: Moly 27.8 58 0.0013 23.5 2.5 64 27-97 31-101 (110)
86 PF12158 DUF3592: Protein of u 27.7 1.4E+02 0.0029 22.6 4.7 18 36-53 88-105 (148)
87 TIGR02609 doc_partner putative 27.7 22 0.00048 25.3 0.2 24 27-50 11-34 (74)
88 PRK05863 sulfur carrier protei 27.7 36 0.00077 23.4 1.3 15 37-51 47-61 (65)
89 PRK06764 hypothetical protein; 27.6 1.3E+02 0.0028 23.2 4.3 45 43-89 58-104 (105)
90 PF07088 GvpD: GvpD gas vesicl 26.8 35 0.00076 32.9 1.4 25 147-171 421-445 (484)
91 TIGR03794 NHPM_micro_HlyD NHPM 25.7 68 0.0015 29.4 3.1 33 38-70 306-338 (421)
92 cd02792 MopB_CT_Formate-Dh-Na- 25.6 1.9E+02 0.0041 21.2 5.0 31 23-53 32-62 (122)
93 COG4456 VagC Virulence-associa 25.2 93 0.002 22.9 3.1 32 8-53 3-34 (74)
94 PRK06488 sulfur carrier protei 24.6 65 0.0014 21.8 2.1 17 36-52 46-62 (65)
95 cd05698 S1_Rrp5_repeat_hs6_sc5 24.6 58 0.0013 21.7 1.9 44 5-51 13-56 (70)
96 PF03658 Ub-RnfH: RnfH family 24.5 49 0.0011 24.7 1.6 16 36-51 58-73 (84)
97 PF12148 DUF3590: Protein of u 24.4 1.3E+02 0.0027 22.7 3.8 44 54-97 6-57 (85)
98 cd02779 MopB_CT_Arsenite-Ox Th 24.3 27 0.00058 26.1 0.1 27 26-52 33-59 (115)
99 cd04472 S1_PNPase S1_PNPase: P 24.2 95 0.0021 20.1 2.9 44 4-50 12-55 (68)
100 PF14444 S1-like: S1-like 24.0 83 0.0018 22.1 2.5 23 38-60 32-54 (58)
101 PF02922 CBM_48: Carbohydrate- 23.3 2.3E+02 0.005 19.4 4.8 27 55-81 45-74 (85)
102 cd05690 S1_RPS1_repeat_ec5 S1_ 23.1 83 0.0018 20.7 2.4 44 4-50 12-56 (69)
103 cd04461 S1_Rrp5_repeat_hs8_sc7 22.9 66 0.0014 22.5 1.9 43 5-50 27-69 (83)
104 PRK08179 prfH peptide chain re 22.8 2E+02 0.0043 24.7 5.1 46 137-187 130-175 (200)
105 KOG3123|consensus 22.6 47 0.001 29.5 1.4 17 7-23 131-147 (272)
106 PF06152 Phage_min_cap2: Phage 22.5 71 0.0015 29.4 2.6 29 43-71 224-252 (361)
107 PF05258 DUF721: Protein of un 22.4 73 0.0016 21.9 2.1 33 145-177 47-79 (89)
108 KOG3601|consensus 22.2 55 0.0012 28.7 1.7 49 14-72 162-210 (222)
109 PRK03598 putative efflux pump 22.0 2.3E+02 0.0051 24.8 5.7 71 38-114 252-326 (331)
110 PRK07696 sulfur carrier protei 21.7 62 0.0014 22.4 1.6 16 36-51 48-63 (67)
111 PF13437 HlyD_3: HlyD family s 21.7 1.5E+02 0.0033 21.1 3.8 32 37-71 47-79 (105)
112 PF03423 CBM_25: Carbohydrate 21.0 79 0.0017 23.1 2.1 10 58-67 47-56 (87)
113 cd02777 MopB_CT_DMSOR-like The 20.7 28 0.00061 26.3 -0.4 26 27-52 35-60 (127)
114 cd02788 MopB_CT_NDH-1_NuoG2-N7 20.5 36 0.00078 24.8 0.2 27 26-52 29-55 (96)
115 KOG4773|consensus 20.2 27 0.00059 32.8 -0.6 37 37-77 191-230 (386)
116 PRK07440 hypothetical protein; 20.1 83 0.0018 22.1 2.0 16 36-51 51-66 (70)
117 PF15595 Imm31: Immunity prote 20.1 1.8E+02 0.0038 22.4 3.9 27 139-165 60-86 (107)
118 PF13382 Adenine_deam_C: Adeni 20.1 83 0.0018 26.2 2.3 33 131-163 50-82 (171)
119 PRK12335 tellurite resistance 20.0 1.5E+02 0.0033 25.6 4.0 51 58-113 31-81 (287)
No 1
>PF05641 Agenet: Agenet domain; InterPro: IPR008395 This domain is related to the TUDOR domain IPR008191 from INTERPRO []. The function of the agenet domain is unknown. This signature matches one of the two Agenet domains in the FMR proteins [].; GO: 0003723 RNA binding; PDB: 2BKD_N 3O8V_A 3KUF_A 3H8Z_A.
Probab=98.97 E-value=1.2e-09 Score=77.40 Aligned_cols=59 Identities=31% Similarity=0.392 Sum_probs=41.2
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEecc-EEEEEEeccCCc------ceeeecCCCcccCCCC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIKGD-FHVVEYLGCQST------FTEIVPPERMRQKNTN 99 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iKGd-f~vVey~~~d~~------~teIV~~erLR~~n~n 99 (191)
|..|+.|||++..+ +-.++|++|+|..+.++ .|.|+|...... .+|+|+..+|||..|.
T Consensus 1 F~~G~~VEV~s~e~-g~~gaWf~a~V~~~~~~~~~~V~Y~~~~~~~~~~~~l~e~V~~~~iRP~pP~ 66 (68)
T PF05641_consen 1 FKKGDEVEVSSDED-GFRGAWFPATVLKENGDDKYLVEYDDLPDEDGESPPLKEWVDARRIRPCPPP 66 (68)
T ss_dssp --TT-EEEEEE-SB-TT--EEEEEEEEEEETT-EEEEEETT-SS--------EEEEEGGGEEE----
T ss_pred CCCCCEEEEEEcCC-CCCcEEEEEEEEEeCCCcEEEEEECCcccccccccccEEEechheEECcCcC
Confidence 67999999998554 44899999999999999 999999766433 7999999999998753
No 2
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=98.77 E-value=2e-08 Score=68.76 Aligned_cols=56 Identities=30% Similarity=0.532 Sum_probs=49.3
Q ss_pred CccCCCeEEEEeecCCCCccceeeEEEEEEec-cEEEEEEeccCCcceeeecCCCcccCCC
Q psy985 39 GFVENQEVEVFSKATEGESCGWWKVIVKMIKG-DFHVVEYLGCQSTFTEIVPPERMRQKNT 98 (191)
Q Consensus 39 ~~~egdeVEV~sran~~ep~gWW~a~V~~iKG-df~vVey~~~d~~~teIV~~erLR~~n~ 98 (191)
.|.+|+.|||+++. -.+||+|+|.++.| +.|.|.|.+.....++.|+..+|||..+
T Consensus 2 ~~~~G~~Ve~~~~~----~~~W~~a~V~~~~~~~~~~V~~~~~~~~~~e~v~~~~LRp~~~ 58 (61)
T smart00743 2 DFKKGDRVEVFSKE----EDSWWEAVVTKVLGDGKYLVRYLTESEPLKETVDWSDLRPHPP 58 (61)
T ss_pred CcCCCCEEEEEECC----CCEEEEEEEEEECCCCEEEEEECCCCcccEEEEeHHHcccCCC
Confidence 58899999999975 58999999999999 7999999974466899999999999764
No 3
>smart00561 MBT Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
Probab=97.46 E-value=0.00074 Score=51.13 Aligned_cols=59 Identities=12% Similarity=0.142 Sum_probs=49.9
Q ss_pred CCCCccCCCeEEEEeecCCCCccceeeEEEEEEeccEEEEEEeccCCcceeeecCC--CcccCC
Q psy985 36 INSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYLGCQSTFTEIVPPE--RMRQKN 97 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~~~d~~~teIV~~e--rLR~~n 97 (191)
..-.|.+|+.+|+--+. .|..+|.|+|..+.|....|.|.|||..|..++..+ .|+|++
T Consensus 24 ~~~~F~vGmkLEavD~~---~~~~i~vAtV~~v~g~~l~v~~dg~~~~~D~W~~~~S~~I~PvG 84 (96)
T smart00561 24 PPNGFKVGMKLEAVDPR---NPSLICVATVVEVKGYRLLLHFDGWDDKYDFWCDADSPDIHPVG 84 (96)
T ss_pred ccCcccCCCEEEEECCC---CCceEEEEEEEEEECCEEEEEEccCCCcCCEEEECCCCCcccCc
Confidence 35679999999997654 688999999999999999999999999988887654 566654
No 4
>PF11717 Tudor-knot: RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F ....
Probab=97.41 E-value=0.00055 Score=46.58 Aligned_cols=50 Identities=24% Similarity=0.387 Sum_probs=41.9
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEecc----EEEEEEeccCCcceeeecCCCcc
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIKGD----FHVVEYLGCQSTFTEIVPPERMR 94 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iKGd----f~vVey~~~d~~~teIV~~erLR 94 (191)
|.+|+.|-++. .---|.+|+|..++.+ -|.|-|.|||..+.|.|+.++|+
T Consensus 1 ~~vG~~v~~~~-----~~~~~y~A~I~~~r~~~~~~~YyVHY~g~nkR~DeWV~~~~i~ 54 (55)
T PF11717_consen 1 FEVGEKVLCKY-----KDGQWYEAKILDIREKNGEPEYYVHYQGWNKRLDEWVPESRIR 54 (55)
T ss_dssp --TTEEEEEEE-----TTTEEEEEEEEEEEECTTCEEEEEEETTSTGCC-EEEETTTEE
T ss_pred CCcCCEEEEEE-----CCCcEEEEEEEEEEecCCCEEEEEEcCCCCCCceeeecHHHcc
Confidence 57899999987 3456999999999876 79999999999999999999987
No 5
>smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=96.33 E-value=0.016 Score=38.41 Aligned_cols=53 Identities=19% Similarity=0.369 Sum_probs=43.8
Q ss_pred CccCCCeEEEEeecCCCCccceeeEEEEEEec-cEEEEEEeccCCcceeeecCCCcccCCC
Q psy985 39 GFVENQEVEVFSKATEGESCGWWKVIVKMIKG-DFHVVEYLGCQSTFTEIVPPERMRQKNT 98 (191)
Q Consensus 39 ~~~egdeVEV~sran~~ep~gWW~a~V~~iKG-df~vVey~~~d~~~teIV~~erLR~~n~ 98 (191)
.+.+|+.|.|.. .+ -.|++|+|..+.+ ..+.|.|.+. ...+.|+.+.||+..+
T Consensus 2 ~~~~G~~~~a~~-~d----~~wyra~I~~~~~~~~~~V~f~D~--G~~~~v~~~~l~~l~~ 55 (57)
T smart00333 2 TFKVGDKVAARW-ED----GEWYRARIIKVDGEQLYEVFFIDY--GNEEVVPPSDLRPLPE 55 (57)
T ss_pred CCCCCCEEEEEe-CC----CCEEEEEEEEECCCCEEEEEEECC--CccEEEeHHHeecCCC
Confidence 578999999976 32 3699999999999 7899999853 4578999999999754
No 6
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=96.03 E-value=0.02 Score=36.71 Aligned_cols=47 Identities=17% Similarity=0.218 Sum_probs=36.2
Q ss_pred CCeEEEEeecCCCCccceeeEEEEEEe-ccEEEEEEeccCCcceeeecCCCccc
Q psy985 43 NQEVEVFSKATEGESCGWWKVIVKMIK-GDFHVVEYLGCQSTFTEIVPPERMRQ 95 (191)
Q Consensus 43 gdeVEV~sran~~ep~gWW~a~V~~iK-Gdf~vVey~~~d~~~teIV~~erLR~ 95 (191)
|+.+-+.... -..|++|+|..+. ++.+.|.|.+.. ..++|+.+.||+
T Consensus 1 G~~c~a~~~~----d~~wyra~V~~~~~~~~~~V~f~DyG--~~~~v~~~~l~~ 48 (48)
T cd04508 1 GDLCLAKYSD----DGKWYRAKITSILSDGKVEVFFVDYG--NTEVVPLSDLRP 48 (48)
T ss_pred CCEEEEEECC----CCeEEEEEEEEECCCCcEEEEEEcCC--CcEEEeHHHcCC
Confidence 5556665432 3579999999999 678999998654 578999999986
No 7
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes. Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=95.16 E-value=0.045 Score=39.72 Aligned_cols=53 Identities=25% Similarity=0.365 Sum_probs=41.4
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEec-----------cEEEEEEeccCCcceeeecCCCcccCCCC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIKG-----------DFHVVEYLGCQSTFTEIVPPERMRQKNTN 99 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iKG-----------df~vVey~~~d~~~teIV~~erLR~~n~n 99 (191)
|..||-|-+-.+. ..||||+|..... ..|.|.|.| +.+| -.|..++|+|-..+
T Consensus 1 f~~GdlVwaK~~g-----~pwWPa~V~~~~~~~~~~~~~~~~~~~~V~Ffg-~~~~-~wv~~~~l~pf~~~ 64 (87)
T cd05162 1 FRPGDLVWAKMKG-----YPWWPALVVDPPKDSKKAKKKAKEGKVLVLFFG-DKTF-AWVGAERLKPFTEH 64 (87)
T ss_pred CCCCCEEEEeCCC-----CCCCCEEEccccccchhhhccCCCCEEEEEEeC-CCcE-EEeCccceeeccch
Confidence 5789999987753 3699999987663 589999998 4445 78888889887654
No 8
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=94.79 E-value=0.059 Score=38.38 Aligned_cols=51 Identities=22% Similarity=0.352 Sum_probs=38.1
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEE--------eccEEEEEEeccCCcceeeecCCCcccCC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMI--------KGDFHVVEYLGCQSTFTEIVPPERMRQKN 97 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~i--------KGdf~vVey~~~d~~~teIV~~erLR~~n 97 (191)
|.+||-|-+-.+ -..||||+|... +++.+.|.|.|.+ +| ..|..++|+|-.
T Consensus 1 f~~GdlVWaK~~-----g~pwWPa~V~~~~~~~~~~~~~~~~~V~Ffg~~-~~-~wv~~~~i~~f~ 59 (86)
T PF00855_consen 1 FRPGDLVWAKLK-----GYPWWPARVCDPDEKSKKKRKDGHVLVRFFGDN-DY-AWVKPSNIKPFS 59 (86)
T ss_dssp -STTEEEEEEET-----TSEEEEEEEEECCHCTSCSSSSTEEEEEETTTT-EE-EEEEGGGEEECC
T ss_pred CCCCCEEEEEeC-----CCCCCceEEeecccccccCCCCCEEEEEecCCC-CE-EEECHHHhhChh
Confidence 578999999763 345999999988 3568999999876 55 566677777653
No 9
>PF02820 MBT: mbt repeat; InterPro: IPR004092 The function of the malignant brain tumor (MBT) repeat is unknown, but is found in a number of nuclear proteins involved in transcriptional repression. The repeat contains a completely conserved glutamate at its amino terminus that may be important for function. The crystal structure of the two MBT repeats of human SCM-like 2 protein has been reported. Each repeat consists of an extended "arm" and a globular core. The arm of the first repeat packs against the core of the second repeat and vice versa. The structure of the core-interacting part of each arm consists of an N-terminal alpha-helix and a turn of 310 helix connected by a short beta-strand. The core consists of an Src homology 3-like five-stranded beta-barrel followed by a C-terminal alpha-helix and another short beta-strand. Each arm interacts with its partner core in a similar way, with the orientation of the N-terminal helix relative to the barrel varying slightly. There are also extensive interactions between the two barrels [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2P0K_A 3F70_B 3CEY_A 2VYT_A 2BIV_A 1OI1_A 3OQ5_A 2RJE_B 2RJD_A 2RI3_A ....
Probab=94.75 E-value=0.12 Score=36.78 Aligned_cols=51 Identities=14% Similarity=0.111 Sum_probs=39.2
Q ss_pred CeEEEEeecCCCCccceeeEEEEEEeccEEEEEEeccCCcceeeecCC--CcccCC
Q psy985 44 QEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYLGCQSTFTEIVPPE--RMRQKN 97 (191)
Q Consensus 44 deVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~~~d~~~teIV~~e--rLR~~n 97 (191)
+.+|+..+. +|...|.|.|..+.|+-..|.|.|++..+..++..+ +|.|++
T Consensus 1 MkLEa~d~~---~~~~~~vAtV~~v~g~~l~v~~dg~~~~~d~w~~~~S~~i~PvG 53 (73)
T PF02820_consen 1 MKLEAVDPR---NPSLICVATVVKVCGGRLLVRYDGWDDDYDFWCHIDSPRIFPVG 53 (73)
T ss_dssp EEEEEEETT---ECCEEEEEEEEEEETTEEEEEETTSTGGGEEEEETTSTTEEETT
T ss_pred CeEEEECCC---CCCeEEEEEEEEEeCCEEEEEEcCCCCCccEEEECCCCCeeecc
Confidence 356777654 467899999999999999999999998876666543 455543
No 10
>cd05835 Dnmt3b_related The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=94.10 E-value=0.059 Score=39.81 Aligned_cols=50 Identities=22% Similarity=0.248 Sum_probs=36.3
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEecc--------EEEEEEeccCCcceeeecCCCcccC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIKGD--------FHVVEYLGCQSTFTEIVPPERMRQK 96 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iKGd--------f~vVey~~~d~~~teIV~~erLR~~ 96 (191)
|.+||-|-+-.+. ..||||+|.....+ .+-|.|-|. .+ ...|+.+.|-|-
T Consensus 1 f~vGDlVWaK~kg-----~pwWP~~V~~~~~~~~~~~~~~~~~V~fFGs-~~-~a~v~~~~l~pf 58 (87)
T cd05835 1 FNVGDLVWGKIKG-----FPWWPGRVVSITVTSKRPPVVGMRWVTWFGS-GT-FSEVSVDKLSPF 58 (87)
T ss_pred CCCCCEEEEecCC-----CCCCCeEEechhhcccccCCCCeEEEEEeCC-CC-EeEECHHHCcCh
Confidence 5689999887754 45999999887443 588999883 33 466677776664
No 11
>PF00018 SH3_1: SH3 domain; InterPro: IPR001452 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. The crystal structure of the SH3 domain of the cytoskeletal protein spectrin, and the solution structures of SH3 domains of phospholipase C (PLC-y) and phosphatidylinositol 3-kinase p85 alpha-subunit, have been determined [, , ]. In spite of relatively limited sequence similarity, their overall structures are similar. The domains belong to the alpha+beta structural class, with 5 to 8 beta-strands forming 2 tightly-packed, anti-parallel beta-sheets arranged in a barrel-like structure, and intervening loops sometimes forming helices. Conserved aliphatic and aromatic residues form a hydrophobic core (A11, L23, A29, V34, W42, L52 and V59 in PLC-y []) and a hydrophobic pocket on the molecular surface (L12, F13, W53 and P55 in PLC-y). The conserved core is believed to stabilise the fold, while the pocket is thought to serve as a binding site for target proteins. Conserved carboxylic amino acids located in the loops, on the periphery of the pocket (D14 and E22), may be involved in protein-protein interactions via proline-rich regions. The N- and C-termini are packed in close proximity, indicating that they are independent structural modules.; GO: 0005515 protein binding; PDB: 1UHF_A 1W1F_A 1WA7_A 1SEM_A 1KFZ_A 2SEM_B 1K76_A 3SEM_B 1X2Q_A 2J06_B ....
Probab=93.98 E-value=0.067 Score=34.65 Aligned_cols=28 Identities=32% Similarity=0.692 Sum_probs=23.1
Q ss_pred CCCCCccCCCeEEEEeecCCCCccceeeEEEE
Q psy985 35 TINSGFVENQEVEVFSKATEGESCGWWKVIVK 66 (191)
Q Consensus 35 ~~~~~~~egdeVEV~sran~~ep~gWW~a~V~ 66 (191)
+....|..||.++|..+.++ +||.|+..
T Consensus 11 ~~eLs~~~Gd~i~v~~~~~~----~Ww~~~~~ 38 (48)
T PF00018_consen 11 PDELSFKKGDIIEVLEKSDD----GWWKVRNE 38 (48)
T ss_dssp TTBSEB-TTEEEEEEEESSS----SEEEEEET
T ss_pred CCEEeEECCCEEEEEEecCC----CEEEEEEC
Confidence 35688999999999998887 99999865
No 12
>cd05836 N_Pac_NP60 The PWWP domain is an essential part of the cytokine-like nuclear factor n-pac protein, or NP60, which enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha. In a variety of cell lines, NP60 has been shown to localize to the nucleus. In addition to the PWWP domain, NP60 also contains an AT-hook and a C-terminal NAD-binding domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=93.25 E-value=0.34 Score=35.80 Aligned_cols=51 Identities=14% Similarity=0.115 Sum_probs=37.1
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEE---------EeccEEEEEEeccCCcceeeecCCCcccCC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKM---------IKGDFHVVEYLGCQSTFTEIVPPERMRQKN 97 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~---------iKGdf~vVey~~~d~~~teIV~~erLR~~n 97 (191)
|+.||-|-+-.+. ..||||+|.. -++..|.|.|-|-. +| -.|..+.|.|-.
T Consensus 1 f~~GDlVwaK~~g-----~P~WPa~V~~~~~~~~~~~~~~~~~~V~FFG~~-~~-~wv~~~~l~pF~ 60 (86)
T cd05836 1 LKLGDLVWAKMKG-----FPPWPGRIVKPPKDLKKPRGKAKCFFVFFFGSE-NH-AWIKEENIKPYH 60 (86)
T ss_pred CCCCCEEEEeCCC-----CCCCCEEEechhhhcccccCCCCeEEEEEeCCC-CE-EEECHHhCeech
Confidence 5689998887753 7799999965 23568999998854 34 667777777654
No 13
>PF07653 SH3_2: Variant SH3 domain; InterPro: IPR011511 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. This entry represents a variant of the SH3 domain.; PDB: 1I1J_B 1K0X_A 1HJD_A 2KEA_A 1KJW_A 1JXM_A 1JXO_B 2EBP_A 2DL3_A 2EYX_A ....
Probab=92.47 E-value=0.13 Score=34.24 Aligned_cols=26 Identities=27% Similarity=0.564 Sum_probs=19.3
Q ss_pred CCCCccCCCeEEEE-eecCCCCccceeeEEE
Q psy985 36 INSGFVENQEVEVF-SKATEGESCGWWKVIV 65 (191)
Q Consensus 36 ~~~~~~egdeVEV~-sran~~ep~gWW~a~V 65 (191)
....|..||.|+|+ .+.++ |||.|+.
T Consensus 14 ~~Ls~~~Gd~i~v~~~~~~~----~ww~~~~ 40 (55)
T PF07653_consen 14 DELSFKKGDVIEVLGEKDDD----GWWLGEN 40 (55)
T ss_dssp TB-EB-TTEEEEEEEEECST----SEEEEEE
T ss_pred CceEEecCCEEEEEEeecCC----CEEEEEE
Confidence 35889999999999 44333 9999988
No 14
>cd05834 HDGF_related The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF. This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=91.22 E-value=0.43 Score=35.12 Aligned_cols=53 Identities=19% Similarity=0.199 Sum_probs=40.3
Q ss_pred CccCCCeEEEEeecCCCCccceeeEEEEEEe-----ccEEEEEEeccCCcceeeecCCCcccCCC
Q psy985 39 GFVENQEVEVFSKATEGESCGWWKVIVKMIK-----GDFHVVEYLGCQSTFTEIVPPERMRQKNT 98 (191)
Q Consensus 39 ~~~egdeVEV~sran~~ep~gWW~a~V~~iK-----Gdf~vVey~~~d~~~teIV~~erLR~~n~ 98 (191)
+|+.||-|-+-.+. .-||||+|.... +.-|.|.|.|.. +| -.|+...|-|-..
T Consensus 2 ~f~~GdlVwaK~kG-----yp~WPa~I~~~~~~~~~~~~~~V~FfGt~-~~-a~v~~~~l~pf~~ 59 (83)
T cd05834 2 QFKAGDLVFAKVKG-----YPAWPARVDEPEDWKPPGKKYPVYFFGTH-ET-AFLKPEDLFPYTE 59 (83)
T ss_pred CCCCCCEEEEecCC-----CCCCCEEEecccccCCCCCEEEEEEeCCC-CE-eEECHHHceeccc
Confidence 68899999987754 569999998887 468999999853 33 5577777766544
No 15
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=91.04 E-value=0.84 Score=43.54 Aligned_cols=57 Identities=16% Similarity=0.043 Sum_probs=47.4
Q ss_pred CCCccCCCeEEEEeecCCCCccceeeEEEEEEec-------c-EEEEEEeccCCcceeeecCCCcccCC
Q psy985 37 NSGFVENQEVEVFSKATEGESCGWWKVIVKMIKG-------D-FHVVEYLGCQSTFTEIVPPERMRQKN 97 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKG-------d-f~vVey~~~d~~~teIV~~erLR~~n 97 (191)
...|.+|+.|-||...+. .|..|+|..++- + -|+|-|.|||....|+|+.+||+..+
T Consensus 51 ~~~~~VGekVla~~~~Dg----~~~~A~VI~~R~~~~~~~~~~~YYVHY~g~nrRlDEWV~~~rLdls~ 115 (450)
T PLN00104 51 MLPLEVGTRVMCRWRFDG----KYHPVKVIERRRGGSGGPNDYEYYVHYTEFNRRLDEWVKLEQLDLDT 115 (450)
T ss_pred cceeccCCEEEEEECCCC----CEEEEEEEEEeccCCCCCCCceEEEEEecCCccHhhccCHhhccccc
Confidence 456899999999985444 477899998884 2 49999999999999999999998755
No 16
>PF14604 SH3_9: Variant SH3 domain; PDB: 2CRE_A 2E5K_A 2CT3_A 2DE0_X 2D8H_A 2DA9_A 2X3X_E 2X3W_D 2KRN_A 2ED0_A ....
Probab=90.99 E-value=0.27 Score=32.41 Aligned_cols=30 Identities=20% Similarity=0.458 Sum_probs=21.8
Q ss_pred CCCCCCCCccCCCeEEEEeecCCCCccceeeEEE
Q psy985 32 PRETINSGFVENQEVEVFSKATEGESCGWWKVIV 65 (191)
Q Consensus 32 P~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V 65 (191)
|..+....|..||.|.|..+. ..|||.|+.
T Consensus 7 ~~~~dELs~~~Gd~i~v~~~~----~~~W~~g~~ 36 (49)
T PF14604_consen 7 AQDPDELSFKKGDVITVLEKS----DDGWWYGRN 36 (49)
T ss_dssp SSSTTB-EB-TTEEEEEEEES----STSEEEEEE
T ss_pred CCCcCEeeEcCCCEEEEEEeC----CCCEEEEEE
Confidence 333457889999999999774 469999984
No 17
>PF06003 SMN: Survival motor neuron protein (SMN); InterPro: IPR010304 This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.; GO: 0003723 RNA binding, 0006397 mRNA processing, 0005634 nucleus, 0005737 cytoplasm; PDB: 1MHN_A 4A4G_A 3S6N_M 4A4E_A 1G5V_A 4A4H_A 4A4F_A 2D9T_A.
Probab=90.70 E-value=0.61 Score=41.16 Aligned_cols=57 Identities=14% Similarity=0.275 Sum_probs=42.5
Q ss_pred CCCccCCCeEEEEeecCCCCccceeeEEEEEEecc--EEEEEEeccCCcceeeecCCCcccCCCC
Q psy985 37 NSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGD--FHVVEYLGCQSTFTEIVPPERMRQKNTN 99 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGd--f~vVey~~~d~~~teIV~~erLR~~n~n 99 (191)
.....+||...+.. +.|+ .|+.|+|..|.++ -++|.|.|.++ +|+|.+..|+|...+
T Consensus 66 ~~~WkvGd~C~A~~-s~Dg---~~Y~A~I~~i~~~~~~~~V~f~gYgn--~e~v~l~dL~~~~~~ 124 (264)
T PF06003_consen 66 NKKWKVGDKCMAVY-SEDG---QYYPATIESIDEEDGTCVVVFTGYGN--EEEVNLSDLKPSEGD 124 (264)
T ss_dssp TT---TT-EEEEE--TTTS---SEEEEEEEEEETTTTEEEEEETTTTE--EEEEEGGGEEETT--
T ss_pred ccCCCCCCEEEEEE-CCCC---CEEEEEEEEEcCCCCEEEEEEcccCC--eEeeehhhhcccccc
Confidence 46889999999965 3444 3999999999975 89999999975 799999999998754
No 18
>smart00293 PWWP domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues
Probab=90.61 E-value=0.69 Score=31.93 Aligned_cols=50 Identities=18% Similarity=0.286 Sum_probs=37.2
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEe------------ccEEEEEEeccCCcceeeecCCCcccC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIK------------GDFHVVEYLGCQSTFTEIVPPERMRQK 96 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iK------------Gdf~vVey~~~d~~~teIV~~erLR~~ 96 (191)
|..||-|-+-++ -..||+|+|..-. ...|.|.|.|-. +| -.|+.++|.|.
T Consensus 1 f~~GdlVwaK~~-----G~p~WPa~V~~~~~~~~~~~~~~~~~~~~~V~Ffg~~-~~-awv~~~~l~p~ 62 (63)
T smart00293 1 FKPGDLVWAKMK-----GFPWWPALVVSPKETPDNIRKRKRFENLYPVLFFGDK-DT-AWISSSKLFPL 62 (63)
T ss_pred CCCCCEEEEECC-----CCCCCCeEEcCcccCChhHhhccCCCCEEEEEEeCCC-CE-EEECccceeeC
Confidence 467899888775 3559999997553 237888998754 35 88999888874
No 19
>PF15057 DUF4537: Domain of unknown function (DUF4537)
Probab=90.37 E-value=0.88 Score=35.80 Aligned_cols=81 Identities=12% Similarity=0.004 Sum_probs=53.4
Q ss_pred eEEEEeecCCCCcccccccCcccccCCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEE------EEeccEEEEEEec
Q psy985 6 FNVSNPFSLLRWQPESKFQFSMVRLPPRETINSGFVENQEVEVFSKATEGESCGWWKVIVK------MIKGDFHVVEYLG 79 (191)
Q Consensus 6 ~~v~V~f~eN~Wqpe~~vPf~~VRlPP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~------~iKGdf~vVey~~ 79 (191)
-...|.| ++... +.+|.+++-.=-.. ....+.+||.|-+- .+...+.|-||+|. ..++++|.|.|-
T Consensus 26 ~~~lV~f-~~~~~--~~v~~~~iI~~~~~-~~~~L~~GD~VLA~---~~~~~~~Y~Pg~V~~~~~~~~~~~~~~~V~f~- 97 (124)
T PF15057_consen 26 GQFLVEF-DDGDT--QEVPISDIIALSDA-MRHSLQVGDKVLAP---WEPDDCRYGPGTVIAGPERRASEDKEYTVRFY- 97 (124)
T ss_pred CEEEEEE-CCCCE--EEeChHHeEEccCc-ccCcCCCCCEEEEe---cCcCCCEEeCEEEEECccccccCCceEEEEEE-
Confidence 3567788 44433 35666666322222 26788999999998 44445669999999 668889999995
Q ss_pred cCCcceeeecCCCcccC
Q psy985 80 CQSTFTEIVPPERMRQK 96 (191)
Q Consensus 80 ~d~~~teIV~~erLR~~ 96 (191)
|.+ +.-|+++.++..
T Consensus 98 -ng~-~~~vp~~~~~~I 112 (124)
T PF15057_consen 98 -NGK-TAKVPRGEVIWI 112 (124)
T ss_pred -CCC-CCccchhhEEEC
Confidence 443 455666655543
No 20
>KOG0162|consensus
Probab=90.19 E-value=0.27 Score=49.78 Aligned_cols=26 Identities=19% Similarity=0.560 Sum_probs=19.0
Q ss_pred CccCCCeEEEEeecCCCCccceeeEEEEEE
Q psy985 39 GFVENQEVEVFSKATEGESCGWWKVIVKMI 68 (191)
Q Consensus 39 ~~~egdeVEV~sran~~ep~gWW~a~V~~i 68 (191)
.|+.||+||+-. .+|.|||+|++..-
T Consensus 1069 s~~~~diIei~~----edpSGWw~gk~~~k 1094 (1106)
T KOG0162|consen 1069 SFKKGDIIEIMR----EDPSGWWLGKLNGK 1094 (1106)
T ss_pred cccCCCEEEEec----cCCCcchhhccCCc
Confidence 456678877743 57999999995443
No 21
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4. The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=90.06 E-value=0.52 Score=35.42 Aligned_cols=51 Identities=18% Similarity=0.187 Sum_probs=36.2
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEE--------------EeccEEEEEEeccCCcceeeecCCCcccCC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKM--------------IKGDFHVVEYLGCQSTFTEIVPPERMRQKN 97 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~--------------iKGdf~vVey~~~d~~~teIV~~erLR~~n 97 (191)
|..||-|-+-.+. +-||+|+|.. .++..|-|.|.|. .+| -.|....|.|..
T Consensus 1 f~~GDlVwaK~~G-----yPwWPA~V~~~~~~p~~~l~~~~~~~~~~~~V~FFg~-~~~-~Wv~~~~l~pl~ 65 (93)
T cd05840 1 FQPGDRVLAKVKG-----FPAWPAIVVPEEMLPDSVLKGKKKKNKRTYPVMFFPD-GDY-YWVPNKDLKPLT 65 (93)
T ss_pred CCCCCEEEEeCCC-----CCCCCEEECChHHCCHHHHhcccCCCCCeEEEEEeCC-CcE-EEEChhhcccCC
Confidence 5679999998864 5699999953 1234788898874 344 566777777653
No 22
>PF00567 TUDOR: Tudor domain; InterPro: IPR008191 There are multiple copies of this domain in the Drosophila melanogaster tudor protein and it has been identified in several RNA-binding proteins []. Although the function of this domain is unknown, in Drosophila melanogaster the tudor protein is required during oogenesis for the formation of primordial germ cells and for normal abdominal segmentation [].; PDB: 3NTI_A 3NTK_B 3NTH_A 2DIQ_A 3FDR_A 3PNW_O 3S6W_A 3PMT_A 2WAC_A 2O4X_A ....
Probab=89.12 E-value=1.3 Score=31.83 Aligned_cols=55 Identities=13% Similarity=0.132 Sum_probs=39.9
Q ss_pred CCccCCCeEEEEeecCCCCccceeeEEE-EEEeccEEEEEEeccCCcceeeecCCCcccCCC
Q psy985 38 SGFVENQEVEVFSKATEGESCGWWKVIV-KMIKGDFHVVEYLGCQSTFTEIVPPERMRQKNT 98 (191)
Q Consensus 38 ~~~~egdeVEV~sran~~ep~gWW~a~V-~~iKGdf~vVey~~~d~~~teIV~~erLR~~n~ 98 (191)
.....|+-+-++...++ +|.+|+| ....++.+.|.|. |-..++.|+...||+..+
T Consensus 50 ~~~~~~~~~~~~~~~~~----~w~Ra~I~~~~~~~~~~V~~i--D~G~~~~v~~~~l~~l~~ 105 (121)
T PF00567_consen 50 PESNPGEGCLCVVSEDG----RWYRAVITVDIDENQYKVFLI--DYGNTEKVSASDLRPLPP 105 (121)
T ss_dssp ST--TTEEEEEEETTTS----EEEEEEEEEEECTTEEEEEET--TTTEEEEEEGGGEEE--H
T ss_pred cccccCCEEEEEEecCC----ceeeEEEEEecccceeEEEEE--ecCceEEEcHHHhhhhCH
Confidence 34556666655554444 9999999 7777779999998 667789999999999874
No 23
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS. The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain. Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis. In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain pro
Probab=87.59 E-value=0.82 Score=35.16 Aligned_cols=55 Identities=13% Similarity=0.198 Sum_probs=41.1
Q ss_pred CccCCCeEEEEeecCCCCccceeeEEEEE--------------EeccEEEEEEeccCCcceeeecCCCcccCCCC
Q psy985 39 GFVENQEVEVFSKATEGESCGWWKVIVKM--------------IKGDFHVVEYLGCQSTFTEIVPPERMRQKNTN 99 (191)
Q Consensus 39 ~~~egdeVEV~sran~~ep~gWW~a~V~~--------------iKGdf~vVey~~~d~~~teIV~~erLR~~n~n 99 (191)
+|..||-|-+-.+. .-||||.|.. -+...|-|+|.|-..+| -.|..+.|.|-...
T Consensus 2 ~~~~GdlVWaK~~g-----~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~~V~FFG~~~~~-aWv~~~~l~pf~~~ 70 (110)
T cd05837 2 KYQVGDLVWAKVSG-----YPWWPCMVCSDPLLGTYTKTKRNKRKPRQYHVQFFGDNPER-AWISEKSLKPFKGS 70 (110)
T ss_pred CCCCCCEEEEeCCC-----CCCCCEEEecccccchhhhhhhccCCCCeEEEEEcCCCCCE-EEecHHHccccCCc
Confidence 57889999987754 5699999983 12248999999876555 58888888886644
No 24
>smart00326 SH3 Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Probab=87.27 E-value=1.4 Score=27.56 Aligned_cols=27 Identities=26% Similarity=0.543 Sum_probs=22.4
Q ss_pred CCCCccCCCeEEEEeecCCCCccceeeEEEE
Q psy985 36 INSGFVENQEVEVFSKATEGESCGWWKVIVK 66 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran~~ep~gWW~a~V~ 66 (191)
....+..||.|+|..+. ..|||.++..
T Consensus 17 ~~l~~~~Gd~v~v~~~~----~~~w~~~~~~ 43 (58)
T smart00326 17 DELSFKKGDIITVLEKS----DDGWWKGRLG 43 (58)
T ss_pred CCCCCCCCCEEEEEEcC----CCCeEEEEeC
Confidence 45788999999999876 4699999765
No 25
>cd00174 SH3 Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Probab=86.53 E-value=1.6 Score=27.07 Aligned_cols=27 Identities=26% Similarity=0.540 Sum_probs=21.7
Q ss_pred CCCCccCCCeEEEEeecCCCCccceeeEEEE
Q psy985 36 INSGFVENQEVEVFSKATEGESCGWWKVIVK 66 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran~~ep~gWW~a~V~ 66 (191)
....+..||.|.|... ...|||.++..
T Consensus 14 ~~l~~~~Gd~v~v~~~----~~~~w~~~~~~ 40 (54)
T cd00174 14 DELSFKKGDIIEVLEK----SDDGWWEGRLL 40 (54)
T ss_pred CCCCCCCCCEEEEEEc----CCCCeEEEEEC
Confidence 4678899999999987 34689999754
No 26
>COG2139 RPL21A Ribosomal protein L21E [Translation, ribosomal structure and biogenesis]
Probab=83.95 E-value=2.8 Score=32.35 Aligned_cols=62 Identities=18% Similarity=0.143 Sum_probs=49.1
Q ss_pred CCCCccCCCeEEEEeecC--CCCcc---ceeeEEEEEEeccEEEEEEeccCCcceeeecCCCcccCC
Q psy985 36 INSGFVENQEVEVFSKAT--EGESC---GWWKVIVKMIKGDFHVVEYLGCQSTFTEIVPPERMRQKN 97 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran--~~ep~---gWW~a~V~~iKGdf~vVey~~~d~~~teIV~~erLR~~n 97 (191)
+-.+|.+||.|.+-.-.. ..-|. -=-.|+|.-++|+-|+|+-...+..-+-||..+.|||..
T Consensus 29 ~l~ey~~Gd~V~I~IdpSv~kGmPh~rf~G~TG~Vvg~~g~ay~V~v~~G~k~K~liv~peHLk~~~ 95 (98)
T COG2139 29 YLQEYKVGDKVHIDIDPSVHKGMPHPRFQGKTGTVVGVRGRAYKVEVYDGNKEKTLIVRPEHLKPQK 95 (98)
T ss_pred HHhhccCCCEEEEEeCcccccCCCCccccCcceEEEeccCCEEEEEEecCCceEEEEeCHHHccccc
Confidence 556899999999966432 22332 334679999999999999998888899999999999864
No 27
>cd05841 BS69_related The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions. BS69 is a multi-domain protein, containing bromo, PHD, PWWP, and MYND domains. The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=81.61 E-value=2.7 Score=31.30 Aligned_cols=54 Identities=17% Similarity=0.215 Sum_probs=39.6
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEeccEEEEEEeccCCcceeeecCCCcccCCCC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYLGCQSTFTEIVPPERMRQKNTN 99 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~~~d~~~teIV~~erLR~~n~n 99 (191)
...||-|-+-.+. ..||+|+|....+.-|-|.|-|... -.-.|+...|.|-+.+
T Consensus 7 ~~p~dLVwAK~kG-----yp~WPAkV~~~~~~~~~V~FFG~t~-~~a~v~~~~i~~~~~~ 60 (83)
T cd05841 7 RPPHELVWAKLKG-----FPYWPAKVMRVEDNQVDVRFFGGQH-DRAWIPSNNIQPISTE 60 (83)
T ss_pred CCCCCEEEEeCCC-----CCCCCEEEeecCCCeEEEEEcCCCC-CeEEEehHHeeehhhh
Confidence 4567777665543 6799999999999999999998443 3467777777765543
No 28
>cd06080 MUM1_like Mutated melanoma-associated antigen 1 (MUM-1) is a melanoma-associated antigen (MAA). MUM-1 belongs to the mutated or aberrantly expressed type of MAAs, along with antigens such as CDK4, beta-catenin, gp100-in4, p15, and N-acetylglucosaminyltransferase V. It is highly expressed in several types of human cancers. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=81.17 E-value=4.4 Score=29.95 Aligned_cols=51 Identities=20% Similarity=0.319 Sum_probs=33.7
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEec--cEEEEEEeccCCcceeeecCCCcccC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIKG--DFHVVEYLGCQSTFTEIVPPERMRQK 96 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iKG--df~vVey~~~d~~~teIV~~erLR~~ 96 (191)
|..||-|-.-.+ -+-||||+|..+.. .-|-|.|.|.+.+ ...+..+.|-|-
T Consensus 1 f~~gdlVWaK~~-----g~P~WPa~I~~~~~~~~k~~V~FfG~~~~-~a~~~~~~l~p~ 53 (80)
T cd06080 1 FEKNDLVWAKIQ-----GYPWWPAVIKSISRKKQKARVNFIGDNMQ-SEKKGIRVVKRW 53 (80)
T ss_pred CCCCCEEEEeCC-----CCCCCCEEEeeecCCCCEEEEEEeCCCCc-eeccchhhcccc
Confidence 467787777554 36699999988853 4788999988632 333444444443
No 29
>cd02781 MopB_CT_Acetylene-hydratase The MopB_CT_Acetylene-hydratase CD contains acetylene hydratase (Ahy) and other related proteins. The acetylene hydratase of Pelobacter acetylenicus is a tungsten iron-sulfur protein involved in the fermentation of acetylene to ethanol and acetate. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=65.45 E-value=1.8 Score=32.79 Aligned_cols=30 Identities=13% Similarity=0.128 Sum_probs=23.9
Q ss_pred cCcccccCCCCCCCCCccCCCeEEEEeecC
Q psy985 24 QFSMVRLPPRETINSGFVENQEVEVFSKAT 53 (191)
Q Consensus 24 Pf~~VRlPP~~~~~~~~~egdeVEV~sran 53 (191)
|..-|.+-|.+..+..+.+||.|+|++...
T Consensus 31 ~~~~v~inp~dA~~~gi~~Gd~V~v~s~~G 60 (130)
T cd02781 31 PDPVAEINPETAAKLGIADGDWVWVETPRG 60 (130)
T ss_pred CCCEEEECHHHHHHcCCCCCCEEEEECCCC
Confidence 444577777766889999999999999763
No 30
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS). When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=63.72 E-value=15 Score=27.40 Aligned_cols=50 Identities=16% Similarity=0.157 Sum_probs=31.9
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEE----------e--ccEEEEEEeccCCcceeeecCCCcccC
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMI----------K--GDFHVVEYLGCQSTFTEIVPPERMRQK 96 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~i----------K--Gdf~vVey~~~d~~~teIV~~erLR~~ 96 (191)
+..||-|-+-.+. .-||||.|..- + ..-+.|.|-|.. +| -.|+..+|-|-
T Consensus 1 ~~~GdlVWaK~~g-----~pwWPa~V~~~~~~p~~~~~~~~~~~~~~V~Ffgs~-~y-~Wv~~~~l~pf 62 (95)
T cd05838 1 PLYGDIVWAKLGN-----FRWWPAIICDPREVPPNIQVLRHCIGEFCVMFFGTH-DY-YWVHRGRVFPY 62 (95)
T ss_pred CCcCCEEEEECCC-----CCCCCeEEcChhhcChhHhhccCCCCeEEEEEeCCC-CE-EEeccccccch
Confidence 3569999998765 45999999751 1 124777887642 23 45555555543
No 31
>PF13509 S1_2: S1 domain; PDB: 3GO5_A.
Probab=62.62 E-value=3.8 Score=28.12 Aligned_cols=28 Identities=32% Similarity=0.347 Sum_probs=16.1
Q ss_pred cccccCCCCCCCCCccCCCeEEEEeecCC
Q psy985 26 SMVRLPPRETINSGFVENQEVEVFSKATE 54 (191)
Q Consensus 26 ~~VRlPP~~~~~~~~~egdeVEV~sran~ 54 (191)
.++.||-... ..++.+||+|+||..-|.
T Consensus 25 ~~vlLp~~e~-~~~~~~Gd~v~VFvY~D~ 52 (61)
T PF13509_consen 25 KEVLLPKSEV-PEPLKVGDEVEVFVYLDK 52 (61)
T ss_dssp EEEEEEGGG-------TTSEEEEEEEE-T
T ss_pred CEEEechHHc-CCCCCCCCEEEEEEEECC
Confidence 5677775533 367999999999987664
No 32
>KOG4348|consensus
Probab=58.49 E-value=1.9 Score=41.64 Aligned_cols=32 Identities=25% Similarity=0.412 Sum_probs=23.3
Q ss_pred CCCCccCCCeEEEEeecCCCCccceeeEEEEEEecc
Q psy985 36 INSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGD 71 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGd 71 (191)
...++.+||.+|+-+-..+ |||.|++.---|=
T Consensus 115 DELelkVGDiIeli~eVEe----GWw~G~Lngk~Gm 146 (627)
T KOG4348|consen 115 DELELKVGDIIELISEVEE----GWWKGKLNGKVGM 146 (627)
T ss_pred ceeeeeeccHHHhhhHhhh----hhhhceecCcccc
Confidence 4568888888888775544 9999988744443
No 33
>COG2336 MazE Growth regulator [Signal transduction mechanisms]
Probab=58.15 E-value=2.6 Score=31.54 Aligned_cols=24 Identities=29% Similarity=0.279 Sum_probs=20.8
Q ss_pred ccccCCCCCCCCCccCCCeEEEEe
Q psy985 27 MVRLPPRETINSGFVENQEVEVFS 50 (191)
Q Consensus 27 ~VRlPP~~~~~~~~~egdeVEV~s 50 (191)
-||+|+.--+...+++||+|||-.
T Consensus 13 avrIP~~l~kql~l~~g~~v~v~v 36 (82)
T COG2336 13 AVRIPAALLKQLNLTIGDEVEVEV 36 (82)
T ss_pred eeeccHHHHHHhCCCcCceEEEEE
Confidence 489999877889999999999854
No 34
>cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2. BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region. In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=58.06 E-value=16 Score=28.68 Aligned_cols=58 Identities=10% Similarity=0.184 Sum_probs=41.1
Q ss_pred cCCCeEEEEeecCCCCccceeeEEEEEEe--------------------------cc-EEEEEEeccCCcceeeecCCCc
Q psy985 41 VENQEVEVFSKATEGESCGWWKVIVKMIK--------------------------GD-FHVVEYLGCQSTFTEIVPPERM 93 (191)
Q Consensus 41 ~egdeVEV~sran~~ep~gWW~a~V~~iK--------------------------Gd-f~vVey~~~d~~~teIV~~erL 93 (191)
..||-|.+-++ -+-||+|.|.--. ++ .|.|.|.+-..+| -.|+++.|
T Consensus 2 ~pg~lVwaK~~-----g~P~wPa~iidp~~~~~~~~~~~~p~~~l~~~~~~~~~~~~~~~lV~FFd~~~s~-~Wv~~~~l 75 (111)
T cd05839 2 EPLTLVWAKCR-----GYPSYPALIIDPKMPRDGVFHNGVPPDVLTLGEARAQNADERLYLVLFFDNKRTW-QWLPGDKL 75 (111)
T ss_pred CCcCEeeeeec-----CCCCCCeEeeCCCCCCcccccCCCCchhhhHHHHHhccCCCcEEEEEEecCCCcc-eecCHHHC
Confidence 56888888773 4569999887532 22 7889998554455 67999999
Q ss_pred ccCCCCCCCCc
Q psy985 94 RQKNTNPPIDA 104 (191)
Q Consensus 94 R~~n~n~pi~~ 104 (191)
.|...+..++.
T Consensus 76 ~pl~~~~~~D~ 86 (111)
T cd05839 76 EPLGVDETLDK 86 (111)
T ss_pred cccccchhhhh
Confidence 99886655544
No 35
>cd02794 MopB_CT_DmsA-EC The MopB_CT_DmsA-EC CD includes the DmsA enzyme of the dmsABC operon encoding the anaerobic dimethylsulfoxide reductase (DMSOR) of Escherichia coli and other related DMSOR-like enzymes. Unlike other DMSOR-like enzymes, this group has a predicted N-terminal iron-sulfur [4Fe-4S] cluster binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=56.78 E-value=3.4 Score=31.22 Aligned_cols=26 Identities=23% Similarity=0.413 Sum_probs=21.6
Q ss_pred cccCCCCCCCCCccCCCeEEEEeecC
Q psy985 28 VRLPPRETINSGFVENQEVEVFSKAT 53 (191)
Q Consensus 28 VRlPP~~~~~~~~~egdeVEV~sran 53 (191)
|.+-|.+.....+.+||.|+|++...
T Consensus 32 v~i~p~~A~~~gi~~Gd~V~v~s~~g 57 (121)
T cd02794 32 VWINPLDAAARGIKDGDRVLVFNDRG 57 (121)
T ss_pred EEECHHHHHHcCCCCCCEEEEEcCCc
Confidence 67777666889999999999998763
No 36
>cd02786 MopB_CT_3 The MopB_CT_3 CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=56.76 E-value=28 Score=25.67 Aligned_cols=29 Identities=17% Similarity=0.310 Sum_probs=23.4
Q ss_pred cCcccccCCCCCCCCCccCCCeEEEEeec
Q psy985 24 QFSMVRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 24 Pf~~VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
|-..|.+-|.+..+..+.+||.|+|+++.
T Consensus 29 ~~~~v~i~p~dA~~lgi~~Gd~V~v~s~~ 57 (116)
T cd02786 29 GEPTLLIHPADAAARGIADGDLVVVFNDR 57 (116)
T ss_pred CCCEEEECHHHHHHcCCCCCCEEEEEcCC
Confidence 44557777777788999999999999865
No 37
>KOG3038|consensus
Probab=56.66 E-value=46 Score=29.89 Aligned_cols=72 Identities=14% Similarity=0.102 Sum_probs=48.8
Q ss_pred cccccCCCCC------CCCCccCCCeEEEEeecCCCCccceeeEEEEEEeccE-EEEEEeccCCcceeeecCCCcccCCC
Q psy985 26 SMVRLPPRET------INSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDF-HVVEYLGCQSTFTEIVPPERMRQKNT 98 (191)
Q Consensus 26 ~~VRlPP~~~------~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf-~vVey~~~d~~~teIV~~erLR~~n~ 98 (191)
...|+||+-. .++.+..||+|-.++++.+..-- |-+|.|+.+-++- |-++-..=+..--+..++.+|=+.+.
T Consensus 108 ~~~~~~p~~~gaip~~~~~~~~~gd~VAa~v~~~~~dg~-WIlaeVv~~~~~~~ye~ev~D~Epk~d~~g~r~~~yklp~ 186 (264)
T KOG3038|consen 108 GKPGKPPPLCGAIPAQGDYVLLKGDEVAARVKAVSEDGD-WILAEVVKVSSETRYEFEVVDPEPKKDEVGNRGQLYKLPR 186 (264)
T ss_pred CCCCCCCcccccccccCCccccCCceeeeeeeeccCCCC-EEEEEEEEEecCCceEeEecCCCccccccccccceecccH
Confidence 3456555532 35778899999999987655444 9999999999996 55554433322356777777766553
No 38
>KOG2996|consensus
Probab=56.05 E-value=10 Score=38.07 Aligned_cols=39 Identities=28% Similarity=0.441 Sum_probs=28.5
Q ss_pred CCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEEEEecc
Q psy985 31 PPRETINSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGD 71 (191)
Q Consensus 31 PP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGd 71 (191)
|.++.....+.+||.|-+|++..+ --|||+|.|----|=
T Consensus 815 ~ard~~eLSlk~GDvV~i~~k~g~--d~GWWkGevngrvGw 853 (865)
T KOG2996|consen 815 CARDMRELSLKEGDVVKIYDKVGE--DQGWWKGEVNGRVGW 853 (865)
T ss_pred CCCchhhcccccCCEEEEehhccc--cCceecceecCcccc
Confidence 333334567899999999997654 569999988765543
No 39
>PF09465 LBR_tudor: Lamin-B receptor of TUDOR domain; InterPro: IPR019023 The Lamin-B receptor is a chromatin and lamin binding protein in the inner nuclear membrane. It is one of the integral inner nuclear envelope membrane proteins responsible for targeting nuclear membranes to chromatin, being a downstream effector of Ran, a small Ras-like nuclear GTPase which regulates NE assembly. Lamin-B receptor interacts with importin beta, a Ran-binding protein, thereby directly contributing to the fusion of membrane vesicles and the formation of the nuclear envelope []. ; PDB: 2L8D_A 2DIG_A.
Probab=54.33 E-value=40 Score=23.48 Aligned_cols=39 Identities=21% Similarity=0.066 Sum_probs=29.5
Q ss_pred CCccCCCeEEEEeecCCCCccceeeEEEEE--EeccEEEEEEecc
Q psy985 38 SGFVENQEVEVFSKATEGESCGWWKVIVKM--IKGDFHVVEYLGC 80 (191)
Q Consensus 38 ~~~~egdeVEV~sran~~ep~gWW~a~V~~--iKGdf~vVey~~~ 80 (191)
..+..|+.|.|.=+.+. .|+.|+|.. .+.+.|-|.|...
T Consensus 4 ~k~~~Ge~V~~rWP~s~----lYYe~kV~~~d~~~~~y~V~Y~DG 44 (55)
T PF09465_consen 4 RKFAIGEVVMVRWPGSS----LYYEGKVLSYDSKSDRYTVLYEDG 44 (55)
T ss_dssp SSS-SS-EEEEE-TTTS-----EEEEEEEEEETTTTEEEEEETTS
T ss_pred ccccCCCEEEEECCCCC----cEEEEEEEEecccCceEEEEEcCC
Confidence 67899999999876543 689999999 7788999999855
No 40
>smart00298 CHROMO Chromatin organization modifier domain.
Probab=51.23 E-value=9.4 Score=24.29 Aligned_cols=34 Identities=12% Similarity=-0.038 Sum_probs=24.7
Q ss_pred eeEEEEEEe-----ccEEEEEEeccCCcceeeecCCCcc
Q psy985 61 WKVIVKMIK-----GDFHVVEYLGCQSTFTEIVPPERMR 94 (191)
Q Consensus 61 W~a~V~~iK-----Gdf~vVey~~~d~~~teIV~~erLR 94 (191)
...+|...+ .-.|.|.+.|++..+++.++.+.|+
T Consensus 3 ~v~~Il~~r~~~~~~~~ylVkW~g~~~~~~tW~~~~~l~ 41 (55)
T smart00298 3 EVEKILDHRWKKKGELEYLVKWKGYSYSEDTWEPEENLL 41 (55)
T ss_pred chheeeeeeecCCCcEEEEEEECCCCCccCceeeHHHHH
Confidence 345565555 2378999999988778888777766
No 41
>cd02793 MopB_CT_DMSOR-BSOR-TMAOR The MopB_DMSOR-BSOR-TMAOR CD contains dimethylsulfoxide reductase (DMSOR), biotin sulfoxide reductase (BSOR), trimethylamine N-oxide reductase (TMAOR) and other related proteins. DMSOR always catalyzes the reduction of DMSO to dimethylsulfide, but its cellular location and oligomerization state are organism-dependent. For example, in Rhodobacter sphaeriodes and Rhodobacter capsulatus, it is an 82-kDa monomeric soluble protein found in the periplasmic space; in E. coli, it is membrane-bound and exists as a heterotrimer. BSOR catalyzes the reduction of biotin sulfixode to biotin, and is unique among Mo enzymes because no additional auxiliary proteins or cofactors are required. TMAOR is similar to DMSOR, but its only natural substrate is TMAO.This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=49.54 E-value=4 Score=31.37 Aligned_cols=25 Identities=24% Similarity=0.570 Sum_probs=19.6
Q ss_pred cccCCCCCCCCCccCCCeEEEEeec
Q psy985 28 VRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 28 VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
|.+=|.+.....+.+||.|+|+++.
T Consensus 35 v~i~p~dA~~~gi~~Gd~V~v~s~~ 59 (129)
T cd02793 35 IRINPADAAARGIADGDIVRVFNDR 59 (129)
T ss_pred EEECHHHHHHcCCCCCCEEEEEcCC
Confidence 5555555578899999999999864
No 42
>PF01245 Ribosomal_L19: Ribosomal protein L19; InterPro: IPR001857 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L19 is one of the proteins from the large ribosomal subunit [, ]. In Escherichia coli, L19 is known to be located at the 30S-50S ribosomal subunit interface [] and may play a role in the structure and function of the aminoacyl-tRNA binding site. It belongs to a family of ribosomal proteins, including L19 from bacteria and the chloroplasts of red algae. L19 is a protein of 120 to 130 amino-acid residues.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3HUZ_T 3V2D_T 3I8I_R 2XG2_T 2V49_T 2XUX_T 3HUX_T 3I9C_R 3V25_T 3UZ2_R ....
Probab=47.49 E-value=49 Score=25.86 Aligned_cols=34 Identities=21% Similarity=0.186 Sum_probs=21.4
Q ss_pred CCCccCCCeEEEEeecCCCC--ccceeeEEEEEEec
Q psy985 37 NSGFVENQEVEVFSKATEGE--SCGWWKVIVKMIKG 70 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~e--p~gWW~a~V~~iKG 70 (191)
-++|.+||.|+|-+...+.+ ---=..|.|..++|
T Consensus 16 ~p~f~~GD~v~V~~~i~e~~k~r~q~f~GvvIa~~~ 51 (113)
T PF01245_consen 16 IPEFRVGDTVRVTYKISEGNKERIQVFEGVVIARRR 51 (113)
T ss_dssp SSSSSSSSEEEEEEEEESSSSEEEEEEEEEEEEEEB
T ss_pred CCCcCCCCEEEEEEEEecCCCceeEEEEEEEEEEEC
Confidence 36899999999977765443 22234455555544
No 43
>PRK05338 rplS 50S ribosomal protein L19; Provisional
Probab=47.20 E-value=41 Score=26.60 Aligned_cols=43 Identities=26% Similarity=0.291 Sum_probs=26.4
Q ss_pred CCCccCCCeEEEEeecCCCCccceeeEEEEEEeccEEEEEEeccCCcc
Q psy985 37 NSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYLGCQSTF 84 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~~~d~~~ 84 (191)
-++|.+||.|.|.++-.+++ .-++....|=...+.-.|.++++
T Consensus 16 ~p~f~~GD~V~V~~~i~eg~-----k~R~q~f~GvvI~~~~~G~~~tf 58 (116)
T PRK05338 16 IPEFRPGDTVRVHVKVVEGN-----KERIQAFEGVVIARRGRGLNETF 58 (116)
T ss_pred CCCcCCCCEEEEEEEEccCC-----ceEeccEEEEEEEEeCCCCCceE
Confidence 46899999999988754432 23344444444444445666555
No 44
>PF07039 DUF1325: SGF29 tudor-like domain; InterPro: IPR010750 SAGA-associated factor 29 is involved in transcriptional regulation, probably through association with histone acetyltransferase (HAT) complexes like the TFTC-HAT or STAGA complexes. It also may be involved in MYC-mediated oncogenic transformation. It is a component of the ATAC complex, which is a complex with histone acetyltransferase activity on histones H3 and H4 []. This entry represents a domain found in yeast and human SAGA-associated factor 29 proteins that is related to the tudor domain. ; PDB: 3MP6_A 3MP1_A 3MP8_A 3MET_B 3ME9_A 3MEU_B 3MEA_A 3MEV_B 3LX7_A 3MEW_A.
Probab=46.12 E-value=48 Score=26.27 Aligned_cols=49 Identities=12% Similarity=0.152 Sum_probs=33.8
Q ss_pred ccCCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEEEE---eccEEEEEEeccC
Q psy985 29 RLPPRETINSGFVENQEVEVFSKATEGESCGWWKVIVKMI---KGDFHVVEYLGCQ 81 (191)
Q Consensus 29 RlPP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~~i---KGdf~vVey~~~d 81 (191)
+.+|......+|..|..|-+..+. .-+-++|+|... +.+.|.++|.|-+
T Consensus 61 ~~~~~~~~~~~f~~g~~VLAlYP~----TT~FY~A~V~~~p~~~~~~y~l~Fedd~ 112 (130)
T PF07039_consen 61 KAPPDTDPLAEFPKGTKVLALYPD----TTCFYPATVVSPPKKKSGEYKLKFEDDE 112 (130)
T ss_dssp B--TTT-GGGS--TT-EEEEE-TT----SSEEEEEEEEEE-SSTTS-EEEEECTTT
T ss_pred ccCCCCCchhhCCCCCEEEEECCC----CceEEEEEEEeCCCCCCCcEEEEEeCCC
Confidence 444455567799999999998876 779999999999 6779999998664
No 45
>CHL00084 rpl19 ribosomal protein L19
Probab=45.50 E-value=44 Score=26.49 Aligned_cols=43 Identities=21% Similarity=0.172 Sum_probs=26.3
Q ss_pred CCCccCCCeEEEEeecCCCCccceeeEEEEEEeccEEEEEEeccCCcc
Q psy985 37 NSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYLGCQSTF 84 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~~~d~~~ 84 (191)
-++|.+||.|.|..+-.+++ .-++-...|=...+.-.|.++++
T Consensus 20 ~p~f~~GDtV~V~~~i~eg~-----k~R~q~F~GvvI~~r~~G~~~tf 62 (117)
T CHL00084 20 LPKIRVGDTVKVGVLIQEGN-----KERVQFYEGTVIAKKNSGLNTTI 62 (117)
T ss_pred CCccCCCCEEEEEEEEecCC-----eeEeceEEEEEEEEeCCCCCeeE
Confidence 45899999999988766543 23344444444444445666555
No 46
>PF12945 YcgR_2: Flagellar protein YcgR; PDB: 2RDE_B 1YLN_A 3KYG_A.
Probab=44.28 E-value=85 Score=21.69 Aligned_cols=38 Identities=18% Similarity=0.226 Sum_probs=25.6
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEecc-EEEEEEe
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIKGD-FHVVEYL 78 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iKGd-f~vVey~ 78 (191)
+..||.|++.....+ +..++.+++|.-+..+ ..+|..-
T Consensus 1 L~iG~~i~i~i~~~~-~~~~~y~S~v~g~~~~~~l~i~~P 39 (87)
T PF12945_consen 1 LKIGQKIEIEITNPT-GEKGRYKSRVIGIDDDRYLIISMP 39 (87)
T ss_dssp --TT-EEEEEEE-TT-S-EEEEEEEEEEEETTTEEEEE--
T ss_pred CCCCCEEEEEEECCC-CceEEEEEEEEEECCCCEEEEEcC
Confidence 467999998663333 4559999999999999 9988875
No 47
>TIGR01024 rplS_bact ribosomal protein L19, bacterial type. This model describes bacterial ribosomoal protein L19 and its chloroplast equivalent. Putative mitochondrial L19 are found in several species (but not Saccharomyces cerevisiae) and score between trusted and noise cutoffs.
Probab=43.90 E-value=45 Score=26.23 Aligned_cols=43 Identities=26% Similarity=0.272 Sum_probs=28.2
Q ss_pred CCCccCCCeEEEEeecCCCCccceeeEEEEEEeccEEEEEEeccCCcc
Q psy985 37 NSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYLGCQSTF 84 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~~~d~~~ 84 (191)
-++|.+||.|.|.+.-.+++ .-++....|=...+.-.|.++++
T Consensus 16 ip~f~~GD~v~V~~~i~eg~-----k~R~q~f~GvvI~~~~~G~~~tf 58 (113)
T TIGR01024 16 LPDFRVGDTVRVHVKIVEGK-----KERIQVFEGVVIARRGGGIGETF 58 (113)
T ss_pred CCccCCCCEEEEEEEEccCC-----ceEcccEEEEEEEEeCCCCceEE
Confidence 46899999999988765543 22444555555555555666665
No 48
>KOG4348|consensus
Probab=43.57 E-value=29 Score=33.75 Aligned_cols=34 Identities=21% Similarity=0.432 Sum_probs=25.3
Q ss_pred CCCccCCCeEEEEeecCCCCccceeeEEEEEEeccE
Q psy985 37 NSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDF 72 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf 72 (191)
...|.|||.|-.-+ .+-.-.|||.|.+.--||=|
T Consensus 277 ELt~KEgdil~lIt--K~cgdaGWweGELnGk~GvF 310 (627)
T KOG4348|consen 277 ELTLKEGDILILIT--KNCGDAGWWEGELNGKKGVF 310 (627)
T ss_pred ceeeccccEEEEec--ccccccceeeeeecCccccC
Confidence 44677788777766 33345799999999999875
No 49
>cd02789 MopB_CT_FmdC-FwdD The MopB_FmdC-FwdD CD includes the C-terminus of subunit C of molybdenum formylmethanofuran dehydrogenase (FmdC) and subunit D of tungsten formylmethanofuran dehydrogenase (FwdD), and other related proteins. Formylmethanofuran dehydrogenase catalyzes the first step in methane formation from CO2 in methanogenic archaea and some eubacteria. Members of this CD belong to the molybdopterin_binding superfamily of proteins. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=41.95 E-value=5.8 Score=29.76 Aligned_cols=25 Identities=24% Similarity=0.247 Sum_probs=18.9
Q ss_pred cccCCCCCCCCCccCCCeEEEEeec
Q psy985 28 VRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 28 VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
|.+-|.+.....+.+||.|+|+++.
T Consensus 33 v~i~p~dA~~lgi~~Gd~V~v~~~~ 57 (106)
T cd02789 33 CEINPEDYKLLGKPEGDKVKVTSEF 57 (106)
T ss_pred EEECHHHHHHcCCCCCCEEEEEcCC
Confidence 4444444578899999999999854
No 50
>PF11302 DUF3104: Protein of unknown function (DUF3104); InterPro: IPR021453 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=41.73 E-value=45 Score=24.59 Aligned_cols=33 Identities=18% Similarity=0.306 Sum_probs=25.1
Q ss_pred CccCCCeEEEEeec--CCCCccceeeEEEEEEecc
Q psy985 39 GFVENQEVEVFSKA--TEGESCGWWKVIVKMIKGD 71 (191)
Q Consensus 39 ~~~egdeVEV~sra--n~~ep~gWW~a~V~~iKGd 71 (191)
..+.||.|-|-.-. .....-.||.|.|..+.|.
T Consensus 5 ~Vk~Gd~ViV~~~~~~~~~~~~dWWmg~Vi~~~gg 39 (75)
T PF11302_consen 5 SVKPGDTVIVQDEQEVGQKQDKDWWMGQVIHCEGG 39 (75)
T ss_pred ccCCCCEEEEecCccccccCCCCcEEEEEEEEecc
Confidence 46789999885543 2345569999999999986
No 51
>PF10775 ATP_sub_h: ATP synthase complex subunit h; InterPro: IPR019711 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit H found in the F0 complex of F-ATPases from fungal mitochondria. Subunit H is homologous to the mammalian factor F6, and is essential for the correct assembly and/or functioning of F-ATPases, since yeast cells lacking it are not able to grow on non-fermentable carbon sources. Subunit H occupies a central place in the peripheral stalk between the F1 sector and the membrane [].
Probab=40.81 E-value=14 Score=26.64 Aligned_cols=18 Identities=22% Similarity=0.602 Sum_probs=16.5
Q ss_pred hHhhhhhhhhhhhhhHHH
Q psy985 158 CQKRASMIKDMHFRNLNQ 175 (191)
Q Consensus 158 ~~kRA~mL~dmHfr~Lrq 175 (191)
+.+|+.+++|+++|.|+.
T Consensus 6 ~~~r~d~VQDLYLkELKa 23 (67)
T PF10775_consen 6 TPRRADLVQDLYLKELKA 23 (67)
T ss_pred ccccccHHHHHHHHHHHh
Confidence 678999999999999985
No 52
>PF07039 DUF1325: SGF29 tudor-like domain; InterPro: IPR010750 SAGA-associated factor 29 is involved in transcriptional regulation, probably through association with histone acetyltransferase (HAT) complexes like the TFTC-HAT or STAGA complexes. It also may be involved in MYC-mediated oncogenic transformation. It is a component of the ATAC complex, which is a complex with histone acetyltransferase activity on histones H3 and H4 []. This entry represents a domain found in yeast and human SAGA-associated factor 29 proteins that is related to the tudor domain. ; PDB: 3MP6_A 3MP1_A 3MP8_A 3MET_B 3ME9_A 3MEU_B 3MEA_A 3MEV_B 3LX7_A 3MEW_A.
Probab=40.37 E-value=71 Score=25.32 Aligned_cols=37 Identities=22% Similarity=0.082 Sum_probs=27.9
Q ss_pred cCCCeEEEEeecCCCCccceeeEEEEEEeccE--EEEEEe
Q psy985 41 VENQEVEVFSKATEGESCGWWKVIVKMIKGDF--HVVEYL 78 (191)
Q Consensus 41 ~egdeVEV~sran~~ep~gWW~a~V~~iKGdf--~vVey~ 78 (191)
+.||+|=++.+ ..++--.|-+|+|....++- |-|+=.
T Consensus 1 q~G~~VAak~~-~~~~~~~WIla~Vv~~~~~~~rYeV~D~ 39 (130)
T PF07039_consen 1 QPGDQVAAKVK-QGNEEEEWILAEVVKYNSDGNRYEVEDP 39 (130)
T ss_dssp -TT-EEEEEEC-TTTTTCEEEEEEEEEEETTTTEEEEEET
T ss_pred CCCCEEEEEcC-CCCCCCCEEEEEEEEEeCCCCEEEEecC
Confidence 47999999888 44456899999999999994 666644
No 53
>cd02783 MopB_CT_2 The MopB_CT_2 CD includes a group of related uncharacterized bacterial and archaeal molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=40.05 E-value=18 Score=28.88 Aligned_cols=38 Identities=18% Similarity=0.135 Sum_probs=27.6
Q ss_pred cCcccccCCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEEE
Q psy985 24 QFSMVRLPPRETINSGFVENQEVEVFSKATEGESCGWWKVIVKM 67 (191)
Q Consensus 24 Pf~~VRlPP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~~ 67 (191)
|-.-|.+-|.+.....|.+||.|+|++ +.|-..++++.
T Consensus 30 ~~~~v~inp~dA~~~GI~dGd~V~v~s------~~G~~~~~a~v 67 (156)
T cd02783 30 TRNYLYMHPKTAKELGIKDGDWVWVES------VNGRVKGQARF 67 (156)
T ss_pred CCCEEEECHHHHHHcCCCCCCEEEEEc------CCeeEEEEEEE
Confidence 333466666666889999999999998 44666666653
No 54
>PF13376 OmdA: Bacteriocin-protection, YdeI or OmpD-Associated
Probab=39.59 E-value=26 Score=24.02 Aligned_cols=24 Identities=25% Similarity=0.401 Sum_probs=18.8
Q ss_pred eecCChhHHHHHHh-hhchHHHHHh
Q psy985 111 DIEVPLDVQEFAKM-EGAHKDFQKA 134 (191)
Q Consensus 111 ~i~VPedLre~~~~-~~~h~dF~ka 134 (191)
+++||+||.+..+. +++..-|...
T Consensus 3 ~~~vP~dl~~aL~~~p~a~~~f~~l 27 (63)
T PF13376_consen 3 EVEVPEDLEAALEANPEAKEFFESL 27 (63)
T ss_pred CCCCCHHHHHHHHCCHHHHHHHHHC
Confidence 47899999999888 5676677653
No 55
>cd02790 MopB_CT_Formate-Dh_H Formate dehydrogenase H (Formate-Dh-H) catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. It is a component of the anaerobic formate hydrogen lyase complex. The E. coli formate dehydrogenase H (Fdh-H) is a monomer composed of a single polypeptide chain with a Mo active site region and a [4Fe-4S] center. This CD (MopB_CT_Formate-Dh_H) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=39.33 E-value=49 Score=24.15 Aligned_cols=62 Identities=11% Similarity=0.133 Sum_probs=36.2
Q ss_pred cCcccccCCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEE-EEeccEEEEEEeccCCcceeeec
Q psy985 24 QFSMVRLPPRETINSGFVENQEVEVFSKATEGESCGWWKVIVK-MIKGDFHVVEYLGCQSTFTEIVP 89 (191)
Q Consensus 24 Pf~~VRlPP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~-~iKGdf~vVey~~~d~~~teIV~ 89 (191)
|-..+.+-|.+..+..+.+||.|+|++.... .. =+++|. .+..+...+.+-.++...|.+++
T Consensus 33 ~~~~v~in~~dA~~lgi~~Gd~V~v~~~~G~--~~--~~v~i~~~i~~g~v~~~~g~~~~~~n~l~~ 95 (116)
T cd02790 33 PEEYVEINPEDAKRLGIEDGEKVRVSSRRGS--VE--VRARVTDRVPEGVVFMPFHFAEAAANLLTN 95 (116)
T ss_pred CCcEEEECHHHHHHcCCCCCCEEEEEcCCEE--EE--EEEEECCCcCCCEEEEecCcCCCcceECcC
Confidence 4455777777778899999999999985421 11 122221 24444555554334444565544
No 56
>cd02785 MopB_CT_4 The MopB_CT_4 CD includes a group of related uncharacterized bacterial and archaeal molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=38.59 E-value=83 Score=23.60 Aligned_cols=37 Identities=22% Similarity=0.298 Sum_probs=26.5
Q ss_pred cCcccccCCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEE
Q psy985 24 QFSMVRLPPRETINSGFVENQEVEVFSKATEGESCGWWKVIVK 66 (191)
Q Consensus 24 Pf~~VRlPP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~ 66 (191)
|-.-|.+-|.+..+..+.+||.|+|+++. |.-.++++
T Consensus 30 ~~~~v~i~p~dA~~~gi~~Gd~V~v~s~~------G~i~~~a~ 66 (124)
T cd02785 30 PEPRVKINPIDAAARGIAHGDLVEVYNDR------GSVVCKAK 66 (124)
T ss_pred CCCeEEECHHHHHHcCCCCCCEEEEEeCC------CEEEEEEE
Confidence 33446777766688999999999999954 44455554
No 57
>PF04014 Antitoxin-MazE: Antidote-toxin recognition MazE; InterPro: IPR007159 This domain is found in AbrB from Bacillus subtilis. The product of the abrB gene is an ambiactive repressor and activator of the transcription of genes expressed during the transition state between vegetative growth and the onset of stationary phase and sporulation []. AbrB is thought to interact directly with the transcription initiation regions of genes under its control []. AbrB contains a helix-turn-helix structure, but this domain ends before the helix-turn-helix begins []. The product of the B. subtilis gene spoVT is another member of this family and is also a transcriptional regulator []. DNA-binding activity in this AbrB homologue requires hexamerisation []. Another family member has been isolated from the Sulfolobus solfataricus and has been identified as a homologue of bacterial repressor-like proteins. The Escherichia coli family member SohA or Prl1F appears to be bifunctional and is able to regulate its own expression as well as relieve the export block imposed by high-level synthesis of beta-galactosidase hybrid proteins [].; PDB: 2L66_A 2GLW_A 3TND_D 2W1T_B 2RO5_B 2FY9_A 2RO3_B 1UB4_C 3ZVK_G 1YFB_B ....
Probab=38.52 E-value=6.8 Score=25.34 Aligned_cols=27 Identities=22% Similarity=0.294 Sum_probs=18.8
Q ss_pred ccccCCCCCCCCCccCCCeEEEEeecC
Q psy985 27 MVRLPPRETINSGFVENQEVEVFSKAT 53 (191)
Q Consensus 27 ~VRlPP~~~~~~~~~egdeVEV~sran 53 (191)
.|+||..-.....+.+||+|++....+
T Consensus 8 ~v~iPk~~~~~l~l~~Gd~v~i~~~~~ 34 (47)
T PF04014_consen 8 QVTIPKEIREKLGLKPGDEVEIEVEGD 34 (47)
T ss_dssp EEEE-HHHHHHTTSSTTTEEEEEEETT
T ss_pred eEECCHHHHHHcCCCCCCEEEEEEeCC
Confidence 466776544456889999999987654
No 58
>PRK05659 sulfur carrier protein ThiS; Validated
Probab=36.29 E-value=25 Score=23.80 Aligned_cols=16 Identities=19% Similarity=0.308 Sum_probs=13.5
Q ss_pred CCCCccCCCeEEEEee
Q psy985 36 INSGFVENQEVEVFSK 51 (191)
Q Consensus 36 ~~~~~~egdeVEV~sr 51 (191)
.+..+++||.||+++-
T Consensus 47 ~~~~l~~gD~vei~~~ 62 (66)
T PRK05659 47 ASTALREGDVVEIVHA 62 (66)
T ss_pred CcccCCCCCEEEEEEE
Confidence 5677899999999874
No 59
>cd02791 MopB_CT_Nitrate-R-NapA-like Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite. Monomeric Nas is located in the cytoplasm and participates in nitrogen assimilation. Dimeric Nap is located in the periplasm and is coupled to quinol oxidation via a membrane-anchored tetraheme cytochrome. This CD (MopB_CT_Nitrate-R-Nap) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs
Probab=35.84 E-value=89 Score=23.01 Aligned_cols=31 Identities=23% Similarity=0.317 Sum_probs=24.5
Q ss_pred cccCcccccCCCCCCCCCccCCCeEEEEeec
Q psy985 22 KFQFSMVRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 22 ~vPf~~VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
..|-..+++-|.+..+..+.+||.|+|++..
T Consensus 31 ~~~~~~v~in~~dA~~lgi~~Gd~V~v~~~~ 61 (122)
T cd02791 31 HVPEPYVEIHPEDAARLGLKEGDLVRVTSRR 61 (122)
T ss_pred hCCCCEEEECHHHHHHcCCCCCCEEEEEcCC
Confidence 3444557888877788999999999999854
No 60
>smart00739 KOW KOW (Kyprides, Ouzounis, Woese) motif. Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
Probab=35.72 E-value=79 Score=17.35 Aligned_cols=26 Identities=23% Similarity=0.285 Sum_probs=16.6
Q ss_pred ccCCCeEEEEeecCCCCccceeeEEEEEEec
Q psy985 40 FVENQEVEVFSKATEGESCGWWKVIVKMIKG 70 (191)
Q Consensus 40 ~~egdeVEV~sran~~ep~gWW~a~V~~iKG 70 (191)
|.+||.|.|..=..... .|.|..+.+
T Consensus 2 ~~~G~~V~I~~G~~~g~-----~g~i~~i~~ 27 (28)
T smart00739 2 FEVGDTVRVIAGPFKGK-----VGKVLEVDG 27 (28)
T ss_pred CCCCCEEEEeECCCCCc-----EEEEEEEcC
Confidence 67899999987444333 455655543
No 61
>cd02778 MopB_CT_Thiosulfate-R-like The MopB_CT_Thiosulfate-R-like CD contains thiosulfate-, sulfur-, and polysulfide-reductases, and other related proteins. Thiosulfate reductase catalyzes the cleavage of sulfur-sulfur bonds in thiosulfate. Polysulfide reductase is a membrane-bound enzyme that catalyzes the reduction of polysulfide using either hydrogen or formate as the electron donor. Also included in this CD is the phenylacetyl-CoA:acceptor oxidoreductase, large subunit (PadB2), which has been characterized as a membrane-bound molybdenum-iron-sulfur enzyme involved in anaerobic metabolism of phenylalanine in the denitrifying bacterium Thauera aromatica. The MopB_CT_Thiosulfate-R-like CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=35.62 E-value=13 Score=27.67 Aligned_cols=32 Identities=19% Similarity=0.188 Sum_probs=25.8
Q ss_pred cccCcccccCCCCCCCCCccCCCeEEEEeecC
Q psy985 22 KFQFSMVRLPPRETINSGFVENQEVEVFSKAT 53 (191)
Q Consensus 22 ~vPf~~VRlPP~~~~~~~~~egdeVEV~sran 53 (191)
..|-..|.+-|.+.....+.+||.|+|++...
T Consensus 26 ~~~~~~v~i~p~dA~~~gi~~Gd~V~v~s~~G 57 (123)
T cd02778 26 LTPENTLWINPETAARLGIKDGDRVEVSSARG 57 (123)
T ss_pred cCCCCeEEECHHHHHHcCCCCCCEEEEEeCCC
Confidence 44556688888777889999999999999654
No 62
>cd02787 MopB_CT_ydeP The MopB_CT_ydeP CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=35.53 E-value=11 Score=27.92 Aligned_cols=27 Identities=19% Similarity=0.281 Sum_probs=21.1
Q ss_pred cccCCCCCCCCCccCCCeEEEEeecCC
Q psy985 28 VRLPPRETINSGFVENQEVEVFSKATE 54 (191)
Q Consensus 28 VRlPP~~~~~~~~~egdeVEV~sran~ 54 (191)
|.+-|.+..+..+.+||.|+|+++...
T Consensus 33 v~i~p~dA~~lgI~dGd~V~v~s~~G~ 59 (112)
T cd02787 33 VFMNPDDIARLGLKAGDRVDLESAFGD 59 (112)
T ss_pred EEECHHHHHHhCCCCCCEEEEEecCCC
Confidence 555555557899999999999997644
No 63
>PF03454 MoeA_C: MoeA C-terminal region (domain IV); InterPro: IPR005111 The majority of molybdenum-containing enzymes utilise a molybdenum cofactor (MoCF or Moco) consisting of a Mo atom coordinated via a cis-dithiolene moiety to molybdopterin (MPT). MoCF is ubiquitous in nature, and the pathway for MoCF biosynthesis is conserved in all three domains of life. MoCF-containing enzymes function as oxidoreductases in carbon, nitrogen, and sulphur metabolism [, ]. In Escherichia coli, biosynthesis of MoCF is a three stage process. It begins with the MoaA and MoaC conversion of GTP to the meta-stable pterin intermediate precursor Z. The second stage involves MPT synthase (MoaD and MoaE), which converts precursor Z to MPT; MoeB is involved in the recycling of MPT synthase. The final step in MoCF synthesis is the attachment of mononuclear Mo to MPT, a process that requires MoeA and which is enhanced by MogA in an Mg2 ATP-dependent manner []. MoCF is the active co-factor in eukaryotic and some prokaryotic molybdo-enzymes, but the majority of bacterial enzymes requiring MoCF, need a modification of MTP for it to be active; MobA is involved in the attachment of a nucleotide monophosphate to MPT resulting in the MGD co-factor, the active co-factor for most prokaryotic molybdo-enzymes. Bacterial two-hybrid studies have revealed the close interactions between MoeA, MogA, and MobA in the synthesis of MoCF []. Moreover the close functional association of MoeA and MogA in the synthesis of MoCF is supported by fact that the known eukaryotic homologues to MoeA and MogA exist as fusion proteins: CNX1 (Q39054 from SWISSPROT) of Arabidopsis thaliana (Mouse-ear cress), mammalian Gephryin (e.g. Q9NQX3 from SWISSPROT) and Drosophila melanogaster (Fruit fly) Cinnamon (P39205 from SWISSPROT) []. This domain is found in proteins involved in biosynthesis of molybdopterin cofactor however the exact molecular function of this domain is uncertain. The structure of this domain is known [] and forms an incomplete beta barrel.; GO: 0032324 molybdopterin cofactor biosynthetic process; PDB: 1T3E_A 2FU3_A 2FTS_A 1WU2_A 1XI8_A 2NRS_A 2NRP_B 2NRO_A 2NQV_A 2NQM_B ....
Probab=35.44 E-value=18 Score=24.65 Aligned_cols=20 Identities=30% Similarity=0.438 Sum_probs=12.6
Q ss_pred cccCCCCCCCCCccCCCeEEEEe
Q psy985 28 VRLPPRETINSGFVENQEVEVFS 50 (191)
Q Consensus 28 VRlPP~~~~~~~~~egdeVEV~s 50 (191)
+.+|+ ....+..||+||||-
T Consensus 52 ~~ip~---~~~~~~~G~~V~v~l 71 (72)
T PF03454_consen 52 IVIPE---GVEGLEAGEEVEVIL 71 (72)
T ss_dssp EEEET---T-SEE-TTEEEEEEE
T ss_pred EEeCC---CCCccCCCCEEEEEE
Confidence 34554 344599999999974
No 64
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=32.91 E-value=52 Score=29.25 Aligned_cols=35 Identities=23% Similarity=-0.004 Sum_probs=29.7
Q ss_pred CCCccCCCeEEEEeecCCCCccceeeEEEEEEecc
Q psy985 37 NSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGD 71 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGd 71 (191)
-..+..||.|.|...+.+....|.+.|+|..|..+
T Consensus 321 ~~~i~~G~~v~v~~~~~~~~~~~~~~g~V~~i~~~ 355 (423)
T TIGR01843 321 IGFVHVGQPAEIKFSAFPYRRYGILNGKVKSISPD 355 (423)
T ss_pred hhhhCCCCceEEEEecCCCcccCCccEEEEEECCC
Confidence 45788999999988887778888999999999654
No 65
>PRK09798 antitoxin MazE; Provisional
Probab=32.38 E-value=18 Score=26.68 Aligned_cols=24 Identities=21% Similarity=0.217 Sum_probs=19.9
Q ss_pred ccccCCCCCCCCCccCCCeEEEEe
Q psy985 27 MVRLPPRETINSGFVENQEVEVFS 50 (191)
Q Consensus 27 ~VRlPP~~~~~~~~~egdeVEV~s 50 (191)
-||+|..--....+.+||+||+-.
T Consensus 14 ~vRIPk~~l~~l~l~~g~~vei~v 37 (82)
T PRK09798 14 AVRIPATLMQALNLNIDDEVKIDL 37 (82)
T ss_pred eEEcCHHHHHHcCCCCCCEEEEEE
Confidence 489998766678899999999876
No 66
>PF00868 Transglut_N: Transglutaminase family; InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=32.28 E-value=2.2e+02 Score=21.84 Aligned_cols=68 Identities=16% Similarity=0.216 Sum_probs=41.8
Q ss_pred cceeEEEEeecCCCCcccccccCcccccCCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEEEEeccEEEEEE
Q psy985 3 GHSFNVSNPFSLLRWQPESKFQFSMVRLPPRETINSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEY 77 (191)
Q Consensus 3 ~~~~~v~V~f~eN~Wqpe~~vPf~~VRlPP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey 77 (191)
|.-|.+++.| ...+.|+. +.++|--.-+..+...-|-.|-+-. ......+=|.|+|....|+..-|+-
T Consensus 32 GQ~F~i~l~f-~r~~~~~~----d~l~l~~~~G~~P~~~~gT~~~~~~--~~~~~~~~W~a~v~~~~~~~~tv~V 99 (118)
T PF00868_consen 32 GQPFTITLRF-NRPFDPSK----DQLSLEFETGPNPSESKGTKVVFPV--SSSLDSSSWSARVESQDGNSVTVSV 99 (118)
T ss_dssp TSEEEEEEEE-SSS--TTT----EEEEEEEEESSS--TTTTSEEEEEE--CSSS-TSSSEEEEEEEETTEEEEEE
T ss_pred CCEEEEEEEE-cCCcCCCC----cEEEEEEEEecccccCCCcEEEEEE--ccCCCCCCEEEEEEecCCCEEEEEE
Confidence 5679999999 66666553 3444443334444556677776654 4445566678999999999766554
No 67
>cd02858 Esterase_N_term Esterase N-terminal domain. Esterases catalyze the hydrolysis of organic esters to release an alcohol or thiol and acid. The term can be applied to enzymes that hydrolyze carboxylate, phosphate and sulphate esters, but is more often restricted to the first class of substrate. The N-terminus of esterase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=32.07 E-value=43 Score=24.02 Aligned_cols=21 Identities=19% Similarity=0.414 Sum_probs=17.3
Q ss_pred cceeeEEEEEEeccEEEEEEe
Q psy985 58 CGWWKVIVKMIKGDFHVVEYL 78 (191)
Q Consensus 58 ~gWW~a~V~~iKGdf~vVey~ 78 (191)
-|||.++|..++|.+|.-+|.
T Consensus 39 ~G~W~~~v~~l~~g~Y~Y~~~ 59 (85)
T cd02858 39 AGVWSVTTGPLAPGIYTYSFL 59 (85)
T ss_pred CeEEEEEECCCCCcEEEEEEE
Confidence 599999997788888777765
No 68
>cd00508 MopB_CT_Fdh-Nap-like This CD includes formate dehydrogenases (Fdh) H and N; nitrate reductases, Nap and Nas; and other related proteins. Formate dehydrogenase H is a component of the anaerobic formate hydrogen lyase complex and catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. Formate dehydrogenase N (alpha subunit) is the major electron donor to the bacterial nitrate respiratory chain and nitrate reductases, Nap and Nas, catalyze the reduction of nitrate to nitrite. This CD (MopB_CT_Fdh-Nap-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=31.96 E-value=16 Score=26.82 Aligned_cols=29 Identities=21% Similarity=0.322 Sum_probs=23.2
Q ss_pred cCcccccCCCCCCCCCccCCCeEEEEeec
Q psy985 24 QFSMVRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 24 Pf~~VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
|..-+.+-|.+..+..+.+||.|+|++..
T Consensus 33 ~~~~v~inp~dA~~lgi~~Gd~V~v~~~~ 61 (120)
T cd00508 33 PEPFVEIHPEDAARLGIKDGDLVRVSSRR 61 (120)
T ss_pred CCCEEEECHHHHHHcCCCCCCEEEEEeCC
Confidence 44457777777788999999999999844
No 69
>PRK11347 antitoxin ChpS; Provisional
Probab=31.94 E-value=17 Score=26.83 Aligned_cols=26 Identities=15% Similarity=0.059 Sum_probs=21.2
Q ss_pred ccccCCCCCCCCCccCCCeEEEEeec
Q psy985 27 MVRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 27 ~VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
-||||..--....+.+||+|+|-...
T Consensus 13 ~vriPk~il~~l~l~~G~~v~i~v~~ 38 (83)
T PRK11347 13 GMVIPNIVMKELNLQPGQSVEAQVSN 38 (83)
T ss_pred eEEeCHHHHHHcCCCCCCEEEEEEEC
Confidence 48999876678899999999997643
No 70
>PF08239 SH3_3: Bacterial SH3 domain; InterPro: IPR013247 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. A homologue of the SH3 domain has been found in a number of different bacterial proteins including glycyl-glycine endopeptidase, bacteriocin and some hypothetical proteins.; PDB: 3PVQ_B 3NPF_B 3H41_A 2KQ8_A 2KRS_A 2KYB_A 2KT8_A.
Probab=30.88 E-value=1.1e+02 Score=19.54 Aligned_cols=40 Identities=23% Similarity=0.487 Sum_probs=25.3
Q ss_pred ccccCCCCC--CCCCccCCCeEEEEeecCCCCccceeeEEEEEEecc
Q psy985 27 MVRLPPRET--INSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGD 71 (191)
Q Consensus 27 ~VRlPP~~~--~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGd 71 (191)
+||--|... .-..+..|+.|+|+...++.. | .+|+.-+|.
T Consensus 4 nvR~~p~~~s~~i~~l~~g~~v~v~~~~~~~~---W--~~V~~~~g~ 45 (55)
T PF08239_consen 4 NVRSGPSTNSPVIGQLPKGEKVTVLGESGDGN---W--YKVRTYDGK 45 (55)
T ss_dssp EEESSSSTTSTEEEEEETTSEEEEEEEETT-----E--EEEEEETTE
T ss_pred EEEeCCCCCChhhEEEeCCCEEEEEEEcCCcE---E--EEEECcCCc
Confidence 466666654 233678999999999888766 5 444444443
No 71
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=30.84 E-value=43 Score=22.46 Aligned_cols=16 Identities=0% Similarity=0.189 Sum_probs=13.3
Q ss_pred CCCCccCCCeEEEEee
Q psy985 36 INSGFVENQEVEVFSK 51 (191)
Q Consensus 36 ~~~~~~egdeVEV~sr 51 (191)
.+..+++||+||++..
T Consensus 46 ~~~~L~~gD~vei~~~ 61 (65)
T PRK06944 46 AARALAAGDRLDLVQP 61 (65)
T ss_pred ccccCCCCCEEEEEee
Confidence 4567999999999875
No 72
>KOG3766|consensus
Probab=30.80 E-value=1.3e+02 Score=29.19 Aligned_cols=54 Identities=13% Similarity=0.139 Sum_probs=42.8
Q ss_pred CCCCccCCCeEEEEeecCCCCccceeeEEEEEEecc-EEEEEEeccCCc-ceeeecCCC
Q psy985 36 INSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGD-FHVVEYLGCQST-FTEIVPPER 92 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGd-f~vVey~~~d~~-~teIV~~er 92 (191)
..--|++||.+|+.-+.|..+.| +|+|..+.|+ +.-|.+.+|+.+ |...+-.+.
T Consensus 197 ~~~~F~vgmkLEavd~~np~~Ic---vATV~~V~~~~~i~v~~d~~~~~~~d~~~~~~s 252 (478)
T KOG3766|consen 197 PPSRFQVGMKLEAVDDLNPSAIC---VATVVEVFDSREILVHFDGWDKSELDYWCDHDS 252 (478)
T ss_pred CCCcceeccEEEEeccCCCccee---eeehheecccceEEEEeccCCCcccceeEecCC
Confidence 34578999999998777766666 7899999999 699999999877 665555544
No 73
>COG4954 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.79 E-value=53 Score=26.22 Aligned_cols=63 Identities=30% Similarity=0.364 Sum_probs=40.8
Q ss_pred EEeecCChhHHHHHHh-------hhchHHHHHhhh-----ccEEEEcC-CCCeeEEEeCchhhHhhhhhhhhhhhhhHHH
Q psy985 109 KFDIEVPLDVQEFAKM-------EGAHKDFQKAIA-----AAIVRYIP-DKGALSVISRDEACQKRASMIKDMHFRNLNQ 175 (191)
Q Consensus 109 k~~i~VPedLre~~~~-------~~~h~dF~kaig-----a~~v~y~p-e~~~Lvvls~~e~~~kRA~mL~dmHfr~Lrq 175 (191)
|+.|.+-+||.-|-+- -++-..|-.-|| ++-=.|+| .-.+++|+|+++++. ...|=|.|-.
T Consensus 5 kiaivlrddlavwqklnvtaflmsgivaq~peiigepyrd~agn~ynplsiqpvivls~dqetl------~aihrraler 78 (135)
T COG4954 5 KIAIVLRDDLAVWQKLNVTAFLMSGIVAQSPEIIGEPYRDAAGNTYNPLSIQPVIVLSGDQETL------KAIHRRALER 78 (135)
T ss_pred eEEEEEecchHHHHHhhHHHHHHhhhhhcCchhcCcccccccCCccCccccceEEEEeCCHHHH------HHHHHHHHhc
Confidence 5677788888777542 233333334444 33446777 467889999999984 4578787766
Q ss_pred HH
Q psy985 176 KV 177 (191)
Q Consensus 176 K~ 177 (191)
|+
T Consensus 79 ~v 80 (135)
T COG4954 79 KV 80 (135)
T ss_pred CC
Confidence 64
No 74
>PRK10800 acyl-CoA thioesterase YbgC; Provisional
Probab=30.70 E-value=2.2e+02 Score=21.11 Aligned_cols=21 Identities=19% Similarity=-0.001 Sum_probs=16.0
Q ss_pred CCCCccCCCeEEEEeecCCCC
Q psy985 36 INSGFVENQEVEVFSKATEGE 56 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran~~e 56 (191)
+...+.-||+|+|-+....-.
T Consensus 63 y~~~~~~~d~i~v~t~v~~~~ 83 (130)
T PRK10800 63 YYAPARLDDMLEVQSEITSMR 83 (130)
T ss_pred EcCcccCCCEEEEEEEEEeeC
Confidence 667788899999988655443
No 75
>KOG2039|consensus
Probab=30.69 E-value=94 Score=32.18 Aligned_cols=61 Identities=13% Similarity=0.218 Sum_probs=44.5
Q ss_pred CCCCCCCCccCCCeEEE-EeecCCCCccceeeEEEEEEec-cEEEEEEeccCCcceeeecCCCcccCCCC
Q psy985 32 PRETINSGFVENQEVEV-FSKATEGESCGWWKVIVKMIKG-DFHVVEYLGCQSTFTEIVPPERMRQKNTN 99 (191)
Q Consensus 32 P~~~~~~~~~egdeVEV-~sran~~ep~gWW~a~V~~iKG-df~vVey~~~d~~~teIV~~erLR~~n~n 99 (191)
|+....+++.-||-+-. |+ .| -.|.+|+|+.+.. +=+.|-|. |-.-+|.+++-+|++.++.
T Consensus 688 ~~~~~~~~p~~gd~c~A~y~--~D---~qwyRa~i~~V~~~~~~~V~yi--Dygn~E~lp~~~l~~lp~~ 750 (875)
T KOG2039|consen 688 PPSSGSYTPKRGDLCVAKYS--LD---GQWYRALIVEVLDPESMEVFYI--DYGNIETLPFVRLKPLPPH 750 (875)
T ss_pred ccccCCCCCCCCCeeeeeec--cc---cceeeeeeeeeccCcceeEEEE--ecCcccccccccccCCChH
Confidence 33345788999998877 55 22 2599999999988 54444444 3345899999999999865
No 76
>KOG4225|consensus
Probab=30.50 E-value=39 Score=32.68 Aligned_cols=32 Identities=22% Similarity=0.481 Sum_probs=25.0
Q ss_pred CCCCccCCCeEEEEeecCCCCccceeeEEEEEEeccE
Q psy985 36 INSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDF 72 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf 72 (191)
.+.++.|||.|.|..+-|| ||.-|.=. .+|.|
T Consensus 447 deLEl~egDii~VmeKcdd----gWfvGts~-rtg~f 478 (489)
T KOG4225|consen 447 DELELREGDIIDVMEKCDD----GWFVGTSR-RTGKF 478 (489)
T ss_pred hhheeccCCEEeeeecccC----cceeccce-ecccc
Confidence 4678999999999999998 89988533 34443
No 77
>KOG1118|consensus
Probab=30.20 E-value=43 Score=31.12 Aligned_cols=38 Identities=16% Similarity=0.264 Sum_probs=27.5
Q ss_pred CCCCccCCCeEEEEeecCCCCccceeeEEEEEEeccEEEEEEe
Q psy985 36 INSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYL 78 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~ 78 (191)
...+|++||-++|-..-+ .+||-|.+.--.| ||-|.|.
T Consensus 321 gEL~fkeGDlI~l~~QId----enWyeG~~~g~sG-~FPvnYv 358 (366)
T KOG1118|consen 321 GELDFKEGDLITLTNQID----ENWYEGEKHGESG-MFPVNYV 358 (366)
T ss_pred CccCcccCceeeehhhcC----cchhhheecCccC-cccccee
Confidence 567889999999965444 5899997765533 6666664
No 78
>TIGR01683 thiS thiamine biosynthesis protein ThiS. This model represents ThiS, a small thiamine-biosynthesis protein related to MoaD, a molybdenum cofactor biosynthesis protein. Both proteins are involved in sulfur transfer. ThiS has a conserved Gly-Gly C-terminus that is modified, in reactions requiring ThiI, ThiF, IscS, and a sulfur atom from Cys, into the thiocarboxylate that provides the sulfur for thiazole biosynthesis.
Probab=29.57 E-value=38 Score=22.97 Aligned_cols=17 Identities=12% Similarity=0.182 Sum_probs=13.7
Q ss_pred CCCCccCCCeEEEEeec
Q psy985 36 INSGFVENQEVEVFSKA 52 (191)
Q Consensus 36 ~~~~~~egdeVEV~sra 52 (191)
....+++||+||++...
T Consensus 45 ~~~~L~~gD~veii~~V 61 (64)
T TIGR01683 45 DDTILKEGDRIEIVTFV 61 (64)
T ss_pred CceecCCCCEEEEEEec
Confidence 34679999999999753
No 79
>COG2996 Predicted RNA-bindining protein (contains S1 and HTH domains) [General function prediction only]
Probab=29.55 E-value=55 Score=29.75 Aligned_cols=68 Identities=19% Similarity=0.217 Sum_probs=42.4
Q ss_pred ccccCCCCCCCCCccCCCeEEEEeecCCC--------Cc----cceeeEEEEEEeccEEEEEEeccCCcceeeecCCCcc
Q psy985 27 MVRLPPRETINSGFVENQEVEVFSKATEG--------ES----CGWWKVIVKMIKGDFHVVEYLGCQSTFTEIVPPERMR 94 (191)
Q Consensus 27 ~VRlPP~~~~~~~~~egdeVEV~sran~~--------ep----~gWW~a~V~~iKGdf~vVey~~~d~~~teIV~~erLR 94 (191)
++.||-......++.+||+|+||...|.. +| -.|=-++|..++-+.=+ |..|--+-.-.|+.+.|.
T Consensus 31 ~ilL~k~~~~~~e~evGdev~vFiY~D~~~rl~aTt~~p~~tvg~~g~~~Vv~v~~~lGa--FlD~Gl~KDl~vp~~elp 108 (287)
T COG2996 31 TILLPKSEPEEDELEVGDEVTVFIYVDSEDRLIATTREPKATVGEYGWLKVVEVNKDLGA--FLDWGLPKDLLVPLDELP 108 (287)
T ss_pred EEeccccCCcCCccccCcEEEEEEEECCCCceeheeecceEeecceeEEEEEEEcCCcce--EEecCCCcceeeehhhcc
Confidence 67777665678899999999998876522 33 22333777777733211 123333445567777777
Q ss_pred cC
Q psy985 95 QK 96 (191)
Q Consensus 95 ~~ 96 (191)
+.
T Consensus 109 ~~ 110 (287)
T COG2996 109 TL 110 (287)
T ss_pred cc
Confidence 64
No 80
>KOG4226|consensus
Probab=29.40 E-value=58 Score=30.03 Aligned_cols=69 Identities=20% Similarity=0.307 Sum_probs=41.5
Q ss_pred ccCcccccCCCCC------------------CCCCccCCCeEEEEeecCCCCccceeeEEEEEEeccE---EEEEEeccC
Q psy985 23 FQFSMVRLPPRET------------------INSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDF---HVVEYLGCQ 81 (191)
Q Consensus 23 vPf~~VRlPP~~~------------------~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf---~vVey~~~d 81 (191)
-|-.+.|+||..- ....+.-|+.|-|.-+.+| |||+|.-----|=| ||-| .+|
T Consensus 91 sp~~d~~~ppd~~~~~~t~AvVKf~Y~a~~eDELsLtKGtrv~vmEKssD----GWWrG~~ng~VGWFPSNYv~E--~~d 164 (379)
T KOG4226|consen 91 SPSTDAEYPPDRIYDLNTPAVVKFNYVAEREDELSLTKGTRVTVMEKSSD----GWWRGSYNGQVGWFPSNYVTE--EVD 164 (379)
T ss_pred CCCccccCCcchhhhcCCceEEEEeeccccccccccccCcEEEEEEeccC----cceecccCCeeccccccceeh--hcc
Confidence 4666778887531 1234567999999888877 89998752221212 3333 235
Q ss_pred CcceeeecCCCcccCC
Q psy985 82 STFTEIVPPERMRQKN 97 (191)
Q Consensus 82 ~~~teIV~~erLR~~n 97 (191)
++--+=.+...+|+..
T Consensus 165 s~~gd~~s~~~~~~~A 180 (379)
T KOG4226|consen 165 SAAGDSPSFLSLRKAA 180 (379)
T ss_pred ccccCCccceecchhh
Confidence 5555556666777643
No 81
>PRK04306 50S ribosomal protein L21e; Reviewed
Probab=28.66 E-value=1.7e+02 Score=22.54 Aligned_cols=62 Identities=23% Similarity=0.204 Sum_probs=46.3
Q ss_pred CCCCccCCCeEEEEeecC--CCCccce---eeEEEEEEeccEEEEEEeccCCcceeeecCCCcccCC
Q psy985 36 INSGFVENQEVEVFSKAT--EGESCGW---WKVIVKMIKGDFHVVEYLGCQSTFTEIVPPERMRQKN 97 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran--~~ep~gW---W~a~V~~iKGdf~vVey~~~d~~~teIV~~erLR~~n 97 (191)
+-.+|.+||.|.+-.-.. ..-|.-- =.|+|.-+.|.-|.|+-...+..-.-||..|.||+..
T Consensus 31 ~l~~y~~Gd~V~I~~d~sv~kGmPh~~yhGkTG~V~~v~~~A~~V~v~vg~k~Kri~vr~eHlk~~~ 97 (98)
T PRK04306 31 ALQEFEEGDKVHIVIDPSVHKGMPHPRFHGKTGTVVGKRGRAYIVEVKDGGKEKTLIVRPEHLRPQK 97 (98)
T ss_pred HHHhccCCCEEEEEecCceecCCccccccCCCEEEEeecCeEEEEEEEECCceeEEEcCHHHcCccC
Confidence 556899999999976443 2223222 2578999999988888777777788999999999864
No 82
>cd02775 MopB_CT Molybdopterin-Binding, C-terminal (MopB_CT) domain of the MopB superfamily of proteins, a large, diverse, heterogeneous superfamily of enzymes that, in general, bind molybdopterin as a cofactor. The MopB domain is found in a wide variety of molybdenum- and tungsten-containing enzymes, including formate dehydrogenase-H (Fdh-H) and -N (Fdh-N), several forms of nitrate reductase (Nap, Nas, NarG), dimethylsulfoxide reductase (DMSOR), thiosulfate reductase, formylmethanofuran dehydrogenase, and arsenite oxidase. Molybdenum is present in most of these enzymes in the form of molybdopterin, a modified pterin ring with a dithiolene side chain, which is responsible for ligating the Mo. In many bacterial and archaeal species, molybdopterin is in the form of a dinucleotide, with two molybdopterin dinucleotide units per molybdenum. These proteins can function as monomers, heterodimers, or heterotrimers, depending on the protein and organism. Also included in the MopB superfamily is
Probab=28.51 E-value=20 Score=25.40 Aligned_cols=38 Identities=21% Similarity=0.285 Sum_probs=28.3
Q ss_pred CCCcccccccCcccccCCCCCCCCCccCCCeEEEEeecC
Q psy985 15 LRWQPESKFQFSMVRLPPRETINSGFVENQEVEVFSKAT 53 (191)
Q Consensus 15 N~Wqpe~~vPf~~VRlPP~~~~~~~~~egdeVEV~sran 53 (191)
+.|-.+ ..|...|.+.|.+.....+.+||.|+|++...
T Consensus 13 ~~~l~~-~~~~~~v~~~~~da~~lgl~~Gd~v~v~~~~g 50 (101)
T cd02775 13 NPWLRE-LAPEPVVEINPEDAAALGIKDGDLVRVESRRG 50 (101)
T ss_pred CHHHHh-cCCCCEEEECHHHHHHcCCCCCCEEEEEcCCc
Confidence 344433 44557788888877889999999999998543
No 83
>cd02776 MopB_CT_Nitrate-R-NarG-like Respiratory nitrate reductase A (NarGHI), alpha chain (NarG) and related proteins. Under anaerobic conditions in the presence of nitrate, E. coli synthesizes the cytoplasmic membrane-bound quinol-nitrate oxidoreductase (NarGHI), which reduces nitrate to nitrite and forms part of a redox loop generating a proton-motive force. Found in prokaryotes and some archaea, NarGHI usually functions as a heterotrimer. The alpha chain contains the molybdenum cofactor-containing Mo-bisMGD catalytic subunit. This CD (MopB_CT_Nitrate-R-NarG-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=28.41 E-value=17 Score=28.77 Aligned_cols=29 Identities=28% Similarity=0.445 Sum_probs=23.2
Q ss_pred cCcccccCCCCCCCCCccCCCeEEEEeec
Q psy985 24 QFSMVRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 24 Pf~~VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
|-..|.+-|.+.....+.+||.|+|++..
T Consensus 29 ~~~~v~inp~dA~~lgI~dGd~V~v~~~~ 57 (141)
T cd02776 29 GGPVVWMNPKDAAELGIKDNDWVEVFNDN 57 (141)
T ss_pred CCCEEEECHHHHHHcCCCCCCEEEEEeCC
Confidence 34457777777788999999999999864
No 84
>PF13279 4HBT_2: Thioesterase-like superfamily; PDB: 2W3X_E 3CK1_A 2GF6_C 2NUJ_A 2HLJ_A 2XFL_B 2XEM_B 2OIW_B 2HX5_A 2FUJ_A ....
Probab=28.18 E-value=1.2e+02 Score=22.03 Aligned_cols=17 Identities=12% Similarity=0.032 Sum_probs=11.3
Q ss_pred CCCCccCCCeEEEEeec
Q psy985 36 INSGFVENQEVEVFSKA 52 (191)
Q Consensus 36 ~~~~~~egdeVEV~sra 52 (191)
+..++.-||+++|-++-
T Consensus 53 y~~~~~~~d~~~v~~~~ 69 (121)
T PF13279_consen 53 YLRPLRFGDRLEVETRV 69 (121)
T ss_dssp E-S--BTTSEEEEEEEE
T ss_pred EcccccCCCEEEEEEEE
Confidence 56778889999998754
No 85
>PF01568 Molydop_binding: Molydopterin dinucleotide binding domain; InterPro: IPR006657 A domain in this entry corresponds to the C-terminal domain IV in dimethyl sulphoxide (DMSO)reductase which interacts with the 2-amino pyrimidone ring of both molybdopterin guanine dinucleotide molecules [].; GO: 0016491 oxidoreductase activity, 0030151 molybdenum ion binding, 0055114 oxidation-reduction process; PDB: 2IVF_A 1OGY_G 3ML1_A 3O5A_A 1TI2_G 1VLE_M 1VLD_U 1VLF_O 1TI4_I 1TI6_E ....
Probab=27.85 E-value=58 Score=23.55 Aligned_cols=64 Identities=17% Similarity=0.199 Sum_probs=34.6
Q ss_pred ccccCCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEEE---EeccEEEEEEeccC----CcceeeecCCCcccCC
Q psy985 27 MVRLPPRETINSGFVENQEVEVFSKATEGESCGWWKVIVKM---IKGDFHVVEYLGCQ----STFTEIVPPERMRQKN 97 (191)
Q Consensus 27 ~VRlPP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~~---iKGdf~vVey~~~d----~~~teIV~~erLR~~n 97 (191)
-|++-|.+..+..+.+||.|+|+|+.. ==.++|+- |+-+...+.+..+. ...|.|+ .+++=|..
T Consensus 31 ~v~inp~dA~~~Gi~~Gd~V~v~s~~G------~v~~~v~~~~~v~~g~v~~~~~~~~~~~g~~~N~L~-~~~~d~~~ 101 (110)
T PF01568_consen 31 FVEINPEDAAKLGIKDGDWVRVSSPRG------SVEVRVKVTDGVPPGVVFMPHGWGGAAKGGSVNALT-PDRVDPIS 101 (110)
T ss_dssp EEEEEHHHHHHCT--TTCEEEEEETTE------EEEEEEEEETTS-TTEEEEESTHTTSBSSS-GGGSS--SSSSTTT
T ss_pred EEEEcHHHHHHhcCcCCCEEEEEeccc------eEeeeeEEecCCcCCEEEEeccccccccCCcceECC-CCCCCccc
Confidence 677777777889999999999999543 22233322 22234444444333 4467776 44444433
No 86
>PF12158 DUF3592: Protein of unknown function (DUF3592); InterPro: IPR021994 This family of proteins is functionally uncharacterised.This family of proteins is found in bacteria, archaea, eukaryotes and viruses. Proteins in this family are typically between 150 and 242 amino acids in length.
Probab=27.71 E-value=1.4e+02 Score=22.56 Aligned_cols=18 Identities=11% Similarity=0.156 Sum_probs=13.1
Q ss_pred CCCCccCCCeEEEEeecC
Q psy985 36 INSGFVENQEVEVFSKAT 53 (191)
Q Consensus 36 ~~~~~~egdeVEV~sran 53 (191)
....+..||+|+||.-.+
T Consensus 88 ~~~~~~~G~~V~V~Y~P~ 105 (148)
T PF12158_consen 88 YWPKYPIGDTVTVYYNPN 105 (148)
T ss_pred CCccCCCcCEEEEEECCc
Confidence 445678999999966443
No 87
>TIGR02609 doc_partner putative addiction module antidote. Members of this protein family are putative addiction module antidote proteins that appear recurringly in two-gene operons with members of the Doc (death-on-curing) family TIGR01550. Members of this family contain a SpoVT/AbrB-like domain (pfam04014). Note that the gene pairs with a member of this family tend to be found on bacterial chromosomes, not on plasmids.
Probab=27.70 E-value=22 Score=25.34 Aligned_cols=24 Identities=25% Similarity=0.266 Sum_probs=18.3
Q ss_pred ccccCCCCCCCCCccCCCeEEEEe
Q psy985 27 MVRLPPRETINSGFVENQEVEVFS 50 (191)
Q Consensus 27 ~VRlPP~~~~~~~~~egdeVEV~s 50 (191)
-||||..--....+.+||+|++..
T Consensus 11 ~vtIPk~i~~~lgl~~Gd~v~v~~ 34 (74)
T TIGR02609 11 VVTLPKEVLESLGLKEGDTLYVDE 34 (74)
T ss_pred EEEECHHHHHHcCcCCCCEEEEEE
Confidence 378887654568899999998755
No 88
>PRK05863 sulfur carrier protein ThiS; Provisional
Probab=27.66 E-value=36 Score=23.41 Aligned_cols=15 Identities=13% Similarity=0.299 Sum_probs=12.3
Q ss_pred CCCccCCCeEEEEee
Q psy985 37 NSGFVENQEVEVFSK 51 (191)
Q Consensus 37 ~~~~~egdeVEV~sr 51 (191)
+..+++||.||+++-
T Consensus 47 ~~~L~~gD~ieIv~~ 61 (65)
T PRK05863 47 ATKLRDGARLEVVTA 61 (65)
T ss_pred hhhcCCCCEEEEEee
Confidence 345999999999874
No 89
>PRK06764 hypothetical protein; Provisional
Probab=27.56 E-value=1.3e+02 Score=23.16 Aligned_cols=45 Identities=24% Similarity=0.371 Sum_probs=28.6
Q ss_pred CCeEEEEeecCCCCccceeeEEEEEEeccEEEEEEecc--CCcceeeec
Q psy985 43 NQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYLGC--QSTFTEIVP 89 (191)
Q Consensus 43 gdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~~~--d~~~teIV~ 89 (191)
|-.++|+| .|.|.---=.-.|.--|-.-|||+..|| |.-||=||+
T Consensus 58 grsidils--gdkeaiqlnkyti~f~kpg~yvirvngciy~dvytfiv~ 104 (105)
T PRK06764 58 GRSIDVLS--GDKEAIQLNKYTIRFSKPGKYVIRVNGCIYNDVYTFIVN 104 (105)
T ss_pred Cceeeeec--CChhheEeeeeEEEecCCccEEEEEccEEeeeeEEEEec
Confidence 44445544 4555555555567777777799999999 444665553
No 90
>PF07088 GvpD: GvpD gas vesicle protein; InterPro: IPR009788 This family consists of several archaeal GvpD gas vesicle proteins. GvpD is thought to be involved in the regulation of gas vesicle formation [,].; GO: 0005524 ATP binding
Probab=26.76 E-value=35 Score=32.90 Aligned_cols=25 Identities=28% Similarity=0.448 Sum_probs=17.7
Q ss_pred CeeEEEeCchhhHhhhhhhhhhhhh
Q psy985 147 GALSVISRDEACQKRASMIKDMHFR 171 (191)
Q Consensus 147 ~~Lvvls~~e~~~kRA~mL~dmHfr 171 (191)
+.+.+.-..-+..+||.|..|||||
T Consensus 421 ~~i~i~~~~t~~~~~~~~~~~~h~~ 445 (484)
T PF07088_consen 421 LVILITKPGTELRTRADRVADMHFR 445 (484)
T ss_pred eEEEEEcCCchhhhHHHhhhhheEE
Confidence 3444444444557899999999998
No 91
>TIGR03794 NHPM_micro_HlyD NHPM bacteriocin system secretion protein. Members of this protein family are homologs of the HlyD membrane fusion protein of type I secretion systems. Their occurrence in prokaryotic genomes is associated with the occurrence of a novel class of microcin (small bacteriocins) with a propeptide region related to nitrile hydratase. We designate the class of bacteriocin as Nitrile Hydratase Propeptide Microcin, or NHPM. This family, therefore, is designated as NHPM bacteriocin system secretion protein. Some but not all NHPM-class putative microcins belong to the TOMM (thiazole/oxazole modified microcin) class as assessed by the presence of the scaffolding protein and/or cyclodehydratase in the same gene clusters.
Probab=25.73 E-value=68 Score=29.40 Aligned_cols=33 Identities=12% Similarity=0.187 Sum_probs=28.2
Q ss_pred CCccCCCeEEEEeecCCCCccceeeEEEEEEec
Q psy985 38 SGFVENQEVEVFSKATEGESCGWWKVIVKMIKG 70 (191)
Q Consensus 38 ~~~~egdeVEV~sran~~ep~gWW~a~V~~iKG 70 (191)
..+.+||+|.|..-+.+..+.|-..|+|..|-.
T Consensus 306 ~~v~~G~~v~v~~~~~~~~~~~~~~g~V~~Is~ 338 (421)
T TIGR03794 306 KKIRPGMSVQITPSTVKAERDGYIRGTVTSVSE 338 (421)
T ss_pred hhCCCCCEEEEEEcccccceeeeEEEEEEEecC
Confidence 467899999999887788889999999998854
No 92
>cd02792 MopB_CT_Formate-Dh-Na-like Formate dehydrogenase N, alpha subunit (Formate-Dh-Na) is a major component of nitrate respiration in bacteria such as in the E. coli formate dehydrogenase N (Fdh-N). Fdh-N is a membrane protein that is a complex of three different subunits and is the major electron donor to the nitrate respiratory chain. Also included in this CD is the Desulfovibrio gigas tungsten formate dehydrogenase, DgW-FDH. In contrast to Fdh-N, which is a functional heterotrimer, DgW-FDH is a heterodimer. The DgW-FDH complex is composed of a large subunit carrying the W active site and one [4Fe-4S] center, and a small subunit that harbors a series of three [4Fe-4S] clusters as well as a putative vacant binding site for a fourth cluster. The smaller subunit is not included in this alignment. This CD (MopB_CT_Formate-Dh-Na-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=25.63 E-value=1.9e+02 Score=21.20 Aligned_cols=31 Identities=19% Similarity=0.130 Sum_probs=24.6
Q ss_pred ccCcccccCCCCCCCCCccCCCeEEEEeecC
Q psy985 23 FQFSMVRLPPRETINSGFVENQEVEVFSKAT 53 (191)
Q Consensus 23 vPf~~VRlPP~~~~~~~~~egdeVEV~sran 53 (191)
.|-.-|.+-|.+..+..+.+||.|+|++...
T Consensus 32 ~~~~~v~i~p~dA~~lgi~~Gd~V~v~s~~G 62 (122)
T cd02792 32 QPEMFVEISPELAAERGIKNGDMVWVSSPRG 62 (122)
T ss_pred CCCcEEEECHHHHHHcCCCCCCEEEEEcCCc
Confidence 3555578877777889999999999998653
No 93
>COG4456 VagC Virulence-associated protein and related proteins [Function unknown]
Probab=25.22 E-value=93 Score=22.91 Aligned_cols=32 Identities=31% Similarity=0.271 Sum_probs=22.0
Q ss_pred EEEeecCCCCcccccccCcccccCCCCCCCCCccCCCeEEEEeecC
Q psy985 8 VSNPFSLLRWQPESKFQFSMVRLPPRETINSGFVENQEVEVFSKAT 53 (191)
Q Consensus 8 v~V~f~eN~Wqpe~~vPf~~VRlPP~~~~~~~~~egdeVEV~sran 53 (191)
+++.. .|+-| -|||| .+..|.+ ++||++-..+
T Consensus 3 ~klF~-snrSQ--------AVRLP----~e~~f~~-~~VeI~r~G~ 34 (74)
T COG4456 3 AKLFR-SNRSQ--------AVRLP----KEFRFPE-DRVEIIREGD 34 (74)
T ss_pred eeEee-cCCee--------eEecc----hheecCC-cEEEEEEeCC
Confidence 34444 66666 47888 3567777 9999987665
No 94
>PRK06488 sulfur carrier protein ThiS; Validated
Probab=24.62 E-value=65 Score=21.83 Aligned_cols=17 Identities=18% Similarity=0.233 Sum_probs=13.7
Q ss_pred CCCCccCCCeEEEEeec
Q psy985 36 INSGFVENQEVEVFSKA 52 (191)
Q Consensus 36 ~~~~~~egdeVEV~sra 52 (191)
.+..+.+||+||++...
T Consensus 46 ~~~~L~dgD~Ieiv~~V 62 (65)
T PRK06488 46 AQFVLHEGDRIEILSPM 62 (65)
T ss_pred CccccCCCCEEEEEEec
Confidence 35678999999998754
No 95
>cd05698 S1_Rrp5_repeat_hs6_sc5 S1_Rrp5_repeat_hs6_sc5: Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in Saccharomyces cerevisiae Rrp5 and 14 S1 repeats in Homo sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 6 (hs6) and S. cerevisiae S1 repeat 5 (sc5). Rrp5 is found in eukaryotes but not in prokaryotes or archaea.
Probab=24.60 E-value=58 Score=21.70 Aligned_cols=44 Identities=14% Similarity=-0.071 Sum_probs=27.5
Q ss_pred eeEEEEeecCCCCcccccccCcccccCCCCCCCCCccCCCeEEEEee
Q psy985 5 SFNVSNPFSLLRWQPESKFQFSMVRLPPRETINSGFVENQEVEVFSK 51 (191)
Q Consensus 5 ~~~v~V~f~eN~Wqpe~~vPf~~VRlPP~~~~~~~~~egdeVEV~sr 51 (191)
.|-+.|.+ .+.+. --+|.+++.-..-......+.+||+|+|..-
T Consensus 13 ~~G~~V~l-~~~~~--gli~~s~l~~~~~~~~~~~~~~G~~i~v~v~ 56 (70)
T cd05698 13 PNGCIVSF-YNNVK--GFLPKSELSEAFIKDPEEHFRVGQVVKVKVL 56 (70)
T ss_pred cCcEEEEE-CCCCE--EEEEHHHcChhhcCCHHHcccCCCEEEEEEE
Confidence 46677888 76644 4677776643321122345899999999543
No 96
>PF03658 Ub-RnfH: RnfH family Ubiquitin; InterPro: IPR005346 This is a small family of proteins of unknown function.; PDB: 2HJ1_B.
Probab=24.54 E-value=49 Score=24.72 Aligned_cols=16 Identities=13% Similarity=0.459 Sum_probs=8.9
Q ss_pred CCCCccCCCeEEEEee
Q psy985 36 INSGFVENQEVEVFSK 51 (191)
Q Consensus 36 ~~~~~~egdeVEV~sr 51 (191)
.+..+.+||.||+|-+
T Consensus 58 ~d~~L~~GDRVEIYRP 73 (84)
T PF03658_consen 58 LDTVLRDGDRVEIYRP 73 (84)
T ss_dssp TT-B--TT-EEEEE-S
T ss_pred CCCcCCCCCEEEEecc
Confidence 3567889999999964
No 97
>PF12148 DUF3590: Protein of unknown function (DUF3590); InterPro: IPR021991 This domain is found in eukaryotes, and is typically between 83 and 97 amino acids in length. It is found in association with PF00097 from PFAM, PF02182 from PFAM, PF00628 from PFAM, PF00240 from PFAM. There are two conserved sequence motifs: RAR and NYN. The domain is part of the protein NIRF which has zinc finger and ubiquitinating domains. The function of this domain is likely to be mainly structural, however this has not been confirmed. ; PDB: 3DB4_A 3ASK_A 3DB3_A 2L3R_A.
Probab=24.42 E-value=1.3e+02 Score=22.70 Aligned_cols=44 Identities=11% Similarity=0.244 Sum_probs=30.1
Q ss_pred CCCccceeeEEEEEEec-------c-EEEEEEeccCCcceeeecCCCcccCC
Q psy985 54 EGESCGWWKVIVKMIKG-------D-FHVVEYLGCQSTFTEIVPPERMRQKN 97 (191)
Q Consensus 54 ~~ep~gWW~a~V~~iKG-------d-f~vVey~~~d~~~teIV~~erLR~~n 97 (191)
+...-+|.-|+|..+-- + +|-|+|-+.+..-...+....|||..
T Consensus 6 d~~~gAWfEa~i~~i~~~~~~~~e~viYhIkyddype~gvv~~~~~~iRpRA 57 (85)
T PF12148_consen 6 DRNMGAWFEAQIVTITKKCMSDDEDVIYHIKYDDYPENGVVEMRSKDIRPRA 57 (85)
T ss_dssp -TTT-EEEEEEEEEEEES-SSSSTTEEEEEEETT-GGG-EEEEEGGGEEE--
T ss_pred cCCCcceEEEEEEEeeccCCCCCCCEEEEEEeccCCCcCceeccccccccee
Confidence 44566899998876633 2 99999987776667788999999984
No 98
>cd02779 MopB_CT_Arsenite-Ox This CD contains the molybdopterin_binding C-terminal (MopB_CT) region of Arsenite oxidase (Arsenite-Ox) and related proteins. Arsenite oxidase oxidizes arsenite to the less toxic arsenate; it transfers the electrons obtained from the oxidation of arsenite towards the soluble periplasmic electron carriers cytochrome c and/or amicyanin.
Probab=24.33 E-value=27 Score=26.12 Aligned_cols=27 Identities=19% Similarity=0.437 Sum_probs=21.1
Q ss_pred cccccCCCCCCCCCccCCCeEEEEeec
Q psy985 26 SMVRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 26 ~~VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
.-|.+-|.+..+..+.+||.|+|++..
T Consensus 33 ~~v~in~~dA~~lgi~~Gd~V~v~s~~ 59 (115)
T cd02779 33 PYIEVNPEDAKREGLKNGDLVEVYNDY 59 (115)
T ss_pred CEEEECHHHHHHcCCCCCCEEEEEeCC
Confidence 335666666688999999999999965
No 99
>cd04472 S1_PNPase S1_PNPase: Polynucleotide phosphorylase (PNPase), ), S1-like RNA-binding domain. PNPase is a polyribonucleotide nucleotidyl transferase that degrades mRNA. It is a trimeric multidomain protein. The C-terminus contains the S1 domain which binds ssRNA. This family is classified based on the S1 domain. PNPase nonspecifically removes the 3' nucleotides from mRNA, but is stalled by double-stranded RNA structures such as a stem-loop. Evidence shows that a minimum of 7-10 unpaired nucleotides at the 3' end, is required for PNPase degradation. It is suggested that PNPase also dephosphorylates the RNA 5' end. This additional activity may regulate the 5'-dependent activity of RNaseE in vivo.
Probab=24.24 E-value=95 Score=20.11 Aligned_cols=44 Identities=16% Similarity=0.107 Sum_probs=26.1
Q ss_pred ceeEEEEeecCCCCcccccccCcccccCCCCCCCCCccCCCeEEEEe
Q psy985 4 HSFNVSNPFSLLRWQPESKFQFSMVRLPPRETINSGFVENQEVEVFS 50 (191)
Q Consensus 4 ~~~~v~V~f~eN~Wqpe~~vPf~~VRlPP~~~~~~~~~egdeVEV~s 50 (191)
+.|-+.|.+ +++ -+--+|+++++-.+-......+..||+|+|-.
T Consensus 12 ~~~G~~v~l-~~~--~~g~l~~~~l~~~~~~~~~~~~~~Gd~v~v~v 55 (68)
T cd04472 12 KDFGAFVEI-LPG--KDGLVHISELSDERVEKVEDVLKVGDEVKVKV 55 (68)
T ss_pred EEeEEEEEe-CCC--CEEEEEhHHcCCccccCHHHccCCCCEEEEEE
Confidence 346667777 544 23456777665433222234689999999953
No 100
>PF14444 S1-like: S1-like
Probab=23.99 E-value=83 Score=22.13 Aligned_cols=23 Identities=13% Similarity=0.001 Sum_probs=19.1
Q ss_pred CCccCCCeEEEEeecCCCCccce
Q psy985 38 SGFVENQEVEVFSKATEGESCGW 60 (191)
Q Consensus 38 ~~~~egdeVEV~sran~~ep~gW 60 (191)
.-+++||.|.|-.--|.+-|+-|
T Consensus 32 ~~P~vGdrV~v~A~~n~~~~~kW 54 (58)
T PF14444_consen 32 NVPKVGDRVLVEAIYNPNMPFKW 54 (58)
T ss_pred CCCccCCEEEEEEEeCCCCCccc
Confidence 45689999999888888888877
No 101
>PF02922 CBM_48: Carbohydrate-binding module 48 (Isoamylase N-terminal domain); InterPro: IPR004193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. This domain is found in a range of enzymes that act on branched substrates ie. isoamylase, pullulanase and branching enzyme. Isoamylase hydrolyses 1,6-alpha-D-glucosidic branch linkages in glycogen, amylopectin and dextrin; 1,4-alpha-glucan branching enzyme functions in the formation of 1,6-glucosidic linkages of glycogen; and pullulanase is a starch-debranching enzyme.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BHZ_A 2BY2_A 2BY3_A 2BXY_A 2BY1_A 2BHY_A 2BHU_A 2BXZ_A 2BY0_A 2FHB_A ....
Probab=23.30 E-value=2.3e+02 Score=19.41 Aligned_cols=27 Identities=22% Similarity=0.370 Sum_probs=18.1
Q ss_pred CCccceeeEEEE-EEe-cc-EEEEEEeccC
Q psy985 55 GESCGWWKVIVK-MIK-GD-FHVVEYLGCQ 81 (191)
Q Consensus 55 ~ep~gWW~a~V~-~iK-Gd-f~vVey~~~d 81 (191)
...-|||.+.|. .++ |+ .|.....+-+
T Consensus 45 ~~~~G~w~~~~~~~~~~g~~~Y~y~i~~~~ 74 (85)
T PF02922_consen 45 KDDDGVWEVTVPGDLPPGGYYYKYRIDGDD 74 (85)
T ss_dssp ECTTTEEEEEEEGCGTTTT-EEEEEEEETT
T ss_pred cCCCCEEEEEEcCCcCCCCEEEEEEEEeCC
Confidence 346788988888 677 44 6666665444
No 102
>cd05690 S1_RPS1_repeat_ec5 S1_RPS1_repeat_ec5: Ribosomal protein S1 (RPS1) domain. RPS1 is a component of the small ribosomal subunit thought to be involved in the recognition and binding of mRNA's during translation initiation. The bacterial RPS1 domain architecture consists of 4-6 tandem S1 domains. In some bacteria, the tandem S1 array is located C-terminal to a 4-hydroxy-3-methylbut-2-enyl diphosphate reductase (HMBPP reductase) domain. While RPS1 is found primarily in bacteria, proteins with tandem RPS1-like domains have been identified in plants and humans, however these lack the N-terminal HMBPP reductase domain. This CD includes S1 repeat 5 (ec5) of the Escherichia coli RPS1. Autoantibodies to double-stranded DNA from patients with systemic lupus erythematosus cross-react with the human RPS1 homolog.
Probab=23.08 E-value=83 Score=20.74 Aligned_cols=44 Identities=16% Similarity=0.122 Sum_probs=27.4
Q ss_pred ceeEEEEeecCCCCcccccccCcccccCCC-CCCCCCccCCCeEEEEe
Q psy985 4 HSFNVSNPFSLLRWQPESKFQFSMVRLPPR-ETINSGFVENQEVEVFS 50 (191)
Q Consensus 4 ~~~~v~V~f~eN~Wqpe~~vPf~~VRlPP~-~~~~~~~~egdeVEV~s 50 (191)
+.|-+.|.+ ++++ +--+|+++++-+.. ......++.||+|+|-.
T Consensus 12 ~~~G~fv~l-~~~~--~Glv~~~~l~~~~~~~~~~~~~~~G~~v~v~v 56 (69)
T cd05690 12 TDFGIFVGL-DGGI--DGLVHISDISWTQRVRHPSEIYKKGQEVEAVV 56 (69)
T ss_pred EeeeEEEEe-CCCC--EEEEEHHHCCCccccCChhhEECCCCEEEEEE
Confidence 356677777 6653 44677777753211 12234689999999953
No 103
>cd04461 S1_Rrp5_repeat_hs8_sc7 S1_Rrp5_repeat_hs8_sc7: Rrp5 Homo sapiens S1 repeat 8 (hs8) and Saccharomyces cerevisiae S1 repeat 7 (sc7)-like domains. Rrp5 is a trans-acting factor important for biogenesis of both the 40S and 60S eukaryotic ribosomal subunits. Rrp5 has two distinct regions, an N-terminal region containing tandemly repeated S1 RNA-binding domains (12 S1 repeats in S. cerevisiae Rrp5 and 14 S1 repeats in H. sapiens Rrp5) and a C-terminal region containing tetratricopeptide repeat (TPR) motifs thought to be involved in protein-protein interactions. Mutational studies have shown that each region represents a specific functional domain. Deletions within the S1-containing region inhibit pre-rRNA processing at either site A3 or A2, whereas deletions within the TPR region confer an inability to support cleavage of A0-A2. This CD includes H. sapiens S1 repeat 8 and S. cerevisiae S1 repeat 7. Rrp5 is found in eukaryotes but not in prokaryotes or archaea.
Probab=22.93 E-value=66 Score=22.50 Aligned_cols=43 Identities=19% Similarity=0.054 Sum_probs=27.2
Q ss_pred eeEEEEeecCCCCcccccccCcccccCCCCCCCCCccCCCeEEEEe
Q psy985 5 SFNVSNPFSLLRWQPESKFQFSMVRLPPRETINSGFVENQEVEVFS 50 (191)
Q Consensus 5 ~~~v~V~f~eN~Wqpe~~vPf~~VRlPP~~~~~~~~~egdeVEV~s 50 (191)
.|-+.|.+ ++++. --+|.+++....-......+.+||+|+|..
T Consensus 27 ~~G~fv~l-~~~~~--g~v~~~el~~~~~~~~~~~~~~Gd~v~vkV 69 (83)
T cd04461 27 PYGVFVEF-LGGLT--GLAPKSYISDEFVTDPSFGFKKGQSVTAKV 69 (83)
T ss_pred eceEEEEc-CCCCE--EEEEHHHCCcccccCHHHhcCCCCEEEEEE
Confidence 45667777 66643 356666664443323346789999999954
No 104
>PRK08179 prfH peptide chain release factor-like protein; Reviewed
Probab=22.78 E-value=2e+02 Score=24.70 Aligned_cols=46 Identities=13% Similarity=0.087 Sum_probs=29.8
Q ss_pred ccEEEEcCCCCeeEEEeCchhhHhhhhhhhhhhhhhHHHHHHHhhhhhhhh
Q psy985 137 AAIVRYIPDKGALSVISRDEACQKRASMIKDMHFRNLNQKVVLLKRTEDID 187 (191)
Q Consensus 137 a~~v~y~pe~~~Lvvls~~e~~~kRA~mL~dmHfr~LrqK~~l~~r~eea~ 187 (191)
|+.++.-| +-|+|-+.++.++.+=. +.-++.|++||.-+.+.++++
T Consensus 130 aVrl~h~p--tgi~v~~~~~RSQ~~Nk---~~A~~~L~~~L~~~~~~~~~~ 175 (200)
T PRK08179 130 AVRATHLA--SGISVKVQSERSQHANK---RLARLLIAWKLEQQQQEQSAA 175 (200)
T ss_pred eEEEEEcC--CcEEEEECCCCCHHHHH---HHHHHHHHHHHHHHHHHHHHH
Confidence 56667666 66777777665544332 345678888888777665543
No 105
>KOG3123|consensus
Probab=22.63 E-value=47 Score=29.50 Aligned_cols=17 Identities=41% Similarity=0.620 Sum_probs=13.1
Q ss_pred EEEEeecCCCCcccccc
Q psy985 7 NVSNPFSLLRWQPESKF 23 (191)
Q Consensus 7 ~v~V~f~eN~Wqpe~~v 23 (191)
.|++.|+.++|+|++.+
T Consensus 131 tVSiv~ftd~wrP~Sfy 147 (272)
T KOG3123|consen 131 TVSIVFFTDNWRPESFY 147 (272)
T ss_pred EEEEEEEccCcCchhHH
Confidence 56777779999998543
No 106
>PF06152 Phage_min_cap2: Phage minor capsid protein 2; InterPro: IPR009319 This entry is represented by Bacteriophage A118, Gp4, the minor capsid protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.; GO: 0005198 structural molecule activity, 0019028 viral capsid
Probab=22.48 E-value=71 Score=29.39 Aligned_cols=29 Identities=31% Similarity=0.374 Sum_probs=25.3
Q ss_pred CCeEEEEeecCCCCccceeeEEEEEEecc
Q psy985 43 NQEVEVFSKATEGESCGWWKVIVKMIKGD 71 (191)
Q Consensus 43 gdeVEV~sran~~ep~gWW~a~V~~iKGd 71 (191)
-|-|+|=+-++.-..|.+|-|+|..++|+
T Consensus 224 ~dlv~vS~H~garp~cap~QGkV~s~~~~ 252 (361)
T PF06152_consen 224 IDLVEVSSHPGARPSCAPWQGKVYSLSGG 252 (361)
T ss_pred CCEEEEcCCCCCCCCCcCcCCEEEEeccC
Confidence 36788888888999999999999988874
No 107
>PF05258 DUF721: Protein of unknown function (DUF721); InterPro: IPR007922 This family contains several actinomycete proteins of unknown function, and related sequences from other species.
Probab=22.37 E-value=73 Score=21.94 Aligned_cols=33 Identities=18% Similarity=0.282 Sum_probs=22.5
Q ss_pred CCCeeEEEeCchhhHhhhhhhhhhhhhhHHHHH
Q psy985 145 DKGALSVISRDEACQKRASMIKDMHFRNLNQKV 177 (191)
Q Consensus 145 e~~~Lvvls~~e~~~kRA~mL~dmHfr~LrqK~ 177 (191)
+.|.|+|.+.+.....+-+|+...-...|+++.
T Consensus 47 ~~g~L~i~v~~~~~~~~L~~~~~~il~~l~~~~ 79 (89)
T PF05258_consen 47 KDGTLVIEVDSSAWAQELRYMKPQILKKLNEFL 79 (89)
T ss_pred ECCEEEEEECCHHHHHHHHHHHHHHHHHHHHHc
Confidence 479999999988866666666555555555543
No 108
>KOG3601|consensus
Probab=22.20 E-value=55 Score=28.73 Aligned_cols=49 Identities=20% Similarity=0.341 Sum_probs=31.7
Q ss_pred CCCCcccccccCcccccCCCCCCCCCccCCCeEEEEeecCCCCccceeeEEEEEEeccE
Q psy985 14 LLRWQPESKFQFSMVRLPPRETINSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDF 72 (191)
Q Consensus 14 eN~Wqpe~~vPf~~VRlPP~~~~~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf 72 (191)
.|-||.. +++|+ |.++....|..||-++|..+.+ .-||.|...--.|=|
T Consensus 162 ~~~yqQa-~~df~-----~~pp~ql~f~~gq~~~v~~~ss----~~ww~Gs~lg~agiF 210 (222)
T KOG3601|consen 162 TNYYQQA-LYDFQ-----PQPPGQLAFRRGQQIQVLDSSS----PFWWFGSKLGRAGIF 210 (222)
T ss_pred cchhhhh-cCCCC-----CCCchhhccccCCcceeecCCC----cchhhccccCceeee
Confidence 4455543 45555 2223467788999999998755 469999876555443
No 109
>PRK03598 putative efflux pump membrane fusion protein; Provisional
Probab=22.04 E-value=2.3e+02 Score=24.82 Aligned_cols=71 Identities=7% Similarity=0.073 Sum_probs=41.6
Q ss_pred CCccCCCeEEEEeecCCCCccceeeEEEEEEeccEEEEEEecc--C--CcceeeecCCCcccCCCCCCCCcCceeEEeec
Q psy985 38 SGFVENQEVEVFSKATEGESCGWWKVIVKMIKGDFHVVEYLGC--Q--STFTEIVPPERMRQKNTNPPIDAKTFHKFDIE 113 (191)
Q Consensus 38 ~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGdf~vVey~~~--d--~~~teIV~~erLR~~n~n~pi~~~~f~k~~i~ 113 (191)
..+.+||.|.|+..+.... .+.|+|..|.-. .++..- . ...+.++-.=++.+.|++..+-..++-+.+|.
T Consensus 252 ~~i~~Gq~v~v~~~~~~~~---~~~g~V~~Is~~---~~~~~~~~~~~~~~~~~~~~v~i~~~~~~~~l~pGm~~~v~i~ 325 (331)
T PRK03598 252 GQAQPGRKVLLYTDGRPDK---PYHGQIGFVSPT---AEFTPKTVETPDLRTDLVYRLRIVVTDADDALRQGMPVTVRFA 325 (331)
T ss_pred hhCCCCCEEEEEEcCCCCc---EEEEEEEEEcCc---cccCCccccCcccceeEEEEEEEEecCcccccCCCCeEEEEEe
Confidence 4678999999988664333 578999988653 122100 0 01223333335555565666777777776665
Q ss_pred C
Q psy985 114 V 114 (191)
Q Consensus 114 V 114 (191)
.
T Consensus 326 ~ 326 (331)
T PRK03598 326 D 326 (331)
T ss_pred c
Confidence 4
No 110
>PRK07696 sulfur carrier protein ThiS; Provisional
Probab=21.74 E-value=62 Score=22.44 Aligned_cols=16 Identities=6% Similarity=0.366 Sum_probs=13.0
Q ss_pred CCCCccCCCeEEEEee
Q psy985 36 INSGFVENQEVEVFSK 51 (191)
Q Consensus 36 ~~~~~~egdeVEV~sr 51 (191)
....+++||.||+++-
T Consensus 48 ~~~~L~~gD~iEIv~~ 63 (67)
T PRK07696 48 TDTSVFDGDQIEIVTF 63 (67)
T ss_pred CceecCCCCEEEEEEE
Confidence 4567899999999863
No 111
>PF13437 HlyD_3: HlyD family secretion protein
Probab=21.73 E-value=1.5e+02 Score=21.14 Aligned_cols=32 Identities=16% Similarity=-0.063 Sum_probs=23.8
Q ss_pred CCCcc-CCCeEEEEeecCCCCccceeeEEEEEEecc
Q psy985 37 NSGFV-ENQEVEVFSKATEGESCGWWKVIVKMIKGD 71 (191)
Q Consensus 37 ~~~~~-egdeVEV~sran~~ep~gWW~a~V~~iKGd 71 (191)
-..+. .||.|.++... .+-..|.|+|..+...
T Consensus 47 ~~~i~~~g~~v~v~~~~---~~~~~~~g~V~~I~~~ 79 (105)
T PF13437_consen 47 IARIKDPGQKVTVRLDP---GPEKTIEGKVSSISPS 79 (105)
T ss_pred hcceEeCCCEEEEEECC---CCCcEEEEEEEEEeCc
Confidence 35665 99999999872 2222999999999773
No 112
>PF03423 CBM_25: Carbohydrate binding domain (family 25); InterPro: IPR005085 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM25 from CAZY which has a starch-binding function as has been demonstrated in one case.; PDB: 2LAB_A 2C3X_B 2C3V_A 2C3W_C 2LAA_A.
Probab=21.04 E-value=79 Score=23.07 Aligned_cols=10 Identities=50% Similarity=1.371 Sum_probs=7.8
Q ss_pred cceeeEEEEE
Q psy985 58 CGWWKVIVKM 67 (191)
Q Consensus 58 ~gWW~a~V~~ 67 (191)
-|||.++|.+
T Consensus 47 ~~~~~~tv~v 56 (87)
T PF03423_consen 47 GGWWKATVDV 56 (87)
T ss_dssp TTEEEEEEE-
T ss_pred CCEEEEEEEE
Confidence 5899999865
No 113
>cd02777 MopB_CT_DMSOR-like The MopB_CT_DMSOR-like CD contains dimethylsulfoxide reductase (DMSOR), biotin sulfoxide reductase (BSOR), trimethylamine N-oxide reductase (TMAOR) and other related proteins. DMSOR always catalyzes the reduction of DMSO to dimethylsulfide, but its cellular location and oligomerization state are organism-dependent. For example, in Rhodobacter sphaeriodes and Rhodobacter capsulatus, it is an 82-kDa monomeric soluble protein found in the periplasmic space; in E. coli, it is membrane-bound and exists as a heterotrimer. BSOR catalyzes the reduction of biotin sulfixode to biotin, and is unique among Mo enzymes because no additional auxiliary proteins or cofactors are required. TMAOR is similar to DMSOR, but its only natural substrate is TMAO. Also included in this group is the pyrogallol-phloroglucinol transhydroxylase from Pelobacter acidigallici. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB hom
Probab=20.71 E-value=28 Score=26.32 Aligned_cols=26 Identities=23% Similarity=0.397 Sum_probs=21.7
Q ss_pred ccccCCCCCCCCCccCCCeEEEEeec
Q psy985 27 MVRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 27 ~VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
-|.+-|.+..+..+.+||.|+|++..
T Consensus 35 ~v~i~p~dA~~lgi~~Gd~V~v~s~~ 60 (127)
T cd02777 35 PVWINPLDAAARGIKDGDIVRVFNDR 60 (127)
T ss_pred eEEECHHHHHHcCCCCCCEEEEEcCC
Confidence 37777776788999999999999865
No 114
>cd02788 MopB_CT_NDH-1_NuoG2-N7 MopB_CT_NDH-1_NuoG2-N7: C-terminal region of the NuoG-like subunit (of the variant with a [4Fe-4S] cluster, N7) of the NADH-quinone oxidoreductase/NADH dehydrogenase-1 (NDH-1) found in various bacteria. The NDH-1 is the first energy-transducting complex in the respiratory chain and functions as a redox pump that uses the redox energy to translocate H+ ions across the membrane, resulting in a significant contribution to energy production. In Escherichia coli NDH-1, the largest subunit is encoded by the nuoG gene, and is part of the 14 distinct subunits constituting the functional enzyme. The NuoG subunit is made of two domains: the first contains three binding sites for FeS clusters (the fer2 domain), the second domain, is of unknown function or, as postulated, has lost an ancestral formate dehydrogenase activity that became redundant during the evolution of the complex I enzyme. Unique to this group, compared to the other prokaryotic and eukaryotic groups
Probab=20.52 E-value=36 Score=24.77 Aligned_cols=27 Identities=22% Similarity=0.345 Sum_probs=21.0
Q ss_pred cccccCCCCCCCCCccCCCeEEEEeec
Q psy985 26 SMVRLPPRETINSGFVENQEVEVFSKA 52 (191)
Q Consensus 26 ~~VRlPP~~~~~~~~~egdeVEV~sra 52 (191)
.-+.+-|.+..+..+.+||.|+|+++.
T Consensus 29 ~~v~inp~dA~~lGi~~Gd~V~v~s~~ 55 (96)
T cd02788 29 PYARLSPADAARLGLADGDLVEFSLGD 55 (96)
T ss_pred CEEEECHHHHHHcCCCCCCEEEEEECC
Confidence 346666666678899999999999854
No 115
>KOG4773|consensus
Probab=20.22 E-value=27 Score=32.77 Aligned_cols=37 Identities=16% Similarity=0.142 Sum_probs=26.9
Q ss_pred CCCccCCCeEEEEeecCCCCccceeeEEEEEEecc---EEEEEE
Q psy985 37 NSGFVENQEVEVFSKATEGESCGWWKVIVKMIKGD---FHVVEY 77 (191)
Q Consensus 37 ~~~~~egdeVEV~sran~~ep~gWW~a~V~~iKGd---f~vVey 77 (191)
...++.||.+++.+|.+ -+||.|+++-.+|= ||+-..
T Consensus 191 EL~l~agdV~~~~~r~e----k~W~~gk~R~~~g~yp~sF~~~l 230 (386)
T KOG4773|consen 191 ELNLVAGDVEFLLSRDE----KYWLLGKVRGLTGYYPDSFVKQL 230 (386)
T ss_pred eeeeehhhHHHHHhhcc----cceeeeeeccccccccHHhhhhh
Confidence 34567788888777765 48999999999985 444443
No 116
>PRK07440 hypothetical protein; Provisional
Probab=20.12 E-value=83 Score=22.10 Aligned_cols=16 Identities=6% Similarity=0.133 Sum_probs=13.3
Q ss_pred CCCCccCCCeEEEEee
Q psy985 36 INSGFVENQEVEVFSK 51 (191)
Q Consensus 36 ~~~~~~egdeVEV~sr 51 (191)
.+..+++||.||+++-
T Consensus 51 ~~~~L~~gD~IEIv~~ 66 (70)
T PRK07440 51 EQTQVQPGDRLEIVTI 66 (70)
T ss_pred CceecCCCCEEEEEEE
Confidence 4677899999999874
No 117
>PF15595 Imm31: Immunity protein 31
Probab=20.06 E-value=1.8e+02 Score=22.40 Aligned_cols=27 Identities=22% Similarity=0.427 Sum_probs=21.2
Q ss_pred EEEEcCCCCeeEEEeCchhhHhhhhhh
Q psy985 139 IVRYIPDKGALSVISRDEACQKRASMI 165 (191)
Q Consensus 139 ~v~y~pe~~~Lvvls~~e~~~kRA~mL 165 (191)
.+.||||.|...+.|.+..+.++-..+
T Consensus 60 ~i~fDpEagmF~ays~~~eal~~l~~~ 86 (107)
T PF15595_consen 60 KIDFDPEAGMFCAYSEDKEALKKLAEI 86 (107)
T ss_pred hcCcCCCCCEEEEecCCHHHHHHHHHH
Confidence 478999999999998887766654433
No 118
>PF13382 Adenine_deam_C: Adenine deaminase C-terminal domain; PDB: 3T8L_B 3T81_A 3NQB_A.
Probab=20.05 E-value=83 Score=26.18 Aligned_cols=33 Identities=18% Similarity=0.118 Sum_probs=22.9
Q ss_pred HHHhhhccEEEEcCCCCeeEEEeCchhhHhhhh
Q psy985 131 FQKAIAAAIVRYIPDKGALSVISRDEACQKRAS 163 (191)
Q Consensus 131 F~kaiga~~v~y~pe~~~Lvvls~~e~~~kRA~ 163 (191)
|--..||+.-+|..|++.++|+.+|.+.+.+|.
T Consensus 50 fg~~~GAiAsS~ahDshniiviG~~~~dm~~A~ 82 (171)
T PF13382_consen 50 FGLKRGAIASSVAHDSHNIIVIGTNDEDMALAA 82 (171)
T ss_dssp S--BSSEEEES--TTT--EEEEESSHHHHHHHH
T ss_pred CcccCCeEEEEcccCCCCEEEEECCHHHHHHHH
Confidence 434458999999999999999999999888763
No 119
>PRK12335 tellurite resistance protein TehB; Provisional
Probab=20.01 E-value=1.5e+02 Score=25.60 Aligned_cols=51 Identities=20% Similarity=0.416 Sum_probs=34.6
Q ss_pred cceeeEEEEEEeccEEEEEEeccCCcceeeecCCCcccCCCCCCCCcCceeEEeec
Q psy985 58 CGWWKVIVKMIKGDFHVVEYLGCQSTFTEIVPPERMRQKNTNPPIDAKTFHKFDIE 113 (191)
Q Consensus 58 ~gWW~a~V~~iKGdf~vVey~~~d~~~teIV~~erLR~~n~n~pi~~~~f~k~~i~ 113 (191)
.|+| |++..++|.+-++.+. . -.++...+.|-|.+..+=+-...+|+....
T Consensus 31 ~g~~-~~~~vl~G~l~~~~~d---e-~g~~~~~~~l~~~~~~~~i~p~~wh~v~~~ 81 (287)
T PRK12335 31 EGTW-AKLTVLKGELKFYELT---E-DGEELSEHIFDAENQPPFIEPQAWHRIEAA 81 (287)
T ss_pred CCcc-eEEEEEeeeEEEEEEC---C-CCCeeeEEEEecCCCCceeCCcceEEEEEc
Confidence 7899 9999999998777773 1 235555666666654444555667666654
Done!