Query psy3080
Match_columns 118
No_of_seqs 99 out of 113
Neff 4.6
Searched_HMMs 46136
Date Sat Aug 17 01:11:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3080.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3080hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1789|consensus 100.0 1.6E-50 3.5E-55 370.1 7.8 118 1-118 1188-1307(2235)
2 PRK09430 djlA Dna-J like membr 98.5 1.7E-07 3.7E-12 75.7 6.1 59 50-117 152-221 (267)
3 PTZ00100 DnaJ chaperone protei 97.6 6.9E-05 1.5E-09 54.8 3.1 28 88-117 59-86 (116)
4 PF00226 DnaJ: DnaJ domain; I 96.8 0.00099 2.1E-08 41.8 2.4 21 95-117 1-21 (64)
5 smart00271 DnaJ DnaJ molecular 96.7 0.0011 2.3E-08 40.8 2.0 22 95-118 2-23 (60)
6 PF13446 RPT: A repeated domai 96.5 0.0031 6.6E-08 40.2 3.0 26 90-117 1-26 (62)
7 cd06257 DnaJ DnaJ domain or J- 96.4 0.0022 4.8E-08 38.6 1.9 22 95-118 1-22 (55)
8 KOG0721|consensus 96.4 0.0018 3.8E-08 52.4 1.6 23 93-117 98-120 (230)
9 PTZ00037 DnaJ_C chaperone prot 96.0 0.0042 9.1E-08 53.4 2.5 27 90-118 24-50 (421)
10 KOG1789|consensus 95.8 0.0097 2.1E-07 57.8 4.0 54 28-82 794-847 (2235)
11 KOG0715|consensus 95.4 0.0065 1.4E-07 49.8 1.1 29 88-118 37-65 (288)
12 PF03656 Pam16: Pam16; InterP 95.3 0.01 2.3E-07 43.9 1.8 26 90-117 54-79 (127)
13 KOG0723|consensus 93.8 0.057 1.2E-06 39.5 2.7 27 88-116 50-76 (112)
14 KOG0722|consensus 93.0 0.045 9.8E-07 45.8 1.2 24 93-118 32-55 (329)
15 KOG3442|consensus 93.0 0.23 5E-06 37.2 4.8 27 89-117 54-80 (132)
16 KOG0720|consensus 91.4 0.1 2.2E-06 46.3 1.5 23 93-117 234-256 (490)
17 KOG0716|consensus 91.3 0.1 2.2E-06 43.4 1.4 27 90-118 27-53 (279)
18 PTZ00341 Ring-infected erythro 90.4 0.14 3.1E-06 49.2 1.6 24 93-118 572-595 (1136)
19 COG5407 SEC63 Preprotein trans 87.1 0.33 7.1E-06 43.6 1.5 22 94-117 98-119 (610)
20 COG1076 DjlA DnaJ-domain-conta 78.4 1.5 3.3E-05 33.1 1.9 22 94-117 113-134 (174)
21 KOG0624|consensus 73.7 1.9 4.2E-05 37.9 1.6 22 94-117 394-415 (504)
22 PF04433 SWIRM: SWIRM domain; 72.7 9.1 0.0002 25.4 4.4 43 67-117 30-75 (86)
23 KOG0568|consensus 71.9 2.6 5.7E-05 35.2 1.9 23 93-117 46-68 (342)
24 KOG0550|consensus 64.1 2.9 6.2E-05 37.2 0.6 23 93-117 372-394 (486)
25 PF12728 HTH_17: Helix-turn-he 61.4 6 0.00013 23.4 1.6 16 91-106 2-17 (51)
26 PF02042 RWP-RK: RWP-RK domain 57.6 6.9 0.00015 24.9 1.4 27 41-67 21-47 (52)
27 TIGR01764 excise DNA binding d 57.6 6.5 0.00014 22.2 1.2 16 91-106 2-17 (49)
28 PF07455 Psu: Phage polarity s 52.4 20 0.00043 28.5 3.5 42 65-106 119-160 (188)
29 PF07099 DUF1361: Protein of u 50.9 18 0.0004 27.3 3.0 32 48-82 119-151 (168)
30 cd00872 PI3Ka_I Phosphoinositi 49.4 36 0.00077 26.1 4.4 61 40-115 22-89 (171)
31 PF08006 DUF1700: Protein of u 48.1 32 0.00069 25.7 3.9 47 51-104 6-54 (181)
32 cd04762 HTH_MerR-trunc Helix-T 47.2 12 0.00026 20.8 1.2 15 91-105 1-15 (49)
33 PF09824 ArsR: ArsR transcript 47.2 35 0.00077 26.5 4.1 60 36-104 76-135 (160)
34 PF06844 DUF1244: Protein of u 46.9 28 0.0006 23.5 3.0 32 72-103 12-45 (68)
35 KOG2905|consensus 46.1 16 0.00036 30.2 2.2 33 60-92 197-236 (254)
36 PF12833 HTH_18: Helix-turn-he 46.0 31 0.00068 21.7 3.2 56 41-106 1-61 (81)
37 KOG1150|consensus 41.9 18 0.00039 29.6 1.8 21 95-117 54-74 (250)
38 COG3492 Uncharacterized protei 40.8 53 0.0012 23.7 3.9 36 68-103 39-76 (104)
39 PF10376 Mei5: Double-strand r 39.1 56 0.0012 26.2 4.2 40 66-105 165-216 (221)
40 PF09862 DUF2089: Protein of u 38.9 65 0.0014 23.5 4.2 53 35-100 60-112 (113)
41 PF08281 Sigma70_r4_2: Sigma-7 38.5 10 0.00022 22.6 -0.1 26 88-115 24-49 (54)
42 cd04761 HTH_MerR-SF Helix-Turn 37.6 20 0.00043 20.5 1.1 14 91-104 1-14 (49)
43 PRK12527 RNA polymerase sigma 36.1 48 0.001 23.4 3.2 27 88-116 119-145 (159)
44 PRK09685 DNA-binding transcrip 35.2 1.4E+02 0.003 23.2 5.9 67 34-106 213-280 (302)
45 PF00690 Cation_ATPase_N: Cati 34.9 48 0.001 20.8 2.7 26 90-115 3-28 (69)
46 smart00342 HTH_ARAC helix_turn 34.4 78 0.0017 19.0 3.6 58 39-103 5-63 (84)
47 TIGR01268 Phe4hydrox_tetr phen 33.8 85 0.0018 27.8 4.9 66 31-100 156-242 (436)
48 cd06459 M3B_Oligoendopeptidase 33.0 75 0.0016 26.1 4.2 38 75-116 385-422 (427)
49 PRK13500 transcriptional activ 32.9 1.2E+02 0.0026 24.3 5.3 62 35-106 222-287 (312)
50 cd00870 PI3Ka_III Phosphoinosi 32.8 1.2E+02 0.0026 23.0 5.0 64 40-115 29-96 (166)
51 cd03347 eu_PheOH Eukaryotic ph 32.7 99 0.0021 26.2 4.9 67 31-101 54-141 (306)
52 PF01893 UPF0058: Uncharacteri 32.1 87 0.0019 21.9 3.8 47 70-116 6-53 (89)
53 KOG0113|consensus 31.4 30 0.00065 29.6 1.6 96 17-117 2-122 (335)
54 COG2975 Uncharacterized protei 30.5 74 0.0016 21.2 3.0 32 53-84 32-63 (64)
55 PRK13503 transcriptional activ 30.3 1E+02 0.0022 23.6 4.3 73 34-116 186-272 (278)
56 PRK12518 RNA polymerase sigma 30.0 34 0.00073 24.4 1.5 26 88-115 134-159 (175)
57 PRK12529 RNA polymerase sigma 29.9 26 0.00055 25.6 0.9 26 89-116 142-167 (178)
58 PRK09642 RNA polymerase sigma 29.7 29 0.00063 24.5 1.1 26 88-115 120-145 (160)
59 TIGR02980 SigBFG RNA polymeras 29.7 92 0.002 23.5 4.0 37 64-103 77-119 (227)
60 PF06244 DUF1014: Protein of u 29.4 54 0.0012 24.2 2.5 28 90-117 51-80 (122)
61 PRK09645 RNA polymerase sigma 29.3 35 0.00077 24.4 1.5 27 88-116 132-158 (173)
62 PRK09978 DNA-binding transcrip 28.9 1.6E+02 0.0034 24.2 5.4 62 35-106 158-222 (274)
63 PF08235 LNS2: LNS2 (Lipin/Ned 28.8 1.8E+02 0.0039 22.1 5.4 57 50-106 62-144 (157)
64 cd06170 LuxR_C_like C-terminal 28.7 31 0.00067 19.7 1.0 18 89-106 14-31 (57)
65 KOG3786|consensus 28.7 34 0.00075 30.0 1.6 18 60-77 321-338 (395)
66 PF04157 EAP30: EAP30/Vps36 fa 28.2 94 0.002 24.1 3.9 37 80-116 103-141 (223)
67 PF07962 Swi3: Replication For 28.0 1.9E+02 0.0042 19.4 6.0 51 48-100 12-62 (83)
68 PF01418 HTH_6: Helix-turn-hel 27.8 58 0.0013 21.1 2.3 55 5-63 3-62 (77)
69 COG5451 Predicted secreted pro 27.6 20 0.00044 26.8 -0.0 21 44-64 41-61 (128)
70 PF01466 Skp1: Skp1 family, di 27.2 65 0.0014 21.1 2.4 28 89-116 44-72 (78)
71 PRK12547 RNA polymerase sigma 27.1 33 0.00071 24.6 1.0 27 88-116 126-152 (164)
72 COG4741 Predicted secreted end 27.1 48 0.001 26.0 2.0 40 4-43 86-126 (175)
73 PF04967 HTH_10: HTH DNA bindi 26.9 71 0.0015 20.1 2.4 42 73-116 6-47 (53)
74 TIGR02954 Sig70_famx3 RNA poly 26.9 43 0.00093 23.9 1.6 19 88-106 133-151 (169)
75 TIGR01270 Trp_5_monoox tryptop 26.8 2E+02 0.0044 25.7 6.0 17 31-47 183-199 (464)
76 PRK11511 DNA-binding transcrip 26.8 1.3E+02 0.0029 21.1 4.1 62 35-106 25-90 (127)
77 PF03683 UPF0175: Uncharacteri 26.7 1E+02 0.0022 20.3 3.3 18 87-104 31-48 (76)
78 COG5509 Uncharacterized small 26.4 65 0.0014 21.5 2.2 31 67-97 31-61 (65)
79 PRK12702 mannosyl-3-phosphogly 26.0 69 0.0015 27.0 2.9 43 57-106 94-139 (302)
80 PF09539 DUF2385: Protein of u 25.9 66 0.0014 22.9 2.4 22 44-65 11-32 (96)
81 TIGR02950 SigM_subfam RNA poly 25.9 1.2E+02 0.0025 21.0 3.7 19 88-106 119-137 (154)
82 PRK00430 fis global DNA-bindin 25.6 1E+02 0.0022 21.3 3.3 45 51-106 35-84 (95)
83 smart00422 HTH_MERR helix_turn 25.5 41 0.00089 20.6 1.2 15 91-105 1-15 (70)
84 COG4746 Uncharacterized protei 25.0 54 0.0012 22.8 1.7 19 58-76 15-33 (80)
85 PHA01083 hypothetical protein 24.9 32 0.0007 26.4 0.7 39 53-105 20-61 (149)
86 KOG3547|consensus 24.7 51 0.0011 29.2 2.0 47 5-55 207-253 (450)
87 cd03569 VHS_Hrs_Vps27p VHS dom 24.7 1.4E+02 0.0031 21.8 4.1 32 69-100 101-132 (142)
88 smart00814 Alpha_TIF Alpha tra 24.4 48 0.001 28.7 1.7 47 52-104 36-94 (356)
89 smart00421 HTH_LUXR helix_turn 24.4 41 0.00089 18.9 1.0 18 89-106 17-34 (58)
90 PRK12532 RNA polymerase sigma 24.3 41 0.00089 24.7 1.1 26 88-115 150-175 (195)
91 TIGR01537 portal_HK97 phage po 24.1 1.3E+02 0.0028 24.1 4.0 19 87-105 321-339 (341)
92 cd08306 Death_FADD Fas-associa 24.0 1.3E+02 0.0028 20.2 3.5 51 53-103 17-75 (86)
93 PRK12542 RNA polymerase sigma 24.0 39 0.00085 24.6 1.0 26 89-116 137-162 (185)
94 PRK12523 RNA polymerase sigma 23.9 40 0.00086 24.3 1.0 27 88-116 133-159 (172)
95 PRK09047 RNA polymerase factor 23.8 40 0.00087 23.6 1.0 18 89-106 121-138 (161)
96 cd08803 Death_ank3 Death domai 23.8 75 0.0016 21.6 2.3 18 67-84 41-58 (84)
97 TIGR02999 Sig-70_X6 RNA polyme 23.7 40 0.00088 24.2 1.0 27 88-116 148-174 (183)
98 PF07766 LETM1: LETM1-like pro 23.6 1.9E+02 0.004 23.4 4.9 63 50-116 174-247 (268)
99 cd00864 PI3Ka Phosphoinositide 23.3 1.4E+02 0.0031 22.1 3.9 53 43-103 25-81 (152)
100 TIGR03220 catechol_dmpE 2-oxop 23.3 1.1E+02 0.0023 24.4 3.4 29 71-99 4-37 (255)
101 TIGR02952 Sig70_famx2 RNA poly 23.2 42 0.00091 23.6 1.0 27 88-116 136-162 (170)
102 smart00345 HTH_GNTR helix_turn 23.2 71 0.0015 18.4 1.9 31 76-106 5-36 (60)
103 PF00784 MyTH4: MyTH4 domain; 23.1 20 0.00043 24.9 -0.7 13 36-48 14-26 (114)
104 PRK12525 RNA polymerase sigma 23.0 43 0.00093 24.1 1.0 26 89-116 133-158 (168)
105 TIGR02947 SigH_actino RNA poly 23.0 53 0.0012 24.0 1.5 19 88-106 145-163 (193)
106 PRK12530 RNA polymerase sigma 22.9 43 0.00092 24.7 1.0 27 88-116 148-174 (189)
107 PF04148 Erv26: Transmembrane 22.6 1.1E+02 0.0023 24.6 3.2 31 44-77 73-103 (211)
108 cd03565 VHS_Tom1 VHS domain fa 22.5 1.3E+02 0.0028 22.0 3.5 31 70-101 103-134 (141)
109 smart00581 PSP proline-rich do 22.5 88 0.0019 20.1 2.3 21 85-106 3-23 (54)
110 COG1745 Predicted metal-bindin 22.5 1.9E+02 0.0042 20.6 4.2 50 68-117 4-55 (94)
111 PF05179 CDC73: RNA pol II acc 22.1 47 0.001 27.2 1.1 19 59-77 204-222 (273)
112 cd03567 VHS_GGA VHS domain fam 22.0 1.1E+02 0.0024 22.5 3.1 31 68-101 102-132 (139)
113 PRK13502 transcriptional activ 21.8 1.5E+02 0.0033 22.8 4.0 61 36-106 193-257 (282)
114 PF04405 ScdA_N: Domain of Unk 21.8 49 0.0011 21.0 1.0 19 88-106 29-47 (56)
115 TIGR02689 ars_reduc_gluta arse 21.7 1.1E+02 0.0023 21.5 2.8 11 60-70 94-104 (126)
116 cd08804 Death_ank2 Death domai 21.5 72 0.0016 21.4 1.8 46 53-101 19-75 (84)
117 PRK12531 RNA polymerase sigma 21.5 54 0.0012 24.2 1.3 26 89-116 156-181 (194)
118 cd05782 DNA_polB_like1_exo Unc 21.2 1.5E+02 0.0032 22.8 3.8 41 75-117 143-183 (208)
119 PF09536 DUF2378: Protein of u 21.2 1.7E+02 0.0037 22.5 4.0 37 60-103 40-76 (178)
120 cd07923 Gallate_dioxygenase_C 21.1 50 0.0011 23.4 1.0 18 46-63 57-74 (94)
121 PF04355 SmpA_OmlA: SmpA / Oml 21.1 63 0.0014 20.3 1.4 17 87-104 13-29 (71)
122 PRK15121 right oriC-binding tr 21.0 2.9E+02 0.0063 21.8 5.4 62 35-106 21-86 (289)
123 PRK00423 tfb transcription ini 21.0 39 0.00084 27.7 0.5 15 49-63 220-234 (310)
124 PRK12537 RNA polymerase sigma 21.0 73 0.0016 23.2 1.9 19 88-106 147-165 (182)
125 PRK15044 transcriptional regul 21.0 2.2E+02 0.0049 23.9 5.0 62 35-106 208-272 (295)
126 PRK09393 ftrA transcriptional 20.9 2.4E+02 0.0053 22.5 5.0 62 35-106 234-299 (322)
127 PF14475 Mso1_Sec1_bdg: Sec1-b 20.7 70 0.0015 19.6 1.4 13 49-64 28-40 (41)
128 PRK12522 RNA polymerase sigma 20.6 64 0.0014 23.1 1.5 19 88-106 133-151 (173)
129 PF08891 YfcL: YfcL protein; 20.3 2.2E+02 0.0047 19.8 4.0 56 42-101 21-77 (85)
130 TIGR03377 glycerol3P_GlpA glyc 20.2 1E+02 0.0022 26.5 2.9 36 45-83 449-485 (516)
131 PRK10219 DNA-binding transcrip 20.2 2.7E+02 0.0059 18.4 6.3 61 36-106 22-86 (107)
132 PF14769 CLAMP: Flagellar C1a 20.2 1.5E+02 0.0033 20.2 3.2 43 53-101 4-46 (101)
133 cd00093 HTH_XRE Helix-turn-hel 20.1 72 0.0016 17.0 1.4 13 92-104 43-55 (58)
134 PF00382 TFIIB: Transcription 20.1 2.4E+02 0.0051 17.6 4.8 12 52-63 1-12 (71)
135 PRK09637 RNA polymerase sigma 20.0 78 0.0017 23.3 1.9 19 88-106 120-138 (181)
No 1
>KOG1789|consensus
Probab=100.00 E-value=1.6e-50 Score=370.08 Aligned_cols=118 Identities=50% Similarity=0.994 Sum_probs=114.5
Q ss_pred CHHHHHHHHHhHhhccchhhccCccccccccccCccCccccccchhhhHHHHHHHhccCCCCCCCCccHHHHHHHHHHHH
Q psy3080 1 MRRLLIGKIAAHIADFTPRLRGNNRAVYQFCGIPAVRYPQLEQEMFVNIFYLRHLCDTAKFPDWPINQPIMLLKDVLEAW 80 (118)
Q Consensus 1 MR~~li~~i~~Hl~df~~rL~~~~~a~y~y~pip~i~YpeL~~elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~W 80 (118)
|||+||++|++||+||++||.+|++|+|||||||+|.||||++|||||+||||||||++|||||||+|||.||+++|.+|
T Consensus 1188 MRr~lIe~ia~HLaDf~~rL~sn~raLYqYcPiP~i~YPeL~~ElfCh~YYLr~LCD~~rFPdWPI~dpV~fL~~~L~~W 1267 (2235)
T KOG1789|consen 1188 MRRHLIERIAVHLADFSHRLTSNVRALYQYCPIPLIDYPELAQELFCHVYYLRHLCDKQRFPDWPIRDPVPFLRCCLATW 1267 (2235)
T ss_pred HHHHHHHHHHHHHhccCHhHHHhHHHHHHcCCCCCCCcHHHHHHHHHHHHHHHHHhccccCCCCcccCchHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHhhcCCCCCCHHHHHHHcCCCCCCC-CCh-HHHHhhcC
Q psy3080 81 KAEVERKPPEMSVDDAYEALGLTRGSH-HEE-NIVRKAYY 118 (118)
Q Consensus 81 r~E~~kkp~~mS~edAy~vLGL~~~a~-~se-~~IRrAY~ 118 (118)
++|++|||+.||+++||+||.++-+.. +|+ +.|||+||
T Consensus 1268 ~~ElekKP~~mS~d~A~eiL~i~l~n~~hD~~~KirrqY~ 1307 (2235)
T KOG1789|consen 1268 YNELEKKPATMSVDLAREILSVDLTNEEHDKPAKIRRQYY 1307 (2235)
T ss_pred HHHHhcCCCccchHHHHHHhccccCCCCcccHHHHHHHHH
Confidence 999999999999999999999998774 355 99999997
No 2
>PRK09430 djlA Dna-J like membrane chaperone protein; Provisional
Probab=98.53 E-value=1.7e-07 Score=75.69 Aligned_cols=59 Identities=19% Similarity=0.409 Sum_probs=52.8
Q ss_pred HHHHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcC-----------CCCCCHHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 50 FYLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERK-----------PPEMSVDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 50 yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kk-----------p~~mS~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
-.|+++|+.+|+++ ..+..++.++..+.+++ +..++.++||++|||++++ |+++|||||
T Consensus 152 ~~L~~Ia~~Lgis~-------~df~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~~~~ay~vLgv~~~a--s~~eIk~aY 221 (267)
T PRK09430 152 QVLYVIAEELGFSR-------FQFDQLLRMMQAGFRFQQQQGGGGYQQAQRGPTLEDAYKVLGVSESD--DDQEIKRAY 221 (267)
T ss_pred HHHHHHHHHcCCCH-------HHHHHHHHHHHHHHhhcccccccccccccCCCcHHhHHHHcCCCCCC--CHHHHHHHH
Confidence 48999999999999 99999999999875552 2368999999999999999 999999999
No 3
>PTZ00100 DnaJ chaperone protein; Provisional
Probab=97.55 E-value=6.9e-05 Score=54.77 Aligned_cols=28 Identities=25% Similarity=0.401 Sum_probs=26.2
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
...||.+|||++|||++++ |.++||+||
T Consensus 59 ~~~Ms~~eAy~ILGv~~~A--s~~eIkkaY 86 (116)
T PTZ00100 59 ENPMSKSEAYKILNISPTA--SKERIREAH 86 (116)
T ss_pred cCCCCHHHHHHHcCCCCCC--CHHHHHHHH
Confidence 4589999999999999999 999999998
No 4
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=96.81 E-value=0.00099 Score=41.77 Aligned_cols=21 Identities=38% Similarity=0.768 Sum_probs=19.9
Q ss_pred HHHHHcCCCCCCCCChHHHHhhc
Q psy3080 95 DAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 95 dAy~vLGL~~~a~~se~~IRrAY 117 (118)
+.|++|||++++ +.++||+||
T Consensus 1 ~~y~iLgl~~~~--~~~eik~~y 21 (64)
T PF00226_consen 1 NPYEILGLPPDA--SDEEIKKAY 21 (64)
T ss_dssp HHHHHCTSTTTS--SHHHHHHHH
T ss_pred ChHHHCCCCCCC--CHHHHHHHH
Confidence 579999999999 999999998
No 5
>smart00271 DnaJ DnaJ molecular chaperone homology domain.
Probab=96.71 E-value=0.0011 Score=40.83 Aligned_cols=22 Identities=41% Similarity=0.694 Sum_probs=20.2
Q ss_pred HHHHHcCCCCCCCCChHHHHhhcC
Q psy3080 95 DAYEALGLTRGSHHEENIVRKAYY 118 (118)
Q Consensus 95 dAy~vLGL~~~a~~se~~IRrAY~ 118 (118)
+.|++|||++++ +.++||+||.
T Consensus 2 ~~y~vLgl~~~~--~~~~ik~ay~ 23 (60)
T smart00271 2 DYYEILGVPRDA--SLDEIKKAYR 23 (60)
T ss_pred CHHHHcCCCCCC--CHHHHHHHHH
Confidence 689999999998 9999999983
No 6
>PF13446 RPT: A repeated domain in UCH-protein
Probab=96.49 E-value=0.0031 Score=40.20 Aligned_cols=26 Identities=31% Similarity=0.721 Sum_probs=23.1
Q ss_pred CCCHHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 90 EMSVDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 90 ~mS~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
.||.++||+.||++++. +|..|-.+|
T Consensus 1 ~~~~~~Ay~~Lgi~~~~--~Dd~Ii~~f 26 (62)
T PF13446_consen 1 YMDVEEAYEILGIDEDT--DDDFIISAF 26 (62)
T ss_pred CCCHHHHHHHhCcCCCC--CHHHHHHHH
Confidence 48999999999998888 999988776
No 7
>cd06257 DnaJ DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
Probab=96.40 E-value=0.0022 Score=38.64 Aligned_cols=22 Identities=32% Similarity=0.659 Sum_probs=19.9
Q ss_pred HHHHHcCCCCCCCCChHHHHhhcC
Q psy3080 95 DAYEALGLTRGSHHEENIVRKAYY 118 (118)
Q Consensus 95 dAy~vLGL~~~a~~se~~IRrAY~ 118 (118)
+.|++|||++++ +.++||+||.
T Consensus 1 ~~y~vLgl~~~~--~~~~ik~~y~ 22 (55)
T cd06257 1 DYYDILGVPPDA--SDEEIKKAYR 22 (55)
T ss_pred ChHHHcCCCCCC--CHHHHHHHHH
Confidence 479999999999 9999999983
No 8
>KOG0721|consensus
Probab=96.36 E-value=0.0018 Score=52.39 Aligned_cols=23 Identities=48% Similarity=0.752 Sum_probs=21.1
Q ss_pred HHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 93 VDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 93 ~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
.=|-|+||||++|+ |++||||||
T Consensus 98 ~fDPyEILGl~pga--s~~eIKkaY 120 (230)
T KOG0721|consen 98 KFDPYEILGLDPGA--SEKEIKKAY 120 (230)
T ss_pred cCCcHHhhCCCCCC--CHHHHHHHH
Confidence 34779999999999 999999999
No 9
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=96.04 E-value=0.0042 Score=53.42 Aligned_cols=27 Identities=30% Similarity=0.559 Sum_probs=24.0
Q ss_pred CCCHHHHHHHcCCCCCCCCChHHHHhhcC
Q psy3080 90 EMSVDDAYEALGLTRGSHHEENIVRKAYY 118 (118)
Q Consensus 90 ~mS~edAy~vLGL~~~a~~se~~IRrAY~ 118 (118)
.|...+.|++|||+++| |+++|||||.
T Consensus 24 ~~~~~d~Y~vLGV~~~A--s~~eIKkAYr 50 (421)
T PTZ00037 24 EVDNEKLYEVLNLSKDC--TTSEIKKAYR 50 (421)
T ss_pred cccchhHHHHcCCCCCC--CHHHHHHHHH
Confidence 44568999999999999 9999999993
No 10
>KOG1789|consensus
Probab=95.82 E-value=0.0097 Score=57.84 Aligned_cols=54 Identities=26% Similarity=0.482 Sum_probs=47.8
Q ss_pred ccccccCccCccccccchhhhHHHHHHHhccCCCCCCCCccHHHHHHHHHHHHHH
Q psy3080 28 YQFCGIPAVRYPQLEQEMFVNIFYLRHLCDTAKFPDWPINQPIMLLKDVLEAWKA 82 (118)
Q Consensus 28 y~y~pip~i~YpeL~~elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~ 82 (118)
|.|.-. .|+||.|..|+--+.||||-|..+-.=.+.||.||++|++++-.+.+-
T Consensus 794 wn~sef-~i~y~sl~~ei~ig~yylrlll~~~~~~~~p~~~p~~ff~~~yhrf~~ 847 (2235)
T KOG1789|consen 794 WNHTEF-QIRYPSLLEEIKIGDYYLRLLLIEADENATPIHNPLEFFNNVYHRFLL 847 (2235)
T ss_pred ccccee-eeechhhHhhhhHhHHHHHHHHHhccCCCCcccCHHHHHHHHHHHHhc
Confidence 555554 799999999999999999999999888899999999999999887654
No 11
>KOG0715|consensus
Probab=95.39 E-value=0.0065 Score=49.85 Aligned_cols=29 Identities=24% Similarity=0.473 Sum_probs=25.3
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhhcC
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKAYY 118 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrAY~ 118 (118)
...++.++.|+||||+.++ +.+|||+||+
T Consensus 37 s~~~~~~d~Y~vLgv~~~A--t~~EIK~Af~ 65 (288)
T KOG0715|consen 37 SRIISKEDYYKVLGVSRNA--TLSEIKSAFR 65 (288)
T ss_pred cccCCCcchhhhhCcCCCC--CHHHHHHHHH
Confidence 4455666999999999999 9999999996
No 12
>PF03656 Pam16: Pam16; InterPro: IPR005341 The Pam16 protein is the fifth essential subunit of the pre-sequence translocase-associated protein import motor (PAM) []. In Saccharomyces cerevisiae (Baker's yeast), Pam16 is required for preprotein translocation into the matrix, but not for protein insertion into the inner membrane [].; PDB: 2GUZ_J.
Probab=95.30 E-value=0.01 Score=43.90 Aligned_cols=26 Identities=23% Similarity=0.428 Sum_probs=17.6
Q ss_pred CCCHHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 90 EMSVDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 90 ~mS~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
+||.+||++||||++.. +.++|.+-|
T Consensus 54 ~Mtl~EA~~ILnv~~~~--~~eeI~k~y 79 (127)
T PF03656_consen 54 GMTLDEARQILNVKEEL--SREEIQKRY 79 (127)
T ss_dssp ---HHHHHHHHT--G----SHHHHHHHH
T ss_pred CCCHHHHHHHcCCCCcc--CHHHHHHHH
Confidence 69999999999999977 889998876
No 13
>KOG0723|consensus
Probab=93.84 E-value=0.057 Score=39.52 Aligned_cols=27 Identities=30% Similarity=0.464 Sum_probs=24.8
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
.+.||.-||-.||||+++. +.+.||.|
T Consensus 50 ~~kMsr~EA~lIL~v~~s~--~k~Kikea 76 (112)
T KOG0723|consen 50 EPKMSRREAALILGVTPSL--DKDKIKEA 76 (112)
T ss_pred ccccchHHHHHHhCCCccc--cHHHHHHH
Confidence 4589999999999999998 99999987
No 14
>KOG0722|consensus
Probab=92.96 E-value=0.045 Score=45.85 Aligned_cols=24 Identities=29% Similarity=0.674 Sum_probs=22.7
Q ss_pred HHHHHHHcCCCCCCCCChHHHHhhcC
Q psy3080 93 VDDAYEALGLTRGSHHEENIVRKAYY 118 (118)
Q Consensus 93 ~edAy~vLGL~~~a~~se~~IRrAY~ 118 (118)
.+++|+||||+.++ +.++|.|||+
T Consensus 32 ~enCYdVLgV~Rea--~KseIakAYR 55 (329)
T KOG0722|consen 32 AENCYDVLGVAREA--NKSEIAKAYR 55 (329)
T ss_pred chhHHHHhhhhhhc--cHHHHHHHHH
Confidence 69999999999999 9999999994
No 15
>KOG3442|consensus
Probab=92.96 E-value=0.23 Score=37.25 Aligned_cols=27 Identities=15% Similarity=0.335 Sum_probs=23.7
Q ss_pred CCCCHHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 89 PEMSVDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
++||++||.+||+|++.. +.++|-+-|
T Consensus 54 ~~iTlqEa~qILnV~~~l--n~eei~k~y 80 (132)
T KOG3442|consen 54 GKITLQEAQQILNVKEPL--NREEIEKRY 80 (132)
T ss_pred ccccHHHHhhHhCCCCCC--CHHHHHHHH
Confidence 579999999999999977 888887766
No 16
>KOG0720|consensus
Probab=91.40 E-value=0.1 Score=46.30 Aligned_cols=23 Identities=39% Similarity=0.657 Sum_probs=21.1
Q ss_pred HHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 93 VDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 93 ~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
.-|||.+|||.++. |+++|||.|
T Consensus 234 ~~daYsvlGl~~d~--sd~~lKk~Y 256 (490)
T KOG0720|consen 234 ILDAYSALGLPSDC--SDADLKKNY 256 (490)
T ss_pred CCCchhhcCCCCCC--CHHHHHHHH
Confidence 45899999999888 999999998
No 17
>KOG0716|consensus
Probab=91.30 E-value=0.1 Score=43.37 Aligned_cols=27 Identities=30% Similarity=0.569 Sum_probs=23.4
Q ss_pred CCCHHHHHHHcCCCCCCCCChHHHHhhcC
Q psy3080 90 EMSVDDAYEALGLTRGSHHEENIVRKAYY 118 (118)
Q Consensus 90 ~mS~edAy~vLGL~~~a~~se~~IRrAY~ 118 (118)
+....+=|.||||++++ ++++|||||+
T Consensus 27 ~~~~~~LYdVLgl~k~a--t~d~IKKaYR 53 (279)
T KOG0716|consen 27 DVIRLDLYDVLGLPKTA--TKDEIKKAYR 53 (279)
T ss_pred ccchhHHHHHhCCCccc--chHHHHHHHH
Confidence 34477899999999999 8899999995
No 18
>PTZ00341 Ring-infected erythrocyte surface antigen; Provisional
Probab=90.42 E-value=0.14 Score=49.15 Aligned_cols=24 Identities=17% Similarity=0.369 Sum_probs=22.1
Q ss_pred HHHHHHHcCCCCCCCCChHHHHhhcC
Q psy3080 93 VDDAYEALGLTRGSHHEENIVRKAYY 118 (118)
Q Consensus 93 ~edAy~vLGL~~~a~~se~~IRrAY~ 118 (118)
..+-|++|||+++| |.++|||||+
T Consensus 572 d~dYYdILGVs~dA--S~~EIKKAYR 595 (1136)
T PTZ00341 572 DTLFYDILGVGVNA--DMKEISERYF 595 (1136)
T ss_pred CCChHHHcCCCCCC--CHHHHHHHHH
Confidence 45889999999999 9999999995
No 19
>COG5407 SEC63 Preprotein translocase subunit Sec63 [Intracellular trafficking and secretion]
Probab=87.14 E-value=0.33 Score=43.60 Aligned_cols=22 Identities=36% Similarity=0.668 Sum_probs=20.5
Q ss_pred HHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 94 DDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 94 edAy~vLGL~~~a~~se~~IRrAY 117 (118)
=|-|+|||++.++ |+.+|||+|
T Consensus 98 fDPyEILGI~~~t--s~rdik~~y 119 (610)
T COG5407 98 FDPYEILGIDQDT--SERDIKKRY 119 (610)
T ss_pred CChHHhhcccCCC--cHHHHHHHH
Confidence 4779999999999 999999998
No 20
>COG1076 DjlA DnaJ-domain-containing proteins 1 [Posttranslational modification, protein turnover, chaperones]
Probab=78.37 E-value=1.5 Score=33.10 Aligned_cols=22 Identities=32% Similarity=0.697 Sum_probs=21.0
Q ss_pred HHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 94 DDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 94 edAy~vLGL~~~a~~se~~IRrAY 117 (118)
.++|.+||++.++ +-++|+++|
T Consensus 113 ~~~l~~l~~~~~~--~~~~i~~~~ 134 (174)
T COG1076 113 EDALKVLGVEIKA--DQDAIKKAY 134 (174)
T ss_pred hhHHHHhcCchhh--hHHHHHHHH
Confidence 8999999999999 999999987
No 21
>KOG0624|consensus
Probab=73.70 E-value=1.9 Score=37.95 Aligned_cols=22 Identities=36% Similarity=0.679 Sum_probs=20.6
Q ss_pred HHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 94 DDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 94 edAy~vLGL~~~a~~se~~IRrAY 117 (118)
-|=|+||||..++ +.++|-|||
T Consensus 394 RDYYKILGVkRnA--sKqEI~KAY 415 (504)
T KOG0624|consen 394 RDYYKILGVKRNA--SKQEITKAY 415 (504)
T ss_pred chHHHHhhhcccc--cHHHHHHHH
Confidence 5779999999999 999999998
No 22
>PF04433 SWIRM: SWIRM domain; InterPro: IPR007526 The SWIRM domain is a small alpha-helical domain of about 85 amino acid residues found in eukaryotic chromosomal proteins. It is named after the proteins SWI3, RSC8 and MOIRA in which it was first recognised. This domain is predicted to mediate protein-protein interactions in the assembly of chromatin-protein complexes. The SWIRM domain can be linked to different domains, such as the ZZ-type zinc finger (IPR000433 from INTERPRO), the Myb DNA-binding domain (IPR001005 from INTERPRO), the HORMA domain (IPR003511 from INTERPRO), the amino-oxidase domain, the chromo domain (IPR000953 from INTERPRO), and the JAB1/PAD1 domain.; GO: 0005515 protein binding; PDB: 2Z3Y_A 2UXN_A 2Y48_A 2HKO_A 2XAF_A 2X0L_A 2XAJ_A 2UXX_A 2V1D_A 2L3D_A ....
Probab=72.72 E-value=9.1 Score=25.42 Aligned_cols=43 Identities=23% Similarity=0.494 Sum_probs=29.6
Q ss_pred ccHHHHH---HHHHHHHHHHhhcCCCCCCHHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 67 NQPIMLL---KDVLEAWKAEVERKPPEMSVDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 67 ~dpv~lL---~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
+.|...| ..++..|+.. ....+|.+||++.+. +. +...+.|.|
T Consensus 30 ~~p~~Yl~iRn~il~~w~~n---~~~~lt~~~~~~~i~---~~--d~~~~~ri~ 75 (86)
T PF04433_consen 30 KTPEQYLKIRNTILAEWRKN---PNKYLTKTDARKLIK---GI--DVNKIRRIY 75 (86)
T ss_dssp CHHHHHHHHHHHHHHHHHHH---TTS---HHHHHHHTT---SS--SHHHHHHHH
T ss_pred CChHHHHHHHHHHHHHHHHC---CCCcccHHHHHHHcc---cc--CHHHHHHHH
Confidence 4566655 3678999776 677899999999998 43 667777765
No 23
>KOG0568|consensus
Probab=71.91 E-value=2.6 Score=35.21 Aligned_cols=23 Identities=26% Similarity=0.687 Sum_probs=21.5
Q ss_pred HHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 93 VDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 93 ~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
+-|+|.+|||.+|+ |..+||.||
T Consensus 46 ~~e~fril~v~e~~--~adevr~af 68 (342)
T KOG0568|consen 46 IMECFRILGVEEGA--DADEVREAF 68 (342)
T ss_pred HHHHHHHhcccccC--chhHHHHHH
Confidence 67999999999999 999999987
No 24
>KOG0550|consensus
Probab=64.07 E-value=2.9 Score=37.25 Aligned_cols=23 Identities=35% Similarity=0.732 Sum_probs=21.2
Q ss_pred HHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 93 VDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 93 ~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
..|=|+|||++..+ ++.+|+|||
T Consensus 372 Rkd~ykilGi~~~a--s~~eikkay 394 (486)
T KOG0550|consen 372 RKDWYKILGISRNA--SDDEIKKAY 394 (486)
T ss_pred hhhHHHHhhhhhhc--ccchhhhHH
Confidence 56779999999999 999999998
No 25
>PF12728 HTH_17: Helix-turn-helix domain
Probab=61.40 E-value=6 Score=23.42 Aligned_cols=16 Identities=38% Similarity=0.729 Sum_probs=13.7
Q ss_pred CCHHHHHHHcCCCCCC
Q psy3080 91 MSVDDAYEALGLTRGS 106 (118)
Q Consensus 91 mS~edAy~vLGL~~~a 106 (118)
||.+||.+.||++.+.
T Consensus 2 lt~~e~a~~l~is~~t 17 (51)
T PF12728_consen 2 LTVKEAAELLGISRST 17 (51)
T ss_pred CCHHHHHHHHCcCHHH
Confidence 7899999999998644
No 26
>PF02042 RWP-RK: RWP-RK domain; InterPro: IPR003035 This domain is named RWP-RK after a conserved motif at the C terminus of the domain. The domain is found in algal minus dominance proteins as well as plant proteins involved in nitrogen-controlled development [].
Probab=57.62 E-value=6.9 Score=24.90 Aligned_cols=27 Identities=26% Similarity=0.597 Sum_probs=22.2
Q ss_pred cccchhhhHHHHHHHhccCCCCCCCCc
Q psy3080 41 LEQEMFVNIFYLRHLCDTAKFPDWPIN 67 (118)
Q Consensus 41 L~~elfC~~yYLr~lcd~~rfp~wpI~ 67 (118)
-..+|=++.=-|..+|.++|.+.||-+
T Consensus 21 AA~~Lgv~~T~LKr~CR~~GI~RWP~R 47 (52)
T PF02042_consen 21 AAKELGVSVTTLKRRCRRLGIPRWPYR 47 (52)
T ss_pred HHHHhCCCHHHHHHHHHHcCCCCCCch
Confidence 345666778889999999999999963
No 27
>TIGR01764 excise DNA binding domain, excisionase family. An excisionase, or Xis protein, is a small protein that binds and promotes excisive recombination; it is not enzymatically active. This model represents a number of putative excisionases and related proteins from temperate phage, plasmids, and transposons, as well as DNA binding domains of other proteins, such as a DNA modification methylase. This model identifies mostly small proteins and N-terminal regions of large proteins, but some proteins appear to have two copies. This domain appears similar, in both sequence and predicted secondary structure (PSIPRED) to the MerR family of transcriptional regulators (pfam00376).
Probab=57.59 E-value=6.5 Score=22.19 Aligned_cols=16 Identities=31% Similarity=0.686 Sum_probs=13.8
Q ss_pred CCHHHHHHHcCCCCCC
Q psy3080 91 MSVDDAYEALGLTRGS 106 (118)
Q Consensus 91 mS~edAy~vLGL~~~a 106 (118)
||.+||.+.||++++.
T Consensus 2 lt~~e~a~~lgis~~t 17 (49)
T TIGR01764 2 LTVEEAAEYLGVSKDT 17 (49)
T ss_pred CCHHHHHHHHCCCHHH
Confidence 7899999999998754
No 28
>PF07455 Psu: Phage polarity suppression protein (Psu); InterPro: IPR010006 This entry is represented by Bacteriophage P4, Psu, the polarity suppression protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains a number of phage polarity suppression proteins (Psu) (approximately 190 residues long). The Psu protein of Bacteriophage P4 causes suppression of transcriptional polarity in Escherichia coli by overcoming Rho termination factor activity [].
Probab=52.35 E-value=20 Score=28.49 Aligned_cols=42 Identities=17% Similarity=0.283 Sum_probs=37.0
Q ss_pred CCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcCCCCCC
Q psy3080 65 PINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 65 pI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~~a 106 (118)
-++||...|++.|..|-...+.-..+|...|-...-|+.|++
T Consensus 119 av~~a~~~LReaL~~~lt~~~~inya~~d~DIL~~iG~rPD~ 160 (188)
T PF07455_consen 119 AVNRADTYLREALSRWLTAGAEINYAAQDRDILTAIGFRPDA 160 (188)
T ss_pred ccccHHHHHHHHHHHHHccCCcCCcccchhhHHHhccCCCCh
Confidence 588999999999999999776667788888899999999977
No 29
>PF07099 DUF1361: Protein of unknown function (DUF1361); InterPro: IPR009793 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown although some members are annotated as being putative integral membrane proteins.
Probab=50.94 E-value=18 Score=27.33 Aligned_cols=32 Identities=25% Similarity=0.576 Sum_probs=27.5
Q ss_pred hHHHHHHHhccCCCCCCCC-ccHHHHHHHHHHHHHH
Q psy3080 48 NIFYLRHLCDTAKFPDWPI-NQPIMLLKDVLEAWKA 82 (118)
Q Consensus 48 ~~yYLr~lcd~~rfp~wpI-~dpv~lL~~lL~~Wr~ 82 (118)
=+.||.++ +|+..|.| .+|..+++++.+.|..
T Consensus 119 ~GIYlGRf---lR~NSWDi~~~P~~l~~~i~~~l~~ 151 (168)
T PF07099_consen 119 FGIYLGRF---LRLNSWDILTNPQSLIRDILSSLSP 151 (168)
T ss_pred HHHHHHhh---cccchhHHhCCHHHHHHHHHHHhhh
Confidence 36788877 79999999 7999999999999943
No 30
>cd00872 PI3Ka_I Phosphoinositide 3-kinase (PI3K) class I, accessory domain ; PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation. In general, PI3K class I prefer phosphoinositol (4,5)-bisphosphate as a substrate. Mammalian members interact with active Ras. They form heterodimers with adapter molecules linking them to different signaling pathways.
Probab=49.40 E-value=36 Score=26.13 Aligned_cols=61 Identities=18% Similarity=0.356 Sum_probs=42.6
Q ss_pred ccccchhhhHHHHHH-------HhccCCCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcCCCCCCCCChHH
Q psy3080 40 QLEQEMFVNIFYLRH-------LCDTAKFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALGLTRGSHHEENI 112 (118)
Q Consensus 40 eL~~elfC~~yYLr~-------lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~~a~~se~~ 112 (118)
+-.+-+|=+.||+.+ +.... +|--.+-|....++|..| +.++.+||.+.|+-. +.+..
T Consensus 22 eek~llW~~R~~~~~~p~aL~~~l~sv---~w~~~~~v~e~~~lL~~W--------~~i~~~~aLeLL~~~----f~d~~ 86 (171)
T cd00872 22 EDKELLWKLRHECRKKPQALPKLLLSV---KWNKRDDVAQMYQLLKRW--------PKLKPEQALELLDCN----FPDEH 86 (171)
T ss_pred HHHHHHHHHHHHHhhCcHHHHHHHhhC---CCCCHHHHHHHHHHHHCC--------CCCCHHHHHHHCCCc----CCCHH
Confidence 334557777888753 33333 676677788888899988 557999999999843 24466
Q ss_pred HHh
Q psy3080 113 VRK 115 (118)
Q Consensus 113 IRr 115 (118)
||+
T Consensus 87 VR~ 89 (171)
T cd00872 87 VRE 89 (171)
T ss_pred HHH
Confidence 764
No 31
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=48.11 E-value=32 Score=25.67 Aligned_cols=47 Identities=19% Similarity=0.328 Sum_probs=34.3
Q ss_pred HHHHHhccCC-CCCCCCccHHHHHHHHHHHHHHHhhcC-CCCCCHHHHHHHcCCCC
Q psy3080 51 YLRHLCDTAK-FPDWPINQPIMLLKDVLEAWKAEVERK-PPEMSVDDAYEALGLTR 104 (118)
Q Consensus 51 YLr~lcd~~r-fp~wpI~dpv~lL~~lL~~Wr~E~~kk-p~~mS~edAy~vLGL~~ 104 (118)
||..|-..++ .|. ...+++++-.+.-++-. ..+.|++|+.+.||=+.
T Consensus 6 fL~~L~~~L~~lp~-------~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~LG~P~ 54 (181)
T PF08006_consen 6 FLNELEKYLKKLPE-------EEREEILEYYEEYFDDAGEEGKSEEEIIAELGSPK 54 (181)
T ss_pred HHHHHHHHHHcCCH-------HHHHHHHHHHHHHHHHhhhCCCCHHHHHHHcCCHH
Confidence 5666666654 344 77888888888777664 55789999999999554
No 32
>cd04762 HTH_MerR-trunc Helix-Turn-Helix DNA binding domain of truncated MerR-like proteins. Proteins in this family mostly have a truncated helix-turn-helix (HTH) MerR-like domain. They lack a portion of the C-terminal region, called Wing 2 and the long dimerization helix that is typically present in MerR-like proteins. These truncated domains are found in response regulator receiver (REC) domain proteins (i.e., CheY), cytosine-C5 specific DNA methylases, IS607 transposase-like proteins, and RacA, a bacterial protein that anchors chromosomes to cell poles.
Probab=47.23 E-value=12 Score=20.83 Aligned_cols=15 Identities=27% Similarity=0.523 Sum_probs=12.6
Q ss_pred CCHHHHHHHcCCCCC
Q psy3080 91 MSVDDAYEALGLTRG 105 (118)
Q Consensus 91 mS~edAy~vLGL~~~ 105 (118)
||.+|+.+.||+++.
T Consensus 1 ~s~~e~a~~lgvs~~ 15 (49)
T cd04762 1 LTTKEAAELLGVSPS 15 (49)
T ss_pred CCHHHHHHHHCcCHH
Confidence 578999999999863
No 33
>PF09824 ArsR: ArsR transcriptional regulator; InterPro: IPR014517 Members of this family of archaeal proteins are conserved transcriptional regulators belonging to the ArsR family.
Probab=47.22 E-value=35 Score=26.46 Aligned_cols=60 Identities=22% Similarity=0.304 Sum_probs=52.4
Q ss_pred cCccccccchhhhHHHHHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcCCCC
Q psy3080 36 VRYPQLEQEMFVNIFYLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALGLTR 104 (118)
Q Consensus 36 i~YpeL~~elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~ 104 (118)
.+|-.++...+|+.==|..+-.-.=.|+ ..|+++.+...+|+ +++.+|+.+--+.||++|
T Consensus 76 tsYs~vqaNFqcs~~DLsdii~i~f~~d-------eel~~~~e~i~~~v--~~Gn~Sl~~lsr~l~~sp 135 (160)
T PF09824_consen 76 TSYSKVQANFQCSMEDLSDIIYIAFMSD-------EELRDYVEKIEKEV--EAGNTSLSDLSRKLGISP 135 (160)
T ss_pred hhHhheeeeeEeeHHHHHHHHheeecCH-------HHHHHHHHHHHHHH--HcCCCcHHHHHHHhCCCH
Confidence 6788889999999888888887766667 89999999999999 459999999999999997
No 34
>PF06844 DUF1244: Protein of unknown function (DUF1244); InterPro: IPR009654 This family consists of several short bacterial proteins of around 100 residues in length. The function of this family is unknown.; PDB: 2O35_A 3FYB_B.
Probab=46.91 E-value=28 Score=23.54 Aligned_cols=32 Identities=28% Similarity=0.543 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHhhc-CCCCCCHHHHHHHc-CCC
Q psy3080 72 LLKDVLEAWKAEVER-KPPEMSVDDAYEAL-GLT 103 (118)
Q Consensus 72 lL~~lL~~Wr~E~~k-kp~~mS~edAy~vL-GL~ 103 (118)
|-+.+|..|-.+... ++-.||.++|.++. |.+
T Consensus 12 FCRNCLskWy~~aA~~~g~~~~~d~ARE~vYGMP 45 (68)
T PF06844_consen 12 FCRNCLSKWYREAAEERGIEMDKDEAREIVYGMP 45 (68)
T ss_dssp --HHHHHHHHHHHHHHCT----HHHHHHHHHSS-
T ss_pred HHHHHHHHHHHHHHHhcCCcCCHHHHHHHHhCCC
Confidence 456999999887665 67899999999864 543
No 35
>KOG2905|consensus
Probab=46.06 E-value=16 Score=30.17 Aligned_cols=33 Identities=21% Similarity=0.471 Sum_probs=25.2
Q ss_pred CCCCCCCcc-------HHHHHHHHHHHHHHHhhcCCCCCC
Q psy3080 60 KFPDWPINQ-------PIMLLKDVLEAWKAEVERKPPEMS 92 (118)
Q Consensus 60 rfp~wpI~d-------pv~lL~~lL~~Wr~E~~kkp~~mS 92 (118)
+++.|.|+| |+.+|++||...---..|.|..-|
T Consensus 197 k~~ywtlK~Lv~~t~QP~~fLKEiL~~icv~NkKg~~k~t 236 (254)
T KOG2905|consen 197 KYQYWTLKDLVEITKQPEAFLKEILKDICVLNKKGPYKNT 236 (254)
T ss_pred cCccccHHHHHHHhcCHHHHHHHHHHHHHHHhccCcccCc
Confidence 688898865 999999999998776555555443
No 36
>PF12833 HTH_18: Helix-turn-helix domain; PDB: 2K9S_A 3LSG_C 3OIO_A 1D5Y_B 3GBG_A 3OOU_A 1BL0_A 1XS9_A 3MN2_B 3MKL_B ....
Probab=46.01 E-value=31 Score=21.74 Aligned_cols=56 Identities=18% Similarity=0.331 Sum_probs=34.2
Q ss_pred cccchhhhHHHHHHHhcc-CCCCCCCCccHHHHHHHHHHHHHHHhhcC----CCCCCHHHHHHHcCCCCCC
Q psy3080 41 LEQEMFVNIFYLRHLCDT-AKFPDWPINQPIMLLKDVLEAWKAEVERK----PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 41 L~~elfC~~yYLr~lcd~-~rfp~wpI~dpv~lL~~lL~~Wr~E~~kk----p~~mS~edAy~vLGL~~~a 106 (118)
|.+++-++..||.+++.. .|.+ ..+.+..+|-+..+. .+.+|+.|.-..+|.+..+
T Consensus 1 lA~~~~~s~~~l~~~f~~~~g~s----------~~~~~~~~R~~~a~~~L~~~~~~~i~~ia~~~Gf~~~~ 61 (81)
T PF12833_consen 1 LADELGMSERYLSRIFKKETGMS----------FKQYLRELRLQRAKELLRQNTDLSIAEIAEECGFSSQS 61 (81)
T ss_dssp HHHHCTS-HHHHHHHHHHHHSS-----------HHHHHHHHHHHHHHHHHHHHTT--HHHHHHHTT-SSHH
T ss_pred ChHHhCcCHHHHHHHHHHHHCcC----------HHHHHHHHHHHHHHHHHHHhhcccHHHHHHHcCCCCHH
Confidence 356677888899998888 4322 334455555443332 4789999999999999643
No 37
>KOG1150|consensus
Probab=41.89 E-value=18 Score=29.65 Aligned_cols=21 Identities=19% Similarity=0.482 Sum_probs=19.4
Q ss_pred HHHHHcCCCCCCCCChHHHHhhc
Q psy3080 95 DAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 95 dAy~vLGL~~~a~~se~~IRrAY 117 (118)
.+|+||-++|+. ++++||+-|
T Consensus 54 NpfeVLqIdpev--~~edikkry 74 (250)
T KOG1150|consen 54 NPFEVLQIDPEV--TDEDIKKRY 74 (250)
T ss_pred ChHHHHhcCCCC--CHHHHHHHH
Confidence 589999999999 999999987
No 38
>COG3492 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=40.80 E-value=53 Score=23.69 Aligned_cols=36 Identities=22% Similarity=0.403 Sum_probs=27.3
Q ss_pred cHHHHHHHHHHHHHHHhhc-CCCCCCHHHHHHHc-CCC
Q psy3080 68 QPIMLLKDVLEAWKAEVER-KPPEMSVDDAYEAL-GLT 103 (118)
Q Consensus 68 dpv~lL~~lL~~Wr~E~~k-kp~~mS~edAy~vL-GL~ 103 (118)
+=..|-+.+|..|-.|... ++-.+|.|+|.++. |++
T Consensus 39 nLAgFCRNCLs~Wy~eaae~~gv~lskd~aRE~VyGMp 76 (104)
T COG3492 39 NLAGFCRNCLSNWYREAAEAQGVDLSKDQAREIVYGMP 76 (104)
T ss_pred HHHHHHHHHHHHHHHHHHhccCCCccHHHHHHHHhCCC
Confidence 3346778999999877655 47799999999864 554
No 39
>PF10376 Mei5: Double-strand recombination repair protein ; InterPro: IPR018468 Mei5 is one of a pair of meiosis-specific proteins which facilitate the loading of Dmc1 on to Rad51 on DNA at double-strand breaks during recombination. Recombination is carried out by a large protein complex based around the two RecA homologues, Rad51 and Dmc1 []. This complex may play both a catalytic and a structural role in the interaction between homologous chromosomes during meiosis. Mei5 is seen to contain a coiled-coli region.
Probab=39.15 E-value=56 Score=26.18 Aligned_cols=40 Identities=10% Similarity=0.392 Sum_probs=32.4
Q ss_pred CccHHHHHHHHHHHHHHHhhc------------CCCCCCHHHHHHHcCCCCC
Q psy3080 66 INQPIMLLKDVLEAWKAEVER------------KPPEMSVDDAYEALGLTRG 105 (118)
Q Consensus 66 I~dpv~lL~~lL~~Wr~E~~k------------kp~~mS~edAy~vLGL~~~ 105 (118)
+++=...|+++...||.-... .+..||..+=++.|||++.
T Consensus 165 ~Kn~~~~Lq~lI~Kwr~~~q~~l~eL~~~~~~~e~~~~TM~eL~~~l~ID~~ 216 (221)
T PF10376_consen 165 SKNDLEQLQSLIKKWRSASQEALYELQSEMSEEEGEKFTMGELIKRLGIDYD 216 (221)
T ss_pred hhccHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccHHHHHHHhCCCcc
Confidence 455567899999999986554 3679999999999999875
No 40
>PF09862 DUF2089: Protein of unknown function (DUF2089); InterPro: IPR018658 This family consists of various hypothetical prokaryotic proteins.
Probab=38.87 E-value=65 Score=23.45 Aligned_cols=53 Identities=21% Similarity=0.357 Sum_probs=39.7
Q ss_pred ccCccccccchhhhHHHHHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHc
Q psy3080 35 AVRYPQLEQEMFVNIFYLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEAL 100 (118)
Q Consensus 35 ~i~YpeL~~elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vL 100 (118)
.|+||=..+ -|-.|...+|++.-+-.+....-+++|++.. .+.+|.++|.++|
T Consensus 60 giSYPTvR~-------rLd~ii~~lg~~~~~~~~~~~~~~~IL~~L~------~GeIs~eeA~~~L 112 (113)
T PF09862_consen 60 GISYPTVRN-------RLDKIIEKLGYEEDEEEEEEDERKEILDKLE------KGEISVEEALEIL 112 (113)
T ss_pred CCCcHHHHH-------HHHHHHHHhCCCCCcccccchhHHHHHHHHH------cCCCCHHHHHHHh
Confidence 467776654 4678888999954455666677778888765 4789999999987
No 41
>PF08281 Sigma70_r4_2: Sigma-70, region 4; InterPro: IPR013249 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. Region 4 of sigma-70 like sigma-factors are involved in binding to the -35 promoter element via a helix-turn-helix motif [].; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 2LFW_A 1OR7_B 2H27_D 2O8X_B.
Probab=38.47 E-value=10 Score=22.56 Aligned_cols=26 Identities=23% Similarity=0.340 Sum_probs=15.9
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRK 115 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRr 115 (118)
-.+||.+|.-+.||+++++ =..-++|
T Consensus 24 ~~g~s~~eIa~~l~~s~~~--v~~~l~r 49 (54)
T PF08281_consen 24 FQGMSYAEIAEILGISEST--VKRRLRR 49 (54)
T ss_dssp TS---HHHHHHHCTS-HHH--HHHHHHH
T ss_pred HHCcCHHHHHHHHCcCHHH--HHHHHHH
Confidence 5689999999999998766 3444443
No 42
>cd04761 HTH_MerR-SF Helix-Turn-Helix DNA binding domain of transcription regulators from the MerR superfamily. Helix-turn-helix (HTH) transcription regulator MerR superfamily, N-terminal domain. The MerR family transcription regulators have been shown to mediate responses to stress including exposure to heavy metals, drugs, or oxygen radicals in eubacterial and some archaeal species. They regulate transcription of multidrug/metal ion transporter genes and oxidative stress regulons by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=37.63 E-value=20 Score=20.48 Aligned_cols=14 Identities=7% Similarity=0.275 Sum_probs=11.9
Q ss_pred CCHHHHHHHcCCCC
Q psy3080 91 MSVDDAYEALGLTR 104 (118)
Q Consensus 91 mS~edAy~vLGL~~ 104 (118)
||..|+.+.||+++
T Consensus 1 ~~~~e~a~~~gv~~ 14 (49)
T cd04761 1 YTIGELAKLTGVSP 14 (49)
T ss_pred CcHHHHHHHHCcCH
Confidence 67889999999886
No 43
>PRK12527 RNA polymerase sigma factor; Reviewed
Probab=36.14 E-value=48 Score=23.36 Aligned_cols=27 Identities=26% Similarity=0.292 Sum_probs=21.5
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
-.+||.+|.-+.||++.++ -..-+.||
T Consensus 119 ~~~~s~~eIA~~lgis~~t--v~~~l~ra 145 (159)
T PRK12527 119 LEGLSHQQIAEHLGISRSL--VEKHIVNA 145 (159)
T ss_pred HcCCCHHHHHHHhCCCHHH--HHHHHHHH
Confidence 3488999999999999988 55555554
No 44
>PRK09685 DNA-binding transcriptional activator FeaR; Provisional
Probab=35.21 E-value=1.4e+02 Score=23.20 Aligned_cols=67 Identities=10% Similarity=0.142 Sum_probs=46.9
Q ss_pred CccCccccccchhhhHHHHHHHhccCCCCCCCCccHHHHHHHH-HHHHHHHhhcCCCCCCHHHHHHHcCCCCCC
Q psy3080 34 PAVRYPQLEQEMFVNIFYLRHLCDTAKFPDWPINQPIMLLKDV-LEAWKAEVERKPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 34 p~i~YpeL~~elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~l-L~~Wr~E~~kkp~~mS~edAy~vLGL~~~a 106 (118)
|.++=.++..++..+..||.++..+.|.+ |.+.+.+. |+.=+..+.......|+.|-..-+|.+..+
T Consensus 213 ~~ls~~~lA~~~giS~r~L~r~Fk~~G~T------~~~yi~~~RL~~A~~lL~~~~~~~sI~eIA~~~GF~d~s 280 (302)
T PRK09685 213 EILRPEWIAGELGISVRSLYRLFAEQGLV------VAQYIRNRRLDRCADDLRPAADDEKITSIAYKWGFSDSS 280 (302)
T ss_pred CCCCHHHHHHHHCCCHHHHHHHHHHcCCC------HHHHHHHHHHHHHHHHhhhhccCCCHHHHHHHhCCCCHH
Confidence 34566889999999999999999876542 23444443 444455552123578999999999999755
No 45
>PF00690 Cation_ATPase_N: Cation transporter/ATPase, N-terminus; InterPro: IPR004014 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles []. P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H+, Na+, K+, Mg2+, Ca2+, Ag+ and Ag2+, Zn2+, Co2+, Pb2+, Ni2+, Cd2+, Cu+ and Cu2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the conserved N-terminal region found in several classes of cation-transporting P-type ATPases, including those that transport H+ (3.6.3.6 from EC), Na+ (3.6.3.7 from EC), Ca2+ (3.6.3.8 from EC), Na+/K+ (3.6.3.9 from EC), and H+/K+ (3.6.3.10 from EC). In the H+/K+- and Na+/K+-exchange P-ATPases, this domain is found in the catalytic alpha chain. In gastric H+/K+-ATPases, this domain undergoes reversible sequential phosphorylation inducing conformational changes that may be important for regulating the function of these ATPases [, ]. More information about this protein can be found at Protein of the Month: ATP Synthases [].; PDB: 3KDP_C 3N2F_A 3B8E_A 3N23_A 2XZB_A 1MHS_B 3A3Y_A 2ZXE_A 3B8C_A 3B9B_A ....
Probab=34.86 E-value=48 Score=20.79 Aligned_cols=26 Identities=15% Similarity=0.277 Sum_probs=19.0
Q ss_pred CCCHHHHHHHcCCCCCCCCChHHHHh
Q psy3080 90 EMSVDDAYEALGLTRGSHHEENIVRK 115 (118)
Q Consensus 90 ~mS~edAy~vLGL~~~a~~se~~IRr 115 (118)
.+|.++.++.|+.+...+.+++++.+
T Consensus 3 ~~~~~~v~~~l~t~~~~GLs~~ev~~ 28 (69)
T PF00690_consen 3 QLSVEEVLKRLNTSSSQGLSSEEVEE 28 (69)
T ss_dssp TSSHHHHHHHHTTBTSSBBTHHHHHH
T ss_pred CCCHHHHHHHHCcCCCCCCCHHHHHH
Confidence 56888888888877777666666543
No 46
>smart00342 HTH_ARAC helix_turn_helix, arabinose operon control protein.
Probab=34.42 E-value=78 Score=19.01 Aligned_cols=58 Identities=19% Similarity=0.237 Sum_probs=36.3
Q ss_pred cccccchhhhHHHHHHHhccCCCCCCCCccHHHHHHH-HHHHHHHHhhcCCCCCCHHHHHHHcCCC
Q psy3080 39 PQLEQEMFVNIFYLRHLCDTAKFPDWPINQPIMLLKD-VLEAWKAEVERKPPEMSVDDAYEALGLT 103 (118)
Q Consensus 39 peL~~elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~-lL~~Wr~E~~kkp~~mS~edAy~vLGL~ 103 (118)
.++..++.++.-+|.++.....-. .|...++. -++.-...+..+ .+|++|....+|+.
T Consensus 5 ~~la~~~~~s~~~l~~~f~~~~~~-----s~~~~~~~~r~~~a~~~l~~~--~~~~~~ia~~~g~~ 63 (84)
T smart00342 5 EDLAEALGMSPRHLQRLFKKETGT-----TPKQYLRDRRLERARRLLRDT--DLSVTEIALRVGFS 63 (84)
T ss_pred HHHHHHhCCCHHHHHHHHHHHhCc-----CHHHHHHHHHHHHHHHHHHcC--CCCHHHHHHHhCCC
Confidence 567888999999998887764211 12233222 122233333333 89999999999994
No 47
>TIGR01268 Phe4hydrox_tetr phenylalanine-4-hydroxylase, tetrameric form. The member of this family from Drosophila has been described as having both phenylalanine-4-hydroxylase and tryptophan 5-monoxygenase activity (PubMed:1371286). However, a Drosophila member of the tryptophan 5-monoxygenase clade has subsequently been discovered.
Probab=33.80 E-value=85 Score=27.80 Aligned_cols=66 Identities=24% Similarity=0.241 Sum_probs=39.5
Q ss_pred cccCccCccccccchhhhHH--------------HHHH---HhccCCCCCCCCccHHHHHHHHHHHHHHHhhc----CCC
Q psy3080 31 CGIPAVRYPQLEQEMFVNIF--------------YLRH---LCDTAKFPDWPINQPIMLLKDVLEAWKAEVER----KPP 89 (118)
Q Consensus 31 ~pip~i~YpeL~~elfC~~y--------------YLr~---lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~k----kp~ 89 (118)
.|||.|.|.+-|+++|=..| ||.. |.+..|++.=-| .+|+++=+..++.-.. -+|
T Consensus 156 ~~ip~v~YT~~e~~~W~~l~~~~~~l~~~~Ac~eyl~~l~~L~~~~g~~~d~I----Pql~dvs~~L~~~TGw~~~pV~G 231 (436)
T TIGR01268 156 QPIPRVEYTDEEIATWRTVFNNLTVLYPTHACQEYNHIFPLLQQNCGFREDNI----PQLEDVSQFLQDCTGFTLRPVAG 231 (436)
T ss_pred CCCCccccCHHHHHHHHHHHHHHHHHhhccccHHHHHHHHHHHHhcCCCccCC----CCHHHHHHHHHhccCCEEEecCC
Confidence 49999999999999994443 6544 466667766333 4455554444443222 255
Q ss_pred CCCHHHHHHHc
Q psy3080 90 EMSVDDAYEAL 100 (118)
Q Consensus 90 ~mS~edAy~vL 100 (118)
-+|..+=+..|
T Consensus 232 ll~~~~F~~~L 242 (436)
T TIGR01268 232 LLSSRDFLAGL 242 (436)
T ss_pred cCCHHHHHHHH
Confidence 55555544444
No 48
>cd06459 M3B_Oligoendopeptidase_F Peptidase family M3B Oligopeptidase F (PepF; Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and includes oligoendopeptidase F from Lactococcus lactis. This enzyme hydrolyzes peptides containing between 7 and 17 amino acids with fairly broad specificity. The PepF gene is duplicated in L. lactis on the plasmid that bears it, while a shortened second copy is found in Bacillus subtilis. Most bacterial PepFs are cytoplasmic endopeptidases; however, the PepF Bacillus amyloliquefaciens oligopeptidase is a secreted protein and may facilitate the process of sporulation. Specifically, the yjbG gene encoding the homolog of the PepF1 and PepF2 oligoendopeptidases of Lactococcus lactis has been identified in Bacillus subtilis as an inhibitor of sporulation initiation when over expressed from a multicopy plasmid.
Probab=33.03 E-value=75 Score=26.07 Aligned_cols=38 Identities=13% Similarity=0.153 Sum_probs=29.4
Q ss_pred HHHHHHHHHhhcCCCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 75 DVLEAWKAEVERKPPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 75 ~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
...+.|+ ++=+.+++++..|.++.+|++. . +++.++++
T Consensus 385 ~~~~~y~-~~L~~Ggs~~~~ell~~~g~d~-~--~~~~~~~~ 422 (427)
T cd06459 385 GAVEKYL-ELLKAGGSKSPLELLKKAGVDL-T--SPDFWEEA 422 (427)
T ss_pred cHHHHHH-HHHHccCCCCHHHHHHHcCcCC-C--ChHHHHHH
Confidence 4567776 5556788899999999999997 5 66766654
No 49
>PRK13500 transcriptional activator RhaR; Provisional
Probab=32.88 E-value=1.2e+02 Score=24.31 Aligned_cols=62 Identities=10% Similarity=0.084 Sum_probs=46.4
Q ss_pred ccCccccccchhhhHHHHHHHhccC-CCCCCCCccHHHHHHHHHHHHHHHhhc---CCCCCCHHHHHHHcCCCCCC
Q psy3080 35 AVRYPQLEQEMFVNIFYLRHLCDTA-KFPDWPINQPIMLLKDVLEAWKAEVER---KPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 35 ~i~YpeL~~elfC~~yYLr~lcd~~-rfp~wpI~dpv~lL~~lL~~Wr~E~~k---kp~~mS~edAy~vLGL~~~a 106 (118)
+++=.+|.+.+.++..||.+++.+. |.+- .+.+...|-+-.+ ..+.+|+.|-...+|.+..+
T Consensus 222 ~isl~~lA~~~~iS~~~L~r~FK~~tG~T~----------~~yi~~~RL~~A~~LL~~t~~sI~eIA~~~GF~d~s 287 (312)
T PRK13500 222 PFALDKFCDEASCSERVLRQQFRQQTGMTI----------NQYLRQVRVCHAQYLLQHSRLLISDISTECGFEDSN 287 (312)
T ss_pred CCCHHHHHHHHCcCHHHHHHHHHHHHCcCH----------HHHHHHHHHHHHHHHHHcCCCCHHHHHHHhCCCCHH
Confidence 4677899999999999999999874 5442 3455555555444 36689999999999999754
No 50
>cd00870 PI3Ka_III Phosphoinositide 3-kinase (PI3K) class III, accessory domain (PIK domain); PIK domain is conserved in all PI3 and PI4-kinases. Its role is unclear but it has been suggested to be involved in substrate presentation. In general, PI3Ks class III phosphorylate phosphoinositol (PtdIns) only. The prototypical PI3K class III, yeast Vps34, is involved in trafficking proteins from Golgi to the vacuole.
Probab=32.78 E-value=1.2e+02 Score=22.97 Aligned_cols=64 Identities=22% Similarity=0.384 Sum_probs=41.9
Q ss_pred ccccchhhhHHHHHHHhccC----CCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcCCCCCCCCChHHHHh
Q psy3080 40 QLEQEMFVNIFYLRHLCDTA----KFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALGLTRGSHHEENIVRK 115 (118)
Q Consensus 40 eL~~elfC~~yYLr~lcd~~----rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~~a~~se~~IRr 115 (118)
+-.+-||=+.||+.+.=+.+ +.=+|.=.+-|....++|..|. .++.++|.+.||-.- .+..||+
T Consensus 29 ~ek~llW~~R~~l~~~p~aL~~~L~sv~W~~~~e~~e~~~lL~~W~--------~i~~~~aLeLL~~~f----~~~~VR~ 96 (166)
T cd00870 29 EEKDLIWKFRFYLTNNKKALTKFLKSVNWSDEQEVKQALELMPKWA--------KIDIEDALELLSPYF----TNPVVRK 96 (166)
T ss_pred HHHHHHHHhHHHHhhCcHHHHHHhhhCCCCCHHHHHHHHHHHhcCC--------CCCHHHHHHHcCccC----CCHHHHH
Confidence 33445777778875322221 2237766666778888999994 578999999998543 3455654
No 51
>cd03347 eu_PheOH Eukaryotic phenylalanine-4-hydroxylase (eu_PheOH); a member of the biopterin-dependent aromatic amino acid hydroxylase family of non-heme, iron(II)-dependent enzymes that also includes prokaryotic phenylalanine-4-hydroxylase (pro_PheOH), eukaryotic tyrosine hydroxylase (TyrOH) and eukaryotic tryptophan hydroxylase (TrpOH). PheOH catalyzes the first and rate-limiting step in the metabolism of the amino acid L-phenylalanine (L-Phe), the hydroxylation of L-Phe to L-tyrosine (L-Tyr). It uses (6R)-L-erythro-5,6,7,8-tetrahydrobiopterin (BH4) as the physiological electron donor. The catalytic activity of the tetrameric enzyme is tightly regulated by the binding of L-Phe and BH4 as well as by phosphorylation. Mutations in the human enzyme are linked to a severe variant of phenylketonuria.
Probab=32.67 E-value=99 Score=26.24 Aligned_cols=67 Identities=24% Similarity=0.238 Sum_probs=39.4
Q ss_pred cccCccCccccccchhhhH--------------HHHH---HHhccCCCCCCCCccHHHHHHHHHHHHHHHhhc----CCC
Q psy3080 31 CGIPAVRYPQLEQEMFVNI--------------FYLR---HLCDTAKFPDWPINQPIMLLKDVLEAWKAEVER----KPP 89 (118)
Q Consensus 31 ~pip~i~YpeL~~elfC~~--------------yYLr---~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~k----kp~ 89 (118)
.|||.|.|.+-|+++|=.. -||. .|.+..|++.=-|.+ |+++=+..++.-.. -++
T Consensus 54 ~~ip~v~YT~eE~~~W~~l~~r~~~l~~~~Ac~eyl~~l~~L~~~~gl~~d~IPq----l~dvn~~L~~~TGw~~~pV~G 129 (306)
T cd03347 54 QPIPRVEYTEEEKKTWGTVFRELKSLYPTHACYEYNHVFPLLEKNCGFSEDNIPQ----LEDVSNFLQTCTGFRLRPVAG 129 (306)
T ss_pred CCCCcCcCCHHHHHHHHHHHHHHHHHhccccCHHHHHHHHHHHHhcCCCcCCCCC----HHHHHHHHHhccCCEEEecCc
Confidence 5999999999999999433 3544 456666777644433 44443333332211 255
Q ss_pred CCCHHHHHHHcC
Q psy3080 90 EMSVDDAYEALG 101 (118)
Q Consensus 90 ~mS~edAy~vLG 101 (118)
-+|..+=+..|.
T Consensus 130 li~~~~Ff~~LA 141 (306)
T cd03347 130 LLSSRDFLAGLA 141 (306)
T ss_pred cCCHHHHHHHHh
Confidence 556555555443
No 52
>PF01893 UPF0058: Uncharacterised protein family UPF0058; InterPro: IPR002753 These archaebacterial proteins have no known function. Members of the family are about 90-105 amino acid residues long.; PDB: 2GF4_B.
Probab=32.12 E-value=87 Score=21.95 Aligned_cols=47 Identities=21% Similarity=0.282 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcCCCCCCCC-ChHHHHhh
Q psy3080 70 IMLLKDVLEAWKAEVERKPPEMSVDDAYEALGLTRGSHH-EENIVRKA 116 (118)
Q Consensus 70 v~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~~a~~-se~~IRrA 116 (118)
.-.|-.+|...+..++.....-..=+.|+.||++|..-+ |.++=|.|
T Consensus 6 Li~LH~lL~~v~~~~e~~~~~~~~~~~Y~~L~i~P~hIHksK~eHK~A 53 (89)
T PF01893_consen 6 LIHLHQLLVEVKKYFEEENNDEEDFEEYEELGISPHHIHKSKTEHKAA 53 (89)
T ss_dssp HHHHHHHHHHHHHHHHTTT--TTTTHHHHHH---TT-TTS-HHHHHHH
T ss_pred HHHHHHHHHHHHHHHHhccCCccchhHHHHcCCCcchhcCCHHHHHHH
Confidence 356788898999988864444445689999999997733 77666655
No 53
>KOG0113|consensus
Probab=31.38 E-value=30 Score=29.64 Aligned_cols=96 Identities=25% Similarity=0.320 Sum_probs=55.8
Q ss_pred chhhccCccccccccccCccCcccc----ccchhhhHH-----HHHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcC
Q psy3080 17 TPRLRGNNRAVYQFCGIPAVRYPQL----EQEMFVNIF-----YLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERK 87 (118)
Q Consensus 17 ~~rL~~~~~a~y~y~pip~i~YpeL----~~elfC~~y-----YLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kk 87 (118)
+..|..|..|| |.|-|||.|.-= -++=-|+.| ||+.+.+...-|+|. ||+...+=+++-+.+-..+
T Consensus 2 ~~~lp~nllaL--F~pRpPl~y~pP~d~~p~kr~~~~~tGvA~~~~~~~~~~d~p~~~---p~~t~~e~~er~~~~k~e~ 76 (335)
T KOG0113|consen 2 TQFLPPNLLAL--FAPRPPLPYLPPTDKLPHKRKTNPYTGVAQYLSTFEDPKDAPPKF---PVETPEEPLERGRREKTEK 76 (335)
T ss_pred CccCCccHHHh--cCCCCCcccCCccccChhhccCCCcccHHHHHHhhcCcccCCCcC---cccchhhHHHhhhhhhhhh
Confidence 35666788888 568889988544 223334443 899998887655542 3333333333322221111
Q ss_pred -------------CC--CCCHHHHHHHcCCCCCCC-CChHHHHhhc
Q psy3080 88 -------------PP--EMSVDDAYEALGLTRGSH-HEENIVRKAY 117 (118)
Q Consensus 88 -------------p~--~mS~edAy~vLGL~~~a~-~se~~IRrAY 117 (118)
|. .=-..|.|.+|-|..=.. -+|++||+-+
T Consensus 77 ~~~~~~~~l~~wdP~~dp~a~gDPy~TLFv~RLnydT~EskLrreF 122 (335)
T KOG0113|consen 77 IPHKLERRLKLWDPNNDPNAIGDPYKTLFVARLNYDTSESKLRREF 122 (335)
T ss_pred hHHHHHHHHHhcCCCCCCcccCCccceeeeeeccccccHHHHHHHH
Confidence 11 112569999999776221 1999998753
No 54
>COG2975 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=30.50 E-value=74 Score=21.19 Aligned_cols=32 Identities=25% Similarity=0.510 Sum_probs=26.0
Q ss_pred HHHhccCCCCCCCCccHHHHHHHHHHHHHHHh
Q psy3080 53 RHLCDTAKFPDWPINQPIMLLKDVLEAWKAEV 84 (118)
Q Consensus 53 r~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~ 84 (118)
+-+|...+|.+=|=+--...|.+++.+|-.|.
T Consensus 32 ~wi~eL~~FdDDp~rs~EkiLEAi~~~WidE~ 63 (64)
T COG2975 32 QWICELEEFDDDPQRSNEKILEAIQLAWIDEA 63 (64)
T ss_pred HHHHhCcccCCChhhccHHHHHHHHHHHhhhc
Confidence 35788899998666666789999999999885
No 55
>PRK13503 transcriptional activator RhaS; Provisional
Probab=30.34 E-value=1e+02 Score=23.55 Aligned_cols=73 Identities=15% Similarity=0.225 Sum_probs=50.2
Q ss_pred CccCccccccchhhhHHHHHHHhccC-CCCCCCCccHHHHHHHHHHHHHHHhhc---CCCCCCHHHHHHHcCCCCCC---
Q psy3080 34 PAVRYPQLEQEMFVNIFYLRHLCDTA-KFPDWPINQPIMLLKDVLEAWKAEVER---KPPEMSVDDAYEALGLTRGS--- 106 (118)
Q Consensus 34 p~i~YpeL~~elfC~~yYLr~lcd~~-rfp~wpI~dpv~lL~~lL~~Wr~E~~k---kp~~mS~edAy~vLGL~~~a--- 106 (118)
-+++=.+++..+..+..||.+++.+. |.+ | .+.+..-|-+-.+ ..+++|+.|.-..+|.+..+
T Consensus 186 ~~~tl~~lA~~~~lS~~~l~r~Fk~~~G~S------~----~~yi~~~Rl~~A~~LL~~~~~sI~eIA~~~GF~~~s~F~ 255 (278)
T PRK13503 186 EEVNWEALADQFSLSLRTLHRQLKQQTGLT------P----QRYLNRLRLLKARHLLRHSDASVTDIAYRCGFGDSNHFS 255 (278)
T ss_pred CCCCHHHHHHHHCCCHHHHHHHHHHHhCcC------H----HHHHHHHHHHHHHHHHHcCCCCHHHHHHHhCCCCHHHHH
Confidence 46778899999999999999999764 322 2 2333333333333 35778999999999999754
Q ss_pred -------CCChHHHHhh
Q psy3080 107 -------HHEENIVRKA 116 (118)
Q Consensus 107 -------~~se~~IRrA 116 (118)
+.+..+.||+
T Consensus 256 r~FKk~~G~TP~~yRk~ 272 (278)
T PRK13503 256 TLFRREFSWSPRDIRQG 272 (278)
T ss_pred HHHHHHHCcCHHHHHHh
Confidence 2366666653
No 56
>PRK12518 RNA polymerase sigma factor; Provisional
Probab=29.96 E-value=34 Score=24.45 Aligned_cols=26 Identities=12% Similarity=0.160 Sum_probs=20.1
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRK 115 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRr 115 (118)
-.++|.+|.-+.||+++++ -...+.|
T Consensus 134 ~~g~s~~eIA~~lg~s~~t--v~~~l~R 159 (175)
T PRK12518 134 LEDLPQKEIAEILNIPVGT--VKSRLFY 159 (175)
T ss_pred hcCCCHHHHHHHHCCCHHH--HHHHHHH
Confidence 4588999999999999877 4444444
No 57
>PRK12529 RNA polymerase sigma factor; Provisional
Probab=29.86 E-value=26 Score=25.62 Aligned_cols=26 Identities=19% Similarity=0.257 Sum_probs=21.3
Q ss_pred CCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 89 PEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
.++|.+|.-++||+++++ -.+-+.||
T Consensus 142 ~g~s~~EIA~~lgis~~t--Vk~~l~rA 167 (178)
T PRK12529 142 DGMKQKDIAQALDIALPT--VKKYIHQA 167 (178)
T ss_pred cCCCHHHHHHHHCCCHHH--HHHHHHHH
Confidence 478999999999999988 56666655
No 58
>PRK09642 RNA polymerase sigma factor SigW; Reviewed
Probab=29.75 E-value=29 Score=24.47 Aligned_cols=26 Identities=12% Similarity=-0.056 Sum_probs=19.7
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRK 115 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRr 115 (118)
-.+||.+|.-++||+++++ -..-+.|
T Consensus 120 ~~g~s~~EIA~~lgis~~t--V~~~l~R 145 (160)
T PRK09642 120 LEEKSYQEIALQEKIEVKT--VEMKLYR 145 (160)
T ss_pred HhCCCHHHHHHHHCCCHHH--HHHHHHH
Confidence 3488999999999999887 4444443
No 59
>TIGR02980 SigBFG RNA polymerase sigma-70 factor, sigma-B/F/G subfamily. This group of similar sigma-70 factors includes clades found in Bacilli (including the sporulation factors SigF:TIGR02885 and SigG:TIGR02850 as well as SigB:TIGR02941), and the high GC gram positive bacteria (Actinobacteria) where a variable number of them are found depending on the lineage.
Probab=29.69 E-value=92 Score=23.52 Aligned_cols=37 Identities=24% Similarity=0.585 Sum_probs=25.9
Q ss_pred CCCccHHHH------HHHHHHHHHHHhhcCCCCCCHHHHHHHcCCC
Q psy3080 64 WPINQPIML------LKDVLEAWKAEVERKPPEMSVDDAYEALGLT 103 (118)
Q Consensus 64 wpI~dpv~l------L~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~ 103 (118)
|.|+-|..+ ++.+.+.++.+..+.| |.+|..+.||++
T Consensus 77 ~~~ri~~~~~~~~~~~~~~~~~l~~~~~~~p---~~~ela~~l~~~ 119 (227)
T TIGR02980 77 WAVRVPRRLKELGLKINKATEELTQRLGRSP---TIAEIAEELGVS 119 (227)
T ss_pred CceecCHHHHHHHHHHHHHHHHHHHHHCCCC---CHHHHHHHhCCC
Confidence 888888764 6667777777765444 567777777764
No 60
>PF06244 DUF1014: Protein of unknown function (DUF1014); InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=29.36 E-value=54 Score=24.17 Aligned_cols=28 Identities=39% Similarity=0.603 Sum_probs=19.9
Q ss_pred CCCHHHHHHHcCCCCCC--CCChHHHHhhc
Q psy3080 90 EMSVDDAYEALGLTRGS--HHEENIVRKAY 117 (118)
Q Consensus 90 ~mS~edAy~vLGL~~~a--~~se~~IRrAY 117 (118)
.-++|+|..+|+|+.+. .+-+--+|.||
T Consensus 51 A~~Id~Al~aLs~~~~~~drHPErR~KAAy 80 (122)
T PF06244_consen 51 ASGIDDALDALSLTDKPIDRHPERRMKAAY 80 (122)
T ss_pred hccHHHHHHHcccCCCCCCCCcchhHHHHH
Confidence 44799999999999832 23555566665
No 61
>PRK09645 RNA polymerase sigma factor SigL; Provisional
Probab=29.30 E-value=35 Score=24.35 Aligned_cols=27 Identities=19% Similarity=0.202 Sum_probs=20.7
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
-.++|.+|.-++||+++++ -...+.||
T Consensus 132 ~~g~s~~EIA~~lgis~~t--V~~~l~ra 158 (173)
T PRK09645 132 YRGWSTAQIAADLGIPEGT--VKSRLHYA 158 (173)
T ss_pred HcCCCHHHHHHHHCcCHHH--HHHHHHHH
Confidence 4588999999999999887 44444443
No 62
>PRK09978 DNA-binding transcriptional regulator GadX; Provisional
Probab=28.90 E-value=1.6e+02 Score=24.25 Aligned_cols=62 Identities=10% Similarity=0.109 Sum_probs=45.4
Q ss_pred ccCccccccchhhhHHHHHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcC---CCCCCHHHHHHHcCCCCCC
Q psy3080 35 AVRYPQLEQEMFVNIFYLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERK---PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 35 ~i~YpeL~~elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kk---p~~mS~edAy~vLGL~~~a 106 (118)
+++=.++.+++-.+..||.++..+.|. - ..+.+..+|-+-.++ .+.+|+++....+|.+..+
T Consensus 158 ~lsl~~lA~~~g~S~~~L~R~Fk~~G~------S----~~~yl~~~Rl~~A~~LL~~t~~sI~eIA~~~GF~s~S 222 (274)
T PRK09978 158 EWTLARIASELLMSPSLLKKKLREEET------S----YSQLLTECRMQRALQLIVIHGFSIKRVAVSCGYHSVS 222 (274)
T ss_pred CCCHHHHHHHHCcCHHHHHHHHHhcCC------C----HHHHHHHHHHHHHHHHHHcCCCCHHHHHHHhCCCCHH
Confidence 456688999999999999999976542 2 345555555554443 4678999999999998644
No 63
>PF08235 LNS2: LNS2 (Lipin/Ned1/Smp2); InterPro: IPR013209 This domain is found in Saccharomyces cerevisiae (Baker's yeast) protein SMP2, proteins with an N-terminal lipin domain (IPR007651 from INTERPRO) and phosphatidylinositol transfer proteins []. SMP2 is involved in plasmid maintenance and respiration []. Lipin proteins are involved in adipose tissue development and insulin resistance [].
Probab=28.84 E-value=1.8e+02 Score=22.11 Aligned_cols=57 Identities=19% Similarity=0.283 Sum_probs=40.0
Q ss_pred HHHHHHhccC-CCCCCCC-----------------ccHHHHHHHHHHHHHHHhhcC--------CCCCCHHHHHHHcCCC
Q psy3080 50 FYLRHLCDTA-KFPDWPI-----------------NQPIMLLKDVLEAWKAEVERK--------PPEMSVDDAYEALGLT 103 (118)
Q Consensus 50 yYLr~lcd~~-rfp~wpI-----------------~dpv~lL~~lL~~Wr~E~~kk--------p~~mS~edAy~vLGL~ 103 (118)
=||....+.- +||+.|+ +||..|=-++|...+..+-.. ++..|-..||+-.||+
T Consensus 62 ~~L~~~~q~~~~lP~Gpv~~sP~~l~~al~rEvi~~~p~~fK~~~L~~l~~~f~~~~~pf~agfGN~~tDv~aY~~vGip 141 (157)
T PF08235_consen 62 SWLAQHQQQGHNLPDGPVLLSPDSLFSALHREVISKDPEEFKIACLRDLRALFPPDGNPFYAGFGNRSTDVIAYKAVGIP 141 (157)
T ss_pred HHHHHHHhCCccCCCCCEEECCcchhhhhhccccccChHHHHHHHHHHHHHhcCCCCCeEEEecCCcHHHHHHHHHcCCC
Confidence 3555553333 8888875 478888778888777764422 5566888999999999
Q ss_pred CCC
Q psy3080 104 RGS 106 (118)
Q Consensus 104 ~~a 106 (118)
++-
T Consensus 142 ~~r 144 (157)
T PF08235_consen 142 KSR 144 (157)
T ss_pred hhh
Confidence 854
No 64
>cd06170 LuxR_C_like C-terminal DNA-binding domain of LuxR-like proteins. This domain contains a helix-turn-helix motif and binds DNA. Proteins belonging to this group are response regulators; some act as transcriptional activators, others as transcriptional repressors. Many are active as homodimers. Many are two domain proteins in which the DNA binding property of the C-terminal DNA binding domain is modulated by modifications of the N-terminal domain. For example in the case of Lux R which participates in the regulation of gene expression in response to fluctuations in cell-population density (quorum-sensing), a signaling molecule, the pheromone Acyl HSL (N-acyl derivatives of homoserine lactone), binds to the N-terminal domain and leads to LuxR dimerization. For others phophorylation of the N-terminal domain leads to multimerization, for example Escherichia coli NarL and Sinorhizobium melilot FixJ. NarL controls gene expression of many respiratory-related operons when environmental
Probab=28.73 E-value=31 Score=19.68 Aligned_cols=18 Identities=11% Similarity=0.268 Sum_probs=14.8
Q ss_pred CCCCHHHHHHHcCCCCCC
Q psy3080 89 PEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a 106 (118)
.++|..+..+.||+++++
T Consensus 14 ~~~s~~eia~~l~~s~~t 31 (57)
T cd06170 14 EGKTNKEIADILGISEKT 31 (57)
T ss_pred cCCCHHHHHHHHCCCHHH
Confidence 357999999999998765
No 65
>KOG3786|consensus
Probab=28.73 E-value=34 Score=29.96 Aligned_cols=18 Identities=22% Similarity=0.833 Sum_probs=15.2
Q ss_pred CCCCCCCccHHHHHHHHH
Q psy3080 60 KFPDWPINQPIMLLKDVL 77 (118)
Q Consensus 60 rfp~wpI~dpv~lL~~lL 77 (118)
-|-+||++||++.+..+-
T Consensus 321 QFk~wpw~dp~eIF~ki~ 338 (395)
T KOG3786|consen 321 QFKGWPWRDPVEIFNKIK 338 (395)
T ss_pred ccCCCCCCCHHHHHhhcc
Confidence 488999999999987663
No 66
>PF04157 EAP30: EAP30/Vps36 family; InterPro: IPR007286 EAP30 is a subunit of the ELL complex. The ELL is an 80kDa RNA polymerase II transcription factor. ELL interacts with three other proteins to form the complex known as ELL complex. The ELL complex is capable of increasing that catalytic rate of transcription elongation, but is unable to repress initiation of transcription by RNA polymerase II as is the case of ELL. EAP30 is thought to lead to the derepression of ELL's transcriptional inhibitory activity. ; PDB: 2ZME_A 3CUQ_A 1W7P_D 1U5T_B.
Probab=28.25 E-value=94 Score=24.10 Aligned_cols=37 Identities=22% Similarity=0.209 Sum_probs=26.2
Q ss_pred HHHHhhcCCCCCCHHHHHHHcCCCC-CCC-CChHHHHhh
Q psy3080 80 WKAEVERKPPEMSVDDAYEALGLTR-GSH-HEENIVRKA 116 (118)
Q Consensus 80 Wr~E~~kkp~~mS~edAy~vLGL~~-~a~-~se~~IRrA 116 (118)
-.....+.+|-|+.+|.|..+.=.. +.. +|+++|.+|
T Consensus 103 c~~~~~~~GGii~L~dl~~~~nr~R~g~~lISp~Di~~A 141 (223)
T PF04157_consen 103 CLATRSKNGGIISLSDLYCRYNRARGGSELISPEDILRA 141 (223)
T ss_dssp HHHHCCTTTSEEEHHHHHHHHHHCTTTSST--HHHHHHH
T ss_pred HHHHHhcCCCEEEHHHHHHHHHHhcccCCCcCHHHHHHH
Confidence 3344445577889999999998877 443 499999887
No 67
>PF07962 Swi3: Replication Fork Protection Component Swi3; InterPro: IPR012923 Replication fork pausing is required to initiate recombination events. More specifically, Swi1 is required for recombination near the mat1 locus. Swi3 has been found to co-purify with Swi1. Together they define a fork protection complex that coordinates leading- and lagging-strand synthesis and stabilises stalled replication forks []. This complex is required for accurate replication, fork protection and replication checkpoint signalling [, ].; GO: 0006974 response to DNA damage stimulus, 0007049 cell cycle, 0048478 replication fork protection, 0005634 nucleus
Probab=27.99 E-value=1.9e+02 Score=19.43 Aligned_cols=51 Identities=24% Similarity=0.287 Sum_probs=42.1
Q ss_pred hHHHHHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHc
Q psy3080 48 NIFYLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEAL 100 (118)
Q Consensus 48 ~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vL 100 (118)
+.-||++.++..+|.. -...++-|..||.+.+.+...==|.++-+|+...+
T Consensus 12 Glp~l~~~~k~~k~kg--kghE~~dL~~ll~~Yq~W~h~LfPk~~F~d~i~~v 62 (83)
T PF07962_consen 12 GLPYLRKNFKKFKFKG--KGHEYEDLRRLLQFYQLWAHRLFPKASFEDFIERV 62 (83)
T ss_pred CHHHHHHHHHHcCcCC--CCCHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHHH
Confidence 4678888889998886 47788889999999999987778888988887654
No 68
>PF01418 HTH_6: Helix-turn-helix domain, rpiR family; InterPro: IPR000281 This domain contains a helix-turn-helix motif []. Every member of this family is N-terminal to a SIS domain IPR001347 from INTERPRO. Members of this family are probably regulators of genes involved in phosphosugar metobolism.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2O3F_B 3IWF_B.
Probab=27.78 E-value=58 Score=21.09 Aligned_cols=55 Identities=16% Similarity=0.320 Sum_probs=37.7
Q ss_pred HHHHHHhHhhccchhhccCcccccccc-----ccCccCccccccchhhhHHHHHHHhccCCCCC
Q psy3080 5 LIGKIAAHIADFTPRLRGNNRAVYQFC-----GIPAVRYPQLEQEMFVNIFYLRHLCDTAKFPD 63 (118)
Q Consensus 5 li~~i~~Hl~df~~rL~~~~~a~y~y~-----pip~i~YpeL~~elfC~~yYLr~lcd~~rfp~ 63 (118)
++++|..+...+++.-+. +.+|. -++..+=-||....+++.==+.++|..+||.+
T Consensus 3 l~~~i~~~~~~ls~~e~~----Ia~yil~~~~~~~~~si~elA~~~~vS~sti~Rf~kkLG~~g 62 (77)
T PF01418_consen 3 LLEKIRSQYNSLSPTEKK----IADYILENPDEIAFMSISELAEKAGVSPSTIVRFCKKLGFSG 62 (77)
T ss_dssp HHHHHHHHGGGS-HHHHH----HHHHHHH-HHHHCT--HHHHHHHCTS-HHHHHHHHHHCTTTC
T ss_pred HHHHHHHHHhhCCHHHHH----HHHHHHhCHHHHHHccHHHHHHHcCCCHHHHHHHHHHhCCCC
Confidence 567777777777765432 22222 23445556899999999999999999999999
No 69
>COG5451 Predicted secreted protein [Function unknown]
Probab=27.61 E-value=20 Score=26.82 Aligned_cols=21 Identities=33% Similarity=0.697 Sum_probs=16.3
Q ss_pred chhhhHHHHHHHhccCCCCCC
Q psy3080 44 EMFVNIFYLRHLCDTAKFPDW 64 (118)
Q Consensus 44 elfC~~yYLr~lcd~~rfp~w 64 (118)
|+-=+.-|||+||.+.|=-.|
T Consensus 41 e~LGs~hyLRnlC~~~g~q~W 61 (128)
T COG5451 41 EALGSVHYLRNLCGEKGDQYW 61 (128)
T ss_pred HHHhHHHHHHHHccccccHHH
Confidence 455677899999999876555
No 70
>PF01466 Skp1: Skp1 family, dimerisation domain; InterPro: IPR016072 SKP1 (together with SKP2) was identified as an essential component of the cyclin A-CDK2 S phase kinase complex []. It was found to bind several F-box containing proteins (e.g., Cdc4, Skp2, cyclin F) and to be involved in the ubiquitin protein degradation pathway. A yeast homologue of SKP1 (P52286) was identified in the centromere bound kinetochore complex [] and is also involved in the ubiquitin pathway []. In Dictyostelium discoideum (Slime mold) FP21 was shown to be glycosylated in the cytosol and has homology to SKP1 []. This entry represents a dimerisation domain found at the C-terminal of SKP1 proteins [], as well as in subunit D of the centromere DNA-binding protein complex Cbf3 []. This domain is multi-helical in structure, and consists of an interlocked herterodimer in F-box proteins.; GO: 0006511 ubiquitin-dependent protein catabolic process; PDB: 2P1O_A 3OGL_G 3OGM_A 3C6O_A 2P1N_A 2P1Q_A 3OGK_I 3C6N_A 3C6P_A 2P1P_A ....
Probab=27.15 E-value=65 Score=21.07 Aligned_cols=28 Identities=21% Similarity=0.287 Sum_probs=16.8
Q ss_pred CCCCHHHHHHHcCCCCCCCC-ChHHHHhh
Q psy3080 89 PEMSVDDAYEALGLTRGSHH-EENIVRKA 116 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a~~-se~~IRrA 116 (118)
.++|.++-.+.||++.+-+. .+.+||+-
T Consensus 44 ~gks~eeir~~fgi~~d~t~eee~~i~~e 72 (78)
T PF01466_consen 44 KGKSPEEIRKYFGIENDLTPEEEEEIRKE 72 (78)
T ss_dssp TTS-HHHHHHHHT---TSSHHHHHHHHHH
T ss_pred cCCCHHHHHHHcCCCCCCCHHHHHHHHHH
Confidence 36899999999999987632 33456653
No 71
>PRK12547 RNA polymerase sigma factor; Provisional
Probab=27.07 E-value=33 Score=24.58 Aligned_cols=27 Identities=26% Similarity=0.310 Sum_probs=20.9
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
-.++|.+|.-++||+++++ -.+-+.||
T Consensus 126 ~~g~s~~eIA~~lgis~~t--V~~~l~Ra 152 (164)
T PRK12547 126 ASGFSYEDAAAICGCAVGT--IKSRVSRA 152 (164)
T ss_pred HcCCCHHHHHHHhCCCHHH--HHHHHHHH
Confidence 4588999999999999877 55555544
No 72
>COG4741 Predicted secreted endonuclease distantly related to archaeal Holliday junction resolvase [Nucleotide transport and metabolism]
Probab=27.07 E-value=48 Score=26.01 Aligned_cols=40 Identities=18% Similarity=0.434 Sum_probs=34.6
Q ss_pred HHHHHHHhHhhccchhhccCcc-ccccccccCccCcccccc
Q psy3080 4 LLIGKIAAHIADFTPRLRGNNR-AVYQFCGIPAVRYPQLEQ 43 (118)
Q Consensus 4 ~li~~i~~Hl~df~~rL~~~~~-a~y~y~pip~i~YpeL~~ 43 (118)
-+..+++.||+.|-|.++=||+ +.|.=+|+--|.|--|++
T Consensus 86 Vi~GrVtEqlaPffp~f~ynPkD~RfIGTPvD~iVFdGLs~ 126 (175)
T COG4741 86 VILGRVTEQLAPFFPEFKYNPKDARFIGTPVDFIVFDGLSE 126 (175)
T ss_pred HHhhhhHhhhcccccCCCcCCccceeeCCCceEEEEcCCCc
Confidence 4677899999999999999997 688899998898877754
No 73
>PF04967 HTH_10: HTH DNA binding domain; InterPro: IPR007050 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. This entry represents the HTH DNA binding domain found in Halobacterium salinarium (Halobacterium halobium) and described as a putative bacterio-opsin activator.
Probab=26.91 E-value=71 Score=20.11 Aligned_cols=42 Identities=17% Similarity=0.321 Sum_probs=28.5
Q ss_pred HHHHHHHHHHHhhcCCCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 73 LKDVLEAWKAEVERKPPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 73 L~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
.+.|..++..-.=--|-..|.+|=-+.||+++++ =..-||||
T Consensus 6 ~e~L~~A~~~GYfd~PR~~tl~elA~~lgis~st--~~~~LRra 47 (53)
T PF04967_consen 6 REILKAAYELGYFDVPRRITLEELAEELGISKST--VSEHLRRA 47 (53)
T ss_pred HHHHHHHHHcCCCCCCCcCCHHHHHHHhCCCHHH--HHHHHHHH
Confidence 3444444443222237788999999999999988 66677776
No 74
>TIGR02954 Sig70_famx3 RNA polymerase sigma-70 factor, TIGR02954 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building, bidirectional best hits and one-to-a-genome distribution, to represent a conserved family. This family is found in certain Bacillus and Clostridium species.
Probab=26.87 E-value=43 Score=23.89 Aligned_cols=19 Identities=11% Similarity=0.344 Sum_probs=16.9
Q ss_pred CCCCCHHHHHHHcCCCCCC
Q psy3080 88 PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a 106 (118)
-.++|.+|.-+.||+++++
T Consensus 133 ~~g~s~~eiA~~lgis~~t 151 (169)
T TIGR02954 133 YHDLTIKEIAEVMNKPEGT 151 (169)
T ss_pred HcCCCHHHHHHHHCCCHHH
Confidence 4579999999999999877
No 75
>TIGR01270 Trp_5_monoox tryptophan 5-monooxygenase, tetrameric. This model describes tryptophan 5-monooxygenase, a member of the family of tetrameric, biopterin-dependent aromatic amino acid hydroxylases found in metazoans. It is closely related to tetrameric phenylalanine-4-hydroxylase and tyrosine 3-monooxygenase, and more distantly related to the monomeric phenylalanine-4-hydroxylase found in some Gram-negative bacteria.
Probab=26.78 E-value=2e+02 Score=25.74 Aligned_cols=17 Identities=29% Similarity=0.452 Sum_probs=15.4
Q ss_pred cccCccCccccccchhh
Q psy3080 31 CGIPAVRYPQLEQEMFV 47 (118)
Q Consensus 31 ~pip~i~YpeL~~elfC 47 (118)
.|||.|.|.+-|+++|=
T Consensus 183 ~~ip~v~YT~~E~~~W~ 199 (464)
T TIGR01270 183 EPIPRVEYTEEERKTWG 199 (464)
T ss_pred CCCCccccCHHHHHHHH
Confidence 49999999999999993
No 76
>PRK11511 DNA-binding transcriptional activator MarA; Provisional
Probab=26.76 E-value=1.3e+02 Score=21.08 Aligned_cols=62 Identities=10% Similarity=0.087 Sum_probs=44.5
Q ss_pred ccCccccccchhhhHHHHHHHhccC-CCCCCCCccHHHHHHHHHHHHHHHhhc---CCCCCCHHHHHHHcCCCCCC
Q psy3080 35 AVRYPQLEQEMFVNIFYLRHLCDTA-KFPDWPINQPIMLLKDVLEAWKAEVER---KPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 35 ~i~YpeL~~elfC~~yYLr~lcd~~-rfp~wpI~dpv~lL~~lL~~Wr~E~~k---kp~~mS~edAy~vLGL~~~a 106 (118)
+++=.+|..++-++..||.++..+. |-+ ..+.+..+|-+..+ ..+++|+.|....+|.+..+
T Consensus 25 ~~sl~~lA~~~g~S~~~l~r~Fk~~~G~s----------~~~~l~~~Rl~~A~~~L~~t~~~i~eIA~~~Gf~s~s 90 (127)
T PRK11511 25 PLSLEKVSERSGYSKWHLQRMFKKETGHS----------LGQYIRSRKMTEIAQKLKESNEPILYLAERYGFESQQ 90 (127)
T ss_pred CCCHHHHHHHHCcCHHHHHHHHHHHHCcC----------HHHHHHHHHHHHHHHHHHcCCCCHHHHHHHhCCCCHH
Confidence 3566899999999999998888764 443 23555555554322 24578999999999998644
No 77
>PF03683 UPF0175: Uncharacterised protein family (UPF0175); InterPro: IPR005368 This entry contains small proteins of unknown function.
Probab=26.75 E-value=1e+02 Score=20.26 Aligned_cols=18 Identities=28% Similarity=0.453 Sum_probs=16.2
Q ss_pred CCCCCCHHHHHHHcCCCC
Q psy3080 87 KPPEMSVDDAYEALGLTR 104 (118)
Q Consensus 87 kp~~mS~edAy~vLGL~~ 104 (118)
+-+.+|...|.+++|++.
T Consensus 31 ~~g~iS~gkAAelag~s~ 48 (76)
T PF03683_consen 31 EEGKISLGKAAELAGMSR 48 (76)
T ss_pred HcCCCCHHHHHHHhCCCH
Confidence 488999999999999986
No 78
>COG5509 Uncharacterized small protein containing a coiled-coil domain [Function unknown]
Probab=26.38 E-value=65 Score=21.51 Aligned_cols=31 Identities=32% Similarity=0.427 Sum_probs=25.0
Q ss_pred ccHHHHHHHHHHHHHHHhhcCCCCCCHHHHH
Q psy3080 67 NQPIMLLKDVLEAWKAEVERKPPEMSVDDAY 97 (118)
Q Consensus 67 ~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy 97 (118)
+.-+.+|++=.++.++|..||..+-|..+|.
T Consensus 31 ~eRIalLq~EIeRlkAe~~kK~~srsAAeaL 61 (65)
T COG5509 31 EERIALLQAEIERLKAELAKKKASRSAAEAL 61 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhccHHHHHHH
Confidence 3456888888999999999998887777764
No 79
>PRK12702 mannosyl-3-phosphoglycerate phosphatase; Reviewed
Probab=25.99 E-value=69 Score=27.05 Aligned_cols=43 Identities=12% Similarity=0.153 Sum_probs=35.2
Q ss_pred ccCCCCCCCCccHHHHHHHHHHHHHHHhhcC---CCCCCHHHHHHHcCCCCCC
Q psy3080 57 DTAKFPDWPINQPIMLLKDVLEAWKAEVERK---PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 57 d~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kk---p~~mS~edAy~vLGL~~~a 106 (118)
-++|-|- ..++++|+..+.+...+ =+.||.+|-.+.-||+...
T Consensus 94 ~~lg~~y-------~~ir~~L~~l~~~~~~~f~gF~d~t~~ei~~~TGL~~~~ 139 (302)
T PRK12702 94 CALGLPY-------PCLRHILQQVRQDSHLDLIGFGDWTASELAAATGIPLEE 139 (302)
T ss_pred EecCCCH-------HHHHHHHHHHHHHhCCCceehhhCCHHHHHHHhCcCHHH
Confidence 3556666 89999999999997643 5689999999999999855
No 80
>PF09539 DUF2385: Protein of unknown function (DUF2385); InterPro: IPR012645 Members of this uncharacterised protein family are found in a number of alphaproteobacteria, including root nodule bacteria, Brucella suis, Caulobacter crescentus (Caulobacter vibrioides), and Rhodopseudomonas palustris. Conserved residues include two well-separated cysteines, suggesting a disulphide bond. The function is unknown.
Probab=25.90 E-value=66 Score=22.88 Aligned_cols=22 Identities=32% Similarity=0.699 Sum_probs=15.9
Q ss_pred chhhhHHHHHHHhccCCCCCCC
Q psy3080 44 EMFVNIFYLRHLCDTAKFPDWP 65 (118)
Q Consensus 44 elfC~~yYLr~lcd~~rfp~wp 65 (118)
|+.=..-|||+||..-.=+.|.
T Consensus 11 eiLGalHyLR~LCg~~~~~~WR 32 (96)
T PF09539_consen 11 EILGALHYLRNLCGGNEDQYWR 32 (96)
T ss_pred HHHHHHHHHHHHhCCCCcchHH
Confidence 4556678999999965555554
No 81
>TIGR02950 SigM_subfam RNA polymerase sigma factor, SigM family. This family of RNA polymerase sigma factors is a member of the Sigma-70 subfamily (TIGR02937) and is restricted to certain lineages of the order Bacillales. This family encompasses at least two distinct sigma factors as two proteins are found in each of B. anthracis, B. subtilis subsp. subtilis str. 168, and B. lichiniformis (although these are not apparently the same two in each). One of these is designated as SigM in B. subtilis (Swiss_Prot: SIGM_BACSU) and is activated by various stressors.
Probab=25.86 E-value=1.2e+02 Score=21.00 Aligned_cols=19 Identities=26% Similarity=0.226 Sum_probs=16.5
Q ss_pred CCCCCHHHHHHHcCCCCCC
Q psy3080 88 PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a 106 (118)
-.++|.+|.-+.||+++++
T Consensus 119 ~~g~s~~eIA~~lgis~~t 137 (154)
T TIGR02950 119 FKEFSYKEIAELLNLSLAK 137 (154)
T ss_pred hccCcHHHHHHHHCCCHHH
Confidence 3578999999999999876
No 82
>PRK00430 fis global DNA-binding transcriptional dual regulator Fis; Provisional
Probab=25.60 E-value=1e+02 Score=21.26 Aligned_cols=45 Identities=24% Similarity=0.287 Sum_probs=26.5
Q ss_pred HHHHHhccCCCCCCCCccHHHHHHHHHHHHHHH-----hhcCCCCCCHHHHHHHcCCCCCC
Q psy3080 51 YLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAE-----VERKPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 51 YLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E-----~~kkp~~mS~edAy~vLGL~~~a 106 (118)
|+.+|.++. |. .+++.+++..... +++.. -+..+|-+.||++.++
T Consensus 35 ~~~~l~~~~--~~-------~~~~~~l~~~Er~~i~~aL~~~~--gn~s~AAr~LGIsRsT 84 (95)
T PRK00430 35 YFAQLNGQD--VN-------DLYELVLAEVEAPLLDMVMQYTR--GNQTRAALMLGINRGT 84 (95)
T ss_pred HHHHhcCCC--ch-------hHHHHHHHHHHHHHHHHHHHHcC--CCHHHHHHHhCCCHHH
Confidence 777777652 23 5666655543222 22223 3478899999998654
No 83
>smart00422 HTH_MERR helix_turn_helix, mercury resistance.
Probab=25.54 E-value=41 Score=20.56 Aligned_cols=15 Identities=7% Similarity=0.242 Sum_probs=11.9
Q ss_pred CCHHHHHHHcCCCCC
Q psy3080 91 MSVDDAYEALGLTRG 105 (118)
Q Consensus 91 mS~edAy~vLGL~~~ 105 (118)
||..|+.+.+|+++.
T Consensus 1 ~s~~eva~~~gvs~~ 15 (70)
T smart00422 1 YTIGEVAKLAGVSVR 15 (70)
T ss_pred CCHHHHHHHHCcCHH
Confidence 577888888888863
No 84
>COG4746 Uncharacterized protein conserved in archaea [Function unknown]
Probab=24.96 E-value=54 Score=22.76 Aligned_cols=19 Identities=37% Similarity=0.560 Sum_probs=14.2
Q ss_pred cCCCCCCCCccHHHHHHHH
Q psy3080 58 TAKFPDWPINQPIMLLKDV 76 (118)
Q Consensus 58 ~~rfp~wpI~dpv~lL~~l 76 (118)
.+.=-++|||+|+.++-+|
T Consensus 15 ~~k~a~fPInn~~eL~~AL 33 (80)
T COG4746 15 ALKGADFPINNPEELVAAL 33 (80)
T ss_pred HHccCCCCCCCHHHHHHhc
Confidence 3333689999999987665
No 85
>PHA01083 hypothetical protein
Probab=24.89 E-value=32 Score=26.37 Aligned_cols=39 Identities=13% Similarity=0.240 Sum_probs=27.0
Q ss_pred HHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHH---HHHcCCCCC
Q psy3080 53 RHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDA---YEALGLTRG 105 (118)
Q Consensus 53 r~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edA---y~vLGL~~~ 105 (118)
++|+..+|-+. +.+..||..+.+ .+|.++| .+.+|++++
T Consensus 20 kqLA~~LGVs~-----------q~IS~~R~G~r~---~i~de~A~~LAe~aGiDp~ 61 (149)
T PHA01083 20 KQIAHDLGVSP-----------QKISKMRTGVRT---YISDEEAIFLAESAGIDPE 61 (149)
T ss_pred HHHHHHhCCCH-----------HHHHHHHcCCCC---CCCHHHHHHHHHHhCCCHH
Confidence 46777788777 788899985433 3455555 567888874
No 86
>KOG3547|consensus
Probab=24.73 E-value=51 Score=29.25 Aligned_cols=47 Identities=23% Similarity=0.414 Sum_probs=31.4
Q ss_pred HHHHHHhHhhccchhhccCccccccccccCccCccccccchhhhHHHHHHH
Q psy3080 5 LIGKIAAHIADFTPRLRGNNRAVYQFCGIPAVRYPQLEQEMFVNIFYLRHL 55 (118)
Q Consensus 5 li~~i~~Hl~df~~rL~~~~~a~y~y~pip~i~YpeL~~elfC~~yYLr~l 55 (118)
++..+...|.+|-..+. .--.|+++|| |+.|||..- |=-+.|++--|
T Consensus 207 ~~~~~~~ei~~fR~~l~--~L~~yDwVpI-PL~YpQvV~-lAVr~YF~~cL 253 (450)
T KOG3547|consen 207 LYNSLLGEIKKFRTGLS--LLCNYDWVPI-PLVYPQVVF-LAVRSYFFICL 253 (450)
T ss_pred HHHHHHHHHHHHHHhhc--cccCCCceec-cccccceee-eehHHhHHHHh
Confidence 45666777788765554 4557999999 678999864 33345555433
No 87
>cd03569 VHS_Hrs_Vps27p VHS domain family, Hrs and Vps27p subfamily; composed of Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and its yeast homolog Vps27p (vacuolar protein sorting). The VHS domain, an essential part of Hrs/Vps27p, has a superhelical structure similar to the structure of ARM (Armadillo) repeats and is present at the N-termini of proteins. Hrs also contains a FYVE (Fab1p, YOTB, Vac1p, and EEA1) zinc finger domain C-terminal to VHS, as well as two coiled-coil domains. Hrs has been proposed to play a role in at least three vesicle trafficking events: exocytosis, endocytosis, and endosome to lysosome trafficking. Hrs is involved in promoting rapid recycling of endocytosed signaling receptors to the plasma membrane.
Probab=24.71 E-value=1.4e+02 Score=21.81 Aligned_cols=32 Identities=25% Similarity=0.430 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHc
Q psy3080 69 PIMLLKDVLEAWKAEVERKPPEMSVDDAYEAL 100 (118)
Q Consensus 69 pv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vL 100 (118)
-..-+-+++..|...+...|.--...++|+.|
T Consensus 101 Vk~kil~li~~W~~~f~~~~~l~~i~~~y~~L 132 (142)
T cd03569 101 VRQKILELIQAWALAFRNKPQLKYVVDTYQIL 132 (142)
T ss_pred HHHHHHHHHHHHHHHhCCCcccHHHHHHHHHH
Confidence 34577889999999885443333489999887
No 88
>smart00814 Alpha_TIF Alpha trans-inducing protein (Alpha-TIF). Alpha-TIF (VP16) from Herpes Simplex virus is an essential tegument protein involved in the transcriptional activation of viral immediate early (IE) promoters (alpha genes) during the lytic phase of viral infection. VP16 associates with cellular transcription factors to enhance transcription rates, including the general transcription factor TFIIB and the transcriptional coactivator PC4. The N-terminal residues of VP16 confer specificity for the IE genes, while the C-terminal residues are responsible for transcriptional activation. Within the C-terminal region are two activation regions that can independently and cooperatively activate transcription. VP16 forms a transcriptional regulatory complex with two cellular proteins, the POU-domain transcription factor Oct-1 and the cell-proliferation factor HCF-1. VP16 is an alpha/beta protein with an unusual fold. Other transcription factors may have a similar topology.
Probab=24.44 E-value=48 Score=28.71 Aligned_cols=47 Identities=23% Similarity=0.388 Sum_probs=31.3
Q ss_pred HHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcC-CC-----------CCCHHHHHHHcCCCC
Q psy3080 52 LRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERK-PP-----------EMSVDDAYEALGLTR 104 (118)
Q Consensus 52 Lr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kk-p~-----------~mS~edAy~vLGL~~ 104 (118)
..+|+++++||+||= |.+.|+.|-.-+=-- |. +.|.+|-.+.++.+.
T Consensus 36 y~rLl~eL~F~eGp~------lls~le~wNeDLFSc~P~n~dLY~~~~~LSt~~~eVi~~~~~p~ 94 (356)
T smart00814 36 YQRLLTELGFAEGPA------LLSALETWNEDLFSCFPTNEDLYREAKILSTSPSEVIELLSTPE 94 (356)
T ss_pred HHHHHHHcCCCccHH------HHHHHHHhcchhhccCCCChhHhhhhhhhcCCHHHHHhhccCCC
Confidence 367899999999874 778888897654221 21 345666666655554
No 89
>smart00421 HTH_LUXR helix_turn_helix, Lux Regulon. lux regulon (activates the bioluminescence operon
Probab=24.41 E-value=41 Score=18.93 Aligned_cols=18 Identities=17% Similarity=0.366 Sum_probs=14.8
Q ss_pred CCCCHHHHHHHcCCCCCC
Q psy3080 89 PEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a 106 (118)
.++|..+.-+.||++.+.
T Consensus 17 ~g~s~~eia~~l~is~~t 34 (58)
T smart00421 17 EGLTNKEIAERLGISEKT 34 (58)
T ss_pred cCCCHHHHHHHHCCCHHH
Confidence 357999999999998765
No 90
>PRK12532 RNA polymerase sigma factor; Provisional
Probab=24.35 E-value=41 Score=24.71 Aligned_cols=26 Identities=15% Similarity=0.266 Sum_probs=20.0
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRK 115 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRr 115 (118)
-.+||.+|.-++||+++++ -...+.|
T Consensus 150 ~~g~s~~EIA~~lgis~~t--Vk~~l~R 175 (195)
T PRK12532 150 ILGFSSDEIQQMCGISTSN--YHTIMHR 175 (195)
T ss_pred HhCCCHHHHHHHHCCCHHH--HHHHHHH
Confidence 4588999999999999887 4444444
No 91
>TIGR01537 portal_HK97 phage portal protein, HK97 family. This model represents one of several distantly related families of phage portal protein. This protein forms a hole, or portal, that enables DNA passage during packaging and ejection. It also forms the junction between the phage head (capsid) and the tail proteins. It functions as a dodecamer of a single polypeptide of average mol. wt. of 40-90 KDa.
Probab=24.11 E-value=1.3e+02 Score=24.07 Aligned_cols=19 Identities=16% Similarity=0.223 Sum_probs=16.9
Q ss_pred CCCCCCHHHHHHHcCCCCC
Q psy3080 87 KPPEMSVDDAYEALGLTRG 105 (118)
Q Consensus 87 kp~~mS~edAy~vLGL~~~ 105 (118)
+.+.||.+|+.+.||++|-
T Consensus 321 ~~G~~t~NE~R~~~g~~p~ 339 (341)
T TIGR01537 321 QSGFLTINEVREREGLPPL 339 (341)
T ss_pred hCCCCCHHHHHHhcCCCCC
Confidence 5789999999999999874
No 92
>cd08306 Death_FADD Fas-associated Death Domain protein-protein interaction domain. Death domain (DD) found in FAS-associated via death domain (FADD). FADD is a component of the death-inducing signaling complex (DISC) and serves as an adaptor in the signaling pathway of death receptor proteins. It modulates apoptosis as well as non-apoptotic processes such as cell cycle progression, survival, innate immune signaling, and hematopoiesis. FADD contains an N-terminal DED and a C-terminal DD. Its DD interacts with the DD of the activated death receptor, FAS, and its DED recruits the initiator caspases, caspase-8 and -10, to the DISC complex via a homotypic interaction with the N-terminal DED of the caspase. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain),
Probab=23.99 E-value=1.3e+02 Score=20.18 Aligned_cols=51 Identities=16% Similarity=0.212 Sum_probs=29.1
Q ss_pred HHHhccCCCCCCCC--------ccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcCCC
Q psy3080 53 RHLCDTAKFPDWPI--------NQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALGLT 103 (118)
Q Consensus 53 r~lcd~~rfp~wpI--------~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~ 103 (118)
+.|+.++||++=.| .|..+...++|.+|+..-.+....-+.-+|+.-.|+.
T Consensus 17 k~laR~LGlse~~Id~i~~~~~~~~~eq~~~mL~~W~~~~g~~At~~~L~~aL~~~~l~ 75 (86)
T cd08306 17 RKLARKLGLSETKIESIEEAHPRNLREQVRQSLREWKKIKKKEAKVADLIKALRDCQLN 75 (86)
T ss_pred HHHHHHcCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHhHCcchHHHHHHHHHHHcCcH
Confidence 45666666664222 3445677788888888532221111366777777774
No 93
>PRK12542 RNA polymerase sigma factor; Provisional
Probab=23.99 E-value=39 Score=24.58 Aligned_cols=26 Identities=12% Similarity=0.280 Sum_probs=19.8
Q ss_pred CCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 89 PEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
.++|.+|.-++||+++++ -.+-+.||
T Consensus 137 ~g~s~~EIA~~lgis~~t--Vk~~l~Ra 162 (185)
T PRK12542 137 YNLTYQEISSVMGITEAN--VRKQFERA 162 (185)
T ss_pred cCCCHHHHHHHHCCCHHH--HHHHHHHH
Confidence 478999999999999887 44444443
No 94
>PRK12523 RNA polymerase sigma factor; Reviewed
Probab=23.88 E-value=40 Score=24.25 Aligned_cols=27 Identities=15% Similarity=0.188 Sum_probs=21.0
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
-.++|.+|.-++||++.++ -..-+.||
T Consensus 133 ~~g~s~~EIA~~lgis~~t--V~~~l~ra 159 (172)
T PRK12523 133 LDGMGHAEIAERLGVSVSR--VRQYLAQG 159 (172)
T ss_pred HcCCCHHHHHHHHCCCHHH--HHHHHHHH
Confidence 3588999999999999887 55555554
No 95
>PRK09047 RNA polymerase factor sigma-70; Validated
Probab=23.82 E-value=40 Score=23.55 Aligned_cols=18 Identities=33% Similarity=0.597 Sum_probs=16.2
Q ss_pred CCCCHHHHHHHcCCCCCC
Q psy3080 89 PEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a 106 (118)
.++|.+|.-++||++.++
T Consensus 121 ~g~s~~EIA~~lgis~~t 138 (161)
T PRK09047 121 EDMDVAETAAAMGCSEGS 138 (161)
T ss_pred hcCCHHHHHHHHCCCHHH
Confidence 488999999999999877
No 96
>cd08803 Death_ank3 Death domain of Ankyrin-3. Death Domain (DD) of the human protein ankyrin-3 (ANK-3) and related proteins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. ANK-3, also called anykyrin-G (for general or giant), is found in neurons and at least one splice variant has been shown to be essential for propagation of action potentials as a binding partner to neurofascin and voltage-gated sodium channels. It is required for maintaining axo-dendritic polarity, and may be a genetic risk factor associated with bipolar disorder. ANK-3 may also play roles in other cell types. Mutations affecting ANK-3 pathways for Na channel localization are associated with Brugada syndrome, a potentially fata arrythmia. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by se
Probab=23.81 E-value=75 Score=21.57 Aligned_cols=18 Identities=22% Similarity=0.228 Sum_probs=14.4
Q ss_pred ccHHHHHHHHHHHHHHHh
Q psy3080 67 NQPIMLLKDVLEAWKAEV 84 (118)
Q Consensus 67 ~dpv~lL~~lL~~Wr~E~ 84 (118)
+|+..+-..+|..|+..-
T Consensus 41 ~~~~~q~~~lL~~W~~r~ 58 (84)
T cd08803 41 NSLIAQSFMLLKKWVTRD 58 (84)
T ss_pred CCHHHHHHHHHHHHHHhh
Confidence 467788899999999863
No 97
>TIGR02999 Sig-70_X6 RNA polymerase sigma factor, TIGR02999 family. This group of sigma factors are members of the sigma-70 family (TIGR02937) and are found in a variety of species including Rhodopirellula baltica which encodes a paralogous group of five.
Probab=23.72 E-value=40 Score=24.21 Aligned_cols=27 Identities=26% Similarity=0.249 Sum_probs=20.4
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
-.+||.+|.-++||+++++ =..-+.||
T Consensus 148 ~~g~s~~EIA~~lgis~~t--Vk~~l~Ra 174 (183)
T TIGR02999 148 FAGLTVEEIAELLGVSVRT--VERDWRFA 174 (183)
T ss_pred HcCCCHHHHHHHhCCCHHH--HHHHHHHH
Confidence 3589999999999999877 44444443
No 98
>PF07766 LETM1: LETM1-like protein; InterPro: IPR011685 This is a group of mainly hypothetical eukaryotic proteins. Putative features found in LETM1, such as a transmembrane domain and a CK2 and PKC phosphorylation site [], are relatively conserved throughout the family. Deletion of LETM1 is thought to be involved in the development of Wolf-Hirschhorn syndrome in humans []. A member of this family, P91927 from SWISSPROT, is known to be expressed in the mitochondria of Drosophila melanogaster [], suggesting that this may be a group of mitochondrial proteins.; PDB: 3SKQ_A.
Probab=23.65 E-value=1.9e+02 Score=23.38 Aligned_cols=63 Identities=24% Similarity=0.271 Sum_probs=35.9
Q ss_pred HHHHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhc--------CCCCCC---HHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 50 FYLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVER--------KPPEMS---VDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 50 yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~k--------kp~~mS---~edAy~vLGL~~~a~~se~~IRrA 116 (118)
--|+++|.-.+.+.|+ |-.+++.-|..+-.++.. +..+|| ..+||..=|+.... .+++++|+.
T Consensus 174 ~~L~~L~r~~~l~~~~---~~~~lr~rL~~~~~~l~~dD~~i~~eGv~~Ls~~EL~~Ac~~RGl~~~~-~s~~~lr~~ 247 (268)
T PF07766_consen 174 PHLRALCRLLGLTPFG---PSSLLRRRLRKRLRYLKQDDRLIKREGVDSLSEEELQDACYERGLRSTG-LSEEELREW 247 (268)
T ss_dssp HHHHHHHHHTT----S---SHHHHHHHHHHHHHHHHHHHHHHHHH-GGGS-HHHHHHHHHHTT---TT---HHHHHHH
T ss_pred HHHHHHHHHhccCcCC---chHHHHHHHHHHHHHHHHHHHHHHHhccccCCHHHHHHHHHHhCCCcCC-CCHHHHHHH
Confidence 3499999999988875 456666666555544433 245777 56799999997632 388888764
No 99
>cd00864 PI3Ka Phosphoinositide 3-kinase family, accessory domain (PIK domain); PIK domain is conserved in PI3 and PI4-kinases. Its role is unclear, but it has been suggested to be involved in substrate presentation. Phosphoinositide 3-kinases play an important role in a variety of fundamental cellular processes and can be divided into three main classes, defined by their substrate specificity and domain architecture.
Probab=23.33 E-value=1.4e+02 Score=22.06 Aligned_cols=53 Identities=25% Similarity=0.399 Sum_probs=37.5
Q ss_pred cchhhhHHHHHH----HhccCCCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcCCC
Q psy3080 43 QEMFVNIFYLRH----LCDTAKFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALGLT 103 (118)
Q Consensus 43 ~elfC~~yYLr~----lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~ 103 (118)
+-+|=..+|+.. +-.-+++=+|...+-+..+..+|..| ..++..+|.+.|+-.
T Consensus 25 ~llw~~R~~~~~~p~~lp~~L~sv~w~~~~~~~e~~~lL~~W--------~~~~~~~aL~LL~~~ 81 (152)
T cd00864 25 ELLWKFRYYLLNVPKALPKLLKSVNWNDDEEVSELYQLLKWW--------APLSPEDALELLSPK 81 (152)
T ss_pred HHHHHHHHHHhhChHHHHHHHHHccCCCHHHHHHHHHHHhcC--------CCCCHHHHHHHcCCc
Confidence 345556666643 23334455887777788888999988 567899999999854
No 100
>TIGR03220 catechol_dmpE 2-oxopent-4-enoate hydratase. Members of this protein family are 2-oxopent-4-enoate hydratase, which is also called 2-hydroxypent-2,4-dienoate hydratase. It is closely related to another gene found in the same operon, 4-oxalocrotonate decarboxylase, with which it interacts closely.
Probab=23.25 E-value=1.1e+02 Score=24.42 Aligned_cols=29 Identities=24% Similarity=0.484 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHhh-----cCCCCCCHHHHHHH
Q psy3080 71 MLLKDVLEAWKAEVE-----RKPPEMSVDDAYEA 99 (118)
Q Consensus 71 ~lL~~lL~~Wr~E~~-----kkp~~mS~edAy~v 99 (118)
.+-+.|+++|+.--. ...+.+|.+|||.|
T Consensus 4 ~~a~~L~~A~~~~~~~~~~~~~~~~~~~~dAyai 37 (255)
T TIGR03220 4 QLGDELYQALVTRTPVAPLTSRGPDISIEDAYRI 37 (255)
T ss_pred HHHHHHHHHHHHCCcCCCCCCCCCCCCHHHHHHH
Confidence 345567777776321 12357789999976
No 101
>TIGR02952 Sig70_famx2 RNA polymerase sigma-70 factor, TIGR02952 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building, bidirectional best hits and one-to-a-genome distribution, to represent a conserved family. This family is found in a limited number of Gram-positive bacterial lineages.
Probab=23.22 E-value=42 Score=23.62 Aligned_cols=27 Identities=19% Similarity=0.299 Sum_probs=20.2
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
-.+||.+|.-++||+++++ -...+.||
T Consensus 136 ~~g~s~~eIA~~l~is~~t--v~~~l~ra 162 (170)
T TIGR02952 136 GQNLPIAEVARILGKTEGA--VKILQFRA 162 (170)
T ss_pred hcCCCHHHHHHHHCCCHHH--HHHHHHHH
Confidence 3488999999999999877 44444443
No 102
>smart00345 HTH_GNTR helix_turn_helix gluconate operon transcriptional repressor.
Probab=23.20 E-value=71 Score=18.36 Aligned_cols=31 Identities=19% Similarity=0.205 Sum_probs=18.1
Q ss_pred HHHHHHHHhhcCCCCC-CHHHHHHHcCCCCCC
Q psy3080 76 VLEAWKAEVERKPPEM-SVDDAYEALGLTRGS 106 (118)
Q Consensus 76 lL~~Wr~E~~kkp~~m-S~edAy~vLGL~~~a 106 (118)
+...|....-..+..+ |..+-.+.+|++.++
T Consensus 5 l~~~i~~~~~~~~~~l~s~~~la~~~~vs~~t 36 (60)
T smart00345 5 LREDIVSGELRPGDKLPSERELAAQLGVSRTT 36 (60)
T ss_pred HHHHHHcCCCCCCCcCcCHHHHHHHHCCCHHH
Confidence 3444444322234456 888888999886543
No 103
>PF00784 MyTH4: MyTH4 domain; InterPro: IPR000857 The microtubule-based kinesin motors and actin-based myosin motors generate movements required for intracellular trafficking, cell division, and muscle contraction. In general, these proteins consist of a motor domain that generates movement and a tail region that varies widely from class to class and is thought to mediate many of the regulatory or cargo binding functions specific to each class of motor []. The Myosin Tail Homology 4 (MyTH4) domain has been identified as a conserved domain in the tail domains of several different unconventional myosins [] and a plant kinesin-like protein [], but has more recently been found in several non-motor proteins []. Although the function is not yet fully understood, there is an evidence that the MyTH4 domain of Myosin-X (Myo10) binds to microtubules and thus could provide a link between an actin-based motor protein and the microtubule cytoskeleton []. The MyTH4 domain is found in one or two copies associated with other domains, such as myosin head, kinesin motor, FERM, PH, SH3 and IQ. The domain is predicted to be largely alpha-helical, interrupted by three or four turns. The MyTH4 domain contains four highly conserved regions designated MGD (consensus sequence L(K/R)(F/Y)MGDhP, LRDE (consensus LRDEhYCQhhKQHxxxN), RGW (consensus RGWxLh), and ELEA (RxxPPSxhELEA), where h indicates a hydrophobic residue and x is any residue [].; GO: 0005856 cytoskeleton; PDB: 3AU5_A 3AU4_A 3PZD_A 3PVL_A.
Probab=23.14 E-value=20 Score=24.93 Aligned_cols=13 Identities=23% Similarity=0.700 Sum_probs=10.1
Q ss_pred cCccccccchhhh
Q psy3080 36 VRYPQLEQEMFVN 48 (118)
Q Consensus 36 i~YpeL~~elfC~ 48 (118)
+..|+|.||+||-
T Consensus 14 l~~~~LrDEiy~Q 26 (114)
T PF00784_consen 14 LENPELRDEIYCQ 26 (114)
T ss_dssp HH-CCHHHHHHHH
T ss_pred HcchhhHHHHHHH
Confidence 4678999999995
No 104
>PRK12525 RNA polymerase sigma factor; Provisional
Probab=22.99 E-value=43 Score=24.05 Aligned_cols=26 Identities=15% Similarity=0.165 Sum_probs=21.1
Q ss_pred CCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 89 PEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
.++|.+|.-++||++.++ -...++||
T Consensus 133 eg~s~~EIA~~l~is~~t--V~~~l~ra 158 (168)
T PRK12525 133 EGLTYVEIGERLGVSLSR--IHQYMVEA 158 (168)
T ss_pred cCCCHHHHHHHHCCCHHH--HHHHHHHH
Confidence 378999999999999987 55666665
No 105
>TIGR02947 SigH_actino RNA polymerase sigma-70 factor, TIGR02947 family. This group of sigma factors are members of the sigma-70 family (TIGR02937). They and appear by homology, tree building, bidirectional best hits and (with the exception of a paralog in Thermobifida fusca YX) one-to-a-genome distribution, to represent a conserved family. This family is restricted to the Actinobacteria and each gene examined is followed by an anti-sigma factor in an apparent operon.
Probab=22.96 E-value=53 Score=24.04 Aligned_cols=19 Identities=16% Similarity=0.272 Sum_probs=17.2
Q ss_pred CCCCCHHHHHHHcCCCCCC
Q psy3080 88 PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a 106 (118)
-.+||.+|.-++||+++++
T Consensus 145 ~~g~s~~EIA~~lgis~~t 163 (193)
T TIGR02947 145 VEGFAYKEIAEIMGTPIGT 163 (193)
T ss_pred hcCCCHHHHHHHHCCCHHH
Confidence 4589999999999999987
No 106
>PRK12530 RNA polymerase sigma factor; Provisional
Probab=22.93 E-value=43 Score=24.74 Aligned_cols=27 Identities=11% Similarity=0.169 Sum_probs=20.3
Q ss_pred CCCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 88 PPEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
-.+||.+|.-++||+++++ =..-+.||
T Consensus 148 ~~g~s~~EIA~~lgis~~t--Vk~~l~RA 174 (189)
T PRK12530 148 YLELSSEQICQECDISTSN--LHVLLYRA 174 (189)
T ss_pred HcCCCHHHHHHHHCCCHHH--HHHHHHHH
Confidence 3589999999999999987 44444443
No 107
>PF04148 Erv26: Transmembrane adaptor Erv26; InterPro: IPR007277 Erv26 is an integral membrane protein that is packed into COPII vesicles and cycles between the ER and Golgi compartments. It directs pro-alkaline phosphatase into endoplasmic reticulum-derived COPII transport vesicles [].
Probab=22.65 E-value=1.1e+02 Score=24.64 Aligned_cols=31 Identities=35% Similarity=0.696 Sum_probs=25.4
Q ss_pred chhhhHHHHHHHhccCCCCCCCCccHHHHHHHHH
Q psy3080 44 EMFVNIFYLRHLCDTAKFPDWPINQPIMLLKDVL 77 (118)
Q Consensus 44 elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL 77 (118)
-++||..|+++| .+||-=.+.+|...+--++
T Consensus 73 si~s~~~Y~~~L---~~fP~i~ltsp~Fi~S~~l 103 (211)
T PF04148_consen 73 SIFSHLVYLRNL---RTFPFISLTSPSFILSCVL 103 (211)
T ss_pred HHHHHHHHHHHh---CCCCeeecCCHHHHHHHHH
Confidence 578999999999 5799999999966655544
No 108
>cd03565 VHS_Tom1 VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways.
Probab=22.53 E-value=1.3e+02 Score=22.01 Aligned_cols=31 Identities=23% Similarity=0.511 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHHhhcCCCCCC-HHHHHHHcC
Q psy3080 70 IMLLKDVLEAWKAEVERKPPEMS-VDDAYEALG 101 (118)
Q Consensus 70 v~lL~~lL~~Wr~E~~kkp~~mS-~edAy~vLG 101 (118)
...+.+++..|..++...| +++ ..++|+.|-
T Consensus 103 k~kil~li~~W~~~f~~~~-~l~~i~~~y~~L~ 134 (141)
T cd03565 103 QEKVLALIQAWADAFRGSP-DLTGVVEVYEELK 134 (141)
T ss_pred HHHHHHHHHHHHHHhCCCc-cchHHHHHHHHHH
Confidence 4667789999999885444 444 899999874
No 109
>smart00581 PSP proline-rich domain in spliceosome associated proteins.
Probab=22.51 E-value=88 Score=20.14 Aligned_cols=21 Identities=48% Similarity=0.675 Sum_probs=16.2
Q ss_pred hcCCCCCCHHHHHHHcCCCCCC
Q psy3080 85 ERKPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 85 ~kkp~~mS~edAy~vLGL~~~a 106 (118)
+++|+.+| ++-.+.||+++++
T Consensus 3 ~~kPG~lS-~~LR~ALG~~~~~ 23 (54)
T smart00581 3 HFKPGRIS-DELREALGLPPGQ 23 (54)
T ss_pred CccCCcCC-HHHHHHcCCCCCC
Confidence 35788888 4567889999875
No 110
>COG1745 Predicted metal-binding protein [General function prediction only]
Probab=22.48 E-value=1.9e+02 Score=20.59 Aligned_cols=50 Identities=16% Similarity=0.120 Sum_probs=35.7
Q ss_pred cHHHHHHHHHHHHHHHhhcCCCCC-CHHHHHHHcCCCCCCCC-ChHHHHhhc
Q psy3080 68 QPIMLLKDVLEAWKAEVERKPPEM-SVDDAYEALGLTRGSHH-EENIVRKAY 117 (118)
Q Consensus 68 dpv~lL~~lL~~Wr~E~~kkp~~m-S~edAy~vLGL~~~a~~-se~~IRrAY 117 (118)
|-.-+|..+|--.++.++...+.- ..=.+|+.||++|..-+ +.++=+.|=
T Consensus 4 eELi~LH~~l~~vkky~e~~~~~~n~~fk~YdeLnI~P~HIHrsK~eHK~AI 55 (94)
T COG1745 4 EELIQLHQLLVYVKKYFENEYGIDNEEFKEYDELNISPVHIHRSKAEHKAAI 55 (94)
T ss_pred HHHHHHHHHHHHHHHHHHhhcCCChHHHHHHHHcCCCchhhhhhHHHHHHHH
Confidence 445678888888888887753332 25579999999997633 777766663
No 111
>PF05179 CDC73: RNA pol II accessory factor, Cdc73 family; InterPro: IPR007852 Paf1 is an RNA polymerase II-associated protein in yeast, which defines a complex that is distinct from the Srb/Mediator holoenzyme. The Paf1 complex, which also contains Cdc73, Ctr9, Hpr1, Ccr4, Rtf1 and Leo1, is required for full expression of a subset of yeast genes, particularly those responsive to signals from the Pkc1/MAP kinase cascade. The complex appears to play an essential role in RNA elongation []. ; PDB: 3V46_A.
Probab=22.11 E-value=47 Score=27.25 Aligned_cols=19 Identities=32% Similarity=0.775 Sum_probs=14.9
Q ss_pred CCCCCCCCccHHHHHHHHH
Q psy3080 59 AKFPDWPINQPIMLLKDVL 77 (118)
Q Consensus 59 ~rfp~wpI~dpv~lL~~lL 77 (118)
--|-+||.+||+.+++.+.
T Consensus 204 WQFk~w~~~~P~~LF~~v~ 222 (273)
T PF05179_consen 204 WQFKGWPWSDPVELFQKVK 222 (273)
T ss_dssp GGGTTSSS-SHHHHHHHSE
T ss_pred hhhCCCCCCCHHHHhhcCc
Confidence 4588999999999988764
No 112
>cd03567 VHS_GGA VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
Probab=22.00 E-value=1.1e+02 Score=22.47 Aligned_cols=31 Identities=23% Similarity=0.546 Sum_probs=22.9
Q ss_pred cHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcC
Q psy3080 68 QPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALG 101 (118)
Q Consensus 68 dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLG 101 (118)
.-..-+-+++..|..++.+.|. ..+||..|-
T Consensus 102 ~Vk~kil~li~~W~~~f~~~p~---~~~~Y~~Lk 132 (139)
T cd03567 102 KVKTKIIELLYSWTLELPHEPK---IKEAYDMLK 132 (139)
T ss_pred HHHHHHHHHHHHHHHHhcccch---HHHHHHHHH
Confidence 3345677889999998864444 789999884
No 113
>PRK13502 transcriptional activator RhaR; Provisional
Probab=21.79 E-value=1.5e+02 Score=22.78 Aligned_cols=61 Identities=8% Similarity=0.028 Sum_probs=43.0
Q ss_pred cCccccccchhhhHHHHHHHhcc-CCCCCCCCccHHHHHHHHHHHHHHHhhc---CCCCCCHHHHHHHcCCCCCC
Q psy3080 36 VRYPQLEQEMFVNIFYLRHLCDT-AKFPDWPINQPIMLLKDVLEAWKAEVER---KPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 36 i~YpeL~~elfC~~yYLr~lcd~-~rfp~wpI~dpv~lL~~lL~~Wr~E~~k---kp~~mS~edAy~vLGL~~~a 106 (118)
++=.+|.+.+-++..||.+++.+ .|.+- . +.+..-|-+-.+ ..+.+|+.|.-..+|.+..+
T Consensus 193 ~~~~~lA~~~~iS~~~L~r~fk~~~G~t~------~----~yi~~~Rl~~A~~lL~~t~~sI~eIA~~~GF~d~s 257 (282)
T PRK13502 193 FALDAFCQQEQCSERVLRQQFRAQTGMTI------N----QYLRQVRICHAQYLLQHSPLMISEISMQCGFEDSN 257 (282)
T ss_pred CCHHHHHHHHCcCHHHHHHHHHHHHCcCH------H----HHHHHHHHHHHHHHHHcCCCCHHHHHHHcCCCCHH
Confidence 45588999999999999999986 55442 2 333333333222 25689999999999999755
No 114
>PF04405 ScdA_N: Domain of Unknown function (DUF542) ; InterPro: IPR007500 This is a domain of unknown function found at the N terminus of genes involved in cell wall development and nitrous oxide protection. ScdA is required for normal cell growth and development; mutants have an increased level of peptidoglycan cross-linking and aberrant cellular morphology suggesting a role for ScdA in cell wall metabolism []. NorA1, NorA2, and YtfE are involved in the nitrous oxide response. NorA1 and NorA2, which are similar to YtfE, are co-transcribed with the membrane-bound nitrous oxide (NO) reductases. The genes appear to be involved in NO protection but their function is unknown [, ].
Probab=21.77 E-value=49 Score=20.96 Aligned_cols=19 Identities=26% Similarity=0.316 Sum_probs=16.8
Q ss_pred CCCCCHHHHHHHcCCCCCC
Q psy3080 88 PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a 106 (118)
+|.-|+++|++.-||+++.
T Consensus 29 gG~~~L~eA~~~~~ld~~~ 47 (56)
T PF04405_consen 29 GGNRSLEEACEEKGLDPEE 47 (56)
T ss_pred CCCchHHHHHHHcCCCHHH
Confidence 7788999999999998765
No 115
>TIGR02689 ars_reduc_gluta arsenate reductase, glutathione/glutaredoxin type. Members of this protein family represent a novel form of arsenate reductase, using glutathione and glutaredoxin rather than thioredoxin for reducing equivalents as do some homologous arsenate reductases. An example of this type is Synechocystis sp. strain PCC 6803 slr0946, and of latter type (excluded from this model) is Staphylococcus aureus plasmid pI258 ArsC. Both are among the subset of arsenate reductases that belong the the low-molecular-weight protein-tyrosine phosphatase superfamily.
Probab=21.71 E-value=1.1e+02 Score=21.49 Aligned_cols=11 Identities=36% Similarity=0.954 Sum_probs=8.4
Q ss_pred CCCCCCCccHH
Q psy3080 60 KFPDWPINQPI 70 (118)
Q Consensus 60 rfp~wpI~dpv 70 (118)
.+-.|+|.||.
T Consensus 94 ~~~~w~i~DP~ 104 (126)
T TIGR02689 94 IFEDWQLEDPD 104 (126)
T ss_pred eeecCCCCCCC
Confidence 34579999994
No 116
>cd08804 Death_ank2 Death domain of Ankyrin-2. Death Domain (DD) of Ankyrin-2 (ANK-2) and related proteins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. ANK-2, also called ankyrin-B (for broadly expressed), is required for proper function of the Na/Ca ion exchanger-1 in cardiomyocytes, and is thought to function in linking integral membrane proteins to the underlying cytoskeleton. Human ANK-2 is associated with "Ankyrin-B syndrome", an atypical arrythmia disorder with risk of sudden cardiac death. It also plays key roles in the brain and striated muscle. Loss of ANK-2 is associated with significant nervous system defects and sarcomere disorganization. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other
Probab=21.51 E-value=72 Score=21.45 Aligned_cols=46 Identities=15% Similarity=0.258 Sum_probs=26.7
Q ss_pred HHHhccCCCCCCCCcc-----H---HHHHHHHHHHHHHHhhcCCCCCC---HHHHHHHcC
Q psy3080 53 RHLCDTAKFPDWPINQ-----P---IMLLKDVLEAWKAEVERKPPEMS---VDDAYEALG 101 (118)
Q Consensus 53 r~lcd~~rfp~wpI~d-----p---v~lL~~lL~~Wr~E~~kkp~~mS---~edAy~vLG 101 (118)
..|+.++||+.=.|.. | ..+-.++|..|+.. ++..-| ...|+..+|
T Consensus 19 k~LAr~Lg~se~dI~~i~~~~~~~~~eq~~~mL~~W~~r---~g~~At~~~L~~aL~~i~ 75 (84)
T cd08804 19 TELARELDFTEEQIHQIRIENPNSLQDQSHALLKYWLER---DGKHATDTNLMKCLTKIN 75 (84)
T ss_pred HHHHHHcCCCHHHHHHHHHHCcccHHHHHHHHHHHHHHc---cCCCchHHHHHHHHHHcC
Confidence 3577778887755554 4 34555666777664 443333 555655544
No 117
>PRK12531 RNA polymerase sigma factor; Provisional
Probab=21.46 E-value=54 Score=24.16 Aligned_cols=26 Identities=19% Similarity=0.289 Sum_probs=19.8
Q ss_pred CCCCHHHHHHHcCCCCCCCCChHHHHhh
Q psy3080 89 PEMSVDDAYEALGLTRGSHHEENIVRKA 116 (118)
Q Consensus 89 ~~mS~edAy~vLGL~~~a~~se~~IRrA 116 (118)
.++|.+|.-++||++.++ -..-++||
T Consensus 156 eg~s~~EIA~~lgis~~t--Vk~rl~ra 181 (194)
T PRK12531 156 EELPHQQVAEMFDIPLGT--VKSRLRLA 181 (194)
T ss_pred cCCCHHHHHHHhCcCHHH--HHHHHHHH
Confidence 478999999999999887 44444443
No 118
>cd05782 DNA_polB_like1_exo Uncharacterized bacterial subgroup of the DEDDy 3'-5' exonuclease domain of family-B DNA polymerases. A subfamily of the 3'-5' exonuclease domain of family-B DNA polymerases. This subfamily is composed of uncharacterized bacterial family-B DNA polymerases. Family-B DNA polymerases contain an N-terminal DEDDy DnaQ-like exonuclease domain in the same polypeptide chain as the polymerase domain, similar to family-A DNA polymerases. This exonuclease domain contains three sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. These motifs are involved in metal binding and catalysis. The exonuclease domain of family-B DNA polymerases has a fundamental role in proofreading activity. It contains a beta hairpin structure that plays an important role in active site switching in the event of a nucleotide misincorporation. Family-B DNA polymerases are predominantly involved in DNA replication and DNA repair.
Probab=21.19 E-value=1.5e+02 Score=22.81 Aligned_cols=41 Identities=17% Similarity=0.167 Sum_probs=26.7
Q ss_pred HHHHHHHHHhhcCCCCCCHHHHHHHcCCCCCCCCChHHHHhhc
Q psy3080 75 DVLEAWKAEVERKPPEMSVDDAYEALGLTRGSHHEENIVRKAY 117 (118)
Q Consensus 75 ~lL~~Wr~E~~kkp~~mS~edAy~vLGL~~~a~~se~~IRrAY 117 (118)
+++..|+... +..++|.+...+.||++.-...+-++|-+.|
T Consensus 143 DL~~~~~~~~--~~~~~~L~~va~~lG~~~K~d~~G~~v~~~y 183 (208)
T cd05782 143 DLMDLLAFYG--ARARASLDLLAKLLGIPGKMDVDGSQVWELY 183 (208)
T ss_pred cHHHHHhccC--ccCCCCHHHHHHHhCCCCCcCCCHHHHHHHH
Confidence 4445565421 2468899999999999543334666776665
No 119
>PF09536 DUF2378: Protein of unknown function (DUF2378); InterPro: IPR011751 This family consists of a set of at least 17 paralogous proteins in Myxococcus xanthus (strain DK 1622). Members are about 200 amino acids in length. No other homologs are known; the function is unknown.
Probab=21.18 E-value=1.7e+02 Score=22.49 Aligned_cols=37 Identities=30% Similarity=0.575 Sum_probs=24.9
Q ss_pred CCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcCCC
Q psy3080 60 KFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALGLT 103 (118)
Q Consensus 60 rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLGL~ 103 (118)
-||++|+ ..+..++..=..++ .|. ++.++|+..||-.
T Consensus 40 ~~~~YP~----~~flr~l~~aa~~l--~p~-~~~e~a~r~lG~~ 76 (178)
T PF09536_consen 40 LFPAYPL----EVFLRLLAAAAREL--YPG-LPREEAYRRLGRR 76 (178)
T ss_pred ccccCCH----HHHHHHHHHHHHHH--cCC-CCHHHHHHHHHHH
Confidence 3556665 66666666666665 344 4999999999854
No 120
>cd07923 Gallate_dioxygenase_C The C-terminal domain of Gallate Dioxygenase, which catalyzes the oxidization and subsequent ring-opening of gallate. Gallate Dioxygenase catalyzes the oxidization and subsequent ring-opening of gallate, an intermediate in the degradation of the aromatic compound, syringate. The reaction product of gallate dioxygenase is 4-oxalomesaconate. The amino acid sequence of the N-terminal and C-terminal regions of gallate dioxygenase exhibits homology with the sequence of the PCA 4,5-dioxygenase B (catalytic) and A subunits, respectively. This model represents the C-terminal domain, which is similar to the A subunit of PCA 4,5-dioxygenase (or LigAB). The enzyme is estimated to be a homodimer according to the Escherichia coli enzyme. Since enzymes in this subfamily have fused A and B subunits, the dimer interface may resemble the tetramer interface of classical LigAB enzymes. This enzyme belongs to the class III extradiol dioxygenase family, composed of enzymes whi
Probab=21.14 E-value=50 Score=23.42 Aligned_cols=18 Identities=22% Similarity=0.246 Sum_probs=16.6
Q ss_pred hhhHHHHHHHhccCCCCC
Q psy3080 46 FVNIFYLRHLCDTAKFPD 63 (118)
Q Consensus 46 fC~~yYLr~lcd~~rfp~ 63 (118)
=+|+|||-+|+-.+|.|+
T Consensus 57 G~n~y~L~K~a~~~G~~~ 74 (94)
T cd07923 57 GVIFFVLEKLAAVVGVSN 74 (94)
T ss_pred cCcHHHHHHHHHHcCCCH
Confidence 479999999999999998
No 121
>PF04355 SmpA_OmlA: SmpA / OmlA family; InterPro: IPR007450 This is a bacterial outer membrane lipoprotein, possibly involved in maintaining the structural integrity of the cell envelope []. The lipid attachment site is a conserved N-terminal cysteine residue sometimes found adjacent to the OmpA domain (IPR006665 from INTERPRO).; GO: 0019867 outer membrane; PDB: 4DM5_C 2PXG_A 2YH9_B 2KXX_A 2KM7_A.
Probab=21.12 E-value=63 Score=20.34 Aligned_cols=17 Identities=35% Similarity=0.479 Sum_probs=13.1
Q ss_pred CCCCCCHHHHHHHcCCCC
Q psy3080 87 KPPEMSVDDAYEALGLTR 104 (118)
Q Consensus 87 kp~~mS~edAy~vLGL~~ 104 (118)
+++ ||.+|-..+||=+.
T Consensus 13 ~~G-mTk~qV~~lLG~P~ 29 (71)
T PF04355_consen 13 KPG-MTKDQVRALLGSPS 29 (71)
T ss_dssp -TT-SBHHHHHHHHTS-S
T ss_pred cCC-CCHHHHHHhcCCCC
Confidence 455 99999999999554
No 122
>PRK15121 right oriC-binding transcriptional activator; Provisional
Probab=21.00 E-value=2.9e+02 Score=21.76 Aligned_cols=62 Identities=5% Similarity=0.038 Sum_probs=44.0
Q ss_pred ccCccccccchhhhHHHHHHHhccC-CCCCCCCccHHHHHHHHHHHHHHHhhc---CCCCCCHHHHHHHcCCCCCC
Q psy3080 35 AVRYPQLEQEMFVNIFYLRHLCDTA-KFPDWPINQPIMLLKDVLEAWKAEVER---KPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 35 ~i~YpeL~~elfC~~yYLr~lcd~~-rfp~wpI~dpv~lL~~lL~~Wr~E~~k---kp~~mS~edAy~vLGL~~~a 106 (118)
+++=.+|++.+-++.++|.++-... |.+ +.+.+...|-+..+ ..+++|+.|.-..+|.+..+
T Consensus 21 ~~~l~~lA~~~~~S~~~l~r~F~~~~g~s----------~~~yi~~~Rl~~A~~~L~~~~~~i~~iA~~~Gf~s~~ 86 (289)
T PRK15121 21 PLSLDNVAAKAGYSKWHLQRMFKDVTGHA----------IGAYIRARRLSKAAVALRLTSRPILDIALQYRFDSQQ 86 (289)
T ss_pred CCCHHHHHHHHCcCHHHHHHHHHHHHCcC----------HHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHCCCCHH
Confidence 3566889999999999999887764 433 23455555554433 24679999999999998644
No 123
>PRK00423 tfb transcription initiation factor IIB; Reviewed
Probab=21.00 E-value=39 Score=27.74 Aligned_cols=15 Identities=13% Similarity=0.266 Sum_probs=12.3
Q ss_pred HHHHHHHhccCCCCC
Q psy3080 49 IFYLRHLCDTAKFPD 63 (118)
Q Consensus 49 ~yYLr~lcd~~rfp~ 63 (118)
.-|+..+|+.++.|+
T Consensus 220 ~~~i~r~~~~L~L~~ 234 (310)
T PRK00423 220 IDYVPRFASELGLSG 234 (310)
T ss_pred HHHHHHHHHHcCCCH
Confidence 368889999998887
No 124
>PRK12537 RNA polymerase sigma factor; Provisional
Probab=21.00 E-value=73 Score=23.16 Aligned_cols=19 Identities=21% Similarity=0.211 Sum_probs=17.1
Q ss_pred CCCCCHHHHHHHcCCCCCC
Q psy3080 88 PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a 106 (118)
-.+||.+|.-+.||++.++
T Consensus 147 ~~~~s~~eIA~~lgis~~t 165 (182)
T PRK12537 147 VDGCSHAEIAQRLGAPLGT 165 (182)
T ss_pred HcCCCHHHHHHHHCCChhh
Confidence 3588999999999999988
No 125
>PRK15044 transcriptional regulator SirC; Provisional
Probab=20.97 E-value=2.2e+02 Score=23.89 Aligned_cols=62 Identities=8% Similarity=0.095 Sum_probs=47.4
Q ss_pred ccCccccccchhhhHHHHHHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcC---CCCCCHHHHHHHcCCCCCC
Q psy3080 35 AVRYPQLEQEMFVNIFYLRHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERK---PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 35 ~i~YpeL~~elfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kk---p~~mS~edAy~vLGL~~~a 106 (118)
+++-.+|.+++..+..+|++.....+.. ..+.+..+|-+..++ .+.+|+.|.-..+|.+..+
T Consensus 208 ~~SLeeLA~~lgmS~~tL~R~Fk~eg~T----------~~~y~~~~RL~~A~~LL~~t~~sIseIA~~~GFss~S 272 (295)
T PRK15044 208 KWSQAEVAGKLFMSVSSLKRKLAAEEVS----------FSKIYLDARMNQAIKLLRMGAGNISQVATMCGYDTPS 272 (295)
T ss_pred CCCHHHHHHHhCCCHHHHHHHHHHcCCC----------HHHHHHHHHHHHHHHHHHcCCCCHHHHHHHhCCCChH
Confidence 5788899999999999999988775322 224445666665553 6789999999999999754
No 126
>PRK09393 ftrA transcriptional activator FtrA; Provisional
Probab=20.91 E-value=2.4e+02 Score=22.47 Aligned_cols=62 Identities=13% Similarity=0.095 Sum_probs=42.1
Q ss_pred ccCccccccchhhhHHHHHHHhccC-CCCCCCCccHHHHHHHHHHHHHHHhhcC---CCCCCHHHHHHHcCCCCCC
Q psy3080 35 AVRYPQLEQEMFVNIFYLRHLCDTA-KFPDWPINQPIMLLKDVLEAWKAEVERK---PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 35 ~i~YpeL~~elfC~~yYLr~lcd~~-rfp~wpI~dpv~lL~~lL~~Wr~E~~kk---p~~mS~edAy~vLGL~~~a 106 (118)
+++=.+|...+-.+.-||.+++.+. |.+- ++.+...|-+..++ .+.+|+.|.-..+|.+..+
T Consensus 234 ~~sl~~lA~~~~~S~~~l~r~fk~~~g~s~----------~~~~~~~Rl~~A~~lL~~~~~~i~~IA~~~Gf~~~s 299 (322)
T PRK09393 234 PHTVASLAARAAMSPRTFLRRFEAATGMTP----------AEWLLRERLARARDLLESSALSIDQIAERAGFGSEE 299 (322)
T ss_pred CCCHHHHHHHHCcCHHHHHHHHHHHHCcCH----------HHHHHHHHHHHHHHHHHcCCCCHHHHHHHhCCCCHH
Confidence 3566788888888888888888763 4432 34444445444332 4578999999999998644
No 127
>PF14475 Mso1_Sec1_bdg: Sec1-binding region of Mso1
Probab=20.70 E-value=70 Score=19.55 Aligned_cols=13 Identities=31% Similarity=0.950 Sum_probs=10.1
Q ss_pred HHHHHHHhccCCCCCC
Q psy3080 49 IFYLRHLCDTAKFPDW 64 (118)
Q Consensus 49 ~yYLr~lcd~~rfp~w 64 (118)
.||-.+ ..+||+|
T Consensus 28 ~yY~~k---~~~~P~W 40 (41)
T PF14475_consen 28 KYYTEK---GRPFPGW 40 (41)
T ss_pred HHHHHc---CCCCCCc
Confidence 477776 6789998
No 128
>PRK12522 RNA polymerase sigma factor; Provisional
Probab=20.63 E-value=64 Score=23.10 Aligned_cols=19 Identities=21% Similarity=0.272 Sum_probs=16.7
Q ss_pred CCCCCHHHHHHHcCCCCCC
Q psy3080 88 PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a 106 (118)
=.++|.+|.-+.||+++++
T Consensus 133 ~~~~s~~EIA~~lgis~~t 151 (173)
T PRK12522 133 YEQYSYKEMSEILNIPIGT 151 (173)
T ss_pred HcCCCHHHHHHHhCCCHHH
Confidence 3578999999999999877
No 129
>PF08891 YfcL: YfcL protein; InterPro: IPR014987 This group of proteins are functionally uncharacterised. They are related to the short YfcL protein from Escherichia coli.
Probab=20.33 E-value=2.2e+02 Score=19.78 Aligned_cols=56 Identities=16% Similarity=0.187 Sum_probs=30.3
Q ss_pred ccchhhhHHHHHHHhccC-CCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcC
Q psy3080 42 EQEMFVNIFYLRHLCDTA-KFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALG 101 (118)
Q Consensus 42 ~~elfC~~yYLr~lcd~~-rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLG 101 (118)
++|||-++|=..||.=.. ..-.-...+ +.++-.+...-+++-.+.+|..|.--|.+
T Consensus 21 DDeLFA~GYLrGH~~lava~~E~~~~~~----~~~l~~~v~~sL~~A~~ELsp~Dq~lV~~ 77 (85)
T PF08891_consen 21 DDELFASGYLRGHFTLAVAELEQEGEHS----LEALKARVEASLEKAAGELSPADQALVNN 77 (85)
T ss_pred hHHHHHHHHHhHHHHHHHHHHhhcCCCC----HHHHHHHHHHHHHHHHccCCHHHHHHHHH
Confidence 589999999888874221 001111222 34444445554444466667666655544
No 130
>TIGR03377 glycerol3P_GlpA glycerol-3-phosphate dehydrogenase, anaerobic, A subunit. Members of this protein family are the A subunit, product of the glpA gene, of a three-subunit, membrane-anchored, FAD-dependent anaerobic glycerol-3-phosphate dehydrogenase.
Probab=20.25 E-value=1e+02 Score=26.53 Aligned_cols=36 Identities=25% Similarity=0.249 Sum_probs=31.6
Q ss_pred hhhhHHHHHHHhccCCCCCCCCccHHHHHHHHHH-HHHHH
Q psy3080 45 MFVNIFYLRHLCDTAKFPDWPINQPIMLLKDVLE-AWKAE 83 (118)
Q Consensus 45 lfC~~yYLr~lcd~~rfp~wpI~dpv~lL~~lL~-~Wr~E 83 (118)
.||..--...+++++ .|+..+...+|+++|+ +|++.
T Consensus 449 ~~c~~~~~~~~~~~~---~~~~~~~~~~l~~f~~~rw~g~ 485 (516)
T TIGR03377 449 EFCAYRAAGLLSREG---LIDPEQSTELLREFLEERWKGI 485 (516)
T ss_pred chHHHHHHHHHHHhh---CCChhhhHHHHHHHHHhhhcCc
Confidence 489999999999988 7888899999999998 89985
No 131
>PRK10219 DNA-binding transcriptional regulator SoxS; Provisional
Probab=20.23 E-value=2.7e+02 Score=18.41 Aligned_cols=61 Identities=15% Similarity=0.195 Sum_probs=42.6
Q ss_pred cCccccccchhhhHHHHHHHhccC-CCCCCCCccHHHHHHHHHHHHHHHhhc---CCCCCCHHHHHHHcCCCCCC
Q psy3080 36 VRYPQLEQEMFVNIFYLRHLCDTA-KFPDWPINQPIMLLKDVLEAWKAEVER---KPPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 36 i~YpeL~~elfC~~yYLr~lcd~~-rfp~wpI~dpv~lL~~lL~~Wr~E~~k---kp~~mS~edAy~vLGL~~~a 106 (118)
++=.+|...+-++..||.+++.+. |.+ +.+.+...|-+-.+ ..+++|+.|+-.-+|.+..+
T Consensus 22 ~~~~~lA~~~~~S~~~l~r~f~~~~g~s----------~~~~i~~~Rl~~a~~~L~~~~~~i~~iA~~~Gf~~~s 86 (107)
T PRK10219 22 LNIDVVAKKSGYSKWYLQRMFRTVTHQT----------LGDYIRQRRLLLAAVELRTTERPIFDIAMDLGYVSQQ 86 (107)
T ss_pred CCHHHHHHHHCCCHHHHHHHHHHHHCcC----------HHHHHHHHHHHHHHHHHHccCCCHHHHHHHHCCCCHH
Confidence 455899999999999998887774 432 23444444433333 24568999999999999644
No 132
>PF14769 CLAMP: Flagellar C1a complex subunit C1a-32
Probab=20.16 E-value=1.5e+02 Score=20.22 Aligned_cols=43 Identities=19% Similarity=0.305 Sum_probs=22.2
Q ss_pred HHHhccCCCCCCCCccHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHcC
Q psy3080 53 RHLCDTAKFPDWPINQPIMLLKDVLEAWKAEVERKPPEMSVDDAYEALG 101 (118)
Q Consensus 53 r~lcd~~rfp~wpI~dpv~lL~~lL~~Wr~E~~kkp~~mS~edAy~vLG 101 (118)
-.+|.+.|||--.+.-=+.+++.+++ +-+ + +.+|.+|+++.|+
T Consensus 4 ~~Fa~~~~fs~~q~s~~~~i~~~ll~---~~i--~-~~~~~~~~~~~fk 46 (101)
T PF14769_consen 4 FLFAKEQGFSWEQTSAFLSILKELLE---KNI--E-KGMSLEDSFKYFK 46 (101)
T ss_pred hHhHhhCCCCHHHHHHHHHHHHHHHH---HHH--H-ccCCHHHHHHHHH
Confidence 34677788877222222222222222 122 2 6778888887664
No 133
>cd00093 HTH_XRE Helix-turn-helix XRE-family like proteins. Prokaryotic DNA binding proteins belonging to the xenobiotic response element family of transcriptional regulators.
Probab=20.14 E-value=72 Score=17.00 Aligned_cols=13 Identities=23% Similarity=0.473 Sum_probs=7.8
Q ss_pred CHHHHHHHcCCCC
Q psy3080 92 SVDDAYEALGLTR 104 (118)
Q Consensus 92 S~edAy~vLGL~~ 104 (118)
....-++.||+++
T Consensus 43 ~~~~i~~~~~~~~ 55 (58)
T cd00093 43 TLEKLAKALGVSL 55 (58)
T ss_pred HHHHHHHHhCCCh
Confidence 4555666667654
No 134
>PF00382 TFIIB: Transcription factor TFIIB repeat; InterPro: IPR013150 Cyclins are eukaryotic proteins that play an active role in controlling nuclear cell division cycles [], and regulate cyclin dependent kinases (CDKs). Cyclins, together with the p34 (cdc2) or cdk2 kinases, form the Maturation Promoting Factor (MPF). There are two main groups of cyclins, G1/S cyclins, which are essential for the control of the cell cycle at the G1/S (start) transition, and G2/M cyclins, which are essential for the control of the cell cycle at the G2/M (mitosis) transition. G2/M cyclins accumulate steadily during G2 and are abruptly destroyed as cells exit from mitosis (at the end of the M-phase). In most species, there are multiple forms of G1 and G2 cyclins. For example, in vertebrates, there are two G2 cyclins, A and B, and at least three G1 cyclins, C, D, and E. Cyclin homologues have been found in various viruses, including Saimiriine herpesvirus 2 (Herpesvirus saimiri) and Human herpesvirus 8 (HHV-8) (Kaposi's sarcoma-associated herpesvirus). These viral homologues differ from their cellular counterparts in that the viral proteins have gained new functions and eliminated others to harness the cell and benefit the virus []. In eukaryotes, transcription initiation of all protein encoding genes involves the polymerase II system. This sytem is modulated by both general and specific transcription factors. The general factors (which include TFIIA, TFIIB, TFIID, TFIIE, TFIIF, TFIIG and TFIIH) operate through common promoter elements, such as the TATA box. Transcription factor IIB (TFIIB) is of central importance in transcription of class II genes. It associates with TFIID-TFIIA bound to DNA (the DA complex) to form a ternary TFIID-IIA-IBB (DAB) complex, which is recognised by RNA polymerase II [, ]. TFIIB comprises ~315-340 residues and contains an imperfect C-terminal repeat of a 75-residue domain that may contribute to the symmetry of the folded protein. The basal archaeal transcription machinery resembles that of the eukaryotic polymerase II system and includes a homologue of TFIIB []. This entry represents a cyclin-like domain which is found repeated in the C-terminal region of a variety of eukaryotic TFIIB's and their archaeal counterparts. These domains individually form the typical cyclin fold, and in the transcription complex they straddle the C-terminal region of the TATA-binding protein - an interaction essential for the formation of the transcription initiation complex [, ].; GO: 0003743 translation initiation factor activity, 0006413 translational initiation; PDB: 2PHG_A 1C9B_Q 1TFB_A 1VOL_A 3K7A_M 1AIS_B 1D3U_B.
Probab=20.07 E-value=2.4e+02 Score=17.61 Aligned_cols=12 Identities=17% Similarity=0.761 Sum_probs=8.3
Q ss_pred HHHHhccCCCCC
Q psy3080 52 LRHLCDTAKFPD 63 (118)
Q Consensus 52 Lr~lcd~~rfp~ 63 (118)
+.++|+.+++|+
T Consensus 1 I~r~~~~L~L~~ 12 (71)
T PF00382_consen 1 IPRICSKLGLPE 12 (71)
T ss_dssp HHHHHHHTT--H
T ss_pred ChHHHhHcCCCH
Confidence 467899999998
No 135
>PRK09637 RNA polymerase sigma factor SigZ; Provisional
Probab=20.01 E-value=78 Score=23.25 Aligned_cols=19 Identities=26% Similarity=0.317 Sum_probs=17.1
Q ss_pred CCCCCHHHHHHHcCCCCCC
Q psy3080 88 PPEMSVDDAYEALGLTRGS 106 (118)
Q Consensus 88 p~~mS~edAy~vLGL~~~a 106 (118)
-.+||.+|.-++||+++++
T Consensus 120 ~~g~~~~EIA~~lgis~~t 138 (181)
T PRK09637 120 LEGLSQKEIAEKLGLSLSG 138 (181)
T ss_pred hcCCCHHHHHHHhCCCHHH
Confidence 5689999999999999876
Done!