Query psy14923
Match_columns 180
No_of_seqs 187 out of 1489
Neff 7.6
Searched_HMMs 46136
Date Fri Aug 16 15:50:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14923.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14923hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PTZ00199 high mobility group p 99.9 3.8E-23 8.3E-28 145.7 11.3 83 35-117 9-93 (94)
2 cd01389 MATA_HMG-box MATA_HMG- 99.8 1.9E-20 4.2E-25 127.1 8.5 72 48-119 1-72 (77)
3 COG5648 NHP6B Chromatin-associ 99.8 3.4E-20 7.3E-25 145.2 8.6 85 37-121 59-143 (211)
4 cd01388 SOX-TCF_HMG-box SOX-TC 99.8 6.8E-20 1.5E-24 122.9 8.6 69 49-117 2-70 (72)
5 PF00505 HMG_box: HMG (high mo 99.8 1.6E-19 3.6E-24 119.4 9.4 69 49-117 1-69 (69)
6 cd01390 HMGB-UBF_HMG-box HMGB- 99.8 4.3E-19 9.2E-24 116.4 9.4 65 49-113 1-65 (66)
7 smart00398 HMG high mobility g 99.8 7.6E-19 1.7E-23 116.0 9.6 70 48-117 1-70 (70)
8 PF09011 HMG_box_2: HMG-box do 99.8 7.1E-19 1.5E-23 118.3 9.6 72 46-117 1-73 (73)
9 cd00084 HMG-box High Mobility 99.7 3E-17 6.5E-22 107.1 9.4 65 49-113 1-65 (66)
10 KOG0381|consensus 99.7 8.2E-17 1.8E-21 113.3 11.2 76 45-120 17-95 (96)
11 KOG0527|consensus 99.7 1E-16 2.2E-21 135.2 9.7 79 42-120 56-134 (331)
12 KOG0526|consensus 99.6 2E-15 4.4E-20 131.7 7.5 81 35-119 522-602 (615)
13 KOG4715|consensus 99.6 1.8E-14 4E-19 118.8 12.1 111 6-119 19-135 (410)
14 KOG3248|consensus 99.2 1.7E-11 3.7E-16 102.2 6.8 73 47-119 190-262 (421)
15 KOG0528|consensus 99.1 8.6E-11 1.9E-15 102.0 3.9 80 42-121 319-398 (511)
16 PF14887 HMG_box_5: HMG (high 98.5 1.1E-06 2.4E-11 58.8 7.5 74 48-122 3-76 (85)
17 PTZ00199 high mobility group p 98.4 1.5E-07 3.3E-12 66.1 1.9 36 142-177 11-46 (94)
18 KOG2746|consensus 98.2 1.5E-06 3.2E-11 78.6 4.8 69 44-112 177-247 (683)
19 PF09011 HMG_box_2: HMG-box do 97.5 8.1E-05 1.8E-09 49.5 2.6 25 151-175 1-25 (73)
20 cd01390 HMGB-UBF_HMG-box HMGB- 97.4 0.0001 2.3E-09 47.5 2.5 24 154-177 1-24 (66)
21 PF04690 YABBY: YABBY protein; 97.3 0.0005 1.1E-08 53.2 5.8 49 43-91 116-164 (170)
22 PF00505 HMG_box: HMG (high mo 97.3 0.00021 4.5E-09 46.5 2.4 24 154-177 1-24 (69)
23 smart00398 HMG high mobility g 97.2 0.00024 5.3E-09 46.0 2.5 25 153-177 1-25 (70)
24 PF06382 DUF1074: Protein of u 97.2 0.0011 2.3E-08 51.4 5.6 49 53-105 83-131 (183)
25 cd01389 MATA_HMG-box MATA_HMG- 97.1 0.00035 7.6E-09 46.9 2.6 25 153-177 1-25 (77)
26 cd00084 HMG-box High Mobility 97.1 0.00035 7.5E-09 44.7 2.5 24 154-177 1-24 (66)
27 KOG0526|consensus 97.1 0.00029 6.3E-09 62.7 2.3 36 143-178 525-560 (615)
28 COG5648 NHP6B Chromatin-associ 97.1 0.00053 1.1E-08 54.4 3.3 68 47-114 142-209 (211)
29 cd01388 SOX-TCF_HMG-box SOX-TC 96.8 0.0011 2.3E-08 44.0 2.5 24 154-177 2-25 (72)
30 PF08073 CHDNT: CHDNT (NUC034) 96.3 0.0062 1.4E-07 38.4 3.4 40 53-92 13-52 (55)
31 KOG0381|consensus 96.1 0.0047 1E-07 42.9 2.7 28 150-177 17-46 (96)
32 PF06244 DUF1014: Protein of u 92.2 0.21 4.5E-06 36.7 3.7 45 48-92 72-116 (122)
33 KOG3248|consensus 91.1 0.12 2.6E-06 44.1 1.6 23 153-175 191-213 (421)
34 KOG0527|consensus 90.5 0.17 3.7E-06 43.3 2.0 30 148-177 57-86 (331)
35 PF11304 DUF3106: Protein of u 90.0 3 6.4E-05 29.8 7.8 53 77-130 10-69 (107)
36 PF04769 MAT_Alpha1: Mating-ty 89.0 1.4 3.1E-05 35.1 6.1 55 43-103 38-92 (201)
37 TIGR03481 HpnM hopanoid biosyn 86.1 2.1 4.5E-05 33.9 5.5 46 73-118 63-110 (198)
38 cd00225 API3 Ascaris pepsin in 83.3 1.9 4.1E-05 32.7 3.8 29 84-120 25-53 (159)
39 PRK15117 ABC transporter perip 83.0 4 8.7E-05 32.6 5.9 48 72-119 66-115 (211)
40 PF11304 DUF3106: Protein of u 79.4 20 0.00043 25.5 9.5 32 68-99 35-68 (107)
41 PF12881 NUT_N: NUT protein N 77.2 6.8 0.00015 33.3 5.6 53 53-105 229-281 (328)
42 KOG3223|consensus 76.6 4 8.7E-05 32.3 3.8 44 50-93 166-209 (221)
43 PF05494 Tol_Tol_Ttg2: Toluene 75.4 5 0.00011 30.5 4.2 47 72-118 36-84 (170)
44 KOG3838|consensus 65.5 7.6 0.00016 34.2 3.5 37 89-125 268-304 (497)
45 PF04690 YABBY: YABBY protein; 65.3 6 0.00013 30.7 2.6 27 151-177 119-145 (170)
46 PF01352 KRAB: KRAB box; Inte 63.6 5.7 0.00012 23.3 1.7 29 76-104 3-32 (41)
47 PF13875 DUF4202: Domain of un 62.7 19 0.00042 28.3 5.0 39 55-96 131-169 (185)
48 PF06945 DUF1289: Protein of u 56.3 21 0.00045 21.9 3.4 26 75-105 23-48 (51)
49 PRK09706 transcriptional repre 52.9 53 0.0011 23.9 5.8 42 78-119 87-128 (135)
50 KOG0528|consensus 49.2 6.9 0.00015 35.1 0.6 20 153-172 325-344 (511)
51 PF12650 DUF3784: Domain of un 43.1 17 0.00036 25.1 1.6 15 86-100 25-39 (97)
52 COG2854 Ttg2D ABC-type transpo 41.7 37 0.00079 27.2 3.5 41 81-121 78-119 (202)
53 PRK10236 hypothetical protein; 40.3 33 0.00072 28.1 3.1 26 79-104 118-143 (237)
54 KOG1610|consensus 32.8 1.3E+02 0.0029 25.7 5.7 39 76-117 218-256 (322)
55 PRK12751 cpxP periplasmic stre 32.2 1.2E+02 0.0026 23.3 5.0 23 83-105 123-145 (162)
56 PRK12750 cpxP periplasmic repr 31.9 1.4E+02 0.003 23.0 5.4 23 111-133 128-150 (170)
57 cd07081 ALDH_F20_ACDH_EutE-lik 31.5 1.2E+02 0.0027 26.8 5.6 41 78-118 6-46 (439)
58 PF15581 Imm35: Immunity prote 29.1 92 0.002 21.6 3.4 23 75-97 31-53 (93)
59 PRK14296 chaperone protein Dna 26.8 1.3E+02 0.0027 26.2 4.8 38 64-101 26-67 (372)
60 PF02026 RyR: RyR domain; Int 26.3 69 0.0015 22.2 2.5 21 86-106 60-80 (94)
61 PF06628 Catalase-rel: Catalas 26.0 76 0.0016 20.4 2.6 19 82-100 12-30 (68)
62 PF13945 NST1: Salt tolerance 25.7 1.9E+02 0.004 22.9 5.1 37 76-115 100-136 (190)
63 KOG4715|consensus 25.6 53 0.0012 28.2 2.1 29 149-177 60-88 (410)
64 PF05388 Carbpep_Y_N: Carboxyp 25.4 1.4E+02 0.003 21.5 4.0 28 76-103 45-72 (113)
65 PRK14279 chaperone protein Dna 24.2 1.3E+02 0.0027 26.4 4.3 41 62-102 29-74 (392)
66 PF08367 M16C_assoc: Peptidase 24.0 1.7E+02 0.0036 23.6 4.8 30 77-106 13-42 (248)
67 cd07122 ALDH_F20_ACDH Coenzyme 23.7 2.2E+02 0.0048 25.2 5.9 40 79-118 7-46 (436)
68 PRK14283 chaperone protein Dna 23.7 1.8E+02 0.0039 25.2 5.1 41 62-102 25-69 (378)
69 TIGR00787 dctP tripartite ATP- 22.8 1.9E+02 0.0041 23.2 4.9 26 84-109 213-238 (257)
70 PF06394 Pepsin-I3: Pepsin inh 22.8 82 0.0018 21.1 2.2 26 88-121 38-63 (76)
71 PRK14291 chaperone protein Dna 22.2 1.8E+02 0.0039 25.2 4.9 40 62-101 23-66 (382)
72 cd07132 ALDH_F3AB Aldehyde deh 22.2 2.2E+02 0.0047 25.2 5.5 41 78-118 5-45 (443)
73 PTZ00037 DnaJ_C chaperone prot 22.1 1.7E+02 0.0037 25.9 4.8 43 60-102 46-89 (421)
74 cd08317 Death_ank Death domain 21.8 50 0.0011 22.1 1.1 21 72-92 3-23 (84)
75 cd07085 ALDH_F6_MMSDH Methylma 21.2 2.5E+02 0.0055 24.9 5.7 37 80-116 47-83 (478)
76 cd07133 ALDH_CALDH_CalB Conife 21.0 2.4E+02 0.0051 24.8 5.5 40 78-117 5-44 (434)
77 PRK14299 chaperone protein Dna 20.8 2.2E+02 0.0047 23.7 5.0 39 63-101 25-67 (291)
78 KOG1827|consensus 20.6 6.1 0.00013 36.7 -4.7 44 52-95 552-595 (629)
79 PRK10266 curved DNA-binding pr 20.4 2.5E+02 0.0053 23.5 5.2 39 63-101 25-67 (306)
80 PF09164 VitD-bind_III: Vitami 20.1 2.7E+02 0.0058 18.2 4.8 32 54-85 9-40 (68)
81 PRK10363 cpxP periplasmic repr 20.1 2.9E+02 0.0063 21.4 5.1 23 83-105 117-139 (166)
No 1
>PTZ00199 high mobility group protein; Provisional
Probab=99.90 E-value=3.8e-23 Score=145.72 Aligned_cols=83 Identities=29% Similarity=0.555 Sum_probs=77.3
Q ss_pred cccccCCCCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHH
Q psy14923 35 LKQQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIK--IIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQN 112 (180)
Q Consensus 35 ~k~~~~k~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~--~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~ 112 (180)
.+++..++.+||+.|++|+|||++|+.++|..|..+||+++ +++|+++||++|+.||++||.+|+++|++++++|..+
T Consensus 9 ~~k~~~k~~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~rY~~e 88 (94)
T PTZ00199 9 LVRKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKVRYEKE 88 (94)
T ss_pred cccccCCCCCCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHH
Confidence 34455567899999999999999999999999999999986 8999999999999999999999999999999999999
Q ss_pred HHhhh
Q psy14923 113 VKKYH 117 (180)
Q Consensus 113 ~~~y~ 117 (180)
|.+|.
T Consensus 89 ~~~Y~ 93 (94)
T PTZ00199 89 KAEYA 93 (94)
T ss_pred HHHHh
Confidence 99995
No 2
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.83 E-value=1.9e-20 Score=127.10 Aligned_cols=72 Identities=17% Similarity=0.335 Sum_probs=69.5
Q ss_pred CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy14923 48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE 119 (180)
Q Consensus 48 ~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~ 119 (180)
.|+||+|||+||+++.|..++.+||++++.+|+++||++|+.||+++|++|.++|++++++|..++++|.-.
T Consensus 1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky~ 72 (77)
T cd01389 1 KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKYT 72 (77)
T ss_pred CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCccc
Confidence 478999999999999999999999999999999999999999999999999999999999999999999863
No 3
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.82 E-value=3.4e-20 Score=145.19 Aligned_cols=85 Identities=31% Similarity=0.606 Sum_probs=81.3
Q ss_pred cccCCCCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy14923 37 QQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKY 116 (180)
Q Consensus 37 ~~~~k~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y 116 (180)
+..+++.+||+.||||+|||++|+.++|.++..++|++++.+|.+++|++|++|+|+||++|...|..++++|..++..|
T Consensus 59 k~~~r~k~dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y 138 (211)
T COG5648 59 KRLVRKKKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEY 138 (211)
T ss_pred HHHHHHhcCCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhh
Confidence 45567889999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhhCC
Q psy14923 117 HDELT 121 (180)
Q Consensus 117 ~~~~~ 121 (180)
...++
T Consensus 139 ~~k~~ 143 (211)
T COG5648 139 NKKLP 143 (211)
T ss_pred hcccC
Confidence 99876
No 4
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.82 E-value=6.8e-20 Score=122.93 Aligned_cols=69 Identities=23% Similarity=0.456 Sum_probs=67.1
Q ss_pred CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923 49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH 117 (180)
Q Consensus 49 PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~ 117 (180)
.|||+|||++|++++|..++.+||++++.+|+++||++|+.||+++|++|.++|+.++++|.+++++|+
T Consensus 2 iKrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y~ 70 (72)
T cd01388 2 IKRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDYK 70 (72)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCCC
Confidence 479999999999999999999999999999999999999999999999999999999999999999885
No 5
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.81 E-value=1.6e-19 Score=119.42 Aligned_cols=69 Identities=32% Similarity=0.641 Sum_probs=65.8
Q ss_pred CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923 49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH 117 (180)
Q Consensus 49 PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~ 117 (180)
|+||+|||++|+.+.+..++.+||+++..+|+++||++|++||+++|.+|.+.|++++.+|.+++.+|+
T Consensus 1 PkrP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y~ 69 (69)
T PF00505_consen 1 PKRPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEYK 69 (69)
T ss_dssp SSSS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcCCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 899999999999999999999999999999999999999999999999999999999999999999985
No 6
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.80 E-value=4.3e-19 Score=116.35 Aligned_cols=65 Identities=42% Similarity=0.806 Sum_probs=63.7
Q ss_pred CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHH
Q psy14923 49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNV 113 (180)
Q Consensus 49 PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~ 113 (180)
||+|+|||++|+.+.|..++.+||++++.+|++.||++|++||+++|++|.+.|++++++|..+|
T Consensus 1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~~y~~e~ 65 (66)
T cd01390 1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKERYEKEM 65 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence 89999999999999999999999999999999999999999999999999999999999999887
No 7
>smart00398 HMG high mobility group.
Probab=99.79 E-value=7.6e-19 Score=116.04 Aligned_cols=70 Identities=34% Similarity=0.665 Sum_probs=67.9
Q ss_pred CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923 48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH 117 (180)
Q Consensus 48 ~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~ 117 (180)
+|++|+|||++|+++.|..+..+||++++.+|++.||++|+.||+++|++|.+.|++++++|..++..|.
T Consensus 1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y~ 70 (70)
T smart00398 1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK 70 (70)
T ss_pred CcCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 5899999999999999999999999999999999999999999999999999999999999999999874
No 8
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.79 E-value=7.1e-19 Score=118.26 Aligned_cols=72 Identities=33% Similarity=0.621 Sum_probs=63.8
Q ss_pred CCCCCCCCChHHHHHHHHHHHHHHh-CCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923 46 PEKPKKPLTPYFKFQGQIRNKIMER-NPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH 117 (180)
Q Consensus 46 p~~PKrP~say~lF~~e~r~~~~~~-~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~ 117 (180)
|++||+|+|||+||+.+.+..++.. ++.....|+++.|+..|++||++||.+|+++|++++++|..+|..|.
T Consensus 1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~~y~~e~~~~~ 73 (73)
T PF09011_consen 1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKERYEREMKEWN 73 (73)
T ss_dssp SSS--SSSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred CcCCCCCCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 6899999999999999999999888 77788999999999999999999999999999999999999999984
No 9
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.73 E-value=3e-17 Score=107.15 Aligned_cols=65 Identities=37% Similarity=0.753 Sum_probs=63.1
Q ss_pred CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHH
Q psy14923 49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNV 113 (180)
Q Consensus 49 PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~ 113 (180)
|++|+|||++|+++.+..++.++|+++..+|++.||++|+.|++++|.+|.+.|++++.+|.+++
T Consensus 1 pkrp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~~y~~~~ 65 (66)
T cd00084 1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEM 65 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence 78999999999999999999999999999999999999999999999999999999999998775
No 10
>KOG0381|consensus
Probab=99.72 E-value=8.2e-17 Score=113.30 Aligned_cols=76 Identities=37% Similarity=0.681 Sum_probs=72.3
Q ss_pred CC--CCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH-hhhhhC
Q psy14923 45 IP--EKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVK-KYHDEL 120 (180)
Q Consensus 45 ~p--~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~-~y~~~~ 120 (180)
+| +.|++|+|||++|+.+.+..++.+||++++.+|++++|++|++|++++|.+|+..+..++++|..+|. .|+..+
T Consensus 17 ~p~~~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~~Y~~~~~~~~~~~~ 95 (96)
T KOG0381|consen 17 DPNAQAPKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKEKYEKELAGEYKASL 95 (96)
T ss_pred CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 56 59999999999999999999999999999999999999999999999999999999999999999999 887754
No 11
>KOG0527|consensus
Probab=99.69 E-value=1e-16 Score=135.21 Aligned_cols=79 Identities=20% Similarity=0.413 Sum_probs=74.2
Q ss_pred CCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhC
Q psy14923 42 PLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDEL 120 (180)
Q Consensus 42 ~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~ 120 (180)
........||||||||+|.++.|.+|..+||++.+.||+|.||.+|+.|+|+||.+|+++|++.+..|++++.+|+-.-
T Consensus 56 ~k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKYRP 134 (331)
T KOG0527|consen 56 DKTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKYRP 134 (331)
T ss_pred CCCCccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCccccc
Confidence 3344578999999999999999999999999999999999999999999999999999999999999999999999753
No 12
>KOG0526|consensus
Probab=99.59 E-value=2e-15 Score=131.73 Aligned_cols=81 Identities=26% Similarity=0.490 Sum_probs=75.8
Q ss_pred cccccCCCCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy14923 35 LKQQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVK 114 (180)
Q Consensus 35 ~k~~~~k~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~ 114 (180)
.+++..|+.+||++|||++||||+|++..|..|+.+ ++++++|++.+|++|+.||+ |.+|++.|+.++++|+.+|.
T Consensus 522 ~~~k~~kk~kdpnapkra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~ 597 (615)
T KOG0526|consen 522 EKKKKGKKKKDPNAPKRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMK 597 (615)
T ss_pred ccccCcccCCCCCCCccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHH
Confidence 344777889999999999999999999999999887 89999999999999999999 99999999999999999999
Q ss_pred hhhhh
Q psy14923 115 KYHDE 119 (180)
Q Consensus 115 ~y~~~ 119 (180)
+|+..
T Consensus 598 ~yk~g 602 (615)
T KOG0526|consen 598 EYKNG 602 (615)
T ss_pred hhcCC
Confidence 99954
No 13
>KOG4715|consensus
Probab=99.59 E-value=1.8e-14 Score=118.81 Aligned_cols=111 Identities=30% Similarity=0.516 Sum_probs=96.5
Q ss_pred cCCCCCCCCccc------ccccCCCCCcccccccccccccCCCCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHH
Q psy14923 6 LTTGVFTGGNVF------NLRSCLMPHLNSVTTAGLKQQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEV 79 (180)
Q Consensus 6 ~~~~~~~~~~~~------~~~~~~~p~~~~~~~~~~k~~~~k~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei 79 (180)
.-|+-|+|++.| |.+.++.+++++++....- ....+-|.+|-+|+-+||.|.+..|++|++.||++...||
T Consensus 19 ~st~g~~g~~~~~h~~y~n~r~~~~~~T~sr~~~~~~---~t~pkpPkppekpl~pymrySrkvWd~VkA~nPe~kLWei 95 (410)
T KOG4715|consen 19 PSTPGFGGYNPYTHLAYSNYRLPGNPATNSRVFQNSS---ITRPKPPKPPEKPLMPYMRYSRKVWDQVKASNPELKLWEI 95 (410)
T ss_pred CCCCCCCCCCcchhhhccCcCCCCCCCCccccccccc---ccCCCCCCCCCcccchhhHHhhhhhhhhhccCcchHHHHH
Confidence 446668888877 7777778887777666533 2344567889999999999999999999999999999999
Q ss_pred HHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy14923 80 TKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE 119 (180)
Q Consensus 80 ~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~ 119 (180)
-++||.+|..|+|+||+.|.+.++.++..|++.|..|..+
T Consensus 96 GK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh~s 135 (410)
T KOG4715|consen 96 GKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYHNS 135 (410)
T ss_pred HHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhhCC
Confidence 9999999999999999999999999999999999999884
No 14
>KOG3248|consensus
Probab=99.24 E-value=1.7e-11 Score=102.15 Aligned_cols=73 Identities=15% Similarity=0.337 Sum_probs=67.8
Q ss_pred CCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy14923 47 EKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE 119 (180)
Q Consensus 47 ~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~ 119 (180)
.-.|+|+|||+||++|.|++|.+++.-....+|.++||.+|..||.||..+|.++|.++++.+.+-+..|-+.
T Consensus 190 phiKKPLNAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WSAR 262 (421)
T KOG3248|consen 190 PHIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSAR 262 (421)
T ss_pred ccccccHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcchh
Confidence 3568999999999999999999998756689999999999999999999999999999999999999999874
No 15
>KOG0528|consensus
Probab=99.07 E-value=8.6e-11 Score=101.99 Aligned_cols=80 Identities=19% Similarity=0.380 Sum_probs=72.3
Q ss_pred CCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhCC
Q psy14923 42 PLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELT 121 (180)
Q Consensus 42 ~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~~ 121 (180)
....+.-.||||||||+|.+|.|.+|...+||+.+..|+++||.+|+.||-.||++|.++-...-..|.+.+++|+-.-.
T Consensus 319 ~~ss~PHIKRPMNAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYrYkPR 398 (511)
T KOG0528|consen 319 RASSEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYRYKPR 398 (511)
T ss_pred cCCCCccccCCcchhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHHHHhhhccCcccccCCC
Confidence 33445667999999999999999999999999999999999999999999999999999999988899999999986543
No 16
>PF14887 HMG_box_5: HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=98.46 E-value=1.1e-06 Score=58.79 Aligned_cols=74 Identities=15% Similarity=0.277 Sum_probs=61.7
Q ss_pred CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhCCH
Q psy14923 48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTP 122 (180)
Q Consensus 48 ~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~~~ 122 (180)
-|..|-+|--+|.+.....+.+.++.....+ .+.+...|.+|++.+|.+|+..|.++..+|+.+|.+|+..+++
T Consensus 3 lPE~PKt~qe~Wqq~vi~dYla~~~~dr~K~-~kam~~~W~~me~Kekl~WIkKA~EdqKrYE~el~e~r~~~~~ 76 (85)
T PF14887_consen 3 LPETPKTAQEIWQQSVIGDYLAKFRNDRKKA-LKAMEAQWSQMEKKEKLKWIKKAAEDQKRYERELREMRSAPAD 76 (85)
T ss_dssp -S----THHHHHHHHHHHHHHHHTTSTHHHH-HHHHHHHHHTTGGGHHHHHHHHHHHHHHHHHHHHHCCS-CCCT
T ss_pred CCCCCCCHHHHHHHHHHHHHHHHhhHhHHHH-HHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHhcCCCC
Confidence 4778889999999999999999998877666 4589999999999999999999999999999999999987663
No 17
>PTZ00199 high mobility group protein; Provisional
Probab=98.38 E-value=1.5e-07 Score=66.13 Aligned_cols=36 Identities=19% Similarity=0.333 Sum_probs=30.5
Q ss_pred HHHhhhhhccCCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 142 REMKKLFRETGKPKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 142 ~~~kk~~k~~~~PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
++.++..+|++.||+|+|||++||.++|..+..+|.
T Consensus 11 k~~~k~~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P 46 (94)
T PTZ00199 11 RKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENP 46 (94)
T ss_pred cccCCCCCCCCCCCCCCcHHHHHHHHHHHHHHHHCc
Confidence 344567899999999999999999999998877663
No 18
>KOG2746|consensus
Probab=98.21 E-value=1.5e-06 Score=78.63 Aligned_cols=69 Identities=20% Similarity=0.409 Sum_probs=63.6
Q ss_pred CCCCCCCCCCChHHHHHHHHH--HHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHH
Q psy14923 44 SIPEKPKKPLTPYFKFQGQIR--NKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQN 112 (180)
Q Consensus 44 ~~p~~PKrP~say~lF~~e~r--~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~ 112 (180)
.+..-..+|||+|++|++.+| ..+...||+..+.-|+++||++|-.|.+.||+.|.++|.+.++.|-++
T Consensus 177 r~k~HirrPMnaf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~Ahfka 247 (683)
T KOG2746|consen 177 RDKDHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKEAHFKA 247 (683)
T ss_pred CcchhhhhhhHHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHHHHhhh
Confidence 344567899999999999999 888899999999999999999999999999999999999999998776
No 19
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=97.48 E-value=8.1e-05 Score=49.54 Aligned_cols=25 Identities=32% Similarity=0.591 Sum_probs=20.7
Q ss_pred cCCCCCCCchHHHHHHHhhhhhhhc
Q psy14923 151 TGKPKKPGNSFYLFIKDNIDKKEYE 175 (180)
Q Consensus 151 ~~~PKrP~say~lF~~e~r~~~~~~ 175 (180)
|++||+|+|||+||+.+++..+..+
T Consensus 1 p~kpK~~~say~lF~~~~~~~~k~~ 25 (73)
T PF09011_consen 1 PKKPKRPPSAYNLFMKEMRKEVKEE 25 (73)
T ss_dssp SSS--SSSSHHHHHHHHHHHHHHHH
T ss_pred CcCCCCCCCHHHHHHHHHHHHHHHh
Confidence 5689999999999999999998766
No 20
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=97.43 E-value=0.0001 Score=47.47 Aligned_cols=24 Identities=29% Similarity=0.605 Sum_probs=22.0
Q ss_pred CCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 154 PKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 154 PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
||+|+|||++|+++.|..+..+|.
T Consensus 1 Pkrp~saf~~f~~~~r~~~~~~~p 24 (66)
T cd01390 1 PKRPLSAYFLFSQEQRPKLKKENP 24 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCc
Confidence 899999999999999999987764
No 21
>PF04690 YABBY: YABBY protein; InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=97.35 E-value=0.0005 Score=53.21 Aligned_cols=49 Identities=22% Similarity=0.383 Sum_probs=43.5
Q ss_pred CCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCC
Q psy14923 43 LSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIG 91 (180)
Q Consensus 43 ~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls 91 (180)
.+-|.+-.|-+|||..|+++....|++.+|+++..|....++..|...+
T Consensus 116 ~kPPEKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~p 164 (170)
T PF04690_consen 116 NKPPEKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFP 164 (170)
T ss_pred cCCccccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCc
Confidence 3445566678999999999999999999999999999999999998765
No 22
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=97.25 E-value=0.00021 Score=46.51 Aligned_cols=24 Identities=42% Similarity=0.668 Sum_probs=20.1
Q ss_pred CCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 154 PKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 154 PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
|+||+|||+|||.+.+..+..+|.
T Consensus 1 PkrP~~af~lf~~~~~~~~k~~~p 24 (69)
T PF00505_consen 1 PKRPPNAFMLFCKEKRAKLKEENP 24 (69)
T ss_dssp SSSS--HHHHHHHHHHHHHHHHST
T ss_pred CcCCCCHHHHHHHHHHHHHHHHhc
Confidence 899999999999999999987765
No 23
>smart00398 HMG high mobility group.
Probab=97.23 E-value=0.00024 Score=46.03 Aligned_cols=25 Identities=40% Similarity=0.635 Sum_probs=22.2
Q ss_pred CCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 153 KPKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 153 ~PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
+|++|+|||++|+++.+..+..+|.
T Consensus 1 ~pkrp~~~y~~f~~~~r~~~~~~~~ 25 (70)
T smart00398 1 KPKRPMSAFMLFSQENRAKIKAENP 25 (70)
T ss_pred CcCCCCcHHHHHHHHHHHHHHHHCc
Confidence 4899999999999999999877663
No 24
>PF06382 DUF1074: Protein of unknown function (DUF1074); InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=97.15 E-value=0.0011 Score=51.41 Aligned_cols=49 Identities=18% Similarity=0.379 Sum_probs=42.7
Q ss_pred CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHH
Q psy14923 53 LTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKE 105 (180)
Q Consensus 53 ~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~ 105 (180)
-+||+-|+.+.+. .|.+++..|+....+..|..||+++|..|..++...
T Consensus 83 nnaYLNFLReFRr----kh~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~~~ 131 (183)
T PF06382_consen 83 NNAYLNFLREFRR----KHCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAPSV 131 (183)
T ss_pred chHHHHHHHHHHH----HccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcchh
Confidence 4679999999875 567999999999999999999999999998865543
No 25
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=97.14 E-value=0.00035 Score=46.89 Aligned_cols=25 Identities=28% Similarity=0.452 Sum_probs=21.8
Q ss_pred CCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 153 KPKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 153 ~PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
.||||+|||+||+++.+..+..++.
T Consensus 1 ~~kRP~naf~lf~~~~r~~~~~~~p 25 (77)
T cd01389 1 KIPRPRNAFILYRQDKHAQLKTENP 25 (77)
T ss_pred CCCCCCcHHHHHHHHHHHHHHHHCC
Confidence 3899999999999999998876654
No 26
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=97.13 E-value=0.00035 Score=44.73 Aligned_cols=24 Identities=25% Similarity=0.593 Sum_probs=21.7
Q ss_pred CCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 154 PKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 154 PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
||+|+|||++|+++.+..+..+|.
T Consensus 1 pkrp~~af~~f~~~~~~~~~~~~~ 24 (66)
T cd00084 1 PKRPLSAYFLFSQEHRAEVKAENP 24 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCc
Confidence 799999999999999999877764
No 27
>KOG0526|consensus
Probab=97.09 E-value=0.00029 Score=62.72 Aligned_cols=36 Identities=17% Similarity=0.368 Sum_probs=31.2
Q ss_pred HHhhhhhccCCCCCCCchHHHHHHHhhhhhhhcCCC
Q psy14923 143 EMKKLFRETGKPKKPGNSFYLFIKDNIDKKEYEGKS 178 (180)
Q Consensus 143 ~~kk~~k~~~~PKrP~say~lF~~e~r~~~~~~~~~ 178 (180)
+..++.|||++|||++||||+|....|.+++..+-+
T Consensus 525 k~~kk~kdpnapkra~sa~m~w~~~~r~~ik~dgi~ 560 (615)
T KOG0526|consen 525 KKGKKKKDPNAPKRATSAYMLWLNASRESIKEDGIS 560 (615)
T ss_pred cCcccCCCCCCCccchhHHHHHHHhhhhhHhhcCch
Confidence 345788999999999999999999999999877643
No 28
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=97.06 E-value=0.00053 Score=54.42 Aligned_cols=68 Identities=29% Similarity=0.546 Sum_probs=61.3
Q ss_pred CCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy14923 47 EKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVK 114 (180)
Q Consensus 47 ~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~ 114 (180)
.+++.|..+|+-+-.+.+..+...+|+....+++++++..|++|++.-|.+|.+.+..++++|...++
T Consensus 142 ~~~~~~~~~~~e~~~~~r~~~~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~~~~~~~~ 209 (211)
T COG5648 142 LPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYP 209 (211)
T ss_pred cCCCCCCchhhhccHHhccccCCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHHHHhhhcc
Confidence 57778888888888888888888899999999999999999999999999999999999999877654
No 29
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=96.78 E-value=0.0011 Score=43.98 Aligned_cols=24 Identities=38% Similarity=0.358 Sum_probs=20.8
Q ss_pred CCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 154 PKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 154 PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
.|||+|||++||++.|..+..+|.
T Consensus 2 iKrP~naf~~F~~~~r~~~~~~~p 25 (72)
T cd01388 2 IKRPMNAFMLFSKRHRRKVLQEYP 25 (72)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCC
Confidence 589999999999999998876653
No 30
>PF08073 CHDNT: CHDNT (NUC034) domain; InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=96.25 E-value=0.0062 Score=38.38 Aligned_cols=40 Identities=25% Similarity=0.551 Sum_probs=36.3
Q ss_pred CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCH
Q psy14923 53 LTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGE 92 (180)
Q Consensus 53 ~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~ 92 (180)
++.|-+|.+..|+.+.+.||++....+...++.+|++-++
T Consensus 13 lt~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~~ 52 (55)
T PF08073_consen 13 LTNYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQE 52 (55)
T ss_pred HHHHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHHh
Confidence 4568899999999999999999999999999999997654
No 31
>KOG0381|consensus
Probab=96.14 E-value=0.0047 Score=42.89 Aligned_cols=28 Identities=29% Similarity=0.554 Sum_probs=24.8
Q ss_pred cc--CCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 150 ET--GKPKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 150 ~~--~~PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
|+ +.||+|+|||++|+.+.+..+..+|.
T Consensus 17 ~p~~~~pkrp~sa~~~f~~~~~~~~k~~~p 46 (96)
T KOG0381|consen 17 DPNAQAPKRPLSAFFLFSSEQRSKIKAENP 46 (96)
T ss_pred CCCCCCCCCCCcHHHHHHHHHHHHHHHhCC
Confidence 55 59999999999999999999888775
No 32
>PF06244 DUF1014: Protein of unknown function (DUF1014); InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=92.24 E-value=0.21 Score=36.71 Aligned_cols=45 Identities=20% Similarity=0.453 Sum_probs=39.9
Q ss_pred CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCH
Q psy14923 48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGE 92 (180)
Q Consensus 48 ~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~ 92 (180)
|-+|---||.-|...+.+.++.++|++..+++...|-.+|..-++
T Consensus 72 PErR~KAAy~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPe 116 (122)
T PF06244_consen 72 PERRMKAAYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPE 116 (122)
T ss_pred cchhHHHHHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCC
Confidence 344445789999999999999999999999999999999988765
No 33
>KOG3248|consensus
Probab=91.13 E-value=0.12 Score=44.08 Aligned_cols=23 Identities=39% Similarity=0.478 Sum_probs=20.0
Q ss_pred CCCCCCchHHHHHHHhhhhhhhc
Q psy14923 153 KPKKPGNSFYLFIKDNIDKKEYE 175 (180)
Q Consensus 153 ~PKrP~say~lF~~e~r~~~~~~ 175 (180)
-.|+|+|||+||++|+|.++-.+
T Consensus 191 hiKKPLNAFmlyMKEmRa~vvaE 213 (421)
T KOG3248|consen 191 HIKKPLNAFMLYMKEMRAKVVAE 213 (421)
T ss_pred cccccHHHHHHHHHHHHHHHHHH
Confidence 46899999999999999988654
No 34
>KOG0527|consensus
Probab=90.49 E-value=0.17 Score=43.32 Aligned_cols=30 Identities=20% Similarity=0.277 Sum_probs=25.5
Q ss_pred hhccCCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 148 FRETGKPKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 148 ~k~~~~PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
.+..+..|||+||||+|++..|.++-.++.
T Consensus 57 k~~~~hIKRPMNAFMVWSq~~RRkma~qnP 86 (331)
T KOG0527|consen 57 KTSTDRIKRPMNAFMVWSQGQRRKLAKQNP 86 (331)
T ss_pred CCCccccCCCcchhhhhhHHHHHHHHHhCc
Confidence 355678999999999999999999877764
No 35
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=89.97 E-value=3 Score=29.82 Aligned_cols=53 Identities=11% Similarity=0.349 Sum_probs=38.6
Q ss_pred HHHHHHHHHHhhcCCHHHHHHHHHHHHH-------HHHHHHHHHHhhhhhCCHHHHHHHHH
Q psy14923 77 IEVTKLVSEEWKKIGEAEKKRLENEYAK-------EKEIYAQNVKKYHDELTPEAIEFLRK 130 (180)
Q Consensus 77 ~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~-------~k~~y~~~~~~y~~~~~~~~~~~~~~ 130 (180)
.++..-++..|..|+++.+..+.+.+.. +..+....|..|.. ++|+++..+..
T Consensus 10 q~~L~pl~~~W~~l~~~qr~k~l~~a~r~~~mspeqq~r~~~rm~~W~~-LspeqR~~~R~ 69 (107)
T PF11304_consen 10 QQALAPLAERWNSLPPEQRRKWLQIAERWPSMSPEQQQRLRERMRRWAA-LSPEQRQQARE 69 (107)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHh-CCHHHHHHHHH
Confidence 4556678888888888888887777754 67777788887776 67777655443
No 36
>PF04769 MAT_Alpha1: Mating-type protein MAT alpha 1; InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=88.99 E-value=1.4 Score=35.09 Aligned_cols=55 Identities=22% Similarity=0.345 Sum_probs=41.4
Q ss_pred CCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHH
Q psy14923 43 LSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYA 103 (180)
Q Consensus 43 ~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~ 103 (180)
......++||+|+|+.|..=.- ...|+....+++..|+..|..=+- |..|.-.|.
T Consensus 38 ~~~~~~~kr~lN~Fm~FRsyy~----~~~~~~~Qk~~S~~l~~lW~~dp~--k~~W~l~ak 92 (201)
T PF04769_consen 38 KRSPEKAKRPLNGFMAFRSYYS----PIFPPLPQKELSGILTKLWEKDPF--KNKWSLMAK 92 (201)
T ss_pred cccccccccchhHHHHHHHHHH----hhcCCcCHHHHHHHHHHHHhCCcc--HhHHHHHhh
Confidence 3445678999999999976664 456788899999999999987432 566655543
No 37
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=86.13 E-value=2.1 Score=33.94 Aligned_cols=46 Identities=13% Similarity=0.443 Sum_probs=38.8
Q ss_pred CCCHHHHHH-HHHHHhhcCCHHHHHHHHHHHHH-HHHHHHHHHHhhhh
Q psy14923 73 NIKIIEVTK-LVSEEWKKIGEAEKKRLENEYAK-EKEIYAQNVKKYHD 118 (180)
Q Consensus 73 ~~~~~ei~k-~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~y~~ 118 (180)
-.++..+++ .+|.-|+.+|+++++.|.+.... ....|-..+..|..
T Consensus 63 ~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l~~tY~~~l~~y~~ 110 (198)
T TIGR03481 63 AFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRELSIATYASQFKSYAG 110 (198)
T ss_pred hCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence 467777766 58999999999999999999988 66778888888865
No 38
>cd00225 API3 Ascaris pepsin inhibitor-3 (API3); protein inhibitor that reversibly inhibits aspartic proteinase cathepsin E, and gastric enzymes pepsin and gastricsin.
Probab=83.27 E-value=1.9 Score=32.67 Aligned_cols=29 Identities=17% Similarity=0.402 Sum_probs=19.1
Q ss_pred HHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhC
Q psy14923 84 SEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDEL 120 (180)
Q Consensus 84 ~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~ 120 (180)
|-.|+.|+.+|.+. ...|..+..+|+..+
T Consensus 25 g~~lReLt~~Eq~e--------l~~y~~d~~~yK~~~ 53 (159)
T cd00225 25 GFPLRELTPDEQQE--------LAQYVEDVADYKEEV 53 (159)
T ss_pred CceeeeCCHHHHHH--------HHHHHHHHHHHHHHH
Confidence 34699999999443 345666666666543
No 39
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=82.99 E-value=4 Score=32.63 Aligned_cols=48 Identities=17% Similarity=0.358 Sum_probs=39.1
Q ss_pred CCCCHHHHHH-HHHHHhhcCCHHHHHHHHHHHHHH-HHHHHHHHHhhhhh
Q psy14923 72 PNIKIIEVTK-LVSEEWKKIGEAEKKRLENEYAKE-KEIYAQNVKKYHDE 119 (180)
Q Consensus 72 p~~~~~ei~k-~l~~~Wk~Ls~~eK~~y~~~a~~~-k~~y~~~~~~y~~~ 119 (180)
|-.++..+++ .+|.-|+.+|+++++.|.+....- ...|-..+..|...
T Consensus 66 p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~~Lv~tYa~~l~~y~~q 115 (211)
T PRK15117 66 PYVQVKYAGALVLGRYYKDATPAQREAYFAAFREYLKQAYGQALAMYHGQ 115 (211)
T ss_pred ccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHhCCc
Confidence 4567777766 689999999999999999988874 45788888888753
No 40
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=79.41 E-value=20 Score=25.52 Aligned_cols=32 Identities=16% Similarity=0.401 Sum_probs=14.8
Q ss_pred HHhCCCCCHHHHHHHHH--HHhhcCCHHHHHHHH
Q psy14923 68 MERNPNIKIIEVTKLVS--EEWKKIGEAEKKRLE 99 (180)
Q Consensus 68 ~~~~p~~~~~ei~k~l~--~~Wk~Ls~~eK~~y~ 99 (180)
....+.++..+-..... ..|.+||++++..-.
T Consensus 35 a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R 68 (107)
T PF11304_consen 35 AERWPSMSPEQQQRLRERMRRWAALSPEQRQQAR 68 (107)
T ss_pred HHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHH
Confidence 33444555433332222 356666666655433
No 41
>PF12881 NUT_N: NUT protein N terminus; InterPro: IPR024309 This domain is found in the N-terminal region of Nuclear Testis (NUT) proteins. It is also found in FAM22, which are a family of uncharacterised mammalian proteins.
Probab=77.19 E-value=6.8 Score=33.26 Aligned_cols=53 Identities=11% Similarity=0.107 Sum_probs=44.4
Q ss_pred CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHH
Q psy14923 53 LTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKE 105 (180)
Q Consensus 53 ~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~ 105 (180)
..|+.+|..-....+....|.++..|-....-++|...|.-+|..|.++|++=
T Consensus 229 ~EAlSCFLIpvLrsLar~kPtMtlEeGl~ra~qEW~~~SnfdRmifyemaekF 281 (328)
T PF12881_consen 229 AEALSCFLIPVLRSLARLKPTMTLEEGLWRAVQEWQHTSNFDRMIFYEMAEKF 281 (328)
T ss_pred hhhhhhhHHHHHHHHHhcCCCccHHHHHHHHHHHhhccccccHHHHHHHHHHH
Confidence 55677777777667777788999999999999999999999999999999753
No 42
>KOG3223|consensus
Probab=76.58 E-value=4 Score=32.29 Aligned_cols=44 Identities=18% Similarity=0.460 Sum_probs=39.5
Q ss_pred CCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHH
Q psy14923 50 KKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEA 93 (180)
Q Consensus 50 KrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~ 93 (180)
+|=.-||.-|-....+.++.+||++..++.-.+|-.+|..-+|.
T Consensus 166 kRmrAA~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN 209 (221)
T KOG3223|consen 166 KRMRAAFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN 209 (221)
T ss_pred HHHHHHHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCC
Confidence 44467799999999999999999999999999999999988874
No 43
>PF05494 Tol_Tol_Ttg2: Toluene tolerance, Ttg2 ; InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=75.39 E-value=5 Score=30.55 Aligned_cols=47 Identities=17% Similarity=0.475 Sum_probs=34.4
Q ss_pred CCCCHHHHHHH-HHHHhhcCCHHHHHHHHHHHHHH-HHHHHHHHHhhhh
Q psy14923 72 PNIKIIEVTKL-VSEEWKKIGEAEKKRLENEYAKE-KEIYAQNVKKYHD 118 (180)
Q Consensus 72 p~~~~~ei~k~-l~~~Wk~Ls~~eK~~y~~~a~~~-k~~y~~~~~~y~~ 118 (180)
|-.++..+++. ||.-|+.+|+++++.|.+...+. ...|-..+..|..
T Consensus 36 ~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l~~~Y~~~l~~y~~ 84 (170)
T PF05494_consen 36 PYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQLLVRTYAKRLDEYSG 84 (170)
T ss_dssp GGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHHHHHHHHHHHHHT-SS
T ss_pred HhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhCC
Confidence 55677777665 78899999999999999988774 4556677777765
No 44
>KOG3838|consensus
Probab=65.53 E-value=7.6 Score=34.17 Aligned_cols=37 Identities=11% Similarity=0.173 Sum_probs=32.5
Q ss_pred cCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhCCHHHH
Q psy14923 89 KIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAI 125 (180)
Q Consensus 89 ~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~~~~~~ 125 (180)
++.+.||++|.++.+.....+.++.++|.+.+++++.
T Consensus 268 E~qe~ek~kyqeEfe~~q~elek~k~efkk~hpd~~~ 304 (497)
T KOG3838|consen 268 EMQELEKAKYQEEFEWAQLELEKRKDEFKKSHPDAQG 304 (497)
T ss_pred hhhHHHHHHHHHHHHHHHHHHhhhHhhhccCCchhhc
Confidence 4556789999999999999999999999999998753
No 45
>PF04690 YABBY: YABBY protein; InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=65.27 E-value=6 Score=30.72 Aligned_cols=27 Identities=19% Similarity=0.373 Sum_probs=21.6
Q ss_pred cCCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 151 TGKPKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 151 ~~~PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
+.+-.|-.|||+.|++|....++..+.
T Consensus 119 PEKRqR~psaYn~f~k~ei~rik~~~p 145 (170)
T PF04690_consen 119 PEKRQRVPSAYNRFMKEEIQRIKAENP 145 (170)
T ss_pred ccccCCCchhHHHHHHHHHHHHHhcCC
Confidence 334456699999999999999887764
No 46
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=63.64 E-value=5.7 Score=23.34 Aligned_cols=29 Identities=24% Similarity=0.402 Sum_probs=16.6
Q ss_pred HHHHHHHHH-HHhhcCCHHHHHHHHHHHHH
Q psy14923 76 IIEVTKLVS-EEWKKIGEAEKKRLENEYAK 104 (180)
Q Consensus 76 ~~ei~k~l~-~~Wk~Ls~~eK~~y~~~a~~ 104 (180)
+.+|+--++ +.|..|.+.+|.-|.+...+
T Consensus 3 f~Dvav~fs~eEW~~L~~~Qk~ly~dvm~E 32 (41)
T PF01352_consen 3 FEDVAVYFSQEEWELLDPAQKNLYRDVMLE 32 (41)
T ss_dssp ----TT---HHHHHTS-HHHHHHHHHHHHH
T ss_pred EEEEEEEcChhhcccccceecccchhHHHH
Confidence 445555555 56999999999998876544
No 47
>PF13875 DUF4202: Domain of unknown function (DUF4202)
Probab=62.72 E-value=19 Score=28.33 Aligned_cols=39 Identities=10% Similarity=0.319 Sum_probs=33.2
Q ss_pred hHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHH
Q psy14923 55 PYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKK 96 (180)
Q Consensus 55 ay~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~ 96 (180)
+=++|...+.+.+...| +-..+..++..-|+.||+..++
T Consensus 131 acLVFL~~~f~~F~~~~---deeK~v~Il~KTw~KMS~~g~~ 169 (185)
T PF13875_consen 131 ACLVFLEYYFEDFAAKH---DEEKIVDILRKTWRKMSERGHE 169 (185)
T ss_pred HHHHhHHHHHHHHHhcC---CHHHHHHHHHHHHHHCCHHHHH
Confidence 46789999999998877 5567888999999999998775
No 48
>PF06945 DUF1289: Protein of unknown function (DUF1289); InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=56.30 E-value=21 Score=21.85 Aligned_cols=26 Identities=15% Similarity=0.473 Sum_probs=18.8
Q ss_pred CHHHHHHHHHHHhhcCCHHHHHHHHHHHHHH
Q psy14923 75 KIIEVTKLVSEEWKKIGEAEKKRLENEYAKE 105 (180)
Q Consensus 75 ~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~ 105 (180)
+..||.. |..|++++|....+.....
T Consensus 23 T~dEI~~-----W~~~s~~er~~i~~~l~~R 48 (51)
T PF06945_consen 23 TLDEIRD-----WKSMSDDERRAILARLRAR 48 (51)
T ss_pred cHHHHHH-----HhhCCHHHHHHHHHHHHHH
Confidence 4566655 9999999998776655443
No 49
>PRK09706 transcriptional repressor DicA; Reviewed
Probab=52.90 E-value=53 Score=23.87 Aligned_cols=42 Identities=17% Similarity=0.153 Sum_probs=35.9
Q ss_pred HHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy14923 78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE 119 (180)
Q Consensus 78 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~ 119 (180)
+-...+-..|+.|++++++..........+.|.+-+++|...
T Consensus 87 ~~~~~ll~~~~~L~~~~~~~~l~~l~~~~~~~~~~~~~~~~~ 128 (135)
T PRK09706 87 EDQKELLELFDALPESEQDAQLSEMRARVENFNKLFEELLKA 128 (135)
T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345688999999999999999999999888888888877763
No 50
>KOG0528|consensus
Probab=49.25 E-value=6.9 Score=35.11 Aligned_cols=20 Identities=35% Similarity=0.526 Sum_probs=17.5
Q ss_pred CCCCCCchHHHHHHHhhhhh
Q psy14923 153 KPKKPGNSFYLFIKDNIDKK 172 (180)
Q Consensus 153 ~PKrP~say~lF~~e~r~~~ 172 (180)
--|||+||||++.++.|.++
T Consensus 325 HIKRPMNAFMVWAkDERRKI 344 (511)
T KOG0528|consen 325 HIKRPMNAFMVWAKDERRKI 344 (511)
T ss_pred cccCCcchhhcccchhhhhh
Confidence 45999999999999998765
No 51
>PF12650 DUF3784: Domain of unknown function (DUF3784); InterPro: IPR017259 This group represents an uncharacterised conserved protein.
Probab=43.11 E-value=17 Score=25.10 Aligned_cols=15 Identities=13% Similarity=0.501 Sum_probs=13.0
Q ss_pred HhhcCCHHHHHHHHH
Q psy14923 86 EWKKIGEAEKKRLEN 100 (180)
Q Consensus 86 ~Wk~Ls~~eK~~y~~ 100 (180)
-|+.||+|||+.|.+
T Consensus 25 Gyntms~eEk~~~D~ 39 (97)
T PF12650_consen 25 GYNTMSKEEKEKYDK 39 (97)
T ss_pred hcccCCHHHHHHhhH
Confidence 489999999999965
No 52
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=41.66 E-value=37 Score=27.16 Aligned_cols=41 Identities=10% Similarity=0.276 Sum_probs=34.1
Q ss_pred HHHHHHhhcCCHHHHHHHHHHHHHH-HHHHHHHHHhhhhhCC
Q psy14923 81 KLVSEEWKKIGEAEKKRLENEYAKE-KEIYAQNVKKYHDELT 121 (180)
Q Consensus 81 k~l~~~Wk~Ls~~eK~~y~~~a~~~-k~~y~~~~~~y~~~~~ 121 (180)
..||.-|+.+|+++++.|.+..... ...|-..+..|..+..
T Consensus 78 ~vLGk~~k~aspeQ~~~F~~aF~~yl~q~Y~~aL~~Y~~q~~ 119 (202)
T COG2854 78 LVLGKYYKTASPEQRQAFFKAFRTYLEQTYGQALLDYKGQTL 119 (202)
T ss_pred HHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHccCCCc
Confidence 4588999999999999999888774 4568889999988653
No 53
>PRK10236 hypothetical protein; Provisional
Probab=40.30 E-value=33 Score=28.06 Aligned_cols=26 Identities=19% Similarity=0.494 Sum_probs=21.1
Q ss_pred HHHHHHHHhhcCCHHHHHHHHHHHHH
Q psy14923 79 VTKLVSEEWKKIGEAEKKRLENEYAK 104 (180)
Q Consensus 79 i~k~l~~~Wk~Ls~~eK~~y~~~a~~ 104 (180)
+.+.+...|..||++|++.+.+.-..
T Consensus 118 l~kll~~a~~kms~eE~~~L~~~l~~ 143 (237)
T PRK10236 118 LEQFLRNTWKKMDEEHKQEFLHAVDA 143 (237)
T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHhh
Confidence 47789999999999999888765443
No 54
>KOG1610|consensus
Probab=32.79 E-value=1.3e+02 Score=25.73 Aligned_cols=39 Identities=21% Similarity=0.516 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923 76 IIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH 117 (180)
Q Consensus 76 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~ 117 (180)
...+.+.+.+.|..|+++.|+.|-+.+..+ |...+..|.
T Consensus 218 ~~~~~~~~~~~w~~l~~e~k~~YGedy~~~---~~~~~~~~~ 256 (322)
T KOG1610|consen 218 PEKLEKRMKEIWERLPQETKDEYGEDYFED---YKKSLEKYL 256 (322)
T ss_pred hHHHHHHHHHHHhcCCHHHHHHHHHHHHHH---HHHHHHhhh
Confidence 578889999999999999999998877665 444455444
No 55
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=32.16 E-value=1.2e+02 Score=23.27 Aligned_cols=23 Identities=9% Similarity=0.208 Sum_probs=12.2
Q ss_pred HHHHhhcCCHHHHHHHHHHHHHH
Q psy14923 83 VSEEWKKIGEAEKKRLENEYAKE 105 (180)
Q Consensus 83 l~~~Wk~Ls~~eK~~y~~~a~~~ 105 (180)
.-+++..|++++|..|.+..++.
T Consensus 123 ~~qmy~lLTPEQra~l~~~~e~r 145 (162)
T PRK12751 123 RNQMYNLLTPEQKEALNKKHQER 145 (162)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHH
Confidence 34455566666666665544433
No 56
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=31.94 E-value=1.4e+02 Score=22.99 Aligned_cols=23 Identities=17% Similarity=0.226 Sum_probs=14.1
Q ss_pred HHHHhhhhhCCHHHHHHHHHHHH
Q psy14923 111 QNVKKYHDELTPEAIEFLRKEKE 133 (180)
Q Consensus 111 ~~~~~y~~~~~~~~~~~~~~~~~ 133 (180)
+...+....+||+++..+.....
T Consensus 128 ~~~~~~~~vLTpEQRak~~e~~~ 150 (170)
T PRK12750 128 EKRHQMLSILTPEQKAKFQELQQ 150 (170)
T ss_pred HHHHHHHHhCCHHHHHHHHHHHH
Confidence 33444566788888776665543
No 57
>cd07081 ALDH_F20_ACDH_EutE-like Coenzyme A acylating aldehyde dehydrogenase (ACDH), Ethanolamine utilization protein EutE, and related proteins. Coenzyme A acylating aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10), functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA. The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH, and may be critical enzymes in the fermentative pathway.
Probab=31.48 E-value=1.2e+02 Score=26.85 Aligned_cols=41 Identities=10% Similarity=-0.026 Sum_probs=34.1
Q ss_pred HHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy14923 78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHD 118 (180)
Q Consensus 78 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~ 118 (180)
+.++..-..|+.++.++|..+...+.+..+++.+++.....
T Consensus 6 ~~A~~A~~~W~~~~~~~R~~iL~~~a~~l~~~~~ela~~~~ 46 (439)
T cd07081 6 AAAKVAQQGLSCKSQEMVDLIFRAAAEAAEDARIDLAKLAV 46 (439)
T ss_pred HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555667899999999999999999999999888887643
No 58
>PF15581 Imm35: Immunity protein 35
Probab=29.06 E-value=92 Score=21.55 Aligned_cols=23 Identities=22% Similarity=0.471 Sum_probs=18.0
Q ss_pred CHHHHHHHHHHHhhcCCHHHHHH
Q psy14923 75 KIIEVTKLVSEEWKKIGEAEKKR 97 (180)
Q Consensus 75 ~~~ei~k~l~~~Wk~Ls~~eK~~ 97 (180)
+..-+...+...|+.|++++=..
T Consensus 31 ~i~~l~~lIe~eWRGl~~~qV~~ 53 (93)
T PF15581_consen 31 TIRNLESLIEHEWRGLPEEQVLY 53 (93)
T ss_pred HHHHHHHHHHHHHcCCCHHHHHH
Confidence 35667889999999999876543
No 59
>PRK14296 chaperone protein DnaJ; Provisional
Probab=26.85 E-value=1.3e+02 Score=26.18 Aligned_cols=38 Identities=11% Similarity=0.204 Sum_probs=27.2
Q ss_pred HHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy14923 64 RNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENE 101 (180)
Q Consensus 64 r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~ 101 (180)
|...+.-|||.+ ..+-.++|.+.|..|||.+|..-.++
T Consensus 26 rkla~~~HPD~n~~~~a~~~F~~i~~AyevLsD~~KR~~YD~ 67 (372)
T PRK14296 26 RKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQYDQ 67 (372)
T ss_pred HHHHHHHCcCCCCCchHHHHHHHHHHHHHHhcCHHHhhhhhh
Confidence 333445677764 45778889999999999987755554
No 60
>PF02026 RyR: RyR domain; InterPro: IPR003032 This domain is called RyR for Ryanodine receptor []. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown.; PDB: 4ETV_A 3RQR_A 4ETT_A 4ERT_A 4ESU_A 4ETU_A 4ERV_A 3NRT_E.
Probab=26.33 E-value=69 Score=22.18 Aligned_cols=21 Identities=14% Similarity=0.267 Sum_probs=15.8
Q ss_pred HhhcCCHHHHHHHHHHHHHHH
Q psy14923 86 EWKKIGEAEKKRLENEYAKEK 106 (180)
Q Consensus 86 ~Wk~Ls~~eK~~y~~~a~~~k 106 (180)
-|..|++++|+.+.+.+.+-.
T Consensus 60 py~~L~e~eK~~dr~~~~e~l 80 (94)
T PF02026_consen 60 PYDELSEEEKEKDRDMVRETL 80 (94)
T ss_dssp -GGGS-HHHHHHHHHHHHHHH
T ss_pred ChhhCCHHHHHHhHHHHHHHH
Confidence 499999999999988776543
No 61
>PF06628 Catalase-rel: Catalase-related immune-responsive; InterPro: IPR010582 Catalases (1.11.1.6 from EC) are antioxidant enzymes that catalyse the conversion of hydrogen peroxide to water and molecular oxygen, serving to protect cells from its toxic effects []. Hydrogen peroxide is produced as a consequence of oxidative cellular metabolism and can be converted to the highly reactive hydroxyl radical via transition metals, this radical being able to damage a wide variety of molecules within a cell, leading to oxidative stress and cell death. Catalases act to neutralise hydrogen peroxide toxicity, and are produced by all aerobic organisms ranging from bacteria to man. Most catalases are mono-functional, haem-containing enzymes, although there are also bifunctional haem-containing peroxidase/catalases (IPR000763 from INTERPRO) that are closely related to plant peroxidases, and non-haem, manganese-containing catalases (IPR007760 from INTERPRO) that are found in bacteria []. This entry represents a small conserved region within catalase enzymes that carries the immune-responsive amphipathic octa-peptide that is recognised by T cells [].; PDB: 2CAH_A 1NM0_A 1H7K_A 1E93_A 1H6N_A 3HB6_A 2CAG_A 1M85_A 1MQF_A 1A4E_C ....
Probab=26.03 E-value=76 Score=20.39 Aligned_cols=19 Identities=16% Similarity=0.429 Sum_probs=15.8
Q ss_pred HHHHHhhcCCHHHHHHHHH
Q psy14923 82 LVSEEWKKIGEAEKKRLEN 100 (180)
Q Consensus 82 ~l~~~Wk~Ls~~eK~~y~~ 100 (180)
..+..|+.|+++||+.+.+
T Consensus 12 Qa~~ly~~l~~~er~~lv~ 30 (68)
T PF06628_consen 12 QARDLYRVLSDEERERLVE 30 (68)
T ss_dssp HHHHHHHHSSHHHHHHHHH
T ss_pred hHHHHHHHCCHHHHHHHHH
Confidence 4678899999999987764
No 62
>PF13945 NST1: Salt tolerance down-regulator
Probab=25.68 E-value=1.9e+02 Score=22.94 Aligned_cols=37 Identities=19% Similarity=0.300 Sum_probs=26.2
Q ss_pred HHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHh
Q psy14923 76 IIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKK 115 (180)
Q Consensus 76 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~ 115 (180)
..+-...|-+-|-.|+++||.....+-++ .-.+.|.+
T Consensus 100 s~eEre~LkeFW~SL~eeERr~LVkIEKe---~VLkkmKe 136 (190)
T PF13945_consen 100 SQEEREKLKEFWESLSEEERRSLVKIEKE---AVLKKMKE 136 (190)
T ss_pred hHHHHHHHHHHHHccCHHHHHHHHHhhHH---HHHHHHHH
Confidence 44666789999999999999977665444 34444444
No 63
>KOG4715|consensus
Probab=25.61 E-value=53 Score=28.17 Aligned_cols=29 Identities=14% Similarity=0.263 Sum_probs=24.1
Q ss_pred hccCCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923 149 RETGKPKKPGNSFYLFIKDNIDKKEYEGK 177 (180)
Q Consensus 149 k~~~~PKrP~say~lF~~e~r~~~~~~~~ 177 (180)
+-|..|-+|+-.|+-|+.-+|++++..|.
T Consensus 60 kpPkppekpl~pymrySrkvWd~VkA~nP 88 (410)
T KOG4715|consen 60 KPPKPPEKPLMPYMRYSRKVWDQVKASNP 88 (410)
T ss_pred CCCCCCCcccchhhHHhhhhhhhhhccCc
Confidence 34556777899999999999999988775
No 64
>PF05388 Carbpep_Y_N: Carboxypeptidase Y pro-peptide; InterPro: IPR008442 This signature is found at the N terminus of carboxypeptidase Y, which belong to MEROPS peptidase family S10. This region contains the signal peptide and pro-peptide regions [,].; GO: 0004185 serine-type carboxypeptidase activity, 0005773 vacuole
Probab=25.41 E-value=1.4e+02 Score=21.51 Aligned_cols=28 Identities=14% Similarity=0.234 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHhhcCCHHHHHHHHHHHH
Q psy14923 76 IIEVTKLVSEEWKKIGEAEKKRLENEYA 103 (180)
Q Consensus 76 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~ 103 (180)
..-+++.+++.++.|+.+-|+.|.|+..
T Consensus 45 ~~~~~~~l~e~l~~Lt~e~k~~W~E~~~ 72 (113)
T PF05388_consen 45 LEKISKYLNEPLKSLTSEAKALWDEMML 72 (113)
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHH
Confidence 4566778999999999999999988764
No 65
>PRK14279 chaperone protein DnaJ; Provisional
Probab=24.23 E-value=1.3e+02 Score=26.39 Aligned_cols=41 Identities=24% Similarity=0.251 Sum_probs=29.2
Q ss_pred HHHHHHHHhCCCCC-----HHHHHHHHHHHhhcCCHHHHHHHHHHH
Q psy14923 62 QIRNKIMERNPNIK-----IIEVTKLVSEEWKKIGEAEKKRLENEY 102 (180)
Q Consensus 62 e~r~~~~~~~p~~~-----~~ei~k~l~~~Wk~Ls~~eK~~y~~~a 102 (180)
..|...+.-|||.. ..+..+.|.+.|..|||++|..-.+++
T Consensus 29 ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vLsD~~KR~~YD~~ 74 (392)
T PRK14279 29 AYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRKEYDET 74 (392)
T ss_pred HHHHHHHHHCcCCCCCChHHHHHHHHHHHHHHHhcchhhhhHHHHh
Confidence 34444556688764 357778899999999999887655543
No 66
>PF08367 M16C_assoc: Peptidase M16C associated; InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=24.02 E-value=1.7e+02 Score=23.59 Aligned_cols=30 Identities=17% Similarity=0.159 Sum_probs=23.6
Q ss_pred HHHHHHHHHHhhcCCHHHHHHHHHHHHHHH
Q psy14923 77 IEVTKLVSEEWKKIGEAEKKRLENEYAKEK 106 (180)
Q Consensus 77 ~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k 106 (180)
.+..+.|.+.+..||+++++...+.+++.+
T Consensus 13 ~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~ 42 (248)
T PF08367_consen 13 EEEKEKLAAYKASLSEEEKEKIIEQTKELK 42 (248)
T ss_dssp HHHHHHHHHHHHCS-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhCCHHHHHHHHHHHHHHH
Confidence 456778999999999999998888777654
No 67
>cd07122 ALDH_F20_ACDH Coenzyme A acylating aldehyde dehydrogenase (ACDH), ALDH family 20-like. Coenzyme A acylating aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH and may be critical enzymes in the fermentative pathway.
Probab=23.73 E-value=2.2e+02 Score=25.22 Aligned_cols=40 Identities=10% Similarity=0.102 Sum_probs=33.1
Q ss_pred HHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy14923 79 VTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHD 118 (180)
Q Consensus 79 i~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~ 118 (180)
.++..-..|+.++.++|..+...+.+..+++.+++.....
T Consensus 7 ~A~~A~~~W~~~~~~eR~~~L~~~a~~l~~~~eela~~~~ 46 (436)
T cd07122 7 RARKAQREFATFSQEQVDKIVEAVAWAAADAAEELAKMAV 46 (436)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455567899999999999999999998888888877653
No 68
>PRK14283 chaperone protein DnaJ; Provisional
Probab=23.68 E-value=1.8e+02 Score=25.24 Aligned_cols=41 Identities=20% Similarity=0.341 Sum_probs=29.6
Q ss_pred HHHHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHHH
Q psy14923 62 QIRNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENEY 102 (180)
Q Consensus 62 e~r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~a 102 (180)
..|...+.-|||.+ ..+..+.|.+.|..|++.+|..-.+.+
T Consensus 25 ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~~Lsd~~kR~~YD~~ 69 (378)
T PRK14283 25 AYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQRYDQF 69 (378)
T ss_pred HHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhchhHHHHHHhhh
Confidence 34444556688864 467888999999999999876655543
No 69
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=22.83 E-value=1.9e+02 Score=23.19 Aligned_cols=26 Identities=12% Similarity=0.169 Sum_probs=20.1
Q ss_pred HHHhhcCCHHHHHHHHHHHHHHHHHH
Q psy14923 84 SEEWKKIGEAEKKRLENEYAKEKEIY 109 (180)
Q Consensus 84 ~~~Wk~Ls~~eK~~y~~~a~~~k~~y 109 (180)
...|..||++.|+...+.+.+.-...
T Consensus 213 ~~~~~~L~~e~q~~i~~a~~~~~~~~ 238 (257)
T TIGR00787 213 KAFWKSLPPDLQAVVKEAAKEAGEYQ 238 (257)
T ss_pred HHHHhcCCHHHHHHHHHHHHHHHHHH
Confidence 57799999999999888776654333
No 70
>PF06394 Pepsin-I3: Pepsin inhibitor-3-like repeated domain; InterPro: IPR010480 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The members of this group of proteins belong to MEROPS inhibitor family I33, clan IR; the nematode aspartyl protease inhibitors or Aspins. They are restricted to parasitic nematode species. Structural features common to the nematode Aspins include the presence of a signal peptide sequence and the conservation of all four cysteine residues in the mature protein. The Y[V.A]RDLT sequence motif has been suggested as being of crucial functional importance in several filarial nematode inhibitors [], this sequence is not conserved in Tco-API-1 from Trichostrongylus colubriformis (Black scour worm) and it has been demonstrated that Tco-API-1, is not an Aspin as it does not inhibit porcine pepsin []. Related inhibitors from Onchocerca volvulus, Ov33 [] and Ascaris suum (Pig roundworm), PI-3 [] inhibit the in vitro activity of aspartyl proteases such as pepsin and cathepsin E (MEROPS peptidase family A1). Aspin may facilitate the safe passage of the eggs of Ascaris through the host stomach without digestion by pepsin [, ]. The other parasitic nematodes known to express homologous proteins do not pass through the stomach of their hosts []. Several proteins in the family are potent allergens in mammals. The three-dimensional structures of pepsin inhibitor-3 (PI-3) from A. suum and of the complex between PI-3 and porcine pepsin at 1. 75 A and 2.45 A resolution, respectively, have revealed the mechanism of aspartic protease inhibition. PI-3 has a new fold consisting of two identical domains, each comprising an antiparallel beta-sheet flanked by an alpha-helix. In the enzyme-inhibitor complex, the N-terminal beta-strand of PI-3 pairs with one strand of the 'active site flap' (residues 70-82) of pepsin, thus forming an eight-stranded beta-sheet that spans the two proteins. PI-3 has a novel mode of inhibition, using its N-terminal residues to occupy and therefore block the first three binding pockets in pepsin for substrate residues C-terminal to the scissile bond (S1'-S3') [].; PDB: 1F32_A 1F34_B.
Probab=22.80 E-value=82 Score=21.13 Aligned_cols=26 Identities=23% Similarity=0.577 Sum_probs=17.1
Q ss_pred hcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhCC
Q psy14923 88 KKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELT 121 (180)
Q Consensus 88 k~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~~ 121 (180)
+.|+++|++.+ +.|.+++.+|...+.
T Consensus 38 R~Lt~~E~~eL--------~~y~~~v~~y~~~l~ 63 (76)
T PF06394_consen 38 RDLTPDEQQEL--------KTYQKKVAAYKEQLQ 63 (76)
T ss_dssp EE--HHHHHHH--------HHHHHHHHHHHHHHT
T ss_pred ccCCHHHHHHH--------HHHHHHHHHHHHHHH
Confidence 56777777654 568888888887654
No 71
>PRK14291 chaperone protein DnaJ; Provisional
Probab=22.20 E-value=1.8e+02 Score=25.25 Aligned_cols=40 Identities=20% Similarity=0.363 Sum_probs=28.5
Q ss_pred HHHHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy14923 62 QIRNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENE 101 (180)
Q Consensus 62 e~r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~ 101 (180)
..|...+.-|||.+ ..+..+.|.+.|..|++.+|..-.+.
T Consensus 23 ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vLsd~~kR~~YD~ 66 (382)
T PRK14291 23 AYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRKLYDQ 66 (382)
T ss_pred HHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhcCHHHHHHHhh
Confidence 34444556678764 45778899999999999887655554
No 72
>cd07132 ALDH_F3AB Aldehyde dehydrogenase family 3 members A1, A2, and B1 and related proteins. NAD(P)+-dependent, aldehyde dehydrogenase, family 3 members A1 and B1 (ALDH3A1, ALDH3B1, EC=1.2.1.5) and fatty aldehyde dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3), and similar sequences are included in this CD. Human ALDH3A1 is a homodimer with a critical role in cellular defense against oxidative stress; it catalyzes the oxidation of various cellular membrane lipid-derived aldehydes. Corneal crystalline ALDH3A1 protects the cornea and underlying lens against UV-induced oxidative stress. Human ALDH3A2, a microsomal homodimer, catalyzes the oxidation of long-chain aliphatic aldehydes to fatty acids. Human ALDH3B1 is highly expressed in the kidney and liver and catalyzes the oxidation of various medium- and long-chain saturated and unsaturated aliphatic aldehydes.
Probab=22.15 E-value=2.2e+02 Score=25.16 Aligned_cols=41 Identities=5% Similarity=-0.154 Sum_probs=33.8
Q ss_pred HHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy14923 78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHD 118 (180)
Q Consensus 78 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~ 118 (180)
+.++..-..|+.++..+|..+........+.+.+++.+-..
T Consensus 5 ~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~~ 45 (443)
T cd07132 5 RRAREAFSSGKTRPLEFRIQQLEALLRMLEENEDEIVEALA 45 (443)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 34556667899999999999999999998888888876553
No 73
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=22.11 E-value=1.7e+02 Score=25.94 Aligned_cols=43 Identities=21% Similarity=0.383 Sum_probs=32.2
Q ss_pred HHHHHHHHHHhCCCCC-HHHHHHHHHHHhhcCCHHHHHHHHHHH
Q psy14923 60 QGQIRNKIMERNPNIK-IIEVTKLVSEEWKKIGEAEKKRLENEY 102 (180)
Q Consensus 60 ~~e~r~~~~~~~p~~~-~~ei~k~l~~~Wk~Ls~~eK~~y~~~a 102 (180)
-+..|...+.-|||.. ..+..+.|.+.|..|++.+|....+.+
T Consensus 46 KkAYrkla~k~HPDk~~~~e~F~~i~~AYevLsD~~kR~~YD~~ 89 (421)
T PTZ00037 46 KKAYRKLAIKHHPDKGGDPEKFKEISRAYEVLSDPEKRKIYDEY 89 (421)
T ss_pred HHHHHHHHHHHCCCCCchHHHHHHHHHHHHHhccHHHHHHHhhh
Confidence 4455555667799975 357888999999999999877666653
No 74
>cd08317 Death_ank Death domain associated with Ankyrins. Death Domain (DD) associated with Ankyrins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. Ankyrins function as adaptor proteins and they interact, through ANK repeats, with structurally diverse membrane proteins, including ion channels/pumps, calcium release channels, and cell adhesion molecules. They play critical roles in the proper expression and membrane localization of these proteins. In mammals, this family includes ankyrin-R for restricted (or ANK1), ankyrin-B for broadly expressed (or ANK2) and ankyrin-G for general or giant (or ANK3). They are expressed in different combinations in many tissues and play non-overlapping functions. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-associati
Probab=21.76 E-value=50 Score=22.06 Aligned_cols=21 Identities=14% Similarity=0.749 Sum_probs=17.6
Q ss_pred CCCCHHHHHHHHHHHhhcCCH
Q psy14923 72 PNIKIIEVTKLVSEEWKKIGE 92 (180)
Q Consensus 72 p~~~~~ei~k~l~~~Wk~Ls~ 92 (180)
+++.+..|+..||..|+.|..
T Consensus 3 ~~~~l~~ia~~lG~dW~~LAr 23 (84)
T cd08317 3 ADIRLADISNLLGSDWPQLAR 23 (84)
T ss_pred ccchHHHHHHHHhhHHHHHHH
Confidence 567789999999999998754
No 75
>cd07085 ALDH_F6_MMSDH Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2. Methylmalonate semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27) [acylating] from Bacillus subtilis is involved in valine metabolism and catalyses the NAD+- and CoA-dependent oxidation of methylmalonate semialdehyde into propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and Arabidopsis MMSDH ALDH6B2 are also present in this CD.
Probab=21.16 E-value=2.5e+02 Score=24.91 Aligned_cols=37 Identities=8% Similarity=0.165 Sum_probs=30.7
Q ss_pred HHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy14923 80 TKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKY 116 (180)
Q Consensus 80 ~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y 116 (180)
++.....|+.++.++|..+...+.+..+++.+++..-
T Consensus 47 A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~ 83 (478)
T cd07085 47 AKAAFPAWSATPVLKRQQVMFKFRQLLEENLDELARL 83 (478)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445567999999999999999999888888777764
No 76
>cd07133 ALDH_CALDH_CalB Coniferyl aldehyde dehydrogenase-like. Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of Pseudomonas sp. strain HR199 (CalB) which catalyzes the NAD+-dependent oxidation of coniferyl aldehyde to ferulic acid, and similar sequences, are present in this CD.
Probab=21.02 E-value=2.4e+02 Score=24.83 Aligned_cols=40 Identities=8% Similarity=-0.168 Sum_probs=33.6
Q ss_pred HHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923 78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH 117 (180)
Q Consensus 78 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~ 117 (180)
+.++..-..|+.++..+|..+.....+..+.+.+++....
T Consensus 5 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~ 44 (434)
T cd07133 5 ERQKAAFLANPPPSLEERRDRLDRLKALLLDNQDALAEAI 44 (434)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455666789999999999999999999999998888754
No 77
>PRK14299 chaperone protein DnaJ; Provisional
Probab=20.82 E-value=2.2e+02 Score=23.70 Aligned_cols=39 Identities=18% Similarity=0.373 Sum_probs=26.3
Q ss_pred HHHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy14923 63 IRNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENE 101 (180)
Q Consensus 63 ~r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~ 101 (180)
.|...+.-||+.. ..+..+.+.+.|..|++.+|..-.+.
T Consensus 25 yr~la~~~HPD~~~~~~~~~~f~~i~~Ay~~L~d~~kr~~yD~ 67 (291)
T PRK14299 25 FKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRIYDT 67 (291)
T ss_pred HHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhcCHHHHHHHHh
Confidence 3444445577764 45677889999999998876644443
No 78
>KOG1827|consensus
Probab=20.56 E-value=6.1 Score=36.69 Aligned_cols=44 Identities=27% Similarity=0.420 Sum_probs=39.6
Q ss_pred CCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHH
Q psy14923 52 PLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEK 95 (180)
Q Consensus 52 P~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK 95 (180)
-.++|++|..+.+..+-.++|+..+++++.+.|..|..|+..-|
T Consensus 552 ~~~~~~~~s~~~~~~~~~~np~v~~~~~~~~vg~~~~~lp~~~k 595 (629)
T KOG1827|consen 552 SPEPYILDSIENRTIIWFENPTVGFGEVSIIVGNDWDKLPNINK 595 (629)
T ss_pred CCccccccccccCceeeeeCCCcccceeEEeecCCcccCccccc
Confidence 56789999999999999999999999999999999999994433
No 79
>PRK10266 curved DNA-binding protein CbpA; Provisional
Probab=20.40 E-value=2.5e+02 Score=23.55 Aligned_cols=39 Identities=15% Similarity=0.331 Sum_probs=27.4
Q ss_pred HHHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy14923 63 IRNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENE 101 (180)
Q Consensus 63 ~r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~ 101 (180)
.|...+.-|||.. ..+..+.+.+.|..|++.+|..-.+.
T Consensus 25 yr~la~k~HPD~~~~~~~~~~f~~i~~Ay~~L~~~~kr~~yD~ 67 (306)
T PRK10266 25 YRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQ 67 (306)
T ss_pred HHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 3444455677754 56788899999999998776644443
No 80
>PF09164 VitD-bind_III: Vitamin D binding protein, domain III; InterPro: IPR015247 This domain is predominantly found in Vitamin D binding proteins, and adopts a multihelical structure. It is required for formation of an actin 'clamp', allowing the protein to bind to actin []. ; PDB: 1MA9_A 1KW2_A 1KXP_D 1J7E_A 1J78_A 1LOT_A.
Probab=20.12 E-value=2.7e+02 Score=18.23 Aligned_cols=32 Identities=13% Similarity=0.415 Sum_probs=21.3
Q ss_pred ChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHH
Q psy14923 54 TPYFKFQGQIRNKIMERNPNIKIIEVTKLVSE 85 (180)
Q Consensus 54 say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~ 85 (180)
+.|.-|-+.-.+.++...|+.+..++...+.+
T Consensus 9 ~tFtEyKKrL~e~l~~k~P~at~~~l~~lve~ 40 (68)
T PF09164_consen 9 NTFTEYKKRLAERLRAKLPDATPTELKELVEK 40 (68)
T ss_dssp S-HHHHHHHHHHHHHHH-TTS-HHHHHHHHHH
T ss_pred ccHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH
Confidence 34666777777788888888888877766544
No 81
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=20.07 E-value=2.9e+02 Score=21.37 Aligned_cols=23 Identities=4% Similarity=0.173 Sum_probs=10.6
Q ss_pred HHHHhhcCCHHHHHHHHHHHHHH
Q psy14923 83 VSEEWKKIGEAEKKRLENEYAKE 105 (180)
Q Consensus 83 l~~~Wk~Ls~~eK~~y~~~a~~~ 105 (180)
--+++.-|++|+|..|.+..++.
T Consensus 117 ~nqmy~lLTPEQKaq~~~~~~~r 139 (166)
T PRK10363 117 RNQMYRLLTPEQQAVLNEKHQQR 139 (166)
T ss_pred HHHHHHhCCHHHHHHHHHHHHHH
Confidence 33444555555555554433333
Done!