Query psy1937
Match_columns 307
No_of_seqs 298 out of 1165
Neff 6.2
Searched_HMMs 46136
Date Fri Aug 16 20:05:50 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1937.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1937hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4715|consensus 100.0 2.5E-69 5.5E-74 498.2 18.3 215 79-302 56-279 (410)
2 PTZ00199 high mobility group p 99.8 1.1E-19 2.4E-24 144.6 9.0 73 74-146 9-83 (94)
3 cd01389 MATA_HMG-box MATA_HMG- 99.7 1.5E-17 3.2E-22 127.2 7.6 71 87-157 1-71 (77)
4 KOG4715|consensus 99.7 8.2E-18 1.8E-22 156.7 2.5 58 1-58 181-238 (410)
5 cd01388 SOX-TCF_HMG-box SOX-TC 99.7 1.3E-16 2.9E-21 120.5 7.4 68 88-155 2-69 (72)
6 PF00505 HMG_box: HMG (high mo 99.6 4.2E-16 9.2E-21 115.6 6.9 68 88-155 1-68 (69)
7 cd01390 HMGB-UBF_HMG-box HMGB- 99.6 5.4E-16 1.2E-20 114.0 6.8 59 88-146 1-59 (66)
8 smart00398 HMG high mobility g 99.6 1.3E-15 2.8E-20 112.6 7.8 69 87-155 1-69 (70)
9 COG5648 NHP6B Chromatin-associ 99.6 7.7E-16 1.7E-20 137.0 6.2 83 77-159 60-142 (211)
10 KOG0527|consensus 99.6 2.3E-15 5.1E-20 143.9 7.0 78 81-158 56-133 (331)
11 cd00084 HMG-box High Mobility 99.5 1.3E-14 2.8E-19 106.1 6.8 59 88-146 1-59 (66)
12 PF09011 HMG_box_2: HMG-box do 99.5 1.1E-14 2.5E-19 110.2 6.5 62 85-146 1-63 (73)
13 KOG3248|consensus 99.5 1.7E-14 3.7E-19 135.7 7.5 73 87-159 191-269 (421)
14 KOG0381|consensus 99.5 1.3E-13 2.9E-18 109.0 8.3 63 84-146 17-81 (96)
15 KOG0526|consensus 99.3 2.9E-12 6.2E-17 127.0 5.5 76 77-156 525-600 (615)
16 KOG0528|consensus 98.9 7.7E-10 1.7E-14 108.9 4.1 73 84-156 322-394 (511)
17 KOG2746|consensus 98.3 2.8E-07 6.1E-12 94.1 3.6 66 81-146 175-242 (683)
18 PF08549 SWI-SNF_Ssr4: Fungal 96.7 0.0036 7.9E-08 64.9 7.2 85 206-293 281-406 (669)
19 PF04690 YABBY: YABBY protein; 96.5 0.0038 8.3E-08 54.9 4.5 49 82-130 116-164 (170)
20 PF06382 DUF1074: Protein of u 95.9 0.0095 2.1E-07 52.5 4.0 49 92-144 83-131 (183)
21 COG5648 NHP6B Chromatin-associ 95.9 0.0062 1.3E-07 55.0 2.9 62 85-146 141-202 (211)
22 PF14887 HMG_box_5: HMG (high 95.8 0.031 6.7E-07 43.0 5.8 69 87-158 3-73 (85)
23 PF08073 CHDNT: CHDNT (NUC034) 93.6 0.076 1.6E-06 38.3 3.0 39 92-130 13-51 (55)
24 PF06320 GCN5L1: GCN5-like pro 90.5 1.8 3.9E-05 36.1 8.1 59 239-297 28-89 (121)
25 PF04769 MAT_Alpha1: Mating-ty 90.0 0.39 8.6E-06 43.5 4.1 54 83-142 39-92 (201)
26 PF06244 DUF1014: Protein of u 84.8 0.93 2E-05 37.9 3.0 47 86-132 70-117 (122)
27 PF11068 YlqD: YlqD protein; 56.7 74 0.0016 26.9 7.9 61 243-303 26-94 (131)
28 PF11629 Mst1_SARAH: C termina 55.7 57 0.0012 23.0 5.8 35 247-284 11-45 (49)
29 TIGR03481 HpnM hopanoid biosyn 54.8 14 0.00031 33.1 3.5 39 111-150 62-101 (198)
30 PRK15117 ABC transporter perip 53.2 18 0.00038 32.8 3.9 55 103-158 57-115 (211)
31 PF05494 Tol_Tol_Ttg2: Toluene 52.6 15 0.00032 31.6 3.2 47 110-157 35-84 (170)
32 COG4238 Murein lipoprotein [Ce 51.0 1.1E+02 0.0024 23.5 7.7 57 240-296 21-77 (78)
33 PF04728 LPP: Lipoprotein leuc 49.2 1E+02 0.0022 22.4 7.8 42 243-284 2-43 (56)
34 COG1579 Zn-ribbon protein, pos 48.9 1E+02 0.0023 28.7 8.3 53 243-295 102-154 (239)
35 KOG3223|consensus 46.9 16 0.00035 33.0 2.5 53 86-141 162-215 (221)
36 PF02370 M: M protein repeat; 42.3 39 0.00085 19.7 2.8 17 260-276 3-19 (21)
37 PF02172 KIX: KIX domain; Int 40.3 80 0.0017 24.5 5.2 44 216-270 11-57 (81)
38 PF12958 DUF3847: Protein of u 39.4 8.2 0.00018 30.4 -0.4 24 14-37 39-63 (86)
39 PF12210 Hrs_helical: Hepatocy 38.5 78 0.0017 25.4 4.9 56 228-283 28-95 (96)
40 PF07200 Mod_r: Modifier of ru 33.8 2.1E+02 0.0044 24.0 7.3 70 218-289 12-93 (150)
41 PF07334 IFP_35_N: Interferon- 32.8 98 0.0021 23.8 4.5 30 248-277 4-33 (76)
42 PRK09039 hypothetical protein; 32.7 1.9E+02 0.0041 28.2 7.7 54 237-290 128-183 (343)
43 PF00038 Filament: Intermediat 31.7 2.8E+02 0.006 25.9 8.6 53 243-295 222-274 (312)
44 cd07598 BAR_FAM92 The Bin/Amph 27.4 2.7E+02 0.006 25.2 7.3 50 243-296 3-52 (211)
45 PF12718 Tropomyosin_1: Tropom 27.3 2.9E+02 0.0063 23.4 7.1 46 243-288 27-72 (143)
46 PF07352 Phage_Mu_Gam: Bacteri 26.6 3.2E+02 0.007 23.0 7.3 53 244-299 10-62 (149)
47 COG3937 Uncharacterized conser 26.6 1.1E+02 0.0025 25.0 4.1 37 225-267 70-106 (108)
48 COG3105 Uncharacterized protei 26.4 1.9E+02 0.0041 24.6 5.5 28 258-285 34-61 (138)
49 PF10211 Ax_dynein_light: Axon 24.5 4.6E+02 0.01 23.2 8.1 34 245-278 128-161 (189)
50 PRK06800 fliH flagellar assemb 24.4 3.3E+02 0.0072 24.7 7.0 42 250-291 37-81 (228)
51 PF03915 AIP3: Actin interacti 24.4 2.2E+02 0.0049 28.8 6.7 59 236-297 231-289 (424)
52 PF11304 DUF3106: Protein of u 24.0 2.1E+02 0.0045 23.1 5.3 39 105-143 33-73 (107)
53 PRK15335 type III secretion sy 23.2 1.9E+02 0.0041 24.6 4.9 45 245-289 74-121 (147)
54 PRK10361 DNA recombination pro 23.1 4E+02 0.0086 27.4 8.2 52 245-296 68-119 (475)
55 PF09006 Surfac_D-trimer: Lung 23.0 2.3E+02 0.0049 19.8 4.5 23 247-269 2-24 (46)
56 KOG0241|consensus 22.5 1E+02 0.0022 34.6 3.9 90 202-295 333-426 (1714)
57 PF09340 NuA4: Histone acetylt 22.0 2.1E+02 0.0046 21.9 4.7 32 252-283 3-34 (80)
58 PRK10328 DNA binding protein, 21.4 2.6E+02 0.0056 23.7 5.6 55 225-281 1-58 (134)
59 PF04340 DUF484: Protein of un 20.4 2.2E+02 0.0047 25.7 5.3 63 207-276 5-72 (225)
No 1
>KOG4715|consensus
Probab=100.00 E-value=2.5e-69 Score=498.17 Aligned_cols=215 Identities=62% Similarity=0.921 Sum_probs=206.7
Q ss_pred CCCCCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH-----HHHh---
Q psy1937 79 GRGPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN-----LKAS--- 150 (307)
Q Consensus 79 ~k~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~-----~k~y--- 150 (307)
...++.|++|.+|+.+||+|++.+|++|++.||++.+|||+|+||.||..|+++||+.|...|+.+|. |++|
T Consensus 56 ~t~pkpPkppekpl~pymrySrkvWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh~s 135 (410)
T KOG4715|consen 56 ITRPKPPKPPEKPLMPYMRYSRKVWDQVKASNPELKLWEIGKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYHNS 135 (410)
T ss_pred ccCCCCCCCCCcccchhhHHhhhhhhhhhccCcchHHHHHHHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhhCC
Confidence 45678899999999999999999999999999999999999999999999999999999999999998 8999
Q ss_pred -hhhhccCccchhhhhhccChhhhhhhhhcCCCCccccccCCccccCCCCcccccCCCCccccccccccccchhhHHhhh
Q psy1937 151 -TKLASISRPTWRAKAKASKPKQLRKSREARPKDSGEKMQIGRIDIQPAEDEDEQEDGYSVKHVAYARYLRNHRLINEIF 229 (307)
Q Consensus 151 -~y~~y~~~k~~~~ka~~~~~~q~~~~~e~~~~~~~~~~~~~~~~~~p~ededd~ddg~S~k~~a~~Rf~rNH~l~~Eif 229 (307)
+|.+||++| ++++++. +.++++..++...|+++|+|||++|+||+|||||+||+|.+||+||||||+|||
T Consensus 136 p~y~ayinaK-sra~a~l--------e~~sr~~~sr~~~ge~~~~IQPaeDeDD~ddg~stkhla~arf~rN~rLIsei~ 206 (410)
T KOG4715|consen 136 PAYLAYINAK-SRAEAAL--------EEESRQRQSRMEKGEPYMSIQPAEDEDDYDDGFSTKHLATARFQRNHRLISEIL 206 (410)
T ss_pred chHHHHhhhh-hhhhhhh--------ccccccccchhhcCCcceecccccCccccccccchhhhhhhhhhhhhHHHHHHh
Confidence 899999999 9999888 667777778888899999999999999999999999999999999999999999
Q ss_pred cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhhccCCCC
Q psy1937 230 SDSVVPDVRSVVTTNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEKASCSEN 302 (307)
Q Consensus 230 s~~~v~d~rs~~~~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~~~~~~~ 302 (307)
|.+||||+|||||+.||+||+|||+||++||+|||+||++||++|++|||+|+++||.|++|||++|+++|++
T Consensus 207 SesvVpDvrsVVtt~RMqvlkrQv~SL~~HQ~KLEaEL~q~Ee~hq~kKrk~~estdsf~~eLKr~c~~kvev 279 (410)
T KOG4715|consen 207 SESVVPDVRSVVTTARMQVLKRQVQSLMVHQRKLEAELLQIEERHQEKKRKFLESTDSFNNELKRLCGLKVEV 279 (410)
T ss_pred ccccccchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHhcCCcccc
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999954
No 2
>PTZ00199 high mobility group protein; Provisional
Probab=99.81 E-value=1.1e-19 Score=144.61 Aligned_cols=73 Identities=26% Similarity=0.592 Sum_probs=67.7
Q ss_pred ccccCCCCCCCCCCCCCCccchhchhHHHHHHHHHHcCCCC--HHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937 74 TAKTDGRGPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLK--LWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN 146 (307)
Q Consensus 74 ~~k~~~k~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~--~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~ 146 (307)
..+++++..+||++||||+||||+||.++|..|..+||+++ +++|+++||++|+.||+++|.+|+++|+.+++
T Consensus 9 ~~k~~~k~~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~ 83 (94)
T PTZ00199 9 LVRKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKV 83 (94)
T ss_pred cccccCCCCCCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 44556677899999999999999999999999999999986 89999999999999999999999999999975
No 3
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.72 E-value=1.5e-17 Score=127.21 Aligned_cols=71 Identities=30% Similarity=0.377 Sum_probs=64.0
Q ss_pred CCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhccC
Q psy1937 87 PPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASIS 157 (307)
Q Consensus 87 ~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~~ 157 (307)
.||||+||||+|+++.|..|+.+||++++.||+++||++|+.||+++|++|.++|+.+++...-+|.+|.-
T Consensus 1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky 71 (77)
T cd01389 1 KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKY 71 (77)
T ss_pred CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCcc
Confidence 48999999999999999999999999999999999999999999999999999999998744435666543
No 4
>KOG4715|consensus
Probab=99.68 E-value=8.2e-18 Score=156.66 Aligned_cols=58 Identities=74% Similarity=1.032 Sum_probs=56.8
Q ss_pred CCCCcccccccccccccchhhHHhhhcCCCCCCCCcccchhhHHHHHHhhhhhhhhhc
Q psy1937 1 QEDGYSVKHVAYARYLRNHRLINEIFSDSVVPDVRSVVTTNRMQVLKRQVQSLTMHQH 58 (307)
Q Consensus 1 ~~~~~~~~~~~~~~~~rnh~Li~~~fs~~~Vpd~rslVt~~r~q~lKkq~QSL~~hq~ 58 (307)
+|||||+||+|.+||+||||||++|||++||||+||+||+.||++|++|++||++||.
T Consensus 181 ~ddg~stkhla~arf~rN~rLIsei~SesvVpDvrsVVtt~RMqvlkrQv~SL~~HQ~ 238 (410)
T KOG4715|consen 181 YDDGFSTKHLATARFQRNHRLISEILSESVVPDVRSVVTTARMQVLKRQVQSLMVHQR 238 (410)
T ss_pred cccccchhhhhhhhhhhhhHHHHHHhccccccchhhhhhHHHHHHHHHHHHHHHHHHH
Confidence 5899999999999999999999999999999999999999999999999999999995
No 5
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.67 E-value=1.3e-16 Score=120.46 Aligned_cols=68 Identities=22% Similarity=0.379 Sum_probs=61.5
Q ss_pred CCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhc
Q psy1937 88 PEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLAS 155 (307)
Q Consensus 88 PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y 155 (307)
.|||+||||+||+++|..++.+||++++.||+++||++|+.||+++|++|.++|+.+++...-+|.+|
T Consensus 2 iKrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y 69 (72)
T cd01388 2 IKRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDY 69 (72)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCC
Confidence 58999999999999999999999999999999999999999999999999999999976333355555
No 6
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.64 E-value=4.2e-16 Score=115.64 Aligned_cols=68 Identities=32% Similarity=0.556 Sum_probs=58.8
Q ss_pred CCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhc
Q psy1937 88 PEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLAS 155 (307)
Q Consensus 88 PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y 155 (307)
||||+|||++||.+.+..++.+||+++..+|+++||++|++||+++|.+|.++|+.++....-++.+|
T Consensus 1 PkrP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y 68 (69)
T PF00505_consen 1 PKRPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEY 68 (69)
T ss_dssp SSSS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcCCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 89999999999999999999999999999999999999999999999999999999865222244443
No 7
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.63 E-value=5.4e-16 Score=113.97 Aligned_cols=59 Identities=34% Similarity=0.708 Sum_probs=57.2
Q ss_pred CCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937 88 PEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN 146 (307)
Q Consensus 88 PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~ 146 (307)
||||+||||+|+++.|..++.+||++++.+|+++||++|++||+++|.+|.+.|+.++.
T Consensus 1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~ 59 (66)
T cd01390 1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKE 59 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence 89999999999999999999999999999999999999999999999999999998854
No 8
>smart00398 HMG high mobility group.
Probab=99.62 E-value=1.3e-15 Score=112.59 Aligned_cols=69 Identities=29% Similarity=0.501 Sum_probs=61.4
Q ss_pred CCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhc
Q psy1937 87 PPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLAS 155 (307)
Q Consensus 87 ~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y 155 (307)
+||+|+||||+|+.+.|..+..+||++++.+|+++||.+|+.||+++|.+|.++|+.++....-++..|
T Consensus 1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y 69 (70)
T smart00398 1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEY 69 (70)
T ss_pred CcCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 589999999999999999999999999999999999999999999999999999999865322244444
No 9
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.61 E-value=7.7e-16 Score=136.99 Aligned_cols=83 Identities=24% Similarity=0.492 Sum_probs=72.5
Q ss_pred cCCCCCCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhcc
Q psy1937 77 TDGRGPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASI 156 (307)
Q Consensus 77 ~~~k~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~ 156 (307)
...+..+||+.||||+||||+|+.++|++++.++|++++.+|++++|++|++|+++||.+|..+|..+++...-....|.
T Consensus 60 ~~~r~k~dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y~ 139 (211)
T COG5648 60 RLVRKKKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYN 139 (211)
T ss_pred HHHHHhcCCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhhh
Confidence 44667889999999999999999999999999999999999999999999999999999999999999652222555666
Q ss_pred Ccc
Q psy1937 157 SRP 159 (307)
Q Consensus 157 ~~k 159 (307)
.+.
T Consensus 140 ~k~ 142 (211)
T COG5648 140 KKL 142 (211)
T ss_pred ccc
Confidence 555
No 10
>KOG0527|consensus
Probab=99.58 E-value=2.3e-15 Score=143.90 Aligned_cols=78 Identities=21% Similarity=0.334 Sum_probs=69.0
Q ss_pred CCCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhccCc
Q psy1937 81 GPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASISR 158 (307)
Q Consensus 81 ~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~~~ 158 (307)
..+.....||||||||+|++..|.++..+||++.+.||+|.||.+|+.|+++||.||+++|++.|+++.-+|.+|.-.
T Consensus 56 ~k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKYR 133 (331)
T KOG0527|consen 56 DKTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKYR 133 (331)
T ss_pred CCCCccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCcccc
Confidence 445567789999999999999999999999999999999999999999999999999999999999555555555543
No 11
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.54 E-value=1.3e-14 Score=106.05 Aligned_cols=59 Identities=41% Similarity=0.767 Sum_probs=57.2
Q ss_pred CCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937 88 PEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN 146 (307)
Q Consensus 88 PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~ 146 (307)
||||+||||+|+++.|..++.++|+++..+|+++||.+|+.|++++|.+|.+.|+.++.
T Consensus 1 pkrp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~ 59 (66)
T cd00084 1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKE 59 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence 79999999999999999999999999999999999999999999999999999999854
No 12
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.54 E-value=1.1e-14 Score=110.19 Aligned_cols=62 Identities=27% Similarity=0.553 Sum_probs=54.7
Q ss_pred CCCCCCCccchhchhHHHHHHHHHH-cCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937 85 PKPPEKPLMPYMRYSRKVWDQVKAQ-NLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN 146 (307)
Q Consensus 85 p~~PKRPlsAy~lf~~e~r~~v~~e-~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~ 146 (307)
|++||+|+|||++|+.+++..++.. .+...+.|+++.||.+|++||++||.+|.+.|+.+++
T Consensus 1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~ 63 (73)
T PF09011_consen 1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKE 63 (73)
T ss_dssp SSS--SSSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHH
T ss_pred CcCCCCCCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHH
Confidence 6889999999999999999999998 7888999999999999999999999999999999976
No 13
>KOG3248|consensus
Probab=99.52 E-value=1.7e-14 Score=135.66 Aligned_cols=73 Identities=21% Similarity=0.350 Sum_probs=67.8
Q ss_pred CCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH--HHHh----hhhhccCcc
Q psy1937 87 PPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN--LKAS----TKLASISRP 159 (307)
Q Consensus 87 ~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~--~k~y----~y~~y~~~k 159 (307)
..|+||||||+|++|+|.+|.+++......+|.+|||++|.+||.||..+|+++|.++++ |+.| +.+||.++|
T Consensus 191 hiKKPLNAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WSARdNYgKKk 269 (421)
T KOG3248|consen 191 HIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYGKKK 269 (421)
T ss_pred cccccHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcchhhhhhhhh
Confidence 469999999999999999999999988899999999999999999999999999999999 6777 778888766
No 14
>KOG0381|consensus
Probab=99.48 E-value=1.3e-13 Score=109.04 Aligned_cols=63 Identities=32% Similarity=0.670 Sum_probs=59.8
Q ss_pred CC--CCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937 84 AP--KPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN 146 (307)
Q Consensus 84 dp--~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~ 146 (307)
+| ..||||++||++|+.+.|..++.+||++++.||+++||++|++|++++|.+|+..|..++.
T Consensus 17 ~p~~~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~ 81 (96)
T KOG0381|consen 17 DPNAQAPKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKE 81 (96)
T ss_pred CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence 56 5999999999999999999999999999999999999999999999999999999988865
No 15
>KOG0526|consensus
Probab=99.28 E-value=2.9e-12 Score=126.97 Aligned_cols=76 Identities=17% Similarity=0.385 Sum_probs=66.1
Q ss_pred cCCCCCCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhcc
Q psy1937 77 TDGRGPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASI 156 (307)
Q Consensus 77 ~~~k~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~ 156 (307)
++.++.|||++||||++|||+|++..|..|+.+ +.+++||+|.+|++|+.|+. |.+|+++|+.+|+...-++..|.
T Consensus 525 k~~kk~kdpnapkra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~~yk 600 (615)
T KOG0526|consen 525 KKGKKKKDPNAPKRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMKEYK 600 (615)
T ss_pred cCcccCCCCCCCccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHHhhc
Confidence 556778999999999999999999999999988 89999999999999999999 99999999999862222444444
No 16
>KOG0528|consensus
Probab=98.91 E-value=7.7e-10 Score=108.86 Aligned_cols=73 Identities=16% Similarity=0.322 Sum_probs=61.5
Q ss_pred CCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhcc
Q psy1937 84 APKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASI 156 (307)
Q Consensus 84 dp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~ 156 (307)
.+..-||||||||.|.++.|.+|...+||+.+..|+||||.+|+.|+..||+||+++-+..-.++.-.|.+|.
T Consensus 322 s~PHIKRPMNAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYr 394 (511)
T KOG0528|consen 322 SEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYR 394 (511)
T ss_pred CCccccCCcchhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHHHHhhhccCcccc
Confidence 3445699999999999999999999999999999999999999999999999999887765543333444444
No 17
>KOG2746|consensus
Probab=98.34 E-value=2.8e-07 Score=94.12 Aligned_cols=66 Identities=20% Similarity=0.356 Sum_probs=61.1
Q ss_pred CCCCCCCCCCCccchhchhHHHH--HHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937 81 GPKAPKPPEKPLMPYMRYSRKVW--DQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN 146 (307)
Q Consensus 81 ~~kdp~~PKRPlsAy~lf~~e~r--~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~ 146 (307)
.+.+.....+||||||+||+.+| ..+...||+..++.|++|||++|-.|.+.||+.|.++|.+.|.
T Consensus 175 nkr~k~HirrPMnaf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~ 242 (683)
T KOG2746|consen 175 NKRDKDHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKE 242 (683)
T ss_pred CcCcchhhhhhhHHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHH
Confidence 44556677899999999999999 9999999999999999999999999999999999999999876
No 18
>PF08549 SWI-SNF_Ssr4: Fungal domain of unknown function (DUF1750); InterPro: IPR013859 This is a fungal protein of unknown function.
Probab=96.74 E-value=0.0036 Score=64.90 Aligned_cols=85 Identities=19% Similarity=0.400 Sum_probs=61.2
Q ss_pred CCCccccccccccccchhhHHhhhcc-----cCCC-C--------cchhhh---------------------------HH
Q psy1937 206 DGYSVKHVAYARYLRNHRLINEIFSD-----SVVP-D--------VRSVVT---------------------------TN 244 (307)
Q Consensus 206 dg~S~k~~a~~Rf~rNH~l~~Eifs~-----~~v~-d--------~rs~~~---------------------------~~ 244 (307)
|-+|+|.|..+||+-||+.|+||||. -++| | +-||+. ..
T Consensus 281 D~lTPReIS~~RY~qhHEWMEEI~sSPY~i~qI~P~DLGLGrKGEL~sLTeGiF~ap~~~~~~~~~~~~~~~~~gkLdp~ 360 (669)
T PF08549_consen 281 DHLTPREISKMRYQQHHEWMEEILSSPYRISQIEPVDLGLGRKGELESLTEGIFEAPGGQSGDASKEGPKKPYVGKLDPG 360 (669)
T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHhCccccccccccccCCccccchhhhhcccccCCCCCCccccccCCCcccccCCCHH
Confidence 45889999999999999999999997 2222 2 222221 22
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHH
Q psy1937 245 RMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELK 293 (307)
Q Consensus 245 r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~ 293 (307)
......+ +...|.....+||+.|+.+|+.+-.+|...|--=..|+.
T Consensus 361 ~aeeF~k---RV~~~ia~~~AEIekmK~~Hak~m~k~k~~s~lk~AE~~ 406 (669)
T PF08549_consen 361 KAEEFRK---RVAKKIADMNAEIEKMKARHAKRMAKFKRNSLLKDAEKE 406 (669)
T ss_pred HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccHHHHHHHH
Confidence 2233333 345688899999999999999999999998865555543
No 19
>PF04690 YABBY: YABBY protein; InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=96.45 E-value=0.0038 Score=54.89 Aligned_cols=49 Identities=22% Similarity=0.405 Sum_probs=44.0
Q ss_pred CCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCC
Q psy1937 82 PKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP 130 (307)
Q Consensus 82 ~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls 130 (307)
.|+|.+..|-++||-.|+++.-..|++.+|+++..|.-+..+..|...+
T Consensus 116 ~kPPEKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~p 164 (170)
T PF04690_consen 116 NKPPEKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFP 164 (170)
T ss_pred cCCccccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCc
Confidence 4556666788999999999999999999999999999999999998765
No 20
>PF06382 DUF1074: Protein of unknown function (DUF1074); InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=95.88 E-value=0.0095 Score=52.53 Aligned_cols=49 Identities=16% Similarity=0.261 Sum_probs=44.1
Q ss_pred ccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHH
Q psy1937 92 LMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQE 144 (307)
Q Consensus 92 lsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~ 144 (307)
-+||+-|++++|. .|.++...|+....+.+|..||+.+|..|..++...
T Consensus 83 nnaYLNFLReFRr----kh~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~~~ 131 (183)
T PF06382_consen 83 NNAYLNFLREFRR----KHCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAPSV 131 (183)
T ss_pred chHHHHHHHHHHH----HccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcchh
Confidence 4689999999887 468999999999999999999999999999987654
No 21
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=95.87 E-value=0.0062 Score=55.01 Aligned_cols=62 Identities=29% Similarity=0.571 Sum_probs=57.1
Q ss_pred CCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937 85 PKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN 146 (307)
Q Consensus 85 p~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~ 146 (307)
..+|+.|..+|+-+-.+.|+.+...+|+....++++++|..|++|++.-|.+|.+.|.+.+.
T Consensus 141 k~~~~~~~~~~~e~~~~~r~~~~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~ 202 (211)
T COG5648 141 KLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKE 202 (211)
T ss_pred ccCCCCCCchhhhccHHhccccCCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHH
Confidence 35678899999999999999999999999999999999999999999999999999998753
No 22
>PF14887 HMG_box_5: HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=95.76 E-value=0.031 Score=42.97 Aligned_cols=69 Identities=14% Similarity=0.162 Sum_probs=52.2
Q ss_pred CCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHh--hhhhccCc
Q psy1937 87 PPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKAS--TKLASISR 158 (307)
Q Consensus 87 ~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y--~y~~y~~~ 158 (307)
-|..|-+|--+|.+.....+.+.+++-...+ -+.+...|++|++.+|-+|...|.++ .+.| .+-.|..+
T Consensus 3 lPE~PKt~qe~Wqq~vi~dYla~~~~dr~K~-~kam~~~W~~me~Kekl~WIkKA~Ed--qKrYE~el~e~r~~ 73 (85)
T PF14887_consen 3 LPETPKTAQEIWQQSVIGDYLAKFRNDRKKA-LKAMEAQWSQMEKKEKLKWIKKAAED--QKRYERELREMRSA 73 (85)
T ss_dssp -S----THHHHHHHHHHHHHHHHTTSTHHHH-HHHHHHHHHTTGGGHHHHHHHHHHHH--HHHHHHHHHCCS-C
T ss_pred CCCCCCCHHHHHHHHHHHHHHHHhhHhHHHH-HHHHHHHHHHhhhhhhhHHHHHHHHH--HHHHHHHHHHHhcC
Confidence 4777888889999999999999998877766 45899999999999999999999988 4666 44445443
No 23
>PF08073 CHDNT: CHDNT (NUC034) domain; InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=93.59 E-value=0.076 Score=38.27 Aligned_cols=39 Identities=18% Similarity=0.370 Sum_probs=35.1
Q ss_pred ccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCC
Q psy1937 92 LMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP 130 (307)
Q Consensus 92 lsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls 130 (307)
++.|=.|++-.|+.|.+.||++..+.|..+++.+|+.-+
T Consensus 13 lt~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~ 51 (55)
T PF08073_consen 13 LTNYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQ 51 (55)
T ss_pred HHHHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHH
Confidence 456788999999999999999999999999999998543
No 24
>PF06320 GCN5L1: GCN5-like protein 1 (GCN5L1); InterPro: IPR009395 This family consists of several eukaryotic GCN5-like protein 1 (GCN5L1) sequences. The function of this family is unknown [,].
Probab=90.51 E-value=1.8 Score=36.08 Aligned_cols=59 Identities=20% Similarity=0.328 Sum_probs=51.0
Q ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHhHHhhhhHHHHHHHHhhhc
Q psy1937 239 SVVTTNRMQVLKRQVQSLTMHQKKLEAELQQIEEKF---EAKKRKFVESSEQFQEELKKEKA 297 (307)
Q Consensus 239 s~~~~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~---~~kk~~~~~~s~~f~~~~~~~~~ 297 (307)
..++..-+..|...|+.+...|++|+.|..++.... ...-.+|+...+.|+..||.+++
T Consensus 28 ~~l~~~l~d~ln~~v~~~~~Nqk~ie~e~k~L~~~~~~l~kqt~qw~~~~~~~~~~LKEiGD 89 (121)
T PF06320_consen 28 QALTNALVDHLNSRVSEAYENQKKIEKEAKQLQRNTAKLAKQTDQWLKLVDSFNDALKEIGD 89 (121)
T ss_pred HHHHHHHHHHHHHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 346677788899999999999999999888887655 66778899999999999999975
No 25
>PF04769 MAT_Alpha1: Mating-type protein MAT alpha 1; InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=89.97 E-value=0.39 Score=43.46 Aligned_cols=54 Identities=19% Similarity=0.330 Sum_probs=40.3
Q ss_pred CCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHH
Q psy1937 83 KAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYE 142 (307)
Q Consensus 83 kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae 142 (307)
..+..++||+|+||.|..-+- ...|+....+++.+|+..|..=+- |..|.-.|.
T Consensus 39 ~~~~~~kr~lN~Fm~FRsyy~----~~~~~~~Qk~~S~~l~~lW~~dp~--k~~W~l~ak 92 (201)
T PF04769_consen 39 RSPEKAKRPLNGFMAFRSYYS----PIFPPLPQKELSGILTKLWEKDPF--KNKWSLMAK 92 (201)
T ss_pred ccccccccchhHHHHHHHHHH----hhcCCcCHHHHHHHHHHHHhCCcc--HhHHHHHhh
Confidence 345568999999999976655 446888999999999999986332 445555554
No 26
>PF06244 DUF1014: Protein of unknown function (DUF1014); InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=84.78 E-value=0.93 Score=37.93 Aligned_cols=47 Identities=21% Similarity=0.497 Sum_probs=40.4
Q ss_pred CCCCCC-ccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChH
Q psy1937 86 KPPEKP-LMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPED 132 (307)
Q Consensus 86 ~~PKRP-lsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~e 132 (307)
..|-|. --||.-|....-+.++.+||++..+.+-.+|-.+|..-|+.
T Consensus 70 rHPErR~KAAy~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPeN 117 (122)
T PF06244_consen 70 RHPERRMKAAYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPEN 117 (122)
T ss_pred CCcchhHHHHHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCCC
Confidence 345444 47899999999999999999999999999999999877653
No 27
>PF11068 YlqD: YlqD protein; InterPro: IPR021297 This bacterial family of proteins has no known function. ; PDB: 4DCI_C.
Probab=56.69 E-value=74 Score=26.89 Aligned_cols=61 Identities=25% Similarity=0.426 Sum_probs=46.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHhHHhhhhHHHHHHHHhhhccCCCCC
Q psy1937 243 TNRMQVLKRQVQSLTMHQKKLEAEL--------QQIEEKFEAKKRKFVESSEQFQEELKKEKASCSENE 303 (307)
Q Consensus 243 ~~r~~~l~~~~~sL~~hq~kle~el--------~~~e~~~~~kk~~~~~~s~~f~~~~~~~~~~~~~~~ 303 (307)
.+.+.-|-.+.+.|..-.+++..|. ..+...|+..+.++.+--..+...|..+-.+.++.|
T Consensus 26 ~~~i~~~d~el~QLefq~kr~~~e~~~~~~~~~~~i~~q~~~e~~~r~e~k~~l~~ql~qv~~L~lgsE 94 (131)
T PF11068_consen 26 QEQIQQLDQELQQLEFQGKRMIKEIKKQNAQQIQSIQQQFEQEKQERLEQKNQLLQQLEQVQKLELGSE 94 (131)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHTTSSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHS-TT-E
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCE
Confidence 3456667777888888888877764 577889999999999999999999988877766654
No 28
>PF11629 Mst1_SARAH: C terminal SARAH domain of Mst1; InterPro: IPR024205 The SARAH (Sav/Rassf/Hpo) domain is found at the C terminus in three classes of eukaryotic tumour suppressors that give the domain its name. In the Sav (Salvador) and Hpo (Hippo) families, the SARAH domain mediates signal transduction from Hpo via the Sav scaffolding protein to the downstream component Wts (Warts); the phosphorylation of Wts by Hpo triggers cell cycle arrest and apoptosis by down-regulating cyclin E, Diap 1 and other targets []. The SARAH domain is also involved in dimerisation, as in the human Hpo orthologue, Mst1, which homodimerises via its C-terminal SARAH domain. The SARAH domain is found associated with other domains, such as protein kinase domains, WW/rsp5/WWP domain (IPR001202 from INTERPRO), C1 domain (IPR002219 from INTERPRO), LIM domain (IPR001781 from INTERPRO), or the Ras-associating (RA) domain (IPR000159 from INTERPRO).; GO: 0004674 protein serine/threonine kinase activity; PDB: 2JO8_A.
Probab=55.72 E-value=57 Score=23.01 Aligned_cols=35 Identities=20% Similarity=0.582 Sum_probs=27.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhh
Q psy1937 247 QVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVES 284 (307)
Q Consensus 247 ~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~ 284 (307)
..|.+...+|.. ..|.||..+..+|++|++=+++.
T Consensus 11 ~eL~~rl~~LD~---~ME~Eieelr~RY~~KRqPIldA 45 (49)
T PF11629_consen 11 EELQQRLASLDP---EMEQEIEELRQRYQAKRQPILDA 45 (49)
T ss_dssp HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHhCCH---HHHHHHHHHHHHHHHhhccHHHH
Confidence 456666667766 46889999999999999888765
No 29
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=54.76 E-value=14 Score=33.10 Aligned_cols=39 Identities=13% Similarity=0.438 Sum_probs=31.7
Q ss_pred CCCCHHHHHH-HHHhhccCCChHHhHHHHHHHHHHHHHHHh
Q psy1937 111 LDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQEKNLKAS 150 (307)
Q Consensus 111 P~~~~~eisk-~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y 150 (307)
|...+..|++ .||.-|+.+|+++|+.|.+....-- ...|
T Consensus 62 ~~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l-~~tY 101 (198)
T TIGR03481 62 EAFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRELS-IATY 101 (198)
T ss_pred HhCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHH-HHHH
Confidence 4678888987 6899999999999999999987631 3555
No 30
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=53.21 E-value=18 Score=32.82 Aligned_cols=55 Identities=13% Similarity=0.190 Sum_probs=39.4
Q ss_pred HHHHHHH-cCCCCHHHHHH-HHHhhccCCChHHhHHHHHHHHHHHHHHHh--hhhhccCc
Q psy1937 103 WDQVKAQ-NLDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQEKNLKAS--TKLASISR 158 (307)
Q Consensus 103 r~~v~~e-~P~~~~~eisk-~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y--~y~~y~~~ 158 (307)
+..|... .|...+..|++ .||.-|+.+|+++|..|.+....-- ...| +...|-..
T Consensus 57 ~~~v~~~l~p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~~L-v~tYa~~l~~y~~q 115 (211)
T PRK15117 57 RTIVDQELLPYVQVKYAGALVLGRYYKDATPAQREAYFAAFREYL-KQAYGQALAMYHGQ 115 (211)
T ss_pred HHHHHHHccccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHH-HHHHHHHHHHhCCc
Confidence 3444333 38889999987 6899999999999999999887642 3555 44455443
No 31
>PF05494 Tol_Tol_Ttg2: Toluene tolerance, Ttg2 ; InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=52.61 E-value=15 Score=31.65 Aligned_cols=47 Identities=15% Similarity=0.319 Sum_probs=32.3
Q ss_pred cCCCCHHHHHHH-HHhhccCCChHHhHHHHHHHHHHHHHHHh--hhhhccC
Q psy1937 110 NLDLKLWEIGKI-IGQMWRDLPEDQKTEYVEDYEQEKNLKAS--TKLASIS 157 (307)
Q Consensus 110 ~P~~~~~eisk~-lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y--~y~~y~~ 157 (307)
.|...+..|++. ||.-|+.+|+++++.|.+....-- ...| ....|..
T Consensus 35 ~~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l-~~~Y~~~l~~y~~ 84 (170)
T PF05494_consen 35 DPYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQLL-VRTYAKRLDEYSG 84 (170)
T ss_dssp GGGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHHHH-HHHHHHHHHT-SS
T ss_pred HHhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHHHH-HHHHHHHHHhhCC
Confidence 477788888876 788899999999999999987642 3444 3344443
No 32
>COG4238 Murein lipoprotein [Cell envelope biogenesis, outer membrane]
Probab=51.04 E-value=1.1e+02 Score=23.49 Aligned_cols=57 Identities=14% Similarity=0.295 Sum_probs=46.3
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhh
Q psy1937 240 VVTTNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEK 296 (307)
Q Consensus 240 ~~~~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~ 296 (307)
-+..++|.-|.-+||.|...+..|+....++...-++-|-.-...-+...+.-+..|
T Consensus 21 c~s~aK~dqlss~vq~LnAkv~qLe~dv~a~~~~~qAAk~eaarAn~rldn~a~s~~ 77 (78)
T COG4238 21 CSSNAKIDQLSSDVQTLNAKVDQLENDVNAMRSDVQAAKDEAARANQRLDNQAQSYC 77 (78)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHhc
Confidence 345889999999999999999999999999999988877666666556655555555
No 33
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=49.22 E-value=1e+02 Score=22.40 Aligned_cols=42 Identities=17% Similarity=0.345 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhh
Q psy1937 243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVES 284 (307)
Q Consensus 243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~ 284 (307)
+++|.-|..+|+.|...+.+|..++..+...-+.-|..-...
T Consensus 2 ~akid~Ls~dVq~L~~kvdqLs~dv~~lr~~v~~ak~EAaRA 43 (56)
T PF04728_consen 2 NAKIDQLSSDVQTLNSKVDQLSSDVNALRADVQAAKEEAARA 43 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467889999999999999999999999988777665544333
No 34
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=48.93 E-value=1e+02 Score=28.69 Aligned_cols=53 Identities=23% Similarity=0.466 Sum_probs=47.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhh
Q psy1937 243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKE 295 (307)
Q Consensus 243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~ 295 (307)
..|...|..++..|..-+.+|+.++..+.+++..-.+.|.+....+-.++..+
T Consensus 102 k~r~~~le~el~~l~~~~~~l~~~i~~l~~~~~~~e~~~~e~~~~~e~e~~~i 154 (239)
T COG1579 102 KERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEI 154 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56788899999999999999999999999999999999999999998888766
No 35
>KOG3223|consensus
Probab=46.91 E-value=16 Score=32.98 Aligned_cols=53 Identities=13% Similarity=0.401 Sum_probs=43.3
Q ss_pred CCC-CCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHH
Q psy1937 86 KPP-EKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDY 141 (307)
Q Consensus 86 ~~P-KRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~a 141 (307)
..| ||=.-||.-|-...-+.++.++|++.++..-.+|-.+|..-|+. ||...+
T Consensus 162 rHPEkRmrAA~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN---P~Nq~~ 215 (221)
T KOG3223|consen 162 RHPEKRMRAAFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN---PFNQAA 215 (221)
T ss_pred cChHHHHHHHHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCC---hhhHHh
Confidence 445 45556788899999999999999999999999999999988875 554443
No 36
>PF02370 M: M protein repeat; InterPro: IPR003345 This short repeat is found in multiple copies in bacterial M proteins. The M proteins bind to IgA and are closely associated with virulence. The M protein has been postulated to be a major group A streptococcal (GAS) virulence factor because of its contribution to the bacterial resistance to opsonophagocytosis [].; PDB: 2KK9_A.
Probab=42.28 E-value=39 Score=19.69 Aligned_cols=17 Identities=35% Similarity=0.518 Sum_probs=13.6
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy1937 260 QKKLEAELQQIEEKFEA 276 (307)
Q Consensus 260 q~kle~el~~~e~~~~~ 276 (307)
.++||++++.+++.++.
T Consensus 3 kk~lEa~~qkLe~e~q~ 19 (21)
T PF02370_consen 3 KKQLEADHQKLEAEKQI 19 (21)
T ss_dssp HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhh
Confidence 46789999999888864
No 37
>PF02172 KIX: KIX domain; InterPro: IPR003101 The nuclear factor CREB activates transcription of target genes in part through direct interactions with the KIX domain of the coactivator CBP in a phosphorylation-dependent manner []. This provides a model for activator:coactivator interactions. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun.; GO: 0003712 transcription cofactor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2KWF_A 1KDX_A 2LQH_A 2LQI_A 1SB0_A 2AGH_B.
Probab=40.26 E-value=80 Score=24.51 Aligned_cols=44 Identities=32% Similarity=0.574 Sum_probs=28.3
Q ss_pred cccccch---hhHHhhhcccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937 216 ARYLRNH---RLINEIFSDSVVPDVRSVVTTNRMQVLKRQVQSLTMHQKKLEAELQQI 270 (307)
Q Consensus 216 ~Rf~rNH---~l~~Eifs~~~v~d~rs~~~~~r~~~l~~~~~sL~~hq~kle~el~~~ 270 (307)
+.=.||| .|+.-||.. ||+... ...||. +|....+++|.++...
T Consensus 11 t~~lR~hlV~KLv~aI~P~---pdp~a~-~d~rm~-------~l~~yarkvE~~~fe~ 57 (81)
T PF02172_consen 11 TPDLRNHLVHKLVQAIFPT---PDPNAM-NDPRMK-------NLIEYARKVEKDMFET 57 (81)
T ss_dssp -HHHHHHHHHHHHHHHS-S---SSCCCC-CSHHHH-------HHHHHHHHHHHHHHHC
T ss_pred CHHHHHHHHHHHHHhhCCC---CChhhh-hhHHHH-------HHHHHHHHHHHHHHHH
Confidence 3456777 777777754 566544 578888 4666777788776543
No 38
>PF12958 DUF3847: Protein of unknown function (DUF3847); InterPro: IPR024215 This entry represents a family of uncharacterised proteins that were found by clustering human gut metagenomic sequences [].
Probab=39.37 E-value=8.2 Score=30.36 Aligned_cols=24 Identities=33% Similarity=0.621 Sum_probs=18.4
Q ss_pred ccccchhhHHh-hhcCCCCCCCCcc
Q psy1937 14 RYLRNHRLINE-IFSDSVVPDVRSV 37 (307)
Q Consensus 14 ~~~rnh~Li~~-~fs~~~Vpd~rsl 37 (307)
|=.||||||.. .+-.+++|++..+
T Consensus 39 RK~RtHRLi~rGa~lEsi~~e~~~l 63 (86)
T PF12958_consen 39 RKERTHRLIERGAILESIFPEPKDL 63 (86)
T ss_pred HHHHHHHHHHhhHHHHHHhhcchhc
Confidence 55799999994 6667888888554
No 39
>PF12210 Hrs_helical: Hepatocyte growth factor-regulated tyrosine kinase substrate; InterPro: IPR024641 This domain comprises the helical region of hepatocyte growth factor-regulated tyrosine kinase substrate (HRS). It is approximately 100 amino acids in length. Hrs, together with signal transducing adaptor molecule (STAM), forms the ESCRT-0 complex, which sorts ubiquitinated cell surface receptors to lysosomes for degradation []. ; PDB: 3F1I_H.
Probab=38.46 E-value=78 Score=25.42 Aligned_cols=56 Identities=25% Similarity=0.317 Sum_probs=26.3
Q ss_pred hhcccCCCCcchhhhHHHHHHHH--HHHHHHHHHHHHHHHH----------HHHHHHHHHHHHhHHhh
Q psy1937 228 IFSDSVVPDVRSVVTTNRMQVLK--RQVQSLTMHQKKLEAE----------LQQIEEKFEAKKRKFVE 283 (307)
Q Consensus 228 ifs~~~v~d~rs~~~~~r~~~l~--~~~~sL~~hq~kle~e----------l~~~e~~~~~kk~~~~~ 283 (307)
|.+|+.|..+-.-+|.-.-++|+ .+.+....|...|..- |..|++.|.+|+++..|
T Consensus 28 IanDsaVqsLF~~lt~mH~~LL~~i~~~ee~R~~~E~lQdkL~qi~eAR~AlDalR~eH~~klrr~aE 95 (96)
T PF12210_consen 28 IANDSAVQSLFQTLTAMHPQLLKYIQEQEEKRVYYEGLQDKLAQIKEARAALDALREEHREKLRRQAE 95 (96)
T ss_dssp GGG-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 33444333333334444444443 3334444444444444 44556778777775544
No 40
>PF07200 Mod_r: Modifier of rudimentary (Mod(r)) protein; InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=33.77 E-value=2.1e+02 Score=23.97 Aligned_cols=70 Identities=17% Similarity=0.374 Sum_probs=35.2
Q ss_pred cccchhhHHhhhcccCCCCcchhhh------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhh
Q psy1937 218 YLRNHRLINEIFSDSVVPDVRSVVT------------TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESS 285 (307)
Q Consensus 218 f~rNH~l~~Eifs~~~v~d~rs~~~------------~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s 285 (307)
..-||.++.+++.. .|.+..+.. ..++-.++.++..+..+...+-.++..++..|..+..++-.-+
T Consensus 12 Ll~d~~~l~~~v~~--l~~~~~~~~~~~~l~~~n~~lAe~nL~~~~~l~~~r~~l~~~~~~~~~L~~~~~~k~~~~~~l~ 89 (150)
T PF07200_consen 12 LLSDEEKLDAFVKS--LPQVQELQQEREELLAENEELAEQNLSLEPELEELRSQLQELYEELKELESEYQEKEQQQDELS 89 (150)
T ss_dssp HHHH-HHHHHHGGG--GS--HHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHcCHHHHHHHHHc--CHHHHHHHHHHHHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44566667776654 344333222 1122233345555555556666667777777777666666655
Q ss_pred HHHH
Q psy1937 286 EQFQ 289 (307)
Q Consensus 286 ~~f~ 289 (307)
..|.
T Consensus 90 ~~~s 93 (150)
T PF07200_consen 90 SNYS 93 (150)
T ss_dssp HCHH
T ss_pred ccCC
Confidence 5554
No 41
>PF07334 IFP_35_N: Interferon-induced 35 kDa protein (IFP 35) N-terminus; InterPro: IPR009938 This entry represents the N terminus of interferon-induced 35 kDa protein (IFP 35) (approximately 80 residues long), which contains a leucine zipper motif in an alpha helical configuration []. This group of proteins also includes N-myc-interactor (Nmi), a homologous interferon-induced protein.
Probab=32.76 E-value=98 Score=23.82 Aligned_cols=30 Identities=37% Similarity=0.460 Sum_probs=20.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937 248 VLKRQVQSLTMHQKKLEAELQQIEEKFEAK 277 (307)
Q Consensus 248 ~l~~~~~sL~~hq~kle~el~~~e~~~~~k 277 (307)
.|...-..|.+-.+|||+||++....|+=+
T Consensus 4 ei~eEn~~Lk~eiqkle~ELq~~~~~~qIk 33 (76)
T PF07334_consen 4 EIQEENARLKEEIQKLEAELQQNKREFQIK 33 (76)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhc
Confidence 344444567777778888888877776643
No 42
>PRK09039 hypothetical protein; Validated
Probab=32.66 E-value=1.9e+02 Score=28.17 Aligned_cols=54 Identities=19% Similarity=0.306 Sum_probs=37.7
Q ss_pred cchhhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHH
Q psy1937 237 VRSVVTTN--RMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQE 290 (307)
Q Consensus 237 ~rs~~~~~--r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~ 290 (307)
.+.+.+.+ .+..|+.|+..|......|+++|...+++-.+...++.+-...+++
T Consensus 128 ~k~~~se~~~~V~~L~~qI~aLr~Qla~le~~L~~ae~~~~~~~~~i~~L~~~L~~ 183 (343)
T PRK09039 128 EKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKIADLGRRLNV 183 (343)
T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444433 3777888888888888889988888888877766666554444433
No 43
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=31.73 E-value=2.8e+02 Score=25.94 Aligned_cols=53 Identities=21% Similarity=0.477 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhh
Q psy1937 243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKE 295 (307)
Q Consensus 243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~ 295 (307)
...++.|..++.+|...-..|+..|..++..+......|...-.....+|..+
T Consensus 222 r~~~~~l~~el~~l~~~~~~Le~~l~~le~~~~~~~~~~~~~i~~le~el~~l 274 (312)
T PF00038_consen 222 RRQIQSLQAELESLRAKNASLERQLRELEQRLDEEREEYQAEIAELEEELAEL 274 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HhhhhHhhhhhhccccchhhhhhhHHHHHHHHHHHHHHHHHhhhccchhHHHH
Confidence 33467777777788888888888888888888888777777766666666554
No 44
>cd07598 BAR_FAM92 The Bin/Amphiphysin/Rvs (BAR) domain of Family with sequence similarity 92 (FAM92). BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. This group is composed of proteins from the family with sequence similarity 92 (FAM92), which were originally identified by the presence of the unknown domain DUF1208. This domain shows similarity to the BAR domains of sorting nexins. Mammals contain at least two member types, FAM92A and FAM92B, which may exist in many variants. The Xenopus homolog of FAM92A1, xVAP019, is essential for embryo survival and cell differentiation. FAM92A1 may be involved in regulating cell proliferation and apoptosis. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=27.36 E-value=2.7e+02 Score=25.22 Aligned_cols=50 Identities=8% Similarity=0.277 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhh
Q psy1937 243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEK 296 (307)
Q Consensus 243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~ 296 (307)
.++.++++.+|.+|..|-.+||..+..+=. +.+++...++.|...+.-+.
T Consensus 3 ~~~~k~i~~~i~~lE~hl~~l~~~~~~lv~----k~~~L~~~~~~fak~~~~la 52 (211)
T cd07598 3 DNQTKFIQERITNVEKHFGELCQDFAAYTR----KTARLRDKGDELAKSINAYA 52 (211)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHhHHHHHHHHHHHH
Confidence 467899999999999999999999887654 44577888888988877664
No 45
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=27.25 E-value=2.9e+02 Score=23.43 Aligned_cols=46 Identities=26% Similarity=0.419 Sum_probs=34.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHH
Q psy1937 243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQF 288 (307)
Q Consensus 243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f 288 (307)
..++.-+-.+|.+|...-..||.+|..+++....-+.++.++...+
T Consensus 27 e~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k~~lee~~~~~ 72 (143)
T PF12718_consen 27 EQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAKEKLEESEKRK 72 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence 3455556678888888888888888888888888887777765543
No 46
>PF07352 Phage_Mu_Gam: Bacteriophage Mu Gam like protein; InterPro: IPR009951 The Gam protein, originally characterised in Bacteriophage Mu, protects linear double stranded DNA from exonuclease degradation in vitro and in vivo []. This protein is also found in many bacterial species as part of a suspected prophage. Further studies have shown that Gam is a functional counterpart of the eukaryotic Ku protein, which has key roles in DNA repair and in certain transposition events. Gam displays DNA binding characteristics remarkably similar to those of human Ku []. In addition, Gam can interfere with Ty1 retrotransposition in Saccharomyces cerevisiae (Baker's yeast). These data reveal structural and functional parallels between bacteriophage Gam and eukaryotic Ku and suggest that their functions have been evolutionarily conserved [].; GO: 0003690 double-stranded DNA binding, 0042262 DNA protection; PDB: 2P2U_B.
Probab=26.62 E-value=3.2e+02 Score=23.02 Aligned_cols=53 Identities=17% Similarity=0.287 Sum_probs=42.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhhccC
Q psy1937 244 NRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEKASC 299 (307)
Q Consensus 244 ~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~~~~ 299 (307)
.+|.-|++++..+.. .+..++..|.+.+++....+...-+.|..-|.-.|...
T Consensus 10 ~ki~~l~~~~~~i~~---~~~~~I~~i~~~~~~~~~~l~~~i~~l~~~l~~y~e~~ 62 (149)
T PF07352_consen 10 RKIAELQREIARIEA---EANDEIARIKEWYEAEIAPLQNRIEYLEGLLQAYAEAN 62 (149)
T ss_dssp HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCT
T ss_pred HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
Confidence 356677777766654 55778999999999999999999999999999888543
No 47
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=26.61 E-value=1.1e+02 Score=25.03 Aligned_cols=37 Identities=32% Similarity=0.449 Sum_probs=24.0
Q ss_pred HHhhhcccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937 225 INEIFSDSVVPDVRSVVTTNRMQVLKRQVQSLTMHQKKLEAEL 267 (307)
Q Consensus 225 ~~Eifs~~~v~d~rs~~~~~r~~~l~~~~~sL~~hq~kle~el 267 (307)
|.+++++.- |+++.=|+.|...|+.|.....+|++++
T Consensus 70 i~~ml~~~~------~~r~~~~~~l~~rvd~Lerqv~~Lenk~ 106 (108)
T COG3937 70 IEEMLSDLE------VARQSEMDELTERVDALERQVADLENKL 106 (108)
T ss_pred HHHHHhhcc------ccccchHHHHHHHHHHHHHHHHHHHHHh
Confidence 888888732 5555556666666666666666666665
No 48
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.41 E-value=1.9e+02 Score=24.62 Aligned_cols=28 Identities=21% Similarity=0.464 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhHHhhhh
Q psy1937 258 MHQKKLEAELQQIEEKFEAKKRKFVESS 285 (307)
Q Consensus 258 ~hq~kle~el~~~e~~~~~kk~~~~~~s 285 (307)
-+|.+|+.||+.++.++++-++.++.-+
T Consensus 34 k~q~~~q~ELe~~K~~ld~~rqel~~HF 61 (138)
T COG3105 34 KQQQKLQYELEKVKAQLDEYRQELVKHF 61 (138)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4688999999999999998887765543
No 49
>PF10211 Ax_dynein_light: Axonemal dynein light chain; InterPro: IPR019347 Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains [].
Probab=24.46 E-value=4.6e+02 Score=23.24 Aligned_cols=34 Identities=18% Similarity=0.342 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937 245 RMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKK 278 (307)
Q Consensus 245 r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk 278 (307)
++..|+.....|..+...|+..+..++.+.++.+
T Consensus 128 ~i~~L~~e~~~L~~~~~~l~~~~e~~ek~~~e~~ 161 (189)
T PF10211_consen 128 EIEELEEEKEELEKQVQELKNKCEQLEKREEELR 161 (189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5666777777788888888888777777776543
No 50
>PRK06800 fliH flagellar assembly protein H; Validated
Probab=24.38 E-value=3.3e+02 Score=24.68 Aligned_cols=42 Identities=33% Similarity=0.582 Sum_probs=24.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHhHHhhhhHHHHHH
Q psy1937 250 KRQVQSLTMHQKKLEAELQQIEE---KFEAKKRKFVESSEQFQEE 291 (307)
Q Consensus 250 ~~~~~sL~~hq~kle~el~~~e~---~~~~kk~~~~~~s~~f~~~ 291 (307)
...-+.|..||+.|+.|+.+|+. ..+..|..++..-+.|..-
T Consensus 37 ~~d~~~L~~~Q~~L~~e~~~l~~eqQ~l~~er~~l~~er~~~~~~ 81 (228)
T PRK06800 37 QKDHEELLAQQKSLHKELNQLRQEQQKLERERQQLLADREQFQEH 81 (228)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445566666666666665543 3445666666666666543
No 51
>PF03915 AIP3: Actin interacting protein 3; InterPro: IPR022782 This entry represents a domain found in yeast actin interacting protein 3 and bud site selection protein 6. In these proteins it is typically found towards the C terminus. It is also found in metazoan proteins, such as the mouse enhancer trap locus 4 protein. ; PDB: 3ONX_B 3OKQ_A.
Probab=24.36 E-value=2.2e+02 Score=28.75 Aligned_cols=59 Identities=19% Similarity=0.315 Sum_probs=39.2
Q ss_pred CcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhhc
Q psy1937 236 DVRSVVTTNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEKA 297 (307)
Q Consensus 236 d~rs~~~~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~~ 297 (307)
++|-=|++-|.....+|+..+......+..+|..+++-...-|.-|-.- .-.||..||.
T Consensus 231 ~LRkDV~~RgvRp~~~qle~v~kdi~~a~~~L~~m~~~i~~~kp~WkKi---WE~EL~~V~e 289 (424)
T PF03915_consen 231 DLRKDVVQRGVRPSPKQLETVAKDISRASKELKKMKEYIKTEKPIWKKI---WESELQKVCE 289 (424)
T ss_dssp HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH
T ss_pred HHHHHHHHcCCcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhCHHHHHH---HHHHHHHHHH
Confidence 4566666666777778888888888888889998888887766666544 4456666663
No 52
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=23.99 E-value=2.1e+02 Score=23.13 Aligned_cols=39 Identities=15% Similarity=0.329 Sum_probs=20.8
Q ss_pred HHHHHcCCCCHHHHHHHHHh--hccCCChHHhHHHHHHHHH
Q psy1937 105 QVKAQNLDLKLWEIGKIIGQ--MWRDLPEDQKTEYVEDYEQ 143 (307)
Q Consensus 105 ~v~~e~P~~~~~eisk~lg~--~Wk~Ls~eeK~~Y~~~ae~ 143 (307)
.+...++.++..+-.++... .|.+||++++..--+.+..
T Consensus 33 ~~a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R~~~~~ 73 (107)
T PF11304_consen 33 QIAERWPSMSPEQQQRLRERMRRWAALSPEQRQQARENYQR 73 (107)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 34445566665554444442 3666666666554444443
No 53
>PRK15335 type III secretion system protein SpaM; Provisional
Probab=23.20 E-value=1.9e+02 Score=24.64 Aligned_cols=45 Identities=20% Similarity=0.533 Sum_probs=28.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhHHhhhhHHHH
Q psy1937 245 RMQVLKRQVQSLTMHQKKLEA---ELQQIEEKFEAKKRKFVESSEQFQ 289 (307)
Q Consensus 245 r~~~l~~~~~sL~~hq~kle~---el~~~e~~~~~kk~~~~~~s~~f~ 289 (307)
|-.|++||.++|.-.--.+++ ||+.-.+.++.+..-|+.-++++.
T Consensus 74 rqaivRRQ~~~L~Lq~~~iqEKr~elqkeke~~~k~~~yWLRKe~kY~ 121 (147)
T PRK15335 74 KQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNYQ 121 (147)
T ss_pred HHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhccchH
Confidence 467888888887655444443 555556666666666666665543
No 54
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=23.11 E-value=4e+02 Score=27.44 Aligned_cols=52 Identities=13% Similarity=0.315 Sum_probs=34.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhh
Q psy1937 245 RMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEK 296 (307)
Q Consensus 245 r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~ 296 (307)
+...+..+...+......|++.|...++.+++++.-+.++.+.+..+.+-+.
T Consensus 68 ~l~~~~~~~~~~~~~~~~l~~~le~~~~~~~ek~~~l~~~~~~L~~~F~~LA 119 (475)
T PRK10361 68 EVRSLQSINTSLEADLREVTTRMEAAQQHADDKIRQMINSEQRLSEQFENLA 119 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555566666667777777777777777777777766666666665543
No 55
>PF09006 Surfac_D-trimer: Lung surfactant protein D coiled-coil trimerisation; InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=22.96 E-value=2.3e+02 Score=19.78 Aligned_cols=23 Identities=30% Similarity=0.512 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy1937 247 QVLKRQVQSLTMHQKKLEAELQQ 269 (307)
Q Consensus 247 ~~l~~~~~sL~~hq~kle~el~~ 269 (307)
..|+.||..|....+-|+.-+.+
T Consensus 2 ~aLrqQv~aL~~qv~~Lq~~fs~ 24 (46)
T PF09006_consen 2 NALRQQVEALQGQVQRLQAAFSQ 24 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHH
Confidence 45777777655555555544433
No 56
>KOG0241|consensus
Probab=22.53 E-value=1e+02 Score=34.64 Aligned_cols=90 Identities=24% Similarity=0.302 Sum_probs=51.9
Q ss_pred cccCCCCcccccc-ccccccchhhHHhhhcccCCCCcchhhhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937 202 DEQEDGYSVKHVA-YARYLRNHRLINEIFSDSVVPDVRSVVTTNRMQVLKR---QVQSLTMHQKKLEAELQQIEEKFEAK 277 (307)
Q Consensus 202 dd~ddg~S~k~~a-~~Rf~rNH~l~~Eifs~~~v~d~rs~~~~~r~~~l~~---~~~sL~~hq~kle~el~~~e~~~~~k 277 (307)
|+||+.||+=+-| .|+-.-||-.+||==.-.+..|+|--|-+=|+++-+. ++..|.-|-.++|.=+.+|...+++|
T Consensus 333 dnyeeTlStLRYadrAkrIvN~avvNedpnarvirElReEve~lr~qL~~ae~~~~~el~e~l~esekli~ei~~twEEk 412 (1714)
T KOG0241|consen 333 DNYEETLSTLRYADRAKRIVNHAVVNEDPNARVIRELREEVEKLREQLEQAEAMKLPELKEKLEESEKLIKEITVTWEEK 412 (1714)
T ss_pred cchHHHHHHHHHHHHHHHhhccccccCCchHHHHHHHHHHHHHHHHHHhhhhhccchHHHHHHHHHHHHHHHHHhHHHHH
Confidence 6788888874432 3555666665555333333344444455555554442 33445556666666677888888888
Q ss_pred HhHHhhhhHHHHHHHHhh
Q psy1937 278 KRKFVESSEQFQEELKKE 295 (307)
Q Consensus 278 k~~~~~~s~~f~~~~~~~ 295 (307)
.++.. ..+.|+++.
T Consensus 413 l~ktE----~in~erq~~ 426 (1714)
T KOG0241|consen 413 LRKTE----EINQERQAQ 426 (1714)
T ss_pred HHHHH----HHHHHHHHH
Confidence 76654 445555544
No 57
>PF09340 NuA4: Histone acetyltransferase subunit NuA4; InterPro: IPR015418 The NuA4 histone acetyltransferase (HAT) multisubunit complex is responsible for acetylation of histone H4 and H2A N-terminal tails in yeast []. NuA4 complexes are highly conserved in eukaryotes and play primary roles in transcription, cellular response to DNA damage, and cell cycle control [].
Probab=21.99 E-value=2.1e+02 Score=21.92 Aligned_cols=32 Identities=28% Similarity=0.530 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhh
Q psy1937 252 QVQSLTMHQKKLEAELQQIEEKFEAKKRKFVE 283 (307)
Q Consensus 252 ~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~ 283 (307)
++..|....++|+.+|..||..--.+=-.+++
T Consensus 3 ~L~~l~~~k~~Le~~L~~lE~qIy~~Et~YL~ 34 (80)
T PF09340_consen 3 ELKELLQKKKKLEKDLAALEKQIYDKETSYLE 34 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456666666666666666655444444444
No 58
>PRK10328 DNA binding protein, nucleoid-associated; Provisional
Probab=21.43 E-value=2.6e+02 Score=23.69 Aligned_cols=55 Identities=15% Similarity=0.354 Sum_probs=29.6
Q ss_pred HHhhhcccCCCCcchhhhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH
Q psy1937 225 INEIFSDSVVPDVRSVVTTNR---MQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKF 281 (307)
Q Consensus 225 ~~Eifs~~~v~d~rs~~~~~r---~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~ 281 (307)
|+|++ -+...+||+....| +..|..=.+-|......-.++.......-+++..++
T Consensus 1 ms~~l--k~l~n~R~lra~~re~~~e~Lee~~ekl~~vv~er~~~~~~~~~~~~er~~~l 58 (134)
T PRK10328 1 MSVML--QSLNNIRTLRAMAREFSIDVLEEMLEKFRVVTKERREEEEQQQRELAERQEKI 58 (134)
T ss_pred CcHHH--HHHhhHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45555 24557888877776 455555555555555554444444444444444433
No 59
>PF04340 DUF484: Protein of unknown function, DUF484; InterPro: IPR007435 This family consists of several proteins of uncharacterised function.; PDB: 3E98_B.
Probab=20.42 E-value=2.2e+02 Score=25.66 Aligned_cols=63 Identities=24% Similarity=0.428 Sum_probs=19.9
Q ss_pred CCccccccccccccchhh----HHhhhcccCCCCcchhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937 207 GYSVKHVAYARYLRNHRL----INEIFSDSVVPDVRSVVTTNR-MQVLKRQVQSLTMHQKKLEAELQQIEEKFEA 276 (307)
Q Consensus 207 g~S~k~~a~~Rf~rNH~l----~~Eifs~~~v~d~rs~~~~~r-~~~l~~~~~sL~~hq~kle~el~~~e~~~~~ 276 (307)
.++...|+. |-++|.- ..|+|..-.||... .+ +....+|++.|...-+.|+.+|..|-....+
T Consensus 5 ~l~~~~V~~--yL~~~PdFf~~~~~ll~~l~~ph~~-----~~avSL~erQ~~~LR~~~~~L~~~l~~Li~~Ar~ 72 (225)
T PF04340_consen 5 ALDAEDVAA--YLRQHPDFFERHPELLAELRLPHPS-----GGAVSLVERQLERLRERNRQLEEQLEELIENARE 72 (225)
T ss_dssp -------------------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cCCHHHHHH--HHHhCcHHHHhCHHHHHHcCCCCCC-----CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455566666 7776631 12333333344321 22 3333477777777777777777777665543
Done!