Query psy11212
Match_columns 448
No_of_seqs 169 out of 323
Neff 5.9
Searched_HMMs 46136
Date Fri Aug 16 20:59:37 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11212.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11212hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2549|consensus 100.0 6E-102 1E-106 802.9 28.8 362 4-447 7-371 (576)
2 COG5095 TAF6 Transcription ini 100.0 5.3E-97 1E-101 720.7 21.1 350 6-448 4-361 (450)
3 cd08050 TAF6 TATA Binding Prot 100.0 1.9E-94 4.2E-99 734.4 30.1 341 9-447 1-341 (343)
4 COG5095 TAF6 Transcription ini 100.0 8E-38 1.7E-42 306.0 4.5 121 2-122 16-150 (450)
5 KOG2549|consensus 100.0 5.6E-37 1.2E-41 319.1 7.6 125 2-126 22-159 (576)
6 cd08050 TAF6 TATA Binding Prot 100.0 2.6E-31 5.7E-36 271.1 7.6 124 2-125 10-146 (343)
7 PF07571 DUF1546: Protein of u 99.9 1.3E-26 2.7E-31 194.7 3.9 79 369-447 1-79 (92)
8 PF02969 TAF: TATA box binding 99.7 9.3E-18 2E-22 132.3 5.4 65 6-150 2-66 (66)
9 smart00803 TAF TATA box bindin 99.3 1.2E-12 2.6E-17 103.3 5.7 64 7-150 2-65 (65)
10 PF02969 TAF: TATA box binding 98.7 4.3E-09 9.4E-14 83.2 0.7 41 2-42 14-66 (66)
11 cd00076 H4 Histone H4, one of 98.4 8.6E-07 1.9E-11 73.6 7.0 73 7-160 13-85 (85)
12 PLN00035 histone H4; Provision 98.3 1.7E-06 3.6E-11 74.1 7.2 74 7-161 29-102 (103)
13 PTZ00015 histone H4; Provision 98.3 2.4E-06 5.3E-11 73.1 7.0 73 7-160 30-102 (102)
14 smart00803 TAF TATA box bindin 97.8 1E-05 2.3E-10 63.9 1.6 41 2-42 13-65 (65)
15 PF12755 Vac14_Fab1_bd: Vacuol 97.0 0.0021 4.6E-08 54.7 6.6 82 340-437 7-88 (97)
16 cd07979 TAF9 TATA Binding Prot 96.4 0.012 2.6E-07 51.8 7.1 63 124-205 38-101 (117)
17 cd00076 H4 Histone H4, one of 95.2 0.013 2.8E-07 48.9 2.4 47 5-51 28-85 (85)
18 PLN00035 histone H4; Provision 95.0 0.017 3.8E-07 49.7 2.7 47 6-52 45-102 (103)
19 COG2036 HHT1 Histones H3 and H 94.8 0.037 8.1E-07 46.7 4.1 72 4-155 16-87 (91)
20 PF00125 Histone: Core histone 94.7 0.05 1.1E-06 43.3 4.4 24 128-151 51-74 (75)
21 PTZ00015 histone H4; Provision 94.4 0.028 6E-07 48.4 2.4 46 6-51 46-102 (102)
22 PF02985 HEAT: HEAT repeat; I 93.8 0.12 2.5E-06 34.4 4.1 30 365-404 1-30 (31)
23 smart00417 H4 Histone H4. 93.0 0.15 3.3E-06 41.4 4.2 37 7-43 13-49 (74)
24 smart00576 BTP Bromodomain tra 92.5 0.37 8.1E-06 39.0 6.0 32 124-155 43-74 (77)
25 PF13513 HEAT_EZ: HEAT-like re 91.1 0.064 1.4E-06 39.9 -0.0 55 388-444 1-55 (55)
26 PF12348 CLASP_N: CLASP N term 90.1 0.48 1E-05 45.1 5.0 102 295-406 19-126 (228)
27 KOG3467|consensus 90.0 0.23 5E-06 41.4 2.3 41 120-161 62-102 (103)
28 cd07981 TAF12 TATA Binding Pro 87.1 0.48 1E-05 38.1 2.4 34 119-152 34-67 (72)
29 PF11864 DUF3384: Domain of un 84.4 15 0.00032 39.5 12.9 97 277-373 64-185 (464)
30 smart00428 H3 Histone H3. 83.2 0.86 1.9E-05 39.5 2.3 34 121-154 70-103 (105)
31 PF07524 Bromo_TP: Bromodomain 82.2 1.1 2.4E-05 36.0 2.5 36 119-154 38-73 (77)
32 cd00020 ARM Armadillo/beta-cat 80.9 3.6 7.8E-05 33.9 5.3 108 316-443 8-117 (120)
33 PF00808 CBFD_NFYB_HMF: Histon 80.8 0.94 2E-05 35.2 1.5 31 119-149 35-65 (65)
34 KOG2023|consensus 80.1 16 0.00035 41.2 11.0 138 283-427 215-405 (885)
35 cd00020 ARM Armadillo/beta-cat 78.3 11 0.00023 31.0 7.4 70 315-402 49-119 (120)
36 PF01602 Adaptin_N: Adaptin N 78.0 10 0.00022 40.4 8.8 138 278-444 37-179 (526)
37 PF02291 TFIID-31kDa: Transcri 76.7 1.8 3.9E-05 38.9 2.3 63 120-201 45-108 (129)
38 KOG2171|consensus 73.7 20 0.00043 42.4 10.1 153 282-444 3-186 (1075)
39 PLN00160 histone H3; Provision 70.2 2.9 6.3E-05 35.8 1.9 35 120-154 60-94 (97)
40 KOG1242|consensus 70.0 35 0.00075 37.9 10.4 145 283-445 133-282 (569)
41 cd07979 TAF9 TATA Binding Prot 67.4 6.4 0.00014 34.6 3.5 75 4-96 15-101 (117)
42 KOG2171|consensus 67.4 22 0.00047 42.1 8.6 100 311-428 840-939 (1075)
43 PF06371 Drf_GBD: Diaphanous G 66.7 7.7 0.00017 35.6 4.1 78 311-402 107-186 (187)
44 PRK09687 putative lyase; Provi 66.5 76 0.0016 31.9 11.5 29 415-445 159-187 (280)
45 PTZ00018 histone H3; Provision 66.4 3.9 8.5E-05 37.1 2.0 36 120-155 100-135 (136)
46 PF12348 CLASP_N: CLASP N term 65.9 72 0.0016 30.1 10.8 109 284-409 95-212 (228)
47 PF12460 MMS19_C: RNAPII trans 64.1 78 0.0017 33.4 11.6 116 301-425 23-145 (415)
48 PLN00121 histone H3; Provision 63.4 4.6 0.0001 36.6 1.9 36 120-155 100-135 (136)
49 KOG0212|consensus 63.3 36 0.00078 37.8 8.8 79 338-430 62-140 (675)
50 PF10274 ParcG: Parkin co-regu 62.2 59 0.0013 31.0 9.1 93 310-407 71-168 (183)
51 PF14500 MMS19_N: Dos2-interac 60.4 62 0.0013 32.3 9.5 127 289-430 5-138 (262)
52 PF15630 CENP-S: Kinetochore c 59.4 3.4 7.3E-05 33.8 0.3 22 129-150 50-71 (76)
53 PRK09687 putative lyase; Provi 59.4 1.2E+02 0.0026 30.5 11.4 53 385-447 170-222 (280)
54 KOG2023|consensus 55.6 66 0.0014 36.6 9.3 144 272-423 461-615 (885)
55 smart00427 H2B Histone H2B. 55.4 10 0.00022 32.0 2.5 31 122-152 37-67 (89)
56 KOG3334|consensus 53.4 12 0.00027 34.1 2.8 37 121-157 47-83 (148)
57 PF01602 Adaptin_N: Adaptin N 52.9 49 0.0011 35.2 7.9 83 282-373 113-201 (526)
58 PLN00161 histone H3; Provision 52.1 12 0.00027 33.8 2.6 37 121-157 95-131 (135)
59 PF13513 HEAT_EZ: HEAT-like re 51.2 17 0.00037 26.6 2.9 41 351-401 15-55 (55)
60 PTZ00429 beta-adaptin; Provisi 50.9 1.8E+02 0.0038 33.7 12.1 143 280-445 65-207 (746)
61 KOG1851|consensus 50.1 51 0.0011 40.6 7.8 113 315-444 1522-1636(1710)
62 cd00074 H2A Histone 2A; H2A is 48.4 37 0.00081 29.9 5.0 24 128-151 62-85 (115)
63 KOG1242|consensus 47.2 1.3E+02 0.0029 33.5 10.0 137 275-421 287-463 (569)
64 PF10508 Proteasom_PSMB: Prote 45.7 41 0.00088 36.6 5.9 88 302-403 60-148 (503)
65 PF14911 MMS22L_C: S-phase gen 43.5 97 0.0021 32.8 8.0 75 343-425 231-310 (373)
66 PTZ00463 histone H2B; Provisio 42.9 19 0.00042 31.8 2.3 30 123-152 65-94 (117)
67 PF04648 MF_alpha: Yeast matin 42.5 13 0.00027 20.5 0.7 11 99-109 2-12 (13)
68 PLN00158 histone H2B; Provisio 41.5 22 0.00048 31.4 2.5 30 123-152 64-93 (116)
69 KOG0870|consensus 40.6 17 0.00036 34.0 1.7 40 119-158 45-84 (172)
70 KOG1824|consensus 40.3 78 0.0017 37.3 7.1 107 313-427 514-623 (1233)
71 KOG1967|consensus 40.0 1.5E+02 0.0032 34.9 9.2 141 281-429 861-1009(1030)
72 PF13646 HEAT_2: HEAT repeats; 39.0 28 0.0006 27.4 2.6 50 385-446 11-60 (88)
73 KOG1241|consensus 38.4 1.2E+02 0.0026 34.9 8.1 114 315-442 360-476 (859)
74 PF03224 V-ATPase_H_N: V-ATPas 37.8 1.2E+02 0.0025 30.7 7.5 148 275-444 24-177 (312)
75 PF12717 Cnd1: non-SMC mitotic 36.1 3.2E+02 0.007 25.1 9.6 57 336-405 38-94 (178)
76 KOG1248|consensus 35.2 2.2E+02 0.0047 34.4 9.8 113 278-408 780-903 (1176)
77 PF12074 DUF3554: Domain of un 34.3 1.9E+02 0.0042 29.4 8.5 91 323-426 164-255 (339)
78 PF12460 MMS19_C: RNAPII trans 32.4 1.2E+02 0.0027 31.9 6.9 88 281-372 317-414 (415)
79 KOG0869|consensus 31.9 29 0.00063 32.3 1.8 36 120-155 67-102 (168)
80 PF10363 DUF2435: Protein of u 30.0 87 0.0019 26.3 4.2 56 388-446 17-72 (92)
81 PF08389 Xpo1: Exportin 1-like 29.4 1.6E+02 0.0034 25.3 6.1 81 298-398 67-148 (148)
82 PRK13800 putative oxidoreducta 29.2 3.1E+02 0.0067 32.1 10.0 35 385-428 849-883 (897)
83 PF04286 DUF445: Protein of un 28.6 98 0.0021 31.2 5.2 57 272-328 202-264 (367)
84 PF06025 DUF913: Domain of Unk 28.1 93 0.002 32.8 5.0 96 315-425 151-250 (379)
85 COG2036 HHT1 Histones H3 and H 27.9 46 0.00099 28.2 2.2 44 7-50 36-90 (91)
86 COG5593 Nucleic-acid-binding p 27.3 1.2E+02 0.0027 33.6 5.7 118 308-442 162-292 (821)
87 PRK00034 gatC aspartyl/glutamy 27.0 2.6E+02 0.0057 23.0 6.7 36 7-43 2-37 (95)
88 PF13646 HEAT_2: HEAT repeats; 26.9 3E+02 0.0065 21.3 8.3 57 364-441 31-87 (88)
89 KOG1745|consensus 26.6 21 0.00046 32.4 -0.1 31 124-154 105-135 (137)
90 PF12719 Cnd3: Nuclear condens 26.4 83 0.0018 31.6 4.2 56 386-443 76-140 (298)
91 PF09862 DUF2089: Protein of u 26.0 73 0.0016 28.1 3.2 28 398-425 52-79 (113)
92 PRK07668 hypothetical protein; 25.8 81 0.0018 31.6 3.9 33 276-308 1-35 (254)
93 smart00567 EZ_HEAT E-Z type HE 25.7 87 0.0019 20.0 2.8 29 388-426 1-29 (30)
94 KOG0212|consensus 25.3 4.7E+02 0.01 29.5 9.7 115 280-408 118-242 (675)
95 PF05004 IFRD: Interferon-rela 24.6 3.4E+02 0.0074 27.7 8.3 72 357-438 79-152 (309)
96 KOG1823|consensus 24.3 1.2E+02 0.0026 37.1 5.4 90 343-445 544-638 (1364)
97 PF13251 DUF4042: Domain of un 23.6 1.1E+02 0.0023 29.2 4.0 53 343-404 6-70 (182)
98 smart00544 MA3 Domain in DAP-5 23.3 2.8E+02 0.006 23.2 6.3 43 360-412 32-74 (113)
99 PF14376 Haem_bd: Haem-binding 23.0 80 0.0017 28.4 3.0 77 115-211 30-119 (137)
100 KOG1241|consensus 22.8 4.3E+02 0.0093 30.7 9.0 57 385-442 566-625 (859)
101 PF03810 IBN_N: Importin-beta 22.6 2.6E+02 0.0057 21.3 5.6 38 387-424 29-75 (77)
102 PF03847 TFIID_20kDa: Transcri 22.0 56 0.0012 26.1 1.5 34 118-151 31-64 (68)
103 COG5116 RPN2 26S proteasome re 21.7 3.4E+02 0.0073 30.7 7.8 158 278-440 511-687 (926)
104 PF12170 DNA_pol3_tau_5: DNA p 21.3 2E+02 0.0043 26.2 5.2 54 267-320 56-133 (142)
105 PF04192 Utp21: Utp21 specific 21.0 2.5E+02 0.0053 27.7 6.2 58 285-346 126-185 (237)
106 COG5094 TAF9 Transcription ini 20.8 76 0.0016 28.5 2.2 34 119-152 46-82 (145)
107 PF04793 Herpes_BBRF1: BRRF1-l 20.3 6.3E+02 0.014 25.8 9.0 125 276-414 29-163 (284)
No 1
>KOG2549|consensus
Probab=100.00 E-value=6.3e-102 Score=802.93 Aligned_cols=362 Identities=54% Similarity=0.862 Sum_probs=335.8
Q ss_pred cCCCCC-HHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHH
Q psy11212 4 YGTSFS-IESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDL 82 (448)
Q Consensus 4 ~~~~~~-~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~ 82 (448)
..+.++ .|..|.+||++||.+|.+|+++.+++|++|++|+.++-
T Consensus 7 ~~~v~s~~Es~k~vAEslGi~nl~deaa~~La~dv~yrikEI~Q~----------------------------------- 51 (576)
T KOG2549|consen 7 SPTVVSPKESVKVVAESLGITNLNDEAALLLAEDVEYRIKEIVQD----------------------------------- 51 (576)
T ss_pred cccccCcHHHHHHHHHHhCccccCHHHHHHHHHHHHHHHHHHHHH-----------------------------------
Confidence 345556 99999999999999999999999999999999997766
Q ss_pred hhccCCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCCCC
Q psy11212 83 LTNLNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGLTV 162 (448)
Q Consensus 83 ~~~~~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy~s 162 (448)
|.|||+||||++||++||++|||++|+||+|||.+
T Consensus 52 ---------------------------------------------aaKfm~hskR~kLtv~DV~~ALr~~nVep~yg~~s 86 (576)
T KOG2549|consen 52 ---------------------------------------------AAKFMVHSKRTKLTVDDVDYALRSLNVEPLYGFGA 86 (576)
T ss_pred ---------------------------------------------HHHHhhcCCCCcCcHHHHHHHHhhcccccccCccc
Confidence 99999999999999999999999999999999999
Q ss_pred CCccCceecCCCCceeEeeccccccHHHHHhccCCCCCCccceEEEEEEecCccCCCCCCCCCCCccccccccCCccccc
Q psy11212 163 SQFLPFKHASGGGRELHFTDEKEVDFEDLLTNLNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESIDPISKL 242 (448)
Q Consensus 163 ~~~~~f~~~s~~g~~l~~~~D~evdL~~ii~~~lpkiP~~~~~~aHWLaIeGvQP~IPeNp~~~~~~~~~~~~~~p~~~~ 242 (448)
.+.++|+.++++|+++||.+|+||||++++++||||+|+++++++|||+||||||+|||||++.+++.++.+...+....
T Consensus 87 ~~~i~fr~a~~~~r~ly~~~d~EVD~d~~i~~plpkvP~d~~i~sHWL~iEGVQP~iPeNp~p~~~~~~~~~~~~~~~~~ 166 (576)
T KOG2549|consen 87 QEIIPFRKASGGGRELYYPEDEEVDLDEIINAPLPKVPLDVSIKSHWLAIEGVQPAIPENPPPLEKNVQKKEVIEPAIKA 166 (576)
T ss_pred CceeeccccCCCCceeecCchhhcchhhhhcCCCCCCCccceeeeeeeeeccccccCCCCCCccccchhhhcccchhhhc
Confidence 99999999998889999999999999999999999999999999999999999999999999999877665655554433
Q ss_pred CCCCCCCCCCCc--hhHHhhhccccceeeeccccccchHHHHHHHHHHHHHhcCCCHHHHHHHHHhhhcCCCcccchhhH
Q psy11212 243 GKKDKDTSGKPT--SAKLEKLRNVETVHVKQLATHELSVEQQLYYKEITEACVGSDEGRRFEALQSLATDPGLHEMLPRM 320 (448)
Q Consensus 243 ~~~~~~~~g~~~--~~~~~~~~~~~~v~vkp~~~h~LSkElQlyf~~It~al~~~~e~~r~~aL~sL~tD~gL~qLlPyf 320 (448)
..+. ..++++ +.+...+...+++.+||+++|+||+|||+||++||++|+|+|+.+|++||++|+||+|||||||||
T Consensus 167 ~~~~--~~~~~~~~~~~~~~~k~~e~~~~Kp~~~h~LS~ElQlYy~~It~a~~g~~~~~r~eAL~sL~TDsGL~~LlPyF 244 (576)
T KOG2549|consen 167 PEGN--NPGASTKSSDAVKTLKIGEPVELKPSSKHVLSVELQLYYKEITEACTGSDEPLRQEALQSLETDSGLQQLLPYF 244 (576)
T ss_pred cCCC--CCCccccccccccccccCCcccccccchhhccHHHHHHHHHHHHHHhcCCHHHHHHHHHhhccCccHHHHHHHH
Confidence 3322 222222 133444566789999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHH
Q psy11212 321 CTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQ 400 (448)
Q Consensus 321 v~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~ 400 (448)
++||+++|+.|++++|+..|+++|+|++||++||+||+|||||+|||+||||+|+|++|.+|+.+|||+|||+||++|+.
T Consensus 245 v~fIae~vs~ni~~~nL~lL~~lm~m~rSLl~Np~i~lepYlh~L~PSvlTCvVsk~l~~~p~~dnhwaLRDfAA~ll~~ 324 (576)
T KOG2549|consen 245 VTFIAEGVSVNIVQNNLELLIYLMRMVRSLLDNPNIFLEPYLHQLVPSVLTCVVSKNLCLRPELDNHWALRDFAARLLAQ 324 (576)
T ss_pred HHHHhhheeeccccccHHHHHHHHHHHHHHhcCCccchhhHHHHHhhHHHHhhhhhhccCCccccchHHHHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhcccCCC
Q psy11212 401 VSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLVTER 447 (448)
Q Consensus 401 I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~alG~~ 447 (448)
||++|++.|++|++||++||.|+|.|+++++++|||||+||++||.+
T Consensus 325 i~k~f~~~y~~L~~Rit~tl~k~l~D~~~~~st~YGai~gL~~lg~~ 371 (576)
T KOG2549|consen 325 ICKNFSTLYNNLQPRITRTLSKALLDNKKPLSTHYGAIAGLSELGHE 371 (576)
T ss_pred HHHhhhhHHHHHHHHHHHHHHHHhcCCCCCchhhhhHHHHHHHhhhh
Confidence 99999999999999999999999999999999999999999999964
No 2
>COG5095 TAF6 Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=100.00 E-value=5.3e-97 Score=720.73 Aligned_cols=350 Identities=39% Similarity=0.623 Sum_probs=319.0
Q ss_pred CCCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHHhhc
Q psy11212 6 TSFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTN 85 (448)
Q Consensus 6 ~~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~~~~ 85 (448)
+-++.|++|.+||++||.+|-+|+++.++.|++|+++..||-
T Consensus 4 t~ws~et~KdvAeslGi~Ni~Dd~l~alamDlEYRI~ev~qe-------------------------------------- 45 (450)
T COG5095 4 TLWSKETLKDVAESLGISNIDDDALRALAMDLEYRIKEVCQE-------------------------------------- 45 (450)
T ss_pred eeecHHHHHHHHHHcCCcccccHHHHHHHHhHHHHHHHHHHH--------------------------------------
Confidence 457899999999999999999999999999999999998877
Q ss_pred cCCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCCCCCCc
Q psy11212 86 LNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGLTVSQF 165 (448)
Q Consensus 86 ~~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy~s~~~ 165 (448)
|.|||+||||++||++||+.|||.+|+||||||+++.+
T Consensus 46 ------------------------------------------a~KFmvhSKRtvLt~dDis~ALr~lNVePLyGyd~~r~ 83 (450)
T COG5095 46 ------------------------------------------ASKFMVHSKRTVLTIDDISYALRSLNVEPLYGYDPSRP 83 (450)
T ss_pred ------------------------------------------HHHHhhcccceeeeHHhHHHHHHhcCCCcccCCCCCcc
Confidence 99999999999999999999999999999999999999
Q ss_pred cCceec-CCCCceeEeeccccccHHHHHhccCCCCCCccceEEEEEEecCccCCCCCCCCCCCccccccccCCcccccCC
Q psy11212 166 LPFKHA-SGGGRELHFTDEKEVDFEDLLTNLNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESIDPISKLGK 244 (448)
Q Consensus 166 ~~f~~~-s~~g~~l~~~~D~evdL~~ii~~~lpkiP~~~~~~aHWLaIeGvQP~IPeNp~~~~~~~~~~~~~~p~~~~~~ 244 (448)
+.|..+ .|+||.+||++|+||||+++||+||||+|+.+++++||||||||||+|||||.+.++.-.+.
T Consensus 84 l~fs~v~~g~gqsvYYlddEEvDfe~~in~PLPkVPr~~s~~sHWLAiEGVQPaIpqNP~l~d~~V~~~----------- 152 (450)
T COG5095 84 LQFSLVFRGLGQSVYYLDDEEVDFEEYINRPLPKVPRRVSIQSHWLAIEGVQPAIPQNPILLDKPVAKW----------- 152 (450)
T ss_pred hhHHHHHccCCcceEecChhhcCHHHHhcCcCccCCcCcchhhhhhhhcccCccCCCCCCccccccccc-----------
Confidence 999875 47889999999999999999999999999999999999999999999999999877643211
Q ss_pred CCCCCC----CCCchhHHh--hhccccceeeeccccccchHHHHHHHHHHHHHhcC-CCHHHHHHHHHhhhcCCCcccch
Q psy11212 245 KDKDTS----GKPTSAKLE--KLRNVETVHVKQLATHELSVEQQLYYKEITEACVG-SDEGRRFEALQSLATDPGLHEML 317 (448)
Q Consensus 245 ~~~~~~----g~~~~~~~~--~~~~~~~v~vkp~~~h~LSkElQlyf~~It~al~~-~~e~~r~~aL~sL~tD~gL~qLl 317 (448)
..+.+. |.++.+.+. .....+++++||+++|+||||+|+||++|+.++.+ ++++.|.+||+|||+|+|||||+
T Consensus 153 ~~k~T~g~~~~a~t~a~q~~n~v~~~~n~E~KplvkHvLsKELq~YF~kvisal~dEs~~~~r~aAl~sLr~dsGlhQLv 232 (450)
T COG5095 153 ASKDTLGVMPGASTAAYQARNGVTSMENAELKPLVKHVLSKELQMYFDKVISALLDESDEQTRDAALESLRNDSGLHQLV 232 (450)
T ss_pred cccCccccCcCcchHHHHHhcCCCCcccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCccHHHHH
Confidence 112222 222222211 23344789999999999999999999999999975 56778999999999999999999
Q ss_pred hhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHH
Q psy11212 318 PRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRL 397 (448)
Q Consensus 318 Pyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~l 397 (448)
|||++||+|+|+.|+ +|+..|..+|.|+.||++|++||++||+|||||++|||+++|++|+.|+.-+|++|||+||.+
T Consensus 233 PYFi~f~~eqit~Nl--~nl~~LtTv~~m~~sLL~N~~iFvdPY~hqlmPSilTcliakklg~~p~dhe~~alRd~AA~l 310 (450)
T COG5095 233 PYFIHFFNEQITKNL--KNLEKLTTVVMMYSSLLKNKYIFVDPYLHQLMPSILTCLIAKKLGNVPDDHEHYALRDVAADL 310 (450)
T ss_pred HHHHHHHHHHHHHhh--hhHHHHHHHHHHHHHHhcCCceeecHHHHHHHHHHHHHHHHHHhcCCCcchhHHHHHHHHHHH
Confidence 999999999999999 899999999999999999999999999999999999999999999998877899999999999
Q ss_pred HHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhcccCCCC
Q psy11212 398 LTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLVTERS 448 (448)
Q Consensus 398 L~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~alG~~~ 448 (448)
|..+|++|+++|.+|+|||+||++|+|+|..++.+|+|||+.||+.||.++
T Consensus 311 l~yV~~~F~~~YktLkPRvtrTllKafLD~~k~~sT~YGalkgls~l~ke~ 361 (450)
T COG5095 311 LKYVFSNFSSSYKTLKPRVTRTLLKAFLDREKTESTQYGALKGLSILSKEV 361 (450)
T ss_pred HHHHHhhhhHhhhhhchHHHHHHHHHHHhcccccchhhhhhhhhhhhchhh
Confidence 999999999999999999999999999999999999999999999998763
No 3
>cd08050 TAF6 TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTFs) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs are named after their electrophoretic mobility in polyacrylamide gels in different species. A new, unified nomenclature has been suggested for the pol II TAFs to show the relationship between TAF orthologs and paralogs. Several hypotheses are proposed for TAFs functions such as serving as
Probab=100.00 E-value=1.9e-94 Score=734.40 Aligned_cols=341 Identities=53% Similarity=0.867 Sum_probs=316.1
Q ss_pred CHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHHhhccCC
Q psy11212 9 SIESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTNLNP 88 (448)
Q Consensus 9 ~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~~~~~~p 88 (448)
+.|.++.+|+++|+..+.+||++.+++|+++.++..++.
T Consensus 1 ~~~~i~~ia~~~Gi~~~~~~a~~~La~~~e~~~~~i~~~----------------------------------------- 39 (343)
T cd08050 1 PQESIKLIAESLGIDSLSDEVAQLLAEDVEYRLREIIQE----------------------------------------- 39 (343)
T ss_pred ChhHHHHHHHHcCCCcCCHHHHHHHHHHHHHHHHHHHHH-----------------------------------------
Confidence 468999999999999999999999999999999997776
Q ss_pred CCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCCCCCCccCc
Q psy11212 89 KSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGLTVSQFLPF 168 (448)
Q Consensus 89 ~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy~s~~~~~f 168 (448)
|.|||||+||++||++|||.|||++|+||+|||.++++++|
T Consensus 40 ---------------------------------------A~k~~~hskR~~l~~~Di~~Al~~~n~eplyG~~~~~~~~~ 80 (343)
T cd08050 40 ---------------------------------------AAKFMRHSKRRKLTTSDVNHALRLRNVEPLYGFSSSEPLPF 80 (343)
T ss_pred ---------------------------------------HHHHHHHhCCCcCCHHHHHHHHHHhCCCcccCCCCCccccc
Confidence 99999999999999999999999999999999999999999
Q ss_pred eecCCCCceeEeeccccccHHHHHhccCCCCCCccceEEEEEEecCccCCCCCCCCCCCccccccccCCcccccCCCCCC
Q psy11212 169 KHASGGGRELHFTDEKEVDFEDLLTNLNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESIDPISKLGKKDKD 248 (448)
Q Consensus 169 ~~~s~~g~~l~~~~D~evdL~~ii~~~lpkiP~~~~~~aHWLaIeGvQP~IPeNp~~~~~~~~~~~~~~p~~~~~~~~~~ 248 (448)
+...++|+++||++|+||||++++++++|++|+++++++||||||||||.|||||++.+.+...... +
T Consensus 81 ~~~~~~~~~l~~~~D~eidl~~~i~~~lp~~p~~~~~~~hWLaieGvqP~ip~n~~~~~~~~~~~~~------------~ 148 (343)
T cd08050 81 RVSTGGGQELYYVEDKEIDLKDLINTPLPKVPLDVSVKAHWLAIEGVQPLIPENPPPSAIEVQKKES------------L 148 (343)
T ss_pred eeccCCCceEeeCCCCcccHHHhhhcccCCCCCccccccccceecCccCCCCCCCCchhcccccccc------------c
Confidence 7777889999999999999999999999999999999999999999999999999987654321100 0
Q ss_pred CCCCCchhHHhhhccccceeeeccccccchHHHHHHHHHHHHHhcCCCHHHHHHHHHhhhcCCCcccchhhHHHHHHhhh
Q psy11212 249 TSGKPTSAKLEKLRNVETVHVKQLATHELSVEQQLYYKEITEACVGSDEGRRFEALQSLATDPGLHEMLPRMCTFIAEGV 328 (448)
Q Consensus 249 ~~g~~~~~~~~~~~~~~~v~vkp~~~h~LSkElQlyf~~It~al~~~~e~~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V 328 (448)
..+. .....+.+++.+||.++|+||+|||+||++||++|+++++..|.+||++|++|||||||||||++||+++|
T Consensus 149 ~~~~-----~~~~~~~~~~~~k~~~~~~LS~Elq~yf~~It~a~~~~~~~~r~~aL~sL~tD~gl~~LlPyf~~fI~~~v 223 (343)
T cd08050 149 ANAA-----VKRLKKDEQVLLKPLVRHVLSKELQLYFEEITEALVGSNEEKRREALQSLRTDPGLQQLLPYFVRFIAEGV 223 (343)
T ss_pred cccc-----cccccccCcceeeeccccccCHHHHHHHHHHHHHHhCCCHHHHHHHHHHhccCCCchhhhhHHHHHHHHHH
Confidence 0000 11123457889999999999999999999999999999999999999999999999999999999999999
Q ss_pred hhccccCCHHHHHHHHHHHHHHhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCC
Q psy11212 329 KVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTS 408 (448)
Q Consensus 329 ~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~s 408 (448)
++|+.+ |+..|.++|+|++||++||+|+||+|+|+|||++|||+|+|++|.+++.++||+|||+||++|+.||++|+++
T Consensus 224 ~~n~~~-nl~~L~~lm~~v~ALl~N~~l~le~Ylh~Lip~vltclv~~~l~~~~~~~~h~~LRd~AA~ll~~i~~~f~~~ 302 (343)
T cd08050 224 TVNLDQ-NLALLIYLMRMVRALLDNPNLHLEPYLHQLIPSVLTCLVAKQLCSRPPDDNHWALRDYAARLLAQICRKFSTS 302 (343)
T ss_pred Hhhhcc-cHHHHHHHHHHHHHHhcCCCCchHHhHHHHHHHHHHHhhhHhhcCCCCCchHHHHHHHHHHHHHHHHHHcCCC
Confidence 999944 9999999999999999999999999999999999999999999999888999999999999999999999999
Q ss_pred CcchHHHHHHHHHHHhcCCCCCccchhhhhhhhcccCCC
Q psy11212 409 TNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLVTER 447 (448)
Q Consensus 409 y~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~alG~~ 447 (448)
|++|++||++||.|+|+||++++++|||||+||++||++
T Consensus 303 y~~l~~ri~~tl~k~l~d~~~~~~~~YGAi~GL~~lG~~ 341 (343)
T cd08050 303 YNTLQPRITRTLLKALLDPKKPLTTHYGAIVGLSALGPE 341 (343)
T ss_pred CCcHHHHHHHHHHHHHcCCCCCcchhhHHHHHHHHhCcc
Confidence 999999999999999999999999999999999999986
No 4
>COG5095 TAF6 Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=100.00 E-value=8e-38 Score=306.05 Aligned_cols=121 Identities=32% Similarity=0.632 Sum_probs=115.1
Q ss_pred CccCC-CCCHHHHHHHHHhc--CCCCccHHHHHHh---------HHhHHHHHHh-hcccccCCCCCCCcccccc-CCCCc
Q psy11212 2 DIYGT-SFSIESMKVIAESI--GIPNLPDDAAKEL---------SDNITYCCKV-IIQPIYGLTVSQFLPFKHA-SGGGR 67 (448)
Q Consensus 2 ~~~~~-~~~~~~~~~~a~~~--~~~~~~~~~~k~m---------~~d~~~~l~~-~~ep~~g~~~~~~~~f~~~-~~~~~ 67 (448)
||.|. +|.||++++||.|+ ||+|++|+|.||| .+||++|||. ||||||||+++.++.|..+ .|+|+
T Consensus 16 eslGi~Ni~Dd~l~alamDlEYRI~ev~qea~KFmvhSKRtvLt~dDis~ALr~lNVePLyGyd~~r~l~fs~v~~g~gq 95 (450)
T COG5095 16 ESLGISNIDDDALRALAMDLEYRIKEVCQEASKFMVHSKRTVLTIDDISYALRSLNVEPLYGYDPSRPLQFSLVFRGLGQ 95 (450)
T ss_pred HHcCCcccccHHHHHHHHhHHHHHHHHHHHHHHHhhcccceeeeHHhHHHHHHhcCCCcccCCCCCcchhHHHHHccCCc
Confidence 56776 89999999999998 9999999999999 5899999998 8999999999999999974 68899
Q ss_pred ceeeccCccccHHHHhhccCCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchh
Q psy11212 68 ELHFTDEKEVDFEDLLTNLNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQ 122 (448)
Q Consensus 68 ~~~~~~d~e~d~~~~~~~~~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q 122 (448)
.+||++||||||+++|++||||+|+.+|+++||||||||||+|||||.+.++++.
T Consensus 96 svYYlddEEvDfe~~in~PLPkVPr~~s~~sHWLAiEGVQPaIpqNP~l~d~~V~ 150 (450)
T COG5095 96 SVYYLDDEEVDFEEYINRPLPKVPRRVSIQSHWLAIEGVQPAIPQNPILLDKPVA 150 (450)
T ss_pred ceEecChhhcCHHHHhcCcCccCCcCcchhhhhhhhcccCccCCCCCCccccccc
Confidence 9999999999999999999999999999999999999999999999999999865
No 5
>KOG2549|consensus
Probab=100.00 E-value=5.6e-37 Score=319.13 Aligned_cols=125 Identities=40% Similarity=0.825 Sum_probs=120.3
Q ss_pred CccCC-CCCHHHHHHHHHhc--CCCCccHHHHHHh---------HHhHHHHHHh-hcccccCCCCCCCccccccCCCCcc
Q psy11212 2 DIYGT-SFSIESMKVIAESI--GIPNLPDDAAKEL---------SDNITYCCKV-IIQPIYGLTVSQFLPFKHASGGGRE 68 (448)
Q Consensus 2 ~~~~~-~~~~~~~~~~a~~~--~~~~~~~~~~k~m---------~~d~~~~l~~-~~ep~~g~~~~~~~~f~~~~~~~~~ 68 (448)
||+|. +|+||++++||+|| ||+||+|||+||| ++||++|||+ |+||+|||++.++++||+++|+|++
T Consensus 22 EslGi~nl~deaa~~La~dv~yrikEI~Q~aaKfm~hskR~kLtv~DV~~ALr~~nVep~yg~~s~~~i~fr~a~~~~r~ 101 (576)
T KOG2549|consen 22 ESLGITNLNDEAALLLAEDVEYRIKEIVQDAAKFMVHSKRTKLTVDDVDYALRSLNVEPLYGFGAQEIIPFRKASGGGRE 101 (576)
T ss_pred HHhCccccCHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcCcHHHHHHHHhhcccccccCcccCceeeccccCCCCce
Confidence 67887 59999999999999 8999999999999 5999999998 8999999999999999999999999
Q ss_pred eeeccCccccHHHHhhccCCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhh
Q psy11212 69 LHFTDEKEVDFEDLLTNLNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLES 126 (448)
Q Consensus 69 ~~~~~d~e~d~~~~~~~~~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~ 126 (448)
+||.||+||||++++++||||+|+++++++||||||||||.|||||+|.++.+++.+.
T Consensus 102 ly~~~d~EVD~d~~i~~plpkvP~d~~i~sHWL~iEGVQP~iPeNp~p~~~~~~~~~~ 159 (576)
T KOG2549|consen 102 LYYPEDEEVDLDEIINAPLPKVPLDVSIKSHWLAIEGVQPAIPENPPPLEKNVQKKEV 159 (576)
T ss_pred eecCchhhcchhhhhcCCCCCCCccceeeeeeeeeccccccCCCCCCccccchhhhcc
Confidence 9999999999999999999999999999999999999999999999999999888763
No 6
>cd08050 TAF6 TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTFs) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs are named after their electrophoretic mobility in polyacrylamide gels in different species. A new, unified nomenclature has been suggested for the pol II TAFs to show the relationship between TAF orthologs and paralogs. Several hypotheses are proposed for TAFs functions such as serving as
Probab=99.97 E-value=2.6e-31 Score=271.09 Aligned_cols=124 Identities=39% Similarity=0.759 Sum_probs=116.8
Q ss_pred CccCC-CCCHHHHHHHHHhc--CCCCccHHHHHHh---------HHhHHHHHHh-hcccccCCCCCCCccccccCCCCcc
Q psy11212 2 DIYGT-SFSIESMKVIAESI--GIPNLPDDAAKEL---------SDNITYCCKV-IIQPIYGLTVSQFLPFKHASGGGRE 68 (448)
Q Consensus 2 ~~~~~-~~~~~~~~~~a~~~--~~~~~~~~~~k~m---------~~d~~~~l~~-~~ep~~g~~~~~~~~f~~~~~~~~~ 68 (448)
|+.|. .++||+++.||+++ ++++|+|+|+||| ++||+.|||. |+||+|||.++++++|+...++|++
T Consensus 10 ~~~Gi~~~~~~a~~~La~~~e~~~~~i~~~A~k~~~hskR~~l~~~Di~~Al~~~n~eplyG~~~~~~~~~~~~~~~~~~ 89 (343)
T cd08050 10 ESLGIDSLSDEVAQLLAEDVEYRLREIIQEAAKFMRHSKRRKLTTSDVNHALRLRNVEPLYGFSSSEPLPFRVSTGGGQE 89 (343)
T ss_pred HHcCCCcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCHHHHHHHHHHhCCCcccCCCCCccccceeccCCCce
Confidence 56677 99999999999999 8999999999999 5999999998 8999999999999999888999999
Q ss_pred eeeccCccccHHHHhhccCCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhh
Q psy11212 69 LHFTDEKEVDFEDLLTNLNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLE 125 (448)
Q Consensus 69 ~~~~~d~e~d~~~~~~~~~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~ 125 (448)
+||.+|+||||++++++++|++|+++++++||||||||||.|||||++.....++..
T Consensus 90 l~~~~D~eidl~~~i~~~lp~~p~~~~~~~hWLaieGvqP~ip~n~~~~~~~~~~~~ 146 (343)
T cd08050 90 LYYVEDKEIDLKDLINTPLPKVPLDVSVKAHWLAIEGVQPLIPENPPPSAIEVQKKE 146 (343)
T ss_pred EeeCCCCcccHHHhhhcccCCCCCccccccccceecCccCCCCCCCCchhccccccc
Confidence 999999999999999999999999999999999999999999999999986655543
No 7
>PF07571 DUF1546: Protein of unknown function (DUF1546); InterPro: IPR011442 These proteins are associated with IPR004823 from INTERPRO in transcription initiation factor TFIID subunit 6 (TAF6).; GO: 0051090 regulation of sequence-specific DNA binding transcription factor activity, 0005634 nucleus
Probab=99.93 E-value=1.3e-26 Score=194.69 Aligned_cols=79 Identities=47% Similarity=0.802 Sum_probs=77.1
Q ss_pred HHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhcccCCC
Q psy11212 369 ILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLVTER 447 (448)
Q Consensus 369 lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~alG~~ 447 (448)
||||+|+|++|++|+.++||+|||+||++|+.||++|+++|++|++||+++|+|+|+||++|+++|||||+||++||++
T Consensus 1 vltClv~k~l~~~~~~~~h~~LRd~AA~lL~~I~~~~~~~~~~L~~Ri~~tl~k~l~d~~~~~~t~YGAi~gL~~lG~~ 79 (92)
T PF07571_consen 1 VLTCLVGKQLGSRPPVDNHWALRDFAASLLAQICRKFSSSYPTLQPRITRTLLKALLDPKKPLGTHYGAIVGLSALGPE 79 (92)
T ss_pred CeEEEeeccccCCCCCcchHHHHHHHHHHHHHHHHHhccccchHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHH
Confidence 5899999999999988999999999999999999999999999999999999999999999999999999999999975
No 8
>PF02969 TAF: TATA box binding protein associated factor (TAF); InterPro: IPR004823 The TATA box binding protein associated factor (TAF) is part of the transcription initiation factor TFIID multimeric protein complex. TFIID plays a central role in mediating promoter responses to various activators and repressors. It binds tightly to TAFII-250 and directly interacts with TAFII-40. TFIID is composed of TATA binding protein (TBP)and a number of TBP-associated factors (TAFS). TAF proteins adopt a histone-like fold.; GO: 0006352 transcription initiation, DNA-dependent, 0005634 nucleus; PDB: 1TAF_B.
Probab=99.71 E-value=9.3e-18 Score=132.30 Aligned_cols=65 Identities=52% Similarity=0.719 Sum_probs=53.1
Q ss_pred CCCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHHhhc
Q psy11212 6 TSFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTN 85 (448)
Q Consensus 6 ~~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~~~~ 85 (448)
+.++.|++|.+||++||.++.+|+++.+++|++|+||..|+.
T Consensus 2 s~~~~esvk~iAes~Gi~~l~de~a~~La~dveyrlreiiq~-------------------------------------- 43 (66)
T PF02969_consen 2 SVFSQESVKDIAESLGISNLSDEAAKALAEDVEYRLREIIQE-------------------------------------- 43 (66)
T ss_dssp ----HHHHHHHHHHTT---B-HHHHHHHHHHHHHHHHHHHHH--------------------------------------
T ss_pred CcCCHHHHHHHHHHcCCCCCCHHHHHHHHHHHHHHHHHHHHH--------------------------------------
Confidence 458999999999999999999999999999999999998888
Q ss_pred cCCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHH
Q psy11212 86 LNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALN 150 (448)
Q Consensus 86 ~~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~ 150 (448)
|.|||+||||++||++|||+|||
T Consensus 44 ------------------------------------------a~kfm~hskR~~Lt~~Di~~ALr 66 (66)
T PF02969_consen 44 ------------------------------------------ALKFMRHSKRTKLTTDDINSALR 66 (66)
T ss_dssp ------------------------------------------HHHHHHHTT-SSB-HHHHHHHH-
T ss_pred ------------------------------------------HHHHHHHhCCCCCCHHHHHHHhC
Confidence 99999999999999999999997
No 9
>smart00803 TAF TATA box binding protein associated factor. TAFs (TATA box binding protein associated factors) are part of the transcription initiation factor TFIID multimeric protein complex. TFIID is composed of the TATA box binding protein (TBP) and a number of TAFs. The TAFs provide binding sites for many different transcriptional activators and co-activators that modulate transcription initiation by Pol II. TAF proteins adopt a histone-like fold.
Probab=99.35 E-value=1.2e-12 Score=103.27 Aligned_cols=64 Identities=47% Similarity=0.652 Sum_probs=61.2
Q ss_pred CCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHHhhcc
Q psy11212 7 SFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTNL 86 (448)
Q Consensus 7 ~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~~~~~ 86 (448)
.++.++++.+|++.|++.+.+|+++.|++|++|.++..++.
T Consensus 2 ~~p~~~i~ria~~~Gi~ris~~a~~~l~~~~e~rl~~i~~~--------------------------------------- 42 (65)
T smart00803 2 WLPKETIKDVAESLGIGNLSDEAAKLLAEDVEYRIKEIVQE--------------------------------------- 42 (65)
T ss_pred CCCHHHHHHHHHHCCCccccHHHHHHHHHHHHHHHHHHHHH---------------------------------------
Confidence 47889999999999999999999999999999999997777
Q ss_pred CCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHH
Q psy11212 87 NPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALN 150 (448)
Q Consensus 87 ~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~ 150 (448)
|.|||+||||++||++||++|||
T Consensus 43 -----------------------------------------A~k~~~hakRktlt~~DI~~Alk 65 (65)
T smart00803 43 -----------------------------------------ALKFMRHSKRTTLTTSDIDSALR 65 (65)
T ss_pred -----------------------------------------HHHHHHHhCCCeecHHHHHHHhC
Confidence 99999999999999999999996
No 10
>PF02969 TAF: TATA box binding protein associated factor (TAF); InterPro: IPR004823 The TATA box binding protein associated factor (TAF) is part of the transcription initiation factor TFIID multimeric protein complex. TFIID plays a central role in mediating promoter responses to various activators and repressors. It binds tightly to TAFII-250 and directly interacts with TAFII-40. TFIID is composed of TATA binding protein (TBP)and a number of TBP-associated factors (TAFS). TAF proteins adopt a histone-like fold.; GO: 0006352 transcription initiation, DNA-dependent, 0005634 nucleus; PDB: 1TAF_B.
Probab=98.68 E-value=4.3e-09 Score=83.23 Aligned_cols=41 Identities=24% Similarity=0.404 Sum_probs=35.1
Q ss_pred CccCC-CCCHHHHHHHHHhc--CCCCccHHHHHHh---------HHhHHHHHH
Q psy11212 2 DIYGT-SFSIESMKVIAESI--GIPNLPDDAAKEL---------SDNITYCCK 42 (448)
Q Consensus 2 ~~~~~-~~~~~~~~~~a~~~--~~~~~~~~~~k~m---------~~d~~~~l~ 42 (448)
||.|. +++||+++.||+|+ ||++|+|+|.||| ++|||+|||
T Consensus 14 es~Gi~~l~de~a~~La~dveyrlreiiq~a~kfm~hskR~~Lt~~Di~~ALr 66 (66)
T PF02969_consen 14 ESLGISNLSDEAAKALAEDVEYRLREIIQEALKFMRHSKRTKLTTDDINSALR 66 (66)
T ss_dssp HHTT---B-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-SSB-HHHHHHHH-
T ss_pred HHcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHHHHhC
Confidence 67887 99999999999999 9999999999999 589999997
No 11
>cd00076 H4 Histone H4, one of the four histones, along with H2A, H2B and H3, which forms the eukaryotic nucleosome core; along with H3, it plays a central role in nucleosome formation; histones bind to DNA and wrap the genetic material into "beads on a string" in which DNA (the string) is wrapped around small blobs of histones (the beads) at regular intervals; play a role in the inheritance of specialized chromosome structures and the control of gene activity; defects in the establishment of proper chromosome structure by histones may activate or silence genes aberrantly and thus lead to disease; the sequence of histone H4 has remained almost invariant in more than 2 billion years of evolution
Probab=98.39 E-value=8.6e-07 Score=73.58 Aligned_cols=73 Identities=16% Similarity=0.323 Sum_probs=64.0
Q ss_pred CCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHHhhcc
Q psy11212 7 SFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTNL 86 (448)
Q Consensus 7 ~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~~~~~ 86 (448)
.|+.-+++-||..-|++-|..|+.+.+.+-++.-|...+.-
T Consensus 13 gi~k~~I~RLarr~GvkRIS~d~y~e~~~~l~~~l~~I~~d--------------------------------------- 53 (85)
T cd00076 13 GITKPAIRRLARRGGVKRISGGVYDEVRNVLKSYLEDVIRD--------------------------------------- 53 (85)
T ss_pred cCCHHHHHHHHHHcCcchhhHHHHHHHHHHHHHHHHHHHHH---------------------------------------
Confidence 48888999999999999999999888877766666554333
Q ss_pred CCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCC
Q psy11212 87 NPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGL 160 (448)
Q Consensus 87 ~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy 160 (448)
|.+|+.|++|+++|++||..||+..| .|+|||
T Consensus 54 -----------------------------------------av~ya~Ha~RKTVt~~DV~~alkr~g-~~~ygf 85 (85)
T cd00076 54 -----------------------------------------AVTYTEHAKRKTVTAMDVVYALKRQG-RTLYGY 85 (85)
T ss_pred -----------------------------------------HHHHHHhcCCCcCcHHHHHHHHHHCC-CCccCC
Confidence 99999999999999999999999999 999998
No 12
>PLN00035 histone H4; Provisional
Probab=98.32 E-value=1.7e-06 Score=74.14 Aligned_cols=74 Identities=15% Similarity=0.273 Sum_probs=65.3
Q ss_pred CCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHHhhcc
Q psy11212 7 SFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTNL 86 (448)
Q Consensus 7 ~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~~~~~ 86 (448)
.||.-+++-||..-|++-|..++.+.+.+-++.-|...+.-
T Consensus 29 ~ipk~~IrRLARr~GvkRIS~~ay~elr~vle~~l~~I~~d--------------------------------------- 69 (103)
T PLN00035 29 GITKPAIRRLARRGGVKRISGLIYEETRGVLKIFLENVIRD--------------------------------------- 69 (103)
T ss_pred cCCHHHHHHHHHHcCcccchHHHHHHHHHHHHHHHHHHHHH---------------------------------------
Confidence 38888999999999999999999988887777666654333
Q ss_pred CCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCCC
Q psy11212 87 NPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGLT 161 (448)
Q Consensus 87 ~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy~ 161 (448)
|.+|+.|++|+++|++||..||+..|- |||||.
T Consensus 70 -----------------------------------------av~ya~HA~RKTV~~~DV~~Alkr~g~-~lyGf~ 102 (103)
T PLN00035 70 -----------------------------------------AVTYTEHARRKTVTAMDVVYALKRQGR-TLYGFG 102 (103)
T ss_pred -----------------------------------------HHHHHHhcCCCcCcHHHHHHHHHHcCC-cCCCCC
Confidence 899999999999999999999999999 999996
No 13
>PTZ00015 histone H4; Provisional
Probab=98.26 E-value=2.4e-06 Score=73.10 Aligned_cols=73 Identities=15% Similarity=0.278 Sum_probs=64.5
Q ss_pred CCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHHhhcc
Q psy11212 7 SFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTNL 86 (448)
Q Consensus 7 ~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~~~~~ 86 (448)
.|+..+++-||..-|++-|..|+.+.+.+-++.-|...+.-
T Consensus 30 gI~k~~IrRLarr~GvkRIS~d~y~e~r~vle~~l~~I~rd--------------------------------------- 70 (102)
T PTZ00015 30 GITKGAIRRLARRGGVKRISGDIYEEVRGVLKAFLENVVRD--------------------------------------- 70 (102)
T ss_pred CCCHHHHHHHHHHcCCccchHHHHHHHHHHHHHHHHHHHHH---------------------------------------
Confidence 48888999999999999999999988887777666654333
Q ss_pred CCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCC
Q psy11212 87 NPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGL 160 (448)
Q Consensus 87 ~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy 160 (448)
|..|+.|++|+++|++||..||+..|- |+|||
T Consensus 71 -----------------------------------------av~~aeHA~RKTVt~~DV~~AlKr~g~-~~ygf 102 (102)
T PTZ00015 71 -----------------------------------------STAYTEYARRKTVTAMDVVYALKRQGR-TLYGF 102 (102)
T ss_pred -----------------------------------------HHHHHHhcCCCcccHHHHHHHHHhcCC-CCCCC
Confidence 899999999999999999999999999 99998
No 14
>smart00803 TAF TATA box binding protein associated factor. TAFs (TATA box binding protein associated factors) are part of the transcription initiation factor TFIID multimeric protein complex. TFIID is composed of the TATA box binding protein (TBP) and a number of TAFs. The TAFs provide binding sites for many different transcriptional activators and co-activators that modulate transcription initiation by Pol II. TAF proteins adopt a histone-like fold.
Probab=97.77 E-value=1e-05 Score=63.91 Aligned_cols=41 Identities=24% Similarity=0.398 Sum_probs=36.6
Q ss_pred CccCC-CCCHHHHHHHHHhc--CCCCccHHHHHHh---------HHhHHHHHH
Q psy11212 2 DIYGT-SFSIESMKVIAESI--GIPNLPDDAAKEL---------SDNITYCCK 42 (448)
Q Consensus 2 ~~~~~-~~~~~~~~~~a~~~--~~~~~~~~~~k~m---------~~d~~~~l~ 42 (448)
++.|. .+|+|+++.||+++ ++++|+|+|.||| ++||++|||
T Consensus 13 ~~~Gi~ris~~a~~~l~~~~e~rl~~i~~~A~k~~~hakRktlt~~DI~~Alk 65 (65)
T smart00803 13 ESLGIGNLSDEAAKLLAEDVEYRIKEIVQEALKFMRHSKRTTLTTSDIDSALR 65 (65)
T ss_pred HHCCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCeecHHHHHHHhC
Confidence 34555 59999999999998 8999999999999 589999996
No 15
>PF12755 Vac14_Fab1_bd: Vacuolar 14 Fab1-binding region
Probab=96.99 E-value=0.0021 Score=54.67 Aligned_cols=82 Identities=29% Similarity=0.374 Sum_probs=65.3
Q ss_pred HHHHHHHHHHHhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHH
Q psy11212 340 LIYLMRMVKALLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRL 419 (448)
Q Consensus 340 L~~ll~~v~ALl~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~t 419 (448)
|+.+-.++.||-.. ++.|+-.|+|+||.|+ .+++|++|.+|.+.|..|++.+....-.--..|...
T Consensus 7 li~Laa~ai~l~~~----~~~~l~~Il~pVL~~~----------~D~d~rVRy~AcEaL~ni~k~~~~~~l~~f~~IF~~ 72 (97)
T PF12755_consen 7 LIGLAAVAIALGKD----ISKYLDEILPPVLKCF----------DDQDSRVRYYACEALYNISKVARGEILPYFNEIFDA 72 (97)
T ss_pred HHHHHHHHHHchHh----HHHHHHHHHHHHHHHc----------CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555566666665 9999999999999998 478999999999999999999876544446778899
Q ss_pred HHHHhcCCCCCccchhhh
Q psy11212 420 LSNAINDPKINFPSLYGA 437 (448)
Q Consensus 420 l~k~l~d~~~~l~t~YGA 437 (448)
|.|.+.|++.+ ...||
T Consensus 73 L~kl~~D~d~~--Vr~~a 88 (97)
T PF12755_consen 73 LCKLSADPDEN--VRSAA 88 (97)
T ss_pred HHHHHcCCchh--HHHHH
Confidence 99999998644 45554
No 16
>cd07979 TAF9 TATA Binding Protein (TBP) Associated Factor 9 (TAF9) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 9 (TAF9) is one of several TAFs that bind TBP and are involved in forming the TFIID complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. The TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAFs orthologs and paralogs. Human TAF9 has a paralogue gene (TAF9L) whi
Probab=96.35 E-value=0.012 Score=51.76 Aligned_cols=63 Identities=11% Similarity=0.069 Sum_probs=45.7
Q ss_pred hhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCCCCCCccCceecCCCCceeEeeccccccHHHHHhc-cCCCCCCc
Q psy11212 124 LESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTN-LNPKSALE 202 (448)
Q Consensus 124 l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy~s~~~~~f~~~s~~g~~l~~~~D~evdL~~ii~~-~lpkiP~~ 202 (448)
+..+|..|.+|++|++++.+||..|++.+.- |.|.++. ..|.-+++..-.|+ |||+++..
T Consensus 38 il~dA~~~a~hA~r~tV~~eDV~lAi~~r~~---~~f~~~p----------------~~~~l~~~a~~~N~~pLP~~~~~ 98 (117)
T cd07979 38 VLDDAKVYSEHAGKANIDADDVKLAIQSRVD---YSFTSPP----------------PRDFLLELAREKNSIPLPPIPPS 98 (117)
T ss_pred HHHHHHHHHHHcCCCCCCHHHHHHHHHHHhc---cCCCCCC----------------cHHHHHHHHHHhccCCCCCCCCC
Confidence 3344999999999999999999999999876 5555431 12334555555554 88988877
Q ss_pred cce
Q psy11212 203 THL 205 (448)
Q Consensus 203 ~~~ 205 (448)
.++
T Consensus 99 ~g~ 101 (117)
T cd07979 99 CGL 101 (117)
T ss_pred CCc
Confidence 643
No 17
>cd00076 H4 Histone H4, one of the four histones, along with H2A, H2B and H3, which forms the eukaryotic nucleosome core; along with H3, it plays a central role in nucleosome formation; histones bind to DNA and wrap the genetic material into "beads on a string" in which DNA (the string) is wrapped around small blobs of histones (the beads) at regular intervals; play a role in the inheritance of specialized chromosome structures and the control of gene activity; defects in the establishment of proper chromosome structure by histones may activate or silence genes aberrantly and thus lead to disease; the sequence of histone H4 has remained almost invariant in more than 2 billion years of evolution
Probab=95.22 E-value=0.013 Score=48.86 Aligned_cols=47 Identities=15% Similarity=0.169 Sum_probs=41.0
Q ss_pred CCCCCHHHHHHHHHhc--CCCCccHHHHHHh---------HHhHHHHHHhhcccccCC
Q psy11212 5 GTSFSIESMKVIAESI--GIPNLPDDAAKEL---------SDNITYCCKVIIQPIYGL 51 (448)
Q Consensus 5 ~~~~~~~~~~~~a~~~--~~~~~~~~~~k~m---------~~d~~~~l~~~~ep~~g~ 51 (448)
..-+|+|+...+.+-+ -+.+|+.||+++. ++||.+|||..=.|||||
T Consensus 28 vkRIS~d~y~e~~~~l~~~l~~I~~dav~ya~Ha~RKTVt~~DV~~alkr~g~~~ygf 85 (85)
T cd00076 28 VKRISGGVYDEVRNVLKSYLEDVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRTLYGY 85 (85)
T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCcHHHHHHHHHHCCCCccCC
Confidence 4468999999999877 5889999999997 589999999844999998
No 18
>PLN00035 histone H4; Provisional
Probab=95.02 E-value=0.017 Score=49.69 Aligned_cols=47 Identities=17% Similarity=0.149 Sum_probs=40.9
Q ss_pred CCCCHHHHHHHHHhc--CCCCccHHHHHHh---------HHhHHHHHHhhcccccCCC
Q psy11212 6 TSFSIESMKVIAESI--GIPNLPDDAAKEL---------SDNITYCCKVIIQPIYGLT 52 (448)
Q Consensus 6 ~~~~~~~~~~~a~~~--~~~~~~~~~~k~m---------~~d~~~~l~~~~ep~~g~~ 52 (448)
.-+|+++...+.+-+ -+.+|+.||+++. ++||.+|||..=-|||||+
T Consensus 45 kRIS~~ay~elr~vle~~l~~I~~dav~ya~HA~RKTV~~~DV~~Alkr~g~~lyGf~ 102 (103)
T PLN00035 45 KRISGLIYEETRGVLKIFLENVIRDAVTYTEHARRKTVTAMDVVYALKRQGRTLYGFG 102 (103)
T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCcHHHHHHHHHHcCCcCCCCC
Confidence 468999999999988 5889999999998 5899999998334999997
No 19
>COG2036 HHT1 Histones H3 and H4 [Chromatin structure and dynamics]
Probab=94.83 E-value=0.037 Score=46.69 Aligned_cols=72 Identities=18% Similarity=0.199 Sum_probs=56.8
Q ss_pred cCCCCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeeccCccccHHHHh
Q psy11212 4 YGTSFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLL 83 (448)
Q Consensus 4 ~~~~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~~d~e~d~~~~~ 83 (448)
....|+.-.++-++...+...+..+|.+.|++=++.-+....|-
T Consensus 16 ~~~~Lp~apv~Ri~r~~~~~Rvs~~A~~~l~~~~e~~~~~i~~~------------------------------------ 59 (91)
T COG2036 16 TDLLLPKAPVRRILRKAGAERVSSSAIEELQEALEEYLEEIAED------------------------------------ 59 (91)
T ss_pred hhhhcCchHHHHHHHHHhHHHhhHHHHHHHHHHHHHHHHHHHHH------------------------------------
Confidence 33456666677777777777788888888887777666654333
Q ss_pred hccCCCCCcccceeeeEEEeeccCCCCCCCCCCCCcchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCC
Q psy11212 84 TNLNPKSALETHLRTHWLAIEGVQPTVPENPPPVDKSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIE 155 (448)
Q Consensus 84 ~~~~p~~~~~~~~~~hwlaieg~qp~ip~np~~~~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niE 155 (448)
|..+..|.||++.+++||..|++.++..
T Consensus 60 --------------------------------------------A~~~A~ha~RKTV~~~DI~la~~~~~~~ 87 (91)
T COG2036 60 --------------------------------------------AVELAEHAKRKTVKAEDIKLALKRLGRR 87 (91)
T ss_pred --------------------------------------------HHHHHHHcCCCeecHHHHHHHHHHhccc
Confidence 9999999999999999999999998865
No 20
>PF00125 Histone: Core histone H2A/H2B/H3/H4 histone h2a signature histone h2b signature histone h3 signature histone h4 signature; InterPro: IPR007125 The core histones together with some other DNA binding proteins appear to form a superfamily defined by a common fold and distant sequence similarities [, ]. Some proteins contain local homology domains related to the histone fold [].; GO: 0003677 DNA binding; PDB: 2YFW_D 2YFV_B 1U35_H 2F8N_D 2PYO_D 2NQB_D 3AN2_C 3AZJ_C 3AV1_G 3AZM_G ....
Probab=94.67 E-value=0.05 Score=43.34 Aligned_cols=24 Identities=29% Similarity=0.382 Sum_probs=23.0
Q ss_pred hhhhhcccCCccCChhHHHHHHHh
Q psy11212 128 SFKCMSHSKRVKLLPVDIDDALNM 151 (448)
Q Consensus 128 A~Kfm~hskR~kLt~~DIn~AL~~ 151 (448)
|..+++|+||++|++.||+.|++.
T Consensus 51 A~~~a~~~kR~tI~~~DI~~A~r~ 74 (75)
T PF00125_consen 51 AGNLARHAKRKTITPRDIQLAVRI 74 (75)
T ss_dssp HHHHHHHTTBSEEGHHHHHHHHHH
T ss_pred HHHHHhhcCCcEecHHHHHHHHhc
Confidence 999999999999999999999985
No 21
>PTZ00015 histone H4; Provisional
Probab=94.41 E-value=0.028 Score=48.42 Aligned_cols=46 Identities=15% Similarity=0.223 Sum_probs=39.9
Q ss_pred CCCCHHHHHHHHHhc--CCCCccHHHHHHh---------HHhHHHHHHhhcccccCC
Q psy11212 6 TSFSIESMKVIAESI--GIPNLPDDAAKEL---------SDNITYCCKVIIQPIYGL 51 (448)
Q Consensus 6 ~~~~~~~~~~~a~~~--~~~~~~~~~~k~m---------~~d~~~~l~~~~ep~~g~ 51 (448)
.-+|+|+...+.+-+ -+.+|+.||..+. ++||.+|||..=-|+|||
T Consensus 46 kRIS~d~y~e~r~vle~~l~~I~rdav~~aeHA~RKTVt~~DV~~AlKr~g~~~ygf 102 (102)
T PTZ00015 46 KRISGDIYEEVRGVLKAFLENVVRDSTAYTEYARRKTVTAMDVVYALKRQGRTLYGF 102 (102)
T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccHHHHHHHHHhcCCCCCCC
Confidence 368999999998887 5889999999998 589999999844599998
No 22
>PF02985 HEAT: HEAT repeat; InterPro: IPR000357 The HEAT repeat is a tandemly repeated, 37-47 amino acid long module occurring in a number of cytoplasmic proteins, including the four name-giving proteins huntingtin, elongation factor 3 (EF3), the 65 Kd alpha regulatory subunit of protein phosphatase 2A (PP2A) and the yeast PI3-kinase TOR1 []. Arrays of HEAT repeats consists of 3 to 36 units forming a rod-like helical structure and appear to function as protein-protein interaction surfaces. It has been noted that many HEAT repeat-containing proteins are involved in intracellular transport processes. In the crystal structure of PP2A PR65/A [], the HEAT repeats consist of pairs of antiparallel alpha helices [].; GO: 0005515 protein binding; PDB: 3FGA_A 2PF4_C 2IAE_A 2BKU_D 3EA5_B 3ND2_A 2BPT_A 2NYL_A 2NPP_D 2PKG_B ....
Probab=93.81 E-value=0.12 Score=34.45 Aligned_cols=30 Identities=27% Similarity=0.414 Sum_probs=24.9
Q ss_pred HHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHH
Q psy11212 365 LLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKN 404 (448)
Q Consensus 365 Llp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~ 404 (448)
|+|.++.++ .|.+|.+|+.|+..|+.|++.
T Consensus 1 llp~l~~~l----------~D~~~~VR~~a~~~l~~i~~~ 30 (31)
T PF02985_consen 1 LLPILLQLL----------NDPSPEVRQAAAECLGAIAEH 30 (31)
T ss_dssp HHHHHHHHH----------T-SSHHHHHHHHHHHHHHHHT
T ss_pred CHHHHHHHc----------CCCCHHHHHHHHHHHHHHHhh
Confidence 578888777 368999999999999999875
No 23
>smart00417 H4 Histone H4.
Probab=92.98 E-value=0.15 Score=41.39 Aligned_cols=37 Identities=8% Similarity=0.106 Sum_probs=27.6
Q ss_pred CCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHh
Q psy11212 7 SFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKV 43 (448)
Q Consensus 7 ~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~ 43 (448)
.|+.-+++-||..-|++-|..++...+.+=++.-|..
T Consensus 13 gI~k~~IrRLaRr~GvkRIS~~~y~elr~vle~~l~~ 49 (74)
T smart00417 13 GITKPAIRRLARRGGVKRISGLIYDETRNVLKSFLEN 49 (74)
T ss_pred CCCHHHHHHHHHHcCcchhhHHHHHHHHHHHHHHHHH
Confidence 4777889999999999998888877766555444443
No 24
>smart00576 BTP Bromodomain transcription factors and PHD domain containing proteins. subdomain of archael histone-like transcription factors
Probab=92.54 E-value=0.37 Score=39.03 Aligned_cols=32 Identities=19% Similarity=0.045 Sum_probs=28.0
Q ss_pred hhhhhhhhhcccCCccCChhHHHHHHHhcCCC
Q psy11212 124 LESISFKCMSHSKRVKLLPVDIDDALNMKNIE 155 (448)
Q Consensus 124 l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niE 155 (448)
+-+.+.+++.|++|+..+..||..||+-.|+.
T Consensus 43 l~~~~~~~a~~agR~~~~~~Dv~~Al~~~gi~ 74 (77)
T smart00576 43 LGRTAHSYAELAGRTEPNLGDVVLALENLGIS 74 (77)
T ss_pred HHHHHHHHHHHcCCCCCCHHHHHHHHHHhCcc
Confidence 33448999999999999999999999998873
No 25
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=91.06 E-value=0.064 Score=39.89 Aligned_cols=55 Identities=13% Similarity=0.003 Sum_probs=45.6
Q ss_pred hHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhccc
Q psy11212 388 WALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLV 444 (448)
Q Consensus 388 w~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~al 444 (448)
|.+|.-|+..|+.++...........+.++..|.+.|.|++. .+.-.|..+|..|
T Consensus 1 p~vR~~A~~aLg~l~~~~~~~~~~~~~~~~~~L~~~L~d~~~--~VR~~A~~aLg~l 55 (55)
T PF13513_consen 1 PRVRRAAAWALGRLAEGCPELLQPYLPELLPALIPLLQDDDD--SVRAAAAWALGNL 55 (55)
T ss_dssp HHHHHHHHHHHHCTTTTTHHHHHHHHHHHHHHHHHHTTSSSH--HHHHHHHHHHHCH
T ss_pred CHHHHHHHHHHhhHhcccHHHHHHHHHHHHHHHHHHHcCCCH--HHHHHHHHHHhcC
Confidence 899999999999887666666666888999999999988654 7788888888654
No 26
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=90.09 E-value=0.48 Score=45.12 Aligned_cols=102 Identities=19% Similarity=0.214 Sum_probs=59.8
Q ss_pred CCHHHHHHHHHhhhcCCCcc---cchhhHHHHHH---hhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccchhhHHHH
Q psy11212 295 SDEGRRFEALQSLATDPGLH---EMLPRMCTFIA---EGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYLHELLPS 368 (448)
Q Consensus 295 ~~e~~r~~aL~sL~tD~gL~---qLlPyfv~FI~---e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~ 368 (448)
.+=+.|.+||..|+.--.-+ ...+-|+..+. ..+...+...+..+....+.++..|..+-.-.+++|+..++|+
T Consensus 19 ~~W~~r~~al~~L~~l~~~~~~~~~~~~~~~~l~~~~~~i~~~l~d~Rs~v~~~A~~~l~~l~~~l~~~~~~~~~~~l~~ 98 (228)
T PF12348_consen 19 SDWEERVEALQKLRSLIKGNAPEDFPPDFVECLRQLLDAIIKQLSDLRSKVSKTACQLLSDLARQLGSHFEPYADILLPP 98 (228)
T ss_dssp SSHHHHHHHHHHHHHHHHH-B-----HHHHHHHH---HHHHH-S-HH---HHHHHHHHHHHHHHHHGGGGHHHHHHHHHH
T ss_pred cCHHHHHHHHHHHHHHHHcCCccccHHHHHHHHHHhHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHH
Confidence 34446777777766532211 22333333333 4444444333444555666777777766666799999999999
Q ss_pred HHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhc
Q psy11212 369 ILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFN 406 (448)
Q Consensus 369 lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~ 406 (448)
++.++-.+ .=-+|+.|..+|..|+..++
T Consensus 99 Ll~~~~~~----------~~~i~~~a~~~L~~i~~~~~ 126 (228)
T PF12348_consen 99 LLKKLGDS----------KKFIREAANNALDAIIESCS 126 (228)
T ss_dssp HHHGGG-------------HHHHHHHHHHHHHHHTTS-
T ss_pred HHHHHccc----------cHHHHHHHHHHHHHHHHHCC
Confidence 99887442 11589999999998888765
No 27
>KOG3467|consensus
Probab=89.96 E-value=0.23 Score=41.36 Aligned_cols=41 Identities=20% Similarity=0.285 Sum_probs=36.2
Q ss_pred chhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCCC
Q psy11212 120 SAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGLT 161 (448)
Q Consensus 120 ~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy~ 161 (448)
+..++.+.|.-+..|.||++.|..||=++|+-.++. +|||.
T Consensus 62 fl~n~i~~A~~yt~HAKRKTvT~~dvv~~LKR~G~~-~~g~~ 102 (103)
T KOG3467|consen 62 FLENVIRDAVTYTEHAKRKTVTAMDVVYALKRQGRT-LYGFG 102 (103)
T ss_pred HHHHHHHHHHHHHhhhhhceeeHHHHHHHHHHcCce-eeccC
Confidence 556788889999999999999999999999998874 78884
No 28
>cd07981 TAF12 TATA Binding Protein (TBP) Associated Factor 12 (TAF12) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 12 (TAF12) is one of several TAFs that bind TBP and are involved in forming the TFIID complex. TFIID is one of the seven General Transcription Factors (GTFs) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs are named after their electrophoretic mobility in polyacrylamide gels in different species. A new, unified nomenclature has been suggested for the pol II TAFs to show the relationship between TAF orthologs and paralogs. Several hypotheses are proposed for TAFs function such as serving as activator-bind
Probab=87.11 E-value=0.48 Score=38.06 Aligned_cols=34 Identities=15% Similarity=0.081 Sum_probs=31.0
Q ss_pred cchhhhhhhhhhhhcccCCccCChhHHHHHHHhc
Q psy11212 119 KSAQKLESISFKCMSHSKRVKLLPVDIDDALNMK 152 (448)
Q Consensus 119 ~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~ 152 (448)
.++..+...|.++.+|.+|+++...||..||+..
T Consensus 34 ~fv~~v~~~a~~lAkHr~~~tv~~~Di~l~l~r~ 67 (72)
T cd07981 34 DFVDDVVEDACRLAKHRKSDTLEVKDVQLHLERN 67 (72)
T ss_pred HHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHHh
Confidence 3788899999999999999999999999999754
No 29
>PF11864 DUF3384: Domain of unknown function (DUF3384); InterPro: IPR024584 This entry represents the N-terminal domain of tuberin which is functionally uncharacterised.
Probab=84.43 E-value=15 Score=39.55 Aligned_cols=97 Identities=16% Similarity=0.267 Sum_probs=73.2
Q ss_pred chHHHHHHHHHHHHHhcCCCHHHHHHHHHhhhcCCC-----cccchhhHHHHHHhhhh-----------------hccc-
Q psy11212 277 LSVEQQLYYKEITEACVGSDEGRRFEALQSLATDPG-----LHEMLPRMCTFIAEGVK-----------------VNVV- 333 (448)
Q Consensus 277 LSkElQlyf~~It~al~~~~e~~r~~aL~sL~tD~g-----L~qLlPyfv~FI~e~V~-----------------~nl~- 333 (448)
+.-+-..||..|++.-..+|-..|-+||..|..+.. ...+.|.+++|+..-.. .+..
T Consensus 64 ~~~~R~~fF~~I~~~~~~~d~~~~l~aL~~LT~~Grdi~~~~~~i~~~L~~wl~~~~~~~~~~r~~~~~~~~~~~~~~~~ 143 (464)
T PF11864_consen 64 SGLMRAEFFRDISDPSNDDDFDLRLEALIALTDNGRDIDFFEYEIGPFLLSWLEPSYQAARSARRKAKKSSSSKSKGLSN 143 (464)
T ss_pred cHHHHHHHHHHHhcCCCchhHHHHHHHHHHHHcCCcCchhcccchHHHHHHHHHHHHHHHHHHHHHhhcccccccccccc
Confidence 567788999999777555666789999999987753 66799999999976660 0110
Q ss_pred -cCCHHHHHHHHHHHHHHhcCCCcccc-cchhhHHHHHHHHh
Q psy11212 334 -QNNLALLIYLMRMVKALLDNQSLYLE-KYLHELLPSILSCI 373 (448)
Q Consensus 334 -~~nl~~L~~ll~~v~ALl~N~~l~ie-pYlHqLlp~lLtCl 373 (448)
..+-..+..+++++.-++..++-+++ ..++.++-.+++.+
T Consensus 144 ~~~~~~~l~~ll~~l~nviKfn~~~l~e~~i~~lv~~i~~iC 185 (464)
T PF11864_consen 144 LDNEESNLSDLLQFLVNVIKFNFNYLDEDEISSLVDQICTIC 185 (464)
T ss_pred ccchhhhHHHHHHHHHHHHhcCCCCCCHHHHHHHHHHHHHHH
Confidence 12455688899999999999998777 78888888776653
No 30
>smart00428 H3 Histone H3.
Probab=83.22 E-value=0.86 Score=39.53 Aligned_cols=34 Identities=26% Similarity=0.237 Sum_probs=29.6
Q ss_pred hhhhhhhhhhhhcccCCccCChhHHHHHHHhcCC
Q psy11212 121 AQKLESISFKCMSHSKRVKLLPVDIDDALNMKNI 154 (448)
Q Consensus 121 ~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~ni 154 (448)
.-.+...|..++.|+||.+|++.||..|.+.++.
T Consensus 70 lv~lfeda~~~a~HAkRvTl~~kDi~La~rir~~ 103 (105)
T smart00428 70 LVGLFEDTNLLAIHAKRVTIMPKDIQLARRIRGE 103 (105)
T ss_pred HHHHHHHHHHHHHHhCCccCcHhhHHHHHHHhcc
Confidence 4456677899999999999999999999998875
No 31
>PF07524 Bromo_TP: Bromodomain associated; InterPro: IPR006565 This bromodomain is found in eukaryotic transcription factors and PHD domain containing proteins (IPR001965 from INTERPRO). The tandem PHD finger-bromodomain is found in many chromatin-associated proteins. It is involved in gene silencing by the human co-repressor KRAB-associated protein 1 (KAP1). The tandem PHD finger-bromodomain of KAP1 has a distinct structure that joins the two protein modules. The first helix, alpha(Z), of an atypical bromodomain forms the central hydrophobic core that anchors the other three helices of the bromodomain on one side and the zinc binding PHD finger on the other []. The Rap1 GTPase-activating protein, Sipa1, is modulated by the cellular bromodomain protein, Brd4. Brd4 belongs to the BET family and is a multifunctional protein involved in transcription, replication, the signal transduction pathway, and cell cycle progression. All of these functions are linked to its association with acetylated chromatin. It has tandem bromodomains []. The dysregulation of the Brd4-associated pathways may play an important role in breast cancer progression []. Bovine papillomavirus type 1 E2 also binds to chromosomes in a complex with Brd4. Interaction with Brd4 is additionally important for E2-mediated transcriptional regulation [, ].
Probab=82.16 E-value=1.1 Score=36.02 Aligned_cols=36 Identities=22% Similarity=0.260 Sum_probs=32.9
Q ss_pred cchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCC
Q psy11212 119 KSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNI 154 (448)
Q Consensus 119 ~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~ni 154 (448)
+..+++-+.+.+|+.|+.|+.-+..||..||+.+|+
T Consensus 38 ~yl~~l~~~~~~~ae~~gRt~~~~~Dv~~al~~~gi 73 (77)
T PF07524_consen 38 RYLQELGRTAKRYAEHAGRTEPNLQDVEQALEEMGI 73 (77)
T ss_pred HHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHHhCC
Confidence 466788888999999999999999999999999887
No 32
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=80.88 E-value=3.6 Score=33.92 Aligned_cols=108 Identities=10% Similarity=0.057 Sum_probs=66.6
Q ss_pred chhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccchh-hHHHHHHHHhhccccCCCCCCCcchHHHHHH
Q psy11212 316 MLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYLH-ELLPSILSCIVSKQLCGRPEADNHWALRDFA 394 (448)
Q Consensus 316 LlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYlH-qLlp~lLtCll~k~l~~~~~~~~hw~LRd~A 394 (448)
++|+++.++.+. +.......+..+..|..+..-.....+. ..+|.++.++-. ++..+|..|
T Consensus 8 ~i~~l~~~l~~~--------~~~~~~~a~~~l~~l~~~~~~~~~~~~~~~~i~~l~~~l~~----------~~~~v~~~a 69 (120)
T cd00020 8 GLPALVSLLSSS--------DENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS----------EDEEVVKAA 69 (120)
T ss_pred ChHHHHHHHHcC--------CHHHHHHHHHHHHHHhcCCHHHHHHHHHCCChHHHHHHHhC----------CCHHHHHHH
Confidence 677777777533 3556666777777777774333333333 788888888732 346999999
Q ss_pred HHHHHHHHHHhcCCCcch-HHHHHHHHHHHhcCCCCCccchhhhhhhhcc
Q psy11212 395 SRLLTQVSKNFNTSTNNI-QTRVTRLLSNAINDPKINFPSLYGAIAGEGL 443 (448)
Q Consensus 395 A~lL~~I~~~~~~sy~~L-~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~a 443 (448)
+.+|..|+......-..+ +.-++..+.+.+.+++ ..+.--|+..|..
T Consensus 70 ~~~L~~l~~~~~~~~~~~~~~g~l~~l~~~l~~~~--~~~~~~a~~~l~~ 117 (120)
T cd00020 70 LWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSN--EDIQKNATGALSN 117 (120)
T ss_pred HHHHHHHccCcHHHHHHHHHCCChHHHHHHHhcCC--HHHHHHHHHHHHH
Confidence 999999987653211111 2235677777777652 3344444444443
No 33
>PF00808 CBFD_NFYB_HMF: Histone-like transcription factor (CBF/NF-Y) and archaeal histone; InterPro: IPR003958 The CCAAT-binding factor (CBF) is a mammalian transcription factor that binds to a CCAAT motif in the promoters of a wide variety of genes, including type I collagen and albumin. The factor is a heteromeric complex of A and B subunits, both of which are required for DNA-binding [, ]. The subunits can interact in the absence of DNA-binding, conserved regions in each being important in mediating this interaction. The A subunit can be split into 3 domains on the basis of sequence similarity, a non-conserved N-terminal 'A domain'; a highly-conserved central 'B domain' involved in DNA-binding; and a C-terminal 'C domain', which contains a number of glutamine and acidic residues involved in protein-protein interactions []. The A subunit shows striking similarity to the HAP3 subunit of the yeast CCAAT-binding heterotrimeric transcription factor [, ]. The Kluyveromyces lactis HAP3 protein has been predicted to contain a 4-cysteine zinc finger, which is thought to be present in similar HAP3 and CBF subunit A proteins, in which the third cysteine is replaced by a serine []. This domain is found in the CCAAT transcription factor and archaeal histones.; GO: 0043565 sequence-specific DNA binding, 0005622 intracellular; PDB: 1F1E_A 2BYM_D 2BYK_D 1HTA_A 1B67_A 1JFI_B 1KU5_B 1N1J_A 1BFM_A 1B6W_A ....
Probab=80.83 E-value=0.94 Score=35.22 Aligned_cols=31 Identities=19% Similarity=0.145 Sum_probs=27.7
Q ss_pred cchhhhhhhhhhhhcccCCccCChhHHHHHH
Q psy11212 119 KSAQKLESISFKCMSHSKRVKLLPVDIDDAL 149 (448)
Q Consensus 119 ~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL 149 (448)
.|++.+.+.|..+..+.+|++++.+||..||
T Consensus 35 ~Fi~~l~~~A~~~a~~~~rkti~~~Dv~~Av 65 (65)
T PF00808_consen 35 EFIQYLAKEANEIAQRDKRKTITYEDVAKAV 65 (65)
T ss_dssp HHHHHHHHHHHHHHHHTTSSEE-HHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHcCCCccCHHHHHHHC
Confidence 4788899999999999999999999999986
No 34
>KOG2023|consensus
Probab=80.14 E-value=16 Score=41.18 Aligned_cols=138 Identities=16% Similarity=0.259 Sum_probs=81.0
Q ss_pred HHHHHHHHHhcCCCHHHHHHHHHhhh--cCCCcccchhh---HHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCc-
Q psy11212 283 LYYKEITEACVGSDEGRRFEALQSLA--TDPGLHEMLPR---MCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSL- 356 (448)
Q Consensus 283 lyf~~It~al~~~~e~~r~~aL~sL~--tD~gL~qLlPy---fv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l- 356 (448)
.|.+-+..--...+++.|+.+..+|- .+-...-|+|+ .+.|+-..++. +|-.+-.---.+-.|+..+|-.
T Consensus 215 ~Fle~lFalanD~~~eVRk~vC~alv~Llevr~dkl~phl~~IveyML~~tqd----~dE~VALEACEFwla~aeqpi~~ 290 (885)
T KOG2023|consen 215 KFLEILFALANDEDPEVRKNVCRALVFLLEVRPDKLVPHLDNIVEYMLQRTQD----VDENVALEACEFWLALAEQPICK 290 (885)
T ss_pred HHHHHHHHHccCCCHHHHHHHHHHHHHHHHhcHHhcccchHHHHHHHHHHccC----cchhHHHHHHHHHHHHhcCcCcH
Confidence 33333333333456777876555442 12222233333 34444333221 2222222234566689999944
Q ss_pred -ccccchhhHHHHHHHHhhccccC----CCC--------------------------------C----------CCcchH
Q psy11212 357 -YLEKYLHELLPSILSCIVSKQLC----GRP--------------------------------E----------ADNHWA 389 (448)
Q Consensus 357 -~iepYlHqLlp~lLtCll~k~l~----~~~--------------------------------~----------~~~hw~ 389 (448)
-+.|||.+|+|.++..++..... .+. + ....|.
T Consensus 291 ~~L~p~l~kliPvLl~~M~Ysd~D~~LL~~~eeD~~vpDreeDIkPRfhksk~~~~~~~~~~eDdddDe~DDdD~~~dWN 370 (885)
T KOG2023|consen 291 EVLQPYLDKLIPVLLSGMVYSDDDIILLKNNEEDESVPDREEDIKPRFHKSKEHGNGEDADDEDDDDDEDDDDDAFSDWN 370 (885)
T ss_pred HHHHHHHHHHHHHHHccCccccccHHHhcCccccccCCchhhhccchhhhchhccCcccccccccccccccccccccccc
Confidence 38999999999999776554421 000 0 012599
Q ss_pred HHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCC
Q psy11212 390 LRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDP 427 (448)
Q Consensus 390 LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~ 427 (448)
||..||..|..+.+-|+. .+-+-++-.|+..|..+
T Consensus 371 LRkCSAAaLDVLanvf~~---elL~~l~PlLk~~L~~~ 405 (885)
T KOG2023|consen 371 LRKCSAAALDVLANVFGD---ELLPILLPLLKEHLSSE 405 (885)
T ss_pred HhhccHHHHHHHHHhhHH---HHHHHHHHHHHHHcCcc
Confidence 999999999999999886 46666777777777654
No 35
>cd00020 ARM Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
Probab=78.34 E-value=11 Score=31.00 Aligned_cols=70 Identities=11% Similarity=0.146 Sum_probs=49.7
Q ss_pred cchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccch-hhHHHHHHHHhhccccCCCCCCCcchHHHHH
Q psy11212 315 EMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYL-HELLPSILSCIVSKQLCGRPEADNHWALRDF 393 (448)
Q Consensus 315 qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYl-HqLlp~lLtCll~k~l~~~~~~~~hw~LRd~ 393 (448)
..+|.++.++.+ +|.......+.....|..++.-..+.+. +.+++.++.++-. .+...|+.
T Consensus 49 ~~i~~l~~~l~~--------~~~~v~~~a~~~L~~l~~~~~~~~~~~~~~g~l~~l~~~l~~----------~~~~~~~~ 110 (120)
T cd00020 49 GGLPALVQLLKS--------EDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDS----------SNEDIQKN 110 (120)
T ss_pred CChHHHHHHHhC--------CCHHHHHHHHHHHHHHccCcHHHHHHHHHCCChHHHHHHHhc----------CCHHHHHH
Confidence 567777777653 3556667777888888888764444433 4578888887622 35789999
Q ss_pred HHHHHHHHH
Q psy11212 394 ASRLLTQVS 402 (448)
Q Consensus 394 AA~lL~~I~ 402 (448)
|..+|..||
T Consensus 111 a~~~l~~l~ 119 (120)
T cd00020 111 ATGALSNLA 119 (120)
T ss_pred HHHHHHHhh
Confidence 999999887
No 36
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=78.02 E-value=10 Score=40.42 Aligned_cols=138 Identities=12% Similarity=0.148 Sum_probs=82.2
Q ss_pred hHHHHHHHHHHHHHhcCCCHHHHHHHH----HhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcC
Q psy11212 278 SVEQQLYYKEITEACVGSDEGRRFEAL----QSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDN 353 (448)
Q Consensus 278 SkElQlyf~~It~al~~~~e~~r~~aL----~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N 353 (448)
..+...+|-.++..+.+++...++.++ .....|+. ++.-+.+.+.+.+.+.|-..-..-++.+-.+.
T Consensus 37 G~~~~~~~~~vi~l~~s~~~~~Krl~yl~l~~~~~~~~~-------~~~l~~n~l~kdl~~~n~~~~~lAL~~l~~i~-- 107 (526)
T PF01602_consen 37 GYDISFLFMEVIKLISSKDLELKRLGYLYLSLYLHEDPE-------LLILIINSLQKDLNSPNPYIRGLALRTLSNIR-- 107 (526)
T ss_dssp T---GSTHHHHHCTCSSSSHHHHHHHHHHHHHHTTTSHH-------HHHHHHHHHHHHHCSSSHHHHHHHHHHHHHH---
T ss_pred CCCCchHHHHHHHHhCCCCHHHHHHHHHHHHHHhhcchh-------HHHHHHHHHHHhhcCCCHHHHHHHHhhhhhhc--
Confidence 344446677777777766655444322 33455554 23334444444454456544444555554433
Q ss_pred CCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHH-HHHHHHHHhcCCCCCcc
Q psy11212 354 QSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTR-VTRLLSNAINDPKINFP 432 (448)
Q Consensus 354 ~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~R-I~~tl~k~l~d~~~~l~ 432 (448)
-..-+..++|.|..|+- ++++.+|..|+-.+.++++.+.. +-+. +...+.+.|.|++ ..
T Consensus 108 ----~~~~~~~l~~~v~~ll~----------~~~~~VRk~A~~~l~~i~~~~p~----~~~~~~~~~l~~lL~d~~--~~ 167 (526)
T PF01602_consen 108 ----TPEMAEPLIPDVIKLLS----------DPSPYVRKKAALALLKIYRKDPD----LVEDELIPKLKQLLSDKD--PS 167 (526)
T ss_dssp ----SHHHHHHHHHHHHHHHH----------SSSHHHHHHHHHHHHHHHHHCHC----CHHGGHHHHHHHHTTHSS--HH
T ss_pred ----ccchhhHHHHHHHHHhc----------CCchHHHHHHHHHHHHHhccCHH----HHHHHHHHHHhhhccCCc--ch
Confidence 23334667777777763 35679999999999999999544 4334 6777777777653 66
Q ss_pred chhhhhhhhccc
Q psy11212 433 SLYGAIAGEGLV 444 (448)
Q Consensus 433 t~YGAl~GL~al 444 (448)
....|+..+..+
T Consensus 168 V~~~a~~~l~~i 179 (526)
T PF01602_consen 168 VVSAALSLLSEI 179 (526)
T ss_dssp HHHHHHHHHHHH
T ss_pred hHHHHHHHHHHH
Confidence 777777766554
No 37
>PF02291 TFIID-31kDa: Transcription initiation factor IID, 31kD subunit; InterPro: IPR003162 Human transcription initiation factor TFIID is composed of the TATA-binding polypeptide (TBP) and at least 13 TBP-associated factors (TAFs) that collectively or individually are involved in activator-dependent transcription []. TAFII-31 protein is a transcriptional coactivator of the p53 protein [].; GO: 0006352 transcription initiation, DNA-dependent; PDB: 1TAF_A.
Probab=76.74 E-value=1.8 Score=38.88 Aligned_cols=63 Identities=11% Similarity=0.020 Sum_probs=29.1
Q ss_pred chhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCcCCCCCCccCceecCCCCceeEeeccccccHHHHHhc-cCCC
Q psy11212 120 SAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIYGLTVSQFLPFKHASGGGRELHFTDEKEVDFEDLLTN-LNPK 198 (448)
Q Consensus 120 ~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLyGy~s~~~~~f~~~s~~g~~l~~~~D~evdL~~ii~~-~lpk 198 (448)
....++..|.-|..|++|+..+.+||..|.+++ .+|+-..+. ..|.-.++.+-.|+ |||.
T Consensus 45 Yt~~vL~DA~~ya~hA~~~~i~~~DVrLAi~~r-----~~~~f~~pp--------------pre~llelA~e~N~~PLP~ 105 (129)
T PF02291_consen 45 YTSDVLEDAQVYADHAGRSTIDADDVRLAIQSR-----LDHSFTQPP--------------PREFLLELAREKNSIPLPP 105 (129)
T ss_dssp HHHHHHHHHHHHHHHTT-SSB-HHHHHHHHHHT-----------------------------------------------
T ss_pred HHHHHHHHHHHHHHhcccccCChHHHHHHHHHH-----HhhhccCCC--------------ChHHHHHHHHHhcCCCCCC
Confidence 478899999999999999999999999999988 232211111 22445566666665 7887
Q ss_pred CCC
Q psy11212 199 SAL 201 (448)
Q Consensus 199 iP~ 201 (448)
++.
T Consensus 106 i~~ 108 (129)
T PF02291_consen 106 IPP 108 (129)
T ss_dssp ---
T ss_pred CCC
Confidence 763
No 38
>KOG2171|consensus
Probab=73.70 E-value=20 Score=42.39 Aligned_cols=153 Identities=20% Similarity=0.278 Sum_probs=101.1
Q ss_pred HHHHHHHHHHhcCCCHHHHHHHHHhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcC--CCcccc
Q psy11212 282 QLYYKEITEACVGSDEGRRFEALQSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDN--QSLYLE 359 (448)
Q Consensus 282 Qlyf~~It~al~~~~e~~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N--~~l~ie 359 (448)
+..|..+...+.+.|.+.|+.|=+.|++..+=++++|-|++.....=.- +..++..+ ..|-|+.- +.|. .
T Consensus 3 ~~~l~qLl~~l~spDn~vr~~Ae~~l~~~~~~~~~l~~L~~i~~~~~~p-----~~Rq~aaV--l~Rkl~~~~w~~l~-~ 74 (1075)
T KOG2171|consen 3 SAPLEQLLQQLLSPDNEVRRQAEEALETLAKTEPLLPALAHILATSADP-----QVRQLAAV--LLRKLLTKHWSRLS-A 74 (1075)
T ss_pred hhHHHHHHHHhcCCCchHHHHHHHHHHHhhcccchHHHHHHHHhcCCCh-----HHHHHHHH--HHHHHHHHHhhcCC-H
Confidence 4567778888888888889999999999999999999999887644211 12222111 11111111 1111 1
Q ss_pred cchhhHHHHHHHHhhcccc-------C-------------CCCC---------CCcchHHHHHHHHHHHHHHHHhcCCCc
Q psy11212 360 KYLHELLPSILSCIVSKQL-------C-------------GRPE---------ADNHWALRDFASRLLTQVSKNFNTSTN 410 (448)
Q Consensus 360 pYlHqLlp~lLtCll~k~l-------~-------------~~~~---------~~~hw~LRd~AA~lL~~I~~~~~~sy~ 410 (448)
---.+|-.++|.|++...- | ..|+ ...+-.+|+-|=.+|..+-.-|++...
T Consensus 75 e~~~siks~lL~~~~~E~~~~vr~k~~dviAeia~~~l~e~WPell~~L~q~~~S~~~~~rE~al~il~s~~~~~~~~~~ 154 (1075)
T KOG2171|consen 75 EVQQSIKSSLLEIIQSETEPSVRHKLADVIAEIARNDLPEKWPELLQFLFQSTKSPNPSLRESALLILSSLPETFGNTLQ 154 (1075)
T ss_pred HHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHhccccchHHHHHHHHHHhcCCCcchhHHHHHHHHhhhhhhccccc
Confidence 1223555666666544321 1 0110 123457899999999999999999887
Q ss_pred chHHHHHHHHHHHhcCCCCCccchhhhhhhhccc
Q psy11212 411 NIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLV 444 (448)
Q Consensus 411 ~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~al 444 (448)
..-.-+.+.|.+.+.|+..+ .+.-|+.++.++
T Consensus 155 ~~~~~l~~lf~q~~~d~s~~--vr~~a~rA~~a~ 186 (1075)
T KOG2171|consen 155 PHLDDLLRLFSQTMTDPSSP--VRVAAVRALGAF 186 (1075)
T ss_pred hhHHHHHHHHHHhccCCcch--HHHHHHHHHHHH
Confidence 77778999999999999877 777788777665
No 39
>PLN00160 histone H3; Provisional
Probab=70.23 E-value=2.9 Score=35.79 Aligned_cols=35 Identities=29% Similarity=0.345 Sum_probs=29.5
Q ss_pred chhhhhhhhhhhhcccCCccCChhHHHHHHHhcCC
Q psy11212 120 SAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNI 154 (448)
Q Consensus 120 ~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~ni 154 (448)
..-.+...|..++.|+||-+|++.|+..|.+.++.
T Consensus 60 yLv~lfed~~lca~HakRVTl~~kD~~L~~rirg~ 94 (97)
T PLN00160 60 HLVGLFEDSNLCAIHGKRVTIMPKDMQLARRIRGQ 94 (97)
T ss_pred HHHHHHhhhHHHHHHhcccccchhhHHHHHHhhcc
Confidence 34456677889999999999999999999998863
No 40
>KOG1242|consensus
Probab=69.96 E-value=35 Score=37.87 Aligned_cols=145 Identities=15% Similarity=0.144 Sum_probs=93.0
Q ss_pred HHHHHHHHHhcCCC-HHHH---HHHHHhhhcCCCcccchhhHHH-HHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcc
Q psy11212 283 LYYKEITEACVGSD-EGRR---FEALQSLATDPGLHEMLPRMCT-FIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLY 357 (448)
Q Consensus 283 lyf~~It~al~~~~-e~~r---~~aL~sL~tD~gL~qLlPyfv~-FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~ 357 (448)
.++.+.+..+.+.. -..| ..++..+-++.|+..+-++-+- =+...+.. . ++-..-...+....++..|=+-.
T Consensus 133 ~~~l~~l~~ll~~~~~~~~~~aa~~~ag~v~g~~i~~~~~~~~l~~l~~ai~d-k--~~~~~re~~~~a~~~~~~~Lg~~ 209 (569)
T KOG1242|consen 133 EYVLELLLELLTSTKIAERAGAAYGLAGLVNGLGIESLKEFGFLDNLSKAIID-K--KSALNREAALLAFEAAQGNLGPP 209 (569)
T ss_pred HHHHHHHHHHhccccHHHHhhhhHHHHHHHcCcHHhhhhhhhHHHHHHHHhcc-c--chhhcHHHHHHHHHHHHHhcCCC
Confidence 45555555555432 2222 3478999999999999887421 11222211 1 11111113455666778888888
Q ss_pred cccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhh
Q psy11212 358 LEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGA 437 (448)
Q Consensus 358 iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGA 437 (448)
-|||+-.++|.+++|.=.+ .=.+|+-|......|.+.+.. .+++-.+..++...+.+ ...|.=++
T Consensus 210 ~EPyiv~~lp~il~~~~d~----------~~~Vr~Aa~~a~kai~~~~~~--~aVK~llpsll~~l~~~---kWrtK~as 274 (569)
T KOG1242|consen 210 FEPYIVPILPSILTNFGDK----------INKVREAAVEAAKAIMRCLSA--YAVKLLLPSLLGSLLEA---KWRTKMAS 274 (569)
T ss_pred CCchHHhhHHHHHHHhhcc----------chhhhHHHHHHHHHHHHhcCc--chhhHhhhhhHHHHHHH---hhhhHHHH
Confidence 9999999999999998332 227899999999999888776 55666665555555543 46677777
Q ss_pred hhhhcccC
Q psy11212 438 IAGEGLVT 445 (448)
Q Consensus 438 l~GL~alG 445 (448)
+-+|.+|+
T Consensus 275 lellg~m~ 282 (569)
T KOG1242|consen 275 LELLGAMA 282 (569)
T ss_pred HHHHHHHH
Confidence 77776664
No 41
>cd07979 TAF9 TATA Binding Protein (TBP) Associated Factor 9 (TAF9) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 9 (TAF9) is one of several TAFs that bind TBP and are involved in forming the TFIID complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. The TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAFs orthologs and paralogs. Human TAF9 has a paralogue gene (TAF9L) whi
Probab=67.43 E-value=6.4 Score=34.63 Aligned_cols=75 Identities=13% Similarity=0.109 Sum_probs=51.9
Q ss_pred cCCCCCHHHHHHHHHhc-C-CCCccHHHHHHh---------HHhHHHHHHhhcccccCCCCCCCccccccCCCCcceeec
Q psy11212 4 YGTSFSIESMKVIAESI-G-IPNLPDDAAKEL---------SDNITYCCKVIIQPIYGLTVSQFLPFKHASGGGRELHFT 72 (448)
Q Consensus 4 ~~~~~~~~~~~~~a~~~-~-~~~~~~~~~k~m---------~~d~~~~l~~~~ep~~g~~~~~~~~f~~~~~~~~~~~~~ 72 (448)
+...+++++...|.|-+ + ..+|.+||..|. .+||.-|++...+ |.|.++-+ |
T Consensus 15 Gv~~~~~~v~~~Lle~~~ry~~~il~dA~~~a~hA~r~tV~~eDV~lAi~~r~~--~~f~~~p~---~------------ 77 (117)
T cd07979 15 GITEYEPRVINQLLEFAYRYTTDVLDDAKVYSEHAGKANIDADDVKLAIQSRVD--YSFTSPPP---R------------ 77 (117)
T ss_pred CCCccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHHHhc--cCCCCCCc---H------------
Confidence 45689999999999988 4 688999999987 5899999997555 66664421 1
Q ss_pred cCccccHHHHhh-ccCCCCCcccce
Q psy11212 73 DEKEVDFEDLLT-NLNPKSALETHL 96 (448)
Q Consensus 73 ~d~e~d~~~~~~-~~~p~~~~~~~~ 96 (448)
|.=+++..-.| .|||+++....+
T Consensus 78 -~~l~~~a~~~N~~pLP~~~~~~g~ 101 (117)
T cd07979 78 -DFLLELAREKNSIPLPPIPPSCGL 101 (117)
T ss_pred -HHHHHHHHHhccCCCCCCCCCCCc
Confidence 11122333334 788888866554
No 42
>KOG2171|consensus
Probab=67.41 E-value=22 Score=42.08 Aligned_cols=100 Identities=16% Similarity=0.254 Sum_probs=74.3
Q ss_pred CCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHH
Q psy11212 311 PGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWAL 390 (448)
Q Consensus 311 ~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~L 390 (448)
|.+.++.|+++.|+..+ +-.+ =..-+.++..++.=-.=..-+|.--++|.++.|+ .+..|++
T Consensus 840 p~f~~~~p~iv~~l~~~-------~~~~-r~~av~~~~d~ie~~~~a~~~~~~~~~p~~~~~~----------~d~~pEV 901 (1075)
T KOG2171|consen 840 PFFENFLPLIVKLLKSK-------KTVA-RQWAVCIFDDLIEGCGEASAKYKERFLPLVLEAL----------QDSDPEV 901 (1075)
T ss_pred HHHHHHHHHHHHHHhcC-------CcHH-HHHHHHHHHHHHHhcccccchHHHHHHHHHHHHc----------CCCCHHH
Confidence 44566677777766543 2222 2234556667776555566778889999998887 3578999
Q ss_pred HHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCC
Q psy11212 391 RDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPK 428 (448)
Q Consensus 391 Rd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~ 428 (448)
|.-||--++..|...+..|..+-.-..-.|.+++..+.
T Consensus 902 RqaAsYGiGvlaq~~g~~y~~v~~~~l~~L~~~iq~~~ 939 (1075)
T KOG2171|consen 902 RQAAAYGMGVLAQFGGEDYAPVCSEALPLLVQVLQPPL 939 (1075)
T ss_pred HHHHHHHHHHHHHHcCcchHHHHHHHHHHHHHHHcChh
Confidence 99999999999999999998888888888888887664
No 43
>PF06371 Drf_GBD: Diaphanous GTPase-binding Domain; InterPro: IPR010473 Diaphanous-related formins (Drfs) are a family of formin homology (FH) proteins that act as effectors of Rho small GTPases during growth factor-induced cytoskeletal remodelling, stress fibre formation, and cell division []. Drf proteins are characterised by a variety of shared domains: an N-terminal GTPase-binding domain (GBD), formin-homology domains FH1, FH2 (IPR003104 from INTERPRO) and FH3 (IPR010472 from INTERPRO), and a C-terminal conserved Dia-autoregulatory domain (DAD) that binds the GBD. This entry represents the GBD, which is a bifunctional autoinhibitory domain that interacts with and is regulated by activated Rho family members. Mammalian Drf3 contains a CRIB-like motif within its GBD for binding to Cdc42, which is required for Cdc42 to activate and guide Drf3 towards the cell cortex where it remodels the actin skeleton [].; GO: 0003779 actin binding, 0017048 Rho GTPase binding, 0030036 actin cytoskeleton organization; PDB: 3OBV_A 2BNX_A 3EG5_D 2BAP_B 3O4X_B 1Z2C_B 2F31_A.
Probab=66.69 E-value=7.7 Score=35.56 Aligned_cols=78 Identities=21% Similarity=0.261 Sum_probs=53.4
Q ss_pred CCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccch--hhHHHHHHHHhhccccCCCCCCCcch
Q psy11212 311 PGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYL--HELLPSILSCIVSKQLCGRPEADNHW 388 (448)
Q Consensus 311 ~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYl--HqLlp~lLtCll~k~l~~~~~~~~hw 388 (448)
-|+..|+-.+.++....... .++.......++.++||+.|.. -++-.+ +..+..+..|+.+ .+.
T Consensus 107 ~G~~~L~~~L~~~~~~~~~~---~~~~~~~~~~l~Clkal~n~~~-G~~~v~~~~~~v~~i~~~L~s----------~~~ 172 (187)
T PF06371_consen 107 GGLEALLNVLSKLNKKKEKS---EEDIDIEHECLRCLKALMNTKY-GLEAVLSHPDSVNLIALSLDS----------PNI 172 (187)
T ss_dssp HHHHHHHHHHHHHHTHHCTC---TTCHHHHHHHHHHHHHHTSSHH-HHHHHHCSSSHHHHHHHT--T----------TSH
T ss_pred CCHHHHHHHHHHhhhhhhhc---chhHHHHHHHHHHHHHHHccHH-HHHHHHcCcHHHHHHHHHHCC----------CCH
Confidence 47777777777665544332 3677888889999999998885 222222 3556666777643 345
Q ss_pred HHHHHHHHHHHHHH
Q psy11212 389 ALRDFASRLLTQVS 402 (448)
Q Consensus 389 ~LRd~AA~lL~~I~ 402 (448)
..|..|..+|+.||
T Consensus 173 ~~r~~~leiL~~lc 186 (187)
T PF06371_consen 173 KTRKLALEILAALC 186 (187)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH
Confidence 89999999999998
No 44
>PRK09687 putative lyase; Provisional
Probab=66.51 E-value=76 Score=31.85 Aligned_cols=29 Identities=21% Similarity=0.028 Sum_probs=16.0
Q ss_pred HHHHHHHHHhcCCCCCccchhhhhhhhcccC
Q psy11212 415 RVTRLLSNAINDPKINFPSLYGAIAGEGLVT 445 (448)
Q Consensus 415 RI~~tl~k~l~d~~~~l~t~YGAl~GL~alG 445 (448)
+.+..|.++|.|+. ...++.|+.+|..++
T Consensus 159 ~ai~~L~~~L~d~~--~~VR~~A~~aLg~~~ 187 (280)
T PRK09687 159 AAIPLLINLLKDPN--GDVRNWAAFALNSNK 187 (280)
T ss_pred HHHHHHHHHhcCCC--HHHHHHHHHHHhcCC
Confidence 35555556665543 246666666666553
No 45
>PTZ00018 histone H3; Provisional
Probab=66.41 E-value=3.9 Score=37.07 Aligned_cols=36 Identities=28% Similarity=0.330 Sum_probs=31.0
Q ss_pred chhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCC
Q psy11212 120 SAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIE 155 (448)
Q Consensus 120 ~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niE 155 (448)
+.-.+...+..++.|+||-+|++.|+..|.+.++.+
T Consensus 100 yLv~lfed~~lca~HakRVTl~~kD~~L~~rirg~~ 135 (136)
T PTZ00018 100 YLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGER 135 (136)
T ss_pred HHHHHhhhhHHHHHhhcceecchhhHHHHHHhcccC
Confidence 445566778999999999999999999999998764
No 46
>PF12348 CLASP_N: CLASP N terminal; InterPro: IPR024395 This domain is found in the N-terminal region of CLIP-associated proteins (CLASPs), which are widely conserved microtubule plus-end-tracking proteins that regulate the stability of dynamic microtubules [, ]. The domain is also found in other proteins involved in microtubule binding, including STU1, MOR1 and spindle pole body component Alp14.; PDB: 2QK2_A.
Probab=65.89 E-value=72 Score=30.10 Aligned_cols=109 Identities=15% Similarity=0.203 Sum_probs=56.1
Q ss_pred HHHHHHHHhcCCCHHHHHHH---HHhhhcCCCcccchhhH-HHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCC---c
Q psy11212 284 YYKEITEACVGSDEGRRFEA---LQSLATDPGLHEMLPRM-CTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQS---L 356 (448)
Q Consensus 284 yf~~It~al~~~~e~~r~~a---L~sL~tD~gL~qLlPyf-v~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~---l 356 (448)
++.-+.+.+..+...-|..| |..+-...+ ..|.+ ..++..+.. ++|..+=...++.+..++.+-. -
T Consensus 95 ~l~~Ll~~~~~~~~~i~~~a~~~L~~i~~~~~---~~~~~~~~~l~~~~~----~Kn~~vR~~~~~~l~~~l~~~~~~~~ 167 (228)
T PF12348_consen 95 LLPPLLKKLGDSKKFIREAANNALDAIIESCS---YSPKILLEILSQGLK----SKNPQVREECAEWLAIILEKWGSDSS 167 (228)
T ss_dssp HHHHHHHGGG---HHHHHHHHHHHHHHHTTS----H--HHHHHHHHHHTT-----S-HHHHHHHHHHHHHHHTT-----G
T ss_pred HHHHHHHHHccccHHHHHHHHHHHHHHHHHCC---cHHHHHHHHHHHHHh----CCCHHHHHHHHHHHHHHHHHccchHh
Confidence 33333333344444445544 444444433 22233 334433332 2565544445555555555444 2
Q ss_pred cc--ccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCC
Q psy11212 357 YL--EKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTST 409 (448)
Q Consensus 357 ~i--epYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy 409 (448)
.+ ...+.++++.+..|+-. ..=.+|+.|-+++..+.+.|+..-
T Consensus 168 ~l~~~~~~~~l~~~l~~~l~D----------~~~~VR~~Ar~~~~~l~~~~~~~a 212 (228)
T PF12348_consen 168 VLQKSAFLKQLVKALVKLLSD----------ADPEVREAARECLWALYSHFPERA 212 (228)
T ss_dssp GG--HHHHHHHHHHHHHHHTS----------S-HHHHHHHHHHHHHHHHHH-HHH
T ss_pred hhcccchHHHHHHHHHHHCCC----------CCHHHHHHHHHHHHHHHHHCCHhh
Confidence 33 33567889988888733 334899999999999999988643
No 47
>PF12460 MMS19_C: RNAPII transcription regulator C-terminal; InterPro: IPR024687 This domain, approximately 60 amino acids in length, is found in the N-terminal region of MMS19 proteins. MMS19 is required for both nucleotide excision repair (NER) and RNA polymerase II (RNAP II) transcription [].
Probab=64.08 E-value=78 Score=33.41 Aligned_cols=116 Identities=16% Similarity=0.223 Sum_probs=76.7
Q ss_pred HHHHHhhhcCCC-cccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCC-----Ccccccchhh-HHHHHHHHh
Q psy11212 301 FEALQSLATDPG-LHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQ-----SLYLEKYLHE-LLPSILSCI 373 (448)
Q Consensus 301 ~~aL~sL~tD~g-L~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~-----~l~iepYlHq-Llp~lLtCl 373 (448)
-.+|..+.+++. +..++|+++.-+....... .+......+++.+..++.+. +...+.|.|. ++|.++...
T Consensus 23 L~~l~~ls~~~~i~~~~~~~ll~kl~~~~~~~---~~~~~~~~il~tl~~~~~~~~~~~~~~~~~~y~~~~lv~~l~~~~ 99 (415)
T PF12460_consen 23 LEALAALSTSPQILETLSIRLLNKLSIVCQSE---SSSDYCHAILSTLQSLLEKKQEDKQFEDNSWYFHRILVPRLFELA 99 (415)
T ss_pred HHHHHHHHCChhHHHHHHHHHHHHHHHHhcCC---CChHHHHHHHHHHHHHHHhcccccccchHHHHHHhHHHHHHHHHH
Confidence 456777777777 4566666666665443321 14444445555555554443 5567889997 888888877
Q ss_pred hccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhc
Q psy11212 374 VSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAIN 425 (448)
Q Consensus 374 l~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~ 425 (448)
+........ .++.+=+.+++++..|++.... .-+..+...+.+.|.
T Consensus 100 ~~~~~~~~~---~~~~~L~~~~~l~~~iv~~l~~---~~q~~~~~~~~~lf~ 145 (415)
T PF12460_consen 100 LQASDQSSD---LDDRVLELLSRLINLIVRSLSP---EKQQEILDELYSLFL 145 (415)
T ss_pred Hhhcccccc---cchHHHHHHHHHHHHHHHhCCH---HHHHHHHHHHHHHHc
Confidence 776554422 5678889999999999998764 456667777777766
No 48
>PLN00121 histone H3; Provisional
Probab=63.36 E-value=4.6 Score=36.60 Aligned_cols=36 Identities=28% Similarity=0.330 Sum_probs=30.5
Q ss_pred chhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCC
Q psy11212 120 SAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIE 155 (448)
Q Consensus 120 ~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niE 155 (448)
+.-.+...|..++.|+||-+|++.||..|.+.++.+
T Consensus 100 yLv~lfed~~lca~HakRVTl~~kD~~L~~rirg~~ 135 (136)
T PLN00121 100 YLVGLFEDTNLCAIHAKRVTIMPKDIQLARRIRGER 135 (136)
T ss_pred HHHHHHhhhHHHHHHhcceecchhhHHHHHHhcccc
Confidence 344566778899999999999999999999988753
No 49
>KOG0212|consensus
Probab=63.32 E-value=36 Score=37.79 Aligned_cols=79 Identities=23% Similarity=0.289 Sum_probs=64.0
Q ss_pred HHHHHHHHHHHHHhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHH
Q psy11212 338 ALLIYLMRMVKALLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVT 417 (448)
Q Consensus 338 ~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~ 417 (448)
.-|+-+.-++-||-.+. .+|+.+++|++++|.= +..-.+|.||..-|..|++-+......--..|.
T Consensus 62 GgLiGlAA~~iaLg~~~----~~Y~~~iv~Pv~~cf~----------D~d~~vRyyACEsLYNiaKv~k~~v~~~Fn~iF 127 (675)
T KOG0212|consen 62 GGLIGLAAVAIALGIKD----AGYLEKIVPPVLNCFS----------DQDSQVRYYACESLYNIAKVAKGEVLVYFNEIF 127 (675)
T ss_pred chHHHHHHHHHHhcccc----HHHHHHhhHHHHHhcc----------CccceeeeHhHHHHHHHHHHhccCcccchHHHH
Confidence 55666666666665443 3499999999999982 334478999999999999999999888899999
Q ss_pred HHHHHHhcCCCCC
Q psy11212 418 RLLSNAINDPKIN 430 (448)
Q Consensus 418 ~tl~k~l~d~~~~ 430 (448)
..+.|...|++.+
T Consensus 128 dvL~klsaDsd~~ 140 (675)
T KOG0212|consen 128 DVLCKLSADSDQN 140 (675)
T ss_pred HHHHHHhcCCccc
Confidence 9999999988643
No 50
>PF10274 ParcG: Parkin co-regulated protein; InterPro: IPR019399 This family of proteins is transcribed anti-sense along the DNA to the Parkin gene product and the two appear to be transcribed under the same promoter. The protein has predicted alpha-helical and beta-sheet domains which suggest its function is in the ubiquitin/proteasome system []. Mutations in parkin are the genetic cause of early-onset and autosomal recessive juvenile parkinsonism.
Probab=62.22 E-value=59 Score=31.01 Aligned_cols=93 Identities=15% Similarity=0.240 Sum_probs=64.3
Q ss_pred CCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHH-hcCCCc--ccccchhhHHHHHHHHhhcccc--CCCCCC
Q psy11212 310 DPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKAL-LDNQSL--YLEKYLHELLPSILSCIVSKQL--CGRPEA 384 (448)
Q Consensus 310 D~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~AL-l~N~~l--~iepYlHqLlp~lLtCll~k~l--~~~~~~ 384 (448)
...+-+++|-++.-+-.. +..+|.++....|++++.| ...+.+ .|-||+-||+|+ +....++.. +..-+.
T Consensus 71 ~~kilPvlPqLI~plk~A----L~tr~~~V~~~~L~~Lq~Lv~~~~~vG~aLvPyyrqLLp~-ln~f~~k~~n~gd~i~y 145 (183)
T PF10274_consen 71 GEKILPVLPQLIIPLKRA----LNTRDPEVFCATLKALQQLVTSSDMVGEALVPYYRQLLPV-LNLFKNKNVNLGDGIDY 145 (183)
T ss_pred hhHHHHHHHHHHHHHHHH----HhCCCHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHH-HHHHHhcccCCCccccc
Confidence 334666666666666555 4458999999999999998 556666 599999999996 554444442 211112
Q ss_pred CcchHHHHHHHHHHHHHHHHhcC
Q psy11212 385 DNHWALRDFASRLLTQVSKNFNT 407 (448)
Q Consensus 385 ~~hw~LRd~AA~lL~~I~~~~~~ 407 (448)
...=.++|.-..+|..+.+.-|.
T Consensus 146 ~~~~~~~dlI~etL~~lE~~GG~ 168 (183)
T PF10274_consen 146 RKRKNLGDLIQETLELLERNGGP 168 (183)
T ss_pred ccccchhHHHHHHHHHHHHhcCh
Confidence 34557899999999888777665
No 51
>PF14500 MMS19_N: Dos2-interacting transcription regulator of RNA-Pol-II
Probab=60.36 E-value=62 Score=32.32 Aligned_cols=127 Identities=16% Similarity=0.118 Sum_probs=81.2
Q ss_pred HHHhcCCCHHHHHHHHHhhh-------cCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccc
Q psy11212 289 TEACVGSDEGRRFEALQSLA-------TDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKY 361 (448)
Q Consensus 289 t~al~~~~e~~r~~aL~sL~-------tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepY 361 (448)
-+.+.++|+..|.+|+..|. .|.=-.+=+-.++.|+.+.+ .|...+...++-+.||+..+++.-+-
T Consensus 5 g~~Ltsed~~~R~ka~~~Ls~vL~~lp~~~L~~~ev~~L~~F~~~rl------~D~~~~~~~l~gl~~L~~~~~~~~~~- 77 (262)
T PF14500_consen 5 GEYLTSEDPIIRAKALELLSEVLERLPPDFLSRQEVQVLLDFFCSRL------DDHACVQPALKGLLALVKMKNFSPES- 77 (262)
T ss_pred hhhhCCCCHHHHHHHHHHHHHHHHhCCHhhccHHHHHHHHHHHHHHh------ccHhhHHHHHHHHHHHHhCcCCChhh-
Confidence 34566778888877754444 33333444667778887764 46666777788999999877754443
Q ss_pred hhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCC
Q psy11212 362 LHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKIN 430 (448)
Q Consensus 362 lHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~ 430 (448)
...++-.+...+=-+ ..--.-|...=++|..+..+|...-.++-.-.+..+.++.-.++-|
T Consensus 78 ~~~i~~~l~~~~~~q--------~~~q~~R~~~~~ll~~l~~~~~~~l~~~~~~fv~~~i~~~~gEkDP 138 (262)
T PF14500_consen 78 AVKILRSLFQNVDVQ--------SLPQSTRYAVYQLLDSLLENHREALQSMGDDFVYGFIQLIDGEKDP 138 (262)
T ss_pred HHHHHHHHHHhCChh--------hhhHHHHHHHHHHHHHHHHHhHHHHHhchhHHHHHHHHHhccCCCH
Confidence 444444443322111 1223568888889999999887766666677777777777665543
No 52
>PF15630 CENP-S: Kinetochore component CENP-S; PDB: 4DRA_C 4DRB_H 3V9R_C.
Probab=59.43 E-value=3.4 Score=33.77 Aligned_cols=22 Identities=14% Similarity=0.015 Sum_probs=16.7
Q ss_pred hhhhcccCCccCChhHHHHHHH
Q psy11212 129 FKCMSHSKRVKLLPVDIDDALN 150 (448)
Q Consensus 129 ~Kfm~hskR~kLt~~DIn~AL~ 150 (448)
.-|.+|.||++.+++||-...|
T Consensus 50 e~FAkHA~R~tI~~dDV~Ll~R 71 (76)
T PF15630_consen 50 EAFAKHAGRSTINMDDVKLLAR 71 (76)
T ss_dssp HHHHHHTT-SEE-HHHHHHHTT
T ss_pred HHHHHhcCCCeecHHHHHHHhh
Confidence 3599999999999999976554
No 53
>PRK09687 putative lyase; Provisional
Probab=59.37 E-value=1.2e+02 Score=30.48 Aligned_cols=53 Identities=19% Similarity=0.278 Sum_probs=41.6
Q ss_pred CcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhcccCCC
Q psy11212 385 DNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLVTER 447 (448)
Q Consensus 385 ~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~alG~~ 447 (448)
++.|.+|..|+.-|+.+ .++. +.+...|.++|.|+ ...+.-+|+.||..+|..
T Consensus 170 d~~~~VR~~A~~aLg~~--~~~~------~~~~~~L~~~L~D~--~~~VR~~A~~aLg~~~~~ 222 (280)
T PRK09687 170 DPNGDVRNWAAFALNSN--KYDN------PDIREAFVAMLQDK--NEEIRIEAIIGLALRKDK 222 (280)
T ss_pred CCCHHHHHHHHHHHhcC--CCCC------HHHHHHHHHHhcCC--ChHHHHHHHHHHHccCCh
Confidence 46778999999999998 2222 36777888888877 567899999999998864
No 54
>KOG2023|consensus
Probab=55.63 E-value=66 Score=36.59 Aligned_cols=144 Identities=18% Similarity=0.305 Sum_probs=92.6
Q ss_pred cccccchHHHHHHHHHHHHHhc----CCCHHHHHH---HHHhhhcCCCcccchhhHHHHHHhhhhhc--cccCCHHHHHH
Q psy11212 272 LATHELSVEQQLYYKEITEACV----GSDEGRRFE---ALQSLATDPGLHEMLPRMCTFIAEGVKVN--VVQNNLALLIY 342 (448)
Q Consensus 272 ~~~h~LSkElQlyf~~It~al~----~~~e~~r~~---aL~sL~tD~gL~qLlPyfv~FI~e~V~~n--l~~~nl~~L~~ 342 (448)
-.+++++.+.+.||+.+.+.+. .++....++ |++.|+.+.| ..|+||+-+-+..-+... +..+|+-+|.
T Consensus 461 ys~wv~~~~~~~~f~pvL~~ll~~llD~NK~VQEAAcsAfAtleE~A~-~eLVp~l~~IL~~l~~af~kYQ~KNLlILY- 538 (885)
T KOG2023|consen 461 YSKWVVQDSRDEYFKPVLEGLLRRLLDSNKKVQEAACSAFATLEEEAG-EELVPYLEYILDQLVFAFGKYQKKNLLILY- 538 (885)
T ss_pred hhhhHhcCChHhhhHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHhcc-chhHHHHHHHHHHHHHHHHHHhhcceehHH-
Confidence 3456788889999999776654 455554554 5678888877 579998776655444331 2235665543
Q ss_pred HHHHHHHHhcCC--CcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHH
Q psy11212 343 LMRMVKALLDNQ--SLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLL 420 (448)
Q Consensus 343 ll~~v~ALl~N~--~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl 420 (448)
..+..|.+-- +|+=--|+.-|||+++.=- ..++ .+..+-|-|=+.=+++-..+-..|-.--.-...|.++.+
T Consensus 539 --DAIgtlAdsvg~~Ln~~~YiqiLmPPLi~KW--~~ls--d~DKdLfPLLEClSsia~AL~~gF~P~~~~Vy~Rc~~il 612 (885)
T KOG2023|consen 539 --DAIGTLADSVGHALNKPAYIQILMPPLIEKW--ELLS--DSDKDLFPLLECLSSIASALGVGFLPYAQPVYQRCFRIL 612 (885)
T ss_pred --HHHHHHHHHHHHhcCcHHHHHHhccHHHHHH--HhcC--cccchHHHHHHHHHHHHHHHhccccccCHHHHHHHHHHH
Confidence 3344444433 4555568999999988543 1122 124678999888777777777777665566777777766
Q ss_pred HHH
Q psy11212 421 SNA 423 (448)
Q Consensus 421 ~k~ 423 (448)
.+.
T Consensus 613 ~~t 615 (885)
T KOG2023|consen 613 QKT 615 (885)
T ss_pred HHH
Confidence 644
No 55
>smart00427 H2B Histone H2B.
Probab=55.43 E-value=10 Score=31.97 Aligned_cols=31 Identities=16% Similarity=0.233 Sum_probs=27.8
Q ss_pred hhhhhhhhhhhcccCCccCChhHHHHHHHhc
Q psy11212 122 QKLESISFKCMSHSKRVKLLPVDIDDALNMK 152 (448)
Q Consensus 122 q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~ 152 (448)
.++-.+|.+..+..||.+||..+|..|.|.+
T Consensus 37 erIa~EAs~L~~~nkr~TltsreIqtAvrl~ 67 (89)
T smart00427 37 ERIAAEASKLARYNKKSTLSSREIQTAVRLI 67 (89)
T ss_pred HHHHHHHHHHHhccCCCcCCHHHHHHHHHHH
Confidence 3567789999999999999999999999875
No 56
>KOG3334|consensus
Probab=53.42 E-value=12 Score=34.15 Aligned_cols=37 Identities=16% Similarity=0.141 Sum_probs=33.6
Q ss_pred hhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCC
Q psy11212 121 AQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPI 157 (448)
Q Consensus 121 ~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPL 157 (448)
++.++..|+-|-+|+|+..+.++||..|..++..--.
T Consensus 47 tt~vL~DA~vys~HA~ka~i~~eDVrlA~~~~~~~sf 83 (148)
T KOG3334|consen 47 TTTVLDDAKVYSSHAKKATIDAEDVRLAIQMRVDHSF 83 (148)
T ss_pred HHHHHHHHHHHHHhccCCCCcHHHHHHHHHHHhcccc
Confidence 8899999999999999999999999999999865443
No 57
>PF01602 Adaptin_N: Adaptin N terminal region; InterPro: IPR002553 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the N-terminal domain of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and from the beta and gamma subunits of various coatomer (COP) adaptors. This domain has a 2-layer alpha/alpha fold that forms a right-handed superhelix, and is a member of the ARM repeat superfamily []. The N-terminal region of the various AP adaptor proteins share strong sequence identity; by contrast, the C-terminal domains of different adaptins share similar structural folds, but have little sequence identity []. It has been proposed that the N-terminal domain interacts with another uniform component of the coated vesicles. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 1W63_C 2JKR_A 2JKT_A 2XA7_A 2VGL_B 3TJZ_E.
Probab=52.88 E-value=49 Score=35.18 Aligned_cols=83 Identities=11% Similarity=0.241 Sum_probs=52.2
Q ss_pred HHHHHHHHHHhcCCCHHHHHHHH----HhhhcCCCcc-c-chhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCC
Q psy11212 282 QLYYKEITEACVGSDEGRRFEAL----QSLATDPGLH-E-MLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQS 355 (448)
Q Consensus 282 Qlyf~~It~al~~~~e~~r~~aL----~sL~tD~gL~-q-LlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~ 355 (448)
+..+..|.+.+.++++..|++|+ +..+.+|.+- . ++|.+.+.+ ..+|..++...+.++..+-.|+.
T Consensus 113 ~~l~~~v~~ll~~~~~~VRk~A~~~l~~i~~~~p~~~~~~~~~~l~~lL--------~d~~~~V~~~a~~~l~~i~~~~~ 184 (526)
T PF01602_consen 113 EPLIPDVIKLLSDPSPYVRKKAALALLKIYRKDPDLVEDELIPKLKQLL--------SDKDPSVVSAALSLLSEIKCNDD 184 (526)
T ss_dssp HHHHHHHHHHHHSSSHHHHHHHHHHHHHHHHHCHCCHHGGHHHHHHHHT--------THSSHHHHHHHHHHHHHHHCTHH
T ss_pred hHHHHHHHHHhcCCchHHHHHHHHHHHHHhccCHHHHHHHHHHHHhhhc--------cCCcchhHHHHHHHHHHHccCcc
Confidence 44567788888888887777554 4446666652 2 344444444 23567777777777777756666
Q ss_pred cccccchhhHHHHHHHHh
Q psy11212 356 LYLEKYLHELLPSILSCI 373 (448)
Q Consensus 356 l~iepYlHqLlp~lLtCl 373 (448)
.+. ..+|.++..+.+++
T Consensus 185 ~~~-~~~~~~~~~L~~~l 201 (526)
T PF01602_consen 185 SYK-SLIPKLIRILCQLL 201 (526)
T ss_dssp HHT-THHHHHHHHHHHHH
T ss_pred hhh-hhHHHHHHHhhhcc
Confidence 655 66777777666555
No 58
>PLN00161 histone H3; Provisional
Probab=52.06 E-value=12 Score=33.84 Aligned_cols=37 Identities=24% Similarity=0.232 Sum_probs=31.0
Q ss_pred hhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCC
Q psy11212 121 AQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPI 157 (448)
Q Consensus 121 ~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPL 157 (448)
.-.+...|.-+..|+||-+|++.||..|.+.++..+-
T Consensus 95 LV~lFeda~lcaiHAkRVTlm~kDm~La~rirg~~~~ 131 (135)
T PLN00161 95 LVHLFEDCNLCAIHAKRVTIMPKDMQLARRIRGPIYG 131 (135)
T ss_pred HHHHHHHHHHHHHHhcCcccchhhHHHHHHhcccccc
Confidence 3445667888899999999999999999999987643
No 59
>PF13513 HEAT_EZ: HEAT-like repeat; PDB: 2Z5J_A 2OT8_B 2Z5O_A 2H4M_A 2QMR_A 1QBK_B 2Z5M_A 2Z5K_A 2Z5N_A 1GCJ_B ....
Probab=51.16 E-value=17 Score=26.58 Aligned_cols=41 Identities=27% Similarity=0.426 Sum_probs=29.5
Q ss_pred hcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHH
Q psy11212 351 LDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQV 401 (448)
Q Consensus 351 l~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I 401 (448)
..+..-.+++|+.+++|.++.++- +++-.+|..|+..|+.|
T Consensus 15 ~~~~~~~~~~~~~~~~~~L~~~L~----------d~~~~VR~~A~~aLg~l 55 (55)
T PF13513_consen 15 AEGCPELLQPYLPELLPALIPLLQ----------DDDDSVRAAAAWALGNL 55 (55)
T ss_dssp TTTTHHHHHHHHHHHHHHHHHHTT----------SSSHHHHHHHHHHHHCH
T ss_pred hcccHHHHHHHHHHHHHHHHHHHc----------CCCHHHHHHHHHHHhcC
Confidence 344445678888999998887761 22238999999998865
No 60
>PTZ00429 beta-adaptin; Provisional
Probab=50.93 E-value=1.8e+02 Score=33.69 Aligned_cols=143 Identities=13% Similarity=0.073 Sum_probs=76.9
Q ss_pred HHHHHHHHHHHHhcCCCHHHHHHHHHhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccc
Q psy11212 280 EQQLYYKEITEACVGSDEGRRFEALQSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLE 359 (448)
Q Consensus 280 ElQlyf~~It~al~~~~e~~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~ie 359 (448)
+.-.+|-.|+..+.++|.+.|+-++--|..-... -|-+..-..+.+.+-+..+|- .++.+ .++.|. +|.+.
T Consensus 65 DvS~LF~dVvk~~~S~d~elKKLvYLYL~~ya~~---~pelalLaINtl~KDl~d~Np-~IRaL--ALRtLs---~Ir~~ 135 (746)
T PTZ00429 65 DVSYLFVDVVKLAPSTDLELKKLVYLYVLSTARL---QPEKALLAVNTFLQDTTNSSP-VVRAL--AVRTMM---CIRVS 135 (746)
T ss_pred CchHHHHHHHHHhCCCCHHHHHHHHHHHHHHccc---ChHHHHHHHHHHHHHcCCCCH-HHHHH--HHHHHH---cCCcH
Confidence 4445566666655555555555443222222111 133222223333333322332 22221 222222 25556
Q ss_pred cchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhh
Q psy11212 360 KYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIA 439 (448)
Q Consensus 360 pYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~ 439 (448)
-.+-.++++|..|+.. .+=.+|.-||-.+.+|.+.+.+... ..++...+.+.|.| ++..+.+.|+.
T Consensus 136 ~i~e~l~~~lkk~L~D----------~~pYVRKtAalai~Kly~~~pelv~--~~~~~~~L~~LL~D--~dp~Vv~nAl~ 201 (746)
T PTZ00429 136 SVLEYTLEPLRRAVAD----------PDPYVRKTAAMGLGKLFHDDMQLFY--QQDFKKDLVELLND--NNPVVASNAAA 201 (746)
T ss_pred HHHHHHHHHHHHHhcC----------CCHHHHHHHHHHHHHHHhhCccccc--ccchHHHHHHHhcC--CCccHHHHHHH
Confidence 6667788888888832 3348999999999999887654321 22344445554554 47788899988
Q ss_pred hhcccC
Q psy11212 440 GEGLVT 445 (448)
Q Consensus 440 GL~alG 445 (448)
.|..+.
T Consensus 202 aL~eI~ 207 (746)
T PTZ00429 202 IVCEVN 207 (746)
T ss_pred HHHHHH
Confidence 776653
No 61
>KOG1851|consensus
Probab=50.06 E-value=51 Score=40.57 Aligned_cols=113 Identities=12% Similarity=0.132 Sum_probs=73.6
Q ss_pred cchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHh-cCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHH
Q psy11212 315 EMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALL-DNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDF 393 (448)
Q Consensus 315 qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl-~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~ 393 (448)
-+.|-|++-+...+..+ .+...=...+.++++.+ .|.+++-+.-...+--.+. |++. +++.+.|+.
T Consensus 1522 ~l~~e~l~~l~~~~~~~---~tw~vr~avl~fl~~~vy~n~Fv~~~~~r~dI~~l~~-s~l~---------D~~i~vre~ 1588 (1710)
T KOG1851|consen 1522 HLQPEFLRDLKMLTADS---STWRVRSAVLKFLQTVVYSNIFVSQELRRDDIRKLLE-SLLN---------DDQIEVREE 1588 (1710)
T ss_pred hhHHHHHHHHHHHhccc---chHHHHHHHHHHHHHHHHHHhhcccchhHHHHHHHHH-HHHc---------chHHHHHHH
Confidence 45677777776433322 34444444777777654 4444444444444444333 2222 466789999
Q ss_pred HHHHHHHH-HHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhccc
Q psy11212 394 ASRLLTQV-SKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLV 444 (448)
Q Consensus 394 AA~lL~~I-~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~al 444 (448)
||.+|+-+ |..|....+ +..+++.+.+..+.++..-.-||+-||+|+
T Consensus 1589 Aa~~Lsgl~~~s~~~~~~----~k~d~~~~~~~s~s~~~i~~HgavlgLgA~ 1636 (1710)
T KOG1851|consen 1589 AAKCLSGLLQGSKFQFVS----DKRDTTSNILQSKSKDEIKAHGAVLGLGAI 1636 (1710)
T ss_pred HHHHHHHHHhccccccch----HhhhhhhhhhhhcchHHHHhhhhHHHHHHH
Confidence 99999966 444444333 788888899998888999999999999885
No 62
>cd00074 H2A Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
Probab=48.38 E-value=37 Score=29.90 Aligned_cols=24 Identities=13% Similarity=0.196 Sum_probs=22.1
Q ss_pred hhhhhcccCCccCChhHHHHHHHh
Q psy11212 128 SFKCMSHSKRVKLLPVDIDDALNM 151 (448)
Q Consensus 128 A~Kfm~hskR~kLt~~DIn~AL~~ 151 (448)
|....++.||++.|+.||+.|.+.
T Consensus 62 A~n~ak~~k~krItp~hi~lAi~n 85 (115)
T cd00074 62 AGNAARDNKKKRITPRHLQLAVRN 85 (115)
T ss_pred HHHHHHHcCCCeEcHHHHHHHHhc
Confidence 888889999999999999999875
No 63
>KOG1242|consensus
Probab=47.17 E-value=1.3e+02 Score=33.50 Aligned_cols=137 Identities=18% Similarity=0.325 Sum_probs=83.3
Q ss_pred ccchHHHHHHHHHHHHHhcCCCHHHHHHHHHh------hhcCCCcccchhhHHHHHHhhhhh---c------------cc
Q psy11212 275 HELSVEQQLYYKEITEACVGSDEGRRFEALQS------LATDPGLHEMLPRMCTFIAEGVKV---N------------VV 333 (448)
Q Consensus 275 h~LSkElQlyf~~It~al~~~~e~~r~~aL~s------L~tD~gL~qLlPyfv~FI~e~V~~---n------------l~ 333 (448)
.+||-=++.---.+++.+.++..+.|+++.++ .-.++-|+.++|-++.=+++-... . +.
T Consensus 287 ~qLs~~lp~iiP~lsevl~DT~~evr~a~~~~l~~~~svidN~dI~~~ip~Lld~l~dp~~~~~e~~~~L~~ttFV~~V~ 366 (569)
T KOG1242|consen 287 KQLSLCLPDLIPVLSEVLWDTKPEVRKAGIETLLKFGSVIDNPDIQKIIPTLLDALADPSCYTPECLDSLGATTFVAEVD 366 (569)
T ss_pred HHHHHHHhHhhHHHHHHHccCCHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHhcCcccchHHHHHhhcceeeeeeec
Confidence 34566666666777777777776666654433 345666777777777666544310 0 00
Q ss_pred cCCHHHHHHHH------------HHHHHHhc------CCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHH
Q psy11212 334 QNNLALLIYLM------------RMVKALLD------NQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFAS 395 (448)
Q Consensus 334 ~~nl~~L~~ll------------~~v~ALl~------N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA 395 (448)
-..++++.-++ |++-.+++ +.-..+.||+-+|+|- +-|.+..- .| +.|..||
T Consensus 367 ~psLalmvpiL~R~l~eRst~~kr~t~~IidNm~~LveDp~~lapfl~~Llp~-lk~~~~d~---~P------EvR~vaa 436 (569)
T KOG1242|consen 367 APSLALMVPILKRGLAERSTSIKRKTAIIIDNMCKLVEDPKDLAPFLPSLLPG-LKENLDDA---VP------EVRAVAA 436 (569)
T ss_pred chhHHHHHHHHHHHHhhccchhhhhHHHHHHHHHHhhcCHHHHhhhHHHHhhH-HHHHhcCC---Ch------hHHHHHH
Confidence 11122211111 22222333 3446789999999998 55655432 23 6899999
Q ss_pred HHHHHHHHHhcCCC-cchHHHHHHHHH
Q psy11212 396 RLLTQVSKNFNTST-NNIQTRVTRLLS 421 (448)
Q Consensus 396 ~lL~~I~~~~~~sy-~~L~~RI~~tl~ 421 (448)
+-|+.+.++-++.+ ..+.|.++.++.
T Consensus 437 rAL~~l~e~~g~~~f~d~~p~l~e~~~ 463 (569)
T KOG1242|consen 437 RALGALLERLGEVSFDDLIPELSETLT 463 (569)
T ss_pred HHHHHHHHHHHhhcccccccHHHHhhc
Confidence 99999999999877 777777766653
No 64
>PF10508 Proteasom_PSMB: Proteasome non-ATPase 26S subunit; InterPro: IPR019538 The 26S proteasome is an enzymatic complex that degrades ubiquitinated proteins in eukaryotic cells. 26S proteasome non-ATPase regulatory subunit 5 is one of a number of chaperones that are involved in the assembly of the proteasome. The chaperones dissociate before 26S proteasome formation is complete [].; GO: 0044183 protein binding involved in protein folding
Probab=45.66 E-value=41 Score=36.64 Aligned_cols=88 Identities=16% Similarity=0.306 Sum_probs=59.8
Q ss_pred HHHHhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccch-hhHHHHHHHHhhccccCC
Q psy11212 302 EALQSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYL-HELLPSILSCIVSKQLCG 380 (448)
Q Consensus 302 ~aL~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYl-HqLlp~lLtCll~k~l~~ 380 (448)
.+|..+=.+-....+.|-+..|+..++.+ .+-.+-...++.+..++.++.-..+.-. +.+++.|+.|+-.+
T Consensus 60 ~iL~~~l~~~~~~~l~~~~~~~L~~gL~h----~~~~Vr~l~l~~l~~~~~~~~~~~~~~~~~~l~~~i~~~L~~~---- 131 (503)
T PF10508_consen 60 DILKRLLSALSPDSLLPQYQPFLQRGLTH----PSPKVRRLALKQLGRIARHSEGAAQLLVDNELLPLIIQCLRDP---- 131 (503)
T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHHhcC----CCHHHHHHHHHHHHHHhcCCHHHHHHhcCccHHHHHHHHHcCC----
Confidence 34666666667777788889999888764 3333444446666666666665444433 67899999999443
Q ss_pred CCCCCcchHHHHHHHHHHHHHHH
Q psy11212 381 RPEADNHWALRDFASRLLTQVSK 403 (448)
Q Consensus 381 ~~~~~~hw~LRd~AA~lL~~I~~ 403 (448)
.-.+-..|+++|..|++
T Consensus 132 ------d~~Va~~A~~~L~~l~~ 148 (503)
T PF10508_consen 132 ------DLSVAKAAIKALKKLAS 148 (503)
T ss_pred ------cHHHHHHHHHHHHHHhC
Confidence 23677888888888875
No 65
>PF14911 MMS22L_C: S-phase genomic integrity recombination mediator, C-terminal
Probab=43.47 E-value=97 Score=32.78 Aligned_cols=75 Identities=19% Similarity=0.387 Sum_probs=52.0
Q ss_pred HHHHHHHHhcC----CCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhc-CCCcchHHHHH
Q psy11212 343 LMRMVKALLDN----QSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFN-TSTNNIQTRVT 417 (448)
Q Consensus 343 ll~~v~ALl~N----~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~-~sy~~L~~RI~ 417 (448)
.+.++.-|+.. +.=.+..-+-.++|++|.|++- ++++-..|..|..++..|.+.+. .+....+..++
T Consensus 231 vL~fl~~Ll~~~~~~~~~~~~~~~~~~lp~lL~c~~~--------v~e~~~~k~~a~e~l~~mv~~~~~~~~~~~~~~l~ 302 (373)
T PF14911_consen 231 VLAFLQQLLKRLQRQNENQILTLLRLVLPSLLECLML--------VNEEPQVKKLATELLQYMVESCQVGSSGEPREQLT 302 (373)
T ss_pred HHHHHHHHHHhcCcccchhHHHHHHHhhHHHHHHHhh--------cCCCcchhHHHHHHHHHHHHcccccCcchHHHHHH
Confidence 34444444444 3444555667889999999976 23456789999999999998888 66677777776
Q ss_pred HHHHHHhc
Q psy11212 418 RLLSNAIN 425 (448)
Q Consensus 418 ~tl~k~l~ 425 (448)
..|.....
T Consensus 303 s~lrsfvq 310 (373)
T PF14911_consen 303 SVLRSFVQ 310 (373)
T ss_pred HHHHHHHH
Confidence 66665443
No 66
>PTZ00463 histone H2B; Provisional
Probab=42.90 E-value=19 Score=31.79 Aligned_cols=30 Identities=17% Similarity=0.245 Sum_probs=27.3
Q ss_pred hhhhhhhhhhcccCCccCChhHHHHHHHhc
Q psy11212 123 KLESISFKCMSHSKRVKLLPVDIDDALNMK 152 (448)
Q Consensus 123 ~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~ 152 (448)
.+-.+|.++++..||.+||..||..|.|++
T Consensus 65 rIA~EAs~La~~nkr~TltsrEIQtAvrLl 94 (117)
T PTZ00463 65 KIATEASRLCKYTRRDTLSSREIQTAIRLV 94 (117)
T ss_pred HHHHHHHHHHHhcCCCcCCHHHHHHHHhhc
Confidence 566789999999999999999999999876
No 67
>PF04648 MF_alpha: Yeast mating factor alpha hormone; InterPro: IPR006742 This repeated sequence,WHWLQLKPGQPMY, characterises the mating factor alpha-1 or alpha-1 mating pheromone [contains: Mating factor alpha].The hormone is excreted into the culture medium by haploid cells of the alpha mating type and acts on cells of the opposite mating type (type A) by binding to a cognate G-protein coupled receptor which is coupled to a downstream signal transduction pathway. It inhibits DNA synthesis in type A cells synchronising them with type alpha, and so mediates the conjugation process.; GO: 0000772 mating pheromone activity, 0019953 sexual reproduction, 0005576 extracellular region
Probab=42.50 E-value=13 Score=20.50 Aligned_cols=11 Identities=45% Similarity=0.951 Sum_probs=9.7
Q ss_pred eEEEeeccCCC
Q psy11212 99 HWLAIEGVQPT 109 (448)
Q Consensus 99 hwlaieg~qp~ 109 (448)
|||.++-=||.
T Consensus 2 hWL~~~~GqP~ 12 (13)
T PF04648_consen 2 HWLRLSPGQPM 12 (13)
T ss_pred cceeccCCCcC
Confidence 99999988885
No 68
>PLN00158 histone H2B; Provisional
Probab=41.46 E-value=22 Score=31.37 Aligned_cols=30 Identities=17% Similarity=0.181 Sum_probs=27.3
Q ss_pred hhhhhhhhhhcccCCccCChhHHHHHHHhc
Q psy11212 123 KLESISFKCMSHSKRVKLLPVDIDDALNMK 152 (448)
Q Consensus 123 ~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~ 152 (448)
.+-.+|.+..+..||.+||..+|..|.+++
T Consensus 64 rIA~EAs~La~~nkr~TltsrEIqtAvrLv 93 (116)
T PLN00158 64 KIATEAGKLARYNKKPTVTSREIQTAVRLI 93 (116)
T ss_pred HHHHHHHHHHhccCCCcCCHHHHHHHHHHh
Confidence 566789999999999999999999999875
No 69
>KOG0870|consensus
Probab=40.63 E-value=17 Score=34.02 Aligned_cols=40 Identities=20% Similarity=0.122 Sum_probs=35.0
Q ss_pred cchhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCCCCc
Q psy11212 119 KSAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIEPIY 158 (448)
Q Consensus 119 ~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niEPLy 158 (448)
-|+..+++.|..|.+..||++++.+||=.||.-+...-+-
T Consensus 45 VFv~~Lts~s~e~A~~q~rKt~sadDVl~aL~Eiefs~f~ 84 (172)
T KOG0870|consen 45 VFVIFLTSVSNEIAKDQKRKTISADDVLKALDEIEFSSFV 84 (172)
T ss_pred HHHHHHHHHHHHHHHhcccCcccHHHHHHHHHHhchHHHh
Confidence 4788889999999999999999999999999988765543
No 70
>KOG1824|consensus
Probab=40.30 E-value=78 Score=37.34 Aligned_cols=107 Identities=16% Similarity=0.226 Sum_probs=74.2
Q ss_pred cccchhhHHHHHHhhhhhccccC---CHHHHHHHHHHHHHHhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchH
Q psy11212 313 LHEMLPRMCTFIAEGVKVNVVQN---NLALLIYLMRMVKALLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWA 389 (448)
Q Consensus 313 L~qLlPyfv~FI~e~V~~nl~~~---nl~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~ 389 (448)
+|+++|-++.-+.-.|..++-+= -+.++-.+.++++-|-.|-.+.-.+|+|.++-+.+.-+-++ +-.-.
T Consensus 514 fhp~~~~Ls~~v~~aV~d~fyKisaEAL~v~~~lvkvirpl~~~~~~d~~~~v~~m~~~tl~rL~a~--------d~Dqe 585 (1233)
T KOG1824|consen 514 FHPHLSALSPPVVAAVGDPFYKISAEALLVCQQLVKVIRPLQPPSSFDASPYVKTMYDCTLQRLKAT--------DSDQE 585 (1233)
T ss_pred cccchhhhhhHHHHHhcCchHhhhHHHHHHHHHHHHHhcccCCCccCCCChhHHHHHHHHHHHHhcc--------cccHH
Confidence 55666655555555544433211 23334446677777777888999999999887777666443 44568
Q ss_pred HHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCC
Q psy11212 390 LRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDP 427 (448)
Q Consensus 390 LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~ 427 (448)
+|+.|-.+|++++..|+..-.+=-+|+.-.|.+.|.++
T Consensus 586 VkeraIscmgq~i~~fgD~l~~eL~~~L~il~eRl~nE 623 (1233)
T KOG1824|consen 586 VKERAISCMGQIIANFGDFLGNELPRTLPILLERLGNE 623 (1233)
T ss_pred HHHHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhch
Confidence 99999999999999999666666677777777777654
No 71
>KOG1967|consensus
Probab=39.98 E-value=1.5e+02 Score=34.95 Aligned_cols=141 Identities=19% Similarity=0.276 Sum_probs=88.7
Q ss_pred HHHHHHHHHHHhcCC-------CHHHHHHHHHhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcC
Q psy11212 281 QQLYYKEITEACVGS-------DEGRRFEALQSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDN 353 (448)
Q Consensus 281 lQlyf~~It~al~~~-------~e~~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N 353 (448)
-|.||-.|.-.+++. .....-.||..+=++--.++++|.+-.-+-=-+ ..+...|...=...++.+.-++.=
T Consensus 861 kQRfF~~ivP~l~~~~~t~~~~~K~~yl~~LshVl~~vP~~vllp~~~~LlPLLL-q~Ls~~D~~v~vstl~~i~~~l~~ 939 (1030)
T KOG1967|consen 861 KQRFFCDIVPILVSKFETAPGSQKHNYLEALSHVLTNVPKQVLLPQFPMLLPLLL-QALSMPDVIVRVSTLRTIPMLLTE 939 (1030)
T ss_pred HHHHHHhhHHHHHHHhccCCccchhHHHHHHHHHHhcCCHHhhccchhhHHHHHH-HhcCCCccchhhhHhhhhhHHHHh
Confidence 367788887777642 223456777666666666777777654432111 112234544422344555555544
Q ss_pred CCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCC-cchHHHHHHHHHHHhcCCCC
Q psy11212 354 QSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTST-NNIQTRVTRLLSNAINDPKI 429 (448)
Q Consensus 354 ~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy-~~L~~RI~~tl~k~l~d~~~ 429 (448)
-+=-+..|+.-++|.+|+.- +. + ..+--..|..|=++|..+-++--..| ..-|+.|++.+.+.|-|+++
T Consensus 940 ~~tL~t~~~~Tlvp~lLsls--~~----~-~n~~~~VR~~ALqcL~aL~~~~P~~~l~~fr~~Vl~al~k~LdDkKR 1009 (1030)
T KOG1967|consen 940 SETLQTEHLSTLVPYLLSLS--SD----N-DNNMMVVREDALQCLNALTRRLPTKSLLSFRPLVLRALIKILDDKKR 1009 (1030)
T ss_pred ccccchHHHhHHHHHHHhcC--CC----C-CcchhHHHHHHHHHHHHHhccCCCcccccccHHHHHHhhhccCcHHH
Confidence 44456778889999776532 21 1 12347899999999999999644433 34489999999999999874
No 72
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=39.04 E-value=28 Score=27.40 Aligned_cols=50 Identities=18% Similarity=0.155 Sum_probs=33.5
Q ss_pred CcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhcccCC
Q psy11212 385 DNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLVTE 446 (448)
Q Consensus 385 ~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~alG~ 446 (448)
+..|.+|..|+.+|+.+. .+++...|.+.+.|+ ...+..=|+..|..+|.
T Consensus 11 ~~~~~vr~~a~~~L~~~~----------~~~~~~~L~~~l~d~--~~~vr~~a~~aL~~i~~ 60 (88)
T PF13646_consen 11 DPDPQVRAEAARALGELG----------DPEAIPALIELLKDE--DPMVRRAAARALGRIGD 60 (88)
T ss_dssp SSSHHHHHHHHHHHHCCT----------HHHHHHHHHHHHTSS--SHHHHHHHHHHHHCCHH
T ss_pred CCCHHHHHHHHHHHHHcC----------CHhHHHHHHHHHcCC--CHHHHHHHHHHHHHhCC
Confidence 456788888888888332 236677777777554 45677777777776653
No 73
>KOG1241|consensus
Probab=38.39 E-value=1.2e+02 Score=34.95 Aligned_cols=114 Identities=15% Similarity=0.190 Sum_probs=77.3
Q ss_pred cchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCC-cccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHH
Q psy11212 315 EMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQS-LYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDF 393 (448)
Q Consensus 315 qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~-l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~ 393 (448)
-++|...-||-+.|+. .|+..-..-.-..-+.+.=|. ..+-|++++.+|+++.-+..+ ..-.||-
T Consensus 360 ~Iv~~Vl~Fiee~i~~----pdwr~reaavmAFGSIl~gp~~~~Lt~iV~qalp~ii~lm~D~----------sl~VkdT 425 (859)
T KOG1241|consen 360 DIVPHVLPFIEENIQN----PDWRNREAAVMAFGSILEGPEPDKLTPIVIQALPSIINLMSDP----------SLWVKDT 425 (859)
T ss_pred cchhhhHHHHHHhcCC----cchhhhhHHHHHHHhhhcCCchhhhhHHHhhhhHHHHHHhcCc----------hhhhcch
Confidence 5788888888877653 566655444445567777777 479999999999998877422 1235699
Q ss_pred HHHHHHHHHHHhcCC--CcchHHHHHHHHHHHhcCCCCCccchhhhhhhhc
Q psy11212 394 ASRLLTQVSKNFNTS--TNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEG 442 (448)
Q Consensus 394 AA~lL~~I~~~~~~s--y~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~ 442 (448)
+|-.+++||+-.... ...+-+-++..|..-|.|.-+--..-=+|+.||+
T Consensus 426 aAwtlgrI~d~l~e~~~n~~~l~~~l~~l~~gL~DePrva~N~CWAf~~La 476 (859)
T KOG1241|consen 426 AAWTLGRIADFLPEAIINQELLQSKLSALLEGLNDEPRVASNVCWAFISLA 476 (859)
T ss_pred HHHHHHHHHhhchhhcccHhhhhHHHHHHHHHhhhCchHHHHHHHHHHHHH
Confidence 999999999876532 2334455666777777776544445555555554
No 74
>PF03224 V-ATPase_H_N: V-ATPase subunit H; InterPro: IPR004908 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. V-ATPases (also known as V1V0-ATPase or vacuolar ATPase) (3.6.3.14 from EC) are found in the eukaryotic endomembrane system, and in the plasma membrane of prokaryotes and certain specialised eukaryotic cells. V-ATPases hydrolyse ATP to drive a proton pump, and are involved in a variety of vital intra- and inter-cellular processes such as receptor mediated endocytosis, protein trafficking, active transport of metabolites, homeostasis and neurotransmitter release []. V-ATPases are composed of two linked complexes: the V1 complex (subunits A-H) contains the catalytic core that hydrolyses ATP, while the V0 complex (subunits a, c, c', c'', d) forms the membrane-spanning pore. V-ATPases may have an additional role in membrane fusion through binding to t-SNARE proteins []. This entry represents subunit H (also known as Vma13p) found in the V1 complex of V-ATPases. This subunit has a regulatory function, being responsible for activating ATPase activity and coupling ATPase activity to proton flow []. The yeast enzyme contains five motifs similar to the HEAT or Armadillo repeats seen in the importins, and can be divided into two distinct domains: a large N-terminal domain consisting of stacked alpha helices, and a smaller C-terminal alpha-helical domain with a similar superhelical topology to an armadillo repeat []. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0046961 proton-transporting ATPase activity, rotational mechanism, 0015991 ATP hydrolysis coupled proton transport, 0000221 vacuolar proton-transporting V-type ATPase, V1 domain; PDB: 1HO8_A.
Probab=37.79 E-value=1.2e+02 Score=30.66 Aligned_cols=148 Identities=15% Similarity=0.102 Sum_probs=75.0
Q ss_pred ccchHHHHHHHHHHHHHhcCCCHHHHHHHHHhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCC
Q psy11212 275 HELSVEQQLYYKEITEACVGSDEGRRFEALQSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQ 354 (448)
Q Consensus 275 h~LSkElQlyf~~It~al~~~~e~~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~ 354 (448)
..+|.|.-.+.+++-...... |...+ ..|.. ...--|+..+..- .++...+.|++.++.-|+...
T Consensus 24 ~~is~~~~~~ik~~~~~~~~~----~~~~~---~~~~~--~~~~~~l~lL~~~------~~~~d~v~yvL~li~dll~~~ 88 (312)
T PF03224_consen 24 GLISEEDLSLIKKLDKQSKEE----RRELL---EEDGD--QYASLFLNLLNKL------SSNDDTVQYVLTLIDDLLSDD 88 (312)
T ss_dssp TSS-HHHHHHHHHHHHHHH----------------------------HHHHHH---------HHHHHHHHHHHHHHHH-S
T ss_pred CCCCHHHHHHHHHHHCCCHHH----HHHHH---Hhchh--hHHHHHHHHHHHc------cCcHHHHHHHHHHHHHHHhcC
Confidence 467888887777766665533 22222 22221 2222334444322 257889999999998888777
Q ss_pred CcccccchhhHHH----HHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCC
Q psy11212 355 SLYLEKYLHELLP----SILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKIN 430 (448)
Q Consensus 355 ~l~iepYlHqLlp----~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~ 430 (448)
. .....++.+-. ...+.++. +.. .++|.+...|+.+|+.++...+..-......+...|.+.+.+-.++
T Consensus 89 ~-~~~~~~~~~~~~~~~~~~~~fl~--ll~----~~D~~i~~~a~~iLt~Ll~~~~~~~~~~~~~~l~~ll~~L~~~l~~ 161 (312)
T PF03224_consen 89 P-SRVELFLELAKQDDSDPYSPFLK--LLD----RNDSFIQLKAAFILTSLLSQGPKRSEKLVKEALPKLLQWLSSQLSS 161 (312)
T ss_dssp S-SSHHHHHHHHH-TTH--HHHHHH--H-S-----SSHHHHHHHHHHHHHHHTSTTT--HHHHHHHHHHHHHHHH-TT-H
T ss_pred H-HHHHHHHHhcccccchhHHHHHH--Hhc----CCCHHHHHHHHHHHHHHHHcCCccccchHHHHHHHHHHHHHHhhcC
Confidence 6 33333443221 11222222 111 2478999999999999998877766654456667776666653222
Q ss_pred --ccchhhhhhhhccc
Q psy11212 431 --FPSLYGAIAGEGLV 444 (448)
Q Consensus 431 --l~t~YGAl~GL~al 444 (448)
....+.|+.+|+.|
T Consensus 162 ~~~~~~~~av~~L~~L 177 (312)
T PF03224_consen 162 SDSELQYIAVQCLQNL 177 (312)
T ss_dssp HHH---HHHHHHHHHH
T ss_pred CCcchHHHHHHHHHHH
Confidence 23448888888765
No 75
>PF12717 Cnd1: non-SMC mitotic condensation complex subunit 1
Probab=36.13 E-value=3.2e+02 Score=25.10 Aligned_cols=57 Identities=14% Similarity=0.222 Sum_probs=40.8
Q ss_pred CHHHHHHHHHHHHHHhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHh
Q psy11212 336 NLALLIYLMRMVKALLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNF 405 (448)
Q Consensus 336 nl~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~ 405 (448)
+..+=...+.++.-|+.|..+.....+ +.-++.|+ .|+...+|+.|..++..+..+.
T Consensus 38 ~~~VR~~al~~Ls~Li~~d~ik~k~~l---~~~~l~~l----------~D~~~~Ir~~A~~~~~e~~~~~ 94 (178)
T PF12717_consen 38 DPLVRKTALLVLSHLILEDMIKVKGQL---FSRILKLL----------VDENPEIRSLARSFFSELLKKR 94 (178)
T ss_pred CHHHHHHHHHHHHHHHHcCceeehhhh---hHHHHHHH----------cCCCHHHHHHHHHHHHHHHHhc
Confidence 444444466777888999888888655 33344555 2456689999999999998883
No 76
>KOG1248|consensus
Probab=35.21 E-value=2.2e+02 Score=34.37 Aligned_cols=113 Identities=22% Similarity=0.367 Sum_probs=73.8
Q ss_pred hHHHHHHHHHHHHHhcCCCHHHHHH---HH-------HhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHH
Q psy11212 278 SVEQQLYYKEITEACVGSDEGRRFE---AL-------QSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMV 347 (448)
Q Consensus 278 SkElQlyf~~It~al~~~~e~~r~~---aL-------~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v 347 (448)
+.-++.|+..|.+.++|+....+.. |+ ...-+|+-+.+++-..+-|| .+++-++...-+..+
T Consensus 780 ~~~lnefl~~Isagl~gd~~~~~as~Ivai~~il~e~~~~ld~~~l~~li~~V~~~L--------~s~sreI~kaAI~fi 851 (1176)
T KOG1248|consen 780 SAILNEFLSIISAGLVGDSTRVVASDIVAITHILQEFKNILDDETLEKLISMVCLYL--------ASNSREIAKAAIGFI 851 (1176)
T ss_pred HHHHHHHHHHHHhhhcccHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHH--------hcCCHHHHHHHHHHH
Confidence 3457889999999988765443322 22 22233433444443333333 345666777777777
Q ss_pred HHHhcC-CCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCC
Q psy11212 348 KALLDN-QSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTS 408 (448)
Q Consensus 348 ~ALl~N-~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~s 408 (448)
..++.- |-..+-+|+.+|||+++. +.+ +..-.+|-..-.+|..+|++||..
T Consensus 852 kvlv~~~pe~~l~~~~~~LL~sll~--ls~--------d~k~~~r~Kvr~LlekLirkfg~~ 903 (1176)
T KOG1248|consen 852 KVLVYKFPEECLSPHLEELLPSLLA--LSH--------DHKIKVRKKVRLLLEKLIRKFGAE 903 (1176)
T ss_pred HHHHHcCCHHHHhhhHHHHHHHHHH--HHH--------hhhHHHHHHHHHHHHHHHHHhCHH
Confidence 766643 556789999999999887 222 123378999999999999999963
No 77
>PF12074 DUF3554: Domain of unknown function (DUF3554); InterPro: IPR022716 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is typically between 287 to 356 amino acids in length. This domain is found associated with PF02985 from PFAM.
Probab=34.26 E-value=1.9e+02 Score=29.37 Aligned_cols=91 Identities=16% Similarity=0.221 Sum_probs=62.0
Q ss_pred HHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccc-hhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHH
Q psy11212 323 FIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKY-LHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQV 401 (448)
Q Consensus 323 FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepY-lHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I 401 (448)
|++++|...+ .+-+-+..++|..++++.+-.-.+..- ...+--+++.|++++.+ .|..|..|.+.|..+
T Consensus 164 ll~~kvyskl--~~~~d~~w~~~al~~~~~~~~~~~~~~~~~~~~~a~i~ll~s~~~--------~~~vR~~A~~~l~~l 233 (339)
T PF12074_consen 164 LLSEKVYSKL--ASEEDLCWLLRALEALLSDHPSELSSDKSSAWAQAFIYLLCSSNV--------SWKVRRAALSALKKL 233 (339)
T ss_pred hcCHHHHhcc--CCHhHHHHHHHHHHHHHhcchhhhhhhHHHHHHHHHHHHHHcCCC--------CHHHHHHHHHHHHHH
Confidence 3444554443 356667889999999988765444332 35666667778766422 578999999999999
Q ss_pred HHHhcCCCcchHHHHHHHHHHHhcC
Q psy11212 402 SKNFNTSTNNIQTRVTRLLSNAIND 426 (448)
Q Consensus 402 ~~~~~~sy~~L~~RI~~tl~k~l~d 426 (448)
+.+.... +..-|+.-|.+.+.+
T Consensus 234 ~~~~~~~---l~~~li~~l~~~l~~ 255 (339)
T PF12074_consen 234 YASNPEL---LSKSLISGLWKWLSS 255 (339)
T ss_pred HHhChHH---HHHHHHHHHHHHHHh
Confidence 9886554 666677777777653
No 78
>PF12460 MMS19_C: RNAPII transcription regulator C-terminal; InterPro: IPR024687 This domain, approximately 60 amino acids in length, is found in the N-terminal region of MMS19 proteins. MMS19 is required for both nucleotide excision repair (NER) and RNA polymerase II (RNAP II) transcription [].
Probab=32.37 E-value=1.2e+02 Score=31.88 Aligned_cols=88 Identities=17% Similarity=0.360 Sum_probs=53.2
Q ss_pred HHHHHHHHHHHhc----CCCHHHH---HHHHHhhhcCCCcccchhhHHH---HHHhhhhhccccCCHHHHHHHHHHHHHH
Q psy11212 281 QQLYYKEITEACV----GSDEGRR---FEALQSLATDPGLHEMLPRMCT---FIAEGVKVNVVQNNLALLIYLMRMVKAL 350 (448)
Q Consensus 281 lQlyf~~It~al~----~~~e~~r---~~aL~sL~tD~gL~qLlPyfv~---FI~e~V~~nl~~~nl~~L~~ll~~v~AL 350 (448)
-|.||..+.-.++ +.+...+ -.||..|=..---+.++|++-. .+-+.+ ...|..++...+++...+
T Consensus 317 kQR~F~~~~p~L~~~~~~~~~~~k~~yL~ALs~ll~~vP~~vl~~~l~~LlPLLlqsL----~~~~~~v~~s~L~tL~~~ 392 (415)
T PF12460_consen 317 KQRFFTQVLPKLLEGFKEADDEIKSNYLTALSHLLKNVPKSVLLPELPTLLPLLLQSL----SLPDADVLLSSLETLKMI 392 (415)
T ss_pred hHHHHHHHHHHHHHHHhhcChhhHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHh----CCCCHHHHHHHHHHHHHH
Confidence 4666666655544 3333222 3455443333333333333222 223332 236777888899999999
Q ss_pred hcCCCcccccchhhHHHHHHHH
Q psy11212 351 LDNQSLYLEKYLHELLPSILSC 372 (448)
Q Consensus 351 l~N~~l~iepYlHqLlp~lLtC 372 (448)
+....=-+..|++.|+|.+|.+
T Consensus 393 l~~~~~~i~~hl~sLI~~LL~l 414 (415)
T PF12460_consen 393 LEEAPELISEHLSSLIPRLLKL 414 (415)
T ss_pred HHcCHHHHHHHHHHHHHHHHhc
Confidence 9988777888999999998864
No 79
>KOG0869|consensus
Probab=31.94 E-value=29 Score=32.26 Aligned_cols=36 Identities=14% Similarity=-0.018 Sum_probs=32.0
Q ss_pred chhhhhhhhhhhhcccCCccCChhHHHHHHHhcCCC
Q psy11212 120 SAQKLESISFKCMSHSKRVKLLPVDIDDALNMKNIE 155 (448)
Q Consensus 120 ~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~~niE 155 (448)
|++.++++|+.-..+-||++.|-+||=-||-.++.|
T Consensus 67 fISFvT~EAsekC~~EkRKTIngdDllwAm~tLGFe 102 (168)
T KOG0869|consen 67 FISFVTGEASEKCQREKRKTINGDDLLWAMSTLGFE 102 (168)
T ss_pred HHHHHhhHHHHHHHHHhcCcccHHHHHHHHHHcCcH
Confidence 667788888888888999999999999999999875
No 80
>PF10363 DUF2435: Protein of unknown function (DUF2435)
Probab=30.02 E-value=87 Score=26.28 Aligned_cols=56 Identities=14% Similarity=0.151 Sum_probs=45.6
Q ss_pred hHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhcccCC
Q psy11212 388 WALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEGLVTE 446 (448)
Q Consensus 388 w~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~alG~ 446 (448)
-.+|-.+=..|.+++++.+ .-..-.++|...++..|.|+. | -.+..||.||++|+.
T Consensus 17 ~PvRa~gL~~L~~Li~~~~-~~~~~~~~il~l~l~~L~d~D-s-yVYL~aI~~L~~La~ 72 (92)
T PF10363_consen 17 PPVRAHGLVLLRKLIESKS-EPVIDIPKILDLFLSQLKDED-S-YVYLNAIKGLAALAD 72 (92)
T ss_pred cchHHHHHHHHHHHHHcCC-cchhhHHHHHHHHHHHcCCCC-c-hHHHHHHHHHHHHHH
Confidence 3689999999999999977 223447899999999999884 3 378899999999875
No 81
>PF08389 Xpo1: Exportin 1-like protein; InterPro: IPR013598 The exchange of macromolecules between the nucleus and cytoplasm takes place through nuclear pore complexes within the nuclear membrane. Active transport of large molecules through these pore complexes require carrier proteins, called karyopherins (importins and exportins), which shuttle between the two compartments. This domain is found close to the N terminus of yeast exportin 1 (Xpo1, Crm1, P14068 from SWISSPROT), as well as adjacent to the N-terminal domain of importin-beta (IPR001494 from INTERPRO). Exportin 1 is a nuclear export receptor that translocates proteins out of the nucleus; it interacts with leucine-rich nuclear export signal (NES) sequences in proteins to be transported, as well as with RanGTP [, ]. Importin-beta is a nuclear import receptor that translocates proteins into the nucleus; it interacts with RanGTP and importin-alpha, the latter binding with the nuclear localisation signal (NLS) sequences in proteins to be transported []. More information about these proteins can be found at Protein of the Month: Importins [].; PDB: 3IBV_A 3ICQ_U 3M1I_C 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D 2XWU_B 2X19_B ....
Probab=29.43 E-value=1.6e+02 Score=25.31 Aligned_cols=81 Identities=21% Similarity=0.321 Sum_probs=47.7
Q ss_pred HHHHHHHHhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccchh-hHHHHHHHHhhcc
Q psy11212 298 GRRFEALQSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYLH-ELLPSILSCIVSK 376 (448)
Q Consensus 298 ~~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYlH-qLlp~lLtCll~k 376 (448)
++|...-+.|+.. +|.++.++.+-+..+....+.......++.+.+.+. .+.++.... .+++.+...+
T Consensus 67 ~r~~~l~~~l~~~------~~~i~~~l~~~l~~~~~~~~~~~~~~~L~~l~s~i~--~~~~~~i~~~~~l~~~~~~l--- 135 (148)
T PF08389_consen 67 ERRRELKDALRSN------SPDILEILSQILSQSSSEANEELVKAALKCLKSWIS--WIPIELIINSNLLNLIFQLL--- 135 (148)
T ss_dssp HHHHHHHHHHHHH------HHHHHHHHHHHHHHHCHCCHHHHHHHHHHHHHHHTT--TS-HHHHHSSSHHHHHHHHT---
T ss_pred HHHHHHHHHHHHH------HHHHHHHHHHHHHhhccccHHHHHHHHHHHHHHHHH--hCCHHHhccHHHHHHHHHHc---
Confidence 4455555666654 455555555444443322236778889999999988 555555544 3666665544
Q ss_pred ccCCCCCCCcchHHHHHHHHHH
Q psy11212 377 QLCGRPEADNHWALRDFASRLL 398 (448)
Q Consensus 377 ~l~~~~~~~~hw~LRd~AA~lL 398 (448)
..+ .+|+.|+++|
T Consensus 136 ---~~~------~~~~~A~~cl 148 (148)
T PF08389_consen 136 ---QSP------ELREAAAECL 148 (148)
T ss_dssp ---TSC------CCHHHHHHHH
T ss_pred ---CCH------HHHHHHHHhC
Confidence 122 2488888775
No 82
>PRK13800 putative oxidoreductase/HEAT repeat-containing protein; Provisional
Probab=29.20 E-value=3.1e+02 Score=32.15 Aligned_cols=35 Identities=26% Similarity=0.265 Sum_probs=21.9
Q ss_pred CcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCC
Q psy11212 385 DNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPK 428 (448)
Q Consensus 385 ~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~ 428 (448)
+.+|.+|..|+..|+.+ ...+.....|.+++.|+.
T Consensus 849 D~~~~VR~~A~~aL~~~---------~~~~~a~~~L~~al~D~d 883 (897)
T PRK13800 849 DPHLDVRKAAVLALTRW---------PGDPAARDALTTALTDSD 883 (897)
T ss_pred CCCHHHHHHHHHHHhcc---------CCCHHHHHHHHHHHhCCC
Confidence 56677777777777775 223445566666666653
No 83
>PF04286 DUF445: Protein of unknown function (DUF445); InterPro: IPR007383 This entry contains proteins of unknown function. They are predicted to be transmembrane proteins with 2 or 3 TM domains.
Probab=28.65 E-value=98 Score=31.22 Aligned_cols=57 Identities=21% Similarity=0.203 Sum_probs=29.2
Q ss_pred cccccchHHHHHHHHHHHHHhcCCCHH------HHHHHHHhhhcCCCcccchhhHHHHHHhhh
Q psy11212 272 LATHELSVEQQLYYKEITEACVGSDEG------RRFEALQSLATDPGLHEMLPRMCTFIAEGV 328 (448)
Q Consensus 272 ~~~h~LSkElQlyf~~It~al~~~~e~------~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V 328 (448)
...|.+..+....+.+....+..+++- .....|..+.....+..+.-.+..++.+.+
T Consensus 202 ~~~~~lr~~~~~~l~~~i~~L~~d~~~~~~i~~~~~~~l~~~~~~~~~~~l~~~l~~~i~~~l 264 (367)
T PF04286_consen 202 DPDHPLRQEIDQKLRELIERLLTDPELREKIEELKDKLLSELILEEFLEELWDSLREWIKEDL 264 (367)
T ss_pred CcccHhHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHhh
Confidence 345667777776666666666543221 122344444444444444445555554444
No 84
>PF06025 DUF913: Domain of Unknown Function (DUF913); InterPro: IPR010314 This is a domain of unknown function found towards the N terminus of a family of E3 ubiquitin protein ligases, including yeast TOM1, many of which appear to play a role in mRNA transcription and processing. This domain is found in association with and immediately C-terminal to another domain of unknown function: IPR010309 from INTERPRO.
Probab=28.08 E-value=93 Score=32.81 Aligned_cols=96 Identities=13% Similarity=0.160 Sum_probs=69.7
Q ss_pred cchhhHHHHHH-hhhhhccccCCHHHHHHHHHHHHHHhcCCCc-c-cccchhhHHHHHHHHhhccccCCCCCCCcchHHH
Q psy11212 315 EMLPRMCTFIA-EGVKVNVVQNNLALLIYLMRMVKALLDNQSL-Y-LEKYLHELLPSILSCIVSKQLCGRPEADNHWALR 391 (448)
Q Consensus 315 qLlPyfv~FI~-e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l-~-iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LR 391 (448)
.|.+.|..-|. +++- .+.+.|..+-.++.||.-|..= . +.. ...++.++.+..+++... .+|
T Consensus 151 Gl~~~~L~~i~~~~i~-----~s~e~l~~lP~~l~AicLN~~Gl~~~~~--~~~l~~~f~if~s~~~~~--------~l~ 215 (379)
T PF06025_consen 151 GLIDAFLDAITAKGIL-----PSSEVLTSLPNVLSAICLNNRGLEKVKS--SNPLDKLFEIFTSPDYVK--------ALR 215 (379)
T ss_pred CChHHHHHHHhccCCC-----CcHHHHHHHHHHHhHHhcCHHHHHHHHh--cChHHHHHHHhCCHHHHH--------Hhc
Confidence 56666777776 5542 6788899999999999999873 1 222 266667777766655442 344
Q ss_pred H-HHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhc
Q psy11212 392 D-FASRLLTQVSKNFNTSTNNIQTRVTRLLSNAIN 425 (448)
Q Consensus 392 d-~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~ 425 (448)
+ ..|..|+.-.+....-+|.||+-+++.+.+.+.
T Consensus 216 ~~d~a~~lG~~~DEL~RH~p~Lk~~i~~~ii~~l~ 250 (379)
T PF06025_consen 216 RRDTASNLGNSFDELMRHHPSLKPDIIDAIIKILD 250 (379)
T ss_pred ccchHHHHHHHHHHHHccCHHHHHHHHHHHHHHHH
Confidence 4 778888888888889999999999988887764
No 85
>COG2036 HHT1 Histones H3 and H4 [Chromatin structure and dynamics]
Probab=27.88 E-value=46 Score=28.21 Aligned_cols=44 Identities=20% Similarity=0.408 Sum_probs=34.3
Q ss_pred CCCHHHHHHHHHhcC--CCCccHHHHHHh---------HHhHHHHHHhhcccccC
Q psy11212 7 SFSIESMKVIAESIG--IPNLPDDAAKEL---------SDNITYCCKVIIQPIYG 50 (448)
Q Consensus 7 ~~~~~~~~~~a~~~~--~~~~~~~~~k~m---------~~d~~~~l~~~~ep~~g 50 (448)
.++.++...+++-+. +.+|..+|..+- .+||..|++..-...||
T Consensus 36 Rvs~~A~~~l~~~~e~~~~~i~~~A~~~A~ha~RKTV~~~DI~la~~~~~~~~~~ 90 (91)
T COG2036 36 RVSSSAIEELQEALEEYLEEIAEDAVELAEHAKRKTVKAEDIKLALKRLGRRIYG 90 (91)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeecHHHHHHHHHHhcccccc
Confidence 478888888888883 678999999885 69999999975444443
No 86
>COG5593 Nucleic-acid-binding protein possibly involved in ribosomal biogenesis [Translation, ribosomal structure and biogenesis]
Probab=27.33 E-value=1.2e+02 Score=33.59 Aligned_cols=118 Identities=19% Similarity=0.204 Sum_probs=74.2
Q ss_pred hcCCCcccch--hhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCCcccccchhhHHHHHH-----------HHhh
Q psy11212 308 ATDPGLHEML--PRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQSLYLEKYLHELLPSIL-----------SCIV 374 (448)
Q Consensus 308 ~tD~gL~qLl--Pyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~l~iepYlHqLlp~lL-----------tCll 374 (448)
...||+.-.| ||+.+|+-+....++- ...++++++|-..|-+++-.-+-.++=-+| ...+
T Consensus 162 k~q~~lsk~v~~k~l~~~~fesflk~l~-------fr~levle~ls~d~i~~Vk~qvv~~VydLL~a~peqe~nLl~L~I 234 (821)
T COG5593 162 KNQPGLSKEVQNKYLKQRIFESFLKNLR-------FRVLEVLEVLSHDPIQYVKKQVVRLVYDLLEARPEQEVNLLHLFI 234 (821)
T ss_pred hcCcchhhhhcchHHHHHHHHHHHHHHH-------HHHHHHHHHhccchHHHHHHHHHHHHHHHHhcChHHHHHHHHHHH
Confidence 4456653333 7888898888776541 224555555555554444322222111111 1111
Q ss_pred ccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhhc
Q psy11212 375 SKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGEG 442 (448)
Q Consensus 375 ~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL~ 442 (448)
. ++| ..|+.-|+--++++.+....-|+++.=|++-.-...+.|+..+.+.|=|++.|.
T Consensus 235 N-KlG---------Dk~~kvsskasY~ilkLe~~hP~mk~VV~d~Iedf~f~pn~~~~s~yyalitLn 292 (821)
T COG5593 235 N-KLG---------DKRDKVSSKASYVILKLELLHPGMKEVVLDGIEDFYFKPNDLFRSRYYALITLN 292 (821)
T ss_pred H-hhc---------cchhhhhhhhhHHHHHHHhcCCchhHHHHhhhhHheeccccccceeeeeehhhH
Confidence 1 122 246655666667777778888999999999999999999999999999988775
No 87
>PRK00034 gatC aspartyl/glutamyl-tRNA amidotransferase subunit C; Reviewed
Probab=26.98 E-value=2.6e+02 Score=23.01 Aligned_cols=36 Identities=11% Similarity=0.262 Sum_probs=30.7
Q ss_pred CCCHHHHHHHHHhcCCCCccHHHHHHhHHhHHHHHHh
Q psy11212 7 SFSIESMKVIAESIGIPNLPDDAAKELSDNITYCCKV 43 (448)
Q Consensus 7 ~~~~~~~~~~a~~~~~~~~~~~~~k~m~~d~~~~l~~ 43 (448)
.++.+.++.||.-.+| ++.++-..-+..+++..|..
T Consensus 2 ~i~~e~i~~la~La~l-~l~~ee~~~~~~~l~~il~~ 37 (95)
T PRK00034 2 AITREEVKHLAKLARL-ELSEEELEKFAGQLNKILDF 37 (95)
T ss_pred CCCHHHHHHHHHHhCC-CCCHHHHHHHHHHHHHHHHH
Confidence 4789999999999999 58888888888888887764
No 88
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=26.85 E-value=3e+02 Score=21.28 Aligned_cols=57 Identities=18% Similarity=0.228 Sum_probs=39.2
Q ss_pred hHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCccchhhhhhhh
Q psy11212 364 ELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFPSLYGAIAGE 441 (448)
Q Consensus 364 qLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~t~YGAl~GL 441 (448)
+.+|.++.++ .+++|.+|..|+..|+.+- .++....|.+.+.++. ....+.-|+..|
T Consensus 31 ~~~~~L~~~l----------~d~~~~vr~~a~~aL~~i~----------~~~~~~~L~~~l~~~~-~~~vr~~a~~aL 87 (88)
T PF13646_consen 31 EAIPALIELL----------KDEDPMVRRAAARALGRIG----------DPEAIPALIKLLQDDD-DEVVREAAAEAL 87 (88)
T ss_dssp HHHHHHHHHH----------TSSSHHHHHHHHHHHHCCH----------HHHTHHHHHHHHTC-S-SHHHHHHHHHHH
T ss_pred hHHHHHHHHH----------cCCCHHHHHHHHHHHHHhC----------CHHHHHHHHHHHcCCC-cHHHHHHHHhhc
Confidence 5677666655 1467899999999999872 3678888888888764 344455555544
No 89
>KOG1745|consensus
Probab=26.56 E-value=21 Score=32.41 Aligned_cols=31 Identities=32% Similarity=0.440 Sum_probs=26.3
Q ss_pred hhhhhhhhhcccCCccCChhHHHHHHHhcCC
Q psy11212 124 LESISFKCMSHSKRVKLLPVDIDDALNMKNI 154 (448)
Q Consensus 124 l~~~A~Kfm~hskR~kLt~~DIn~AL~~~ni 154 (448)
+...+..+..|+||-++++.||..|.+.++-
T Consensus 105 LfEdtnlcAihAkRVTimpkdiQlArrirg~ 135 (137)
T KOG1745|consen 105 LFEDTNLCAIHAKRVTIMPKDIQLARRIRGE 135 (137)
T ss_pred hccccchhhhccceeEecccceehhhhcccC
Confidence 4445678999999999999999999998763
No 90
>PF12719 Cnd3: Nuclear condensing complex subunits, C-term domain
Probab=26.44 E-value=83 Score=31.57 Aligned_cols=56 Identities=11% Similarity=0.045 Sum_probs=41.1
Q ss_pred cchHHHHHHHHHHHHHHHHhcCCCcchH---------HHHHHHHHHHhcCCCCCccchhhhhhhhcc
Q psy11212 386 NHWALRDFASRLLTQVSKNFNTSTNNIQ---------TRVTRLLSNAINDPKINFPSLYGAIAGEGL 443 (448)
Q Consensus 386 ~hw~LRd~AA~lL~~I~~~~~~sy~~L~---------~RI~~tl~k~l~d~~~~l~t~YGAl~GL~a 443 (448)
++-.+|-.|.+.+-.+...||..+-.-. ..+.+.|.+.|.+. ....+.=|+-|++.
T Consensus 76 ~~~~v~~~al~~l~Dll~~~g~~~~~~~~~~~~~~~~~~l~~~l~~~l~~~--~~~~~~~a~EGl~K 140 (298)
T PF12719_consen 76 DDEEVKITALKALFDLLLTHGIDIFDSESDNDESVDSKSLLKILTKFLDSE--NPELQAIAVEGLCK 140 (298)
T ss_pred CCHHHHHHHHHHHHHHHHHcCchhccchhccCccchHhHHHHHHHHHHhcC--CHHHHHHHHHHHHH
Confidence 4678999999999999999996554433 45888888888866 33466666666654
No 91
>PF09862 DUF2089: Protein of unknown function (DUF2089); InterPro: IPR018658 This family consists of various hypothetical prokaryotic proteins.
Probab=26.05 E-value=73 Score=28.09 Aligned_cols=28 Identities=14% Similarity=0.325 Sum_probs=23.6
Q ss_pred HHHHHHHhcCCCcchHHHHHHHHHHHhc
Q psy11212 398 LTQVSKNFNTSTNNIQTRVTRLLSNAIN 425 (448)
Q Consensus 398 L~~I~~~~~~sy~~L~~RI~~tl~k~l~ 425 (448)
|..+-+.|+-|||++|.|+-+.+.+.=.
T Consensus 52 lKe~e~~lgiSYPTvR~rLd~ii~~lg~ 79 (113)
T PF09862_consen 52 LKEMEKELGISYPTVRNRLDKIIEKLGY 79 (113)
T ss_pred HHHHHHHHCCCcHHHHHHHHHHHHHhCC
Confidence 4578889999999999999888877655
No 92
>PRK07668 hypothetical protein; Validated
Probab=25.82 E-value=81 Score=31.58 Aligned_cols=33 Identities=18% Similarity=0.094 Sum_probs=23.4
Q ss_pred cchHHHHHHHHHHHHHh--cCCCHHHHHHHHHhhh
Q psy11212 276 ELSVEQQLYYKEITEAC--VGSDEGRRFEALQSLA 308 (448)
Q Consensus 276 ~LSkElQlyf~~It~al--~~~~e~~r~~aL~sL~ 308 (448)
.||+|-|.|+.++...+ .+-+++++++.|+.+.
T Consensus 1 mLTkeNeefl~~L~~yL~~~glseeeieeiL~Ei~ 35 (254)
T PRK07668 1 MLSKEGRKFLDDTRVYLIAKGIKEEDIESFLEDAE 35 (254)
T ss_pred CCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH
Confidence 48899999999988888 3445666666655543
No 93
>smart00567 EZ_HEAT E-Z type HEAT repeats. Present in subunits of cyanobacterial phycocyanin lyase, and other proteins. Probable scaffolding role.
Probab=25.65 E-value=87 Score=20.03 Aligned_cols=29 Identities=24% Similarity=0.189 Sum_probs=20.6
Q ss_pred hHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcC
Q psy11212 388 WALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAIND 426 (448)
Q Consensus 388 w~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d 426 (448)
|.+|..|+..|+.|.. ++.+..|.++|.|
T Consensus 1 ~~vR~~aa~aLg~~~~----------~~a~~~L~~~l~d 29 (30)
T smart00567 1 PLVRHEAAFALGQLGD----------EEAVPALIKALED 29 (30)
T ss_pred CHHHHHHHHHHHHcCC----------HhHHHHHHHHhcC
Confidence 7899999999998822 3455556666654
No 94
>KOG0212|consensus
Probab=25.30 E-value=4.7e+02 Score=29.48 Aligned_cols=115 Identities=21% Similarity=0.304 Sum_probs=77.8
Q ss_pred HHHHHHHHHHHHhc----CCCH------HHHHHHHHhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHH
Q psy11212 280 EQQLYYKEITEACV----GSDE------GRRFEALQSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKA 349 (448)
Q Consensus 280 ElQlyf~~It~al~----~~~e------~~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~A 349 (448)
|--.||.+|..++- ++|. +.+...++.+-+...-.-=+|-|+..+.+.+.+ .|...=.+++.-+.-
T Consensus 118 ~v~~~Fn~iFdvL~klsaDsd~~V~~~aeLLdRLikdIVte~~~tFsL~~~ipLL~eriy~----~n~~tR~flv~Wl~~ 193 (675)
T KOG0212|consen 118 EVLVYFNEIFDVLCKLSADSDQNVRGGAELLDRLIKDIVTESASTFSLPEFIPLLRERIYV----INPMTRQFLVSWLYV 193 (675)
T ss_pred CcccchHHHHHHHHHHhcCCccccccHHHHHHHHHHHhccccccccCHHHHHHHHHHHHhc----CCchHHHHHHHHHHH
Confidence 34457888877753 2222 223333333333333244478888888888764 355555668888888
Q ss_pred HhcCCCcccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCC
Q psy11212 350 LLDNQSLYLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTS 408 (448)
Q Consensus 350 Ll~N~~l~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~s 408 (448)
|-.=|.+.+=.|+..|++.+++.+ .+++-++|+.+..+|+.....-.++
T Consensus 194 Lds~P~~~m~~yl~~~ldGLf~~L----------sD~s~eVr~~~~t~l~~fL~eI~s~ 242 (675)
T KOG0212|consen 194 LDSVPDLEMISYLPSLLDGLFNML----------SDSSDEVRTLTDTLLSEFLAEIRSS 242 (675)
T ss_pred HhcCCcHHHHhcchHHHHHHHHHh----------cCCcHHHHHHHHHHHHHHHHHHhcC
Confidence 888999999999999999988876 2566689999988887766654443
No 95
>PF05004 IFRD: Interferon-related developmental regulator (IFRD); InterPro: IPR007701 Interferon-related developmental regulator (IFRD1) is the human homologue of the Rattus norvegicus early response protein PC4 and its murine homologue TIS7 []. The exact function of IFRD1 is unknown but it has been shown that PC4 is necessary for muscle differentiation and that it might have a role in signal transduction. This entry also contains IFRD2 and its murine equivalent SKMc15, which are highly expressed soon after gastrulation and in the hepatic primordium, suggesting an involvement in early hematopoiesis [].
Probab=24.62 E-value=3.4e+02 Score=27.67 Aligned_cols=72 Identities=15% Similarity=0.268 Sum_probs=49.3
Q ss_pred ccccchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhc--CCCcchHHHHHHHHHHHhcCCCCCccch
Q psy11212 357 YLEKYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFN--TSTNNIQTRVTRLLSNAINDPKINFPSL 434 (448)
Q Consensus 357 ~iepYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~--~sy~~L~~RI~~tl~k~l~d~~~~l~t~ 434 (448)
+++.+...|+..++.|+ |+ +.. .=+.+|+++++.+|-..| ..-..+-..+...|.+++.|...+...+
T Consensus 79 ~v~~~~~tL~~~~~k~l--kk-g~~-------~E~~lA~~~l~Ll~ltlg~g~~~~ei~~~~~~~L~~~l~d~s~~~~~R 148 (309)
T PF05004_consen 79 FVEDRRETLLDALLKSL--KK-GKS-------EEQALAARALALLALTLGAGEDSEEIFEELKPVLKRILTDSSASPKAR 148 (309)
T ss_pred HHHHHHHHHHHHHHHHh--cc-CCH-------HHHHHHHHHHHHHhhhcCCCccHHHHHHHHHHHHHHHHhCCccchHHH
Confidence 34455556666667666 11 111 237789999999999966 6668888899999999999986655443
Q ss_pred hhhh
Q psy11212 435 YGAI 438 (448)
Q Consensus 435 YGAl 438 (448)
=.++
T Consensus 149 ~~~~ 152 (309)
T PF05004_consen 149 AACL 152 (309)
T ss_pred HHHH
Confidence 3333
No 96
>KOG1823|consensus
Probab=24.27 E-value=1.2e+02 Score=37.12 Aligned_cols=90 Identities=20% Similarity=0.295 Sum_probs=55.9
Q ss_pred HHHHHHHHhcCCCcccccchhhHHHHHH--HHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCC-cchHHHHHHH
Q psy11212 343 LMRMVKALLDNQSLYLEKYLHELLPSIL--SCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTST-NNIQTRVTRL 419 (448)
Q Consensus 343 ll~~v~ALl~N~~l~iepYlHqLlp~lL--tCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy-~~L~~RI~~t 419 (448)
.+-++.+.+.|.-+... .+|+|+ .|-+-+. ....+||.|-+++++|.+-.|..| ..+-+.+.-|
T Consensus 544 ~~~~~~~~l~~~~i~~~-----~L~~illkic~~l~~--------~s~e~rd~srktl~~i~k~Lg~~yl~~Vi~el~~T 610 (1364)
T KOG1823|consen 544 ALVFLVLFLGNAEIVLR-----NLPSILLDICYLLRS--------RSAELRDASRKTLAKIIKILGPKYLYFVIKELQTT 610 (1364)
T ss_pred HHHHHHhcccchHHHHh-----cCcHHHHHHHHHHhh--------hhHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 34455556666555443 455555 3443321 234899999999999999999877 4556667777
Q ss_pred HHHHhcCCCCCccchh--hhhhhhcccC
Q psy11212 420 LSNAINDPKINFPSLY--GAIAGEGLVT 445 (448)
Q Consensus 420 l~k~l~d~~~~l~t~Y--GAl~GL~alG 445 (448)
+.+.+.--...+..|| ||+.+....|
T Consensus 611 l~rg~q~hvl~~tvh~ll~a~~~v~~~g 638 (1364)
T KOG1823|consen 611 LKRGFQVHVLSYTVHYLLGAMKNVLKAG 638 (1364)
T ss_pred HHHhhhhhhhHHHHHHHHHHHhcccccc
Confidence 7777765555555554 5555544333
No 97
>PF13251 DUF4042: Domain of unknown function (DUF4042)
Probab=23.60 E-value=1.1e+02 Score=29.16 Aligned_cols=53 Identities=26% Similarity=0.385 Sum_probs=41.1
Q ss_pred HHHHHHHHhcC-CCcccccchhhHHHHH-----------HHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHH
Q psy11212 343 LMRMVKALLDN-QSLYLEKYLHELLPSI-----------LSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKN 404 (448)
Q Consensus 343 ll~~v~ALl~N-~~l~iepYlHqLlp~l-----------LtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~ 404 (448)
-+.+..||.++ +.=.+-.|.+.|+|.. +||++. |..-..|--|+++|..+.+.
T Consensus 6 Al~~L~al~k~~~~r~l~~yW~~llP~~~~~~~~~~~sLlt~il~---------Dp~~kvR~aA~~~l~~lL~g 70 (182)
T PF13251_consen 6 ALQCLQALAKSTDKRSLFGYWPALLPDSVLQGRPATPSLLTCILK---------DPSPKVRAAAASALAALLEG 70 (182)
T ss_pred HHHHHHHHHHhcCCceeHhhHHHHCCCCCCcCCCCCcchhHHHHc---------CCchhHHHHHHHHHHHHHHc
Confidence 45677889988 7667899999999976 566654 33458999999999988765
No 98
>smart00544 MA3 Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press
Probab=23.31 E-value=2.8e+02 Score=23.21 Aligned_cols=43 Identities=23% Similarity=0.427 Sum_probs=33.0
Q ss_pred cchhhHHHHHHHHhhccccCCCCCCCcchHHHHHHHHHHHHHHHHhcCCCcch
Q psy11212 360 KYLHELLPSILSCIVSKQLCGRPEADNHWALRDFASRLLTQVSKNFNTSTNNI 412 (448)
Q Consensus 360 pYlHqLlp~lLtCll~k~l~~~~~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L 412 (448)
.|.|.++-.+++|.+.+. . .-|++.+.++..+|++-.-+-.++
T Consensus 32 ~~~~~vv~~~i~~~le~~----~------~~~~~~~~Ll~~L~~~~~~~~~~~ 74 (113)
T smart00544 32 EQHHEVVKVLLTCALEEK----R------TYREMYSVLLSRLCQANVISTKQF 74 (113)
T ss_pred cchHHHHHHHHHHHHcCC----c------cHHHHHHHHHHHHHHcCCcCHHHH
Confidence 489999999999999862 1 359999999999997744433333
No 99
>PF14376 Haem_bd: Haem-binding domain
Probab=23.00 E-value=80 Score=28.37 Aligned_cols=77 Identities=13% Similarity=0.128 Sum_probs=52.4
Q ss_pred CCCCcchhhhhhhhhhhhcccCCccC------------ChhHHHHHHHhcCCCCCcCCCCCCccCceecCCCCceeEeec
Q psy11212 115 PPVDKSAQKLESISFKCMSHSKRVKL------------LPVDIDDALNMKNIEPIYGLTVSQFLPFKHASGGGRELHFTD 182 (448)
Q Consensus 115 ~~~~~~~q~l~~~A~Kfm~hskR~kL------------t~~DIn~AL~~~niEPLyGy~s~~~~~f~~~s~~g~~l~~~~ 182 (448)
...+..++.+++.|. |-.||..+++ ...||+.+.|.+|+....-|+ ..
T Consensus 30 ~~~p~~v~~il~~~C-ydCHSn~T~~PwYa~i~p~s~l~~~dI~~Gr~~lNfs~~~~~~-------------------~~ 89 (137)
T PF14376_consen 30 IKAPEEVKIILKNSC-YDCHSNNTRYPWYANIAPASWLMEKDIKEGRRHLNFSEWGSYS-------------------KR 89 (137)
T ss_pred ccchHHHHHHHHccc-cccCCCCCCCccceecCchHHHHHHHHHHHHHHhCcchhhhcC-------------------cc
Confidence 334445666766665 6789999775 569999999999998874331 22
Q ss_pred cccccHHHHHhc-cCCCCCCccceEEEEEE
Q psy11212 183 EKEVDFEDLLTN-LNPKSALETHLRTHWLA 211 (448)
Q Consensus 183 D~evdL~~ii~~-~lpkiP~~~~~~aHWLa 211 (448)
-++.+|+++.+. .--+.|...-+..||=|
T Consensus 90 ~~~~~l~~i~~~I~~g~MP~~~Y~~~H~~a 119 (137)
T PF14376_consen 90 KQEAKLAKIEEVIEDGEMPPPSYTLLHWEA 119 (137)
T ss_pred cCHHHHHHHHHHHHcCCCChHHHhhhCCCC
Confidence 456677777654 23457777777777754
No 100
>KOG1241|consensus
Probab=22.83 E-value=4.3e+02 Score=30.72 Aligned_cols=57 Identities=9% Similarity=0.151 Sum_probs=43.7
Q ss_pred CcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcCCCCCcc---chhhhhhhhc
Q psy11212 385 DNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTRLLSNAINDPKINFP---SLYGAIAGEG 442 (448)
Q Consensus 385 ~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~tl~k~l~d~~~~l~---t~YGAl~GL~ 442 (448)
+.+-.|.+.=..+|..|.+|.+......--.|+..|++.|-. .++-. .-+|||.+|.
T Consensus 566 ~q~~eLQs~Lc~~Lq~i~rk~~~~~~~~~d~iM~lflri~~s-~~s~~v~e~a~laV~tl~ 625 (859)
T KOG1241|consen 566 AQLNELQSLLCNTLQSIIRKVGSDIREVSDQIMGLFLRIFES-KRSAVVHEEAFLAVSTLA 625 (859)
T ss_pred HHHHHHHHHHHHHHHHHHHHccccchhHHHHHHHHHHHHHcC-CccccchHHHHHHHHHHH
Confidence 456667777778888999999998888899999999999987 34443 4566766654
No 101
>PF03810 IBN_N: Importin-beta N-terminal domain; InterPro: IPR001494 Karyopherins are a group of proteins involved in transporting molecules through the pores of the nuclear envelope. Karyopherins, which may act as importins or exportins, are part of the Importin-beta super-family, which all share a similar three-dimensional structure. Members of the importin-beta (karyopherin-beta) family can bind and transport cargo by themselves, or can form heterodimers with importin-alpha. As part of a heterodimer, importin-beta mediates interactions with the pore complex, while importin-alpha acts as an adaptor protein to bind the nuclear localisation signal (NLS) on the cargo through the classical NLS import of proteins. Importin-beta is a helicoidal molecule constructed from 19 HEAT repeats. Many nuclear pore proteins contain FG sequence repeats that can bind to HEAT repeats within importins [, ], which is important for importin-beta mediated transport. Ran GTPase helps to control the unidirectional transfer of cargo. The cytoplasm contains primarily RanGDP and the nucleus RanGTP through the actions of RanGAP and RanGEF, respectively. In the nucleus, RanGTP binds to importin-beta within the importin/cargo complex, causing a conformational change in importin-beta that releases it from importin-alpha-bound cargo. As a result, the N-terminal auto-inhibitory region on importin-alpha is free to loop back and bind to the major NLS-binding site, causing the cargo to be released []. There are additional release factors as well. This entry represents the N-terminal domain of karyopherins that is important for the binding of the Ran protein []. More information about these proteins can be found at Protein of the Month: Importins [].; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport; PDB: 3NC1_A 3NBY_D 3NBZ_D 3NC0_A 3GJX_D 1IBR_D 1QGR_A 3LWW_A 1F59_A 2Q5D_A ....
Probab=22.63 E-value=2.6e+02 Score=21.35 Aligned_cols=38 Identities=16% Similarity=0.196 Sum_probs=28.2
Q ss_pred chHHHHHHHHHHHHHHHHh-c--------CCCcchHHHHHHHHHHHh
Q psy11212 387 HWALRDFASRLLTQVSKNF-N--------TSTNNIQTRVTRLLSNAI 424 (448)
Q Consensus 387 hw~LRd~AA~lL~~I~~~~-~--------~sy~~L~~RI~~tl~k~l 424 (448)
+-.+|-+|+-+|...+.++ . .--..-|..|-..++..|
T Consensus 29 ~~~~R~~A~i~LKn~I~~~W~~~~~~~~~~~~~~~k~~Ik~~ll~~l 75 (77)
T PF03810_consen 29 DPEVRQLAAILLKNLIKKNWSPSKQKGWSQLPEEEKEQIKSQLLQLL 75 (77)
T ss_dssp CHHHHHHHHHHHHHHHHHSGGHHHHHHHHGSSHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHHcCchhhccCCCCCCHHHHHHHHHHHHHHH
Confidence 3489999999999988887 5 444566777777666665
No 102
>PF03847 TFIID_20kDa: Transcription initiation factor TFIID subunit A; InterPro: IPR003228 Human transcription initiation factor TFIID is composed of the TATA-binding polypeptide (TBP) and at least 13 TBP-associated factors (TAFs) that collectively or individually are involved in activator-dependent transcription [].; GO: 0006352 transcription initiation, DNA-dependent, 0005669 transcription factor TFIID complex; PDB: 1H3O_B.
Probab=21.96 E-value=56 Score=26.06 Aligned_cols=34 Identities=21% Similarity=0.204 Sum_probs=28.5
Q ss_pred CcchhhhhhhhhhhhcccCCccCChhHHHHHHHh
Q psy11212 118 DKSAQKLESISFKCMSHSKRVKLLPVDIDDALNM 151 (448)
Q Consensus 118 ~~~~q~l~~~A~Kfm~hskR~kLt~~DIn~AL~~ 151 (448)
+.|+..+...|.+..+|-+-.+|.+.||...|+-
T Consensus 31 ddFv~~v~~~ac~lAKhR~s~tle~~Dv~~~Ler 64 (68)
T PF03847_consen 31 DDFVDDVVSFACRLAKHRKSSTLEVKDVQLHLER 64 (68)
T ss_dssp HHHHHHHHHHHHHHHHHTT-SEE-HHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhccCCCCCCHHHHHHHHHh
Confidence 3588889999999999999999999999998863
No 103
>COG5116 RPN2 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=21.71 E-value=3.4e+02 Score=30.75 Aligned_cols=158 Identities=16% Similarity=0.119 Sum_probs=95.5
Q ss_pred hHHHHHHHHHHHHHhcCCCHHHHH-----HHH--HhhhcCCCcccchhhHHHHHHhhhhhcc-------ccCCHHHHHHH
Q psy11212 278 SVEQQLYYKEITEACVGSDEGRRF-----EAL--QSLATDPGLHEMLPRMCTFIAEGVKVNV-------VQNNLALLIYL 343 (448)
Q Consensus 278 SkElQlyf~~It~al~~~~e~~r~-----~aL--~sL~tD~gL~qLlPyfv~FI~e~V~~nl-------~~~nl~~L~~l 343 (448)
=--||++++-|++.+-+.|.-.|- .|| ---....-+.-||||-+.=..+.|++-- ..+|-.. +
T Consensus 511 ygrqe~add~I~ell~d~ds~lRy~G~fs~alAy~GTgn~~vv~~lLh~avsD~nDDVrRAAViAlGfvc~~D~~~---l 587 (926)
T COG5116 511 YGRQEMADDYINELLYDKDSILRYNGVFSLALAYVGTGNLGVVSTLLHYAVSDGNDDVRRAAVIALGFVCCDDRDL---L 587 (926)
T ss_pred hhhHHHHHHHHHHHhcCchHHhhhccHHHHHHHHhcCCcchhHhhhheeecccCchHHHHHHHHheeeeEecCcch---h
Confidence 345677788788877776666553 222 2233344467788887666666665421 1234332 5
Q ss_pred HHHHHHHhcCCCcccccchhhHHHHHHHHh-hccccCCC---C-CCCcchHHHHHHHHHHHHHHHHhcCCCcchHHHHHH
Q psy11212 344 MRMVKALLDNQSLYLEKYLHELLPSILSCI-VSKQLCGR---P-EADNHWALRDFASRLLTQVSKNFNTSTNNIQTRVTR 418 (448)
Q Consensus 344 l~~v~ALl~N~~l~iepYlHqLlp~lLtCl-l~k~l~~~---~-~~~~hw~LRd~AA~lL~~I~~~~~~sy~~L~~RI~~ 418 (448)
.+.++-|..|-+.|...=..--+- +-|. -+.+.|.+ + -.+..=-+|+-|---++.|.-++...-+.=-.||.+
T Consensus 588 v~tvelLs~shN~hVR~g~AvaLG--iacag~G~~~a~diL~~L~~D~~dfVRQ~AmIa~~mIl~Q~n~~Lnp~v~~I~k 665 (926)
T COG5116 588 VGTVELLSESHNFHVRAGVAVALG--IACAGTGDKVATDILEALMYDTNDFVRQSAMIAVGMILMQCNPELNPNVKRIIK 665 (926)
T ss_pred hHHHHHhhhccchhhhhhhHHHhh--hhhcCCccHHHHHHHHHHhhCcHHHHHHHHHHHHHHHHhhcCcccChhHHHHHH
Confidence 667777777666554321111111 1121 01111100 0 001122368777788899999999888888899999
Q ss_pred HHHHHhcCCCCCccchhhhhhh
Q psy11212 419 LLSNAINDPKINFPSLYGAIAG 440 (448)
Q Consensus 419 tl~k~l~d~~~~l~t~YGAl~G 440 (448)
.|.+++.|-+.+-...+||+.|
T Consensus 666 ~f~~vI~~Khe~glaklGA~la 687 (926)
T COG5116 666 KFNRVIVDKHESGLAKLGAVLA 687 (926)
T ss_pred HHHHHHhhhhHhHHHHHHHHHH
Confidence 9999999998888999999875
No 104
>PF12170 DNA_pol3_tau_5: DNA polymerase III tau subunit V interacting with alpha; InterPro: IPR021029 This domain family is found in bacteria, and is approximately 140 amino acids in length. The family is found in association with PF00004 from PFAM. Domains I-III are shared between the tau and the gamma subunits, while most of the DnaB-binding Domain IV and all of the alpha-interacting Domain V are unique to tau. The extreme C-terminal region of this domain 5 is the part which interacts with the alpha subunit of the DNA polymerase III holoenzyme [, ]. ; GO: 0003887 DNA-directed DNA polymerase activity; PDB: 2AYA_A.
Probab=21.29 E-value=2e+02 Score=26.16 Aligned_cols=54 Identities=22% Similarity=0.306 Sum_probs=25.9
Q ss_pred eeeeccccccchHHHHHHHHHHHHHhcC----------CC--------------HHHHHHHHHhhhcCCCcccchhhH
Q psy11212 267 VHVKQLATHELSVEQQLYYKEITEACVG----------SD--------------EGRRFEALQSLATDPGLHEMLPRM 320 (448)
Q Consensus 267 v~vkp~~~h~LSkElQlyf~~It~al~~----------~~--------------e~~r~~aL~sL~tD~gL~qLlPyf 320 (448)
..++|..+|..+...+.=...-.....| .+ .+++..|-++|.+||.++.|.=.|
T Consensus 56 L~L~~~q~HL~~~~a~~~L~~ALs~~~g~~i~L~I~~~~~~~~~TP~e~r~~i~~erl~qA~~~l~~D~~Vq~L~~~F 133 (142)
T PF12170_consen 56 LHLRPSQKHLNNDSAQEQLQQALSEYLGEPIKLSIEEGDNPARETPLEIRQRIYQERLQQAKQSLQNDPNVQFLQQRF 133 (142)
T ss_dssp EEE-GGGTTT--HHHHHHHHHHHHHHHSS--EEEEEE---TTS--HHHHHHHHHHHHHHHHHHHHHHT-S--------
T ss_pred EEEChhhHhhCCHHHHHHHHHHHHHHhCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHcCHHHHHHHHHh
Confidence 4678888888888765444433332222 11 135578999999999999885443
No 105
>PF04192 Utp21: Utp21 specific WD40 associated putative domain ; InterPro: IPR007319 A large ribonuclear protein complex is required for the processing of the small-ribosomal-subunit rRNA - the small-subunit (SSU) processome [, ]. This preribosomal complex contains the U3 snoRNA and at least 40 proteins, which have the following properties: They are nucleolar. They are able to coimmunoprecipitate with the U3 snoRNA and Mpp10 (a protein specific to the SSU processome). They are required for 18S rRNA biogenesis. There appears to be a linkage between polymerase I transcription and the formation of the SSU processome; as some, but not all, of the SSU processome components are required for pre-rRNA transcription initiation. These SSU processome components have been termed t-Utps. They form a pre-complex with pre-18S rRNA in the absence of snoRNA U3 and other SSU processome components. It has been proposed that the t-Utp complex proteins are both rDNA and rRNA binding proteins that are involved in the initiation of pre18S rRNA transcription. Initially binding to rDNA then associating with the 5' end of the nascent pre18S rRNA. The t-Utpcomplex forms the nucleus around which the rest of the SSU processome components, including snoRNA U3, assemble []. From electron microscopy the SSU processome may correspond to the terminal knobs visualized at the 5' ends of nascent 18S rRNA. Utp21 is a component of the SSU processome, which is required for pre-18S rRNA processing. It interacts with Utp18 [].; GO: 0006364 rRNA processing, 0032040 small-subunit processome
Probab=20.98 E-value=2.5e+02 Score=27.73 Aligned_cols=58 Identities=17% Similarity=0.215 Sum_probs=40.8
Q ss_pred HHHHHHHhcCCCHHHHHHHHHhhh--cCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHH
Q psy11212 285 YKEITEACVGSDEGRRFEALQSLA--TDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRM 346 (448)
Q Consensus 285 f~~It~al~~~~e~~r~~aL~sL~--tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~ 346 (448)
|..+++-+.+-.+..-..-+.+|. +..++.. +-.|++|+...++.+ +|.+.+...|.+
T Consensus 126 y~~~~~~LkslsPS~iDlEIRsL~~~~~~~~~~-l~~Fl~~l~~~L~~~---~dfElvQa~l~v 185 (237)
T PF04192_consen 126 YSEFLEYLKSLSPSAIDLEIRSLSPESGGSYEE-LVSFLRFLTYRLKSR---RDFELVQAYLSV 185 (237)
T ss_pred HHHHHHHHHhCChhHHHHHHHhccCccCCcHHH-HHHHHHHHHHHHHcC---CCHHHHHHHHHH
Confidence 677777777766666677778888 5445554 667899999998866 688755444433
No 106
>COG5094 TAF9 Transcription initiation factor TFIID, subunit TAF9 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=20.82 E-value=76 Score=28.47 Aligned_cols=34 Identities=21% Similarity=0.205 Sum_probs=28.3
Q ss_pred cchhhhhhhhhhhhcccCCccCCh---hHHHHHHHhc
Q psy11212 119 KSAQKLESISFKCMSHSKRVKLLP---VDIDDALNMK 152 (448)
Q Consensus 119 ~~~q~l~~~A~Kfm~hskR~kLt~---~DIn~AL~~~ 152 (448)
.-.|.++..|.-|..|+.|...++ +||..|+..+
T Consensus 46 RYTq~vl~Dalvya~htgrg~~a~l~veDvrLA~at~ 82 (145)
T COG5094 46 RYTQDVLEDALVYAKHTGRGHIATLGVEDVRLALATK 82 (145)
T ss_pred HHHHHHHHHHHHHHHhcCCCccCcccHHHHHHHHHHH
Confidence 346888999999999999865555 9999999876
No 107
>PF04793 Herpes_BBRF1: BRRF1-like protein; InterPro: IPR006878 Most proteins in this entry are uncharacterised viral proteins translated from gene 49. The UL6 locus of pseudorabies virus (PRV) has a gene cluster with homology to herpes simplex virus UL5, UL6, UL7 and UL8, Epstein-Barr virus BBRF1 and BBRF2, and Kaposi sarcoma-associated herpes virus ORF43 and ORF42 [].
Probab=20.34 E-value=6.3e+02 Score=25.80 Aligned_cols=125 Identities=13% Similarity=0.185 Sum_probs=77.1
Q ss_pred cchHHHHHHHHHHHHHhcCCCHHHHHHHHHhhhcCCCcccchhhHHHHHHhhhhhccccCCHHHHHHHHHHHHHHhcCCC
Q psy11212 276 ELSVEQQLYYKEITEACVGSDEGRRFEALQSLATDPGLHEMLPRMCTFIAEGVKVNVVQNNLALLIYLMRMVKALLDNQS 355 (448)
Q Consensus 276 ~LSkElQlyf~~It~al~~~~e~~r~~aL~sL~tD~gL~qLlPyfv~FI~e~V~~nl~~~nl~~L~~ll~~v~ALl~N~~ 355 (448)
.+..|--+|.-||..++.-.-+.++ .++..+ ...+=||...+.+- .+-..-..+-.+...|.....
T Consensus 29 ~~~leR~~fL~kv~q~L~qhr~~E~-~Ivp~i------~~ni~y~L~~L~~~-------~~~~~~~~i~~~L~~l~~~~d 94 (284)
T PF04793_consen 29 LLALERGLFLLKVCQVLMQHRQSEP-FIVPKI------RSNIIYFLEELKEL-------SPGDCQEAIKEILDHLEEAGD 94 (284)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhh-hHHHHH------HHHHHHHHHHHHhh-------CChhHHHHHHHHHHHHHhCCC
Confidence 3446667899999998875433222 222222 22334444444322 223333345566667788999
Q ss_pred cccccchhhHHHHHHHHhhc--ccc--CCCCCCCcchH------HHHHHHHHHHHHHHHhcCCCcchHH
Q psy11212 356 LYLEKYLHELLPSILSCIVS--KQL--CGRPEADNHWA------LRDFASRLLTQVSKNFNTSTNNIQT 414 (448)
Q Consensus 356 l~iepYlHqLlp~lLtCll~--k~l--~~~~~~~~hw~------LRd~AA~lL~~I~~~~~~sy~~L~~ 414 (448)
++|+.+|-..+|-++.|==- ... +........|. +|+...+.+..+++.+..+-++++.
T Consensus 95 ~~L~~~L~~~l~~ll~~~yP~~~~~~~~~~~~~~~~Wcl~~l~~vs~~~r~~v~~~l~~~~~smpS~~a 163 (284)
T PF04793_consen 95 SNLERELAKGLPKLLGCKYPHILNVWPVSPEQPYHEWCLHHLLGVSEEYRKKVSETLDDKHTSMPSFQA 163 (284)
T ss_pred cchHHHHHHHHHHHHhhhCCCcCCCCccCCCCChhhHHHhhHhhcCHHHHHHHHHHHhccccccHhHHH
Confidence 99999999999999988522 112 11111223454 5677888889998888888888864
Done!