Query psy4691
Match_columns 380
No_of_seqs 214 out of 454
Neff 6.0
Searched_HMMs 46136
Date Fri Aug 16 23:26:05 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4691.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4691hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2030|consensus 100.0 3.7E-92 7.9E-97 732.5 26.6 342 11-380 232-632 (911)
2 COG1293 Predicted RNA-binding 100.0 6.5E-36 1.4E-40 314.2 15.7 239 80-367 266-564 (564)
3 PF05833 FbpA: Fibronectin-bin 99.7 1.2E-17 2.6E-22 171.7 3.5 175 16-280 233-412 (455)
4 PF05670 DUF814: Domain of unk 97.4 9.2E-05 2E-09 60.4 2.2 22 315-336 69-90 (90)
5 COG1293 Predicted RNA-binding 95.9 0.14 3E-06 55.0 14.0 119 75-257 257-376 (564)
6 COG1730 GIM5 Predicted prefold 91.7 9.4 0.0002 34.1 15.6 38 244-284 93-130 (145)
7 cd00584 Prefoldin_alpha Prefol 70.5 66 0.0014 27.3 12.9 33 244-279 86-118 (129)
8 PF03993 DUF349: Domain of Unk 63.2 39 0.00084 25.8 6.9 16 80-95 6-21 (77)
9 PF04568 IATP: Mitochondrial A 62.9 26 0.00056 29.4 6.1 27 104-130 69-99 (100)
10 cd00890 Prefoldin Prefoldin is 58.3 1.1E+02 0.0024 25.6 15.5 33 244-279 86-118 (129)
11 PRK03947 prefoldin subunit alp 57.0 1.3E+02 0.0028 26.0 16.3 28 244-271 93-120 (140)
12 KOG0687|consensus 50.4 70 0.0015 32.6 7.8 28 139-166 152-179 (393)
13 TIGR00293 prefoldin, archaeal 48.8 1.6E+02 0.0035 24.7 13.0 21 244-264 85-105 (126)
14 PF09726 Macoilin: Transmembra 46.8 1.9E+02 0.004 32.4 11.2 53 104-159 549-601 (697)
15 PF02757 YLP: YLP motif; Inte 44.1 12 0.00026 18.1 0.7 6 356-361 3-8 (9)
16 PF02996 Prefoldin: Prefoldin 42.1 2E+02 0.0043 23.8 10.0 40 232-279 69-108 (120)
17 PF15066 CAGE1: Cancer-associa 39.3 3.2E+02 0.0069 29.1 10.8 63 104-172 314-376 (527)
18 COG5187 RPN7 26S proteasome re 38.7 1.6E+02 0.0034 29.9 8.1 29 138-166 162-190 (412)
19 PF01519 DUF16: Protein of unk 37.6 1.9E+02 0.0042 24.4 7.4 47 105-158 51-97 (102)
20 PF05524 PEP-utilisers_N: PEP- 36.6 1.7E+02 0.0037 24.5 7.2 54 108-161 36-93 (123)
21 PF04048 Sec8_exocyst: Sec8 ex 36.5 1.3E+02 0.0028 26.4 6.6 15 76-90 62-76 (142)
22 PF15463 ECM11: Extracellular 36.4 2.9E+02 0.0063 24.1 9.9 67 87-156 64-133 (139)
23 PF06191 DUF995: Protein of un 34.7 28 0.0006 31.1 2.1 24 297-320 67-91 (145)
24 PF07462 MSP1_C: Merozoite sur 34.4 6.4E+02 0.014 27.4 14.2 41 238-278 120-161 (574)
25 TIGR02894 DNA_bind_RsfA transc 34.2 3.7E+02 0.0079 24.6 9.3 53 105-157 102-154 (161)
26 PF10168 Nup88: Nuclear pore c 34.1 3.6E+02 0.0078 30.2 11.0 67 80-148 540-606 (717)
27 PF05384 DegS: Sensor protein 33.6 3.7E+02 0.0079 24.4 10.2 20 110-129 101-120 (159)
28 PF14193 DUF4315: Domain of un 30.4 3E+02 0.0064 22.3 9.8 58 104-176 5-62 (83)
29 KOG0018|consensus 28.9 1E+03 0.022 28.1 13.5 55 106-160 233-287 (1141)
30 KOG4460|consensus 28.1 3.9E+02 0.0085 29.2 9.4 66 77-144 560-625 (741)
31 cd00584 Prefoldin_alpha Prefol 27.6 2.6E+02 0.0057 23.6 7.0 6 80-85 84-89 (129)
32 PF08336 P4Ha_N: Prolyl 4-Hydr 26.4 4.1E+02 0.0089 22.7 9.2 23 149-174 61-83 (134)
33 PF02770 Acyl-CoA_dh_M: Acyl-C 26.0 64 0.0014 23.1 2.4 27 347-373 25-51 (52)
34 PF02996 Prefoldin: Prefoldin 25.8 2.9E+02 0.0063 22.7 6.8 7 80-86 74-80 (120)
35 PF07889 DUF1664: Protein of u 25.2 4.6E+02 0.01 22.9 10.0 26 105-130 66-91 (126)
36 PF13747 DUF4164: Domain of un 23.9 4E+02 0.0086 21.6 8.3 23 105-127 37-59 (89)
37 PF15437 PGBA_C: Plasminogen-b 23.8 1.9E+02 0.0041 23.5 4.8 18 239-256 33-50 (86)
38 smart00685 DM14 Repeats in fly 23.7 3E+02 0.0065 20.9 5.7 48 118-181 3-50 (59)
39 PF13747 DUF4164: Domain of un 23.2 4.1E+02 0.0089 21.6 10.3 53 77-130 16-69 (89)
40 PF11221 Med21: Subunit 21 of 22.9 3.2E+02 0.0069 24.0 6.7 70 80-158 66-138 (144)
41 PF13863 DUF4200: Domain of un 22.3 4.6E+02 0.01 21.8 9.0 37 104-140 78-114 (126)
42 PF05833 FbpA: Fibronectin-bin 22.1 20 0.00043 37.1 -1.3 44 104-152 300-343 (455)
43 PF08580 KAR9: Yeast cortical 21.8 4.5E+02 0.0097 29.3 9.0 24 146-169 106-129 (683)
44 PF04363 DUF496: Protein of un 21.5 4.7E+02 0.01 21.6 7.2 19 157-175 44-62 (95)
45 PF07889 DUF1664: Protein of u 21.4 5.5E+02 0.012 22.4 10.0 44 104-147 51-94 (126)
46 PF10779 XhlA: Haemolysin XhlA 21.1 3.9E+02 0.0084 20.5 8.9 29 106-134 5-33 (71)
47 PRK13729 conjugal transfer pil 20.9 3.5E+02 0.0077 28.8 7.5 11 340-350 210-220 (475)
48 PRK05423 hypothetical protein; 20.7 4.9E+02 0.011 21.8 6.8 18 158-175 52-69 (104)
49 KOG0994|consensus 20.0 1.6E+03 0.035 27.2 13.3 46 125-170 1451-1497(1758)
No 1
>KOG2030|consensus
Probab=100.00 E-value=3.7e-92 Score=732.54 Aligned_cols=342 Identities=51% Similarity=0.820 Sum_probs=313.9
Q ss_pred HhhcccHHHHHHHccCCCCccEEEeecCCCCCCcccceecccceecccCCCCCCcccccccCCcccccCCHHHHHHHHhc
Q psy4691 11 IHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFS 90 (380)
Q Consensus 11 ~~~~~~~~~~l~~~~~~~~~Gyiv~k~~~~~~~~~~~~~~~~F~P~~~~~~~~P~~l~~~~~~~~~~f~SFNeAlDeFFs 90 (380)
+..++.++..|+.... .+.|||...++..+.+.. .++|++||| + +.++.......|++||+|+|+|||
T Consensus 232 v~~~eE~~~elit~~~-~~~Gyi~~~k~~~~~s~~-~~~y~~f~p-~---------~~~fKs~~~k~fetf~ea~Def~S 299 (911)
T KOG2030|consen 232 VKDQEEEDRELITGKL-GSKGYILEEKEKKPISTK-EFIYDEFHP-L---------GVQFKSEPVKKFETFNEAVDEFFS 299 (911)
T ss_pred HHHHHHHHHHHHhCCC-cccchhHHhhcccCCCcc-ceeeccccc-c---------ccccccchhhhccchhhHHHHHHH
Confidence 4445567777877643 588999988877755433 899999999 4 577888888999999999999999
Q ss_pred chhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHH
Q psy4691 91 TAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR 170 (380)
Q Consensus 91 ~~e~~k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~ 170 (380)
.+++++..++...++.++.+||.+++.+|+++++.+++.++.++++|+||+.|..+|+.+|.+|++++++||+|.+|++|
T Consensus 300 ~IEsqk~~lr~~~~E~qa~kKld~vr~Dq~~rvq~Lqq~qe~~~~kAelIe~N~eLVe~~il~I~s~la~~m~W~dieKL 379 (911)
T KOG2030|consen 300 TIESQKSELRVKNQELQAEKKLDKVRKDQKERVQELQQVQELNRRKAELIEPNPELVEAAILAIQSALAQQMDWKDIEKL 379 (911)
T ss_pred HHhhhhHHHHHHhhHhHHHhhhhcchhhHHHHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHccCCcHhHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHhhcCCccccccccccccccEEEEEeccccccccccccccccCCCCCCCCCCcCCCCCeEEEEecCCCHHHHHHHHH
Q psy4691 171 VKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF 250 (380)
Q Consensus 171 i~~~~~~g~pva~~I~~l~l~~n~i~l~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~idl~ls~~~NA~~yy 250 (380)
++.+|++|||+|..|.++++..|.+++.|.++.+ | + +|+....+-+.|+|||+||||+||++||
T Consensus 380 ik~eqKkGn~vAk~i~~l~l~~n~~t~~L~d~~d-d-----------~----~de~k~~e~~~VeiDLslsA~aNArr~y 443 (911)
T KOG2030|consen 380 IKSEQKKGNPVAKSIDKLKLEKNEATLRLKDPED-D-----------N----DDEKKSSEVIVVEIDLSLSAFANARRYY 443 (911)
T ss_pred HHHHHhcCchHhhhhhHHHHhhhhheeecCCccc-c-----------c----chhhccccceeeeeeccccchhhHHHHH
Confidence 9999999999999999999999999999993221 0 0 0112234458899999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhhhhhccccccccccceeeEEecCCcee----------------
Q psy4691 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLG---------------- 314 (380)
Q Consensus 251 ~~~K~~~~K~~kt~~~~~kalk~~e~k~~~~l~~~~~~~~~~~~Rk~~WFEKF~WFiSSeg~LV---------------- 314 (380)
+++|+++.|++||..++++|||++++|++++|++......+.+.|++|||||||||||||||||
T Consensus 444 ~~kk~aa~K~kKT~~a~eKAlK~~e~Ki~q~lk~~~~~~~i~k~Rk~~wFEKf~wFiSSEg~LVi~GrdaqQnEllvkky 523 (911)
T KOG2030|consen 444 EMKKEAAEKIKKTVDASEKALKSAERKIEQQLKQVKTVSRIKKIRKVYWFEKFHWFISSEGYLVIGGRDAQQNELLVKKY 523 (911)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccceeehheeEEEecCcEEEEcCCChhhhhHHHHhh
Confidence 9999999999999999999999999999999999888889999999999999999999999999
Q ss_pred -------------------------------------------cccccCCccceEEEEeCCCccccCCCCcccCCceeEE
Q psy4691 315 -------------------------------------------NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMI 351 (380)
Q Consensus 315 -------------------------------------------S~AW~~k~~~~a~WV~~~QVSKt~psGeyL~~GsFmI 351 (380)
|.||+++++++||||||+|||||||||||||+|||||
T Consensus 524 ~~~~DiY~had~~gaSsviIkN~~~~eipp~TL~eAg~ma~~~S~aWdakvvssaWwv~~dqVSKtaptgeyL~~GSFmI 603 (911)
T KOG2030|consen 524 LEPGDIYVHADLHGASSVIIKNPPKTEIPPKTLEEAGSMALCYSKAWDAKVVSSAWWVYPDQVSKTAPTGEYLPTGSFMI 603 (911)
T ss_pred CCCCCeEEecccCCCceEEEeCCCCCCCChhhHHHHHHHHhHHhhhhcccccccceEEecccccccCCCCccccccceEE
Confidence 9999999999999999999999999999999999999
Q ss_pred ecccccCCCCCceeEEEEEEEeccCccCC
Q psy4691 352 RGKKNFFPPCQLAMGISFLFKLEESSISR 380 (380)
Q Consensus 352 rGkKnyl~p~~L~mg~G~lf~~~~~~~~~ 380 (380)
|||||||||++|+|||||||+|++++++|
T Consensus 604 rgkkN~lpp~~LvmG~GlLfrldes~~E~ 632 (911)
T KOG2030|consen 604 RGKKNFLPPHQLVMGLGLLFRLDESSIER 632 (911)
T ss_pred ecccCCCCchhheecceeEEEeccchhhh
Confidence 99999999999999999999999999876
No 2
>COG1293 Predicted RNA-binding protein homologous to eukaryotic snRNP [Transcription]
Probab=100.00 E-value=6.5e-36 Score=314.24 Aligned_cols=239 Identities=27% Similarity=0.431 Sum_probs=186.2
Q ss_pred CHHHHHHHHhcchhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHh
Q psy4691 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159 (380)
Q Consensus 80 SFNeAlDeFFs~~e~~k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~ 159 (380)
+||+++|+||...............+...++||++++..|++.++++++.++.++.+|+||++|+++|+..+..|+.+..
T Consensus 266 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~kl~~~i~~~~~~~~~~~~~~~~~r~~g~ll~an~~~i~~~~~~v~~~~~ 345 (564)
T COG1293 266 LFNEALDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADF 345 (564)
T ss_pred HHHHHHHHHhhhhhHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhehhhh
Confidence 79999999999988765555567788999999999999999999999988889999999999999999999999987654
Q ss_pred cCCChHHHHHHHHHHhhcCCccccccccccccccEEEEEeccccccccccccccccCCCCCCCCCCcCCCCCeEEEEecC
Q psy4691 160 NQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLD 239 (380)
Q Consensus 160 ~~~~W~~I~~~i~~~~~~g~pva~~I~~l~l~~n~i~l~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~idl~ 239 (380)
.+ +.+|.--+ .....|++ ......++...| ++..|.|+..
T Consensus 346 ~~--~~~i~i~l---~~~~~~~~-----~~~~~~~~~~kl------------------------------k~~~~~~~~~ 385 (564)
T COG1293 346 YG--NEEIKIEL---DKSKTPSE-----NAQRYFKKYKKL------------------------------KGAKVNLDRQ 385 (564)
T ss_pred cc--ccceeecc---CcCcccch-----hhHHHhhhhhhc------------------------------cCceeehhhh
Confidence 43 22221100 00000000 000112222222 2578999999
Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhHHhhhhhhhhccccccccccceeeEEecCCcee----
Q psy4691 240 LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE-KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLG---- 314 (380)
Q Consensus 240 ls~~~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e-~k~~~~l~~~~~~~~~~~~Rk~~WFEKF~WFiSSeg~LV---- 314 (380)
+++.+|+..||++++++.+|... .++++.++ +.+++.+.+ ....+.|+++||||||||+|||||||
T Consensus 386 ~~~~~~~~~y~e~~~~~lek~~~-----~~~ieei~ee~ie~~~~~----~~k~~~kkk~~~ek~~~~~ss~Gf~vi~Gr 456 (564)
T COG1293 386 LSELKEAIAYYESAKTALEKAEG-----KKAIEEIREELIEEGLLK----SKKKKRKKKEWFEKFRWFVSSDGFLVIGGR 456 (564)
T ss_pred hhhhHHHHHHHHHHHHHHHhccc-----hhhHHHHHHHHHHHHHhh----hhHHhhhhhhhcccceeeeccCCeEEEEec
Confidence 99999999999999888776654 44555554 333333332 34556789999999999999999999
Q ss_pred ------------------------------------------------------cccccCCc-cceEEEEeCCCccccCC
Q psy4691 315 ------------------------------------------------------NVAWDAKV-VTNAWWVKADQVSKTAP 339 (380)
Q Consensus 315 ------------------------------------------------------S~AW~~k~-~~~a~WV~~~QVSKt~p 339 (380)
|+||.++. ++|+|||++.||||+||
T Consensus 457 na~qNe~l~~k~~~~~DlwfHa~~~~gshvvik~~~~~~~e~ti~eAA~~Aa~~Ska~~~~~v~vd~t~vk~vqv~K~a~ 536 (564)
T COG1293 457 NAKQNEELVKKYAEKDDLWFHADDIPGSHVVIKTEGKEPSEETILEAAQLAASYSKAWKSGLVPVDYTWVKPVQVPKGAK 536 (564)
T ss_pred CcccchHHHHhhcccCcEEEEccCCCCCeEEEeCCCCCCChHHHHHHHHHHHHhchHhhcCCCceEEEEEcccccCCCCC
Confidence 99999995 99999999999999999
Q ss_pred CCcccCCceeEEecccccCCCCCceeEE
Q psy4691 340 TGEFLTTGSFMIRGKKNFFPPCQLAMGI 367 (380)
Q Consensus 340 sGeyL~~GsFmIrGkKnyl~p~~L~mg~ 367 (380)
||+||++|+|||+|++||+..++|.+++
T Consensus 537 ~G~vl~~g~~~I~~k~~~~~~~~lk~~~ 564 (564)
T COG1293 537 SGEVLYKGQKTIRGKRDYITIVKLKLAV 564 (564)
T ss_pred CceEEecCcEEEEEccccccccccccCC
Confidence 9999999999999999999999988763
No 3
>PF05833 FbpA: Fibronectin-binding protein A N-terminus (FbpA); InterPro: IPR008616 This family consists of the N-terminal region of the prokaryotic fibronectin-binding protein, the C-terminal region is IPR008532 from INTERPRO. Fibronectin binding is considered to be an important virulence factor in streptococcal infections. Fibronectin is a dimeric glycoprotein that is present in a soluble form in plasma and extracellular fluids; it is also present in a fibrillar form on cell surfaces. Both the soluble and cellular forms of fibronectin may be incorporated into the extracellular tissue matrix. While fibronectin has critical roles in eukaryotic cellular processes, such as adhesion, migration and differentiation, it is also a substrate for the attachment of bacteria. The binding of pathogenic Streptococcus pyogenes and Staphylococcus aureus to epithelial cells via fibronectin facilitates their internalisation and systemic spread within the host [].; PDB: 3DOA_A 2ZBK_F 2HKJ_A 1Z5B_A 1Z5C_B 1MX0_F 1Z5A_A 1MU5_A 1Z59_A.
Probab=99.68 E-value=1.2e-17 Score=171.74 Aligned_cols=175 Identities=25% Similarity=0.422 Sum_probs=28.6
Q ss_pred cHHHHHHHccC-CCCccEEEeecCCCCCCcccceecccceecccCCCCCCcccccccCCcccccCCHHHHHHHHhcchhh
Q psy4691 16 PTLHVLVSNLN-VSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAES 94 (380)
Q Consensus 16 ~~~~~l~~~~~-~~~~Gyiv~k~~~~~~~~~~~~~~~~F~P~~~~~~~~P~~l~~~~~~~~~~f~SFNeAlDeFFs~~e~ 94 (380)
..+..|++... +...|||+...+.. ..|+||. +.++.+....+|+|||+|||+||+..+.
T Consensus 233 ~~~~~l~~~l~~~~~~p~l~~~~~~~----------~~f~~~~---------l~~~~~~~~~~f~s~~~ald~yf~~~~~ 293 (455)
T PF05833_consen 233 EAIRELLNELEEGQFKPYLYYDDGKP----------KDFSPFP---------LKQYEELEVKEFDSFNEALDEYFSEKEE 293 (455)
T ss_dssp HHHHHHHHHHT---S----EEE---------------EEESS-----------TT---S--EE-SSHHHHHHHHT-----
T ss_pred HHHHHhhhhcccccCccEEEEecCcc----------cEEEEEe---------ccccccccccCCCCHHHHHHHHHHHHHH
Confidence 44556666554 35778998876533 6888885 7777777789999999999999998875
Q ss_pred h-hHhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHH
Q psy4691 95 H-KIDL---KAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR 170 (380)
Q Consensus 95 ~-k~~~---k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~ 170 (380)
+ ...+ +.....+..+++++++++.+++.++.++ ..+.++.+|+||.+|.+.+ +.|++|.++..+
T Consensus 294 ~~~~~~~~~~l~k~l~~~~~klek~l~~l~~~~~~~~-~~~~~~~~gelL~a~~~~i-----------~~g~~~~~l~~~ 361 (455)
T PF05833_consen 294 EERLEQKKKRLEKKLEKKIKKLEKKLEKLEEELEESE-KAENYREYGELLLANLHQI-----------KKGMKWVELEDF 361 (455)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hhhHHHHHHHHHHHHHHHh-----------hCCCCEEEhhhh
Confidence 5 2222 2222456667777777777777777777 4477888888888888777 899999999887
Q ss_pred HHHHhhcCCccccccccccccccEEEEEeccccccccccccccccCCCCCCCCCCcCCCCCeEEEEecCCCHHHHHHHHH
Q psy4691 171 VKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF 250 (380)
Q Consensus 171 i~~~~~~g~pva~~I~~l~l~~n~i~l~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~idl~ls~~~NA~~yy 250 (380)
+.. . .++.|+||+++|+++||..||
T Consensus 362 ~~~-~------------------------------------------------------~~i~I~Ld~~~s~~eNA~~yf 386 (455)
T PF05833_consen 362 YEE-G------------------------------------------------------EEIEIPLDPSLSPSENAQKYF 386 (455)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred hcc-C------------------------------------------------------CceEEEECCCCCHHHHHHHHH
Confidence 632 1 148899999999999999999
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQ 280 (380)
Q Consensus 251 ~~~K~~~~K~~kt~~~~~kalk~~e~k~~~ 280 (380)
+++|++..| ++.++.+++.+++++..
T Consensus 387 ~k~kK~k~k----~~~~~~~i~~~~~el~~ 412 (455)
T PF05833_consen 387 KKYKKLKRK----IEKLEERIEEAEKELEY 412 (455)
T ss_dssp ------------------------------
T ss_pred HHHHHHHHH----HHHHHHHHHHHHHHHHH
Confidence 999766544 45555555555555443
No 4
>PF05670 DUF814: Domain of unknown function (DUF814); InterPro: IPR008532 This domain occurs in proteins that have been annotated as Fibronectin/fibrinogen binding protein by similarity. This annotation comes from O34693 from SWISSPROT where the N-terminal region is involved in this activity []. Hence the activity of this C-terminal domain is unknown. This domain contains a conserved motif D/E-X-W/Y-X-H that may be functionally important.
Probab=97.38 E-value=9.2e-05 Score=60.38 Aligned_cols=22 Identities=18% Similarity=0.284 Sum_probs=21.0
Q ss_pred cccccCCccceEEEEeCCCccc
Q psy4691 315 NVAWDAKVVTNAWWVKADQVSK 336 (380)
Q Consensus 315 S~AW~~k~~~~a~WV~~~QVSK 336 (380)
|+||+.+..+.++|..+++|+|
T Consensus 69 Ska~~~~~~v~V~yt~~k~v~K 90 (90)
T PF05670_consen 69 SKAWKKGEKVEVDYTQGKYVKK 90 (90)
T ss_pred CHhhccCCCeEEEEeehHhccC
Confidence 9999999999999999999987
No 5
>COG1293 Predicted RNA-binding protein homologous to eukaryotic snRNP [Transcription]
Probab=95.94 E-value=0.14 Score=55.04 Aligned_cols=119 Identities=18% Similarity=0.212 Sum_probs=89.0
Q ss_pred ccccCCHHHHHHHHhcchhhh-hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Q psy4691 75 IQEFESFAAAVDEFFSTAESH-KIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILA 153 (380)
Q Consensus 75 ~~~f~SFNeAlDeFFs~~e~~-k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~ 153 (380)
...|.+++..+|+||...... ++.+... ...+++++.+..++++|+.++++.+.++..++......++|-.-+..
T Consensus 257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~l~~~~~kl~~~i~~~~~~~~~~~~~~~~~r~~g~ll~an~~~ 332 (564)
T COG1293 257 LKAYADLEKLFNEALDEKFERDKIKQLAS----ELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQL 332 (564)
T ss_pred ccccchhhHHHHHHHHHHhhhhhHHHhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356788999999999888765 3333332 46789999999999999999999999999998999999888776666
Q ss_pred HHHHHhcCCChHHHHHHHHHHhhcCCccccccccccccccEEEEEeccccccccccccccccCCCCCCCCCCcCCCCCeE
Q psy4691 154 IRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL 233 (380)
Q Consensus 154 v~~~~~~~~~W~~I~~~i~~~~~~g~pva~~I~~l~l~~n~i~l~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 233 (380)
|..+... +.+.+.. ..+.+.
T Consensus 333 i~~~~~~------------------------------------v~~~~~~------------------------~~~~i~ 352 (564)
T COG1293 333 IEEGLKS------------------------------------VRLADFY------------------------GNEEIK 352 (564)
T ss_pred hhhhhhh------------------------------------hehhhhc------------------------ccccee
Confidence 6554322 1111000 012478
Q ss_pred EEEecCCCHHHHHHHHHHHHHHHH
Q psy4691 234 VDVDLDLSAYANAKRFFDLKRSAA 257 (380)
Q Consensus 234 V~idl~ls~~~NA~~yy~~~K~~~ 257 (380)
|.+|.+.++-+|+.+||...++..
T Consensus 353 i~l~~~~~~~~~~~~~~~~~~klk 376 (564)
T COG1293 353 IELDKSKTPSENAQRYFKKYKKLK 376 (564)
T ss_pred eccCcCcccchhhHHHhhhhhhcc
Confidence 888888888899999999988775
No 6
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=91.72 E-value=9.4 Score=34.12 Aligned_cols=38 Identities=18% Similarity=0.314 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHh
Q psy4691 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD 284 (380)
Q Consensus 244 ~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e~k~~~~l~~ 284 (380)
..|-.|+++++ .++.+.++....+|..+.+++.+....
T Consensus 93 ~eAie~l~k~~---~~l~~~~~~l~~~l~~l~~~~~~l~~~ 130 (145)
T COG1730 93 DEAIEFLKKRI---EELEKAIEKLQQALAELAQRIEQLEQE 130 (145)
T ss_pred HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56888888874 444566666666666666655554333
No 7
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=70.55 E-value=66 Score=27.31 Aligned_cols=33 Identities=24% Similarity=0.288 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279 (380)
Q Consensus 244 ~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e~k~~ 279 (380)
..|..|++++.+...+.. +...+.+......+.
T Consensus 86 ~eA~~~l~~r~~~l~~~~---~~l~~~l~~l~~~~~ 118 (129)
T cd00584 86 EEAIEFLDKKIEELTKQI---EKLQKELAKLKDQIN 118 (129)
T ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH
Confidence 568899999866654443 333334444433433
No 8
>PF03993 DUF349: Domain of Unknown Function (DUF349); InterPro: IPR007139 This motif is found singly or as up to five tandem repeats in a small set of bacterial proteins. There are two or three alpha-helices, and possibly a beta-strand.
Probab=63.16 E-value=39 Score=25.81 Aligned_cols=16 Identities=38% Similarity=0.640 Sum_probs=13.0
Q ss_pred CHHHHHHHHhcchhhh
Q psy4691 80 SFAAAVDEFFSTAESH 95 (380)
Q Consensus 80 SFNeAlDeFFs~~e~~ 95 (380)
.|..|+|.||......
T Consensus 6 ~F~~a~~~~~~~~~~~ 21 (77)
T PF03993_consen 6 RFRAACDAFFDRRKEF 21 (77)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 5899999999887654
No 9
>PF04568 IATP: Mitochondrial ATPase inhibitor, IATP; InterPro: IPR007648 ATP synthase inhibitor prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical gradient, for example during oxygen deprivation [] ATP synthase inhibitor forms a one to one complex with the F1 ATPase, possibly by binding at the alpha-beta interface. It is thought to inhibit ATP synthesis by preventing the release of ATP []. The minimum inhibitory region for bovine inhibitor (P01096 from SWISSPROT) is from residues 39 to 72 []. The inhibitor has two oligomeric states, dimer (the active state) and tetramer. At low pH , the inhibitor forms a dimer via antiparallel coiled coil interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory region, thus preventing the inhibitory activity []. ; GO: 0004857 enzyme inhibitor activity, 0045980 negative regulation of nucleotide metabolic process, 0005739 mitochondrion; PDB: 1GMJ_B 1OHH_H 1HF9_B 2V7Q_J.
Probab=62.87 E-value=26 Score=29.39 Aligned_cols=27 Identities=30% Similarity=0.562 Sum_probs=16.4
Q ss_pred HHHHHHHHHHHHHHH----HHHHHHHHHHHH
Q psy4691 104 QERDALKKLENVKRD----HETRLSALEQTQ 130 (380)
Q Consensus 104 ~~~~~ikKLek~~~~----qekrI~~le~~~ 130 (380)
++.+.+++|...+.+ |++.|+++++..
T Consensus 69 ~EkEqL~~Lk~kl~~e~~~~~k~i~~le~~I 99 (100)
T PF04568_consen 69 KEKEQLKKLKEKLKEEIEHHRKEIDELEKHI 99 (100)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 333444555555555 888888888553
No 10
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=58.27 E-value=1.1e+02 Score=25.55 Aligned_cols=33 Identities=21% Similarity=0.328 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279 (380)
Q Consensus 244 ~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e~k~~ 279 (380)
..|..++++++....+..+.+ .+.+..+..++.
T Consensus 86 ~eA~~~l~~r~~~l~~~~~~l---~~~~~~~~~~~~ 118 (129)
T cd00890 86 EEAIEFLKKRLETLEKQIEKL---EKQLEKLQDQIT 118 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH
Confidence 678899999866655443333 334444443433
No 11
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=57.00 E-value=1.3e+02 Score=25.97 Aligned_cols=28 Identities=14% Similarity=0.273 Sum_probs=17.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 244 ANAKRFFDLKRSAAKKQQKTIQSTEKAL 271 (380)
Q Consensus 244 ~NA~~yy~~~K~~~~K~~kt~~~~~kal 271 (380)
.-|..+++++.....+..+++...-..+
T Consensus 93 ~eA~~~l~~~~~~l~~~~~~l~~~l~~~ 120 (140)
T PRK03947 93 DEAIEILDKRKEELEKALEKLEEALQKL 120 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6799999998777655544444433333
No 12
>KOG0687|consensus
Probab=50.37 E-value=70 Score=32.65 Aligned_cols=28 Identities=21% Similarity=0.313 Sum_probs=25.0
Q ss_pred HHHHhHHHHHHHHHHHHHHHhcCCChHH
Q psy4691 139 LIINNQESVDAAILAIRQDIANQLSWED 166 (380)
Q Consensus 139 LI~aN~~~Ve~~i~~v~~~~~~~~~W~~ 166 (380)
+++.+...|.+-|..++.++++|=||+-
T Consensus 152 lfy~D~~lV~~~iekak~liE~GgDWeR 179 (393)
T KOG0687|consen 152 LFYLDHDLVTESIEKAKSLIEEGGDWER 179 (393)
T ss_pred HhhccHHHHHHHHHHHHHHHHhCCChhh
Confidence 5678888999999999999999999975
No 13
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=48.81 E-value=1.6e+02 Score=24.74 Aligned_cols=21 Identities=14% Similarity=0.235 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy4691 244 ANAKRFFDLKRSAAKKQQKTI 264 (380)
Q Consensus 244 ~NA~~yy~~~K~~~~K~~kt~ 264 (380)
.-|..|++++.+...+..+.+
T Consensus 85 ~eA~~~l~~~~~~l~~~~~~l 105 (126)
T TIGR00293 85 EEAIEFLKKRIEELEKAIEKL 105 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 668899998866654443333
No 14
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=46.79 E-value=1.9e+02 Score=32.38 Aligned_cols=53 Identities=17% Similarity=0.183 Sum_probs=29.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHh
Q psy4691 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA 159 (380)
Q Consensus 104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~ 159 (380)
+.+.++++|+.-++.-++++..++.+...++.+- ..|-..+|.++.+++.+.+
T Consensus 549 ~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~~~---~e~~~~~e~L~~aL~amqd 601 (697)
T PF09726_consen 549 QLESELKKLRRELKQKEEQIRELESELQELRKYE---KESEKDTEVLMSALSAMQD 601 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hhhhhhHHHHHHHHHHHHH
Confidence 4445566666666666666666666655554431 2233445555555555443
No 15
>PF02757 YLP: YLP motif; InterPro: IPR004019 The YLP motif is found in one or several copies in various Drosophila proteins. Its function is unknown, however the presence of completely conserved tyrosine residues and its presence in the human Erbb-2 and ErbB-4 receptor protein-tyrosine kinases (2.7.10.1 from EC) may suggest it could be a substrate for tyrosine kinases. ErbBs (1-4) are single-pass transmembrane proteins that activate a wide variety of signalling pathways, including those involved in proliferation, migration, differentiation, survival, and apoptosis; they are frequently misregulated in cancer []. ErbB-2 is an essential component of a neuregulin-receptor complex, although neuregulins do not interact with it alone. ErbB-4 specifically binds and is activated by neuregulins, NRG-2, NRG-3, heparin-binding EGF-like growth factor, betacellulin and NTAK [].
Probab=44.13 E-value=12 Score=18.15 Aligned_cols=6 Identities=33% Similarity=1.287 Sum_probs=3.0
Q ss_pred ccCCCC
Q psy4691 356 NFFPPC 361 (380)
Q Consensus 356 nyl~p~ 361 (380)
.||||+
T Consensus 3 eYLpP~ 8 (9)
T PF02757_consen 3 EYLPPV 8 (9)
T ss_pred cccCCC
Confidence 355554
No 16
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=42.12 E-value=2e+02 Score=23.77 Aligned_cols=40 Identities=20% Similarity=0.332 Sum_probs=22.9
Q ss_pred eEEEEecCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK 279 (380)
Q Consensus 232 ~~V~idl~ls~~~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e~k~~ 279 (380)
+-|+.++ ..|..|+++|.+...+. ++.....++.+..++.
T Consensus 69 ~~vE~s~-----~eA~~~l~~r~~~l~~~---~~~l~~~~~~~~~~~~ 108 (120)
T PF02996_consen 69 YYVEMSL-----EEAIEFLKKRIKELEEQ---LEKLEKELAELQAQIE 108 (120)
T ss_dssp EEEEEEH-----HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH
T ss_pred eEEEecH-----HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH
Confidence 5555554 67999999996665443 3333444444443433
No 17
>PF15066 CAGE1: Cancer-associated gene protein 1 family
Probab=39.25 E-value=3.2e+02 Score=29.13 Aligned_cols=63 Identities=17% Similarity=0.241 Sum_probs=47.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHHHH
Q psy4691 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK 172 (380)
Q Consensus 104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~i~ 172 (380)
.....+.||...-++|+-+|..|+-.---+..+-+-|...+..=+-.++.||.+.. .|+++|.
T Consensus 314 aLNEvL~kLk~tn~kQq~~IqdLq~sN~yLe~kvkeLQ~k~~kQqvfvDiinkLk~------niEeLIe 376 (527)
T PF15066_consen 314 ALNEVLQKLKHTNRKQQNRIQDLQCSNLYLEKKVKELQMKITKQQVFVDIINKLKE------NIEELIE 376 (527)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHhhhccHHHHHHHHHHHHHhhhhhHHHHHHHHHHH------HHHHHHH
Confidence 56678899999999999999999866566666677777776666667777877753 4777763
No 18
>COG5187 RPN7 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones]
Probab=38.68 E-value=1.6e+02 Score=29.86 Aligned_cols=29 Identities=21% Similarity=0.363 Sum_probs=25.9
Q ss_pred HHHHHhHHHHHHHHHHHHHHHhcCCChHH
Q psy4691 138 ELIINNQESVDAAILAIRQDIANQLSWED 166 (380)
Q Consensus 138 eLI~aN~~~Ve~~i~~v~~~~~~~~~W~~ 166 (380)
-+|+.+.-.|++.|.+++..++.|=||+.
T Consensus 162 g~~y~d~~vV~e~lE~~~~~iEkGgDWeR 190 (412)
T COG5187 162 GLIYGDRKVVEESLEVADDIIEKGGDWER 190 (412)
T ss_pred HHhhccHHHHHHHHHHHHHHHHhCCCHHh
Confidence 36788888999999999999999999975
No 19
>PF01519 DUF16: Protein of unknown function DUF16; InterPro: IPR002862 Proteins that contain this domain are of unknown function. It appears to be confined to proteins from Mycoplasma pneumoniae [].; PDB: 2BA2_C.
Probab=37.56 E-value=1.9e+02 Score=24.36 Aligned_cols=47 Identities=11% Similarity=0.251 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Q psy4691 105 ERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDI 158 (380)
Q Consensus 105 ~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~ 158 (380)
....+.+|......|-+.|.++..++. ..|+.| .+|.+.|..|+.=|
T Consensus 51 qgeqI~kL~e~V~~QGEqIkel~~e~k---~qgktL----~~I~~~L~~inkRL 97 (102)
T PF01519_consen 51 QGEQINKLTEKVDKQGEQIKELQVEQK---AQGKTL----QLILKTLQSINKRL 97 (102)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH----HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH----HHHHHHHHHHHHHH
Confidence 345678888889999999999985554 334433 44666666665543
No 20
>PF05524 PEP-utilisers_N: PEP-utilising enzyme, N-terminal; InterPro: IPR008731 This sequence identifies proteins which are a component of the phosphoenolpyruvate:sugar phosphotransferase system (PTS), a major carbohydrate active transport system. The PTS system is found throughout the bacterial kingdom, and is responsible for the coupled phosphorylation and translocation of numerous sugars across the cytoplasmic membrane []. This entry represents the N-terminal domain of enzyme I (EIN) which transfers the phosphoryl group from phosphoenolpyruvate (PEP) to the phosphoryl carrier protein (HPr) which in turn phosphorylates a group of membrane-associated proteins, known as enzyme II. The N-terminal domain of EI (EIN) extends from residues 1 to 259 and can be phosphorylated in a fully reversible manner by phosphorylated HPr. EIN, however, cannot be autophosphorylated by PEP [, ].; GO: 0005351 sugar:hydrogen symporter activity, 0008965 phosphoenolpyruvate-protein phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0005737 cytoplasm; PDB: 2WQD_A 2XDF_B 2HWG_A 3EZB_A 2L5H_A 3EZA_A 1EZB_A 2EZA_A 1EZA_A 1EZC_A ....
Probab=36.55 E-value=1.7e+02 Score=24.48 Aligned_cols=54 Identities=15% Similarity=0.201 Sum_probs=27.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhHHHHHH--HHHHHHHHHhcC
Q psy4691 108 ALKKLENVKRDHETRLSALEQTQLV--DKEKAELIINNQESVDA--AILAIRQDIANQ 161 (380)
Q Consensus 108 ~ikKLek~~~~qekrI~~le~~~E~--~~~kgeLI~aN~~~Ve~--~i~~v~~~~~~~ 161 (380)
++.||.+.+..-...|..+.+.... ....++++.+|+.+++. .++.|...+.+|
T Consensus 36 E~~rl~~Al~~~~~eL~~l~~~~~~~~~~~~a~If~ah~~~L~D~~l~~~v~~~I~~~ 93 (123)
T PF05524_consen 36 EIERLEQALEKAREELEQLAERAESKLGEEEAAIFEAHLMMLEDPELIDEVEELIREG 93 (123)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHCHSSCTHHHHHHHHHHT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHhcCHhHHHHHHHHHhcc
Confidence 4455555555555555555433111 12344677777777655 555565555544
No 21
>PF04048 Sec8_exocyst: Sec8 exocyst complex component specific domain; InterPro: IPR007191 Sec8 is a component of the exocyst complex involved in the docking of exocystic vesicles with a fusion site on the plasma membrane. The exocyst complex is composed of Sec3, Sec5, Sec6, Sec8, Sec10, Sec15, Exo70 and Exo84.; GO: 0006904 vesicle docking involved in exocytosis, 0015031 protein transport, 0000145 exocyst
Probab=36.46 E-value=1.3e+02 Score=26.36 Aligned_cols=15 Identities=7% Similarity=0.536 Sum_probs=9.3
Q ss_pred cccCCHHHHHHHHhc
Q psy4691 76 QEFESFAAAVDEFFS 90 (380)
Q Consensus 76 ~~f~SFNeAlDeFFs 90 (380)
.+|..||.+|-.|=.
T Consensus 62 eh~q~Fn~sI~sy~~ 76 (142)
T PF04048_consen 62 EHYQGFNSSIGSYSQ 76 (142)
T ss_pred HHHHHHHHHHHHHHH
Confidence 456667777776533
No 22
>PF15463 ECM11: Extracellular mutant protein 11
Probab=36.43 E-value=2.9e+02 Score=24.09 Aligned_cols=67 Identities=19% Similarity=0.322 Sum_probs=43.0
Q ss_pred HHhcchhhh---hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Q psy4691 87 EFFSTAESH---KIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQ 156 (380)
Q Consensus 87 eFFs~~e~~---k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~ 156 (380)
.||+...-. ..-....++....+.||...+.......+.|+.+ ....+++|......|+.-|.-|+.
T Consensus 64 ~~fs~ls~~eWe~~Gd~~l~qf~~l~~kl~~~R~~~r~~~~~fe~e---I~~R~eav~~~~~~l~~kL~~mk~ 133 (139)
T PF15463_consen 64 EFFSNLSFDEWEEAGDWFLEQFSELMQKLKEARRKLRKKFAVFEDE---INRRAEAVRAQGEQLDRKLEKMKE 133 (139)
T ss_pred HHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355555433 1222333456667788888888888888888733 446677777777777777666653
No 23
>PF06191 DUF995: Protein of unknown function (DUF995); InterPro: IPR009337 This is a family of uncharacterised Proteobacteria proteins.
Probab=34.65 E-value=28 Score=31.14 Aligned_cols=24 Identities=17% Similarity=0.497 Sum_probs=17.7
Q ss_pred cccccceeeEEecCCcee-cccccC
Q psy4691 297 VYWFEKFYWFISSENYLG-NVAWDA 320 (380)
Q Consensus 297 ~~WFEKF~WFiSSeg~LV-S~AW~~ 320 (380)
...|=.=+||||.+|-|| ...|.+
T Consensus 67 ~~s~g~GrW~Vt~~GklC~~a~W~~ 91 (145)
T PF06191_consen 67 SESYGEGRWFVTDNGKLCFRATWHS 91 (145)
T ss_pred cceeeeeEEEECCCCCEEEEeEEEC
Confidence 355566699999999999 445544
No 24
>PF07462 MSP1_C: Merozoite surface protein 1 (MSP1) C-terminus; InterPro: IPR010901 This entry represents the C-terminal region of merozoite surface protein 1 (MSP1), which is found in a number of Plasmodium species. MSP-1 is a 200 kDa protein expressed on the surface of the Plasmodium vivax merozoite. MSP-1 of Plasmodium species is synthesised as a high-molecular-weight precursor and then processed into several fragments. At the time of red cell invasion by the merozoite, only the 19 kDa C-terminal fragment (MSP-119), which contains two epidermal growth factor-like domains, remains on the surface. Antibodies against MSP-119 inhibit merozoite entry into red cells, and immunisation with MSP-119 protects monkeys from challenging infections. Hence, MSP-119 is considered a promising vaccine candidate [].; GO: 0009405 pathogenesis, 0016020 membrane
Probab=34.37 E-value=6.4e+02 Score=27.44 Aligned_cols=41 Identities=34% Similarity=0.403 Sum_probs=33.5
Q ss_pred cCCCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH
Q psy4691 238 LDLSAYANAKRFFDLKRSAAKKQ-QKTIQSTEKALKSAEKKT 278 (380)
Q Consensus 238 l~ls~~~NA~~yy~~~K~~~~K~-~kt~~~~~kalk~~e~k~ 278 (380)
-..|+-.|-..||++||.+++|. +-|+++++.-||=-..++
T Consensus 120 ~p~~vl~n~~vfFnKkrEaek~eveNtlkNt~iLlkyYKa~~ 161 (574)
T PF07462_consen 120 NPTSVLKNFTVFFNKKREAEKKEVENTLKNTEILLKYYKARA 161 (574)
T ss_pred CcHHHHHHHHHHHhhhhHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence 34688899999999999998877 788888888888755444
No 25
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=34.17 E-value=3.7e+02 Score=24.56 Aligned_cols=53 Identities=13% Similarity=0.109 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Q psy4691 105 ERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQD 157 (380)
Q Consensus 105 ~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~ 157 (380)
...+.++|+..+..++.+++.|+.+.+.+...-..|..-|..+=.+|+-.|.+
T Consensus 102 ~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~~~~~~~eDY~~L~~Im~RARkl 154 (161)
T TIGR02894 102 LQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTLIDIMDRARKL 154 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455666666677777777777776777666777777777666666666654
No 26
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=34.07 E-value=3.6e+02 Score=30.24 Aligned_cols=67 Identities=18% Similarity=0.144 Sum_probs=46.5
Q ss_pred CHHHHHHHHhcchhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Q psy4691 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD 148 (380)
Q Consensus 80 SFNeAlDeFFs~~e~~k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve 148 (380)
-+.+|+.-|....-. + ..++....+..+..|...+++|.++|..++++.+..+..++.|...+..+.
T Consensus 540 lL~~a~~vlreeYi~-~-~~~ar~ei~~rv~~Lk~~~e~Ql~~L~~l~e~~~~l~~~ae~LaeR~e~a~ 606 (717)
T PF10168_consen 540 LLSQATKVLREEYIE-K-QDLAREEIQRRVKLLKQQKEQQLKELQELQEERKSLRESAEKLAERYEEAK 606 (717)
T ss_pred HHHHHHHHHHHHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 355555555432111 1 123444667777888999999999999999999999999998887666553
No 27
>PF05384 DegS: Sensor protein DegS; InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=33.62 E-value=3.7e+02 Score=24.39 Aligned_cols=20 Identities=40% Similarity=0.577 Sum_probs=11.1
Q ss_pred HHHHHHHHHHHHHHHHHHHH
Q psy4691 110 KKLENVKRDHETRLSALEQT 129 (380)
Q Consensus 110 kKLek~~~~qekrI~~le~~ 129 (380)
+.|..++.+++.++..+...
T Consensus 101 ~qLr~rRD~LErrl~~l~~t 120 (159)
T PF05384_consen 101 KQLRERRDELERRLRNLEET 120 (159)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 44555556666665555533
No 28
>PF14193 DUF4315: Domain of unknown function (DUF4315)
Probab=30.37 E-value=3e+02 Score=22.31 Aligned_cols=58 Identities=17% Similarity=0.264 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHhh
Q psy4691 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR 176 (380)
Q Consensus 104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~i~~~~~ 176 (380)
++...+.|...++.+++.++..|+.+... ..|. +|+.+||++ +|+=.++..+++..+.
T Consensus 5 Ki~~eieK~k~Kiae~Q~rlK~Le~qk~E--------~EN~----EIv~~VR~~---~mtp~eL~~~L~~~~~ 62 (83)
T PF14193_consen 5 KIRAEIEKTKEKIAELQARLKELEAQKTE--------AENL----EIVQMVRSM---KMTPEELAAFLRAMKS 62 (83)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHH----HHHHHHHHc---CCCHHHHHHHHHHHHh
Confidence 45667778888888888888888744222 2444 455666654 6899999999987754
No 29
>KOG0018|consensus
Probab=28.93 E-value=1e+03 Score=28.13 Aligned_cols=55 Identities=22% Similarity=0.254 Sum_probs=43.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhc
Q psy4691 106 RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN 160 (380)
Q Consensus 106 ~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~ 160 (380)
...++++...+..++.++...+.++...+....-|+.|+..|+..+..-+..++.
T Consensus 233 ~~els~~~~ei~~~~~~~d~~e~ei~~~k~e~~ki~re~~~~Dk~i~~ke~~l~e 287 (1141)
T KOG0018|consen 233 NDELSRLNAEIPKLKERMDKKEREIRVRKKERGKIRRELQKVDKKISEKEEKLAE 287 (1141)
T ss_pred hHHHHHHhhhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3456677777888888888888888888777778999999999988888777754
No 30
>KOG4460|consensus
Probab=28.15 E-value=3.9e+02 Score=29.20 Aligned_cols=66 Identities=15% Similarity=0.059 Sum_probs=43.5
Q ss_pred ccCCHHHHHHHHhcchhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH
Q psy4691 77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ 144 (380)
Q Consensus 77 ~f~SFNeAlDeFFs~~e~~k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~ 144 (380)
.|.=.|+|+.-|--..- +..-......+..+.-|..++++|-++|..++++.+..++.++-+..-+
T Consensus 560 ~~~lL~~a~~vfrEqYi--~~~dlV~~e~qrH~~~l~~~k~~QlQ~l~~~~eer~~i~e~a~~La~R~ 625 (741)
T KOG4460|consen 560 CLQLLSRATQVFREQYI--LKQDLVKEEIQRHVKLLCDQKKKQLQDLSYCREERKSLREMAERLADRY 625 (741)
T ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34446677765532221 1112233456677788899999999999999988888888877765444
No 31
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=27.59 E-value=2.6e+02 Score=23.57 Aligned_cols=6 Identities=17% Similarity=0.445 Sum_probs=3.0
Q ss_pred CHHHHH
Q psy4691 80 SFAAAV 85 (380)
Q Consensus 80 SFNeAl 85 (380)
++.+|+
T Consensus 84 ~~~eA~ 89 (129)
T cd00584 84 DLEEAI 89 (129)
T ss_pred cHHHHH
Confidence 355555
No 32
>PF08336 P4Ha_N: Prolyl 4-Hydroxylase alpha-subunit, N-terminal region; InterPro: IPR013547 The members found in this entry are eukaryotic proteins, and include all three isoforms of the prolyl 4-hydroxylase alpha subunit. This enzyme (1.14.11.2 from EC) is important in the post-translational modification of collagen, as it catalyses the formation of 4-hydroxyproline. In vertebrates, the complete enzyme is an alpha2-beta2 tetramer; the beta-subunit is identical to protein disulphide isomerase [, , , ]. The function of the N-terminal region featured in this family does not seem to be known. ; GO: 0004656 procollagen-proline 4-dioxygenase activity, 0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen, 0055114 oxidation-reduction process, 0005783 endoplasmic reticulum
Probab=26.43 E-value=4.1e+02 Score=22.71 Aligned_cols=23 Identities=26% Similarity=0.319 Sum_probs=16.6
Q ss_pred HHHHHHHHHHhcCCChHHHHHHHHHH
Q psy4691 149 AAILAIRQDIANQLSWEDIEARVKQA 174 (380)
Q Consensus 149 ~~i~~v~~~~~~~~~W~~I~~~i~~~ 174 (380)
.+...|+.+. .||..++.+++..
T Consensus 61 naF~LIrRl~---~dW~~~~~~~~~~ 83 (134)
T PF08336_consen 61 NAFSLIRRLH---QDWPKWEKLMEQP 83 (134)
T ss_pred HHHHHHHHHH---HhhhhHHHHHHHh
Confidence 4555666664 5899999998665
No 33
>PF02770 Acyl-CoA_dh_M: Acyl-CoA dehydrogenase, middle domain; InterPro: IPR006091 Acyl-CoA dehydrogenases (1.3.99.3 from EC) are a family of flavoproteins that catalyse the alpha,beta-dehydrogenation of acyl-CoA thioesters to the corresponding trans 2,3-enoyl CoA-products with the concomitant reduction of enzyme-bound FAD. Different family members share a high sequence identity, catalytic mechanisms, and structural properties, but differ in the position of their catalytic bases and in their substrate binding specificity. Butyryl-CoA dehydrogenase [] prefers short chain substrates, medium chain- and long-chain acyl-CoA dehydrogenases prefer medium and long chain substrates, respectively, and Isovaleryl-CoA dehydrogenase [] prefers branched-chain substrates. The monomeric enzyme is folded into three domains of approximately equal size, where the N-terminal domain is all-alpha, the middle domain is an open (5,8) barrel, and the C-terminal domain is a four-helical bundle. The constituent families differ in the numbers of C-terminal domains. This entry represents the middle beta-barrel domain found in medium chain acyl-CoA dehydrogenases, as well as in the related peroxisomal acyl-CoA oxidase-II enzymes. Acyl-CoA oxidase (ACO; 1.3.3.6 from EC) catalyzes the first and rate-determining step of the peroxisomal beta-oxidation of fatty acids [].; GO: 0003995 acyl-CoA dehydrogenase activity, 0055114 oxidation-reduction process; PDB: 3MDD_B 1UDY_C 3MDE_B 2UXW_A 3B96_A 1SIQ_A 1SIR_A 2R0N_A 2R0M_A 2D29_B ....
Probab=25.99 E-value=64 Score=23.07 Aligned_cols=27 Identities=15% Similarity=0.249 Sum_probs=19.9
Q ss_pred ceeEEecccccCCCCCceeEEEEEEEe
Q psy4691 347 GSFMIRGKKNFFPPCQLAMGISFLFKL 373 (380)
Q Consensus 347 GsFmIrGkKnyl~p~~L~mg~G~lf~~ 373 (380)
|.|+|.|.|.|+...+..=.+-++-++
T Consensus 25 ~~~~L~G~K~~v~~~~~a~~~~v~ar~ 51 (52)
T PF02770_consen 25 DGYVLNGEKRFVSNAPDADWFLVFART 51 (52)
T ss_dssp TEEEEEEEEEEEETTTTESEEEEEEEE
T ss_pred ceEEEeeEEEEECCcCccCEEEEEEEE
Confidence 349999999999988766555555544
No 34
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=25.81 E-value=2.9e+02 Score=22.74 Aligned_cols=7 Identities=29% Similarity=0.591 Sum_probs=3.3
Q ss_pred CHHHHHH
Q psy4691 80 SFAAAVD 86 (380)
Q Consensus 80 SFNeAlD 86 (380)
|+.+|++
T Consensus 74 s~~eA~~ 80 (120)
T PF02996_consen 74 SLEEAIE 80 (120)
T ss_dssp EHHHHHH
T ss_pred cHHHHHH
Confidence 3445544
No 35
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=25.15 E-value=4.6e+02 Score=22.85 Aligned_cols=26 Identities=0% Similarity=0.125 Sum_probs=12.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 105 ERDALKKLENVKRDHETRLSALEQTQ 130 (380)
Q Consensus 105 ~~~~ikKLek~~~~qekrI~~le~~~ 130 (380)
...++..+..++++|.+..+..+++.
T Consensus 66 LsqRId~vd~klDe~~ei~~~i~~eV 91 (126)
T PF07889_consen 66 LSQRIDRVDDKLDEQKEISKQIKDEV 91 (126)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 34444555555555555555554443
No 36
>PF13747 DUF4164: Domain of unknown function (DUF4164)
Probab=23.87 E-value=4e+02 Score=21.64 Aligned_cols=23 Identities=13% Similarity=0.265 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 105 ERDALKKLENVKRDHETRLSALE 127 (380)
Q Consensus 105 ~~~~ikKLek~~~~qekrI~~le 127 (380)
....+.+|..-+..+..++.+.+
T Consensus 37 ~e~ei~~l~~dr~rLa~eLD~~~ 59 (89)
T PF13747_consen 37 LEEEIQRLDADRSRLAQELDQAE 59 (89)
T ss_pred HHHHHHHHHhhHHHHHHHHHhHH
Confidence 33344444444444444444444
No 37
>PF15437 PGBA_C: Plasminogen-binding protein pgbA C-terminal
Probab=23.78 E-value=1.9e+02 Score=23.51 Aligned_cols=18 Identities=28% Similarity=0.368 Sum_probs=8.3
Q ss_pred CCCHHHHHHHHHHHHHHH
Q psy4691 239 DLSAYANAKRFFDLKRSA 256 (380)
Q Consensus 239 ~ls~~~NA~~yy~~~K~~ 256 (380)
.||+-.-=+++-+.+|++
T Consensus 33 kl~~kEekrrLkeEkKkA 50 (86)
T PF15437_consen 33 KLSPKEEKRRLKEEKKKA 50 (86)
T ss_pred cCCHHHHHHHHHHHHHHH
Confidence 445544444444444433
No 38
>smart00685 DM14 Repeats in fly CG4713, worm Y37H9A.3 and human FLJ20241.
Probab=23.74 E-value=3e+02 Score=20.88 Aligned_cols=48 Identities=23% Similarity=0.300 Sum_probs=29.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHhhcCCcc
Q psy4691 118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV 181 (380)
Q Consensus 118 ~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~i~~~~~~g~pv 181 (380)
-++.|++.|+..+..-+..|+ +++|....|.+ ..++.+|+.. +.|.||
T Consensus 3 ~L~~R~~~yk~Aa~~AK~~gd--------~~kAr~~~R~~-------K~~~~~I~~~-~aG~pV 50 (59)
T smart00685 3 LLQQRQEQYKQAALQAKRAGD--------EEKARRHLRIA-------KQFDDAIKAA-RAGRPV 50 (59)
T ss_pred HHHHHHHHHHHHHHHHHHcCC--------HHHHHHHHHHH-------hhHHHHHHHH-HCCCCC
Confidence 456777777777777766666 45555555554 2355666554 356665
No 39
>PF13747 DUF4164: Domain of unknown function (DUF4164)
Probab=23.17 E-value=4.1e+02 Score=21.56 Aligned_cols=53 Identities=17% Similarity=0.254 Sum_probs=32.6
Q ss_pred ccCCHHHHHHHHhcchhhh-hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 77 EFESFAAAVDEFFSTAESH-KIDLKAVQQERDALKKLENVKRDHETRLSALEQTQ 130 (380)
Q Consensus 77 ~f~SFNeAlDeFFs~~e~~-k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~ 130 (380)
.++++..+|+.-|...... .++... +....-..+|...+...+.+...++..+
T Consensus 16 aid~LE~~v~~r~~~~~~~~~~e~ei-~~l~~dr~rLa~eLD~~~ar~~~Le~~~ 69 (89)
T PF13747_consen 16 AIDRLEKAVDRRLERDRKRDELEEEI-QRLDADRSRLAQELDQAEARANRLEEAN 69 (89)
T ss_pred HHHHHHHHHHHHHHhhhhhhhHHHHH-HHHHhhHHHHHHHHHhHHHHHHHHHHHH
Confidence 3555666666666655544 222222 4556666778888888888877777443
No 40
>PF11221 Med21: Subunit 21 of Mediator complex; InterPro: IPR021384 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med21 has been known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in Drosophila. The heterodimer of the two subunits Med7 and Med21 appears to act as a hinge between the middle and the tail regions of Mediator []. ; PDB: 1YKE_B 1YKH_B.
Probab=22.91 E-value=3.2e+02 Score=23.99 Aligned_cols=70 Identities=16% Similarity=0.233 Sum_probs=36.4
Q ss_pred CHHHHHHHHhcchhhhhHhHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Q psy4691 80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKL---ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQ 156 (380)
Q Consensus 80 SFNeAlDeFFs~~e~~k~~~k~~~~~~~~ikKL---ek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~ 156 (380)
.|...+.+|=..+... ..++..-|+.| +..-++|.++|.+|+ ++++..+..+..=+..-+..+..|+.
T Consensus 66 ~~~~~~~elA~dIi~k------akqIe~LIdsLPg~~~see~Q~~~i~~L~---~E~~~~~~el~~~v~e~e~ll~~v~~ 136 (144)
T PF11221_consen 66 EFEENIKELATDIIRK------AKQIEYLIDSLPGIEVSEEEQLKRIKELE---EENEEAEEELQEAVKEAEELLKQVQE 136 (144)
T ss_dssp HHHHHHHHHHHHHHHH------HHHHHHHHHHSTTSSS-HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHH------HHHHHHHHHhCCCCCCCHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4777777664433321 11222222322 344566777777776 34555555555555555666666665
Q ss_pred HH
Q psy4691 157 DI 158 (380)
Q Consensus 157 ~~ 158 (380)
++
T Consensus 137 ~i 138 (144)
T PF11221_consen 137 LI 138 (144)
T ss_dssp HH
T ss_pred HH
Confidence 54
No 41
>PF13863 DUF4200: Domain of unknown function (DUF4200)
Probab=22.27 E-value=4.6e+02 Score=21.80 Aligned_cols=37 Identities=22% Similarity=0.299 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI 140 (380)
Q Consensus 104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI 140 (380)
.....+.+|...+..+...+..++.....|..+.+.|
T Consensus 78 ~k~~ei~~l~~~l~~l~~~~~k~e~~l~~~~~Y~~fL 114 (126)
T PF13863_consen 78 EKEAEIKKLKAELEELKSEISKLEEKLEEYKKYEEFL 114 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556667777777777777777776666666665554
No 42
>PF05833 FbpA: Fibronectin-binding protein A N-terminus (FbpA); InterPro: IPR008616 This family consists of the N-terminal region of the prokaryotic fibronectin-binding protein, the C-terminal region is IPR008532 from INTERPRO. Fibronectin binding is considered to be an important virulence factor in streptococcal infections. Fibronectin is a dimeric glycoprotein that is present in a soluble form in plasma and extracellular fluids; it is also present in a fibrillar form on cell surfaces. Both the soluble and cellular forms of fibronectin may be incorporated into the extracellular tissue matrix. While fibronectin has critical roles in eukaryotic cellular processes, such as adhesion, migration and differentiation, it is also a substrate for the attachment of bacteria. The binding of pathogenic Streptococcus pyogenes and Staphylococcus aureus to epithelial cells via fibronectin facilitates their internalisation and systemic spread within the host [].; PDB: 3DOA_A 2ZBK_F 2HKJ_A 1Z5B_A 1Z5C_B 1MX0_F 1Z5A_A 1MU5_A 1Z59_A.
Probab=22.09 E-value=20 Score=37.11 Aligned_cols=44 Identities=23% Similarity=0.283 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Q psy4691 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL 152 (380)
Q Consensus 104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~ 152 (380)
..+...+++++.+.+++++++.++++.+..+.. .++..+.++|.
T Consensus 300 ~~~~l~k~l~~~~~klek~l~~l~~~~~~~~~~-----~~~~~~gelL~ 343 (455)
T PF05833_consen 300 KKKRLEKKLEKKIKKLEKKLEKLEEELEESEKA-----ENYREYGELLL 343 (455)
T ss_dssp -------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----hHHHHHHHHHH
Confidence 444555779999999999999999776666544 45555544443
No 43
>PF08580 KAR9: Yeast cortical protein KAR9; InterPro: IPR013889 The KAR9 protein in Saccharomyces cerevisiae (Baker's yeast) is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules []. KAR9 localises at the shmoo tip in mating cells and at the tip of the growing bud in anaphase [].
Probab=21.75 E-value=4.5e+02 Score=29.35 Aligned_cols=24 Identities=13% Similarity=0.574 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHhcCCChHHHHH
Q psy4691 146 SVDAAILAIRQDIANQLSWEDIEA 169 (380)
Q Consensus 146 ~Ve~~i~~v~~~~~~~~~W~~I~~ 169 (380)
.|.+.|..|+.-++-.|.|.||..
T Consensus 106 ~vK~~L~~vK~qveiAmE~~EL~~ 129 (683)
T PF08580_consen 106 DVKKTLISVKKQVEIAMEWEELWN 129 (683)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHH
Confidence 466777778888888888888764
No 44
>PF04363 DUF496: Protein of unknown function (DUF496); InterPro: IPR007458 Members of this family are uncharacterised proteins.
Probab=21.54 E-value=4.7e+02 Score=21.63 Aligned_cols=19 Identities=32% Similarity=0.396 Sum_probs=14.1
Q ss_pred HHhcCCChHHHHHHHHHHh
Q psy4691 157 DIANQLSWEDIEARVKQAQ 175 (380)
Q Consensus 157 ~~~~~~~W~~I~~~i~~~~ 175 (380)
-+..+|+-.+|..+|..-+
T Consensus 44 YI~~~Ms~edi~~II~nMr 62 (95)
T PF04363_consen 44 YIKPDMSIEDIRAIIENMR 62 (95)
T ss_pred HccCCCCHHHHHHHHHHHH
Confidence 3467899999988886543
No 45
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=21.42 E-value=5.5e+02 Score=22.38 Aligned_cols=44 Identities=18% Similarity=0.221 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH
Q psy4691 104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESV 147 (380)
Q Consensus 104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~V 147 (380)
+....-..|...++.+..||+.+...+++.......|......|
T Consensus 51 ql~~vs~~l~~tKkhLsqRId~vd~klDe~~ei~~~i~~eV~~v 94 (126)
T PF07889_consen 51 QLEQVSESLSSTKKHLSQRIDRVDDKLDEQKEISKQIKDEVTEV 94 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence 33344444555555555566665555555555555554444333
No 46
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=21.06 E-value=3.9e+02 Score=20.49 Aligned_cols=29 Identities=24% Similarity=0.538 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691 106 RDALKKLENVKRDHETRLSALEQTQLVDK 134 (380)
Q Consensus 106 ~~~ikKLek~~~~qekrI~~le~~~E~~~ 134 (380)
+..+.+++..++.+++++..++...+.+.
T Consensus 5 ~e~l~~ie~~l~~~~~~i~~lE~~~~~~e 33 (71)
T PF10779_consen 5 KEKLNRIETKLDNHEERIDKLEKRDAANE 33 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44556666666666666666665544443
No 47
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=20.86 E-value=3.5e+02 Score=28.82 Aligned_cols=11 Identities=27% Similarity=0.643 Sum_probs=7.2
Q ss_pred CCcccCCceeE
Q psy4691 340 TGEFLTTGSFM 350 (380)
Q Consensus 340 sGeyL~~GsFm 350 (380)
+-.|||.|||+
T Consensus 210 ~~~YiPsgSf~ 220 (475)
T PRK13729 210 SLPYIPSGSFA 220 (475)
T ss_pred cCceeCCCCeE
Confidence 33577777774
No 48
>PRK05423 hypothetical protein; Provisional
Probab=20.74 E-value=4.9e+02 Score=21.81 Aligned_cols=18 Identities=22% Similarity=0.357 Sum_probs=13.1
Q ss_pred HhcCCChHHHHHHHHHHh
Q psy4691 158 IANQLSWEDIEARVKQAQ 175 (380)
Q Consensus 158 ~~~~~~W~~I~~~i~~~~ 175 (380)
+..+|+-++|..+|..-+
T Consensus 52 Ik~~Ms~e~i~~II~nMr 69 (104)
T PRK05423 52 IKPGMSIEEIQGIIANMK 69 (104)
T ss_pred cCCCCCHHHHHHHHHHHH
Confidence 367888888888875543
No 49
>KOG0994|consensus
Probab=20.04 E-value=1.6e+03 Score=27.22 Aligned_cols=46 Identities=9% Similarity=0.102 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhc-CCChHHHHHH
Q psy4691 125 ALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN-QLSWEDIEAR 170 (380)
Q Consensus 125 ~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~-~~~W~~I~~~ 170 (380)
.+++..+.-.....-..++..+++.+|..|+..+.+ +.|=+.|+.+
T Consensus 1451 ~Aq~~~~~a~as~~q~~~s~~el~~Li~~v~~Flt~~~adp~si~~v 1497 (1758)
T KOG0994|consen 1451 SAQRALEQANASRSQMEESNRELRNLIQQVRDFLTQPDADPDSIEEV 1497 (1758)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHH
Confidence 333333444445555667777888888888888754 4566677654
Done!