Query         psy4691
Match_columns 380
No_of_seqs    214 out of 454
Neff          6.0 
Searched_HMMs 46136
Date          Fri Aug 16 23:26:05 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy4691.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4691hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2030|consensus              100.0 3.7E-92 7.9E-97  732.5  26.6  342   11-380   232-632 (911)
  2 COG1293 Predicted RNA-binding  100.0 6.5E-36 1.4E-40  314.2  15.7  239   80-367   266-564 (564)
  3 PF05833 FbpA:  Fibronectin-bin  99.7 1.2E-17 2.6E-22  171.7   3.5  175   16-280   233-412 (455)
  4 PF05670 DUF814:  Domain of unk  97.4 9.2E-05   2E-09   60.4   2.2   22  315-336    69-90  (90)
  5 COG1293 Predicted RNA-binding   95.9    0.14   3E-06   55.0  14.0  119   75-257   257-376 (564)
  6 COG1730 GIM5 Predicted prefold  91.7     9.4  0.0002   34.1  15.6   38  244-284    93-130 (145)
  7 cd00584 Prefoldin_alpha Prefol  70.5      66  0.0014   27.3  12.9   33  244-279    86-118 (129)
  8 PF03993 DUF349:  Domain of Unk  63.2      39 0.00084   25.8   6.9   16   80-95      6-21  (77)
  9 PF04568 IATP:  Mitochondrial A  62.9      26 0.00056   29.4   6.1   27  104-130    69-99  (100)
 10 cd00890 Prefoldin Prefoldin is  58.3 1.1E+02  0.0024   25.6  15.5   33  244-279    86-118 (129)
 11 PRK03947 prefoldin subunit alp  57.0 1.3E+02  0.0028   26.0  16.3   28  244-271    93-120 (140)
 12 KOG0687|consensus               50.4      70  0.0015   32.6   7.8   28  139-166   152-179 (393)
 13 TIGR00293 prefoldin, archaeal   48.8 1.6E+02  0.0035   24.7  13.0   21  244-264    85-105 (126)
 14 PF09726 Macoilin:  Transmembra  46.8 1.9E+02   0.004   32.4  11.2   53  104-159   549-601 (697)
 15 PF02757 YLP:  YLP motif;  Inte  44.1      12 0.00026   18.1   0.7    6  356-361     3-8   (9)
 16 PF02996 Prefoldin:  Prefoldin   42.1   2E+02  0.0043   23.8  10.0   40  232-279    69-108 (120)
 17 PF15066 CAGE1:  Cancer-associa  39.3 3.2E+02  0.0069   29.1  10.8   63  104-172   314-376 (527)
 18 COG5187 RPN7 26S proteasome re  38.7 1.6E+02  0.0034   29.9   8.1   29  138-166   162-190 (412)
 19 PF01519 DUF16:  Protein of unk  37.6 1.9E+02  0.0042   24.4   7.4   47  105-158    51-97  (102)
 20 PF05524 PEP-utilisers_N:  PEP-  36.6 1.7E+02  0.0037   24.5   7.2   54  108-161    36-93  (123)
 21 PF04048 Sec8_exocyst:  Sec8 ex  36.5 1.3E+02  0.0028   26.4   6.6   15   76-90     62-76  (142)
 22 PF15463 ECM11:  Extracellular   36.4 2.9E+02  0.0063   24.1   9.9   67   87-156    64-133 (139)
 23 PF06191 DUF995:  Protein of un  34.7      28  0.0006   31.1   2.1   24  297-320    67-91  (145)
 24 PF07462 MSP1_C:  Merozoite sur  34.4 6.4E+02   0.014   27.4  14.2   41  238-278   120-161 (574)
 25 TIGR02894 DNA_bind_RsfA transc  34.2 3.7E+02  0.0079   24.6   9.3   53  105-157   102-154 (161)
 26 PF10168 Nup88:  Nuclear pore c  34.1 3.6E+02  0.0078   30.2  11.0   67   80-148   540-606 (717)
 27 PF05384 DegS:  Sensor protein   33.6 3.7E+02  0.0079   24.4  10.2   20  110-129   101-120 (159)
 28 PF14193 DUF4315:  Domain of un  30.4   3E+02  0.0064   22.3   9.8   58  104-176     5-62  (83)
 29 KOG0018|consensus               28.9   1E+03   0.022   28.1  13.5   55  106-160   233-287 (1141)
 30 KOG4460|consensus               28.1 3.9E+02  0.0085   29.2   9.4   66   77-144   560-625 (741)
 31 cd00584 Prefoldin_alpha Prefol  27.6 2.6E+02  0.0057   23.6   7.0    6   80-85     84-89  (129)
 32 PF08336 P4Ha_N:  Prolyl 4-Hydr  26.4 4.1E+02  0.0089   22.7   9.2   23  149-174    61-83  (134)
 33 PF02770 Acyl-CoA_dh_M:  Acyl-C  26.0      64  0.0014   23.1   2.4   27  347-373    25-51  (52)
 34 PF02996 Prefoldin:  Prefoldin   25.8 2.9E+02  0.0063   22.7   6.8    7   80-86     74-80  (120)
 35 PF07889 DUF1664:  Protein of u  25.2 4.6E+02    0.01   22.9  10.0   26  105-130    66-91  (126)
 36 PF13747 DUF4164:  Domain of un  23.9   4E+02  0.0086   21.6   8.3   23  105-127    37-59  (89)
 37 PF15437 PGBA_C:  Plasminogen-b  23.8 1.9E+02  0.0041   23.5   4.8   18  239-256    33-50  (86)
 38 smart00685 DM14 Repeats in fly  23.7   3E+02  0.0065   20.9   5.7   48  118-181     3-50  (59)
 39 PF13747 DUF4164:  Domain of un  23.2 4.1E+02  0.0089   21.6  10.3   53   77-130    16-69  (89)
 40 PF11221 Med21:  Subunit 21 of   22.9 3.2E+02  0.0069   24.0   6.7   70   80-158    66-138 (144)
 41 PF13863 DUF4200:  Domain of un  22.3 4.6E+02    0.01   21.8   9.0   37  104-140    78-114 (126)
 42 PF05833 FbpA:  Fibronectin-bin  22.1      20 0.00043   37.1  -1.3   44  104-152   300-343 (455)
 43 PF08580 KAR9:  Yeast cortical   21.8 4.5E+02  0.0097   29.3   9.0   24  146-169   106-129 (683)
 44 PF04363 DUF496:  Protein of un  21.5 4.7E+02    0.01   21.6   7.2   19  157-175    44-62  (95)
 45 PF07889 DUF1664:  Protein of u  21.4 5.5E+02   0.012   22.4  10.0   44  104-147    51-94  (126)
 46 PF10779 XhlA:  Haemolysin XhlA  21.1 3.9E+02  0.0084   20.5   8.9   29  106-134     5-33  (71)
 47 PRK13729 conjugal transfer pil  20.9 3.5E+02  0.0077   28.8   7.5   11  340-350   210-220 (475)
 48 PRK05423 hypothetical protein;  20.7 4.9E+02   0.011   21.8   6.8   18  158-175    52-69  (104)
 49 KOG0994|consensus               20.0 1.6E+03   0.035   27.2  13.3   46  125-170  1451-1497(1758)

No 1  
>KOG2030|consensus
Probab=100.00  E-value=3.7e-92  Score=732.54  Aligned_cols=342  Identities=51%  Similarity=0.820  Sum_probs=313.9

Q ss_pred             HhhcccHHHHHHHccCCCCccEEEeecCCCCCCcccceecccceecccCCCCCCcccccccCCcccccCCHHHHHHHHhc
Q psy4691          11 IHKCGPTLHVLVSNLNVSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFS   90 (380)
Q Consensus        11 ~~~~~~~~~~l~~~~~~~~~Gyiv~k~~~~~~~~~~~~~~~~F~P~~~~~~~~P~~l~~~~~~~~~~f~SFNeAlDeFFs   90 (380)
                      +..++.++..|+.... .+.|||...++..+.+.. .++|++||| +         +.++.......|++||+|+|+|||
T Consensus       232 v~~~eE~~~elit~~~-~~~Gyi~~~k~~~~~s~~-~~~y~~f~p-~---------~~~fKs~~~k~fetf~ea~Def~S  299 (911)
T KOG2030|consen  232 VKDQEEEDRELITGKL-GSKGYILEEKEKKPISTK-EFIYDEFHP-L---------GVQFKSEPVKKFETFNEAVDEFFS  299 (911)
T ss_pred             HHHHHHHHHHHHhCCC-cccchhHHhhcccCCCcc-ceeeccccc-c---------ccccccchhhhccchhhHHHHHHH
Confidence            4445567777877643 588999988877755433 899999999 4         577888888999999999999999


Q ss_pred             chhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHH
Q psy4691          91 TAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR  170 (380)
Q Consensus        91 ~~e~~k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~  170 (380)
                      .+++++..++...++.++.+||.+++.+|+++++.+++.++.++++|+||+.|..+|+.+|.+|++++++||+|.+|++|
T Consensus       300 ~IEsqk~~lr~~~~E~qa~kKld~vr~Dq~~rvq~Lqq~qe~~~~kAelIe~N~eLVe~~il~I~s~la~~m~W~dieKL  379 (911)
T KOG2030|consen  300 TIESQKSELRVKNQELQAEKKLDKVRKDQKERVQELQQVQELNRRKAELIEPNPELVEAAILAIQSALAQQMDWKDIEKL  379 (911)
T ss_pred             HHhhhhHHHHHHhhHhHHHhhhhcchhhHHHHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHccCCcHhHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHhhcCCccccccccccccccEEEEEeccccccccccccccccCCCCCCCCCCcCCCCCeEEEEecCCCHHHHHHHHH
Q psy4691         171 VKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF  250 (380)
Q Consensus       171 i~~~~~~g~pva~~I~~l~l~~n~i~l~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~idl~ls~~~NA~~yy  250 (380)
                      ++.+|++|||+|..|.++++..|.+++.|.++.+ |           +    +|+....+-+.|+|||+||||+||++||
T Consensus       380 ik~eqKkGn~vAk~i~~l~l~~n~~t~~L~d~~d-d-----------~----~de~k~~e~~~VeiDLslsA~aNArr~y  443 (911)
T KOG2030|consen  380 IKSEQKKGNPVAKSIDKLKLEKNEATLRLKDPED-D-----------N----DDEKKSSEVIVVEIDLSLSAFANARRYY  443 (911)
T ss_pred             HHHHHhcCchHhhhhhHHHHhhhhheeecCCccc-c-----------c----chhhccccceeeeeeccccchhhHHHHH
Confidence            9999999999999999999999999999993221 0           0    0112234458899999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhhhhhccccccccccceeeEEecCCcee----------------
Q psy4691         251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLG----------------  314 (380)
Q Consensus       251 ~~~K~~~~K~~kt~~~~~kalk~~e~k~~~~l~~~~~~~~~~~~Rk~~WFEKF~WFiSSeg~LV----------------  314 (380)
                      +++|+++.|++||..++++|||++++|++++|++......+.+.|++|||||||||||||||||                
T Consensus       444 ~~kk~aa~K~kKT~~a~eKAlK~~e~Ki~q~lk~~~~~~~i~k~Rk~~wFEKf~wFiSSEg~LVi~GrdaqQnEllvkky  523 (911)
T KOG2030|consen  444 EMKKEAAEKIKKTVDASEKALKSAERKIEQQLKQVKTVSRIKKIRKVYWFEKFHWFISSEGYLVIGGRDAQQNELLVKKY  523 (911)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhcccceeehheeEEEecCcEEEEcCCChhhhhHHHHhh
Confidence            9999999999999999999999999999999999888889999999999999999999999999                


Q ss_pred             -------------------------------------------cccccCCccceEEEEeCCCccccCCCCcccCCceeEE
Q psy4691         315 -------------------------------------------NVAWDAKVVTNAWWVKADQVSKTAPTGEFLTTGSFMI  351 (380)
Q Consensus       315 -------------------------------------------S~AW~~k~~~~a~WV~~~QVSKt~psGeyL~~GsFmI  351 (380)
                                                                 |.||+++++++||||||+|||||||||||||+|||||
T Consensus       524 ~~~~DiY~had~~gaSsviIkN~~~~eipp~TL~eAg~ma~~~S~aWdakvvssaWwv~~dqVSKtaptgeyL~~GSFmI  603 (911)
T KOG2030|consen  524 LEPGDIYVHADLHGASSVIIKNPPKTEIPPKTLEEAGSMALCYSKAWDAKVVSSAWWVYPDQVSKTAPTGEYLPTGSFMI  603 (911)
T ss_pred             CCCCCeEEecccCCCceEEEeCCCCCCCChhhHHHHHHHHhHHhhhhcccccccceEEecccccccCCCCccccccceEE
Confidence                                                       9999999999999999999999999999999999999


Q ss_pred             ecccccCCCCCceeEEEEEEEeccCccCC
Q psy4691         352 RGKKNFFPPCQLAMGISFLFKLEESSISR  380 (380)
Q Consensus       352 rGkKnyl~p~~L~mg~G~lf~~~~~~~~~  380 (380)
                      |||||||||++|+|||||||+|++++++|
T Consensus       604 rgkkN~lpp~~LvmG~GlLfrldes~~E~  632 (911)
T KOG2030|consen  604 RGKKNFLPPHQLVMGLGLLFRLDESSIER  632 (911)
T ss_pred             ecccCCCCchhheecceeEEEeccchhhh
Confidence            99999999999999999999999999876


No 2  
>COG1293 Predicted RNA-binding protein homologous to eukaryotic snRNP [Transcription]
Probab=100.00  E-value=6.5e-36  Score=314.24  Aligned_cols=239  Identities=27%  Similarity=0.431  Sum_probs=186.2

Q ss_pred             CHHHHHHHHhcchhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHh
Q psy4691          80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA  159 (380)
Q Consensus        80 SFNeAlDeFFs~~e~~k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~  159 (380)
                      +||+++|+||...............+...++||++++..|++.++++++.++.++.+|+||++|+++|+..+..|+.+..
T Consensus       266 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~kl~~~i~~~~~~~~~~~~~~~~~r~~g~ll~an~~~i~~~~~~v~~~~~  345 (564)
T COG1293         266 LFNEALDEKFERDKIKQLASELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQLIEEGLKSVRLADF  345 (564)
T ss_pred             HHHHHHHHHhhhhhHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhehhhh
Confidence            79999999999988765555567788999999999999999999999988889999999999999999999999987654


Q ss_pred             cCCChHHHHHHHHHHhhcCCccccccccccccccEEEEEeccccccccccccccccCCCCCCCCCCcCCCCCeEEEEecC
Q psy4691         160 NQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLD  239 (380)
Q Consensus       160 ~~~~W~~I~~~i~~~~~~g~pva~~I~~l~l~~n~i~l~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~idl~  239 (380)
                      .+  +.+|.--+   .....|++     ......++...|                              ++..|.|+..
T Consensus       346 ~~--~~~i~i~l---~~~~~~~~-----~~~~~~~~~~kl------------------------------k~~~~~~~~~  385 (564)
T COG1293         346 YG--NEEIKIEL---DKSKTPSE-----NAQRYFKKYKKL------------------------------KGAKVNLDRQ  385 (564)
T ss_pred             cc--ccceeecc---CcCcccch-----hhHHHhhhhhhc------------------------------cCceeehhhh
Confidence            43  22221100   00000000     000112222222                              2578999999


Q ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHhHHhhhhhhhhccccccccccceeeEEecCCcee----
Q psy4691         240 LSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAE-KKTKQTLKDVQTMTNINKARKVYWFEKFYWFISSENYLG----  314 (380)
Q Consensus       240 ls~~~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e-~k~~~~l~~~~~~~~~~~~Rk~~WFEKF~WFiSSeg~LV----  314 (380)
                      +++.+|+..||++++++.+|...     .++++.++ +.+++.+.+    ....+.|+++||||||||+|||||||    
T Consensus       386 ~~~~~~~~~y~e~~~~~lek~~~-----~~~ieei~ee~ie~~~~~----~~k~~~kkk~~~ek~~~~~ss~Gf~vi~Gr  456 (564)
T COG1293         386 LSELKEAIAYYESAKTALEKAEG-----KKAIEEIREELIEEGLLK----SKKKKRKKKEWFEKFRWFVSSDGFLVIGGR  456 (564)
T ss_pred             hhhhHHHHHHHHHHHHHHHhccc-----hhhHHHHHHHHHHHHHhh----hhHHhhhhhhhcccceeeeccCCeEEEEec
Confidence            99999999999999888776654     44555554 333333332    34556789999999999999999999    


Q ss_pred             ------------------------------------------------------cccccCCc-cceEEEEeCCCccccCC
Q psy4691         315 ------------------------------------------------------NVAWDAKV-VTNAWWVKADQVSKTAP  339 (380)
Q Consensus       315 ------------------------------------------------------S~AW~~k~-~~~a~WV~~~QVSKt~p  339 (380)
                                                                            |+||.++. ++|+|||++.||||+||
T Consensus       457 na~qNe~l~~k~~~~~DlwfHa~~~~gshvvik~~~~~~~e~ti~eAA~~Aa~~Ska~~~~~v~vd~t~vk~vqv~K~a~  536 (564)
T COG1293         457 NAKQNEELVKKYAEKDDLWFHADDIPGSHVVIKTEGKEPSEETILEAAQLAASYSKAWKSGLVPVDYTWVKPVQVPKGAK  536 (564)
T ss_pred             CcccchHHHHhhcccCcEEEEccCCCCCeEEEeCCCCCCChHHHHHHHHHHHHhchHhhcCCCceEEEEEcccccCCCCC
Confidence                                                                  99999995 99999999999999999


Q ss_pred             CCcccCCceeEEecccccCCCCCceeEE
Q psy4691         340 TGEFLTTGSFMIRGKKNFFPPCQLAMGI  367 (380)
Q Consensus       340 sGeyL~~GsFmIrGkKnyl~p~~L~mg~  367 (380)
                      ||+||++|+|||+|++||+..++|.+++
T Consensus       537 ~G~vl~~g~~~I~~k~~~~~~~~lk~~~  564 (564)
T COG1293         537 SGEVLYKGQKTIRGKRDYITIVKLKLAV  564 (564)
T ss_pred             CceEEecCcEEEEEccccccccccccCC
Confidence            9999999999999999999999988763


No 3  
>PF05833 FbpA:  Fibronectin-binding protein A N-terminus (FbpA);  InterPro: IPR008616 This family consists of the N-terminal region of the prokaryotic fibronectin-binding protein, the C-terminal region is IPR008532 from INTERPRO. Fibronectin binding is considered to be an important virulence factor in streptococcal infections. Fibronectin is a dimeric glycoprotein that is present in a soluble form in plasma and extracellular fluids; it is also present in a fibrillar form on cell surfaces. Both the soluble and cellular forms of fibronectin may be incorporated into the extracellular tissue matrix. While fibronectin has critical roles in eukaryotic cellular processes, such as adhesion, migration and differentiation, it is also a substrate for the attachment of bacteria. The binding of pathogenic Streptococcus pyogenes and Staphylococcus aureus to epithelial cells via fibronectin facilitates their internalisation and systemic spread within the host [].; PDB: 3DOA_A 2ZBK_F 2HKJ_A 1Z5B_A 1Z5C_B 1MX0_F 1Z5A_A 1MU5_A 1Z59_A.
Probab=99.68  E-value=1.2e-17  Score=171.74  Aligned_cols=175  Identities=25%  Similarity=0.422  Sum_probs=28.6

Q ss_pred             cHHHHHHHccC-CCCccEEEeecCCCCCCcccceecccceecccCCCCCCcccccccCCcccccCCHHHHHHHHhcchhh
Q psy4691          16 PTLHVLVSNLN-VSSQGYLVQKKEKRPNSEDYIVANMEFHPMILQPGSTPSVMRQHQGFPIQEFESFAAAVDEFFSTAES   94 (380)
Q Consensus        16 ~~~~~l~~~~~-~~~~Gyiv~k~~~~~~~~~~~~~~~~F~P~~~~~~~~P~~l~~~~~~~~~~f~SFNeAlDeFFs~~e~   94 (380)
                      ..+..|++... +...|||+...+..          ..|+||.         +.++.+....+|+|||+|||+||+..+.
T Consensus       233 ~~~~~l~~~l~~~~~~p~l~~~~~~~----------~~f~~~~---------l~~~~~~~~~~f~s~~~ald~yf~~~~~  293 (455)
T PF05833_consen  233 EAIRELLNELEEGQFKPYLYYDDGKP----------KDFSPFP---------LKQYEELEVKEFDSFNEALDEYFSEKEE  293 (455)
T ss_dssp             HHHHHHHHHHT---S----EEE---------------EEESS-----------TT---S--EE-SSHHHHHHHHT-----
T ss_pred             HHHHHhhhhcccccCccEEEEecCcc----------cEEEEEe---------ccccccccccCCCCHHHHHHHHHHHHHH
Confidence            44556666554 35778998876533          6888885         7777777789999999999999998875


Q ss_pred             h-hHhH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHH
Q psy4691          95 H-KIDL---KAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEAR  170 (380)
Q Consensus        95 ~-k~~~---k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~  170 (380)
                      + ...+   +.....+..+++++++++.+++.++.++ ..+.++.+|+||.+|.+.+           +.|++|.++..+
T Consensus       294 ~~~~~~~~~~l~k~l~~~~~klek~l~~l~~~~~~~~-~~~~~~~~gelL~a~~~~i-----------~~g~~~~~l~~~  361 (455)
T PF05833_consen  294 EERLEQKKKRLEKKLEKKIKKLEKKLEKLEEELEESE-KAENYREYGELLLANLHQI-----------KKGMKWVELEDF  361 (455)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hhhHHHHHHHHHHHHHHHh-----------hCCCCEEEhhhh
Confidence            5 2222   2222456667777777777777777777 4477888888888888777           899999999887


Q ss_pred             HHHHhhcCCccccccccccccccEEEEEeccccccccccccccccCCCCCCCCCCcCCCCCeEEEEecCCCHHHHHHHHH
Q psy4691         171 VKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSLVDVDLDLSAYANAKRFF  250 (380)
Q Consensus       171 i~~~~~~g~pva~~I~~l~l~~n~i~l~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~V~idl~ls~~~NA~~yy  250 (380)
                      +.. .                                                      .++.|+||+++|+++||..||
T Consensus       362 ~~~-~------------------------------------------------------~~i~I~Ld~~~s~~eNA~~yf  386 (455)
T PF05833_consen  362 YEE-G------------------------------------------------------EEIEIPLDPSLSPSENAQKYF  386 (455)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             hcc-C------------------------------------------------------CceEEEECCCCCHHHHHHHHH
Confidence            632 1                                                      148899999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         251 DLKRSAAKKQQKTIQSTEKALKSAEKKTKQ  280 (380)
Q Consensus       251 ~~~K~~~~K~~kt~~~~~kalk~~e~k~~~  280 (380)
                      +++|++..|    ++.++.+++.+++++..
T Consensus       387 ~k~kK~k~k----~~~~~~~i~~~~~el~~  412 (455)
T PF05833_consen  387 KKYKKLKRK----IEKLEERIEEAEKELEY  412 (455)
T ss_dssp             ------------------------------
T ss_pred             HHHHHHHHH----HHHHHHHHHHHHHHHHH
Confidence            999766544    45555555555555443


No 4  
>PF05670 DUF814:  Domain of unknown function (DUF814);  InterPro: IPR008532 This domain occurs in proteins that have been annotated as Fibronectin/fibrinogen binding protein by similarity. This annotation comes from O34693 from SWISSPROT where the N-terminal region is involved in this activity []. Hence the activity of this C-terminal domain is unknown. This domain contains a conserved motif D/E-X-W/Y-X-H that may be functionally important.
Probab=97.38  E-value=9.2e-05  Score=60.38  Aligned_cols=22  Identities=18%  Similarity=0.284  Sum_probs=21.0

Q ss_pred             cccccCCccceEEEEeCCCccc
Q psy4691         315 NVAWDAKVVTNAWWVKADQVSK  336 (380)
Q Consensus       315 S~AW~~k~~~~a~WV~~~QVSK  336 (380)
                      |+||+.+..+.++|..+++|+|
T Consensus        69 Ska~~~~~~v~V~yt~~k~v~K   90 (90)
T PF05670_consen   69 SKAWKKGEKVEVDYTQGKYVKK   90 (90)
T ss_pred             CHhhccCCCeEEEEeehHhccC
Confidence            9999999999999999999987


No 5  
>COG1293 Predicted RNA-binding protein homologous to eukaryotic snRNP [Transcription]
Probab=95.94  E-value=0.14  Score=55.04  Aligned_cols=119  Identities=18%  Similarity=0.212  Sum_probs=89.0

Q ss_pred             ccccCCHHHHHHHHhcchhhh-hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Q psy4691          75 IQEFESFAAAVDEFFSTAESH-KIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILA  153 (380)
Q Consensus        75 ~~~f~SFNeAlDeFFs~~e~~-k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~  153 (380)
                      ...|.+++..+|+||...... ++.+...    ...+++++.+..++++|+.++++.+.++..++......++|-.-+..
T Consensus       257 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~l~~~~~kl~~~i~~~~~~~~~~~~~~~~~r~~g~ll~an~~~  332 (564)
T COG1293         257 LKAYADLEKLFNEALDEKFERDKIKQLAS----ELEKKLEKELKKLENKLEKQEDELEELEKAAEELRQKGELLYANLQL  332 (564)
T ss_pred             ccccchhhHHHHHHHHHHhhhhhHHHhHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            356788999999999888765 3333332    46789999999999999999999999999998999999888776666


Q ss_pred             HHHHHhcCCChHHHHHHHHHHhhcCCccccccccccccccEEEEEeccccccccccccccccCCCCCCCCCCcCCCCCeE
Q psy4691         154 IRQDIANQLSWEDIEARVKQAQRHNDPVASIIKQLKLNINHITLLLRVHKERSNIKDSWVLHHDNADPFADSSETKKPSL  233 (380)
Q Consensus       154 v~~~~~~~~~W~~I~~~i~~~~~~g~pva~~I~~l~l~~n~i~l~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  233 (380)
                      |..+...                                    +.+.+..                        ..+.+.
T Consensus       333 i~~~~~~------------------------------------v~~~~~~------------------------~~~~i~  352 (564)
T COG1293         333 IEEGLKS------------------------------------VRLADFY------------------------GNEEIK  352 (564)
T ss_pred             hhhhhhh------------------------------------hehhhhc------------------------ccccee
Confidence            6554322                                    1111000                        012478


Q ss_pred             EEEecCCCHHHHHHHHHHHHHHHH
Q psy4691         234 VDVDLDLSAYANAKRFFDLKRSAA  257 (380)
Q Consensus       234 V~idl~ls~~~NA~~yy~~~K~~~  257 (380)
                      |.+|.+.++-+|+.+||...++..
T Consensus       353 i~l~~~~~~~~~~~~~~~~~~klk  376 (564)
T COG1293         353 IELDKSKTPSENAQRYFKKYKKLK  376 (564)
T ss_pred             eccCcCcccchhhHHHhhhhhhcc
Confidence            888888888899999999988775


No 6  
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=91.72  E-value=9.4  Score=34.12  Aligned_cols=38  Identities=18%  Similarity=0.314  Sum_probs=23.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHh
Q psy4691         244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTKQTLKD  284 (380)
Q Consensus       244 ~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e~k~~~~l~~  284 (380)
                      ..|-.|+++++   .++.+.++....+|..+.+++.+....
T Consensus        93 ~eAie~l~k~~---~~l~~~~~~l~~~l~~l~~~~~~l~~~  130 (145)
T COG1730          93 DEAIEFLKKRI---EELEKAIEKLQQALAELAQRIEQLEQE  130 (145)
T ss_pred             HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            56888888874   444566666666666666655554333


No 7  
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly.  The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=70.55  E-value=66  Score=27.31  Aligned_cols=33  Identities=24%  Similarity=0.288  Sum_probs=18.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK  279 (380)
Q Consensus       244 ~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e~k~~  279 (380)
                      ..|..|++++.+...+..   +...+.+......+.
T Consensus        86 ~eA~~~l~~r~~~l~~~~---~~l~~~l~~l~~~~~  118 (129)
T cd00584          86 EEAIEFLDKKIEELTKQI---EKLQKELAKLKDQIN  118 (129)
T ss_pred             HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH
Confidence            568899999866654443   333334444433433


No 8  
>PF03993 DUF349:  Domain of Unknown Function (DUF349);  InterPro: IPR007139 This motif is found singly or as up to five tandem repeats in a small set of bacterial proteins. There are two or three alpha-helices, and possibly a beta-strand.
Probab=63.16  E-value=39  Score=25.81  Aligned_cols=16  Identities=38%  Similarity=0.640  Sum_probs=13.0

Q ss_pred             CHHHHHHHHhcchhhh
Q psy4691          80 SFAAAVDEFFSTAESH   95 (380)
Q Consensus        80 SFNeAlDeFFs~~e~~   95 (380)
                      .|..|+|.||......
T Consensus         6 ~F~~a~~~~~~~~~~~   21 (77)
T PF03993_consen    6 RFRAACDAFFDRRKEF   21 (77)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            5899999999887654


No 9  
>PF04568 IATP:  Mitochondrial ATPase inhibitor, IATP;  InterPro: IPR007648  ATP synthase inhibitor prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical gradient, for example during oxygen deprivation [] ATP synthase inhibitor forms a one to one complex with the F1 ATPase, possibly by binding at the alpha-beta interface. It is thought to inhibit ATP synthesis by preventing the release of ATP []. The minimum inhibitory region for bovine inhibitor (P01096 from SWISSPROT) is from residues 39 to 72 []. The inhibitor has two oligomeric states, dimer (the active state) and tetramer. At low pH , the inhibitor forms a dimer via antiparallel coiled coil interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory region, thus preventing the inhibitory activity []. ; GO: 0004857 enzyme inhibitor activity, 0045980 negative regulation of nucleotide metabolic process, 0005739 mitochondrion; PDB: 1GMJ_B 1OHH_H 1HF9_B 2V7Q_J.
Probab=62.87  E-value=26  Score=29.39  Aligned_cols=27  Identities=30%  Similarity=0.562  Sum_probs=16.4

Q ss_pred             HHHHHHHHHHHHHHH----HHHHHHHHHHHH
Q psy4691         104 QERDALKKLENVKRD----HETRLSALEQTQ  130 (380)
Q Consensus       104 ~~~~~ikKLek~~~~----qekrI~~le~~~  130 (380)
                      ++.+.+++|...+.+    |++.|+++++..
T Consensus        69 ~EkEqL~~Lk~kl~~e~~~~~k~i~~le~~I   99 (100)
T PF04568_consen   69 KEKEQLKKLKEKLKEEIEHHRKEIDELEKHI   99 (100)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            333444555555555    888888888553


No 10 
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly.  The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=58.27  E-value=1.1e+02  Score=25.55  Aligned_cols=33  Identities=21%  Similarity=0.328  Sum_probs=19.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         244 ANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK  279 (380)
Q Consensus       244 ~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e~k~~  279 (380)
                      ..|..++++++....+..+.+   .+.+..+..++.
T Consensus        86 ~eA~~~l~~r~~~l~~~~~~l---~~~~~~~~~~~~  118 (129)
T cd00890          86 EEAIEFLKKRLETLEKQIEKL---EKQLEKLQDQIT  118 (129)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH
Confidence            678899999866655443333   334444443433


No 11 
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=57.00  E-value=1.3e+02  Score=25.97  Aligned_cols=28  Identities=14%  Similarity=0.273  Sum_probs=17.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         244 ANAKRFFDLKRSAAKKQQKTIQSTEKAL  271 (380)
Q Consensus       244 ~NA~~yy~~~K~~~~K~~kt~~~~~kal  271 (380)
                      .-|..+++++.....+..+++...-..+
T Consensus        93 ~eA~~~l~~~~~~l~~~~~~l~~~l~~~  120 (140)
T PRK03947         93 DEAIEILDKRKEELEKALEKLEEALQKL  120 (140)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            6799999998777655544444433333


No 12 
>KOG0687|consensus
Probab=50.37  E-value=70  Score=32.65  Aligned_cols=28  Identities=21%  Similarity=0.313  Sum_probs=25.0

Q ss_pred             HHHHhHHHHHHHHHHHHHHHhcCCChHH
Q psy4691         139 LIINNQESVDAAILAIRQDIANQLSWED  166 (380)
Q Consensus       139 LI~aN~~~Ve~~i~~v~~~~~~~~~W~~  166 (380)
                      +++.+...|.+-|..++.++++|=||+-
T Consensus       152 lfy~D~~lV~~~iekak~liE~GgDWeR  179 (393)
T KOG0687|consen  152 LFYLDHDLVTESIEKAKSLIEEGGDWER  179 (393)
T ss_pred             HhhccHHHHHHHHHHHHHHHHhCCChhh
Confidence            5678888999999999999999999975


No 13 
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=48.81  E-value=1.6e+02  Score=24.74  Aligned_cols=21  Identities=14%  Similarity=0.235  Sum_probs=13.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH
Q psy4691         244 ANAKRFFDLKRSAAKKQQKTI  264 (380)
Q Consensus       244 ~NA~~yy~~~K~~~~K~~kt~  264 (380)
                      .-|..|++++.+...+..+.+
T Consensus        85 ~eA~~~l~~~~~~l~~~~~~l  105 (126)
T TIGR00293        85 EEAIEFLKKRIEELEKAIEKL  105 (126)
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            668899998866654443333


No 14 
>PF09726 Macoilin:  Transmembrane protein;  InterPro: IPR019130  This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=46.79  E-value=1.9e+02  Score=32.38  Aligned_cols=53  Identities=17%  Similarity=0.183  Sum_probs=29.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHh
Q psy4691         104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIA  159 (380)
Q Consensus       104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~  159 (380)
                      +.+.++++|+.-++.-++++..++.+...++.+-   ..|-..+|.++.+++.+.+
T Consensus       549 ~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~~~---~e~~~~~e~L~~aL~amqd  601 (697)
T PF09726_consen  549 QLESELKKLRRELKQKEEQIRELESELQELRKYE---KESEKDTEVLMSALSAMQD  601 (697)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---hhhhhhHHHHHHHHHHHHH
Confidence            4445566666666666666666666655554431   2233445555555555443


No 15 
>PF02757 YLP:  YLP motif;  InterPro: IPR004019 The YLP motif is found in one or several copies in various Drosophila proteins. Its function is unknown, however the presence of completely conserved tyrosine residues and its presence in the human Erbb-2 and ErbB-4 receptor protein-tyrosine kinases (2.7.10.1 from EC) may suggest it could be a substrate for tyrosine kinases. ErbBs (1-4) are single-pass transmembrane proteins that activate a wide variety of signalling pathways, including those involved in proliferation, migration, differentiation, survival, and apoptosis; they are frequently misregulated in cancer []. ErbB-2 is an essential component of a neuregulin-receptor complex, although neuregulins do not interact with it alone. ErbB-4 specifically binds and is activated by neuregulins, NRG-2, NRG-3, heparin-binding EGF-like growth factor, betacellulin and NTAK [].
Probab=44.13  E-value=12  Score=18.15  Aligned_cols=6  Identities=33%  Similarity=1.287  Sum_probs=3.0

Q ss_pred             ccCCCC
Q psy4691         356 NFFPPC  361 (380)
Q Consensus       356 nyl~p~  361 (380)
                      .||||+
T Consensus         3 eYLpP~    8 (9)
T PF02757_consen    3 EYLPPV    8 (9)
T ss_pred             cccCCC
Confidence            355554


No 16 
>PF02996 Prefoldin:  Prefoldin subunit;  InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family.   Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=42.12  E-value=2e+02  Score=23.77  Aligned_cols=40  Identities=20%  Similarity=0.332  Sum_probs=22.9

Q ss_pred             eEEEEecCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         232 SLVDVDLDLSAYANAKRFFDLKRSAAKKQQKTIQSTEKALKSAEKKTK  279 (380)
Q Consensus       232 ~~V~idl~ls~~~NA~~yy~~~K~~~~K~~kt~~~~~kalk~~e~k~~  279 (380)
                      +-|+.++     ..|..|+++|.+...+.   ++.....++.+..++.
T Consensus        69 ~~vE~s~-----~eA~~~l~~r~~~l~~~---~~~l~~~~~~~~~~~~  108 (120)
T PF02996_consen   69 YYVEMSL-----EEAIEFLKKRIKELEEQ---LEKLEKELAELQAQIE  108 (120)
T ss_dssp             EEEEEEH-----HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH
T ss_pred             eEEEecH-----HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH
Confidence            5555554     67999999996665443   3333444444443433


No 17 
>PF15066 CAGE1:  Cancer-associated gene protein 1 family
Probab=39.25  E-value=3.2e+02  Score=29.13  Aligned_cols=63  Identities=17%  Similarity=0.241  Sum_probs=47.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHHHH
Q psy4691         104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVK  172 (380)
Q Consensus       104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~i~  172 (380)
                      .....+.||...-++|+-+|..|+-.---+..+-+-|...+..=+-.++.||.+..      .|+++|.
T Consensus       314 aLNEvL~kLk~tn~kQq~~IqdLq~sN~yLe~kvkeLQ~k~~kQqvfvDiinkLk~------niEeLIe  376 (527)
T PF15066_consen  314 ALNEVLQKLKHTNRKQQNRIQDLQCSNLYLEKKVKELQMKITKQQVFVDIINKLKE------NIEELIE  376 (527)
T ss_pred             HHHHHHHHHHhhhHHHHHHHHHhhhccHHHHHHHHHHHHHhhhhhHHHHHHHHHHH------HHHHHHH
Confidence            56678899999999999999999866566666677777776666667777877753      4777763


No 18 
>COG5187 RPN7 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones]
Probab=38.68  E-value=1.6e+02  Score=29.86  Aligned_cols=29  Identities=21%  Similarity=0.363  Sum_probs=25.9

Q ss_pred             HHHHHhHHHHHHHHHHHHHHHhcCCChHH
Q psy4691         138 ELIINNQESVDAAILAIRQDIANQLSWED  166 (380)
Q Consensus       138 eLI~aN~~~Ve~~i~~v~~~~~~~~~W~~  166 (380)
                      -+|+.+.-.|++.|.+++..++.|=||+.
T Consensus       162 g~~y~d~~vV~e~lE~~~~~iEkGgDWeR  190 (412)
T COG5187         162 GLIYGDRKVVEESLEVADDIIEKGGDWER  190 (412)
T ss_pred             HHhhccHHHHHHHHHHHHHHHHhCCCHHh
Confidence            36788888999999999999999999975


No 19 
>PF01519 DUF16:  Protein of unknown function DUF16;  InterPro: IPR002862 Proteins that contain this domain are of unknown function. It appears to be confined to proteins from Mycoplasma pneumoniae [].; PDB: 2BA2_C.
Probab=37.56  E-value=1.9e+02  Score=24.36  Aligned_cols=47  Identities=11%  Similarity=0.251  Sum_probs=31.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Q psy4691         105 ERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDI  158 (380)
Q Consensus       105 ~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~  158 (380)
                      ....+.+|......|-+.|.++..++.   ..|+.|    .+|.+.|..|+.=|
T Consensus        51 qgeqI~kL~e~V~~QGEqIkel~~e~k---~qgktL----~~I~~~L~~inkRL   97 (102)
T PF01519_consen   51 QGEQINKLTEKVDKQGEQIKELQVEQK---AQGKTL----QLILKTLQSINKRL   97 (102)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH----HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH----HHHHHHHHHHHHHH
Confidence            345678888889999999999985554   334433    44666666665543


No 20 
>PF05524 PEP-utilisers_N:  PEP-utilising enzyme, N-terminal;  InterPro: IPR008731  This sequence identifies proteins which are a component of the phosphoenolpyruvate:sugar phosphotransferase system (PTS), a major carbohydrate active transport system. The PTS system is found throughout the bacterial kingdom, and is responsible for the coupled phosphorylation and translocation of numerous sugars across the cytoplasmic membrane []. This entry represents the N-terminal domain of enzyme I (EIN) which transfers the phosphoryl group from phosphoenolpyruvate (PEP) to the phosphoryl carrier protein (HPr) which in turn phosphorylates a group of membrane-associated proteins, known as enzyme II. The N-terminal domain of EI (EIN) extends from residues 1 to 259 and can be phosphorylated in a fully reversible manner by phosphorylated HPr. EIN, however, cannot be autophosphorylated by PEP [, ].; GO: 0005351 sugar:hydrogen symporter activity, 0008965 phosphoenolpyruvate-protein phosphotransferase activity, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0005737 cytoplasm; PDB: 2WQD_A 2XDF_B 2HWG_A 3EZB_A 2L5H_A 3EZA_A 1EZB_A 2EZA_A 1EZA_A 1EZC_A ....
Probab=36.55  E-value=1.7e+02  Score=24.48  Aligned_cols=54  Identities=15%  Similarity=0.201  Sum_probs=27.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHhHHHHHH--HHHHHHHHHhcC
Q psy4691         108 ALKKLENVKRDHETRLSALEQTQLV--DKEKAELIINNQESVDA--AILAIRQDIANQ  161 (380)
Q Consensus       108 ~ikKLek~~~~qekrI~~le~~~E~--~~~kgeLI~aN~~~Ve~--~i~~v~~~~~~~  161 (380)
                      ++.||.+.+..-...|..+.+....  ....++++.+|+.+++.  .++.|...+.+|
T Consensus        36 E~~rl~~Al~~~~~eL~~l~~~~~~~~~~~~a~If~ah~~~L~D~~l~~~v~~~I~~~   93 (123)
T PF05524_consen   36 EIERLEQALEKAREELEQLAERAESKLGEEEAAIFEAHLMMLEDPELIDEVEELIREG   93 (123)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHCHSSCTHHHHHHHHHHT-HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHhcCHhHHHHHHHHHhcc
Confidence            4455555555555555555433111  12344677777777655  555565555544


No 21 
>PF04048 Sec8_exocyst:  Sec8 exocyst complex component specific domain;  InterPro: IPR007191 Sec8 is a component of the exocyst complex involved in the docking of exocystic vesicles with a fusion site on the plasma membrane. The exocyst complex is composed of Sec3, Sec5, Sec6, Sec8, Sec10, Sec15, Exo70 and Exo84.; GO: 0006904 vesicle docking involved in exocytosis, 0015031 protein transport, 0000145 exocyst
Probab=36.46  E-value=1.3e+02  Score=26.36  Aligned_cols=15  Identities=7%  Similarity=0.536  Sum_probs=9.3

Q ss_pred             cccCCHHHHHHHHhc
Q psy4691          76 QEFESFAAAVDEFFS   90 (380)
Q Consensus        76 ~~f~SFNeAlDeFFs   90 (380)
                      .+|..||.+|-.|=.
T Consensus        62 eh~q~Fn~sI~sy~~   76 (142)
T PF04048_consen   62 EHYQGFNSSIGSYSQ   76 (142)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456667777776533


No 22 
>PF15463 ECM11:  Extracellular mutant protein 11
Probab=36.43  E-value=2.9e+02  Score=24.09  Aligned_cols=67  Identities=19%  Similarity=0.322  Sum_probs=43.0

Q ss_pred             HHhcchhhh---hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Q psy4691          87 EFFSTAESH---KIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQ  156 (380)
Q Consensus        87 eFFs~~e~~---k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~  156 (380)
                      .||+...-.   ..-....++....+.||...+.......+.|+.+   ....+++|......|+.-|.-|+.
T Consensus        64 ~~fs~ls~~eWe~~Gd~~l~qf~~l~~kl~~~R~~~r~~~~~fe~e---I~~R~eav~~~~~~l~~kL~~mk~  133 (139)
T PF15463_consen   64 EFFSNLSFDEWEEAGDWFLEQFSELMQKLKEARRKLRKKFAVFEDE---INRRAEAVRAQGEQLDRKLEKMKE  133 (139)
T ss_pred             HHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            355555433   1222333456667788888888888888888733   446677777777777777666653


No 23 
>PF06191 DUF995:  Protein of unknown function (DUF995);  InterPro: IPR009337 This is a family of uncharacterised Proteobacteria proteins.
Probab=34.65  E-value=28  Score=31.14  Aligned_cols=24  Identities=17%  Similarity=0.497  Sum_probs=17.7

Q ss_pred             cccccceeeEEecCCcee-cccccC
Q psy4691         297 VYWFEKFYWFISSENYLG-NVAWDA  320 (380)
Q Consensus       297 ~~WFEKF~WFiSSeg~LV-S~AW~~  320 (380)
                      ...|=.=+||||.+|-|| ...|.+
T Consensus        67 ~~s~g~GrW~Vt~~GklC~~a~W~~   91 (145)
T PF06191_consen   67 SESYGEGRWFVTDNGKLCFRATWHS   91 (145)
T ss_pred             cceeeeeEEEECCCCCEEEEeEEEC
Confidence            355566699999999999 445544


No 24 
>PF07462 MSP1_C:  Merozoite surface protein 1 (MSP1) C-terminus;  InterPro: IPR010901 This entry represents the C-terminal region of merozoite surface protein 1 (MSP1), which is found in a number of Plasmodium species. MSP-1 is a 200 kDa protein expressed on the surface of the Plasmodium vivax merozoite. MSP-1 of Plasmodium species is synthesised as a high-molecular-weight precursor and then processed into several fragments. At the time of red cell invasion by the merozoite, only the 19 kDa C-terminal fragment (MSP-119), which contains two epidermal growth factor-like domains, remains on the surface. Antibodies against MSP-119 inhibit merozoite entry into red cells, and immunisation with MSP-119 protects monkeys from challenging infections. Hence, MSP-119 is considered a promising vaccine candidate [].; GO: 0009405 pathogenesis, 0016020 membrane
Probab=34.37  E-value=6.4e+02  Score=27.44  Aligned_cols=41  Identities=34%  Similarity=0.403  Sum_probs=33.5

Q ss_pred             cCCCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH
Q psy4691         238 LDLSAYANAKRFFDLKRSAAKKQ-QKTIQSTEKALKSAEKKT  278 (380)
Q Consensus       238 l~ls~~~NA~~yy~~~K~~~~K~-~kt~~~~~kalk~~e~k~  278 (380)
                      -..|+-.|-..||++||.+++|. +-|+++++.-||=-..++
T Consensus       120 ~p~~vl~n~~vfFnKkrEaek~eveNtlkNt~iLlkyYKa~~  161 (574)
T PF07462_consen  120 NPTSVLKNFTVFFNKKREAEKKEVENTLKNTEILLKYYKARA  161 (574)
T ss_pred             CcHHHHHHHHHHHhhhhHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence            34688899999999999998877 788888888888755444


No 25 
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=34.17  E-value=3.7e+02  Score=24.56  Aligned_cols=53  Identities=13%  Similarity=0.109  Sum_probs=35.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Q psy4691         105 ERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQD  157 (380)
Q Consensus       105 ~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~  157 (380)
                      ...+.++|+..+..++.+++.|+.+.+.+...-..|..-|..+=.+|+-.|.+
T Consensus       102 ~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~~~~~~~eDY~~L~~Im~RARkl  154 (161)
T TIGR02894       102 LQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTLIDIMDRARKL  154 (161)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34455666666677777777777776777666777777777666666666654


No 26 
>PF10168 Nup88:  Nuclear pore component;  InterPro: IPR019321  Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells []. 
Probab=34.07  E-value=3.6e+02  Score=30.24  Aligned_cols=67  Identities=18%  Similarity=0.144  Sum_probs=46.5

Q ss_pred             CHHHHHHHHhcchhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Q psy4691          80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVD  148 (380)
Q Consensus        80 SFNeAlDeFFs~~e~~k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve  148 (380)
                      -+.+|+.-|....-. + ..++....+..+..|...+++|.++|..++++.+..+..++.|...+..+.
T Consensus       540 lL~~a~~vlreeYi~-~-~~~ar~ei~~rv~~Lk~~~e~Ql~~L~~l~e~~~~l~~~ae~LaeR~e~a~  606 (717)
T PF10168_consen  540 LLSQATKVLREEYIE-K-QDLAREEIQRRVKLLKQQKEQQLKELQELQEERKSLRESAEKLAERYEEAK  606 (717)
T ss_pred             HHHHHHHHHHHHHHH-H-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            355555555432111 1 123444667777888999999999999999999999999998887666553


No 27 
>PF05384 DegS:  Sensor protein DegS;  InterPro: IPR008595 This is a group of Bacillus DegS proteins. The DegS-DegU two-component regulatory system of Bacillus subtilis controls various processes that characterise the transition from the exponential to the stationary growth phase, including the induction of extracellular degradative enzymes, expression of late competence genes and down-regulation of the sigma D regulon []. The entry also contains one sequence Q8R9D3 from SWISSPROT from Thermoanaerobacter tengcongensis which is described as a sensory transduction histidine kinase.; GO: 0016301 kinase activity, 0007165 signal transduction
Probab=33.62  E-value=3.7e+02  Score=24.39  Aligned_cols=20  Identities=40%  Similarity=0.577  Sum_probs=11.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHH
Q psy4691         110 KKLENVKRDHETRLSALEQT  129 (380)
Q Consensus       110 kKLek~~~~qekrI~~le~~  129 (380)
                      +.|..++.+++.++..+...
T Consensus       101 ~qLr~rRD~LErrl~~l~~t  120 (159)
T PF05384_consen  101 KQLRERRDELERRLRNLEET  120 (159)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            44555556666665555533


No 28 
>PF14193 DUF4315:  Domain of unknown function (DUF4315)
Probab=30.37  E-value=3e+02  Score=22.31  Aligned_cols=58  Identities=17%  Similarity=0.264  Sum_probs=40.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHhh
Q psy4691         104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQR  176 (380)
Q Consensus       104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~i~~~~~  176 (380)
                      ++...+.|...++.+++.++..|+.+...        ..|.    +|+.+||++   +|+=.++..+++..+.
T Consensus         5 Ki~~eieK~k~Kiae~Q~rlK~Le~qk~E--------~EN~----EIv~~VR~~---~mtp~eL~~~L~~~~~   62 (83)
T PF14193_consen    5 KIRAEIEKTKEKIAELQARLKELEAQKTE--------AENL----EIVQMVRSM---KMTPEELAAFLRAMKS   62 (83)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHH----HHHHHHHHc---CCCHHHHHHHHHHHHh
Confidence            45667778888888888888888744222        2444    455666654   6899999999987754


No 29 
>KOG0018|consensus
Probab=28.93  E-value=1e+03  Score=28.13  Aligned_cols=55  Identities=22%  Similarity=0.254  Sum_probs=43.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhc
Q psy4691         106 RDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN  160 (380)
Q Consensus       106 ~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~  160 (380)
                      ...++++...+..++.++...+.++...+....-|+.|+..|+..+..-+..++.
T Consensus       233 ~~els~~~~ei~~~~~~~d~~e~ei~~~k~e~~ki~re~~~~Dk~i~~ke~~l~e  287 (1141)
T KOG0018|consen  233 NDELSRLNAEIPKLKERMDKKEREIRVRKKERGKIRRELQKVDKKISEKEEKLAE  287 (1141)
T ss_pred             hHHHHHHhhhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            3456677777888888888888888888777778999999999988888777754


No 30 
>KOG4460|consensus
Probab=28.15  E-value=3.9e+02  Score=29.20  Aligned_cols=66  Identities=15%  Similarity=0.059  Sum_probs=43.5

Q ss_pred             ccCCHHHHHHHHhcchhhhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhH
Q psy4691          77 EFESFAAAVDEFFSTAESHKIDLKAVQQERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQ  144 (380)
Q Consensus        77 ~f~SFNeAlDeFFs~~e~~k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~  144 (380)
                      .|.=.|+|+.-|--..-  +..-......+..+.-|..++++|-++|..++++.+..++.++-+..-+
T Consensus       560 ~~~lL~~a~~vfrEqYi--~~~dlV~~e~qrH~~~l~~~k~~QlQ~l~~~~eer~~i~e~a~~La~R~  625 (741)
T KOG4460|consen  560 CLQLLSRATQVFREQYI--LKQDLVKEEIQRHVKLLCDQKKKQLQDLSYCREERKSLREMAERLADRY  625 (741)
T ss_pred             HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34446677765532221  1112233456677788899999999999999988888888877765444


No 31 
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly.  The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=27.59  E-value=2.6e+02  Score=23.57  Aligned_cols=6  Identities=17%  Similarity=0.445  Sum_probs=3.0

Q ss_pred             CHHHHH
Q psy4691          80 SFAAAV   85 (380)
Q Consensus        80 SFNeAl   85 (380)
                      ++.+|+
T Consensus        84 ~~~eA~   89 (129)
T cd00584          84 DLEEAI   89 (129)
T ss_pred             cHHHHH
Confidence            355555


No 32 
>PF08336 P4Ha_N:  Prolyl 4-Hydroxylase alpha-subunit, N-terminal region;  InterPro: IPR013547 The members found in this entry are eukaryotic proteins, and include all three isoforms of the prolyl 4-hydroxylase alpha subunit. This enzyme (1.14.11.2 from EC) is important in the post-translational modification of collagen, as it catalyses the formation of 4-hydroxyproline. In vertebrates, the complete enzyme is an alpha2-beta2 tetramer; the beta-subunit is identical to protein disulphide isomerase [, , , ]. The function of the N-terminal region featured in this family does not seem to be known. ; GO: 0004656 procollagen-proline 4-dioxygenase activity, 0016702 oxidoreductase activity, acting on single donors with incorporation of molecular oxygen, incorporation of two atoms of oxygen, 0055114 oxidation-reduction process, 0005783 endoplasmic reticulum
Probab=26.43  E-value=4.1e+02  Score=22.71  Aligned_cols=23  Identities=26%  Similarity=0.319  Sum_probs=16.6

Q ss_pred             HHHHHHHHHHhcCCChHHHHHHHHHH
Q psy4691         149 AAILAIRQDIANQLSWEDIEARVKQA  174 (380)
Q Consensus       149 ~~i~~v~~~~~~~~~W~~I~~~i~~~  174 (380)
                      .+...|+.+.   .||..++.+++..
T Consensus        61 naF~LIrRl~---~dW~~~~~~~~~~   83 (134)
T PF08336_consen   61 NAFSLIRRLH---QDWPKWEKLMEQP   83 (134)
T ss_pred             HHHHHHHHHH---HhhhhHHHHHHHh
Confidence            4555666664   5899999998665


No 33 
>PF02770 Acyl-CoA_dh_M:  Acyl-CoA dehydrogenase, middle domain;  InterPro: IPR006091  Acyl-CoA dehydrogenases (1.3.99.3 from EC) are a family of flavoproteins that catalyse the alpha,beta-dehydrogenation of acyl-CoA thioesters to the corresponding trans 2,3-enoyl CoA-products with the concomitant reduction of enzyme-bound FAD. Different family members share a high sequence identity, catalytic mechanisms, and structural properties, but differ in the position of their catalytic bases and in their substrate binding specificity. Butyryl-CoA dehydrogenase [] prefers short chain substrates, medium chain- and long-chain acyl-CoA dehydrogenases prefer medium and long chain substrates, respectively, and Isovaleryl-CoA dehydrogenase [] prefers branched-chain substrates.  The monomeric enzyme is folded into three domains of approximately equal size, where the N-terminal domain is all-alpha, the middle domain is an open (5,8) barrel, and the C-terminal domain is a four-helical bundle. The constituent families differ in the numbers of C-terminal domains. This entry represents the middle beta-barrel domain found in medium chain acyl-CoA dehydrogenases, as well as in the related peroxisomal acyl-CoA oxidase-II enzymes. Acyl-CoA oxidase (ACO; 1.3.3.6 from EC) catalyzes the first and rate-determining step of the peroxisomal beta-oxidation of fatty acids [].; GO: 0003995 acyl-CoA dehydrogenase activity, 0055114 oxidation-reduction process; PDB: 3MDD_B 1UDY_C 3MDE_B 2UXW_A 3B96_A 1SIQ_A 1SIR_A 2R0N_A 2R0M_A 2D29_B ....
Probab=25.99  E-value=64  Score=23.07  Aligned_cols=27  Identities=15%  Similarity=0.249  Sum_probs=19.9

Q ss_pred             ceeEEecccccCCCCCceeEEEEEEEe
Q psy4691         347 GSFMIRGKKNFFPPCQLAMGISFLFKL  373 (380)
Q Consensus       347 GsFmIrGkKnyl~p~~L~mg~G~lf~~  373 (380)
                      |.|+|.|.|.|+...+..=.+-++-++
T Consensus        25 ~~~~L~G~K~~v~~~~~a~~~~v~ar~   51 (52)
T PF02770_consen   25 DGYVLNGEKRFVSNAPDADWFLVFART   51 (52)
T ss_dssp             TEEEEEEEEEEEETTTTESEEEEEEEE
T ss_pred             ceEEEeeEEEEECCcCccCEEEEEEEE
Confidence            349999999999988766555555544


No 34 
>PF02996 Prefoldin:  Prefoldin subunit;  InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family.   Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=25.81  E-value=2.9e+02  Score=22.74  Aligned_cols=7  Identities=29%  Similarity=0.591  Sum_probs=3.3

Q ss_pred             CHHHHHH
Q psy4691          80 SFAAAVD   86 (380)
Q Consensus        80 SFNeAlD   86 (380)
                      |+.+|++
T Consensus        74 s~~eA~~   80 (120)
T PF02996_consen   74 SLEEAIE   80 (120)
T ss_dssp             EHHHHHH
T ss_pred             cHHHHHH
Confidence            3445544


No 35 
>PF07889 DUF1664:  Protein of unknown function (DUF1664);  InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long. 
Probab=25.15  E-value=4.6e+02  Score=22.85  Aligned_cols=26  Identities=0%  Similarity=0.125  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         105 ERDALKKLENVKRDHETRLSALEQTQ  130 (380)
Q Consensus       105 ~~~~ikKLek~~~~qekrI~~le~~~  130 (380)
                      ...++..+..++++|.+..+..+++.
T Consensus        66 LsqRId~vd~klDe~~ei~~~i~~eV   91 (126)
T PF07889_consen   66 LSQRIDRVDDKLDEQKEISKQIKDEV   91 (126)
T ss_pred             HHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence            34444555555555555555554443


No 36 
>PF13747 DUF4164:  Domain of unknown function (DUF4164)
Probab=23.87  E-value=4e+02  Score=21.64  Aligned_cols=23  Identities=13%  Similarity=0.265  Sum_probs=10.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         105 ERDALKKLENVKRDHETRLSALE  127 (380)
Q Consensus       105 ~~~~ikKLek~~~~qekrI~~le  127 (380)
                      ....+.+|..-+..+..++.+.+
T Consensus        37 ~e~ei~~l~~dr~rLa~eLD~~~   59 (89)
T PF13747_consen   37 LEEEIQRLDADRSRLAQELDQAE   59 (89)
T ss_pred             HHHHHHHHHhhHHHHHHHHHhHH
Confidence            33344444444444444444444


No 37 
>PF15437 PGBA_C:  Plasminogen-binding protein pgbA C-terminal
Probab=23.78  E-value=1.9e+02  Score=23.51  Aligned_cols=18  Identities=28%  Similarity=0.368  Sum_probs=8.3

Q ss_pred             CCCHHHHHHHHHHHHHHH
Q psy4691         239 DLSAYANAKRFFDLKRSA  256 (380)
Q Consensus       239 ~ls~~~NA~~yy~~~K~~  256 (380)
                      .||+-.-=+++-+.+|++
T Consensus        33 kl~~kEekrrLkeEkKkA   50 (86)
T PF15437_consen   33 KLSPKEEKRRLKEEKKKA   50 (86)
T ss_pred             cCCHHHHHHHHHHHHHHH
Confidence            445544444444444433


No 38 
>smart00685 DM14 Repeats in fly CG4713, worm Y37H9A.3 and human FLJ20241.
Probab=23.74  E-value=3e+02  Score=20.88  Aligned_cols=48  Identities=23%  Similarity=0.300  Sum_probs=29.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHhhcCCcc
Q psy4691         118 DHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQDIANQLSWEDIEARVKQAQRHNDPV  181 (380)
Q Consensus       118 ~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~~~~W~~I~~~i~~~~~~g~pv  181 (380)
                      -++.|++.|+..+..-+..|+        +++|....|.+       ..++.+|+.. +.|.||
T Consensus         3 ~L~~R~~~yk~Aa~~AK~~gd--------~~kAr~~~R~~-------K~~~~~I~~~-~aG~pV   50 (59)
T smart00685        3 LLQQRQEQYKQAALQAKRAGD--------EEKARRHLRIA-------KQFDDAIKAA-RAGRPV   50 (59)
T ss_pred             HHHHHHHHHHHHHHHHHHcCC--------HHHHHHHHHHH-------hhHHHHHHHH-HCCCCC
Confidence            456777777777777766666        45555555554       2355666554 356665


No 39 
>PF13747 DUF4164:  Domain of unknown function (DUF4164)
Probab=23.17  E-value=4.1e+02  Score=21.56  Aligned_cols=53  Identities=17%  Similarity=0.254  Sum_probs=32.6

Q ss_pred             ccCCHHHHHHHHhcchhhh-hHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691          77 EFESFAAAVDEFFSTAESH-KIDLKAVQQERDALKKLENVKRDHETRLSALEQTQ  130 (380)
Q Consensus        77 ~f~SFNeAlDeFFs~~e~~-k~~~k~~~~~~~~ikKLek~~~~qekrI~~le~~~  130 (380)
                      .++++..+|+.-|...... .++... +....-..+|...+...+.+...++..+
T Consensus        16 aid~LE~~v~~r~~~~~~~~~~e~ei-~~l~~dr~rLa~eLD~~~ar~~~Le~~~   69 (89)
T PF13747_consen   16 AIDRLEKAVDRRLERDRKRDELEEEI-QRLDADRSRLAQELDQAEARANRLEEAN   69 (89)
T ss_pred             HHHHHHHHHHHHHHhhhhhhhHHHHH-HHHHhhHHHHHHHHHhHHHHHHHHHHHH
Confidence            3555666666666655544 222222 4556666778888888888877777443


No 40 
>PF11221 Med21:  Subunit 21 of Mediator complex;  InterPro: IPR021384 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins.  The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11.  The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation.   The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22.  The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4.  The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16.  The CDK8 module contains: MED12, MED13, CCNC and CDK8.   Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP.  Med21 has been known as Srb7 in yeasts, hSrb7 in humans and Trap 19 in Drosophila. The heterodimer of the two subunits Med7 and Med21 appears to act as a hinge between the middle and the tail regions of Mediator []. ; PDB: 1YKE_B 1YKH_B.
Probab=22.91  E-value=3.2e+02  Score=23.99  Aligned_cols=70  Identities=16%  Similarity=0.233  Sum_probs=36.4

Q ss_pred             CHHHHHHHHhcchhhhhHhHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Q psy4691          80 SFAAAVDEFFSTAESHKIDLKAVQQERDALKKL---ENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAILAIRQ  156 (380)
Q Consensus        80 SFNeAlDeFFs~~e~~k~~~k~~~~~~~~ikKL---ek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~  156 (380)
                      .|...+.+|=..+...      ..++..-|+.|   +..-++|.++|.+|+   ++++..+..+..=+..-+..+..|+.
T Consensus        66 ~~~~~~~elA~dIi~k------akqIe~LIdsLPg~~~see~Q~~~i~~L~---~E~~~~~~el~~~v~e~e~ll~~v~~  136 (144)
T PF11221_consen   66 EFEENIKELATDIIRK------AKQIEYLIDSLPGIEVSEEEQLKRIKELE---EENEEAEEELQEAVKEAEELLKQVQE  136 (144)
T ss_dssp             HHHHHHHHHHHHHHHH------HHHHHHHHHHSTTSSS-HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHH------HHHHHHHHHhCCCCCCCHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4777777664433321      11222222322   344566777777776   34555555555555555666666665


Q ss_pred             HH
Q psy4691         157 DI  158 (380)
Q Consensus       157 ~~  158 (380)
                      ++
T Consensus       137 ~i  138 (144)
T PF11221_consen  137 LI  138 (144)
T ss_dssp             HH
T ss_pred             HH
Confidence            54


No 41 
>PF13863 DUF4200:  Domain of unknown function (DUF4200)
Probab=22.27  E-value=4.6e+02  Score=21.80  Aligned_cols=37  Identities=22%  Similarity=0.299  Sum_probs=24.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELI  140 (380)
Q Consensus       104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI  140 (380)
                      .....+.+|...+..+...+..++.....|..+.+.|
T Consensus        78 ~k~~ei~~l~~~l~~l~~~~~k~e~~l~~~~~Y~~fL  114 (126)
T PF13863_consen   78 EKEAEIKKLKAELEELKSEISKLEEKLEEYKKYEEFL  114 (126)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4556667777777777777777776666666665554


No 42 
>PF05833 FbpA:  Fibronectin-binding protein A N-terminus (FbpA);  InterPro: IPR008616 This family consists of the N-terminal region of the prokaryotic fibronectin-binding protein, the C-terminal region is IPR008532 from INTERPRO. Fibronectin binding is considered to be an important virulence factor in streptococcal infections. Fibronectin is a dimeric glycoprotein that is present in a soluble form in plasma and extracellular fluids; it is also present in a fibrillar form on cell surfaces. Both the soluble and cellular forms of fibronectin may be incorporated into the extracellular tissue matrix. While fibronectin has critical roles in eukaryotic cellular processes, such as adhesion, migration and differentiation, it is also a substrate for the attachment of bacteria. The binding of pathogenic Streptococcus pyogenes and Staphylococcus aureus to epithelial cells via fibronectin facilitates their internalisation and systemic spread within the host [].; PDB: 3DOA_A 2ZBK_F 2HKJ_A 1Z5B_A 1Z5C_B 1MX0_F 1Z5A_A 1MU5_A 1Z59_A.
Probab=22.09  E-value=20  Score=37.11  Aligned_cols=44  Identities=23%  Similarity=0.283  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Q psy4691         104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESVDAAIL  152 (380)
Q Consensus       104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~Ve~~i~  152 (380)
                      ..+...+++++.+.+++++++.++++.+..+..     .++..+.++|.
T Consensus       300 ~~~~l~k~l~~~~~klek~l~~l~~~~~~~~~~-----~~~~~~gelL~  343 (455)
T PF05833_consen  300 KKKRLEKKLEKKIKKLEKKLEKLEEELEESEKA-----ENYREYGELLL  343 (455)
T ss_dssp             -------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-----hHHHHHHHHHH
Confidence            444555779999999999999999776666544     45555544443


No 43 
>PF08580 KAR9:  Yeast cortical protein KAR9;  InterPro: IPR013889  The KAR9 protein in Saccharomyces cerevisiae (Baker's yeast) is a cytoskeletal protein required for karyogamy, correct positioning of the mitotic spindle and for orientation of cytoplasmic microtubules []. KAR9 localises at the shmoo tip in mating cells and at the tip of the growing bud in anaphase []. 
Probab=21.75  E-value=4.5e+02  Score=29.35  Aligned_cols=24  Identities=13%  Similarity=0.574  Sum_probs=18.3

Q ss_pred             HHHHHHHHHHHHHhcCCChHHHHH
Q psy4691         146 SVDAAILAIRQDIANQLSWEDIEA  169 (380)
Q Consensus       146 ~Ve~~i~~v~~~~~~~~~W~~I~~  169 (380)
                      .|.+.|..|+.-++-.|.|.||..
T Consensus       106 ~vK~~L~~vK~qveiAmE~~EL~~  129 (683)
T PF08580_consen  106 DVKKTLISVKKQVEIAMEWEELWN  129 (683)
T ss_pred             HHHHHHHHHHHHHHHHHhHHHHHH
Confidence            466777778888888888888764


No 44 
>PF04363 DUF496:  Protein of unknown function (DUF496);  InterPro: IPR007458 Members of this family are uncharacterised proteins.
Probab=21.54  E-value=4.7e+02  Score=21.63  Aligned_cols=19  Identities=32%  Similarity=0.396  Sum_probs=14.1

Q ss_pred             HHhcCCChHHHHHHHHHHh
Q psy4691         157 DIANQLSWEDIEARVKQAQ  175 (380)
Q Consensus       157 ~~~~~~~W~~I~~~i~~~~  175 (380)
                      -+..+|+-.+|..+|..-+
T Consensus        44 YI~~~Ms~edi~~II~nMr   62 (95)
T PF04363_consen   44 YIKPDMSIEDIRAIIENMR   62 (95)
T ss_pred             HccCCCCHHHHHHHHHHHH
Confidence            3467899999988886543


No 45 
>PF07889 DUF1664:  Protein of unknown function (DUF1664);  InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long. 
Probab=21.42  E-value=5.5e+02  Score=22.38  Aligned_cols=44  Identities=18%  Similarity=0.221  Sum_probs=22.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH
Q psy4691         104 QERDALKKLENVKRDHETRLSALEQTQLVDKEKAELIINNQESV  147 (380)
Q Consensus       104 ~~~~~ikKLek~~~~qekrI~~le~~~E~~~~kgeLI~aN~~~V  147 (380)
                      +....-..|...++.+..||+.+...+++.......|......|
T Consensus        51 ql~~vs~~l~~tKkhLsqRId~vd~klDe~~ei~~~i~~eV~~v   94 (126)
T PF07889_consen   51 QLEQVSESLSSTKKHLSQRIDRVDDKLDEQKEISKQIKDEVTEV   94 (126)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence            33344444555555555566665555555555555554444333


No 46 
>PF10779 XhlA:  Haemolysin XhlA;  InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes []. 
Probab=21.06  E-value=3.9e+02  Score=20.49  Aligned_cols=29  Identities=24%  Similarity=0.538  Sum_probs=17.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy4691         106 RDALKKLENVKRDHETRLSALEQTQLVDK  134 (380)
Q Consensus       106 ~~~ikKLek~~~~qekrI~~le~~~E~~~  134 (380)
                      +..+.+++..++.+++++..++...+.+.
T Consensus         5 ~e~l~~ie~~l~~~~~~i~~lE~~~~~~e   33 (71)
T PF10779_consen    5 KEKLNRIETKLDNHEERIDKLEKRDAANE   33 (71)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44556666666666666666665544443


No 47 
>PRK13729 conjugal transfer pilus assembly protein TraB; Provisional
Probab=20.86  E-value=3.5e+02  Score=28.82  Aligned_cols=11  Identities=27%  Similarity=0.643  Sum_probs=7.2

Q ss_pred             CCcccCCceeE
Q psy4691         340 TGEFLTTGSFM  350 (380)
Q Consensus       340 sGeyL~~GsFm  350 (380)
                      +-.|||.|||+
T Consensus       210 ~~~YiPsgSf~  220 (475)
T PRK13729        210 SLPYIPSGSFA  220 (475)
T ss_pred             cCceeCCCCeE
Confidence            33577777774


No 48 
>PRK05423 hypothetical protein; Provisional
Probab=20.74  E-value=4.9e+02  Score=21.81  Aligned_cols=18  Identities=22%  Similarity=0.357  Sum_probs=13.1

Q ss_pred             HhcCCChHHHHHHHHHHh
Q psy4691         158 IANQLSWEDIEARVKQAQ  175 (380)
Q Consensus       158 ~~~~~~W~~I~~~i~~~~  175 (380)
                      +..+|+-++|..+|..-+
T Consensus        52 Ik~~Ms~e~i~~II~nMr   69 (104)
T PRK05423         52 IKPGMSIEEIQGIIANMK   69 (104)
T ss_pred             cCCCCCHHHHHHHHHHHH
Confidence            367888888888875543


No 49 
>KOG0994|consensus
Probab=20.04  E-value=1.6e+03  Score=27.22  Aligned_cols=46  Identities=9%  Similarity=0.102  Sum_probs=29.4

Q ss_pred             HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHhc-CCChHHHHHH
Q psy4691         125 ALEQTQLVDKEKAELIINNQESVDAAILAIRQDIAN-QLSWEDIEAR  170 (380)
Q Consensus       125 ~le~~~E~~~~kgeLI~aN~~~Ve~~i~~v~~~~~~-~~~W~~I~~~  170 (380)
                      .+++..+.-.....-..++..+++.+|..|+..+.+ +.|=+.|+.+
T Consensus      1451 ~Aq~~~~~a~as~~q~~~s~~el~~Li~~v~~Flt~~~adp~si~~v 1497 (1758)
T KOG0994|consen 1451 SAQRALEQANASRSQMEESNRELRNLIQQVRDFLTQPDADPDSIEEV 1497 (1758)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCHHHHHHH
Confidence            333333444445555667777888888888888754 4566677654


Done!