Query         038921
Match_columns 197
No_of_seqs    128 out of 224
Neff          3.9 
Searched_HMMs 46136
Date          Fri Mar 29 04:35:17 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/038921.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/038921hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF03791 KNOX2:  KNOX2 domain ; 100.0   1E-28 2.2E-33  171.5   6.9   51  138-188     2-52  (52)
  2 PF03790 KNOX1:  KNOX1 domain ;  99.9 4.5E-24 9.7E-29  144.5   4.0   44   87-130     1-44  (45)
  3 KOG0773 Transcription factor M  98.3 1.9E-07 4.1E-12   83.1   1.6  105   87-191    50-164 (342)
  4 PF11348 DUF3150:  Protein of u  65.2     9.7 0.00021   33.8   4.2   46  105-164    61-118 (257)
  5 PF03792 PBC:  PBC domain;  Int  63.0      47   0.001   29.0   7.8   41   87-127    27-78  (191)
  6 PF06295 DUF1043:  Protein of u  61.1      15 0.00032   29.3   4.2   35  152-186    35-69  (128)
  7 KOG0774 Transcription factor P  60.6      65  0.0014   30.0   8.6   79   85-163    27-125 (334)
  8 PF11288 DUF3089:  Protein of u  60.5     3.1 6.7E-05   36.2   0.2   30   85-118   109-138 (207)
  9 COG3105 Uncharacterized protei  54.8      33 0.00073   28.5   5.3   42  146-187    38-79  (138)
 10 KOG4420 Uncharacterized conser  46.4      27 0.00059   32.4   3.9   49  148-196   177-241 (325)
 11 PF05190 MutS_IV:  MutS family   44.8      23 0.00051   24.9   2.7   25  142-166     1-25  (92)
 12 PF13097 CENP-U:  CENP-A nucleo  44.1      69  0.0015   27.6   5.8   42  145-186   104-147 (175)
 13 PRK11677 hypothetical protein;  41.8      87  0.0019   25.7   5.8   42  145-186    32-73  (134)
 14 PF07425 Pardaxin:  Pardaxin;    37.8      22 0.00048   22.7   1.4   19   89-107     5-23  (33)
 15 PF11014 DUF2852:  Protein of u  36.2      12 0.00026   30.2   0.0   35  117-153    78-112 (115)
 16 PF05016 Plasmid_stabil:  Plasm  35.5   1E+02  0.0022   21.1   4.7   22  171-192    24-45  (90)
 17 KOG4460 Nuclear pore complex,   32.8      79  0.0017   32.2   5.0   37  148-187   569-605 (741)
 18 PF15225 IL32:  Interleukin 32   32.2 2.6E+02  0.0056   22.2   7.3   50  118-169     2-62  (104)
 19 PF02290 SRP14:  Signal recogni  30.4      41 0.00088   25.6   2.1   18  143-160    71-88  (93)
 20 PF03401 TctC:  Tripartite tric  29.4 1.1E+02  0.0023   26.8   4.8   18  111-128   211-228 (274)
 21 cd03204 GST_C_GDAP1 GST_C fami  28.2      96  0.0021   24.0   3.9   33  152-184     5-46  (111)
 22 KOG1070 rRNA processing protei  28.1 1.7E+02  0.0038   32.9   6.8   91   86-185  1444-1538(1710)
 23 PF12057 DUF3538:  Domain of un  26.4 1.2E+02  0.0025   24.8   4.1   41  147-187    15-69  (120)
 24 PF13713 BRX_N:  Transcription   26.0      75  0.0016   21.1   2.5   20  170-189     8-27  (39)
 25 KOG4445 Uncharacterized conser  24.2 1.6E+02  0.0034   28.0   5.1   43  146-188   135-179 (368)
 26 KOG2828 Acetyl-CoA hydrolase [  24.1      42 0.00091   32.6   1.4   23   92-114   429-451 (454)
 27 TIGR00270 conserved hypothetic  23.7      97  0.0021   25.6   3.3   43  145-187    61-103 (154)
 28 PF01563 Alpha_E3_glycop:  Alph  22.5      45 0.00098   23.9   1.0   23   89-111    28-50  (56)
 29 COG5661 Predicted secreted Zn-  21.8 2.3E+02   0.005   25.1   5.3   42  142-186   109-150 (210)
 30 PF10112 Halogen_Hydrol:  5-bro  21.3 2.2E+02  0.0048   23.6   5.0   43  145-187   125-173 (199)
 31 PHA03397 vlf-1 very late expre  21.3      67  0.0014   30.3   2.1   29   86-114    12-40  (363)
 32 cd03209 GST_C_Mu GST_C family,  20.9 2.8E+02  0.0061   20.4   5.1   14  171-184    39-52  (121)
 33 PF13865 FoP_duplication:  C-te  20.6      65  0.0014   23.4   1.6    8  144-151    46-53  (74)
 34 PF10152 DUF2360:  Predicted co  20.6 2.9E+02  0.0062   22.5   5.5   42  147-188     1-49  (148)
 35 PF14689 SPOB_a:  Sensor_kinase  20.6   3E+02  0.0065   19.0   5.5   37  151-187     9-57  (62)

No 1  
>PF03791 KNOX2:  KNOX2 domain ;  InterPro: IPR005541 The MEINOX region is comprised of two domains, KNOX1 and KNOX2. KNOX1 plays a role in suppressing target gene expression. KNOX2, essential for function, is thought to be necessary for homo-dimerization [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=99.95  E-value=1e-28  Score=171.48  Aligned_cols=51  Identities=55%  Similarity=0.935  Sum_probs=49.3

Q ss_pred             CCCCCCCChhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHhhc
Q 038921          138 PTCFGADPELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNLCT  188 (197)
Q Consensus       138 ~~~~g~DPELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~~  188 (197)
                      ++++|+|||||||||+||.||+||||||+|||+||++|||+||+||++|||
T Consensus         2 ~~~~~~dpELDqFMeaYc~~L~kykeeL~~p~~EA~~f~~~ie~qL~~Lt~   52 (52)
T PF03791_consen    2 SSSIGADPELDQFMEAYCDMLVKYKEELQRPFQEAMEFCREIEQQLSSLTG   52 (52)
T ss_pred             CCCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence            567899999999999999999999999999999999999999999999996


No 2  
>PF03790 KNOX1:  KNOX1 domain ;  InterPro: IPR005540 The MEINOX region is comprised of two domains, KNOX1 and KNOX2. KNOX1 plays a role in suppressing target gene expression. KNOX2, essential for function, is thought to be necessary for homo-dimerization [].; GO: 0003677 DNA binding, 0005634 nucleus
Probab=99.89  E-value=4.5e-24  Score=144.49  Aligned_cols=44  Identities=70%  Similarity=1.090  Sum_probs=41.5

Q ss_pred             HHHHHHHhhCCChHHHHHHHHhccccCCchhhHhHHHHHHhhhh
Q 038921           87 TVIKAKIASHPCYPRLLEAYIDCQKVGAPPEIAYLLDAIRSEKD  130 (197)
Q Consensus        87 ~~iKAkI~sHPlYp~LL~AyidC~KVGAPpev~~~Ldei~~~~~  130 (197)
                      +.|||||++||+||+||+|||+|||||||||++++||++.++++
T Consensus         1 e~iKA~I~~HP~Y~~Ll~Ayi~C~KVGAP~e~~~~L~e~~~~~~   44 (45)
T PF03790_consen    1 EAIKAKIASHPLYPRLLAAYIDCQKVGAPPEVVARLDEILAESQ   44 (45)
T ss_pred             ChHHHHHHcCCCcHHHHHHHHHHHhcCCCHHHHHHHHHHHHHhc
Confidence            36999999999999999999999999999999999999998764


No 3  
>KOG0773 consensus Transcription factor MEIS1 and related HOX domain proteins [Transcription]
Probab=98.32  E-value=1.9e-07  Score=83.08  Aligned_cols=105  Identities=15%  Similarity=0.099  Sum_probs=86.9

Q ss_pred             HHHHHHHhhCCChHHHHHHHHhccccCCchhhHhHHHHHHhhhhhh--cc-CCC-----CCCCCCCChhhhHHHHHHHHH
Q 038921           87 TVIKAKIASHPCYPRLLEAYIDCQKVGAPPEIAYLLDAIRSEKDLY--RR-DSV-----PTCFGADPELDEFMETYSDML  158 (197)
Q Consensus        87 ~~iKAkI~sHPlYp~LL~AyidC~KVGAPpev~~~Ldei~~~~~~~--~~-~~~-----~~~~g~DPELDqFMeaYc~mL  158 (197)
                      ..+|+.+.+||+|..++.||++|++++.+.+.+.+.++..+.....  ++ +++     ....+.+++|+.||..|+.+|
T Consensus        50 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~s~~~~~~~~~~~~~~~~~k~~~~l  129 (342)
T KOG0773|consen   50 ASSKYLTAAQELLDEFCSAGLDCLKGKMPYDPVPRSPASLSPPEDKGARRGNATRESATLKAWLEEHRLNPYPSKLEKIL  129 (342)
T ss_pred             ccccccccchhHHhHHhhccccccccccCcCccccccccccCccccccccccccccccccccchhhhhhccCchHHHHHH
Confidence            3589999999999999999999999999999998877665544321  11 111     124568999999999999999


Q ss_pred             HHHHHHhhchH--HHHHHHHHHHHHHHHHhhcCCC
Q 038921          159 AKYKSDLARPF--DEATTFLNKVELQLRNLCTTTA  191 (197)
Q Consensus       159 ~kykEEL~rPf--~EA~~F~~~iE~QL~~L~~~~~  191 (197)
                      ..++..|+..+  ++++.+.++++..+...++.+.
T Consensus       130 l~~~~~~~~~~~~~~~~~a~r~~~~~~~~~~~~~~  164 (342)
T KOG0773|consen  130 LAVITKLTLTQVSTWFANARRRLKKELKMTWGPTP  164 (342)
T ss_pred             HHHHHHhhhhhHHHHHHHHHHHHHhccCCCCCCcc
Confidence            99999999998  7999999999999999887653


No 4  
>PF11348 DUF3150:  Protein of unknown function (DUF3150);  InterPro: IPR021496  This bacterial family of proteins with unknown function appears to be restricted to Proteobacteria. 
Probab=65.24  E-value=9.7  Score=33.82  Aligned_cols=46  Identities=28%  Similarity=0.498  Sum_probs=30.5

Q ss_pred             HHHhccccCC--------chhh----HhHHHHHHhhhhhhccCCCCCCCCCCChhhhHHHHHHHHHHHHHHH
Q 038921          105 AYIDCQKVGA--------PPEI----AYLLDAIRSEKDLYRRDSVPTCFGADPELDEFMETYSDMLAKYKSD  164 (197)
Q Consensus       105 AyidC~KVGA--------Ppev----~~~Ldei~~~~~~~~~~~~~~~~g~DPELDqFMeaYc~mL~kykEE  164 (197)
                      |.=-|.|||.        |.+.    ..+|++++.++              +.+.++|+..|=..+..+..+
T Consensus        61 A~r~~~~~G~rFlgG~aVP~~~~~~l~~~L~~i~~eF--------------~~~k~~Fl~~Yd~~i~~w~~~  118 (257)
T PF11348_consen   61 AERLCLKVGVRFLGGYAVPEDKAEELAEELEDIKTEF--------------EQEKQDFLANYDQAIEEWIDR  118 (257)
T ss_pred             HHHHHHHcCCcccceeEcCHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHH
Confidence            4456899998        4443    44444555544              345678888888888887777


No 5  
>PF03792 PBC:  PBC domain;  InterPro: IPR005542 Pbx proteins are members of the TALE (three-amino-acid loop extension) family of atypical homeodomain proteins, whose members are characterised by a three-residue insertion in the first helix of the homeodomain involved in their interaction with Hox proteins. Examination of Pbx1 has shown that, in addition to the homeodomain, a short 16-residue C-terminal tail is essential for maximal cooperative interactions with Hox partners as well as for maximal monomeric binding of Pbx1 to DNA.  The PBX domain is a bipartite acidic domain [].; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0005634 nucleus
Probab=62.99  E-value=47  Score=29.02  Aligned_cols=41  Identities=22%  Similarity=0.306  Sum_probs=31.2

Q ss_pred             HHHHHHHhhCCChHHHHHHHHhcc-ccCC---------chh-hHhHHHHHHh
Q 038921           87 TVIKAKIASHPCYPRLLEAYIDCQ-KVGA---------PPE-IAYLLDAIRS  127 (197)
Q Consensus        87 ~~iKAkI~sHPlYp~LL~AyidC~-KVGA---------Ppe-v~~~Ldei~~  127 (197)
                      ...|-.|-+||+||.|-+..++-. |++-         ||| ++.|||.+-.
T Consensus        27 qa~K~~l~~hr~k~ALfsVLcE~KEkt~LSir~~qee~p~dpQl~RLDNML~   78 (191)
T PF03792_consen   27 QARKHALNCHRMKPALFSVLCEIKEKTVLSIRNIQEEDPPDPQLMRLDNMLL   78 (191)
T ss_pred             HHhchhhcCCCCchhhHHHHHHHHhhcCccccccCCcCCCchhhhhhhcchh
Confidence            789999999999999999987765 2221         444 7788887644


No 6  
>PF06295 DUF1043:  Protein of unknown function (DUF1043);  InterPro: IPR009386 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=61.09  E-value=15  Score=29.29  Aligned_cols=35  Identities=20%  Similarity=0.436  Sum_probs=29.1

Q ss_pred             HHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHh
Q 038921          152 ETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNL  186 (197)
Q Consensus       152 eaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L  186 (197)
                      +.=-.-|..||.++..+|..+...+++|...-+.|
T Consensus        35 ~~~k~el~~yk~~V~~HF~~ta~Ll~~l~~~Y~~l   69 (128)
T PF06295_consen   35 EQAKQELEQYKQEVNDHFAQTAELLDNLTQDYQKL   69 (128)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34455689999999999999999999998877665


No 7  
>KOG0774 consensus Transcription factor PBX and related HOX domain proteins [Transcription]
Probab=60.60  E-value=65  Score=30.01  Aligned_cols=79  Identities=20%  Similarity=0.329  Sum_probs=46.7

Q ss_pred             hhHHHHHHHhhCCChHHHHHHHHhcc-ccC---------Cchh-hHhHHHHHHhhhhhhccC----CCCCCCCCC-----
Q 038921           85 VSTVIKAKIASHPCYPRLLEAYIDCQ-KVG---------APPE-IAYLLDAIRSEKDLYRRD----SVPTCFGAD-----  144 (197)
Q Consensus        85 ~s~~iKAkI~sHPlYp~LL~AyidC~-KVG---------APpe-v~~~Ldei~~~~~~~~~~----~~~~~~g~D-----  144 (197)
                      +..+.|-.|-+||+||.|.+--++-. |++         .||| ++-|||.+.-.-.+..+.    ......|.|     
T Consensus        27 eaqa~K~~lnch~mk~AlfsVLcE~KeKt~lsir~~qdeep~dpqlmRLDnML~AEGVagPekgga~~~~Asgg~hsdYR  106 (334)
T KOG0774|consen   27 EAQARKHALNCHRMKPALFSVLCEIKEKTVLSIRGMQDEEPPDPQLMRLDNMLLAEGVAGPEKGGARAAAASGGDHSDYR  106 (334)
T ss_pred             hHHhhhhccccccchHHHHHHHHHhhhhheeeeccccccCCCChHHHHHHHHHHHhcccCccccchhhhhccCCChHHHH
Confidence            34589999999999999998776543 222         3777 899999864432222211    001111222     


Q ss_pred             ChhhhHHHHHHHHHHHHHH
Q 038921          145 PELDEFMETYSDMLAKYKS  163 (197)
Q Consensus       145 PELDqFMeaYc~mL~kykE  163 (197)
                      ..|-|--.-|-..|.||.+
T Consensus       107 ~kL~qiR~iy~~Elekyeq  125 (334)
T KOG0774|consen  107 AKLLQIRQIYHNELEKYEQ  125 (334)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            3455555666666666654


No 8  
>PF11288 DUF3089:  Protein of unknown function (DUF3089);  InterPro: IPR021440  This family of proteins has no known function. 
Probab=60.46  E-value=3.1  Score=36.18  Aligned_cols=30  Identities=33%  Similarity=0.658  Sum_probs=27.1

Q ss_pred             hhHHHHHHHhhCCChHHHHHHHHhccccCCchhh
Q 038921           85 VSTVIKAKIASHPCYPRLLEAYIDCQKVGAPPEI  118 (197)
Q Consensus        85 ~s~~iKAkI~sHPlYp~LL~AyidC~KVGAPpev  118 (197)
                      ....||.+|..+|+..+||+||+    ||.|.-+
T Consensus       109 l~~LL~e~~~~~pl~~rLVAAYl----iG~~v~~  138 (207)
T PF11288_consen  109 LLRLLKEEIAGDPLRKRLVAAYL----IGYPVTV  138 (207)
T ss_pred             HHHHHHHHhcCchHHhhhheeee----cCccccH
Confidence            36789999999999999999999    9998765


No 9  
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=54.78  E-value=33  Score=28.54  Aligned_cols=42  Identities=19%  Similarity=0.400  Sum_probs=34.4

Q ss_pred             hhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHhh
Q 038921          146 ELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNLC  187 (197)
Q Consensus       146 ELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~  187 (197)
                      -|-.=|+.--.-|..||+||.+||.+..+.+++|-+.-+.|-
T Consensus        38 ~~q~ELe~~K~~ld~~rqel~~HFa~sAeLlktl~~dYqkly   79 (138)
T COG3105          38 KLQYELEKVKAQLDEYRQELVKHFARSAELLKTLAQDYQKLY   79 (138)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344446777788999999999999999999999988777663


No 10 
>KOG4420 consensus Uncharacterized conserved protein (Ganglioside-induced differentiation associated protein 1, GDAP1) [Function unknown]
Probab=46.36  E-value=27  Score=32.44  Aligned_cols=49  Identities=27%  Similarity=0.411  Sum_probs=38.8

Q ss_pred             hhHHHHHHHHHHHHHHHhhc---------hHHHHHHHHHHHHHHHHH-------hhcCCCccccC
Q 038921          148 DEFMETYSDMLAKYKSDLAR---------PFDEATTFLNKVELQLRN-------LCTTTASVRSL  196 (197)
Q Consensus       148 DqFMeaYc~mL~kykEEL~r---------Pf~EA~~F~~~iE~QL~~-------L~~~~~s~~~~  196 (197)
                      ++--++|...+.|....|-+         -.+|-.+|+..+|+.|..       |||...|+..|
T Consensus       177 pdla~ay~akqkkl~~kl~~hdd~s~lkkild~l~~~Ld~VEteLe~r~~~~~wL~G~efslADV  241 (325)
T KOG4420|consen  177 PDLAEAYLAKQKKLMAKLLEHDDVSYLKKILDELAMVLDQVETELEKRKLCELWLCGCEFSLADV  241 (325)
T ss_pred             chhhHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHhhccccceeeccccchHHHH
Confidence            44458999888888766554         468999999999999998       78888776554


No 11 
>PF05190 MutS_IV:  MutS family domain IV C-terminus.;  InterPro: IPR007861 Mismatch repair contributes to the overall fidelity of DNA replication and is essential for combating the adverse effects of damage to the genome. It involves the correction of mismatched base pairs that have been missed by the proofreading element of the DNA polymerase complex. The post-replicative Mismatch Repair System (MMRS) of Escherichia coli involves MutS (Mutator S), MutL and MutH proteins, and acts to correct point mutations or small insertion/deletion loops produced during DNA replication []. MutS and MutL are involved in preventing recombination between partially homologous DNA sequences. The assembly of MMRS is initiated by MutS, which recognises and binds to mispaired nucleotides and allows further action of MutL and MutH to eliminate a portion of newly synthesized DNA strand containing the mispaired base []. MutS can also collaborate with methyltransferases in the repair of O(6)-methylguanine damage, which would otherwise pair with thymine during replication to create an O(6)mG:T mismatch []. MutS exists as a dimer, where the two monomers have different conformations and form a heterodimer at the structural level []. Only one monomer recognises the mismatch specifically and has ADP bound. Non-specific major groove DNA-binding domains from both monomers embrace the DNA in a clamp-like structure. Mismatch binding induces ATP uptake and a conformational change in the MutS protein, resulting in a clamp that translocates on DNA.  MutS is a modular protein with a complex structure [], and is composed of:   N-terminal mismatch-recognition domain, which is similar in structure to tRNA endonuclease. Connector domain, which is similar in structure to Holliday junction resolvase ruvC. Core domain, which is composed of two separate subdomains that join together to form a helical bundle; from within the core domain, two helices act as levers that extend towards (but do not touch) the DNA. Clamp domain, which is inserted between the two subdomains of the core domain at the top of the lever helices; the clamp domain has a beta-sheet structure. ATPase domain (connected to the core domain), which has a classical Walker A motif. HTH (helix-turn-helix) domain, which is involved in dimer contacts.   The MutS family of proteins is named after the Salmonella typhimurium MutS protein involved in mismatch repair. Homologues of MutS have been found in many species including eukaryotes (MSH 1, 2, 3, 4, 5, and 6 proteins), archaea and bacteria, and together these proteins have been grouped into the MutS family. Although many of these proteins have similar activities to the E. coli MutS, there is significant diversity of function among the MutS family members. Human MSH has been implicated in non-polyposis colorectal carcinoma (HNPCC) and is a mismatch binding protein [].This diversity is even seen within species, where many species encode multiple MutS homologues with distinct functions []. Inter-species homologues may have arisen through frequent ancient horizontal gene transfer of MutS (and MutL) from bacteria to archaea and eukaryotes via endosymbiotic ancestors of mitochondria and chloroplasts [].  This entry represents the clamp domain (domain 4) found in proteins of the MutS family. The clamp domain is inserted within the core domain at the top of the lever helices. It has a beta-sheet structure [].; GO: 0005524 ATP binding, 0030983 mismatched DNA binding, 0006298 mismatch repair; PDB: 2WTU_A 1OH7_A 1OH5_B 1W7A_B 1NG9_A 1OH8_B 1WBD_A 1WB9_A 3K0S_A 1OH6_A ....
Probab=44.83  E-value=23  Score=24.87  Aligned_cols=25  Identities=36%  Similarity=0.550  Sum_probs=18.5

Q ss_pred             CCCChhhhHHHHHHHHHHHHHHHhh
Q 038921          142 GADPELDEFMETYSDMLAKYKSDLA  166 (197)
Q Consensus       142 g~DPELDqFMeaYc~mL~kykEEL~  166 (197)
                      |-|++||+-.+.|.++.....+.+.
T Consensus         1 g~d~~Ld~~~~~~~~~~~~l~~~~~   25 (92)
T PF05190_consen    1 GFDEELDELREEYEEIEEELEELLE   25 (92)
T ss_dssp             TSSHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHHHH
Confidence            4689999999998887666554443


No 12 
>PF13097 CENP-U:  CENP-A nucleosome associated complex (NAC) subunit
Probab=44.12  E-value=69  Score=27.59  Aligned_cols=42  Identities=21%  Similarity=0.405  Sum_probs=36.2

Q ss_pred             ChhhhHHHHHHHHHHHHHHHhhchH-HHHH-HHHHHHHHHHHHh
Q 038921          145 PELDEFMETYSDMLAKYKSDLARPF-DEAT-TFLNKVELQLRNL  186 (197)
Q Consensus       145 PELDqFMeaYc~mL~kykEEL~rPf-~EA~-~F~~~iE~QL~~L  186 (197)
                      -|||=...++-.++.+||+.+.-.+ .+|+ .|+..+..||-.+
T Consensus       104 tELDVvL~~FEk~~~eYkq~ieS~~cr~AI~~F~~~~keqL~~~  147 (175)
T PF13097_consen  104 TELDVVLSAFEKTALEYKQSIESKICRKAINKFYSNFKEQLIEM  147 (175)
T ss_pred             hHHHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHHHHHH
Confidence            4999999999999999999998876 6664 6999999888654


No 13 
>PRK11677 hypothetical protein; Provisional
Probab=41.79  E-value=87  Score=25.67  Aligned_cols=42  Identities=21%  Similarity=0.392  Sum_probs=32.4

Q ss_pred             ChhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHh
Q 038921          145 PELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNL  186 (197)
Q Consensus       145 PELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L  186 (197)
                      .+|.+=.+.=-.-|..||+|+..||.++..-+++|-.+-+.|
T Consensus        32 ~~le~eLe~~k~ele~YkqeV~~HFa~TA~Ll~~L~~~Y~~L   73 (134)
T PRK11677         32 QALQYELEKNKAELEEYRQELVSHFARSAELLDTMAKDYRQL   73 (134)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344444455556799999999999999999999888776655


No 14 
>PF07425 Pardaxin:  Pardaxin;  InterPro: IPR009990 This family consists of several Pardaxin proteins. Pardaxin, a 33-amino-acid pore-forming polypeptide toxin isolated from the Red Sea Moses sole Pardachirus marmoratus, has a helix-hinge-helix structure. This is a common structural motif found both in antibacterial peptides that can act selectively on bacterial membranes (e.g., cecropin), and in cytotoxic peptides that can lyse both mammalian and bacterial cells (e.g., melittin). Pardaxin possesses a high antibacterial activity with a significantly reduced haemolytic activity towards human red blood cells compared with melittin []. Pardaxin has also been found to have a shark repellent action [].; GO: 0005576 extracellular region; PDB: 1XC0_A 2KNS_A.
Probab=37.78  E-value=22  Score=22.67  Aligned_cols=19  Identities=37%  Similarity=0.436  Sum_probs=15.7

Q ss_pred             HHHHHhhCCChHHHHHHHH
Q 038921           89 IKAKIASHPCYPRLLEAYI  107 (197)
Q Consensus        89 iKAkI~sHPlYp~LL~Ayi  107 (197)
                      +--||++-|++..||+|--
T Consensus         5 lipkiissplfktllsavg   23 (33)
T PF07425_consen    5 LIPKIISSPLFKTLLSAVG   23 (33)
T ss_dssp             CHHHHCCTTTCHHHHHHHH
T ss_pred             hhhHHHccHHHHHHHHHHH
Confidence            3468999999999998843


No 15 
>PF11014 DUF2852:  Protein of unknown function (DUF2852);  InterPro: IPR021273  This bacterial family of proteins has no known function. 
Probab=36.20  E-value=12  Score=30.19  Aligned_cols=35  Identities=17%  Similarity=0.185  Sum_probs=21.2

Q ss_pred             hhHhHHHHHHhhhhhhccCCCCCCCCCCChhhhHHHH
Q 038921          117 EIAYLLDAIRSEKDLYRRDSVPTCFGADPELDEFMET  153 (197)
Q Consensus       117 ev~~~Ldei~~~~~~~~~~~~~~~~g~DPELDqFMea  153 (197)
                      +.+.+||+.+++++.+-..+  ..-.+..|.|+||..
T Consensus        78 ~tL~RLEeEq~eF~~Fl~rL--R~AKDk~EFD~FM~~  112 (115)
T PF11014_consen   78 DTLRRLEEEQREFEDFLERL--RRAKDKEEFDQFMAE  112 (115)
T ss_pred             HHHHHHHHHHHHHHHHHHHH--HHhhhHHHHHHHHHh
Confidence            46778888888775431110  001245799999974


No 16 
>PF05016 Plasmid_stabil:  Plasmid stabilisation system protein;  InterPro: IPR007712 Members of this family are involved in plasmid stabilisation. The exact molecular function of this protein is not known.; PDB: 3KXE_A 1WMI_A 3KIQ_y 3KIX_y 2KC9_A 3KHA_B 3KIS_y 3KIU_y 2KC8_A 1Z8M_A ....
Probab=35.49  E-value=1e+02  Score=21.07  Aligned_cols=22  Identities=18%  Similarity=0.377  Sum_probs=18.9

Q ss_pred             HHHHHHHHHHHHHHHhhcCCCc
Q 038921          171 EATTFLNKVELQLRNLCTTTAS  192 (197)
Q Consensus       171 EA~~F~~~iE~QL~~L~~~~~s  192 (197)
                      -|..|...|+.++..|...+..
T Consensus        24 ~a~~~~~~i~~~i~~l~~~P~~   45 (90)
T PF05016_consen   24 AAERFIDAIEKAIERLSENPEM   45 (90)
T ss_dssp             HHHHHHHHHHHHHHHHHCCCCT
T ss_pred             HHHHHHHHHHHHHHhccccccc
Confidence            4999999999999999876643


No 17 
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=32.80  E-value=79  Score=32.24  Aligned_cols=37  Identities=22%  Similarity=0.278  Sum_probs=31.0

Q ss_pred             hhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHhh
Q 038921          148 DEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNLC  187 (197)
Q Consensus       148 DqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~  187 (197)
                      .-|-|.|..-=.+-|+|++|++   ...|+.+++||+.|.
T Consensus       569 ~vfrEqYi~~~dlV~~e~qrH~---~~l~~~k~~QlQ~l~  605 (741)
T KOG4460|consen  569 QVFREQYILKQDLVKEEIQRHV---KLLCDQKKKQLQDLS  605 (741)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHH
Confidence            4588888888888899999986   467899999999885


No 18 
>PF15225 IL32:  Interleukin 32
Probab=32.22  E-value=2.6e+02  Score=22.19  Aligned_cols=50  Identities=22%  Similarity=0.248  Sum_probs=31.8

Q ss_pred             hHhHHHHHHhhhhhh-ccCCCCCCCCCCChh----------hhHHHHHHHHHHHHHHHhhchH
Q 038921          118 IAYLLDAIRSEKDLY-RRDSVPTCFGADPEL----------DEFMETYSDMLAKYKSDLARPF  169 (197)
Q Consensus       118 v~~~Ldei~~~~~~~-~~~~~~~~~g~DPEL----------DqFMeaYc~mL~kykEEL~rPf  169 (197)
                      ..++|++.+++...+ +|+.  .--.+||++          |.-|+-+|.||+....-+++-+
T Consensus         2 l~PLLeeeRd~LR~RgnRsp--vp~~EDp~te~~eP~Esf~dkvmR~FqamlqrLQ~ww~~vl   62 (104)
T PF15225_consen    2 LTPLLEEERDELRCRGNRSP--VPDVEDPATEVWEPGESFCDKVMRWFQAMLQRLQTWWQAVL   62 (104)
T ss_pred             CccccHHHHHHHHhccCCCC--CCCCCCcccccCCcchhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            456788888775432 2221  111245544          6789999999999887777664


No 19 
>PF02290 SRP14:  Signal recognition particle 14kD protein;  InterPro: IPR003210  The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes [, ]. SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. Eukaryotic SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor []. In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. In archaea, the SRP complex contains 7S RNA like its eukaryotic counterpart, yet only includes two of the six protein subunits found in the eukarytic complex: SRP19 and SRP54 []. This entry represents the 14 kDa SRP14 component. Both SRP9 and SRP14 have the same (beta)-alpha-beta(3)-alpha fold. The heterodimer has pseudo two-fold symmetry and is saddle-like, consisting of a curved six-stranded beta-sheet that has four helices packed on the convex side and an exposed concave surface lined with positively charged residues. The SRP9/SRP14 heterodimer is essential for SRP RNA binding, mediating the pausing of synthesis of ribosome associated nascent polypeptides that have been engaged by the targeting domain of SRP [].; GO: 0008312 7S RNA binding, 0030942 endoplasmic reticulum signal peptide binding, 0006614 SRP-dependent cotranslational protein targeting to membrane, 0005786 signal recognition particle, endoplasmic reticulum targeting; PDB: 1914_A 1RY1_D 1E8O_B 2W9J_B.
Probab=30.45  E-value=41  Score=25.57  Aligned_cols=18  Identities=28%  Similarity=0.667  Sum_probs=15.1

Q ss_pred             CCChhhhHHHHHHHHHHH
Q 038921          143 ADPELDEFMETYSDMLAK  160 (197)
Q Consensus       143 ~DPELDqFMeaYc~mL~k  160 (197)
                      .-.+|++|..+|+.+|..
T Consensus        71 ~~~~l~~F~~~Y~~v~K~   88 (93)
T PF02290_consen   71 DPDDLDKFWQSYANVLKA   88 (93)
T ss_dssp             ETTCHHHHHHHHHHHHHH
T ss_pred             CHHHHHHHHHHHHHHHHh
Confidence            346999999999999863


No 20 
>PF03401 TctC:  Tripartite tricarboxylate transporter family receptor;  InterPro: IPR005064  Bordetella pertussis, the causative agent of human whooping cough (pertussis), is an obligate human pathogen with diverse high-affinity transport systems for the assimilation of iron, a biometal that is essential for growth []. Periplasmic binding proteins of a new family, particularly well represented in this organism (and more generally in beta-proteobacteria), have been called Bug receptors []. They adopt a characteristic Venus flytrap fold with two globular domains bisected by a ligand-binding cleft. The family is specific for carboxylated solutes, with a characteristic mode of binding involving two highly conserved beta strand-beta turn-alpha helix motifs originating from each domain. These two motifs form hydrogen bonds with a carboxylate group of the ligand, both directly and via conserved water molecules, and have thus been termed the carboxylate pincers. Domain 1 recognises the ligand and the carboxylate group serves as an initial anchoring point. Domain 2 discriminates between productively and non-productively bound ligands as proper interactions with this domain is needed for the of the closed conformation []. BugE has a glutamate bound ligand. No charged residues are involved in glutamate binding by BugE, unlike what has been described for all glutamate receptors reported so far. The Bug architecture is highly conserved despite limited sequence identity [].; GO: 0030288 outer membrane-bounded periplasmic space; PDB: 2QPQ_C 2DVZ_A 2F5X_A.
Probab=29.40  E-value=1.1e+02  Score=26.80  Aligned_cols=18  Identities=28%  Similarity=0.300  Sum_probs=13.1

Q ss_pred             ccCCchhhHhHHHHHHhh
Q 038921          111 KVGAPPEIAYLLDAIRSE  128 (197)
Q Consensus       111 KVGAPpev~~~Ldei~~~  128 (197)
                      +-|+|+|++.+|.++-++
T Consensus       211 p~gtp~~~~~~l~~a~~~  228 (274)
T PF03401_consen  211 PKGTPDEIVDKLADAIKK  228 (274)
T ss_dssp             ETTS-HHHHHHHHHHHHH
T ss_pred             CCCCCHHHHHHHHHHHHH
Confidence            348999999999876553


No 21 
>cd03204 GST_C_GDAP1 GST_C family, Ganglioside-induced differentiation-associated protein 1 (GDAP1) subfamily; GDAP1 was originally identified as a highly expressed gene at the differentiated stage of GD3 synthase-transfected cells. More recently, mutations in GDAP1 have been reported to cause both axonal and demyelinating autosomal-recessive Charcot-Marie-Tooth (CMT) type 4A neuropathy. CMT is characterized by slow and progressive weakness and atrophy of muscles. Sequence analysis of GDAP1 shows similarities and differences with GSTs; it appears to contain both N-terminal thioredoxin-fold and C-terminal alpha helical domains of GSTs, however, it also contains additional C-terminal transmembrane domains unlike GSTs. GDAP1 is mainly expressed in neuronal cells and is localized in the mitochondria through its transmembrane domains. It does not exhibit GST activity using standard substrates.
Probab=28.21  E-value=96  Score=24.03  Aligned_cols=33  Identities=33%  Similarity=0.426  Sum_probs=23.7

Q ss_pred             HHHHHHHHHHHHHhhch---------HHHHHHHHHHHHHHHH
Q 038921          152 ETYSDMLAKYKSDLARP---------FDEATTFLNKVELQLR  184 (197)
Q Consensus       152 eaYc~mL~kykEEL~rP---------f~EA~~F~~~iE~QL~  184 (197)
                      ++|..-..|.|.+|..|         +.+....+..||++|.
T Consensus         5 ~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~l~~LE~~L~   46 (111)
T cd03204           5 TAYIAKQKKLKSKLLDHDNVEYLKKILDELEMVLDQVEQELQ   46 (111)
T ss_pred             HHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHH
Confidence            56666666777666555         4677778888899886


No 22 
>KOG1070 consensus rRNA processing protein Rrp5 [RNA processing and modification]
Probab=28.07  E-value=1.7e+02  Score=32.88  Aligned_cols=91  Identities=18%  Similarity=0.227  Sum_probs=64.8

Q ss_pred             hHHHHHHHhhCCChHHHHHHHHhccccCCchhhHhHHHHHHhhhhhhccCCCCCCCCCCChhhhHHHHHHHHHHHHH--H
Q 038921           86 STVIKAKIASHPCYPRLLEAYIDCQKVGAPPEIAYLLDAIRSEKDLYRRDSVPTCFGADPELDEFMETYSDMLAKYK--S  163 (197)
Q Consensus        86 s~~iKAkI~sHPlYp~LL~AyidC~KVGAPpev~~~Ldei~~~~~~~~~~~~~~~~g~DPELDqFMeaYc~mL~kyk--E  163 (197)
                      .+.+-..|.+||.=.-+.--||.-+=      .+..|+++|.-.   .|.....-+.+..|++.---||.++...|.  +
T Consensus      1444 aeDferlvrssPNSSi~WI~YMaf~L------elsEiekAR~ia---erAL~tIN~REeeEKLNiWiA~lNlEn~yG~ee 1514 (1710)
T KOG1070|consen 1444 AEDFERLVRSSPNSSILWIRYMAFHL------ELSEIEKARKIA---ERALKTINFREEEEKLNIWIAYLNLENAYGTEE 1514 (1710)
T ss_pred             HHHHHHHHhcCCCcchHHHHHHHHHh------hhhhhHHHHHHH---HHHhhhCCcchhHHHHHHHHHHHhHHHhhCcHH
Confidence            44566789999999999988986543      233444433332   222223345678899999999999999997  8


Q ss_pred             HhhchHHHHHHHHH--HHHHHHHH
Q 038921          164 DLARPFDEATTFLN--KVELQLRN  185 (197)
Q Consensus       164 EL~rPf~EA~~F~~--~iE~QL~~  185 (197)
                      .|++-|..|-.+|+  .+...|..
T Consensus      1515 sl~kVFeRAcqycd~~~V~~~L~~ 1538 (1710)
T KOG1070|consen 1515 SLKKVFERACQYCDAYTVHLKLLG 1538 (1710)
T ss_pred             HHHHHHHHHHHhcchHHHHHHHHH
Confidence            89999999999998  44444443


No 23 
>PF12057 DUF3538:  Domain of unknown function (DUF3538);  InterPro: IPR021925  This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 120 amino acids in length. This domain is found associated with PF00240 from PFAM. This domain has a conserved SDL sequence motif. 
Probab=26.36  E-value=1.2e+02  Score=24.76  Aligned_cols=41  Identities=32%  Similarity=0.599  Sum_probs=30.1

Q ss_pred             hhhHHHHHHHHHH-----HH------HHHhhchH---HHHHHHHHHHHHHHHHhh
Q 038921          147 LDEFMETYSDMLA-----KY------KSDLARPF---DEATTFLNKVELQLRNLC  187 (197)
Q Consensus       147 LDqFMeaYc~mL~-----ky------kEEL~rPf---~EA~~F~~~iE~QL~~L~  187 (197)
                      |-=||+.|+++|.     .|      +|+=+|-|   -||..++..-..-|++|-
T Consensus        15 L~Pflery~~iL~~~~~~e~~~~~~~re~~QRi~d~VsEalh~lsHA~hAlSDL~   69 (120)
T PF12057_consen   15 LQPFLERYHEILQEDPSFEYENNTQEREEDQRIFDLVSEALHYLSHAYHALSDLM   69 (120)
T ss_pred             HhHHHHHHHHHHhcCCCCCCCCCccchhhHhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            3448888998886     34      55556665   588888888888888874


No 24 
>PF13713 BRX_N:  Transcription factor BRX N-terminal domain
Probab=25.99  E-value=75  Score=21.08  Aligned_cols=20  Identities=15%  Similarity=0.370  Sum_probs=17.0

Q ss_pred             HHHHHHHHHHHHHHHHhhcC
Q 038921          170 DEATTFLNKVELQLRNLCTT  189 (197)
Q Consensus       170 ~EA~~F~~~iE~QL~~L~~~  189 (197)
                      .-|.+|++.|..||++++..
T Consensus         8 kaaKe~IKsLt~QlK~maek   27 (39)
T PF13713_consen    8 KAAKEVIKSLTAQLKDMAEK   27 (39)
T ss_pred             HHHHHHHHHHHHHHHHHHHh
Confidence            46889999999999999753


No 25 
>KOG4445 consensus Uncharacterized conserved protein, contains RWD domain [Function unknown]
Probab=24.24  E-value=1.6e+02  Score=28.03  Aligned_cols=43  Identities=16%  Similarity=0.268  Sum_probs=35.2

Q ss_pred             hhhhHHHHHHH--HHHHHHHHhhchHHHHHHHHHHHHHHHHHhhc
Q 038921          146 ELDEFMETYSD--MLAKYKSDLARPFDEATTFLNKVELQLRNLCT  188 (197)
Q Consensus       146 ELDqFMeaYc~--mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~~  188 (197)
                      +-|.||--||.  -|+.....|.+-|++|-.-...+-.|.+.+|.
T Consensus       135 ~C~Hy~H~~ClaRyl~~~~~~lrqe~q~~~~~~qh~~~~~eavcp  179 (368)
T KOG4445|consen  135 ACDHYMHFACLARYLTECLTGLRQEIQDAQKERQHMKEQVEAVCP  179 (368)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHhhhhh
Confidence            67999999995  36666666777788899989999999999974


No 26 
>KOG2828 consensus Acetyl-CoA hydrolase [Energy production and conversion]
Probab=24.12  E-value=42  Score=32.60  Aligned_cols=23  Identities=35%  Similarity=0.343  Sum_probs=20.4

Q ss_pred             HHhhCCChHHHHHHHHhccccCC
Q 038921           92 KIASHPCYPRLLEAYIDCQKVGA  114 (197)
Q Consensus        92 kI~sHPlYp~LL~AyidC~KVGA  114 (197)
                      +|..||.|..|+.+|.|-.||=.
T Consensus       429 ~i~~p~dre~L~k~afdr~kvmp  451 (454)
T KOG2828|consen  429 QICAPPDREALLKAAFDRAKVMP  451 (454)
T ss_pred             HhhCCchHHHHHHHHHHHHhccc
Confidence            68999999999999999888743


No 27 
>TIGR00270 conserved hypothetical protein TIGR00270.
Probab=23.69  E-value=97  Score=25.60  Aligned_cols=43  Identities=9%  Similarity=0.219  Sum_probs=30.9

Q ss_pred             ChhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHhh
Q 038921          145 PELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNLC  187 (197)
Q Consensus       145 PELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L~  187 (197)
                      ++++++++.|-..|.++++++.-..+|-..-+.-=.+.++.+-
T Consensus        61 d~~~~l~~~~g~~Ir~~Re~~glSqeeLA~~lgvs~s~IsriE  103 (154)
T TIGR00270        61 DTTEELVEDYGIIIRREREKRGWSQEQLAKKIQEKESLIKKIE  103 (154)
T ss_pred             chHHHHHHHHHHHHHHHHHHcCCCHHHHHHHhCCCHHHHHHHH
Confidence            4788999999999999999887776665554444444444443


No 28 
>PF01563 Alpha_E3_glycop:  Alphavirus E3 glycoprotein;  InterPro: IPR002533 Alphaviruses are enveloped RNA viruses that use arthropods such as mosquitoes for transmission to their vertebrate hosts, and include Semliki Forest and Sindbis viruses []. Alphaviruses consist of three structural proteins: the core nucleocapsid protein C, and the envelope proteins P62 and E1 (IPR002548 from INTERPRO) that associate as a heterodimer. The viral membrane-anchored surface glycoproteins are responsible for receptor recognition and entry into target cells through membrane fusion. The proteolytic maturation of P62 into E2 (IPR000936 from INTERPRO) and E3 causes a change in the viral surface. Together the E1, E2, and sometimes E3 glycoprotein "spikes" form an E1/E2 dimer or an E1/E2/E3 trimer, where E2 extends from the centre to the vertices, E1 fills the space between the vertices, and E3, if present, is at the distal end of the spike [, ]. Upon exposure of the virus to the acidity of the endosome, E1 dissociates from E2 to form an E1 homotrimer, which is necessary for the fusion step to drive the cellular and viral membranes together []. This entry represents the alphaviral E3 glycoprotein. Most alphaviruses lose the peripheral protein E3, but in Semliki viruses it remains associated with the viral surface.; GO: 0004252 serine-type endopeptidase activity, 0019028 viral capsid, 0055036 virion membrane; PDB: 3J0C_H 3J0G_N 3N40_P 3N41_A 3N44_A 3N42_A 3N43_A.
Probab=22.53  E-value=45  Score=23.95  Aligned_cols=23  Identities=30%  Similarity=0.567  Sum_probs=17.0

Q ss_pred             HHHHHhhCCChHHHHHHHHhccc
Q 038921           89 IKAKIASHPCYPRLLEAYIDCQK  111 (197)
Q Consensus        89 iKAkI~sHPlYp~LL~AyidC~K  111 (197)
                      |=..=+.||-|-.||+|-+.|.-
T Consensus        28 mLe~Nvd~p~Y~~LL~a~l~C~~   50 (56)
T PF01563_consen   28 MLEDNVDNPGYDDLLEAVLRCPS   50 (56)
T ss_dssp             HHHCTSSSTTHHHHHHHHCC--S
T ss_pred             HHHHhCCCccHHHHHHHHhhCCC
Confidence            44445789999999999999964


No 29 
>COG5661 Predicted secreted Zn-dependent protease [Posttranslational modification, protein turnover, chaperones]
Probab=21.77  E-value=2.3e+02  Score=25.14  Aligned_cols=42  Identities=26%  Similarity=0.260  Sum_probs=35.8

Q ss_pred             CCCChhhhHHHHHHHHHHHHHHHhhchHHHHHHHHHHHHHHHHHh
Q 038921          142 GADPELDEFMETYSDMLAKYKSDLARPFDEATTFLNKVELQLRNL  186 (197)
Q Consensus       142 g~DPELDqFMeaYc~mL~kykEEL~rPf~EA~~F~~~iE~QL~~L  186 (197)
                      ++||||-.|-.+|-.-+.+-+   +++.+=|..--++||.+|.+|
T Consensus       109 ~a~pElalvwdtlladikrHE---ErH~eiar~har~mEkaL~sl  150 (210)
T COG5661         109 KADPELALVWDTLLADIKRHE---ERHAEIARYHAREMEKALLSL  150 (210)
T ss_pred             CCCHhHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHhc
Confidence            589999999999877666554   478888999999999999998


No 30 
>PF10112 Halogen_Hydrol:  5-bromo-4-chloroindolyl phosphate hydrolysis protein;  InterPro: IPR018770 This entry consists of prokaryotic proteins that mediate the hydrolysis of 5-bromo-4-chloroindolyl phosphate bonds. 
Probab=21.32  E-value=2.2e+02  Score=23.60  Aligned_cols=43  Identities=26%  Similarity=0.414  Sum_probs=29.4

Q ss_pred             ChhhhHHHHHH----HHHHHHHHHhhchH--HHHHHHHHHHHHHHHHhh
Q 038921          145 PELDEFMETYS----DMLAKYKSDLARPF--DEATTFLNKVELQLRNLC  187 (197)
Q Consensus       145 PELDqFMeaYc----~mL~kykEEL~rPf--~EA~~F~~~iE~QL~~L~  187 (197)
                      ...+.||..|-    +++.||.+==.+|.  .|+...+.++|.-|..|+
T Consensus       125 ~~a~~Fl~~yLp~~~~l~~kY~~l~~~~~~~~~~~~~l~e~~~~L~~l~  173 (199)
T PF10112_consen  125 TQARKFLYYYLPTAVKLLEKYAELESQPVKSEEIKQSLEEIEETLDTLN  173 (199)
T ss_pred             HHHHHHHHHHhhHHHHHHHHHHHHHhccCCChhHHHHHHHHHHHHHHHH
Confidence            57899999995    56777877555665  555566666666666554


No 31 
>PHA03397 vlf-1 very late expression factor 1; Provisional
Probab=21.32  E-value=67  Score=30.27  Aligned_cols=29  Identities=28%  Similarity=0.415  Sum_probs=25.6

Q ss_pred             hHHHHHHHhhCCChHHHHHHHHhccccCC
Q 038921           86 STVIKAKIASHPCYPRLLEAYIDCQKVGA  114 (197)
Q Consensus        86 s~~iKAkI~sHPlYp~LL~AyidC~KVGA  114 (197)
                      -..-|.+|-+||+|+...+.=++=||-+-
T Consensus        12 ~~~w~~~i~~~~~f~~~~~~~~~rq~~~~   40 (363)
T PHA03397         12 YNIWKLVIQNHPRFEEVFDLAIDRQKCTP   40 (363)
T ss_pred             HHHHHHHHhccchHHHHHHHHHHhhcCCC
Confidence            45789999999999999999999999543


No 32 
>cd03209 GST_C_Mu GST_C family, Class Mu subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Mu subfamily is composed of eukaryotic GSTs. In rats, at least six distinct class Mu subunits have been identified, with homologous genes in humans for five of these subunits. Class Mu GSTs can form homodimers and heterodimers, giving a large number of possible isoenzymes that can be formed, all with overlapping activities but different substrate specificities. They are the m
Probab=20.95  E-value=2.8e+02  Score=20.42  Aligned_cols=14  Identities=14%  Similarity=-0.088  Sum_probs=8.0

Q ss_pred             HHHHHHHHHHHHHH
Q 038921          171 EATTFLNKVELQLR  184 (197)
Q Consensus       171 EA~~F~~~iE~QL~  184 (197)
                      +....+..+|.+|.
T Consensus        39 ~~~~~l~~le~~L~   52 (121)
T cd03209          39 KLPDKLKLFSDFLG   52 (121)
T ss_pred             HHHHHHHHHHHHhC
Confidence            33445666666664


No 33 
>PF13865 FoP_duplication:  C-terminal duplication domain of Friend of PRMT1
Probab=20.65  E-value=65  Score=23.44  Aligned_cols=8  Identities=63%  Similarity=1.111  Sum_probs=4.2

Q ss_pred             CChhhhHH
Q 038921          144 DPELDEFM  151 (197)
Q Consensus       144 DPELDqFM  151 (197)
                      |.|||+||
T Consensus        46 DaELD~Ym   53 (74)
T PF13865_consen   46 DAELDAYM   53 (74)
T ss_pred             HHHHHHHH
Confidence            44555555


No 34 
>PF10152 DUF2360:  Predicted coiled-coil domain-containing protein (DUF2360);  InterPro: IPR019309 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53.
Probab=20.61  E-value=2.9e+02  Score=22.51  Aligned_cols=42  Identities=17%  Similarity=0.320  Sum_probs=28.1

Q ss_pred             hhhHHHHHHHHHHHHHHH-------hhchHHHHHHHHHHHHHHHHHhhc
Q 038921          147 LDEFMETYSDMLAKYKSD-------LARPFDEATTFLNKVELQLRNLCT  188 (197)
Q Consensus       147 LDqFMeaYc~mL~kykEE-------L~rPf~EA~~F~~~iE~QL~~L~~  188 (197)
                      +.+|+..=|..|.+|--.       +++-++.+-.-+.=+|..|.+|-+
T Consensus         1 vN~Fi~~tv~fLN~F~~~cE~kL~~~e~~Lq~~E~~l~iLEaKL~SIpg   49 (148)
T PF10152_consen    1 VNHFIVHTVQFLNRFASVCEEKLSDMEQRLQRLEATLNILEAKLSSIPG   49 (148)
T ss_pred             CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence            357888888888888533       344456666666667777777643


No 35 
>PF14689 SPOB_a:  Sensor_kinase_SpoOB-type, alpha-helical domain; PDB: 1F51_C 2FTK_B 1IXM_B.
Probab=20.57  E-value=3e+02  Score=19.03  Aligned_cols=37  Identities=19%  Similarity=0.372  Sum_probs=28.1

Q ss_pred             HHHHHHHHHHHHHHhhchH------------HHHHHHHHHHHHHHHHhh
Q 038921          151 METYSDMLAKYKSDLARPF------------DEATTFLNKVELQLRNLC  187 (197)
Q Consensus       151 MeaYc~mL~kykEEL~rPf------------~EA~~F~~~iE~QL~~L~  187 (197)
                      ++.+.+.|++++=+.-+++            ++|..+++++-.+++.+.
T Consensus         9 ~~~~~~~lR~~RHD~~NhLqvI~gllqlg~~~~a~eYi~~~~~~~~~~s   57 (62)
T PF14689_consen    9 LEELIDSLRAQRHDFLNHLQVIYGLLQLGKYEEAKEYIKELSKDLQQES   57 (62)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHhHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH
Confidence            3556677777777777764            799999999988887763


Done!