Query         psy12912
Match_columns 185
No_of_seqs    105 out of 161
Neff          4.7 
Searched_HMMs 46136
Date          Fri Aug 16 22:56:16 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy12912.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12912hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3284|consensus              100.0 6.8E-83 1.5E-87  530.5  14.6  183    3-185     8-196 (213)
  2 PF03997 VPS28:  VPS28 protein; 100.0 4.9E-77 1.1E-81  495.7   9.7  164   22-185     1-170 (188)
  3 PF09454 Vps23_core:  Vps23 cor  93.3     0.4 8.7E-06   33.9   6.3   48   10-57      6-53  (65)
  4 PF09851 SHOCT:  Short C-termin  89.9    0.48   1E-05   28.8   3.1   29   29-57      2-30  (31)
  5 PF06464 DMAP_binding:  DMAP1-b  61.2      18 0.00039   27.8   4.6   35   28-62      9-43  (111)
  6 PF14090 HTH_39:  Helix-turn-he  52.4      24 0.00051   24.6   3.6   41   59-100     2-45  (70)
  7 cd00155 RasGEF Guanine nucleot  47.9 1.2E+02  0.0026   25.0   7.8  115   19-171    91-222 (237)
  8 cd01145 TroA_c Periplasmic bin  47.2      28  0.0006   28.7   3.8   53   49-102   146-202 (203)
  9 PF14056 DUF4250:  Domain of un  45.2      21 0.00045   24.7   2.3   29   69-97     17-46  (55)
 10 PF05120 GvpG:  Gas vesicle pro  45.1      89  0.0019   23.0   5.8   48   14-65     23-70  (79)
 11 COG3462 Predicted membrane pro  44.2      34 0.00073   27.1   3.6   26   30-55     89-114 (117)
 12 cd00672 CysRS_core catalytic c  42.6      29 0.00063   29.3   3.3   60   36-102    73-132 (213)
 13 PF08044 DUF1707:  Domain of un  40.4      62  0.0014   21.8   4.1   27   31-57     11-37  (53)
 14 COG5150 Class 2 transcription   39.0      62  0.0013   26.4   4.5   30   28-66     71-100 (148)
 15 PF00157 Pou:  Pou domain - N-t  34.4      48   0.001   24.3   3.0   64   74-145     8-72  (75)
 16 PRK08719 ribonuclease H; Revie  34.0      47   0.001   26.4   3.1   43   18-62     48-91  (147)
 17 PF08365 IGF2_C:  Insulin-like   30.2      22 0.00048   24.8   0.6   16   88-103    11-26  (56)
 18 PF11740 KfrA_N:  Plasmid repli  29.0      68  0.0015   23.8   3.1   59   44-102     1-64  (120)
 19 PF07240 Turandot:  Stress-indu  28.8      38 0.00083   25.4   1.7   43   52-99     11-55  (85)
 20 COG2257 Uncharacterized homolo  27.4      62  0.0013   24.7   2.6   18   16-33     69-86  (92)
 21 PF02725 Paramyxo_NS_C:  Non-st  27.4      89  0.0019   26.2   3.7   66   73-151    91-162 (165)
 22 COG4496 Uncharacterized protei  27.2 1.2E+02  0.0025   23.5   4.0   41   16-56      5-47  (100)
 23 PF05816 TelA:  Toxic anion res  26.7 1.8E+02  0.0039   26.1   5.9   53  110-166    22-77  (333)
 24 PF09756 DDRGK:  DDRGK domain;   25.5   1E+02  0.0022   26.1   3.8   47   56-105   100-150 (188)
 25 PLN02618 tryptophan synthase,   24.6      94   0.002   29.1   3.8   34   27-61     30-63  (410)
 26 COG1725 Predicted transcriptio  24.6 3.3E+02   0.007   21.7   6.3   91   49-145    13-121 (125)
 27 cd07977 TFIIE_beta_winged_heli  23.5      87  0.0019   22.5   2.7   36  130-173    18-54  (75)
 28 PF11112 PyocinActivator:  Pyoc  22.8 1.2E+02  0.0027   21.9   3.4   38   55-99      4-44  (76)
 29 PRK04346 tryptophan synthase s  22.7 1.1E+02  0.0024   28.5   3.8   33   27-60     22-54  (397)
 30 PF06757 Ins_allergen_rp:  Inse  22.5 4.2E+02   0.009   21.5   6.9   85   16-101     2-90  (179)
 31 PF12668 DUF3791:  Protein of u  20.9      80  0.0017   21.5   2.0   24   75-98      8-32  (62)
 32 PRK06548 ribonuclease H; Provi  20.8      83  0.0018   25.7   2.4   61   19-79     41-104 (161)
 33 PF04651 Pox_A12:  Poxvirus A12  20.7   1E+02  0.0023   26.3   3.0   27   44-70      9-35  (189)
 34 cd01019 ZnuA Zinc binding prot  20.2 2.7E+02  0.0058   24.2   5.6   39   49-87    159-201 (286)

No 1  
>KOG3284|consensus
Probab=100.00  E-value=6.8e-83  Score=530.45  Aligned_cols=183  Identities=63%  Similarity=1.057  Sum_probs=180.4

Q ss_pred             CccccccCCCchHHHhhhhHHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHh
Q psy12912          3 NLEEVKLFTNTKEREKFDNQADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKAAFKQVQGEEYPTVEHFVKKY   82 (185)
Q Consensus         3 ~~~EV~L~~~~~eRe~ye~lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~~~~~v~~~~~~~l~~F~~~y   82 (185)
                      +++|||||+|++|||+|||||||||||+|+|+|||||+||+|++.|||++|.|||.|||+++++++|.+|++||.|+++|
T Consensus         8 l~~Evkl~~N~rerE~~enlseLyaIi~ale~LEKAyirD~is~sey~s~c~kLi~Q~k~~~~~~~~~~f~SiE~Fc~ky   87 (213)
T KOG3284|consen    8 LYEEVKLFNNAREREVYENLSELYAIIKALEQLEKAYIRDCISPSEYTSECSKLIVQYKVAFRSVQGTEFPSIEDFCKKY   87 (213)
T ss_pred             HHHHHHHhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHccCCHHHHHHHHHHHHHHHHHHHHHhcccccCcHHHHHHHH
Confidence            57899999999999999999999999999999999999999999999999999999999999999988999999999999


Q ss_pred             cCCChHHHhhhhhcCCCceec------CCCCchhHHHhhhhhhHHHHhhhccccchhhhhcchHHHHHHHhhhcCCCCCC
Q psy12912         83 RLDCPSALERIREDRPITIKD------DKGNTSKCIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMDTMNRLSILPSG  156 (185)
Q Consensus        83 ~l~cp~A~~Rl~~G~PaTie~------~~~~~~~~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~sln~~~~~p~d  156 (185)
                      +|+||+|++||++|+|+|++|      ++|+++|+|||+||||||+|||||||++|||||||+|+||+.|||+++.+|+|
T Consensus        88 rl~cp~Ai~Ri~~~~piT~e~~ia~s~dk~~~ak~IAe~v~nFIT~mDaLrLn~~A~Dql~PlL~dL~~smnrls~~p~d  167 (213)
T KOG3284|consen   88 RLDCPAAIERIREGRPITVEDRIAPSADKGNSAKCIAEIVQNFITVMDALRLNINAVDQLYPLLSDLSASMNRLSRLPPD  167 (213)
T ss_pred             ccCChHHHHHHHcCCCCcccccccccCCcccHHHHHHHHHHHHHHHHHHHHHhhHHHHhhcchHHHHHHHHHhhccCCcc
Confidence            999999999999999999999      78999999999999999999999999999999999999999999999999999


Q ss_pred             CcchhhHHHHHHHHhcCCCCccCCccccC
Q psy12912        157 FEGKEKVSGWLTTLESMSASDELNETQFN  185 (185)
Q Consensus       157 f~gr~kl~~Wl~kLn~M~AsdeL~eeq~~  185 (185)
                      |+||.||++||++||+|.|||||+|.|+|
T Consensus       168 fe~r~Kv~~Wl~rls~M~asDeL~e~q~R  196 (213)
T KOG3284|consen  168 FEGRTKVKQWLIRLSKMSASDELTEQQVR  196 (213)
T ss_pred             chhHHHHHHHHHHHhcccccccccHHHHH
Confidence            99999999999999999999999999987


No 2  
>PF03997 VPS28:  VPS28 protein;  InterPro: IPR007143 The Endosomal Sorting Complex Required for Transport (ESCRT) complexes form the machinery driving protein sorting from endosomes to lysosomes. ESCRT complexes are central to receptor down-regulation, lysosome biogenesis, and budding of HIV. Yeast ESCRT-I consists of three protein subunits, VPS23, VPS28, and VPS37. In humans, ESCRT-I comprises TSG101, VPS28, and one of four potential human VPS37 homologues. The main role of ESCRT-I is to recognise ubiquitinated cargo via the UEV domain of the VPS23/TSG101 subunit. The assembly of the ESCRT-I complex is directed by the C-terminal steadiness box (SB) of VPS23, the N-terminal half of VPS28, and the C-terminal half of VPS37. The structure is primarily composed of three long, parallel helical hairpins, each corresponding to a different subunit. The additional domains and motifs extending beyond the core serve as gripping tools for ESCRT-I critical functions [, ].; PDB: 2J9W_B 2J9U_C 2G3K_A 2F66_E 2F6M_D 2J9V_A 2CAZ_E 2P22_B.
Probab=100.00  E-value=4.9e-77  Score=495.71  Aligned_cols=164  Identities=54%  Similarity=0.908  Sum_probs=142.5

Q ss_pred             HHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHhcCCChHHHhhhhhcCCCce
Q psy12912         22 QADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKAAFKQVQGEEYPTVEHFVKKYRLDCPSALERIREDRPITI  101 (185)
Q Consensus        22 lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~~~~~v~~~~~~~l~~F~~~y~l~cp~A~~Rl~~G~PaTi  101 (185)
                      ||||||||+|||+||||||||+||++|||++|+|||+|||++++.++++.|+||++|+++|+|+||+|++||++|+|+||
T Consensus         1 LAeLysII~tle~LEkayikD~It~~eYt~~c~kLl~Qyk~~~~~~~~~~~~~le~F~~~y~l~cp~A~~Rl~~G~P~Ti   80 (188)
T PF03997_consen    1 LAELYSIIKTLEHLEKAYIKDSITEKEYTTACNKLLNQYKTILKQLKDDEFPDLEEFMKKYNLDCPAALERLREGVPATI   80 (188)
T ss_dssp             HHHHHHHHHHHHHHHHHHHTTSS-HHHHHHHHHHHHHHHHHHHTSTTHHHHHHHHHHHHHTTS-HHHHHHHHHCTSS---
T ss_pred             ChHHHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHcccccCCCHHHHHHHhcccCChHHHHHHcCCCCch
Confidence            79999999999999999999999999999999999999999999986547899999999999999999999999999999


Q ss_pred             ec------CCCCchhHHHhhhhhhHHHHhhhccccchhhhhcchHHHHHHHhhhcCCCCCCCcchhhHHHHHHHHhcCCC
Q psy12912        102 KD------DKGNTSKCIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMDTMNRLSILPSGFEGKEKVSGWLTTLESMSA  175 (185)
Q Consensus       102 e~------~~~~~~~~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~sln~~~~~p~df~gr~kl~~Wl~kLn~M~A  175 (185)
                      +|      +++++||+|||+||||||+|||||||++|||||||+|+||+.||||++.+|+||+||+||++|+++||+|+|
T Consensus        81 e~~~~~~~~~~~~ak~Vae~t~~FIT~mDaLKLn~~a~DqLhPlL~dL~~slnr~~~~~~dfe~r~kl~~Wl~~Ln~m~a  160 (188)
T PF03997_consen   81 EHRISSSSDKGNSAKLVAEATQNFITLMDALKLNYRAKDQLHPLLSDLMQSLNRVTDLPPDFEGRSKLVEWLIKLNGMKA  160 (188)
T ss_dssp             -----------CHHHHHHHHHHHHHHHHHHHHTT--BHHHHHHHHHHHHHHHHHCTTS-TT-CCHHHHHHHHHHHHTS-T
T ss_pred             hhhcccccCCchHHHHHHHHhChhhhhhHHHhccchhHhhHhhHHHHHHHHHhccCCCCCCCccHHHHHHHHHHHhCCCc
Confidence            99      467789999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             CccCCccccC
Q psy12912        176 SDELNETQFN  185 (185)
Q Consensus       176 sdeL~eeq~~  185 (185)
                      +|||||+|+|
T Consensus       161 sdeL~e~q~r  170 (188)
T PF03997_consen  161 SDELSEEQAR  170 (188)
T ss_dssp             T-B--HHHHH
T ss_pred             ccccCHHHHH
Confidence            9999999986


No 3  
>PF09454 Vps23_core:  Vps23 core domain;  InterPro: IPR017916 The Endosomal Sorting Complex Required for Transport (ESCRT) complexes form the machinery driving protein sorting from endosomes to lysosomes. ESCRT complexes are central to receptor down-regulation, lysosome biogenesis, and budding of HIV. Yeast ESCRT-I consists of three protein subunits, VPS23, VPS28, and VPS37. In humans, ESCRT-I comprises TSG101, VPS28, and one of four potential human VPS37 homologues. The main role of ESCRT-I is to recognise ubiquitinated cargo via the UEV domain of the VPS23/TSG101 subunit. The assembly of the ESCRT-I complex is directed by the C-terminal steadiness box (SB) of VPS23, the N-terminal half of VPS28, and the C-terminal half of VPS37. The structure is primarily composed of three long, parallel helical hairpins, each corresponding to a different subunit. The additional domains and motifs extending beyond the core serve as gripping tools for ESCRT-I critical functions [, ]. This entry represents the Steadiness box domain.; PDB: 2CAZ_A 2F66_D 2F6M_A 2P22_A.
Probab=93.33  E-value=0.4  Score=33.92  Aligned_cols=48  Identities=15%  Similarity=0.205  Sum_probs=42.2

Q ss_pred             CCCchHHHhhhhHHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHH
Q psy12912         10 FTNTKEREKFDNQADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLL   57 (185)
Q Consensus        10 ~~~~~eRe~ye~lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl   57 (185)
                      -.++..++.||-.|+-.+|==|+.+|.+|+-++.|+-+.|-.....|=
T Consensus         6 ~~~~l~~Ql~el~Aed~AieDtiy~L~~al~~g~I~~d~~lK~vR~La   53 (65)
T PF09454_consen    6 AEDPLSNQLYELVAEDHAIEDTIYYLDRALQRGSIDLDTFLKQVRSLA   53 (65)
T ss_dssp             -SSHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSS-HHHHHHHHHHHH
T ss_pred             cCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHH
Confidence            467889999999999999999999999999999999999988876653


No 4  
>PF09851 SHOCT:  Short C-terminal domain;  InterPro: IPR018649  This family of hypothetical prokaryotic proteins has no known function. 
Probab=89.89  E-value=0.48  Score=28.76  Aligned_cols=29  Identities=24%  Similarity=0.458  Sum_probs=25.7

Q ss_pred             HHHHHHHHHhhhccCCChHHHHHHHHHHH
Q psy12912         29 INTLQNLEKAYIRDCVTPKEYTAACSKLL   57 (185)
Q Consensus        29 I~tle~LEkAyirD~It~~eYt~~c~kLl   57 (185)
                      .-.|+.|...|-++.||++||...=.+||
T Consensus         2 ~~~L~~L~~l~~~G~IseeEy~~~k~~ll   30 (31)
T PF09851_consen    2 EDRLEKLKELYDKGEISEEEYEQKKARLL   30 (31)
T ss_pred             hHHHHHHHHHHHcCCCCHHHHHHHHHHHh
Confidence            45688999999999999999999888876


No 5  
>PF06464 DMAP_binding:  DMAP1-binding Domain;  InterPro: IPR010506 This domain binds DMAP1, a transcriptional co-repressor.; GO: 0008134 transcription factor binding, 0005634 nucleus
Probab=61.20  E-value=18  Score=27.78  Aligned_cols=35  Identities=37%  Similarity=0.417  Sum_probs=31.7

Q ss_pred             HHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHH
Q psy12912         28 VINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKA   62 (185)
Q Consensus        28 II~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~   62 (185)
                      +-..|..||.-|-.+.||.+.|...=.+||.+|-.
T Consensus         9 vq~~L~~L~~el~~GdiT~KGY~kkr~~LL~~yl~   43 (111)
T PF06464_consen    9 VQNRLQELDLELEEGDITQKGYEKKRSKLLAPYLP   43 (111)
T ss_pred             HHHHHHHHHHhhhcCcchHHHHHHHHHHHHHHHHh
Confidence            44578899999999999999999999999999974


No 6  
>PF14090 HTH_39:  Helix-turn-helix domain
Probab=52.41  E-value=24  Score=24.61  Aligned_cols=41  Identities=22%  Similarity=0.297  Sum_probs=25.9

Q ss_pred             HHHHHHHhh-cCCCCCCHHHHHHHhcCCChHHH-hhhh-hcCCCc
Q psy12912         59 QYKAAFKQV-QGEEYPTVEHFVKKYRLDCPSAL-ERIR-EDRPIT  100 (185)
Q Consensus        59 Qyk~~~~~v-~~~~~~~l~~F~~~y~l~cp~A~-~Rl~-~G~PaT  100 (185)
                      |.+.++..+ .+..+.++| +...|+.-||+|. ..|+ .|.|+-
T Consensus         2 Q~~rIL~~L~~~~~it~~e-a~~~~gi~~~~aRI~eLR~~G~~I~   45 (70)
T PF14090_consen    2 QCKRILAALRRGGSITTLE-ARRELGIMRLAARISELRKKGYPIV   45 (70)
T ss_pred             HHHHHHHHHHcCCCcCHHH-HHHHcCCCCHHHHHHHHHHcCCeee
Confidence            555566655 344444444 4588888888774 4664 599983


No 7  
>cd00155 RasGEF Guanine nucleotide exchange factor for Ras-like small GTPases. Small GTP-binding proteins of the Ras superfamily function as molecular switches in fundamental events such as signal transduction, cytoskeleton dynamics and intracellular trafficking. Guanine-nucleotide-exchange factors (GEFs) positively regulate these GTP-binding proteins in response to a variety of signals. GEFs catalyze the dissociation of GDP from the inactive GTP-binding proteins. GTP can then bind and induce structural changes that allow interaction with effectors.
Probab=47.91  E-value=1.2e+02  Score=24.99  Aligned_cols=115  Identities=18%  Similarity=0.259  Sum_probs=62.5

Q ss_pred             hhhHHHHHHHHHHHHH-----HHHhhhccCCCh--HHHHHHHHHHHH---HHHHHHHhh-----cCCCCCCHHHHHHHhc
Q psy12912         19 FDNQADLYAVINTLQN-----LEKAYIRDCVTP--KEYTAACSKLLV---QYKAAFKQV-----QGEEYPTVEHFVKKYR   83 (185)
Q Consensus        19 ye~lAelysII~tle~-----LEkAyirD~It~--~eYt~~c~kLl~---Qyk~~~~~v-----~~~~~~~l~~F~~~y~   83 (185)
                      .-|+.-+++|+.+|.+     |.+.+  +.++.  .++.....++..   .|+.+.+.+     ...-+|-+--|.+.. 
T Consensus        91 l~Nfns~~aI~~~L~~~~I~RL~~tw--~~l~~~~~~~~~~l~~l~~~~~n~~~yr~~l~~~~~~~p~IP~lg~~l~dl-  167 (237)
T cd00155          91 LNNFNSLMAIVSALSSSPISRLKKTW--EVLSSKLKKLFEELEELVDPSRNFKNYRKLLKSVGPNPPCVPFLGVYLKDL-  167 (237)
T ss_pred             hCCHHHHHHHHHHhCCccchhHHHHH--HHCCHHHHHHHHHHHHHhCcHhhHHHHHHHHHhcCCCCCCeeehhHHHHHH-
Confidence            5677778888888864     44432  23443  334444445554   555444333     223456666666653 


Q ss_pred             CCChHHHhhhhhcCCCceecCCCCc--hhHHHhhhhhhHHHHhhhccccchhhhhcchHHHHHHHhhhcCCCCCCCcchh
Q psy12912         84 LDCPSALERIREDRPITIKDDKGNT--SKCIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMDTMNRLSILPSGFEGKE  161 (185)
Q Consensus        84 l~cp~A~~Rl~~G~PaTie~~~~~~--~~~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~sln~~~~~p~df~gr~  161 (185)
                             .++..|.|.+++..--|-  -+.+++.+..|..+                            ...|-.|....
T Consensus       168 -------~~~~e~~~~~~~~~~iN~~K~~~i~~~i~~~~~~----------------------------Q~~~Y~~~~~~  212 (237)
T cd00155         168 -------TFLHEGNPDFLEGNLVNFEKRRKIAEILREIRQL----------------------------QSNSYELNRDE  212 (237)
T ss_pred             -------HHHHccCCccCcCCCccHHHHHHHHHHHHHHHHH----------------------------hhCCCCCCCcH
Confidence                   457778887765222222  23355555554433                            22345566666


Q ss_pred             hHHHHHHHHh
Q psy12912        162 KVSGWLTTLE  171 (185)
Q Consensus       162 kl~~Wl~kLn  171 (185)
                      .+++|+.++.
T Consensus       213 ~iq~~l~~~~  222 (237)
T cd00155         213 DILAFLWKLL  222 (237)
T ss_pred             HHHHHHHhcc
Confidence            7777776665


No 8  
>cd01145 TroA_c Periplasmic binding protein TroA_c.  These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea.  They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism.  A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains.
Probab=47.22  E-value=28  Score=28.66  Aligned_cols=53  Identities=19%  Similarity=0.282  Sum_probs=32.1

Q ss_pred             HHHHHHHHHHHHHHHHHhhcCCCC----CCHHHHHHHhcCCChHHHhhhhhcCCCcee
Q psy12912         49 YTAACSKLLVQYKAAFKQVQGEEY----PTVEHFVKKYRLDCPSALERIREDRPITIK  102 (185)
Q Consensus        49 Yt~~c~kLl~Qyk~~~~~v~~~~~----~~l~~F~~~y~l~cp~A~~Rl~~G~PaTie  102 (185)
                      |......|-.+++..++.+++..|    +.+.-|++.|++.. .++-.+..|+|.+..
T Consensus       146 ~~~~l~~l~~~~~~~l~~~~~~~~v~~H~af~Y~~~~yGl~~-~~~~~~~~~~~p~~~  202 (203)
T cd01145         146 FLAKLNKLLREWERQFEGLKGIQVVAYHPSYQYLADWLGIEV-VASLEPLPELPPTSS  202 (203)
T ss_pred             HHHHHHHHHHHHHHHhhccCCCeEEEecccHHHHHHHcCCce-eeeeccCCCCCCCCC
Confidence            444444444445544444433322    56999999999985 555566677776653


No 9  
>PF14056 DUF4250:  Domain of unknown function (DUF4250)
Probab=45.18  E-value=21  Score=24.70  Aligned_cols=29  Identities=17%  Similarity=0.406  Sum_probs=24.9

Q ss_pred             CCCCCCHHHHHHHhcCCChHHHhhhhh-cC
Q psy12912         69 GEEYPTVEHFVKKYRLDCPSALERIRE-DR   97 (185)
Q Consensus        69 ~~~~~~l~~F~~~y~l~cp~A~~Rl~~-G~   97 (185)
                      .++++|+++||..|+++...=.+||.. |.
T Consensus        17 RD~~~sLd~Lc~~~~id~~~l~~kL~~~Gy   46 (55)
T PF14056_consen   17 RDEYSSLDELCYDYDIDKEELEEKLASIGY   46 (55)
T ss_pred             HhccCCHHHHHHHhCCCHHHHHHHHHHcCC
Confidence            477899999999999999999999953 54


No 10 
>PF05120 GvpG:  Gas vesicle protein G ;  InterPro: IPR007804 Gas vesicles are intracellular, protein-coated, and hollow organelles found in cyanobacteria and halophilic archaea. They are permeable to ambient gases by diffusion and provide buoyancy, enabling cells to move upwards in water to access oxygen and/or light. Proteins containing this family are involved in the formation of gas vesicles []. 
Probab=45.12  E-value=89  Score=23.01  Aligned_cols=48  Identities=25%  Similarity=0.337  Sum_probs=39.9

Q ss_pred             hHHHhhhhHHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHHHHH
Q psy12912         14 KEREKFDNQADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKAAFK   65 (185)
Q Consensus        14 ~eRe~ye~lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~~~~   65 (185)
                      .||+.||-    =.|=.-|..|+.+|-.+-||++||...=..||.++..+.+
T Consensus        23 Ae~E~~Dp----~~i~~~L~~L~~~~e~GEIseeEf~~~E~eLL~rL~~~~~   70 (79)
T PF05120_consen   23 AERELYDP----AAIRRELAELQEALEAGEISEEEFERREDELLDRLEEARR   70 (79)
T ss_pred             HHHHHcCH----HHHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHH
Confidence            46666643    2466778899999999999999999999999999987754


No 11 
>COG3462 Predicted membrane protein [Function unknown]
Probab=44.19  E-value=34  Score=27.13  Aligned_cols=26  Identities=19%  Similarity=0.316  Sum_probs=21.8

Q ss_pred             HHHHHHHHhhhccCCChHHHHHHHHH
Q psy12912         30 NTLQNLEKAYIRDCVTPKEYTAACSK   55 (185)
Q Consensus        30 ~tle~LEkAyirD~It~~eYt~~c~k   55 (185)
                      .|.|-|-..|.|+-||++||+.--..
T Consensus        89 RA~eIlkER~AkGEItEEEY~r~~~~  114 (117)
T COG3462          89 RAEEILKERYAKGEITEEEYRRIIRT  114 (117)
T ss_pred             HHHHHHHHHHhcCCCCHHHHHHHHHH
Confidence            46788888999999999999986554


No 12 
>cd00672 CysRS_core catalytic core domain of cysteinyl tRNA synthetase. Cysteinyl tRNA synthetase (CysRS) catalytic core domain. This class I enzyme is a monomer which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding.
Probab=42.60  E-value=29  Score=29.31  Aligned_cols=60  Identities=12%  Similarity=0.168  Sum_probs=38.4

Q ss_pred             HHhhhccCCChHHHHHHHHHHHHHHHHHHHhhcCCCCCCHHHHHHHhcCCChHHHhhhhhcCCCcee
Q psy12912         36 EKAYIRDCVTPKEYTAACSKLLVQYKAAFKQVQGEEYPTVEHFVKKYRLDCPSALERIREDRPITIK  102 (185)
Q Consensus        36 EkAyirD~It~~eYt~~c~kLl~Qyk~~~~~v~~~~~~~l~~F~~~y~l~cp~A~~Rl~~G~PaTie  102 (185)
                      +++=.+..+++.||+...   ..+|+..++.+. =.+|  +.|...|++.|.+...+. -|-|..|-
T Consensus        73 ~~~A~~~g~~p~e~~~~~---~~~f~~~~~~l~-i~~~--d~~~rtWh~ec~am~~~~-lg~~~dih  132 (213)
T cd00672          73 IKRAREEGLSWKEVADYY---TKEFFEDMKALN-VLPP--DVVPRVWHIECSAMAMKY-LGETFDIH  132 (213)
T ss_pred             HHHHHHcCCCHHHHHHHH---HHHHHHHHHHcC-CCCC--CcceeehhHHHHHHHHHH-cCCCccEE
Confidence            333344568888776655   566777777662 2222  557777999999766653 46677765


No 13 
>PF08044 DUF1707:  Domain of unknown function (DUF1707);  InterPro: IPR012551 This domain is found in a variety of actinomycetales proteins. All of the proteins containing this domain are hypothetical and probably membrane bound or associated. Currently, it is unclear to the function of this domain.
Probab=40.36  E-value=62  Score=21.82  Aligned_cols=27  Identities=11%  Similarity=0.203  Sum_probs=24.2

Q ss_pred             HHHHHHHhhhccCCChHHHHHHHHHHH
Q psy12912         31 TLQNLEKAYIRDCVTPKEYTAACSKLL   57 (185)
Q Consensus        31 tle~LEkAyirD~It~~eYt~~c~kLl   57 (185)
                      +++.|-.||-.+-++..||..-|.+..
T Consensus        11 ~~~~L~~a~a~GrL~~~Ef~~R~~~a~   37 (53)
T PF08044_consen   11 AVDLLRAAFAEGRLSLDEFDERLDAAY   37 (53)
T ss_pred             HHHHHHHHHHCCCCCHHHHHHHHHHHH
Confidence            688999999999999999999887654


No 14 
>COG5150 Class 2 transcription repressor NC2, beta subunit (Dr1) [Transcription]
Probab=39.01  E-value=62  Score=26.38  Aligned_cols=30  Identities=43%  Similarity=0.598  Sum_probs=24.2

Q ss_pred             HHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHHHHHHh
Q psy12912         28 VINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYKAAFKQ   66 (185)
Q Consensus        28 II~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk~~~~~   66 (185)
                      ||+||+-||         -++|.+.|.+....||...++
T Consensus        71 viKALenLe---------f~eyi~~~~e~~~n~k~~qK~  100 (148)
T COG5150          71 VIKALENLE---------FEEYIESCMEEHENYKSYQKQ  100 (148)
T ss_pred             HHHHHHhcc---------HHHHHHHHHHHHHHHHHHHhh
Confidence            566666665         578999999999999998765


No 15 
>PF00157 Pou:  Pou domain - N-terminal to homeobox domain;  InterPro: IPR000327 POU proteins are eukaryotic transcription factors containing a bipartite DNA binding domain referred to as the POU domain. The acronym POU (pronounced 'pow') is derived from the names of three mammalian transcription factors, the pituitary-specific Pit-1, the octamer-binding proteins Oct-1 and Oct-2, and the neural Unc-86 from Caenorhabditis elegans. POU domain genes have been identified in diverse organisms including nematodes, flies, amphibians, fish and mammals but have not been yet identified in plants and fungi. The various members of the POU family have a wide variety of functions, all of which are related to the function of the neuroendocrine system [] and the development of an organism []. Some other genes are also regulated, including those for immunoglobulin light and heavy chains (Oct-2) [, ], and trophic hormone genes, such as those for prolactin and growth hormone (Pit-1).  The POU domain is a bipartite domain composed of two subunits separated by a non-conserved region of 15-55 aa. The N-terminal subunit is known as the POU-specific (POUs) domain (IPR000327 from INTERPRO), while the C-terminal subunit is a homeobox domain (IPR007103 from INTERPRO). 3D structures of complexes including both POU subdomains bound to DNA are available. Both subdomains contain the structural motif 'helix-turn-helix', which directly associates with the two components of bipartite DNA binding sites, and both are required for high affinity sequence-specific DNA-binding. The domain may also be involved in protein-protein interactions []. The subdomains are connected by a flexible linker [, , ]. In proteins a POU-specific domain is always accompanied by a homeodomain. Despite of the lack of sequence homology, 3D structure of POUs is similar to 3D structure of bacteriophage lambda repressor and other members of HTH_3 family [, ]. This entry represents the POU-specific subunit of the POU domain.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 3D1N_O 1AU7_A 3L1P_A 2XSD_C 1O4X_A 1HF0_B 1GT0_C 1POU_A 1CQT_B 1E3O_C ....
Probab=34.41  E-value=48  Score=24.29  Aligned_cols=64  Identities=20%  Similarity=0.319  Sum_probs=43.0

Q ss_pred             CHHHHHHHhcCCChHHHhhhhhcCCCceecCCCCchhH-HHhhhhhhHHHHhhhccccchhhhhcchHHHHHH
Q psy12912         74 TVEHFVKKYRLDCPSALERIREDRPITIKDDKGNTSKC-IADIVSLFITTMDKLRLDIKAMDELQPDLRDLMD  145 (185)
Q Consensus        74 ~l~~F~~~y~l~cp~A~~Rl~~G~PaTie~~~~~~~~~-Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~  145 (185)
                      .+++|.+.|+      ..||+-|.--+--  +...++. -.+.+|..|.-+-+|-|..+.+=.|.|+|..-+.
T Consensus         8 ele~Fa~~fk------~rRi~LG~TQ~dV--g~al~~~~G~~~SQttI~RFE~L~LS~kn~~klkP~L~kWL~   72 (75)
T PF00157_consen    8 ELEQFAKEFK------QRRIKLGYTQADV--GAALGRLYGKEFSQTTICRFEALQLSFKNMCKLKPLLEKWLE   72 (75)
T ss_dssp             HHHHHHHHHH------HHHHHTT--HHHH--HHHHHHHHSSGGSHHHHHHHHTTTSCHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHH------HhhhhcccCHHHH--hHHHHHhcCccccchhhhhhHhcccCHHHHHHHHHHHHHHHH
Confidence            4889999985      6788887651110  0001111 1235788899999999999999999999976543


No 16 
>PRK08719 ribonuclease H; Reviewed
Probab=33.98  E-value=47  Score=26.41  Aligned_cols=43  Identities=16%  Similarity=0.239  Sum_probs=34.6

Q ss_pred             hhhhHHHHHHHHHHHHHHHHhhhccCCChHHHH-HHHHHHHHHHHH
Q psy12912         18 KFDNQADLYAVINTLQNLEKAYIRDCVTPKEYT-AACSKLLVQYKA   62 (185)
Q Consensus        18 ~ye~lAelysII~tle~LEkAyirD~It~~eYt-~~c~kLl~Qyk~   62 (185)
                      ...+-|||-+|+.+|+.+.+.-  -..|+++|. ..+++-+.+++.
T Consensus        48 ~Tnn~aEl~A~~~aL~~~~~~~--~i~tDS~yvi~~i~~~~~~W~~   91 (147)
T PRK08719         48 TDNAELELLALIEALEYARDGD--VIYSDSDYCVRGFNEWLDTWKQ   91 (147)
T ss_pred             ccHHHHHHHHHHHHHHHcCCCC--EEEechHHHHHHHHHHHHHHHh
Confidence            4678899999999999987763  477889998 566777888864


No 17 
>PF08365 IGF2_C:  Insulin-like growth factor II E-peptide;  InterPro: IPR013576  The insulin family of proteins groups together several evolutionarily related active peptides []: these include insulin [, ], relaxin [, ], insect prothoracicotropic hormone (bombyxin) [], insulin-like growth factors (IGF1 and IGF2) [, ], mammalian Leydig cell-specific insulin-like peptide (gene INSL3), early placenta insulin-like peptide (ELIP) (gene INSL4), locust insulin-related peptide (LIRP), molluscan insulin-related peptides (MIP), and Caenorhabditis elegans insulin-like peptides. The 3D structures of a number of family members have been determined [, , ]. The fold comprises two polypeptide chains (A and B) linked by two disulphide bonds: all share a conserved arrangement of 4 cysteines in their A chain, the first of which is linked by a disulphide bond to the third, while the second and fourth are linked by interchain disulphide bonds to cysteines in the B chain.   Insulin is found in many animals, and is involved in the regulation of normal glucose homeostasis. It also has other specific physiological effects, such as increasing the permeability of cells to monosaccharides, amino acids and fatty acids, and accelerating glycolysis and glycogen synthesis in the liver []. Insulin exerts its effects by interaction with a cell-surface receptor, which may also result in the promotion of cell growth [].   Insulin is synthesised as a prepropeptide from which an endoplasmic reticulum-targeting sequence is cleaved to yield proinsulin. The sequence of prosinsulin contains 2 well-conserved regions (designated A and B), separated by an intervening connecting region (C), which is variable between species []. The connecting region is cleaved, liberating the active protein, which contains the A and B chains, held together by 2 disulphide bonds [].  Insulin-like Growth Factor Binding Proteins (IGFBP) are a group of vertebrate secreted proteins, which bind to IGF-I and IGF-II with high affinity and modulate the biological actions of IGFs. The IGFBP family has six distinct subgroups, IGFBP-1 through 6, based on conservation of gene (intron-exon) organisation, structural similarity, and binding affinity for IGFs. Across species, IGFBP-5 exhibits the most sequence conservation, while IGFBP-6 exhibits the least sequence conservation. The IGFBPs contain inhibitor domain homologues, which are related to MEROPS protease inhibitor family I31 (equistatin, clan IX).  All IGFBPs share a common domain architecture (IPR000867 from INTERPRO:IPR000716 from INTERPRO). While the N-terminal (IPR000867 from INTERPRO, IGF binding protein domain), and the C-terminal (IPR000716 from INTERPRO, thyroglobulin type-1 repeat) domains are conserved across vertebrate species, the mid-region is highly variable with respect to protease cleavage sites and phosphorylation and glycosylation sites. IGFBPs contain 16-18 conserved cysteines located in the N-terminal and the C-terminal regions, which form 8-9 disulphide bonds [].   As demonstrated for human IGFBP-5, the N terminus is the primary binding site for IGF. This region, comprised of Val49, Tyr50, Pro62 and Lys68-Leu75, forms a hydrophobic patch on the surface of the protein []. The C terminus is also required for high affinity IGF binding, as well as for binding to the extracellular matrix [] and for nuclear translocation [, ] of IGFBP-3 and -5.   IGFBPs are unusually pleiotropic molecules. Like other binding proteins, IGFBP can prolong the half-life of IGFs via high affinity binding of the ligands. In addition to functioning as simple carrier proteins, serum IGFBPs also serve to regulate the endocrine and paracrine/autocrine actions of IGF by modulating the IGF available to bind to signalling IGF-I receptors [, ]. Furthermore, IGFBPs can function as growth modulators independent of IGFs. For example, IGFBP-5 stimulates markers of bone formation in osteoblasts lacking functional IGFs []. The binding of IGFBP to its putative receptor on the cell membrane may stimulate the signalling pathway independent of an IGF receptor, to mediate the effects of IGFBPs in certain target cell types. IGFBP-1 and -2, but not other IGFBPs, contain a C-terminal Arg-Gly-Asp integrin-binding motif. Thus, IGFBP-1 can also stimulate cell migration of CHO and human trophoblast cells through an action mediated by alpha 5 beta 1 integrin []. Finally, IGFBPs transported into the nucleus (via the nuclear localisation signal) may also exert IGF-independent effects by transcriptional activation of genes. This domain is the C-terminal domain of insulin-like growth factor II proteins (IGF-2, also see IPR004825 from INTERPRO) in vertebrates and seems to represent the E-peptide [, ]. 
Probab=30.23  E-value=22  Score=24.83  Aligned_cols=16  Identities=19%  Similarity=0.493  Sum_probs=13.4

Q ss_pred             HHHhhhhhcCCCceec
Q psy12912         88 SALERIREDRPITIKD  103 (185)
Q Consensus        88 ~A~~Rl~~G~PaTie~  103 (185)
                      .|.+||+.|+||-...
T Consensus        11 ksaqRLRRG~PaiLRa   26 (56)
T PF08365_consen   11 KSAQRLRRGLPAILRA   26 (56)
T ss_pred             HHHHHHHccchHHHHh
Confidence            5889999999987753


No 18 
>PF11740 KfrA_N:  Plasmid replication region DNA-binding N-term;  InterPro: IPR021104  The KfrA family of protiens are encoded on plasmids, generally in or near gene clusters invloved in stable inheritance functions. These proteins are thought to form an all-helical structure, consisting of an N-terminal helix-turn-helix DNA binding domain and an extended coiled-coil tail. The best-characterised KfrA protein, encoded on the broad host-range Plasmid RK2, is a site-specific DNA-binding protein whose operator overlaps its own promoter. The DNA-binding domain is essential for function, while the coiled-coil domain is probably responsible for formation of multimers, and may provide an example of a bridge to host structures required for plasmid partitioning []. This entry represents the N-terminal DNA-binding domain.
Probab=29.00  E-value=68  Score=23.81  Aligned_cols=59  Identities=20%  Similarity=0.327  Sum_probs=39.4

Q ss_pred             CChHHHHHHHHHHHHHHH-----HHHHhhcCCCCCCHHHHHHHhcCCChHHHhhhhhcCCCcee
Q psy12912         44 VTPKEYTAACSKLLVQYK-----AAFKQVQGEEYPTVEHFVKKYRLDCPSALERIREDRPITIK  102 (185)
Q Consensus        44 It~~eYt~~c~kLl~Qyk-----~~~~~v~~~~~~~l~~F~~~y~l~cp~A~~Rl~~G~PaTie  102 (185)
                      ||.++...+|.+|+.+=+     .+...+-+...++|..+++.|+-.-+....--..++|..+.
T Consensus         1 IT~e~V~~Aa~~L~~~G~~pT~~~Vr~~lG~GS~~ti~~~l~~w~~~~~~~~~~~~~~lP~~l~   64 (120)
T PF11740_consen    1 ITYEDVIEAADELLAAGKKPTVRAVRERLGGGSMSTISKHLKEWREEREAQVSEAAPDLPEALQ   64 (120)
T ss_pred             CcHHHHHHHHHHHHHcCCCCCHHHHHHHHCCCCHHHHHHHHHHHHHhhhccccccccCCChhHH
Confidence            788888899999997643     22233433567889999999987766665222346665553


No 19 
>PF07240 Turandot:  Stress-inducible humoral factor Turandot;  InterPro: IPR010825 This family consists of several Drosophila species specific Turandot proteins. The Turandot A (TotA) gene encodes a humoral factor, which is secreted from the fat body and accumulates in the body fluids. TotA is strongly induced upon bacterial challenge, as well as by other types of stress such as high temperature, mechanical pressure, dehydration, UV irradiation, and oxidative agents. It is also upregulated during metamorphosis and at high age. Flies that overexpress TotA show prolonged survival and retain normal activity at otherwise lethal temperatures. Although TotA is only induced by severe stress, it responds to a much wider range of stimuli than heat shock genes such as hsp70 or immune genes such as Cecropin A1 [].
Probab=28.80  E-value=38  Score=25.41  Aligned_cols=43  Identities=12%  Similarity=0.390  Sum_probs=29.0

Q ss_pred             HHHHHHHHHHHHHHhhc--CCCCCCHHHHHHHhcCCChHHHhhhhhcCCC
Q psy12912         52 ACSKLLVQYKAAFKQVQ--GEEYPTVEHFVKKYRLDCPSALERIREDRPI   99 (185)
Q Consensus        52 ~c~kLl~Qyk~~~~~v~--~~~~~~l~~F~~~y~l~cp~A~~Rl~~G~Pa   99 (185)
                      ..-.||.=|+-+-..++  .+.-.+++.|+++|.     +-.++--|+|+
T Consensus        11 ni~eLi~fY~ky~~~~~L~~~~r~~~d~~i~~y~-----~~~~lVDGvPa   55 (85)
T PF07240_consen   11 NIQELIAFYEKYSPRLPLTPQDRQRIDRFIRRYK-----EENNLVDGVPA   55 (85)
T ss_pred             hHHHHHHHHHHcCccCCCCHHHHHHHHHHHHHHH-----HHhhcccCcCC
Confidence            45578888887654431  111236999999997     55557789996


No 20 
>COG2257 Uncharacterized homolog of the cytoplasmic domain of flagellar protein FhlB [Function unknown]
Probab=27.42  E-value=62  Score=24.73  Aligned_cols=18  Identities=22%  Similarity=0.495  Sum_probs=15.7

Q ss_pred             HHhhhhHHHHHHHHHHHH
Q psy12912         16 REKFDNQADLYAVINTLQ   33 (185)
Q Consensus        16 Re~ye~lAelysII~tle   33 (185)
                      -+.|+..||+|+.|..+|
T Consensus        69 eelY~vVAEifafi~~~~   86 (92)
T COG2257          69 EELYEVVAEIFAFIYEVD   86 (92)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            378999999999998776


No 21 
>PF02725 Paramyxo_NS_C:  Non-structural protein C;  InterPro: IPR003875 This family consists of the polymerase accessory protein C from members of the paramyxoviridae.
Probab=27.40  E-value=89  Score=26.21  Aligned_cols=66  Identities=21%  Similarity=0.115  Sum_probs=39.1

Q ss_pred             CCHHHHHHHhcCCChHHHhhhhhcCCCceecC------CCCchhHHHhhhhhhHHHHhhhccccchhhhhcchHHHHHHH
Q psy12912         73 PTVEHFVKKYRLDCPSALERIREDRPITIKDD------KGNTSKCIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMDT  146 (185)
Q Consensus        73 ~~l~~F~~~y~l~cp~A~~Rl~~G~PaTie~~------~~~~~~~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~s  146 (185)
                      -|+-.||.+       |+.||++|.|.|..=-      -.++.    |=..|.-|+|=-|..=+  =.+++|+..||+-+
T Consensus        91 YsviMFmiT-------aVkRLrEskmLT~sWf~Qal~~i~~S~----eE~~~L~~Am~~la~lI--P~e~l~ltGDllp~  157 (165)
T PF02725_consen   91 YSVIMFMIT-------AVKRLRESKMLTLSWFNQALMVICQSQ----EEKENLHTAMWILANLI--PREMLYLTGDLLPA  157 (165)
T ss_pred             hhHHHHHHH-------HHHHHhccccchHHHHHHHHHHHcccH----HHHHHHHHHHHHHHHHh--hHHHHHhhhhHHHH
Confidence            468899997       9999999999997410      00110    00111222222221111  24688999999999


Q ss_pred             hhhcC
Q psy12912        147 MNRLS  151 (185)
Q Consensus       147 ln~~~  151 (185)
                      |+...
T Consensus       158 l~~~~  162 (165)
T PF02725_consen  158 LMSTE  162 (165)
T ss_pred             Hhhcc
Confidence            98765


No 22 
>COG4496 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.16  E-value=1.2e+02  Score=23.45  Aligned_cols=41  Identities=27%  Similarity=0.397  Sum_probs=31.5

Q ss_pred             HHhhhhHHHHHHHHHHHHHHHHh--hhccCCChHHHHHHHHHH
Q psy12912         16 REKFDNQADLYAVINTLQNLEKA--YIRDCVTPKEYTAACSKL   56 (185)
Q Consensus        16 Re~ye~lAelysII~tle~LEkA--yirD~It~~eYt~~c~kL   56 (185)
                      |-+-..|.+||--|.+|+-+|..  |.+|..|.+|-.+...||
T Consensus         5 klr~~~Ld~l~dailtL~n~eecy~FfdDlcTinEiqslaqRl   47 (100)
T COG4496           5 KLRGAALDELFDAILTLENLEECYAFFDDLCTINEIQSLAQRL   47 (100)
T ss_pred             chhhHHHHHHHHHHHHhccHHHHHHHHHhhcCHHHHHHHHHHH
Confidence            33456788999999999999975  678888888866655443


No 23 
>PF05816 TelA:  Toxic anion resistance protein (TelA);  InterPro: IPR008863 This family consists of several prokaryotic TelA like proteins. TelA and KlA are associated with tellurite resistance [] and plasmid fertility inhibition [].
Probab=26.73  E-value=1.8e+02  Score=26.10  Aligned_cols=53  Identities=25%  Similarity=0.345  Sum_probs=33.6

Q ss_pred             hHHHhhhhhhH-HHHhhhccccchhhhhcchHHHHHHHhhhcCCCCCCC--cchhhHHHH
Q psy12912        110 KCIADIVSLFI-TTMDKLRLDIKAMDELQPDLRDLMDTMNRLSILPSGF--EGKEKVSGW  166 (185)
Q Consensus       110 ~~Iae~t~~FI-T~MDaLKL~~~a~DqLhPlL~dL~~sln~~~~~p~df--~gr~kl~~W  166 (185)
                      +...+-...|- .+++.++-.  ..+++--.|.+|+.-++.+.  |.+|  .++.++-+|
T Consensus        22 ~~~~~~~a~~s~~iL~~v~~~--d~~~vg~~L~~L~~~~~~~d--p~~~~~~~~~~~l~k   77 (333)
T PF05816_consen   22 AEAQEKIAQFSDRILDRVRNK--DSGEVGELLNELRKEMDELD--PSELKDEKKKGFLGK   77 (333)
T ss_pred             HHHHHHHHHHHHHHHHHHHHh--ccchHhHHHHHHHHHHHhCC--hhhhhhhhhhhHHHH
Confidence            33444444554 556664433  77788889999999999997  5565  344344444


No 24 
>PF09756 DDRGK:  DDRGK domain;  InterPro: IPR019153  This is a family of proteins of approximately 300 residues. They contain a highly conserved DDRGK motif. The function is unknown. ; PDB: 1WI9_A.
Probab=25.49  E-value=1e+02  Score=26.12  Aligned_cols=47  Identities=23%  Similarity=0.421  Sum_probs=32.5

Q ss_pred             HHHHHHHHHHhhcCCCCCCHHHHHHHhcCCChHHHhhhh----hcCCCceecCC
Q psy12912         56 LLVQYKAAFKQVQGEEYPTVEHFVKKYRLDCPSALERIR----EDRPITIKDDK  105 (185)
Q Consensus        56 Ll~Qyk~~~~~v~~~~~~~l~~F~~~y~l~cp~A~~Rl~----~G~PaTie~~~  105 (185)
                      ||..|-.+++.-   .+..|+.....|+|.-+-+++||+    .|.=..|-++.
T Consensus       100 lL~~Fi~yIK~~---Kvv~ledla~~f~l~t~~~i~ri~~L~~~g~ltGv~Ddr  150 (188)
T PF09756_consen  100 LLQEFINYIKEH---KVVNLEDLAAEFGLRTQDVINRIQELEAEGRLTGVIDDR  150 (188)
T ss_dssp             HHHHHHHHHHH----SEE-HHHHHHHH-S-HHHHHHHHHHHHHHSSS-EEE-TT
T ss_pred             HHHHHHHHHHHc---ceeeHHHHHHHcCCCHHHHHHHHHHHHHCCCceeeEcCC
Confidence            899998777653   367899999999999999999995    36555555543


No 25 
>PLN02618 tryptophan synthase, beta chain
Probab=24.60  E-value=94  Score=29.13  Aligned_cols=34  Identities=24%  Similarity=0.408  Sum_probs=29.5

Q ss_pred             HHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHHH
Q psy12912         27 AVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQYK   61 (185)
Q Consensus        27 sII~tle~LEkAyirD~It~~eYt~~c~kLl~Qyk   61 (185)
                      .++.+|+.||++|.+ ...+.+|..+-..++.+|-
T Consensus        30 ~~~~~~~~~~~~~~~-~~~~~~f~~~~~~~l~~~v   63 (410)
T PLN02618         30 TLMTALSELEAAFNA-LATDPEFQEELAGILKDYV   63 (410)
T ss_pred             HHHHHHHHHHHHHHH-HhcChhhHHHHHHHHHHhc
Confidence            468899999999976 7788889999999999884


No 26 
>COG1725 Predicted transcriptional regulators [Transcription]
Probab=24.57  E-value=3.3e+02  Score=21.66  Aligned_cols=91  Identities=18%  Similarity=0.343  Sum_probs=56.3

Q ss_pred             HHHHHHHHHHHHHHHHH-hh-cCCCCCCHHHHHHHhcCC---ChHHHhhh-hhcCCCceecCC------CC------chh
Q psy12912         49 YTAACSKLLVQYKAAFK-QV-QGEEYPTVEHFVKKYRLD---CPSALERI-REDRPITIKDDK------GN------TSK  110 (185)
Q Consensus        49 Yt~~c~kLl~Qyk~~~~-~v-~~~~~~~l~~F~~~y~l~---cp~A~~Rl-~~G~PaTie~~~------~~------~~~  110 (185)
                      |+.-+++...+-   .+ .+ .|+..||+-++....+++   ..+|.+=| +.|+=-|....+      +.      ...
T Consensus        13 Y~QI~~qIk~~I---~~g~l~pGdkLPSvRelA~~~~VNpnTv~raY~eLE~eG~i~t~rg~G~fV~~~~~~~~~~~~~~   89 (125)
T COG1725          13 YEQIANQIKEQI---ASGELKPGDKLPSVRELAKDLGVNPNTVQRAYQELEREGIVETKRGKGTFVTEDAKEILDQLKRE   89 (125)
T ss_pred             HHHHHHHHHHHH---HhCCcCCCCCCCcHHHHHHHhCCCHHHHHHHHHHHHHCCCEEEecCeeEEEcCCchhhHHHHHHH
Confidence            666554444332   12 23 478899999999999986   34666666 357765544211      11      234


Q ss_pred             HHHhhhhhhHHHHhhhccccchhhhhcchHHHHHH
Q psy12912        111 CIADIVSLFITTMDKLRLDIKAMDELQPDLRDLMD  145 (185)
Q Consensus       111 ~Iae~t~~FIT~MDaLKL~~~a~DqLhPlL~dL~~  145 (185)
                      .+.+.+.+||.-|=++-+   +.++++.++.+...
T Consensus        90 ~~~~~l~~~I~~~~~~G~---s~eei~~~~~~~~~  121 (125)
T COG1725          90 LAEEELEEFIEEAKALGL---SLEEILELLKEIYE  121 (125)
T ss_pred             HHHHHHHHHHHHHHHcCC---CHHHHHHHHHHHHh
Confidence            577778888865444443   57888888877654


No 27 
>cd07977 TFIIE_beta_winged_helix TFIIE_beta_winged_helix domain, located at the central core region of TFIIE beta, with double-stranded DNA binding activity. Transcription Factor IIE (TFIIE) beta winged-helix (or forkhead) domain is located at the central core region of TFIIE beta. The winged-helix is a form of helix-turn-helix (HTH) domain which typically binds DNA with the 3rd helix. The winged-helix domain is distinguished by the presence of a C-terminal beta-strand hairpin unit (the wing) that packs against the cleft of the tri-helical core. Although most winged-helix domains are multi-member families, TFIIE beta winged-helix domain is typically found as a single orthologous group. TFIIE is one of the six eukaryotic general transcription factors (TFIIA, TFIIB, TFIID, TFIIE, TFIIF and TFIIH) that are required for transcription initiation of protein-coding genes. TFIIE is a heterotetramer consisting of two copies each of alpha and beta subunits. TFIIE beta contains several functional 
Probab=23.54  E-value=87  Score=22.49  Aligned_cols=36  Identities=22%  Similarity=0.427  Sum_probs=23.9

Q ss_pred             cchhhhhcch-HHHHHHHhhhcCCCCCCCcchhhHHHHHHHHhcC
Q psy12912        130 IKAMDELQPD-LRDLMDTMNRLSILPSGFEGKEKVSGWLTTLESM  173 (185)
Q Consensus       130 ~~a~DqLhPl-L~dL~~sln~~~~~p~df~gr~kl~~Wl~kLn~M  173 (185)
                      |+.++| ||+ +.|++.-++-+.       ...++++||......
T Consensus        18 mK~r~~-~Plt~~EIl~~ls~~d-------~~~~~~~~L~~~~~~   54 (75)
T cd07977          18 MKKRHQ-HPLTLDEILDYLSLLD-------IGPKLKEWLKSEALV   54 (75)
T ss_pred             HHhcCC-CCccHHHHHHHHhccC-------ccHHHHHHHHhhhhc
Confidence            457777 886 566766666533       237788999876654


No 28 
>PF11112 PyocinActivator:  Pyocin activator protein PrtN
Probab=22.82  E-value=1.2e+02  Score=21.95  Aligned_cols=38  Identities=24%  Similarity=0.406  Sum_probs=27.4

Q ss_pred             HHHHHHHHHHHhhcCCCCCCHHHHHHHh--cCCChHHHhhhhhc-CCC
Q psy12912         55 KLLVQYKAAFKQVQGEEYPTVEHFVKKY--RLDCPSALERIRED-RPI   99 (185)
Q Consensus        55 kLl~Qyk~~~~~v~~~~~~~l~~F~~~y--~l~cp~A~~Rl~~G-~Pa   99 (185)
                      -|+.||-       |+....+++.+++|  +|.-..|..+++.| +|.
T Consensus         4 lLma~~~-------~~~~IpL~~v~~~yf~~lt~~~a~rk~~~g~lpl   44 (76)
T PF11112_consen    4 LLMAQYF-------GDPVIPLEEVCEDYFPHLTPKTAKRKANAGELPL   44 (76)
T ss_pred             HHHHHHc-------CCCCCcHHHHHHHHHccCCHHHHHHHHHCCCCCC
Confidence            4566662       45567799999999  99777888888764 444


No 29 
>PRK04346 tryptophan synthase subunit beta; Validated
Probab=22.72  E-value=1.1e+02  Score=28.52  Aligned_cols=33  Identities=30%  Similarity=0.516  Sum_probs=29.0

Q ss_pred             HHHHHHHHHHHhhhccCCChHHHHHHHHHHHHHH
Q psy12912         27 AVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQY   60 (185)
Q Consensus        27 sII~tle~LEkAyirD~It~~eYt~~c~kLl~Qy   60 (185)
                      .++.+|+.||++|.+ ...+.+|..+-..|+..|
T Consensus        22 ~~~~~~~~~~~~~~~-~~~~~~f~~~~~~~~~~~   54 (397)
T PRK04346         22 TLMPALEELEEAYEK-AKNDPEFQAELDYLLKNY   54 (397)
T ss_pred             HHHHHHHHHHHHHHH-HhcCHHHHHHHHHHHHHh
Confidence            467899999999977 777888999999999998


No 30 
>PF06757 Ins_allergen_rp:  Insect allergen related repeat, nitrile-specifier detoxification;  InterPro: IPR010629 This entry represents several insect specific allergen repeats. These repeats are commonly found in various proteins from cockroaches, fruit flies and mosquitos. It has been suggested that the repeat sequences have evolved by duplication of an ancestral amino acid domain, which may have arisen from the mitochondrial energy transfer proteins [].  This family exemplifies a case of novel gene evolution. The case in point is the arms-race between plants and their infective insective herbivores in the area of the glucosinolate-myrosinase system. Brassicas have developed the glucosinolate-myrosinase system as chemical defence mechanism against the insects, and consequently the insects have adapted to produce a detoxifying molecule, nitrile-specifier protein (NSP). NSP is present in the Pieris rapae (Cabbage white butterfly). NSP is structurally different from and has no amino acid homology to any known detoxifying enzymes, and it appears to have arisen by a process of domain and gene duplication of a sequence of unknown function that is widespread in insect species and referred to as insect-allergen-repeat protein. Thus this family is found either as a single domain or as a multiple repeat-domain []. 
Probab=22.48  E-value=4.2e+02  Score=21.45  Aligned_cols=85  Identities=18%  Similarity=0.212  Sum_probs=56.5

Q ss_pred             HHhhhhHHHHHHHHHHHHHHHHhhhccCCChHHHHHHHHHHHHH-HHHHHHhhc-CCCCCCHHHHHHHhcCCChHHHhhh
Q psy12912         16 REKFDNQADLYAVINTLQNLEKAYIRDCVTPKEYTAACSKLLVQ-YKAAFKQVQ-GEEYPTVEHFVKKYRLDCPSALERI   93 (185)
Q Consensus        16 Re~ye~lAelysII~tle~LEkAyirD~It~~eYt~~c~kLl~Q-yk~~~~~v~-~~~~~~l~~F~~~y~l~cp~A~~Rl   93 (185)
                      |...+.+.++..+|..-+ +..-..+=...+.|+-.+..=|-+. |+.++..+. -.++..+-.|.+..+++....++++
T Consensus         2 ~~L~~d~~dfl~lIp~~~-i~~i~~~Y~~~D~efq~~~~yl~s~~f~~l~~~l~~~pE~~~l~~yL~~~gldv~~~i~~i   80 (179)
T PF06757_consen    2 RSLQEDFQDFLDLIPMEE-IQDIVQRYYLEDAEFQAAVRYLNSSEFKQLWQQLEALPEVKALLDYLESAGLDVYYYINQI   80 (179)
T ss_pred             hhHHHHHHHHHHhcCHHH-HHHHHHHHHHcCHHHHHHHHHHcChHHHHHHHHHHcCHHHHHHHHHHHHCCCCHHHHHHHH
Confidence            456778888888887554 4433333335566665555444332 555555442 2356679999999999999999999


Q ss_pred             h--hcCCCce
Q psy12912         94 R--EDRPITI  101 (185)
Q Consensus        94 ~--~G~PaTi  101 (185)
                      .  .|+|.+.
T Consensus        81 ~~~l~~~~~~   90 (179)
T PF06757_consen   81 NDLLGLPPLN   90 (179)
T ss_pred             HHHHcCCcCC
Confidence            5  4988664


No 31 
>PF12668 DUF3791:  Protein of unknown function (DUF3791);  InterPro: IPR024269 This entry represents proteins of unknown function.
Probab=20.91  E-value=80  Score=21.48  Aligned_cols=24  Identities=21%  Similarity=0.318  Sum_probs=20.2

Q ss_pred             HHHHHHHhcCCChHHHhhhh-hcCC
Q psy12912         75 VEHFVKKYRLDCPSALERIR-EDRP   98 (185)
Q Consensus        75 l~~F~~~y~l~cp~A~~Rl~-~G~P   98 (185)
                      |+.|++++++.-+.|.+++. .|+-
T Consensus         8 Ie~~A~~~~~s~~ea~~~~~~~~~~   32 (62)
T PF12668_consen    8 IEEFAKKLNISGEEAYNYFKRSGVI   32 (62)
T ss_pred             HHHHHHHHCcCHHHHHHHHHHcCcH
Confidence            78999999999999999995 3543


No 32 
>PRK06548 ribonuclease H; Provisional
Probab=20.80  E-value=83  Score=25.72  Aligned_cols=61  Identities=18%  Similarity=0.177  Sum_probs=35.5

Q ss_pred             hhhHHHHHHHHHHHHHHHHhhh-ccCCChHHHHHHHH-HHHHHHHHH-HHhhcCCCCCCHHHHH
Q psy12912         19 FDNQADLYAVINTLQNLEKAYI-RDCVTPKEYTAACS-KLLVQYKAA-FKQVQGEEYPTVEHFV   79 (185)
Q Consensus        19 ye~lAelysII~tle~LEkAyi-rD~It~~eYt~~c~-kLl~Qyk~~-~~~v~~~~~~~l~~F~   79 (185)
                      -.|-|||.++|.+|+.+..... =-..|+++|.--|- +=+..+|.- ++.-.|..+.+.|-+.
T Consensus        41 TNnraEl~Aii~aL~~~~~~~~~v~I~TDS~yvi~~i~~W~~~Wk~~gWk~s~G~pV~N~dL~~  104 (161)
T PRK06548         41 TNNIAELTAVRELLIATRHTDRPILILSDSKYVINSLTKWVYSWKMRKWRKADGKPVLNQEIIQ  104 (161)
T ss_pred             CHHHHHHHHHHHHHHhhhcCCceEEEEeChHHHHHHHHHHHHHHHHCCCcccCCCccccHHHHH
Confidence            4567999999999987754311 13478899976653 345555421 1122234455555553


No 33 
>PF04651 Pox_A12:  Poxvirus A12 protein;  InterPro: IPR006744   This family contains vaccinia virus protein A12 and its homologues. VVA12 is a virion protein though its function is unknown.
Probab=20.67  E-value=1e+02  Score=26.29  Aligned_cols=27  Identities=19%  Similarity=0.399  Sum_probs=23.3

Q ss_pred             CChHHHHHHHHHHHHHHHHHHHhhcCC
Q psy12912         44 VTPKEYTAACSKLLVQYKAAFKQVQGE   70 (185)
Q Consensus        44 It~~eYt~~c~kLl~Qyk~~~~~v~~~   70 (185)
                      =+.++|.++.+||--|.|+++.++.++
T Consensus         9 SSYDDYIeTinKitPQLrtlLahI~ge   35 (189)
T PF04651_consen    9 SSYDDYIETINKITPQLRTLLAHISGE   35 (189)
T ss_pred             ccHHHHHHHHHHhCHHHHHHHHHhccc
Confidence            356899999999999999999998543


No 34 
>cd01019 ZnuA Zinc binding protein ZnuA. These proteins have been shown to function as initial receptors in the ABC uptake of Zn2+.  They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism.  They are comprised of two globular subdomains connected by a single helix and bind their specific ligands in the cleft between these domains.  A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=20.16  E-value=2.7e+02  Score=24.17  Aligned_cols=39  Identities=21%  Similarity=0.315  Sum_probs=22.5

Q ss_pred             HHHHHHHHHHHHHHHHHhhcCCCC----CCHHHHHHHhcCCCh
Q psy12912         49 YTAACSKLLVQYKAAFKQVQGEEY----PTVEHFVKKYRLDCP   87 (185)
Q Consensus        49 Yt~~c~kLl~Qyk~~~~~v~~~~~----~~l~~F~~~y~l~cp   87 (185)
                      |.+....|..+|+..++.+++..+    +.+.-|+++|++..-
T Consensus       159 ~~~~L~~l~~~~~~~~~~~~~~~~v~~H~af~Yl~~~~gl~~~  201 (286)
T cd01019         159 FNARLAELDATIKERLAPVKTKPFFVFHDAYGYFEKRYGLTQA  201 (286)
T ss_pred             HHHHHHHHHHHHHHHhhccCCCeEEEecccHHHHHHHcCCcee
Confidence            444444444445555544432222    468999999999743


Done!