Query         psy8079
Match_columns 135
No_of_seqs    103 out of 1042
Neff          5.7 
Searched_HMMs 46136
Date          Fri Aug 16 17:52:08 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8079.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8079hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF00379 Chitin_bind_4:  Insect  99.7 5.5E-17 1.2E-21  102.3   7.7   52   35-91      1-52  (52)
  2 PF06004 DUF903:  Bacterial pro  83.7     2.1 4.6E-05   26.8   3.6   39   37-83      2-40  (50)
  3 PRK15401 alpha-ketoglutarate-d  59.5      24 0.00052   28.2   5.1   49   68-116    46-105 (213)
  4 PHA02698 hypothetical protein;  56.7     7.2 0.00016   26.9   1.5   21   89-109    19-39  (89)
  5 PF10902 DUF2693:  Protein of u  53.8      51  0.0011   22.7   5.3   57   33-93      1-63  (83)
  6 COG0450 AhpC Peroxiredoxin [Po  48.5      53  0.0012   26.1   5.4   61   43-118   106-166 (194)
  7 KOG1691|consensus               40.5 1.5E+02  0.0033   23.9   6.9   32   64-95     79-110 (210)
  8 COG4786 FlgG Flagellar basal b  39.8      50  0.0011   27.4   4.2   56   47-102    80-138 (265)
  9 PRK05461 apaG CO2+/MG2+ efflux  36.1      85  0.0018   23.0   4.5   21   62-82     93-113 (127)
 10 PF09244 DUF1964:  Domain of un  34.9      42  0.0009   22.2   2.4   21   68-88      4-24  (68)
 11 TIGR01643 YD_repeat_2x YD repe  34.1      71  0.0015   17.9   3.2   21   71-93      9-29  (42)
 12 cd07240 ED_TypeI_classII_N N-t  32.8      69  0.0015   20.9   3.4   20   69-88     98-117 (117)
 13 PF12118 SprA-related:  SprA-re  31.2      75  0.0016   26.7   4.0   17   69-88    175-192 (294)
 14 cd07252 BphC1-RGP6_N_like N-te  30.7      70  0.0015   21.6   3.2   21   68-88    100-120 (120)
 15 PF05423 Mycobact_memb:  Mycoba  30.5   1E+02  0.0022   23.0   4.2   24   66-90     51-74  (140)
 16 PHA02978 hypothetical protein;  29.7      46 0.00099   24.5   2.2   15   69-83     76-90  (135)
 17 PF05899 Cupin_3:  Protein of u  28.8 1.5E+02  0.0032   19.2   4.4   26   62-89     28-53  (74)
 18 PF13511 DUF4124:  Domain of un  27.3      54  0.0012   20.1   2.0   18   67-87     13-30  (60)
 19 PRK14818 NADH dehydrogenase su  25.5      56  0.0012   25.5   2.1   21   97-117   133-153 (173)
 20 PF10929 DUF2811:  Protein of u  25.3      73  0.0016   20.6   2.3   20  103-122     5-25  (57)
 21 PF03562 MltA:  MltA specific i  24.4      53  0.0011   25.2   1.8   54   64-121    82-143 (158)
 22 COG3422 Uncharacterized conser  23.0 1.1E+02  0.0023   19.9   2.7   20   27-46      7-26  (59)
 23 TIGR03795 chp_BMA0021 conserve  22.5 1.4E+02  0.0031   21.7   3.7   23   98-120    89-112 (114)
 24 PF00659 POLO_box:  POLO box du  22.2 2.1E+02  0.0046   17.9   5.5   13   34-46      4-16  (68)
 25 cd08344 MhqB_like_N N-terminal  22.1 1.1E+02  0.0024   20.3   2.9   17   69-85     93-109 (112)
 26 PF09242 FCSD-flav_bind:  Flavo  21.9      92   0.002   20.4   2.4   29   66-94      7-36  (71)
 27 PF10681 Rot1:  Chaperone for p  20.1 3.8E+02  0.0082   21.7   5.9   60   28-94     28-93  (212)

No 1  
>PF00379 Chitin_bind_4:  Insect cuticle protein;  InterPro: IPR000618 Insect cuticle is composed of proteins and chitin. The cuticular proteins seem to be specific to the type of cuticle (flexible or stiff) that occur at stages of the insect development. The proteins found in the flexible cuticle of larva and pupa of different insects share a conserved C-terminal section [] such a region is also found in the soft endocuticle of adults insects [] as well as in other cuticular proteins including in arachnids []. In addition, cuticular proteins share hydrophobic regions dominated by tetrapeptide repeats (A-A-P-A/V), which are presumed to be functionally important [, ]. Many insect cuticle proteins also include a 35-36 amino acid motif known as the R and R consensus. An extended form of this motif has been shown [] to bind chitin. It has no sequence similiarity to the cysteine-containing chitin-binding domain of chitinases and some peritrophic membrane proteins, suggesting that arthropods have two distinct classes of chitin-binding proteins, those with the chitin-binding domains found in lectins, chitinases and peritrophic membranes (cysCBD), and those with the type of chitin-binding domains found in cuticular proteins (non-cysCBD) []. The cuticle protein signature has been found in locust cuticle proteins 7 (LM-7), 8 (LM-8), 19 (LM-19) and endocuticle structural glycoprotein ABD-4; Hyalophora cecropia (Cecropia moth) cuticle proteins 12 and 66; Drosophila melanogaster (Fruit fly) larval cuticles proteins I, II, III and IV (LCP1 to LCP4); drosophila pupal cuticle proteins PCP, EDG-78E and EDG-84E; Manduca sexta (Tobacco hawkmoth) cuticle protein LCP-14; Tenebrio molitor (Yellow mealworm) cuticle proteins ACP-20, A1A, A2B and A3A; and Araneus diadematus (Spider) cuticle proteins ACP 11.9, ACP 12.4, ACP 12.6, ACP 15.5 and ACP 15.7.; GO: 0042302 structural constituent of cuticle
Probab=99.71  E-value=5.5e-17  Score=102.26  Aligned_cols=52  Identities=40%  Similarity=0.788  Sum_probs=46.4

Q ss_pred             eeEEEecCCCceEEEeeeeccCCCCCcccceEEEEEEEECCCCCEEEEEEEeCCCCc
Q psy8079          35 YQYSFQTSNGIAAQESGFLKNAGNPEAEVQVAQGAYTYTGADGNPYSLQYTADEEGF   91 (135)
Q Consensus        35 Y~f~Ye~~dG~~~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG~~~~V~Y~ADenGf   91 (135)
                      |.|+|+++|| .++|++.   .++ ++..++|+|+|+|++|||+.++|+|+||++||
T Consensus         1 Y~f~Y~~~dg-~~~e~~~---~~~-~~~~~~v~GsY~y~~pdG~~~~V~Y~Ad~~Gf   52 (52)
T PF00379_consen    1 YSFGYETSDG-SRQEEGR---PET-EDEGGVVRGSYSYIDPDGQTRTVTYVADENGF   52 (52)
T ss_pred             CeeEeEcCCC-CEEEEEE---ccc-CCCCCEEEEEEEEECCCCCEEEEEEECCCCCC
Confidence            7899999999 7888887   222 56779999999999999999999999999998


No 2  
>PF06004 DUF903:  Bacterial protein of unknown function (DUF903);  InterPro: IPR010305 This family consists of several small bacterial proteins several of which are classified as putative lipoproteins. The function of this family is unknown.; PDB: 2RB6_B 3BDU_F 2RA2_F 2RD1_C 3FIF_H 2JN0_A 2K57_A.
Probab=83.68  E-value=2.1  Score=26.84  Aligned_cols=39  Identities=31%  Similarity=0.500  Sum_probs=28.4

Q ss_pred             EEEecCCCceEEEeeeeccCCCCCcccceEEEEEEEECCCCCEEEEE
Q psy8079          37 YSFQTSNGIAAQESGFLKNAGNPEAEVQVAQGAYTYTGADGNPYSLQ   83 (135)
Q Consensus        37 f~Ye~~dG~~~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG~~~~V~   83 (135)
                      |--.+.||....-.|..+..        .-.|.|+|.+.+|+...|+
T Consensus         2 yvi~t~dG~~i~t~gkP~~D--------~~tG~~~y~D~~G~~~qIn   40 (50)
T PF06004_consen    2 YVITTNDGRTIVTDGKPKYD--------KDTGMYSYTDADGKEQQIN   40 (50)
T ss_dssp             EEEEETTSEEEEESSEEEEE--------TTTTEEEEEBTTS-EEEEE
T ss_pred             cEEEeCCCcEEEeCCCcccc--------CCCCcEEEECCCCCEEEEc
Confidence            34567788877777776632        2579999999999998875


No 3  
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=59.54  E-value=24  Score=28.22  Aligned_cols=49  Identities=20%  Similarity=0.306  Sum_probs=34.0

Q ss_pred             EEEEEECCCCCEEEE--------EEEeCCCCccccCCC--CCCC-CCChHHHHHHHHHHH
Q psy8079          68 GAYTYTGADGNPYSL--------QYTADEEGFKPSANY--LPTP-PPIPAEIVKALEFLA  116 (135)
Q Consensus        68 GsYsy~~pdG~~~~V--------~Y~ADenGf~~~g~~--lP~~-pp~p~~i~~al~~~~  116 (135)
                      ..+....|.|...+|        -|++|+.||+=.+.+  -+.| ||+|+.+.+....++
T Consensus        46 p~~~~~~~gg~~msv~mt~~G~~~W~~d~~~YrYs~~~~~~~~pwp~~P~~l~~L~~~~~  105 (213)
T PRK15401         46 PFRHMVTPGGYTMSVAMTNCGALGWVTDRRGYRYSPIDPLTGKPWPAMPASFLALAQRAA  105 (213)
T ss_pred             CccceecCCCCcceeEEeccccceEecCCCCcccCCcCCCCCCCCCCchHHHHHHHHHHH
Confidence            345667788876666        799999999998754  3444 678877666555443


No 4  
>PHA02698 hypothetical protein; Provisional
Probab=56.74  E-value=7.2  Score=26.87  Aligned_cols=21  Identities=29%  Similarity=0.725  Sum_probs=16.9

Q ss_pred             CCccccCCCCCCCCCChHHHH
Q psy8079          89 EGFKPSANYLPTPPPIPAEIV  109 (135)
Q Consensus        89 nGf~~~g~~lP~~pp~p~~i~  109 (135)
                      .|-|+.|+..|+|-|+|+.|-
T Consensus        19 ddvqvnGDlmPtp~p~peeV~   39 (89)
T PHA02698         19 DDVQVNGDLMPTPEPTPEEVP   39 (89)
T ss_pred             hccccCCcccCCCCCChhhhc
Confidence            367888999999988887653


No 5  
>PF10902 DUF2693:  Protein of unknown function (DUF2693);  InterPro: IPR024401 This family of proteins is found in bacteria and bacteriophages. Its function is unknown. 
Probab=53.78  E-value=51  Score=22.71  Aligned_cols=57  Identities=11%  Similarity=0.219  Sum_probs=36.2

Q ss_pred             CCeeEEEecCCCceEEEeeeeccCCCCCcccceEEEEEEEECCCC------CEEEEEEEeCCCCccc
Q psy8079          33 GSYQYSFQTSNGIAAQESGFLKNAGNPEAEVQVAQGAYTYTGADG------NPYSLQYTADEEGFKP   93 (135)
Q Consensus        33 G~Y~f~Ye~~dG~~~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG------~~~~V~Y~ADenGf~~   93 (135)
                      |.-.|-|+-.||+.|.-.|+++..--    .....|.+.+..+..      ....+-|--+.+|||-
T Consensus         1 G~h~~~feK~DGtiR~a~gT~d~~li----~~~~~~~~~~~~~~~~~rkes~~s~~yfDve~~~WRS   63 (83)
T PF10902_consen    1 GEHEFVFEKVDGTIRVAKGTRDPDLI----PYEEKGKEMFPPPTKPERKESTTSVRYFDVEKKGWRS   63 (83)
T ss_pred             CcEEEEEEecCCCEEEEEEecCcccc----cceecCcccccCCCCccccCCcceEEEEEeccCceee
Confidence            45678999999999999999874211    124555555543332      1334456666778774


No 6  
>COG0450 AhpC Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]
Probab=48.49  E-value=53  Score=26.11  Aligned_cols=61  Identities=21%  Similarity=0.275  Sum_probs=40.7

Q ss_pred             CCceEEEeeeeccCCCCCcccceEEEEEEEECCCCCEEEEEEEeCCCCccccCCCCCCCCCChHHHHHHHHHHHcC
Q psy8079          43 NGIAAQESGFLKNAGNPEAEVQVAQGAYTYTGADGNPYSLQYTADEEGFKPSANYLPTPPPIPAEIVKALEFLASQ  118 (135)
Q Consensus        43 dG~~~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG~~~~V~Y~ADenGf~~~g~~lP~~pp~p~~i~~al~~~~~~  118 (135)
                      ++.....-|.+.     ++.++.++|.|= |||||..+.+.+.....|=.+.         -.-.+++||++.++|
T Consensus       106 ~~~vs~~ygvl~-----~~~g~a~R~~FI-IDp~g~ir~~~v~~~~iGRn~d---------EilR~idAlq~~~~h  166 (194)
T COG0450         106 KGEIARAYGVLH-----PEEGLALRGTFI-IDPDGVIRHILVNPLTIGRNVD---------EILRVIDALQFVAKH  166 (194)
T ss_pred             chhHHHHcCCcc-----cCCCcceeEEEE-ECCCCeEEEEEEecCCCCcCHH---------HHHHHHHHHHHHHHh
Confidence            344444455544     345578999974 8999999998888776553332         012467788898887


No 7  
>KOG1691|consensus
Probab=40.54  E-value=1.5e+02  Score=23.86  Aligned_cols=32  Identities=31%  Similarity=0.649  Sum_probs=27.6

Q ss_pred             ceEEEEEEEECCCCCEEEEEEEeCCCCccccC
Q psy8079          64 QVAQGAYTYTGADGNPYSLQYTADEEGFKPSA   95 (135)
Q Consensus        64 ~~v~GsYsy~~pdG~~~~V~Y~ADenGf~~~g   95 (135)
                      .+.+|+|+|+..+-..+.+.++||.-|.+|..
T Consensus        79 nv~~gqFaFta~e~~~y~~Cf~~~~~~~~p~~  110 (210)
T KOG1691|consen   79 NVTKGQFAFTAEESGMYEACFTADVPGHKPET  110 (210)
T ss_pred             ccccceEEEEeccCCcEEEEEecccCCCCCCc
Confidence            34599999999999999999999988888744


No 8  
>COG4786 FlgG Flagellar basal body rod protein [Cell motility and secretion]
Probab=39.83  E-value=50  Score=27.43  Aligned_cols=56  Identities=27%  Similarity=0.383  Sum_probs=38.1

Q ss_pred             EEEeeeeccCCCCCcccceEEEEEEEECCCC-CEEEE--EEEeCCCCccccCCCCCCCC
Q psy8079          47 AQESGFLKNAGNPEAEVQVAQGAYTYTGADG-NPYSL--QYTADEEGFKPSANYLPTPP  102 (135)
Q Consensus        47 ~~E~G~~~~~G~~~~~~~~v~GsYsy~~pdG-~~~~V--~Y~ADenGf~~~g~~lP~~p  102 (135)
                      .-+.|.++..+.+-|-.+.-.|+|....||| ..||=  .++-|++||-+..+-.|+-+
T Consensus        80 ~~tqGsl~~T~n~lD~AI~g~gfF~I~~~dG~~~YTR~G~F~~d~~G~LVT~~G~~vl~  138 (265)
T COG4786          80 VFTQGSLQKTDNPLDLAITGDGFFQIQTPDGTIAYTRDGSFTVDEEGQLVTSNGYPVLD  138 (265)
T ss_pred             eccccCceecCCccceEEcCCceEEEEcCCCCEEEeeCCceeECCCCCEEeCCCCCccC
Confidence            4456777766664455566678899999999 55552  56668888888776555544


No 9  
>PRK05461 apaG CO2+/MG2+ efflux protein ApaG; Reviewed
Probab=36.05  E-value=85  Score=23.05  Aligned_cols=21  Identities=24%  Similarity=0.499  Sum_probs=16.9

Q ss_pred             ccceEEEEEEEECCCCCEEEE
Q psy8079          62 EVQVAQGAYTYTGADGNPYSL   82 (135)
Q Consensus        62 ~~~~v~GsYsy~~pdG~~~~V   82 (135)
                      ....++|+|.+++++|....|
T Consensus        93 p~G~M~G~y~~~~~~G~~F~v  113 (127)
T PRK05461         93 PSGTMQGHYQMVDEDGERFEV  113 (127)
T ss_pred             CCEEEEEEEEEEeCCCCEEEE
Confidence            446899999999999986543


No 10 
>PF09244 DUF1964:  Domain of unknown function (DUF1964);  InterPro: IPR015325 This domain is C-terminal to the catalytic sucrose phosphorylase beta/alpha barrel domain. It adopts a beta-sandwich fold, with Greek-key topology and is functionally uncharacterised []. ; PDB: 1R7A_B 2GDU_A 2GDV_A.
Probab=34.87  E-value=42  Score=22.23  Aligned_cols=21  Identities=14%  Similarity=0.504  Sum_probs=14.7

Q ss_pred             EEEEEECCCCCEEEEEEEeCC
Q psy8079          68 GAYTYTGADGNPYSLQYTADE   88 (135)
Q Consensus        68 GsYsy~~pdG~~~~V~Y~ADe   88 (135)
                      |.|||--.++..++.+|++.+
T Consensus         4 G~FSy~~dgdtSitf~W~g~~   24 (68)
T PF09244_consen    4 GEFSYEADGDTSITFTWTGAT   24 (68)
T ss_dssp             SEEEEEEETTTEEEEEEE-SS
T ss_pred             ceeeEecCCCcEEEEEEeccc
Confidence            667777777777777887764


No 11 
>TIGR01643 YD_repeat_2x YD repeat (two copies). This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Probab=34.14  E-value=71  Score=17.88  Aligned_cols=21  Identities=33%  Similarity=0.379  Sum_probs=13.2

Q ss_pred             EEECCCCCEEEEEEEeCCCCccc
Q psy8079          71 TYTGADGNPYSLQYTADEEGFKP   93 (135)
Q Consensus        71 sy~~pdG~~~~V~Y~ADenGf~~   93 (135)
                      +.++|+|...+..|  |..|-..
T Consensus         9 ~~~~p~G~~~~~~Y--D~~Grl~   29 (42)
T TIGR01643         9 GSTDADGTTTRYTY--DAAGRLV   29 (42)
T ss_pred             EEECCCCCEEEEEE--CCCCCEE
Confidence            35678888766666  5555443


No 12 
>cd07240 ED_TypeI_classII_N N-terminal domain of type I, class II extradiol dioxygenases; non-catalytic domain. This family contains the N-terminal, non-catalytic, domain of type I, class II extradiol dioxygenases. Dioxygenases catalyze the incorporation of both atoms of molecular oxygen into substrates using a variety of reaction mechanisms, resulting in the cleavage of aromatic rings. Two major groups of dioxygenases have been identified according to the cleavage site; extradiol enzymes cleave the aromatic ring between a hydroxylated carbon and an adjacent non-hydroxylated carbon, whereas intradiol enzymes cleave the aromatic ring between two hydroxyl groups. Extradiol dioxygenases are classified into type I and type II enzymes. Type I extradiol dioxygenases include class I and class II enzymes. These two classes of enzymes show sequence similarity; the two-domain class II enzymes evolved from a class I enzyme through gene duplication. The extradiol dioxygenases represented in this fa
Probab=32.78  E-value=69  Score=20.92  Aligned_cols=20  Identities=25%  Similarity=0.413  Sum_probs=15.8

Q ss_pred             EEEEECCCCCEEEEEEEeCC
Q psy8079          69 AYTYTGADGNPYSLQYTADE   88 (135)
Q Consensus        69 sYsy~~pdG~~~~V~Y~ADe   88 (135)
                      ++.+.||||..+.|.+..|+
T Consensus        98 ~~~~~DP~G~~ie~~~~~~~  117 (117)
T cd07240          98 GLRFQDPDGHLLELFVEADK  117 (117)
T ss_pred             EEEEECCCCCEEEEEEccCC
Confidence            45688999999998876553


No 13 
>PF12118 SprA-related:  SprA-related family;  InterPro: IPR021973  Most members of this uncharacterised protein family are annotated as being SprA-related. 
Probab=31.21  E-value=75  Score=26.74  Aligned_cols=17  Identities=41%  Similarity=0.968  Sum_probs=13.1

Q ss_pred             EEEEE-CCCCCEEEEEEEeCC
Q psy8079          69 AYTYT-GADGNPYSLQYTADE   88 (135)
Q Consensus        69 sYsy~-~pdG~~~~V~Y~ADe   88 (135)
                      +|+|. +|||+.|-   |.+|
T Consensus       175 sy~y~~GpDg~~Ya---vgGe  192 (294)
T PF12118_consen  175 SYSYQRGPDGKRYA---VGGE  192 (294)
T ss_pred             ceeeeeCCCCceee---ccce
Confidence            67775 99999888   5556


No 14 
>cd07252 BphC1-RGP6_N_like N-terminal domain of 2,3-dihydroxybiphenyl 1,2-dioxygenase (BphC, EC 1.13.11.39) 1 from Rhodococcus globerulus P6 (BphC1-RGP6) and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of BphC1-RGP6 and similar proteins. BphC catalyzes the extradiol ring cleavage reaction of 2,3-dihydroxybiphenyl, the third step in the polychlorinated biphenyls (PCBs) degradation pathway (bph pathway). This subfamily of BphCs belongs to the type I extradiol dioxygenase family, which require a metal in the active site in its catalytic mechanism. Polychlorinated biphenyl degrading bacteria demonstrate a multiplicity of 2,3-dihydroxybiphenyl 1,2-dioxygenases. For example, three types of BphC enzymes have been found in Rhodococcus globerulus (BphC1-RGP6 - BphC3-RGP6), all three enzymes are type I extradiol dioxygenases. BphC1-RGP6 has an internal duplication, it is a two-domain dioxygenase which forms octamers, and has Fe(II) at the catalytic site. Its N-
Probab=30.70  E-value=70  Score=21.58  Aligned_cols=21  Identities=29%  Similarity=0.502  Sum_probs=16.6

Q ss_pred             EEEEEECCCCCEEEEEEEeCC
Q psy8079          68 GAYTYTGADGNPYSLQYTADE   88 (135)
Q Consensus        68 GsYsy~~pdG~~~~V~Y~ADe   88 (135)
                      .++.+.||||..+.+.|.-+|
T Consensus       100 ~~~~~~DPdG~~iE~~~~~~~  120 (120)
T cd07252         100 GLIRFADPDGNRHELFWGPKE  120 (120)
T ss_pred             EEEEEECCCCCEEEEEecccC
Confidence            367899999999998876543


No 15 
>PF05423 Mycobact_memb:  Mycobacterium membrane protein;  InterPro: IPR008693 This family contains several membrane proteins from Mycobacterium species [].
Probab=30.49  E-value=1e+02  Score=23.00  Aligned_cols=24  Identities=29%  Similarity=0.592  Sum_probs=18.5

Q ss_pred             EEEEEEEECCCCCEEEEEEEeCCCC
Q psy8079          66 AQGAYTYTGADGNPYSLQYTADEEG   90 (135)
Q Consensus        66 v~GsYsy~~pdG~~~~V~Y~ADenG   90 (135)
                      .+=.|+-.++.|...+|+|. |++|
T Consensus        51 k~V~YEV~G~~G~~~~I~Y~-D~~~   74 (140)
T PF05423_consen   51 KTVTYEVTGPPGSTATISYL-DADG   74 (140)
T ss_pred             cEEEEEEEcCCCCeEEEEEE-cCCC
Confidence            34468888899999999997 5544


No 16 
>PHA02978 hypothetical protein; Provisional
Probab=29.71  E-value=46  Score=24.47  Aligned_cols=15  Identities=20%  Similarity=0.525  Sum_probs=12.3

Q ss_pred             EEEEECCCCCEEEEE
Q psy8079          69 AYTYTGADGNPYSLQ   83 (135)
Q Consensus        69 sYsy~~pdG~~~~V~   83 (135)
                      .+||.+|||..++|+
T Consensus        76 y~sy~~~~gisiqvs   90 (135)
T PHA02978         76 YFSYADPDGISIQVS   90 (135)
T ss_pred             EEEecCCCceEEEEe
Confidence            567899999988874


No 17 
>PF05899 Cupin_3:  Protein of unknown function (DUF861);  InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=28.83  E-value=1.5e+02  Score=19.17  Aligned_cols=26  Identities=27%  Similarity=0.422  Sum_probs=20.4

Q ss_pred             ccceEEEEEEEECCCCCEEEEEEEeCCC
Q psy8079          62 EVQVAQGAYTYTGADGNPYSLQYTADEE   89 (135)
Q Consensus        62 ~~~~v~GsYsy~~pdG~~~~V~Y~ADen   89 (135)
                      -..++.|+-..++.+|..  +++.|++-
T Consensus        28 ~~~vleG~v~it~~~G~~--~~~~aGD~   53 (74)
T PF05899_consen   28 FFYVLEGEVTITDEDGET--VTFKAGDA   53 (74)
T ss_dssp             EEEEEEEEEEEEETTTEE--EEEETTEE
T ss_pred             EEEEEEeEEEEEECCCCE--EEEcCCcE
Confidence            347899999999999996  55677653


No 18 
>PF13511 DUF4124:  Domain of unknown function (DUF4124)
Probab=27.30  E-value=54  Score=20.06  Aligned_cols=18  Identities=17%  Similarity=0.486  Sum_probs=13.6

Q ss_pred             EEEEEEECCCCCEEEEEEEeC
Q psy8079          67 QGAYTYTGADGNPYSLQYTAD   87 (135)
Q Consensus        67 ~GsYsy~~pdG~~~~V~Y~AD   87 (135)
                      ..-|+|+|.+|+.   .|+-.
T Consensus        13 a~vYk~~D~~G~v---~ysd~   30 (60)
T PF13511_consen   13 AEVYKWVDENGVV---HYSDT   30 (60)
T ss_pred             ccEEEEECCCCCE---EECcc
Confidence            4679999999984   55555


No 19 
>PRK14818 NADH dehydrogenase subunit B; Provisional
Probab=25.46  E-value=56  Score=25.54  Aligned_cols=21  Identities=33%  Similarity=0.634  Sum_probs=15.5

Q ss_pred             CCCCCCCChHHHHHHHHHHHc
Q psy8079          97 YLPTPPPIPAEIVKALEFLAS  117 (135)
Q Consensus        97 ~lP~~pp~p~~i~~al~~~~~  117 (135)
                      .+|-=||.|++|+.+|..+.+
T Consensus       133 yIPGCPP~PeaIl~gil~L~~  153 (173)
T PRK14818        133 YVPGCPPRPEALTEGLLRLQE  153 (173)
T ss_pred             EccCCCCCHHHHHHHHHHHHH
Confidence            566679999999888754443


No 20 
>PF10929 DUF2811:  Protein of unknown function (DUF2811);  InterPro: IPR021231  This is a bacterial family of uncharacterised proteins. 
Probab=25.28  E-value=73  Score=20.56  Aligned_cols=20  Identities=30%  Similarity=0.772  Sum_probs=16.4

Q ss_pred             CChHHHHHHHH-HHHcCCCCC
Q psy8079         103 PIPAEIVKALE-FLASQPKID  122 (135)
Q Consensus       103 p~p~~i~~al~-~~~~~p~~~  122 (135)
                      .||+.+.+++. ||.+||.=+
T Consensus         5 eiPe~L~~~m~~fie~hP~WD   25 (57)
T PF10929_consen    5 EIPEDLHQAMKDFIETHPNWD   25 (57)
T ss_pred             cccHHHHHHHHHHHHcCCCch
Confidence            47888888987 999999754


No 21 
>PF03562 MltA:  MltA specific insert domain;  InterPro: IPR005300 This group of proteins includes MltA; a membrane-bound, murein degrading transglycosylase enzyme which plays an important role in the controlled growth of the stress-bearing sacculus of Escherichia coli [, ].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds; PDB: 2PI8_D 2AE0_X 2PIC_A 2GAE_A 2PJJ_A 3CZB_A 2G6G_A 2PNW_A 2G5D_A.
Probab=24.43  E-value=53  Score=25.15  Aligned_cols=54  Identities=17%  Similarity=0.308  Sum_probs=33.1

Q ss_pred             ceEEEEEEEECCCCCEEEEEEEeCCCC--ccccCC------CCCCCCCChHHHHHHHHHHHcCCCC
Q psy8079          64 QVAQGAYTYTGADGNPYSLQYTADEEG--FKPSAN------YLPTPPPIPAEIVKALEFLASQPKI  121 (135)
Q Consensus        64 ~~v~GsYsy~~pdG~~~~V~Y~ADenG--f~~~g~------~lP~~pp~p~~i~~al~~~~~~p~~  121 (135)
                      +.|+||=.-..|||....|.|- +.||  |+..|.      .|+...-   ..+...+|+++||.+
T Consensus        82 lqIQGSGrl~l~DG~~~rvgYA-g~NG~pY~sIGr~Li~~G~i~~~~~---Smq~Ir~wl~~~P~~  143 (158)
T PF03562_consen   82 LQIQGSGRLRLPDGSTVRVGYA-GQNGHPYTSIGRLLIDRGEIPREQM---SMQAIRAWLRAHPEE  143 (158)
T ss_dssp             HHHHSEEEEE-TTSSEEEEEEE-EE-SS----HHHHHHHTTSS-TTS----SHHHHHHHHHHTGGG
T ss_pred             eeecCcEEEEcCCCCEEEEEEc-ccCCCccccHHHHHHHcCCcChhhC---CHHHHHHHHHHCHHH
Confidence            5688999999999999999995 4566  454443      3432211   133345799999875


No 22 
>COG3422 Uncharacterized conserved protein [Function unknown]
Probab=23.02  E-value=1.1e+02  Score=19.93  Aligned_cols=20  Identities=20%  Similarity=0.507  Sum_probs=15.5

Q ss_pred             cccCCCCCeeEEEecCCCce
Q psy8079          27 TQVNPDGSYQYSFQTSNGIA   46 (135)
Q Consensus        27 ~~~~~dG~Y~f~Ye~~dG~~   46 (135)
                      -..+.+|.|.|.+..+||..
T Consensus         7 i~kdk~Ge~rfrlkA~N~ei   26 (59)
T COG3422           7 IYKDKAGEYRFRLKAANGEI   26 (59)
T ss_pred             EEEcCCCcEEEEEEccCccE
Confidence            34578899999999888754


No 23 
>TIGR03795 chp_BMA0021 conserved hypothetical protein, BMA_0021 family. Members of this protein family are found sparsely, mostly in members of the genus Burkholderia. Members often occur as tandem homologous genes, such as BMA_0021 and BMA_0022 in Burkholderia mallei ATCC 23344. The genes regularly are encoded near the so-called docking protein of TOMM (thiazole/oxazole-modified microcins) biosynthetic clusters, suggesting a role in bacteriocin biosynthesis. The function is unknown.
Probab=22.47  E-value=1.4e+02  Score=21.69  Aligned_cols=23  Identities=30%  Similarity=0.439  Sum_probs=13.1

Q ss_pred             CCCCCCChHHHHHHH-HHHHcCCC
Q psy8079          98 LPTPPPIPAEIVKAL-EFLASQPK  120 (135)
Q Consensus        98 lP~~pp~p~~i~~al-~~~~~~p~  120 (135)
                      ||.+|..-+...-|| +|+.+|+.
T Consensus        89 iP~aP~~~~~~a~ALAayn~~~~~  112 (114)
T TIGR03795        89 IPPAPEHLAEEAIALAAYNDAGPA  112 (114)
T ss_pred             cCCCCCccchHHHHHHHHhccCcc
Confidence            777655433232344 48888874


No 24 
>PF00659 POLO_box:  POLO box duplicated region;  InterPro: IPR000959 A subgroup of serine/threonine protein kinases, Polo or Polo-like kinases play multiple roles during the cell cycle. Polo kinases are required at several key points through mitosis, starting from control of the G2/M transition through phosphorylation of Cdc25C and mitotic cyclins. Polo kinases are characterised by an amino terminal catalytic domain, and a carboxy terminal non-catalytic domain consisting of three blocks of conserved sequences known as polo boxes which form one single functional domain []. The domain is named after its founding member encoded by the polo gene of Drosophila melanogaster []. This domain of around 70 amino acids has been found in species ranging from yeast to mammals. Polo boxes appear to mediate interaction with multiple proteins through protein:protein interactions; some but not all of these proteins are substrates for the kinase domain of the molecule [].  The crystal structure of the polo domain of the murine protein, Sak, is dimeric, consisting of two alpha-helices and two six-stranded beta-sheets []. The topology of one polypeptide subunit of the dimer consists of, from its N- to C terminus, an extended strand segment, five beta-strands, one alpha-helix (A) and a C-terminal beta-strand. Beta-strands from one subunit form a contiguous antiparallel beta-sheet with beta-strands from the second subunit. The two beta-sheets pack with a crossing angle of 110 degrees, orienting the hydrophobic surfaces inward and the hydrophilic surfaces outward. Helix A, which is colinear with beta-strand 6 of the same polypeptide, buries a large portion of the non-overlapping hydrophobic beta-sheet surfaces. Interactions involving helices A comprise a majority of the hydrophobic core structure and also the dimer interface. Point mutations in the Polo box of the budding yeast Cdc5 protein abolish the ability of overexpressed Cdc5 to interact with the spindle poles and to organise cytokinetic structures [].; GO: 0005515 protein binding; PDB: 1MBY_B 3P37_A 3MHN_A 1Q4K_A 3HIK_A 3Q1I_A 3P35_A 3MHQ_A 1UMW_B 3MQ8_B ....
Probab=22.22  E-value=2.1e+02  Score=17.89  Aligned_cols=13  Identities=38%  Similarity=0.646  Sum_probs=11.0

Q ss_pred             CeeEEEecCCCce
Q psy8079          34 SYQYSFQTSNGIA   46 (135)
Q Consensus        34 ~Y~f~Ye~~dG~~   46 (135)
                      .|.+.|..+||+.
T Consensus         4 k~gi~~~LSng~v   16 (68)
T PF00659_consen    4 KYGIGYQLSNGTV   16 (68)
T ss_dssp             CSEEEEEETTSEE
T ss_pred             ceEEEEEEeCCCE
Confidence            4789999999984


No 25 
>cd08344 MhqB_like_N N-terminal domain of MhqB, a type I extradiol dioxygenase, and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of Burkholderia sp. NF100 MhqB and similar proteins. MhqB is a type I extradiol dioxygenase involved in the catabolism of methylhydroquinone, an intermediate in the degradation of fenitrothion. The purified enzyme has shown extradiol ring cleavage activity toward 3-methylcatechol. Fe2+ was suggested as a cofactor, the same as most other enzymes in the family. Burkholderia sp. NF100 MhqB is encoded on the plasmid pNF1. The type I family of extradiol dioxygenases contains two structurally homologous barrel-shaped domains at the N- and C-terminal. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism.
Probab=22.12  E-value=1.1e+02  Score=20.28  Aligned_cols=17  Identities=18%  Similarity=0.335  Sum_probs=14.0

Q ss_pred             EEEEECCCCCEEEEEEE
Q psy8079          69 AYTYTGADGNPYSLQYT   85 (135)
Q Consensus        69 sYsy~~pdG~~~~V~Y~   85 (135)
                      ++.+.||||..+.+.+-
T Consensus        93 ~~~~~DP~Gn~iel~~~  109 (112)
T cd08344          93 GVWFRDPDGNLLQVKVA  109 (112)
T ss_pred             EEEEECCCCCEEEEecC
Confidence            47799999999998753


No 26 
>PF09242 FCSD-flav_bind:  Flavocytochrome c sulphide dehydrogenase, flavin-binding;  InterPro: IPR015323 This entry represents the flavin-binding domain of flavocytochrome c sulphide dehydrogenase (FCSD), enzymes found in sulphur-oxidising bacteria such as the purple phototrophic bacteria Chromatium vinosum [, ]. These enzymes are complexes of flavoprotein and a dihaem cytochrome that carry out hydrogen sulphide-dependent cytochrome C reduction. The dihaem cytochrome folds into two domains, each of which resembles mitochondrial cytochrome c, with the two haem groups bound to the interior of the subunit. The flavoprotein subunit has a glutathione reductase-like fold consisting of a beta(3,4)-alpha(3) core, and an alpha+beta sandwich. The active site of the flavoprotein subunit contains a catalytically important disulphide bridge located above the pyrimidine portion of the flavin ring []. Electrons are transferred from the flavin to one of the haem groups in the cytochrome. This entry represents a flavoprotein domain required for binding to flavin, and subsequent electron transfer. ; PDB: 1FCD_A.
Probab=21.91  E-value=92  Score=20.44  Aligned_cols=29  Identities=17%  Similarity=0.211  Sum_probs=19.6

Q ss_pred             EEEEEEEECCC-CCEEEEEEEeCCCCcccc
Q psy8079          66 AQGAYTYTGAD-GNPYSLQYTADEEGFKPS   94 (135)
Q Consensus        66 v~GsYsy~~pd-G~~~~V~Y~ADenGf~~~   94 (135)
                      +.==||+++|| |..+.-.|..++.++...
T Consensus         7 ~NtCYSlv~p~~aIsVa~vy~~~~~~i~~v   36 (71)
T PF09242_consen    7 ANTCYSLVAPDYAISVAGVYRYDDGKIVEV   36 (71)
T ss_dssp             EEEEEEEEETTEEEEEEEEEEE-TTTEEE-
T ss_pred             ceeEEEEeCCCceEEEEEEEEECCCeEEEe
Confidence            44468888886 777777888888775443


No 27 
>PF10681 Rot1:  Chaperone for protein-folding within the ER, fungal;  InterPro: IPR019623  This conserved fungal family is an essential molecular chaperone in the endoplasmic reticulum. Molecular chaperones transiently interact with unfolded proteins to inhibit their self-aggregation and to support their folding and/or assembly. Rot1 is a general chaperone with some substrate specificity, its substrates being the structurally unrelated Kre5 Kre6 Big1 Atg22, which are type I, type II, and polytopic membrane proteins. The dependencies of each for Rot1 do not share similarities. However, their folding does require BiP, and one of these proteins was simultaneously associated with both Rot1 and BiP. In addition, Rot1 may cooperate with BiP/Kar2 in the folding of Kre6 []. 
Probab=20.06  E-value=3.8e+02  Score=21.66  Aligned_cols=60  Identities=23%  Similarity=0.409  Sum_probs=31.8

Q ss_pred             ccCCCCCeeEEEecCCCceEEEeeeeccCCCCC------cccceEEEEEEEECCCCCEEEEEEEeCCCCcccc
Q psy8079          28 QVNPDGSYQYSFQTSNGIAAQESGFLKNAGNPE------AEVQVAQGAYTYTGADGNPYSLQYTADEEGFKPS   94 (135)
Q Consensus        28 ~~~~dG~Y~f~Ye~~dG~~~~E~G~~~~~G~~~------~~~~~v~GsYsy~~pdG~~~~V~Y~ADenGf~~~   94 (135)
                      +.+.-| .+|+| +.||-..+..  ....+++.      ..-+-++|.|+.. ++|.+...=+..|  |.|..
T Consensus        28 eP~~~G-ISYSF-T~DG~~EeA~--Yr~~~Np~~p~C~~a~l~wQHGtY~l~-~nGsl~L~P~~~D--GrQl~   93 (212)
T PF10681_consen   28 EPSLPG-ISYSF-TEDGYFEEAQ--YRVTSNPTNPSCPTAVLIWQHGTYELN-SNGSLTLTPFAVD--GRQLV   93 (212)
T ss_pred             CCCCCc-eeEEE-cCCCeeeEEE--EEEccCCCCCCCCceEEEEecceEEEC-CCCcEEEeecCCC--Cceec
Confidence            444555 55555 8888553333  33333321      1223568999865 6676665555444  44443


Done!