Query         033621
Match_columns 115
No_of_seqs    111 out of 371
Neff          6.1 
Searched_HMMs 46136
Date          Fri Mar 29 04:23:15 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/033621.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/033621hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3343 Vesicle coat complex C 100.0   6E-33 1.3E-37  202.8  11.1  102    1-106     3-105 (175)
  2 COG5030 APS2 Clathrin adaptor  100.0 1.3E-28 2.8E-33  176.7  11.9   99    6-107     1-99  (152)
  3 KOG0936 Clathrin adaptor compl 100.0 8.4E-28 1.8E-32  175.2  10.7   98    6-106     1-104 (182)
  4 KOG0934 Clathrin adaptor compl  99.9 3.4E-26 7.3E-31  163.8   8.4   99    6-107     1-99  (145)
  5 PF01217 Clat_adaptor_s:  Clath  99.9 2.2E-24 4.7E-29  153.5  14.6   98    6-106     1-98  (141)
  6 COG5541 RET3 Vesicle coat comp  99.9 1.5E-24 3.2E-29  158.8  10.6  102    2-107     3-113 (187)
  7 KOG0935 Clathrin adaptor compl  99.9 1.8E-22 3.9E-27  142.4  12.0  100    6-107     1-100 (143)
  8 KOG0938 Adaptor complexes medi  96.6   0.036 7.8E-07   45.9  10.7   93    6-104     1-93  (446)
  9 KOG0937 Adaptor complexes medi  96.6   0.064 1.4E-06   45.1  12.0   94    7-104     1-94  (424)
 10 PF08923 MAPKK1_Int:  Mitogen-a  96.4   0.055 1.2E-06   38.0   9.3   97    4-102    14-115 (119)
 11 PF15001 AP-5_subunit_s1:  AP-5  96.0    0.13 2.8E-06   39.1   9.9   44   61-104   102-145 (189)
 12 PF03164 Mon1:  Trafficking pro  94.1     1.5 3.2E-05   36.6  12.0   86    8-98     12-98  (415)
 13 KOG2740 Clathrin-associated pr  92.9    0.35 7.6E-06   40.3   6.3   98    6-107     1-99  (418)
 14 PF03259 Robl_LC7:  Roadblock/L  85.6     4.9 0.00011   25.2   6.2   74    3-78     11-90  (91)
 15 KOG2635 Medium subunit of clat  84.8      15 0.00033   31.6  10.2   91    6-103     2-94  (512)
 16 COG3322 Predicted periplasmic   76.0      10 0.00022   30.6   6.1   23    6-28    103-125 (295)
 17 PF09695 YtfJ_HI0045:  Bacteria  67.6      15 0.00033   27.3   4.9   30    8-40    126-155 (160)
 18 COG2018 Uncharacterized distan  65.3      46   0.001   23.5   7.7   80    3-84     15-100 (119)
 19 PF05228 CHASE4:  CHASE4 domain  54.2      42 0.00092   23.0   5.2   24    4-27     47-72  (161)
 20 PF09561 RE_HpaII:  HpaII restr  48.1      85  0.0018   26.2   6.7   57    6-75    291-347 (355)
 21 PF05323 Pox_A21:  Poxvirus A21  43.0      65  0.0014   22.5   4.5   38   30-71     20-57  (111)
 22 PF13774 Longin:  Regulated-SNA  42.9      83  0.0018   19.6   8.5   63   40-104     3-65  (83)
 23 PF04315 DUF462:  Protein of un  42.5      21 0.00046   26.6   2.2   20   21-40     65-84  (164)
 24 cd03016 PRX_1cys Peroxiredoxin  40.0      57  0.0012   24.1   4.2   22    6-27    116-137 (203)
 25 cd04670 Nudix_Hydrolase_12 Mem  36.7 1.2E+02  0.0027   19.9   5.2   41    9-49      6-57  (127)
 26 COG0450 AhpC Peroxiredoxin [Po  36.3 1.2E+02  0.0026   23.2   5.5   23    6-28    123-145 (194)
 27 PF13503 DUF4123:  Domain of un  34.3      61  0.0013   21.4   3.3   18   11-28     89-106 (124)
 28 PRK13190 putative peroxiredoxi  34.3      58  0.0013   24.1   3.5   23    6-28    116-138 (202)
 29 COG3054 Predicted transcriptio  33.3      46 0.00099   25.0   2.7   17    8-24    148-164 (184)
 30 PRK13599 putative peroxiredoxi  32.5      56  0.0012   24.7   3.2   22    6-27    118-139 (215)
 31 PHA03023 hypothetical protein;  31.6 1.1E+02  0.0024   21.4   4.2   39   30-72     20-59  (112)
 32 PHA03045 IMV membrane protein;  31.6 1.2E+02  0.0026   21.3   4.4   39   30-72     20-58  (113)
 33 PF10623 PilI:  Plasmid conjuga  30.1      76  0.0017   21.0   3.0   15    9-23     10-24  (83)
 34 cd04669 Nudix_Hydrolase_11 Mem  29.5 1.7E+02  0.0036   19.3   4.9   39    9-47      4-53  (121)
 35 cd04685 Nudix_Hydrolase_26 Mem  29.3 1.9E+02  0.0041   19.8   5.2   41    9-49      4-58  (133)
 36 PF01820 Dala_Dala_lig_N:  D-al  27.9 1.5E+02  0.0032   20.1   4.4   28   73-101     3-30  (117)
 37 cd03671 Ap4A_hydrolase_plant_l  27.4 2.1E+02  0.0045   19.6   5.2   41    9-49      7-57  (147)
 38 PF14903 WG_beta_rep:  WG conta  27.4      68  0.0015   16.3   2.1   16   12-27      3-18  (35)
 39 cd04678 Nudix_Hydrolase_19 Mem  26.8 1.7E+02  0.0037   19.2   4.5   40    9-48      6-57  (129)
 40 PRK13191 putative peroxiredoxi  26.8      81  0.0018   23.8   3.2   23    5-27    122-144 (215)
 41 PF15404 PH_4:  Pleckstrin homo  26.2      42 0.00092   25.3   1.5   29   42-70      5-33  (185)
 42 cd08151 AOS Allene oxide synth  26.1      83  0.0018   25.3   3.3   23    3-25    158-180 (328)
 43 PRK00714 RNA pyrophosphohydrol  25.3 2.2E+02  0.0047   20.0   5.1   41    9-49     12-62  (156)
 44 PF03456 uDENN:  uDENN domain;   25.2 1.1E+02  0.0023   18.3   3.0   19    4-22     41-59  (65)
 45 PF00989 PAS:  PAS fold;  Inter  24.7      82  0.0018   19.2   2.5   16    7-22     11-26  (113)
 46 KOG0997 Uncharacterized conser  24.5 4.9E+02   0.011   22.9  10.1   70    8-83    118-189 (523)
 47 cd03430 GDPMH GDP-mannose glyc  24.2 2.4E+02  0.0052   19.4   5.0   41    8-48     15-67  (144)
 48 PRK13189 peroxiredoxin; Provis  24.0      96  0.0021   23.4   3.1   22    6-27    125-146 (222)
 49 cd03018 PRX_AhpE_like Peroxire  23.6 1.8E+02  0.0038   19.5   4.2   19    9-27    116-134 (149)
 50 PF15603 Imm45:  Immunity prote  23.2 1.2E+02  0.0026   19.9   3.1   32   80-111    49-80  (82)
 51 PF13426 PAS_9:  PAS domain; PD  22.9      94   0.002   18.5   2.5   15    8-22      2-16  (104)
 52 KOG0559 Dihydrolipoamide succi  22.7      98  0.0021   26.3   3.1   21   60-80    308-328 (457)
 53 PLN02609 catalase               21.0 2.5E+02  0.0054   24.4   5.3   49    2-50    202-263 (492)
 54 cd08156 catalase_clade_3 Clade  20.6 1.2E+02  0.0025   25.8   3.3   48    3-50    146-206 (429)

No 1  
>KOG3343 consensus Vesicle coat complex COPI, zeta subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00  E-value=6e-33  Score=202.82  Aligned_cols=102  Identities=51%  Similarity=0.823  Sum_probs=96.9

Q ss_pred             CCCcceEeEEEEEcCCCCEEEEeeCCCCCCC-hHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEe
Q 033621            1 MDTCPSIKNILLLDSEGKRVAVKYYSNDWPT-NSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTG   79 (115)
Q Consensus         1 ~~sl~~I~ailIld~~Gkr~~sKyY~~~~~~-~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~   79 (115)
                      +.|+|.|+|++|+|++|+|+++|||++++++ .++|++|||.+|+|++|.+    +||.++||+++|||.+.||+||++|
T Consensus         3 ~~sly~vk~iliLD~~G~Ri~aKYY~~~~~s~vkeqkaFEK~lF~KT~kt~----~eI~~ldg~~vvYk~~~Dl~fyv~G   78 (175)
T KOG3343|consen    3 PLSLYTVKAILILDSDGKRILAKYYDDPHPSTVKEQKAFEKNLFSKTSKTE----SEILLLDGNTVVYKSVIDLHFYVVG   78 (175)
T ss_pred             ccchhhhheEEEEcCCCCEeeeeecCCCcchhHHHHHHHHHHHhccccccc----ceeEEecCcEEEEEecccEEEEEec
Confidence            3589999999999999999999999998877 9999999999999999864    6999999999999999999999999


Q ss_pred             cCCcchHHHHHHHHHHHHHHHhhhCCc
Q 033621           80 GEDENELILATVLQGFFDAVGLLLRLL  106 (115)
Q Consensus        80 ~~~ENEL~l~~~i~~~veaLd~~f~~~  106 (115)
                      +.+||||+|.++++|++|||+.+|++.
T Consensus        79 ~~~ENEl~L~svL~~l~dal~llLr~n  105 (175)
T KOG3343|consen   79 SEEENELMLMSVLTCLFDALSLLLRKN  105 (175)
T ss_pred             CcchhHHHHHHHHHHHHHHHHHHHHhC
Confidence            999999999999999999999999864


No 2  
>COG5030 APS2 Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion]
Probab=99.96  E-value=1.3e-28  Score=176.72  Aligned_cols=99  Identities=20%  Similarity=0.381  Sum_probs=93.9

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCcch
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDENE   85 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~ENE   85 (115)
                      ||+++||+|++||+|++|||+|  .+..+|+++.++|++.++.|+++ ++++..+.+.++|||+||+|||+++.+.+|||
T Consensus         1 ~i~~vli~nrqgk~RL~K~yt~--~~~~e~~kli~~i~~lIs~R~~k-e~N~~e~k~~kiVYrrYA~LyF~f~Vd~~dnE   77 (152)
T COG5030           1 MIKFVLIFNRQGKPRLVKWYTP--VSDPEQAKLIADIYELISARKPK-ESNFIEGKNEKIVYRRYATLYFVFGVDNDDNE   77 (152)
T ss_pred             CeEEEEEEcCCCceeeeEeecc--CCcHHHHHHHHHHHHHHHcCCch-hcccccccCcEEEeeecCcEEEEEEEcCCCCc
Confidence            7999999999999999999987  47888999999999999999865 78999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHhhhCCcc
Q 033621           86 LILATVLQGFFDAVGLLLRLLH  107 (115)
Q Consensus        86 L~l~~~i~~~veaLd~~f~~~~  107 (115)
                      |+++++||.|||+||++|+|+-
T Consensus        78 l~iL~lIh~FVE~lDr~FgnVC   99 (152)
T COG5030          78 LIILELIHNFVEILDRFFGNVC   99 (152)
T ss_pred             chHHHHHHHHHHHHHHHhccce
Confidence            9999999999999999999863


No 3  
>KOG0936 consensus Clathrin adaptor complex, small subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.95  E-value=8.4e-28  Score=175.21  Aligned_cols=98  Identities=20%  Similarity=0.347  Sum_probs=88.8

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEE------ecCeEEEEEEeceEEEEEEe
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAM------LEGNVIVYKFVQDLHFFVTG   79 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~------~~~~~vVyr~~adl~f~vv~   79 (115)
                      ||+|++|+|++||+|++|||+|  .+.+.|+++.+++|..+++|+. +-|+++.      -.+..++||+||+|||+++.
T Consensus         1 MI~AvlifNn~gkPRL~KFY~p--~~~~~Qq~lir~vf~lvs~R~~-n~~nFLe~~~l~g~~d~rlIYrhYATLYFvfvv   77 (182)
T KOG0936|consen    1 MIKAVLIFNNKGKPRLVKFYTP--VDEEKQQQLIREVFHLVSKRPD-NVCNFLEGNSLIGGSDNRLIYRHYATLYFVFVV   77 (182)
T ss_pred             CeeEEEEecCCCCcceeeecCc--CChHHHHHHHHHHHHHHHcCCc-hHhhhhccccccCCccceeehheeeeEEEEEEE
Confidence            8999999999999999999998  4899999999999999999984 1233333      25899999999999999999


Q ss_pred             cCCcchHHHHHHHHHHHHHHHhhhCCc
Q 033621           80 GEDENELILATVLQGFFDAVGLLLRLL  106 (115)
Q Consensus        80 ~~~ENEL~l~~~i~~~veaLd~~f~~~  106 (115)
                      |++||||+++|+||.|||+||++|.|+
T Consensus        78 D~sEsEL~iLDLIQvfVEtLDkCF~nV  104 (182)
T KOG0936|consen   78 DSSESELGILDLIQVFVETLDKCFENV  104 (182)
T ss_pred             cCCcchhHHHHHHHHHHHHHHHHHHhh
Confidence            999999999999999999999999986


No 4  
>KOG0934 consensus Clathrin adaptor complex, small subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.93  E-value=3.4e-26  Score=163.76  Aligned_cols=99  Identities=17%  Similarity=0.325  Sum_probs=94.2

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCcch
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDENE   85 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~ENE   85 (115)
                      ||+++|+.++|||.|++|||++  .+.++|+++++++.+.+.+|+.+ .|+++.+.++++|||+||+|+|+++.+.+|||
T Consensus         1 mi~f~LlvsrQGk~rL~k~y~~--~~~~er~~i~re~i~~~Lar~pk-~csfie~kd~kvVyrryasl~f~~~v~~~dNE   77 (145)
T KOG0934|consen    1 MIKFFLLVSRQGKTRLQKWYEA--LSIKERKKIERELIKSVLARKPK-MCSFIEYKDEKVVYRRYASLFFCVGVEDNDNE   77 (145)
T ss_pred             CeEEEEEEeccCceehhHHHhh--hcHHHHHHHHHHHHHHHHhCCcc-cccchhccCceehhhhhhhEEEEEEEecCCch
Confidence            7999999999999999999997  48999999999999999999754 68999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHhhhCCcc
Q 033621           86 LILATVLQGFFDAVGLLLRLLH  107 (115)
Q Consensus        86 L~l~~~i~~~veaLd~~f~~~~  107 (115)
                      |+++++||.+||.||++|+++-
T Consensus        78 L~~LE~IH~~vE~lDkYFg~VC   99 (145)
T KOG0934|consen   78 LAILEFIHNYVELLDKYFGSVC   99 (145)
T ss_pred             hhHHHHHHHHHHHHHHHhccce
Confidence            9999999999999999999874


No 5  
>PF01217 Clat_adaptor_s:  Clathrin adaptor complex small chain;  InterPro: IPR022775 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer [].  Clathrin coats contain both clathrin and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors []. All AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). Each subunit has a specific function. Adaptin subunits recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal appendage domains. By contrast, GGAs are monomers composed of four domains, which have functions similar to AP subunits: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The GAE domain is similar to the AP gamma-adaptin ear domain, being responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. While clathrin mediates endocytic protein transport from ER to Golgi, coatomers (COPI, COPII) primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the small sigma and mu subunits of various adaptins from different AP clathrin adaptor complexes (including AP1, AP2, AP3 and AP4), and the zeta and delta subunits of various coatomer (COP) adaptors. The small sigma subunit of AP proteins have been characterised in several species [, , , ]. The sigma subunit plays a role in protein sorting in the late-Golgi/trans-Golgi network (TGN) and/or endosomes. The zeta subunit of coatomers (zeta-COP) is required for coatomer binding to Golgi membranes and for coat-vesicle assembly [, ]. More information about these proteins can be found at Protein of the Month: Clathrin [].; PDB: 1W63_W 2JKR_I 2VGL_S 2JKT_I 2XA7_S 2HF6_A 3TJZ_C.
Probab=99.93  E-value=2.2e-24  Score=153.46  Aligned_cols=98  Identities=32%  Similarity=0.506  Sum_probs=86.0

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCcch
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDENE   85 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~ENE   85 (115)
                      ||+|++|+|++|+||++|||.+.  +.++++++++++.+++.++++. .++++.++++.+|||+++||+|+++++.+|||
T Consensus         1 MI~~i~i~n~~G~~i~~k~y~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~i~~~~~~~~vy~~~~dl~~~~v~~~~eNe   77 (141)
T PF01217_consen    1 MIKAILILNSQGKRILSKYYRDV--SEEERQKLFEKFIKKKSSRNSK-QSPIFEHDNYRIVYKRYSDLYFVVVGDENENE   77 (141)
T ss_dssp             SEEEEEEEETTSEEEEEEESSTS--TSHHHHHHHHHHHHHHHTSSSS-STSEEEETTEEEEEEEETTEEEEEEESSTSBH
T ss_pred             CEEEEEEEcCCCCEEEehhcCCc--cHHHHHHHHHHHHHHHHhcccc-cceeeecccceeeeEeeccEEEEEEeecccch
Confidence            89999999999999999999653  5556666667777777777532 38999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHhhhCCc
Q 033621           86 LILATVLQGFFDAVGLLLRLL  106 (115)
Q Consensus        86 L~l~~~i~~~veaLd~~f~~~  106 (115)
                      +++++++|.++++|+.+|++.
T Consensus        78 l~~~e~l~~~v~~l~~~~~~v   98 (141)
T PF01217_consen   78 LLLLEFLHRLVEVLDDYFGNV   98 (141)
T ss_dssp             HHHHHHHHHHHHHHHHHHSS-
T ss_pred             HHHHHHHHHhhhhhhhhhccc
Confidence            999999999999999999964


No 6  
>COG5541 RET3 Vesicle coat complex COPI, zeta subunit [Posttranslational modification, protein turnover, chaperones]
Probab=99.92  E-value=1.5e-24  Score=158.76  Aligned_cols=102  Identities=25%  Similarity=0.407  Sum_probs=96.0

Q ss_pred             CCcceEeEEEEEcCCCCEEEEeeCCCCC---------CChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEece
Q 033621            2 DTCPSIKNILLLDSEGKRVAVKYYSNDW---------PTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQD   72 (115)
Q Consensus         2 ~sl~~I~ailIld~~Gkr~~sKyY~~~~---------~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~ad   72 (115)
                      .|||.|+|+||+|+||+|+++|||.|++         .+.+++++||+++++|+.+..    .+|.+++|++|+|+++.|
T Consensus         3 lsLy~v~a~LilDsqGeriy~kyy~pph~~eg~~~vFnsvkkekefek~l~eKt~k~~----~~Il~f~d~lV~~k~~~d   78 (187)
T COG5541           3 LSLYDVEALLILDSQGERIYRKYYQPPHRSEGHQLVFNSVKKEKEFEKKLAEKTAKDR----ESILMFYDRLVMCKRLDD   78 (187)
T ss_pred             cchheeeeeEEecCCccchhhhhcCCcccccccchhhcchhHHHHHHHHHHHHhhcCc----cceeeEcceeeeeeeehh
Confidence            3799999999999999999999999974         577899999999999999875    499999999999999999


Q ss_pred             EEEEEEecCCcchHHHHHHHHHHHHHHHhhhCCcc
Q 033621           73 LHFFVTGGEDENELILATVLQGFFDAVGLLLRLLH  107 (115)
Q Consensus        73 l~f~vv~~~~ENEL~l~~~i~~~veaLd~~f~~~~  107 (115)
                      +.+|++++-+|||..+.+++..+..||+.+++..+
T Consensus        79 v~~yiv~~meeNE~~l~q~f~~ir~Al~li~k~~~  113 (187)
T COG5541          79 VLLYIVSPMEENEPFLGQVFDEIRAALILIVKTPT  113 (187)
T ss_pred             EEEEEecccccccHHHHHHHHHHHHHHHHHHcCCc
Confidence            99999999999999999999999999999998754


No 7  
>KOG0935 consensus Clathrin adaptor complex, small subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.89  E-value=1.8e-22  Score=142.44  Aligned_cols=100  Identities=17%  Similarity=0.229  Sum_probs=90.9

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCcch
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDENE   85 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~ENE   85 (115)
                      ||++|||.|++||+|++|||-|.  +..++++.+.++++.++.|+++-.+.+....+.++++|+||+|+|+++.|..+||
T Consensus         1 mi~FILiqNr~Gk~RLak~yv~~--dd~ek~~~~~~vh~lvs~Rd~K~~~~~~~~~~~~~~~rryagLyf~~~vd~tDne   78 (143)
T KOG0935|consen    1 MIRFILIQNRAGKTRLAKWYVQF--DDDEKQKLIEEVHALVTVRDAKHTNFVEFRNFKIIYRRRYAGLYFCICVDVTDNE   78 (143)
T ss_pred             CeEEEEEEccccceeheeeeecc--CchHHHHHHHHHHHHHhhccchhhhheeeeeceEEEEEeeCCEEEEEEEecCCch
Confidence            79999999999999999999773  5556788899999999999987667777788888888899999999999999999


Q ss_pred             HHHHHHHHHHHHHHHhhhCCcc
Q 033621           86 LILATVLQGFFDAVGLLLRLLH  107 (115)
Q Consensus        86 L~l~~~i~~~veaLd~~f~~~~  107 (115)
                      ++.++.||.|||.||.+|+|+-
T Consensus        79 layLe~IHlFVEvLd~fF~NVC  100 (143)
T KOG0935|consen   79 LAYLEHIHLFVEVLDEFFHNVC  100 (143)
T ss_pred             HHHHHHHHHHHHHHHHHhcccc
Confidence            9999999999999999999973


No 8  
>KOG0938 consensus Adaptor complexes medium subunit family [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.63  E-value=0.036  Score=45.92  Aligned_cols=93  Identities=10%  Similarity=0.132  Sum_probs=74.0

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCcch
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDENE   85 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~ENE   85 (115)
                      ||.|++|+|..|..++.|.|...   ++ +..  -++|+--.-.+.-.-+++...++...+|-+..+|+++.+..++.|=
T Consensus         1 misglfi~n~rGevlink~fr~d---lk-rs~--~diFRv~vi~n~d~r~PV~~igsttf~~~r~~nl~lvaitksN~Nv   74 (446)
T KOG0938|consen    1 MISGLFIYNLRGEVLINKTFRDD---LK-RSI--VDIFRVQVINNLDVRSPVLTIGSTTFHHIRSSNLWLVAITKSNANV   74 (446)
T ss_pred             CcceEEEEeccCcEEEehhhhhh---hh-hhH--HHHHHHhhhhccccCCCeeEecceeEEEEeeccEEEEEEecCCCch
Confidence            78999999999999999999753   32 111  2233322211111236899999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHhhhC
Q 033621           86 LILATVLQGFFDAVGLLLR  104 (115)
Q Consensus        86 L~l~~~i~~~veaLd~~f~  104 (115)
                      .+..+++--+.+.+..+|+
T Consensus        75 a~v~eFl~kl~avm~aYfg   93 (446)
T KOG0938|consen   75 AAVFEFLYKLDAVMNAYFG   93 (446)
T ss_pred             hhHHHHHHHHHHHHHHHhc
Confidence            9999999999999999998


No 9  
>KOG0937 consensus Adaptor complexes medium subunit family [Intracellular trafficking, secretion, and vesicular transport]
Probab=96.55  E-value=0.064  Score=45.14  Aligned_cols=94  Identities=17%  Similarity=0.281  Sum_probs=77.4

Q ss_pred             EeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCcchH
Q 033621            7 IKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDENEL   86 (115)
Q Consensus         7 I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~ENEL   86 (115)
                      +.|+.++|..|+.+.+|=|... -+...=-+|=+.++.+. ..+  ...+++..+|...+|=+..++++.+++..+-|=-
T Consensus         1 ~sa~fild~~G~~lisr~yr~d-v~~s~~~~F~~~l~~~e-~~~--~~~p~l~~~g~~~~~ik~s~lylv~~~~~n~~a~   76 (424)
T KOG0937|consen    1 ASAVFILDHKGEVLISRDYRGD-VPMSSTEKFFRKLFEKE-EGD--ESPPFLVHDGSRFIHIKHSNLYLVAGTRPNVSAA   76 (424)
T ss_pred             CceEEEEcCCCcEeEeeccccc-CChhhhhhHHHHHhhhc-ccC--CCCCeEEeCCceEEEEeecceEEEEEeccCCCHH
Confidence            4689999999999999999764 23444455665666665 222  1357889999999999999999999998999999


Q ss_pred             HHHHHHHHHHHHHHhhhC
Q 033621           87 ILATVLQGFFDAVGLLLR  104 (115)
Q Consensus        87 ~l~~~i~~~veaLd~~f~  104 (115)
                      .++++++.+++.+..+|+
T Consensus        77 ~v~~~l~~~~~v~~~y~~   94 (424)
T KOG0937|consen   77 LVLSFLYAVADVFGDYLS   94 (424)
T ss_pred             HHHHHHHHHHHHHHHHhc
Confidence            999999999999999998


No 10 
>PF08923 MAPKK1_Int:  Mitogen-activated protein kinase kinase 1 interacting;  InterPro: IPR015019 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []:   Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins)   Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. This entry represents Mitogen-activated protein kinase kinase 1 interacting protein, which is a small subcellular adaptor protein required for MAPK signalling and ERK1/2 activation. The overall topology of this domain has a central five-stranded beta-sheet sandwiched between a two alpha-helix and a one alpha-helix layer []. ; PDB: 1VEU_A 1VET_A 1SKO_A 2ZL1_A 3CPT_A.
Probab=96.40  E-value=0.055  Score=38.04  Aligned_cols=97  Identities=13%  Similarity=0.198  Sum_probs=66.8

Q ss_pred             cceEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHH----HhcCC-CccccceEEecCeEEEEEEeceEEEEEE
Q 033621            4 CPSIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTK----TQKTN-ARTEAEIAMLEGNVIVYKFVQDLHFFVT   78 (115)
Q Consensus         4 l~~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k----~~~~~-~~~~~~i~~~~~~~vVyr~~adl~f~vv   78 (115)
                      .+-+.+|+|-|+||-+++.=..+.. ++..-+..|... |..    ++|.+ +++..=+..|+++.++--....+++.++
T Consensus        14 v~Gl~~I~itDrDGvpi~~v~~~~~-~~~~~~~~~~~t-f~~a~~Q~~KL~lG~nk~ii~~Y~~~qvv~~~~~pl~it~i   91 (119)
T PF08923_consen   14 VDGLQAIVITDRDGVPIAKVSSDSA-PESAMRPSLLST-FAMAIDQASKLGLGKNKSIIAYYDSYQVVQFNKLPLYITFI   91 (119)
T ss_dssp             STTEEEEEEEETTS-EEEEEE-TTS--GGGGSHHHHCC-HHHHHHHHTTSSS-SEEEEEEEESSEEEEEEEETTEEEEEE
T ss_pred             cCCeEEEEEECCCCcEEEEecCCCC-cchhhhhHHHHH-HHHHhhcccccCCCCceEEEEEeCCEEEEEEeCCCeEEEEE
Confidence            4558999999999999888555542 333444455433 322    33432 2333457779999999999999999999


Q ss_pred             ecCCcchHHHHHHHHHHHHHHHhh
Q 033621           79 GGEDENELILATVLQGFFDAVGLL  102 (115)
Q Consensus        79 ~~~~ENEL~l~~~i~~~veaLd~~  102 (115)
                      ++++-|==.++++-+-+.+.|..+
T Consensus        92 as~~aN~G~il~l~~~L~~~l~~l  115 (119)
T PF08923_consen   92 ASSNANTGLILSLEEELAPILNEL  115 (119)
T ss_dssp             EETTS-HHHHHHHHHHHHHHHHHH
T ss_pred             ecCCCCHHHHHHhHHHHHHHHHHH
Confidence            999999888888877777776643


No 11 
>PF15001 AP-5_subunit_s1:  AP-5 complex subunit sigma-1
Probab=95.97  E-value=0.13  Score=39.05  Aligned_cols=44  Identities=23%  Similarity=0.275  Sum_probs=38.6

Q ss_pred             cCeEEEEEEeceEEEEEEecCCcchHHHHHHHHHHHHHHHhhhC
Q 033621           61 EGNVIVYKFVQDLHFFVTGGEDENELILATVLQGFFDAVGLLLR  104 (115)
Q Consensus        61 ~~~~vVyr~~adl~f~vv~~~~ENEL~l~~~i~~~veaLd~~f~  104 (115)
                      +...+|++...++.|.+|.+..||=+..-.+++.++..|...++
T Consensus       102 ~~k~vvW~~v~~l~ftLVce~hEN~lLa~~~L~~~~~~l~~~~~  145 (189)
T PF15001_consen  102 TPKIVVWLGVGSLCFTLVCEPHENRLLAENTLRLFIRHLLEHLK  145 (189)
T ss_pred             CCcEEEeeccCCEEEEEEecCchhHHHHHHHHHHHHHHHHHHHH
Confidence            45789999999999999999999999999999999888865553


No 12 
>PF03164 Mon1:  Trafficking protein Mon1;  InterPro: IPR004353 Members of this family have been called SAND proteins [] although these proteins do not contain a SAND domain. In Saccharomyces cerevisiae a protein complex of Mon1 and Ccz1 functions with the small GTPase Ypt7 to mediate vesicle trafficking to the vacuole [, ]. The Mon1/Ccz1 complex is conserved in eukaryotic evolution and members of this family (previously known as DUF254) are distant homologues to domains of known structure that assemble into cargo vesicle adapter (AP) complexes [, ].
Probab=94.13  E-value=1.5  Score=36.62  Aligned_cols=86  Identities=15%  Similarity=0.181  Sum_probs=58.6

Q ss_pred             eEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccc-cceEEecCeEEEEEEeceEEEEEEecCCcchH
Q 033621            8 KNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTE-AEIAMLEGNVIVYKFVQDLHFFVTGGEDENEL   86 (115)
Q Consensus         8 ~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~-~~i~~~~~~~vVyr~~adl~f~vv~~~~ENEL   86 (115)
                      +=++|+.+.||+++++| .    ++++=..+-.-+...++--...++ -.-+..+++++||..-+-|+++.+...+|.+-
T Consensus        12 kh~fIlS~AGKPIysr~-G----~e~~l~~~~g~~~aiiS~~~~~~d~l~~i~~~~~~ivfl~r~pl~lv~vS~~~e~~~   86 (415)
T PF03164_consen   12 KHFFILSSAGKPIYSRY-G----DEDKLSSLMGVIQAIISFFQSNGDELRSIRAGDHRIVFLNRGPLILVAVSKTGESES   86 (415)
T ss_pred             CeEEEECCCCceeEEec-C----ChHHHHHHHHHHHHHHHHHHhCCCcEEEEEeCCEEEEEEecCCEEEEEEcCCcCCHH
Confidence            45789999999999988 2    333333444444333332211111 23455789999999999999999999999987


Q ss_pred             HHHHHHHHHHHH
Q 033621           87 ILATVLQGFFDA   98 (115)
Q Consensus        87 ~l~~~i~~~vea   98 (115)
                      .|..-++.++.-
T Consensus        87 ~l~~qL~~ly~q   98 (415)
T PF03164_consen   87 QLRKQLDYLYSQ   98 (415)
T ss_pred             HHHHHHHHHHHH
Confidence            777766666543


No 13 
>KOG2740 consensus Clathrin-associated protein medium chain [Intracellular trafficking, secretion, and vesicular transport]
Probab=92.92  E-value=0.35  Score=40.32  Aligned_cols=98  Identities=14%  Similarity=0.201  Sum_probs=63.1

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceE-EEEEEecCCcc
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDL-HFFVTGGEDEN   84 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl-~f~vv~~~~EN   84 (115)
                      ||+++.+.|.+|+..+-|--...    ..-+..+...|+.-++....-+.....--.+--++.-+.+. +|+.+.-.+--
T Consensus         1 mi~siflidtsg~l~lek~~~g~----t~~rsic~~f~e~~~~~~~~e~~ppvi~~p~hylfsv~~~~i~~~~~st~e~p   76 (418)
T KOG2740|consen    1 MILSIFLIDTSGDLLLEKHLKGS----TVVRSICDYFFEDQSSDDDLEHVPPVISTPHHYLFSVYRDLIFFCAVSTVETP   76 (418)
T ss_pred             CeeEEEEEcCCchhhhhHhhCCc----eeeeehHHHHHHhhhhccccccCCceecCCceeeeeeeccCcEEEEEEeccCC
Confidence            79999999999999887765421    12344555555554444321123344444444455555554 44444444445


Q ss_pred             hHHHHHHHHHHHHHHHhhhCCcc
Q 033621           85 ELILATVLQGFFDAVGLLLRLLH  107 (115)
Q Consensus        85 EL~l~~~i~~~veaLd~~f~~~~  107 (115)
                      -|...++++.+++.|..+|+..+
T Consensus        77 PL~~iefL~rv~dv~~eyFg~~s   99 (418)
T KOG2740|consen   77 PLMVIEFLHRVVDVLLEYFGGLS   99 (418)
T ss_pred             ChhHHHHHHHHHHHHHHHhcccC
Confidence            89999999999999999999654


No 14 
>PF03259 Robl_LC7:  Roadblock/LC7 domain;  InterPro: IPR004942 This family includes proteins that are about 100 amino acids long and have been shown to be related []. Members of this family of proteins are associated with both flagellar outer arm dynein and Drosophila and rat brain cytoplasmic dynein. It is proposed that roadblock/LC7 family members may modulate specific dynein functions []. This family also includes Golgi-associated MP1 adapter protein (Q9Y2Q5 from SWISSPROT) and MglB from Myxococcus xanthus (Q50883 from SWISSPROT), a protein involved in gliding motility []. However the family also includes members from non-motile bacteria such as Streptomyces coelicolor, suggesting that the protein may play a structural or regulatory role.; PDB: 2B95_B 1Z09_A 2E8J_B 2HZ5_B 3KYE_A 2ZL1_B 1SKO_B 3CPT_B 1VEU_B 1VET_B ....
Probab=85.58  E-value=4.9  Score=25.20  Aligned_cols=74  Identities=19%  Similarity=0.220  Sum_probs=41.7

Q ss_pred             CcceEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCC---ccc--cceEEecCeEEEEEEece-EEEE
Q 033621            3 TCPSIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNA---RTE--AEIAMLEGNVIVYKFVQD-LHFF   76 (115)
Q Consensus         3 sl~~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~---~~~--~~i~~~~~~~vVyr~~ad-l~f~   76 (115)
                      +.+-|++.+++|++|..+.+  ..+...+.+.=.+.-..++..+.+...   .++  .=.+..++..++....++ .+++
T Consensus        11 ~~~gv~~~~l~~~dG~~i~~--~~~~~~~~~~~aa~~a~~~~~~~~~~~~l~~~~~~~v~i~~~~~~i~i~~~~~~~~L~   88 (91)
T PF03259_consen   11 SVPGVRGAVLVDKDGLVIAS--SGIDDDDAEKLAAMAASLLAAAEKLAKELGEGELEQVRIETEKGEIIITPVGDFYLLV   88 (91)
T ss_dssp             HSTTEEEEEEEETTSEEEEE--TSSSHHHHHHHHHHHHHHHHHHHHHHHHHTTSSEEEEEEEESSEEEEEEECSTCEEEE
T ss_pred             CCCCeeEEEEEcCCCCEEEE--ecCCcccHHHHHHHHHHHHHHHHHHHHHhCCCCcEEEEEEECCCEEEEEEcCCCEEEE
Confidence            46789999999999999998  332111122222333444444443211   111  124446777777777777 5555


Q ss_pred             EE
Q 033621           77 VT   78 (115)
Q Consensus        77 vv   78 (115)
                      ++
T Consensus        89 v~   90 (91)
T PF03259_consen   89 VL   90 (91)
T ss_dssp             EE
T ss_pred             EE
Confidence            54


No 15 
>KOG2635 consensus Medium subunit of clathrin adaptor complex [Intracellular trafficking, secretion, and vesicular transport]
Probab=84.81  E-value=15  Score=31.63  Aligned_cols=91  Identities=13%  Similarity=0.162  Sum_probs=67.3

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHH--HHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCc
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAV--FTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDE   83 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i--~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~E   83 (115)
                      ++-|..+....||.++++=+..  +   .|.++|.-|  |.|.-...  .+.-++.-+...-||+-...+|++++...+.
T Consensus         2 vvlaa~i~t~~Gk~ivsRqf~~--M---sr~RIEgLl~aFpkLv~~~--~qhT~vEt~~VRYVYqP~d~lY~vLITtk~S   74 (512)
T KOG2635|consen    2 VVLAASINTKTGKAIVSRQFRE--M---SRSRIEGLLAAFPKLVSAG--KQHTFVETDSVRYVYQPLDNLYIVLITTKQS   74 (512)
T ss_pred             eEEEEEEeecCCceeeehHhHh--h---hHHHHHHHHHHhHHhhccC--CCccEEecccEEEEEEecccEEEEEEecccc
Confidence            3567788999999998876653  2   255666554  34444322  2467888899999999999999999999999


Q ss_pred             chHHHHHHHHHHHHHHHhhh
Q 033621           84 NELILATVLQGFFDAVGLLL  103 (115)
Q Consensus        84 NEL~l~~~i~~~veaLd~~f  103 (115)
                      |=|-=++.|+-|......+.
T Consensus        75 NIleDl~TL~Lfskvipey~   94 (512)
T KOG2635|consen   75 NILEDLETLRLFSKVIPEYC   94 (512)
T ss_pred             chhhHHHHHHHHHHhchhhh
Confidence            98888888877766555443


No 16 
>COG3322 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=75.98  E-value=10  Score=30.62  Aligned_cols=23  Identities=22%  Similarity=0.463  Sum_probs=20.8

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCC
Q 033621            6 SIKNILLLDSEGKRVAVKYYSND   28 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~   28 (115)
                      -+.+++++|.+|+.+++|.+++.
T Consensus       103 g~d~vf~vd~~G~~vy~~~~d~~  125 (295)
T COG3322         103 GLDGVFVVDPSGKLVYSKLVDQE  125 (295)
T ss_pred             CccEEEEECCCCCEEEEeeeccc
Confidence            47899999999999999999874


No 17 
>PF09695 YtfJ_HI0045:  Bacterial protein of unknown function (YtfJ_HI0045);  InterPro: IPR006513 These are sequences from gammaproteobacteria that are related to the Escherichia coli protein, YtfJ. 
Probab=67.60  E-value=15  Score=27.26  Aligned_cols=30  Identities=23%  Similarity=0.225  Sum_probs=22.5

Q ss_pred             eEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHH
Q 033621            8 KNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEK   40 (115)
Q Consensus         8 ~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~   40 (115)
                      .+|+++|++|+.+++|.=.   .+.++.+++-.
T Consensus       126 SaiiVlDK~G~V~F~k~G~---Ls~~Ev~qVi~  155 (160)
T PF09695_consen  126 SAIIVLDKQGKVQFVKEGA---LSPAEVQQVIA  155 (160)
T ss_pred             ceEEEEcCCccEEEEECCC---CCHHHHHHHHH
Confidence            5899999999999998643   46666655543


No 18 
>COG2018 Uncharacterized distant relative of homeotic protein bithoraxoid [General function prediction only]
Probab=65.31  E-value=46  Score=23.53  Aligned_cols=80  Identities=19%  Similarity=0.322  Sum_probs=53.2

Q ss_pred             CcceEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCC---ccccceEE---ecCeEEEEEEeceEEEE
Q 033621            3 TCPSIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNA---RTEAEIAM---LEGNVIVYKFVQDLHFF   76 (115)
Q Consensus         3 sl~~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~---~~~~~i~~---~~~~~vVyr~~adl~f~   76 (115)
                      +++-|++.+|.+.+|=.+.+..-.+  .+.+.=.+.-..++....+...   -++-+-+.   -++++++|+.-.+.++.
T Consensus        15 ~v~gv~ga~Ivs~DGL~ia~~~p~~--~d~e~vaA~~a~~~g~~er~~~~l~~g~leqi~I~g~~g~i~l~~~g~~~il~   92 (119)
T COG2018          15 NVPGVRGALVVSKDGLPIAAELPGN--VDAEIVAAMAATALGLAERAADELGGGELEQIMIEGKKGKILLYDAGDDAILV   92 (119)
T ss_pred             hccCceEEEEEccCCceEeecCCCc--ccHHHHHHHHHHHHHHhHHHHHHhCCCCceEEEEeccccEEEEEEcCCceEEE
Confidence            3788999999999999999866443  2334445555666665554332   11223333   35678888887799999


Q ss_pred             EEecCCcc
Q 033621           77 VTGGEDEN   84 (115)
Q Consensus        77 vv~~~~EN   84 (115)
                      ++.+...|
T Consensus        93 ~~a~~~~n  100 (119)
T COG2018          93 VLADEGTN  100 (119)
T ss_pred             EEcCCCCc
Confidence            98887766


No 19 
>PF05228 CHASE4:  CHASE4 domain;  InterPro: IPR007892 CHASE4 is an extracellular sensory domain, which is present in various classes of transmembrane receptors that are upstream of signal transduction pathways in prokaryotes. Specifically, CHASE4 domains are found in histidine kinases in archaea and in predicted diguanylate cyclases/phosphodiesterases in bacteria. Environmental factors that are recognised by CHASE4 domains are not known at this time [].
Probab=54.19  E-value=42  Score=23.03  Aligned_cols=24  Identities=25%  Similarity=0.362  Sum_probs=19.9

Q ss_pred             cceEeEEEEEcCCCCEEE--EeeCCC
Q 033621            4 CPSIKNILLLDSEGKRVA--VKYYSN   27 (115)
Q Consensus         4 l~~I~ailIld~~Gkr~~--sKyY~~   27 (115)
                      -..+.+++++|.+|+.++  ++.++.
T Consensus        47 ~~~~d~~~~~d~~g~~~~~~~~~~~~   72 (161)
T PF05228_consen   47 NFGLDLIFILDPDGRVLYSSSKGYDF   72 (161)
T ss_pred             hcCccEEEEEcCCCCEEEEeccCccc
Confidence            356889999999999999  766655


No 20 
>PF09561 RE_HpaII:  HpaII restriction endonuclease;  InterPro: IPR019062 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below:   Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA.   Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone [].   This family includes HpaII, which recognises the double-stranded sequence CCGG and cleaves after C-1. 
Probab=48.06  E-value=85  Score=26.15  Aligned_cols=57  Identities=12%  Similarity=0.302  Sum_probs=40.6

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCCCCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEE
Q 033621            6 SIKNILLLDSEGKRVAVKYYSNDWPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHF   75 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f   75 (115)
                      ..-|.+|++.+|..++=+-|+        |..||.-+|.-|+=..+     =..=.++--+||.-+..||
T Consensus       291 a~gGyivV~~dGevlcYHiy~--------rn~FedYL~~NTkle~~-----SssRh~fG~iy~enG~~yf  347 (355)
T PF09561_consen  291 ATGGYIVVKEDGEVLCYHIYN--------RNEFEDYLYNNTKLETP-----SSSRHKFGFIYRENGEYYF  347 (355)
T ss_pred             ccceEEEEeCCCCEEEEEehh--------HHHHHHHHHhcccccCC-----CCCcccceEEEEECCEEEE
Confidence            356899999999998876664        78999999998873221     0111456668887777665


No 21 
>PF05323 Pox_A21:  Poxvirus A21 Protein;  InterPro: IPR007987 This family consists of several poxvirus A21 proteins.
Probab=42.95  E-value=65  Score=22.54  Aligned_cols=38  Identities=21%  Similarity=0.389  Sum_probs=31.9

Q ss_pred             CChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEec
Q 033621           30 PTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQ   71 (115)
Q Consensus        30 ~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~a   71 (115)
                      |...++.+.|+..|.+.++.+    ..+.-.||+...|+.-.
T Consensus        20 P~I~eKLr~E~~ay~~~k~~~----~~~~CVdd~lf~y~f~~   57 (111)
T PF05323_consen   20 PAISEKLRLEHDAYVKYKNLK----KKFICVDDRLFSYKFNT   57 (111)
T ss_pred             HHHHHHHHHHHHHHHHHHhcC----CcEEEECCEEEEEEccc
Confidence            456789999999999999874    47888999999998644


No 22 
>PF13774 Longin:  Regulated-SNARE-like domain; PDB: 1IOU_A 3BW6_A 1H8M_A 3EGX_C 2NUP_C 3EGD_C 2NUT_C 3KYQ_A 1IFQ_B 2VX8_D ....
Probab=42.89  E-value=83  Score=19.65  Aligned_cols=63  Identities=5%  Similarity=0.027  Sum_probs=45.9

Q ss_pred             HHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCcchHHHHHHHHHHHHHHHhhhC
Q 033621           40 KAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDENELILATVLQGFFDAVGLLLR  104 (115)
Q Consensus        40 ~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~ENEL~l~~~i~~~veaLd~~f~  104 (115)
                      ++|-+++.+.+.  .-..+..+++..=|..-.++.+++++|.+-..-.-..+|+.+.+....-++
T Consensus         3 ~~il~~i~~~~~--~k~s~~~~~~~fh~~~~~~i~~~citd~~~~~r~aF~fL~~i~~~F~~~~~   65 (83)
T PF13774_consen    3 RKILKRIPPNGN--SKMSYESGNYVFHYLVEDGIAYLCITDKSYPKRVAFAFLEEIKQEFIQTYG   65 (83)
T ss_dssp             HHHHHTS-TTSE--SEEEEEETTEEEEEEEETTEEEEEEEETTS-HHHHHHHHHHHHHHHHHHCT
T ss_pred             HHHHHhcCCCCC--CeEEEEECCEEEEEEEcCCeEEEEEEcCCCCcchHHHHHHHHHHHHHHHcC
Confidence            455555554331  136777899999999999999999999998888888888877777666555


No 23 
>PF04315 DUF462:  Protein of unknown function, DUF462;  InterPro: IPR007411 This family consists of bacterial proteins of uncharacterised function.
Probab=42.50  E-value=21  Score=26.59  Aligned_cols=20  Identities=20%  Similarity=0.277  Sum_probs=16.4

Q ss_pred             EEeeCCCCCCChHHHHHHHH
Q 033621           21 AVKYYSNDWPTNSAKEAFEK   40 (115)
Q Consensus        21 ~sKyY~~~~~~~~~q~~fE~   40 (115)
                      +.=||.|..-+.++|++||+
T Consensus        65 fGYWY~PDGR~~~qQ~~FE~   84 (164)
T PF04315_consen   65 FGYWYCPDGRDAEQQAEFEQ   84 (164)
T ss_pred             CCCCcCCCCCCHHHHHHHHh
Confidence            34588887778999999997


No 24 
>cd03016 PRX_1cys Peroxiredoxin (PRX) family, 1-cys PRX subfamily; composed of PRXs containing only one conserved cysteine, which serves as the peroxidatic cysteine. They are homodimeric thiol-specific antioxidant (TSA) proteins that confer a protective role in cells by reducing and detoxifying hydrogen peroxide, peroxynitrite, and organic hydroperoxides. As with all other PRXs, a cysteine sulfenic acid intermediate is formed upon reaction of 1-cys PRX with its substrates. Having no resolving cysteine, the oxidized enzyme is resolved by an external small-molecule or protein reductant such as thioredoxin or glutaredoxin. Similar to typical 2-cys PRX, 1-cys PRX forms a functional dimeric unit with a B-type interface, as well as a decameric structure which is stabilized in the reduced form of the enzyme. Other oligomeric forms, tetramers and hexamers, have also been reported. Mammalian 1-cys PRX is localized cellularly in the cytosol and is expressed at high levels in brain, eye, testes an
Probab=39.96  E-value=57  Score=24.10  Aligned_cols=22  Identities=18%  Similarity=0.587  Sum_probs=19.1

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCC
Q 033621            6 SIKNILLLDSEGKRVAVKYYSN   27 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~   27 (115)
                      .+.+.+|+|.+|+.+...+|..
T Consensus       116 ~~r~~fiID~~G~I~~~~~~~~  137 (203)
T cd03016         116 TVRAVFIIDPDKKIRLILYYPA  137 (203)
T ss_pred             eeeEEEEECCCCeEEEEEecCC
Confidence            3678999999999999998864


No 25 
>cd04670 Nudix_Hydrolase_12 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=36.71  E-value=1.2e+02  Score=19.87  Aligned_cols=41  Identities=15%  Similarity=0.161  Sum_probs=28.3

Q ss_pred             EEEEEcCCCCEEEEeeCC--CC---------CCChHHHHHHHHHHHHHHhcC
Q 033621            9 NILLLDSEGKRVAVKYYS--ND---------WPTNSAKEAFEKAVFTKTQKT   49 (115)
Q Consensus         9 ailIld~~Gkr~~sKyY~--~~---------~~~~~~q~~fE~~i~~k~~~~   49 (115)
                      +.+|+|.+|+.++.|.+.  +.         -....-..+..|++.+.+.-.
T Consensus         6 ~~~v~~~~~~vLl~~r~~~~~~~w~~PGG~ve~gEt~~~aa~RE~~EE~Gl~   57 (127)
T cd04670           6 GGLVLNEKNEVLVVQERNKTPNGWKLPGGLVDPGEDIFDGAVREVLEETGID   57 (127)
T ss_pred             EEEEEcCCCeEEEEEccCCCCCcEECCCccCCCCCCHHHHHHHHHHHHHCCC
Confidence            346778889999887665  21         034456788889988887743


No 26 
>COG0450 AhpC Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]
Probab=36.31  E-value=1.2e+02  Score=23.16  Aligned_cols=23  Identities=9%  Similarity=0.254  Sum_probs=21.1

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCC
Q 033621            6 SIKNILLLDSEGKRVAVKYYSND   28 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~   28 (115)
                      .++|++|+|.+|.+|..-+|+.+
T Consensus       123 a~R~~FIIDp~g~ir~~~v~~~~  145 (194)
T COG0450         123 ALRGTFIIDPDGVIRHILVNPLT  145 (194)
T ss_pred             ceeEEEEECCCCeEEEEEEecCC
Confidence            68999999999999999999864


No 27 
>PF13503 DUF4123:  Domain of unknown function (DUF4123)
Probab=34.31  E-value=61  Score=21.38  Aligned_cols=18  Identities=17%  Similarity=0.466  Sum_probs=14.6

Q ss_pred             EEEcCCCCEEEEeeCCCC
Q 033621           11 LLLDSEGKRVAVKYYSND   28 (115)
Q Consensus        11 lIld~~Gkr~~sKyY~~~   28 (115)
                      .+...+|+..+-|||+|.
T Consensus        89 ~v~~~~g~~~llRfyDpr  106 (124)
T PF13503_consen   89 QVRLPDGRSVLLRFYDPR  106 (124)
T ss_pred             eeECCCCCEEEEEcCCHH
Confidence            455667999999999983


No 28 
>PRK13190 putative peroxiredoxin; Provisional
Probab=34.27  E-value=58  Score=24.12  Aligned_cols=23  Identities=13%  Similarity=0.459  Sum_probs=20.3

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCCC
Q 033621            6 SIKNILLLDSEGKRVAVKYYSND   28 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~~   28 (115)
                      .+.+.+|+|.+|+.+.+.+|...
T Consensus       116 ~~p~~fiId~~G~I~~~~~~~~~  138 (202)
T PRK13190        116 TVRGVFIIDPNQIVRWMIYYPAE  138 (202)
T ss_pred             EEeEEEEECCCCEEEEEEEeCCC
Confidence            47899999999999999988764


No 29 
>COG3054 Predicted transcriptional regulator [General function prediction only]
Probab=33.27  E-value=46  Score=24.99  Aligned_cols=17  Identities=29%  Similarity=0.495  Sum_probs=15.0

Q ss_pred             eEEEEEcCCCCEEEEee
Q 033621            8 KNILLLDSEGKRVAVKY   24 (115)
Q Consensus         8 ~ailIld~~Gkr~~sKy   24 (115)
                      .||+++|++|+.-++|=
T Consensus       148 SaivVlDk~G~Vkfvke  164 (184)
T COG3054         148 SAVVVLDKDGRVKFVKE  164 (184)
T ss_pred             ceEEEEcCCCcEEEEec
Confidence            48999999999988874


No 30 
>PRK13599 putative peroxiredoxin; Provisional
Probab=32.48  E-value=56  Score=24.68  Aligned_cols=22  Identities=18%  Similarity=0.646  Sum_probs=19.4

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCC
Q 033621            6 SIKNILLLDSEGKRVAVKYYSN   27 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~   27 (115)
                      .+.+++|+|.+|+.+.+-+|..
T Consensus       118 ~~R~tfIID~dG~Ir~~~~~p~  139 (215)
T PRK13599        118 TVRAVFIVDDKGTIRLIMYYPQ  139 (215)
T ss_pred             eeeEEEEECCCCEEEEEEEcCC
Confidence            5899999999999999987754


No 31 
>PHA03023 hypothetical protein; Provisional
Probab=31.63  E-value=1.1e+02  Score=21.43  Aligned_cols=39  Identities=13%  Similarity=0.228  Sum_probs=32.0

Q ss_pred             CChHHHHHHHHHHHHHHh-cCCCccccceEEecCeEEEEEEece
Q 033621           30 PTNSAKEAFEKAVFTKTQ-KTNARTEAEIAMLEGNVIVYKFVQD   72 (115)
Q Consensus        30 ~~~~~q~~fE~~i~~k~~-~~~~~~~~~i~~~~~~~vVyr~~ad   72 (115)
                      |...++-+.|++.|.+.+ +.+    .++.-.+|+...|+.-..
T Consensus        20 p~I~eKLr~E~~ay~~y~~~L~----~~~~Cv~d~lf~y~f~~~   59 (112)
T PHA03023         20 NKELNKLELDKKIFENAEDDLK----TTISCINDHWFIVKRNSR   59 (112)
T ss_pred             HHHHHHHHHHHHHHHHHHHhcC----CcEEEECCEEEEEEcCch
Confidence            456789999999999998 443    378889999999998665


No 32 
>PHA03045 IMV membrane protein; Provisional
Probab=31.58  E-value=1.2e+02  Score=21.33  Aligned_cols=39  Identities=21%  Similarity=0.336  Sum_probs=31.2

Q ss_pred             CChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEece
Q 033621           30 PTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQD   72 (115)
Q Consensus        30 ~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~ad   72 (115)
                      |...++.+.|+..|.+-++..    .++.-.||+.+.|+.-..
T Consensus        20 p~I~eKLr~E~~ay~~~~~l~----~~~~CVdd~~f~y~f~~~   58 (113)
T PHA03045         20 PAISEKLRREYDAYTKYKQLK----KNFICVDDRLFSYDFTTS   58 (113)
T ss_pred             HHHHHHHHHHHHHHHHHHHcC----CcEEEECCEEEEEEccch
Confidence            456789999999999966554    368889999999987554


No 33 
>PF10623 PilI:  Plasmid conjugative transfer protein PilI;  InterPro: IPR018897  The thin pilus of plasmid R64 belongs to the type IV family and is required for liquid matings. PilI is one of 14 genes that have been identified as being involved in biogenesis of the R64 thin pilus []. 
Probab=30.14  E-value=76  Score=21.04  Aligned_cols=15  Identities=13%  Similarity=0.330  Sum_probs=12.9

Q ss_pred             EEEEEcCCCCEEEEe
Q 033621            9 NILLLDSEGKRVAVK   23 (115)
Q Consensus         9 ailIld~~Gkr~~sK   23 (115)
                      -||++|++|++.+.-
T Consensus        10 ~VLVv~n~c~~kL~~   24 (83)
T PF10623_consen   10 QVLVVSNHCERKLFD   24 (83)
T ss_pred             EEEEEeCCcceeEee
Confidence            489999999999873


No 34 
>cd04669 Nudix_Hydrolase_11 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=29.50  E-value=1.7e+02  Score=19.32  Aligned_cols=39  Identities=10%  Similarity=0.276  Sum_probs=26.8

Q ss_pred             EEEEEcCCCCEEEEeeCCCC-------C----CChHHHHHHHHHHHHHHh
Q 033621            9 NILLLDSEGKRVAVKYYSND-------W----PTNSAKEAFEKAVFTKTQ   47 (115)
Q Consensus         9 ailIld~~Gkr~~sKyY~~~-------~----~~~~~q~~fE~~i~~k~~   47 (115)
                      +++|+|.+|+.++.|-+.+.       .    ....-+.+..|++...+.
T Consensus         4 ~~ii~~~~~~vLL~~r~~~~~~~w~lPGG~ve~gEs~~~a~~REl~EEtG   53 (121)
T cd04669           4 SIVIINDQGEILLIRRIKPGKTYYVFPGGGIEEGETPEEAAKREALEELG   53 (121)
T ss_pred             EEEEEeCCCEEEEEEEecCCCCcEECCceeccCCCCHHHHHHHHHHHhhC
Confidence            45677778999998865431       1    344567888888888775


No 35 
>cd04685 Nudix_Hydrolase_26 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate
Probab=29.29  E-value=1.9e+02  Score=19.80  Aligned_cols=41  Identities=22%  Similarity=0.308  Sum_probs=29.9

Q ss_pred             EEEEEcCCCCEEEEeeCCCC--------------CCChHHHHHHHHHHHHHHhcC
Q 033621            9 NILLLDSEGKRVAVKYYSND--------------WPTNSAKEAFEKAVFTKTQKT   49 (115)
Q Consensus         9 ailIld~~Gkr~~sKyY~~~--------------~~~~~~q~~fE~~i~~k~~~~   49 (115)
                      .++++|.+|+.++.|...+.              -....-..+..|++...+.-.
T Consensus         4 ~~~i~~~~g~vLl~r~~~~~~~~~~~w~~PgG~ve~gE~~~~a~~Re~~EE~G~~   58 (133)
T cd04685           4 RVVLLDPDDRVLLLRGDDPDSPGPDWWFTPGGGVEPGESPEQAARRELREETGIT   58 (133)
T ss_pred             EEEEEcCCCeEEEEEEeCCCCCCCCEEECCcCCCCCCCCHHHHHHHHHHHHHCCc
Confidence            57889999999998876431              034456788888888887644


No 36 
>PF01820 Dala_Dala_lig_N:  D-ala D-ala ligase N-terminus;  InterPro: IPR011127 This entry represents the N-terminal region of the D-alanine--D-alanine ligase enzyme (6.3.2.4 from EC) which is thought to be involved in substrate binding []. D-Alanine is one of the central molecules of the cross-linking step of peptidoglycan assembly. There are three enzymes involved in the D-alanine branch of peptidoglycan biosynthesis: the pyridoxal phosphate-dependent D-alanine racemase (Alr), the ATP-dependent D-alanine:D-alanine ligase (Ddl), and the ATP-dependent D-alanine:D-alanine-adding enzyme (MurF) [].; GO: 0008716 D-alanine-D-alanine ligase activity, 0009252 peptidoglycan biosynthetic process, 0005618 cell wall; PDB: 4EG0_B 3E5N_A 3RFC_A 3R5F_A 1IOV_A 1IOW_A 2DLN_A 3Q1K_D 3I12_C 3N8D_B ....
Probab=27.87  E-value=1.5e+02  Score=20.07  Aligned_cols=28  Identities=18%  Similarity=0.363  Sum_probs=24.0

Q ss_pred             EEEEEEecCCcchHHHHHHHHHHHHHHHh
Q 033621           73 LHFFVTGGEDENELILATVLQGFFDAVGL  101 (115)
Q Consensus        73 l~f~vv~~~~ENEL~l~~~i~~~veaLd~  101 (115)
                      +.+++.|.+.|-|..+.+.-+ +.++|+.
T Consensus         3 v~vlfGG~S~EheVSl~Sa~~-v~~~L~~   30 (117)
T PF01820_consen    3 VAVLFGGRSSEHEVSLRSARN-VYEALDK   30 (117)
T ss_dssp             EEEEEETSSTTHHHHHHHHHH-HHHHSHT
T ss_pred             EEEEeccCchhHHHHHHHHHH-HHHHHhh
Confidence            678889999999999999877 7788875


No 37 
>cd03671 Ap4A_hydrolase_plant_like Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Members of this family are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one group (represented by this subfamily) and fungi/animals/archaea enzymes fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for the inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU where U is Ile, Leu, or Val), Ap4A hydrolase is structurally 
Probab=27.41  E-value=2.1e+02  Score=19.58  Aligned_cols=41  Identities=12%  Similarity=0.160  Sum_probs=30.1

Q ss_pred             EEEEEcCCCCEEEEeeCCCC-C---------CChHHHHHHHHHHHHHHhcC
Q 033621            9 NILLLDSEGKRVAVKYYSND-W---------PTNSAKEAFEKAVFTKTQKT   49 (115)
Q Consensus         9 ailIld~~Gkr~~sKyY~~~-~---------~~~~~q~~fE~~i~~k~~~~   49 (115)
                      +++|+|.+|+.++.|.+... .         +...-..+..|++.+-+.-+
T Consensus         7 ~~ii~~~~~~vLL~~r~~~~~~W~~PgG~~e~gE~~~~aA~REv~EEtGl~   57 (147)
T cd03671           7 GVVLFNEDGKVFVGRRIDTPGAWQFPQGGIDEGEDPEQAALRELEEETGLD   57 (147)
T ss_pred             EEEEEeCCCEEEEEEEcCCCCCEECCcCCCCCCcCHHHHHHHHHHHHHCCC
Confidence            57788989999998877642 1         33456788888888887754


No 38 
>PF14903 WG_beta_rep:  WG containing repeat
Probab=27.36  E-value=68  Score=16.33  Aligned_cols=16  Identities=31%  Similarity=0.484  Sum_probs=13.0

Q ss_pred             EEcCCCCEEEEeeCCC
Q 033621           12 LLDSEGKRVAVKYYSN   27 (115)
Q Consensus        12 Ild~~Gkr~~sKyY~~   27 (115)
                      ++|.+|+.++.--|+.
T Consensus         3 ~id~~G~~vi~~~yd~   18 (35)
T PF14903_consen    3 YIDKNGKIVIPPKYDE   18 (35)
T ss_pred             EEeCCCCEEEEccccC
Confidence            5799999998877764


No 39 
>cd04678 Nudix_Hydrolase_19 Members of the Nudix hydrolase superfamily catalyze the hydrolysis of NUcleoside DIphosphates linked to other moieties, X. Enzymes belonging to this superfamily require a divalent cation, such as Mg2+ or Mn2+, for their activity and contain a highly conserved 23-residue nudix motif (GX5EX7REUXEEXGU, where U = I, L or V), which functions as a metal binding and catalytic site. Substrates of nudix hydrolases include intact and oxidatively damaged nucleoside triphosphates, dinucleoside polyphosphates, nucleotide-sugars and dinucleotide enzymes. These substrates are metabolites or cell signaling molecules that require regulation during different stages of the cell cycle or during periods of stress. In general, the role of the nudix hydrolase is to sanitize the nucleotide pools and to maintain cell viability, thereby serving as surveillance & "house-cleaning" enzymes. Substrate specificity is used to define families within the superfamily. Differences in substrate 
Probab=26.79  E-value=1.7e+02  Score=19.17  Aligned_cols=40  Identities=20%  Similarity=0.192  Sum_probs=26.7

Q ss_pred             EEEEEcCCCCEEEEeeCC----C----C----CCChHHHHHHHHHHHHHHhc
Q 033621            9 NILLLDSEGKRVAVKYYS----N----D----WPTNSAKEAFEKAVFTKTQK   48 (115)
Q Consensus         9 ailIld~~Gkr~~sKyY~----~----~----~~~~~~q~~fE~~i~~k~~~   48 (115)
                      +++|.|.+|+.++.|=-.    .    +    -+...-..+..|++.+.+.-
T Consensus         6 ~~ii~~~~~~iLl~~r~~~~~~~~w~~PGG~ve~gEt~~~Aa~REl~EE~Gl   57 (129)
T cd04678           6 GVFVLNPKGKVLLGKRKGSHGAGTWALPGGHLEFGESFEECAAREVLEETGL   57 (129)
T ss_pred             EEEEECCCCeEEEEeccCCCCCCeEECCcccccCCCCHHHHHHHHHHHHhCC
Confidence            567788889988887331    1    1    13445678888888877664


No 40 
>PRK13191 putative peroxiredoxin; Provisional
Probab=26.77  E-value=81  Score=23.76  Aligned_cols=23  Identities=17%  Similarity=0.576  Sum_probs=20.3

Q ss_pred             ceEeEEEEEcCCCCEEEEeeCCC
Q 033621            5 PSIKNILLLDSEGKRVAVKYYSN   27 (115)
Q Consensus         5 ~~I~ailIld~~Gkr~~sKyY~~   27 (115)
                      ..+.+.+|+|.+|+.+..-+|+.
T Consensus       122 ~~~r~tfIID~~G~Ir~~~~~~~  144 (215)
T PRK13191        122 ATVRAVFIVDDKGTVRLILYYPM  144 (215)
T ss_pred             ceeEEEEEECCCCEEEEEEecCC
Confidence            35789999999999999999874


No 41 
>PF15404 PH_4:  Pleckstrin homology domain
Probab=26.18  E-value=42  Score=25.31  Aligned_cols=29  Identities=14%  Similarity=0.417  Sum_probs=23.6

Q ss_pred             HHHHHhcCCCccccceEEecCeEEEEEEe
Q 033621           42 VFTKTQKTNARTEAEIAMLEGNVIVYKFV   70 (115)
Q Consensus        42 i~~k~~~~~~~~~~~i~~~~~~~vVyr~~   70 (115)
                      ||+|-.+.+.+....+++..|++|+|+.+
T Consensus         5 LY~K~~khs~F~~~~vvL~~G~Li~f~~~   33 (185)
T PF15404_consen    5 LYQKPRKHSTFKKYFVVLIPGFLILFQLF   33 (185)
T ss_pred             eeecCCCCCCceEEEEEEeCCEEEEEEEE
Confidence            56777777777678999999999999974


No 42 
>cd08151 AOS Allene oxide synthase. Allene oxide synthase converts a fatty acid hydroperoxide to an allene oxide, which is an unstable epoxide. In corals, the enzyme is part of a eiconaosid synthesis pathway that is initiated by a lipoxygenase, which generates the fatty acid hydroperoxides in the first step. The structure of allene oxide synthase closely resembles that of catalase, but allene oxide synthase does not have catalase activity.
Probab=26.10  E-value=83  Score=25.34  Aligned_cols=23  Identities=13%  Similarity=0.069  Sum_probs=20.4

Q ss_pred             CcceEeEEEEEcCCCCEEEEeeC
Q 033621            3 TCPSIKNILLLDSEGKRVAVKYY   25 (115)
Q Consensus         3 sl~~I~ailIld~~Gkr~~sKyY   25 (115)
                      .=+.++++-..|.+|+++++||=
T Consensus       158 ~y~s~~~f~f~~~~G~~~~vK~~  180 (328)
T cd08151         158 HYYSQICYEFVALDGKSRYARFR  180 (328)
T ss_pred             cceeEeeEEEECCCCCEEEEEEE
Confidence            34678999999999999999996


No 43 
>PRK00714 RNA pyrophosphohydrolase; Reviewed
Probab=25.34  E-value=2.2e+02  Score=19.98  Aligned_cols=41  Identities=15%  Similarity=0.323  Sum_probs=29.3

Q ss_pred             EEEEEcCCCCEEEEeeCCCCC----------CChHHHHHHHHHHHHHHhcC
Q 033621            9 NILLLDSEGKRVAVKYYSNDW----------PTNSAKEAFEKAVFTKTQKT   49 (115)
Q Consensus         9 ailIld~~Gkr~~sKyY~~~~----------~~~~~q~~fE~~i~~k~~~~   49 (115)
                      +++++|.+|+.++.|....+.          +...-..+..|++.+.+.-.
T Consensus        12 ~~~i~~~~g~vLL~~r~~~~~~w~~P~G~~~~gE~~~~aa~REl~EEtG~~   62 (156)
T PRK00714         12 GIILLNRQGQVFWGRRIGQGHSWQFPQGGIDPGETPEQAMYRELYEEVGLR   62 (156)
T ss_pred             EEEEEecCCEEEEEEEcCCCCeEECCcccCCCCcCHHHHHHHHHHHHhCCC
Confidence            567999999999988865421          34445777888888877643


No 44 
>PF03456 uDENN:  uDENN domain;  InterPro: IPR005113 This region is always found associated with IPR001194 from INTERPRO. It is predicted to form an all beta domain [].; PDB: 3TW8_A.
Probab=25.25  E-value=1.1e+02  Score=18.31  Aligned_cols=19  Identities=26%  Similarity=0.424  Sum_probs=13.5

Q ss_pred             cceEeEEEEEcCCCCEEEE
Q 033621            4 CPSIKNILLLDSEGKRVAV   22 (115)
Q Consensus         4 l~~I~ailIld~~Gkr~~s   22 (115)
                      .++...+++-|.+|+|++.
T Consensus        41 ~~~~f~FvLT~~~G~r~Yg   59 (65)
T PF03456_consen   41 PPQFFSFVLTDEDGSRLYG   59 (65)
T ss_dssp             SSCEEEEEEE-TTS-EEEE
T ss_pred             CCeEEEEEEECCCCCEEEE
Confidence            4567788999999999864


No 45 
>PF00989 PAS:  PAS fold;  InterPro: IPR013767 PAS domains are involved in many signalling proteins where they are used as a signal sensor domain []. PAS domains appear in archaea, bacteria and eukaryotes. Several PAS-domain proteins are known to detect their signal by way of an associated cofactor. Haeme, flavin, and a 4-hydroxycinnamyl chromophore are used in different proteins. The PAS domain was named after three proteins that it occurs in:  Per- period circadian protein Arnt- Ah receptor nuclear translocator protein Sim- single-minded protein. PAS domains are often associated with PAC domains IPR001610 from INTERPRO. It appears that these domains are directly linked, and that together they form the conserved 3D PAS fold. The division between the PAS and PAC domains is caused by major differences in sequences in the region connecting these two motifs []. In human PAS kinase, this region has been shown to be very flexible, and adopts different conformations depending on the bound ligand []. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 2GJ3_A 4F3L_B 1XFN_A 1OTD_A 2PYR_A 1KOU_A 1XFQ_A 2ZOI_A 2ZOH_A 1OTA_A ....
Probab=24.70  E-value=82  Score=19.24  Aligned_cols=16  Identities=25%  Similarity=0.551  Sum_probs=13.8

Q ss_pred             EeEEEEEcCCCCEEEE
Q 033621            7 IKNILLLDSEGKRVAV   22 (115)
Q Consensus         7 I~ailIld~~Gkr~~s   22 (115)
                      -.+++++|.+|+.++.
T Consensus        11 ~~~i~~~d~~g~I~~~   26 (113)
T PF00989_consen   11 PDGIFVIDEDGRILYV   26 (113)
T ss_dssp             SSEEEEEETTSBEEEE
T ss_pred             CceEEEEeCcCeEEEE
Confidence            3589999999999876


No 46 
>KOG0997 consensus Uncharacterized conserved protein Sand [Function unknown]
Probab=24.52  E-value=4.9e+02  Score=22.91  Aligned_cols=70  Identities=19%  Similarity=0.207  Sum_probs=44.5

Q ss_pred             eEEEEEcCCCCEEEEeeCCCC--CCChHHHHHHHHHHHHHHhcCCCccccceEEecCeEEEEEEeceEEEEEEecCCc
Q 033621            8 KNILLLDSEGKRVAVKYYSND--WPTNSAKEAFEKAVFTKTQKTNARTEAEIAMLEGNVIVYKFVQDLHFFVTGGEDE   83 (115)
Q Consensus         8 ~ailIld~~Gkr~~sKyY~~~--~~~~~~q~~fE~~i~~k~~~~~~~~~~~i~~~~~~~vVyr~~adl~f~vv~~~~E   83 (115)
                      |-++|+.+.||+++++|=+..  .++...++++..- ++   ..+.  .---+..+|+++||-+-+-+.++.+...++
T Consensus       118 khifilseaGKPiyS~~gs~e~l~stmGv~~alISf-~q---~~~~--~i~si~a~g~k~vfl~kspl~LvA~s~t~~  189 (523)
T KOG0997|consen  118 KHIFILSEAGKPIYSRHGSDEALSSTMGVMQALISF-FQ---VSGL--AITSIHAFGNKLVFLQKSPLLLVAVSRTSQ  189 (523)
T ss_pred             ceEEEEecCCCceeeecCcHHHHHHHHHHHHHHHHH-Hh---hCCc--eEEEEEecCceEEEEecCcEEEEEEccccc
Confidence            458999999999999986432  1233334443332 11   1110  112334689999999999999999988655


No 47 
>cd03430 GDPMH GDP-mannose glycosyl hydrolase (AKA GDP-mannose mannosyl hydrolase (GDPMH)) is a member of the Nudix hydrolase superfamily. This class of enzymes is unique from other members of the superfamily in two aspects. First, it contains a modified Nudix signature sequence. The slight changes to the conserved sequence motif, GX5EX7REUXEEXGU, where U = I, L or V), are believed to contribute to the removal of all magnesium binding sites but one, retaining only the metal site that coordinates the pyrophosphate of the substrate. Secondly, it is not a pyrophosphatase that substitutes at a phosphorus; instead, it hydrolyzes nucleotide sugars such as GDP-mannose to GDP and mannose, cleaving the phosphoglycosyl bond by substituting at a carbon position. GDP-mannose provides mannosyl components for cell wall synthesis and is required for the synthesis of other glycosyl donors (such as GDP-fucose and colitose) for the cell wall. The importance of GDP-sugar hydrolase activities is thus close
Probab=24.24  E-value=2.4e+02  Score=19.41  Aligned_cols=41  Identities=15%  Similarity=0.292  Sum_probs=26.9

Q ss_pred             eEEEEEcCCCCEEEEe---------eCCCC---CCChHHHHHHHHHHHHHHhc
Q 033621            8 KNILLLDSEGKRVAVK---------YYSND---WPTNSAKEAFEKAVFTKTQK   48 (115)
Q Consensus         8 ~ailIld~~Gkr~~sK---------yY~~~---~~~~~~q~~fE~~i~~k~~~   48 (115)
                      -+++|+|.+|+.++.|         |.-|.   -+...-..+..|++.+.+.-
T Consensus        15 v~~vI~~~~g~vLl~~R~~~p~~g~w~lPGG~ve~gEs~~~aa~RE~~EE~Gl   67 (144)
T cd03430          15 IDLIVENEDGQYLLGKRTNRPAQGYWFVPGGRIRKNETLTEAFERIAKDELGL   67 (144)
T ss_pred             EEEEEEeCCCeEEEEEccCCCCCCcEECCCceecCCCCHHHHHHHHHHHHHCC
Confidence            3566678899999876         22221   13455678888888887763


No 48 
>PRK13189 peroxiredoxin; Provisional
Probab=24.02  E-value=96  Score=23.44  Aligned_cols=22  Identities=23%  Similarity=0.670  Sum_probs=19.5

Q ss_pred             eEeEEEEEcCCCCEEEEeeCCC
Q 033621            6 SIKNILLLDSEGKRVAVKYYSN   27 (115)
Q Consensus         6 ~I~ailIld~~Gkr~~sKyY~~   27 (115)
                      .+.+.+|+|.+|+.+...+|+.
T Consensus       125 ~~r~tfIID~~G~Ir~~~~~~~  146 (222)
T PRK13189        125 TVRAVFIIDPKGIIRAILYYPQ  146 (222)
T ss_pred             ceeEEEEECCCCeEEEEEecCC
Confidence            5789999999999999988865


No 49 
>cd03018 PRX_AhpE_like Peroxiredoxin (PRX) family, AhpE-like subfamily; composed of proteins similar to Mycobacterium tuberculosis AhpE. AhpE is described as a 1-cys PRX because of the absence of a resolving cysteine. The structure and sequence of AhpE, however, show greater similarity to 2-cys PRXs than 1-cys PRXs. PRXs are thiol-specific antioxidant (TSA) proteins that confer a protective role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either thioredoxin, glutathione, trypanothione and AhpF. The first step of catalysis is the nucleophilic attack by the peroxidatic cysteine on the peroxide leading to the formation of a cysteine sulfenic acid intermediate. The absence of a resolving cysteine suggests that functional AhpE is regenerated by an external reductant. The solution behavior and crystal structure of AhpE show that it forms dimers and octamers.
Probab=23.59  E-value=1.8e+02  Score=19.48  Aligned_cols=19  Identities=11%  Similarity=0.191  Sum_probs=14.6

Q ss_pred             EEEEEcCCCCEEEEeeCCC
Q 033621            9 NILLLDSEGKRVAVKYYSN   27 (115)
Q Consensus         9 ailIld~~Gkr~~sKyY~~   27 (115)
                      ..+++|++|+.+...+...
T Consensus       116 ~~~lid~~G~v~~~~~~~~  134 (149)
T cd03018         116 AVFVIDRDGIIRYAWVSDD  134 (149)
T ss_pred             eEEEECCCCEEEEEEecCC
Confidence            6799999998777765543


No 50 
>PF15603 Imm45:  Immunity protein 45
Probab=23.18  E-value=1.2e+02  Score=19.92  Aligned_cols=32  Identities=19%  Similarity=0.103  Sum_probs=28.1

Q ss_pred             cCCcchHHHHHHHHHHHHHHHhhhCCcceEEE
Q 033621           80 GEDENELILATVLQGFFDAVGLLLRLLHYVIV  111 (115)
Q Consensus        80 ~~~ENEL~l~~~i~~~veaLd~~f~~~~~~~~  111 (115)
                      ++.|||-+--.-++.++++....|.+++....
T Consensus        49 ~P~e~~~it~~e~q~II~aI~~~~~~~~~~v~   80 (82)
T PF15603_consen   49 PPHENEPITIAERQKIIEAIEKYFSERGMTVD   80 (82)
T ss_pred             CCCCCcccCHHHHHHHHHHHHHHHhcCceEEE
Confidence            46889888888999999999999999988764


No 51 
>PF13426 PAS_9:  PAS domain; PDB: 3ULF_B 3UE6_E 2Z6D_B 2Z6C_B 3P7N_B 1LL8_A 3MJQ_A 3BWL_A 4EET_B 4EEP_A ....
Probab=22.94  E-value=94  Score=18.54  Aligned_cols=15  Identities=33%  Similarity=0.698  Sum_probs=12.8

Q ss_pred             eEEEEEcCCCCEEEE
Q 033621            8 KNILLLDSEGKRVAV   22 (115)
Q Consensus         8 ~ailIld~~Gkr~~s   22 (115)
                      .+++++|.+|..+.+
T Consensus         2 ~~i~i~d~~g~i~~~   16 (104)
T PF13426_consen    2 DGIFILDPDGRILYV   16 (104)
T ss_dssp             SEEEEEETTSBEEEE
T ss_pred             EEEEEECCcCcEEeh
Confidence            378999999999876


No 52 
>KOG0559 consensus Dihydrolipoamide succinyltransferase (2-oxoglutarate dehydrogenase, E2 subunit) [Energy production and conversion]
Probab=22.73  E-value=98  Score=26.31  Aligned_cols=21  Identities=29%  Similarity=0.455  Sum_probs=18.3

Q ss_pred             ecCeEEEEEEeceEEEEEEec
Q 033621           60 LEGNVIVYKFVQDLHFFVTGG   80 (115)
Q Consensus        60 ~~~~~vVyr~~adl~f~vv~~   80 (115)
                      .||.-||||.|-|+.+.|..+
T Consensus       308 Idg~~iVYRDyvDISvAVaTp  328 (457)
T KOG0559|consen  308 IDGDDIVYRDYVDISVAVATP  328 (457)
T ss_pred             ecCCeeEEeecceeEEEeecC
Confidence            577799999999999998765


No 53 
>PLN02609 catalase
Probab=20.98  E-value=2.5e+02  Score=24.43  Aligned_cols=49  Identities=12%  Similarity=0.106  Sum_probs=35.8

Q ss_pred             CCcceEeEEEEEcCCCCEEEEeeCCCC-----CCChHHHHHH--------HHHHHHHHhcCC
Q 033621            2 DTCPSIKNILLLDSEGKRVAVKYYSND-----WPTNSAKEAF--------EKAVFTKTQKTN   50 (115)
Q Consensus         2 ~sl~~I~ailIld~~Gkr~~sKyY~~~-----~~~~~~q~~f--------E~~i~~k~~~~~   50 (115)
                      +.=+.+++.-+.|.+|++.++||-=.+     ..+.++.+++        .++|++.+.+-+
T Consensus       202 ~~~~gvhtF~~vn~~G~~~~Vk~h~~p~~G~~~l~~eeA~~~~g~~~d~l~~DL~~~I~~G~  263 (492)
T PLN02609        202 MEGFGVHTYKLINKAGKAHYVKFHWKPTCGVKNLLDEEAVRVGGSNHSHATQDLYDSIAAGN  263 (492)
T ss_pred             CCcccccceEEECCCCCEEEEEEEEEeCCCCCCCCHHHHHHhhCCCCcHHHHHHHHHHhcCC
Confidence            345679999999999999999997322     1244444444        788999998754


No 54 
>cd08156 catalase_clade_3 Clade 3 of the heme-binding enzyme catalase. Catalase is a ubiquitous enzyme found in both prokaryotes and eukaryotes, which is involved in the protection of cells from the toxic effects of peroxides. It catalyzes the conversion of hydrogen peroxide to water and molecular oxygen. Catalases also utilize hydrogen peroxide to oxidize various substrates such as alcohol or phenols. Clade 3 catalases are the most abundant subfamily and are found in all three kingdoms of life; they have a relatively small subunit size of 43 to 75 kDa, and bind a protoheme IX (heme b) group buried deep inside the structure. Clade 3 catalases also bind NADPH as a second redox-active cofactor. They form tetramers, and in eukaryotic cells, catalases are located in peroxisomes.
Probab=20.63  E-value=1.2e+02  Score=25.85  Aligned_cols=48  Identities=17%  Similarity=0.264  Sum_probs=33.7

Q ss_pred             CcceEeEEEEEcCCCCEEEEeeCCCC---C--CChHHHHH--------HHHHHHHHHhcCC
Q 033621            3 TCPSIKNILLLDSEGKRVAVKYYSND---W--PTNSAKEA--------FEKAVFTKTQKTN   50 (115)
Q Consensus         3 sl~~I~ailIld~~Gkr~~sKyY~~~---~--~~~~~q~~--------fE~~i~~k~~~~~   50 (115)
                      .=+.+++.-..|.+|++.++||-=.+   .  .+.++.++        +.+++++.+.+-+
T Consensus       146 ~~~gvhtf~~vn~~G~~~~Vk~h~~p~~G~~~l~~eea~~~~g~~~d~l~~dL~~~I~~G~  206 (429)
T cd08156         146 NGYGSHTFSLVNAKGERFWVKFHFKTDQGIKNLTNEEAAELAGEDPDYAQRDLFEAIERGD  206 (429)
T ss_pred             CccccceEEEEcCCCCEEEEEEEEEeCCCCCCCCHHHHHHhhCCCCCHHHHHHHHHHhcCC
Confidence            34679999999999999999996221   1  23333332        4678999988765


Done!