Query         psy16824
Match_columns 85
No_of_seqs    106 out of 153
Neff          4.0 
Searched_HMMs 46136
Date          Fri Aug 16 22:59:29 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy16824.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16824hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF04062 P21-Arc:  ARP2/3 compl 100.0 9.3E-49   2E-53  286.6   2.5   79    1-79     97-175 (175)
  2 KOG3155|consensus              100.0   5E-44 1.1E-48  260.6   5.2   79    1-79     95-173 (173)
  3 cd00126 PAH Pancreatic Hormone  85.4     0.9 1.9E-05   26.0   2.3   22   26-47      8-29  (36)
  4 PF00159 Hormone_3:  Pancreatic  83.9     1.3 2.8E-05   25.3   2.5   22   26-47      8-29  (36)
  5 smart00309 PAH Pancreatic horm  82.9     1.3 2.9E-05   25.3   2.3   22   26-47      8-29  (36)
  6 PF09254 Endonuc-FokI_C:  Restr  81.5     2.3 4.9E-05   32.2   3.7   32   29-71     89-120 (189)
  7 COG3765 WzzB Chain length dete  57.8      22 0.00048   29.1   4.6   47    9-55    143-190 (347)
  8 TIGR03779 Bac_Flav_CT_M Bacter  56.0      17 0.00036   30.2   3.7   36   19-55    133-180 (410)
  9 PF08990 Docking:  Erythronolid  55.2      11 0.00025   20.1   1.8   17   31-47      3-19  (27)
 10 COG1298 FlhA Flagellar biosynt  53.9      27 0.00058   31.1   4.8   46    8-54    445-515 (696)
 11 PF08606 Prp19:  Prp19/Pso4-lik  49.9      33 0.00071   22.2   3.6   19   33-51     25-43  (70)
 12 PF00631 G-gamma:  GGL domain;   45.0      27 0.00059   21.2   2.6   19   30-48      2-20  (68)
 13 PF02831 gpW:  gpW;  InterPro:   41.3      22 0.00047   22.6   1.8   19   30-48     38-56  (68)
 14 PHA02865 MHC-like TNF binding   39.5      13 0.00028   30.2   0.7   11   11-21    261-271 (338)
 15 TIGR03142 cytochro_ccmI cytoch  39.0      28 0.00061   23.2   2.2   16   40-55     63-78  (117)
 16 COG2232 Predicted ATP-dependen  38.8      44 0.00095   27.9   3.6   28    8-35    334-364 (389)
 17 PRK06553 lipid A biosynthesis   37.5      21 0.00045   27.2   1.5   35   13-51    153-187 (308)
 18 PRK08943 lipid A biosynthesis   36.0      23 0.00049   27.0   1.5   34   14-51    152-185 (314)
 19 cd05493 Bromo_ALL-1 Bromodomai  35.8      41 0.00088   23.9   2.7   19   48-66    113-131 (131)
 20 PF08802 CytB6-F_Fe-S:  Cytochr  35.3       7 0.00015   22.7  -1.1   13   69-81      6-18  (39)
 21 PF13709 DUF4159:  Domain of un  35.0      34 0.00073   25.2   2.2   14   28-41     68-81  (207)
 22 PF11300 DUF3102:  Protein of u  34.9      60  0.0013   22.9   3.4   21   32-52      7-34  (130)
 23 COG4114 FhuF Uncharacterized F  34.8      40 0.00087   26.6   2.7   41   32-72    176-223 (251)
 24 COG5500 Predicted integral mem  33.8      29 0.00062   25.5   1.7   26   17-42    108-135 (159)
 25 PF00140 Sigma70_r1_2:  Sigma-7  33.7      48   0.001   18.1   2.2   13   32-44      1-13  (37)
 26 PF13006 Nterm_IS4:  Insertion   33.6      39 0.00084   22.8   2.2   21   36-56     77-97  (98)
 27 PRK15202 type III secretion ch  33.4      47   0.001   23.5   2.6   34   17-50     77-116 (117)
 28 PF10372 YojJ:  Bacterial membr  32.8      61  0.0013   20.8   2.9   16   32-47     12-27  (70)
 29 PF11464 Rbsn:  Rabenosyn Rab b  31.8      58  0.0013   19.1   2.4   15   31-45      4-18  (42)
 30 PF04536 TPM:  TLP18.3, Psb32 a  31.6      24 0.00052   22.3   0.9   24   28-51      3-26  (119)
 31 cd00555 Maf Nucleotide binding  31.2      37 0.00081   24.5   1.9   16   20-35     74-89  (180)
 32 PF08938 HBS1_N:  HBS1 N-termin  31.0     8.3 0.00018   24.4  -1.4   22   28-49     21-42  (79)
 33 KOG1205|consensus               29.6      40 0.00087   26.5   2.0   34   10-48    150-183 (282)
 34 PF05438 TRH:  Thyrotropin-rele  28.8      44 0.00095   25.8   2.0   47    2-48      7-60  (212)
 35 cd07624 BAR_SNX7_30 The Bin/Am  28.3      97  0.0021   22.4   3.7   29   19-47      3-31  (200)
 36 PRK06946 lipid A biosynthesis   28.1      39 0.00084   25.5   1.6   32   16-51    133-164 (293)
 37 PRK00234 Maf-like protein; Rev  27.9      47   0.001   24.3   2.0   16   20-35     76-91  (192)
 38 PF12891 Glyco_hydro_44:  Glyco  27.9      26 0.00056   27.2   0.6   20   33-52    206-228 (239)
 39 KOG2940|consensus               27.7      39 0.00084   27.3   1.6   17   40-56     52-68  (325)
 40 PF11544 Spc42p:  Spindle pole   27.6      80  0.0017   20.7   2.8   19   29-47     18-36  (76)
 41 TIGR02208 lipid_A_msbB lipid A  27.2      30 0.00066   26.2   0.9   34   14-51    143-176 (305)
 42 PRK00078 Maf-like protein; Rev  27.0      49  0.0011   24.2   1.9   16   20-35     78-93  (192)
 43 PF03221 HTH_Tnp_Tc5:  Tc5 tran  26.4 1.3E+02  0.0028   17.0   3.5   45   29-73      2-60  (66)
 44 PF14357 DUF4404:  Domain of un  26.1      50  0.0011   21.4   1.7   13   35-47      2-14  (85)
 45 PF09396 Thrombin_light:  Throm  25.6      73  0.0016   19.4   2.2   28   13-40     18-47  (49)
 46 PRK00648 Maf-like protein; Rev  25.5      54  0.0012   23.9   1.9   16   20-35     79-94  (191)
 47 PF11035 SnAPC_2_like:  Small n  25.4 1.2E+02  0.0027   25.0   4.1   43   29-72     57-109 (344)
 48 cd00068 GGL G protein gamma su  25.4      58  0.0013   19.5   1.8   17   33-49      2-18  (57)
 49 PRK00884 Maf-like protein; Rev  25.3      54  0.0012   24.1   1.9   16   20-35     76-91  (194)
 50 PF09509 Hypoth_Ymh:  Protein o  25.3 1.1E+02  0.0024   20.9   3.4   25   33-57     28-58  (125)
 51 COG2207 AraC AraC-type DNA-bin  25.3      99  0.0021   18.8   2.9   19   34-52     64-82  (127)
 52 TIGR00172 maf MAF protein. Thi  25.1      55  0.0012   23.8   1.9   16   20-35     77-92  (183)
 53 PRK00032 Maf-like protein; Rev  24.9      56  0.0012   23.9   1.9   16   20-35     77-92  (190)
 54 smart00224 GGL G protein gamma  24.8      59  0.0013   19.8   1.7   16   33-48      2-17  (63)
 55 PF06183 DinI:  DinI-like famil  24.8      83  0.0018   19.5   2.4   18   38-56      5-22  (65)
 56 PRK04056 Maf-like protein; Rev  24.6      58  0.0013   23.6   1.9   16   20-35     75-90  (180)
 57 KOG4119|consensus               24.5      95  0.0021   20.0   2.7   21   28-48      5-25  (71)
 58 COG1694 MazG Predicted pyropho  24.5 1.1E+02  0.0023   19.6   3.0   23   33-55     44-68  (102)
 59 PRK14364 Maf-like protein; Pro  24.4      58  0.0013   23.6   1.9   16   20-35     71-86  (181)
 60 PRK00148 Maf-like protein; Rev  24.0      60  0.0013   23.8   1.9   16   20-35     75-90  (194)
 61 PRK01441 Maf-like protein; Rev  24.0      61  0.0013   24.0   2.0   16   20-35     86-101 (207)
 62 PRK08905 lipid A biosynthesis   23.9      37 0.00079   25.6   0.8   31   17-51    124-154 (289)
 63 PF09720 Unstab_antitox:  Putat  23.2      92   0.002   17.9   2.3   31   43-74      7-37  (54)
 64 PF12019 GspH:  Type II transpo  23.1 1.3E+02  0.0028   18.0   3.1   22   31-52      1-22  (114)
 65 PRK14366 Maf-like protein; Pro  23.1      64  0.0014   23.8   1.9   16   20-35     80-95  (195)
 66 TIGR03418 chol_sulf_TF putativ  23.0      71  0.0015   22.9   2.1   21   34-54     31-51  (291)
 67 PRK11716 DNA-binding transcrip  22.5      32  0.0007   23.9   0.3   20   35-54      8-27  (269)
 68 PRK10219 DNA-binding transcrip  22.5 1.2E+02  0.0025   19.1   2.9   19   34-52     49-67  (107)
 69 KOG3382|consensus               22.4   1E+02  0.0022   22.6   2.8   26   33-58     16-42  (151)
 70 PF12174 RST:  RCD1-SRO-TAF4 (R  22.4 1.2E+02  0.0025   19.2   2.8   18   38-55     47-64  (70)
 71 cd04789 HTH_Cfa Helix-Turn-Hel  22.3      57  0.0012   21.1   1.4   33   15-47     56-88  (102)
 72 PF02099 Josephin:  Josephin;    22.3      34 0.00074   24.5   0.4   28   17-45    121-148 (157)
 73 PF15601 Imm42:  Immunity prote  22.3   1E+02  0.0022   22.0   2.7   40   16-55     37-78  (134)
 74 PRK14363 Maf-like protein; Pro  22.1      70  0.0015   23.8   2.0   16   20-35     75-90  (204)
 75 PF10229 DUF2246:  Uncharacteri  21.9      45 0.00097   26.3   1.0   11    8-18     51-61  (278)
 76 PRK11546 zraP zinc resistance   21.9 1.3E+02  0.0029   21.6   3.3   29   19-47     77-106 (143)
 77 PRK06012 flhA flagellar biosyn  21.8 1.7E+02  0.0036   26.0   4.5   45    9-54    447-516 (697)
 78 PRK13348 chromosome replicatio  21.8      42 0.00091   24.1   0.7   20   34-53     32-51  (294)
 79 PF02545 Maf:  Maf-like protein  21.8      64  0.0014   23.6   1.7   16   20-35     78-93  (195)
 80 PF03190 Thioredox_DsbH:  Prote  21.8      55  0.0012   23.8   1.3   32   13-44     99-142 (163)
 81 PF01231 IDO:  Indoleamine 2,3-  21.8      74  0.0016   26.0   2.2   18   34-51    405-422 (422)
 82 PRK06628 lipid A biosynthesis   21.7      59  0.0013   24.5   1.6   32   16-51    137-168 (290)
 83 COG4047 Uncharacterized protei  21.6      70  0.0015   25.1   1.9   13   59-71     56-68  (243)
 84 PLN03143 nudix hydrolase; Prov  21.2      89  0.0019   24.6   2.5   42    7-48     32-79  (291)
 85 COG1503 eRF1 Peptide chain rel  21.0      63  0.0014   27.0   1.7   29   14-47     24-52  (411)
 86 PF14852 Fis1_TPR_N:  Fis1 N-te  20.8   1E+02  0.0023   16.9   2.1   14   44-57     20-33  (35)
 87 PRK13480 3'-5' exoribonuclease  20.7   1E+02  0.0022   24.4   2.7   40   16-55     95-136 (314)
 88 PRK14368 Maf-like protein; Pro  20.5      76  0.0016   23.3   1.9   16   20-35     79-94  (193)
 89 PRK14365 Maf-like protein; Pro  20.4      78  0.0017   23.3   1.9   16   20-35     77-92  (197)
 90 PRK13456 DNA protection protei  20.4 1.5E+02  0.0033   22.2   3.5   43   32-75     51-99  (186)
 91 PRK13503 transcriptional activ  20.2 1.2E+02  0.0026   21.7   2.9   19   34-52    215-233 (278)
 92 PRK14361 Maf-like protein; Pro  20.2      79  0.0017   23.1   1.9   16   20-35     72-87  (187)
 93 PF12785 VESA1_N:  Variant eryt  20.2      39 0.00084   28.8   0.3   30   19-48     39-68  (462)
 94 PRK01526 Maf-like protein; Rev  20.1      79  0.0017   23.5   1.9   15   21-35     85-99  (205)
 95 COG3248 Tsx Nucleoside-binding  20.0      20 0.00043   28.6  -1.3   27    8-35    153-180 (284)

No 1  
>PF04062 P21-Arc:  ARP2/3 complex ARPC3 (21 kDa) subunit;  InterPro: IPR007204 The Arp2/3 complex is a seven-protein assembly that is critical for actin nucleation and branching in cells. Arp2/3 nucleates new actin filaments while bound to existing filaments, thus creating a branched network []. The complex consists of Arp2, Arp3, p41, p34, p21, p20 and p16. Subunits p34 and p20 constitute the core of the structure, with the remaining subunits located peripherally []. This entry describes the p21 subunit. Proteins such as WASp and Scar1 may mediate receptor signalling through interactions with p21-Arc, resulting in the activation of Arc2/3 complex activity [].; GO: 0030833 regulation of actin filament polymerization, 0005856 cytoskeleton; PDB: 3DWL_J 2P9P_E 2P9N_E 2P9K_E 1TYQ_E 1U2V_E 3RSE_E 2P9U_E 3DXM_E 2P9S_E ....
Probab=100.00  E-value=9.3e-49  Score=286.58  Aligned_cols=79  Identities=65%  Similarity=1.263  Sum_probs=62.6

Q ss_pred             CccccccCcCCCCCCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCChhhhhhccchhhhhcc
Q psy16824          1 MYSLALAKFDLPGEAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVCERVFNTPDGKPSKWWLCFSKKRFMDKSL   79 (85)
Q Consensus         1 l~~lal~~F~iPGd~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~e~vy~~~~~~psKwWm~F~KRkFM~KsL   79 (85)
                      |+++|+++|+||||+|||||++|++|+|++|+|+||+||+|+|||+|.|||||||++++++||||||||+|||||||||
T Consensus        97 l~~lal~~F~iPGd~gFpLn~~y~~P~~~~e~d~lR~Yl~QlRqElg~RL~ervf~~~~~~psKwW~~FaKRkFM~KsL  175 (175)
T PF04062_consen   97 LYNLALDNFPIPGDPGFPLNSLYAKPANRQEADLLRQYLTQLRQELGLRLLERVFDDGDGKPSKWWMCFAKRKFMNKSL  175 (175)
T ss_dssp             HHHHHHS----TTSTT-TTTTTS---SSHHHHHHHHHHHHHHHHHHCCCCCHHHB-SSSS-B-HCCCCCCC--GGG--T
T ss_pred             HHHHHHhcCCCCCCCCCCccccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCChhhhhhhhhccccccC
Confidence            5899999999999999999999999999999999999999999999999999999988899999999999999999997


No 2  
>KOG3155|consensus
Probab=100.00  E-value=5e-44  Score=260.61  Aligned_cols=79  Identities=62%  Similarity=1.228  Sum_probs=77.5

Q ss_pred             CccccccCcCCCCCCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCChhhhhhccchhhhhcc
Q psy16824          1 MYSLALAKFDLPGEAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVCERVFNTPDGKPSKWWLCFSKKRFMDKSL   79 (85)
Q Consensus         1 l~~lal~~F~iPGd~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~e~vy~~~~~~psKwWm~F~KRkFM~KsL   79 (85)
                      |++||+++|+||||+|||||+||++|.+++|+|.||+||+|+|||+|.|||++||++.+++|||||+||+|||||||||
T Consensus        95 l~~lal~~~~iPGe~GFPln~ly~~p~s~~~~e~mr~Yl~Q~RqE~~~RL~~~v~~~~~d~PsKwW~cF~kRrFMnksl  173 (173)
T KOG3155|consen   95 LYNLALENFPIPGEPGFPLNALYTLPASKQDAELMRAYLQQFRQETGLRLLEKVYDTPKDKPSKWWLCFAKRRFMNKSL  173 (173)
T ss_pred             HHHhhhhcccCCCCCCCCchhhhcCCCCCCchHHHHHHHHHHHHHhhhhHHHHhcCCCCCCcchhHHHHHHHHhhhccC
Confidence            6899999999999999999999999999999999999999999999999999999999999999999999999999997


No 3  
>cd00126 PAH Pancreatic Hormone domain, a regulator of pancreatic and gastrointestinal functions; neuropeptide Y (NPY)b, peptide YY (PYY), and pancreatic polypetide (PP) are closely related; propeptide is enzymatically cleaved to yield the mature active peptide with amidated C-terminal ends; receptor binding and activation functions may reside in the N- and C-termini respectively; occurs in neurons, intestinal endocrine cells, and pancreas; exist as monomers and dimers
Probab=85.44  E-value=0.9  Score=26.03  Aligned_cols=22  Identities=18%  Similarity=0.483  Sum_probs=17.0

Q ss_pred             CCChhHHHHHHHHHHHHHHHHH
Q psy16824         26 PQSQNEADLMKNYLTQVRQETG   47 (85)
Q Consensus        26 P~~~~E~d~lr~Yl~QlRqE~~   47 (85)
                      |++-.-.|.|++|+++||+=+.
T Consensus         8 Pg~~a~~eel~~Y~~~L~~Yin   29 (36)
T cd00126           8 PGDDASPEELRQYLAALREYIN   29 (36)
T ss_pred             CCCCCCHHHHHHHHHHHHHHHH
Confidence            4555557999999999998653


No 4  
>PF00159 Hormone_3:  Pancreatic hormone peptide;  InterPro: IPR001955 Pancreatic hormone (PP) [] is a peptide synthesized in pancreatic islets of Langherhans, which acts as a regulator of pancreatic and gastrointestinal functions. The hormone is produced as a larger propeptide, which is enzymatically cleaved to yield the mature active peptide: this is 36 amino acids in length [] and has an amidated C terminus []. The hormone has a globular structure, residues 2-8 forming a left-handed poly-proline-II-like helix, residues 9-13 a beta turn, and 14-32 an alpha-helix,held close to the first helix by hydrophobic interactions []. Unlike glucagon, another peptide hormone, the structure of pancreatic peptide is preserved in aqueous solution []. Both N and C termini are required for activity: receptor binding and activation functions may reside in the N and C termini respectively []. Pancreatic hormone is part of a wider family of active peptides that includes:  Neuropeptide Y (NPY) [], one of the most abundant peptides in the mammalian nervous system. NPY is implicated in the control of feeding and the secretion of the gonadotrophin-releasing hormone. Peptide YY (PYY) []. PPY is a gut peptide that inhibits exocrine pancreatic secretion, has a vasoconstrictory action and inhibits jejunal and colonic mobility. Various NPY and PYY-like polypeptides from fish and amphibians [, ]. Neuropeptide F (NPF) from invertebrates such as worms and snail. Skin peptide Tyr-Tyr (SPYY) from the frog Phyllomedusa bicolor. SPYY shows a large spectra of antibacterial and antifungal activity.  All these peptides are 36 to 39 amino acids long. Like most active peptides, their C-terminal is amidated and they are synthesized as larger protein precursors.; GO: 0005179 hormone activity, 0005576 extracellular region; PDB: 1LJV_A 1BBA_A 1V1D_A 1PPT_A 2H3T_A 2H4B_A 2BF9_A 2H3S_B 1K8V_A 2DF0_A ....
Probab=83.94  E-value=1.3  Score=25.29  Aligned_cols=22  Identities=14%  Similarity=0.315  Sum_probs=16.3

Q ss_pred             CCChhHHHHHHHHHHHHHHHHH
Q psy16824         26 PQSQNEADLMKNYLTQVRQETG   47 (85)
Q Consensus        26 P~~~~E~d~lr~Yl~QlRqE~~   47 (85)
                      |++-.-.|.|++|+.+||+=+.
T Consensus         8 P~~~aspeel~~Y~~~L~~Y~~   29 (36)
T PF00159_consen    8 PGDFASPEELAQYYAALRHYIN   29 (36)
T ss_dssp             SSTTSSHHHHHHHHHHHHHHHH
T ss_pred             CCCCCCHHHHHHHHHHHHHHHH
Confidence            4444555789999999998653


No 5  
>smart00309 PAH Pancreatic hormones / neuropeptide F / peptide YY family. Pancreatic hormone is a regulator of pancreatic and gastrointestinal functions.
Probab=82.87  E-value=1.3  Score=25.32  Aligned_cols=22  Identities=18%  Similarity=0.465  Sum_probs=16.2

Q ss_pred             CCChhHHHHHHHHHHHHHHHHH
Q psy16824         26 PQSQNEADLMKNYLTQVRQETG   47 (85)
Q Consensus        26 P~~~~E~d~lr~Yl~QlRqE~~   47 (85)
                      |++..-.|.|++|+.+||+=+.
T Consensus         8 Pg~~a~~e~l~~Y~~~L~~Yin   29 (36)
T smart00309        8 PGDDASPEDLRQYLAALREYIN   29 (36)
T ss_pred             CCCCCCHHHHHHHHHHHHHHHH
Confidence            4444445779999999998653


No 6  
>PF09254 Endonuc-FokI_C:  Restriction endonuclease FokI, C terminal;  InterPro: IPR015334 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below:   Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA.   Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone [].  This entry represents the C-terminal domain of FokI restriction endonucleases, which adopts a structure consisting of an alpha/beta/alpha core containing a five-stranded beta-sheet. FokI recognises the double-stranded DNA sequence 5'-GGATG-3' and cleave DNA phosphodiester groups 9 base pairs away on this strand and 13 base pairs away on the complementary strand [, ].; GO: 0003677 DNA binding, 0009036 Type II site-specific deoxyribonuclease activity, 0009307 DNA restriction-modification system; PDB: 1FOK_A 2FOK_B.
Probab=81.55  E-value=2.3  Score=32.17  Aligned_cols=32  Identities=28%  Similarity=0.677  Sum_probs=21.5

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCChhhhhhcc
Q psy16824         29 QNEADLMKNYLTQVRQETGLRVCERVFNTPDGKPSKWWLCFSK   71 (85)
Q Consensus        29 ~~E~d~lr~Yl~QlRqE~~~RL~e~vy~~~~~~psKwWm~F~K   71 (85)
                      -.++|.|--|+.+-|.-           ++.-.|+|||-+|-+
T Consensus        89 i~q~DeM~RYI~~n~~R-----------d~~~npnkWWe~f~~  120 (189)
T PF09254_consen   89 ISQADEMIRYIEENQER-----------DEKRNPNKWWENFPE  120 (189)
T ss_dssp             HHHHHHHHHHHHHHHH-------------TTTSTT-GGGGS-T
T ss_pred             cccHHHHHHHHHHHhcc-----------cccCCCchhHHhccc
Confidence            46778888888877642           234569999999976


No 7  
>COG3765 WzzB Chain length determinant protein [Cell envelope biogenesis, outer membrane]
Probab=57.81  E-value=22  Score=29.13  Aligned_cols=47  Identities=17%  Similarity=0.306  Sum_probs=37.4

Q ss_pred             cCCCCCCCCC-CcccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy16824          9 FDLPGEAGFP-LNSVYAKPQSQNEADLMKNYLTQVRQETGLRVCERVF   55 (85)
Q Consensus         9 F~iPGd~gFp-Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~e~vy   55 (85)
                      |..+|+.+++ ++..|+.+....-.+.+++|++++=|.....+++++=
T Consensus       143 ~~~~~~~~~~~~~vs~~a~t~edAq~~L~gyI~~~s~~v~~el~~~l~  190 (347)
T COG3765         143 FKPGGFDLATNLTVSFTAETAEDAQDLLRGYIAFVSQKVAQELLDNLK  190 (347)
T ss_pred             ccCCccccchhheeeeecCCcHHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence            4456666665 8888988877667799999999999999888887764


No 8  
>TIGR03779 Bac_Flav_CT_M Bacteroides conjugative transposon TraM protein. Members of this protein family are designated TraM and are found in a proposed transfer region of a class of conjugative transposon found in the Bacteroides lineage.
Probab=56.00  E-value=17  Score=30.22  Aligned_cols=36  Identities=19%  Similarity=0.456  Sum_probs=28.0

Q ss_pred             CcccccCCCChhHHHHHHHHHHHHHHHHH------------HHHHHHhc
Q psy16824         19 LNSVYAKPQSQNEADLMKNYLTQVRQETG------------LRVCERVF   55 (85)
Q Consensus        19 Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~------------~RL~e~vy   55 (85)
                      |+++|..|.+..|.+ ++.=+.|++.++.            ..|+||-|
T Consensus       133 LgsFYe~~~~~~e~e-l~~~veel~~~l~~~~~~~~~~~eq~almEKsy  180 (410)
T TIGR03779       133 LGSFYEYPKTDEEKE-LLREVEELESRLATEPSPAPELEEQLALMEKSY  180 (410)
T ss_pred             hhhhccCCCCchhHH-HHHHHHHHHHHHhhhcccccchhHHHHHHHHhH
Confidence            778999998888888 7777888887664            56777666


No 9  
>PF08990 Docking:  Erythronolide synthase docking;  InterPro: IPR015083 The N-terminal docking domain found in modular polyketide synthase assumes an alpha-helical structure, wherein two alpha-helices are connected by a short loop. Two such N-terminal domains dimerise to form amphipathic parallel alpha-helical coiled coils: dimerisation is essential for protein function []. ; GO: 0016740 transferase activity, 0048037 cofactor binding; PDB: 2HG4_E.
Probab=55.21  E-value=11  Score=20.09  Aligned_cols=17  Identities=29%  Similarity=0.567  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy16824         31 EADLMKNYLTQVRQETG   47 (85)
Q Consensus        31 E~d~lr~Yl~QlRqE~~   47 (85)
                      ++|++|.||++.=-||-
T Consensus         3 ~e~kLr~YLkr~t~eL~   19 (27)
T PF08990_consen    3 NEDKLRDYLKRVTAELR   19 (27)
T ss_dssp             -HCHHHHHHHHHHHHHH
T ss_pred             cHHHHHHHHHHHHHHHH
Confidence            56889999998776663


No 10 
>COG1298 FlhA Flagellar biosynthesis pathway, component FlhA [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=53.89  E-value=27  Score=31.09  Aligned_cols=46  Identities=24%  Similarity=0.535  Sum_probs=36.1

Q ss_pred             CcCCCC----CCCCCCcccccCCCChhHH-------------------HHHHHHHHHH--HHHHHHHHHHHh
Q psy16824          8 KFDLPG----EAGFPLNSVYAKPQSQNEA-------------------DLMKNYLTQV--RQETGLRVCERV   54 (85)
Q Consensus         8 ~F~iPG----d~gFpLn~~y~~P~~~~E~-------------------d~lr~Yl~Ql--RqE~~~RL~e~v   54 (85)
                      .=.|||    ||-|=|.+++-.+..++|+                   |.+|+++.+|  |||. ..|+|++
T Consensus       445 ~~~i~G~~t~ePaFGl~a~WI~~~~ke~A~~~GYtVvd~~sVi~THl~evir~~a~eLl~rqev-q~Lld~l  515 (696)
T COG1298         445 TGEIPGEETKDPAFGLPAYWIEEDQKEEAQILGYTVVDPSSVIATHLSEVIRNHAHELLGRQEV-QQLLDRL  515 (696)
T ss_pred             CCCCCCccccCCCCCCceeecCchhhhHHHhcCCeeecchHHHHHhHHHHHHHhHHHHhHHHHH-HHHHHHH
Confidence            456888    7889999999999888888                   7788888877  7776 5555544


No 11 
>PF08606 Prp19:  Prp19/Pso4-like;  InterPro: IPR013915  This region is found specifically in PRP19-like protein. The region represented by this protein covers the sequence implicated in self-interaction and a coiled-coiled motif []. PRP19-like proteins form an oligomer that is necessary for spliceosome assembly []. 
Probab=49.91  E-value=33  Score=22.20  Aligned_cols=19  Identities=26%  Similarity=0.254  Sum_probs=15.2

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy16824         33 DLMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        33 d~lr~Yl~QlRqE~~~RL~   51 (85)
                      =.+|+=|.|+||||+..|-
T Consensus        25 f~LRk~l~~~rqELs~aLY   43 (70)
T PF08606_consen   25 FTLRKQLDQTRQELSHALY   43 (70)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4589999999999986543


No 12 
>PF00631 G-gamma:  GGL domain;  InterPro: IPR015898 This entry represents the G protein gamma subunit and the GGL (G protein gamma-like) domain, which are related in sequence and are comprised of an extended alpha-helical polypeptide. The G protein gamma subunit forms a stable dimer with the beta subunit, but it does not make any contact with the alpha subunit, which contacts the opposite face of the beta subunit. The GGL domain is found in several RGS (regulators of G protein signaling) proteins. GGL domains can interact with beta subunits to form novel dimers that prevent gamma subunit binding, and may prevent heterotrimer formation by inhibiting alpha subunit binding. The interaction between G protein beta-5 neuro-specific isoforms and RGS GGL domains may represent a general mode of binding between beta-propeller proteins and their partners [].; GO: 0004871 signal transducer activity, 0007186 G-protein coupled receptor protein signaling pathway, 0005834 heterotrimeric G-protein complex; PDB: 3PSC_G 3SN6_G 1OMW_G 2BCJ_G 1GG2_G 3PVW_G 3PVU_G 3AH8_G 3CIK_G 1GP2_G ....
Probab=45.00  E-value=27  Score=21.24  Aligned_cols=19  Identities=37%  Similarity=0.431  Sum_probs=15.6

Q ss_pred             hHHHHHHHHHHHHHHHHHH
Q psy16824         30 NEADLMKNYLTQVRQETGL   48 (85)
Q Consensus        30 ~E~d~lr~Yl~QlRqE~~~   48 (85)
                      ++.+.++.-+.|||+|+..
T Consensus         2 ~~~~~l~~ei~~L~~el~~   20 (68)
T PF00631_consen    2 QEKDQLKREIEQLRQELER   20 (68)
T ss_dssp             THHHHHHHHHHHHHHHHTS
T ss_pred             hHHHHHHHHHHHHHHHHcc
Confidence            5677888888999999873


No 13 
>PF02831 gpW:  gpW;  InterPro: IPR004174 GpW is a 68 residue protein known to be present in phage particles. Extracts of phage-infected cells lacking GpW contain DNA-filled heads, and active tails, but no infectious virions. GpW is required for the addition of GpFII to the head, which is, in turn, required for the attachment of tails. Since GpFII and tails are known to be attached at the connector, GpW is also likely to assemble at this site. The addition of GpW to filled heads increases the DNase resistance of the packaged DNA, suggesting that GpW either forms a plug at the connector to prevent ejection of the DNA, or binds directly to the DNA. The large number of positively charged residues in GpW (its calculated pI is 10.8) is consistent with a role in DNA interaction [].; GO: 0019067 viral assembly, maturation, egress, and release; PDB: 2L6Q_A 2L6R_A 1HYW_A.
Probab=41.35  E-value=22  Score=22.59  Aligned_cols=19  Identities=16%  Similarity=0.461  Sum_probs=15.9

Q ss_pred             hHHHHHHHHHHHHHHHHHH
Q psy16824         30 NEADLMKNYLTQVRQETGL   48 (85)
Q Consensus        30 ~E~d~lr~Yl~QlRqE~~~   48 (85)
                      .....|+.|+.+|..+||.
T Consensus        38 a~i~~L~~yI~~L~~~Lg~   56 (68)
T PF02831_consen   38 ANIGDLRAYIQQLEAQLGI   56 (68)
T ss_dssp             GGHHHHHHHHHHHHHHTTS
T ss_pred             CCHHHHHHHHHHHHHHhCc
Confidence            4557799999999999984


No 14 
>PHA02865 MHC-like TNF binding protein; Provisional
Probab=39.54  E-value=13  Score=30.16  Aligned_cols=11  Identities=64%  Similarity=1.226  Sum_probs=9.2

Q ss_pred             CCCCCCCCCcc
Q psy16824         11 LPGEAGFPLNS   21 (85)
Q Consensus        11 iPGd~gFpLn~   21 (85)
                      =|||||||+|.
T Consensus       261 e~ge~~~~~~~  271 (338)
T PHA02865        261 EPGEPGFPTNT  271 (338)
T ss_pred             cccccccccce
Confidence            36999999984


No 15 
>TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family
Probab=39.00  E-value=28  Score=23.25  Aligned_cols=16  Identities=19%  Similarity=0.156  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHHHhc
Q psy16824         40 TQVRQETGLRVCERVF   55 (85)
Q Consensus        40 ~QlRqE~~~RL~e~vy   55 (85)
                      .|+|+|+..||++-+=
T Consensus        63 ~~~~~El~rrLL~d~~   78 (117)
T TIGR03142        63 EAARAELQRRLLADIP   78 (117)
T ss_pred             HHHHHHHHHHHHHCcc
Confidence            6788999999997543


No 16 
>COG2232 Predicted ATP-dependent carboligase related to biotin carboxylase [General function prediction only]
Probab=38.84  E-value=44  Score=27.86  Aligned_cols=28  Identities=39%  Similarity=0.565  Sum_probs=23.2

Q ss_pred             CcCCCC---CCCCCCcccccCCCChhHHHHH
Q psy16824          8 KFDLPG---EAGFPLNSVYAKPQSQNEADLM   35 (85)
Q Consensus         8 ~F~iPG---d~gFpLn~~y~~P~~~~E~d~l   35 (85)
                      ..|-||   +.|-||-+++++-.++++++.|
T Consensus       334 DiP~~GtviekgePl~sviA~~nt~~~a~~~  364 (389)
T COG2232         334 DIPRPGTVIEKGEPLCSVIASSNTRSGAESM  364 (389)
T ss_pred             cCCCCCcccCCCCceeeeeeccCCHHHHHHH
Confidence            456666   8999999999998888888766


No 17 
>PRK06553 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=37.51  E-value=21  Score=27.18  Aligned_cols=35  Identities=11%  Similarity=0.221  Sum_probs=26.3

Q ss_pred             CCCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824         13 GEAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        13 Gd~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~   51 (85)
                      +.-|+|++.+|.+..|+    .+..++.++|...|.+++
T Consensus       153 ~~~~~~~~~vyr~~~n~----~~d~~i~~~R~~~g~~~i  187 (308)
T PRK06553        153 AAFGLDVTVLFRPPNNP----YAARKVLEARRTTMGGLV  187 (308)
T ss_pred             HHcCCceEEEEecCCCh----HHHHHHHHHHHHcCCCcc
Confidence            34578999999887764    566778888888776654


No 18 
>PRK08943 lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase; Validated
Probab=35.97  E-value=23  Score=27.02  Aligned_cols=34  Identities=21%  Similarity=0.422  Sum_probs=26.5

Q ss_pred             CCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824         14 EAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        14 d~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~   51 (85)
                      .-|+|+..||.+..|+    .+..++.+.|+..|.+++
T Consensus       152 ~~~~~~~~vyr~~~n~----~~d~~~~~~R~~~g~~~i  185 (314)
T PRK08943        152 SQGQPMAAMFHNQRNP----LFDWLWNRVRRRFGGRLH  185 (314)
T ss_pred             hcCCCccEEEeCCCCH----HHHHHHHHHHhhcCCeee
Confidence            3478999999987764    567788899998876655


No 19 
>cd05493 Bromo_ALL-1 Bromodomain, ALL-1 like proteins. ALL-1 is a vertebrate homologue of Drosophila trithorax and is often affected in chromosomal rearrangements that are linked to acute leukemias, such as acute lymphocytic leukemia (ALL). Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine.
Probab=35.79  E-value=41  Score=23.92  Aligned_cols=19  Identities=26%  Similarity=0.566  Sum_probs=16.6

Q ss_pred             HHHHHHhcCCCCCCCChhh
Q psy16824         48 LRVCERVFNTPDGKPSKWW   66 (85)
Q Consensus        48 ~RL~e~vy~~~~~~psKwW   66 (85)
                      .|++|+||.--+.+.+|+|
T Consensus       113 ik~me~vf~Wf~~~d~k~W  131 (131)
T cd05493         113 IKLMESVFPWFNSEDPKLW  131 (131)
T ss_pred             HHHHHHhcccccccCCCCC
Confidence            7899999987777888888


No 20 
>PF08802 CytB6-F_Fe-S:  Cytochrome B6-F complex Fe-S subunit ;  InterPro: IPR014909 The cytochrome b6-f complex mediates electron transfer between photosystem II (PSII) and photosystem I (PSI), cyclic electron flow around PSI, and state transitions. The cytochrome b6-f complex has 4 large subunits, these are: cytochrome b6, subunit IV (17 kDa polypeptide, PetD), cytochrome f and the Rieske protein, while the 4 small subunits are: PetG, PetL, PetM and PetN. The complex functions as a dimer.  This protein corresponds to the alpha helical transmembrane domain of the cytochrome b6-f complex Rieske iron-sulphur subunit. ; GO: 0009496 plastoquinol-plastocyanin reductase activity, 0051537 2 iron, 2 sulfur cluster binding, 0055114 oxidation-reduction process, 0042651 thylakoid membrane; PDB: 1Q90_R 1VF5_D 2E75_D 2E74_D 2E76_D 2D2C_Q 2ZT9_D.
Probab=35.33  E-value=7  Score=22.68  Aligned_cols=13  Identities=23%  Similarity=0.473  Sum_probs=10.1

Q ss_pred             hccchhhhhcccC
Q psy16824         69 FSKKRFMDKSLTA   81 (85)
Q Consensus        69 F~KRkFM~KsLs~   81 (85)
                      +.||+|||.-+.|
T Consensus         6 m~RR~lmN~ll~G   18 (39)
T PF08802_consen    6 MSRRQLMNLLLGG   18 (39)
T ss_dssp             HHHHHHHHHHHHH
T ss_pred             hhHHHHHHHHHHh
Confidence            6899999986543


No 21 
>PF13709 DUF4159:  Domain of unknown function (DUF4159)
Probab=34.98  E-value=34  Score=25.16  Aligned_cols=14  Identities=29%  Similarity=0.517  Sum_probs=12.1

Q ss_pred             ChhHHHHHHHHHHH
Q psy16824         28 SQNEADLMKNYLTQ   41 (85)
Q Consensus        28 ~~~E~d~lr~Yl~Q   41 (85)
                      +.+|.+.||.||++
T Consensus        68 s~~e~~~Lr~Yl~~   81 (207)
T PF13709_consen   68 SDEEIANLRRYLEN   81 (207)
T ss_pred             CHHHHHHHHHHHHc
Confidence            67899999999865


No 22 
>PF11300 DUF3102:  Protein of unknown function (DUF3102);  InterPro: IPR021451 This entry is represented by Streptococcus phage 7201, Orf2. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=34.87  E-value=60  Score=22.94  Aligned_cols=21  Identities=33%  Similarity=0.294  Sum_probs=17.0

Q ss_pred             HHHHHHHHHHHHH-------HHHHHHHH
Q psy16824         32 ADLMKNYLTQVRQ-------ETGLRVCE   52 (85)
Q Consensus        32 ~d~lr~Yl~QlRq-------E~~~RL~e   52 (85)
                      +..++.|.+|.++       |+|.||++
T Consensus         7 a~EIn~~~~q~~~~~l~~~iEIGrrL~e   34 (130)
T PF11300_consen    7 AAEINAIKRQTGQILLESAIEIGRRLKE   34 (130)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3457788888887       89999998


No 23 
>COG4114 FhuF Uncharacterized Fe-S protein [General function prediction only]
Probab=34.82  E-value=40  Score=26.60  Aligned_cols=41  Identities=27%  Similarity=0.487  Sum_probs=31.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH-----hc--CCCCCCCChhhhhhccc
Q psy16824         32 ADLMKNYLTQVRQETGLRVCER-----VF--NTPDGKPSKWWLCFSKK   72 (85)
Q Consensus        32 ~d~lr~Yl~QlRqE~~~RL~e~-----vy--~~~~~~psKwWm~F~KR   72 (85)
                      .-.|-=||+.+||++|.++++-     +|  -=.+|.+|-+|.|...|
T Consensus       176 gyli~wyl~e~k~~lg~~~~~s~r~~lf~e~ll~nG~dnPl~rtv~~r  223 (251)
T COG4114         176 GYLINWYLTEMKQLLGEDLVESLRHALFFEKLLPNGQDNPLWRTVVLR  223 (251)
T ss_pred             HHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHhccCCCCCcHHHHHHHh
Confidence            3456779999999999999872     22  11458899999998877


No 24 
>COG5500 Predicted integral membrane protein [Function unknown]
Probab=33.80  E-value=29  Score=25.50  Aligned_cols=26  Identities=42%  Similarity=0.636  Sum_probs=20.3

Q ss_pred             CCCccccc--CCCChhHHHHHHHHHHHH
Q psy16824         17 FPLNSVYA--KPQSQNEADLMKNYLTQV   42 (85)
Q Consensus        17 FpLn~~y~--~P~~~~E~d~lr~Yl~Ql   42 (85)
                      -|||.-.+  .|++.+-++.|.+||+|-
T Consensus       108 VPLNdALAaa~Pas~~ga~lW~~yLt~W  135 (159)
T COG5500         108 VPLNDALAAANPASTDGAKLWTSYLTNW  135 (159)
T ss_pred             ccchHHHhhcCCCcccHHHHHHHHHHhH
Confidence            47776543  378899999999999973


No 25 
>PF00140 Sigma70_r1_2:  Sigma-70 factor, region 1.2;  InterPro: IPR009042 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes.  With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognised by the major sigma factors [, ]. ; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, DNA-dependent, 0006355 regulation of transcription, DNA-dependent; PDB: 1SMY_F 1IW7_P 1SIG_A 3IYD_F 2BE5_F 2A6E_F 2CW0_F 2A69_P 2A6H_P 3DXJ_P ....
Probab=33.67  E-value=48  Score=18.13  Aligned_cols=13  Identities=23%  Similarity=0.600  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHHHH
Q psy16824         32 ADLMKNYLTQVRQ   44 (85)
Q Consensus        32 ~d~lr~Yl~QlRq   44 (85)
                      .|.++.||++++.
T Consensus         1 ~D~l~~Yl~ei~~   13 (37)
T PF00140_consen    1 SDSLRLYLKEIGR   13 (37)
T ss_dssp             HHHHHHHHHHHHH
T ss_pred             CcHHHHHHHHHcC
Confidence            3789999999974


No 26 
>PF13006 Nterm_IS4:  Insertion element 4 transposase N-terminal;  InterPro: IPR024473 This entry represents the N-terminal domain of Insertion Element 4 transposases [].
Probab=33.56  E-value=39  Score=22.76  Aligned_cols=21  Identities=29%  Similarity=0.378  Sum_probs=18.7

Q ss_pred             HHHHHHHHHHHHHHHHHHhcC
Q psy16824         36 KNYLTQVRQETGLRVCERVFN   56 (85)
Q Consensus        36 r~Yl~QlRqE~~~RL~e~vy~   56 (85)
                      ++-++|.||-||..-++.+|.
T Consensus        77 ~SaitqARqRLG~~pl~~LF~   97 (98)
T PF13006_consen   77 PSAITQARQRLGAEPLRWLFE   97 (98)
T ss_pred             hHHHHHHHHHcCHHHHHHHhc
Confidence            567899999999999999994


No 27 
>PRK15202 type III secretion chaperone protein SigE; Provisional
Probab=33.44  E-value=47  Score=23.47  Aligned_cols=34  Identities=24%  Similarity=0.442  Sum_probs=25.5

Q ss_pred             CCCcccccCCCChhHHHHHH------HHHHHHHHHHHHHH
Q psy16824         17 FPLNSVYAKPQSQNEADLMK------NYLTQVRQETGLRV   50 (85)
Q Consensus        17 FpLn~~y~~P~~~~E~d~lr------~Yl~QlRqE~~~RL   50 (85)
                      =-|-++|.-|+...|+|.+-      +-.+|||||...|.
T Consensus        77 t~L~Al~rlp~~s~~ee~~~g~e~fIs~vr~L~~~~a~r~  116 (117)
T PRK15202         77 TALVALYRLPQTSTEEEALTGFELFISNVKQLKEEYARRM  116 (117)
T ss_pred             ceEEEeeecCCCCcHHHHHhhHHHHHHHHHHHHHHHhhcc
Confidence            34778899998777776543      44689999998874


No 28 
>PF10372 YojJ:  Bacterial membrane-spanning protein N-terminus;  InterPro: IPR019457  This entry is found at the N terminus of a family of putative membrane-spanning bacterial proteins. These proteins often contain IPR003390 from INTERPRO towards the C terminus. ; PDB: 2FB5_A.
Probab=32.79  E-value=61  Score=20.76  Aligned_cols=16  Identities=31%  Similarity=0.634  Sum_probs=14.4

Q ss_pred             HHHHHHHHHHHHHHHH
Q psy16824         32 ADLMKNYLTQVRQETG   47 (85)
Q Consensus        32 ~d~lr~Yl~QlRqE~~   47 (85)
                      +..+++||.|+.++++
T Consensus        12 K~~lk~~L~~I~~~~~   27 (70)
T PF10372_consen   12 KEQLKQYLEQIEEEIS   27 (70)
T ss_dssp             HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            4789999999999997


No 29 
>PF11464 Rbsn:  Rabenosyn Rab binding domain;  InterPro: IPR021565  Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=31.78  E-value=58  Score=19.09  Aligned_cols=15  Identities=13%  Similarity=0.589  Sum_probs=11.9

Q ss_pred             HHHHHHHHHHHHHHH
Q psy16824         31 EADLMKNYLTQVRQE   45 (85)
Q Consensus        31 E~d~lr~Yl~QlRqE   45 (85)
                      ....+++|++|.|..
T Consensus         4 Qi~~I~~~I~qAk~~   18 (42)
T PF11464_consen    4 QINIIESYIKQAKAA   18 (42)
T ss_dssp             HHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHHHh
Confidence            457789999998864


No 30 
>PF04536 TPM:  TLP18.3, Psb32 and MOLO-1 founding proteins of phosphatase;  InterPro: IPR007621 This is a family of uncharacterised proteins. They are found in both eukarya and eubacteria. In eubacteria the region is towards the N-terminal of the protein and is accompanied by an N-terminal signal sequence. The C-terminal of eubacterial proteins typically contains one or more putative transmembrane regions. In eukaryotes the region is not accompanied by a signal sequence.; PDB: 3PTJ_A 3PW9_A 3PVH_A 2KPT_A 2KW7_A.
Probab=31.61  E-value=24  Score=22.29  Aligned_cols=24  Identities=25%  Similarity=0.527  Sum_probs=18.8

Q ss_pred             ChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824         28 SQNEADLMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        28 ~~~E~d~lr~Yl~QlRqE~~~RL~   51 (85)
                      +++|.+.+.+.++++.+++|.+++
T Consensus         3 s~~~~~~l~~~l~~~~~~t~~~i~   26 (119)
T PF04536_consen    3 SQEERERLNQALAKLEKKTGVQIV   26 (119)
T ss_dssp             -HHHHHHHHHHHHHHHHHC--EEE
T ss_pred             CHHHHHHHHHHHHHHHHhhCCEEE
Confidence            467889999999999999998774


No 31 
>cd00555 Maf Nucleotide binding protein Maf. Maf has been implicated in inhibition of septum formation in eukaryotes, bacteria and archaea, but homologs in B.subtilis and S.cerevisiae are nonessential for cell division. Maf has been predicted to be a nucleotide- or nucleic acid-binding protein with structural similarity to the hypoxanthine/xanthine NTP pyrophosphatase Ham1 from Methanococcus jannaschii, RNase H from Escherichia coli, and some other nucleotide or RNA-binding proteins.
Probab=31.20  E-value=37  Score=24.47  Aligned_cols=16  Identities=31%  Similarity=0.547  Sum_probs=12.7

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      |.++.||.|.+|+..|
T Consensus        74 g~il~KP~~~~eA~~~   89 (180)
T cd00555          74 GRILGKPKDREEAREM   89 (180)
T ss_pred             CEEEcCCCCHHHHHHH
Confidence            4689999999888543


No 32 
>PF08938 HBS1_N:  HBS1 N-terminus;  InterPro: IPR015033 This domain is found in various eukaryotic HBS1-like proteins. ; PDB: 1UFZ_A 3IZQ_1.
Probab=30.96  E-value=8.3  Score=24.36  Aligned_cols=22  Identities=27%  Similarity=0.286  Sum_probs=18.8

Q ss_pred             ChhHHHHHHHHHHHHHHHHHHH
Q psy16824         28 SQNEADLMKNYLTQVRQETGLR   49 (85)
Q Consensus        28 ~~~E~d~lr~Yl~QlRqE~~~R   49 (85)
                      +..|.+.|++-|-|+|..||.-
T Consensus        21 s~ed~~~L~~~l~~vr~~Lg~~   42 (79)
T PF08938_consen   21 SPEDQAQLYSCLPQVREVLGDY   42 (79)
T ss_dssp             TCHHHHHHCHHCCCHHHHCCCC
T ss_pred             CHHHHHHHHHHHHHHHHHHccc
Confidence            4578899999999999999853


No 33 
>KOG1205|consensus
Probab=29.59  E-value=40  Score=26.46  Aligned_cols=34  Identities=21%  Similarity=0.511  Sum_probs=27.7

Q ss_pred             CCCCCCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHH
Q psy16824         10 DLPGEAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGL   48 (85)
Q Consensus        10 ~iPGd~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~   48 (85)
                      +|=|--+.|++++|.+.+     =.+.+|..-||+|+..
T Consensus       150 SiaG~~~~P~~~~Y~ASK-----~Al~~f~etLR~El~~  183 (282)
T KOG1205|consen  150 SIAGKMPLPFRSIYSASK-----HALEGFFETLRQELIP  183 (282)
T ss_pred             ccccccCCCcccccchHH-----HHHHHHHHHHHHHhhc
Confidence            455777889999998643     5699999999999976


No 34 
>PF05438 TRH:  Thyrotropin-releasing hormone (TRH);  InterPro: IPR008857 This family consists of several thyrotropin-releasing hormone (TRH) proteins. Thyrotropin-Releasing Hormone (TRH; pyroGlu-His-Pro-NH2), originally isolated as a hypothalamic neuropeptide hormone, most likely acts also as a neuromodulator and/or neurotransmitter in the central nervous system (CNS). This interpretation is supported by the identification of a peptidase localised on the surface of neuronal cells which has been termed TRH-degrading ectoenzyme (TRH-DE) since it selectively inactivates TRH. TRH has been used clinically for the treatment of spinocerebellar degeneration and disturbance of consciousness in humans [].; GO: 0005184 neuropeptide hormone activity, 0009755 hormone-mediated signaling pathway, 0005576 extracellular region
Probab=28.78  E-value=44  Score=25.79  Aligned_cols=47  Identities=19%  Similarity=0.312  Sum_probs=32.4

Q ss_pred             ccccccCcCCCCCCCCCCccccc---CCCC----hhHHHHHHHHHHHHHHHHHH
Q psy16824          2 YSLALAKFDLPGEAGFPLNSVYA---KPQS----QNEADLMKNYLTQVRQETGL   48 (85)
Q Consensus         2 ~~lal~~F~iPGd~gFpLn~~y~---~P~~----~~E~d~lr~Yl~QlRqE~~~   48 (85)
                      .+|++.++.|||-.+.|=-+.-.   ++.+    ++|.-.||+-|+.++.|-+.
T Consensus         7 l~L~l~~~~v~gg~~~~~e~~~~~~~~~lDdlLqrAe~llLrs~L~k~~~e~~~   60 (212)
T PF05438_consen    7 LALTLCNTGVPGGQAAQEEAETGEETPGLDDLLQRAESLLLRSILQKLQEEQGN   60 (212)
T ss_pred             HHHHHhhccccCCccCccccccccCCccHHHHHHHHHHHHHHHHHHHHHhhhcc
Confidence            46778899999988877333322   1211    45556788899999988876


No 35 
>cd07624 BAR_SNX7_30 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexins 7 and 30. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. This subfamily consists of SNX7, SNX30, and similar proteins. The specific functions of SNX7 and SNX30 have not been elucidated. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=28.29  E-value=97  Score=22.38  Aligned_cols=29  Identities=28%  Similarity=0.418  Sum_probs=22.1

Q ss_pred             CcccccCCCChhHHHHHHHHHHHHHHHHH
Q psy16824         19 LNSVYAKPQSQNEADLMKNYLTQVRQETG   47 (85)
Q Consensus        19 Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~   47 (85)
                      +++.|.-..-..|=+.++.|+.+|-+-|+
T Consensus         3 ~~~~~~~~~~d~eF~e~~eyi~~L~~~l~   31 (200)
T cd07624           3 TSTMYLLKNRSPEFDKMNEYLTLFGEKLG   31 (200)
T ss_pred             ccchhhhcCCCccHHHHHHHHHHHHHHhH
Confidence            46666554445677999999999998886


No 36 
>PRK06946 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=28.06  E-value=39  Score=25.50  Aligned_cols=32  Identities=16%  Similarity=0.229  Sum_probs=25.1

Q ss_pred             CCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824         16 GFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        16 gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~   51 (85)
                      ++|+..||.+..|+    .+-.++.++|...|.+++
T Consensus       133 ~~~~~~vyr~~~n~----~~d~~~~~~R~~~g~~~i  164 (293)
T PRK06946        133 RRRVGSLYTPMSNP----LLDAIAKAARGRFGAEMV  164 (293)
T ss_pred             cCCceEEeeCCCCH----HHHHHHHHHHHhcCCCcc
Confidence            57899999887764    566778899998876665


No 37 
>PRK00234 Maf-like protein; Reviewed
Probab=27.93  E-value=47  Score=24.26  Aligned_cols=16  Identities=19%  Similarity=0.308  Sum_probs=12.7

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        76 g~Il~KP~~~~eA~~m   91 (192)
T PRK00234         76 GQILGKPHTFERAREQ   91 (192)
T ss_pred             CEECCCCCCHHHHHHH
Confidence            3689999999888544


No 38 
>PF12891 Glyco_hydro_44:  Glycoside hydrolase family 44;  InterPro: IPR024745 This is a family of putative bacterial glycoside hydrolases.; PDB: 3IK2_A 3ZQ9_A 2YJQ_B 2YKK_A 2YIH_A 2EEX_A 2EQD_A 2E0P_A 2E4T_A 2EO7_A ....
Probab=27.90  E-value=26  Score=27.20  Aligned_cols=20  Identities=35%  Similarity=0.675  Sum_probs=15.5

Q ss_pred             HHHHHHHHHHHHH---HHHHHHH
Q psy16824         33 DLMKNYLTQVRQE---TGLRVCE   52 (85)
Q Consensus        33 d~lr~Yl~QlRqE---~~~RL~e   52 (85)
                      +.|.=||+|+|++   .|.|||+
T Consensus       206 ~fl~wyL~qm~~~~~~~G~RLLD  228 (239)
T PF12891_consen  206 DFLPWYLDQMKEAEKSTGKRLLD  228 (239)
T ss_dssp             SHHHHHHHHHHHHHHHHTS-S-S
T ss_pred             chHHHHHHHHHHhhhhcCceeee
Confidence            4889999999975   7899998


No 39 
>KOG2940|consensus
Probab=27.70  E-value=39  Score=27.33  Aligned_cols=17  Identities=35%  Similarity=0.728  Sum_probs=14.7

Q ss_pred             HHHHHHHHHHHHHHhcC
Q psy16824         40 TQVRQETGLRVCERVFN   56 (85)
Q Consensus        40 ~QlRqE~~~RL~e~vy~   56 (85)
                      +-|.+|.|-||.+|||+
T Consensus        52 dylkeeig~rlaDrvfD   68 (325)
T KOG2940|consen   52 DYLKEEIGDRLADRVFD   68 (325)
T ss_pred             hHHHHHHHHHHHHHHHH
Confidence            44679999999999995


No 40 
>PF11544 Spc42p:  Spindle pole body component Spc42p;  InterPro: IPR021611  Spc42p is a 42kDa component of the S.cerevisiae spindle body that localises to the electron dense central region of the SPB. Spc42p is a phosphoprotein which forms a polymeric layer at the periphery of the SPB central plaque. This functions during SPB duplication and also facilitates the attachment of the SPB to the nuclear membrane. ; PDB: 2Q6Q_B.
Probab=27.58  E-value=80  Score=20.72  Aligned_cols=19  Identities=21%  Similarity=0.204  Sum_probs=16.0

Q ss_pred             hhHHHHHHHHHHHHHHHHH
Q psy16824         29 QNEADLMKNYLTQVRQETG   47 (85)
Q Consensus        29 ~~E~d~lr~Yl~QlRqE~~   47 (85)
                      .+|-|.+..|+.+||.+|-
T Consensus        18 ~eEI~rLn~lv~sLR~KLi   36 (76)
T PF11544_consen   18 QEEIDRLNILVGSLRGKLI   36 (76)
T ss_dssp             HHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4567999999999999874


No 41 
>TIGR02208 lipid_A_msbB lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase. This family consists of MsbB in E. coli and closely related proteins in other species. MsbB is homologous to HtrB (TIGR02207) and acts immediately after it in the biosynthesis of KDO-2 lipid A (also called Re LPS and Re endotoxin). These two enzymes act after creation of KDO-2 lipid IV-A by addition of the KDO sugars.
Probab=27.17  E-value=30  Score=26.17  Aligned_cols=34  Identities=21%  Similarity=0.419  Sum_probs=26.1

Q ss_pred             CCCCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824         14 EAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        14 d~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~   51 (85)
                      .-|.|+..||.+..|+    .+..++.+.|...|.+++
T Consensus       143 ~~~~~~~~v~r~~~n~----~~d~~~~~~R~~~g~~~i  176 (305)
T TIGR02208       143 SQGLPMVTMFNNHKNP----LFDWLWNRVRSRFGGHVY  176 (305)
T ss_pred             hcCCCceEEeeCCCCH----HHHHHHHHHHhcCCCcee
Confidence            3478999999988774    467778889988776655


No 42 
>PRK00078 Maf-like protein; Reviewed
Probab=26.99  E-value=49  Score=24.20  Aligned_cols=16  Identities=31%  Similarity=0.457  Sum_probs=12.7

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|++|+..|
T Consensus        78 g~ilgKP~~~eeA~~~   93 (192)
T PRK00078         78 GKVLGKPKDEEDAFEM   93 (192)
T ss_pred             CEEeCCCCCHHHHHHH
Confidence            4689999999888554


No 43 
>PF03221 HTH_Tnp_Tc5:  Tc5 transposase DNA-binding domain;  InterPro: IPR006600 This entry represents a DNA-binding helix-turn-helix domain found in the pogo family of transposable elements, the centromere protein Cenp-B, and yeast PCD2. There is extensive sequence similarity between Cenp-B and transposase proteins encoded by the pogo superfamily of transposable elements, which includes the human Tigger and Jerky elements []. The HTH domain is composed of three alpha-helices, with the second and third helices connected via a turn comprise the helix-turn-helix motif. Helix 3 is termed the recognition helix as it binds the DNA major groove, as in other HTHs []. This conserved DNA-binding domain is found in the following proteins:   Cenp-B (major centromere autoantigen B or centromere protein B), which appears to organise arrays of centromere satellite DNA into a higher order structure that then direct centromere formation and kinetochore assembly in mammalian chromosomes. The N terminus of Cenp-B contains two DNA-binding HTH domains, which bind to adjacent major grooves of DNA: a psq-type HTH domain followed by a CenpB-type HTH domain, which together bind specifically to the Cenp-B box, which occurs in alpha-satellite DNA in human centromeres [].      Pogo family transposable elements includes both Tigger and Jerky elements []. Pogo contains two open reading frames flanked by inverted repeats. The N-terminal region of pogo transposase contains a Cenp-B-type HTH DNA-binding domain []. Mammalian jerky protein, involved in epileptic seizures in mice [].     PDC2 (Pyruvate DeCarboxylase 2), which is a transcription factor required for the synthesis of the glycolytic enzyme pyruvate decarboxylase, required for high level expression of both the THI and the PDC genes. PDC2 may be important for a high basal level of PDC gene expression or play a positive role in the autoregulation control of PDC1 and PDC5 [, ].  ; PDB: 1HLV_A 1IUF_A.
Probab=26.43  E-value=1.3e+02  Score=16.95  Aligned_cols=45  Identities=18%  Similarity=0.367  Sum_probs=21.5

Q ss_pred             hhHHHHHHHHHHHHH-----------HHHHHHHHHHhcCC--CCCC-CChhhhhhccch
Q psy16824         29 QNEADLMKNYLTQVR-----------QETGLRVCERVFNT--PDGK-PSKWWLCFSKKR   73 (85)
Q Consensus        29 ~~E~d~lr~Yl~QlR-----------qE~~~RL~e~vy~~--~~~~-psKwWm~F~KRk   73 (85)
                      ++.++.|-.|+.+.+           ++.+..|.+.-...  ...+ .+.|+-.|-||-
T Consensus         2 ~~~E~~L~~wi~~~~~~g~~vt~~~i~~~A~~i~~~~~~~~~~~~~~s~~W~~~F~~Rh   60 (66)
T PF03221_consen    2 PELEKALVEWIKRMRRKGFPVTREMIREKAKEIAELAKSPGPPEFKASKGWLDRFKKRH   60 (66)
T ss_dssp             HHHHHHHHHHHHHHCGCT---SCHHHHHHHHHHHH-SCCCT-TT-S--CHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHhhcccccCcCCcccHHHHHHHHHc
Confidence            345566667776653           23334443221211  2223 446777787774


No 44 
>PF14357 DUF4404:  Domain of unknown function (DUF4404)
Probab=26.14  E-value=50  Score=21.36  Aligned_cols=13  Identities=23%  Similarity=0.442  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHHHH
Q psy16824         35 MKNYLTQVRQETG   47 (85)
Q Consensus        35 lr~Yl~QlRqE~~   47 (85)
                      +++.|.+||+||.
T Consensus         2 L~~~L~~L~~eL~   14 (85)
T PF14357_consen    2 LQELLEKLHQELE   14 (85)
T ss_pred             HHHHHHHHHHHHh
Confidence            6788888888887


No 45 
>PF09396 Thrombin_light:  Thrombin light chain;  InterPro: IPR018992 Thrombin is a Na+-activated, allosteric serine protease that functions in blood homeostasis, inflammation and wound healing []. Thrombin (or coagulation factor II) is an enzyme that cleaves bonds after Arg and Lys, converts fibrinogen to fibrin and activates factors V, VII, VIII, and (in complex with thrombomodulin) protein C []. Sodium binding is the major driving force behind the procoagulant, prothrombotic and signaling functions of the enzyme, but is dispensable for cleavage of the anticoagulant protein C []. Prothrombin is activated on the surface of a phospholipid membrane where factor Xa removes the activation peptide and cleaves the remaining part into light and heavy chains. This domain corresponds to the light chain of thrombin. ; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis, 0007596 blood coagulation, 0005576 extracellular region; PDB: 3HKI_D 2OCV_A 3EDX_A 3HK6_C 2PV9_A 2PUX_A 3HK3_A 1HRT_L 1TOC_G 1ETT_L ....
Probab=25.58  E-value=73  Score=19.43  Aligned_cols=28  Identities=29%  Similarity=0.629  Sum_probs=17.2

Q ss_pred             CCCCCCCcccccCC--CChhHHHHHHHHHH
Q psy16824         13 GEAGFPLNSVYAKP--QSQNEADLMKNYLT   40 (85)
Q Consensus        13 Gd~gFpLn~~y~~P--~~~~E~d~lr~Yl~   40 (85)
                      ||+.=-|.-||.+-  .|..|.|+|.+|+.
T Consensus        18 GEadCG~RPlFEkk~~~D~~E~ELleSY~~   47 (49)
T PF09396_consen   18 GEADCGLRPLFEKKSKKDKTEKELLESYIG   47 (49)
T ss_dssp             SGTTTT--TTTGGGT---TTHHHHHHHHHC
T ss_pred             cccccCcchhhhhhcccccCHHHHHHHHhc
Confidence            45555555566554  67899999999974


No 46 
>PRK00648 Maf-like protein; Reviewed
Probab=25.50  E-value=54  Score=23.95  Aligned_cols=16  Identities=38%  Similarity=0.509  Sum_probs=12.6

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        79 g~ilgKP~~~~eA~~~   94 (191)
T PRK00648         79 GKVLGKPKDEEEAVEM   94 (191)
T ss_pred             CEEeCCCCCHHHHHHH
Confidence            3689999999888544


No 47 
>PF11035 SnAPC_2_like:  Small nuclear RNA activating complex subunit 2-like;  InterPro: IPR021281  This family of proteins is SnAPC subunit 2-like. SnAPC allows the transcription of human small nuclear RNA genes to occur by recognition of the proximal sequence element []. 
Probab=25.38  E-value=1.2e+02  Score=24.95  Aligned_cols=43  Identities=19%  Similarity=0.469  Sum_probs=32.1

Q ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHhcCC----------CCCCCChhhhhhccc
Q psy16824         29 QNEADLMKNYLTQVRQETGLRVCERVFNT----------PDGKPSKWWLCFSKK   72 (85)
Q Consensus        29 ~~E~d~lr~Yl~QlRqE~~~RL~e~vy~~----------~~~~psKwWm~F~KR   72 (85)
                      +.|+| +|.||+||+.-.....+.+|+..          +.-.|..-||-.+++
T Consensus        57 Rs~aE-I~~fl~~LK~rvareaiqkv~~~g~~~~R~~e~q~paPIEvW~dla~k  109 (344)
T PF11035_consen   57 RSEAE-IRDFLQQLKGRVAREAIQKVHPGGLKGPRRREAQPPAPIEVWMDLAEK  109 (344)
T ss_pred             cCHHH-HHHHHHHHHHHHHHHHHHHhcccccccccccccCCCccHHHHHHHHHH
Confidence            45544 89999999999999999999842          123477788876653


No 48 
>cd00068 GGL G protein gamma subunit-like motifs, the alpha-helical G-gamma chain dimerizes with the G-beta propeller subunit as part of the heterotrimeric G-protein complex; involved in signal transduction via G-protein-coupled receptors
Probab=25.37  E-value=58  Score=19.46  Aligned_cols=17  Identities=29%  Similarity=0.403  Sum_probs=13.6

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy16824         33 DLMKNYLTQVRQETGLR   49 (85)
Q Consensus        33 d~lr~Yl~QlRqE~~~R   49 (85)
                      +.++.=+.|||+|++..
T Consensus         2 ~~~~~~veqLr~el~~~   18 (57)
T cd00068           2 DQLKKEVEQLRKELSRE   18 (57)
T ss_pred             HHHHHHHHHHHHHHCCc
Confidence            56788889999999843


No 49 
>PRK00884 Maf-like protein; Reviewed
Probab=25.35  E-value=54  Score=24.12  Aligned_cols=16  Identities=19%  Similarity=0.343  Sum_probs=12.8

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      |.++.||.|.+|+..|
T Consensus        76 g~ilgKP~~~eeA~~~   91 (194)
T PRK00884         76 GEITGKPLTEENARAQ   91 (194)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3689999999888554


No 50 
>PF09509 Hypoth_Ymh:  Protein of unknown function (Hypoth_ymh);  InterPro: IPR012654 This entry consists of a relatively rare prokaryotic protein family (about 8 occurrences per 200 genomes). Genes for members of this family appear to be associated variously with phage and plasmid regions, restriction system loci, transposons, and housekeeping genes. Their function is unknown.
Probab=25.31  E-value=1.1e+02  Score=20.90  Aligned_cols=25  Identities=24%  Similarity=0.578  Sum_probs=22.3

Q ss_pred             HHHHHHHHHHHHHHHH------HHHHHhcCC
Q psy16824         33 DLMKNYLTQVRQETGL------RVCERVFNT   57 (85)
Q Consensus        33 d~lr~Yl~QlRqE~~~------RL~e~vy~~   57 (85)
                      |..+.+-..+|+-+|.      .|++.+|..
T Consensus        28 eA~K~l~~rvR~~sg~~~~dG~~L~~~aFs~   58 (125)
T PF09509_consen   28 EAVKSLEDRVREKSGLNDLDGADLMDQAFSS   58 (125)
T ss_pred             HHHHHHHHHHHHHHCCCCcchHHHHHHHhCC
Confidence            7788999999999998      999999973


No 51 
>COG2207 AraC AraC-type DNA-binding domain-containing proteins [Transcription]
Probab=25.30  E-value=99  Score=18.81  Aligned_cols=19  Identities=32%  Similarity=0.399  Sum_probs=16.3

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy16824         34 LMKNYLTQVRQETGLRVCE   52 (85)
Q Consensus        34 ~lr~Yl~QlRqE~~~RL~e   52 (85)
                      ...+|+.+.|.+.+.+|+.
T Consensus        64 s~~~~~~~~Rl~~A~~lL~   82 (127)
T COG2207          64 SPSQYLRQLRLEEARRLLR   82 (127)
T ss_pred             CHHHHHHHHHHHHHHHHHH
Confidence            3568999999999999885


No 52 
>TIGR00172 maf MAF protein. This nonessential gene causes inhibition of septation when overexpressed. A member of the family is found in the Archaeon Pyrococcus horikoshii and another in the round worm Caenorhabditis elegans.
Probab=25.06  E-value=55  Score=23.77  Aligned_cols=16  Identities=31%  Similarity=0.661  Sum_probs=12.8

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      |.++.||.|.+|+..|
T Consensus        77 g~ilgKP~~~~eA~~~   92 (183)
T TIGR00172        77 GEIYGKPKDKEEAAEF   92 (183)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            4689999999888554


No 53 
>PRK00032 Maf-like protein; Reviewed
Probab=24.88  E-value=56  Score=23.92  Aligned_cols=16  Identities=31%  Similarity=0.505  Sum_probs=12.6

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|++|+..|
T Consensus        77 g~IlgKP~~~eeA~~~   92 (190)
T PRK00032         77 GEVLEKPRDAADAAAM   92 (190)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3689999999888544


No 54 
>smart00224 GGL G protein gamma subunit-like motifs.
Probab=24.82  E-value=59  Score=19.83  Aligned_cols=16  Identities=25%  Similarity=0.410  Sum_probs=13.3

Q ss_pred             HHHHHHHHHHHHHHHH
Q psy16824         33 DLMKNYLTQVRQETGL   48 (85)
Q Consensus        33 d~lr~Yl~QlRqE~~~   48 (85)
                      +.+|.=+.|||+|++.
T Consensus         2 ~~~~~~ve~Lr~el~~   17 (63)
T smart00224        2 DQLRKEVEQLRKELSR   17 (63)
T ss_pred             hHHHHHHHHHHHHHCC
Confidence            5678888999999984


No 55 
>PF06183 DinI:  DinI-like family;  InterPro: IPR010391 This family of short proteins includes DNA-damage-inducible protein I (DinI) and related proteins. The SOS response, a set of cellular phenomena exhibited by eubacteria, is initiated by various causes that include DNA damage-induced replication arrest, and is positively regulated by the co- protease activity of RecA. Escherichia coli DinI, a LexA-regulated SOS gene product, shuts off the initiation of the SOS response when overexpressed in vivo. Biochemical and genetic studies indicated that DinI physically interacts with RecA to inhibit its co-protease activity []. The structure of DinI is known [].; PDB: 1GHH_A.
Probab=24.79  E-value=83  Score=19.53  Aligned_cols=18  Identities=17%  Similarity=0.211  Sum_probs=13.1

Q ss_pred             HHHHHHHHHHHHHHHHhcC
Q psy16824         38 YLTQVRQETGLRVCERVFN   56 (85)
Q Consensus        38 Yl~QlRqE~~~RL~e~vy~   56 (85)
                      -+..|++||..||.. +|+
T Consensus         5 a~~AL~~EL~kRl~~-~yP   22 (65)
T PF06183_consen    5 ALEALESELTKRLHR-QYP   22 (65)
T ss_dssp             HHHHHHHHHHHHHHH-H-S
T ss_pred             HHHHHHHHHHHHHHH-HCC
Confidence            467899999988775 554


No 56 
>PRK04056 Maf-like protein; Reviewed
Probab=24.57  E-value=58  Score=23.64  Aligned_cols=16  Identities=31%  Similarity=0.449  Sum_probs=12.4

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        75 g~ilgKP~~~~eA~~~   90 (180)
T PRK04056         75 NKILRKAKDKEEAREM   90 (180)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3689999999887543


No 57 
>KOG4119|consensus
Probab=24.51  E-value=95  Score=20.04  Aligned_cols=21  Identities=24%  Similarity=0.360  Sum_probs=17.2

Q ss_pred             ChhHHHHHHHHHHHHHHHHHH
Q psy16824         28 SQNEADLMKNYLTQVRQETGL   48 (85)
Q Consensus        28 ~~~E~d~lr~Yl~QlRqE~~~   48 (85)
                      +.++.+.+|.-.+|||.|++.
T Consensus         5 ~~~~~~q~k~~VeqLk~e~~~   25 (71)
T KOG4119|consen    5 SNSKKPQMKKEVEQLKLEANI   25 (71)
T ss_pred             cccchHHHHHHHHHHHHHHHh
Confidence            345678899999999999974


No 58 
>COG1694 MazG Predicted pyrophosphatase [General function prediction only]
Probab=24.47  E-value=1.1e+02  Score=19.60  Aligned_cols=23  Identities=17%  Similarity=0.443  Sum_probs=17.1

Q ss_pred             HHHHHH--HHHHHHHHHHHHHHHhc
Q psy16824         33 DLMKNY--LTQVRQETGLRVCERVF   55 (85)
Q Consensus        33 d~lr~Y--l~QlRqE~~~RL~e~vy   55 (85)
                      |.++.+  +..+++|||..|...++
T Consensus        44 eai~~~~~~~~l~eELgDvL~~v~~   68 (102)
T COG1694          44 EAIRKEEDLEDLKEELGDVLADVLF   68 (102)
T ss_pred             HHHHhcCcHHHHHHHHHHHHHHHHH
Confidence            444444  68899999998888665


No 59 
>PRK14364 Maf-like protein; Provisional
Probab=24.40  E-value=58  Score=23.62  Aligned_cols=16  Identities=25%  Similarity=0.343  Sum_probs=12.4

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        71 g~ilgKP~~~eeA~~~   86 (181)
T PRK14364         71 GQIIGKPDSKQHAFDI   86 (181)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3589999999888543


No 60 
>PRK00148 Maf-like protein; Reviewed
Probab=23.96  E-value=60  Score=23.81  Aligned_cols=16  Identities=25%  Similarity=0.302  Sum_probs=12.5

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      |.++.||.|.+|+..|
T Consensus        75 g~Il~KP~~~eeA~~~   90 (194)
T PRK00148         75 GRLLGKPHTPEEAIER   90 (194)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3689999999888543


No 61 
>PRK01441 Maf-like protein; Reviewed
Probab=23.96  E-value=61  Score=24.02  Aligned_cols=16  Identities=19%  Similarity=0.237  Sum_probs=12.7

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|++|+..|
T Consensus        86 g~il~KP~~~~eA~~~  101 (207)
T PRK01441         86 RRILPKAELVDEASQC  101 (207)
T ss_pred             CEEcCCCCCHHHHHHH
Confidence            3689999999888554


No 62 
>PRK08905 lipid A biosynthesis lauroyl acyltransferase; Validated
Probab=23.90  E-value=37  Score=25.58  Aligned_cols=31  Identities=19%  Similarity=0.532  Sum_probs=25.0

Q ss_pred             CCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824         17 FPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        17 FpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~   51 (85)
                      +|+..+|.+..|    ..+..++.++|++.|.+++
T Consensus       124 ~~~~~v~r~~~n----~~~~~~~~~~R~~~g~~~i  154 (289)
T PRK08905        124 FPLTAMFRPPRK----AALRPLMEAGRARGNMRTA  154 (289)
T ss_pred             CCceEEEECCCC----HHHHHHHHHHhcccCCcee
Confidence            799999987665    4577788999999887776


No 63 
>PF09720 Unstab_antitox:  Putative addiction module component;  InterPro: IPR013406  This entry defines several short bacterial proteins, typically about 75 amino acids long, which are always found as part of a pair (at least) of small genes. The other protein in the pair always belongs to a family of plasmid stabilisation proteins (IPR007712 from INTERPRO). It is likely that this protein and its partner comprise some form of addiction module - a pair of genes consisting of a stable toxin and an unstable antitoxin which mediate programmed cell death [] - although these gene pairs are usually found on the bacterial main chromosome.
Probab=23.18  E-value=92  Score=17.90  Aligned_cols=31  Identities=19%  Similarity=0.431  Sum_probs=23.5

Q ss_pred             HHHHHHHHHHHhcCCCCCCCChhhhhhccchh
Q psy16824         43 RQETGLRVCERVFNTPDGKPSKWWLCFSKKRF   74 (85)
Q Consensus        43 RqE~~~RL~e~vy~~~~~~psKwWm~F~KRkF   74 (85)
                      |.+|...|++.+=.+ +..-+.||..-..|++
T Consensus         7 r~~L~e~L~~sl~~~-~~~~~~~w~~el~rR~   37 (54)
T PF09720_consen    7 RAELAEELWDSLDDP-DSEVEAWWKEELERRL   37 (54)
T ss_pred             HHHHHHHHHHHhccc-cccCcHHHHHHHHHHH
Confidence            667777888776654 3367899999988886


No 64 
>PF12019 GspH:  Type II transport protein GspH;  InterPro: IPR022346  GspH is involved in bacterial type II export systems []. Like all pilins, GspH has an N terminus alpha helix []. This helix is followed by nine beta strands forming two beta sheets, one of five antiparallel strands and one of four antiparallel strands []. GspH is a minor pseudopilin; it is expressed much less than other pseudopilins in the type II secretion pilus (major pilins) []. The function and localisation of minor pseudo-pilins are still to be fully unraveled []. It has been suggested that some minor pseudopilins may assemble either into the base or the tip of pili, or both. They function as initiators or regulators of pilus biogenesis and dynamics, and/or as adaptors between various pseudopilin component and other members of the T2SS []. ; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex; PDB: 2QV8_B 2KNQ_A.
Probab=23.13  E-value=1.3e+02  Score=17.96  Aligned_cols=22  Identities=23%  Similarity=0.092  Sum_probs=16.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Q psy16824         31 EADLMKNYLTQVRQETGLRVCE   52 (85)
Q Consensus        31 E~d~lr~Yl~QlRqE~~~RL~e   52 (85)
                      |++.|.+.|++.|.|.-.|=-.
T Consensus         1 aA~~l~~~l~~Ar~eAi~~~~~   22 (114)
T PF12019_consen    1 AANRLAAALQLARSEAIRRGRP   22 (114)
T ss_dssp             -HHHHHHHHHHHHHHHHHHSS-
T ss_pred             CHHHHHHHHHHHHHHHHHCCCE
Confidence            5788999999999998655444


No 65 
>PRK14366 Maf-like protein; Provisional
Probab=23.06  E-value=64  Score=23.83  Aligned_cols=16  Identities=13%  Similarity=0.316  Sum_probs=12.9

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        80 g~ilgKP~~~eeA~~m   95 (195)
T PRK14366         80 RRILLKAETEEQAEEY   95 (195)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3689999999888554


No 66 
>TIGR03418 chol_sulf_TF putative choline sulfate-utilization transcription factor. Members of this protein family are transcription factors of the LysR family. Their genes typically are divergently transcribed from choline-sulfatase genes. That enzyme makes choline, a precursor to the osmoprotectant glycine-betaine, available by hydrolysis of choline sulfate.
Probab=23.02  E-value=71  Score=22.87  Aligned_cols=21  Identities=19%  Similarity=0.311  Sum_probs=17.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHh
Q psy16824         34 LMKNYLTQVRQETGLRVCERV   54 (85)
Q Consensus        34 ~lr~Yl~QlRqE~~~RL~e~v   54 (85)
                      .+-..|++|..|+|.+|.+|-
T Consensus        31 avS~~Ik~LE~~lg~~LF~R~   51 (291)
T TIGR03418        31 AVSQQVKRLEEELGTPLFERG   51 (291)
T ss_pred             HHHHHHHHHHHHhCcHHhhcC
Confidence            377889999999999999873


No 67 
>PRK11716 DNA-binding transcriptional regulator IlvY; Provisional
Probab=22.54  E-value=32  Score=23.87  Aligned_cols=20  Identities=15%  Similarity=0.285  Sum_probs=17.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHh
Q psy16824         35 MKNYLTQVRQETGLRVCERV   54 (85)
Q Consensus        35 lr~Yl~QlRqE~~~RL~e~v   54 (85)
                      +-+.|++|++|+|..|.+|.
T Consensus         8 vS~~I~~LE~~lg~~Lf~R~   27 (269)
T PRK11716          8 LSRQIQRLEEELGQPLFVRD   27 (269)
T ss_pred             HHHHHHHHHHHhCCeeEEec
Confidence            77889999999998888763


No 68 
>PRK10219 DNA-binding transcriptional regulator SoxS; Provisional
Probab=22.47  E-value=1.2e+02  Score=19.08  Aligned_cols=19  Identities=16%  Similarity=0.125  Sum_probs=16.3

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy16824         34 LMKNYLTQVRQETGLRVCE   52 (85)
Q Consensus        34 ~lr~Yl~QlRqE~~~RL~e   52 (85)
                      ...+|+.++|-|-+.+++.
T Consensus        49 s~~~~i~~~Rl~~a~~~L~   67 (107)
T PRK10219         49 TLGDYIRQRRLLLAAVELR   67 (107)
T ss_pred             CHHHHHHHHHHHHHHHHHH
Confidence            4789999999999888875


No 69 
>KOG3382|consensus
Probab=22.41  E-value=1e+02  Score=22.65  Aligned_cols=26  Identities=23%  Similarity=0.586  Sum_probs=19.8

Q ss_pred             HHHHHHHHHHHHHHHHHHH-HHhcCCC
Q psy16824         33 DLMKNYLTQVRQETGLRVC-ERVFNTP   58 (85)
Q Consensus        33 d~lr~Yl~QlRqE~~~RL~-e~vy~~~   58 (85)
                      +.|...+|++|+|=|.|-+ .+.|..+
T Consensus        16 ~~l~k~~qmire~Gglr~~~~k~yrtd   42 (151)
T KOG3382|consen   16 KGLGKFMQMIREEGGLRCLLDKLYRTD   42 (151)
T ss_pred             HHHHHHHHHHHhhccHHHHHHHHHhcc
Confidence            5688899999999998854 4666443


No 70 
>PF12174 RST:  RCD1-SRO-TAF4 (RST) plant domain;  InterPro: IPR022003  This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors. 
Probab=22.39  E-value=1.2e+02  Score=19.18  Aligned_cols=18  Identities=17%  Similarity=0.379  Sum_probs=13.5

Q ss_pred             HHHHHHHHHHHHHHHHhc
Q psy16824         38 YLTQVRQETGLRVCERVF   55 (85)
Q Consensus        38 Yl~QlRqE~~~RL~e~vy   55 (85)
                      .++++|+..|.+|+-.+.
T Consensus        47 fvr~lR~IVGD~lL~s~I   64 (70)
T PF12174_consen   47 FVRKLRQIVGDQLLRSAI   64 (70)
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            467888888888876554


No 71 
>cd04789 HTH_Cfa Helix-Turn-Helix DNA binding domain of the Cfa transcription regulator. Putative helix-turn-helix (HTH) MerR-like transcription regulator; the N-terminal domain of Cfa, a cyclopropane fatty acid synthase and other related methyltransferases. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates.
Probab=22.32  E-value=57  Score=21.12  Aligned_cols=33  Identities=18%  Similarity=0.276  Sum_probs=17.5

Q ss_pred             CCCCCcccccCCCChhHHHHHHHHHHHHHHHHH
Q psy16824         15 AGFPLNSVYAKPQSQNEADLMKNYLTQVRQETG   47 (85)
Q Consensus        15 ~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~   47 (85)
                      .||||..+-.-=......+.+.+.+.++.+++.
T Consensus        56 ~G~~l~ei~~~l~~~~~~~~l~~~~~~l~~~i~   88 (102)
T cd04789          56 GGLSLKECLACLQGKLTRSLLLERLSSLAEQIA   88 (102)
T ss_pred             CCCCHHHHHHHHcCCcHHHHHHHHHHHHHHHHH
Confidence            466665543221222334666777777766664


No 72 
>PF02099 Josephin:  Josephin;  InterPro: IPR006155 Human genes containing triplet repeats can markedly expand in length, leading to neuropsychiatric disease. Expansion of triplet repeats explains the phenomenon of anticipation, i.e. the increasing severity or earlier age of onset in successive generations in a pedigree []. A novel gene containing CAG repeats has been identified and mapped to chromosome 14q32.1, the genetic locus for Machado-Joseph disease (MJD). Normally, the gene contains 13-36 CAG repeats, but most clinically diagnosed patients and all affected members of a family with the clinical and pathological diagnosis of MJD show expansion of the repeat number, from 68-79 []. Similar abnormalities in related genes may give rise to diseases similar to MJD. MJD is a neurodegenerative disorder characterised by cerebellar ataxia, pyramidal and extra-pyramidal signs, peripheral nerve palsy, external ophtalmoplegia, facial and lingual fasciculation and bulging. The disease is autosomal dominant, with late onset of symptoms, generally after the fourth decade.; GO: 0008242 omega peptidase activity; PDB: 3O65_G 1YZB_A 2JRI_A 2DOS_A 2AGA_A.
Probab=22.32  E-value=34  Score=24.47  Aligned_cols=28  Identities=21%  Similarity=0.413  Sum_probs=20.9

Q ss_pred             CCCcccccCCCChhHHHHHHHHHHHHHHH
Q psy16824         17 FPLNSVYAKPQSQNEADLMKNYLTQVRQE   45 (85)
Q Consensus        17 FpLn~~y~~P~~~~E~d~lr~Yl~QlRqE   45 (85)
                      |=||+....|.--. .+.++.||+|+..|
T Consensus       121 yNLDS~l~~P~~i~-~~~l~~fL~~l~~~  148 (157)
T PF02099_consen  121 YNLDSKLKEPELIS-DFYLSAFLQQLQSE  148 (157)
T ss_dssp             EEECTTTSS-EEE--HHHHHHHHHHHHCC
T ss_pred             EeccCCCCCCcccC-HHHHHHHHHHHHhC
Confidence            55899999996554 47799999998765


No 73 
>PF15601 Imm42:  Immunity protein 42
Probab=22.29  E-value=1e+02  Score=21.97  Aligned_cols=40  Identities=23%  Similarity=0.266  Sum_probs=33.4

Q ss_pred             CCC--CcccccCCCChhHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy16824         16 GFP--LNSVYAKPQSQNEADLMKNYLTQVRQETGLRVCERVF   55 (85)
Q Consensus        16 gFp--Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~e~vy   55 (85)
                      -||  +|.+|..==+.+|.+...+=|.|+|+|+...--+.|-
T Consensus        37 kfP~Lm~~LY~g~L~~~~~~~A~~eL~~I~~~l~~~~p~~Vi   78 (134)
T PF15601_consen   37 KFPLLMNELYRGYLRYEELEKALKELEEIRKELKKFPPSEVI   78 (134)
T ss_pred             cchHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHhcCChhhhe
Confidence            477  7888988778899999999999999999877666555


No 74 
>PRK14363 Maf-like protein; Provisional
Probab=22.07  E-value=70  Score=23.80  Aligned_cols=16  Identities=38%  Similarity=0.528  Sum_probs=12.8

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        75 g~IlgKP~~~eeA~~~   90 (204)
T PRK14363         75 GNILGKPESLEEAKGM   90 (204)
T ss_pred             CEEcCCCCCHHHHHHH
Confidence            4689999999888554


No 75 
>PF10229 DUF2246:  Uncharacterized conserved protein (DUF2246);  InterPro: IPR019362  This entry represents conserved proteins found in the metazoa but absent from fungi. They are all approximately 300 residues in length and have no known function. 
Probab=21.91  E-value=45  Score=26.26  Aligned_cols=11  Identities=55%  Similarity=1.244  Sum_probs=9.9

Q ss_pred             CcCCCCCCCCC
Q psy16824          8 KFDLPGEAGFP   18 (85)
Q Consensus         8 ~F~iPGd~gFp   18 (85)
                      +|++||+.||.
T Consensus        51 rf~lpGnvg~~   61 (278)
T PF10229_consen   51 RFPLPGNVGFD   61 (278)
T ss_pred             CCCCCCcccch
Confidence            69999999986


No 76 
>PRK11546 zraP zinc resistance protein; Provisional
Probab=21.89  E-value=1.3e+02  Score=21.59  Aligned_cols=29  Identities=10%  Similarity=0.224  Sum_probs=16.2

Q ss_pred             CcccccCCCChhH-HHHHHHHHHHHHHHHH
Q psy16824         19 LNSVYAKPQSQNE-ADLMKNYLTQVRQETG   47 (85)
Q Consensus        19 Ln~~y~~P~~~~E-~d~lr~Yl~QlRqE~~   47 (85)
                      ||+++..|....+ ...+..=|.+||+.|.
T Consensus        77 LnALl~~~~pD~~kI~aL~kEI~~Lr~kL~  106 (143)
T PRK11546         77 YNALLTANPPDSSKINAVAKEMENLRQSLD  106 (143)
T ss_pred             HHHHHcCCCCCHHHHHHHHHHHHHHHHHHH
Confidence            6777777654444 3555555555555444


No 77 
>PRK06012 flhA flagellar biosynthesis protein FlhA; Validated
Probab=21.84  E-value=1.7e+02  Score=25.95  Aligned_cols=45  Identities=31%  Similarity=0.548  Sum_probs=32.1

Q ss_pred             cCCCC----CCCCCCcccccCCCChhHHH-------------------HHHHHHHHH--HHHHHHHHHHHh
Q psy16824          9 FDLPG----EAGFPLNSVYAKPQSQNEAD-------------------LMKNYLTQV--RQETGLRVCERV   54 (85)
Q Consensus         9 F~iPG----d~gFpLn~~y~~P~~~~E~d-------------------~lr~Yl~Ql--RqE~~~RL~e~v   54 (85)
                      -.+||    ||.|=+.+++-.|..+++++                   .+|.|+..+  |||+ .+|++++
T Consensus       447 ~~~~G~~~~eP~fg~~a~WI~~~~~~~a~~~Gytvvdp~svi~THLsevir~~a~ellgrQEv-q~LLD~L  516 (697)
T PRK06012        447 GELPGIPTKEPAFGLPAVWIDEALREQAQLLGYTVVDPSTVVATHLTEVIKNHAAELLGRQEV-QQLLDRL  516 (697)
T ss_pred             cCCCCccccCCCCCCCeEEeChhhHHHHHHcCCeeecHHHHHHHHHHHHHHHhHHHHhCHHHH-HHHHHHH
Confidence            45777    77899999999888777766                   566676666  7777 5555543


No 78 
>PRK13348 chromosome replication initiation inhibitor protein; Provisional
Probab=21.81  E-value=42  Score=24.14  Aligned_cols=20  Identities=10%  Similarity=0.204  Sum_probs=17.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHH
Q psy16824         34 LMKNYLTQVRQETGLRVCER   53 (85)
Q Consensus        34 ~lr~Yl~QlRqE~~~RL~e~   53 (85)
                      .+-+.|++|++|+|.+|.+|
T Consensus        32 avS~~i~~LE~~lg~~Lf~R   51 (294)
T PRK13348         32 AVSQRIKALEESLGQPLLVR   51 (294)
T ss_pred             HHHHHHHHHHHHhCceeeec
Confidence            37788999999999999887


No 79 
>PF02545 Maf:  Maf-like protein;  InterPro: IPR003697 Maf is a putative inhibitor of septum formation in eukaryotes, bacteria, and archaea. The Maf protein shares substantial amino acid sequence identity with the Escherichia coli OrfE protein [].; GO: 0005737 cytoplasm; PDB: 2P5X_A 1EXC_B 1EX2_A.
Probab=21.81  E-value=64  Score=23.57  Aligned_cols=16  Identities=31%  Similarity=0.559  Sum_probs=10.4

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        78 g~Il~KP~~~eeA~~~   93 (195)
T PF02545_consen   78 GEILGKPKDEEEAREM   93 (195)
T ss_dssp             TEEE-S-SSHHHHHHH
T ss_pred             eEEEeCCCCHHHHHHH
Confidence            4689999999888443


No 80 
>PF03190 Thioredox_DsbH:  Protein of unknown function, DUF255;  InterPro: IPR004879 This is a group of uncharacterised proteins.; PDB: 3IRA_A.
Probab=21.76  E-value=55  Score=23.82  Aligned_cols=32  Identities=22%  Similarity=0.487  Sum_probs=15.1

Q ss_pred             CCCCCCCcc------------cccCCCChhHHHHHHHHHHHHHH
Q psy16824         13 GEAGFPLNS------------VYAKPQSQNEADLMKNYLTQVRQ   44 (85)
Q Consensus        13 Gd~gFpLn~------------~y~~P~~~~E~d~lr~Yl~QlRq   44 (85)
                      |..|+||+-            -|-+|.++.....+.+.|.++++
T Consensus        99 ~~gGwPl~vfltPdg~p~~~~tY~P~~~~~g~~~f~~~l~~i~~  142 (163)
T PF03190_consen   99 GSGGWPLTVFLTPDGKPFFGGTYFPPEDRYGRPGFLQLLERIAE  142 (163)
T ss_dssp             S---SSEEEEE-TTS-EEEEESS--SS-BTTB--HHHHHHHHHH
T ss_pred             CCCCCCceEEECCCCCeeeeeeecCCCCCCCCccHHHHHHHHHH
Confidence            667888874            45666655555566666666653


No 81 
>PF01231 IDO:  Indoleamine 2,3-dioxygenase;  InterPro: IPR000898 Indoleamine 2,3-dioxgyenase (IDO, 1.13.11.42 from EC) [] is a cytosolic haem protein which, together with the hepatic enzyme tryptophan 2,3-dioxygenase, catalyzes the conversion of tryptophan and other indole derivatives to kynurenines. The physiological role of IDO is not fully understood but is of great interest, because IDO is widely distributed in human tissues, can be up-regulated via cytokines such as interferon-gamma, and can thereby modulate the levels of tryptophan, which is vital for cell growth. The degradative action of IDO on tryptophan leads to cell death by starvation of this essential and relatively scarce amino acid. IDO is a haem-containing enzyme of about 400 amino acids. Site-directed mutagenesis showed His346 (P14902 from SWISSPROT) to be essential for haem binding, indicating that this histidine residue may be the proximal ligand. Mutation of Asp274 also compromised the ability of IDO to bind haem, suggesting that Asp274 may coordinate to haem directly as the distal ligand or is essential in maintaining the conformation of the haem pocket []. Other proteins that are evolutionarily related to IDO include yeast hypothetical protein YJR078w; and myoglobin from the red muscle of the archaeogastropodic molluscs, Nordotis madaka (Giant abalone) and Sulculus diversicolor [, ]. These unusual globins lack enzymatic activity but have kept the haem group.; GO: 0020037 heme binding; PDB: 2D0U_A 2D0T_A.
Probab=21.76  E-value=74  Score=25.97  Aligned_cols=18  Identities=28%  Similarity=0.303  Sum_probs=13.8

Q ss_pred             HHHHHHHHHHHHHHHHHH
Q psy16824         34 LMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        34 ~lr~Yl~QlRqE~~~RL~   51 (85)
                      -+-.||+|+|.|+...++
T Consensus       405 ~l~~fLK~~Rd~T~~a~~  422 (422)
T PF01231_consen  405 DLMPFLKQCRDETKEALI  422 (422)
T ss_dssp             HHHHHHHHHHHHHHHT--
T ss_pred             hHHHHHHHHHHHHHhhcC
Confidence            467899999999987654


No 82 
>PRK06628 lipid A biosynthesis lauroyl acyltransferase; Provisional
Probab=21.75  E-value=59  Score=24.47  Aligned_cols=32  Identities=16%  Similarity=0.262  Sum_probs=25.1

Q ss_pred             CCCCcccccCCCChhHHHHHHHHHHHHHHHHHHHHH
Q psy16824         16 GFPLNSVYAKPQSQNEADLMKNYLTQVRQETGLRVC   51 (85)
Q Consensus        16 gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~~RL~   51 (85)
                      |.|+..||.+..|    ..+..++.+.|...|.+++
T Consensus       137 ~~~~~~vyr~~~n----~~~d~~~~~~R~~~g~~~i  168 (290)
T PRK06628        137 YPKVAVIYRKANN----PYVNKLVNESRAGDKLRLI  168 (290)
T ss_pred             CCCeeEEEecCCC----HHHHHHHHHHHHhcCCcee
Confidence            5589999988766    4577778889998887776


No 83 
>COG4047 Uncharacterized protein conserved in archaea [Function unknown]
Probab=21.58  E-value=70  Score=25.11  Aligned_cols=13  Identities=46%  Similarity=1.168  Sum_probs=11.3

Q ss_pred             CCCCChhhhhhcc
Q psy16824         59 DGKPSKWWLCFSK   71 (85)
Q Consensus        59 ~~~psKwWm~F~K   71 (85)
                      .++.-.||.||+|
T Consensus        56 tgkGEewW~eF~k   68 (243)
T COG4047          56 TGKGEEWWWEFAK   68 (243)
T ss_pred             ccCcHHHHHHHHH
Confidence            3788899999997


No 84 
>PLN03143 nudix hydrolase; Provisional
Probab=21.21  E-value=89  Score=24.61  Aligned_cols=42  Identities=29%  Similarity=0.552  Sum_probs=29.4

Q ss_pred             cCcCCCCCCCCCCcccccCCCChhHH------HHHHHHHHHHHHHHHH
Q psy16824          7 AKFDLPGEAGFPLNSVYAKPQSQNEA------DLMKNYLTQVRQETGL   48 (85)
Q Consensus         7 ~~F~iPGd~gFpLn~~y~~P~~~~E~------d~lr~Yl~QlRqE~~~   48 (85)
                      ..|.||+.++-|+.-..++--+.++-      ..+++||..+.++++.
T Consensus        32 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~   79 (291)
T PLN03143         32 HSITLPGQPGQPVLVVAAPGISSSDFRKAIDSSLFRQWLKNLQSESGI   79 (291)
T ss_pred             eEEEccCCCCCceeEecCCCCCHHHHHhHhcChHHHHHHHHhhhcccc
Confidence            35889998888887665443333333      3469999999998764


No 85 
>COG1503 eRF1 Peptide chain release factor 1 (eRF1) [Translation, ribosomal structure and biogenesis]
Probab=20.97  E-value=63  Score=27.01  Aligned_cols=29  Identities=24%  Similarity=0.436  Sum_probs=23.8

Q ss_pred             CCCCCCcccccCCCChhHHHHHHHHHHHHHHHHH
Q psy16824         14 EAGFPLNSVYAKPQSQNEADLMKNYLTQVRQETG   47 (85)
Q Consensus        14 d~gFpLn~~y~~P~~~~E~d~lr~Yl~QlRqE~~   47 (85)
                      -.|+-|.++|-+|..     .+-.|.+++|+|.+
T Consensus        24 g~gt~lislyIPp~~-----qisdv~~~lr~e~s   52 (411)
T COG1503          24 GRGTELISLYIPPDR-----QISDVVNRLRDEYS   52 (411)
T ss_pred             CCCceEEEEecCCCC-----chHHHHHHHHhhcc
Confidence            467889999998764     38889999999987


No 86 
>PF14852 Fis1_TPR_N:  Fis1 N-terminal tetratricopeptide repeat; PDB: 1IYG_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A 1PC2_A 1NZN_A.
Probab=20.79  E-value=1e+02  Score=16.92  Aligned_cols=14  Identities=21%  Similarity=0.586  Sum_probs=11.5

Q ss_pred             HHHHHHHHHHhcCC
Q psy16824         44 QETGLRVCERVFNT   57 (85)
Q Consensus        44 qE~~~RL~e~vy~~   57 (85)
                      +..|..|++.+|.+
T Consensus        20 ~~~Gi~lLe~l~~~   33 (35)
T PF14852_consen   20 QQEGIALLEELYRD   33 (35)
T ss_dssp             HHHHHHHHHHHCCC
T ss_pred             HHHHHHHHHHHHhc
Confidence            45789999999964


No 87 
>PRK13480 3'-5' exoribonuclease YhaM; Provisional
Probab=20.68  E-value=1e+02  Score=24.41  Aligned_cols=40  Identities=15%  Similarity=0.371  Sum_probs=30.2

Q ss_pred             CCCCccccc-CCCChhHH-HHHHHHHHHHHHHHHHHHHHHhc
Q psy16824         16 GFPLNSVYA-KPQSQNEA-DLMKNYLTQVRQETGLRVCERVF   55 (85)
Q Consensus        16 gFpLn~~y~-~P~~~~E~-d~lr~Yl~QlRqE~~~RL~e~vy   55 (85)
                      .+.+..+.. .|.+.+|. +.+++|+.+++.+.=.+|++.+|
T Consensus        95 ~~d~~dfvp~s~~~~e~l~~el~~~i~~i~~~~l~~l~~~~~  136 (314)
T PRK13480         95 GVKISDFVEKAPLSKEEMQEEITQYIFEMENPNIQRITRHLL  136 (314)
T ss_pred             cCCHHHcCCCCCCCHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence            455555553 45566555 77899999999998899999999


No 88 
>PRK14368 Maf-like protein; Provisional
Probab=20.50  E-value=76  Score=23.34  Aligned_cols=16  Identities=25%  Similarity=0.470  Sum_probs=12.7

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        79 g~ilgKP~~~~eA~~~   94 (193)
T PRK14368         79 GEIMGKPKDEADAVRM   94 (193)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3589999999888554


No 89 
>PRK14365 Maf-like protein; Provisional
Probab=20.43  E-value=78  Score=23.32  Aligned_cols=16  Identities=38%  Similarity=0.532  Sum_probs=12.3

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        77 g~Il~KP~~~~eA~~~   92 (197)
T PRK14365         77 GEVLGKPASPENAEEM   92 (197)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3589999999888443


No 90 
>PRK13456 DNA protection protein DPS; Provisional
Probab=20.41  E-value=1.5e+02  Score=22.20  Aligned_cols=43  Identities=9%  Similarity=0.112  Sum_probs=28.1

Q ss_pred             HHHHHHHHHHHH-HHHH--HHHHHHhcC---CCCCCCChhhhhhccchhh
Q psy16824         32 ADLMKNYLTQVR-QETG--LRVCERVFN---TPDGKPSKWWLCFSKKRFM   75 (85)
Q Consensus        32 ~d~lr~Yl~QlR-qE~~--~RL~e~vy~---~~~~~psKwWm~F~KRkFM   75 (85)
                      .+.+...|.+.+ +|++  .+|.+|++.   +.+..|.+|| ..+++.+.
T Consensus        51 ~e~V~e~le~a~~EEl~HA~~lAeRI~qLGG~P~~~p~~~~-~ls~~~~~   99 (186)
T PRK13456         51 GEGLKEIAEDARLEDRNHFEALVPRIYELGGKLPRDIREFH-DISACPDA   99 (186)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCChHHHh-hhhcCccc
Confidence            355678889999 7776  688889983   2333455554 55566553


No 91 
>PRK13503 transcriptional activator RhaS; Provisional
Probab=20.21  E-value=1.2e+02  Score=21.67  Aligned_cols=19  Identities=16%  Similarity=0.242  Sum_probs=16.7

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy16824         34 LMKNYLTQVRQETGLRVCE   52 (85)
Q Consensus        34 ~lr~Yl~QlRqE~~~RL~e   52 (85)
                      ...+|+++.|-+-+.|||.
T Consensus       215 S~~~yi~~~Rl~~A~~LL~  233 (278)
T PRK13503        215 TPQRYLNRLRLLKARHLLR  233 (278)
T ss_pred             CHHHHHHHHHHHHHHHHHH
Confidence            4689999999999999886


No 92 
>PRK14361 Maf-like protein; Provisional
Probab=20.20  E-value=79  Score=23.09  Aligned_cols=16  Identities=25%  Similarity=0.470  Sum_probs=12.5

Q ss_pred             cccccCCCChhHHHHH
Q psy16824         20 NSVYAKPQSQNEADLM   35 (85)
Q Consensus        20 n~~y~~P~~~~E~d~l   35 (85)
                      +.++.||.|.+|+..|
T Consensus        72 g~ilgKP~~~eeA~~~   87 (187)
T PRK14361         72 GVLLAKPADEAENEAF   87 (187)
T ss_pred             CEEecCCCCHHHHHHH
Confidence            3689999999888544


No 93 
>PF12785 VESA1_N:  Variant erythrocyte surface antigen-1;  InterPro: IPR024751 This entry represents variant erythrocyte surface antigen 1, versions a and b, of Babesia. Babesia bovis is a tick-borne, intra-erythrocytic, protozoal parasite of cattle that shares many lifestyle parallels with the most virulent of the human malarial parasites, Plasmodium falciparum. Babesia uses antigenic variation to establish consistent infections of long duration. The two variants of VESA1, a and b, are expressed from different but closely related genes, and variation is achieved through the involvement of a segmental gene conversion mechanism and low-frequency epigenetic in situ switching of transcriptional activity from the VESA1 gene-pair to a possible other gene pair [].
Probab=20.19  E-value=39  Score=28.79  Aligned_cols=30  Identities=20%  Similarity=0.374  Sum_probs=20.1

Q ss_pred             CcccccCCCChhHHHHHHHHHHHHHHHHHH
Q psy16824         19 LNSVYAKPQSQNEADLMKNYLTQVRQETGL   48 (85)
Q Consensus        19 Ln~~y~~P~~~~E~d~lr~Yl~QlRqE~~~   48 (85)
                      ||.+|.--..-.-..+.|.||.||-|-|+.
T Consensus        39 ln~lfslvqglggtavvrtyidqlaqvLsa   68 (462)
T PF12785_consen   39 LNGLFSLVQGLGGTAVVRTYIDQLAQVLSA   68 (462)
T ss_pred             HHhHHHHHhccCCceeHHHHHHHHHHHHHH
Confidence            555554433333346789999999999964


No 94 
>PRK01526 Maf-like protein; Reviewed
Probab=20.15  E-value=79  Score=23.47  Aligned_cols=15  Identities=13%  Similarity=0.173  Sum_probs=12.1

Q ss_pred             ccccCCCChhHHHHH
Q psy16824         21 SVYAKPQSQNEADLM   35 (85)
Q Consensus        21 ~~y~~P~~~~E~d~l   35 (85)
                      .++.||.|.+|+..|
T Consensus        85 ~IlgKP~~~~eA~~m   99 (205)
T PRK01526         85 RILPKATTYEEVKNC   99 (205)
T ss_pred             EEecCCCCHHHHHHH
Confidence            589999999888544


No 95 
>COG3248 Tsx Nucleoside-binding outer membrane protein [Cell envelope biogenesis, outer membrane]
Probab=20.03  E-value=20  Score=28.60  Aligned_cols=27  Identities=41%  Similarity=0.410  Sum_probs=22.0

Q ss_pred             CcCCCCCCCCCCcccccCC-CChhHHHHH
Q psy16824          8 KFDLPGEAGFPLNSVYAKP-QSQNEADLM   35 (85)
Q Consensus         8 ~F~iPGd~gFpLn~~y~~P-~~~~E~d~l   35 (85)
                      ...+|+=-+|.|| +|.+| .|++|.+-+
T Consensus       153 Di~~p~f~~~~LN-~Y~kydgNr~~w~g~  180 (284)
T COG3248         153 DIDTPLFMSLSLN-VYAKYDGNRNEWDGY  180 (284)
T ss_pred             ccccCCcceEEEE-EEeccCCCccccCce
Confidence            4788999999999 99999 677776544


Done!