Query         psy18141
Match_columns 113
No_of_seqs    118 out of 172
Neff          4.4 
Searched_HMMs 46136
Date          Fri Aug 16 23:40:23 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy18141.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18141hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4100|consensus              100.0 1.1E-46 2.3E-51  273.2   9.8   99    1-100    17-115 (125)
  2 PF13233 Complex1_LYR_2:  Compl  99.3   4E-12 8.6E-17   87.6   5.3   85    1-89      6-103 (104)
  3 PF05347 Complex1_LYR:  Complex  98.4   3E-07 6.5E-12   57.1   3.4   50    1-50      8-59  (59)
  4 PF13232 Complex1_LYR_1:  Compl  98.2 3.3E-06 7.2E-11   53.4   4.5   51    1-51      8-60  (61)
  5 KOG4620|consensus               78.9     1.9 4.2E-05   29.7   2.4   34    1-34     16-49  (80)
  6 PF13220 DUF4028:  Protein of u  73.9     2.1 4.6E-05   28.2   1.4   20   13-32     17-36  (65)
  7 PF14304 CSTF_C:  Transcription  70.1     5.6 0.00012   24.8   2.6   27   69-95     16-42  (46)
  8 KOG3801|consensus               52.5      58  0.0012   23.2   5.5   57    1-57     14-72  (94)
  9 PF07813 LTXXQ:  LTXXQ motif fa  49.9      22 0.00048   22.9   3.0   24   77-100    14-37  (100)
 10 PF11460 DUF3007:  Protein of u  48.1      39 0.00084   24.4   4.2   37   39-87     63-99  (104)
 11 PF06299 DUF1045:  Protein of u  47.1      11 0.00024   28.8   1.3   43    9-51     86-136 (160)
 12 PF04282 DUF438:  Family of unk  47.0      26 0.00056   23.5   2.9   30    3-39      6-36  (71)
 13 KOG3346|consensus               40.9      15 0.00033   28.6   1.3   37   19-55    145-182 (185)
 14 PRK10363 cpxP periplasmic repr  40.0      25 0.00054   27.2   2.3   20   77-96     50-69  (166)
 15 PF14659 Phage_int_SAM_3:  Phag  39.2      54  0.0012   18.9   3.3   29   16-44     27-57  (58)
 16 PLN03085 nucleobase:cation sym  35.9      72  0.0016   25.8   4.4   69    8-94     70-139 (221)
 17 PRK10455 periplasmic protein;   35.5      31 0.00067   26.0   2.2   20   77-96     56-75  (161)
 18 PF13801 Metal_resist:  Heavy-m  35.1      38 0.00083   22.0   2.3   21   75-95     39-59  (125)
 19 PF00715 IL2:  Interleukin 2 Th  34.2      41 0.00088   25.6   2.6   20   33-52    125-144 (145)
 20 PRK12750 cpxP periplasmic repr  33.6      49  0.0011   25.1   3.0   20   74-93    133-152 (170)
 21 PF14794 DUF4479:  Domain of un  33.1      40 0.00087   22.4   2.2   19   75-93     45-63  (73)
 22 PRK12750 cpxP periplasmic repr  32.1      44 0.00095   25.4   2.5   18   77-94     52-69  (170)
 23 TIGR00787 dctP tripartite ATP-  31.6      56  0.0012   25.1   3.1   32   68-99    209-240 (257)
 24 PF11647 PMT_C:  C-terminal reg  31.2      67  0.0014   21.1   3.0   16   79-94     37-52  (66)
 25 smart00189 IL2 Interleukin-2 f  30.8      48   0.001   25.3   2.5   19   34-52    132-150 (154)
 26 COG0334 GdhA Glutamate dehydro  28.9 1.3E+02  0.0027   26.5   5.0   26   26-52     69-94  (411)
 27 PRK12751 cpxP periplasmic stre  28.0      68  0.0015   24.4   2.9   25   69-93    121-145 (162)
 28 KOG2712|consensus               27.7      45 0.00098   24.3   1.8   19   77-95     82-100 (108)
 29 PF08158 NUC130_3NT:  NUC130/3N  27.6      35 0.00076   21.4   1.1   15    2-16     22-37  (52)
 30 PRK07080 hypothetical protein;  27.4      65  0.0014   27.3   2.9   33   24-56    174-209 (317)
 31 PF00046 Homeobox:  Homeobox do  26.7 1.1E+02  0.0023   18.0   3.2   25   77-101     6-30  (57)
 32 smart00496 IENR2 Intron-encode  26.7      35 0.00076   18.9   0.9   18   66-83      1-18  (26)
 33 KOG4049|consensus               26.6      53  0.0011   26.9   2.2   33   17-52    172-204 (248)
 34 PF11304 DUF3106:  Protein of u  26.2 1.6E+02  0.0034   20.6   4.4   35    9-47     58-92  (107)
 35 PHA02902 putative IMV membrane  25.1      58  0.0012   22.0   1.8   27   64-90     38-66  (70)
 36 PF12616 DUF3775:  Protein of u  24.6      74  0.0016   21.4   2.3   22   74-95      9-30  (75)
 37 PF10655 DUF2482:  Hypothetical  24.1      61  0.0013   23.2   1.9   29   68-96      6-34  (100)
 38 KOG1070|consensus               23.8      97  0.0021   31.7   3.7   41   10-50   1641-1683(1710)
 39 PF06937 EURL:  EURL protein;    23.7      88  0.0019   26.3   3.0   41   68-108   209-250 (285)
 40 cd08816 CARD_RIG-I_1 Caspase a  23.3      56  0.0012   23.0   1.6   15   31-45     18-32  (89)
 41 PRK00306 50S ribosomal protein  22.6 1.4E+02  0.0031   19.0   3.3   19   73-91      4-23  (66)
 42 PF14842 FliG_N:  FliG N-termin  22.4      59  0.0013   22.5   1.6   40   65-104    17-56  (108)
 43 PF08863 YolD:  YolD-like prote  22.4      72  0.0016   20.6   1.9   20   80-99      2-21  (92)
 44 PF08035 Op_neuropeptide:  Opio  22.3      12 0.00027   21.6  -1.5   10   39-48      3-12  (31)
 45 PF12651 RHH_3:  Ribbon-helix-h  22.3 1.4E+02   0.003   17.7   3.0   29   77-105     8-36  (44)
 46 PF13024 DUF3884:  Protein of u  22.2      44 0.00096   22.7   0.9   11   12-22     22-32  (77)
 47 PF06853 DUF1249:  Protein of u  22.2      77  0.0017   22.8   2.2   15   35-49     98-112 (120)
 48 smart00309 PAH Pancreatic horm  21.9 1.4E+02  0.0029   17.7   2.8   22   31-52     11-32  (36)
 49 PF06207 DUF1002:  Protein of u  21.7 1.1E+02  0.0023   24.5   3.1   23   78-100   192-214 (225)
 50 PF12767 SAGA-Tad1:  Transcript  21.7      64  0.0014   25.4   1.8   31   42-83     25-60  (252)
 51 TIGR03759 conj_TIGR03759 integ  21.1 1.8E+02   0.004   23.2   4.2   43   41-91     16-61  (200)

No 1  
>KOG4100|consensus
Probab=100.00  E-value=1.1e-46  Score=273.17  Aligned_cols=99  Identities=51%  Similarity=0.798  Sum_probs=93.0

Q ss_pred             ChhhcchhcCChHHHHhhhHHHHHHHhhhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCCCCCcccCCCCHHHHhhcCH
Q psy18141          1 MEMATLHRGLPPELQILGHQYVRDEFRRHKGCNPSEAQVFMNEWAKYAVNLAEQLGIRGPKFAKPLGQSLDEQALNDLRD   80 (113)
Q Consensus         1 RrILRlHR~LP~~mR~LGD~YVK~EFRrHk~~np~~~~~Fl~eW~~Y~~~L~~Q~~~~g~~~~~~~G~~L~~~~le~lsd   80 (113)
                      |||||+||.||+++|+|||+||||||||||+|||.|+++||+||+.||.+|++|+++.|.. ++.+|.+||++.+++|||
T Consensus        17 kriLrlHr~lp~~~R~lGD~YVkdEFrrHk~vnp~~~~~FlteW~~Ya~~l~qql~~~g~~-K~~~g~~ld~d~le~l~d   95 (125)
T KOG4100|consen   17 KRILRLHRGLPAELRALGDQYVKDEFRRHKTVNPLEAQGFLTEWERYAVALSQQLSSAGKW-KGEIGSDLDSDKLEQLSD   95 (125)
T ss_pred             HHHHHHHccCChHHHHHHHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHhhhcCcc-cccccccCCHHHHHHcCH
Confidence            6899999999999999999999999999999999999999999999999999999876654 579999999999999999


Q ss_pred             HHHHHHHHHHHHhcCCCcch
Q psy18141         81 DQIAQLYELMLAATQPEVLE  100 (113)
Q Consensus        81 EQi~QLyEL~~ea~k~~~~~  100 (113)
                      |||+||||||+||+++..++
T Consensus        96 eqi~QLyELm~ea~k~~~~~  115 (125)
T KOG4100|consen   96 EQIGQLYELMKEAQKHMGDD  115 (125)
T ss_pred             HHHHHHHHHHHHHHhhhhhh
Confidence            99999999999999975543


No 2  
>PF13233 Complex1_LYR_2:  Complex1_LYR-like
Probab=99.30  E-value=4e-12  Score=87.57  Aligned_cols=85  Identities=32%  Similarity=0.433  Sum_probs=64.7

Q ss_pred             ChhhcchhcCChHHH--------HhhhHHH----HHHHhhhcCCC-HHHHHHHHHHHHHHHHHHHHHhcccCCCCCCccc
Q psy18141          1 MEMATLHRGLPPELQ--------ILGHQYV----RDEFRRHKGCN-PSEAQVFMNEWAKYAVNLAEQLGIRGPKFAKPLG   67 (113)
Q Consensus         1 RrILRlHR~LP~~mR--------~LGD~YV----K~EFRrHk~~n-p~~~~~Fl~eW~~Y~~~L~~Q~~~~g~~~~~~~G   67 (113)
                      |+|||.|+..|....        +.||.||    ++||++|+.++ +..+..|+.+|+.|+..|.+|..-..--..-..|
T Consensus         6 R~lLRel~~~~~~~~~~~~~~~~~~~~~~v~~~~~~ef~~~~~~~~~~~~~~~~~~~~~y~~~L~~qr~y~eL~erYnpg   85 (104)
T PF13233_consen    6 RSLLRELRRYPRRSKIHQLKAPRSPGDQYVLEQARAEFRRHKSANDEEEAQEFLQEWENYATFLKNQREYKELLERYNPG   85 (104)
T ss_pred             HHHHHHHHhhcchhhhhhhhchhHHHHHHHHHHHHHHHhccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc
Confidence            678999987665542        8999999    99999999997 9999999999999999999887310000123678


Q ss_pred             CCCCHHHHhhcCHHHHHHHHHH
Q psy18141         68 QSLDEQALNDLRDDQIAQLYEL   89 (113)
Q Consensus        68 ~~L~~~~le~lsdEQi~QLyEL   89 (113)
                      .+||+++.  +  ++..+++.|
T Consensus        86 ~~m~qee~--v--~~tA~~VGL  103 (104)
T PF13233_consen   86 LDMTQEER--V--RRTARRVGL  103 (104)
T ss_pred             cCCCHHHH--H--HHHHHHhcC
Confidence            88998876  2  555555543


No 3  
>PF05347 Complex1_LYR:  Complex 1 protein (LYR family);  InterPro: IPR008011 This family of short proteins includes proteins from the NADH-ubiquinone oxidoreductase complex I. The family includes the B14 subunit from bovine NADH-ubiquinone oxidoreductase B14 subunit Q02366 from SWISSPROT, and the B22 subunit from the human enzyme Q9Y6M9 from SWISSPROT. The family has been named LYR after a highly conserved tripeptide motif close to the N terminus of these proteins.  Members of this family also found in yeast which do contain this complex. In these organisms they are believed to be be required for iron-sulphur custer biogenesis.
Probab=98.40  E-value=3e-07  Score=57.14  Aligned_cols=50  Identities=14%  Similarity=0.265  Sum_probs=45.3

Q ss_pred             ChhhcchhcCChH-HHHhhhHHHHHHHhhhcCC-CHHHHHHHHHHHHHHHHH
Q psy18141          1 MEMATLHRGLPPE-LQILGHQYVRDEFRRHKGC-NPSEAQVFMNEWAKYAVN   50 (113)
Q Consensus         1 RrILRlHR~LP~~-mR~LGD~YVK~EFRrHk~~-np~~~~~Fl~eW~~Y~~~   50 (113)
                      |.+||+.+.+|.+ .|...-.+|++||+.|++. +|..|..+++.+..|+.+
T Consensus         8 R~lLR~~~~~~~~~~r~~~~~~iR~~Fr~n~~~~d~~~I~~~l~~g~~~l~~   59 (59)
T PF05347_consen    8 RQLLRAARSFPDDSEREYIRAEIRQEFRKNRNETDPEKIEELLKKGEEELEM   59 (59)
T ss_pred             HHHHHHHHHcCCcchHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHHHhcC
Confidence            6899999988865 4999999999999999998 999999999999999863


No 4  
>PF13232 Complex1_LYR_1:  Complex1_LYR-like
Probab=98.18  E-value=3.3e-06  Score=53.36  Aligned_cols=51  Identities=22%  Similarity=0.304  Sum_probs=46.5

Q ss_pred             ChhhcchhcCC-hHHHHhhhHHHHHHHhhhcCC-CHHHHHHHHHHHHHHHHHH
Q psy18141          1 MEMATLHRGLP-PELQILGHQYVRDEFRRHKGC-NPSEAQVFMNEWAKYAVNL   51 (113)
Q Consensus         1 RrILRlHR~LP-~~mR~LGD~YVK~EFRrHk~~-np~~~~~Fl~eW~~Y~~~L   51 (113)
                      |.+||..+.+| +..|.--..+|+++||+|+++ +|..|..++.+|+.-+..|
T Consensus         8 R~lLR~~~~~~~~~~r~~~~~~ir~~Fr~~~~~td~~~i~~~l~~~~~~L~~l   60 (61)
T PF13232_consen    8 RQLLREASKFPDYNFRSYFRRRIRDRFRRNKNVTDPEKIAKLLKEGRKELELL   60 (61)
T ss_pred             HHHHHHhhhcCCcchHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHHc
Confidence            67899999888 788988899999999999998 9999999999999988765


No 5  
>KOG4620|consensus
Probab=78.93  E-value=1.9  Score=29.68  Aligned_cols=34  Identities=26%  Similarity=0.376  Sum_probs=24.4

Q ss_pred             ChhhcchhcCChHHHHhhhHHHHHHHhhhcCCCH
Q psy18141          1 MEMATLHRGLPPELQILGHQYVRDEFRRHKGCNP   34 (113)
Q Consensus         1 RrILRlHR~LP~~mR~LGD~YVK~EFRrHk~~np   34 (113)
                      |+.||+-+.=|.+-..==-.||..|||.|.+.+-
T Consensus        16 R~~lraa~~Kp~~~~~~~m~fvh~EFrk~~~lpr   49 (80)
T KOG4620|consen   16 RDLLRAARGKPGAEARRWMAFVHAEFRKHAGLPR   49 (80)
T ss_pred             HHHHHHhcCCCchHHHHHHHHHHHHHHHhcCCcH
Confidence            4667777766654444445799999999999843


No 6  
>PF13220 DUF4028:  Protein of unknown function (DUF4028)
Probab=73.92  E-value=2.1  Score=28.16  Aligned_cols=20  Identities=30%  Similarity=0.454  Sum_probs=16.6

Q ss_pred             HHHHhhhHHHHHHHhhhcCC
Q psy18141         13 ELQILGHQYVRDEFRRHKGC   32 (113)
Q Consensus        13 ~mR~LGD~YVK~EFRrHk~~   32 (113)
                      .+-.=||+|||.=|+.|.+-
T Consensus        17 vqnkng~~yvkkwf~khen~   36 (65)
T PF13220_consen   17 VQNKNGDQYVKKWFSKHENN   36 (65)
T ss_pred             EecccHhHHHHHHHHhccch
Confidence            34456999999999999975


No 7  
>PF14304 CSTF_C:  Transcription termination and cleavage factor C-terminal; PDB: 2J8P_A.
Probab=70.09  E-value=5.6  Score=24.83  Aligned_cols=27  Identities=26%  Similarity=0.324  Sum_probs=22.2

Q ss_pred             CCCHHHHhhcCHHHHHHHHHHHHHhcC
Q psy18141         69 SLDEQALNDLRDDQIAQLYELMLAATQ   95 (113)
Q Consensus        69 ~L~~~~le~lsdEQi~QLyEL~~ea~k   95 (113)
                      .|+++.++.|.++|..+.-.|++...+
T Consensus        16 ~Lt~eQI~~LPp~qR~~I~~Lr~ql~~   42 (46)
T PF14304_consen   16 QLTPEQINALPPDQRQQILQLRQQLMR   42 (46)
T ss_dssp             TS-HHHHHTS-HHHHTHHHHHHHHHH-
T ss_pred             cCCHHHHHhCCHHHHHHHHHHHHHHHh
Confidence            799999999999999999999987664


No 8  
>KOG3801|consensus
Probab=52.48  E-value=58  Score=23.19  Aligned_cols=57  Identities=21%  Similarity=0.243  Sum_probs=47.9

Q ss_pred             ChhhcchhcCC-hHHHHhhhHHHHHHHhhhcCC-CHHHHHHHHHHHHHHHHHHHHHhcc
Q psy18141          1 MEMATLHRGLP-PELQILGHQYVRDEFRRHKGC-NPSEAQVFMNEWAKYAVNLAEQLGI   57 (113)
Q Consensus         1 RrILRlHR~LP-~~mR~LGD~YVK~EFRrHk~~-np~~~~~Fl~eW~~Y~~~L~~Q~~~   57 (113)
                      |.+||.-+.+| =..|+.--.=.+|=||..|++ +|.++..-+.|=+.=+..|..|...
T Consensus        14 r~~lr~s~qfp~YNyReY~~RrtRD~Fr~Nkn~~Dp~e~~~l~~eakk~LevikRQ~ii   72 (94)
T KOG3801|consen   14 RNLLRESKQFPQYNYREYFQRRTRDTFRANKNVCDPAEIKKLYKEAKKQLEVIKRQSII   72 (94)
T ss_pred             HHHHHHHhhCCcccHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34666666677 677888888899999999998 9999999999999999999988753


No 9  
>PF07813 LTXXQ:  LTXXQ motif family protein;  InterPro: IPR012899 This five residue motif is found in a number of bacterial proteins bearing similarity to the protein CpxP (P32158 from SWISSPROT). This is a periplasmic protein that aids in combating extracytoplasmic protein-mediated toxicity, and may also be involved in the response to alkaline pH []. Another member of this family, Spy (P77754 from SWISSPROT) is also a periplasmic protein that may be involved in the response to stress []. The homology between CpxP and Spy may indicate that these two proteins are functionally related []. The motif is found repeated twice in many members of this entry. ; GO: 0042597 periplasmic space; PDB: 3ITF_B 3QZC_B 3OEO_D 3O39_A.
Probab=49.89  E-value=22  Score=22.93  Aligned_cols=24  Identities=21%  Similarity=0.101  Sum_probs=17.5

Q ss_pred             hcCHHHHHHHHHHHHHhcCCCcch
Q psy18141         77 DLRDDQIAQLYELMLAATQPEVLE  100 (113)
Q Consensus        77 ~lsdEQi~QLyEL~~ea~k~~~~~  100 (113)
                      +||++|..++..|+.+........
T Consensus        14 ~LT~eQ~~~~~~i~~~~~~~~~~~   37 (100)
T PF07813_consen   14 NLTDEQKAKWRAIRQAMKAKMKPL   37 (100)
T ss_dssp             --THHHHHHHHHHHHHHCTTS---
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHhh
Confidence            589999999999999988776655


No 10 
>PF11460 DUF3007:  Protein of unknown function (DUF3007);  InterPro: IPR021562  This is a family of uncharacterised proteins found in bacteria and eukaryotes. 
Probab=48.09  E-value=39  Score=24.38  Aligned_cols=37  Identities=16%  Similarity=0.408  Sum_probs=24.0

Q ss_pred             HHHHHHHHHHHHHHHHhcccCCCCCCcccCCCCHHHHhhcCHHHHHHHH
Q psy18141         39 VFMNEWAKYAVNLAEQLGIRGPKFAKPLGQSLDEQALNDLRDDQIAQLY   87 (113)
Q Consensus        39 ~Fl~eW~~Y~~~L~~Q~~~~g~~~~~~~G~~L~~~~le~lsdEQi~QLy   87 (113)
                      -|+.+|++|-+...+..           ..+| +..++.||||.+.+|.
T Consensus        63 Ty~~Q~k~Ye~a~~~~~-----------~~~l-qkRle~l~~eE~~~L~   99 (104)
T PF11460_consen   63 TYMQQRKDYEEAVDQLT-----------NEEL-QKRLEELSPEELEALQ   99 (104)
T ss_pred             cHHHHHHHHHHHHHHHh-----------HHHH-HHHHHhCCHHHHHHHH
Confidence            48899999988775321           1112 2237888888877664


No 11 
>PF06299 DUF1045:  Protein of unknown function (DUF1045);  InterPro: IPR009389 This family consists of several hypothetical proteins from Agrobacterium, Rhizobium and Brucella species. The function of this family is unknown.
Probab=47.10  E-value=11  Score=28.80  Aligned_cols=43  Identities=33%  Similarity=0.425  Sum_probs=33.8

Q ss_pred             cCChHHHHh----hhHHHHHHHhhhcCC----CHHHHHHHHHHHHHHHHHH
Q psy18141          9 GLPPELQIL----GHQYVRDEFRRHKGC----NPSEAQVFMNEWAKYAVNL   51 (113)
Q Consensus         9 ~LP~~mR~L----GD~YVK~EFRrHk~~----np~~~~~Fl~eW~~Y~~~L   51 (113)
                      +|.+.++.+    |.=||-+|||=|++.    ++........-|+.+..-+
T Consensus        86 ~Ls~~Q~~~L~rWGYPYV~deFRFHmTLTg~l~~~~~~~~~~~l~~~f~~~  136 (160)
T PF06299_consen   86 GLSPRQRANLERWGYPYVMDEFRFHMTLTGRLDPAERARVEAALEAHFAPL  136 (160)
T ss_pred             cCCHHHHHHHHHhCCCceeCcCEeeEEeCCCCCHHHHHHHHHHHHHHHHhh
Confidence            466666654    888999999999975    5668888888888877765


No 12 
>PF04282 DUF438:  Family of unknown function (DUF438);  InterPro: IPR007380 This is a a group of uncharacterised proteins.
Probab=46.96  E-value=26  Score=23.47  Aligned_cols=30  Identities=33%  Similarity=0.561  Sum_probs=23.0

Q ss_pred             hhcchhcCChHHHHhhhHHHHHHHhhhcC-CCHHHHHH
Q psy18141          3 MATLHRGLPPELQILGHQYVRDEFRRHKG-CNPSEAQV   39 (113)
Q Consensus         3 ILRlHR~LP~~mR~LGD~YVK~EFRrHk~-~np~~~~~   39 (113)
                      |+|+|.+=+++       =||.+|..+-. ++|.+|..
T Consensus         6 i~~Lh~G~~~e-------~vk~~F~~~~~~Vs~~EI~~   36 (71)
T PF04282_consen    6 IKRLHEGEDPE-------EVKEEFKKLFSDVSASEISA   36 (71)
T ss_pred             HHHHhCCCCHH-------HHHHHHHHHHCCCCHHHHHH
Confidence            67888875554       48999999985 59998853


No 13 
>KOG3346|consensus
Probab=40.89  E-value=15  Score=28.63  Aligned_cols=37  Identities=22%  Similarity=0.496  Sum_probs=31.4

Q ss_pred             hHHHHHHHhhhcCC-CHHHHHHHHHHHHHHHHHHHHHh
Q psy18141         19 HQYVRDEFRRHKGC-NPSEAQVFMNEWAKYAVNLAEQL   55 (113)
Q Consensus        19 D~YVK~EFRrHk~~-np~~~~~Fl~eW~~Y~~~L~~Q~   55 (113)
                      +.+=..||.-|..- +|.-+..|.++|..|+-.+.+++
T Consensus       145 ~~F~~~~F~~~~~lg~PvA~~~f~aq~d~~~~~~~~~~  182 (185)
T KOG3346|consen  145 GNFNTRKFAKKYELGTPVAGNFFQAQWDDYVPKLYKQL  182 (185)
T ss_pred             cceeHHHHHHHhccCCchhhheehhhcchhhHHHHHhh
Confidence            45677899966655 99999999999999999998765


No 14 
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=39.98  E-value=25  Score=27.17  Aligned_cols=20  Identities=30%  Similarity=0.390  Sum_probs=17.7

Q ss_pred             hcCHHHHHHHHHHHHHhcCC
Q psy18141         77 DLRDDQIAQLYELMLAATQP   96 (113)
Q Consensus        77 ~lsdEQi~QLyEL~~ea~k~   96 (113)
                      .|||+|..|+.+||++....
T Consensus        50 dLTdaQRqQmRdLm~~~r~~   69 (166)
T PRK10363         50 SLTEHQRQQMRDLMQQARHE   69 (166)
T ss_pred             CCCHHHHHHHHHHHHHHHhc
Confidence            68999999999999988754


No 15 
>PF14659 Phage_int_SAM_3:  Phage integrase, N-terminal SAM-like domain; PDB: 2KD1_A 2KOB_A 2KHQ_A 3LYS_E 2KIW_A 2KKP_A.
Probab=39.15  E-value=54  Score=18.94  Aligned_cols=29  Identities=24%  Similarity=0.331  Sum_probs=13.7

Q ss_pred             HhhhHHHHHHHhhhc--CCCHHHHHHHHHHH
Q psy18141         16 ILGHQYVRDEFRRHK--GCNPSEAQVFMNEW   44 (113)
Q Consensus        16 ~LGD~YVK~EFRrHk--~~np~~~~~Fl~eW   44 (113)
                      ..-+.||...|-.-+  +++|.++..|+.++
T Consensus        27 ~~~~~~i~p~~g~~~i~~It~~~i~~~~~~l   57 (58)
T PF14659_consen   27 SIIKNHILPYFGNKKIKDITPRDIQNFINEL   57 (58)
T ss_dssp             HHHHHHHHHHTTSSBGGG--HHHHHHHHHHH
T ss_pred             HHHHHHHHHHHCcCcHHHCCHHHHHHHHHHc
Confidence            344455555553322  23666666666554


No 16 
>PLN03085 nucleobase:cation symporter-1; Provisional
Probab=35.89  E-value=72  Score=25.84  Aligned_cols=69  Identities=23%  Similarity=0.207  Sum_probs=34.4

Q ss_pred             hcCChHHHHhhhHHHHHHHhhhcCCCHHHHHHHHHHHHHHHHHHHHHhcccCCC-CCCcccCCCCHHHHhhcCHHHHHHH
Q psy18141          8 RGLPPELQILGHQYVRDEFRRHKGCNPSEAQVFMNEWAKYAVNLAEQLGIRGPK-FAKPLGQSLDEQALNDLRDDQIAQL   86 (113)
Q Consensus         8 R~LP~~mR~LGD~YVK~EFRrHk~~np~~~~~Fl~eW~~Y~~~L~~Q~~~~g~~-~~~~~G~~L~~~~le~lsdEQi~QL   86 (113)
                      |+|||++|--|+ .|++|                +.|..-+..-....-.+|.= .-...|+.|+-++ +.+-|.-.-.+
T Consensus        70 ~~~~~~~~~~~~-~~~~~----------------~~i~~laE~rI~eAm~~GeFdNLpG~GKPL~ldd-~p~vp~e~r~~  131 (221)
T PLN03085         70 RKLPPELRGRRN-SVRSE----------------TDIINVVEQRIWHSMEEGHFENLPGKGKPLNLST-NPHADPAEDTL  131 (221)
T ss_pred             ccCChhhccccc-cccch----------------HhHHHHHHHHHHHHHHcCCCCCCCCCCCCCCCcc-cCCCCHHHHHH
Confidence            689999987665 35554                33444444332222122211 1124677776544 33333334456


Q ss_pred             HHHHHHhc
Q psy18141         87 YELMLAAT   94 (113)
Q Consensus        87 yEL~~ea~   94 (113)
                      |.+|+.+-
T Consensus       132 nRILknaG  139 (221)
T PLN03085        132 YRILSKNG  139 (221)
T ss_pred             HHHHHHCC
Confidence            66666554


No 17 
>PRK10455 periplasmic protein; Reviewed
Probab=35.53  E-value=31  Score=26.03  Aligned_cols=20  Identities=30%  Similarity=0.416  Sum_probs=17.4

Q ss_pred             hcCHHHHHHHHHHHHHhcCC
Q psy18141         77 DLRDDQIAQLYELMLAATQP   96 (113)
Q Consensus        77 ~lsdEQi~QLyEL~~ea~k~   96 (113)
                      +|||+|..|+..||++....
T Consensus        56 ~LT~~Qrqqir~im~~~r~~   75 (161)
T PRK10455         56 NLTDAQKQQIRDIMKAQRDQ   75 (161)
T ss_pred             CCCHHHHHHHHHHHHHHHHh
Confidence            89999999999999887543


No 18 
>PF13801 Metal_resist:  Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=35.07  E-value=38  Score=22.00  Aligned_cols=21  Identities=33%  Similarity=0.306  Sum_probs=16.3

Q ss_pred             HhhcCHHHHHHHHHHHHHhcC
Q psy18141         75 LNDLRDDQIAQLYELMLAATQ   95 (113)
Q Consensus        75 le~lsdEQi~QLyEL~~ea~k   95 (113)
                      .-.||+||..++.+++.+...
T Consensus        39 ~l~Lt~eQ~~~l~~~~~~~~~   59 (125)
T PF13801_consen   39 MLNLTPEQQAKLRALMDEFRQ   59 (125)
T ss_dssp             HS-TTHHHHHHHHHHHHHHHH
T ss_pred             hcCCCHHHHHHHHHHHHHHHH
Confidence            457899999999998887664


No 19 
>PF00715 IL2:  Interleukin 2 This family is a subset of the SCOP family.;  InterPro: IPR000779 T-Lymphocytes regulate the growth and differentiation of certain lymphopoietic and haemopoietic cells through the release of various secreted protein factors []. These factors, which include interleukin-2 (IL2), are secreted by lectin- or antigen-stimulated T-cells, and have various physiological effects. IL2 is a lymphokine that induces the proliferation of responsive T-cells. In addition, it acts on some B-cells, via receptor-specific binding [], as a growth factor and antibody production stimulant []. The protein is secreted as a single glycosylated polypeptide, and cleavage of a signal sequence is required for its activity []. Solution NMR suggests that the structure of IL2 comprises a bundle of 4 helices (termed A-D), flanked by 2 shorter helices and several poorly-defined loops. Residues in helix A, and in the loop region between helices A and B, are important for receptor binding. Secondary structure analysis has suggested similarity to IL4 and granulocyte-macrophage colony stimulating factor (GMCSF) [].; GO: 0005134 interleukin-2 receptor binding, 0008083 growth factor activity, 0006955 immune response, 0005576 extracellular region; PDB: 3QAZ_M 2B5I_A 1Z92_A 3QB1_F 2ERJ_H 1NBP_A 1PY2_C 1IRL_A 1QVN_A 3INK_C ....
Probab=34.20  E-value=41  Score=25.59  Aligned_cols=20  Identities=15%  Similarity=0.517  Sum_probs=16.5

Q ss_pred             CHHHHHHHHHHHHHHHHHHH
Q psy18141         33 NPSEAQVFMNEWAKYAVNLA   52 (113)
Q Consensus        33 np~~~~~Fl~eW~~Y~~~L~   52 (113)
                      ...-|++||..|=.+|..|-
T Consensus       125 et~tiveFln~WItFCQsii  144 (145)
T PF00715_consen  125 ETVTIVEFLNKWITFCQSII  144 (145)
T ss_dssp             SEBEHHHHHHHHHHHHHHHH
T ss_pred             ceehHHHHHHHHHHHHHHhh
Confidence            35568999999999998763


No 20 
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=33.55  E-value=49  Score=25.12  Aligned_cols=20  Identities=30%  Similarity=0.258  Sum_probs=16.7

Q ss_pred             HHhhcCHHHHHHHHHHHHHh
Q psy18141         74 ALNDLRDDQIAQLYELMLAA   93 (113)
Q Consensus        74 ~le~lsdEQi~QLyEL~~ea   93 (113)
                      .+.-|+|||..+|.+++++=
T Consensus       133 ~~~vLTpEQRak~~e~~~~r  152 (170)
T PRK12750        133 MLSILTPEQKAKFQELQQER  152 (170)
T ss_pred             HHHhCCHHHHHHHHHHHHHH
Confidence            37789999999999998764


No 21 
>PF14794 DUF4479:  Domain of unknown function (DUF4479); PDB: 3BU2_C.
Probab=33.15  E-value=40  Score=22.40  Aligned_cols=19  Identities=26%  Similarity=0.298  Sum_probs=12.2

Q ss_pred             HhhcCHHHHHHHHHHHHHh
Q psy18141         75 LNDLRDDQIAQLYELMLAA   93 (113)
Q Consensus        75 le~lsdEQi~QLyEL~~ea   93 (113)
                      .=.||+||+.+|.++-.++
T Consensus        45 ~V~Lt~eqv~~LN~~l~~~   63 (73)
T PF14794_consen   45 QVFLTEEQVAKLNQALQKA   63 (73)
T ss_dssp             -----HHHHHHHHHHHHHT
T ss_pred             CEEcCHHHHHHHHHHHHHc
Confidence            4479999999999987765


No 22 
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=32.07  E-value=44  Score=25.38  Aligned_cols=18  Identities=44%  Similarity=0.407  Sum_probs=15.2

Q ss_pred             hcCHHHHHHHHHHHHHhc
Q psy18141         77 DLRDDQIAQLYELMLAAT   94 (113)
Q Consensus        77 ~lsdEQi~QLyEL~~ea~   94 (113)
                      .|||+|..||..++++..
T Consensus        52 ~LTdeQk~qik~i~~~~r   69 (170)
T PRK12750         52 DLTDAQKEQLKEMREANR   69 (170)
T ss_pred             CCCHHHHHHHHHHHHHHH
Confidence            689999999999886654


No 23 
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=31.58  E-value=56  Score=25.14  Aligned_cols=32  Identities=16%  Similarity=0.024  Sum_probs=26.2

Q ss_pred             CCCCHHHHhhcCHHHHHHHHHHHHHhcCCCcc
Q psy18141         68 QSLDEQALNDLRDDQIAQLYELMLAATQPEVL   99 (113)
Q Consensus        68 ~~L~~~~le~lsdEQi~QLyEL~~ea~k~~~~   99 (113)
                      --++.+..+.||+||...|.+...++.....+
T Consensus       209 ~~~n~~~~~~L~~e~q~~i~~a~~~~~~~~~~  240 (257)
T TIGR00787       209 VVVNKAFWKSLPPDLQAVVKEAAKEAGEYQRK  240 (257)
T ss_pred             EEEeHHHHhcCCHHHHHHHHHHHHHHHHHHHH
Confidence            45678999999999999999998888654444


No 24 
>PF11647 PMT_C:  C-terminal region of Pasteurella multocida toxin residues 569-1285;  InterPro: IPR020972  This entry represents the C-terminal domain of Pasteurella multocida toxin (PMT) which displays a Trojan horse-like shape with three domains, C1, C2 and C3. The C3 domain possesses the Cys-His-Asp catalytic triad. PMT is an enzyme toxin carrying the cysteine protease-like catalytic triad which functions on the cytoplasmic face of the plasma membrane of target cells [].  This entry is also found in the Vibrio cholerae RTX toxin [], a bacterial toxin that self-process by a cysteine peptidase mechanism. These cysteine peptidases belong to MEROPS peptidase family C80 (RTX self-cleaving toxin, clan CD). ; PDB: 4ERR_A 2EC5_B 2EBH_X 2EBF_X.
Probab=31.15  E-value=67  Score=21.14  Aligned_cols=16  Identities=25%  Similarity=0.376  Sum_probs=14.0

Q ss_pred             CHHHHHHHHHHHHHhc
Q psy18141         79 RDDQIAQLYELMLAAT   94 (113)
Q Consensus        79 sdEQi~QLyEL~~ea~   94 (113)
                      ..+++..||+|++...
T Consensus        37 ~~~~V~~l~~L~~~ie   52 (66)
T PF11647_consen   37 GREGVETLYELRKQIE   52 (66)
T ss_dssp             CHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            5999999999998764


No 25 
>smart00189 IL2 Interleukin-2 family. Interleukin-2 is a cytokine produced by T-helper cells in response to antigenic or mitogenic stimulation. This protein is required for T-cell proliferation and other activities crucial to the regulation of the immune response.
Probab=30.84  E-value=48  Score=25.31  Aligned_cols=19  Identities=16%  Similarity=0.511  Sum_probs=16.2

Q ss_pred             HHHHHHHHHHHHHHHHHHH
Q psy18141         34 PSEAQVFMNEWAKYAVNLA   52 (113)
Q Consensus        34 p~~~~~Fl~eW~~Y~~~L~   52 (113)
                      ..-|+.||+.|-.+|..|-
T Consensus       132 t~tivEFLn~WItfCQsi~  150 (154)
T smart00189      132 SVTIVEFLNRWIAFCQSII  150 (154)
T ss_pred             eehHHHHHHHHHHHHHHHH
Confidence            3468999999999999875


No 26 
>COG0334 GdhA Glutamate dehydrogenase/leucine dehydrogenase [Amino acid transport and metabolism]
Probab=28.94  E-value=1.3e+02  Score=26.46  Aligned_cols=26  Identities=23%  Similarity=0.273  Sum_probs=21.8

Q ss_pred             HhhhcCCCHHHHHHHHHHHHHHHHHHH
Q psy18141         26 FRRHKGCNPSEAQVFMNEWAKYAVNLA   52 (113)
Q Consensus        26 FRrHk~~np~~~~~Fl~eW~~Y~~~L~   52 (113)
                      .|-|-.+|.+++. ||+-|..|-..|.
T Consensus        69 iRfhP~v~~~ev~-~Ls~~MT~Knal~   94 (411)
T COG0334          69 VRFHPYVTLEEVK-ALSFWMTLKNALA   94 (411)
T ss_pred             eecCCCCCHHHHH-HHHHHHHHHHHHh
Confidence            3668888888777 9999999999886


No 27 
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=27.97  E-value=68  Score=24.36  Aligned_cols=25  Identities=20%  Similarity=0.118  Sum_probs=19.2

Q ss_pred             CCCHHHHhhcCHHHHHHHHHHHHHh
Q psy18141         69 SLDEQALNDLRDDQIAQLYELMLAA   93 (113)
Q Consensus        69 ~L~~~~le~lsdEQi~QLyEL~~ea   93 (113)
                      +.-.+.++-|+|||..+|.+.+++-
T Consensus       121 ~~~~qmy~lLTPEQra~l~~~~e~r  145 (162)
T PRK12751        121 KVRNQMYNLLTPEQKEALNKKHQER  145 (162)
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHHH
Confidence            3345568889999999999888764


No 28 
>KOG2712|consensus
Probab=27.75  E-value=45  Score=24.26  Aligned_cols=19  Identities=21%  Similarity=0.186  Sum_probs=16.3

Q ss_pred             hcCHHHHHHHHHHHHHhcC
Q psy18141         77 DLRDDQIAQLYELMLAATQ   95 (113)
Q Consensus        77 ~lsdEQi~QLyEL~~ea~k   95 (113)
                      .||.+|..+|.+++.++.+
T Consensus        82 SLs~~qW~~Lk~~~~eId~  100 (108)
T KOG2712|consen   82 SLSLEQWSKLKEHIEEIDK  100 (108)
T ss_pred             ccCHHHHHHHHHHHHHHHH
Confidence            5788999999999998764


No 29 
>PF08158 NUC130_3NT:  NUC130/3NT domain;  InterPro: IPR012977 This N-terminal domain is found in a novel nucleolar protein family defined by NUC130/133 [].
Probab=27.62  E-value=35  Score=21.40  Aligned_cols=15  Identities=27%  Similarity=0.578  Sum_probs=11.7

Q ss_pred             hhhcchh-cCChHHHH
Q psy18141          2 EMATLHR-GLPPELQI   16 (113)
Q Consensus         2 rILRlHR-~LP~~mR~   16 (113)
                      .||+-|- .|+|+||.
T Consensus        22 ~lL~~~~~~L~p~lR~   37 (52)
T PF08158_consen   22 DLLRNHHTVLDPDLRM   37 (52)
T ss_pred             HHHHhccccCCHHHHH
Confidence            4677776 69999985


No 30 
>PRK07080 hypothetical protein; Validated
Probab=27.40  E-value=65  Score=27.30  Aligned_cols=33  Identities=24%  Similarity=0.518  Sum_probs=29.4

Q ss_pred             HHHhhhcCC---CHHHHHHHHHHHHHHHHHHHHHhc
Q psy18141         24 DEFRRHKGC---NPSEAQVFMNEWAKYAVNLAEQLG   56 (113)
Q Consensus        24 ~EFRrHk~~---np~~~~~Fl~eW~~Y~~~L~~Q~~   56 (113)
                      .|||++-=+   ||+++..|...|-.|...|..-++
T Consensus       174 ~~F~mrE~V~iGt~e~v~~~r~~w~e~~~~l~~~Lg  209 (317)
T PRK07080        174 QLFRMREYVRIGTPEQIVAFRQSWIERGTAMADALG  209 (317)
T ss_pred             hheeeeEEEEecCHHHHHHHHHHHHHHHHHHHHHhC
Confidence            589998865   999999999999999999987775


No 31 
>PF00046 Homeobox:  Homeobox domain not present here.;  InterPro: IPR001356 The homeobox domain was first identified in a number of drosophila homeotic and segmentation proteins, but is now known to be well-conserved in many other animals, including vertebrates [, , ]. Hox genes encode homeodomain-containing transcriptional regulators that operate differential genetic programs along the anterior-posterior axis of animal bodies []. The domain binds DNA through a helix-turn-helix (HTH) structure. The HTH motif is characterised by two alpha-helices, which make intimate contacts with the DNA and are joined by a short turn. The second helix binds to DNA via a number of hydrogen bonds and hydrophobic interactions, which occur between specific side chains and the exposed bases and thymine methyl groups within the major groove of the DNA []. The first helix helps to stabilise the structure. The motif is very similar in sequence and structure in a wide range of DNA-binding proteins (e.g., cro and repressor proteins, homeotic proteins, etc.). One of the principal differences between HTH motifs in these different proteins arises from the stereo-chemical requirement for glycine in the turn which is needed to avoid steric interference of the beta-carbon with the main chain: for cro and repressor proteins the glycine appears to be mandatory, while for many of the homeotic and other DNA-binding proteins the requirement is relaxed.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2DA3_A 1LFB_A 2LFB_A 2ECB_A 2DA5_A 3D1N_O 3A03_A 2XSD_C 3CMY_A 1AHD_P ....
Probab=26.73  E-value=1.1e+02  Score=17.98  Aligned_cols=25  Identities=20%  Similarity=0.236  Sum_probs=18.8

Q ss_pred             hcCHHHHHHHHHHHHHhcCCCcchH
Q psy18141         77 DLRDDQIAQLYELMLAATQPEVLEA  101 (113)
Q Consensus        77 ~lsdEQi~QLyEL~~ea~k~~~~~~  101 (113)
                      .+|++|+..|.+.......|.....
T Consensus         6 ~~t~~q~~~L~~~f~~~~~p~~~~~   30 (57)
T PF00046_consen    6 RFTKEQLKVLEEYFQENPYPSKEER   30 (57)
T ss_dssp             SSSHHHHHHHHHHHHHSSSCHHHHH
T ss_pred             CCCHHHHHHHHHHHHHhcccccccc
Confidence            5899999999999998555544433


No 32 
>smart00496 IENR2 Intron-encoded nuclease repeat 2. Short helical motif of unknown function (unpublished results).
Probab=26.68  E-value=35  Score=18.87  Aligned_cols=18  Identities=11%  Similarity=0.250  Sum_probs=14.3

Q ss_pred             ccCCCCHHHHhhcCHHHH
Q psy18141         66 LGQSLDEQALNDLRDDQI   83 (113)
Q Consensus        66 ~G~~L~~~~le~lsdEQi   83 (113)
                      +|+..|++...+||+-..
T Consensus         1 ~GkkhSEETK~KMSea~~   18 (26)
T smart00496        1 KGKKHSEETKKKMSEAAX   18 (26)
T ss_pred             CCCCCCHHHHHHHHHhhc
Confidence            478899999999888643


No 33 
>KOG4049|consensus
Probab=26.63  E-value=53  Score=26.92  Aligned_cols=33  Identities=36%  Similarity=0.467  Sum_probs=27.7

Q ss_pred             hhhHHHHHHHhhhcCCCHHHHHHHHHHHHHHHHHHH
Q psy18141         17 LGHQYVRDEFRRHKGCNPSEAQVFMNEWAKYAVNLA   52 (113)
Q Consensus        17 LGD~YVK~EFRrHk~~np~~~~~Fl~eW~~Y~~~L~   52 (113)
                      -||.=++.||   .|.++.++..|..||+.=+.-+.
T Consensus       172 tGd~eerQef---iNLdEseA~qfd~ew~se~s~~~  204 (248)
T KOG4049|consen  172 TGDVEERQEF---INLDESEAPQFDTEWASEASAIM  204 (248)
T ss_pred             cccHHHHHHh---hcccccchhhHhHHHHHHHHhcc
Confidence            4899999998   47788899999999998777654


No 34 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=26.24  E-value=1.6e+02  Score=20.63  Aligned_cols=35  Identities=29%  Similarity=0.624  Sum_probs=21.9

Q ss_pred             cCChHHHHhhhHHHHHHHhhhcCCCHHHHHHHHHHHHHH
Q psy18141          9 GLPPELQILGHQYVRDEFRRHKGCNPSEAQVFMNEWAKY   47 (113)
Q Consensus         9 ~LP~~mR~LGD~YVK~EFRrHk~~np~~~~~Fl~eW~~Y   47 (113)
                      .|||+.|.    -|+..|+.=+..+|++=......|..|
T Consensus        58 ~LspeqR~----~~R~~~~~~~~Lpp~qR~~lr~~w~~y   92 (107)
T PF11304_consen   58 ALSPEQRQ----QARENYQRFKQLPPEQRQALRARWEAY   92 (107)
T ss_pred             hCCHHHHH----HHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence            36666654    356666666666666666666666666


No 35 
>PHA02902 putative IMV membrane protein; Provisional
Probab=25.10  E-value=58  Score=21.97  Aligned_cols=27  Identities=30%  Similarity=0.546  Sum_probs=19.6

Q ss_pred             CcccCCCCHHH--HhhcCHHHHHHHHHHH
Q psy18141         64 KPLGQSLDEQA--LNDLRDDQIAQLYELM   90 (113)
Q Consensus        64 ~~~G~~L~~~~--le~lsdEQi~QLyEL~   90 (113)
                      ..+|.-|..+-  -++|+|+|+.-|..|-
T Consensus        38 ~~~~~~l~~d~~F~D~lTpDQirAlHrlv   66 (70)
T PHA02902         38 ERFGDTLEDDPLFKDSLTPDQIKALHRLV   66 (70)
T ss_pred             ccccccCCCCchhhccCCHHHHHHHHHHH
Confidence            45666664333  5799999999998874


No 36 
>PF12616 DUF3775:  Protein of unknown function (DUF3775);  InterPro: IPR022254  This domain family is found in bacteria, and is approximately 80 amino acids in length. There is a single completely conserved residue G that may be functionally important. 
Probab=24.57  E-value=74  Score=21.36  Aligned_cols=22  Identities=32%  Similarity=0.466  Sum_probs=18.2

Q ss_pred             HHhhcCHHHHHHHHHHHHHhcC
Q psy18141         74 ALNDLRDDQIAQLYELMLAATQ   95 (113)
Q Consensus        74 ~le~lsdEQi~QLyEL~~ea~k   95 (113)
                      -++.||++|...|.-||==-..
T Consensus         9 ~I~~l~~deqaeLvALmwiGRG   30 (75)
T PF12616_consen    9 FIEDLNEDEQAELVALMWIGRG   30 (75)
T ss_pred             HHHhCCHHHHHHHHHHHHhcCC
Confidence            4889999999999999965443


No 37 
>PF10655 DUF2482:  Hypothetical protein of unknown function (DUF2482);  InterPro: IPR018917 This entry is represented by Bacteriophage 80, Orf10. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  All the members of this very small, very short family are derived from bacteriophages, of the SA bacteriophages 11, Mu50B, system, and from the Staphylococcal_phi-Mu50B-like_prophages subsystem. All members are hypothetical proteins. 
Probab=24.13  E-value=61  Score=23.25  Aligned_cols=29  Identities=21%  Similarity=0.439  Sum_probs=24.6

Q ss_pred             CCCCHHHHhhcCHHHHHHHHHHHHHhcCC
Q psy18141         68 QSLDEQALNDLRDDQIAQLYELMLAATQP   96 (113)
Q Consensus        68 ~~L~~~~le~lsdEQi~QLyEL~~ea~k~   96 (113)
                      ++|+.+++..|=.|--+-||+|-.|+.+.
T Consensus         6 KdMTqeelr~llseK~~ELydL~~eI~kE   34 (100)
T PF10655_consen    6 KDMTQEELRDLLSEKNGELYDLANEIDKE   34 (100)
T ss_pred             hhhhHHHHHHHHHHhhHHHHHHHHHhccc
Confidence            57888888888788889999999999875


No 38 
>KOG1070|consensus
Probab=23.77  E-value=97  Score=31.67  Aligned_cols=41  Identities=24%  Similarity=0.475  Sum_probs=35.3

Q ss_pred             CChHHHHhhhHHHHHHHhh--hcCCCHHHHHHHHHHHHHHHHH
Q psy18141         10 LPPELQILGHQYVRDEFRR--HKGCNPSEAQVFMNEWAKYAVN   50 (113)
Q Consensus        10 LP~~mR~LGD~YVK~EFRr--Hk~~np~~~~~Fl~eW~~Y~~~   50 (113)
                      +.-+++..++.||++=|+|  |++..|..+.-|+.-|=.|-..
T Consensus      1641 id~eik~~~~~~vR~lfeRvi~l~l~~kkmKfffKkwLeyEk~ 1683 (1710)
T KOG1070|consen 1641 IDMEIKHGDIKYVRDLFERVIELKLSIKKMKFFFKKWLEYEKS 1683 (1710)
T ss_pred             HHHHHccCCHHHHHHHHHHHHhcCCChhHhHHHHHHHHHHHHh
Confidence            3457888999999999999  5566999999999999999764


No 39 
>PF06937 EURL:  EURL protein;  InterPro: IPR009704 This family consists of several animal EURL proteins. EURL is preferentially expressed in chick retinal precursor cells as well as in the anterior epithelial cells of the lens at early stages of development. EURL transcripts are found primarily in the peripheral dorsal retina, i.e., the most undifferentiated part of the dorsal retina. EURL transcripts are also detected in the lens at stage 18 and remain abundant in the proliferating epithelial cells of the lens until at least day 11. The distribution pattern of EURL in the developing retina and lens suggest a role before the events leading to cell determination and differentiation [].
Probab=23.69  E-value=88  Score=26.26  Aligned_cols=41  Identities=22%  Similarity=0.233  Sum_probs=33.0

Q ss_pred             CCCCHHHHhhcCHHHHHHHHH-HHHHhcCCCcchHHHHHHHH
Q psy18141         68 QSLDEQALNDLRDDQIAQLYE-LMLAATQPEVLEAEEAQAKA  108 (113)
Q Consensus        68 ~~L~~~~le~lsdEQi~QLyE-L~~ea~k~~~~~~~~~~~~~  108 (113)
                      ...|.++|+.|+-+++.||.+ |.+.++..+.+..-..|.|-
T Consensus       209 ~~~SrEeL~~Mt~~EL~qL~~~L~~qIq~vfeeLt~~vQEKD  250 (285)
T PF06937_consen  209 PHYSREELNSMTLDELKQLNEKLLQQIQDVFEELTQQVQEKD  250 (285)
T ss_pred             cccCHHHhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            467889999999999999975 78888888777776666553


No 40 
>cd08816 CARD_RIG-I_1 Caspase activation and recruitment domain found in RIG-I, first repeat. Caspase activation and recruitment domain (CARD) found in RIG-I (Retinoic acid Inducible Gene I, also known as Ddx58), first repeat. RIG-I is a cytoplasmic RNA helicase that plays an important role in host antiviral response by sensing incoming viral RNA. RIG-I contains two N-terminal CARD domains and a C-terminal RNA helicase. Upon activation, the signal is transferred to downstream pathways via the adaptor molecule IPS-1 (MAVS, VISA, CARDIF), leading to the induction of type I interferons. Although very similar in sequence, RIG-I recognizes different sets of viruses compared to MDA5, a related RNA helicase. RIG-I associates with IPS-1 through a CARD-CARD interaction. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction doma
Probab=23.27  E-value=56  Score=23.03  Aligned_cols=15  Identities=33%  Similarity=0.454  Sum_probs=13.2

Q ss_pred             CCCHHHHHHHHHHHH
Q psy18141         31 GCNPSEAQVFMNEWA   45 (113)
Q Consensus        31 ~~np~~~~~Fl~eW~   45 (113)
                      ..+|..|.+||+.|=
T Consensus        18 tl~P~yIl~~m~~~~   32 (89)
T cd08816          18 ILRPSYILGFMTTWL   32 (89)
T ss_pred             hhchHHHHHHHHHhc
Confidence            459999999999994


No 41 
>PRK00306 50S ribosomal protein L29; Reviewed
Probab=22.61  E-value=1.4e+02  Score=19.00  Aligned_cols=19  Identities=26%  Similarity=0.461  Sum_probs=8.6

Q ss_pred             HHHhhcCHHHH-HHHHHHHH
Q psy18141         73 QALNDLRDDQI-AQLYELML   91 (113)
Q Consensus        73 ~~le~lsdEQi-~QLyEL~~   91 (113)
                      .++..||++++ ..|.+|.+
T Consensus         4 ~elr~ls~~eL~~~l~~lkk   23 (66)
T PRK00306          4 KELRELSVEELNEKLLELKK   23 (66)
T ss_pred             HHHhhCCHHHHHHHHHHHHH
Confidence            34555555554 33344433


No 42 
>PF14842 FliG_N:  FliG N-terminal domain; PDB: 3HJL_A 3AJC_A 3USY_B.
Probab=22.44  E-value=59  Score=22.46  Aligned_cols=40  Identities=23%  Similarity=0.322  Sum_probs=26.3

Q ss_pred             cccCCCCHHHHhhcCHHHHHHHHHHHHHhcCCCcchHHHH
Q psy18141         65 PLGQSLDEQALNDLRDDQIAQLYELMLAATQPEVLEAEEA  104 (113)
Q Consensus        65 ~~G~~L~~~~le~lsdEQi~QLyEL~~ea~k~~~~~~~~~  104 (113)
                      .+|.+-..+.+..|+++++.+|-.-|-....+..+..+..
T Consensus        17 ~Lgee~Aa~vlk~l~~~ei~~i~~~ma~l~~v~~~~~~~V   56 (108)
T PF14842_consen   17 ALGEEAAAEVLKHLDEEEIERISREMAKLGSVSPEEVEEV   56 (108)
T ss_dssp             HS-HHHHHHHHHHS-HHHHHHHHHHHHT-----HHHHHHH
T ss_pred             HHCHHHHHHHHccCCHHHHHHHHHHHHccCCCCHHHHHHH
Confidence            3677778888999999999999888888777776665554


No 43 
>PF08863 YolD:  YolD-like protein;  InterPro: IPR014962 These proteins are functionally uncharacterised. However it has been predicted that these proteins are functionally equivalent to the UmuD subunit of polymerase V from Gram-negative bacteria []. 
Probab=22.40  E-value=72  Score=20.57  Aligned_cols=20  Identities=20%  Similarity=0.180  Sum_probs=10.1

Q ss_pred             HHHHHHHHHHHHHhcCCCcc
Q psy18141         80 DDQIAQLYELMLAATQPEVL   99 (113)
Q Consensus        80 dEQi~QLyEL~~ea~k~~~~   99 (113)
                      |||...|.+++.+..+....
T Consensus         2 PEH~e~L~~~~~e~~k~~kp   21 (92)
T PF08863_consen    2 PEHKEALRELIKEQNKVEKP   21 (92)
T ss_pred             ChHHHHHHHHHHHhcccCCC
Confidence            34555555555555544433


No 44 
>PF08035 Op_neuropeptide:  Opioids neuropeptide;  InterPro: IPR013532 Pro-opiomelanocortin is present in high levels in the pituitary and is processed into 3 major peptide families: adrenocorticotrophin (ACTH); alpha-, beta- and gamma-melanocyte- stimulating hormones (MSH); and beta-endorphin []. ACTH regulates the synthesis and release of glucocorticoids and, to some extent, aldosterone in the adrenal cortex. It is synthesised and released in response to corticotrophin-releasing factor at times of stress (i.e. heat, cold, infection, etc.), its release leading to increased metabolism. The action of MSH in man is poorly understood, but it may be involved in temperature regulation []. Full activity of ACTH resides in the first 20 N-terminal amino acids, the first 13 of which are identical to alpha-MSH [, ]. This region corresponds to the conserved YGG motif that is found in a wide variety of opioid neuropeptides such as enkephalin
Probab=22.35  E-value=12  Score=21.61  Aligned_cols=10  Identities=30%  Similarity=0.780  Sum_probs=8.0

Q ss_pred             HHHHHHHHHH
Q psy18141         39 VFMNEWAKYA   48 (113)
Q Consensus        39 ~Fl~eW~~Y~   48 (113)
                      +||++|+.+.
T Consensus         3 GFM~s~~e~s   12 (31)
T PF08035_consen    3 GFMKSWDERS   12 (31)
T ss_pred             cccccccccc
Confidence            7999998764


No 45 
>PF12651 RHH_3:  Ribbon-helix-helix domain
Probab=22.28  E-value=1.4e+02  Score=17.72  Aligned_cols=29  Identities=24%  Similarity=0.186  Sum_probs=22.9

Q ss_pred             hcCHHHHHHHHHHHHHhcCCCcchHHHHH
Q psy18141         77 DLRDDQIAQLYELMLAATQPEVLEAEEAQ  105 (113)
Q Consensus        77 ~lsdEQi~QLyEL~~ea~k~~~~~~~~~~  105 (113)
                      .+++|...+|.+|-.++.-+....-++|-
T Consensus         8 ~l~~el~~~L~~ls~~t~i~~S~Ll~eAl   36 (44)
T PF12651_consen    8 SLDKELYEKLKELSEETGIPKSKLLREAL   36 (44)
T ss_pred             ecCHHHHHHHHHHHHHHCCCHHHHHHHHH
Confidence            46778889999999999887777766664


No 46 
>PF13024 DUF3884:  Protein of unknown function (DUF3884)
Probab=22.19  E-value=44  Score=22.74  Aligned_cols=11  Identities=64%  Similarity=1.047  Sum_probs=10.0

Q ss_pred             hHHHHhhhHHH
Q psy18141         12 PELQILGHQYV   22 (113)
Q Consensus        12 ~~mR~LGD~YV   22 (113)
                      |+|.+||+=||
T Consensus        22 ~~lk~LG~W~~   32 (77)
T PF13024_consen   22 PELKSLGKWYV   32 (77)
T ss_pred             HHHHHhhceee
Confidence            69999999888


No 47 
>PF06853 DUF1249:  Protein of unknown function (DUF1249);  InterPro: IPR009659 This family consists of several hypothetical bacterial proteins of around 150 residues in length. The function of this family is unknown.
Probab=22.15  E-value=77  Score=22.78  Aligned_cols=15  Identities=27%  Similarity=0.773  Sum_probs=13.1

Q ss_pred             HHHHHHHHHHHHHHH
Q psy18141         35 SEAQVFMNEWAKYAV   49 (113)
Q Consensus        35 ~~~~~Fl~eW~~Y~~   49 (113)
                      -++-.||.+|=.||-
T Consensus        98 ~q~N~FL~eWL~~CL  112 (120)
T PF06853_consen   98 WQLNRFLAEWLRYCL  112 (120)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            367899999999996


No 48 
>smart00309 PAH Pancreatic hormones / neuropeptide F / peptide YY family. Pancreatic hormone is a regulator of pancreatic and gastrointestinal functions.
Probab=21.91  E-value=1.4e+02  Score=17.70  Aligned_cols=22  Identities=9%  Similarity=0.357  Sum_probs=18.3

Q ss_pred             CCCHHHHHHHHHHHHHHHHHHH
Q psy18141         31 GCNPSEAQVFMNEWAKYAVNLA   52 (113)
Q Consensus        31 ~~np~~~~~Fl~eW~~Y~~~L~   52 (113)
                      +++|+++..++++-+.|...+.
T Consensus        11 ~a~~e~l~~Y~~~L~~Yinlit   32 (36)
T smart00309       11 DASPEDLRQYLAALREYINLIT   32 (36)
T ss_pred             CCCHHHHHHHHHHHHHHHHHhc
Confidence            3588999999999999988774


No 49 
>PF06207 DUF1002:  Protein of unknown function (DUF1002);  InterPro: IPR009343 This protein family has no known function. Its members are about 300 amino acids in length. It has so far been detected in Firmicute bacteria and some archaebacteria.
Probab=21.74  E-value=1.1e+02  Score=24.51  Aligned_cols=23  Identities=30%  Similarity=0.322  Sum_probs=20.5

Q ss_pred             cCHHHHHHHHHHHHHhcCCCcch
Q psy18141         78 LRDDQIAQLYELMLAATQPEVLE  100 (113)
Q Consensus        78 lsdEQi~QLyEL~~ea~k~~~~~  100 (113)
                      ||++|+.+|..||....+..-+.
T Consensus       192 ls~~q~~~i~~l~~~~~~~~~~~  214 (225)
T PF06207_consen  192 LSDEQIQQIVNLMKKIQNLNIDW  214 (225)
T ss_pred             CCHHHHHHHHHHHHHHHcCCCCH
Confidence            89999999999999988876665


No 50 
>PF12767 SAGA-Tad1:  Transcriptional regulator of RNA polII, SAGA, subunit;  InterPro: IPR024738 The yeast Spt-Ada-Gcn5-Acetyl (SAGA) transferase complex is a multifunctional coactivator involved in multiple cellular processes [], including regulation of transcription by RNA polymerase II [, ]. It is formed of five major modular subunits and shows a high degree of structural conservation to human TFTC and STAGA []. This entry represents Ada1 (known as Tada1 in higher eukaryotes), one of the subunits that constitute the SAGA core. It also functions as a component of the SALSA and SLIK complexes. ; GO: 0070461 SAGA-type complex
Probab=21.73  E-value=64  Score=25.39  Aligned_cols=31  Identities=23%  Similarity=0.533  Sum_probs=23.5

Q ss_pred             HHHHHHHHHHHHHhcccCCCCCCcccCCCCHHHHhh-----cCHHHH
Q psy18141         42 NEWAKYAVNLAEQLGIRGPKFAKPLGQSLDEQALND-----LRDDQI   83 (113)
Q Consensus        42 ~eW~~Y~~~L~~Q~~~~g~~~~~~~G~~L~~~~le~-----lsdEQi   83 (113)
                      ..|..|...|..           |+..+||.+++++     |..|++
T Consensus        25 ~~~~~Y~~~l~~-----------fl~~klsk~Efd~~~~~~L~~~~~   60 (252)
T PF12767_consen   25 DRWKKYFQSLKR-----------FLSGKLSKEEFDKECRRILGRENV   60 (252)
T ss_pred             HHHHHHHHHHHH-----------HHHhccCHHHHHHHHHHHhChhHH
Confidence            679999999974           6777888887664     556664


No 51 
>TIGR03759 conj_TIGR03759 integrating conjugative element protein, PFL_4693 family. Members of this protein family, such as model protein PFL_4693 from Pseudomonas fluorescens Pf-5, belong to extended genomic regions that appear to be spread by conjugative transfer. Most members have a predicted N-terminal signal sequence. The function is unknown.
Probab=21.07  E-value=1.8e+02  Score=23.15  Aligned_cols=43  Identities=23%  Similarity=0.462  Sum_probs=32.4

Q ss_pred             HHHHHHHHHHHHHHhcccCCCCCCcccCCCCHHH---HhhcCHHHHHHHHHHHH
Q psy18141         41 MNEWAKYAVNLAEQLGIRGPKFAKPLGQSLDEQA---LNDLRDDQIAQLYELML   91 (113)
Q Consensus        41 l~eW~~Y~~~L~~Q~~~~g~~~~~~~G~~L~~~~---le~lsdEQi~QLyEL~~   91 (113)
                      -.||+.|...+.      |+  .+.....|||=.   ++.=||+......||.-
T Consensus        16 ~~Ew~RY~~Lmq------G~--rG~~SPgLdPLtaLGIeArsd~ERrryAEl~v   61 (200)
T TIGR03759        16 EDEWQRYQQLMQ------GP--RGVYSPGLDPLTALGIEARSDEERRRYAELWV   61 (200)
T ss_pred             HHHHHHHHHHhc------CC--ccCcCCCCChhhhhccccCCHHHHHHHHHHHH
Confidence            369999999875      32  245556687766   78889999999999864


Done!