Query         psy7032
Match_columns 136
No_of_seqs    116 out of 200
Neff          4.9 
Searched_HMMs 46136
Date          Sat Aug 17 01:01:37 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7032.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7032hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 TIGR00870 trp transient-recept  99.9 2.2E-27 4.8E-32  213.6   6.2  112    5-116   621-743 (743)
  2 KOG3609|consensus               99.9 3.5E-27 7.5E-32  216.4   7.3  125    5-129   617-753 (822)
  3 KOG3614|consensus               93.2    0.45 9.7E-06   47.6   8.4   38    5-43   1048-1085(1381)
  4 COG5466 Predicted small metal-  76.8     5.4 0.00012   26.6   4.0   34   75-112    25-58  (59)
  5 PF13624 SurA_N_3:  SurA N-term  75.8      14  0.0003   26.9   6.4   59   72-130    74-139 (154)
  6 KOG4807|consensus               73.6     3.8 8.3E-05   37.1   3.4   29   98-126   516-544 (593)
  7 PRK00059 prsA peptidylprolyl i  71.1      26 0.00056   29.1   7.7   56   73-128    85-148 (336)
  8 PF14394 DUF4423:  Domain of un  62.0      11 0.00024   29.3   3.5   29   96-124   127-155 (171)
  9 PF09388 SpoOE-like:  Spo0E lik  61.4      13 0.00028   22.6   3.1   27  105-131     1-27  (45)
 10 COG4224 Uncharacterized protei  60.8      25 0.00053   24.6   4.7   41   78-123     2-42  (77)
 11 PRK06835 DNA replication prote  55.0      69  0.0015   27.3   7.5   57   73-129    37-93  (329)
 12 PF08880 QLQ:  QLQ;  InterPro:   52.7      20 0.00044   21.5   2.8   19   97-115     2-20  (37)
 13 PF10845 DUF2576:  Protein of u  52.7      24 0.00051   22.6   3.2   28  101-128    11-40  (48)
 14 PF03210 Paramyx_P_V_C:  Paramy  51.9      11 0.00023   29.1   1.9   39   72-110   111-154 (155)
 15 PF03564 DUF1759:  Protein of u  49.4      96  0.0021   22.3   6.9   59   70-129    52-117 (145)
 16 PF10146 zf-C4H2:  Zinc finger-  47.4      41 0.00088   27.6   4.7   13   73-85     36-48  (230)
 17 PF09312 SurA_N:  SurA N-termin  47.2      71  0.0015   22.8   5.5   55   74-128    43-101 (118)
 18 PF08105 Antimicrobial10:  Metc  46.4     9.7 0.00021   24.7   0.8    9   26-34     35-43  (52)
 19 cd08807 CARD_CARD10_CARMA3 Cas  45.7      25 0.00055   25.1   2.9   32   95-126    28-64  (86)
 20 cd08806 CARD_CARD14_CARMA2 Cas  45.2      29 0.00063   24.8   3.1   32   95-126    28-64  (86)
 21 TIGR03720 exospor_lead exospor  43.9      13 0.00027   21.0   0.9   11   25-35      7-19  (26)
 22 PF15344 FAM217:  FAM217 family  43.2      11 0.00024   31.3   0.8   10   26-35     50-59  (233)
 23 PF09366 DUF1997:  Protein of u  42.7      35 0.00076   25.7   3.5   14   25-38    110-123 (158)
 24 PF10073 DUF2312:  Uncharacteri  42.5      92   0.002   21.6   5.2   40   76-130     4-43  (74)
 25 PF11176 DUF2962:  Protein of u  42.4      41 0.00089   25.8   3.9   14   75-88     68-81  (155)
 26 KOG3609|consensus               41.6       5 0.00011   38.6  -1.5   43   70-112   734-776 (822)
 27 TIGR02147 Fsuc_second hypothet  40.4      38 0.00081   28.4   3.6   29   96-124   225-253 (271)
 28 TIGR01219 Pmev_kin_ERG8 phosph  40.2 2.4E+02  0.0051   25.5   8.8   28  101-128   352-379 (454)
 29 PF08776 VASP_tetra:  VASP tetr  39.0      58  0.0013   20.1   3.4   21  102-122    12-32  (40)
 30 PF13789 DUF4181:  Domain of un  38.8     3.6 7.9E-05   29.6  -2.3   14    2-15     72-85  (110)
 31 cd08809 CARD_CARD9 Caspase act  37.5      45 0.00098   23.8   3.1   33   95-127    28-65  (86)
 32 PRK10788 periplasmic folding c  33.0      78  0.0017   28.8   4.7   55   74-128    88-149 (623)
 33 PF13801 Metal_resist:  Heavy-m  31.4      94   0.002   20.8   3.9   28   95-122    81-108 (125)
 34 PRK10770 peptidyl-prolyl cis-t  30.7 2.4E+02  0.0052   24.0   7.1   55   73-127    48-106 (413)
 35 PF05121 GvpK:  Gas vesicle pro  29.8 2.1E+02  0.0046   20.4   6.2   53   76-131    23-75  (88)
 36 PRK01631 hypothetical protein;  29.7 1.6E+02  0.0034   20.6   4.7   40   78-122     1-40  (76)
 37 TIGR00496 frr ribosome recycli  29.4 2.8E+02   0.006   21.7   7.3   48   72-119   113-160 (176)
 38 PRK07857 hypothetical protein;  29.2      86  0.0019   23.0   3.5   30   95-124    22-51  (106)
 39 PF05291 Bystin:  Bystin;  Inte  28.1      60  0.0013   28.0   2.9   34   11-45     50-83  (301)
 40 PRK10132 hypothetical protein;  27.9      87  0.0019   22.8   3.4   26  100-125    11-36  (108)
 41 PF08172 CASP_C:  CASP C termin  27.6      74  0.0016   26.3   3.3   25  100-124    99-123 (248)
 42 PF04728 LPP:  Lipoprotein leuc  27.5 1.1E+02  0.0024   20.1   3.5   21  101-121    17-37  (56)
 43 PRK13723 conjugal transfer pil  27.4 2.6E+02  0.0055   25.4   6.9   51   73-123   358-413 (451)
 44 PF09860 DUF2087:  Uncharacteri  26.4 2.1E+02  0.0045   19.3   5.1   25   95-119    27-55  (71)
 45 PF07889 DUF1664:  Protein of u  26.0      68  0.0015   24.2   2.6   24   98-121    86-109 (126)
 46 TIGR02698 CopY_TcrY copper tra  26.0 2.6E+02  0.0057   20.4   5.7   40   72-111    71-115 (130)
 47 PF09851 SHOCT:  Short C-termin  25.7 1.2E+02  0.0025   17.1   3.0   18   94-111    13-30  (31)
 48 TIGR03575 selen_PSTK_euk L-ser  25.6 2.6E+02  0.0057   24.1   6.5   47   75-125   259-305 (340)
 49 PF04353 Rsd_AlgQ:  Regulator o  25.0 1.9E+02   0.004   22.6   4.9   38    8-46     20-59  (153)
 50 PF14838 INTS5_C:  Integrator c  24.8      86  0.0019   30.0   3.5   81    7-87    597-694 (696)
 51 PRK12807 flagellin; Provisiona  24.2 3.5E+02  0.0075   22.3   6.7   32   93-124    97-128 (287)
 52 KOG3870|consensus               23.9 1.4E+02   0.003   27.1   4.5   21   99-119    59-79  (434)
 53 KOG0289|consensus               22.6      43 0.00093   30.7   1.1   30   78-107    24-54  (506)
 54 PF15397 DUF4618:  Domain of un  22.4 1.2E+02  0.0026   25.5   3.7   19  101-119   200-218 (258)
 55 cd00520 RRF Ribosome recycling  22.1 3.8E+02  0.0083   20.8   7.2   49   71-119   117-165 (179)
 56 PF14389 Lzipper-MIP1:  Leucine  22.0 1.1E+02  0.0023   21.3   2.9   20  100-119    67-86  (88)
 57 TIGR01801 CM_A chorismate muta  21.4 1.3E+02  0.0028   21.5   3.2   26   99-124     2-27  (102)
 58 PF01988 VIT1:  VIT family;  In  21.1      93   0.002   24.5   2.7   16   97-112    93-108 (213)
 59 PF14164 YqzH:  YqzH-like prote  21.0 2.7E+02   0.006   18.7   4.7   18   94-111    22-39  (64)
 60 PRK09269 chorismate mutase; Pr  20.8 4.3E+02  0.0092   21.1   6.4   45   79-123   101-145 (193)
 61 PHA02554 13 neck protein; Prov  20.6   2E+02  0.0043   25.0   4.7   42   79-120   246-302 (311)
 62 PF01152 Bac_globin:  Bacterial  20.4 2.5E+02  0.0053   19.5   4.5   43   77-119    77-119 (120)
 63 PF06757 Ins_allergen_rp:  Inse  20.2      95  0.0021   23.8   2.5   53   78-130    20-75  (179)

No 1  
>TIGR00870 trp transient-receptor-potential calcium channel protein. after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Probab=99.94  E-value=2.2e-27  Score=213.61  Aligned_cols=112  Identities=30%  Similarity=0.551  Sum_probs=88.9

Q ss_pred             CCccccchhhhhHHHHHhcCCCCCCCCcccccChHHHHHHHhhhhhhcccccc-HHH-HH--------HHHH-hhhhhhh
Q psy7032           5 RQIDKMRKDSIKLWLSYFDDSPTLPPPFNIFPTTKMFLKMFGLRQIDKMRKDS-IKR-KQ--------QARQ-EKERDFR   73 (136)
Q Consensus         5 ~~~~eWKFaRtkLwmsyf~~~~tlPpPFNliPspk~~~~l~~~~~~~~~~~~s-~~r-~~--------~~~~-~~~~~~~   73 (136)
                      +++.||||+|+++||+|++.+.|+||||||||+||+++|++.+..+..+.... ..+ ..        .... ..+.+.+
T Consensus       621 ~~~~~wk~~r~~~~~~y~~~~~~~p~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  700 (743)
T TIGR00870       621 DADEEWKFQRAKLWMSYEREGGTCPPPFNIIPGPKSFVGLFKRIEKHDGKKRQRWCRRVEEVNWTTWERKAETLIEDGLH  700 (743)
T ss_pred             hhHHHHHHHHHHHHHHHHhcCCCCCCCceeccchHHHHHHHhhhhcccCcchHhHhhhhhhhccchhhhhhhhccccchh
Confidence            56899999999999999999999999999999999999988665433222110 001 00        0001 1122367


Q ss_pred             HHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHH
Q psy7032          74 YAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYE  116 (136)
Q Consensus        74 y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyE  116 (136)
                      |+++|++||+|||++++++.+++++||||++||||||||||||
T Consensus       701 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~  743 (743)
T TIGR00870       701 YQRVMKRLIKRYVLAEQRPRDDEGTTEEETKELKQDISSLRFE  743 (743)
T ss_pred             HHHHHHhhhhhhhhhhccccccCCCChhhHHHHHHHhhhccCC
Confidence            9999999999999999999999999999999999999999997


No 2  
>KOG3609|consensus
Probab=99.94  E-value=3.5e-27  Score=216.39  Aligned_cols=125  Identities=22%  Similarity=0.277  Sum_probs=99.1

Q ss_pred             CCccccchhhhhHHHHHhcCCCCCCCCcccccChHHHHHHHhhhhhhccccc---cHHHHHHH--------HHhhhhhhh
Q psy7032           5 RQIDKMRKDSIKLWLSYFDDSPTLPPPFNIFPTTKMFLKMFGLRQIDKMRKD---SIKRKQQA--------RQEKERDFR   73 (136)
Q Consensus         5 ~~~~eWKFaRtkLwmsyf~~~~tlPpPFNliPspk~~~~l~~~~~~~~~~~~---s~~r~~~~--------~~~~~~~~~   73 (136)
                      ++|+||||||||||||||++|.||||||||||+||+++|+..+.+++.+...   +.++..+.        .....+...
T Consensus       617 ~ADvEWKFARAKLw~syF~eg~tLPpPFNlipspks~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~  696 (822)
T KOG3609|consen  617 DADVEWKFARAKLWMSYFDEGMALPPPFNLIPSPKSFVYRNLRLRASIWKEKPANDDVEDQAADMLLVVFAAFVARKEEK  696 (822)
T ss_pred             cchhHHHHHHHHHHHHHhcCCCCCCCCcccccchHHHHHHHhhhhcchhccCcccccchhhhHhHHHHHhhhhccchhhc
Confidence            6899999999999999999999999999999999999998866554432222   11111111        122345679


Q ss_pred             HHHHHHHHHHHHHHHHHHhhc-CCCCChhHHHHHHhhHHHHHHHHHHHHHhcCCCCC
Q psy7032          74 YAAVMRALVWRYICLQHRQAD-TNPVTEDDVNEVKGEITAMRYELMNVLEKNGMDTS  129 (136)
Q Consensus        74 y~~VMr~LVkRYi~~~qr~~e-~~~vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~  129 (136)
                      |+.+|+..+.||..+..+... ..+.++...++++||++++|+|+++.++...+.+.
T Consensus       697 ~~~~~~~~~~r~~~~~~~~~~~t~g~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~  753 (822)
T KOG3609|consen  697 YKSIWRSEFHREEKQPPRVTFLTTGAHSFNMDNTFEGFKMRRSETMEALGGKEYTTV  753 (822)
T ss_pred             chhhhhhHHHHHHHHHHHHHhhccchhhhhhhhhhhccchhHHHHHHHhcCceeehh
Confidence            999999999999999888765 77999999999999999999999999975444333


No 3  
>KOG3614|consensus
Probab=93.16  E-value=0.45  Score=47.64  Aligned_cols=38  Identities=29%  Similarity=0.543  Sum_probs=29.9

Q ss_pred             CCccccchhhhhHHHHHhcCCCCCCCCcccccChHHHHH
Q psy7032           5 RQIDKMRKDSIKLWLSYFDDSPTLPPPFNIFPTTKMFLK   43 (136)
Q Consensus         5 ~~~~eWKFaRtkLwmsyf~~~~tlPpPFNliPspk~~~~   43 (136)
                      +.+.=|||-|=+|-|.|=+ -..|||||=||-..-.+..
T Consensus      1048 ~sd~iWkFQRY~limeyh~-rP~LPPPfiilsHi~l~~~ 1085 (1381)
T KOG3614|consen 1048 NSDQIWKFQRYSLIMEYHS-RPALPPPFIILSHIYLLLK 1085 (1381)
T ss_pred             hHHHHHHHHHHHHHHHhhc-CCCCCCCcHHHHHHHHHHH
Confidence            3456799999999999876 5589999988876655443


No 4  
>COG5466 Predicted small metal-binding protein [Function unknown]
Probab=76.77  E-value=5.4  Score=26.62  Aligned_cols=34  Identities=29%  Similarity=0.404  Sum_probs=24.7

Q ss_pred             HHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHH
Q psy7032          75 AAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITA  112 (136)
Q Consensus        75 ~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISs  112 (136)
                      ++||+++|-    .+...-..+.+.||+++-|||-|+.
T Consensus        25 ~Ev~~~iv~----H~k~~Hg~t~I~ed~in~Ik~rIk~   58 (59)
T COG5466          25 AEVMRRIVE----HAKEAHGETEIREDMINKIKSRIKK   58 (59)
T ss_pred             HHHHHHHHH----HHHHhcCCccccHHHHHHHHHHhhc
Confidence            567777763    3333334788999999999999874


No 5  
>PF13624 SurA_N_3:  SurA N-terminal domain; PDB: 3NRK_A.
Probab=75.81  E-value=14  Score=26.88  Aligned_cols=59  Identities=19%  Similarity=0.260  Sum_probs=31.8

Q ss_pred             hhHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHH-------HHHHHHHHHhcCCCCCc
Q psy7032          72 FRYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAM-------RYELMNVLEKNGMDTSS  130 (136)
Q Consensus        72 ~~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISsl-------RyEl~e~l~~~gm~~~~  130 (136)
                      .-.+.++..||.+.+....-++..-.|++++|.+.=+.|-.|       +-.+.+.|+++||+...
T Consensus        74 ~~~~~~l~~lI~~~ll~q~A~~~gi~vsd~ev~~~i~~~~~f~~~g~~~~~~f~~~L~~~g~t~~~  139 (154)
T PF13624_consen   74 QLKQQVLDQLIDQKLLLQEAKKLGISVSDAEVDDAIKQIPAFQENGKFDKEAFEEFLKQQGMTEEE  139 (154)
T ss_dssp             -HHHHHHHHHHHHHHHHHHHHHTT----HHHHHHHHHH--HHHHH----HHHHHHHHH--------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHhhccccc
Confidence            456779999999999886666666777888887655554444       23456678888886543


No 6  
>KOG4807|consensus
Probab=73.60  E-value=3.8  Score=37.07  Aligned_cols=29  Identities=28%  Similarity=0.460  Sum_probs=25.5

Q ss_pred             CChhHHHHHHhhHHHHHHHHHHHHHhcCC
Q psy7032          98 VTEDDVNEVKGEITAMRYELMNVLEKNGM  126 (136)
Q Consensus        98 vtE~dl~EiKQDISslRyEl~e~l~~~gm  126 (136)
                      |-|.+|.=|||+|||||-||.-.+++..+
T Consensus       516 VKEsEiQYLKqEissLkDELQtalrDKky  544 (593)
T KOG4807|consen  516 VKESEIQYLKQEISSLKDELQTALRDKKY  544 (593)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHhhhhc
Confidence            45899999999999999999999987654


No 7  
>PRK00059 prsA peptidylprolyl isomerase; Provisional
Probab=71.11  E-value=26  Score=29.06  Aligned_cols=56  Identities=18%  Similarity=0.321  Sum_probs=40.4

Q ss_pred             hHHHHHHHHHHHHHHHHHHhhcCCCCChhHHH-HHHhhHHHHHH-------HHHHHHHhcCCCC
Q psy7032          73 RYAAVMRALVWRYICLQHRQADTNPVTEDDVN-EVKGEITAMRY-------ELMNVLEKNGMDT  128 (136)
Q Consensus        73 ~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~-EiKQDISslRy-------El~e~l~~~gm~~  128 (136)
                      .+..|+.+||.+.+....-+...-.|++++|. +|.+.|..|.-       .+...|...||+.
T Consensus        85 ~~~~vL~~LI~~~ll~q~a~~~gi~vsd~ei~~~i~~~~~~~~~~~~~~~~~~~~~L~~~g~t~  148 (336)
T PRK00059         85 QKEQILDSLITEKVLLQKAKELKLIPSEEELNKEVDKKINEIKKQFNNDEEQFEEALKATGFTE  148 (336)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHcCCCH
Confidence            46789999999999886665556667888874 57777776643       4666777778753


No 8  
>PF14394 DUF4423:  Domain of unknown function (DUF4423)
Probab=61.98  E-value=11  Score=29.28  Aligned_cols=29  Identities=24%  Similarity=0.504  Sum_probs=26.3

Q ss_pred             CCCChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032          96 NPVTEDDVNEVKGEITAMRYELMNVLEKN  124 (136)
Q Consensus        96 ~~vtE~dl~EiKQDISslRyEl~e~l~~~  124 (136)
                      -+|+++++.+|++-|-.||.++..+..+.
T Consensus       127 ~~vs~~~~~ki~~~i~~fRk~i~~i~~~~  155 (171)
T PF14394_consen  127 MSVSREDYEKIKKEIREFRKKIIAIAEED  155 (171)
T ss_pred             EEeCHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            46789999999999999999999998874


No 9  
>PF09388 SpoOE-like:  Spo0E like sporulation regulatory protein;  InterPro: IPR018540  Spore formation is an extreme response to starvation and can also be a component of disease transmission. Sporulation is controlled by an expanded two-component system where starvation signals result in sensor kinase activation and phosphorylation of the master sporulation response regulator Spo0A. Phosphatases such as Spo0E dephosphorylate Spo0A thereby inhibiting sporulation. This is a family of Spo0E-like phosphatases. The structure of a Bacillus anthracis member of this family has revealed an anti-parallel alpha-helical structure []. ; PDB: 2BZB_B 2C0S_A.
Probab=61.36  E-value=13  Score=22.65  Aligned_cols=27  Identities=33%  Similarity=0.500  Sum_probs=23.4

Q ss_pred             HHHhhHHHHHHHHHHHHHhcCCCCCcc
Q psy7032         105 EVKGEITAMRYELMNVLEKNGMDTSSA  131 (136)
Q Consensus       105 EiKQDISslRyEl~e~l~~~gm~~~~~  131 (136)
                      +|.+.|..+|.+|.+.....|+.-|.+
T Consensus         1 ~L~~~Ie~~R~~L~~~~~~~~l~~~~v   27 (45)
T PF09388_consen    1 ELLEEIEELRQELNELAEKKGLTDPEV   27 (45)
T ss_dssp             HHHHHHHHHHHHHHHHHHHCCTTCHHH
T ss_pred             CHHHHHHHHHHHHHHHHHHcCCCCHHH
Confidence            578899999999999999988876654


No 10 
>COG4224 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=60.79  E-value=25  Score=24.65  Aligned_cols=41  Identities=17%  Similarity=0.255  Sum_probs=26.2

Q ss_pred             HHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHh
Q psy7032          78 MRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEK  123 (136)
Q Consensus        78 Mr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~  123 (136)
                      |..++-| |-..-.++...++||++..    +=..||.|-++.++.
T Consensus         2 ~~~~i~r-INeLAkk~K~~gLTeeEk~----eQ~~LR~eYl~~fr~   42 (77)
T COG4224           2 SKKKIAR-INELAKKKKEEGLTEEEKK----EQAKLRREYLESFRG   42 (77)
T ss_pred             cHHHHHH-HHHHHHHhcccCCCHHHHH----HHHHHHHHHHHHHHH
Confidence            3455556 5444556678999999854    445666666666653


No 11 
>PRK06835 DNA replication protein DnaC; Validated
Probab=55.05  E-value=69  Score=27.32  Aligned_cols=57  Identities=21%  Similarity=0.230  Sum_probs=40.2

Q ss_pred             hHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHhcCCCCC
Q psy7032          73 RYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEKNGMDTS  129 (136)
Q Consensus        73 ~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~  129 (136)
                      +..++-..+...=|..++.--..+.-.+..+.++|+.|..|+.+--++|..+|.+..
T Consensus        37 ~~~~id~~i~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~l~~~~~~lL~~~g~~~d   93 (329)
T PRK06835         37 EIAEIDDEIAKLGIKLSRAILKNPDKKEETLKELKEKITDLRVKKAELLVSNGYPPD   93 (329)
T ss_pred             cHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHHHHHHHHcCCChh
Confidence            444555555555555444433322223789999999999999999999999998744


No 12 
>PF08880 QLQ:  QLQ;  InterPro: IPR014978 QLQ is named after the conserved Gln, Leu, Gln motif. QLQ is found at the N terminus of SWI2/SNF2 protein, which has been shown to be involved in protein-protein interactions. QLQ has been postulated to be involved in mediating protein interactions []. ; GO: 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=52.71  E-value=20  Score=21.47  Aligned_cols=19  Identities=32%  Similarity=0.676  Sum_probs=17.4

Q ss_pred             CCChhHHHHHHhhHHHHHH
Q psy7032          97 PVTEDDVNEVKGEITAMRY  115 (136)
Q Consensus        97 ~vtE~dl~EiKQDISslRy  115 (136)
                      ..|.+-+++||..|..+||
T Consensus         2 ~FT~~Ql~~L~~Qi~ayK~   20 (37)
T PF08880_consen    2 PFTPAQLQELRAQILAYKY   20 (37)
T ss_pred             CCCHHHHHHHHHHHHHHHH
Confidence            4688999999999999999


No 13 
>PF10845 DUF2576:  Protein of unknown function (DUF2576);  InterPro: IPR022556  The function of this viral family of proteins is unknown. The entry contains Orf5 from Autographa californica nuclear polyhedrosis virus (AcMNPV).
Probab=52.67  E-value=24  Score=22.56  Aligned_cols=28  Identities=18%  Similarity=0.447  Sum_probs=22.3

Q ss_pred             hHHHHHHhhHHHHHHHHHHHHHhc--CCCC
Q psy7032         101 DDVNEVKGEITAMRYELMNVLEKN--GMDT  128 (136)
Q Consensus       101 ~dl~EiKQDISslRyEl~e~l~~~--gm~~  128 (136)
                      =|-++||+|+.|||.-|.|+-..+  |++.
T Consensus        11 ydreqlrrelnsLR~~vhelctRs~t~fDc   40 (48)
T PF10845_consen   11 YDREQLRRELNSLRRSVHELCTRSTTGFDC   40 (48)
T ss_pred             cCHHHHHHHHHHHHHHHHHHHHhcCCCcch
Confidence            356789999999999999887765  4543


No 14 
>PF03210 Paramyx_P_V_C:  Paramyxovirus P/V phosphoprotein C-terminal;  InterPro: IPR004897 Paramyxoviral P genes are able to generate more than one product, using alternative reading frames and RNA editing. The P gene encodes the structural phosphoprotein P. In addition, it encodes several non-structural proteins present in the infected cell but not in the virus particle. This family includes phosphoprotein P and the non-structural phosphoprotein V from different paramyxoviruses. Phosphoprotein P is a modular protein organised into two moieties that are both functionally and structurally distinct: a well-conserved C-terminal moiety that contains all the regions required for transcription, and a poorly conserved, intrinsically unstructured N-terminal moiety that provides several additional functions required for replication. The N-terminal moiety is responsible for binding to newly synthesized free N(0) (nucleoprotein that has not yet bound RNA), in order to prevent the binding of N(0) to cellular RNA. The C-terminal moiety consists of an oligomerisation domain, an N-RNA (nucleoprotein-RNA)-binding domain and an L polymerase-binding domain [, ]. Phosphoprotein P is essential for the activity of the RNA polymerase complex which it forms with another subunit, L IPR001016 from INTERPRO. Although all the catalytic activities of the polymerase are associated with the L subunit, its function requires specific interactions with phosphoprotein P []. The P and V phosphoproteins are amino co-terminal, but diverge at their C-termini. This difference is generated by an RNA-editing mechanism in which one or two non-templated G residues are inserted into P-gene-derived mRNA. In Measles virus and Sendai virus, one G residue is inserted and the edited transcript encodes the V protein. In Mumps virus, Simian virus 5 and Newcastle disease virus, two G residues are inserted, and the edited transcript codes for the P protein []. Being phosphoproteins, both P and V are rich in serine and threonine residues over their whole lengths. In addition, the V proteins are rich in cysteine residues at the C-termini [].; PDB: 1T6O_A 2K9D_A 1OKS_A 3BBZ_B.
Probab=51.93  E-value=11  Score=29.13  Aligned_cols=39  Identities=26%  Similarity=0.280  Sum_probs=27.7

Q ss_pred             hhHHHHHHHHHHHHHHHHHHhhc-----CCCCChhHHHHHHhhH
Q psy7032          72 FRYAAVMRALVWRYICLQHRQAD-----TNPVTEDDVNEVKGEI  110 (136)
Q Consensus        72 ~~y~~VMr~LVkRYi~~~qr~~e-----~~~vtE~dl~EiKQDI  110 (136)
                      .-|..+++.|++-.+.......+     +.--|++||++||+.|
T Consensus       111 s~~k~~v~aLI~~~i~~~~~r~kl~~~l~~a~t~~dl~~ikr~I  154 (155)
T PF03210_consen  111 SGEKDTVRALIKSCIKDPDKRAKLLKKLNQAKTEEDLKRIKRMI  154 (155)
T ss_dssp             --HHHHHHHHHHHSTTSHHHHHHHHHHHHC--SHHHHHHHHHHH
T ss_pred             chHHHHHHHHHHHcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHh
Confidence            45678999999999886544332     4556899999999987


No 15 
>PF03564 DUF1759:  Protein of unknown function (DUF1759);  InterPro: IPR005312 This is a small family of proteins of unknown function. 
Probab=49.44  E-value=96  Score=22.28  Aligned_cols=59  Identities=15%  Similarity=0.237  Sum_probs=42.9

Q ss_pred             hhhhHHHHHHHHHHHHHHHH-----HHh-hcCCC-CChhHHHHHHhhHHHHHHHHHHHHHhcCCCCC
Q psy7032          70 RDFRYAAVMRALVWRYICLQ-----HRQ-ADTNP-VTEDDVNEVKGEITAMRYELMNVLEKNGMDTS  129 (136)
Q Consensus        70 ~~~~y~~VMr~LVkRYi~~~-----qr~-~e~~~-vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~  129 (136)
                      .+..|+..+..|-+||=...     ... -.+-+ +.++|..+|+..+..+..-+ ..|+..|.++.
T Consensus        52 ~~~~Y~~a~~~L~~~yg~~~~i~~~~~~~l~~l~~~~~~d~~~L~~~~~~v~~~i-~~L~~lg~~~~  117 (145)
T PF03564_consen   52 SEENYEEAWELLEERYGNPRRIIQALLEELRNLPPISNDDPEALRSLVDKVNNCI-RALKALGVNVD  117 (145)
T ss_pred             cchhhHHHHHHHHHHhCCchHHHHHHHHHHhccccccchhHHHHHHHHHHHHHHH-HHHHHcCCCCC
Confidence            45789999999999995432     222 12333 78899999999998888766 57777776654


No 16 
>PF10146 zf-C4H2:  Zinc finger-containing protein ;  InterPro: IPR018482 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a family of proteins which appears to have a highly conserved zinc finger domain at the C-terminal end, described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is predicted to contain a coiled coil. Members of this family are annotated as being tumour-associated antigen HCA127 in humans, but this could not be confirmed.
Probab=47.41  E-value=41  Score=27.64  Aligned_cols=13  Identities=23%  Similarity=0.301  Sum_probs=5.8

Q ss_pred             hHHHHHHHHHHHH
Q psy7032          73 RYAAVMRALVWRY   85 (136)
Q Consensus        73 ~y~~VMr~LVkRY   85 (136)
                      .|..-|..|..-+
T Consensus        36 e~~kE~~~L~~Er   48 (230)
T PF10146_consen   36 EYRKEMEELLQER   48 (230)
T ss_pred             HHHHHHHHHHHHH
Confidence            4444444444433


No 17 
>PF09312 SurA_N:  SurA N-terminal domain;  InterPro: IPR015391 The correct folding of outer membrane proteins in Gram negative bacteria is facilitated by the survival protein SurA []. This entry represents the domain found at the N terminus of the chaperone SurA. It is a helical domain of unknown function. The C terminus of the SurA protein folds back and forms part of this domain also but is not included in the current alignment. ; PDB: 3RGC_B 2PV3_B 1M5Y_A.
Probab=47.21  E-value=71  Score=22.80  Aligned_cols=55  Identities=22%  Similarity=0.280  Sum_probs=36.1

Q ss_pred             HHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHH----HHHHHHHHhcCCCC
Q psy7032          74 YAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMR----YELMNVLEKNGMDT  128 (136)
Q Consensus        74 y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslR----yEl~e~l~~~gm~~  128 (136)
                      ...|+..||-..+....-+..+=.||+++|++.=++|.+-.    -++...|...|++.
T Consensus        43 ~~qvLd~LI~e~L~~q~ak~~gI~vsd~evd~~i~~ia~~n~ls~~ql~~~L~~~G~s~  101 (118)
T PF09312_consen   43 RKQVLDQLIDEKLQLQEAKRLGIKVSDEEVDEAIANIAKQNNLSVEQLRQQLEQQGISY  101 (118)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHCT----HHHHHHHHHHHHHHTT--HHHHHHHCHHCT--H
T ss_pred             HHHHHHHHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHHcCCCHHHHHHHHHHcCCCH
Confidence            46799999999998766666677889999988777776543    26666666777653


No 18 
>PF08105 Antimicrobial10:  Metchnikowin family;  InterPro: IPR012513 This family consists of the metchnikowin family of antimicrobial peptides from Drosophila. metchnikowin is a proline-rich peptide whose expression is immune-inducible. Induction of the metchnikowin gene expression can be mediated either by the TOLL pathway or by the imd gene product. The metchnikowin peptide is unique among the Drosophila antimicrobial peptides in that it is active against both bacteria and fungi [].
Probab=46.39  E-value=9.7  Score=24.71  Aligned_cols=9  Identities=56%  Similarity=0.866  Sum_probs=7.4

Q ss_pred             CCCCCCccc
Q psy7032          26 PTLPPPFNI   34 (136)
Q Consensus        26 ~tlPpPFNl   34 (136)
                      .|-|.|||=
T Consensus        35 DTRPSPFNP   43 (52)
T PF08105_consen   35 DTRPSPFNP   43 (52)
T ss_pred             CCCCCCCCC
Confidence            389999994


No 19 
>cd08807 CARD_CARD10_CARMA3 Caspase activation and recruitment domain of CARD10-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD10, also known as CARMA3 (caspase recruitment domain-containing membrane-associated guanylate kinase protein 3) or BIMP1. The CARMA3-BCL10-MALT1 signalosome plays a role in the GPCR-induced NF-kB activation. CARMA3 is more widely expressed than CARMA1, which is found only in hematopoietic cells. In endothelial and smooth muscle cells, CARMA3-mediated NF-kB activation induces pro-inflammatory signals within the vasculature and is a key factor in atherogenesis. In bronchial epithelial cells, CARMA3-mediated NF-kB signaling is important for the development of allergic airway inflammation. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains fo
Probab=45.69  E-value=25  Score=25.07  Aligned_cols=32  Identities=25%  Similarity=0.388  Sum_probs=25.5

Q ss_pred             CCCCChhHHHHHHhhHH---HHHH--HHHHHHHhcCC
Q psy7032          95 TNPVTEDDVNEVKGEIT---AMRY--ELMNVLEKNGM  126 (136)
Q Consensus        95 ~~~vtE~dl~EiKQDIS---slRy--El~e~l~~~gm  126 (136)
                      +.-+||.|..||..+.+   +-+.  -|+|+|...|-
T Consensus        28 ~~VL~~~deEEI~~~~~~~~~~~k~~~LLDIL~trG~   64 (86)
T cd08807          28 CRVIDEQDEEEVLNSYRFPCRINRTGRLMDILRGRGK   64 (86)
T ss_pred             hCCCChhhHHHHHhccchhhHHHHHHHHHHHHHhcCc
Confidence            45569999999999988   5554  47999998883


No 20 
>cd08806 CARD_CARD14_CARMA2 Caspase activation and recruitment domain of CARD14-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD14, also known as BIMP2 or CARMA2 (caspase recruitment domain-containing membrane-associated guanylate kinase protein 2). CARD14 has been identified as a novel member of the MAGUK (membrane-associated guanylate kinase) family that functions as upstream activators of BCL10 (B-cell lymphoma 10) and NF-kB signaling. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and DED (Death Effector Domain). They serve as adaptors in signaling pathways 
Probab=45.21  E-value=29  Score=24.76  Aligned_cols=32  Identities=19%  Similarity=0.231  Sum_probs=25.0

Q ss_pred             CCCCChhHHHHHHhhHHHHHHH-----HHHHHHhcCC
Q psy7032          95 TNPVTEDDVNEVKGEITAMRYE-----LMNVLEKNGM  126 (136)
Q Consensus        95 ~~~vtE~dl~EiKQDISslRyE-----l~e~l~~~gm  126 (136)
                      +.-+|++|..||..+.+.-...     |+|+|...|=
T Consensus        28 ~~IL~~~deeeIls~~t~~~r~~k~g~LLDIL~trG~   64 (86)
T cd08806          28 AKVLDQLDEEEVLHSPRLTNRAMRVGHLLDLLKTRGK   64 (86)
T ss_pred             cCCCChhhHHHHHccchHHHHHHHHHHHHHHHHhcCc
Confidence            4456999999999998865553     6899998874


No 21 
>TIGR03720 exospor_lead exosporium leader peptide. This domain is found as a leader peptide in at least two proteins targeted to the exosporium, a structure that occurs as the outermost layer of Bacillus anthracis, B. cereus, and B. thuringiensis spores. The exosporium consists of a basal layer and a nap of hair-like filaments. BclA, the major protein of the nap filaments, is targeted there by this leader peptide.
Probab=43.92  E-value=13  Score=21.05  Aligned_cols=11  Identities=64%  Similarity=1.325  Sum_probs=8.3

Q ss_pred             CCCCCC--Ccccc
Q psy7032          25 SPTLPP--PFNIF   35 (136)
Q Consensus        25 ~~tlPp--PFNli   35 (136)
                      |.||||  ||.+.
T Consensus         7 GPTlPpippft~P   19 (26)
T TIGR03720         7 GPTLPPIPPFTLP   19 (26)
T ss_pred             CCCCCCCCCcccc
Confidence            779987  77764


No 22 
>PF15344 FAM217:  FAM217 family
Probab=43.17  E-value=11  Score=31.34  Aligned_cols=10  Identities=60%  Similarity=1.215  Sum_probs=8.3

Q ss_pred             CCCCCCcccc
Q psy7032          26 PTLPPPFNIF   35 (136)
Q Consensus        26 ~tlPpPFNli   35 (136)
                      ..||||||-.
T Consensus        50 DFLPPPFNSw   59 (233)
T PF15344_consen   50 DFLPPPFNSW   59 (233)
T ss_pred             CCCCCCCcch
Confidence            4799999965


No 23 
>PF09366 DUF1997:  Protein of unknown function (DUF1997);  InterPro: IPR018971  This family of proteins are functionally uncharacterised. 
Probab=42.70  E-value=35  Score=25.74  Aligned_cols=14  Identities=50%  Similarity=1.146  Sum_probs=11.5

Q ss_pred             CCCCCCCcccccCh
Q psy7032          25 SPTLPPPFNIFPTT   38 (136)
Q Consensus        25 ~~tlPpPFNliPsp   38 (136)
                      +..+||||.++|.+
T Consensus       110 ~v~~P~~~~~~P~~  123 (158)
T PF09366_consen  110 SVELPPPFRLLPES  123 (158)
T ss_pred             EEEcChhHHhCCHH
Confidence            34689999999985


No 24 
>PF10073 DUF2312:  Uncharacterized protein conserved in bacteria (DUF2312);  InterPro: IPR018753 This entry is represented by Azospirillum phage Cd, Gp10. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.  Members of this family of hypothetical bacterial proteins have no known function. 
Probab=42.52  E-value=92  Score=21.59  Aligned_cols=40  Identities=23%  Similarity=0.412  Sum_probs=26.9

Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHhcCCCCCc
Q psy7032          76 AVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEKNGMDTSS  130 (136)
Q Consensus        76 ~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~~  130 (136)
                      .-++..|.|...-           |+|...|..||.    ++....+.+|+|+.-
T Consensus         4 ~~Lr~~ieRiErL-----------EeEk~~i~~dik----dVyaEAK~~GfD~K~   43 (74)
T PF10073_consen    4 EQLRQFIERIERL-----------EEEKKAISDDIK----DVYAEAKGNGFDTKA   43 (74)
T ss_pred             HHHHHHHHHHHHH-----------HHHHHHHHHHHH----HHHHHHHhCCCCHHH
Confidence            3466677776655           677666777775    566667888888753


No 25 
>PF11176 DUF2962:  Protein of unknown function (DUF2962);  InterPro: IPR021346  This eukaryotic family of proteins has no known function. ; PDB: 2KKM_A.
Probab=42.35  E-value=41  Score=25.83  Aligned_cols=14  Identities=21%  Similarity=0.349  Sum_probs=10.6

Q ss_pred             HHHHHHHHHHHHHH
Q psy7032          75 AAVMRALVWRYICL   88 (136)
Q Consensus        75 ~~VMr~LVkRYi~~   88 (136)
                      ..-|+.||.+||..
T Consensus        68 ~~e~~~lI~~yl~R   81 (155)
T PF11176_consen   68 LEEIHELIERYLHR   81 (155)
T ss_dssp             HHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHhc
Confidence            45678888888865


No 26 
>KOG3609|consensus
Probab=41.58  E-value=5  Score=38.65  Aligned_cols=43  Identities=14%  Similarity=0.026  Sum_probs=39.2

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHH
Q psy7032          70 RDFRYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITA  112 (136)
Q Consensus        70 ~~~~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISs  112 (136)
                      ....++++|.+|..+|....|-..+.+.+++....+.+||+++
T Consensus       734 ~~~~~~e~~~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~  776 (822)
T KOG3609|consen  734 FKMRRSETMEALGGKEYTTVQYYVPKDKTPGPAAAADAGTKVA  776 (822)
T ss_pred             cchhHHHHHHHhcCceeehhhhccccCcCCCccchhhhccccc
Confidence            3468999999999999999999999999999999999999993


No 27 
>TIGR02147 Fsuc_second hypothetical protein, TIGR02147. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
Probab=40.40  E-value=38  Score=28.37  Aligned_cols=29  Identities=21%  Similarity=0.392  Sum_probs=25.4

Q ss_pred             CCCChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032          96 NPVTEDDVNEVKGEITAMRYELMNVLEKN  124 (136)
Q Consensus        96 ~~vtE~dl~EiKQDISslRyEl~e~l~~~  124 (136)
                      -+++++++.+|++-|-.||-+++.+..+.
T Consensus       225 ~~i~~~~~~~i~~~i~~fRk~i~~i~~~~  253 (271)
T TIGR02147       225 FGISEEAYKEIVKKIQEFRKEVLAIATKD  253 (271)
T ss_pred             EecCHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            46679999999999999999999877654


No 28 
>TIGR01219 Pmev_kin_ERG8 phosphomevalonate kinase, ERG8-type, eukaryotic branch. This enzyme is part of the mevalonate pathway, one of two alternative pathways for the biosynthesis of IPP. In an example of nonorthologous gene displacement, two different types of phosphomevalonate kinase are found - the animal type and this ERG8 type. This model represents plant and fungal forms of the ERG8 type of phosphomevalonate kinase.
Probab=40.25  E-value=2.4e+02  Score=25.49  Aligned_cols=28  Identities=14%  Similarity=0.325  Sum_probs=24.0

Q ss_pred             hHHHHHHhhHHHHHHHHHHHHHhcCCCC
Q psy7032         101 DDVNEVKGEITAMRYELMNVLEKNGMDT  128 (136)
Q Consensus       101 ~dl~EiKQDISslRyEl~e~l~~~gm~~  128 (136)
                      .|+.+|++-|...|..|.++=...|.++
T Consensus       352 ~~~~~i~~~i~~~R~~Lr~~~~~sgv~I  379 (454)
T TIGR01219       352 EELLEAREAMLRIRRLMRQITEEASVDI  379 (454)
T ss_pred             ccHHHHHHHHHHHHHHHHHhhHhcCCcc
Confidence            3899999999999999988888887665


No 29 
>PF08776 VASP_tetra:  VASP tetramerisation domain;  InterPro: IPR014885 Vasodilator-stimulated phosphoprotein (VASP) is an actin cytoskeletal regulatory protein. This region corresponds to the tetramerisation domain which forms a right handed alpha helical coiled coil structure []. ; PDB: 1USE_A 1USD_A.
Probab=38.97  E-value=58  Score=20.13  Aligned_cols=21  Identities=19%  Similarity=0.449  Sum_probs=11.9

Q ss_pred             HHHHHHhhHHHHHHHHHHHHH
Q psy7032         102 DVNEVKGEITAMRYELMNVLE  122 (136)
Q Consensus       102 dl~EiKQDISslRyEl~e~l~  122 (136)
                      =|.|.+..+..++.|+++.++
T Consensus        12 IL~EvrkEl~K~K~EIIeA~~   32 (40)
T PF08776_consen   12 ILEEVRKELQKVKEEIIEAIR   32 (40)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            344555555666666666554


No 30 
>PF13789 DUF4181:  Domain of unknown function (DUF4181)
Probab=38.83  E-value=3.6  Score=29.62  Aligned_cols=14  Identities=21%  Similarity=0.290  Sum_probs=11.3

Q ss_pred             CCCCCccccchhhh
Q psy7032           2 FGLRQIDKMRKDSI   15 (136)
Q Consensus         2 ~~~~~~~eWKFaRt   15 (136)
                      +++++-+||||+|-
T Consensus        72 ~~~ra~mEWKy~re   85 (110)
T PF13789_consen   72 FCFRAFMEWKYDRE   85 (110)
T ss_pred             HHHHHHHHHHhccc
Confidence            46788899999873


No 31 
>cd08809 CARD_CARD9 Caspase activation and recruitment domain of CARD9-like proteins. Caspase activation and recruitment domain (CARD) similar to that found in CARD9. CARD9 is a central regulator of innate immunity and is highly expressed in dendritic cells and macrophages. Together with BCL10 (B-cell lymphoma 10) and Malt1 (mucosa-associated lymphoid tissue-lymphoma-translocation gene 1), it forms the M-CBM signalosome (the CBM complex in myeloid immune cells), which mediates activation of NF-kB and MAPK by ITAM-coupled receptors expressed on immune cells. CARD9 associates with BCL10 via a CARD-CARD interaction. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating wi
Probab=37.51  E-value=45  Score=23.80  Aligned_cols=33  Identities=21%  Similarity=0.482  Sum_probs=24.3

Q ss_pred             CCCCChhHHHHHHhh---HHHHHH--HHHHHHHhcCCC
Q psy7032          95 TNPVTEDDVNEVKGE---ITAMRY--ELMNVLEKNGMD  127 (136)
Q Consensus        95 ~~~vtE~dl~EiKQD---ISslRy--El~e~l~~~gm~  127 (136)
                      +.-++++|++||..+   .++-+.  -|+|+|...|-.
T Consensus        28 ~~Vl~~~deEeI~~~~~~~~~~~kvg~LLDIL~~rG~~   65 (86)
T cd08809          28 CKVLNSDDEEQVLNDPSLVIRKRKVGVLLDILQRTGLK   65 (86)
T ss_pred             hCCCChhhHHHHHcCcccchHHHHHHHHHHHHHhcCch
Confidence            445699999999999   334333  478999988843


No 32 
>PRK10788 periplasmic folding chaperone; Provisional
Probab=33.04  E-value=78  Score=28.79  Aligned_cols=55  Identities=15%  Similarity=0.172  Sum_probs=42.1

Q ss_pred             HHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHH-------HHHHHHHhcCCCC
Q psy7032          74 YAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRY-------ELMNVLEKNGMDT  128 (136)
Q Consensus        74 y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRy-------El~e~l~~~gm~~  128 (136)
                      .+.|+..||.+-+....-++-.-.|++++|.+.=+.+..|.-       .+...|+++||+.
T Consensus        88 ~~qvl~~LI~~~Ll~q~A~~lgi~vsd~ev~~~I~~~p~Fq~~G~Fd~~~y~~~L~~~g~t~  149 (623)
T PRK10788         88 RQQVLNRLIDEALLDQYARELGLGISDEQVKQAIFATPAFQTDGKFDNNKYLAILNQMGMTA  149 (623)
T ss_pred             HHHHHHHHHHHHHHHHHHHHcCCCcCHHHHHHHHHhCcccccCCCcCHHHHHHHHHHcCCCH
Confidence            478999999999998777777788899998876666655532       2567888889853


No 33 
>PF13801 Metal_resist:  Heavy-metal resistance; PDB: 3EPV_C 2Y3D_A 2Y3H_D 2Y3G_B 2Y3B_A 2Y39_A 3LAY_H.
Probab=31.41  E-value=94  Score=20.79  Aligned_cols=28  Identities=21%  Similarity=0.205  Sum_probs=20.3

Q ss_pred             CCCCChhHHHHHHhhHHHHHHHHHHHHH
Q psy7032          95 TNPVTEDDVNEVKGEITAMRYELMNVLE  122 (136)
Q Consensus        95 ~~~vtE~dl~EiKQDISslRyEl~e~l~  122 (136)
                      .+.++++.|..+-++|+..+.++.....
T Consensus        81 ~~~~D~~~i~a~~~~~~~~~~~l~~~~~  108 (125)
T PF13801_consen   81 APPPDEAAIEALLEEIREAQAELRQERL  108 (125)
T ss_dssp             CSSS-HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCCCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5677888888888888888887765543


No 34 
>PRK10770 peptidyl-prolyl cis-trans isomerase SurA; Provisional
Probab=30.74  E-value=2.4e+02  Score=24.02  Aligned_cols=55  Identities=16%  Similarity=0.260  Sum_probs=35.5

Q ss_pred             hHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHH----HHHHHHHHHhcCCC
Q psy7032          73 RYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAM----RYELMNVLEKNGMD  127 (136)
Q Consensus        73 ~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISsl----RyEl~e~l~~~gm~  127 (136)
                      -.+.++.+||.+.+....-+...-.|++++|++-=+.+..-    .-++...|..+|++
T Consensus        48 l~~~~l~~Li~~~Ll~q~A~~~gi~vsd~ev~~~i~~~~~~~~~~~~~~~~~L~~~g~~  106 (413)
T PRK10770         48 LRHQILERLIMDNIILQMAQKMGVKISDEQLDQAIANIAAQNNMTLDQMRSRLAYDGLN  106 (413)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHcCCCcCHHHHHHHHHHHHHHCCCCHHHHHHHHHHcCCC
Confidence            34678999999999987777777788888886633333220    11344555566653


No 35 
>PF05121 GvpK:  Gas vesicle protein K  ;  InterPro: IPR007805 Gas vesicles are intracellular, protein-coated, and hollow organelles found in cyanobacteria and halophilic archaea. They are permeable to ambient gases by diffusion and provide buoyancy, enabling cells to move upwards in liquid to access oxygen and/or light. Proteins containing this domain are involved in the formation of gas vesicles [].; GO: 0031412 gas vesicle organization
Probab=29.85  E-value=2.1e+02  Score=20.45  Aligned_cols=53  Identities=13%  Similarity=0.328  Sum_probs=41.4

Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHhcCCCCCcc
Q psy7032          76 AVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEKNGMDTSSA  131 (136)
Q Consensus        76 ~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~gm~~~~~  131 (136)
                      +++++|..|   ++.|..+...+|++++..+=..+-.+.-.+-++...-|.+..+.
T Consensus        23 ELLRqlmEr---QAiRRme~G~Lse~qiErlG~tLm~Le~~~~~l~~~~gl~~~dL   75 (88)
T PF05121_consen   23 ELLRQLMER---QAIRRMEAGSLSEEQIERLGETLMKLEEAMEELCERFGLTPEDL   75 (88)
T ss_pred             HHHHHHHHH---HHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHcCCCHHHh
Confidence            455666655   34666788889999999999999999999998888877765543


No 36 
>PRK01631 hypothetical protein; Provisional
Probab=29.73  E-value=1.6e+02  Score=20.56  Aligned_cols=40  Identities=18%  Similarity=0.351  Sum_probs=22.4

Q ss_pred             HHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHH
Q psy7032          78 MRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLE  122 (136)
Q Consensus        78 Mr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~  122 (136)
                      |+.++.|-=.- -+++.+.|+|+++..|-+    .||.|-++.++
T Consensus         1 m~~ii~RINeL-akK~K~~gLT~eE~~Eq~----~LR~eYl~~fR   40 (76)
T PRK01631          1 MKNILFRINEL-SKKEKATGLTVDEKQEQQ----MLRQNYTQTFR   40 (76)
T ss_pred             ChhHHHHHHHH-HHHhcccCCCHHHHHHHH----HHHHHHHHHHH
Confidence            45566664333 234447899998866543    34444444444


No 37 
>TIGR00496 frr ribosome recycling factor. This model finds only eubacterial proteins. Mitochondrial and/or chloroplast forms might be expected but are not currently known. This protein was previously called ribosome releasing factor. By releasing ribosomes from mRNA at the end of protein biosynthesis, it prevents inappropriate translation from 3-prime regions of the mRNA and frees the ribosome for new rounds of translation. EGAD|53116|YHR038W is part of the frr superfamily.
Probab=29.39  E-value=2.8e+02  Score=21.69  Aligned_cols=48  Identities=13%  Similarity=0.145  Sum_probs=33.5

Q ss_pred             hhHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHH
Q psy7032          72 FRYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMN  119 (136)
Q Consensus        72 ~~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e  119 (136)
                      +.+..-+|++=+.+.........+.+++|||+..+..+|..+=-+-..
T Consensus       113 E~aKv~iRniRr~~~~~iKk~~k~~~iseD~~k~~~~~iQkltd~~i~  160 (176)
T TIGR00496       113 EQAKVAVRNVRRDANDKVKKLEKDKEISEDEERRLQEEIQKLTDEYIK  160 (176)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhhcCCCChhHHHHHHHHHHHHHHHHHH
Confidence            455566666666666555544456779999999999999988665443


No 38 
>PRK07857 hypothetical protein; Provisional
Probab=29.18  E-value=86  Score=23.00  Aligned_cols=30  Identities=17%  Similarity=0.609  Sum_probs=25.2

Q ss_pred             CCCCChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032          95 TNPVTEDDVNEVKGEITAMRYELMNVLEKN  124 (136)
Q Consensus        95 ~~~vtE~dl~EiKQDISslRyEl~e~l~~~  124 (136)
                      .++.+..+|.++++.|..+=.||++.|..+
T Consensus        22 ~~p~~~~~L~~lR~eID~ID~eIl~LL~eR   51 (106)
T PRK07857         22 DDPLSDAEIDELREEIDRLDAEILALVKRR   51 (106)
T ss_pred             CCCcchhhHHHHHHHHHHHHHHHHHHHHHH
Confidence            356678999999999999999999888653


No 39 
>PF05291 Bystin:  Bystin;  InterPro: IPR007955 Trophinin and tastin form a cell adhesion molecule complex that potentially mediates an initial attachment of the blastocyst to uterine epithelial cells at the time of implantation. Trophinin and tastin bind to an intermediary cytoplasmic protein called bystin. Bystin may be involved in implantation and trophoblast invasion because bystin is found with trophinin and tastin in the cells at human implantation sites and also in the intermediate trophoblasts at invasion front in the placenta from early pregnancy []. This family also includes the Saccharomyces cerevisiae protein ENP1. ENP1 is an essential protein in S. cerevisiae and is localised in the nucleus []. It is thought that ENP1 plays a direct role in the early steps of rRNA processing as enp1 defective S. cerevisiae cannot synthesise 20S pre-rRNA and hence 18S rRNA, which leads to reduced formation of 40S ribosomal subunits [].
Probab=28.08  E-value=60  Score=28.00  Aligned_cols=34  Identities=21%  Similarity=0.377  Sum_probs=27.1

Q ss_pred             chhhhhHHHHHhcCCCCCCCCcccccChHHHHHHH
Q psy7032          11 RKDSIKLWLSYFDDSPTLPPPFNIFPTTKMFLKMF   45 (136)
Q Consensus        11 KFaRtkLwmsyf~~~~tlPpPFNliPspk~~~~l~   45 (136)
                      -|.....+||-+.- +-||-||.|||+-+.+-.++
T Consensus        50 vY~~vG~~Ls~YrS-GkLPKafKiiP~l~nWEeiL   83 (301)
T PF05291_consen   50 VYTKVGELLSRYRS-GKLPKAFKIIPSLPNWEEIL   83 (301)
T ss_pred             HHHHHHHHHhhcCC-CCCCcCeecccCchhHHHHH
Confidence            46667788888884 48999999999988876555


No 40 
>PRK10132 hypothetical protein; Provisional
Probab=27.94  E-value=87  Score=22.83  Aligned_cols=26  Identities=23%  Similarity=0.509  Sum_probs=21.5

Q ss_pred             hhHHHHHHhhHHHHHHHHHHHHHhcC
Q psy7032         100 EDDVNEVKGEITAMRYELMNVLEKNG  125 (136)
Q Consensus       100 E~dl~EiKQDISslRyEl~e~l~~~g  125 (136)
                      |+++..|++||.+|--++=++|+..|
T Consensus        11 ~~q~e~L~~Dl~~L~~~le~ll~~~~   36 (108)
T PRK10132         11 DDGVQDIQNDVNQLADSLESVLKSWG   36 (108)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            57788999999999998888887655


No 41 
>PF08172 CASP_C:  CASP C terminal;  InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=27.62  E-value=74  Score=26.34  Aligned_cols=25  Identities=20%  Similarity=0.424  Sum_probs=16.0

Q ss_pred             hhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032         100 EDDVNEVKGEITAMRYELMNVLEKN  124 (136)
Q Consensus       100 E~dl~EiKQDISslRyEl~e~l~~~  124 (136)
                      |+++...+|.|++||.|+-....+|
T Consensus        99 E~elr~~~~~~~~L~~Ev~~L~~DN  123 (248)
T PF08172_consen   99 EEELRKQQQTISSLRREVESLRADN  123 (248)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5667777777777777765544443


No 42 
>PF04728 LPP:  Lipoprotein leucine-zipper;  InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=27.53  E-value=1.1e+02  Score=20.09  Aligned_cols=21  Identities=14%  Similarity=0.411  Sum_probs=16.2

Q ss_pred             hHHHHHHhhHHHHHHHHHHHH
Q psy7032         101 DDVNEVKGEITAMRYELMNVL  121 (136)
Q Consensus       101 ~dl~EiKQDISslRyEl~e~l  121 (136)
                      ..+..|.+||.++|-++....
T Consensus        17 ~kvdqLs~dv~~lr~~v~~ak   37 (56)
T PF04728_consen   17 SKVDQLSSDVNALRADVQAAK   37 (56)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHH
Confidence            567788888888888886554


No 43 
>PRK13723 conjugal transfer pilus assembly protein TraH; Provisional
Probab=27.43  E-value=2.6e+02  Score=25.43  Aligned_cols=51  Identities=8%  Similarity=0.147  Sum_probs=33.8

Q ss_pred             hHHHHH-HHHHHHHHHHHH----HhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHh
Q psy7032          73 RYAAVM-RALVWRYICLQH----RQADTNPVTEDDVNEVKGEITAMRYELMNVLEK  123 (136)
Q Consensus        73 ~y~~VM-r~LVkRYi~~~q----r~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~  123 (136)
                      .|.+++ -.++-.|+...-    ....+...+++++.+++++|...|.++.++..+
T Consensus       358 ~~a~~IA~dll~~yL~~~l~~~~~al~~~~~~~~~~~~~~~~l~~a~~~~~~~~~~  413 (451)
T PRK13723        358 QLTDYIGYDILLQYIQELIQQARAMLATGNYPEAVMDHLRENLNQAQRQIAAFQSQ  413 (451)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344433 345555555443    333355678999999999999999988766543


No 44 
>PF09860 DUF2087:  Uncharacterized protein conserved in bacteria (DUF2087);  InterPro: IPR018656  This domain, found in various hypothetical prokaryotic proteins and transcriptional activators, has no known function. 
Probab=26.37  E-value=2.1e+02  Score=19.25  Aligned_cols=25  Identities=24%  Similarity=0.425  Sum_probs=19.1

Q ss_pred             CCCCChhHHHHHH----hhHHHHHHHHHH
Q psy7032          95 TNPVTEDDVNEVK----GEITAMRYELMN  119 (136)
Q Consensus        95 ~~~vtE~dl~EiK----QDISslRyEl~e  119 (136)
                      ....||.|||++=    .|-+.||-+|.|
T Consensus        27 g~~y~E~EVN~~L~~~~~D~a~LRR~LVd   55 (71)
T PF09860_consen   27 GREYSEKEVNEILKRFFDDYATLRRYLVD   55 (71)
T ss_pred             CCccCHHHHHHHHHHHcccHHHHHHHHHH
Confidence            4556888888764    477899999876


No 45 
>PF07889 DUF1664:  Protein of unknown function (DUF1664);  InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long. 
Probab=26.03  E-value=68  Score=24.15  Aligned_cols=24  Identities=21%  Similarity=0.408  Sum_probs=19.6

Q ss_pred             CChhHHHHHHhhHHHHHHHHHHHH
Q psy7032          98 VTEDDVNEVKGEITAMRYELMNVL  121 (136)
Q Consensus        98 vtE~dl~EiKQDISslRyEl~e~l  121 (136)
                      -+.+|+.+++.|++.+++++-.+-
T Consensus        86 ~i~~eV~~v~~dv~~i~~dv~~v~  109 (126)
T PF07889_consen   86 QIKDEVTEVREDVSQIGDDVDSVQ  109 (126)
T ss_pred             HHHHHHHHHHhhHHHHHHHHHHHH
Confidence            357899999999999999886554


No 46 
>TIGR02698 CopY_TcrY copper transport repressor, CopY/TcrY family. This family includes metal-fist type transcriptional repressors of copper transport systems such as copYZAB of Enterococcus hirae and tcrYAZB (transferble copper resistance) of an Enterocuccus faecium plasmid. High levels of copper can displace zinc and prevent binding by the repressor, activating efflux by copper resistance transporters. The most closely related proteins excluded by this model are antibiotic resistance regulators including the methicillin resistance regulatory protein MecI.
Probab=25.97  E-value=2.6e+02  Score=20.42  Aligned_cols=40  Identities=10%  Similarity=0.165  Sum_probs=24.9

Q ss_pred             hhHHHHHHHHHHHHHHH----HH-HhhcCCCCChhHHHHHHhhHH
Q psy7032          72 FRYAAVMRALVWRYICL----QH-RQADTNPVTEDDVNEVKGEIT  111 (136)
Q Consensus        72 ~~y~~VMr~LVkRYi~~----~q-r~~e~~~vtE~dl~EiKQDIS  111 (136)
                      ...+..++.++.++.--    .- .=.+++.++++|+.+|+|=|.
T Consensus        71 e~~~~~~~~~~~~~f~gs~~~ll~~l~~~~~ls~eele~L~~li~  115 (130)
T TIGR02698        71 EAVENAAQELFSRICSRKVGAVIADLIEESPLSQTDIEKLEKLLS  115 (130)
T ss_pred             HHHHHHHHHHHHHHHCCCHHHHHHHHHhcCCCCHHHHHHHHHHHH
Confidence            34466777777766321    11 112357799999999998443


No 47 
>PF09851 SHOCT:  Short C-terminal domain;  InterPro: IPR018649  This family of hypothetical prokaryotic proteins has no known function. 
Probab=25.72  E-value=1.2e+02  Score=17.10  Aligned_cols=18  Identities=17%  Similarity=0.440  Sum_probs=14.5

Q ss_pred             cCCCCChhHHHHHHhhHH
Q psy7032          94 DTNPVTEDDVNEVKGEIT  111 (136)
Q Consensus        94 e~~~vtE~dl~EiKQDIS  111 (136)
                      +...+|++|..+.|+.|-
T Consensus        13 ~~G~IseeEy~~~k~~ll   30 (31)
T PF09851_consen   13 DKGEISEEEYEQKKARLL   30 (31)
T ss_pred             HcCCCCHHHHHHHHHHHh
Confidence            456789999999998773


No 48 
>TIGR03575 selen_PSTK_euk L-seryl-tRNA(Sec) kinase, eukaryotic. Members of this protein are L-seryl-tRNA(Sec) kinase. This enzyme is part of a two-step pathway in Eukaryota and Archaea for performing selenocysteine biosynthesis by changing serine misacylated on selenocysteine-tRNA to selenocysteine. This enzyme performs the first step, phosphorylation of the OH group of the serine side chain. This family represents eukaryotic proteins with this activity.
Probab=25.61  E-value=2.6e+02  Score=24.11  Aligned_cols=47  Identities=6%  Similarity=0.217  Sum_probs=35.1

Q ss_pred             HHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHhcC
Q psy7032          75 AAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEKNG  125 (136)
Q Consensus        75 ~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~g  125 (136)
                      ..+.+..|...+.+.+.    +.++..+++.+-+.++.+|.|+|+.++...
T Consensus       259 D~~tr~ii~~i~~~~k~----~~~~~~~~~~~~~~l~~~rk~~L~~~r~~~  305 (340)
T TIGR03575       259 DQTLRRIISQTMREAKD----EQASAYNLKLLAEELNKLKADFLEDLRQGN  305 (340)
T ss_pred             HHHHHHHHHHHHHHHHh----ccCChHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence            34666777776654332    236788999999999999999999998654


No 49 
>PF04353 Rsd_AlgQ:  Regulator of RNA polymerase sigma(70) subunit, Rsd/AlgQ;  InterPro: IPR007448 This family includes bacterial transcriptional regulators that are thought to act through an interaction with the conserved region 4 of the sigma(70) subunit of RNA polymerase. The Pseudomonas aeruginosa homologue, AlgQ, positively regulates virulence gene expression and is associated with the mucoid phenotype observed in P. aeruginosa isolates from cystic fibrosis patients.; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 2P7V_A.
Probab=25.04  E-value=1.9e+02  Score=22.55  Aligned_cols=38  Identities=16%  Similarity=0.275  Sum_probs=21.9

Q ss_pred             cccchhhhhHHHHHhcCCCCCCCC--cccccChHHHHHHHh
Q psy7032           8 DKMRKDSIKLWLSYFDDSPTLPPP--FNIFPTTKMFLKMFG   46 (136)
Q Consensus         8 ~eWKFaRtkLwmsyf~~~~tlPpP--FNliPspk~~~~l~~   46 (136)
                      .-|--+|-+|+..|..-. -+||.  =+-.|+++.+..+|.
T Consensus        20 D~WL~~Rq~Llv~Yc~L~-gl~p~~~~~~~~~~~~l~~FCq   59 (153)
T PF04353_consen   20 DRWLEERQQLLVAYCKLA-GLKPYKESLTLPSEEALQNFCQ   59 (153)
T ss_dssp             HHHHHHHHHHHHHHHHHH-S-------------HHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHhHh-cCCCCccccCCCCHHHHHHHHH
Confidence            358899999999999743 24443  467899999988885


No 50 
>PF14838 INTS5_C:  Integrator complex subunit 5 C-terminus
Probab=24.77  E-value=86  Score=30.02  Aligned_cols=81  Identities=15%  Similarity=0.181  Sum_probs=48.7

Q ss_pred             ccccchhhhhHHHHHhcCCCCCCCCcccc------cChHHHHHHHhhhhhhcccc--ccHHHH--HH-----HH--Hhhh
Q psy7032           7 IDKMRKDSIKLWLSYFDDSPTLPPPFNIF------PTTKMFLKMFGLRQIDKMRK--DSIKRK--QQ-----AR--QEKE   69 (136)
Q Consensus         7 ~~eWKFaRtkLwmsyf~~~~tlPpPFNli------Pspk~~~~l~~~~~~~~~~~--~s~~r~--~~-----~~--~~~~   69 (136)
                      +--|-+..|..|+.-+..|..||||++.+      -+|.-+++++.-......+.  ++...-  +.     .+  ....
T Consensus       597 ~~~~~l~~t~~Lv~lm~~gqlLPppL~~~~el~~~ltp~Ei~~lL~~cIW~y~kdh~Psp~~f~~~~~~g~~~RD~~~~~  676 (696)
T PF14838_consen  597 SSPWHLEATCKLVELMAKGQLLPPPLSYIHELFPYLTPHEIYLLLLSCIWNYMKDHVPSPALFVFNDETGLFWRDFSREK  676 (696)
T ss_pred             CcHHHHHHHHHHHHHHHhcCcCCchHHHHHHHHHhcCHHHHHHHHHHHHHHHHhhCCCCHHHHhcCcccCccccCccccc
Confidence            34577788999999999999999999976      45666666553211111110  111000  00     00  0111


Q ss_pred             hhhhHHHHHHHHHHHHHH
Q psy7032          70 RDFRYAAVMRALVWRYIC   87 (136)
Q Consensus        70 ~~~~y~~VMr~LVkRYi~   87 (136)
                      ....|.++.+.++.|-|.
T Consensus       677 ~~~~y~~~lr~v~h~nI~  694 (696)
T PF14838_consen  677 PPEKYTETLRSVLHRNID  694 (696)
T ss_pred             ChhHHHHHHHHHHHhhhh
Confidence            457899999999988774


No 51 
>PRK12807 flagellin; Provisional
Probab=24.16  E-value=3.5e+02  Score=22.28  Aligned_cols=32  Identities=6%  Similarity=0.217  Sum_probs=27.4

Q ss_pred             hcCCCCChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032          93 ADTNPVTEDDVNEVKGEITAMRYELMNVLEKN  124 (136)
Q Consensus        93 ~e~~~vtE~dl~EiKQDISslRyEl~e~l~~~  124 (136)
                      .-+...+.+|..-|.++|..|+-+|..+..+.
T Consensus        97 a~ngt~s~~dr~ai~~Ei~~l~~~i~~~a~~t  128 (287)
T PRK12807         97 SSNGTNTAENQSALQKEFAELQEQIDYIAKNT  128 (287)
T ss_pred             HccCCCCHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence            44566789999999999999999999998643


No 52 
>KOG3870|consensus
Probab=23.86  E-value=1.4e+02  Score=27.05  Aligned_cols=21  Identities=19%  Similarity=0.428  Sum_probs=18.3

Q ss_pred             ChhHHHHHHhhHHHHHHHHHH
Q psy7032          99 TEDDVNEVKGEITAMRYELMN  119 (136)
Q Consensus        99 tE~dl~EiKQDISslRyEl~e  119 (136)
                      -+.|+++|...++.||||++.
T Consensus        59 ~~~d~k~ii~~l~~Lr~ei~~   79 (434)
T KOG3870|consen   59 VVEDGKQIIKLLSKLRYEIQT   79 (434)
T ss_pred             HHHHHHHHHHHHHHHHHHHhc
Confidence            368899999999999999964


No 53 
>KOG0289|consensus
Probab=22.58  E-value=43  Score=30.65  Aligned_cols=30  Identities=20%  Similarity=0.413  Sum_probs=21.8

Q ss_pred             HHHHHHHHHHHHHHhhc-CCCCChhHHHHHH
Q psy7032          78 MRALVWRYICLQHRQAD-TNPVTEDDVNEVK  107 (136)
Q Consensus        78 Mr~LVkRYi~~~qr~~e-~~~vtE~dl~EiK  107 (136)
                      =|+|+.+||...-.+-= +++++++||-+||
T Consensus        24 EkrLIEqyI~e~G~DPIt~~pLs~eelV~Ik   54 (506)
T KOG0289|consen   24 EKRLIEQYIAETGKDPITNEPLSIEELVEIK   54 (506)
T ss_pred             HHHHHHHHHHHcCCCCCCCCcCCHHHeeecc
Confidence            37899999977555432 6677888887765


No 54 
>PF15397 DUF4618:  Domain of unknown function (DUF4618)
Probab=22.40  E-value=1.2e+02  Score=25.50  Aligned_cols=19  Identities=26%  Similarity=0.587  Sum_probs=12.9

Q ss_pred             hHHHHHHhhHHHHHHHHHH
Q psy7032         101 DDVNEVKGEITAMRYELMN  119 (136)
Q Consensus       101 ~dl~EiKQDISslRyEl~e  119 (136)
                      +++.+++.+|+-||.||-.
T Consensus       200 e~i~el~e~I~~L~~eV~~  218 (258)
T PF15397_consen  200 EEIDELEEEIPQLRAEVEQ  218 (258)
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            5667777777777777633


No 55 
>cd00520 RRF Ribosome recycling factor (RRF). Ribosome recycling factor dissociates the posttermination complex, composed of the ribosome, deacylated tRNA, and mRNA, after termination of translation.  Thus ribosomes are "recycled" and ready for another round of protein synthesis.  RRF is believed to bind the ribosome at the A-site in a manner that mimics tRNA, but the specific mechanisms remain unclear.  RRF is essential for bacterial growth.  It is not necessary for cell growth in archaea or eukaryotes, but is found in mitochondria or chloroplasts of some eukaryotic species.
Probab=22.13  E-value=3.8e+02  Score=20.81  Aligned_cols=49  Identities=10%  Similarity=0.159  Sum_probs=34.8

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHH
Q psy7032          71 DFRYAAVMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMN  119 (136)
Q Consensus        71 ~~~y~~VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e  119 (136)
                      -+.+..-+|++=+.+...........+++|||+.-...+|..+=-+-..
T Consensus       117 ~E~~Kv~iRniR~~~~~~lKk~~k~~~iseD~~k~~~~~iqkltd~~i~  165 (179)
T cd00520         117 AEEAKVAIRNIRRDANDKIKKLEKEKEISEDEVKKAEEDLQKLTDEYIK  165 (179)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhhccCCCCchHHHHHHHHHHHHHHHHHH
Confidence            3456666777766666665555556678999999999999887665443


No 56 
>PF14389 Lzipper-MIP1:  Leucine-zipper of ternary complex factor MIP1
Probab=22.04  E-value=1.1e+02  Score=21.32  Aligned_cols=20  Identities=15%  Similarity=0.395  Sum_probs=16.8

Q ss_pred             hhHHHHHHhhHHHHHHHHHH
Q psy7032         100 EDDVNEVKGEITAMRYELMN  119 (136)
Q Consensus       100 E~dl~EiKQDISslRyEl~e  119 (136)
                      |.||.-+.|.+++|++.+.+
T Consensus        67 E~eV~~LE~~v~~L~~~l~~   86 (88)
T PF14389_consen   67 EAEVAKLEQKVLSLYRQLFQ   86 (88)
T ss_pred             HHHHHHHHHHHHHHHHHHHh
Confidence            78888888999999888765


No 57 
>TIGR01801 CM_A chorismate mutase domain of gram positive AroA protein. This model represents a small clade of chorismate mutase domains N-terminally fused to the first enzyme in the chorismate pathway, 2-dehydro-3-deoxyphosphoheptanoate aldolase (DAHP synthetase, AroA) which are found in some gram positive species and Deinococcus. Only in Deinococcus, where this domain is the sole CM domain in the genome can a trusted assignment of function be made. In the other species there is at least one other trusted CM domain present. The similarity between the Deinococcus gene and the others in this clade is sufficiently strong (~44% identity), that the whole clade can be trusted to be functional. The possibility exists, however, that in the gram positive species the fusion to the first enzyme in the pathway has evolved a separate, regulatory role.
Probab=21.41  E-value=1.3e+02  Score=21.47  Aligned_cols=26  Identities=4%  Similarity=0.473  Sum_probs=20.7

Q ss_pred             ChhHHHHHHhhHHHHHHHHHHHHHhc
Q psy7032          99 TEDDVNEVKGEITAMRYELMNVLEKN  124 (136)
Q Consensus        99 tE~dl~EiKQDISslRyEl~e~l~~~  124 (136)
                      |+.++.++++.|..+=.||++.|..+
T Consensus         2 ~~~~L~~lR~~ID~ID~eIl~LL~eR   27 (102)
T TIGR01801         2 SNQSLEDLRAEVDQLNRQILALISRR   27 (102)
T ss_pred             CcchHHHHHHHHHHHHHHHHHHHHHH
Confidence            45578899999999998888877643


No 58 
>PF01988 VIT1:  VIT family;  InterPro: IPR008217 Proteins containing this entry have no known function and are predicted to be integral membrane proteins. They include the Ccc1 protein from Saccharomyces cerevisiae (Baker's yeast) (P47818 from SWISSPROT) that may have a role in regulating calcium levels [].
Probab=21.10  E-value=93  Score=24.52  Aligned_cols=16  Identities=25%  Similarity=0.557  Sum_probs=9.9

Q ss_pred             CCChhHHHHHHhhHHH
Q psy7032          97 PVTEDDVNEVKGEITA  112 (136)
Q Consensus        97 ~vtE~dl~EiKQDISs  112 (136)
                      |++++|..++-.++++
T Consensus        93 Gl~~~~a~~i~~~l~~  108 (213)
T PF01988_consen   93 GLSEEDAEEIAEELSK  108 (213)
T ss_pred             CCCHHHHHHHHHHHHh
Confidence            5666666666666555


No 59 
>PF14164 YqzH:  YqzH-like protein
Probab=21.02  E-value=2.7e+02  Score=18.74  Aligned_cols=18  Identities=17%  Similarity=0.431  Sum_probs=7.9

Q ss_pred             cCCCCChhHHHHHHhhHH
Q psy7032          94 DTNPVTEDDVNEVKGEIT  111 (136)
Q Consensus        94 e~~~vtE~dl~EiKQDIS  111 (136)
                      +..+++.+|.++|.+-|.
T Consensus        22 ~~~pls~~E~~~L~~~i~   39 (64)
T PF14164_consen   22 ECMPLSDEEWEELCKHIQ   39 (64)
T ss_pred             cCCCCCHHHHHHHHHHHH
Confidence            344444444444444443


No 60 
>PRK09269 chorismate mutase; Provisional
Probab=20.80  E-value=4.3e+02  Score=21.11  Aligned_cols=45  Identities=11%  Similarity=0.290  Sum_probs=32.3

Q ss_pred             HHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHHHHHh
Q psy7032          79 RALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMNVLEK  123 (136)
Q Consensus        79 r~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e~l~~  123 (136)
                      +.+-.+|+..-....+....+-.||.+++..|..|--+|+..+..
T Consensus       101 k~iQ~~~~a~W~~~~~~~~~~~~dL~~~Rp~l~~L~~~il~~l~~  145 (193)
T PRK09269        101 KLVQYALLARWRLAGAAPPGPRPDLASIRPRLDRLQQELLDALAD  145 (193)
T ss_pred             HHHHHHHHHHHHhCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHH
Confidence            344456665555443434446789999999999999999988875


No 61 
>PHA02554 13 neck protein; Provisional
Probab=20.59  E-value=2e+02  Score=25.02  Aligned_cols=42  Identities=21%  Similarity=0.220  Sum_probs=29.2

Q ss_pred             HHHHHHHHHHHHHhh-----------c-CCCCCh---hHHHHHHhhHHHHHHHHHHH
Q psy7032          79 RALVWRYICLQHRQA-----------D-TNPVTE---DDVNEVKGEITAMRYELMNV  120 (136)
Q Consensus        79 r~LVkRYi~~~qr~~-----------e-~~~vtE---~dl~EiKQDISslRyEl~e~  120 (136)
                      -+.||+|.++.-++.           + -.|||-   .=+.|-.|||..||.||+..
T Consensus       246 nrWvKdYaTAL~Ke~wGq~L~K~qGmqLPGGITldG~~i~eeA~~eie~Lr~eL~~~  302 (311)
T PHA02554        246 NRWVKDYATALVKEQWGQILAKHQGMQLPGGVTLDGRRLIEEARQEKEKLREELLLL  302 (311)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHccCcccCCceeechHhHHHHHHHHHHHHHHHHHhc
Confidence            468888888753321           1 245653   44678899999999999754


No 62 
>PF01152 Bac_globin:  Bacterial-like globin;  InterPro: IPR001486 Globins are haem-containing proteins involved in binding and/or transporting oxygen. They belong to a very large and well studied family that is widely distributed in many organisms []. Globins have evolved from a common ancestor and can be divided into three groups: single-domain globins, and two types of chimeric globins, flavohaemoglobins and globin-coupled sensors. Bacteria have all three types of globins, while archaea lack flavohaemoglobins, and eukaryotes lack globin-coupled sensors []. Several functionally different haemoglobins can coexist in the same species. The major types of globins include:   Haemoglobin (Hb): trimer of two alpha and two beta chains, although embryonic and foetal forms can substitute the alpha or beta chain for ones with higher oxygen affinity, such as gamma, delta, epsilon or zeta chains. Hb transports oxygen from lungs to other tissues in vertebrates []. Hb proteins are also present in unicellular organisms where they act as enzymes or sensors []. Myoglobin (Mb): monomeric protein responsible for oxygen storage in vertebrate muscle [].  Neuroglobin: a myoglobin-like haemprotein expressed in vertebrate brain and retina, where it is involved in neuroprotection from damage due to hypoxia or ischemia []. Neuroglobin belongs to a branch of the globin family that diverged early in evolution.  Cytoglobin: an oxygen sensor expressed in multiple tissues. Related to neuroglobin []. Erythrocruorin: highly cooperative extracellular respiratory proteins found in annelids and arthropods that are assembled from as many as 180 subunit into hexagonal bilayers []. Leghaemoglobin (legHb or symbiotic Hb): occurs in the root nodules of leguminous plants, where it facilitates the diffusion of oxygen to symbiotic bacteriods in order to promote nitrogen fixation. Non-symbiotic haemoglobin (NsHb): occurs in non-leguminous plants, and can be over-expressed in stressed plants []. Flavohaemoglobins (FHb): chimeric, with an N-terminal globin domain and a C-terminal ferredoxin reductase-like NAD/FAD-binding domain. FHb provides protection against nitric oxide via its C-terminal domain, which transfers electrons to haem in the globin []. Globin-coupled sensors: chimeric, with an N-terminal myoglobin-like domain and a C-terminal domain that resembles the cytoplasmic signalling domain of bacterial chemoreceptors. They bind oxygen, and act to initiate an aerotactic response or regulate gene expression [, ].  Protoglobin: a single domain globin found in archaea that is related to the N-terminal domain of globin-coupled sensors []. Truncated 2/2 globin: lack the first helix, giving them a 2-over-2 instead of the canonical 3-over-3 alpha-helical sandwich fold. Can be divided into three main groups (I, II and II) based on structural features [].   This entry represents a group of haemoglobin-like proteins found in eubacteria, cyanobacteria, protozoa, algae and plants, but not in animals or yeast. These proteins have a truncated 2-over-2 rather than the canonical 3-over-3 alpha-helical sandwich fold []. This entry includes:   HbN (or GlbN): a truncated haemoglobin-like protein that binds oxygen cooperatively with a very high affinity and a slow dissociation rate, which may exclude it from oxygen transport. It appears to be involved in bacterial nitric oxide detoxification and in nitrosative stress []. Cyanoglobin (or GlbN): a truncated haemoprotein found in cyanobacteria that has high oxygen affinity, and which appears to serve as part of a terminal oxidase, rather than as a respiratory pigment []. HbO (or GlbO): a truncated haemoglobin-like protein with a lower oxygen affinity than HbN. HbO associates with the bacterial cell membrane, where it significantly increases oxygen uptake over membranes lacking this protein. HbO appears to interact with a terminal oxidase, and could participate in an oxygen/electron-transfer process that facilitates oxygen transfer during aerobic metabolism []. Glb3: a nuclear-encoded truncated haemoglobin from plants that appears more closely related to HbO than HbN. Glb3 from Arabidopsis thaliana (Mouse-ear cress) exhibits an unusual concentration-independent binding of oxygen and carbon dioxide [].  ; GO: 0019825 oxygen binding, 0015671 oxygen transport; PDB: 2BKM_B 1UVY_A 1DLW_A 2XYK_B 2IG3_A 2GKM_B 1S61_A 1S56_B 1RTE_B 2GLN_A ....
Probab=20.37  E-value=2.5e+02  Score=19.51  Aligned_cols=43  Identities=19%  Similarity=0.329  Sum_probs=32.9

Q ss_pred             HHHHHHHHHHHHHHHhhcCCCCChhHHHHHHhhHHHHHHHHHH
Q psy7032          77 VMRALVWRYICLQHRQADTNPVTEDDVNEVKGEITAMRYELMN  119 (136)
Q Consensus        77 VMr~LVkRYi~~~qr~~e~~~vtE~dl~EiKQDISslRyEl~e  119 (136)
                      |......+++...+..-++.+|.+.+++++-+-+.++|..+..
T Consensus        77 it~~~f~~~~~~~~~al~~~~v~~~~~~~~~~~~~~~~~~i~n  119 (120)
T PF01152_consen   77 ITEEHFDRWLELLKQALDELGVPEELIDELLARLESLRDDIVN  119 (120)
T ss_dssp             BBHHHHHHHHHHHHHHHHHTTCTHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCHHHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHhcC
Confidence            3344455555566667778899999999999999999998864


No 63 
>PF06757 Ins_allergen_rp:  Insect allergen related repeat, nitrile-specifier detoxification;  InterPro: IPR010629 This entry represents several insect specific allergen repeats. These repeats are commonly found in various proteins from cockroaches, fruit flies and mosquitos. It has been suggested that the repeat sequences have evolved by duplication of an ancestral amino acid domain, which may have arisen from the mitochondrial energy transfer proteins [].  This family exemplifies a case of novel gene evolution. The case in point is the arms-race between plants and their infective insective herbivores in the area of the glucosinolate-myrosinase system. Brassicas have developed the glucosinolate-myrosinase system as chemical defence mechanism against the insects, and consequently the insects have adapted to produce a detoxifying molecule, nitrile-specifier protein (NSP). NSP is present in the Pieris rapae (Cabbage white butterfly). NSP is structurally different from and has no amino acid homology to any known detoxifying enzymes, and it appears to have arisen by a process of domain and gene duplication of a sequence of unknown function that is widespread in insect species and referred to as insect-allergen-repeat protein. Thus this family is found either as a single domain or as a multiple repeat-domain []. 
Probab=20.16  E-value=95  Score=23.83  Aligned_cols=53  Identities=21%  Similarity=0.322  Sum_probs=34.2

Q ss_pred             HHHHHHHHHHH-HHHhhcCCCCChhHHHHHHhhHHHHHH--HHHHHHHhcCCCCCc
Q psy7032          78 MRALVWRYICL-QHRQADTNPVTEDDVNEVKGEITAMRY--ELMNVLEKNGMDTSS  130 (136)
Q Consensus        78 Mr~LVkRYi~~-~qr~~e~~~vtE~dl~EiKQDISslRy--El~e~l~~~gm~~~~  130 (136)
                      +..|+.+|+.. .+-.+--+=++-.+++++.+++.++--  ++++.|+++|.++..
T Consensus        20 i~~i~~~Y~~~D~efq~~~~yl~s~~f~~l~~~l~~~pE~~~l~~yL~~~gldv~~   75 (179)
T PF06757_consen   20 IQDIVQRYYLEDAEFQAAVRYLNSSEFKQLWQQLEALPEVKALLDYLESAGLDVYY   75 (179)
T ss_pred             HHHHHHHHHHcCHHHHHHHHHHcChHHHHHHHHHHcCHHHHHHHHHHHHCCCCHHH
Confidence            67788888542 222211122234578899988887542  788888999988765


Done!