Query         psy2167
Match_columns 79
No_of_seqs    104 out of 195
Neff          5.5 
Searched_HMMs 46136
Date          Fri Aug 16 18:01:57 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy2167.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/2167hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3547|consensus              100.0 2.4E-33 5.1E-38  218.2   5.9   75    5-79    175-250 (450)
  2 PF01062 Bestrophin:  Bestrophi  99.0 1.1E-10 2.3E-15   84.4   1.1   72    5-78    160-231 (293)
  3 COG3781 Predicted membrane pro  82.2     2.8   6E-05   32.1   4.4   58   16-76    164-223 (306)
  4 PF08828 DSX_dimer:  Doublesex   71.5     2.3 4.9E-05   25.6   1.1   26   42-67      4-29  (62)
  5 PF02284 COX5A:  Cytochrome c o  65.0       7 0.00015   25.9   2.5   30   38-67     13-49  (108)
  6 PF13624 SurA_N_3:  SurA N-term  63.2      15 0.00033   23.7   3.9   32   19-50     87-118 (154)
  7 PF11848 DUF3368:  Domain of un  62.7      14  0.0003   20.4   3.1   24   19-45      6-29  (48)
  8 PF11464 Rbsn:  Rabenosyn Rab b  58.8      11 0.00023   21.0   2.1   28   18-45      8-35  (42)
  9 PF14394 DUF4423:  Domain of un  48.4      21 0.00046   24.6   2.8   23   33-55    130-152 (171)
 10 PF09491 RE_AlwI:  AlwI restric  46.0      38 0.00083   26.4   4.2   45   10-54     56-112 (435)
 11 cd01421 IMPCH Inosine monophos  44.0      54  0.0012   23.4   4.3   33   20-52    126-169 (187)
 12 KOG1562|consensus               43.8      57  0.0012   25.5   4.7   56   18-74    218-278 (337)
 13 PF04677 CwfJ_C_1:  Protein sim  42.7      19  0.0004   23.6   1.7   17   39-55     65-81  (121)
 14 KOG4077|consensus               42.4      13 0.00028   25.8   1.0   21   46-66     67-87  (149)
 15 PF13310 Virulence_RhuM:  Virul  42.2      57  0.0012   24.6   4.4   37   16-52     47-94  (260)
 16 PF00681 Plectin:  Plectin repe  40.5      33 0.00071   18.5   2.3   14   22-35     24-37  (45)
 17 PF09202 Rio2_N:  Rio2, N-termi  39.3      28 0.00061   21.5   2.0   19   47-65     12-30  (82)
 18 PF06424 PRP1_N:  PRP1 splicing  39.1      19 0.00041   24.5   1.3   27   39-65    101-127 (133)
 19 KOG2477|consensus               38.0      21 0.00046   29.7   1.7   19   37-55    460-478 (628)
 20 PF02787 CPSase_L_D3:  Carbamoy  37.7      65  0.0014   21.1   3.7   37   19-57     62-102 (123)
 21 cd00923 Cyt_c_Oxidase_Va Cytoc  35.9      19 0.00041   23.7   0.9   23   45-67     24-46  (103)
 22 PF11892 DUF3412:  Domain of un  35.8      32 0.00069   23.3   2.0   20   41-60      2-21  (123)
 23 PRK12275 hypothetical protein;  33.5      80  0.0017   19.9   3.6   35   20-55     78-112 (116)
 24 COG3462 Predicted membrane pro  32.9      54  0.0012   22.0   2.7   10   13-22     53-62  (117)
 25 PF07017 PagP:  Antimicrobial p  32.8      15 0.00032   25.5   0.1   13   56-68    105-117 (147)
 26 PF09111 SLIDE:  SLIDE;  InterP  31.5      74  0.0016   20.9   3.2   33   21-59     60-92  (118)
 27 COG1422 Predicted membrane pro  30.7      19  0.0004   26.2   0.3   49   22-75     88-136 (201)
 28 TIGR02147 Fsuc_second hypothet  30.3      57  0.0012   24.3   2.8   22   33-54    228-249 (271)
 29 PF10303 DUF2408:  Protein of u  29.9 1.2E+02  0.0026   20.2   4.1   26   30-55    100-125 (134)
 30 PF14767 RPA_interact_M:  Repli  29.6      56  0.0012   20.3   2.3   24   38-61      6-33  (83)
 31 KOG3326|consensus               29.4      55  0.0012   22.9   2.4   24   19-42     61-84  (154)
 32 PF08671 SinI:  Anti-repressor   29.3      49  0.0011   17.0   1.7   13   19-31      6-18  (30)
 33 PF10845 DUF2576:  Protein of u  28.4      41 0.00089   19.2   1.4   23   34-56     10-32  (48)
 34 KOG4095|consensus               26.9      74  0.0016   22.5   2.7   27   36-62     17-51  (165)
 35 PF10474 DUF2451:  Protein of u  26.2   1E+02  0.0022   22.3   3.5   44   13-56     67-113 (234)
 36 COG3969 Predicted phosphoadeno  25.4 1.2E+02  0.0026   24.2   3.9   44   23-66     49-100 (407)
 37 PF07026 DUF1317:  Protein of u  25.1      94   0.002   18.6   2.5   23   31-53     32-54  (60)
 38 PF05635 23S_rRNA_IVP:  23S rRN  25.0 1.6E+02  0.0034   18.2   3.8   33   20-53     77-109 (110)
 39 TIGR02436 conserved hypothetic  25.0 1.4E+02  0.0031   19.0   3.6   35   21-56     72-106 (111)
 40 PF04437 RINT1_TIP1:  RINT-1 /   24.5 2.5E+02  0.0054   22.1   5.6   50   15-64     68-125 (494)
 41 PF09779 Ima1_N:  Ima1 N-termin  23.0      68  0.0015   21.2   1.9   22   33-54     98-119 (131)
 42 KOG2351|consensus               22.6      93   0.002   21.3   2.5   21   28-49    112-132 (134)
 43 cd07311 terB_like_1 tellurium   22.3 1.5E+02  0.0033   20.1   3.5   21   14-34     23-43  (150)
 44 PTZ00233 variable surface prot  22.3      62  0.0014   26.5   1.8   34   22-58     67-100 (509)
 45 PF04714 BCL_N:  BCL7, N-termin  21.7 1.4E+02  0.0029   17.4   2.7   27   36-62     16-50  (52)
 46 PF00458 WHEP-TRS:  WHEP-TRS do  21.6 1.8E+02  0.0038   16.8   4.3   42   20-63     10-51  (56)
 47 COG2089 SpsE Sialic acid synth  21.4      37 0.00081   26.6   0.4   52   12-64     86-139 (347)
 48 PF05791 Bacillus_HBL:  Bacillu  20.9      99  0.0021   21.4   2.4   18   36-53    125-142 (184)
 49 PLN03075 nicotianamine synthas  20.8 1.3E+02  0.0029   22.7   3.3   35   18-56     38-72  (296)
 50 KOG2787|consensus               20.8 1.6E+02  0.0034   23.5   3.7   47   16-66    164-210 (403)
 51 COG3943 Virulence protein [Gen  20.5 2.7E+02  0.0058   21.4   4.8   40   15-54    103-151 (329)
 52 PF14637 FNIP_M:  Folliculin-in  20.4 2.2E+02  0.0048   20.9   4.3   39   19-57     51-89  (232)

No 1  
>KOG3547|consensus
Probab=99.98  E-value=2.4e-33  Score=218.20  Aligned_cols=75  Identities=37%  Similarity=0.652  Sum_probs=71.5

Q ss_pred             CCCCC-CceeeHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEeeecccC
Q psy2167           5 RSPMA-KYWMPLVWATNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDAPTKHY   79 (79)
Q Consensus         5 ~~~~~-kywvPi~Wa~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV~~Yf~   79 (79)
                      ..+++ |||+||+|+.+|+++|+++|+|++|..++.+.+||++||++|++|++||||||||||||||+||||+||+
T Consensus       175 ~~~~~~kyw~Pi~Wa~sl~~~a~~~g~i~~d~~~~~~~~ei~~fR~~l~~L~~yDwVpIPL~YpQvV~lAVr~YF~  250 (450)
T KOG3547|consen  175 VDSKYMKYWVPINWAFSLLRKARKEGRIESDYLYNSLLGEIKKFRTGLSLLCNYDWVPIPLVYPQVVFLAVRSYFF  250 (450)
T ss_pred             ccccccceeecHHHHHHHHHHHHHcCCccccHHHHHHHHHHHHHHHhhccccCCCceeccccccceeeeehHHhHH
Confidence            34444 9999999999999999999999999999999999999999999999999999999999999999999995


No 2  
>PF01062 Bestrophin:  Bestrophin, RFP-TM, chloride channel;  InterPro: IPR021134 Bestrophin is a 68kDa basolateral plasma membrane protein expressed in retinal pigment epithelial cells (RPE). It is encoded by the VMD2 gene, which is mutated in Best macular dystrophy, a disease characterised by a depressed light peak in the electrooculogram []. VMD2 encodes a 585-amino acid protein with an approximate mass of 68 kDa which has been designated bestrophin. Bestrophin shares homology with the Caenorhabditis elegans RFP gene family, named for the presence of a conserved arginine (R), phenylalanine (F), proline (P), amino acid sequence motif. Bestrophin is a plasma membrane protein, localised to the basolateral surface of RPE cells consistent with a role for bestrophin in the generation or regulation of the EOG light peak. Bestrophin and other RFP family members represent a new class of chloride channels, indicating a direct role for bestrophin in generating the light peak []. The VMD2 gene underlying Best disease was shown to represent the first human member of the RFP-TM protein family. More than 97% of the disease-causing mutations are located in the N-terminal RFP-TM domain implying important functional properties []. This entry also includes a number of hypothetical proteins belonging to protein family UPF0187.
Probab=98.99  E-value=1.1e-10  Score=84.37  Aligned_cols=72  Identities=21%  Similarity=0.287  Sum_probs=66.2

Q ss_pred             CCCCCCceeeHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEeeeccc
Q psy2167           5 RSPMAKYWMPLVWATNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDAPTKH   78 (79)
Q Consensus         5 ~~~~~kywvPi~Wa~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV~~Yf   78 (79)
                      +.++...++|..|+..+ ++++++|. .++.....+.+++.+|++.++.+..++++|||++|+|.+...+++|+
T Consensus       160 ~~~~~p~~i~~~~~~~l-~~~~~~g~-~~~~~~~~l~~~l~~l~~~~g~~eri~~TPiP~~Y~~~~~~~l~~y~  231 (293)
T PF01062_consen  160 SADNPPNWILIQWSQRL-REARREGL-IDSFQLAQLDNELNALRDAQGGCERIKNTPIPFAYTQHLSRFLYIYL  231 (293)
T ss_pred             hhcCcHHHHHHHHHHHH-HHHHHcCC-CChHHHHHHHHHHHHHHHHHhhhheeCCCcCCHHHHHHHHHHHHHHH
Confidence            34677899999999999 99999999 55668999999999999999999999999999999999999998886


No 3  
>COG3781 Predicted membrane protein [Function unknown]
Probab=82.21  E-value=2.8  Score=32.06  Aligned_cols=58  Identities=14%  Similarity=0.213  Sum_probs=43.3

Q ss_pred             HHHHHHHHHHHHhcCCCCHH--HHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEeeec
Q psy2167          16 VWATNIINRARKEALISSDH--VVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDAPT   76 (79)
Q Consensus        16 ~Wa~~ll~~ar~eg~i~~~~--~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV~~   76 (79)
                      -|.-.-++.-|++|.+++-.  .+++.++.|.+--++|+.+-+   .|||++|+=.+.=.||.
T Consensus       164 ~wmGe~l~~q~r~g~l~~~~~~sl~~~L~~~s~vlggCERI~~---TPiPfAYtl~lhrtvyl  223 (306)
T COG3781         164 LWMGEWLQQQRRNGQLDAIQFTSLDRRLNSISAVLGGCERIAY---TPIPFAYTLHLHRTVYL  223 (306)
T ss_pred             HHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHHHHHhHHHHhc---CCccHHHHHHHHHHHHH
Confidence            48888999999999987542  456677777788888888765   89999998544433433


No 4  
>PF08828 DSX_dimer:  Doublesex dimerisation domain;  InterPro: IPR014932 Doublesex (DSX) is a transcription factor that regulates somatic sexual differences in Drosophila. The structure has revealed a novel dimeric arrangement of ubiquitin-associated folds that has not previously been identified in a transcription factor []. ; PDB: 1ZV1_B 2JZ0_A 2JZ1_B.
Probab=71.55  E-value=2.3  Score=25.63  Aligned_cols=26  Identities=35%  Similarity=0.404  Sum_probs=20.9

Q ss_pred             HHHHHHHHHhcccccCcceecchhhe
Q psy2167          42 YELSEHRRKLGSIISYDTVCVPLVYT   67 (79)
Q Consensus        42 ~ei~~FR~~l~~L~~yDWVpIPLvY~   67 (79)
                      +++.++=++|-.-+.|-|--+||+|.
T Consensus         4 e~Ll~~cqkLlEkf~YpWEmmpLmyV   29 (62)
T PF08828_consen    4 EELLERCQKLLEKFRYPWEMMPLMYV   29 (62)
T ss_dssp             HHHHHHHHHHHHHTT--GGGHHHHHH
T ss_pred             HHHHHHHHHHHHHhCCCHHHHHHHHH
Confidence            67788888888899999999999994


No 5  
>PF02284 COX5A:  Cytochrome c oxidase subunit Va;  InterPro: IPR003204 Cytochrome c oxidase (1.9.3.1 from EC) is an oligomeric enzymatic complex which is a component of the respiratory chain complex and is involved in the transfer of electrons from cytochrome c to oxygen []. In eukaryotes this enzyme complex is located in the mitochondrial inner membrane; in aerobic prokaryotes it is found in the plasma membrane.  In eukaryotes, in addition to the three large subunits, I, II and III, that form the catalytic centre of the enzyme complex, there are a variable number of small polypeptidic subunits. One of these subunits is known as Va.; GO: 0004129 cytochrome-c oxidase activity; PDB: 2DYR_R 3AG1_E 3ABL_E 1V54_R 2EIJ_R 1OCR_E 2DYS_E 2EIM_E 2OCC_E 3ASN_R ....
Probab=65.04  E-value=7  Score=25.87  Aligned_cols=30  Identities=27%  Similarity=0.355  Sum_probs=20.9

Q ss_pred             HHHHHHHH-------HHHHHhcccccCcceecchhhe
Q psy2167          38 QTLLYELS-------EHRRKLGSIISYDTVCVPLVYT   67 (79)
Q Consensus        38 ~~l~~ei~-------~FR~~l~~L~~yDWVpIPLvY~   67 (79)
                      .+-.+..+       +.|++|.+|+.||.||=|=+..
T Consensus        13 ary~~~F~~~~iD~we~rrglN~l~~~DlVP~P~ii~   49 (108)
T PF02284_consen   13 ARYEKYFNRPDIDGWELRRGLNNLFGYDLVPEPKIIE   49 (108)
T ss_dssp             HHHHHHHH-TT--HHHHHHHHHHHTTSSB---HHHHH
T ss_pred             HHHHHHhCCccccHHHHHHHHHHHhccccCCChHHHH
Confidence            44455555       8999999999999999987654


No 6  
>PF13624 SurA_N_3:  SurA N-terminal domain; PDB: 3NRK_A.
Probab=63.18  E-value=15  Score=23.68  Aligned_cols=32  Identities=16%  Similarity=0.265  Sum_probs=21.0

Q ss_pred             HHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHH
Q psy2167          19 TNIINRARKEALISSDHVVQTLLYELSEHRRK   50 (79)
Q Consensus        19 ~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~   50 (79)
                      --+..+|++.|.--||..++..+++++.|..+
T Consensus        87 ~ll~q~A~~~gi~vsd~ev~~~i~~~~~f~~~  118 (154)
T PF13624_consen   87 KLLLQEAKKLGISVSDAEVDDAIKQIPAFQEN  118 (154)
T ss_dssp             HHHHHHHHHTT----HHHHHHHHHH--HHHHH
T ss_pred             HHHHHHHHHcCCCCCHHHHHHHHHHHHHHHHC
Confidence            34678899999988999999999998888654


No 7  
>PF11848 DUF3368:  Domain of unknown function (DUF3368);  InterPro: IPR021799  This domain is functionally uncharacterised. This domain is found in bacteria and archaea. This presumed domain is about 50 amino acids in length. 
Probab=62.73  E-value=14  Score=20.40  Aligned_cols=24  Identities=38%  Similarity=0.452  Sum_probs=17.4

Q ss_pred             HHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy2167          19 TNIINRARKEALISSDHVVQTLLYELS   45 (79)
Q Consensus        19 ~~ll~~ar~eg~i~~~~~~~~l~~ei~   45 (79)
                      +-++.+|.++|.|+   ....+++++.
T Consensus         6 lGiL~~Ak~~GlI~---~~~~~l~~l~   29 (48)
T PF11848_consen    6 LGILLLAKRRGLIS---EVKPLLDRLQ   29 (48)
T ss_pred             HHHHHHHHHcCChh---hHHHHHHHHH
Confidence            35788899999998   3556666663


No 8  
>PF11464 Rbsn:  Rabenosyn Rab binding domain;  InterPro: IPR021565  Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=58.77  E-value=11  Score=21.03  Aligned_cols=28  Identities=18%  Similarity=0.191  Sum_probs=18.4

Q ss_pred             HHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy2167          18 ATNIINRARKEALISSDHVVQTLLYELS   45 (79)
Q Consensus        18 a~~ll~~ar~eg~i~~~~~~~~l~~ei~   45 (79)
                      .-+.+..|++.|+++.-..+..-+.||.
T Consensus         8 I~~~I~qAk~~~r~dEV~~L~~NL~EL~   35 (42)
T PF11464_consen    8 IESYIKQAKAARRFDEVATLEENLRELQ   35 (42)
T ss_dssp             HHHHHHHHHHTT-HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHhcCcHHHHHHHHHHHHHH
Confidence            3467889999999886555555555543


No 9  
>PF14394 DUF4423:  Domain of unknown function (DUF4423)
Probab=48.42  E-value=21  Score=24.63  Aligned_cols=23  Identities=26%  Similarity=0.314  Sum_probs=19.7

Q ss_pred             CHHHHHHHHHHHHHHHHHhcccc
Q psy2167          33 SDHVVQTLLYELSEHRRKLGSII   55 (79)
Q Consensus        33 ~~~~~~~l~~ei~~FR~~l~~L~   55 (79)
                      |+..+.++.+||.+||+++..+-
T Consensus       130 s~~~~~ki~~~i~~fRk~i~~i~  152 (171)
T PF14394_consen  130 SREDYEKIKKEIREFRKKIIAIA  152 (171)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHH
Confidence            45689999999999999987654


No 10 
>PF09491 RE_AlwI:  AlwI restriction endonuclease;  InterPro: IPR018573 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below:   Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA.   Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone [].   This entry includes AlwI (recognises GGATC), Bsp6I (recognises GC^NGC), BstNBI (recognises GASTC), PleI(recognises GAGTC) and MlyI (recognises GAGTC) restriction endonucleases. ; PDB: 2EWF_A 2P14_A.
Probab=46.01  E-value=38  Score=26.40  Aligned_cols=45  Identities=16%  Similarity=0.366  Sum_probs=31.7

Q ss_pred             CceeeHHHHHHHHHHHHHhcCCC------------CHHHHHHHHHHHHHHHHHhccc
Q psy2167          10 KYWMPLVWATNIINRARKEALIS------------SDHVVQTLLYELSEHRRKLGSI   54 (79)
Q Consensus        10 kywvPi~Wa~~ll~~ar~eg~i~------------~~~~~~~l~~ei~~FR~~l~~L   54 (79)
                      .+.-|+.+++.++.+-++++.+.            ++..++.++++|.+||..-...
T Consensus        56 ~~~~Pfr~lL~ll~~l~~~~~ls~~E~~~~v~~~~~~~~~~~lv~~I~~fR~~~~~~  112 (435)
T PF09491_consen   56 FYIYPFRFLLKLLNELELDSYLSKEEFAYFVQTISDDNDIEELVNKILDFRKERENK  112 (435)
T ss_dssp             -BB-HHHHHHHHHHHH---S-B-HHHHHHHTTT--BGGGHHHHHHHHHHHHHHHHHS
T ss_pred             eeeehHHHHHHHHHHHhcccccChhhheeEEEEccchhhHHHHHHHHHHHHHhhhhh
Confidence            68899999999999998777653            3456788999999999876543


No 11 
>cd01421 IMPCH Inosine monophosphate cyclohydrolase domain. This is the N-terminal domain in the purine biosynthesis pathway protein ATIC (purH). The bifunctional ATIC protein contains a C-terminal  ATIC formylase domain that formylates 5-aminoimidazole-4-carboxamide-ribonucleotide. The IMPCH domain then converts the formyl-5-aminoimidazole-4-carboxamide-ribonucleotide to inosine monophosphate. This is the final step in de novo purine production.
Probab=44.01  E-value=54  Score=23.43  Aligned_cols=33  Identities=27%  Similarity=0.352  Sum_probs=25.6

Q ss_pred             HHHHHHHHhcC----CCCHHHHHHHHHHHHH-------HHHHhc
Q psy2167          20 NIINRARKEAL----ISSDHVVQTLLYELSE-------HRRKLG   52 (79)
Q Consensus        20 ~ll~~ar~eg~----i~~~~~~~~l~~ei~~-------FR~~l~   52 (79)
                      +++|-|.|-..    +.++..+..+++||++       +|.+|+
T Consensus       126 smlRaAAKN~~~V~vv~dp~dY~~v~~~l~~~g~~~~~~R~~lA  169 (187)
T cd01421         126 SLLRAAAKNYKDVTVLVDPADYQKVLEELKSNGSISEETRRRLA  169 (187)
T ss_pred             HHHHHHHhcCCCeEEEcCHHHHHHHHHHHHhcCCCCHHHHHHHH
Confidence            56776666552    5688899999999986       888875


No 12 
>KOG1562|consensus
Probab=43.77  E-value=57  Score=25.49  Aligned_cols=56  Identities=13%  Similarity=0.038  Sum_probs=43.0

Q ss_pred             HHHHHHHHHHhcCC-----CCHHHHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEee
Q psy2167          18 ATNIINRARKEALI-----SSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDA   74 (79)
Q Consensus        18 a~~ll~~ar~eg~i-----~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV   74 (79)
                      .+.++.+|++.|=+     ++-....+++.|+++|=.....+..|-|..+| -||=..+.-+
T Consensus       218 ~~~~v~~aLk~dgv~~~q~ec~wl~~~~i~e~r~~~~~~f~~t~ya~ttvP-Typsg~igf~  278 (337)
T KOG1562|consen  218 YFGLVLDALKGDGVVCTQGECMWLHLDYIKEGRSFCYVIFDLTAYAITTVP-TYPSGRIGFM  278 (337)
T ss_pred             HHHHHHHhhCCCcEEEEecceehHHHHHHHHHHHhHHHhcCccceeeecCC-CCccceEEEE
Confidence            35677788876632     34457789999999999999999999999999 6775554433


No 13 
>PF04677 CwfJ_C_1:  Protein similar to CwfJ C-terminus 1;  InterPro: IPR006768 This group of sequences contain a conserved C-terminal domain which is found in the Schizosaccharomyces pombe (Fission yeast) protein Cwf19 (Q09909 from SWISSPROT) and its homologues. Cwf19 is part of the Cdc5p complex involved in mRNA splicing []. This domain is found in association with IPR006767 from INTERPRO, which is generally C-terminal and adjacent to this domain. 
Probab=42.66  E-value=19  Score=23.64  Aligned_cols=17  Identities=12%  Similarity=0.319  Sum_probs=15.0

Q ss_pred             HHHHHHHHHHHHhcccc
Q psy2167          39 TLLYELSEHRRKLGSII   55 (79)
Q Consensus        39 ~l~~ei~~FR~~l~~L~   55 (79)
                      +.++||++||++|...+
T Consensus        65 ~~~~Ei~~f~~~L~~mf   81 (121)
T PF04677_consen   65 EVWEEIRNFQKSLRKMF   81 (121)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            78999999999998764


No 14 
>KOG4077|consensus
Probab=42.35  E-value=13  Score=25.80  Aligned_cols=21  Identities=38%  Similarity=0.575  Sum_probs=18.5

Q ss_pred             HHHHHhcccccCcceecchhh
Q psy2167          46 EHRRKLGSIISYDTVCVPLVY   66 (79)
Q Consensus        46 ~FR~~l~~L~~yDWVpIPLvY   66 (79)
                      +.|+.|.+|..||-||=|=+-
T Consensus        67 EvrkglN~l~~yDlVP~pkvI   87 (149)
T KOG4077|consen   67 EVRKGLNNLFDYDLVPSPKVI   87 (149)
T ss_pred             HHHHHHHhhhccccCCChHHH
Confidence            679999999999999988654


No 15 
>PF13310 Virulence_RhuM:  Virulence protein RhuM family
Probab=42.24  E-value=57  Score=24.60  Aligned_cols=37  Identities=19%  Similarity=0.346  Sum_probs=28.2

Q ss_pred             HHHHHHHHHHHHhcCCCCH-----------HHHHHHHHHHHHHHHHhc
Q psy2167          16 VWATNIINRARKEALISSD-----------HVVQTLLYELSEHRRKLG   52 (79)
Q Consensus        16 ~Wa~~ll~~ar~eg~i~~~-----------~~~~~l~~ei~~FR~~l~   52 (79)
                      .||.+.|++-..+|..-+|           ..+..+++.|++.|.+=.
T Consensus        47 ~WAt~~Lkey~~KGf~~d~erLk~~~~~~~dyf~ell~rIr~IRaSEr   94 (260)
T PF13310_consen   47 QWATKVLKEYLIKGFVLDDERLKNGGVFGKDYFDELLERIRDIRASER   94 (260)
T ss_pred             HHHHHhHHHHHHhhhhhhHHHHHccCcccHHHHHHHHHHHHhhHHHHH
Confidence            5999999988877765443           356788899998887643


No 16 
>PF00681 Plectin:  Plectin repeat;  InterPro: IPR001101 Plectin may have a role in cross-linking intermediate filaments, in inter-linking intermediate filaments with microtubules and microfilaments and in anchoring intermediate filaments to the plasma and nuclear membranes. Plectin is recruited into hemidesmosomes, multiprotein complexes that facilitate adhesion of epithelia to the basement membrane, thereby providing linkage between the intracellular keratin filaments to the laminins of the extracellular matrix. Plectin binds to hemidesmosomes through association of its actin-binding domain with the first pair of fibronectin type III repeats and a small part of the connecting segment of the integrin-beta4 subunit, the latter (integrin-alpha6,beta4) acting as a receptor for the extracellular matrix component laminin-5. The plectin repeat is also seen in the cell adhesion junction plaque proteins, desmoplakin, envoplakin, and bullous pemphigoid antigen. The domains in plakins show considerable sequence homology. The N terminus consists of a plakin domain containing a number of subdomains with high alpha-helical content, while the central coiled-coil domain is composed of heptad repeats involved in the dimerisation of plakin, and the C terminus contains one or more homologous repeat sequences referred to plectin repeats []. This entry represents the plectin repeats found in the C terminus of plakin proteins.; GO: 0005856 cytoskeleton; PDB: 1LM7_A 1LM5_A.
Probab=40.53  E-value=33  Score=18.48  Aligned_cols=14  Identities=36%  Similarity=0.522  Sum_probs=10.8

Q ss_pred             HHHHHHhcCCCCHH
Q psy2167          22 INRARKEALISSDH   35 (79)
Q Consensus        22 l~~ar~eg~i~~~~   35 (79)
                      +.+|.++|.|+++.
T Consensus        24 v~~A~~~glId~~~   37 (45)
T PF00681_consen   24 VEEAIQRGLIDSDT   37 (45)
T ss_dssp             HHHHHHTTSS-HHH
T ss_pred             HHHHHHCCCcCHHH
Confidence            46899999999874


No 17 
>PF09202 Rio2_N:  Rio2, N-terminal;  InterPro: IPR015285 This N-terminal domain is found in RIO2 kinases, and is structurally homologous to the winged helix (wHTH) domain. It adopts a structure consisting of four alpha helices followed by two beta strands and a fifth alpha helix. The domain confers DNA binding properties to the protein, as per other winged helix domains []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1TQP_A 1ZAR_A 1TQI_A 1ZAO_A 1TQM_A.
Probab=39.28  E-value=28  Score=21.48  Aligned_cols=19  Identities=16%  Similarity=0.023  Sum_probs=11.6

Q ss_pred             HHHHhcccccCcceecchh
Q psy2167          47 HRRKLGSIISYDTVCVPLV   65 (79)
Q Consensus        47 FR~~l~~L~~yDWVpIPLv   65 (79)
                      .+.=-....+|.|||+++.
T Consensus        12 L~aiE~gmk~hE~VP~~~I   30 (82)
T PF09202_consen   12 LRAIEMGMKNHEWVPLELI   30 (82)
T ss_dssp             HHHHHTTTTT-SSEEHHHH
T ss_pred             HHHHHHcccCCccCCHHHH
Confidence            3333445778889888764


No 18 
>PF06424 PRP1_N:  PRP1 splicing factor, N-terminal;  InterPro: IPR010491 This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1 []. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression.; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005634 nucleus
Probab=39.11  E-value=19  Score=24.48  Aligned_cols=27  Identities=11%  Similarity=0.261  Sum_probs=24.1

Q ss_pred             HHHHHHHHHHHHhcccccCcceecchh
Q psy2167          39 TLLYELSEHRRKLGSIISYDTVCVPLV   65 (79)
Q Consensus        39 ~l~~ei~~FR~~l~~L~~yDWVpIPLv   65 (79)
                      +|-+.+.+++.+|+.+..=+|-+||=|
T Consensus       101 kI~~QFaDLKR~La~VS~eeW~~IPE~  127 (133)
T PF06424_consen  101 KIQQQFADLKRSLATVSEEEWENIPEA  127 (133)
T ss_pred             hHHHHHHHHHHHHccCCHHHHhcCCcc
Confidence            577889999999999999999999943


No 19 
>KOG2477|consensus
Probab=37.98  E-value=21  Score=29.70  Aligned_cols=19  Identities=16%  Similarity=0.258  Sum_probs=16.2

Q ss_pred             HHHHHHHHHHHHHHhcccc
Q psy2167          37 VQTLLYELSEHRRKLGSII   55 (79)
Q Consensus        37 ~~~l~~ei~~FR~~l~~L~   55 (79)
                      =...+|||++||++|.+++
T Consensus       460 DEdvWDEIrnfrKcL~~Mf  478 (628)
T KOG2477|consen  460 DEDVWDEIRNFRKCLALMF  478 (628)
T ss_pred             chHHHHHHHHHHHHHHHHH
Confidence            3578999999999998765


No 20 
>PF02787 CPSase_L_D3:  Carbamoyl-phosphate synthetase large chain, oligomerisation domain;  InterPro: IPR005480 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates [, ]. CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate []. The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain []. CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites []. The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein []. The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP []. There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia []. CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains [].  This entry represents the oligomerisation domain found in the large subunit of carbamoyl phosphate synthases as well as in certain other carboxy phsophate domain-containing enzymes.; GO: 0006807 nitrogen compound metabolic process; PDB: 1M6V_C 1CS0_C 1C30_E 1C3O_G 1BXR_A 1T36_E 1A9X_A 1KEE_G 1CE8_A 1JDB_H ....
Probab=37.67  E-value=65  Score=21.13  Aligned_cols=37  Identities=24%  Similarity=0.427  Sum_probs=25.1

Q ss_pred             HHHHHHHHHhcCCCCHHHHHHHH----HHHHHHHHHhcccccC
Q psy2167          19 TNIINRARKEALISSDHVVQTLL----YELSEHRRKLGSIISY   57 (79)
Q Consensus        19 ~~ll~~ar~eg~i~~~~~~~~l~----~ei~~FR~~l~~L~~y   57 (79)
                      -.++++|.+.|  -||.....++    +|+++.|.+.+..-.|
T Consensus        62 ~~~L~~aK~~G--FsD~~IA~l~~~~e~~vr~~R~~~~i~p~y  102 (123)
T PF02787_consen   62 PELLRKAKRLG--FSDRQIARLWGVSEEEVRELRKEHGIVPVY  102 (123)
T ss_dssp             HHHHHHHHHTT----HHHHHHHHTS-HHHHHHHHHHHT---EE
T ss_pred             HHHHHHHHHcC--CCHHHHHhccCCCHHHHHHHHHHcCCceee
Confidence            45889999999  5676665554    5899999998876555


No 21 
>cd00923 Cyt_c_Oxidase_Va Cytochrome c oxidase subunit Va. Cytochrome c oxidase (CcO), the terminal oxidase in the respiratory chains of eukaryotes and most bacteria, is a multi-chain transmembrane protein located in the inner membrane of mitochondria and the cell membrane of prokaryotes. It catalyzes the reduction of O2 and simultaneously pumps protons across the membrane. The number of subunits varies from three to five in bacteria and up to 13 in mammalian mitochondria. Subunits I, II, and III of mammalian CcO are encoded within the mitochondrial genome and the remaining 10 subunits are encoded within the nuclear genome. Found only in eukaryotes, subunit Va is one of three mammalian subunits that lacks a transmembrane region. Subunit Va is located on the matrix side of the membrane and binds thyroid hormone T2, releasing allosteric inhibition caused by the binding of ATP to subunit IV and allowing high turnover at elevated intramitochondrial ATP/ADP ratios.
Probab=35.95  E-value=19  Score=23.69  Aligned_cols=23  Identities=39%  Similarity=0.492  Sum_probs=19.5

Q ss_pred             HHHHHHhcccccCcceecchhhe
Q psy2167          45 SEHRRKLGSIISYDTVCVPLVYT   67 (79)
Q Consensus        45 ~~FR~~l~~L~~yDWVpIPLvY~   67 (79)
                      =+.|++|.+|+.||-||=|=+-.
T Consensus        24 we~rr~mN~l~~~DlVP~P~ii~   46 (103)
T cd00923          24 WELRRGLNNLFGYDLVPEPKVIE   46 (103)
T ss_pred             HHHHHHHHHHhccccCCCcHHHH
Confidence            36799999999999999986543


No 22 
>PF11892 DUF3412:  Domain of unknown function (DUF3412);  InterPro: IPR021826  This presumed domain is functionally uncharacterised. This domain is found in bacteria. This domain is about 120 amino acids in length. This domain is found associated with PF03641 from PFAM. ; PDB: 3BQ9_A 2PMB_D 3GH1_D.
Probab=35.76  E-value=32  Score=23.27  Aligned_cols=20  Identities=25%  Similarity=0.381  Sum_probs=15.5

Q ss_pred             HHHHHHHHHHhcccccCcce
Q psy2167          41 LYELSEHRRKLGSIISYDTV   60 (79)
Q Consensus        41 ~~ei~~FR~~l~~L~~yDWV   60 (79)
                      +++++++|...+.-+.|.|-
T Consensus         2 m~~V~~~R~~~~Day~fNW~   21 (123)
T PF11892_consen    2 MEEVKEYRRETGDAYYFNWS   21 (123)
T ss_dssp             HHHHHHHHHHTT--SSS-TS
T ss_pred             chHHHHHHHhcCCceeeeee
Confidence            68999999999999999993


No 23 
>PRK12275 hypothetical protein; Reviewed
Probab=33.45  E-value=80  Score=19.90  Aligned_cols=35  Identities=20%  Similarity=0.320  Sum_probs=24.6

Q ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccc
Q psy2167          20 NIINRARKEALISSDHVVQTLLYELSEHRRKLGSII   55 (79)
Q Consensus        20 ~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~   55 (79)
                      ..++-|++.|+|+.+ .+..+.+++.+...-++.+.
T Consensus        78 ~~L~la~~~~~i~~~-~~~~l~~~~~ei~kml~~~i  112 (116)
T PRK12275         78 TQLYIAKELGYITKE-QYESILQEYDEIAKMLNGLI  112 (116)
T ss_pred             HHHHHHHHcCCCCHH-HHHHHHHHHHHHHHHHHHHH
Confidence            456778889998754 56777777777766665543


No 24 
>COG3462 Predicted membrane protein [Function unknown]
Probab=32.88  E-value=54  Score=21.96  Aligned_cols=10  Identities=50%  Similarity=1.198  Sum_probs=5.5

Q ss_pred             eeHHHHHHHH
Q psy2167          13 MPLVWATNII   22 (79)
Q Consensus        13 vPi~Wa~~ll   22 (79)
                      +||-|+.+++
T Consensus        53 mpI~~~vvli   62 (117)
T COG3462          53 MPIFWAVVLI   62 (117)
T ss_pred             HHHHHHHHHH
Confidence            4566665543


No 25 
>PF07017 PagP:  Antimicrobial peptide resistance and lipid A acylation protein PagP;  InterPro: IPR009746 This family consists of several bacterial antimicrobial peptide resistance and lipid A acylation (PagP) proteins. The bacterial outer membrane enzyme PagP transfers a palmitate chain from a phospholipid to lipid A. In a number of pathogenic Gram-negative bacteria, PagP confers resistance to certain cationic antimicrobial peptides produced during the host innate immune response.; PDB: 1MM5_A 1MM4_A 1THQ_A 3GP6_A.
Probab=32.82  E-value=15  Score=25.53  Aligned_cols=13  Identities=15%  Similarity=0.383  Sum_probs=9.7

Q ss_pred             cCcceecchhhee
Q psy2167          56 SYDTVCVPLVYTQ   68 (79)
Q Consensus        56 ~yDWVpIPLvY~Q   68 (79)
                      .|++||+|++.|=
T Consensus       105 d~~yiP~P~~LPl  117 (147)
T PF07017_consen  105 DYHYIPFPAILPL  117 (147)
T ss_dssp             CCTT-EEEEEEEE
T ss_pred             cccccccceeeee
Confidence            5789999998773


No 26 
>PF09111 SLIDE:  SLIDE;  InterPro: IPR015195 The SLIDE domain adopts a secondary structure comprising a main core of three alpha-helices. It has a role in DNA binding, contacting DNA target sites similar to c-Myb (IPR014778 from INTERPRO) repeats or homeodomains []. ; GO: 0003676 nucleic acid binding, 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006338 chromatin remodeling, 0005634 nucleus; PDB: 2NOG_A 2Y9Y_A 2Y9Z_A 1OFC_X.
Probab=31.49  E-value=74  Score=20.91  Aligned_cols=33  Identities=12%  Similarity=0.199  Sum_probs=23.7

Q ss_pred             HHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcc
Q psy2167          21 IINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDT   59 (79)
Q Consensus        21 ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDW   59 (79)
                      |+....+-|+ .+|..+..+-++|++     .-++.|||
T Consensus        60 Ll~~~~~~G~-~~~~~~e~Ik~~Ir~-----~p~FrFDw   92 (118)
T PF09111_consen   60 LLCMLYKYGY-DAEGNWEKIKQEIRE-----SPLFRFDW   92 (118)
T ss_dssp             HHHHHHHHTT-TSTTHHHHHHHHHHH------CGGCT-H
T ss_pred             HHHHHHHhCC-CCCchHHHHHHHHHh-----CCCcccch
Confidence            5667778998 445588888888875     56777887


No 27 
>COG1422 Predicted membrane protein [Function unknown]
Probab=30.70  E-value=19  Score=26.23  Aligned_cols=49  Identities=10%  Similarity=0.102  Sum_probs=33.7

Q ss_pred             HHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecchhheeeeeEeee
Q psy2167          22 INRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVYTQQNFLDAP   75 (79)
Q Consensus        22 l~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY~QvV~lAV~   75 (79)
                      .++|+++|   ++..++++-+|=++.+..-..+.+-..-  |.+|+=++++-++
T Consensus        88 ~~eA~~~~---d~~~lkkLq~~qmem~~~Q~elmk~qfk--PM~~~~v~tI~~F  136 (201)
T COG1422          88 FREAQESG---DMKKLKKLQEKQMEMMDDQRELMKMQFK--PMLYISVLTIPFF  136 (201)
T ss_pred             HHHHHHhC---CHHHHHHHHHHHHHHHHHHHHHHHHhhh--hHHHHHHHHHHHH
Confidence            35677777   6778888888777777776666665555  7777766665443


No 28 
>TIGR02147 Fsuc_second hypothetical protein, TIGR02147. This family consists of the 40 members of a paralogous protein family in the rumen anaerobe Fibrobacter succinogenes S85. Member proteins are about 270 residues long and appear to lack signal sequences and transmembrane helices. The only perfectly conserved residue is a glycine in an otherwise poorly conserved region, suggesting members are not enzymes. The family is not characterized.
Probab=30.29  E-value=57  Score=24.30  Aligned_cols=22  Identities=18%  Similarity=0.410  Sum_probs=19.0

Q ss_pred             CHHHHHHHHHHHHHHHHHhccc
Q psy2167          33 SDHVVQTLLYELSEHRRKLGSI   54 (79)
Q Consensus        33 ~~~~~~~l~~ei~~FR~~l~~L   54 (79)
                      ++..+.++.+||.+||+++..+
T Consensus       228 ~~~~~~~i~~~i~~fRk~i~~i  249 (271)
T TIGR02147       228 SEEAYKEIVKKIQEFRKEVLAI  249 (271)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHH
Confidence            4568999999999999998754


No 29 
>PF10303 DUF2408:  Protein of unknown function (DUF2408);  InterPro: IPR018810  This entry represents a family of proteins conserved in fungi whose function is unknown. 
Probab=29.93  E-value=1.2e+02  Score=20.22  Aligned_cols=26  Identities=19%  Similarity=0.319  Sum_probs=21.6

Q ss_pred             CCCCHHHHHHHHHHHHHHHHHhcccc
Q psy2167          30 LISSDHVVQTLLYELSEHRRKLGSII   55 (79)
Q Consensus        30 ~i~~~~~~~~l~~ei~~FR~~l~~L~   55 (79)
                      -..-+..++.+.+.+.+.|++|.+|.
T Consensus       100 ~~~~~~~l~~iY~~L~~ik~~LE~L~  125 (134)
T PF10303_consen  100 EIEVDPSLQPIYDQLIDIKNTLENLL  125 (134)
T ss_pred             cccccHHHHHHHHHHHHHHHHHHhhh
Confidence            44556689999999999999999874


No 30 
>PF14767 RPA_interact_M:  Replication protein A interacting middle
Probab=29.59  E-value=56  Score=20.27  Aligned_cols=24  Identities=21%  Similarity=0.227  Sum_probs=18.6

Q ss_pred             HHHHHHHHHHHH---Hhc-ccccCccee
Q psy2167          38 QTLLYELSEHRR---KLG-SIISYDTVC   61 (79)
Q Consensus        38 ~~l~~ei~~FR~---~l~-~L~~yDWVp   61 (79)
                      .-+-+|++++|+   +.. .|++|||.+
T Consensus         6 eVm~eE~~~L~~~~~~~~d~lw~~~~~~   33 (83)
T PF14767_consen    6 EVMEEEWKALQSSSSGLPDSLWEYDGLP   33 (83)
T ss_pred             HHHHHHHHHHHhcCCCchhhhhccccch
Confidence            345679999999   444 899999975


No 31 
>KOG3326|consensus
Probab=29.38  E-value=55  Score=22.92  Aligned_cols=24  Identities=13%  Similarity=0.403  Sum_probs=19.3

Q ss_pred             HHHHHHHHHhcCCCCHHHHHHHHH
Q psy2167          19 TNIINRARKEALISSDHVVQTLLY   42 (79)
Q Consensus        19 ~~ll~~ar~eg~i~~~~~~~~l~~   42 (79)
                      ..|+++.|++|..+.|..+....+
T Consensus        61 aRLlYqSrKRGiLEnDLLLs~Fa~   84 (154)
T KOG3326|consen   61 ARLLYQSRKRGILENDLLLSNFAA   84 (154)
T ss_pred             HHHHHHHhhcchhHHHHHHHHHHH
Confidence            579999999999999976655443


No 32 
>PF08671 SinI:  Anti-repressor SinI;  InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=29.35  E-value=49  Score=16.96  Aligned_cols=13  Identities=23%  Similarity=0.465  Sum_probs=9.7

Q ss_pred             HHHHHHHHHhcCC
Q psy2167          19 TNIINRARKEALI   31 (79)
Q Consensus        19 ~~ll~~ar~eg~i   31 (79)
                      ..|+.+|++.|.=
T Consensus         6 ~~Li~eA~~~Gls   18 (30)
T PF08671_consen    6 VELIKEAKESGLS   18 (30)
T ss_dssp             HHHHHHHHHTT--
T ss_pred             HHHHHHHHHcCCC
Confidence            4789999999953


No 33 
>PF10845 DUF2576:  Protein of unknown function (DUF2576);  InterPro: IPR022556  The function of this viral family of proteins is unknown. The entry contains Orf5 from Autographa californica nuclear polyhedrosis virus (AcMNPV).
Probab=28.42  E-value=41  Score=19.17  Aligned_cols=23  Identities=26%  Similarity=0.283  Sum_probs=19.4

Q ss_pred             HHHHHHHHHHHHHHHHHhccccc
Q psy2167          34 DHVVQTLLYELSEHRRKLGSIIS   56 (79)
Q Consensus        34 ~~~~~~l~~ei~~FR~~l~~L~~   56 (79)
                      |+.-.++-.|+++.|.++..||.
T Consensus        10 dydreqlrrelnsLR~~vhelct   32 (48)
T PF10845_consen   10 DYDREQLRRELNSLRRSVHELCT   32 (48)
T ss_pred             ccCHHHHHHHHHHHHHHHHHHHH
Confidence            55667899999999999988864


No 34 
>KOG4095|consensus
Probab=26.89  E-value=74  Score=22.46  Aligned_cols=27  Identities=22%  Similarity=0.206  Sum_probs=19.7

Q ss_pred             HHHHHHHHHHHHHHH--------hcccccCcceec
Q psy2167          36 VVQTLLYELSEHRRK--------LGSIISYDTVCV   62 (79)
Q Consensus        36 ~~~~l~~ei~~FR~~--------l~~L~~yDWVpI   62 (79)
                      ..+++++.|.+.|.-        =-.|.-|-||||
T Consensus        17 DIKkVMaaiEKVRrWEKKwVtvgDTslRIyKWVPV   51 (165)
T KOG4095|consen   17 DIKKVMAAIEKVRRWEKKWVTVGDTSLRIYKWVPV   51 (165)
T ss_pred             HHHHHHHHHHHHHHHhhheEeecccceEEEEeeec
Confidence            477777777777753        246788999997


No 35 
>PF10474 DUF2451:  Protein of unknown function C-terminus (DUF2451);  InterPro: IPR019514  This protein is found in eukaryotes but its function is not known. The N-terminal domain of some members is PF10475 from PFAM (DUF2450). 
Probab=26.18  E-value=1e+02  Score=22.27  Aligned_cols=44  Identities=23%  Similarity=0.309  Sum_probs=30.8

Q ss_pred             eeHHHHHHHHHHHH---HhcCCCCHHHHHHHHHHHHHHHHHhccccc
Q psy2167          13 MPLVWATNIINRAR---KEALISSDHVVQTLLYELSEHRRKLGSIIS   56 (79)
Q Consensus        13 vPi~Wa~~ll~~ar---~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~   56 (79)
                      +++....+.+...+   +|--......+..+++|+..|+.+|..+..
T Consensus        67 l~~~~i~~~Ia~vKWdvkev~~qhs~YVd~l~~~~~~f~~rL~~i~~  113 (234)
T PF10474_consen   67 LDLEQILNSIANVKWDVKEVMSQHSSYVDQLVQEFQQFSERLDEISK  113 (234)
T ss_pred             cCHHHHHHHHHHcCCCCCCCCCccCHHHHHHHHHHHHHHHHHHHHHh
Confidence            45566666665554   333344456889999999999999977763


No 36 
>COG3969 Predicted phosphoadenosine phosphosulfate sulfotransferase [General function prediction only]
Probab=25.35  E-value=1.2e+02  Score=24.24  Aligned_cols=44  Identities=23%  Similarity=0.280  Sum_probs=31.0

Q ss_pred             HHHHHhcC-------CCCHHHHHHHHHHHHHHHHHhcccccC-cceecchhh
Q psy2167          23 NRARKEAL-------ISSDHVVQTLLYELSEHRRKLGSIISY-DTVCVPLVY   66 (79)
Q Consensus        23 ~~ar~eg~-------i~~~~~~~~l~~ei~~FR~~l~~L~~y-DWVpIPLvY   66 (79)
                      .-||+-|+       |+=+..+..-.|-+.+-|++....... =||++||.-
T Consensus        49 ~~ar~~~~~~i~VlfiD~E~QYs~TidyV~em~~~~~dv~~~~yWvcLPl~t  100 (407)
T COG3969          49 EVARENGRDKISVLFIDWEAQYSCTIDYVQEMRESYHDVIETFYWVCLPLTT  100 (407)
T ss_pred             HHHHHhCCCceEEEEEcchhhhhhHHHHHHHHHhcccCccccceEEEeehhc
Confidence            44555565       444556777788888888876666664 899999964


No 37 
>PF07026 DUF1317:  Protein of unknown function (DUF1317);  InterPro: IPR009750 This entry is represented by Bacteriophage lambda, Xis. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=25.12  E-value=94  Score=18.55  Aligned_cols=23  Identities=26%  Similarity=0.247  Sum_probs=19.1

Q ss_pred             CCCHHHHHHHHHHHHHHHHHhcc
Q psy2167          31 ISSDHVVQTLLYELSEHRRKLGS   53 (79)
Q Consensus        31 i~~~~~~~~l~~ei~~FR~~l~~   53 (79)
                      ++++.-.+++-+|+++++.++..
T Consensus        32 i~NPlkAqR~AE~~n~~~~~l~~   54 (60)
T PF07026_consen   32 ITNPLKAQRLAEELNSKQVNLCD   54 (60)
T ss_pred             EcCHHHHHHHHHHHHhhHhhhhh
Confidence            46677789999999999998864


No 38 
>PF05635 23S_rRNA_IVP:  23S rRNA-intervening sequence protein;  InterPro: IPR008815 This family consists of bacterial proteins encoded within an intervening sequence present within some 23S rRNA genes[]. The function of these proteins is not known, but a structural study indicates that each momonmer folds into an antiparallel four-helix bundle, while the overall protein is a homopentamer with a toroid-shaped structure containing a tapered central channel [].; PDB: 2GSC_E 2RLD_D.
Probab=25.04  E-value=1.6e+02  Score=18.23  Aligned_cols=33  Identities=27%  Similarity=0.266  Sum_probs=22.6

Q ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcc
Q psy2167          20 NIINRARKEALISSDHVVQTLLYELSEHRRKLGS   53 (79)
Q Consensus        20 ~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~   53 (79)
                      ..+.-|++.|.|..+ .+..+.+|+.+....+..
T Consensus        77 ~~L~~a~~~~~i~~~-~~~~l~~~~~ei~~~L~~  109 (110)
T PF05635_consen   77 YWLELARDLGYISEE-EYEELKKELEEISKMLNG  109 (110)
T ss_dssp             HHHHHHHHTTSS-HH-HHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHcCCCCHH-HHHHHHHHHHHHHHHHcC
Confidence            456778888888854 567788877777766543


No 39 
>TIGR02436 conserved hypothetical protein TIGR02436. This family consists of a few small, well-conserved proteins found so far in Bacteroides thetaiotaomicron VPI-5482, Nostoc sp. PCC 7120, Clostridium tetani E88, Chlorobium tepidum TLS, and Prevotella ruminicola 23. The function is unknown.
Probab=24.97  E-value=1.4e+02  Score=18.95  Aligned_cols=35  Identities=14%  Similarity=0.116  Sum_probs=25.6

Q ss_pred             HHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhccccc
Q psy2167          21 IINRARKEALISSDHVVQTLLYELSEHRRKLGSIIS   56 (79)
Q Consensus        21 ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~   56 (79)
                      -+.-|++.|+++.+ .+..+.+|+.+..+-|..+.+
T Consensus        72 ~L~la~~~~~l~~~-~~~~l~~e~~ei~~~L~~li~  106 (111)
T TIGR02436        72 WIRLLIQTELIPEN-RYESIKGDCLELLKILAATVV  106 (111)
T ss_pred             HHHHHHHcCCCCHH-HHHHHHHHHHHHHHHHHHHHH
Confidence            45668889999755 578888888887777765543


No 40 
>PF04437 RINT1_TIP1:  RINT-1 / TIP-1 family;  InterPro: IPR007528 This family includes RINT-1, a Rad50 interacting protein which participates in radiation induced checkpoint control [], that interacts with Rad50 only during late S and G2/M phases. RINT1 also functions in membrane trafficking from the endoplasmic reticulum(ER) to the Golgi complex in interphase cells [, , ]. In addition to this, the TIP-1 protein, which is involved in the retrograde transport from the Golgi to the ER []. They share a similar domain organisation with an N-terminal leucine heptad repeat rich coiled coil and an ~500-residue C-terminal RINT1/TIP20 domain, which might be a protein-protein interaction module necessary for the formation of functional complexes.; PDB: 3FHN_A.
Probab=24.51  E-value=2.5e+02  Score=22.09  Aligned_cols=50  Identities=16%  Similarity=0.286  Sum_probs=30.7

Q ss_pred             HHHHHHHHHHHHHh------cCCCCHHHHHHHHHHHHHHHHHhcccccCc--ceecch
Q psy2167          15 LVWATNIINRARKE------ALISSDHVVQTLLYELSEHRRKLGSIISYD--TVCVPL   64 (79)
Q Consensus        15 i~Wa~~ll~~ar~e------g~i~~~~~~~~l~~ei~~FR~~l~~L~~yD--WVpIPL   64 (79)
                      ..|+..|+.-++++      ....++..+..+++|+..|-+.+....+|+  |.|+++
T Consensus        68 ~~fi~~ll~~~~~Kl~~~l~~~~~~~~~l~HlI~e~~~FD~~L~~~~~y~~d~~~~~~  125 (494)
T PF04437_consen   68 EEFIRGLLPPVREKLRSDLPELLDDPSLLSHLIDEILSFDKELRSLYGYPGDWQGSTL  125 (494)
T ss_dssp             HHHHHHHHHHHHHHHHHHH--TTS-HHHHHHHHHHHHHHHHHHHHTS---S------C
T ss_pred             HHHHHHHHHHHHHHHHHHHHhhccChhHHHHHHHHHHHHHHHHHHHcCCCCccchhHH
Confidence            45666666655532      245677789999999999999999999886  344443


No 41 
>PF09779 Ima1_N:  Ima1 N-terminal domain;  InterPro: IPR018617  Members of this family of uncharacterised novel proteins have no known function. 
Probab=23.01  E-value=68  Score=21.20  Aligned_cols=22  Identities=32%  Similarity=0.237  Sum_probs=18.3

Q ss_pred             CHHHHHHHHHHHHHHHHHhccc
Q psy2167          33 SDHVVQTLLYELSEHRRKLGSI   54 (79)
Q Consensus        33 ~~~~~~~l~~ei~~FR~~l~~L   54 (79)
                      ++..+....+|+.+||.+|...
T Consensus        98 e~p~y~~~~~e~~~Yr~~LE~r  119 (131)
T PF09779_consen   98 EDPEYANYEEELPEYRRSLEQR  119 (131)
T ss_pred             CCccHHHHHHHHHHHHHHHHHH
Confidence            3457889999999999999864


No 42 
>KOG2351|consensus
Probab=22.56  E-value=93  Score=21.31  Aligned_cols=21  Identities=33%  Similarity=0.455  Sum_probs=16.1

Q ss_pred             hcCCCCHHHHHHHHHHHHHHHH
Q psy2167          28 EALISSDHVVQTLLYELSEHRR   49 (79)
Q Consensus        28 eg~i~~~~~~~~l~~ei~~FR~   49 (79)
                      +++|++| .++.++++|..-|+
T Consensus       112 ~nkidD~-~le~iL~dls~lr~  132 (134)
T KOG2351|consen  112 ENKIDDD-ELEQILKDLSTLRT  132 (134)
T ss_pred             ccccCHH-HHHHHHHHHHHHHh
Confidence            4566644 89999999988774


No 43 
>cd07311 terB_like_1 tellurium resistance terB-like protein, subgroup 1. This family includes several uncharacterized bacterial proteins. The prototype of this CD is tellurite resistance protein from Nostoc punctiforme that belongs to COG3793. Its precise biological function and its mechanism responsible for tellurium resistance still remains rather poorly understood.
Probab=22.26  E-value=1.5e+02  Score=20.06  Aligned_cols=21  Identities=19%  Similarity=0.409  Sum_probs=18.7

Q ss_pred             eHHHHHHHHHHHHHhcCCCCH
Q psy2167          14 PLVWATNIINRARKEALISSD   34 (79)
Q Consensus        14 Pi~Wa~~ll~~ar~eg~i~~~   34 (79)
                      |..|..+|+.-|+..|.++..
T Consensus        23 ~~~~~~~Ll~iAkADG~Vse~   43 (150)
T cd07311          23 KLAYLKALLVCAKGDGVISPE   43 (150)
T ss_pred             HHHHHHHHHHHHHcCCCCCHH
Confidence            578999999999999999854


No 44 
>PTZ00233 variable surface protein Vir18; Provisional
Probab=22.26  E-value=62  Score=26.53  Aligned_cols=34  Identities=12%  Similarity=0.210  Sum_probs=25.7

Q ss_pred             HHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCc
Q psy2167          22 INRARKEALISSDHVVQTLLYELSEHRRKLGSIISYD   58 (79)
Q Consensus        22 l~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yD   58 (79)
                      |+.|...|+|..+ +...  +.|++|+.+|-..-++-
T Consensus        67 LK~Cy~~~~i~~~-Li~D--d~IK~F~~kC~~~~~C~  100 (509)
T PTZ00233         67 LKHCYDNGYINSD-LNND--EKIKNFKSKCNNKGTCN  100 (509)
T ss_pred             hhhhhhcCccccc-ccch--HhHHHHHHhCCCCccCC
Confidence            5778888998887 3444  89999999998654443


No 45 
>PF04714 BCL_N:  BCL7, N-terminal conserver region;  InterPro: IPR006804 The members of this group of sequences contain a conserved N-terminal domain which is found in the BCL7 family. The function of BCL7 proteins is unknown, though they may be involved in early development. Notably, BCL7B is commonly hemizygously deleted in patients with Williams syndrome [].
Probab=21.72  E-value=1.4e+02  Score=17.38  Aligned_cols=27  Identities=19%  Similarity=0.180  Sum_probs=18.6

Q ss_pred             HHHHHHHHHHHHHHH--------hcccccCcceec
Q psy2167          36 VVQTLLYELSEHRRK--------LGSIISYDTVCV   62 (79)
Q Consensus        36 ~~~~l~~ei~~FR~~--------l~~L~~yDWVpI   62 (79)
                      .++.++..|...|.-        =..|.-|-||||
T Consensus        16 diKrv~~~iekVr~wEKKWVtv~dtslriyKWVPv   50 (52)
T PF04714_consen   16 DIKRVMASIEKVRKWEKKWVTVGDTSLRIYKWVPV   50 (52)
T ss_pred             HHHHHHHHHHHHHHHhhceEEeccceEEEEEEEec
Confidence            467777777666653        245778999997


No 46 
>PF00458 WHEP-TRS:  WHEP-TRS domain;  InterPro: IPR000738 A conserved domain of 46 amino acids, called WHEP-TRS has been shown [] to exist in a number of higher eukaryote aminoacyl-transfer RNA synthetases. This domain is present one to six times in the several enzymes. There are three copies in mammalian multifunctional aminoacyl-tRNA synthetase in a region that separates the N-terminal glutamyl-tRNA synthetase domain from the C-terminal prolyl-tRNA synthetase domain, and six copies in the intercatalytic region of the Drosophila enzyme. The domain is found at the N-terminal extremity of the mammalian tryptophanyl- tRNA synthetase and histidyl-tRNA synthetase, and the mammalian, insect, nematode and plant glycyl- tRNA synthetases []. This domain could contain a central alpha-helical region and may play a role in the association of tRNA-synthetases into multienzyme complexes.; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding, 0006418 tRNA aminoacylation for protein translation; PDB: 1R1B_A 1D2D_A 1X59_A 1FYJ_A 2ZT8_A 2ZXF_A 2ZT7_A 2ZT6_A 2PME_A 2ZT5_A ....
Probab=21.65  E-value=1.8e+02  Score=16.76  Aligned_cols=42  Identities=12%  Similarity=0.160  Sum_probs=31.9

Q ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecc
Q psy2167          20 NIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVP   63 (79)
Q Consensus        20 ~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIP   63 (79)
                      +.+|+-..+|  .+......-+++|.+.+..+..+..-||.|=.
T Consensus        10 ~~VR~LKa~k--a~k~~i~~aV~~Ll~LK~~~~~~tg~~~~~~~   51 (56)
T PF00458_consen   10 DKVRKLKAEK--ADKEEIDAAVAKLLELKAELKELTGKDYKPGK   51 (56)
T ss_dssp             HHHHHHHHTT----HHHHHHHHHHHHHHHHHHHHHCSSSTTSSS
T ss_pred             HHHHHHHccC--CcHHHHHHHHHHHHHHHHHHHHhhCCCCCCCC
Confidence            4566666666  34446899999999999999999999987743


No 47 
>COG2089 SpsE Sialic acid synthase [Cell envelope biogenesis, outer membrane]
Probab=21.41  E-value=37  Score=26.61  Aligned_cols=52  Identities=29%  Similarity=0.270  Sum_probs=34.8

Q ss_pred             eeeHHHHHHHHHHHHHhcCC--CCHHHHHHHHHHHHHHHHHhcccccCcceecch
Q psy2167          12 WMPLVWATNIINRARKEALI--SSDHVVQTLLYELSEHRRKLGSIISYDTVCVPL   64 (79)
Q Consensus        12 wvPi~Wa~~ll~~ar~eg~i--~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPL   64 (79)
                      =+|..|-..|...||++|.+  .|+ .=..-++++.++--..-.+=+++-...||
T Consensus        86 ~~p~e~~~~Lke~a~~~Gi~~~SSP-fd~~svd~l~~~~~~ayKIaS~E~~~~pl  139 (347)
T COG2089          86 ETPLEWHAQLKEYARKRGIIFFSSP-FDLTAVDLLESLNPPAYKIASGEINDLPL  139 (347)
T ss_pred             cCCHHHHHHHHHHHHHcCeEEEecC-CCHHHHHHHHhcCCCeEEecCccccChHH
Confidence            37999999999999999975  333 33445566666555555555555555554


No 48 
>PF05791 Bacillus_HBL:  Bacillus haemolytic enterotoxin (HBL);  InterPro: IPR008414 This family consists of several Bacillus haemolytic enterotoxins (HblC, HblD, HblA, NheA, and NheB), which can cause food poisoning in humans []. Haemolysin BL (encoded by HBL) and non-haemolytic enterotoxin (encoded by NHE), represent the major enterotoxins produced by Bacillus cereus. Most of the cytotoxic activity of B. cereus isolates has been attributed to the level of Nhe, which may indicate a highly diarrheic potential []. The exact mechanism by which B. cereus causes diarrhoea is unknown. Hbl, cytotoxin K (CytK) and Nhe are all putative causes. Both Hbl and Nhe are three-component cytotoxins and maximal cytotoxicity of Nhe against epithelia is dependent on all three components. Nhe has haemolytic activity against erythrocytes from a variety of species. It is possible that the common structural and functional properties of these toxins indicate that the Hbl/Nhe and ClyA families of toxins constitute a superfamily of pore-forming cytotoxins []. The high virulence of some strains is thought to be due to the greater cytotoxic activity of CytK-1 compared to CytK-2, and to a high level of cytK expression []. Haemolysin BL and non-haemolytic enterotoxin production are both influenced by pH and micro []. This entry is found in cytotoxic proteins that form part of the enterotoxin complex and bind to erythrocytes. HblA is composed of a binding component, B, and two lytic components, L1 and L2. All three subunits act synergically to cause hemolysis.; GO: 0009405 pathogenesis, 0016020 membrane; PDB: 2NRJ_A.
Probab=20.87  E-value=99  Score=21.40  Aligned_cols=18  Identities=39%  Similarity=0.560  Sum_probs=14.9

Q ss_pred             HHHHHHHHHHHHHHHhcc
Q psy2167          36 VVQTLLYELSEHRRKLGS   53 (79)
Q Consensus        36 ~~~~l~~ei~~FR~~l~~   53 (79)
                      ..+.+++||++||.+|..
T Consensus       125 ~~~~~i~~L~~f~~~l~~  142 (184)
T PF05791_consen  125 KVQALINELNDFKDKLQK  142 (184)
T ss_dssp             HHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHH
Confidence            577889999999998753


No 49 
>PLN03075 nicotianamine synthase; Provisional
Probab=20.82  E-value=1.3e+02  Score=22.72  Aligned_cols=35  Identities=23%  Similarity=0.217  Sum_probs=24.3

Q ss_pred             HHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhccccc
Q psy2167          18 ATNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIIS   56 (79)
Q Consensus        18 a~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~   56 (79)
                      +.+|+..+....    +.....+.+||.++|.+|..||+
T Consensus        38 f~~Lv~~c~~~~----~~~~~~l~~~i~~~~~~l~~l~~   72 (296)
T PLN03075         38 FTQLVSTCIPPS----SIDVTKLCEEIQEMRSKLIKLCG   72 (296)
T ss_pred             HHHHHHHhCCCC----cchHHHhhHHHHHHHHHHHHHHH
Confidence            344555554333    23456899999999999998886


No 50 
>KOG2787|consensus
Probab=20.75  E-value=1.6e+02  Score=23.53  Aligned_cols=47  Identities=21%  Similarity=0.416  Sum_probs=37.3

Q ss_pred             HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccCcceecchhh
Q psy2167          16 VWATNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISYDTVCVPLVY   66 (79)
Q Consensus        16 ~Wa~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~yDWVpIPLvY   66 (79)
                      -|+.-.+++-.-+|.|.+| .+..++++|-+==..++.=-+   -|=||.|
T Consensus       164 L~a~lflNk~ig~~ti~~~-~i~~i~~~I~~sGr~~a~k~~---~~cPLmY  210 (403)
T KOG2787|consen  164 LWACLFLNKYIGQETIPDD-DIRSIVQAILTSGRELAKKEN---SPCPLMY  210 (403)
T ss_pred             HHHHHHHHhhcCCCcCCHH-HHHHHHHHHHHHHHHHHhcCC---CCCCchh
Confidence            5999999999999999876 789999998766555665554   6677776


No 51 
>COG3943 Virulence protein [General function prediction only]
Probab=20.50  E-value=2.7e+02  Score=21.40  Aligned_cols=40  Identities=15%  Similarity=0.295  Sum_probs=31.7

Q ss_pred             HHHHHHHHHHHHHhcCCCCHH---------HHHHHHHHHHHHHHHhccc
Q psy2167          15 LVWATNIINRARKEALISSDH---------VVQTLLYELSEHRRKLGSI   54 (79)
Q Consensus        15 i~Wa~~ll~~ar~eg~i~~~~---------~~~~l~~ei~~FR~~l~~L   54 (79)
                      -+||..-+.+---+|...+|-         ....+++-|++.|.+-..+
T Consensus       103 rqwatq~lqeyl~kgfvmdderlk~~~~gdy~~emlerirdiraser~v  151 (329)
T COG3943         103 RQWATQTLQEYLIKGFVMDDERLKNPPVGDYFKEMLERIRDIRASERRV  151 (329)
T ss_pred             HHHHHHHHHHHHHhhcccchhhccCCCcchHHHHHHHHHHHHHHHHHHH
Confidence            379999999988888776652         5678999999999886543


No 52 
>PF14637 FNIP_M:  Folliculin-interacting protein middle domain
Probab=20.40  E-value=2.2e+02  Score=20.91  Aligned_cols=39  Identities=10%  Similarity=0.165  Sum_probs=30.7

Q ss_pred             HHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhcccccC
Q psy2167          19 TNIINRARKEALISSDHVVQTLLYELSEHRRKLGSIISY   57 (79)
Q Consensus        19 ~~ll~~ar~eg~i~~~~~~~~l~~ei~~FR~~l~~L~~y   57 (79)
                      ++=++.+.++.....-..++.+.+++.+||+.+..|.+-
T Consensus        51 m~rL~~~i~~a~~~~~~f~~~i~~A~~~f~~~i~~L~~a   89 (232)
T PF14637_consen   51 MSRLRSAIEKAYSRRQQFLSIIYEAWEEFRQSICDLYTA   89 (232)
T ss_pred             HHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            344556667777776778999999999999999988754


Done!