Query         psy623
Match_columns 69
No_of_seqs    142 out of 1065
Neff          8.5 
Searched_HMMs 46136
Date          Fri Aug 16 21:06:33 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy623.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/623hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0417|consensus               99.9 1.4E-22   3E-27  117.7   6.6   57    9-66      2-58  (148)
  2 COG5078 Ubiquitin-protein liga  99.9 6.7E-22 1.5E-26  116.3   8.8   59    8-66      5-63  (153)
  3 KOG0419|consensus               99.8 3.7E-21   8E-26  110.0   6.7   60    5-65      1-60  (152)
  4 KOG0426|consensus               99.8 7.6E-21 1.6E-25  109.0   6.3   61    6-66      2-62  (165)
  5 PTZ00390 ubiquitin-conjugating  99.8 5.6E-20 1.2E-24  108.3   8.2   57    9-66      3-59  (152)
  6 PLN00172 ubiquitin conjugating  99.8 6.6E-20 1.4E-24  107.4   8.2   57    9-66      2-58  (147)
  7 KOG0894|consensus               99.8 3.8E-20 8.3E-25  113.0   6.6   60    6-66      3-62  (244)
  8 PF00179 UQ_con:  Ubiquitin-con  99.7 1.4E-17   3E-22   96.4   7.0   55   12-66      1-55  (140)
  9 cd00195 UBCc Ubiquitin-conjuga  99.7 3.4E-17 7.5E-22   94.9   7.6   55   11-66      2-56  (141)
 10 KOG0425|consensus               99.7 4.1E-17 8.9E-22   95.7   7.0   58    9-66      6-63  (171)
 11 KOG0421|consensus               99.7 1.2E-16 2.5E-21   92.9   7.1   62    4-66     25-86  (175)
 12 smart00212 UBCc Ubiquitin-conj  99.7 6.3E-16 1.4E-20   90.0   7.5   56   11-66      1-56  (145)
 13 KOG0418|consensus               99.6 5.4E-16 1.2E-20   93.2   6.4   57    9-66      4-63  (200)
 14 KOG0424|consensus               99.6 5.4E-15 1.2E-19   85.8   6.9   61    5-65      1-65  (158)
 15 KOG0428|consensus               99.6 2.5E-15 5.5E-20   93.8   5.0   58    7-66     10-67  (314)
 16 KOG0427|consensus               99.5   4E-14 8.8E-19   81.3   6.7   60    6-67     13-72  (161)
 17 KOG0416|consensus               99.3 3.4E-12 7.3E-17   76.0   5.2   55    8-66      3-57  (189)
 18 KOG0423|consensus               99.3 1.8E-12 3.8E-17   77.6   2.5   57    7-64      9-65  (223)
 19 KOG0422|consensus               99.2 1.4E-11   3E-16   71.3   4.2   57    9-66      3-59  (153)
 20 KOG0895|consensus               98.7 2.7E-08 5.9E-13   71.8   6.2   59    7-66    281-339 (1101)
 21 KOG0420|consensus               98.5 3.2E-07   7E-12   55.0   5.5   57    7-66     27-86  (184)
 22 KOG0429|consensus               98.5 5.3E-07 1.2E-11   56.0   6.0   53   12-66     23-75  (258)
 23 KOG0895|consensus               98.4 1.6E-07 3.5E-12   68.0   2.7   55   11-66    854-908 (1101)
 24 KOG0896|consensus               98.3 1.1E-06 2.3E-11   50.8   4.2   57    8-64      5-64  (138)
 25 smart00340 HALZ homeobox assoc  88.1    0.55 1.2E-05   22.2   1.9   15    9-23     20-34  (44)
 26 PF11619 P53_C:  Transcription   72.9     8.5 0.00018   19.9   3.2   24   36-66      2-25  (71)
 27 PF15241 Cylicin_N:  Cylicin N-  64.5     3.7 8.1E-05   23.2   0.9   11   59-69     26-36  (110)
 28 PF04881 Adeno_GP19K:  Adenovir  60.5      11 0.00023   22.1   2.3   30   35-64     44-74  (139)
 29 PF12065 DUF3545:  Protein of u  59.6     8.4 0.00018   19.4   1.6   11   11-21     37-47  (59)
 30 PF12075 KN_motif:  KN motif;    58.8     4.5 9.7E-05   18.7   0.5   12   50-64      5-16  (39)
 31 PF08203 RNA_polI_A14:  Yeast R  57.6       9 0.00019   20.3   1.6   13    8-20     59-71  (76)
 32 PF04904 NCD1:  NAB conserved r  53.0      23  0.0005   18.9   2.7   17    8-24     61-77  (82)
 33 PF09929 DUF2161:  Uncharacteri  49.5      35 0.00077   19.5   3.3   24    4-27     23-46  (118)
 34 PF04314 DUF461:  Protein of un  48.5      29 0.00064   19.0   2.8   27   40-66     77-103 (110)
 35 TIGR02422 protocat_beta protoc  48.1      19 0.00042   22.6   2.2   18   38-55    203-220 (220)
 36 PF12011 DUF3503:  Domain of un  47.2      23  0.0005   21.6   2.4   46   13-64     23-68  (170)
 37 PF05709 Sipho_tail:  Phage tai  45.7      67  0.0015   19.4   6.1   57    8-66     52-110 (249)
 38 PF12259 DUF3609:  Protein of u  45.7      19 0.00041   24.3   2.1   55    9-66     33-87  (361)
 39 PF01107 MP:  Viral movement pr  44.9      69  0.0015   19.3   4.8   37   26-64    130-167 (189)
 40 COG2847 Copper(I)-binding prot  44.8      67  0.0014   19.2   4.1   27   40-66    104-130 (151)
 41 KOG3285|consensus               43.3      73  0.0016   19.8   4.1   43    7-50    118-160 (203)
 42 PF02970 TBCA:  Tubulin binding  40.9      37 0.00081   18.2   2.4   15    7-21      6-20  (90)
 43 PF14824 Sirohm_synth_M:  Siroh  40.3      36 0.00078   14.8   2.3   16    5-20     14-29  (30)
 44 PF14501 HATPase_c_5:  GHKL dom  39.2      26 0.00056   18.5   1.6   13   53-65     88-100 (100)
 45 PF14436 EndoU_bacteria:  Bacte  38.7      46   0.001   18.6   2.7   18   51-68     42-59  (128)
 46 PF12915 DUF3833:  Protein of u  35.2      85  0.0018   19.0   3.5   25   35-63    107-131 (164)
 47 PF06675 DUF1177:  Protein of u  34.7      24 0.00052   23.0   1.2   17   48-64    180-196 (276)
 48 PF07547 RSD-2:  RSD-2 N-termin  34.6      73  0.0016   17.1   2.9   28   38-65     16-44  (84)
 49 cd02860 Pullulanase_N_term Pul  33.6      79  0.0017   16.7   3.4   24   37-64     45-68  (100)
 50 PF12627 PolyA_pol_RNAbd:  Prob  33.4      54  0.0012   15.7   2.2   19    7-25     21-39  (64)
 51 PF00868 Transglut_N:  Transglu  33.4      85  0.0018   17.5   3.2   21   41-63     95-115 (118)
 52 cd02848 Chitinase_N_term Chiti  33.2      47   0.001   18.7   2.1   12   53-64     77-88  (106)
 53 TIGR03601 B_an_ocin probable h  31.5      49  0.0011   17.3   1.8   22   13-42      4-25  (79)
 54 TIGR01633 phi3626_gp14_N putat  31.1      94   0.002   16.8   7.0   57    9-67     63-121 (124)
 55 PF06305 DUF1049:  Protein of u  31.0      59  0.0013   15.9   2.1   15    9-23     49-63  (68)
 56 CHL00130 rbcS ribulose-1,5-bis  29.9 1.2E+02  0.0027   17.8   4.9   33    9-43     60-93  (138)
 57 PF08872 KGK:  KGK domain;  Int  29.7 1.1E+02  0.0024   17.3   3.8   25   41-65     67-93  (114)
 58 PF09943 DUF2175:  Uncharacteri  29.0      68  0.0015   17.9   2.3   20   40-61      2-21  (101)
 59 PF07691 PA14:  PA14 domain;  I  28.6 1.1E+02  0.0023   16.9   3.1   20   39-63     48-67  (145)
 60 PHA00672 hypothetical protein   27.7      55  0.0012   19.2   1.8   20   48-67     38-57  (152)
 61 KOG0680|consensus               26.7 1.8E+02  0.0039   20.0   4.2   44    8-54    332-377 (400)
 62 COG5402 Uncharacterized conser  26.7 1.6E+02  0.0036   18.2   3.9   29   40-69    139-173 (194)
 63 PF14135 DUF4302:  Domain of un  25.3 1.8E+02  0.0039   18.2   5.4   46    8-65      9-54  (235)
 64 PF13115 YtkA:  YtkA-like        25.0      69  0.0015   16.3   1.8   14   53-66     64-77  (86)
 65 PF05015 Plasmid_killer:  Plasm  24.7 1.2E+02  0.0027   16.1   5.4   39    8-46     25-70  (93)
 66 KOG2193|consensus               24.2 1.1E+02  0.0025   21.7   3.1   40   15-54    431-470 (584)
 67 COG3866 PelB Pectate lyase [Ca  23.7 1.1E+02  0.0024   20.7   2.8   25   39-63    197-222 (345)
 68 PHA00458 single-stranded DNA-b  23.7 1.3E+02  0.0029   19.2   3.1   16   52-67     33-48  (233)
 69 COG2880 Uncharacterized protei  23.7      20 0.00042   18.6  -0.5    9   52-60     10-18  (67)
 70 PF15586 Imm47:  Immunity prote  23.4 1.5E+02  0.0033   16.7   4.2   28   34-62     22-49  (116)
 71 PF03366 YEATS:  YEATS family;   23.3      23 0.00049   18.8  -0.3   11   40-50      3-13  (84)
 72 PF02922 CBM_48:  Carbohydrate-  23.3 1.2E+02  0.0025   15.2   3.7   27   36-65     46-73  (85)
 73 PTZ00281 actin; Provisional     23.0 1.8E+02   0.004   19.3   3.8   35    7-42    308-342 (376)
 74 PF15590 Imm15:  Immunity prote  22.4 1.3E+02  0.0029   15.6   3.1   39   15-57     13-51  (69)
 75 PF04447 DUF550:  Protein of un  22.0 1.1E+02  0.0023   17.1   2.1   17    8-24     22-38  (100)
 76 PF13086 AAA_11:  AAA domain; P  21.1      67  0.0014   18.8   1.4    9   43-51     20-28  (236)
 77 PF01954 DUF104:  Protein of un  21.0      36 0.00079   17.0   0.2    9   52-60      8-16  (60)
 78 PF09036 Bcr-Abl_Oligo:  Bcr-Ab  21.0 1.2E+02  0.0027   16.0   2.2   12    8-19     40-51  (79)
 79 PF01774 UreD:  UreD urease acc  20.8 1.5E+02  0.0032   18.2   2.9   23   43-65      2-24  (209)
 80 PF13119 DUF3973:  Domain of un  20.7      46   0.001   15.4   0.5   12   47-58     15-26  (41)
 81 KOG0744|consensus               20.7   1E+02  0.0022   21.3   2.3   16   36-51    171-188 (423)
 82 COG3254 Uncharacterized conser  20.6      31 0.00067   19.4  -0.1   12   23-35     36-47  (105)
 83 cd03463 3,4-PCD_alpha Protocat  20.6 1.8E+02  0.0038   17.8   3.2   17   37-53    167-183 (185)
 84 PF14172 DUF4309:  Domain of un  20.4      86  0.0019   18.1   1.7   17   53-69    108-125 (134)
 85 TIGR03517 GldM_gliding gliding  20.3 1.1E+02  0.0023   21.9   2.4   28   37-64    231-258 (523)

No 1  
>KOG0417|consensus
Probab=99.88  E-value=1.4e-22  Score=117.74  Aligned_cols=57  Identities=32%  Similarity=0.641  Sum_probs=54.1

Q ss_pred             HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +.+||.||+++++.+++++|++. ++++|+++|+|+|.||.+||||||+|+|.|.||.
T Consensus         2 a~~RI~kE~~~l~~dp~~~~~~~-~~~dnl~~w~a~I~GP~~SpYEgG~F~l~I~~p~   58 (148)
T KOG0417|consen    2 ASKRIIKELQDLLRDPPPGCSAG-PVGDNLFHWQATILGPPGSPYEGGVFFLEIHFPE   58 (148)
T ss_pred             cHHHHHHHHHHHhcCCCCCCccC-CCCCceeeEEEEEECCCCCCcCCCEEEEEEECCC
Confidence            34699999999999999999999 9999999999999999999999999999999974


No 2  
>COG5078 Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones]
Probab=99.87  E-value=6.7e-22  Score=116.30  Aligned_cols=59  Identities=49%  Similarity=0.849  Sum_probs=54.5

Q ss_pred             HHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            8 SALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      .+.+||++|+++++++++++|++.+.+++|+++|+++|.||+|||||||+|++.|+||.
T Consensus         5 ~a~~RL~kE~~~l~~~~~~~~~a~p~~d~~l~~w~~~i~GP~dtpYegg~f~~~l~fP~   63 (153)
T COG5078           5 SALKRLLKELKKLQKDPPPGISAGPVDDDNLFHWEATITGPPDTPYEGGIFKLTLEFPE   63 (153)
T ss_pred             hHHHHHHHHHHHHhcCCCCceEEEECCCCcceeEEEEEECCCCCCcCCCEEEEEEECCC
Confidence            38899999999999999999999955555999999999999999999999999999964


No 3  
>KOG0419|consensus
Probab=99.85  E-value=3.7e-21  Score=110.05  Aligned_cols=60  Identities=37%  Similarity=0.733  Sum_probs=56.7

Q ss_pred             CchHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEec
Q psy623            5 PTSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWY   65 (69)
Q Consensus         5 ~~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp   65 (69)
                      |+..+.+||++|+++++++++.|++.. +.++|+..|.+.|.||++|||+||+|+|.|+|+
T Consensus         1 MstpArrrLmrDfkrlqedpp~gisa~-P~~~niM~W~a~I~Gp~~tp~e~gtFkLtl~Ft   60 (152)
T KOG0419|consen    1 MSTPARRRLMRDFKRLQEDPPAGISAA-PVENNIMEWNAVIFGPQDTPFEGGTFKLTLEFT   60 (152)
T ss_pred             CCchHHHHHHHHHHHhhcCCCCCccCC-CCccceeeeeeeEEcCCCCCcCCceEEEEEEcc
Confidence            456789999999999999999999999 889999999999999999999999999999993


No 4  
>KOG0426|consensus
Probab=99.84  E-value=7.6e-21  Score=108.99  Aligned_cols=61  Identities=41%  Similarity=0.721  Sum_probs=57.8

Q ss_pred             chHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            6 TSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         6 ~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      ...+.+||++||++|..++++||.+.|.++||++.|.+.|.||++|+|+||+|-.++.||+
T Consensus         2 ~~~AlkRLm~EykqLt~~~P~GIvAgP~~EdnfF~W~cLI~GP~~T~f~~GvfpA~l~FP~   62 (165)
T KOG0426|consen    2 AGTALKRLMAEYKQLTLNPPEGIVAGPINEDNFFEWECLIQGPEDTCFEGGVFPARLSFPL   62 (165)
T ss_pred             chhHHHHHHHHHHHHccCCCCcceeCCCCccceeeeeeeeeCCCCCcccCCccceeeecCC
Confidence            3468999999999999999999999988999999999999999999999999999999986


No 5  
>PTZ00390 ubiquitin-conjugating enzyme; Provisional
Probab=99.82  E-value=5.6e-20  Score=108.26  Aligned_cols=57  Identities=23%  Similarity=0.504  Sum_probs=53.8

Q ss_pred             HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +.+||++|++++.+++++++.+. +.++|+++|+++|.||++|||+||.|+++|.||.
T Consensus         3 ~~kRl~~E~~~l~~~~~~~i~~~-~~~~d~~~w~~~i~GP~~tpY~gg~f~~~i~~p~   59 (152)
T PTZ00390          3 ISKRIEKETQNLANDPPPGIKAE-PDPGNYRHFKILMEGPDGTPYEGGYYKLELFLPE   59 (152)
T ss_pred             HHHHHHHHHHHHHhCCCCCeEEE-ECCCCccEEEEEEEcCCCCCCcCcEEEEEEECcc
Confidence            56899999999999999999998 7788999999999999999999999999999975


No 6  
>PLN00172 ubiquitin conjugating enzyme; Provisional
Probab=99.82  E-value=6.6e-20  Score=107.40  Aligned_cols=57  Identities=32%  Similarity=0.651  Sum_probs=53.5

Q ss_pred             HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +.+||+||++++++++++++.+. ++++|+++|+++|.||++|||+||.|+++|.||.
T Consensus         2 a~~Rl~kE~~~l~~~~~~~~~~~-~~~~nl~~w~~~i~GP~~tpyegg~f~~~i~fp~   58 (147)
T PLN00172          2 ATKRIQKEHKDLLKDPPSNCSAG-PSDENLFRWTASIIGPSDSPYAGGVFFLSILFPP   58 (147)
T ss_pred             hHHHHHHHHHHHHhCCCCCeEEE-ECCCChheEEEEEECCCCCCCCCCEEEEEEECCc
Confidence            35899999999999999999999 7888999999999999999999999999999974


No 7  
>KOG0894|consensus
Probab=99.82  E-value=3.8e-20  Score=113.02  Aligned_cols=60  Identities=32%  Similarity=0.666  Sum_probs=57.4

Q ss_pred             chHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            6 TSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         6 ~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +..+.+||+|||+.|.++|.+.|.+. +.++|+.+||.+|.||+||||+||.|+.+|.||+
T Consensus         3 ~k~a~kRl~keY~~l~k~Pv~~i~A~-P~p~nILEWHYvl~GpedTPy~GG~YhGkl~FP~   62 (244)
T KOG0894|consen    3 SKAAVKRLQKEYRALCKDPVPYIVAR-PNPNNILEWHYVLRGPEDTPYYGGYYHGKLIFPP   62 (244)
T ss_pred             chHHHHHHHHHHHHHHhCCchhhccC-CCccceeeeEEEeeCCCCCCccCceeeeEEeCCC
Confidence            45799999999999999999999999 9999999999999999999999999999999975


No 8  
>PF00179 UQ_con:  Ubiquitin-conjugating enzyme;  InterPro: IPR000608 The post-translational attachment of ubiquitin (IPR000626 from INTERPRO) to proteins (ubiquitinylation) alters the function, location or trafficking of a protein, or targets it to the 26S proteasome for degradation [, , ]. Ubiquitinylation is an ATP-dependent process that involves the action of at least three enzymes: a ubiquitin-activating enzyme (E1, IPR000011 from INTERPRO), a ubiquitin-conjugating enzyme (E2), and a ubiquitin ligase (E3, IPR000569 from INTERPRO, IPR003613 from INTERPRO), which work sequentially in a cascade []. The E1 enzyme mediates an ATP-dependent transfer of a thioester-linked ubiquitin molecule to a cysteine residue on the E2 enzyme. The E2 enzyme (6.3.2.19 from EC) then either transfers the ubiquitin moiety directly to a substrate, or to an E3 ligase, which can also ubiquitinylate a substrate. There are several different E2 enzymes (over 30 in humans), which are broadly grouped into four classes, all of which have a core catalytic domain (containing the active site cysteine), and some of which have short N- and C-terminal amino acid extensions: class I enzymes consist of just the catalytic core domain (UBC), class II possess a UBC and a C-terminal extension, class III possess a UBC and an N-terminal extension, and class IV possess a UBC and both N- and C-terminal extensions. These extensions appear to be important for some subfamily function, including E2 localisation and protein-protein interactions []. In addition, there are proteins with an E2-like fold that are devoid of catalytic activity, but which appear to assist in poly-ubiquitin chain formation.; GO: 0016881 acid-amino acid ligase activity; PDB: 2AAK_A 3SY2_C 1FBV_C 3SQV_C 1C4Z_D 1JAT_B 2GMI_B 2H2Y_D 2R0J_A 3E95_B ....
Probab=99.73  E-value=1.4e-17  Score=96.41  Aligned_cols=55  Identities=49%  Similarity=0.877  Sum_probs=50.8

Q ss_pred             HHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623           12 ALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus        12 Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      ||++|+++++++++.++.+.+.+++|+++|+++|.||++|||+||.|+|.|.||.
T Consensus         1 Rl~~E~~~l~~~~~~~~~~~~~~~~~~~~w~~~i~gp~~t~y~gg~f~~~i~~p~   55 (140)
T PF00179_consen    1 RLQKELKELQKNPPPGISVQPSEDDNLFEWHVTIFGPPGTPYEGGIFKFRISFPP   55 (140)
T ss_dssp             HHHHHHHHHHHSHTTTEEEEEESTTETTEEEEEEEBETTSTTTTSEEEEEEEETT
T ss_pred             CHHHHHHHHhhCCCCCEEEEECCCCChheEEEEEeccCccceecccccccccccc
Confidence            8999999999999999999944445999999999999999999999999999975


No 9  
>cd00195 UBCc Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3.  This pathway regulates many fundamental cellular processes.  There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
Probab=99.72  E-value=3.4e-17  Score=94.89  Aligned_cols=55  Identities=38%  Similarity=0.753  Sum_probs=52.1

Q ss_pred             HHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623           11 RALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus        11 ~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +||++|+++++++++.++.+. ++++|+++|+++|.||++|||+||.|++.|.||+
T Consensus         2 ~Rl~~E~~~l~~~~~~~~~v~-~~~~~~~~w~~~i~g~~~t~y~g~~~~~~~~~p~   56 (141)
T cd00195           2 KRLQKELKDLKKDPPSGISAE-PVEENLLEWHGTIRGPPDTPYEGGIFKLDIEFPE   56 (141)
T ss_pred             chHHHHHHHHHhCCCCCeEEE-ECCCChhEEEEEEecCCCCCccCCEEEEEEECCC
Confidence            699999999999999999999 7788999999999999999999999999999974


No 10 
>KOG0425|consensus
Probab=99.71  E-value=4.1e-17  Score=95.66  Aligned_cols=58  Identities=59%  Similarity=1.019  Sum_probs=54.1

Q ss_pred             HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +..-|.++|++|++++.+++++...++.|+++|.+.|.||+||.|+||.|+..+.||.
T Consensus         6 a~~ll~~qlk~L~~~pv~gf~~glvd~~dif~WeV~i~gppdTlYeGG~FkA~m~FP~   63 (171)
T KOG0425|consen    6 ASLLLLKQLKELQEEPVEGFSVGLVDDSDIFEWEVAIIGPPDTLYEGGFFKAHMKFPQ   63 (171)
T ss_pred             hHHHHHHHHHHHhcCCCCccccccccCCceeEEEEEEEcCCCccccCceeEEEEeCcc
Confidence            6677899999999999999999977788999999999999999999999999999975


No 11 
>KOG0421|consensus
Probab=99.69  E-value=1.2e-16  Score=92.88  Aligned_cols=62  Identities=31%  Similarity=0.485  Sum_probs=57.0

Q ss_pred             CCchHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            4 VPTSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         4 ~~~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      .......+||++||..|.....++|++. +++||++.|..+|.||.||+|+|-.|++.+.||.
T Consensus        25 ~~~~~V~KRLq~ELm~Lmms~~~gISAF-P~~dnlf~WvGtItGp~dTvyegl~yklSl~Fp~   86 (175)
T KOG0421|consen   25 VDGHSVTKRLQSELMGLMMSNTPGISAF-PESDNLFKWVGTITGPKDTVYEGLKYKLSLSFPN   86 (175)
T ss_pred             ccCchHHHHHHHHHHHHHhcCCCCcccC-cCcCceeEEeeEeeCCCCccccCcEEEEEEecCC
Confidence            3445678999999999999999999999 8889999999999999999999999999999974


No 12 
>smart00212 UBCc Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved  cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
Probab=99.66  E-value=6.3e-16  Score=89.96  Aligned_cols=56  Identities=39%  Similarity=0.732  Sum_probs=50.7

Q ss_pred             HHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623           11 RALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus        11 ~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +||++|++++++++++++.+.+.+++|+++|+++|.||++|||+||.|++.|.||.
T Consensus         1 ~Rl~~E~~~~~~~~~~~~~v~~~~~~~~~~w~~~i~gp~~~~y~g~~f~~~l~~p~   56 (145)
T smart00212        1 KRLLKELKELLKDPPPGISAYPVDEDNLLEWTGTIVGPPGTPYEGGIFKLTIEFPP   56 (145)
T ss_pred             ChHHHHHHHHHhCCCCCeEEEECCCCChheEEEEEEcCCCCCcCCcEEEEEEECCc
Confidence            59999999999999999998844445999999999999999999999999999964


No 13 
>KOG0418|consensus
Probab=99.65  E-value=5.4e-16  Score=93.20  Aligned_cols=57  Identities=32%  Similarity=0.524  Sum_probs=52.3

Q ss_pred             HHHHHHHHHHHhhhCC---CCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            9 ALRALSMEYKSLQEEP---VEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~---~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +.+||++|++++..++   ..+|.+. ..++|+.+....|.||+|||||||+|.++|+||-
T Consensus         4 ~~~ri~~e~k~v~~~~eisq~~I~ve-~vn~~~~~ikG~I~GP~~TPYEGG~FeldI~iPe   63 (200)
T KOG0418|consen    4 AFKRINREQKEVLDDPEISQAGIIVE-MVNENLKEIKGHIAGPEDTPYEGGVFELDIKIPE   63 (200)
T ss_pred             HHHHHHHHHHHhccChhhhhcceEEE-EccCChhhceeEecCCCCCCCCCceEEEEEecCC
Confidence            7799999999999887   5789998 7778999999999999999999999999999963


No 14 
>KOG0424|consensus
Probab=99.60  E-value=5.4e-15  Score=85.78  Aligned_cols=61  Identities=33%  Similarity=0.492  Sum_probs=54.3

Q ss_pred             CchHHHHHHHHHHHHhhhCCCCCEEEEecCC----CCcceEEEEEECCCCCCCCCcEEEEEEEec
Q psy623            5 PTSSALRALSMEYKSLQEEPVEGFRVKLVND----DNLFEWEVAIFGPPDTLYQGGYFKVCVVWY   65 (69)
Q Consensus         5 ~~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~----~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp   65 (69)
                      |+..+..||+.|-+.+.++.+-+|.+.|..+    .|+..|++.|.|+++|+||||.|.+.+.||
T Consensus         1 ~s~~~~~rl~eErk~wrk~hp~gf~AkP~~~~dg~~nl~~Wec~IPG~~~t~wEGg~y~l~v~F~   65 (158)
T KOG0424|consen    1 MSGIALNRLAEERKKWRKDHPFGFYAKPVKNADGTLNLMNWECGIPGKKGTPWEGGLYKLTVNFP   65 (158)
T ss_pred             CcchHHHHHHHHHHHHhhcCCCceeeeccCCCCCcceeEEEEeecCCCCCCcCcCceEEEEEeCC
Confidence            3456889999999999999999999996432    379999999999999999999999999993


No 15 
>KOG0428|consensus
Probab=99.58  E-value=2.5e-15  Score=93.77  Aligned_cols=58  Identities=34%  Similarity=0.680  Sum_probs=53.7

Q ss_pred             hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      .++.+||+||.++|+ +|...+... +-+||+++|+.+|.||.||-|+||+|+.+|.||.
T Consensus        10 npaVkRlmkEa~El~-~Ptd~yha~-plEdNlFEWhFtiRGp~dtdFeGGiYHGRI~lPa   67 (314)
T KOG0428|consen   10 NPAVKRLMKEAAELK-DPTDHYHAQ-PLEDNLFEWHFTIRGPPDTDFEGGIYHGRIVLPA   67 (314)
T ss_pred             CHHHHHHHHHHHHhc-Cchhhhhhc-cchhceeeEEEEeeCCCCCCccCceeeeeEecCC
Confidence            478999999999998 788888888 8899999999999999999999999999999974


No 16 
>KOG0427|consensus
Probab=99.53  E-value=4e-14  Score=81.29  Aligned_cols=60  Identities=28%  Similarity=0.517  Sum_probs=55.0

Q ss_pred             chHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecCC
Q psy623            6 TSSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYKC   67 (69)
Q Consensus         6 ~~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~~   67 (69)
                      +..+.+||+|||.+++.++++|+...  ..||+.+|.+.+.|.++|.|+|.+|.|.++||.+
T Consensus        13 s~~at~RLqKEl~e~q~~pP~G~~~~--v~dnlqqWii~v~Ga~GTLYa~e~~qLq~~F~~~   72 (161)
T KOG0427|consen   13 SKIATNRLQKELSEWQNNPPTGFKHR--VTDNLQQWIIEVTGAPGTLYANETYQLQVEFPEH   72 (161)
T ss_pred             HHHHHHHHHHHHHHHhcCCCCcceee--cccchheeEEEEecCCceeecCcEEEEEEecCCC
Confidence            45678999999999999999999987  4679999999999999999999999999999864


No 17 
>KOG0416|consensus
Probab=99.33  E-value=3.4e-12  Score=76.01  Aligned_cols=55  Identities=29%  Similarity=0.479  Sum_probs=47.7

Q ss_pred             HHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            8 SALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      ...+||..|+.+|...   +..+. ..++++.+++|.+.||.+|||+||+|+++|++|.
T Consensus         3 ~~~rRid~Dv~KL~~s---~yeV~-~ind~m~ef~V~f~GP~ds~YegGvWkv~V~lPd   57 (189)
T KOG0416|consen    3 SGKRRIDTDVMKLLMS---DYEVT-IINDGMQEFYVKFHGPKDSPYEGGVWKVRVELPD   57 (189)
T ss_pred             CcccchhhHHHHHHhc---CCeEE-EecCcccEEEEEeeCCCCCcccCceEEEEEECCC
Confidence            3458999999998653   46777 7788999999999999999999999999999974


No 18 
>KOG0423|consensus
Probab=99.28  E-value=1.8e-12  Score=77.57  Aligned_cols=57  Identities=32%  Similarity=0.601  Sum_probs=53.0

Q ss_pred             hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623            7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVW   64 (69)
Q Consensus         7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~f   64 (69)
                      ....+-+.||++.+..+|+.||.+. ++++|+...++.|.||.+|||++|.|++++.+
T Consensus         9 p~vik~~~kEl~~l~~~PPdGIKV~-~NeeD~tdiqa~IeGP~GTPYa~GlFRmKL~L   65 (223)
T KOG0423|consen    9 PNVIKQLAKELKSLDESPPDGIKVV-VNEEDFTDIQADIEGPVGTPYANGLFRMKLAL   65 (223)
T ss_pred             hHHHHHHHHHHHhcccCCCCceEEe-cChHHhHHHHhhccCCCCCccccceeeehhhh
Confidence            3566789999999999999999999 99999999999999999999999999999876


No 19 
>KOG0422|consensus
Probab=99.23  E-value=1.4e-11  Score=71.35  Aligned_cols=57  Identities=28%  Similarity=0.455  Sum_probs=50.0

Q ss_pred             HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +.+||.|||.+|+++....+.-...+++|+..|++.|. |+..||..|.|+++|.||.
T Consensus         3 a~~Rl~kEL~dl~~~~~~~~rn~~~~e~nll~wt~lli-pd~ppY~kgaF~l~I~fp~   59 (153)
T KOG0422|consen    3 APRRLRKELADLQKNKMKFFRNIEVDEANLLKWTGLLI-PDKPPYNKGAFRLEIDFPV   59 (153)
T ss_pred             hhHHHHHHHHHHHhccHHHHhhhhcccccceeEEeEec-CCCCCccCcceEEEeeCCC
Confidence            56899999999999988877544367889999999999 9999999999999999964


No 20 
>KOG0895|consensus
Probab=98.75  E-value=2.7e-08  Score=71.82  Aligned_cols=59  Identities=22%  Similarity=0.322  Sum_probs=55.4

Q ss_pred             hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      ....+|+++|++-+.++.++++.+. +.+..+...++.|.||.+|||++|+|.|+|.||.
T Consensus       281 ~~~skrv~ke~~llskdlpEgifvr-p~e~RMd~I~alIig~~gtPy~~glf~Fdiq~P~  339 (1101)
T KOG0895|consen  281 KNWSKKVAKELKLLSKDLPEGIFVR-PDEGRMDLIKALIIGPDGTPYADGLFLFDIQFPD  339 (1101)
T ss_pred             hhhHHHHHHHhhhhcccCCCCcccc-ccccccceeeeEEecCCCCCCcCCceeeEeecCC
Confidence            4577899999999999999999998 8899999999999999999999999999999985


No 21 
>KOG0420|consensus
Probab=98.53  E-value=3.2e-07  Score=54.96  Aligned_cols=57  Identities=28%  Similarity=0.380  Sum_probs=41.5

Q ss_pred             hHHHHHHHHHHHHhhhCCCCCEEEEecC-CCCcc--eEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            7 SSALRALSMEYKSLQEEPVEGFRVKLVN-DDNLF--EWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~-~~nl~--~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +.+.-||++|+-++.  .++.+++..++ .+++.  +++++|. |+++.|+||.|+|++++|.
T Consensus        27 s~a~lrl~~di~eln--Lp~t~~~s~~~~~~d~~~~~~elti~-PdEGyY~gGkf~F~~~v~~   86 (184)
T KOG0420|consen   27 SAALLRLKKDILELN--LPPTCSLSFPDSPDDLNNLEFELTIT-PDEGYYQGGKFRFKFKVPN   86 (184)
T ss_pred             cHHHHHHHhhhhhcc--CCCccccccccCCcccccceEEEEEc-cCcceecCceEEEEEECCC
Confidence            456678888887775  45555543233 33333  5999999 9999999999999999964


No 22 
>KOG0429|consensus
Probab=98.49  E-value=5.3e-07  Score=56.01  Aligned_cols=53  Identities=26%  Similarity=0.412  Sum_probs=46.0

Q ss_pred             HHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623           12 ALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus        12 Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      .|..|+....+.+.+||.+. |+-.|-+.|.++|.+ ..+.|+||+|+|+|-+|.
T Consensus        23 ~llAEf~lV~~ekL~gIyvi-PSyan~l~WFGViFv-r~GiyaggVFRFtIliPd   75 (258)
T KOG0429|consen   23 ALLAEFVLVCREKLDGIYVI-PSYANKLLWFGVIFV-RKGIYAGGVFRFTILIPD   75 (258)
T ss_pred             HHHHHHHHHHhccCCceEEc-ccccccceEEEEEEE-ecccccCceEEEEEEcCc
Confidence            46677778888899999999 888999999999994 456899999999999874


No 23 
>KOG0895|consensus
Probab=98.42  E-value=1.6e-07  Score=67.96  Aligned_cols=55  Identities=24%  Similarity=0.319  Sum_probs=47.4

Q ss_pred             HHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623           11 RALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus        11 ~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      +..+.|+.-|..+.+.+|.+. ..++.+.-..+.|.||.+|||++|.|.|.|.||.
T Consensus       854 ~~~~~~~~~~~~~~~~~~~vr-~~e~r~d~~~~~~~g~~~tpy~~~~f~fd~~~~~  908 (1101)
T KOG0895|consen  854 KKVQTEWKILPLSLPSGIFVR-AYEDRMDLLRAVIVGAAGTPYQDGLFFFDFQFPQ  908 (1101)
T ss_pred             HHHHHHHHhhhccCCCceEEE-echHHHHHHHHHhhCCCCCccccceEEEEeecCC
Confidence            334456667777888999999 8888888889999999999999999999999975


No 24 
>KOG0896|consensus
Probab=98.33  E-value=1.1e-06  Score=50.82  Aligned_cols=57  Identities=23%  Similarity=0.360  Sum_probs=43.4

Q ss_pred             HHHHHHHHHHHHhhhCCCC-CEEEEecCCCC--cceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623            8 SALRALSMEYKSLQEEPVE-GFRVKLVNDDN--LFEWEVAIFGPPDTLYQGGYFKVCVVW   64 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~~~~-~~~~~~~~~~n--l~~w~~~i~GP~~tpY~gg~f~l~i~f   64 (69)
                      +...||.+|+.+-++.--+ .++...-+++|  +..|..+|.||+.|+||+.+|.|+|+.
T Consensus         5 Prnfrlleele~g~kg~g~~~~s~gl~d~~dmtl~rWtg~IiGPprT~yEnRiysLKI~C   64 (138)
T KOG0896|consen    5 PRNFRLLEELEEGEKGIGDGTVSWGLEDDDDMTLTRWTGTIIGPPRTMYENRIYSLKIEC   64 (138)
T ss_pred             ccchhhhhhhccccccccCceeeccccCCCcceEeeeccceeCCCCcccccceeeEEEec
Confidence            3556888888887664433 35555223444  679999999999999999999999987


No 25 
>smart00340 HALZ homeobox associated leucin zipper.
Probab=88.11  E-value=0.55  Score=22.16  Aligned_cols=15  Identities=27%  Similarity=0.242  Sum_probs=12.4

Q ss_pred             HHHHHHHHHHHhhhC
Q psy623            9 ALRALSMEYKSLQEE   23 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~   23 (69)
                      -++||++|+++|+.-
T Consensus        20 eNrRL~ke~~eLral   34 (44)
T smart00340       20 ENRRLQKEVQELRAL   34 (44)
T ss_pred             HHHHHHHHHHHHHhc
Confidence            468999999999753


No 26 
>PF11619 P53_C:  Transcription factor P53 - C terminal domain;  InterPro: IPR024631 The p53 tumour suppressor [, , , , ] is a protein found in increased amounts in a wide variety of transformed cells. It is also detectable in many proliferating non-transformed cells, but it is undetectable or present at low levels in resting cells. It is frequently mutated or inactivated in many types of cancer. p53 seems to act as a tumour suppressor in some, but probably not all, tumour types. p53 has been implicated in cell cycle regulation, particularly in the monitoring of genomic DNA integrity prior to replication; for this reason it has been dubbed `guardian of the genome'.  p53 is a sequence-specific DNA-binding protein and transcription factor. The structure of p53 comprises 4 domains: an N-terminal transactivation domain; a central DNA-binding domain; an oligomerisation domain; and a C-terminal, basic, regulatory domain [, ]. The structure of the oligomerisation domain consists of a dimer of dimers, each dimer consisting of 2 anti-parallel alpha-helices and an anti-parallel beta-sheet. The sheets lie on opposite sides of the tetramer and the helices form an unusual 4-helix bundle [, ]. While the majority of p53 mutations found in human cancers are located in the DNA-binding domain, some are also found in the oligomerisation domain. This entry represents the C-terminal domain of Drosophila transcription factor p53. While the rest of the protein is quite conserved between the different transcription factors such as p53 and p73, the C-terminal domain is highly divergent. The Drosophila p53 structure is characterised by an additional N-terminal beta-strand and a C-terminal helix [].; PDB: 2RP4_B.
Probab=72.90  E-value=8.5  Score=19.87  Aligned_cols=24  Identities=21%  Similarity=0.474  Sum_probs=16.7

Q ss_pred             CCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623           36 DNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus        36 ~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      ++-.+|.|.-.       .+|-|+|.|++|+
T Consensus         2 ~~~~dW~Vsrt-------~dGdYrL~itcp~   25 (71)
T PF11619_consen    2 NSAADWEVSRT-------LDGDYRLVITCPK   25 (71)
T ss_dssp             TTT-S-EEEEE-------TTTCEEEEEEESS
T ss_pred             Cccccceeeec-------cCCceEEEEecCc
Confidence            34567888866       2578999999986


No 27 
>PF15241 Cylicin_N:  Cylicin N-terminus
Probab=64.48  E-value=3.7  Score=23.17  Aligned_cols=11  Identities=9%  Similarity=0.168  Sum_probs=8.8

Q ss_pred             EEEEEecCCCC
Q psy623           59 KVCVVWYKCSQ   69 (69)
Q Consensus        59 ~l~i~fp~~~~   69 (69)
                      +|.+.|||+||
T Consensus        26 hFaLtFPKPpr   36 (110)
T PF15241_consen   26 HFALTFPKPPR   36 (110)
T ss_pred             heeeccCCCCC
Confidence            57788999886


No 28 
>PF04881 Adeno_GP19K:  Adenovirus GP19K;  InterPro: IPR006965 This 19 kDa glycoprotein binds the major histocompatibility (MHC) class I antigens in the endoplasmic reticulum (ER). The ER retention signal at the C terminus of Gp19K causes retention of the complex in the ER, preventing lysis of the cell by cytotoxic T-lymphocytes [].; GO: 0005537 mannose binding, 0050690 regulation of defense response to virus by virus
Probab=60.54  E-value=11  Score=22.11  Aligned_cols=30  Identities=13%  Similarity=0.275  Sum_probs=21.0

Q ss_pred             CCCcceEEEEEECCCCCCCC-CcEEEEEEEe
Q psy623           35 DDNLFEWEVAIFGPPDTLYQ-GGYFKVCVVW   64 (69)
Q Consensus        35 ~~nl~~w~~~i~GP~~tpY~-gg~f~l~i~f   64 (69)
                      +.|-..|.|++.|+++|++. +-+|-+.+-|
T Consensus        44 PGd~~~ytVtV~G~dGs~~~~n~tf~~~FiF   74 (139)
T PF04881_consen   44 PGDPEWYTVTVQGPDGSIRKSNNTFMYKFIF   74 (139)
T ss_pred             CCCCcceEEEEECCCCcceeccccchheeeH
Confidence            44567888999999998865 4455555444


No 29 
>PF12065 DUF3545:  Protein of unknown function (DUF3545);  InterPro: IPR021932  This family of proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 60 to 77 amino acids in length. This protein has two completely conserved residues (R and L) that may be functionally important. 
Probab=59.64  E-value=8.4  Score=19.42  Aligned_cols=11  Identities=18%  Similarity=0.383  Sum_probs=7.8

Q ss_pred             HHHHHHHHHhh
Q psy623           11 RALSMEYKSLQ   21 (69)
Q Consensus        11 ~Rl~kEl~~l~   21 (69)
                      .||.|||+++.
T Consensus        37 ~rL~kEL~d~D   47 (59)
T PF12065_consen   37 QRLRKELQDMD   47 (59)
T ss_pred             HHHHHHHHHcc
Confidence            57777877764


No 30 
>PF12075 KN_motif:  KN motif;  InterPro: IPR021939  This small motif is found at the N terminus of Kank proteins and has been called the KN (for Kank N-terminal) motif. This protein is found in eukaryotes. Proteins in this family are typically between 413 to 1202 amino acids in length. This protein is found associated with PF00023 from PFAM. This protein has two conserved sequence motifs: TPYG and LDLDF. Kank1 was obtained by positional cloning of a tumor suppressor gene in renal cell carcinoma, while the other members were found by homology search. The family is involved in the regulation of actin polymerisation and cell motility through signaling pathways containing PI3K/Akt and/or unidentified modulators/effectors []. 
Probab=58.76  E-value=4.5  Score=18.71  Aligned_cols=12  Identities=25%  Similarity=0.706  Sum_probs=8.8

Q ss_pred             CCCCCCcEEEEEEEe
Q psy623           50 DTLYQGGYFKVCVVW   64 (69)
Q Consensus        50 ~tpY~gg~f~l~i~f   64 (69)
                      +|||  | |++.+.|
T Consensus         5 ~tPY--G-yhiDLDF   16 (39)
T PF12075_consen    5 ETPY--G-YHIDLDF   16 (39)
T ss_pred             cCCc--c-eeecchH
Confidence            4788  3 8888877


No 31 
>PF08203 RNA_polI_A14:  Yeast RNA polymerase I subunit RPA14;  InterPro: IPR013239 Saccharomyces cerevisiae RNA polymerase I (Pol I) is a complex consisting of 14 subunits. Subunit RPA14 forms part of a Pol I subcomplex consisting of RPA14 and and RPA43. The RPA14 and RPA43 heterodimer is proposed to play a role in the recruitment of Pol I to the promoter []. ; PDB: 2RF4_F.
Probab=57.56  E-value=9  Score=20.28  Aligned_cols=13  Identities=23%  Similarity=0.493  Sum_probs=10.5

Q ss_pred             HHHHHHHHHHHHh
Q psy623            8 SALRALSMEYKSL   20 (69)
Q Consensus         8 ~~~~Rl~kEl~~l   20 (69)
                      .-.|||+|+++-|
T Consensus        59 SQLKRiQRdlrGL   71 (76)
T PF08203_consen   59 SQLKRIQRDLRGL   71 (76)
T ss_dssp             HHHHHHHHHHHHS
T ss_pred             HHHHHHHHhhCCC
Confidence            3568999999876


No 32 
>PF04904 NCD1:  NAB conserved region 1 (NCD1);  InterPro: IPR006988 Nab1 and Nab2 are co-repressors that specifically interact with and repress transcription mediated by the three members of the NGFI-A (Egr-1, Krox24, zif/268) family of eukaryotic (metazoa) transcription factors []. This entry represents the N-terminal NAB domain, which interacts with the EGR1 inhibitory domain (R1) []. It may also mediate multimerisation.; GO: 0045892 negative regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=53.02  E-value=23  Score=18.89  Aligned_cols=17  Identities=18%  Similarity=0.303  Sum_probs=14.6

Q ss_pred             HHHHHHHHHHHHhhhCC
Q psy623            8 SALRALSMEYKSLQEEP   24 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~~   24 (69)
                      ...+|++|-+.++..+|
T Consensus        61 LHVrRlqKAL~ew~~~p   77 (82)
T PF04904_consen   61 LHVRRLQKALQEWSTNP   77 (82)
T ss_pred             HHHHHHHHHHHHHhcCh
Confidence            57899999999998765


No 33 
>PF09929 DUF2161:  Uncharacterized conserved protein (DUF2161);  InterPro: IPR018679 This family of various hypothetical prokaryotic proteins has no known function.
Probab=49.47  E-value=35  Score=19.55  Aligned_cols=24  Identities=17%  Similarity=0.206  Sum_probs=17.8

Q ss_pred             CCchHHHHHHHHHHHHhhhCCCCC
Q psy623            4 VPTSSALRALSMEYKSLQEEPVEG   27 (69)
Q Consensus         4 ~~~~~~~~Rl~kEl~~l~~~~~~~   27 (69)
                      ..+.....||.+|++....++..|
T Consensus        23 Rk~~krr~rLl~Ef~rR~GDpn~G   46 (118)
T PF09929_consen   23 RKNKKRRSRLLREFQRRSGDPNVG   46 (118)
T ss_pred             CCCHHHHHHHHHHHHHhcCCCCCC
Confidence            345567789999999988776543


No 34 
>PF04314 DUF461:  Protein of unknown function (DUF461);  InterPro: IPR007410 This entry represents a domain found in of proteins of unknown function, including DR1885 from Deinococcus radiodurans and CC3502 from Caulobacter crescentus (Caulobacter vibrioides), which share a potential metal binding motif H(M)X10MX21HXM. DR1885 was found to bind copper(I) through a histidine and three Mets in a cupredoxin-like fold []. The surface location of the copper-binding site as well as the type of coordination are well poised for metal transfer chemistry, suggesting that DR1885 might transfer copper, taking the role of Cox17 in bacteria (Cox17 being an accessory protein required for correct assembly of eukaryotic cyochrome c oxidase). ; PDB: 2K6W_A 2K6Z_A 2K6Y_A 2K70_A 1X9L_A 2JQA_A.
Probab=48.48  E-value=29  Score=19.04  Aligned_cols=27  Identities=11%  Similarity=0.288  Sum_probs=21.3

Q ss_pred             eEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623           40 EWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus        40 ~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      -.++.+.|++..+=+|..+.+.+.|-.
T Consensus        77 g~HlmL~g~~~~l~~G~~v~ltL~f~~  103 (110)
T PF04314_consen   77 GYHLMLMGLKRPLKPGDTVPLTLTFED  103 (110)
T ss_dssp             CCEEEEECESS-B-TTEEEEEEEEETT
T ss_pred             CEEEEEeCCcccCCCCCEEEEEEEECC
Confidence            368889999999999999999999843


No 35 
>TIGR02422 protocat_beta protocatechuate 3,4-dioxygenase, beta subunit. This model represents the beta chain of protocatechuate 3,4-dioxygenase. The most closely related family outside this family is that of the alpha chain (TIGR02423), typically encoded in an adjacent locus. This enzyme acts in the degradation of aromatic compounds by way of p-hydroxybenzoate to succinate and acetyl-CoA.
Probab=48.10  E-value=19  Score=22.65  Aligned_cols=18  Identities=11%  Similarity=0.410  Sum_probs=16.0

Q ss_pred             cceEEEEEECCCCCCCCC
Q psy623           38 LFEWEVAIFGPPDTLYQG   55 (69)
Q Consensus        38 l~~w~~~i~GP~~tpY~g   55 (69)
                      ..+|.+.+.|..+|+||+
T Consensus       203 ~y~FDI~Lqg~~etvf~~  220 (220)
T TIGR02422       203 GYRFDIVLRGRRATPFEN  220 (220)
T ss_pred             eEEEEEEEcCCCcCcCCC
Confidence            379999999999999985


No 36 
>PF12011 DUF3503:  Domain of unknown function (DUF3503);  InterPro: IPR021892  RNA helicase NPH-II or I8 is found in Poxviridae. It is essential for viral replication. Plays an important role during transcription of early mRNAs, presumably by preventing R-loop formation behind the elongating RNA polymerase. Acts as NTP-dependent helicase that catalyzes unidirectional unwinding of 3'tailed duplex RNAs. It might also play a role in the export of newly synthesized mRNA chains out of the core into the cytoplasm and is required for propagation of viral particles[].; GO: 0017111 nucleoside-triphosphatase activity
Probab=47.22  E-value=23  Score=21.57  Aligned_cols=46  Identities=17%  Similarity=0.214  Sum_probs=28.4

Q ss_pred             HHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623           13 LSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVW   64 (69)
Q Consensus        13 l~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~f   64 (69)
                      -++|+.++..+....|+.. .-+=--++|+-...     ....|+|+|.+++
T Consensus        23 Sq~E~~~m~~~~~~~Fs~A-vFPliKHRW~~a~v-----~~~~~~ykLn~E~   68 (170)
T PF12011_consen   23 SQKEFEEMPKEERKAFSYA-VFPLIKHRWKGAYV-----VRDNRVYKLNVES   68 (170)
T ss_pred             cHHHHHhcChhhhhcceee-eeeeehhcccCceE-----EEECCEEEEEeeh
Confidence            4678888876555554433 22223478875443     2557899999985


No 37 
>PF05709 Sipho_tail:  Phage tail protein;  InterPro: IPR008841 This family consists of several Siphovirus and other phage tail component proteins as well as some bacterial proteins of unknown function. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.; PDB: 4DIV_X 2X8K_C.
Probab=45.69  E-value=67  Score=19.41  Aligned_cols=57  Identities=18%  Similarity=0.195  Sum_probs=32.0

Q ss_pred             HHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCC--CCCCCcEEEEEEEecC
Q psy623            8 SALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPD--TLYQGGYFKVCVVWYK   66 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~--tpY~gg~f~l~i~fp~   66 (69)
                      ....++.+++..+.... ....+. .+++.-..|.+.+.+..+  .-...|.|.+.+.+|.
T Consensus        52 ~~~~~~~~~l~~~l~~~-~~~~l~-f~d~p~~~y~~~~~~~~~~~~~~~~~~~ti~f~c~d  110 (249)
T PF05709_consen   52 EDFEQKRRELASWLNPK-EPVKLI-FDDDPDKYYYAKVSGSPDPDEGNNSGTFTITFTCPD  110 (249)
T ss_dssp             HHHHHHHHHHHHHH--S-S-EEEE-ETTSTT-EEEEEEEEEEE--SSSSCEEEEEEEEEEE
T ss_pred             HHHHHHHHHHHHhhCcC-CCEEEE-EECCCCEEEEEEECCcccccccceeEEEEEEEEECC
Confidence            34456677888776433 337777 566656788888776422  2233446666666543


No 38 
>PF12259 DUF3609:  Protein of unknown function (DUF3609);  InterPro: IPR022048  This domain family is found in eukaryotes and viruses, and is typically between 348 and 360 amino acids in length. 
Probab=45.65  E-value=19  Score=24.29  Aligned_cols=55  Identities=9%  Similarity=0.028  Sum_probs=31.0

Q ss_pred             HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623            9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      .-++|..|+.++....+.+..+- ++..++....-.+..-  ....+...-|.|++|-
T Consensus        33 sP~~L~~em~~V~~~L~~~~~lp-~~~~~m~~ly~l~~v~--~~~~~~~liF~i~VPL   87 (361)
T PF12259_consen   33 SPKQLLDEMKNVSSHLPRDWSLP-LEKSNMHDLYRLIKVH--FITPDNKLIFVIEVPL   87 (361)
T ss_pred             CHHHHHHHHHHHHhcCCcccccC-ccccHHHHHhhhEEEE--EEeeCCEEEEEEEece
Confidence            45799999999987776666543 4444444333222110  2233556666666653


No 39 
>PF01107 MP:  Viral movement protein (MP);  InterPro: IPR001022 The movement protein of tobamoviruses is necessary for the initial cell-to-cell movement during the early stages of a viral infection. This movement is active, and involves the interaction of the movement protein with the plasmodesmata. The movement protein possesses the ability to bind to RNA to achieve its role []. The N terminus contains two particularly well-conserved regions, substitutions in one of these results in temperature-sensitive cell-to-cell movement. The C terminus contains three sub-regions characterised by the distributions of charged amino acid residues [].; GO: 0003676 nucleic acid binding, 0006810 transport
Probab=44.92  E-value=69  Score=19.31  Aligned_cols=37  Identities=8%  Similarity=0.142  Sum_probs=25.6

Q ss_pred             CCEEEEecCC-CCcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623           26 EGFRVKLVND-DNLFEWEVAIFGPPDTLYQGGYFKVCVVW   64 (69)
Q Consensus        26 ~~~~~~~~~~-~nl~~w~~~i~GP~~tpY~gg~f~l~i~f   64 (69)
                      |++.+. .++ +.-..|.+.+. -.+..++.|..-|.|++
T Consensus       130 Pn~~vs-l~D~~~~~~l~l~~~-~~g~~~~~G~~p~SI~~  167 (189)
T PF01107_consen  130 PNYSVS-LDDPNLNDSLQLSVK-FEGVDMEDGSHPFSIEY  167 (189)
T ss_pred             CCcEEE-ecchhcCCcEEEEEE-ccccccCCCcEeEEEEE
Confidence            455555 434 34567777776 66667999999998876


No 40 
>COG2847 Copper(I)-binding protein [Inorganic ion transport and    metabolism]
Probab=44.84  E-value=67  Score=19.17  Aligned_cols=27  Identities=22%  Similarity=0.370  Sum_probs=23.6

Q ss_pred             eEEEEEECCCCCCCCCcEEEEEEEecC
Q psy623           40 EWEVAIFGPPDTLYQGGYFKVCVVWYK   66 (69)
Q Consensus        40 ~w~~~i~GP~~tpY~gg~f~l~i~fp~   66 (69)
                      -.++.|.|++-..=+|..|.++++|-+
T Consensus       104 gyHvMlm~lK~pl~eGd~v~vtL~f~~  130 (151)
T COG2847         104 GYHVMLMGLKKPLKEGDKVPVTLKFEK  130 (151)
T ss_pred             CEEEEEeccCCCccCCCEEEEEEEEec
Confidence            467889999999999999999999954


No 41 
>KOG3285|consensus
Probab=43.27  E-value=73  Score=19.80  Aligned_cols=43  Identities=12%  Similarity=0.209  Sum_probs=26.4

Q ss_pred             hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCC
Q psy623            7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPD   50 (69)
Q Consensus         7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~   50 (69)
                      .-..+||++|++.+.+.--..++.- |.=+-...|.+.+..-.|
T Consensus       118 ~k~~~~iq~EIraviRQItasVtfL-P~Le~~ctFdvLiyTdkD  160 (203)
T KOG3285|consen  118 VKDLKRIQNEIRAVIRQITASVTFL-PLLEEICTFDVLIYTDKD  160 (203)
T ss_pred             hhHHHHHHHHHHHHHHHHhhheeec-ccccceeEEEEEEEeCCC
Confidence            3467899999999876554555554 322333566666654444


No 42 
>PF02970 TBCA:  Tubulin binding cofactor A;  InterPro: IPR004226 The folding pathway of tubulins includes highly specific interactions with a series of cofactors (A, B, C, D and E) after they are released from the eukaryotic chaperonin CCT. Cofactors A and D capture and stabilise tubulin in a quasi-native conformation. Cofactor E binds to the cofactor D-tubulin complex, and interaction with cofactor C then causes the release of tubulin poypeptides in the native state. This family is the tubulin-specific chaperone A.; GO: 0051082 unfolded protein binding, 0007021 tubulin complex assembly, 0005874 microtubule; PDB: 3MXZ_A 1QSD_A 1H7C_A.
Probab=40.86  E-value=37  Score=18.20  Aligned_cols=15  Identities=20%  Similarity=0.242  Sum_probs=11.3

Q ss_pred             hHHHHHHHHHHHHhh
Q psy623            7 SSALRALSMEYKSLQ   21 (69)
Q Consensus         7 ~~~~~Rl~kEl~~l~   21 (69)
                      ..+.+||.||+....
T Consensus         6 t~~vkRL~KE~~~Y~   20 (90)
T PF02970_consen    6 TGVVKRLLKEEASYE   20 (90)
T ss_dssp             HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHH
Confidence            357899999987654


No 43 
>PF14824 Sirohm_synth_M:  Sirohaem biosynthesis protein central; PDB: 1KYQ_B.
Probab=40.33  E-value=36  Score=14.75  Aligned_cols=16  Identities=19%  Similarity=0.237  Sum_probs=10.2

Q ss_pred             CchHHHHHHHHHHHHh
Q psy623            5 PTSSALRALSMEYKSL   20 (69)
Q Consensus         5 ~~~~~~~Rl~kEl~~l   20 (69)
                      ++...+++|.+|+++.
T Consensus        14 ~sP~la~~iR~~ie~~   29 (30)
T PF14824_consen   14 KSPRLARLIRKEIERL   29 (30)
T ss_dssp             S-HHHHHHHHHHHHHH
T ss_pred             CChHHHHHHHHHHHHh
Confidence            4555667788887753


No 44 
>PF14501 HATPase_c_5:  GHKL domain
Probab=39.15  E-value=26  Score=18.53  Aligned_cols=13  Identities=31%  Similarity=0.388  Sum_probs=8.0

Q ss_pred             CCCcEEEEEEEec
Q psy623           53 YQGGYFKVCVVWY   65 (69)
Q Consensus        53 Y~gg~f~l~i~fp   65 (69)
                      .+++.|.++|.||
T Consensus        88 ~~~~~f~~~i~ip  100 (100)
T PF14501_consen   88 SEDGIFTVKIVIP  100 (100)
T ss_pred             EECCEEEEEEEEC
Confidence            3456666766665


No 45 
>PF14436 EndoU_bacteria:  Bacterial EndoU nuclease
Probab=38.69  E-value=46  Score=18.58  Aligned_cols=18  Identities=11%  Similarity=-0.044  Sum_probs=15.2

Q ss_pred             CCCCCcEEEEEEEecCCC
Q psy623           51 TLYQGGYFKVCVVWYKCS   68 (69)
Q Consensus        51 tpY~gg~f~l~i~fp~~~   68 (69)
                      .++.+|+|.+.+.++...
T Consensus        42 ~~~~~GVy~~~v~i~~~~   59 (128)
T PF14436_consen   42 PPNPNGVYTAKVEIPGPD   59 (128)
T ss_pred             ccCCCCeEEEEEEEecCC
Confidence            489999999999997654


No 46 
>PF12915 DUF3833:  Protein of unknown function (DUF3833);  InterPro: IPR024409 This is a family of uncharacterised proteins found in Proteobacteria.
Probab=35.19  E-value=85  Score=18.97  Aligned_cols=25  Identities=20%  Similarity=0.296  Sum_probs=19.1

Q ss_pred             CCCcceEEEEEECCCCCCCCCcEEEEEEE
Q psy623           35 DDNLFEWEVAIFGPPDTLYQGGYFKVCVV   63 (69)
Q Consensus        35 ~~nl~~w~~~i~GP~~tpY~gg~f~l~i~   63 (69)
                      ..|-++|+..+.-    |+.|+.+.+++.
T Consensus       107 ~Gnal~w~Y~l~l----p~~g~~~~v~fd  131 (164)
T PF12915_consen  107 AGNALNWRYDLDL----PVDGSTYKVNFD  131 (164)
T ss_pred             eccEEEEEEEEEe----eeCCcEEEEEEE
Confidence            4578899999884    567788888764


No 47 
>PF06675 DUF1177:  Protein of unknown function (DUF1177);  InterPro: IPR009561 This family consists of several hypothetical archaeal and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Probab=34.67  E-value=24  Score=23.00  Aligned_cols=17  Identities=18%  Similarity=0.327  Sum_probs=13.4

Q ss_pred             CCCCCCCCcEEEEEEEe
Q psy623           48 PPDTLYQGGYFKVCVVW   64 (69)
Q Consensus        48 P~~tpY~gg~f~l~i~f   64 (69)
                      -+=|||.+|+|++.-.+
T Consensus       180 qDITPygNgvyHiNSIl  196 (276)
T PF06675_consen  180 QDITPYGNGVYHINSIL  196 (276)
T ss_pred             cccccCCCCceeeeccc
Confidence            34499999999998554


No 48 
>PF07547 RSD-2:  RSD-2 N-terminal domain;  InterPro: IPR011508 This domain is found in three copies at the N terminus of the Caenorhabditis elegans RSD-2 protein. RSD-2 (RNAi spreading defective) is involved in systemic RNAi []. Mutations in the rsd-2 gene do not affect somatic genes but only germline expressed genes [].
Probab=34.57  E-value=73  Score=17.10  Aligned_cols=28  Identities=11%  Similarity=0.220  Sum_probs=21.7

Q ss_pred             cceEEEEEECCC-CCCCCCcEEEEEEEec
Q psy623           38 LFEWEVAIFGPP-DTLYQGGYFKVCVVWY   65 (69)
Q Consensus        38 l~~w~~~i~GP~-~tpY~gg~f~l~i~fp   65 (69)
                      ...|++.+.-++ .+.=.+++|+..|.+-
T Consensus        16 yad~yG~V~~s~~~~~~~~~vyk~~I~vn   44 (84)
T PF07547_consen   16 YADWYGLVDCSDVESMERGTVYKTEITVN   44 (84)
T ss_pred             eccccCeEEcccccccccccEEEEEEEEE
Confidence            347888887676 6777788999999874


No 49 
>cd02860 Pullulanase_N_term Pullulanase domain N-terminus. Pullulanase (AKA dextrinase; alpha-dextrin endo-1,6-alpha glucosidase) is an enzyme with action similar to that of isoamylase; it cleaves 1,6-alpha-glucosidic linkages in pullulan, amylopectin, and glycogen, and in alpha-and beta-amylase limit-dextrins of amylopectin and glycogen.  The N-terminus of pullulanase may be related to the immunoglobulin and/or fibronectin type III superfamilies.  These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions.  Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitobiase, and chitinase.
Probab=33.56  E-value=79  Score=16.69  Aligned_cols=24  Identities=25%  Similarity=0.328  Sum_probs=16.0

Q ss_pred             CcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623           37 NLFEWEVAIFGPPDTLYQGGYFKVCVVW   64 (69)
Q Consensus        37 nl~~w~~~i~GP~~tpY~gg~f~l~i~f   64 (69)
                      .--.|++.+.|.    .+|..|.+.|.-
T Consensus        45 ~~gvw~~~v~~~----~~g~~Y~y~i~~   68 (100)
T cd02860          45 ENGVWSVTLDGD----LEGYYYLYEVKV   68 (100)
T ss_pred             CCCEEEEEeCCc----cCCcEEEEEEEE
Confidence            345788887753    566677777754


No 50 
>PF12627 PolyA_pol_RNAbd:  Probable RNA and SrmB- binding site of polymerase A; PDB: 1OU5_B 3H38_A 3H3A_B 3H39_B 3H37_A 3AQN_A 3AQK_A 3AQM_B 3AQL_B 1MIY_A ....
Probab=33.41  E-value=54  Score=15.73  Aligned_cols=19  Identities=11%  Similarity=0.170  Sum_probs=13.8

Q ss_pred             hHHHHHHHHHHHHhhhCCC
Q psy623            7 SSALRALSMEYKSLQEEPV   25 (69)
Q Consensus         7 ~~~~~Rl~kEl~~l~~~~~   25 (69)
                      ..+..||..|+..+...+.
T Consensus        21 ~is~ERi~~El~kil~~~~   39 (64)
T PF12627_consen   21 KISKERIREELEKILSSPN   39 (64)
T ss_dssp             GS-HHHHHHHHHHHHTSTT
T ss_pred             cCCHHHHHHHHHHHHcCCC
Confidence            4567899999999876543


No 51 
>PF00868 Transglut_N:  Transglutaminase family;  InterPro: IPR001102 Synonym(s): Protein-glutamine gamma-glutamyltransferase, Fibrinoligase, TGase Protein-glutamine gamma-glutamyltransferases (2.3.2.13 from EC) (TGase) are calcium-dependent enzymes that catalyse the cross-linking of proteins by promoting the formation of isopeptide bonds between the gamma-carboxyl group of a glutamine in one polypeptide chain and the epsilon-amino group of a lysine in a second polypeptide chain. TGases also catalyse the conjugation of polyamines to proteins [, ]. Transglutaminases are widely distributed in various organs, tissues and body fluids. The best known transglutaminase is blood coagulation factor XIII, a plasma tetrameric protein composed of two catalytic A subunits and two non-catalytic B subunits. Factor XIII is responsible for cross-linking fibrin chains, thus stabilising the fibrin clot. There are commonly three domains: N-terminal, middle (IPR013808 from INTERPRO) and C-terminal (IPR013807 from INTERPRO). This entry represents the N-terminal domain found in transglutaminases.; GO: 0018149 peptide cross-linking; PDB: 1L9N_B 1NUF_A 1NUD_A 1NUG_B 1L9M_A 1KV3_C 3S3S_A 2Q3Z_A 3LY6_A 3S3P_A ....
Probab=33.38  E-value=85  Score=17.54  Aligned_cols=21  Identities=24%  Similarity=0.428  Sum_probs=9.1

Q ss_pred             EEEEEECCCCCCCCCcEEEEEEE
Q psy623           41 WEVAIFGPPDTLYQGGYFKVCVV   63 (69)
Q Consensus        41 w~~~i~GP~~tpY~gg~f~l~i~   63 (69)
                      ..+.|.-|.+.+-  |.|+|.|.
T Consensus        95 ~tv~V~spa~A~V--G~y~l~v~  115 (118)
T PF00868_consen   95 VTVSVTSPANAPV--GRYKLSVE  115 (118)
T ss_dssp             EEEEEE--TTS----EEEEEEEE
T ss_pred             EEEEEECCCCCce--EEEEEEEE
Confidence            3444444555554  66666654


No 52 
>cd02848 Chitinase_N_term Chitinase N-terminus domain. Chitinases hydrolyze the abundant natural biopolymer chitin, producing smaller chito-oligosaccharides. Chitin consists of multiple N-acetyl-D-glucosamine (NAG) residues connected via beta-1,4-glycosidic linkages and is an important structural element of fungal cell wall and arthropod exoskeletons. On the basis of the mode of chitin hydrolysis, chitinases are classified as random, endo-, and exo-chitinases and based on sequence criteria, chitinases belong to families 18 and 19 of glycosyl hydrolases.  The N-terminus of chitinase may be related to the immunoglobulin and/or fibronectin type III superfamilies. These domains are associated with different types of catalytic domains at  either the N-terminal or C-terminal end and may be involved in homodimeric/tetrameric/dodecameric interactions. Members of this family include members of the alpha amylase family, sialidase, galactose oxidase, cellulase, cellulose, hyaluronate lyase, chitob
Probab=33.24  E-value=47  Score=18.68  Aligned_cols=12  Identities=25%  Similarity=0.487  Sum_probs=10.3

Q ss_pred             CCCcEEEEEEEe
Q psy623           53 YQGGYFKVCVVW   64 (69)
Q Consensus        53 Y~gg~f~l~i~f   64 (69)
                      =+||.|.+.|++
T Consensus        77 ~kgG~y~m~V~l   88 (106)
T cd02848          77 GKGGRYQMQVAL   88 (106)
T ss_pred             CCCCeEEEEEEE
Confidence            349999999987


No 53 
>TIGR03601 B_an_ocin probable heterocycle-containing bacteriocin, BA_2677 family. Numerous bacteria encode systems for producing bacteriocins by extensive modification of ribosomally produced precursors. These proteins are recognizable in part by proximity to the modification proteins, and in part by small size, with leader peptide-like N-terminal sequence followed by low-complexity sequence rich in Cys, Gly, and/or Ser. This protein family represents a probable member of the class, though previously unrecognized because it is not encoded adjacent to its modification proteins.
Probab=31.51  E-value=49  Score=17.29  Aligned_cols=22  Identities=23%  Similarity=0.451  Sum_probs=13.2

Q ss_pred             HHHHHHHhhhCCCCCEEEEecCCCCcceEE
Q psy623           13 LSMEYKSLQEEPVEGFRVKLVNDDNLFEWE   42 (69)
Q Consensus        13 l~kEl~~l~~~~~~~~~~~~~~~~nl~~w~   42 (69)
                      .++||+.|..+.        ...+|+..|.
T Consensus         4 FqqELq~L~lnd--------y~~gnvvyWD   25 (79)
T TIGR03601         4 FQQELQSLSLND--------YQTGNVVYWD   25 (79)
T ss_pred             HHHHHHhccccc--------ccCCcEEEec
Confidence            578888876543        3344555564


No 54 
>TIGR01633 phi3626_gp14_N putative phage tail component, N-terminal domain. This model represents the best-conserved region of about 125 amino acids, toward the N-terminus, of a family of proteins from temperate phage of a number of Gram-positive bacteria. These phage proteins range in length from 230 to 525 amino acids.
Probab=31.10  E-value=94  Score=16.82  Aligned_cols=57  Identities=14%  Similarity=-0.020  Sum_probs=33.3

Q ss_pred             HHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCC--CCCCCcEEEEEEEecCC
Q psy623            9 ALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPD--TLYQGGYFKVCVVWYKC   67 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~--tpY~gg~f~l~i~fp~~   67 (69)
                      ..+.+.++++.+.... ....+. .+++.=..|.+.+.+..+  .....|.+.+.+.+|.|
T Consensus        63 ~~~~~~~~l~~~L~~~-~~~~L~-f~dePd~yy~a~~~~~~~~~~~~~~~~~titF~c~dP  121 (124)
T TIGR01633        63 DLRELFRELAGWLNSQ-EPVPLI-FSDEPDKTYYARVDEEIDLDEDTTFGKGTLNFICPDP  121 (124)
T ss_pred             HHHHHHHHHHHHhCCC-CCcceE-eccCCCcEEEEEEcCccCHHHhhcccEEEEEEEecCC
Confidence            4556777788876533 223444 333333577788776432  22345788888887664


No 55 
>PF06305 DUF1049:  Protein of unknown function (DUF1049);  InterPro: IPR010445 This entry consists of several hypothetical bacterial proteins of unknown function.
Probab=30.96  E-value=59  Score=15.88  Aligned_cols=15  Identities=40%  Similarity=0.581  Sum_probs=9.9

Q ss_pred             HHHHHHHHHHHhhhC
Q psy623            9 ALRALSMEYKSLQEE   23 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~   23 (69)
                      ..+|+.+|+++++++
T Consensus        49 ~~~~~~k~l~~le~e   63 (68)
T PF06305_consen   49 RIRRLRKELKKLEKE   63 (68)
T ss_pred             HHHHHHHHHHHHHHH
Confidence            456777777777653


No 56 
>CHL00130 rbcS ribulose-1,5-bisphosphate carboxylase/oxygenase small subunit; Reviewed
Probab=29.87  E-value=1.2e+02  Score=17.82  Aligned_cols=33  Identities=18%  Similarity=0.118  Sum_probs=22.5

Q ss_pred             HHHHHHHHHHHhhhCCCCC-EEEEecCCCCcceEEE
Q psy623            9 ALRALSMEYKSLQEEPVEG-FRVKLVNDDNLFEWEV   43 (69)
Q Consensus         9 ~~~Rl~kEl~~l~~~~~~~-~~~~~~~~~nl~~w~~   43 (69)
                      ....+++|+.++++.-+.. |.+  .--||...|++
T Consensus        60 D~~~Vl~Ei~~CrkayP~~yIRl--~gFDn~rq~Q~   93 (138)
T CHL00130         60 DPAAVMFEINECRKQKPNGYIKV--NAFDASRGVES   93 (138)
T ss_pred             CHHHHHHHHHHHHHHCCCcEEEE--EEeeCCCcEEE
Confidence            3457899999998865443 344  34567777777


No 57 
>PF08872 KGK:  KGK domain;  InterPro: IPR014971 This protein is found in one or two copies in cyanobacterial proteins. It is named after a short sequence motif. 
Probab=29.75  E-value=1.1e+02  Score=17.27  Aligned_cols=25  Identities=16%  Similarity=0.223  Sum_probs=19.5

Q ss_pred             EEEEEECCCCCCCCCcEEEE--EEEec
Q psy623           41 WEVAIFGPPDTLYQGGYFKV--CVVWY   65 (69)
Q Consensus        41 w~~~i~GP~~tpY~gg~f~l--~i~fp   65 (69)
                      ..|.|..+...-|.-|.+++  .++|-
T Consensus        67 i~CeiL~~g~~~W~kGK~ri~~~leF~   93 (114)
T PF08872_consen   67 IECEILRFGSKGWQKGKVRIKVSLEFI   93 (114)
T ss_pred             eeEEEeccCCCCCccceEEEEEEEEEe
Confidence            45667777777899999999  77773


No 58 
>PF09943 DUF2175:  Uncharacterized protein conserved in archaea (DUF2175);  InterPro: IPR018686  This family of various hypothetical archaeal proteins has no known function. 
Probab=29.03  E-value=68  Score=17.88  Aligned_cols=20  Identities=30%  Similarity=0.823  Sum_probs=15.6

Q ss_pred             eEEEEEECCCCCCCCCcEEEEE
Q psy623           40 EWEVAIFGPPDTLYQGGYFKVC   61 (69)
Q Consensus        40 ~w~~~i~GP~~tpY~gg~f~l~   61 (69)
                      .|.+.|-|  +..|+|-.|.|.
T Consensus         2 kWkC~iCg--~~I~~gqlFTF~   21 (101)
T PF09943_consen    2 KWKCYICG--KPIYEGQLFTFT   21 (101)
T ss_pred             ceEEEecC--CeeeecceEEEe
Confidence            68888876  368999888775


No 59 
>PF07691 PA14:  PA14 domain;  InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=28.56  E-value=1.1e+02  Score=16.85  Aligned_cols=20  Identities=15%  Similarity=0.404  Sum_probs=13.3

Q ss_pred             ceEEEEEECCCCCCCCCcEEEEEEE
Q psy623           39 FEWEVAIFGPPDTLYQGGYFKVCVV   63 (69)
Q Consensus        39 ~~w~~~i~GP~~tpY~gg~f~l~i~   63 (69)
                      .+|...|.-|.+     |.|.|.+.
T Consensus        48 ~~~~G~~~~~~~-----G~y~f~~~   67 (145)
T PF07691_consen   48 VRWTGYFKPPET-----GTYTFSLT   67 (145)
T ss_dssp             EEEEEEEEESSS-----EEEEEEEE
T ss_pred             EEEEEEEecccC-----ceEEEEEE
Confidence            478888883333     77777664


No 60 
>PHA00672 hypothetical protein
Probab=27.75  E-value=55  Score=19.24  Aligned_cols=20  Identities=15%  Similarity=0.325  Sum_probs=15.6

Q ss_pred             CCCCCCCCcEEEEEEEecCC
Q psy623           48 PPDTLYQGGYFKVCVVWYKC   67 (69)
Q Consensus        48 P~~tpY~gg~f~l~i~fp~~   67 (69)
                      |-.--|++|+|-=+|.+|+-
T Consensus        38 pv~H~Fs~GvYARei~IPkG   57 (152)
T PHA00672         38 PTAHLFHAGVYARTIRIPAG   57 (152)
T ss_pred             hhhhhhccceeEEEEeccCc
Confidence            33346889999999999974


No 61 
>KOG0680|consensus
Probab=26.69  E-value=1.8e+02  Score=20.00  Aligned_cols=44  Identities=20%  Similarity=0.222  Sum_probs=24.7

Q ss_pred             HHHHHHHHHHHHhhhCCCCC--EEEEecCCCCcceEEEEEECCCCCCCC
Q psy623            8 SALRALSMEYKSLQEEPVEG--FRVKLVNDDNLFEWEVAIFGPPDTLYQ   54 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~~~~~--~~~~~~~~~nl~~w~~~i~GP~~tpY~   54 (69)
                      .-+.||.+|++.+.   |..  +++..+.+.-.+-|++.-.-++.+-|+
T Consensus       332 gF~~RL~~Elr~l~---P~d~~v~V~~p~dp~~~~W~~g~~~~~~~~~~  377 (400)
T KOG0680|consen  332 GFRQRLARELRSLL---PADWEVSVSVPEDPITFAWEGGSEFAKTDSFE  377 (400)
T ss_pred             chHHHHHHHHHhhC---CccceEEEecCCCcceeeehhccccccCcchh
Confidence            46789999999874   333  344423222356777654434444343


No 62 
>COG5402 Uncharacterized conserved protein [Function unknown]
Probab=26.68  E-value=1.6e+02  Score=18.18  Aligned_cols=29  Identities=21%  Similarity=0.468  Sum_probs=19.5

Q ss_pred             eEEEEEECCCCCCCC------CcEEEEEEEecCCCC
Q psy623           40 EWEVAIFGPPDTLYQ------GGYFKVCVVWYKCSQ   69 (69)
Q Consensus        40 ~w~~~i~GP~~tpY~------gg~f~l~i~fp~~~~   69 (69)
                      .|.++|. |...|+.      +|.|.|-++|...||
T Consensus       139 ~~~I~vs-p~a~pgnWl~~~g~g~f~lVltlydtp~  173 (194)
T COG5402         139 SWVITVS-PDAQPGNWLPVRGEGNFILVLTLYDTPQ  173 (194)
T ss_pred             cEEEEEC-CCCCCCceeEecCCccEEEEEEeecCcc
Confidence            3444444 6666654      688999999877765


No 63 
>PF14135 DUF4302:  Domain of unknown function (DUF4302)
Probab=25.30  E-value=1.8e+02  Score=18.16  Aligned_cols=46  Identities=13%  Similarity=0.308  Sum_probs=24.7

Q ss_pred             HHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcEEEEEEEec
Q psy623            8 SALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGYFKVCVVWY   65 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~f~l~i~fp   65 (69)
                      .+..||...++++++..        ..  .-.-|.+....-.+.-| || |.+-++|-
T Consensus         9 s~~eR~~e~~~~~k~~L--------~~--a~~GW~~~yyp~~~~~~-GG-y~f~~kF~   54 (235)
T PF14135_consen    9 SPAERINEALAEYKKIL--------TS--APNGWKLEYYPKTDQSY-GG-YTFLMKFD   54 (235)
T ss_pred             CHHHHHHHHHHHHHHHH--------hc--CCCceEEEEECCCCccC-Cc-EEEEEEEC
Confidence            45678887777665532        01  11236666662222224 33 77777773


No 64 
>PF13115 YtkA:  YtkA-like
Probab=24.97  E-value=69  Score=16.30  Aligned_cols=14  Identities=14%  Similarity=0.195  Sum_probs=11.0

Q ss_pred             CCCcEEEEEEEecC
Q psy623           53 YQGGYFKVCVVWYK   66 (69)
Q Consensus        53 Y~gg~f~l~i~fp~   66 (69)
                      .+.|.|...+.|+-
T Consensus        64 ~~~G~Y~~~~~f~m   77 (86)
T PF13115_consen   64 TGPGVYEAEVTFSM   77 (86)
T ss_pred             CCCCeEEEEeecCC
Confidence            46799999988863


No 65 
>PF05015 Plasmid_killer:  Plasmid maintenance system killer protein;  InterPro: IPR007711 Several plasmids with proteic killer gene systems have been reported. All of them encode a stable toxin and an unstable antidote. Upon loss of the plasmid, the less stable inhibitor is inactivated more rapidly than the toxin, allowing the toxin to be activated. The activation of those systems result in cell filamentation and cessation of viable cell production. It has been verified that both the stable killer and the unstable inhibitor of the systems are short polypeptides. This family corresponds to the toxin.
Probab=24.69  E-value=1.2e+02  Score=16.05  Aligned_cols=39  Identities=15%  Similarity=0.251  Sum_probs=19.2

Q ss_pred             HHHHHHHHHHHHhhhC-------CCCCEEEEecCCCCcceEEEEEE
Q psy623            8 SALRALSMEYKSLQEE-------PVEGFRVKLVNDDNLFEWEVAIF   46 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~-------~~~~~~~~~~~~~nl~~w~~~i~   46 (69)
                      ...+++.+=|..|...       .+++....++..+.--.|.+.|.
T Consensus        25 ~~~~~~~~~L~~L~aa~~~~dl~~~p~~r~h~L~G~~~g~~Si~i~   70 (93)
T PF05015_consen   25 DIAKKLRRRLDQLDAATSLEDLRSPPSNRLHKLKGDRKGQWSIRIN   70 (93)
T ss_pred             HHHHHHHHHHHHHHhCCCHHHHhcCcCCCcccccCCCCCcEEEEeC
Confidence            3455666666555432       23355555344443445555555


No 66 
>KOG2193|consensus
Probab=24.17  E-value=1.1e+02  Score=21.66  Aligned_cols=40  Identities=20%  Similarity=0.290  Sum_probs=29.9

Q ss_pred             HHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCC
Q psy623           15 MEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQ   54 (69)
Q Consensus        15 kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~   54 (69)
                      +-++.|.+.....+.+.++.-.+..+-.|+|.||+.--|+
T Consensus       431 ~hIKql~RfagASiKIappE~pdvseRMViItGppeaqfK  470 (584)
T KOG2193|consen  431 QHIKQLSRFAGASIKIAPPEIPDVSERMVIITGPPEAQFK  470 (584)
T ss_pred             hhHHHHHHhccceeeecCCCCCCcceeEEEecCChHHHHh
Confidence            3466676666677888877767788888999999875554


No 67 
>COG3866 PelB Pectate lyase [Carbohydrate transport and metabolism]
Probab=23.74  E-value=1.1e+02  Score=20.67  Aligned_cols=25  Identities=24%  Similarity=0.441  Sum_probs=19.9

Q ss_pred             ceEEEEEECCCCC-CCCCcEEEEEEE
Q psy623           39 FEWEVAIFGPPDT-LYQGGYFKVCVV   63 (69)
Q Consensus        39 ~~w~~~i~GP~~t-pY~gg~f~l~i~   63 (69)
                      .+|.+.|.|-.|+ -|.+|.+++.+.
T Consensus       197 dh~Kssl~G~sD~~~~~~~~~kvT~h  222 (345)
T COG3866         197 DHDKSSLLGSSDSSNYDDGKYKVTIH  222 (345)
T ss_pred             cCCeeeeeccCCcccccCCceeEEEe
Confidence            3789999986654 788999998875


No 68 
>PHA00458 single-stranded DNA-binding protein
Probab=23.69  E-value=1.3e+02  Score=19.22  Aligned_cols=16  Identities=19%  Similarity=0.243  Sum_probs=13.6

Q ss_pred             CCCCcEEEEEEEecCC
Q psy623           52 LYQGGYFKVCVVWYKC   67 (69)
Q Consensus        52 pY~gg~f~l~i~fp~~   67 (69)
                      -.+.|.|++.+.|++.
T Consensus        33 ~n~rG~Ykv~lt~~~d   48 (233)
T PHA00458         33 FNPRGVYKVSLTVSND   48 (233)
T ss_pred             cCCCceeEEEEEecCC
Confidence            4678999999999864


No 69 
>COG2880 Uncharacterized protein conserved in archaea [Function unknown]
Probab=23.66  E-value=20  Score=18.56  Aligned_cols=9  Identities=33%  Similarity=0.800  Sum_probs=6.8

Q ss_pred             CCCCcEEEE
Q psy623           52 LYQGGYFKV   60 (69)
Q Consensus        52 pY~gg~f~l   60 (69)
                      .|++|+|+-
T Consensus        10 iYEnGVfKP   18 (67)
T COG2880          10 IYENGVLKP   18 (67)
T ss_pred             HHhcccccc
Confidence            588888863


No 70 
>PF15586 Imm47:  Immunity protein 47
Probab=23.38  E-value=1.5e+02  Score=16.68  Aligned_cols=28  Identities=32%  Similarity=0.493  Sum_probs=18.1

Q ss_pred             CCCCcceEEEEEECCCCCCCCCcEEEEEE
Q psy623           34 NDDNLFEWEVAIFGPPDTLYQGGYFKVCV   62 (69)
Q Consensus        34 ~~~nl~~w~~~i~GP~~tpY~gg~f~l~i   62 (69)
                      +.++...|--...||.+..= +-.|.+.|
T Consensus        22 d~~~F~~~l~l~IG~~~~~G-~d~F~v~V   49 (116)
T PF15586_consen   22 DPDNFCFWLELDIGPEGEDG-SDYFQVFV   49 (116)
T ss_pred             CCcceEEEEEEEECCCCCCc-cceEEEEE
Confidence            45566666666779988773 34466655


No 71 
>PF03366 YEATS:  YEATS family;  InterPro: IPR005033  Named the YEATS family, after `YNK7', `ENL', `AF-9', and `TFIIF small subunit', this family also contains the GAS41 protein. All these proteins are thought to have a transcription stimulatory activity.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 3QRL_A 2L7E_A 3FK3_C 3RLS_A.
Probab=23.33  E-value=23  Score=18.84  Aligned_cols=11  Identities=27%  Similarity=0.673  Sum_probs=5.2

Q ss_pred             eEEEEEECCCC
Q psy623           40 EWEVAIFGPPD   50 (69)
Q Consensus        40 ~w~~~i~GP~~   50 (69)
                      +|.+.+.|+.+
T Consensus         3 ~W~v~Vr~~~~   13 (84)
T PF03366_consen    3 KWTVYVRGLDN   13 (84)
T ss_dssp             EEEEEEEECCC
T ss_pred             EEEEEEEeCCC
Confidence            45555544443


No 72 
>PF02922 CBM_48:  Carbohydrate-binding module 48 (Isoamylase N-terminal domain);  InterPro: IPR004193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Enzymes containing this domain belong to family 13 (GH13 from CAZY) of the glycosyl hydrolases. This domain is found in a range of enzymes that act on branched substrates ie. isoamylase, pullulanase and branching enzyme. Isoamylase hydrolyses 1,6-alpha-D-glucosidic branch linkages in glycogen, amylopectin and dextrin; 1,4-alpha-glucan branching enzyme functions in the formation of 1,6-glucosidic linkages of glycogen; and pullulanase is a starch-debranching enzyme.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 2BHZ_A 2BY2_A 2BY3_A 2BXY_A 2BY1_A 2BHY_A 2BHU_A 2BXZ_A 2BY0_A 2FHB_A ....
Probab=23.30  E-value=1.2e+02  Score=15.24  Aligned_cols=27  Identities=37%  Similarity=0.539  Sum_probs=16.3

Q ss_pred             CCcceEEEEEECCCCCCCCCc-EEEEEEEec
Q psy623           36 DNLFEWEVAIFGPPDTLYQGG-YFKVCVVWY   65 (69)
Q Consensus        36 ~nl~~w~~~i~GP~~tpY~gg-~f~l~i~fp   65 (69)
                      ++--.|++.|.+   ..-+|+ .|++.|+.+
T Consensus        46 ~~~G~w~~~~~~---~~~~g~~~Y~y~i~~~   73 (85)
T PF02922_consen   46 DDDGVWEVTVPG---DLPPGGYYYKYRIDGD   73 (85)
T ss_dssp             CTTTEEEEEEEG---CGTTTT-EEEEEEEET
T ss_pred             CCCCEEEEEEcC---CcCCCCEEEEEEEEeC
Confidence            344578888875   122234 788887765


No 73 
>PTZ00281 actin; Provisional
Probab=22.97  E-value=1.8e+02  Score=19.32  Aligned_cols=35  Identities=9%  Similarity=0.157  Sum_probs=20.2

Q ss_pred             hHHHHHHHHHHHHhhhCCCCCEEEEecCCCCcceEE
Q psy623            7 SSALRALSMEYKSLQEEPVEGFRVKLVNDDNLFEWE   42 (69)
Q Consensus         7 ~~~~~Rl~kEl~~l~~~~~~~~~~~~~~~~nl~~w~   42 (69)
                      ..-..||++|++.+.... ..+.+....+.+..-|.
T Consensus       308 ~Gf~~RL~~El~~~~p~~-~~v~v~~~~~r~~~aW~  342 (376)
T PTZ00281        308 PGIADRMNKELTALAPST-MKIKIIAPPERKYSVWI  342 (376)
T ss_pred             cCHHHHHHHHHHHhCCCC-cceEEecCCCCceeEEE
Confidence            456689999999875321 23444422333455565


No 74 
>PF15590 Imm15:  Immunity protein 15
Probab=22.39  E-value=1.3e+02  Score=15.58  Aligned_cols=39  Identities=18%  Similarity=0.350  Sum_probs=20.7

Q ss_pred             HHHHHhhhCCCCCEEEEecCCCCcceEEEEEECCCCCCCCCcE
Q psy623           15 MEYKSLQEEPVEGFRVKLVNDDNLFEWEVAIFGPPDTLYQGGY   57 (69)
Q Consensus        15 kEl~~l~~~~~~~~~~~~~~~~nl~~w~~~i~GP~~tpY~gg~   57 (69)
                      +-|+++...+. +......|..+=..|.-+   +..+-|+||-
T Consensus        13 n~Lqkv~~~~d-~We~~y~DP~D~r~W~~~---~~~s~~hGGG   51 (69)
T PF15590_consen   13 NRLQKVASSPD-GWETLYQDPRDGRYWEKS---YPESHMHGGG   51 (69)
T ss_pred             HHHHHHhcCCc-chhhhccCCCCCceeEEe---cCcccccCCC
Confidence            34555554332 443332555555666543   6667788773


No 75 
>PF04447 DUF550:  Protein of unknown function (DUF550);  InterPro: IPR007538 This entry represents the N terminus of a protein of unknown function, found in a range of Proteobacteria and a few P22-like dsDNA virus particles.
Probab=21.97  E-value=1.1e+02  Score=17.13  Aligned_cols=17  Identities=29%  Similarity=0.374  Sum_probs=14.3

Q ss_pred             HHHHHHHHHHHHhhhCC
Q psy623            8 SALRALSMEYKSLQEEP   24 (69)
Q Consensus         8 ~~~~Rl~kEl~~l~~~~   24 (69)
                      ...+.|.||+.++..+|
T Consensus        22 g~lkHl~kE~~E~~~~p   38 (100)
T PF04447_consen   22 GPLKHLSKEALEAEAAP   38 (100)
T ss_pred             hHHHHHHHHHHHHHhCC
Confidence            46789999999998775


No 76 
>PF13086 AAA_11:  AAA domain; PDB: 2XZL_A 2XZO_A 2WJY_A 2WJV_A 2XZP_A 2GK6_A 2GK7_A 2GJK_A.
Probab=21.08  E-value=67  Score=18.83  Aligned_cols=9  Identities=56%  Similarity=0.911  Sum_probs=6.2

Q ss_pred             EEEECCCCC
Q psy623           43 VAIFGPPDT   51 (69)
Q Consensus        43 ~~i~GP~~t   51 (69)
                      ..|.||+||
T Consensus        20 ~~i~GpPGT   28 (236)
T PF13086_consen   20 TLIQGPPGT   28 (236)
T ss_dssp             EEEE-STTS
T ss_pred             EEEECCCCC
Confidence            467899987


No 77 
>PF01954 DUF104:  Protein of unknown function DUF104;  InterPro: IPR008203 This family includes short archaebacterial proteins of unknown function. Archaeoglobus fulgidus has twelve copies of this protein, with several being clustered together in the genome.; PDB: 2NWT_A.
Probab=21.04  E-value=36  Score=16.99  Aligned_cols=9  Identities=33%  Similarity=0.800  Sum_probs=7.3

Q ss_pred             CCCCcEEEE
Q psy623           52 LYQGGYFKV   60 (69)
Q Consensus        52 pY~gg~f~l   60 (69)
                      .|++|+|+.
T Consensus         8 iYe~GvlkP   16 (60)
T PF01954_consen    8 IYENGVLKP   16 (60)
T ss_dssp             EEETTEEEE
T ss_pred             EEECCEEEE
Confidence            589999874


No 78 
>PF09036 Bcr-Abl_Oligo:  Bcr-Abl oncoprotein oligomerisation domain;  InterPro: IPR015123 This entry represents the oligomerisation domain of the breakpoint cluster region oncoprotein Bcr, and the Bcr/Abl (Abelson-leukemia-virus) fusion protein created by a reciprocal (9;22) fusion []. Brc displays serine/threonine protein kinase activity (2.7.11.1 from EC), acting as a GTPase-activating protein for RAC1 and CDC42. Brc promotes the exchange of RAC or CDC42-bound GDP by GTP, thereby activating them []. The Bcr/Abl fusion protein loses some of the regulatory function of Bcr with regards to small Rho-like GTPases with negative consequences on cell motility, in particular on the capacity to adhere to endothelial cells []. The Bcr, Bcr/Abl oncoprotein oligomerisation domain consists of a short N-terminal helix (alpha-1), a flexible loop and a long C-terminal helix (alpha-2). Together these form an N-shaped structure, with the loop allowing the two helices to assume a parallel orientation. The monomeric domains associate into a dimer through the formation of an antiparallel coiled coil between the alpha-2 helices and domain swapping of two alpha-1 helices, where one alpha-1 helix swings back and packs against the alpha-2 helix from the second monomer. Two dimers then associate into a tetramer. The oligomerisation domain is essential for the oncogenicity of the Bcr-Abl protein []. ; GO: 0004674 protein serine/threonine kinase activity, 0005096 GTPase activator activity, 0006468 protein phosphorylation, 0007165 signal transduction; PDB: 1K1F_C.
Probab=20.98  E-value=1.2e+02  Score=16.01  Aligned_cols=12  Identities=25%  Similarity=0.437  Sum_probs=8.7

Q ss_pred             HHHHHHHHHHHH
Q psy623            8 SALRALSMEYKS   19 (69)
Q Consensus         8 ~~~~Rl~kEl~~   19 (69)
                      .+.+||.+|+.+
T Consensus        40 ~sirrLeqevnk   51 (79)
T PF09036_consen   40 ASIRRLEQEVNK   51 (79)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHH
Confidence            467888888754


No 79 
>PF01774 UreD:  UreD urease accessory protein;  InterPro: IPR002669 UreD is a urease accessory protein. Urease IPR005848 from INTERPRO hydrolyses urea into ammonia and carbamic acid []. UreD is involved in activation of the urease enzyme via the UreD-UreF-UreG-urease complex [] and is required for urease nickel metallocentre assembly []. See also UreF IPR002639 from INTERPRO, UreG IPR004400 from INTERPRO.; GO: 0016151 nickel ion binding, 0006807 nitrogen compound metabolic process; PDB: 3SF5_D.
Probab=20.84  E-value=1.5e+02  Score=18.17  Aligned_cols=23  Identities=17%  Similarity=0.320  Sum_probs=13.8

Q ss_pred             EEEECCCCCCCCCcEEEEEEEec
Q psy623           43 VAIFGPPDTLYQGGYFKVCVVWY   65 (69)
Q Consensus        43 ~~i~GP~~tpY~gg~f~l~i~fp   65 (69)
                      +.+..|.+++..|..+.++|++-
T Consensus         2 ~yl~~~~gGl~~GD~l~~~i~v~   24 (209)
T PF01774_consen    2 VYLMNPSGGLLGGDRLRIDITVG   24 (209)
T ss_dssp             EEEEE--S-B-TT-EEEEEEEE-
T ss_pred             EEEEeCCCcCcCCCEEEEEEEEC
Confidence            45666888999999999999873


No 80 
>PF13119 DUF3973:  Domain of unknown function (DUF3973)
Probab=20.74  E-value=46  Score=15.42  Aligned_cols=12  Identities=25%  Similarity=0.958  Sum_probs=7.0

Q ss_pred             CCCCCCCCCcEE
Q psy623           47 GPPDTLYQGGYF   58 (69)
Q Consensus        47 GP~~tpY~gg~f   58 (69)
                      ++.+..|++|.|
T Consensus        15 ~~~~~iF~ngfY   26 (41)
T PF13119_consen   15 GINDKIFKNGFY   26 (41)
T ss_pred             cchhhhhhCcee
Confidence            355556666655


No 81 
>KOG0744|consensus
Probab=20.74  E-value=1e+02  Score=21.29  Aligned_cols=16  Identities=44%  Similarity=1.099  Sum_probs=11.8

Q ss_pred             CCcceEE--EEEECCCCC
Q psy623           36 DNLFEWE--VAIFGPPDT   51 (69)
Q Consensus        36 ~nl~~w~--~~i~GP~~t   51 (69)
                      .|+-.|+  +.+.||++|
T Consensus       171 tnlIt~NRliLlhGPPGT  188 (423)
T KOG0744|consen  171 TNLITWNRLILLHGPPGT  188 (423)
T ss_pred             CceeeeeeEEEEeCCCCC
Confidence            5677787  445799987


No 82 
>COG3254 Uncharacterized conserved protein [Function unknown]
Probab=20.63  E-value=31  Score=19.38  Aligned_cols=12  Identities=0%  Similarity=0.382  Sum_probs=6.0

Q ss_pred             CCCCCEEEEecCC
Q psy623           23 EPVEGFRVKLVND   35 (69)
Q Consensus        23 ~~~~~~~~~~~~~   35 (69)
                      ....++++. +++
T Consensus        36 agi~nYSIf-Lde   47 (105)
T COG3254          36 AGIRNYSIF-LDE   47 (105)
T ss_pred             cCCceeEEE-ecC
Confidence            344555665 443


No 83 
>cd03463 3,4-PCD_alpha Protocatechuate 3,4-dioxygenase (3,4-PCD) , alpha subunit. 3,4-PCD catalyzes the oxidative ring cleavage of 3,4-dihydroxybenzoate to produce beta-carboxy-cis,cis-muconate. 3,4-PCDs are large aggregates of 12 protomers, each composed of an alpha- and beta-subunit and an Fe3+ ion bound in the beta-subunit at the alpha-subunit-beta-subunit interface. 3,4-PCD is a member of the aromatic dioxygenases which are non-heme iron intradiol-cleaving enzymes that break the C1-C2 bond and utilize Fe3+.
Probab=20.59  E-value=1.8e+02  Score=17.79  Aligned_cols=17  Identities=12%  Similarity=0.595  Sum_probs=13.5

Q ss_pred             CcceEEEEEECCCCCCC
Q psy623           37 NLFEWEVAIFGPPDTLY   53 (69)
Q Consensus        37 nl~~w~~~i~GP~~tpY   53 (69)
                      ..++|.+++.|..+|+|
T Consensus       167 ~~~~fdIvL~g~~~t~~  183 (185)
T cd03463         167 GAYRFDIRLQGEGETVF  183 (185)
T ss_pred             CeEEEEEEecCCCcccc
Confidence            45788888888888876


No 84 
>PF14172 DUF4309:  Domain of unknown function (DUF4309)
Probab=20.37  E-value=86  Score=18.11  Aligned_cols=17  Identities=29%  Similarity=0.538  Sum_probs=13.0

Q ss_pred             CCCcE-EEEEEEecCCCC
Q psy623           53 YQGGY-FKVCVVWYKCSQ   69 (69)
Q Consensus        53 Y~gg~-f~l~i~fp~~~~   69 (69)
                      |.-|- |+|.+.|++.++
T Consensus       108 Y~~g~~Y~L~Fvf~~~t~  125 (134)
T PF14172_consen  108 YKAGSDYKLEFVFPKPTS  125 (134)
T ss_pred             EecCCceEEEEEEcCCCC
Confidence            55566 999999988653


No 85 
>TIGR03517 GldM_gliding gliding motility-associated protein GldM. This protein family, GldM, is named for the member from Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes. However, members are found also in several members of the Bacteriodetes that appear not to be motile. The best conserved region, toward the N-terminus, is centered on a highly hydrobobic probable transmembrane helix. Two paralogs are found in Cytophaga hutchinsonii.
Probab=20.27  E-value=1.1e+02  Score=21.94  Aligned_cols=28  Identities=14%  Similarity=0.381  Sum_probs=23.1

Q ss_pred             CcceEEEEEECCCCCCCCCcEEEEEEEe
Q psy623           37 NLFEWEVAIFGPPDTLYQGGYFKVCVVW   64 (69)
Q Consensus        37 nl~~w~~~i~GP~~tpY~gg~f~l~i~f   64 (69)
                      ++.+..+.+.-.....|+|..|..+|.+
T Consensus       231 ~~d~~~a~V~~~s~~v~~Ge~~~a~vvL  258 (523)
T TIGR03517       231 SVDNYQAIVIPKSDAVFAGETYEAEVVL  258 (523)
T ss_pred             cccceeEEEEcCCceeecCCeEEEEEEE
Confidence            4567778888677789999999999987


Done!